Blog entries - Codeforces

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3611
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

#	User	Contrib.
1	Qingyu	164
2	adamant	150
3	Um_nik	146
4	Dominater069	144
5	errorgorn	141
6	cry	139
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
9	TheScrasse	134

Thank you for participating in our round. We apologise for C2 being too hard compared to C1.

UPD : Added an alternative solution to B, which contains the original solution with a diagram for better understanding.

2210A - A Simple Sequence

Idea: ritam1234, Preparation: ritam1234

Solution

2210B - Simply Sitting on Chairs

Idea: ritam1234, Preparation: ritam1234, Argentum47

Hint 1

Solution

You can refer to the following for a better idea of the exchange argument, credits dipamsen

Solution-2

2210C1 - A Simple GCD Problem (Easy Version)

Idea: simplelife, Preparation: Argentum47

Hint 1

Hint 2

Hint 3

Solution

2210C2 - A Simple GCD Problem (Hard Version)

Idea: simplelife, Preparation: Argentum47

Thanks to Intellegent for misreading $$$C1$$$ and Argentum47 for fully solving it.

Hint 1

Hint 2

Hint 3

Hint 4

Hint 5

Hint 6

Hint 7

Hint 8

Solution

Refer to the editorial of C1 first. We consider only indices $$$i$$$ ($$$1 \lt i \lt n$$$). For the endpoints, we can do a similar analysis as well. Let $$$A = \gcd(a_i, a_{i-1})$$$ and $$$B = \gcd(a_i, a_{i+1})$$$.

First of all, it is trivial to observe that if $$$b_i \lt \text{lcm}(A, B)$$$, or if $$$ a_i = lcm(A,B)$$$. It is impossible to "reduce" the value of $$$a_i$$$, we can only increase it. Furthermore, if we can reduce the value of $$$a_i$$$, it is optimal to always reduce it to minimum possible value.

Now lets consider the unchanged values so far. They fall in either of the $$$2$$$ categories: $$$\text{lcm}(A,B) \gt b_i$$$ $$$\text{lcm}(A,B) = a_i$$$

If an index falls in category $$$1$$$, it is easy to see that it is impossible to change it without altering subarray GCD. However if the index falls in category $$$2$$$, then we can multiply it by some number $$$x$$$ such that $$$a_i \cdot x \lt b_i$$$ and GCD is preserved.

It is easy to see that it is optimal to choose a prime number as $$$x$$$, since any composite number can be written as product of primes, and choosing either of those primes will result in same subarray GCD.

Now notice that we can only choose prime numbers which are NOT a factor of $$$a_{i-1}/ \gcd(a_i, a_{i-1})$$$ and $$$a_{i+1}/\gcd(a_i, a_{i+1})$$$, since if the prime number was present in those values, the subarray GCD will increase.

Also notice that we don't have to consider all the primes, just the first $$$20$$$ prime numbers will do, since it can be easily seen that the product of first $$$20$$$ primes > $$$10^{18}$$$. This is important because the product of adjacent elements can be atmost $$$10^{18}$$$, and hence cannot cover all these primes.

Now we can use dp to solve from here, where $$$dp[i][j]$$$ = maximum number of changes in array from index $$$1$$$ to $$$i$$$, if we multiply $$$a_i$$$ with $$$j$$$. Also for $$$j = 0$$$, define $$$dp[i][0]$$$ = maximum number of changes in array from index $$$1$$$ to $$$i$$$, if we keep a[i] unchanged. Initially assign all values negative infinity.

Create a new array $$$c_i$$$, such that $$$c_i = \text{lcm}(A,B)$$$ for all $$$i$$$. Furthermore, if ($$$c_i \gt b_i$$$), set $$$c_i = a_i$$$ only. The reason for this is that if $$$a_i$$$ can be successfully reduced, then we just use that value in our analysis. Else we use $$$a_i$$$ only.

Now lets focus on base case. For $$$i = 0$$$, if $$$c_0 \neq a_0$$$, then $$$dp[0][0] = 1$$$, else $$$dp[0][0] = 0$$$. Now for $$$j = 1$$$ to $$$j = 20$$$, let $$$x = c_i \cdot p_j$$$. if $$$x \leq b_0$$$ and gcd($$$x, c_1$$$) = gcd($$$a_0, a_1$$$), and $$$x \neq a_0$$$ then multiplying is a valid option, and hence set $$$dp[0][j] = 1$$$, else it is invalid and let it stay negative infinity. We take gcd($$$x, c_1$$$) here and in subsequent test cases as well, since we want to consider the least possible value in next index, to minimize prime collisions.

Now for remaining indices, let's say we multiplied previous element by $$$k^{th}$$$ prime, and are trying to multiply the current element with $$$j^{th}$$$ prime, then the following conditions must hold true (let $$$x = c_i \cdot p_j$$$ and $$$y = c_{i-1} \cdot p_k$$$)

$$$x \le b_i$$$
gcd($$$x, c_{i+1}$$$) = gcd($$$a_i, a_{i+1}$$$)
gcd($$$x, y$$$) = gcd($$$a_i, a_{i-1}$$$)
$$$x \neq a_i$$$

If these $$$4$$$ conditions hold, then multiplying by $$$j$$$ is a valid combination, considering we multiplied previous number by $$$k$$$. In this case the answer will be $$$dp[i][j]$$$ = max($$$dp[i][j], dp[i-1][k] + 1$$$).

Note that we will handle $$$j = 0$$$ case explicitly, by setting $$$dp[i][0] = \text{max}_k(dp[i-1][k]) + 1$$$ if $$$c_i \lt a_i$$$, and $$$dp[i][0] = \text{max}_k(dp[i-1][k]$$$, otherwise. This is to ensure we always have a valid transition (number is unchanged) in case of impossibility, (the case where $$$a_i$$$ can never be changed) and dp[i] values are not just all negative infinity in that case. (Basically even if we can't change the value, we have to keep the number as $$$a_i$$$ only).

The overall time complexity of this logic is $$$O(n \cdot L^{2} \cdot log(max(A_i)))$$$, where $$$L$$$ is the number of primes used in our logic. This comfortably passes within the time constraints, in both python and cpp. For larger L, like 25-30, some basic optimizations, like exiting the loop if $$$c_i \cdot p_j \gt b_i$$$ might be required depending on the implementation.

Additional Challenge

Code(C++)

#include <bits/stdc++.h>
using namespace std; 
typedef long long ll;

void solve()
{
    ll n;
    cin >> n;
    vector<ll> a(n), b(n), c(n);
    for(ll i = 0; i < n; i++) cin >> a[i];
    for(ll i = 0; i < n; i++) cin >> b[i];
    vector<ll> primes = {1, 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73};
    for(ll i = 0; i < n; i++)
    {
        if (i == 0) c[i] = gcd(a[i], a[i+1]);
        else if (i == n-1) c[i] = gcd(a[i-1], a[i]);
        else c[i] = lcm(gcd(a[i], a[i-1]), gcd(a[i], a[i+1]));
 
        if (c[i] > b[i]) c[i] = a[i];
    }
    ll L = primes.size();
    vector<vector<ll>> dp(n, vector<ll>(L, -1e18));
 
    //set up the base cases
    for(ll i = 0; i < L; i++)
    {
        if (i == 0)
        {
            if (c[0] == a[0]) dp[0][0] = 0;
            else dp[0][0] = 1;
            continue;
        }
        ll val = c[0] * primes[i];
        if (val <= b[0] and gcd(val, c[1]) == gcd(a[0], a[1]) and val != a[0]) dp[0][i] = 1; 
    }
    
    //make the transitions
    for(ll i = 1; i < n; i++)
    {
        for(ll j = 0; j < L; j++)
        {
            for(ll k = 0; k < L; k++)
            {
                //j = 0, special case, we are keeping value unchanged, and always want something to be assigned here
                if (j == 0)
                {
                    if (c[i] == a[i]) dp[i][j] = max(dp[i][j], dp[i-1][k]);
                    else dp[i][j] = max(dp[i][j], dp[i-1][k]+1);
                    continue;
                }
                //we check for all conditions, that multiplying this by prime j and prev by prime k keeps everything in check
                ll val1 = c[i] * primes[j];
                ll val2 = c[i-1] * primes[k];
                if (val1 <= b[i] and gcd(val1, val2) == gcd(a[i], a[i-1]) and val1 != a[i])
                {
                    if (i < n-1)
                    {
                        if (gcd(val1, c[i+1]) == gcd(a[i], a[i+1])) dp[i][j] = max(dp[i][j], dp[i-1][k] + 1);
                    }
                    else dp[n-1][j] = max(dp[n-1][j], dp[n-2][k] + 1);
                }
            }
        }
    }
    cout << max(0LL, *max_element(dp[n-1].begin(), dp[n-1].end())) << '\n';
}
 
int main()
{
    ll n = 1;
    cin >> n;
    while(n--) solve();
    return 0;
}

2210D - A Simple RBS Problem

Idea: simplelife, Preparation: simplelife

Thanks to awesomeguy856 for helping me in writing the editorial for this problem.

Hint 1

Hint 2

Solution

First, consider the tree representations of $$$"("+s+")"$$$ and $$$"("+t+")"$$$. The tree representation of an $$$RBS$$$ can be found with a stack, where each bracket pair is a node, and its parent is the bracket pair at the top of the stack when it is initially added.

The swapping operation becomes the following on a tree: Either

Take two nodes $$$u$$$ and $$$v$$$ and swap the subtrees of a contiguous range of children of u with the disjoint subtrees of a contiguous range of children of $$$v$$$.
Swap the subtrees of two disjoint contiguous ranges of children of $$$u$$$ for any $$$u$$$.

We observe two invariants in this process

the number of leafs (since entire subtrees are swapped, no new leafs are generated and no leafs are destroyed)
the depth of the highest node u with multiple children (or the lack thereof). This is because for any node v in the fixed chain above u, no swaps can be made as every non-fixed node is in the subtree of the only child of $$$v$$$. Furthermore, u will always have multiple children: if at any point $$$u$$$ has only 1 child, then reversing the operations would turn a fixed node into a nonfixed node, contradicting the previous sentence.

If these two invariants differ between $$$s$$$ and $$$t$$$, we immediately know the answer is NO, and these conditions can be checked easily on a tree.

We claim that if $$$s$$$ and $$$t$$$ have the same values for these invariants, then it is sufficient for the answer to be YES.

Proof:

Let $$$u$$$ be the highest node with multiple children (i.e. the lowest fixed node). Consider the process of turning the number of children of $$$u$$$ into the number of leafs. If the number of children of $$$u$$$ is less than the number of leafs, then there must exist a lower node $$$v$$$ that has multiple children. Swap the subtrees of the children of $$$v$$$ with the subtree of any child of $$$u$$$ that is not the ancestor of $$$v$$$. Then, the number of children of $$$u$$$ has increased. Continuing this process eventually leads a tree where $$$u$$$ is the only node with multiple children (a bunch of chains attached to $$$u$$$). Then, we make it so that only the leftmost child $$$c$$$ of $$$u$$$ is allowed to have a child. If any other child $$$v$$$ of $$$u$$$ has a child, then swap the subtree of $$$v$$$ with the lowest node in the chain attached to $$$u$$$. Then, $$$v$$$ is replaced with a leaf. Continuing this process turns every child of $$$u$$$ besides $$$c$$$ into a leaf, as desired.

This process ends with a unique tree based on the two invariants we found, so both $$$s$$$ and $$$t$$$ can reach this tree through some swaps. Since operations are reversible, $$$s$$$ can be transformed into this tree, then be transformed into $$$t$$$. So the answer is YES.

Code(C++)

#include<bits/stdc++.h>
using namespace std;
 
void solve(){
    int n;
    cin>>n;
    string s,t;
    cin>>s>>t;
    auto count_adjacent_pairs = [](string &s) -> int {
        int m=s.size();
        int ct=0;
        for(int i=0;i<m-1;i++){
            if(s[i]=='(' and s[i+1]==')')ct++;
        }
        return ct;
    };
    auto count_outer_brackets = [](string &s) -> int {
        int m=s.size();
        vector<int> match(m,-1);
        stack<int> indices;
        for(int i=0;i<m;i++){
            if(s[i]==')'){
                match[indices.top()]=i;
                indices.pop();
            }
            else{
                indices.push(i);
            }
        }
        int ct=0;
        int l=0,r=m-1;
        while(match[l]==r){
            l++;
            r--;
            ct++;
        }
        return ct;
    };
    if(count_adjacent_pairs(s)==count_adjacent_pairs(t) and count_outer_brackets(s)==count_outer_brackets(t)){
        cout<<"YES\n";
        return;
    }
    cout<<"NO\n";
}
 
int main(){
    int t;
    cin>>t;
    while(t--){
        solve();
    }
    return 0;
}

2210E - Binary Strings are Simple?

Idea: Argentum47, simplelife, Preparation: ritam1234, __baozii__, Argentum47

Thanks to wuhudsm for buffing this problem.

Part 1 Hint

Part 1 Solution

Part 2 Hint 1

Part 2 Hint 2

Part 2 Solution

First it's obvious to see that the binary string and its complement form returns the same answer for all queries so we need atleast two guesses. Hence we will first assume the first character of the string , build the string using this first character and then if this string is not correct, its complement form will be definitely correct.

Then we can see if we query ranges of only even lengths , we can determine the parity of '$$$1$$$'s present in the range.(Given $$$m$$$ is even,$$$\operatorname{gcd}(k,m)$$$ is odd if $$$k$$$ is odd else even if $$$k$$$ is even)

Thus we can first find the string using the following method ---

We can start determining all characters from left to right.
At a position $$$i$$$, we can query the longest prefix which includes the $$$i$$$-th position (basically if $$$i$$$ is odd , we query $$$(2,i)$$$ else $$$(1,i)$$$).
Since we can already know the characters upto $$$i-1$$$-th position, we can choose whether $$$i$$$-th character is '$$$1$$$' or '$$$0$$$' based on the parity of the query we receive.

However the cost of the queries is $$$\sum_{i=1}^{n} \frac{n}{i}$$$ $$$\approx$$$ $$$n\cdot ln(n)$$$ which is greater than $$$3\cdot n$$$ for sufficiently large $$$n$$$.

To reduce the cost, we will have to query from both ends.

To determine $$$i+1$$$-th character assuming we know $$$i$$$-th character , we can treat $$$(i,i+1)$$$ as a pair.

For this pair , we can choose the longest prefix or the longest suffix(similar to what we did earlier) and make two queries which minimise the cost. Since we know the parities of both queries and the $$$i$$$-th character , we can determine the $$$i+1$$$-th character.

We can see the cost of the total queries turn out to be approximately

$$$2\cdot \sum_{i=\frac{n}{2}}^{n} \frac{n}{i}\approx 2\cdot ln(2) \cdot n\approx 1.38 \cdot n \lt 3\cdot n$$$

The bound was set to $$$3\cdot n$$$ to allow some other weaker solutions to pass using the same idea.

There also exists a solution using trees , which basically uses the same query pattern as I mentioned above (We can determine the longest even prefix or longest even suffix for each position and take the longer of these queries) .

We make a tree out of all these queries , assigning the edges as the parity of the queries. Then assuming the $$$1$$$-st character , we find the rest of the characters using a $$$DFS/BFS$$$.

Code 1(C++)

#include <bits/stdc++.h>
using namespace std;
map<pair<int,int>,int> quers;
int ask(int l,int r){
    if(l>r)return 0;
    if(quers.count({l,r})){
        return quers[{l,r}];
    }
    cout<<"? "<<l<<" "<<r<<endl;
    int x;
    cin>>x;
    return quers[{l,r}]=((r-l+1)/x)%2;
}

void solve() {
    int n;
    cin>>n;
    string s;
    for(int i=0;i<n;i++)s+='0';
    quers.clear();
    if(n<=6){
        int cu;
        for(int i=2;i<=n;i++){
            cu=ask(i-1,i);
            s[i-1]=((s[i-2]-'0')^cu)+'0';
        }
        cout<<"! "<<s<<endl;
        int x;
        cin>>x;
        if(x==1)return;
        for(auto &x:s){
            if(x=='0')x='1';
            else x='0';
        }
        cout<<"! "<<s<<endl;
        cin>>x;
        return;
    }
    for(int i=2;i<n;i++){
        // to guess i
        // 1 + (i)%2 i
        // 1 + (i)%2 i-2
        // i+1 n-(n-i)%2
        // i-1 n-(n-i)%2
        int quer1=i-(i)%2;
        int quer2=n-(n-i)%2-i;
        if(quer1>quer2){
            int type1=ask(1+(i)%2,i);
            int type2=ask(1+(i)%2,i-2);
            s[i-1]=((s[i-2]-'0')^type1^type2)+'0';
        }
        else{
            int type1=ask(i+1,n-(n-i)%2);
            int type2=ask(i-1,n-(n-i)%2);
            s[i-1]=((s[i-2]-'0')^type1^type2)+'0';
        }
    }
    int la=ask(1+(n&1),n);
    int ct=0;
    for(int i=(n&1);i<n;i++){
        ct+=(s[i]=='1');
    }
    ct&=1;
    s[n-1]=(la^ct)+'0';
    cout<<"! "<<s<<endl;
    int x;
    cin>>x;
    if(x==1)return;
    for(auto &x:s){
        if(x=='0')x='1';
        else x='0';
    }
    cout<<"! "<<s<<endl;
    cin>>x;
}
signed main() {
    ios_base::sync_with_stdio(false);
    cin.tie(NULL);
    int t; cin >> t;
    while (t--)
        solve();
    return 0;
}

Code 2(C++)

#include <bits/stdc++.h>
using namespace std;

void dfs(int u, int p, vector<vector<pair<int,int>>> &g, vector<int> &pref)
{
    for(auto [v,parity]:g[u])
    {
        if(v==p)continue;
        pref[v]=pref[u]^parity;
        dfs(v,u,g,pref);
    }
}
void solve() {
    int n;
    cin>>n;
    vector<vector<pair<int,int>>> g(n+1);
    vector<int> p(n+1);
    string s;
    for(int i=2;i<=n;i++)
    {
        int f=0;
        if(i%2)f=1;
        int l=n;
        if((i%2)!=(l%2))l--;
        if(i-f>l-i)
        {
            cout<<"? "<<f+1<<" "<<i<<"\n";
            cout.flush();
            int x;
            cin>>x;
            int par=((i-f)/x)%2;
            g[i].push_back({f,par});
            g[f].push_back({i,par});
        }
        else
        {
            cout<<"? "<<i+1<<" "<<l<<"\n";
            cout.flush();
            int x;
            cin>>x;
            int par=((l-i)/x)%2;
            g[i].push_back({l,par});
            g[l].push_back({i,par});
        }
    }
    p[1]=0;
    dfs(0,-1,g,p);
    dfs(1,-1,g,p);

    for(int i=1;i<=n;i++)
    {
        int r=p[i]^p[i-1];
        if(r==0)s+='0';
        else s+='1';
    }
    cout<<"! "<<s<<"\n";
    cout.flush();
    int w;
    cin>>w;
    if(w)return;

    for(int i=0;i<n;i++)
    {
        if(s[i]=='0')s[i]='1';
        else s[i]='0';
    }
    cout<<"! "<<s<<"\n";
    cout.flush();
    cin>>w;
}

int main() {
    int t=1; cin >> t;
    while(t--)
        solve();
    return 0;
}

2210F - A Simple Problem

Idea: simplelife, Preparation: __baozii__

Thanks to __baozii__ for solving this problem.

For the simplicity of editorial , denote $$$\max(b_1,b_2,\ldots,b_i)$$$ as prefix-max($$$i$$$) and $$$\min(b_1,b_2,\ldots,b_i)$$$ as prefix-min($$$i$$$).

Hint 1

Hint 2

Solution

First we will solve this problem for a single query and then build up the optimal solution step by step.

Suppose we choose a prefix-max($$$i$$$) at index $$$i$$$.

Then,

It will never create an inversion with the left side of array.($$$j \lt i$$$)
It will always create an inversion with any prefix-min on the right side of array and never creates any inversion with prefix-max.($$$j \gt i$$$)

Now suppose we choose a prefix-min($$$i$$$) at index $$$i$$$.

Then,

It will always create an inversion with the left side of array except when we choose a prefix-min whose value is equal to this prefix-min.($$$j \lt i$$$)
It will always create an inversion with any prefix-min on the right side of array that is smaller than this prefix-min and never creates any inversion with prefix-max.($$$j \gt i$$$)

Thus we can see our choices at an index $$$i$$$(choosing whether we take prefix-max($$$i$$$) or prefix-min($$$i$$$)) is independent of any index $$$j$$$ where the prefix-min($$$j$$$)$$$\neq$$$prefix-min($$$i$$$).

Thus we can divide this array into blocks having equal prefix-min.

Since choices in each block is independent of other blocks , we can maximise the number of inversions in each block independently and add the contributions of each block to final answer.

Now for each block , Let $$$s$$$ be the length of the array that comes before this block and $$$t$$$ be the length of the current block.

We can easily prove that in this block , that first choosing prefix-max and then prefix-min is the optimal choice to maximise the number of inversions.

Let $$$x(x\leq t)$$$ be the number of elements we choose to be prefix-max in this block. Then the number of inversions in this block is given by $$$(s+x)\cdot(t-x)$$$.

This is a concave down quadratic function with its value maximised at $$$x=\frac{t-s}{2}$$$. Then for this block , we can just choose the closest non-negative integer value of $$$x$$$ to be the optimal $$$x$$$.

For the single query version , we can just divide the array into blocks , choose the optimal $$$x$$$ for each block independently and sum the contributions over all blocks.

Now for optimising the solution ,

We will divide the blocks into two cases.

Case 1: $$$t-s \gt 0$$$
Case 2: $$$t-s\leq 0$$$

For blocks in Case $$$2$$$ , we can just choose $$$x=0$$$ i.e , all elements in the block are prefix-min.

Now notice that for Case $$$1$$$ , $$$t$$$ must be greater than $$$s$$$ i.e , the size of current block should be as large as the size of the array that comes before it. This means whenever we encounter a case of type $$$1$$$ , the current array length doubles. This means Case $$$1$$$ only happens at most $$${log_2(n)}$$$ times , where n is the length of the total array.

Now for implementation details,

We can handle all cases of type $$$1$$$ in $$$O(1)$$$ time.

Now we need an efficient method to jump over blocks of type $$$2$$$.

An efficient way to do this is by using binary lifting.

Consider the following subproblem ---

The current array length is $$$s$$$ and you have to jump over $$$k$$$ blocks of length $$$b_1,b_2,\ldots,b_k$$$, which are of type $$$2$$$.

Let $$$tot=\sum_{i=1}^{k} b_i$$$.

The number of inversions due to these $$$k$$$ blocks will be ---

$$$s\cdot b_1 + (s+b_1)\cdot b_2\ldots + (s+b_1+b_2\ldots b_{k-1})\cdot b_k$$$ $$$=$$$ $$$s\cdot tot + \sum_{i=1}^{k}\sum_{j=i+1}^{k} b_i\cdot b_j$$$ $$$ = $$$ $$$ s\cdot tot + \frac{1}{2}\cdot\left(tot^2-\sum_{i=1}^{k} b_i^2\right)$$$

We can precompute the square sum.

We also need to store the maximum block length we can jump over in case of type $$$2$$$. This can also be done using binary lifting.

The intended time complexity of the solution is $$$O((n+q)\cdot \log(n))$$$ but fairly implemented $$$O(n\cdot \log(n) + q\cdot \log(n)^2)$$$ also passes.

The following can also be implemented using a data-structure , like segment tree or fenwick tree , refer to Dominater069's code.

Code ( __baozii__,C++)

#include <bits/stdc++.h>

using namespace std;

using i64 = long long;

constexpr int B = 21;

void solve() {
    int n, q;
    cin >> n >> q;
    vector<int> a(n);
    for (int i = 0; i < n; i++) cin >> a[i];
    vector<i64> next(n, n), st;
    for (int i = 0; i < n; i++) {
        while (!st.empty() && a[i] < a[st.back()]) {
            next[st.back()] = i;
            st.pop_back();
        }
        st.push_back(i);
    }
    vector<vector<int>> g(n + 1);
    for (int i = 0; i < n; i++) g[next[i]].push_back(i);
    vector<array<int, B>> up(n + 1), pa(n + 1);
    vector<i64> sum1(n + 1), sum2(n + 1);
    for (int i = 0; i <= n; i++) up[i].fill(n), pa[i].fill(n);
    auto dfs = [&](auto &&self, int u) -> void {
        for (auto v : g[u]) {
            int d = u - v;
            for (int i = 0; i < B; i++) {
                up[v][i] = (d >= 1 << i) ? v : up[u][i];
                if (i == 0) pa[v][i] = u;
                else pa[v][i] = pa[pa[v][i - 1]][i - 1];
            }
            sum1[v] = sum1[u] + d;
            sum2[v] = sum2[u] + 1LL * d * d;
            self(self, v);
        }
    };
    dfs(dfs, n);
    while (q--) {
        i64 l, r;
        cin >> l >> r;
        l--;
        i64 u = min(r, next[l]);
        i64 x = (u - l - 1) / 2;
        i64 ans = x * (u - l - 1 - x);
        i64 t = u - l;
        while (u < r) {
            int i = (t <= 1) ? 0 : (31 - __builtin_clz(t));
            assert(i < B && i >= 0);
            i64 v = up[u][i];
            if (v > r) {
                v = u;
                for (int j = B - 1; j >= 0; j--) if (pa[v][j] <= r) v = pa[v][j];
            }
            i64 d1 = sum1[u] - sum1[v], d2 = sum2[u] - sum2[v];
            ans += t * d1 + (d1 * d1 - d2) / 2;
            t += d1;
            u = v;
            if (u < r) {
                i64 s = min(r, next[u]) - u;
                i64 x = clamp((s - t) / 2, 0LL, s);
                ans += -x * x + (s - t) * x + s * t;
                t += s;
                u = min(r, next[u]);
            }
        }
        cout << ans << "\n";
    }
}

int main() {
    ios::sync_with_stdio(0);
    cin.tie(0);

    int t;
    cin >> t;
    while (t--) solve();
}

Code(Dominater069,C++)

#include <bits/stdc++.h>
using namespace std;
#define int long long
#define INF (int)1e18

mt19937_64 RNG(chrono::steady_clock::now().time_since_epoch().count());

struct FenwickTree{
    int n;
    vector <int> f;
    vector <int> b;

    inline void add(int i, int x){
        b[i] += x;
        for (int j = i; j <= n; j += j & (-j)){
            f[j] += x;
        }
    }

    inline void modify(int i, int x){
        add(i, x - b[i]);
    }

    inline void init(int nn, vector <int> a){
        n = nn;
        if (a.size() == n){
            vector <int> a2;
            a2.push_back(0);
            for (auto x : a) a2.push_back(x);
            a = a2;
        }

        f.resize(n + 1);
        b.resize(n + 1);

        for (int i = 0; i <= n; i++) f[i] = 0, b[i] = 0;

        for (int i = 1; i <= n; i++){
            modify(i, a[i]);
        }
    }

    inline int query(int x){
        int ans = 0;
        for (int i = x; i; i -= i & (-i)){
            ans += f[i];
        }
        return ans;
    }

    inline int query(int l, int r){
        return query(r) - query(l - 1);
    }
};

void Solve() 
{
    // make edges l -> nx[l] 
    // binary lifting 
    // cost(i) needs to be computed 
    // sequence length > prefix => balances at (seq + prefix) / 2 
    // otherwise balances at seq length * prefix 
    // the first can happen only log times 
    // for all active i, need to add (i - l) * val[i] 
    // fix the sum for i = l and i such that sequence length > prefix 
    // find first val[i] > current, is it guaranteed to be active?
    
    int n, q; cin >> n >> q;
    
    vector <int> p(n + 1);
    for (int i = 1; i <= n; i++){
        cin >> p[i];
    }
    
    vector<vector<pair<int, int>>> adj(n + 1);
    for (int i = 1; i <= q; i++){
        int l, r; cin >> l >> r;
        
        adj[l].push_back({r, i});
    }
    
    vector <int> nxs(n + 1, n + 1);
    {
        stack <pair<int, int>> st;
        for (int i = 1; i <= n; i++){
            while (!st.empty() && st.top().first > p[i]){
                nxs[st.top().second] = i;
                st.pop();
            }
            
            st.push({p[i], i});
        }
    }
    
    vector<vector<int>> jump(21, vector<int>(n + 2));
    for (int i = 1; i <= n + 1; i++){
        jump[0][i] = (i > n) ? i : nxs[i];
    }
    
    for (int i = 1; i <= 20; i++){
        for (int j = 1; j <= n + 1; j++){
            jump[i][j] = jump[i - 1][jump[i - 1][j]];
        }
    }
    
    vector <int> val(n + 1);
    for (int i = 1; i <= n; i++){
        val[i] = nxs[i] - i;
    }
    
    // fenwick for i * val[i] and val[i]
    stack <pair<int, int>> st; // stack for active values 
    FenwickTree f1, f2;
    vector <int> vv(n);
    f1.init(n, vv); f2.init(n, vv);
    
    for (int i = 1; i <= n; i++){
        f1.modify(i, val[i]);
        f2.modify(i, val[i] * i);
    }
    
    set <int> ss;
    
    vector<vector<int>> at(n + 1);
    for (int i = 1; i <= n; i++){
        int r = i - val[i];
        if (r >= 1){
            at[r].push_back(i);
        }
    }
    
    auto deac = [&](int x){
        ss.erase(x);
        f1.modify(x, 0);
        f2.modify(x, 0);
    };
    
    vector <int> ans(q + 1);
    
    for (int l = n; l >= 1; l--){
        while (!st.empty() && st.top().first > p[l]){
            deac(st.top().second);
            st.pop();
        }
        st.push({p[l], l});
        ss.insert(l);
        for (auto x : at[l]){
            ss.erase(x);
        }
        
        for (auto [r, i] : adj[l]){
            ans[i] += f2.query(l, r) - l * f1.query(l, r);
            
            auto calc = [&](int x){
            //    cout << "UPD " << x << "\n";
                int vv = min(val[x], r - x + 1);
                if (x == l) vv--;
                if (vv > x - l){
                    vv += (x - l);
                    ans[i] += (vv / 2) * ((vv + 1) / 2);
                } else {
                    ans[i] += vv * (x - l);
                }
            };
            
            int pos = l;
            // while (pos <= r){
            //     if (nxs[pos] > r && val[pos] <= (pos - l)){
            //         ans[i] -= val[pos] * (pos - l);
            //         calc(pos);
            //     }
            //     pos = nxs[pos];
            // }
            for (int j = 20; j >= 0; j--){
                if (jump[j][pos] <= r){
                    pos = jump[j][pos];
                }
            }
            assert(nxs[pos] > r);
            if (!ss.count(pos)){
                ans[i] -= val[pos] * (pos - l);
                calc(pos);
            }
            
            for (auto x : ss){
                if (x <= r){
                    ans[i] -= val[x] * (x - l);
                    calc(x);
                } else {
                    break;
                }
            }
        }
    }
    
    for (int i = 1; i <= q; i++){
        cout << ans[i] << "\n";
    }
}

int32_t main() 
{
    auto begin = std::chrono::high_resolution_clock::now();
    ios_base::sync_with_stdio(0);
    cin.tie(0);
    int t = 1;
    // freopen("in",  "r", stdin);
    // freopen("out", "w", stdout);
    
    cin >> t;
    for(int i = 1; i <= t; i++) 
    {
        //cout << "Case #" << i << ": ";
        Solve();
    }
    auto end = std::chrono::high_resolution_clock::now();
    auto elapsed = std::chrono::duration_cast<std::chrono::nanoseconds>(end - begin);
    cerr << "Time measured: " << elapsed.count() * 1e-9 << " seconds.\n"; 
    return 0;
}

Full text and comments »

simplelife's blog