Mo's Algorithm on Trees [Tutorial]

10 лет назад, скрыть # ^ |

← Rev. 2 →

+8

Nope, it will be [ST(P), ST(P)].

Consider the path from 3 to 5. In this case, P = 2

EN(3) = 6 and ST(5) = 7, so we consider the range [6, 7] in A[] corresponding to the nodes [3, 5] giving us the values of nodes 3 and 5.

Our query range does not consider the value of the lca as ST(P) < EN(u) < ST(v) < EN(P). Hence we must account for the value of P separately.

→ Ответить

»

meintoo

10 лет назад, скрыть # ^ |

+8

Ok !!!
Thanks
Nice article anyways

→ Ответить

»

10 лет назад, скрыть # ^ |

+13

Thank You :D

→ Ответить

»

sampriti

10 лет назад, скрыть # |

+23

Thanks! That was a really nice tutorial!

→ Ответить

»

http://mirror.codeforces.com/problemset/problem/375/D

10 лет назад, скрыть # ^ |

+13

Thanks a lot :)

→ Ответить

»

brainstorm

10 лет назад, скрыть # |

0

Nice tutorial :)
can you give links to some more problems on which similar approach can be applied ?

→ Ответить

»

sbansalcs

10 лет назад, скрыть # ^ |

0

→ Ответить

»

10 лет назад, скрыть # ^ |

0

This can be done with standard Mo's Algorithm, because the queries are on subtrees and not paths.

→ Ответить

»

sbansalcs

10 лет назад, скрыть # ^ |

0

Oh sorry, I thought this guy was asking for any problems related to the algorithms described above.

→ Ответить

»

hmrockstar

9 лет назад, скрыть # ^ |

← Rev. 2 →

0

but in each query, there is a new k. I wrote code for this problem, after whole implementation, i noticed that i missed the point that there is alway a new k for each query. Now i am not getting how can i solve this prob!

my code!

→ Ответить

»

komendart

10 лет назад, скрыть # |

← Rev. 2 →

+5

BTW, we can find number of distinct values in a subarray [l, r] of a offline in O((q+n)\log n).

Let's sort all queries by l_i.

d_i = 1 if i is the first occurence of a_i in a[l...n] otherwise d_i = 0.

So, query (l_i, r_i) is finding sum of d[l_i...r_i].

When we move from query with l_i to query with l_i + 1 we must update only one or zero elements of d_i. It can be done in $\text{[math]}$ if we precalculated for each i next occurence of a_i in array.

→ Ответить

»

gongy

10 лет назад, скрыть # ^ |

+10

If you maintain the tree persistently, you can have an online solution as well.

→ Ответить

»

9 лет назад, скрыть # ^ |

-6

can you elaborate it . how to handle it online ? thanks in advance.

→ Ответить

»

bhishma

9 лет назад, скрыть # ^ |

0

I think this is related to a current running contest.

→ Ответить

»

9 лет назад, скрыть # ^ |

-6

yes

→ Ответить

»

9 лет назад, скрыть # ^ |

0

which contest ?

→ Ответить

»

9 лет назад, скрыть # ^ |

0

codechef feburary long challenge

→ Ответить

»

9 лет назад, скрыть # ^ |

0

In curiosity i asked this question too early . sorry for that . you can answer it after contest is over

→ Ответить

»

9 лет назад, скрыть # ^ |

+1

Yeah , 3 days early :P

→ Ответить

»

FLYSKY

9 лет назад, скрыть # ^ |

0

Contest has ended 3 months ago..

Can anyone please answer this question now ?

→ Ответить

»

FLYSKY

9 лет назад, скрыть # ^ |

0

Can you please explain it how to do it online ?

→ Ответить

»

9 лет назад, скрыть # ^ |

-7

Create an array of next occurences and build a persistent segment tree on that . The key idea is that number of distinct values in [L,R] is number of values whose next occurence is > R .

→ Ответить

»

bluemmb

10 лет назад, скрыть # |

0

Thanks, Is known who used this idea on trees first time ?

→ Ответить

»

belltolls

10 лет назад, скрыть # ^ |

+5

It must have been known from before. But I guess this is the first proper tutorial/blog for it.

→ Ответить

»

xrisk

10 лет назад, скрыть # |

+1

Totally went over my head! Excellent blog!

→ Ответить

»

R2__D2

10 лет назад, скрыть # ^ |

+3

If it goes over your head, How do you realize it's an excellent ?

→ Ответить

»

svg_af

10 лет назад, скрыть # |

0

Thanks a lot you made my day !!

I've been obsessing about COT2 for almost two months without anything that comes to mind

if only i could upvote more than once

→ Ответить

»

10 лет назад, скрыть # ^ |

+1

Thanks! I'm glad that you found it useful :)

→ Ответить

»

10 лет назад, скрыть # |

0

I implemented this algorithm on the COT2 problem on SPOJ(http://www.spoj.com/problems/COT2/). I am getting WA. Can someone help me identify the bug in my code? http://ideone.com/aLS5Yx

→ Ответить

»

10 лет назад, скрыть # ^ |

+8

I found the mistake. Thanks for the nice tutorial.

→ Ответить

»

10 лет назад, скрыть # ^ |

0

What was the bug ?

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Lol. Bro that was 7 months ago.

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Can you share your corrected code ? Because I'm getting a WA too .

→ Ответить

»

10 лет назад, скрыть # ^ |

← Rev. 2 →

0

Sure. Link

→ Ответить

»

vasandani68

10 лет назад, скрыть # ^ |

0

Can u please explain ur add and del functions. How are u maintaining the result after ignoring all those indexes which have occured 2 times?

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Recently I solved one question using Mo's algorithm, and I remembered about this comment here. I overwrote the solution on the same link. Here is the solution for COT2. I think its self-explanatory how it is working.

→ Ответить

»

NiKS001

10 лет назад, скрыть # |

← Rev. 2 →

0

Can someone please provide the algorithm for Problem 1?

The best I could get is (N^2)*logN [as the sum of sizes of sets of each node is O(N^2) — Worst case linear graph with all values distinct]

→ Ответить

»

10 лет назад, скрыть # ^ |

← Rev. 4 →

+3

Maintain a set of values for each node in the tree. Let set(u) be the set of all values in the subtree rooted at u. We want size(set(u)) for all u.

Let a node u have k children, v₁, v₂...v_k. Every time you want to merge set(u) with set(v_i), pop out the elements from the smaller set and insert them into the larger one. You can think of it like implementing union find, based on size.

Consider any arbitrary node value. Every time you remove it from a certain set and insert it into some other, the size of the merged set is atleast twice the size of the original.

Say you merge sets x and y. Assume size(x) ≤ size(y). Therefore, by the algorithm, you will push all the elements of x into y. Let xy be the merged set. size(xy) = size(x) + size(y). But size(y) ≥ size(x).

So size(xy) ≥ 2 * size(x).

Thus, each value will not move more than log n times. Since each move is done in O(logn), the total complexity for n values amounts to O(nlog² n)

Code

→ Ответить

»

NiKS001

10 лет назад, скрыть # ^ |

+5

Awesome! Thanks for the great explanation and code!

→ Ответить

»

I_love_Captain_America

9 лет назад, скрыть # ^ |

← Rev. 3 →

0

I don't get the proof. Can you explain it a little more?

size(xy) ≥ 2 * size(x)

I think, size(xy) >= size(y), and size(xy) <= size(x) + size(y)

Thus, each value will not move more than log n times.

How?

EDIT I think I understood. For a particular value to be included the maximum number of times in a move operation from set(x) to set(xy) where size(x) <= size(y), this value must be moved for each of it's ancestor upto root. That is only possible if the height of the tree is at most log n.

But the size(xy) >= 2 * size(x) seems incorrect. I think you meant that the size of subtree of parent of x >= 2 * size(x).

→ Ответить

»

iit2015023

9 лет назад, скрыть # ^ |

0

We cannot use the size function of the set to compare the sizes of the set as it would otherwise lead to N^2 complexity.Am i right?

→ Ответить

»

9 лет назад, скрыть # ^ |

+6

set.size() is O(1).

→ Ответить

»

iit2015023

9 лет назад, скрыть # ^ |

0

I thought it is O(n).Thanks for the info.

→ Ответить

»

himanshujaju

10 лет назад, скрыть # |

0

Very neatly written tutorial. You make it seem amazingly easy!

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Thanks a lot :)

→ Ответить

»

mbrc

10 лет назад, скрыть # |

+11

Superb idea! :D

Thanks! :D

→ Ответить

»

demon_cross

10 лет назад, скрыть # |

0

Nicely written!

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Thanks a lot :D

→ Ответить

»

naruto09

10 лет назад, скрыть # |

0

shouldn't it be end time of u to start time of v in case 1.If we start from start time of u then u will be included 2 times one for its start time and once for end time.Correct me if i am wrong..

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Case 1 implies that u is an ancestor of v.
Therefore, we won't visit u twice in the range [ST(u), ST(v)] as EN(u) > ST(v).

→ Ответить

»

baobab

10 лет назад, скрыть # |

0

Has anyone managed to get accepted on the SPOJ problem with a Java solution? I'm getting NZEC Runtime Error, but it looks like it's actually due to time limit exceeding.

→ Ответить

»

10 лет назад, скрыть # |

+3

Update 1: Added sample problems.

→ Ответить

»

Absolut

10 лет назад, скрыть # ^ |

0

For the "Frank Sinatra" problem. How could you find the less value not present in the path?

I realize that any value greather than the size of the tree wouldn't change the answer. So, if i have at most 1E5 different values I can build a BIT. pos[i] = 1 if value i is present in the path. Then I binary search the less value k wich sum[0...k] is less than k. That would be my answer. However the complexity is O(N*sqrt(N)*log(N)*log(N)) and I think is excesive.

→ Ответить

»

10 лет назад, скрыть # ^ |

0

The complexity wouldn't be $\text{[math]}$ , it would be $\text{[math]}$ .
The first term is because you update your bit atmost $\text{[math]}$ times and the second term is because you binary search once for each query.

→ Ответить

»

Absolut

10 лет назад, скрыть # ^ |

0

Thanks, my mistake.

So, it is the best completely? Or there is another approach

→ Ответить

»

10 лет назад, скрыть # ^ |

0

You can solve the problem in $\text{[math]}$ by doing square root decomposition on the values. Each update would be done in constant time and you will take additional $\text{[math]}$ time per query to find the block which has the smallest value.

Code

#include "bits/stdc++.h"
using namespace std;

typedef pair < int, int > pi;
typedef vector < int > vi;
typedef vector < pi > vpi;

const int MAXN = 1e5 + 10;
const int SQRT = 320;
const int LOGN = 20;

int A[2*MAXN];
pi B[MAXN];
int V[MAXN], C[MAXN];
int D[SQRT + 10][SQRT + 10], S[SQRT + 10];
int P[MAXN];
vpi G[MAXN];
int W[MAXN], R[MAXN];

struct query {
    int x;
    int y;
    int ind;
};

query Q[MAXN];

bool comp(query a, query b) {
    if (int(a.x/SQRT) != int(b.x/SQRT)) return (int(a.x/SQRT) < int(b.x/SQRT));
    return a.y < b.y;
}

int cont = 0;
void dfs(int x, int p) {
    V[x] = 1;
    P[x] = p;
    A[cont] = x;
    B[x].first = cont;
    cont++;
    for (int i = 0; i < G[x].size(); ++i) {
        int y = G[x][i].first;
        int k = G[x][i].second;
        if (!V[y]) {
            C[y] = k;
            dfs(y, p + 1);
        }
    }
    A[cont] = x;
    B[x].second = cont;
    cont++;
}

void add(int x) {
    int k = C[A[x]];
    if (k > MAXN) return;
    if (!W[A[x]]) {
        if (D[k/SQRT][k%SQRT] == 0) ++S[k/SQRT];
        ++D[k/SQRT][k%SQRT];
        ++W[A[x]];
    }
    else {
        if (D[k/SQRT][k%SQRT] == 1) --S[k/SQRT];
        --D[k/SQRT][k%SQRT];
        --W[A[x]];
    }
}

int main() {
    ios_base::sync_with_stdio(false);
    cin.tie(0);
    int n, m;
    cin >> n >> m;
    for (int i = 1; i < n; ++i) {
        int x, y, val;
        cin >> x >> y >> val;
        --x;
        --y;
        G[x].push_back(pi(y, val));
        G[y].push_back(pi(x, val));
    }
    dfs(0, 0);
    C[0] = MAXN + 1;
    for (int i = 0; i < m; ++i) {
        int x, y;
        cin >> x >> y;
        --x;
        --y;
        if (B[x].first > B[y].first) swap(x, y);
        if (B[x].second > B[y].second) {
            Q[i].x = B[x].first + 1;
            Q[i].y = B[y].first;
        }
        else {
            Q[i].x = B[x].second;
            Q[i].y = B[y].first;
        }
        Q[i].ind = i;
    }
    sort(Q, Q + m, comp);
    int a = 0;
    int b = -1;
    for (int i = 0; i < m; ++i) {
        while (b < Q[i].y) {
            ++b;
            add(b);

        }
        while (a > Q[i].x) {
            --a;
            add(a);
        }
        while (b > Q[i].y) {
            add(b);
            --b;
        }
        while (a < Q[i].x) {
            add(a);
            ++a;
        }
        for (int w = 0; w < SQRT; ++w) {
            if (S[w] < SQRT) {
                for (int j = 0; j < SQRT; ++j) {
                    if (D[w][j] == 0) {
                        R[Q[i].ind] = w*SQRT + j;
                        break;
                    }
                }
                break;
            }
        }
    }
    for (int i = 0; i < m; ++i) cout << R[i] << endl;
}

→ Ответить

»

_LNHTD_

6 лет назад, скрыть # ^ |

← Rev. 3 →

0

In your code, I saw this:

if (B[x].first > B[y].first) swap(x, y);
        if (B[x].second > B[y].second) {
            Q[i].x = B[x].first + 1;
            Q[i].y = B[y].first;
        }
        else {
            Q[i].x = B[x].second;
            Q[i].y = B[y].first;
        }
        Q[i].ind = i;

Can " if (B[x].second > B[y].second) " replace LCA(x, y) == x ?

I also want to ask does the O( N * sqrt(N) * log(N) + N * log (N) ) algorithm pass this problem ?

→ Ответить

»

sussy_Boi

12 месяцев назад, скрыть # ^ |

0

If you can merge the split(ed) queries back pretty fast, then does that mean this problem can also be solved using HLD? (I am just curious.)

→ Ответить

»

SProf

10 лет назад, скрыть # |

0

can it gives me tle,if i can't use weight compress?

→ Ответить

»

10 лет назад, скрыть # ^ |

0

If you do not compress weights, you'll need a map and that would add an additional log(n) factor. However, you might be able to squeeze your solution within the TL with an unordered_map.

→ Ответить

»

meshanya

10 лет назад, скрыть # |

+39

BTW, there is a standard solution for the first problem (see this link in Russian). For each of the colors order all the vertices of this color according to the dfs traversal, let the vertices be labelled v₁, v₂, ..., v_k. Add +1 to each of these verticies, and add -1 to the LCAs of the neighboring vertices lca(v₁, v₂), lca(v₂, v₃), ..., lca(v_k - 1, v_k). If you sum up the values inside a subtree, you get the number of distinct elements in it.

Since the ordering can be done in O(n), and in theory you can answer lca queries for a static tree in O(1) with O(n) pre-processing, you have a linear solution (assuming 0 ≤ A[x] < N).

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Thanks! This idea is pretty cool :)

→ Ответить

»

-synx-

9 лет назад, скрыть # ^ |

+3

I understand the merging sets optimization!
Can you explain how we can utilize this in Problem 1 (unique elements in subtree) to achieve $\text{[math]}$ ?
As far as I see, lets say $\text{[math]}$
then we want, to change $\text{[math]}$
which can be done optimally when $\text{[math]}$ ,
how do you propose to do it for $\text{[math]}$ ?

→ Ответить

»

howsiwei

10 лет назад, скрыть # |

+5

Isn't the time complexity of Mo's algorithm O(N*sqrt(Q)) instead of O(Q*sqrt(N))?

→ Ответить

»

10 лет назад, скрыть # ^ |

0

The complexity of Mo's depends on the number of times we increment/decrement the curL, curR variables. This link explains the time complexity of Mo's algorithm.

→ Ответить

»

howsiwei

10 лет назад, скрыть # ^ |

← Rev. 2 →

+10

If the size of each block is k, then the time complexity of moving the left pointer is O(Q*k) and the time complexity of moving the right pointer is O(N/k*N). The optimal value of k is N/sqrt(Q) which results in total time complexity O(N*sqrt(Q)).

→ Ответить

»

https://ideone.com/MG3XbK

10 лет назад, скрыть # ^ |

0

That is true. However, in most cases upper bounds on Q and N are equal (or pretty close), so it doesn't make a difference.

→ Ответить

»

aashishkr

10 лет назад, скрыть # |

0

I wrote the code for COT2 judge gives runtime error at 10th testcase pls help me i cant find the error thanks in advance

pls see my code

→ Ответить

»

Peregrine_Falcon

7 лет назад, скрыть # ^ |

0

I think it's for the weight value. They haven't said anything about the limit. I compressed the weights.

→ Ответить

»

sbansalcs

10 лет назад, скрыть # |

+19

Awesome Tutorial!

→ Ответить

»

10 лет назад, скрыть # ^ |

+3

Thank you!

→ Ответить

»

alphaguy4

10 лет назад, скрыть # |

0

Can anyone explain how to linearize the tree .. (Not binary tree but any tree in general)

As in Problem 1..

→ Ответить

»

10 лет назад, скрыть # ^ |

0

Click

→ Ответить

»

uttom

10 лет назад, скрыть # |

0

I got Runtime Error. Here is my code There is any wrong my crealting lca tabel or anything else. Thanks in advance.

→ Ответить

»

ankeshgupta007

10 лет назад, скрыть # |

0

amazing tutorial!

→ Ответить

»

stould

10 лет назад, скрыть # |

← Rev. 3 →

0

If the tree store the values on the edges, you could store these values on the children (going from the root), and change the Case 2 to:

if(P == u || P == v) check(P);
.... asnwer the query
if(P == u || P == v) check(P);

→ Ответить

»

mochow

10 лет назад, скрыть # |

0

Why don't you write more tutorials?

→ Ответить

»

10 лет назад, скрыть # |

+3

COT2 code link doesnt work .

→ Ответить

»

10 лет назад, скрыть # ^ |

+7

Updated.

→ Ответить

»

10 лет назад, скрыть # |

0

Why does this get WA for COT2 :/ ?

→ Ответить

»

ace_pocket

10 лет назад, скрыть # |

+5

Could you explain your idea for the problem of finding number of inversions in a (u, v) path in a Tree T.

→ Ответить

»

9 лет назад, скрыть # ^ |

0

Just maintain a BIT during Mo's

→ Ответить

»

Vicennial

10 лет назад, скрыть # |

← Rev. 2 →

0

If I flatten the above tree, my array would be:
8 3 1 6 4 7 10 14 13
Suppose I need to use Mo's algorithm for subtrees(assume I need to find sum of values of each subtree indicated by the query)
For a given query 'Vj' how would I find its end range index in the array?
Eg if given query is node '6', the starting range would be idx 3 and ending would be idx 5.

→ Ответить

»

https://wcipeg.com/wiki/Heavy-light_decomposition

10 лет назад, скрыть # ^ |

+1

Store the starting and ending times for every node during your dfs .

→ Ответить

»

Vicennial

10 лет назад, скрыть # ^ |

0

Thanks, understood it after a bit of googling about discovery/begin/end times.

→ Ответить

»

Tobby_And_Friends

10 лет назад, скрыть # |

0

For a problem like this: http://lightoj.com/volume_showproblem.php?problem=1348 where I need to return sum of all the nodes in a given path & update the value of a node, how should I approach using this technique of linearizing the tree? I mean since I need to ignore nodes which have occurrence of 2 so the range becomes discontinuous for a segment tree structure.

→ Ответить

»

Boxer

9 лет назад, скрыть # ^ |

0

I think you should use Heavy light decomposition(HLD).

→ Ответить

»

aniervs

7 лет назад, скрыть # ^ |

← Rev. 3 →

0

It can be done without HLD. Let be dp[u] the sum of values of path from root downto u. The sum of values on the path from u to v is dp[u]+dp[v]-2*dp[L]+val[L] where L is lca(u,v). When you change the value of a node u, you must change dp[v] for all v belonging to subtree of u. With an euler tour the subtree of a node becomes a continous subarray, so you can easily update it with a segment tree or similars.

→ Ответить

»

arjun95

9 лет назад, скрыть # |

0

How can apply this method if weight is given on edges instead vertices

→ Ответить

»

9 лет назад, скрыть # ^ |

0

Root the tree arbitarily . Map the weight of edge (parent-child) to the child .

→ Ответить

»

arjun95

9 лет назад, скрыть # ^ |

0

and what about the weight of root?

→ Ответить

»

9 лет назад, скрыть # ^ |

0

Are you saying that both edges and nodes have weights ?

→ Ответить

»

arjun95

9 лет назад, скрыть # ^ |

0

no, i mean as you said map the weight of edge (parent-child) to the child but root has no parent so what value is map to the root of the tree?

→ Ответить

»

9 лет назад, скрыть # ^ |

0

nothing is mapped to the root . A tree has n — 1 edges which would be mapped to n — 1 vertices of the tree .

→ Ответить

»

I_love_Captain_America

9 лет назад, скрыть # ^ |

0

assign it an impossible value like -INFINITY. This means whenever you see this value, you know it's not allowed, and you ignore it.

→ Ответить

»

hiddentesla

9 лет назад, скрыть # |

0

how to calculate LCA fast? i only know the O(n) algorithm...

→ Ответить

»

9 лет назад, скрыть # ^ |

+5

Click

→ Ответить

»

CuSO45H2O

9 лет назад, скрыть # |

0

In this case, our query range would be[EN(u), ST(v)] + [ST(P), ST(P)].

Consider on this case, if we select 3 and 8 on the tree given to explain the DFS-Order, the range[EN(u), ST(v)] contains the whole subtree S(5) which is not on our query path. Are we supposed to judge every node in the range or Am I missing something? Thx!

→ Ответить

»

hiddentesla

9 лет назад, скрыть # ^ |

0

you only consider nodes which appear once in the range, so maintain a frequency count of the current nodes, if a node appears twice, remove it from the list of nodes

→ Ответить

»

fane_faiz

9 лет назад, скрыть # |

+4

Man, you nailed the use DFS. BTW thanx for the amazing article.

→ Ответить

»

vatsal

9 лет назад, скрыть # |

0

"One easy way to solve this is to flatten the tree into an array by doing a Preorder traversal" Isn't Preorder traversal done on a binary tree? The given tree may not be a binary tree. Where am I going wrong?

→ Ответить

»