The Ultimate Reroot Template

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
1	maomao90	161
3	adamant	156
4	maroonrk	153
5	-is-this-fft-	148
5	atcoder_official	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

Hi Codeforces!

Have you ever had this issue before?

If yes, then you have come to the right place! This is a blog about my super easy to use template for (reroot) DP on trees. I really believe that this template is kind of revolutionary for solving reroot DP problems. I've implemented it both in Python and in C++ (the template supports Python2, Python3 and >= C++14). Using this template, you will be able to easily solve > 2000 rated reroot problems in a couple of minutes, with a couple of lines of code.

A big thanks goes out to everyone that has helped me by giving feedback on blog and/or discussing reroot with me, nor, meooow, qmk, demoralizer, jeroenodb, ffao. And especially a huge thanks goes to nor for helping out making the C++ version of the template.

1. Introduction / Motivation

As an example, consider this problem: 1324F - Maximum White Subtree. The single thing that makes this problem difficult is that you need for every node $$$u$$$ find the maximum white subtree containing $$$u$$$. Had this problem only asked to find the answer for a specific node $$$u$$$, then a simple dfs solution would have worked.

Simple dfs solution

def dfs(node, parent=-1):
  nodeDP = 0
  for nei in graph[node]:
    if nei != parent:
      nodeDP = nodeDP + max(dfs(nei, node), 0)
  return nodeDP + 2 * color[node] - 1
print(dfs(u))

But 1324F - Maximum White Subtree requires you to find the answer for every node. This forces you to use a technique called rerooting. Long story short, it is a mess to code. Maybe you could argue that for this specific problem it isn't all that bad. But it is definitely not as easy to code as the dfs solution above.

What if I told you that it is possible to take the logic from the dfs function above, put it inside of a "black box", and get the answer for all $$$u$$$ in $$$O(n \log n)$$$ time? Well it is, and that is what this blog is all about =)

In order to extract the logic from the simple dfs solution, let us first create a generic template for DP on trees and implement the simple dfs solution using its interface. Note that the following code contain the exact same logic as the simple dfs solution above. It solves the problem for a specific node $$$u$$$.

Simple dfs solution (using treeDP template)

def treeDP(root, graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
  DP = [0] * len(graph)
  def dfs(node, parent=-1):
    nodeDP = default[node]
    for eind, nei in enumerate(graph[node]):
      if nei != parent:
        neiDP = dfs(nei, node)
        nodeDP = combine(nodeDP, neiDP, node, eind)
    parent_eind = -1 if parent == -1 else graph[node].index(parent)
    DP[node] = finalize(nodeDP, node, parent_eind)
    return DP[node]
  
  dfs(root)
  return DP

default = [0] * n

def combine(nodeDP, neiDP, node, eind):
  return nodeDP + max(neiDP, 0)

def finalize(nodeDP, node, eind):
  return nodeDP + 2 * color[node] - 1

print(treeDP(u, graph, default, combine, finalize)[u])

Now, all that remains to solve the full problem is to switch out the treeDP function with the ultimate reroot template. The template returns the output of treeDP for every node $$$u$$$, in $$$O(n \log n)$$$ time! It is just that easy. 240150867

Solution to problem F using the ultimate reroot template

default = [0] * n

def combine(nodeDP, neiDP, node, eind):
  return nodeDP + max(neiDP, 0)

def finalize(nodeDP, node, eind):
  return nodeDP + 2 * color[node] - 1

rootDP, _, _ = rerooter(graph, default, combine, finalize)
print(*rootDP)

The takeaway from this is example is that the reroot template makes it almost trivial to solve complicated reroot problems. For example, suppose we modify 1324F - Maximum White Subtree such that both nodes and edges have colors. Normally this modification would be complicated and would require an entire overhaul of the solution. However, with the ultimate reroot template, the solution is simply:

Solution to problem F if both edges and nodes are colored

default = [0] * n

def combine(nodeDP, neiDP, node, eind):
  return nodeDP + max(neiDP + 2 * edge_color[node][eind] - 1, 0)

def finalize(nodeDP, node, eind):
  return nodeDP + 2 * color[node] - 1

rootDP, _, _ = rerooter(graph, default, combine, finalize)
print(*rootDP)

2. Collection of reroot problems and solutions

Here is a collection of reroot problems on Codeforces, together with some short and simple solutions in both Python and C++ using the rerooter template. These are nice problems to practice on if you want to try out using this template. The difficulty rating ranges between 1700 and 2600. I've also put together a GYM contest with all of the problems: Collection of Reroot DP problems (difficulty rating 1700 to 2600).

(1700 rating) 219D - Choosing Capital for Treeland: Python solution, C++ solution
(2300 rating) 543D - Road Improvement: Python solution, C++ solution
(2200 rating) 592D - Super M: Python solution, C++ solution
(2600 rating) 627D - Preorder Test: Python solution, C++ solution
(2100 rating) 852E - Casinos and travel: Python solution, C++ solution
(2300 rating) 960E - Alternating Tree: Python solution, C++ solution; Or alternatively using "edge DP":; Python solution, C++ solution
(1900 rating) 1092F - Tree with Maximum Cost: Python solution, C++ solution
(2200 rating) 1156D - 0-1-Tree: Python solution, C++ solution
(2400 rating) 1182D - Complete Mirror: Python solution, C++ solution
(2100 rating) 1187E - Tree Painting: Python solution, C++ solution
(2000 rating) 1294F - Three Paths on a Tree: Python solution, C++ solution
(1800 rating) 1324F - Maximum White Subtree: Python solution, C++ solution
(2500 rating) 1498F - Christmas Game: Python solution, C++ solution
(2400 rating) 1626E - Black and White Tree: Python solution, C++ solution
(2500 rating) 1691F - K-Set Tree: Python solution, C++ solution
(2400 rating) 1794E - Labeling the Tree with Distances: Python solution, C++ solution
(2500 rating) 1796E - Colored Subgraphs: Python solution, C++ solution
(1700 rating) 1881F - Minimum Maximum Distance: Python solution, C++ solution
104008G - Group Homework: Python solution, C++ solution
104665H - Alice Learns Eertree!: Python solution, C++ solution

3. Understanding the `rerooter` black box

The following is the black box rerooter implemented naively:

Template (naive O(n^2) version)

def treeDP(root, graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
  DP = [0] * len(graph)
  def dfs(node, parent=-1):
    nodeDP = default[node]
    for eind, nei in enumerate(graph[node]):
      if nei != parent:
        neiDP = dfs(nei, node)
        nodeDP = combine(nodeDP, neiDP, node, eind)
    parent_eind = -1 if parent == -1 else graph[node].index(parent)
    DP[node] = finalize(nodeDP, node, parent_eind)
    return DP[node]
  
  dfs(root)
  return DP

def rerooter(graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
  n = len(graph)
  rootDP = []
  edgeDP = {}
  for root in range(n):
    DP = treeDP(root, graph, default, combine, finalize)
    rootDP.append(DP[root])
    for nei in graph[root]:
      edgeDP[root, nei] = DP[nei]
    
  forwardDP = [[edgeDP[node, nei] for nei in graph[node]] for node in range(n)]
  reverseDP = [[edgeDP[nei, node] for nei in graph[node]] for node in range(n)]

  return rootDP, forwardDP, reverseDP

rerooter outputs three variables.

rootDP is a list, where rootDP[node] = dfs(node).
forwardDP is a list of lists, where forwardDP[node][eind] = dfs(nei, node), where nei = graph[node][eind].
reverseDP is a list of lists, where reverseDP[node][eind] = dfs(node, nei), where nei = graph[node][eind].

If you don't understand the definitions of rootDP/forwardDP/reverseDP, then I recommend reading the naive $$$O(n^2)$$$ implementation of rerooter. It should be fairly self explanatory.

The rest of this blog is about the techniques of how to make rerooter run in $$$O(n \log n)$$$. So if you just want to use rerooter as a black box, then you don't have to read or understand the rest of this blog.

One last remark. If you've ever done rerooting before, you might recall that rerooting usually runs in $$$O(n)$$$ time. So why does this template run in $$$O(n \log n)$$$? The reason for this is that I restrict myself to use the combine function in a left folding procedure, e.g. combine(combine(combine(nodeDP, neiDP1), neiDP2), neiDP3). My template is not allowed to do for example combine(nodeDP, combine(neiDP1, combine(neiDP2, neiDP3))). While this limitation makes the template run slower, $$$O(n \log n)$$$ instead of $$$O(n)$$$, it also makes it a lot easier to use the template. If you still think that the $$$O(n)$$$ version is superior, then I don't think you've understood how nice and general the $$$O(n \log n)$$$ version truly is.

4. Rerooting and exclusivity

The general idea behind rerooting is that we first compute the DP as normal for some arbitrary node as the root (I use node = 0 for this). After we have done this we can "move" the root of the tree by updating the DP value of the old root and the DP value of a neighbour to the old root. That neighbour then becomes the new root.

Let $$$u$$$ denote the current root, and let $$$v$$$ denote the neighbour of $$$u$$$ that we want to move the root to. At this point, we already know the value of dfs(v, u) since $$$u$$$ is the current root. But in order to be able to move the root from $$$u$$$ to $$$v$$$, we need to find the new DP value of $$$u$$$, i.e. dfs(u, v).

If we think about this in terms of forwardDP and reverseDP, then we currently know forwardDP[u], and our goal is to compute reverseDP[u]. This can be done naively in $$$O(\text{deg}(u)^2)$$$ time with a couple of for loops by calling combine $$$O(\text{deg}(u)^2)$$$ times, and then calling finalize $$$O(\text{deg}(u))$$$ times.

The bottle neck here are the $$$O(\text{deg}(u)^2)$$$ calls to combine. So for now, let us separate out the part of the code that calls combine from the rest of the code into a function called exclusive. The goal of the next section will then be to speed up the naively implemented exclusive function to run in $$$O(\text{deg}(u) \text{log} (\text{deg}(u)))$$$ time.

Rerooting using exclusivity (O(sum deg^2) version)

def exclusive(A, zero, combine, node):
    n = len(A)

    exclusiveA = [zero] * n
    for i in range(n):
        for j in range(n):
            if i != j:
                exclusiveA[i] = combine(exclusiveA[i], A[j], node, j)
    return exclusiveA

def treeDP(root, graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
  DP = [0] * len(graph)
  def dfs(node, parent=-1):
    nodeDP = default[node]
    for eind, nei in enumerate(graph[node]):
      if nei != parent:
        neiDP = dfs(nei, node)
        nodeDP = combine(nodeDP, neiDP, node, eind)
    parent_eind = -1 if parent == -1 else graph[node].index(parent)
    DP[node] = finalize(nodeDP, node, parent_eind)
    return DP[node]
  
  dfs(root)
  return DP

def rerooter(graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
    n = len(graph)
    rootDP = [0] * n
    forwardDP = [None] * n
    reverseDP = [None] * n
 
    # Compute DP for root=0
    DP = treeDP(0, graph, default, combine, finalize)
     
    # Do rerooting using the exclusive function
    def dfs(node, parent=-1, parent_eind=-1):
        forwardDP[node] = [DP[nei] for nei in graph[node]]
        rerootDP = exclusive(forwardDP[node], default[node], combine, node)
        reverseDP[node] = [finalize(nodeDP, node, eind) for eind, nodeDP in enumerate(rerootDP)]
        nodeDP = combine(rerootDP[0], forwardDP[node][0], node, 0) if n > 1 else default[node]
        rootDP[node] = finalize(nodeDP, node, -1)
        for eind, nei in enumerate(graph[node]):
            if nei != parent:
                DP[node] = reverseDP[node][eind]
                dfs(nei, node, eind)
    dfs(0)
    return rootDP, forwardDP, reverseDP

5. The exclusive segment tree

We are almost done implementing the fast reroot template. The only operation left to speed up is the function exclusive. Currently it runs in $$$O(\sum \text{deg}^2)$$$ time. The trick to make exclusive run in $$$O(\sum \text{deg} \log{(\text{deg})})$$$ time is to create something similar to a segment tree.

Suppose you have a segment tree where each node in the segment tree accumulates all of the values outside of its interval. The leaves of such a segment tree can then be used as the output of exclusive. I call this data structure the exclusive segment tree.

Example: Exclusive segment tree of size n = 8

The exclusive segment tree is naturally built from top to bottom, taking $$$O(n \log n)$$$ time. Here is an implementation of rerooter using the exclusive segment tree:

Rerooting using exclusivity (O(sum deg log(deg)) version)

def exclusive(A, zero, combine, node):
    n = len(A)
    exclusiveA = [zero] * n # Exclusive segment tree
 
    # Build exclusive segment tree
    for bit in range(n.bit_length())[::-1]:
        for i in range(n)[::-1]:
            # Propagate values down the segment tree    
            exclusiveA[i] = exclusiveA[i // 2]
        for i in range(n & ~(bit == 0)):
            # Fold A[i] into exclusive segment tree
            ind = (i >> bit) ^ 1
            exclusiveA[ind] = combine(exclusiveA[ind], A[i], node, i)
    return exclusiveA

def treeDP(root, graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
  DP = [0] * len(graph)
  def dfs(node, parent=-1):
    nodeDP = default[node]
    for eind, nei in enumerate(graph[node]):
      if nei != parent:
        neiDP = dfs(nei, node)
        nodeDP = combine(nodeDP, neiDP, node, eind)
    parent_eind = -1 if parent == -1 else graph[node].index(parent)
    DP[node] = finalize(nodeDP, node, parent_eind)
    return DP[node]
  
  dfs(root)
  return DP

def rerooter(graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
    n = len(graph)
    rootDP = [0] * n
    forwardDP = [None] * n
    reverseDP = [None] * n
 
    # Compute DP for root=0
    DP = treeDP(0, graph, default, combine, finalize)
     
    # Do rerooting using the exclusive function
    def dfs(node, parent=-1, parent_eind=-1):
        forwardDP[node] = [DP[nei] for nei in graph[node]]
        rerootDP = exclusive(forwardDP[node], default[node], combine, node)
        reverseDP[node] = [finalize(nodeDP, node, eind) for eind, nodeDP in enumerate(rerootDP)]
        nodeDP = combine(rerootDP[0], forwardDP[node][0], node, 0) if n > 1 else default[node]
        rootDP[node] = finalize(nodeDP, node, -1)
        for eind, nei in enumerate(graph[node]):
            if nei != parent:
                DP[node] = reverseDP[node][eind]
                dfs(nei, node, eind)
    dfs(0)
    return rootDP, forwardDP, reverseDP

This algorithm runs in $$$O(\sum \text{deg} \log{(\text{deg})})$$$, so we are essentially done. However, this implementation uses recursive DFS which especially for Python is a huge drawback. Recursion in Python is both relatively slow and increadibly memory hungry. So for a far more practical version, I've also implemented this same algorithm using a BFS instead of a DFS. This gives us the final version of the ultimate rerooter template!

Rerooting using exclusivity (O(sum deg log(deg)) version with BFS)

def exclusive(A, zero, combine, node):
    n = len(A)
    exclusiveA = [zero] * n # Exclusive segment tree
 
    # Build exclusive segment tree
    for bit in range(n.bit_length())[::-1]:
        for i in range(n)[::-1]:
            # Propagate values down the segment tree    
            exclusiveA[i] = exclusiveA[i // 2]
        for i in range(n & ~(bit == 0)):
            # Fold A[i] into exclusive segment tree
            ind = (i >> bit) ^ 1
            exclusiveA[ind] = combine(exclusiveA[ind], A[i], node, i)
    return exclusiveA
 
def rerooter(graph, default, combine, finalize = lambda nodeDP,node,eind: nodeDP):
    n = len(graph)
    rootDP = [0] * n
    forwardDP = [None] * n
    reverseDP = [None] * n
 
    # Compute DP for root=0
    DP = [0] * n
    bfs = [0]
    P = [0] * n
    for node in bfs:
        for nei in graph[node]:
            if P[node] != nei:
                P[nei] = node
                bfs.append(nei)
 
    for node in reversed(bfs):
        nodeDP = default[node]
        for eind, nei in enumerate(graph[node]):
            if P[node] != nei:
                nodeDP = combine(nodeDP, DP[nei], node, eind)
        DP[node] = finalize(nodeDP, node, graph[node].index(P[node]) if node else -1)
    # DP for root=0 done
    
    # Use the exclusive function to reroot 
    for node in bfs:
        DP[P[node]] = DP[node]
        forwardDP[node] = [DP[nei] for nei in graph[node]]
        rerootDP = exclusive(forwardDP[node], default[node], combine, node)
        reverseDP[node] = [finalize(nodeDP, node, eind) for eind, nodeDP in enumerate(rerootDP)]
        rootDP[node] = finalize((combine(rerootDP[0], forwardDP[node][0], node, 0) if n > 1 else default[node]), node, -1)
        for nei, dp in zip(graph[node], reverseDP[node]):
            DP[nei] = dp
    return rootDP, forwardDP, reverseDP

using it = vector<size_t>::iterator; function<void(it, it, const S&)> exclude = [&](it l, it r, const S &val) { if(l == r) return ; if(l+1 == r) { up[*l] = val; return ; } auto m = l + (r-l)/2; exclude(l, m, accumulate(m, r, val, cf)); exclude(m, r, accumulate(l, m, val, cf)); };

Comments (61)

Write comment?

GOD_0F_DEATH

7 months ago, # |

God tier blog!

→ Reply

arvindf232

There is one minor point that confused me a bit: "Only-left folding" refers to only accumulating values of dp from child to parent, and not allowing combining between various children. It does not demand that the children are always processed in the same order. Of course, no problems I have solved ever violated this assumption.

I would like to mention something similar that I have been using: In the blog you basically explained two options, but I usually do the third option:

Give up a factor of (upto) $$$O(\log n)$$$ , or
Give up left folding, or
Assume Reversibility

The third option, in the terminology of your blog, it would be making both a combine and De-combine function. This option is much closer to typical implementation of a reroot tree DP. Sum and Xor can be easily reversed. Children max are a bit more difficult but can be achieved by keeping two maximum values. Thus this is easily O(n). Most of the time, the reverse function is, well, exact reverse of the combine function. Comparing the first option and the third option, this saved the $$$\log $$$ by double code length of the combine function, so it could be worth it -- doubling something two to three lines isn't really so bad.

It is a shame that there is no perfect template that does not have to give something up. It is also unfortunate that all three methods used slightly different representations so it is hard to code all three of them in the same template.

pajenegod

7 months ago, # ^ |

Speaking about that. We (me and nor) have made many different versions of the template. For example we have a version that supports an optional de-combine argument.

If you are interested in that, then take a look at these two C++ submissions 240160904, 240160922. The first submission calls rerooter without a de-combine function (running in $$$O(n \text{log} (n))$$$). The second submission calls rerooter with a de-combine function (so it runs in $$$O(n)$$$ time).

Here is what I've noticed after solving a ton of reroot problems on Codeforces.

I've yet to find a reroot problem where C++ TLE's with the $$$O(n \text{log} (n))$$$ template. In fact, using Python I've only TLEd once (on problem 627D - Preorder Test).
I usually don't see that big of a speedup going from $$$O(n \text{log} (n))$$$ to $$$O(n)$$$. This might be because the worst case input (a star) is missing from the test data. Note that for random trees (ish bounded max degree), even the $$$O(n \text{log} (n))$$$ version of the template runs in $$$O(n)$$$.

nor

← Rev. 2 →

+11

In case someone finds the BFS non-intuitive (you should definitely check it out by the way, it's really cool), and prefer keeping a DFS in your template in the off-chance that you need to modify the template (which should happen very rarely if not never), here are two submissions with the DFS template we came up with: 240165750 and 240165755.

Note that this will have very similar speed to the BFS template, but in terms of memory usage, this takes more memory than BFS due to recursion. It has never been an issue in all the problems we've seen, but the memory overhead is somewhat significant, so don't get rid of your BFS template in case you encounter a problem with a tight ML.

Golovanov399

Is it really safe to append to the list you are iterating over in python?

Surprisingly (for C++ users) yes. Iteration in Python is a completely different beast from what we assume in C++ — iterators are much more complicated in terms of book-keeping (even iteration itself is done by catching an exception each iteration).

+34

From what I understand, the answer to that is yes. In general, standard library data-structures in Python throw exceptions if it isn't ok. Take for example

A = {1, 2, 3}
for a in A:
  A.add(4)

It throws an exception: RuntimeError: set changed size during iteration.

I'm personally a huge fan of appending to lists while iterating over them. It runs very fast, and it makes coding BFS super easy. Probably my favourite Python algorithm ever is this BFS algorithm I made for trees

BFS = [0]
for u in BFS:
  for v in graph[u]:
    graph[v].remove(u)
  BFS += graph[u]

One of the reasons why this is so nice is that you can reuse the BFS list. For example, if you want to compute subtree_size, then you can just add on

subtree_size = [1] * n
for u in BFS[::-1]:
  subtree_size[u] += sum(subtree_size[v] for v in graph[u])

oversolver

1) more problems:

592D - Super M

1156D - 0-1-Tree

1332F - Independent Set

2) as I see your template not works with forests. might be useful here:

1101D - GCD Counting

ABC248G

3) by my experience exlusive on c++ works faster with recursive version instead of bit magic version. is this ok (nor)?

4) why this template requires 3 lambdas? As I know with 3 we can acheive $$$O(n)$$$ time and advantage of this $$$O(nlogn)$$$ method that we need only 2 lambdas to describe. I didn't solve all problems yet, do I miss something?

Can you share the recursive version you're using?

full code: 240168593 (1825 ms)

same code but with exclude from blog: 240203678 (3000 ms)

I tried using this in the template in the blog. The performance gains seem insignificant if you think about the granularity of CF timing (the gain is around 2 times the least count of the timer that CF uses), so in another submission, it might as well may perform worse than the submission in the blog.

Template: https://mirror.codeforces.com/contest/960/submission/240131037

Template but recursive exclusive: https://mirror.codeforces.com/contest/960/submission/240230032

It is nowhere near the difference that you showed in your submissions. The drastic difference in your submissions is most likely because of the fact that you allocate the vector and copy its contents, on top of doing the same work that the recursive code is doing (allocations are very costly and can easily overpower an O(n log n) algorithm in terms of cost).

Why you tried on task where input takes probably half of the time and solution are fast? :D

Nuh, allocations are nothing. Atleast I moved allocation of vector out of lambda and nothing changes. I think recursion is more cache friendly.

← Rev. 5 →

I submitted on the same problem, and the results didn't change with our template.

Template (3119ms): https://mirror.codeforces.com/contest/627/submission/240130592

Template but recursive exclusive (2931ms — 7% better, probably not significant, my other submissions on the same problem have a large variance, plus with larger execution times, time variance tends to be higher): https://mirror.codeforces.com/contest/627/submission/240232312

Template but without explicitly returning/storing all of edge_dp/redge_dp (1606ms — 2x speedup): https://mirror.codeforces.com/contest/627/submission/240234284

On a quick skim of your submissions, I am not exactly sure where the bottlenecks are, but you don't seem to be computing edge_dp/redge_dp, so there is no one-to-one correspondence between your submission and the submissions using this template — in all likelihood the 1606ms submission is the one you should be comparing your submission to.

On the gap between your submissions — it might be possible that a[i] is a very bad order for the cache. But in my experience, allocations are much more unpredictable and can take arbitrary amounts of time — this comes from doing some serious constant-optimizations and getting fastest solutions on online judges. It is especially bad when you do many small vector allocations, for example, the code that takes 1606ms also doesn't do repeated allocations (i.e., calls to reserve) that we were doing earlier.

Also, if you care about performance, I would recommend not using std::function unless you really need type erasure for things like storing lambdas in a vector (in which case you can just convert a lambda to std::function which is easy to do).

6 months ago, # ^ |

I've added 592D - Super M and 1156D - 0-1-Tree to the list. But, I'm not sure why 1332F - Independent Set should be considered a reroot problem. As far as I can tell it is just a standard tree DP problem.

I agree that it might be nice to add support for forests to the template. Doing that would at the same time add support for 1-indexing. But, it would also make the template itself more complicated.

o__x

pajenegod could you please explain the specific purpose of merge_into, finalize_merge and base in the cpp template? Thank you.

← Rev. 4 →

+10

Here is a step guide for how to use the reroot template. I will use 1324F - Maximum White Subtree as an example.

1. Write some code/pseudocode that solves the problem for a specific root u.

2. Identify `default`, `combine` and `finalize` (which in the C++ version of the template are called `base`, `merge_into`, `finalize_merge `).

3. Fill in `default`, `combine` and `finalize` and call `rerooter`.

kevaljain

← Rev. 3 →

sorry, messed up the input format. i wish cf had an option to delete comments

demoralizer

+28

Me wondering when did I ever discuss rerooting with pajengod orz sir. (probably 2-3years ago on AC or Errichto discord server)

+25

Yeah. I initially started working on this blog like 2 years ago. Sometimes around that time, I remember discussing it with you over on the AC server. I never finished it back then. But this year I had some spare time of Christmas so I decided to finally finish up the blog.

defnotmee

Setters for the next round are shrivelling in fear that their rerooting problem will get DESTROYED by The Ultimate Reroot Template™

NeroZein

Great Blog !

Btw, what's that "edge DP" technique ?

"edge DP" stores the dp value of a subtree (if you had done it without thinking about rerooting). Note that any of the endpoints of an edge can be a root, so you need to specify which endpoint is the root.

+16

Here is the idea behind "Edge DP". Consider some edge $$$(u,v)$$$ with edge_index $$$i$$$ (meaning graph[u][i] is $$$v$$$).

So we are in this situation:

Suppose now that we would want to analyze something about this edge. Maybe we want to know what happens if we remove (u,v) from the tree. Or maybe we want to know how many paths with some certain property that pass through (u,v). There are then two natural questions:

1. What is the DP value of the subtree with subtree root v?

2. What is the DP value of the subtree with subtree root u?

Using the reroot template, the answer to 1 is given by forwardDP[u][i] and the answer to 2 is given by reverseDP[u][i].

So not only can the reroot template be used to solve standard reroot problems. The template can also be used to solve "Edge DP" problems, where you are tasked to compute some property of each edge.

BERNARD

See 868E - Policeman and a Tree

lrvideckis

more problems https://open.kattis.com/problems/joiningnetwork https://open.kattis.com/problems/fulldepthmorningshow

gardengnome

Thanks for this, very cool. I have taken your template and put in a Python class. Extending that class and overwriting the default_value, combine and finalise methods in line with this blog is all that's required, see here: https://mirror.codeforces.com/contest/1324/submission/240262918.

ko_osaga

+56

If you still think that the $$$O(n)$$$ version is superior, then I don't think you've understood how nice and general the $$$O(n \log n)$$$ version truly is.

What? Linear solution is easy, just store all prefix/suffix aggregates. Code

-15

The $$$O(n \text{log} (n))$$$ reroot template only ever merge a child into its parent (using the combine function). On the other hand, the $$$O(n)$$$ method of using prefix/suffix merges multiple children into each other in some weird way. This is a fundamental difference that makes the $$$O(n \text{log} (n))$$$ template a lot easier to use, especially on more complicated reroot problems.

+51

I feel when the post talks about the faster $$$O(n)$$$ version, it repeatedly uses strong words such as "weird, fundamental, superior" while avoiding any actual comparison, which makes it feel very fishy. You have a weird restriction of merging a child into its parent and it slows up your solution. I think it is unnecessary. Can you elaborate, with sufficient examples, why that is an "unweird" assumption and why the natural order of children is "weird"?

Maybe 219D - Choosing Capital for Treeland is a good example to show my point.

First as a warm up, suppose we want to solve the fixed root version of the problem: "If the capital is at node $$$1$$$, how many roads would have to be inverted?". The solution to this problem is simply

Solution for a fixed root

n = int(input())
 
graph = [[] for _ in range(n)]
direction = [[] for _ in range(n)]
for _ in range(n - 1):
    u,v = [int(x) - 1 for x in input().split()]
    graph[u].append(v)
    graph[v].append(u)
    direction[u].append(0)
    direction[v].append(1)

def dfs(node, parent=-1):
  nodeDP = 0
  for eind, nei in enumerate(graph[node]):
    if nei != parent:
      neiDP = dfs(nei, node)
      nodeDP = nodeDP + neiDP + direction[node][eind]
  return nodeDP
print(dfs(0))

To solve the full problem, we can essentially plug this code into the $$$O(n \text{log} (n))$$$ reroot template.

Solution to 219D using reroot template

240150642

n = int(input())
 
graph = [[] for _ in range(n)]
direction = [[] for _ in range(n)]
for _ in range(n - 1):
    u,v = [int(x) - 1 for x in input().split()]
    graph[u].append(v)
    graph[v].append(u)
    direction[u].append(0)
    direction[v].append(1)

default = [0] * n
 
def combine(nodeDP, neiDP, node, eind):
    return nodeDP + neiDP + direction[node][eind]
 
def finalize(nodeDP, node, eind):
    return nodeDP
 
rootDP, forwardDP, reverseDP = rerooter(graph, default, combine, finalize)
 
ans = min(rootDP)
print(ans)
print(*[node + 1 for node in range(n) if rootDP[node] == ans])

On the other hand, if you want to solve 219D - Choosing Capital for Treeland in $$$O(n)$$$ using prefix/suffix, then I don't directly see how to do it. I'm sure it is still possible. But it is definitely not as easy as solving the fixed root version of the problem.

Come on, with my $$$O(n)$$$ template it's just "implement a+b": Code

I don't think you've understood the $$$O(n)$$$ solution.

-28

I mean, yes, you've solved the problem. But from what I can gather, you had to make rather messy changes to your "template".

gph[y].erase(find(all(gph[y]), pi{i ^ 1, x}));
pae[y] = (i ^ 1);

and

if (~pae[z]) pref[0] = up_root(rev_dp[z], pae[z]);
for (int i = 0; i < sz(gph[z]); i++) {
  pref[i + 1] = suff[i] = 
  up_root(dp[gph[z][i][1]], gph[z][i][0]);
}

Would you agree with me that what you have done here is more complicated than solving the fixed root version of the problem?

+30

I did not changed them from my github template...

shash4321

+13

Here is an O(n) implementation of 219D using prefix/suffix. The problem-specific details are in solve (for I/O) and the ART struct.

219D

-38

This method requires only two functions to implement — this is only advantage, less thinking. In most problems with additional $$$O(logn)$$$ factor solution still fits into TL.

+15

If I were (1) cautious and (2) able to give up some functionality, I can remove the E() function, and merge take_vertex and up_root function into one. I don't think we gain too much from such "less thinking".

-25

Cool, but I can't see how they can be merged

Just like

elem take_vertex(elem DP, int v) 
elem up_root(elem DP, int e)

elem take_vertex_and_up_root(elem DP, int v, int e)

which is probably what the author is doing, I suppose.

I think very last take_vertex for each vertex is still required.

OR take_vertex_and_up_root must maintain "nothing" values from e

Yeah, there should be e = NIL option if you insist on the merged function

So after some sleeping I can definitely say that this looks like trash against just one link operation

maybe then stop sleeping and solve more problem to figure why not

-18

Lack of sleep will make me feel this method more convenient? Genius.

phtniit

why the line 45 on your library is vector<elem> pref(sz(gph[z]) + 1); instead of vector<elem> pref(sz(gph[z]) + 1, E());

I guess I thought pref[0] = up_root(rev_dp[z], pae[z]); supplies the base case so it's unnecessary. But it looks sketchy when z is a root... I will just change to vector<elem> pref(sz(gph[z]) + 1, E());.

I try to solve this using your library, but it could not pass the example. After some debugging, i found that line...
Anyway, your template is amazing!
BTW, could you show how to solve this problem in best(your) way, i solve it using your code again, but with struggling work...

Here you go. Note that the inner function (last 10~15 lines of solve) is changed so that it can read the final answer more conveniently. I did not, but you may want to change some design specifics (e.g. adding edge/vertex parameter to take_vertex, up_root or likewise). I generally don't care too much about specific design choices and it's really up to you to figure what could be a way to enhance this.

thanks, learn a lot!
Your code is much simpler than mine.
how could i did not notice that it was two-diameter's combine.

alexwice

can someone paste this working code into a pastebin, i trying to learn but the link wont work.

sorry I fixed.

Here is how I would solve 104008G - Group Homework 240418335

Python solution

n = int(input())
A = [int(x) for x in input().split()]
 
graph = [[] for _ in range(n)]
for _ in range(n - 1):
    u,v = [int(x) - 1 for x in input().split()]
    graph[u].append(v)
    graph[v].append(u)
 
default = [(A[node], 0) for node in range(n)]
 
def combine(nodeDP, neiDP, node, eind):
    diameter, longest_seen_path = nodeDP
    child_diameter, child_longest_path = neiDP
    diameter = max(diameter, child_diameter, longest_seen_path + child_longest_path + A[node])
    longest_seen_path = max(longest_seen_path, child_longest_path)
    return diameter, longest_seen_path
 
def finalize(nodeDP, node, eind):
    node_diameter, longest_seen_path = nodeDP
    return node_diameter, longest_seen_path + A[node]
 
rootDP, forwardDP, reverseDP = rerooter(graph, default, combine, finalize)
 
ans = 0
 
# Two cases
# Case 1. The two paths intersect in exactly one node
for node in range(n):
    # Find the (up to) 4 longest paths ending in the children of node
    paths = [child_longest_path for child_diameter, child_longest_path in forwardDP[node]]
    val = sum(sorted(paths, reverse=True)[:4])
    ans = max(val, ans)
 
# Case 2. The two paths are disjoint
# Find some edge seperating the two paths
for node in range(n):
    for eind, nei in enumerate(graph[node]):
        # Find max length of any two paths seperated by edge (node, nei)
        # I.e. the sum of diameters of the two subtrees seperated by edge (node, nei)
        diameter1,_ = forwardDP[node][eind]
        diameter2,_ = reverseDP[node][eind]
        ans = max(ans, diameter1 + diameter2)
 
print(ans)

Thanks for your sharing.
Your blog (as well as the discussions) really helps me, i am just the one always struggling in rerooting...

One more thing, your solution is not Ultimate: It assumes that the aggregate of any set of subproblems are always small. We can always create a tree DP problem that violates this property and also be solvable. The trick is we just throw in some data structure problems, e.g:

An aggregate is defined as a $$$k$$$-th element of subtree DP values
An aggregate is defined as a mode of subtree DP values
An aggregate is defined as a union of intervals of subtree DP values, that is also encoded into an interval with an appropriate hash.

This makes less sense as a CP problem, but nonetheless, they have a near-linear time solution. So you do make some compromises here.

I guess the possible extra compromise is that I assume associativity, but I don't think that's a big problem, since:

Almost everything is associative
The order of children practically doesn't matter
If it actually does, you can prly get around in a way similar to how you get around weird aggregates listed over there
I'm pretty sure, for a problem where order of children matters in terms of associativity, it will mess up your solution too

TripleM5da

Btw I always assumed you can forget about rerooting and do memoized edge dp but the problem in edge dp is that if your state is $$$dp[edge]$$$ and inside you're iterating over $$$adj[node]$$$ that can go up to $$$O(N^2)$$$ in the case of star trees.

My solution for the above issue would be to calculate that you can calculate the solutions of edges without doing the normal calculation by doing some inc-exec tricks for example :

Let's say I have a star tree and I want max solve(edge).
If that edge is oriented $$$par=u$$$ and $$$node=v$$$
I already calculated the answer for two edges oriented with $$$par=v$$$
Then $$$dp[edge]$$$ can be figured out easily by going through the solution of those two edges.

I tried this once or twice and it worked but I find it very difficult to do from scratch most of the times and it does have it's own fuck ups dpending on the problem

I'm not trying to do this all day, but I just found another reason why your take is not so superior, and this is the most nontrivial and funny one.

What this code does is a special case of rake-compress paradigm, which is important when you want to use Top trees. The implementation of top tree I know requires three kind of subtree aggregation method, and they exactly correspond to what I do:

Non-path edge node (rake), which corresponds to merge function
Path edge node (compress), which corresponds to up_root function
Vertex node, which corresponds to take_vertex function

Note that the compress node isn't exactly the same as what up_root does. While up_root is incremental (add a single edge in the back), compress node merges two long paths. This makes the implementation in top tree harder, because usually you have to store information assuming both side of endpoint being a root, kinda similar as why seg-tree version of maximum subsegment is harder than the linear one. On the other hand, for the rake node it is exactly equivalent to the merge function, and the implementation here is usually straightforward because this is similar to how most tree DP works.

So if you are somehow not used to this way of implementation and having trouble adapting it, maybe now is high time, before chill ds enjoyers copy more top trees to cheese Div1F.

Duelist1234

There is a cf discord server?

McPqndq

+14

shash4321 and I created this template for rerooting: https://github.com/kelly-dance/mactl/blob/main/content/various/AllRoots.h

Our approach uses prefixes/suffixes as discussed by ko_osaga here https://mirror.codeforces.com/blog/entry/124286#comment-1103422.

We assume commutativity and associativity of our merge operation, but do not require reversibility.

Utilizing the template only requires modifying the ART struct. Or occasionally adding .promote(i,-1) on line 65. operator+ should merge aggregates. promote takes the aggregate for some child nodes and upgrades it to be the aggregate representing the parent.

I just finished solving all the problems linked in this post using this template. It was a great collection, thanks for putting this together!

Here are a couple problems that shash4321, asiad, and I have encountered that have not yet been linked by other commenters:

https://mirror.codeforces.com/contest/1825/problem/D1 (210638113)

https://cses.fi/problemset/task/1132 (https://cses.fi/paste/cc3cef8f5f02c90f7c0235/)

https://cses.fi/problemset/task/1133 (https://cses.fi/paste/883468c70b26846c5d50b1/)

https://atcoder.jp/contests/dp/tasks/dp_v (https://atcoder.jp/contests/dp/submissions/49030122) (This problem is what inspired shash4321 to make the template!)

https://mirror.codeforces.com/problemset/problem/1796/E (240178306)

https://mirror.codeforces.com/contest/1794/problem/E (shash4321's solution: 196311155)

https://mirror.codeforces.com/gym/104665/problem/H (227755196)

https://open.kattis.com/problems/nuremberg (https://gist.github.com/kelly-dance/9a7b320213d2523e8585c1a4bf205f90)

Note that some of my linked submissions utilize older versions of our template.

One of the main downsides of trying to use a catch all template for these problems is memory usage. We haven't figured out how to pass this problem with our template: https://www.acmicpc.net/problem/14875

Modified the template to use less memory, the implementation can be seen here: 240644685

6 months ago, # |

Auto comment: topic has been updated by pajenegod (previous revision, new revision, compare).

nnqadhl

2 months ago, # |

can anyone help me with problem 1881F- Minimum maximum distance with rerooting ?. I think that when reroot vertex u with its parent, we should consider the maximum dp of others children of parent, but this seems give me a TLE.

https://mirror.codeforces.com/contest/1881/submission/261603990

here is my sol