SnoopyCodes's blog

By SnoopyCodes, history, 6 months ago, In English

So, a few months ago, I was working on Euler Tour from the usaco guide website. There, they explained how you could compress a tree into an array of start[] and end[] arrays to build a segment tree on top of it. However, with LCA (least common ancestor, they built a RMQ (range minimum query) segtree off of the full inorder traversal of the tree, which confused me greatly. After all, in the examples of summing subtrees and updating vertexes, we hadn't needed to do that. They basically used a different way to euler tour, which confused me. Additionally, I was still greatly uncomfortable with changing the size of the segtree, and a segtree with 2 * (2n-1) memory (I use the iterative segtree) pissed me off. I gave up and went back to dynamic programming.

Yesterday, however, I returned. The most annoying thing, in my opinion, was how the start[] and end[] arrays were combined so the segtree could be built, and I had to remember two ways to do the euler tour. Indeed, in my code to declare the segtree I wrote the following:

peak commenting
standard LCA with segtree explanation

So, I present a way to build the segtree off of only the array ordering vertices in which we list them, similar to the other one. To understand this, make sure you know RMQ segtree (I don't know fenwick tree/BIT but it likely also works here) and how to euler tour for, say, Subtree Queries on CSES, or have read the corresponding section (18.2) in CPH or whatever.

written for CSES Company Queries II

To see why this works, let's take a look at an example.

The corresponding array we build off of (and the depths):

yes

Essentially what we are building off of is the first array, where the nodes are ordered by when we first meet them in the dfs traversal. Then: To find the LCA of two nodes, if any of the two is not within the others subtree, then the LCA is the parent node of the RMQ (based on depth, of course) in the range between the two. Of course, if one is contained within another's subtree, the LCA is that node of least depth.

Rudimentary attempt at a proof: The parent node will not be present in this range, because it must have already been visited before. Then we must prove that the parent of the node with least depth in this range is the LCA. This is done by noticing that we are required to traverse to a yet unvisited node (after visiting the first vertex) that is one level under the LCA in order to get to the second node.

The code can be further simplified by removing the stop[] array and modifying the query call to query(start[a] + 1, start[b] + 1), and then checking for the edge case where a == b. However, for the sake of keeping consistency with euler tour, I have kept the stop[] array.

So, is this any better than before? We built on a segtree of half the size, after all.

Well, not really. Because segtree operations take O(log N) after processing, actually, we are reducing the time per query by from like 20 operations to 19. And likely because we are adding a vector to track the parent of each vertex, it isn't any faster. Memory isn't any better, either.

So what's the use?

Now, at least, there aren't two ideas for euler tour running inside my head. I just understand that I can use start[] and stop[] arrays for this. It's mostly for simplicity, really. And because I'm doing usaco guide, I probably won't learn binary lifting until later, as it's in platinum.

This was just a fun thing to think about and do. Open to any code simplifications and blog criticisms! It's my first time writing one. Thanks to iframe_ for helping optimize and proofread, and thank you for reading!

  • Vote: I like it
  • +56
  • Vote: I do not like it

»
6 months ago, # |
  Vote: I like it +3 Vote: I do not like it

Auto comment: topic has been updated by SnoopyCodes (previous revision, new revision, compare).

»
6 months ago, # |
  Vote: I like it +3 Vote: I do not like it

Auto comment: topic has been updated by SnoopyCodes (previous revision, new revision, compare).

»
6 months ago, # |
  Vote: I like it +1 Vote: I do not like it

happy snoopy

»
6 months ago, # |
  Vote: I like it 0 Vote: I do not like it

this is my preferred way of doing lca because you can also find the next node on path.

but i like to write it like:

if (a==b) // lca is a
if (start[b] < start[a]) { swap(a, b); }
//lca is: par[query(start[a] + 1, start[b] + 1)]
  • »
    »
    6 months ago, # ^ |
      Vote: I like it 0 Vote: I do not like it

    Yes, this is the simplification where we can remove the stop[] array. I hadn't considered the problem of finding next node on path, but it should be easily doable with this too.

»
6 months ago, # |
  Vote: I like it +1 Vote: I do not like it

snoopy orz