What's the point of maintaining stack in virtual tree?

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

#	User	Contrib.
1	Qingyu	158
2	adamant	152
3	Um_nik	146
4	Dominater069	144
5	errorgorn	141
6	cry	139
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
9	TheScrasse	134

In every virtual tree implementation I can find (https://en.oi-wiki.org/graph/virtual-tree/, https://mirror.codeforces.com/blog/entry/140066 , for instance), there seems to be an unnecessarily complex method to construct edges involving some kind of stack. The stack method used here is thankfully much simpler than both of these, but can't we simplify it even further to not use a stack at all?

That is, instead of:

for (int i = 1; i < (int)res.size(); i++) {
	while (tin[res[i]] > tout[stk.back()]) { stk.pop_back(); }

	vadj[stk.back()].push_back(res[i]);
	stk.push_back(res[i]);
}

(From the USACO Guide implementation)

We could just write:

for (int i = 1; i < (int)res.size(); i++) {
	vadj[lca(res[i - 1], res[i])].push_back(res[i]);
}

Is there any case in which this approach is wrong? I made this modification and still got accepted on the relevant AtCoder problem.

Edit: the usaco.guide module has since been updated, but I'm still not sure what advantage the second implementation on OI Wiki here has over the first one (which matches more closely with what I've described here), since both implementations seems to have O(n log n) runtime.

vector<int> res = key; sort(res.begin(), res.end(), sort_tin); for (int i = 1; i < (int)key.size(); i++) { res.push_back(lca(key[i - 1], key[i])); } sort(res.begin(), res.end(), sort_tin); res.erase(unique(res.begin(), res.end()), res.end());

int dfn[MAXN]; int h[MAXN], m, a[MAXN], len; // store key nodes bool cmp(int x, int y) { return dfn[x] < dfn[y]; // sort by the dfs order } void build_virtual_tree() { sort(h + 1, h + m + 1, cmp); // sort key nodes by the dfs order for (int i = 1; i < m; ++i) { a[++len] = h[i]; a[++len] = lca(h[i], h[i + 1]); // insert lca } a[++len] = h[m]; sort(a + 1, a + len + 1, cmp); // sort every nodes on the virtual tree by the dfs order len = unique(a + 1, a + len + 1) - a - 1; // remove duplicated for (int i = 1, lc; i < len; ++i) { lc = lca(a[i], a[i + 1]); conn(lc, a[i + 1]); // connect the edge } }

Comments (23)

Write comment?

shsh

12 months ago, hide # |

Auto comment: topic has been updated by shsh (previous revision, new revision, compare).

→ Reply

feining_for_gm

shsh orz

tinca_matei

If you have the following complete binary tree with 7 nodes, indexed from top to bottom, left to right with 1..7, then if you want to build the virtual tree of the leaves, i.e. 4..7, you should get the entire tree, but with the bottom implementation, when you do lca(5, 6) = 1, you will add an edge from 1 to 6 which does not make it a tree anymore I think.

12 months ago, hide # ^ |

← Rev. 2 →

However res is computed like so:

So in the case you mentioned, res = {1, 2, 4, 5, 3, 6, 7} and my implementation still works right?

oToToT

Consider a tree

1 2
2 3
2 4

And you try to build virtual tree on [1, 3, 4]. I think your algorithm will generate edges

1 3
2 4

However, we will expect

Check my comment above; in this case res already contains all 4 nodes.

← Rev. 3 →

+10

I think this means that your code differs a lot from normal implementation. Can you show the full implementation? It helps the discussion.

In case you don't want to provide the full implementation. I think the one described in the Chinese OI-Wiki may be related.

Sorry, I should’ve specified better, but the code that I’m basing my implementation off of is the last USACO Guide link.

← Rev. 4 →

Ah I see. Sorry I miss that link.

I think then it's unnecessary to maintain a stack there. You can check the first implementation in the Chinese OI-Wiki (not the en one) as I shared above.

The code looks like

Oh, I didn't even notice there were two implementations on OI Wiki! Is there any advantage of the second implementation over the first one then? It seems strictly more complicated with no noticeable advantage.

For me, it's just that the one maintaining the stack is easier to understand and prove the correctness.

mychecksdead

If you are lazy to write O(1) LCA, maybe adding the extra constant factor factor to building virtual tree may not be good, idk. (Personally, I don't use O(1) lca often, it's not that much needed in every case.) Also, if the problem forces you to use O(N) memory, you need to use segment tree min queries for LCA, which will also give you log, thus by using the stack implementation you may save some constant factor. But obviosuly this is not very relevant...

mzen

$$$O(N)$$$ prepare $$$O(N)$$$ memory $$$O(1)$$$ get RMQ is possible, but it's somewhat complex: https://mirror.codeforces.com/blog/entry/78931

AksLolCoding

-8

Sparse table LCA has worked for me in almost every problem I've seen, so I've never used $$$O(1)$$$/$$$O(N)$$$ LCA. It also has a great constant factor

-firefly-

This approach is correct, and OI-Wiki also introduces it as a naive approach before introducing the monostack approach.

The key is that you have to introduce either a much more complex algorithm like Four Russian to achieve a $$$O(n)\sim O(1)$$$ LCA, or an extra $$$\log$$$. By using a monostack you can build the virtual tree in $$$\Theta(n)$$$ without much coding.

But the common implementation still call LCA every time a new node needs to be pushed into the stack.

Yes, this is my concern as well, as you can see lca is being called on line 10 of the second implementation. Wouldn't this lead to the same complexity? -firefly-

Furyna

-19

As the author of the USACO.guide article and a big fan of shsh, I'm honored that he optimized my code, and everyone can blame my stupidity for the inefficiency and confusion caused here.

No trust, Furyna orz

shsh's blog