An efficient, shorter and easier way to find LCA(Lowest common ancestor) in offline without Tarjan's

№	Пользователь	Рейтинг
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

№	Пользователь	Вклад
1	Qingyu	158
2	adamant	152
3	Proof_by_QED	146
3	Um_nik	146
5	Dominater069	144
6	errorgorn	141
7	cry	139
8	YuukiS	135
9	chromate00	134
9	TheScrasse	134

Hi, everyone!

Today I want to present my novel idea of finding $$$LCA$$$ of two nodes in at most $$$O(N + \sum_{j=1}^{Q} \log_2(\max(\text{depth}_{u_j}, \text{depth}_{v_j}) + 1))$$$ in offline. As we can see, algorithm depends heavily on the depth of queries, so in random trees it might work faster and memory is also pure $$$O(N + Q)$$$.

Prerequisites

Euler tour technique
Basic binary search
Understanding tree

From the time complexity and prerequisites you might have guessed that we use ancestors of nodes, it is correct! Using properties of DFS we can keep ancestors sorted by depth in $$$O(N)$$$, example pseudocode:

dfs(v):
   ancestors.add(v)
   for u in a[v]: dfs(u)
   ancestors.pop()

This is true because the moment we finished the subtree of $$$v$$$ node, we no longer can have it as a ancestor for other nodes. Since it is always sorted by depth, which is what we need for our binary search. Our task transforms into finding right-most node in our ancestors, which is ancestor of both $$$u$$$ and $$$v$$$. However we don't need out-order to check, because if current query's node partner was already visited in our DFS, then we can find LCA using only in-order.

Code

#include <bits/stdc++.h>

using namespace std;

int32_t main() {
    ios::sync_with_stdio(false);
    cin.tie(nullptr);
    cout.tie(nullptr);
    int n, q, tim = 0;
    cin >> n >> q;
    vector<vector<int>> a(n);
    for (int i = 1; i < n; i++) {
        int p;
        cin >> p, --p, a[p].emplace_back(i);
    }
    vector<vector<pair<int, int>>> Q(n);
    for (int i = 0; i < q; i++) {
        int x, y;
        cin >> x >> y, --x, --y;
        Q[x].emplace_back(y, i);
        Q[y].emplace_back(x, i);
    }
    vector<int> st, tin(n), ans(q);
    auto lca = [&] (int x, int y) {
        int l = 0, r = st.size() - 1, top = -1;
        while (l <= r) {
            int mid = (l + r) >> 1;
            if (tin[st[mid]] <= tin[x] && tin[st[mid]] <= tin[y]) l = mid + 1, top = st[mid];
            else r = mid - 1;
        }
        return top;
    };
    auto dfs = [&] (auto &dfs, int v) -> void {
        tin[v] = tim++;
        st.emplace_back(v);
        for (auto [u, i] : Q[v]) if (tin[u]) ans[i] = lca(u, v);
        for (int u : a[v]) dfs(dfs, u);
        st.pop_back();
    };
    dfs(dfs, 0);
    for (int i : ans) cout << i + 1 << '\n';
    return 0;                    
}

Verified code on a problem: https://cses.fi/problemset/task/1688/

Thanks for reading, I hope you have a nice day.

Комментарии (6)

Написать комментарий?

Nasa

4 часа назад, скрыть # |

← Rev. 3 →

I found about the thing called LPD(longest-path-decomposition), and I am wondering whether this with parallel binary search for each node's queries on the paths of LPD can optimize the time complexity(we are doing a lot of useless binary searches), essentially we only need stack of ancestors for the leaf in the LPD parts. I think it should become something like:

$$$O\left( N + \sum_{p \in \text{paths}} Q_p \cdot \log_2(|p|) \right)$$$

Where $$$Q_p$$$ is number of queries in that LPD path, of course everything here is theoritical, I wonder whether someone can calculate time complexity in a random tree, or whether it is correct.

→ Ответить

BLOBVISGOD

← Rev. 2 →

The average distance between two random nodes in a random tree is asymptotically approximately $$$\sqrt{\frac{n\pi}{2}}-1$$$. Therefore, in random trees there is not much improvement, as $$$\log\left(\sqrt{\frac{n\pi}{2}}-1\right) \approx \frac{1}{2}\log(n)$$$.

3 часа назад, скрыть # ^ |

For the idea in the blog, yes it is true. It worked 2x faster than normal binary jumping approach on https://cses.fi/problemset/task/1688/ but I am more interested in this optimization: https://mirror.codeforces.com/blog/entry/153399?#comment-1362593 .

I am not sure I have the right idea, but it seems like you are doing HLD?

yes

Z_i_a_d_M_G_25

3 часа назад, скрыть # |

orz Nasa

Блог пользователя Nasa