#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	atcoder_official	148
5	-is-this-fft-	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

It is not uncommon to have interactive tree problems where you are allowed to query some connected component of the tree and use the return value to determine whether the answer is in the connected component or outside the connected component (Link to example problem). The general approach for these kinds of problems is to always choose a connected component of size $$$\frac{n}{2}$$$. However, there are also problems where the allowed queries are more restricted, preventing $$$\frac{n}{2}$$$ halving from being possible. This blog covers one of those types of problems.

Definitions

Subtree: $$$S(r, u)$$$ contains the set of vertices in the subtree of vertex $$$u$$$ if the tree was rooted at vertex $$$r$$$.
Neighbour: $$$N(u)$$$ contains the set of vertices that are directly adjacent to vertex $$$u$$$.
Extended subtree: $$$ES(r, V) = \bigcup_{v\in V} S(r, v)\text{ if } v\in N(r)$$$. In other words, an extended subtree is a combination of the subtrees of a chosen set of vertices that are directly adjacent to the root.

Problem Structure

There is a hidden special vertex in a tree with $$$n$$$ vertices. Find the special vertex using at most $$$\lceil\log_{1.5}n\rceil$$$ of the following query:

Choose an extended subtree of the tree. The grader will return whether the special vertex is in the chosen extended subtree. More formally, choose any vertex $$$r$$$ and a subset of neighbours $$$V \subseteq N(r)$$$, then the grader will return whether the special vertex $$$x \in ES(r, V)$$$.

Solution

Let us denote the size of the chosen extended subtree as $$$x$$$. If the grader returns that the special vertex is in the extended subtree, we can combine the root with the extended subtree to form a new tree with $$$x + 1$$$ vertices. Otherwise, the vertices outside of the extended subtree form a tree with $$$n - x$$$ vertices. This means that with each query, we can reduce the size of the tree to at least $$$\max(x + 1, n - x)$$$.

Since we are allowed to use $$$\lceil\log_{1.5}n\rceil$$$ queries, we will be able to solve this problem if the size of the tree reduces to at least $$$\frac{2n}{3}$$$ after each query. Solving the inequality $$$\max(x + 1, n - x) \le \frac{2n}{3}$$$, we get $$$\frac{n}{3} \le x \le \frac{2n}{3} - 1$$$. Now, let us see whether it is always possible to find an extended subtree of size between $$$\frac{n}{3}$$$ and $$$\frac{2n}{3} - 1$$$.

Let us root the tree at its centroid. Recall that all the children of the centroid have subtree sizes less than or equal to $$$\frac{n}{2}$$$. If there exists a child with subtree size more than or equal to $$$\frac{n}{3}$$$, then that lone subtree can be the chosen extended subtree since $$$\frac{n}{3} \le x \le \frac{n}{2} \le \frac{2n}{3} - 1$$$ $$$^\dagger$$$. From this point onwards, we will assume that all children of the centroid have subtree sizes less than $$$\frac{n}{3}$$$.

We will add the children of the centroid one by one into the extended subtree until the first time that the size of the extended subtree becomes more than or equal to $$$\frac{n}{3}$$$. Note that this is always possible as taking all children will result in a size of $$$n - 1$$$ which is more than or equal to $$$\frac{n}{3}$$$. Let $$$x$$$ be the size of the resultant extended subtree and let $$$p$$$ be the size of the last child that was added into the extended subtree. We can obtain the following inequalities: $$$x \ge \frac{n}{3}$$$, $$$x - p < \frac{n}{3}$$$, and $$$p < \frac{n}{3}$$$. Combining the second and third inequality, we obtain $$$x < \frac{2n}{3}$$$. Hence, $$$\frac{n}{3} \le x \le \frac{2n}{3} - 1$$$ and the desired extended subtree is found.

$$$^\dagger$$$ Note that $$$\frac{n}{2} \le \frac{2n}{3} - 1$$$ might not be true when $$$n \le 5$$$, however, this is not very important as $$$n - 1 \le \lceil \log_{1.5} n \rceil$$$ when $$$n \le 5$$$, so we can just query every subtree except for vertex 1 to get the answer.

Optimality

It might feel like $$$\lceil\log_{1.5}n\rceil$$$ is a loose bound and we can do better. However, we can prove that it is not possible to do any better. Special thanks to pavement and jamessngg for helping me to come up with this proof. Consider the tree below.

Jellyfish tree

There are $$$n = 3k + 1$$$ vertices in the tree. The closest we can get to an extended subtree of size $$$\frac{n}{2}$$$ is either a single chain of size $$$k$$$ or two chains having size $$$2k$$$. Both of those options will result in a tree of size $$$2k + 1 \approx \frac{2n}{3}$$$ in the worst case after a query. This shows that for all $$$n = 3k + 1$$$, there exists a tree where it is not possible to reduce the size of the tree by less than $$$\frac{2n}{3}$$$ in a single query. We can use similar trees but with different chain lengths to come up with a similar proof for $$$n = 3k + 2$$$ and $$$n = 3k + 3$$$ to obtain the proof for all $$$n$$$.

An issue with this proof is that it does not account for amortisation. For the tree that was shown above, even though it is not possible to reduce the size of the tree by less than $$$\frac{2n}{3}$$$ in the first query, the next query will be done on a single chain of length $$$2k + 1$$$, which makes it easy for the remaining queries to reduce the size of the chain by half each time. Hence, this proof is not quite complete yet and I would appreciate it if anyone could tell me if they come up with a new idea.

Example

Link to problem

Abridged statement

You are given a tree consisting of $$$n$$$ nodes and there are two distinct nodes $$$a$$$ and $$$b$$$ that you have to determine. You can ask up to 40 queries of the form: $$$k\ \ x_1\ \ x_2\ \ldots\ x_k$$$ and the grader will return the distance between the lowest common ancestor of $$$x_1, x_2, \ldots, x_k$$$ if the tree was rooted at $$$a$$$ and the lowest common ancestor of $$$x_1, x_2, \ldots, x_k$$$ if the tree was rooted at $$$b$$$.

Solution

If we query all the vertices, we will be able to determine the distance between $$$a$$$ and $$$b$$$. Let this distance be $$$d$$$. Suppose we query an extended subtree together with its root. If the return value is $$$0$$$, it means that both $$$a$$$ and $$$b$$$ are outside of the extended subtree. If the return value is $$$d$$$, it means that both $$$a$$$ and $$$b$$$ are in the extended subtree or the root. Lastly, if the return value is between $$$1$$$ and $$$d - 1$$$, we know the distance that $$$a$$$ and $$$b$$$ are away from the root, so we can do two separate binary searches to find each of them.

Using the above-mentioned algorithm to choose the optimal extended subtree, we can solve this problem using $$$1 + \log_{1.5} n + 2\log_2 n \le 40$$$ queries. Note that there is a solution that uses only $$$2\log_2 n$$$ queries in the editorial of the problem, however, I feel like my solution is easier to come up with as it was the solution that I came up with during the contest.

Code

#include <bits/stdc++.h>
using namespace std;

const int MAXN = 1005;

int n;
vector<int> adj[MAXN];
bool bad[MAXN];
vector<int> ans;

int ask(vector<int> qv) {
    cout << "? " << qv.size();
    for (int i : qv) {
        cout << ' ' << i;
    }
    cout << endl;
    int d; cin >> d;
    return d;
}
void answer(int x, int y) {
    cout << "! " << x << ' ' << y << endl;
    exit(0);
}

int sub[MAXN];
void dfs(int u, int p) {
    sub[u] = 1;
    for (int v : adj[u]) {
        if (v == p || bad[v]) {
            continue;
        }
        dfs(v, u);
        sub[u] += sub[v];
    }
}
int getc(int u, int p, int s) {
    for (int v : adj[u]) {
        if (v == p || bad[v]) {
            continue;
        }
        if (sub[v] > s / 2) {
            return getc(v, u, s);
        }
    }
    return u;
}
vector<int> stk;
void pushsub(int u, int p) {
    stk.push_back(u);
    for (int v : adj[u]) {
        if (v == p || bad[v]) {
            continue;
        }
        pushsub(v, u);
    }
}
void getl(int u, int p, int tl) {
    if (tl == 0) {
        stk.push_back(u);
    }
    for (int v : adj[u]) {
        if (v == p || bad[v]) {
            continue;
        }
        getl(v, u, tl - 1);
    }
}
int search(int r, vector<int> stk, int d) {
    int lo = 0, hi = stk.size() - 1;
    while (lo < hi) {
        int mid = lo + hi >> 1;
        vector<int> qv;
        qv.push_back(r);
        for (int i = lo; i <= mid; i++) {
            qv.push_back(stk[i]);
        }
        if (ask(qv) == d) {
            hi = mid;
        } else {
            lo = mid + 1;
        }
    }
    return stk[hi];
}

int main() {
    cin >> n;
    for (int i = 1; i < n; i++) {
        int u, v; cin >> u >> v;
        adj[u].push_back(v);
        adj[v].push_back(u);
    }
    vector<int> qv(n);
    iota(qv.begin(), qv.end(), 1);
    int d = ask(qv);
    int r = 1;
    while (1) {
        dfs(r, -1);
        if (sub[r] == 2) {
            assert(d == 1);
            for (int v : adj[r]) {
                if (!bad[v]) {
                    answer(r, v);
                    return 0;
                }
            }
            assert(0);
        }
        assert(sub[r] > 2);
        r = getc(r, -1, sub[r]);
        dfs(r, -1);
        int sm = 1;
        pair<int, int> rg = {-1, -1};
        for (int i = 0; i < adj[r].size(); i++) {
            int v = adj[r][i];
            if (bad[v]) {
                continue;
            }
            if (sub[v] >= sub[r] / 3) {
                rg = {i, i};
                break;
            }
            sm += sub[v];
            if (sm >= sub[r] / 3) {
                rg = {0, i};
                break;
            }
        }
        auto [s, e] = rg;
        stk.clear();
        stk.push_back(r);
        for (int i = s; i <= e; i++) {
            int v = adj[r][i];
            if (bad[v]) {
                continue;
            }
            pushsub(v, r);
        }
        int td = ask(stk);
        if (td == d) {
            for (int i = 0; i < adj[r].size(); i++) {
                if (s <= i && i <= e) {
                    continue;
                }
                int v = adj[r][i];
                bad[v] = 1;
            }
        } else if (td == 0) {
            for (int i = s; i <= e; i++) {
                int v = adj[r][i];
                bad[v] = 1;
            }
        } else {
            stk.clear();
            for (int i = s; i <= e; i++) {
                int v = adj[r][i];
                if (bad[v]) {
                    continue;
                }
                getl(v, r, td - 1);
            }
            int x = search(r, stk, td);
            stk.clear();
            for (int i = 0; i < adj[r].size(); i++) {
                if (s <= i && i <= e) {
                    continue;
                }
                int v = adj[r][i];
                if (bad[v]) {
                    continue;
                }
                getl(v, r, d - td - 1);
            }
            int y = search(r, stk, d - td);
            answer(x, y);
            return 0;
        }
    }
    return 0;
}

Conclusion

I hope this will be interesting and useful to some people 🥺. Please let me know in the comments if you can come up with a better proof that handles the amortisation or maybe even come up with a more optimal algorithm that can perform better than $$$\log_{1.5} n$$$ queries by making use of the amortisation. If you are aware of other problems that can be solved using this technique, feel free to let me know in the comments as well. Thank you for your support 😁.

Rev.	By	When	Δ	Comment
en9	maomao90	2023-09-17 07:22:15	76	Tiny change: ' but with uneven chains to come ' -> ' but with different chain lengths to come ' (published)
en8	maomao90	2023-09-17 07:10:28	6461	Tiny change: 'e form: $k x_1 x_2 \ldots x_' -> 'e form: $k\, x_1\, x_2\, \ldots x_'
en7	maomao90	2023-09-16 19:01:12	60
en6	maomao90	2023-09-16 18:59:15	2623
en5	maomao90	2023-09-16 09:35:33	474
en4	maomao90	2023-09-16 09:25:12	4	Tiny change: ' the tree by at least ' -> ' the tree to at least '
en3	maomao90	2023-09-16 09:24:18	293
en2	maomao90	2023-09-16 09:13:50	527
en1	maomao90	2023-09-16 07:28:39	1674	Initial revision (saved to drafts)

Definitions

Problem Structure

Solution

Optimality

Example

Abridged statement

Solution

Conclusion

History