Editorial of Yandex.Algorithm 2017 Round 1

#	User	Rating
1	ecnerwala	3649
2	Benq	3581
3	orzdevinwang	3570
4	Geothermal	3569
4	cnnfls_csy	3569
6	tourist	3565
7	maroonrk	3531
8	Radewoosh	3521
9	Um_nik	3482
10	jiangly	3468

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	151
8	SecondThread	147
9	orz	146
10	pajenegod	145

Problem A. Long-Term Mail Storage.

The probelm asked to simply simulate the process. One can keep yet unread mails in any data structure (array, deque, set, whatever) and iterate through time. For any moment of time x we first check whether there is a new incoming letter and add it to the structure if that is the case. Now, check if the ``feeling guilty'' condition is satisfied. If so, compute k and remove k oldest letter from the structure. Doing all this in the most straightforward way would result in O(nT) complexity.

Excercise: solve the problem in O(n) time.

Problem B. Lassies Versus Machine.

First of all, note that if Dusya and Lusia split n in x and n - x they would get exactly the same set of banknotes as a change as if they split in x + 5000 and n - x - 5000. That means, we only have to check all possibilities to split from a to min(a + 5000, n - a). For each possibility we compute the change in O(1) time. If some girl wants to pay y the change will be (5000k - y) mod 500, where $\text{[math]}$ .

Excercise: prove that it's enough to check only values from a to min(a + 499, n - a).

Problem C. Effecient Management Returns.

There are many different approaches for this problem and almost all solutions one can imagine will work as the size of the answer will never exceed $\text{[math]}$ (excersise: prove). This editorial contains only one possible linear time solution.

Proceed vertices one by one. After we have processed first i - 1 vertices (v₁, v₂, ..., v_i - 1) we would like to keep a way to dirstibute them among k_i teams T₁, T₂, ..., T_{k_i} that will satisfy all the requirements. When we add a new node v_i we should find any component T_j, such that there is no node $\text{[math]}$ and $\text{[math]}$ . Assuming $\text{[math]}$ this can be done in $\text{[math]}$ time by simply making an array of boolean marks and traversing the list of all neighbours. If there is no such T_j, we consider k_i + 1 = k_i + 1, i.e. we create a new set for this vertex. However, we actually do not need this $\text{[math]}$ time to set up the boolean array, as we can use only one array and mark it with a number of iteration instead of a simple \emph{true}. Thus, we set marks in O(deg(v_i)) time and then simply consider all sets one by one till we find first valid. Obviously, we will skip no more then deg(v_i) sets till we find first possible match, thus the running time will be O(deg(v_i)) and the total running time is O(n + m).

Problem D. The Sting.

First of all we would like to slightly change how we treat a bet. Define c_i = a_i + b_i. Now, if we accept the i-th bet we immediately take b_i and then pay back c_i in case this bet plays. Define as A some subset of bets, $\text{[math]}$ , i.e. the total profit we get from subset A. Define as L(A) the total amount we will have to pay in case the game result will be "team looses", i.e. $\text{[math]}$ . Similarly we introduce D(A) and W(A). Now, the profit of Ostap if he accepts subset A is S(A) - max(L(A), D(A), W(A)).

In this form it's not clear how to solve the problem as we simultaneously want to maximize S(A) from the one hand, but minimize maximum from the other hand. If we fix the value max(L(A), D(A), W(A)) = k the problem will be, what is the maximum possible sum of b_i if we pick some subset of "loose" ("draw", "win") bets with the sum of c_i not exceeding k. Such values $\text{[math]}$ can be computed for each outcome independetly using knapsack dynamic programming. The complexity of such solution is O(nL), where $\text{[math]}$ .

Problem E. Random Value of Mode.

To start with consider O(n²) dynamic programming solution. Let dpleft(i, j) be the optimum expected value if Gleb has visited all shops on segment from i to j inclusive and is now standing near the shop i. In the same way dpright(i, j) as the optimum expected value if segment from i to j was visited and Gleb stands near shop j.

We are not going to consider all the formulas there, but here is how we compute dpleft(i, j), picking the minimum of two possibilities go left or go right:

dpleft(i, j) = min(1 + t_i - 1 + p_i - 1·|i - 1 - x| + dpleft(i - 1, j)·(1 - p_i - 1)

(j + 1 - i) + t_j + 1 + p_j + 1·|j + 1 - x| + dpright(i, j + 1)·(1 - p{j + 1}))

To move forward we should have a guess that if there are no p_i = 0 we are going to visit many shops with a really small probability. Indeed, the smallest possible positive probability is one percent, that is 0.01 which is pretty large. The probability to visit k shops with p_i > 0 and not find a proper coat is 0.99^k, that for k = 5000 is about 1.5·10^- 22. Assuming t_i ≤ 1000 and n ≤ 300 000 the time required to visit the whole mall is not greater than 10⁹, thus for k = 5000 it will affect the answer by less than 10^- 12. Actually, assuming we only need relative error k = 3000 will be sufficient.

Now, we find no more than k shops with p_i > 0 and i < x and no more than k shops with p_i > 0 and i > x. Compress shops with p_i = 0 between them and compute quaratic dynamic programming. The overall running time will be O(n + k²).

Problem F. Measure Twice, Divide Once.

We need to assign each vertex a single positive integer x_v~--- the number of the process iteration when this vertex will be deleted. For the reason that will be clear soon we will consider x_v = 0 to stand for the last iteration, i.e. the greater value of x correspond to the earlier iterations of the process. One can prove that an assignment of positive integers x_v is correct if and only if for any two vertices u and v such that x_u = x_v the maximum value of x_w for all w on the path in the tree from u to v is greater than x_u. That is necessary and sufficient condition for any two vertices removed during one iteration to be in different components.

Pick any node as a root of the tree. Denote as C(v) the set of direct children of v and as S(v)~--- the subtree of node v. Now, after we set values of x in a subtree v we only care about different values of x_u, $\text{[math]}$ that are not "closed", i.e. there is no value greater between the corresponding node and the root of a subtree (node v). Denote as d(v, mask) boolean value whether it's possible to set values of x in a subtree of node v to have values mask unclosed. Because of centroid decomposition we know there is no need to use values of x greater than $\text{[math]}$ , thus there are no more than $\text{[math]}$ different values of mask, i.e. O(n). d(v, mask) can be recomputed if we know d(u, mask) for all $\text{[math]}$ in O(n³) time. Indeed, if one child u_i uses mask m_i we know:-

We have to set x_v greater than any i that occurs in more than one mask.
We can set x_v to any i that doesn't occur in any m_i.
If we set x_v = i, all j < i are set to 0 in m.

Now, one can notices that according to the following process if mask m₁ is a submask of m₂ it is always better lexicographically smaller than mask m₂ it always affects the result in a better way. Now, we claim that if for some subtree v we consider the minimum possible m, such that d(v, m) is true, for each m₁ > m there exists m₂ submask of m₁ such that d(v, m₂) is true. Indeed, consider the first (highest) bit i where m and m₁ differ. Because m₁ is greater than m it will have 1 at this position, while m will have 0. If there is no 1 in m anymore it is itself a submask of m₁, otherwise, this means x_v < i. We can set x_v = i and obtain a submask of m₁.

The above means we should only care about obtaining a lexicographically smallest mask for each subtree S(v). To do this we use the above rules to merge the results in all $\text{[math]}$ . This can be easily done in $\text{[math]}$ or in O(n) if one uses a lot of bitwise operations.

	Rev.	Lang.	By	When	Δ	Comment
	ru1		GlebsHP	2017-05-21 03:07:09	9552	Первая редакция перевода на Русский
	en1		GlebsHP	2017-05-14 22:46:33	8279	Initial revision (published)

Problem A. Long-Term Mail Storage.

Problem B. Lassies Versus Machine.

Problem C. Effecient Management Returns.

Problem D. The Sting.

Problem E. Random Value of Mode.

Problem F. Measure Twice, Divide Once.

History