Blog entries - Codeforces

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	atcoder_official	148
5	-is-this-fft-	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

mango_lassi's blog

Young Tableaus and the Hook Length Formula

By mango_lassi, 3 years ago, In English

Boring backstory of this blog

In 300IQ contest 3, there was a problem where you had to count the number of permutations with two disjoint longest increasing subsequences. We VCd this contest while practising for ICPC, and didn't solve this problem during the contest, so I was very interested to see how it could be solved. Turns out the solution uses something called Young diagrams, and unless you already know what they are, the editorial is impossible to understand.

I asked if someone knew about Young diagrams on the competitive programming discord, and got linked a paper from the Chinese IOI selection camp, written by yfzcsc. If you can speak chinese, you should just read the paper, it covers a lot more than I do here. With the help of my teammate YaoBIG I believe I understand enough to write a English-language blog on this topic for the rest of us :)

Definitions

We call a sequence $$$\lambda$$$ of positive integers a partition of $$$n$$$, if $$$\lambda_i \geq \lambda_{i + 1}$$$ and $$$\sum_{i = 1}^{|\lambda|} \lambda_i = n$$$. We denote by $$$P_n$$$ the set of partitions of $$$n$$$.

For a partition $$$\lambda$$$, we define the Young diagram $$$Y(\lambda)$$$ as the set of pairs $$$(i, j)$$$ of positive integers such that $$$i \leq |\lambda|$$$ and $$$j \leq \lambda_i$$$. We call the pairs $$$s = (i, j) \in Y(\lambda)$$$ the squares of the diagram.

A normalised Young Tableau $$$P$$$ of size $$$n$$$ is a pair $$$(\lambda, p)$$$ of a partition $$$\lambda$$$ of $$$n$$$, and a permutation $$$p$$$ indexed by $$$Y(\lambda)$$$, such that $$$p_{i, j} < min(p_{i + 1, j}, p_{i, j + 1})$$$.

We use natural definitions of a transpose: we denote by $$$\lambda^T$$$ the transpose partition, that is $$$\lambda^T_j = |\{i : \lambda_i \geq j\}|$$$, let $$$Y^T(\lambda) = Y(\lambda^T)$$$, and define the transpose $$$P^T = (\lambda^T, p^T)$$$ of a normalised Young diagram such that $$$p_{i, j} = p^T_{j, i}$$$.

What is a Young tableau and why are they interesting

Why is it interesting to define Young tableaus? It turns out that there is a one-to-one mapping $$$f$$$ from permutations $$$\pi$$$ of length $$$n$$$ to pairs $$$(P, Q) = ((\lambda, p), (\lambda, q))$$$ of standard Young tableaus of size $$$n$$$ that share the same partition. The mapping has the following properties:

$$$\sum_{i \leq m} \lambda_i$$$ is the maximum total size of $$$m$$$ disjoint increasing subsequences of the permutation
$$$\sum_{j \leq m} \lambda_j^T$$$ is the maximum total size of $$$m$$$ disjoint decreasing subsequences of the permutation
If $$$\pi^{-1}$$$ is the inverse permutation, then $$$f(\pi^{-1}) = (Q, P)$$$
If $$$rev(\pi)$$$ is the reversed permutation, then $$$f(rev(\pi)) = (P^T, ??)$$$ (actually, the second component is the normalised result of applying the Schutzenberger algorithm to $$$Q$$$, but we don't care about that)
The mapping can be computed in $$$\mathcal{O}(n \sqrt{n} \log n)$$$ time. The inverse can be computed in $$$\mathcal{O}(n^2)$$$ time

If $$$c(s)$$$ is the number of valid $$$p$$$ for the partition $$$s$$$, the existence of the mapping shows that $$$n! = \sum_{s \in P_n} c(s)^2$$$. By property 3, the number of permutations $$$\pi$$$ such that $$$\pi = \pi^{-1}$$$ is $$$\sum_{s \in P_n} c(s)$$$. The value we want to compute in the problem presented at the start of the blog is $$$\sum_{s \in P_n : s_1 = s_2} c(s)^2$$$.

The number of partitions of $$$75$$$ is small, so we can loop over all of them. It remains to compute $$$c(s)$$$ fast. For this, we can use the hook-length formula, which we can compute in $$$\mathcal{O}(n)$$$ time.

The mapping: Robinson–Schensted–Knuth correspondence

The name is fancy, but the mapping isn't hard (though proofs of its properties are).

Let's first describe how the first tableau $$$P$$$ is computed. We loop over the rows from first to last, and in each row, find the first element that is larger than the value we are inserting. If we found such a value, we swap it with the value we are inserting and proceed to the next row. Otherwise, we add the value we are inserting to the end of the row and are done.

In every step, the length of exactly one row of the tableau $$$P$$$ increases by $$$1$$$. In step $$$i$$$, we insert $$$i$$$ to tableau $$$Q$$$ at the end of the row whose size increased in the first tableau. This way, both tableaus always have the same shape. By a trivial induction proof, the values in the cells of every row and column in both tableaus are strictly increasing. Thus, the mapping is a injection.

The naive way to perform the insertion takes $$$\mathcal{O}(n)$$$ time, thus we can compute the mapping in $$$\mathcal{O}(n^2)$$$ time.

To compute the inverse, we undo the insertions one by one, starting from the last. Let $$$i$$$ be the row s.t. $$$q_{i, s_i} = n$$$. This is the row whose size increased by $$$1$$$ in the insertion of the last value. Let $$$v \leftarrow p_{i, s_i}$$$, then delete cell $$$(i, s_i)$$$ from both tableaus.

Now, if $$$i = 1$$$, we have $$$\pi_n = v$$$ and can recurse to find the rest of $$$\pi$$$. Otherwise, the reason $$$v$$$ entered row $$$i$$$ was because some smaller value replaced it on row $$$i - 1$$$. This value is the largest value smaller than it. Swap that value with $$$v$$$ and subtract $$$1$$$ from $$$i$$$.

O(n^2) code for the mapping and its inverse

#include <bits/stdc++.h>
using namespace std;
using Tableau = vector<vector<int>>;

// Calculates the Robinson–Schensted–Knuth mapping in O(n^2)
pair<Tableau, Tableau> rskMapping(const vector<int>& pi) {
	Tableau p, q;
	for (int ind = 0; ind < pi.size(); ++ind) {
		int cur = pi[ind];
		bool found = 0;
		for (int i = 0; i < p.size() && !found; ++i) {
			int j = upper_bound(p[i].begin(), p[i].end(), cur) - p[i].begin();
			if (j == p[i].size()) {
				p[i].push_back(cur);
				q[i].push_back(ind);
				found = 1;
			} else {
				swap(p[i][j], cur);
			}
		}
		if (! found) {
			p.emplace_back(vector<int>(1, cur));
			q.emplace_back(vector<int>(1, ind));
		}
	}
	return {p, q};
}

// Calculates the inverse of the Robinson–Schensted–Knuth mapping in O(n^2)
vector<int> rskInverse(int n, Tableau p, Tableau q) {
	vector<int> pi(n);
	for (int ind = n-1; ind >= 0; --ind) {
		int i = 0, j = 0;
		for (; i < q.size(); ++i) {
			for (j = 0; j < q[i].size() && q[i][j] != ind; ++j) {}
			if (j < q[i].size()) break;
		}

		int cur = p[i][j];
		p[i].pop_back();
		q[i].pop_back();

		for (--i; i >= 0; --i) {
			j = upper_bound(p[i].begin(), p[i].end(), cur) - p[i].begin();
			swap(p[i][j - 1], cur);
		}
		pi[ind] = cur;
	}
	return pi;
}

For the faster algorithm for computing the mapping, note that computing just the first $$$\lceil \sqrt{n} \rceil$$$ rows of $$$P$$$ can be done in $$$\mathcal{O}(n \sqrt{n} \log n)$$$ time. Using property $$$4$$$, we can compute the first $$$\lceil \sqrt{n} \rceil$$$ columns of $$$P$$$ in the same time. Since there cannot be more than $$$\lceil \sqrt{n} \rceil$$$ rows with at least $$$\lceil \sqrt{n} \rceil$$$ cells, this way we find the entire tableau in $$$\mathcal{O}(n \sqrt{n} \log n)$$$ time. To find $$$Q$$$, we use property $$$3$$$ and then repeat the above.

I am not aware of any way to calculate the inverse mapping in $$$\mathcal{O}(n \sqrt{n} \log n)$$$ time.

O(n sqrt(n) log(n)) code for the mapping


#include <bits/stdc++.h>
using namespace std;
using Tableau = vector<vector<int>>;

// Calculates the first k rows of P in the Robinson-Schensted-Knuth mapping in O(nk)
Tableau partialRSK(const vector<int>& pi, int k) {
	Tableau p(k);
	for (int ind = 0; ind < pi.size(); ++ind) {
		int cur = pi[ind];
		for (int i = 0; i < k; ++i) {
			int j = upper_bound(p[i].begin(), p[i].end(), cur) - p[i].begin();
			if (j < p[i].size()) swap(p[i][j], cur);
			else {
				p[i].push_back(cur);
				break;
			}
		}
	}
	while(p.back().empty()) p.pop_back();
	return p;
}

// Calculates the Robinson-Schensted-Knuth mapping in O(n sqrt(n) log(n)) time
pair<Tableau, Tableau> fastRSKMapping(vector<int> pi) {
	int n = pi.size(), k = 1;
	while(k*k < n) ++k;

	vector<int> inv_pi(n);
	for (int i = 0; i < n; ++i) inv_pi[pi[i]] = i;

	Tableau p = partialRSK(pi, k), q = partialRSK(inv_pi, k);
	reverse(pi.begin(), pi.end()); reverse(inv_pi.begin(), inv_pi.end());
	Tableau p_cols = partialRSK(pi, k), q_cols = partialRSK(inv_pi, k);

	p.resize(p_cols[0].size()); q.resize(q_cols[0].size());
	for (int i = k; i < p.size(); ++i) {
		for (int j = 0; j < p_cols.size() && p_cols[j].size() > i; ++j) {
			p[i].emplace_back(p_cols[j][i]);
			q[i].emplace_back(q_cols[j][i]);
		}
	}
	return {p, q};
}

Unfortunately, the proofs for properties $$$1$$$ to $$$4$$$ are long, involved and very hard to understand (at least for my counting-challenged brain), so I won't put them here. The proof for properties $$$1$$$ and $$$2$$$ for $$$m > 1$$$ is due to Greene. Properties $$$3$$$ and $$$4$$$ appear as theorems 3.2.1 and 4.1.1 here.

Hook-Length Formula

For $$$\lambda \in P_n$$$ and $$$(i, j) \in Y(\lambda)$$$, we define $$$h_{\lambda}(i, j)$$$ as the number of squares in the diagram directly below or to the right of square $$$(i, j)$$$. Formally written, $$$h_{\lambda}(i, j) = (\lambda^T_j - i) + (\lambda_i - j) + 1$$$. The hook-length theorem states that $$$t(\lambda) = c(\lambda)$$$ for

\begin{equation} t(\lambda) = \frac{n!}{\prod_{(i, j) \in Y(\lambda)} h_\lambda(i, j)} \end{equation}

Computing $$$t(\lambda)$$$ is indeed easy to implement in $$$\mathcal{O}(n)$$$ time, and unlike many other results stated in this blog, the hook-length formula's proof is not hard and very beautiful. The below proof is from here.

Code for the hook-length formula

#include <bits/stdc++.h>
using namespace std;
using Tableau = vector<vector<int>>;
using ll = long long;
const ll MOD = (ll)1e9 + 7;
ll modPow(ll a, ll b) { return (b == 0) ? 1 : ((b & 1) ? a * modPow(a, b ^ 1) % MOD : modPow(a*a % MOD, b >> 1)); }

// Calculates the number of standard Young tableau of shape lambda in O(n) using the hook-length formula
ll hookLength(int n, const vector<int>& lambda) {
	vector<int> lambda_t(lambda[0]);
	for (int i = 0, j = lambda[0]; i < n; ++i) {
		while(j >= 0 && (i + 1 == n || lambda[i+1] <= j)) lambda_t[j] = i + 1;
	}

	ll num = 1, div = 1;
	for (int i = 0; i < n; ++i) num = num * n % MOD;
	for (int i = 0; i < lambda.size(); ++i) {
		for (int j = 0; j < lambda[i]; ++j) {
			ll mult = 1 + (lambda[i] - j - 1) + (lambda_t[j] - i - 1);
			div = div * mult % MOD;
		}
	}
	return num * modPow(div, MOD - 2) % MOD;
}

Proof of the hook-length formula

Denote by $$$\lambda^-$$$ the set of partitions $$$\mu$$$ of $$$n-1$$$ such that $$$Y(\mu) \subseteq Y(\lambda)$$$. These partitions have the same Young diagram as $$$\lambda$$$ with one corner square removed.

Note that $$$c(\lambda)$$$ satisfies the recurrence $$$c(\lambda) = \sum_{\mu \in \lambda^-} c(\mu)$$$ as the maximum value has to occur in some corner cell.

We define a random walk on the Young diagram $$$Y(\lambda)$$$ of the partition. The random walk starts at a uniformly random square of $$$Y(\lambda)$$$, and in every step moves from $$$(i, j)$$$ to a uniformly random square in $$$H_\lambda(i, j) \setminus \{(i, j)\}$$$ where $$$H_\lambda(i, j)$$$ is the set of squares in $$$Y(\lambda)$$$ directly below or to the right of $$$(i, j)$$$. Thus the probability of ending up in square $$$(i', j') \in H_\lambda(i, j) \setminus \{(i, j)\}$$$ is $$$\frac{1}{h_\lambda(i, j) - 1}$$$. When $$$h_\lambda(i, j) = 1$$$, the random walk stops.

After the random walk stops, remove the square it ended up at from $$$Y$$$. We'll show that the probability that the resulting partition corresponding to the diagram after removing the square is $$$\mu \in \lambda^-$$$ with probability $$$t(\mu) / t(\lambda)$$$. Since the process always results in some $$$\mu \in \lambda^-$$$, the probabilities must sum to one, so we must then have $$$t(\lambda) = \sum_{\mu \in \lambda^-} t(\mu)$$$. Since $$$t(\mu) = c(\mu) = 1$$$ for the only partition $$$\mu$$$ of $$$1$$$, the result follows by induction.

First, note that if $$$Y(\lambda) = Y(\mu) + (i, j)$$$, we have \begin{equation} \frac{t(\mu)}{t(\lambda)} = \frac{1}{n} \prod_{k < i} \frac{h_\lambda(k, j)}{h_\lambda(k, j) — 1} \prod_{k < j} \frac{h_\lambda(i, k)}{h_\lambda(i, k) — 1} = \frac{1}{n} \prod_{k < i} \left(1 + \frac{1}{h_\lambda(k, j) — 1}\right) \prod_{k < j} \left(1 + \frac{1}{h_\lambda(i, k) — 1}\right) \end{equation}

For a random walk $$$((a_1, b_1), \dots, (a_k, b_k))$$$, let $$$A = \{a_1, \dots, a_k\}$$$ be the set of visited $$$y$$$-coordinates, and $$$B = \{b_1, \dots, b_k\}$$$ be the set of visited $$$x$$$-coordinates. Since the square the walk ends up in is $$$(a_k, b_k) = (\max A, \max B)$$$, the pair $$$(A, B)$$$ uniquely determines where the cell ends up in (though it does not uniquely determine the walk).

By $$$p(A, B | a, b)$$$, denote the probability that a random walk starting at $$$(a, b)$$$ has the given sets $$$A$$$ and $$$B$$$. If $$$\alpha = \max A$$$ and $$$\beta = \max B$$$, we claim that \begin{equation} p(A, B\ | a, b) = \prod_{i \in A \setminus \{\alpha\}} \frac{1}{h_\lambda(i, \beta) — 1} \prod_{j \in B \setminus \{\beta\}} \frac{1}{h_\lambda(\alpha, j) — 1} \end{equation} this is a easy proof by induction on $$$|A| + |B|$$$: if $$$A = \{a_1, \dots, a_k\}$$$ and $$$b = \{b_1, \dots, b_k\}$$$, $$$a_1 = a, b_1 = b$$$ and $$$a_k = \alpha, b_k = \beta$$$, \begin{equation} p(A, B\ | a, b) = \frac{1}{h_\lambda(a, b) — 1} \left(p(A \setminus \{a\}, B | a_2, b) + p(A, B \setminus \{b\} | a, b_2)\right) \end{equation} which by induction equals \begin{equation} \frac{1}{h_\lambda(a, b) — 1} \left((h_\lambda(a, \beta) — 1) + (h_\lambda(\alpha, b) — 1)\right)\prod_{i \in A \setminus \{\alpha\}} \frac{1}{h_\lambda(i, \beta) — 1} \prod_{j \in B \setminus \{\beta\}} \frac{1}{h_\lambda(\alpha, j) — 1} \end{equation} We have $$$h_\lambda(a, b) - 1 = (h_\lambda(a, \beta) - 1) + (h_\lambda(\alpha, b) - 1)$$$, thus the claimed equation for $$$p(A, B\ | a, b)$$$ holds.

Now, we are done, as \begin{equation} \sum_{a \leq \alpha, b \leq \beta} \frac{1}{n} \sum_{\stackrel{A \subseteq [n]}{\stackrel{min A = a}{max A = \alpha}}} \sum_{\stackrel{B \subseteq [n]}{\stackrel{min B = b}{max B = \beta}}} p(A, B | a, b) = \frac{1}{n} \prod_{k < \alpha} \left(1 + \frac{1}{h_\lambda(k, \beta) — 1}\right) \prod_{k < \beta} \left(1 + \frac{1}{h_\lambda(\alpha, k) — 1}\right) \end{equation} you can see this by looping $$$y$$$ from $$$\alpha$$$ to $$$1$$$ and deciding if $$$y$$$ should be in $$$A$$$, then doing the same for $$$x, \beta$$$ and $$$B$$$. The $$$\frac{1}{n}$$$ factor comes from the random selection of the starting square.

The above proof for the hook-length formula also gives an unbiased sampler for $$$p$$$ given $$$\lambda$$$, though I doubt that will ever be useful in competitive programming.

Problems

300IQ contest 3 problem D

GP of southeastern europe 2021 problem D

If you know any more problems where this technique can be applied, please share a link with me. In particular, there probably exists a problem where you have to find for every $$$m$$$ the maximum total size of $$$m$$$ disjoint increasing subsequences.

Full text and comments »

math, permutations, tutorial, combinatorics

+144

mango_lassi
3 years ago
3

IOI will have honourable mentions starting 2022

By mango_lassi, history, 3 years ago, In English

A participant who scored strictly more than 50% of contestants on at least one day, but does not receive a medal, will be awarded an honourable mention.

The rules change will be effective starting 2022.

So, if you would be awarded a bronze medal based on the results of at least one day, but do not receive a medal, you will receive a HM.

Full text and comments »

#ioi, #ioi2022, honourable mention

+194

mango_lassi
3 years ago
20

EGOI 2021

By mango_lassi, 3 years ago, In English

The first ever European Girls Olympiad In Informatics (EGOI) is now over. You can view the scoreboard of the official contest on the EGOI 2021 webpage. Congratulations to the medalists, and in particular to Alisa Gladchenko AliceG and Ekaterina Shilyaeva AlFlen for the top scores!

We are also organising a virtual contest with the problems from the olympiad. The two contest days will be on codeforces. There is a 28-hour period during which you can take the 5-hour contest. The first contest will open in a few hours and last until midnight on Saturday, and the second will last from the start of Sunday to early Monday morning. All times are UTC+2!

Day 1: Friday 18.6. 20:00 — Saturday 19.6. 23:59 (EGOI 2021 Day 1)

Day 2: Sunday 20.6. 00:00 — Monday 21.6. 04:00 (EGOI 2021 Day 2)

Feel free to discuss the tasks here after the virtual contest frame ends.

A big shout out to the organisers and volunteers.

EDIT: Congratulations to ko_osaga for being the only one to fully solve Lanterns, and to Radewoosh for fully solving Double Move. I am sorry for messing up the settings of the virtual contest (which made it display only the first 5 hours for day 1, and no results at all for day 2). Unfortunately I do not know how to fix that :(

Full text and comments »

#egoi, egoi 2021

+151

mango_lassi
3 years ago
41

Bitwise range AND/OR with no updates in O(n) preprocessing and O(1) time per query

By mango_lassi, history, 3 years ago, In English

There was a blog on this topic that I was about to comment on, but it seems that it got deleted :(

It was asking for a solution in $$$\mathcal{O}(n)$$$ preprocessing and $$$\mathcal{O}(1)$$$ query time. There are no updates. We make the standard assumptions that input integers have $$$\mathcal{O}(\log n)$$$ bits and operations on $$$\mathcal{O}(\log n)$$$-bit integers take $$$\mathcal{O}(1)$$$ time.

The obvious solutions to range AND and range OR are to

Count the number of times every bit appears in every prefix ($$$\mathcal{O}(n \log n)$$$ preprocessing and $$$\mathcal{O}(\log n)$$$ query time)
Use a interval AND / OR segment tree ($$$\mathcal{O}(n)$$$ preprocessing and $$$\mathcal{O}(\log n)$$$ query time)
Use a sparse table ($$$\mathcal{O}(n \log n)$$$ preprocessing and $$$\mathcal{O}(1)$$$ query time).

Four Russians is a standard trick to achieve $$$\mathcal{O}(n)$$$ preprocessing and $$$\mathcal{O}(1)$$$ query time. However, I don't see how it can be used here to achieve that: while we can solve all queries of length $$$\Omega(\log n)$$$ in $$$\mathcal{O}(1)$$$ time with $$$\mathcal{O}(n)$$$ preprocessing, it is not clear how to solve queries entirely inside a single block. This is because each block contains $$$\mathcal{O}(\log^2 n)$$$ bits that matter, not $$$\mathcal{O}(\log n)$$$ as is the case for the cartesian tree of the block, when computing interval minimum.

So the best complexity I could come up with is based on iterating the Four Russians trick, which achieves either

$$$\mathcal{O}(n \log^* n)$$$ precalc and $$$\mathcal{O}(1)$$$ time per query (where $$$\log^* n$$$ is the iterated logarithm)
$$$\mathcal{O}(n)$$$ precalc and $$$\mathcal{O}(\log \log \dots \log n)$$$ time per query (taking logarithm any fixed number of times)

Solution with iterated Four Russians

This is pretty good (theoretically, of course this is pointless to do in practice), but is there some way to get $$$\mathcal{O}(n)$$$ preprocessing and $$$\mathcal{O}(1)$$$ query time? Is there even a solution that uses $$$\mathcal{O}(n)$$$ space and has $$$\mathcal{O}(1)$$$ queries?

Full text and comments »

#range query, #sparse table, #data structure, #four russians

mango_lassi
3 years ago
6

Finding minimum residue of a linear function in O(log M) time

By mango_lassi, history, 3 years ago, In English

I saw a blog (thanks to oversolver for finding it!) earlier today asking how to solve the following problem:

Given a linear function $$$ax + b$$$, find the minimum $$$x \geq 0$$$ such that $$$ax + b \in [L, R]\ (\text{mod } M)$$$.

To solve that problem, we can make the following reduction: If $$$gcd(a, M) > 1$$$, we divide everything by the GCD. The $$$x$$$ for which $$$ax + b = L\ (\text{mod } M)$$$ is $$$(L - b) a^{-1}\ (\text{mod } M)$$$. Denote this value by $$$b_0$$$. Then, the minimum $$$x$$$ to get to $$$L + y$$$ is $$$a^{-1} y + b_0\ (\text{mod } M)$$$. This gives us a reduced problem:

Find the minimum value of $$$ay + b \text{ mod } M$$$ over $$$y \leq k$$$.

This seemed pretty hard, but surprisingly I figured out how to do it in $$$\mathcal{O}(\log M)$$$ time! The algorithm is as follows:

In every step, we reduce the modulo from $$$M$$$ to $$$\min(a, M - a) \leq M / 2$$$. This guarantees we do at most $$$\mathcal{O}(\log M)$$$ steps.

To reduce it to $$$a$$$, we consider the first value $$$s$$$ among $$$[0, a)$$$ achieved by $$$ay + b$$$. If $$$b < a$$$, it is $$$b$$$. Otherwise, it is $$$b - M\text{ mod } a$$$. We check in $$$\mathcal{O}(1)$$$ if we reach $$$s$$$ for some $$$y \leq k$$$. If we do, set $$$M$$$ to $$$a$$$, $$$a$$$ to $$$-M \text{ mod } a$$$ and $$$b$$$ to $$$s$$$. Otherwise, output $$$b$$$ as the first $$$a$$$ values are the only values such that the previous value was larger, thus if we never attain them, the first value we attain is the largest we do.

To reduce it to $$$M - a$$$, we consider the first value $$$s = b \text{ mod } M - a$$$ among $$$[0, M - a)$$$ achieved by $$$ay + b$$$. If we reach $$$s$$$ for some $$$y \leq k$$$, set $$$M$$$ to $$$M - a$$$, $$$a$$$ to $$$a \text{ mod } M - a$$$ and $$$b$$$ to $$$s$$$. Otherwise, we can calculate the number of steps to go from $$$v$$$ to $$$v - (M - a)$$$ for $$$v \geq M - a$$$, and from this the smallest value we can reach with $$$y \leq k$$$.

code


#include <bits/stdc++.h>
using namespace std;
using ll = long long;

pair<ll, ll> extEucMod(ll a, ll b, ll p) {
	if (b == 0) return {1, 0};
	ll m = a / b;
	auto sub = extEucMod(b, a - b * m, p);
	return {sub.second, (sub.first - m*sub.second) % p};
}
ll modInv(ll a, ll p) {
	ll res = extEucMod(p, a, p).second;
	return (res < 0 ? res + p : res);
}
ll gcd(ll a, ll b) { return (b == 0 ? a : gcd(b, a % b)); }
ll mSub(ll a, ll b, ll m) { return (a >= b ? a - b : a - b + m); }
ll posMod(ll a, ll m) { ll res = a % m; return (res < 0 ? res + m : res); }
ll getSteps(ll t, ll ia, ll b, ll m) { return mSub(t, b, m) * ia % m; }

// Returns minimum value of ax + b (mod m) for x \in [0, k]. O(log m) time
ll minRem(ll a, ll b, ll m, ll k) {
	for (ll g = gcd(a, m); g > 1;) return g * minRem(a/g, b/g, m/g, k) + (b % g);
	for (ll b0 = b, m0 = m, ia0 = modInv(a, m), na, nb, nm; a; a = na, b = nb, m = nm) {
		if (a > m - a) {
			na = a % (m - a);
			nb = b % (m - a);
			nm = m - a;
			for (ll steps = getSteps(nb, ia0, b0, m0); steps > k;) {
				ll add = steps - getSteps(nb + nm, ia0, b0, m0);
				return nb + nm * ((steps - k + (add - 1)) / add);
			}
		} else {
			na = posMod(-m, a);
			nb = (b < a ? b : posMod(b - m, a));
			nm = a;
			if (getSteps(nb, ia0, b0, m0) > k) break;
		}
	}
	return b;
}

// Returns minimum x such that ax + b (mod m) \in [le, ri] or -1 if there is no such x. O(log m) time
ll firstInRange(ll a, ll b, ll m, ll le, ll ri) {
	for (ll g = gcd(a, m); g > 1;) return firstInRange(a/g, b/g, m/g, le/g + (le % g > b % g), ri/g - (ri % g < b % g));
	if (le > ri) return -1; // impossible
	ll ia = modInv(a, m);
	return minRem(ia, mSub(le, b, m) * ia % m, m, ri - le);
}


int main() {
	ios_base::sync_with_stdio(false);
	cin.tie(0);

	ll a, b, m, le, ri;
	cin >> a >> b >> m >> le >> ri;
	cout << firstInRange(a, b, m, le, ri) << '\n';
}

What I wanted to ask is, is this a known problem, and is there a simpler or perhaps even faster solution to it? The problem seems fairly simple, so I doubt nobody has thought about it before.

Full text and comments »

#number theory, #euclidean algorithm, #continued fractions

+179

mango_lassi
3 years ago
26

Top rated users list?

By mango_lassi, history, 5 years ago, In English

Is there any way to view top rated users (with inactive users included)? (The "Rating (All)" tab would logically be this, but the list is just the same as "Rating". Further, it seems you need to login to even see the "Rating (All)" tab. What even is it?)
Is there a list of users sorted by their max. rating?
Is there some convenient way to show only European contestants or do any filtering other than by country, city or organisation?

I've tried to google for these but haven't found anything. If any of these lists are hidden somewhere on the site, or any external sites have these lists, please inform me.

Full text and comments »

top rated, feature request, filter, top rating

mango_lassi
5 years ago
22

NWERC 2019

By mango_lassi, history, 5 years ago, In English

The 2019 NWERC (Northwestern European regional contest) is tomorrow. You should see the scoreboard here once the contest starts.

There will be an online mirror here.

Let's discuss the problems after the contest!

Full text and comments »

#acmicpc2019, online mirror, acm icpc regional

mango_lassi
5 years ago
14

Problem H (Game on the Tree) from CCPC 2018

By mango_lassi, history, 5 years ago, In English

This problem

I have a solution I think is correct but can't get it to pass, and apparently even my n^2 code (standard game theory iterating through all states solution) that should obviously work gets WA. They both give the same answers for random small inputs.

I can't find editorials or test data for the contest, so seeing so many teams have +1 or more in this problem, is there some tricky corner case I might have missed?

My code if it is of any interest:

code

#include <iostream>
#include <vector>
#include <utility>
#include <algorithm>
using namespace std;
const int INF = (int)1e9 + 7;

bool findCycle(int i, vector<int>& cycle, vector<int>& par, const vector<vector<int>>& conns) {
	for (auto t : conns[i]) {
		if (par[i] == t) continue;
		if (par[t] != -1) {
			if (par[t] == i) swap(t, i);
			while(i != t) {
				cycle.push_back(i);
				i = par[i];
			}
			cycle.push_back(i);
			return true;
		} else {
			par[t] = i;
			bool found = findCycle(t, cycle, par, conns);
			if (found) return true;
		}
	}
	return false;
}

void dfs(int i, vector<int>& par, vector<int>& ban, const vector<vector<int>>& conns) {
	for (auto t : conns[i]) {
		if (par[t] != -1) continue;
		if (ban[t] != -1) continue;
		par[t] = i;
		dfs(t, par, ban, conns);
	}
}
vector<int> getDists(int s, const vector<vector<int>>& conns) {
	int n = conns.size();
	vector<int> dist(n, INF);
	dist[s] = 0;

	vector<int> que = {s};
	for (int j = 0; j < que.size(); ++j) {
		int i = que[j];
		for (auto t : conns[i]) {
			if (dist[t] == INF) {
				dist[t] = dist[i] + 1;
				que.push_back(t);
			}
		}
	}
	return dist;
}

int findClosest(const vector<int>& list, const vector<int>& dist) {
	int rd = INF;
	int ri = -1;
	for (int i = 0; i < list.size(); ++i) {
		if (dist[list[i]] < rd) {
			rd = dist[list[i]];
			ri = i;
		}
	}
	return ri;
}

void solve(int ti) {
	int n;
	cin >> n;
	vector<vector<int>> conns(n);
	for (int i = 0; i < n; ++i) {
		int a, b;
		cin >> a >> b;
		--a; --b;
		conns[a].push_back(b);
		conns[b].push_back(a);
	}
	int m;
	cin >> m;
	vector<int> dest(m);
	for (int i = 0; i < m; ++i) {
		cin >> dest[i];
		--dest[i];
	}

	int a, b;
	cin >> a >> b;
	--a; --b;

	vector<int> cycle;
	vector<int> par(n, -1);

	par[0] = 0;
	findCycle(0, cycle, par, conns);

	int k = cycle.size();
	vector<int> cyc_ind(n, -1);
	for (int j = 0; j < k; ++j) cyc_ind[cycle[j]] = j;

	// See if A can win without doing anything related to the cycle
	bool win = false;
	vector<int> a_dists = getDists(a, conns);
	vector<int> b_dists = getDists(b, conns);
	for (auto i : dest) {
		if (a_dists[i] <= b_dists[i]) win = true;
	}

	// See if A can win by playing in the cycle
	if (! win) {
		for (int i = 0; i < n; ++i) par[i] = -1;
		for (auto i : cycle) {
			par[i] = i;
			dfs(i, par, cyc_ind, conns);
		}
		// cyc_win[i] = true for i in cycle if branch leading to win starts from i
		vector<bool> cyc_win(n, false);
		for (auto i : dest) {
			for (int j = i; !cyc_win[j]; j = par[j]) {
				cyc_win[j] = true;
			}
		}

		// Find closest nodes in cycle for a and b
		int cai = findClosest(cycle, a_dists);
		int cbi = findClosest(cycle, b_dists);
		int ca = cycle[cai];
		int cb = cycle[cbi];
		int da = a_dists[ca];
		int db = b_dists[cb];

		// Find closest cyc_win to the left and to the right of ca
		int le_win = -1;
		int ri_win = -1;
		for (int j = cai;; j = (j-1+k)%k) {
			if (cyc_win[cycle[j]]) {
				le_win = j;
				break;
			}
		}
		for (int j = cai;; j = (j+1)%k) {
			if (cyc_win[cycle[j]]) {
				ri_win = j;
				break;
			}
		}
		
		//     A
		//     |da
		// x - x - x
		// | da1   | da2
		// lw      rw
		// | db1   | db2
		// x - x - x
		//     | db
		//     B

		int da1 = (cai - le_win + k) % k;
		int da2 = (ri_win - cai + k) % k;
		int db1 = (le_win - cbi + k) % k;
		int db2 = (cbi - ri_win + k) % k;

		if (db1 + db2 > k) {
			// A and B are on the same side
			// We need to figure out which side of A B is in
			db1 = (k - db1) % k;
			db2 = (k - db2) % k;
			int dab = abs(da1 - db1);

			if (db + dab < da) {
				win = false; // B can block A from entering cycle
			} else {
				if (db1 <= da1) {
					if (db + (k - db2) < da + da2) {
						win = false; // B can go all the way around
					} else {
						win = true;
					}
				} else {
					if (db + (k - db1) < da + da1) {
						win = false; // B can go all the way around
					} else {
						win = true;
					}
				}
			}
		} else {
			// A and B are on opposite sides
			if (db + min(da1 + db1, da2 + db2) < da) {
				win = false; // B can block A from entering cycle
			} else if (le_win == ri_win) {
				win = false; // Block messages
			} else {
				// Can B get into a position where it can get to 
				int ddif1 = db1 - da1 + 1; // Move this many times left for standstill
				int ddif2 = db2 - da2 + 1; // Move this many times right for standstill
				if (ddif1 + ddif2 <= 0 && min(0, max(ddif1, ddif2)) + db <= da) {
					win = false; // B can get to standstill
				} else {
					win = true; // A can escape standstill
				}
			}
		}
	}

	// Output result
	cout << "Case " << (ti+1) << ": ";
	if (win) cout << "Panda" << '\n';
	else cout << "Sheep\n";
}

int main() {
	ios_base::sync_with_stdio(false);
	cin.tie(0);

	int t;
	cin >> t;
	for (int ti = 0; ti < t; ++ti) solve(ti);
}

n^2 code

#include <iostream>
#include <vector>
#include <utility>
#include <algorithm>
using namespace std;
const int INF = (int)1e9 + 7;

const int N = 1010;
int win[2][N][N];
int rem[2][N][N];
vector<pair<int, pair<int, int>>> que;

void resolve(int s, int i, int j, int r) {
	if (rem[s][i][j] <= 0) return;
	win[s][i][j] = r;
	rem[s][i][j] = 0;
	que.push_back({s, {i, j}});
}

void solve(int ti) {
	int n;
	cin >> n;

	que.clear();
	for (int s = 0; s < 2; ++s) {
		for (int i = 0; i < n; ++i) {
			for (int j = 0; j < n; ++j) {
				win[s][i][j] = 0;
				rem[s][i][j] = 0;
			}
		}
	}

	vector<vector<int>> conns(n);
	for (int i = 0; i < n; ++i) {
		int a, b;
		cin >> a >> b;
		--a; --b;
		conns[a].push_back(b);
		conns[b].push_back(a);
	}
	for (int i = 0; i < n; ++i) conns[i].push_back(i);

	int m;
	cin >> m;
	vector<int> dest(m);
	for (int i = 0; i < m; ++i) {
		cin >> dest[i];
		--dest[i];
	}

	int a, b;
	cin >> a >> b;
	--a; --b;

	for (int i = 0; i < n; ++i) {
		for (int j = 0; j < n; ++j) {
			rem[0][i][j] = conns[i].size();
			rem[1][i][j] = conns[j].size();
		}
	}

	for (auto i : dest) {
		for (int j = 0; j < n; ++j) {
			resolve(0, i, j, 1);
			if (i != j) resolve(1, i, j, 1);
		}
	}
	for (int i = 0; i < n; ++i) {
		resolve(1, i, i, -1);
	}
	for (int x = 0; x < que.size(); ++x) {
		int s = que[x].first;
		int i = que[x].second.first;
		int j = que[x].second.second;
		int r = (s ? 1 : -1);

		if (s == 0) {
			if (win[s][i][j] == r) {
				for (auto t : conns[j]) {
					resolve(s^1, i, t, r);
				}
			} else {
				for (auto t : conns[j]) {
					if (rem[s^1][i][t] == 1) resolve(s^1, i, t, -r);
					else --rem[s^1][i][t];
				}
			}
		} else {
			if (win[s][i][j] == r) {
				for (auto t : conns[i]) {
					resolve(s^1, t, j, r);
				}
			} else {
				for (auto t : conns[i]) {
					if (rem[s^1][t][j] == 1) resolve(s^1, t, j, -r);
					else --rem[s^1][t][j];
				}
			}

		}
	}

	// Output result
	bool ans = (win[0][a][b] == 1);
	cout << "Case " << (ti+1) << ": ";
	if (ans) cout << "Panda" << '\n';
	else cout << "Sheep\n";
}

int main() {
	ios_base::sync_with_stdio(false);
	cin.tie(0);

	int t;
	cin >> t;
	for (int ti = 0; ti < t; ++ti) solve(ti);
}

Submissions: 54559534 54559186

Full text and comments »

bug, corner case, case analysis, game theory

mango_lassi
5 years ago
2

Non-Recursive HLD Implementation

By mango_lassi, history, 5 years ago, In English

I wrote this pretty short implementation of Heavy-Light Decomposition today:

code

#include <iostream>
#include <vector>
using namespace std;

struct HLD {
	vector<int> par; // Parent
	vector<int> pp; // Path parent (Highest ancestor in same heavy-edge segment)
	vector<int> ind; // HLD index (Index in the array HLD maps nodes to)

	// p: parent of node i. Must have p[i] < i (p[0] = -1).
	HLD(const vector<int>& p) : par(p), pp(p.size()), ind(p.size(), -1) {
		int n = p.size();
		vector<int> siz(n, 1); // subtree size
		for (int i = n-1; i > 0; --i) siz[par[i]] += siz[i];

		vector<int> pc(n, -1); // Preferred child
		for (int i = n-1; i > 0; --i) {
			if (2*siz[i] >= siz[par[i]]) pc[par[i]] = i;
		}

		int cur = 0; // Current position in array
		for (int i = 0; i < n; ++i) {
			if (ind[i] != -1) continue;
			for (int j = i; j != -1; j = pc[j], ++cur) {
				ind[j] = cur;
				pp[j] = i;
			}
		}
	}
	// Get intervals corresponding to path between a and b
	vector<pair<int, int>> get(int a, int b) {
		vector<pair<int, int>> res;
		while(true) {
			if (ind[b] < ind[a]) swap(a, b);
			if (ind[pp[b]] <= ind[a]) {
				res.push_back({ind[a], ind[b]});
				return res;
			} else {
				res.push_back({ind[pp[b]], ind[b]});
				b = par[pp[b]];
			}
		}
	}
};

It uses no recursion, which is nice, and takes data in the index-of-parent format, which can be a small plus or a large minus (I needed HLD over Aho-Corasick automata, so that input format is useful there). The intervals returned by get are in decreasing order, not in the order they are in the path, so something like adding an arithmetic sequence to a path is not possible.

The par-array just stores the parent of every node, ind gives the index of the node in the array HLD maps the nodes to, and pp gives the highest ancestor of the node that is mapped to the same continuous segment.

get returns a vector of at most $$$O(log(n))$$$ intervals, such that hld-index of every node on the path between $$$a$$$ and $$$b$$$ is in exactly one of the returned intervals. Specifically it returns all intersections of heavy paths with the path between $$$a$$$ and $$$b$$$ that are nonempty, in decreasing order (first interval has highest startpoint and endpoint). Clearly there are at most $O(log(n))$ such intervals, since every path contains at most $O(log(n))$ light edges, so at most $O(log(n))$ heavy intervals.

Get works since the path from $$$b$$$ to $$$pp[b]$$$ corresponds to the continuous interval $$$ind[pp[b]], ind[b]$$$ in the array HLD maps the nodes to, and therefore if $$$ind[pp[b]] \leq ind[a] \leq ind[b]$$$, $$$a$$$ is on the path between $$$pp[b]$$$ and $$$b$$$, and if $$$ind[a] < ind[pp[b]]$$$, then no node we go over is an ancestor of $$$a$$$, since the HLD-index of a node's ancestor is always less than the ind of the node.

Since get also finds the LCA, this also gives a nice LCA implementation:

code

        // Get LCA of a and b
        int lca(int a, int b) {
                while(true) {
                        if (ind[b] < ind[a]) swap(a, b); 
                        if (ind[pp[b]] <= ind[a]) return a;
                        else b = par[pp[b]];
                }
        }

Here's Caves and Tunnels solved with this HLD-implementation. Changing the tree format is pretty nasty, so the code isn't that clean :/

code

#include <iostream>
#include <vector>
using namespace std;
using ll = long long;

struct HLD {
	vector<int> par; // Parent
	vector<int> pp; // Path parent
	vector<int> ind; // HLD index

	// p: parent of node i. Must have p[i] < i (p[0] = -1).
	HLD(const vector<int>& p) : par(p), pp(p.size()), ind(p.size(), -1) {
		int n = p.size();
		vector<int> siz(n, 1); // subtree size
		for (int i = n-1; i > 0; --i) siz[par[i]] += siz[i];

		vector<int> pc(n, -1); // Preferred child
		for (int i = n-1; i > 0; --i) {
			if (2*siz[i] >= siz[par[i]]) pc[par[i]] = i;
		}

		int cur = 0; // Current position in array
		for (int i = 0; i < n; ++i) {
			if (ind[i] != -1) continue;
			for (int j = i; j != -1; j = pc[j], ++cur) {
				ind[j] = cur;
				pp[j] = i;
			}
		}
	}
	// Get intervals corresponding to path between a and b
	vector<pair<int, int>> get(int a, int b) {
		vector<pair<int, int>> res;
		while(true) {
			if (ind[b] < ind[a]) swap(a, b);
			if (ind[pp[b]] <= ind[a]) {
				res.push_back({ind[a], ind[b]});
				return res;
			} else {
				res.push_back({ind[pp[b]], ind[b]});
				b = par[pp[b]];
			}
		}
	}
};

struct SegTree {
	vector<ll> seg;
	int h = 1;

	SegTree(int n) {
		while(h < n) h <<= 1;
		seg.resize(2*h, 0);
	}
	ll recGet(int a, int b, int i, int ia, int ib) const {
		if (b < ia || ib < a) return 0;
		if (a <=ia && ib <=b) return seg[i];
		int im = (ia + ib) >> 1;
		ll res = 0;
		res = max(res, recGet(a, b, 2*i, ia, im));
		res = max(res, recGet(a, b, 2*i+1, im+1, ib));
		return res;
	}
	ll get(int a, int b) const {
		return recGet(a, b, 1, 0, h-1);
	}
	void add(int i, int v) {
		i += h;
		seg[i] += v;
		for (i >>= 1; i >= 1; i >>= 1) {
			seg[i] = max(seg[2*i], seg[2*i+1]);
		}
	}
};

const int N = 1e5;
vector<int> conns[N];
int ord[N];

int dfs(int i, int p = -1, int j = 0) {
	ord[i] = j;
	j += 1;
	for (auto t : conns[i]) {
		if (t != p) j = dfs(t, i, j);
	}
	return j;
}

int main() {
	ios_base::sync_with_stdio(false);
	cin.tie(0);

	int n;
	cin >> n;
	for (int i = 0; i < n-1; ++i) {
		int a, b;
		cin >> a >> b;
		--a; --b;
		conns[a].push_back(b);
		conns[b].push_back(a);
	}

	dfs(0);
	vector<int> par(n, -1);
	for (int i = 0; i < n; ++i) {
		for (auto t : conns[i]) {
			if (ord[t] < ord[i]) par[ord[i]] = ord[t];
		}
	}

	HLD hld(par);
	SegTree seg(n);

	int q;
	cin >> q;
	for (int j = 0; j < q; ++j) {
		char t;
		cin >> t;
		if (t == 'I') {
			int i, v;
			cin >> i >> v;
			i = ord[i-1];
			seg.add(hld.ind[i], v);
		} else {
			int a, b;
			cin >> a >> b;
			a = ord[a-1]; b = ord[b-1];

			auto ints = hld.get(a, b);
			ll res = 0;
			for (auto pr : ints) {
				res = max(res, seg.get(pr.first, pr.second));
			}
			cout << res << '\n';
		}
	}
}

Full text and comments »

hld, implementation, lca, datastructure

mango_lassi
5 years ago
1