Blog entries - Codeforces

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	-is-this-fft-	148
5	atcoder_official	148
5	SecondThread	148
8	Petr	147
9	nor	144
9	TheScrasse	144

Hello,

Since the editorial posted for the ACPC 2018 is rather complicated (no offense), I decided to write my own tutorial with much simpler solutions. There are two problems missing, and, hopefully, they will be available soon. Hope you like it!

101991A - Awesome Shawarma

Hint

Tutorial

Initially, all edges of the given tree are bridges since removing any of them disconnects the graph. So, we start with $\text{[math]}$ bridges. Joining the end-points of a simple path with length $\text{[math]}$ edges, removes $\text{[math]}$ bridges from the tree (resulting in $\text{[math]}$ bridges). Since we want the resulting graph to have the number of bridges in $\text{[math]}$ , then $\text{[math]}$ which means $\text{[math]}$ . So, all the problem is asking for is the number of simple paths $\text{[math]}$ of length in the range $\text{[math]}$ since these are the paths that give the desired number of bridges when short-circuited.

How will we count the number of paths with the given length? Centroid Decomposition! Consider the centroid tree where every vertex is the centroid of its subtree. Every path between two vertices in the initial tree passes through their lowest common ancestor (LCA) in the centroid tree. We will pick a vertex as the LCA and check for all paths that pass through it.

Consider the vertex $\text{[math]}$ to be the centroid of the initial tree. For every node $\text{[math]}$ in the tree, if $\text{[math]}$ has a distance $\text{[math]}$ from $\text{[math]}$ , $\text{[math]}$ should be matched to a vertex $\text{[math]}$ with distance to $\text{[math]}$ in the range $\text{[math]}$ such that $\text{[math]}$ in the centroid tree. To be sure that $\text{[math]}$ and $\text{[math]}$ have $\text{[math]}$ , they should be reachable from different neighbors of $\text{[math]}$ in $\text{[math]}$ . So, perform a $\text{[math]}$ on each child of $\text{[math]}$ in order and record the distances of the encountered nodes from $\text{[math]}$ . When finished with a certain child, record these distances globally. In the next child $\text{[math]}$ , whenever a node is encountered, check how many vertices in the previous child match with it and add the value to the answer. When done with this child, also record its values globally so that future vertices can match with the current ones. Keep in mind that vertices with a distance from the centroid in the range $\text{[math]}$ also form paths between them and the centroid, so they are counted without a match (since the centroid is matched to them).

When done with this process, we would have found all the paths passing through $\text{[math]}$ . Now, delete $\text{[math]}$ from the tree and repeat the same process on each of the formed components (which are also trees). For each component, find its centroid and count the number of paths through it.

Now, what is an efficient way of counting the number of nodes with a certain distance from the chosen root? Using a BIT or Fenwick tree. When done with a child of the current centroid, update the $\text{[math]}$ with the gathered values of distances. For a certain distance, increment its corresponding position in the $\text{[math]}$ with the number of vertices with this distance from the centroid. Whenever we want to change the centroid and repeat, we empty the $\text{[math]}$ and fill it with zeroes.

All $\text{[math]}$ operations are done in $\text{[math]}$ and performing $\text{[math]}$ continuously on the centroids of components is $\text{[math]}$ since no component is of size greater than $\text{[math]}$ . (See definition of centroid)

Complexity $\text{[math]}$

101991B - Baklava Tray

Hint

Tutorial

101991C - Coffee

Hint

Tutorial

101991D - Dull Chocolates

Hint

Tutorial

101991E - Exciting Menus

Hint

Tutorial

Since the popularity of the chosen substring is the number of strings that have it as a prefix, then the chosen string should always be the prefix of some string. Consider putting all strings into a trie, such that each node represents a prefix of one or more of the given strings. The chosen string at the end should definitely be one of the nodes of the trie.

The popularity of a certain prefix can be found by keeping a count of the strings that passed through this node during insertion. This resolves the $\text{[math]}$ value of the node.

The length of the prefix is also easy to compute since it is equivalent to the depth of the node in the trie (the node's distance from the root). This resolves the $\text{[math]}$ value of the node.

We are left with the last value which is $\text{[math]}$ . If we get this value from the prefix of the string, we may end up with a wrong answer. Consider that the best substring to choose is $\text{[math]}$ and it occurs in the string $\text{[math]}$ . If we take it as a prefix $\text{[math]}$ , we get a value of $\text{[math]}$ , but if we consider it as the other substring $\text{[math]}$ , we get a value of $\text{[math]}$ , which is better. How do we resolve that? Note that the other $\text{[math]}$ , $\text{[math]}$ , is also part of a prefix of the string $\text{[math]}$ . So, we now have two substrings, $\text{[math]}$ and $\text{[math]}$ , where $\text{[math]}$ is a proper suffix of $\text{[math]}$ .

Recall how the Aho-Corasick algorithm for pattern matching constructs the trie and its suffix links (failure links). In the algorithm, every node (prefix) has a special edge pointing to a node (another prefix) which is the longest proper suffix of the first node. So, consider the maximum $\text{[math]}$ value for some node. We can push this value up through the suffix link to the other prefix node which is a proper suffix of the current one. Hence, we update the value of $\text{[math]}$ in the other node to make it the maximum of all possible values. By repeating this step in a bottom-up manner, we will have the maximum possible value of $\text{[math]}$ for the current substring stored in its node. Constructing these suffix link edges can be done in $\text{[math]}$ , where $\text{[math]}$ is the length of the strings (given that the alphabet is of finite size), and traversing the nodes through these links to push the joy level values can be done using regular $\text{[math]}$ (note the topological sorting) on the nodes. In the worst case, the number of nodes in the trie will be equal to the total number of characters in the strings, which is at most $\text{[math]}$ .

Now, every node, which is a prefix of some string, has stored in it the $\text{[math]}$ , the $\text{[math]}$ , and the maximum $\text{[math]}$ that it can have. All what is left is to traverse all the nodes and find which one has the highest quality. For every node, compute its quality and compare to a global maximum.

Complexity $\text{[math]}$

101991F - Flipping El-fetiera

Hint

Tutorial

101991G - Greatest Chicken Dish

Hint

Tutorial

First, let's define some handy structures. Let's create an array $\text{[math]}$ which stores the logarithm base 2 of integers $\text{[math]}$ . Let's also create a sparse table on GCD so that we can find the GCD of all elements in a given interval $\text{[math]}$ in constant time $\text{[math]}$ . The idea of a sparse table to store in $\text{[math]}$ the GCD of all elements in the interval $\text{[math]}$ . $\text{[math]}$ .

Since the queries are offline, it seems appropriate to apply MO's algorithm. The idea is to divide the interval $\text{[math]}$ into blocks of size $\text{[math]}$ , so we sort the queries by $\text{[math]}$ and then by $\text{[math]}$ if they happen to be in the same block. Review MO's algorithm for a better understanding of the solution.

Now, if we have the solution for a certain interval, we can loop on the queries in the sorted order and answer them all in $\text{[math]}$ . The answer is the number of segments in the interval $\text{[math]}$ that have GCD $\text{[math]}$ . Since the value of $\text{[math]}$ changes among queries, the answer should be stored in an array. Let's define $\text{[math]}$ that stores, for every value $\text{[math]}$ the number of segments in $\text{[math]}$ with GCD equal to $\text{[math]}$ . Therefore, the answer to a particular query is $\text{[math]}$ .

How will we do the editing of the $\text{[math]}$ array as we move the interval's starting and ending points? First, notice an important property. Consider all the segments starting at index $\text{[math]}$ . Initially, the GCD is $\text{[math]}$ . As we the ending of the interval to right, the GCD can either stay the same or be divided by some integer greater than or equal to 2. So, there cannot be more than $\text{[math]}$ distinct GCD values for segments starting at a given index. This is extremely useful. For every index $\text{[math]}$ , we can store the distinct GCD values of segments starting at $\text{[math]}$ , with the ending index at which these GCDs start appearing. The first GCD is $\text{[math]}$ and it starts appearing at is $\text{[math]}$ . We can perform a binary search to find the max index at which the GCD remains the same (remember the useful sparse table we created), or we can just walk on powers of two from greatest to least and add to the index as long as there is no change in GCD value. Repeat the same procedure, but now fix the right end of the interval and consider segments ending at a fixed index. Now, we have two arrays of maps or two arrays of vectors containing pairs (the GCD and the index).

Back to MO's algorithm, when we decrement $\text{[math]}$ , check the array of GCDs starting at $\text{[math]}$ . Loop on this array and, for every GCD value, add to the $\text{[math]}$ array, the number of segments starting at $\text{[math]}$ and having this GCD. This count can be obtained by subtracting the index of appearance of the next GCD value from the index of appearance of the current GCD value. However, note that we should not exceed the $\text{[math]}$ value that we have since we will be counting segments that should not be in the solution. The same goes for incrementing $\text{[math]}$ , but this time we subtract the number of segments from the $\text{[math]}$ array. And for incrementing or decrementing $\text{[math]}$ , we follow the same steps but using the other array where we fixed the ending point and varied the starting point.

Now, we can increment and decrement the interval in $\text{[math]}$ , and all we need to do is to go over the queries as we sorted them and store their answers. Print the answer of the queries in the order that the queries were given in the input.

Complexity $\text{[math]}$

101991H - Hawawshi Decryption

Hint

Tutorial

101991I - Ice-cream Knapsack

Hint

Tutorial

101991K - Khoshaf

Hint

Tutorial

The first thing to observe is the the numbers themselves don't matter. What matters is the remainder of the number by 3, so we can limit our work to the elements $\text{[math]}$ only.

We usually find the sum of an interval $\text{[math]}$ in an array by constructing the prefix sum array and computing $\text{[math]}$ . Let's apply this on the problem at hand. If we want the sum $\text{[math]}$ to be divisible by 3, then $\text{[math]}$ (they have the same remainder when divided by 3. If we consider the prefix sum array as consisting of only zeroes, ones, and twos (the remainder of the sum by 3 instead of the sum itself), then we can get the number of sub-arrays of sum divisible by 3 simply by picking 2 ones or 2 twos or 2 threes (since subtracting the two will give a sum divisible by 3). Also, picking a zero alone will give a sum divisible by 3. So, if we have the prefix sum array with the remainders of sums by 3 instead of the sums themselves, we the number of sub-arrays of sum divisible by 3 is

$\text{[math]}$ where $\text{[math]}$ are the number of zeroes, ones, and twos in the prefix sum array.

Also, note that $\text{[math]}$ and $\text{[math]}$ . For $\text{[math]}$ . Therefore, the number of zeroes, ones, and twos cannot exceed 150 each or else we cannot find exactly $\text{[math]}$ sub-arrays (but much more).

For every possible value of zeroes, ones, and twos (3 nested loops, each from 0 to 150), if they give us exactly $\text{[math]}$ (based on the formula above), then we add to the answer the number of ways we can make a prefix sum array having this many zeroes, ones, and twos.

To count the number of prefix sum arrays that have a certain number of zeroes, ones, and twos, we use a simple $\text{[math]}$ . Solve all values of this $\text{[math]}$ before doing the looping discussed above.

$\text{[math]}$ , where $\text{[math]}$ are the number of zeroes, ones, twos placed or remaining (in the range $\text{[math]}$ and $\text{[math]}$ is the previous sum found (previous element) in the range $\text{[math]}$

$\text{[math]}$ initially since adding no elements means a sum of 0, and all other values of $\text{[math]}$ are 0. Filling the $\text{[math]}$ is straight-forward since we can add a number whose remainder by 3 is 0, 1, or 2 to make the new prefix sum with remainder 0, 1, or 2 by 3.

The number of elements $\text{[math]}$ that give a remainder $\text{[math]}$ when divided by $\text{[math]}$ is $\text{[math]}$ , so the number of elements in the range $\text{[math]}$ that give a remainder $\text{[math]}$ when divided by $\text{[math]}$ is $\text{[math]}$

Complexity $\text{[math]}$

101991L - Looking for Taste

Hint

Tutorial

Full text and comments »

Hello,

As you might know, the Palindromic Tree is a new data structure that has been introduced recently and can solve some queries related to palindromes in linear time O(length of string).

First, I will explain the basic idea of the data structure and then the code.

The basic idea of the Palindromic Tree is that a palindrome is actually a palindrome with the same character added before and after it. For example, “ababa” is a palindrome since “bab” is a palindrome and we add the character a before and after it. The same applies for “bab” and so on. Therefore, to add a new palindrome of length L, there should be some other palindrome of length L-2 that we add a character before and after.

Note: The palindromic tree implementation below is not limited to English alphabets, so it accepts any character.

The palindromic tree (not actually a tree) is made up of several vertices or nodes. Each node stores the string that it represents. So the nodes of the palindromic tree store their corresponding palindromes. Each node also stores a list of its children (edges to children are weighted according to character). Let’s say we are initially on a node representing a palindrome X, then the child on edge with weight ‘a’ represents the palindrome aXa. The child on edge with weight ‘b’ from aXa means that it represents the palindrome baXab. All this is based on our knowledge that X is a palindrome, so adding ‘a’ or ‘b’ to it keeps it a palindrome. In addition, every node has a link (edge but unweighted and different from the weighted edges to children). This link points to the longest palindromic proper suffix of the current node palindrome. I will explain the usefulness of such a link later. We don’t consider the node itself as the longest palindromic suffix simply because every palindrome is the longest palindromic suffix of itself. By excluding the node itself, we also avoid self loops.

So, we said that every palindrome is made up of a smaller palindrome of length – 2. But what about “a” or “aa”? We do the same thing! “aa” is actually an empty string with ‘a’ added on both sides. For “a”, however, the case is a bit different. We consider that there is some imaginary string of length = -1. When we add ‘a’ to both sides of this imaginary string, the length becomes -1 + 2 = 1 which is the length of “a” and this makes sense. Therefore, we initially have 2 root nodes in the palindromic tree. The first node has length = -1 and is the imaginary node mentioned above (I’ll call it the imaginary root). The second node is for an empty string and has length = 0 (I’ll call it the real root). The longest palindromic proper suffix of the real root is the imaginary root since it cannot be the node itself. The longest palindromic proper suffix of the imaginary root is itself since we cannot go any higher, and it is imaginary. The is also explained later.

Throughout the program, we store a node called the current node. This node represents the last inserted node. Initially, the current node is the real root.

To insert a character ‘a’ for example, we need to find the palindrome X such that aXa will be the current palindrome. X should exist in the tree since it came before inserting ‘a’. Remember that ‘a’ is being inserted now at the end of the string, so the last inserted character is the last character of X, whatever X is. So, it makes sense to start checking from the stored current node, since it contains the last characters in the string (the longest palindromic suffix of the string inserted until now). It is important to see that the current node is the longest palindromic suffix of the string inserted until the last character before ‘a’, and that’s why we start checking from it. If we find an ‘a’ before the substring of the current node in the main string, then great! We make a new child for the current node with an edge weight of ‘a’. What if we don’t find ‘a’? It is optimal to move to the longest palindromic proper suffix of the current node. To make the palindrome aXa, X should be a palindrome, and it is also best for X to be of maximal length. Also, X should be a suffix of the string inserted until now. That is why we move to the suffix node linked to the current node. We keep on doing so until we reach a node that has ‘a’ before it. We might reach the real node (“aa” is the palindrome), or continue to reach the imaginary node, and it works for sure (“a” alone is the palindrome). Let’s denote the found parent by a temporary node “temp” and refer to it like that.

Note that the palindromic tree only stores unique palindromes, so it temp already has a child with edge ‘a’ or the inserted character, we just make this child the current node and terminate the method to avoid overwriting any of the previous data. (we don’t create a new node)

Now that we have inserted the new character (new node), we need to find its longest palindromic suffix. The longest palindromic suffix of the new node will have the form of aYa where Y is a palindrome already inserted in the tree before inserting ‘a’. All we need to do is to find Y and get its child at the edge with weight ‘a’ or the inserted character and make this child the suffix palindrome of the new node that we inserted earlier. It can be proven that Y and its desired child always exist.

How do we do that efficiently? We use temp! We know that temp stores a node X such that aXa is the current palindrome. However, we can’t use temp since the child at edge weighted ‘a’ from temp is the currently inserted node. This will create a self loop and will not be a proper suffix. So, we start, not from temp, but from the longest palindromic suffix of temp. We keep on going up through the longest palindromic suffixes until we reached the desired one (the one which has ‘a’ before it). Since we are traversing the longest palindromic suffix, we guarantee that our result will be a suffix and will be the longest possible. In the extreme case, we will reach the imaginary node. When we find our desired node, we make its child at edge weighted ‘a’ the longest palindromic suffix of the currently inserted node.

If for some reason we are at the imaginary root and try to go to its longest suffix, we will return to the imaginary root, and this is the purpose of linking it to itself.

Don’t forget to make the newly inserted node the current node saved in the program. If the new node was created, set the current node to it. If it was found to be a duplicate (case discussed above), set the current node to the node that was present first and don’t create a new node.

Now, for the coding part…

We have a problem. How do we store all the palindrome strings without affecting the memory used? The real question is: Do we actually need to store the palindromic strings? It is enough for each node to keep track of the length of the palindrome that it holds and the end index of this palindrome in the main string (or the start index).

Also, instead of making edges with weights, we can make a map. This maps the character to its corresponding child node, and makes adding children or finding them easy and fast.

First, we’ll create a class Node

class Node{
	HashMap<Character, Node> next = new HashMap<>(); // the node's children and their weights
	Node suffix; // the longest palindromix proper suffix
	int length, endIndex; // the length of the palindrome and its end's index in the main string
	public Node(int length, int endIndex) {
		this.length = length;
		this.endIndex = endIndex;
	}
}

Second, we'll declare and initialize the 3 main nodes and the character arraylist which will be the inserted string

Node imaginaryRoot = new Node(-1, -1), realRoot = new Node(0, -1), currNode = realRoot;
// imaginary and real roots don't have an end index since they do not represent palindromes in the string
ArrayList<Character> str = new ArrayList<>();

void init(){ // to initialize the palindromic tree
	imaginaryRoot.suffix = imaginaryRoot;
	realRoot.suffix = imaginaryRoot;
}

Third, we'll implement the method that adds characters to the inserted string and to the palindromic tree

void addChar(char ch) {
	int index = str.size(); // the index where the new character will be inserted
	str.add(ch); // adding the character to the main string
	Node temp = currNode; // making use of the previously inserted node
	while(index - temp.length - 1 < 0 || str.get(index - temp.length - 1) != str.get(index))
		temp = temp.suffix;
// first condition is just to check bounds, second condition is to check where the palindrome has the current character ch before it
// it is important to see that this loop should terminate at the imaginary root since it will compare the character to itself
	if(temp.next.containsKey(ch)) { // node already exists, don't overwrite
		currNode = temp.next.get(ch); // set current node to the required node
		return; // exit the method to avoid overwriting data
	}
	currNode = new Node(temp.length + 2, index); // new node has size of parent + 2, and ends at the index where the new character ch was inserted
	temp.next.put(ch, currNode); // make the current node a child of temp with edge weight = ch
	if(currNode.length == 1) { // if the current palindrome has a length of 1 (only ch)
		currNode.suffix = realRoot; // the longest palindromic suffix will be the empty string
		return; // exit to avoid finding suffix
	}
	temp = temp.suffix; // start checking from the suffix of temp
	while(index - temp.length - 1 < 0 || str.get(index - temp.length - 1) != str.get(index))
		temp = temp.suffix;
// first condition is just to check bounds, second condition is to check where the palindrome has the current character ch before it
// it is important to see that this loop should terminate at the imaginary root since it will compare the character to itself
	currNode.suffix = temp.next.get(ch); // set the suffix of the current node as the child of temp at edge weighted ch
}

Note: when inserting the nodes, you can keep track of the longest palindrome, the palindrome count, or even an array or list of all palindromes You can even dfs the tree at any time to find these values

Full text and comments »

karamkontar's blog