Blog entries - Codeforces

#	User	Rating
1	ecnerwala	3648
2	Benq	3580
3	orzdevinwang	3570
4	cnnfls_csy	3569
5	Geothermal	3568
6	tourist	3565
7	maroonrk	3530
8	Radewoosh	3520
9	Um_nik	3481
10	jiangly	3467

#	User	Contrib.
1	maomao90	174
2	awoo	164
2	adamant	164
4	TheScrasse	159
4	nor	159
6	maroonrk	156
7	-is-this-fft-	150
8	SecondThread	147
9	orz	146
10	pajenegod	145

Problem A - IQ Test

We can store two values, count_odd and count_even, as the number of odd or even elements in the series. We can also store last_odd and last_even as the index of the last odd/even item encountered. If only one odd number appears --- output last_odd; otherwise only one even number appears, so output last_even.

Problem B - Telephone Numbers

There are many ways of separating the string into clusters of 2 or 3 characters. One easy way is to output 2 characters at a time, until you have only 2 or 3 characters remaining. Here is a possible C++ solution:

<code>
for( i=0; i<n; i++ )
{
putchar(buf[i]);
if( i%2 && i<n-(n%2)-2 ) putchar('-');
}
</code>

Problem C - Roads in Berland

If you are familiar with the Floyd-Warshall algorithm, then this solution may be easier to see.

Initially, we are given a matrix D, where D[i][j] is the distance of shortest path between city i and city j. Suppose we build a new road between a and b with length shorter than D[a][b]. How do we update the rest of the graph accordingly?

Define a new matrix D', whose entries D'[i][j] are the minimum path distance between i and j while taking into account the new road ab. There are three possibilities for each i, j:

D'[i][j] remains unchanged by the new road. In this case D'[i][j] = D[i][j]
D'[i][j] is shorter if we use the new road ab. This means that the new path i, v₁, v₂, ..., v_n, j must include the road a, b. If we connect the vertices i, a, b, j together in a path, then our new distance will be D[i][a] + length(ab) + D[b][j].
Lastly, we may have to use the road ba. (Note that this may not be the same as road ab.) In this case, we have D'[i][j] = D[i][b] + length(ab) + D[a][j].

Thus, for each new road that we build, we must update each path i, j within the graph. Then we must sum shortest distances between cities. Updating the matrix and summing the total distance are both O(N²), so about 300² operations. Lastly, there are at most 300 roads, so in total there are about 300³ operations.

One thing to note is that the sum of all shortest distances between cities may be larger than an int; thus, we need to use a long when calculating the sum.

Problem D - Roads not only in Berland

Before we start this problem, it is helpful to know about the union find data structure. The main idea is this: given some elements x₁, x₂, x₃, ..., x_n that are partitioned in some way, we want to be able to do the following:

merge any two sets together quickly
find the parent set of any x_i

This is a general data structure that sometimes appears in programming competitions. There are a lot of ways to implement it; one good example is written by Bruce Merry (aka BMerry) here.

Back to the problem: Every day we are allowed to build exactly 1 road, and close exactly 1 road. Thus, we can break the problem into two parts:

How do we connect the parts of the graph that are disconnected?
How do we remove roads in a way that does not disconnect parts of the graph?

Let build be the list all roads that need to be built, and let close be the list of nodes that need to be closed. We can show that in fact, these lists are of the same size. This is because the connected graph with n nodes is a tree if and only if it has n - 1 edges. Thus, if we remove more roads than than we build, then the graph is disconnected. Also, if we build more roads than we remove, then we have some unnecessary roads (the graph is no longer a tree).

Now consider the format of the input data:
a₁, b₁
a₂, b₂
...
a_n - 1, b_n - 1
We can show that edge (a_i, b_i) is unnecessary if and only if the nodes a_i, b_i have already been connected by edges (a₁, b₁), (a₂, b₂), ..., (a_i - 1, b_i - 1). In other words, if the vertices a_i, b_i are in the same connected component before we, add (a_i, b_i) then we do not need to add (a_i, b_i). We can use union-find to help us solve this problem:

<code>
for( i from 1 to n-1 )
{
if( find(a_i)=find(b_i) ) close.add(a_i, b_i);
    else merge(a_i, b_i);
}
</code>

In other words, we treat each connected component as a set. Union find allows us to find the connected component for each node. If the two connected components are the same, then our new edge is unnecessary. If they are different, then we can merge them together (with union find). This allows us to find the edges that we can remove.

In order to find the edges that we need to add to the graph, we can also use union-find: whenever we find a component that is disconnected from component 1, then we just add an edge between them.

<code>
for( i from 2 to n )
if( find(v_i)!=find(v₁) )
    {
        then merge(v₁, v_i);
        build.add(v₁, v_i);
    }
</code>

We just need to store the lists of roads that are unnecessary, and the roads that need to be built.

Problem E - Test

The way I solved this problem is with a hash function. Hash functions can fail on certain cases, so in fact, my solution is not 'correct'. However, it passed all the test cases =P

Let the input strings be s₀, s₁, s₂. We can build the shortest solution by permuting the strings and then trying to 'attach' them to each other. I.e., we need to find the longest overlapping segments at the end of string a and the beginning of string b. The obvious brute force solution won't run in time. However, we can use a hash function to help us calculate the result in O(n) time, where n is min(len(a), len(b)). The hash function that I used was the polynomial hash(x₀, x₁, ..., x_n) = x₀ + ax₁ + a²x₂ + ... + aⁿx_n. This polynomial is a good hash function in this problem because it has the following useful property:
Given hash(x_i, ..., x_j), we can calculate the following values in O(1) time:

hash(x_i - 1, x_i, ..., x_j) = x_i - 1 + a × hash(x_i, ..., x_j)
hash(x_i, ..., x_j, x_j + 1) = hash(x_i, ..., x_j) + a^{j + 1 - i} × x_j + 1

In other words, if we know the hash for some subsequence, we can calculate the hash for the subsequence and the previous element, or the subsequence and the next element. Given two strings a, b, we can calculate the hash functions starting from the end of a and starting from the beginning of b. If they are equal for length n, then that means that (maybe) a and b overlap by n characters.

Thus, we can try every permutation of s₀, s₁, s₂, and try appending the strings to each other. There is one last case: if s_i is a substring of s_j for some i ≠ j, then we can just ignore s_i. We can use hash functions to check that one string is contained within another one.

Full text and comments »

choice's blog