ACM-ICPC Upsolve - Codeforces

→ Pay attention

Before contest
Codeforces Round 941 (Div. 1)
35:24:28
Register now »

*has extra registration

Before contest
Codeforces Round 941 (Div. 2)
35:24:28
Register now »

*has extra registration

→ Streams

AMA: TheOneYouWant

By aryanc403

Before stream 11:19:28

Atcoder ABC #351 Short Solution Discussion

By aryanc403

Before stream 34:29:28

View all →

→ Top rated

#	User	Rating
1	ecnerwala	3649
2	Benq	3581
3	orzdevinwang	3570
4	Geothermal	3569
4	cnnfls_csy	3569
6	tourist	3565
7	maroonrk	3531
8	Radewoosh	3521
9	Um_nik	3482
10	jiangly	3468

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	151
8	SecondThread	147
9	orz	146
10	pajenegod	145

View all →

→ Find user

→ Recent actions

Detailed →

Pranayhalo's blog

ACM-ICPC Upsolve

By Pranayhalo, history, 4 years ago, In English

Wondering if there are any good resources (or editorials) to aid when trying to upsolve regional ACM-ICPC contests. I see they have coded solutions, but without some editorial, I am not able to fully understand why what works. I am specifically asking for the Mid-Central PC 2018. If not, would anyone be able to help me in solving the following problem from MCPC18 K: Repeated Substrings? Thanks!

#acm-icpc, #upsolving

Pranayhalo
4 years ago
3

Comments (3)

Write comment?

ankeet

4 years ago, # |

+11

Compute the LCP array of the string and find the index of the maximum value. Code

Another possible solution is to use binary search on substring length + hashing.

→ Reply

Pranayhalo

4 years ago, # ^ |

I understand and have coded the LCP solution now, thanks! I am not sure if I fully understand the alternative solution. I understand how you can binary search (if a length of k is valid, then all lengths 1...k is valid; if a length of k is not valid, then all lengths k...n is not valid). If I want to check if some length of k is valid, how would this be done efficiently? There are n-k+1 substrings of length k so O(n) substrings of length k, and we just add all of these substrings to a set and if we ever add a duplicate, we know that length k is valid? I am not sure how efficient a set is for strings (is it O(n^2) for n insertions as each insertion takes O(n) time since we must compare the strings one at a time?). I think this is where hashing comes in, but I am not sure how to use this correctly.

→ Reply

ankeet

4 years ago, # ^ |

Yes, if you insert the entire string into the set then you will get O(n^2). You will need to do something faster. If you fix the substring length as k, then as you said there are n-k+1 strings of length k. Now, instead of putting the strings themselves into the set, you can put the hashes of the strings into the set. If the set contains duplicates then you know (with high probability) that a length-k substring has appeared twice.

Now, the important question is how to compute the hash of s[i..j] efficiently. Assume the strings are 1-indexed First, we're going to need to define a convenient hash function:

$$$h[i] = (s[1]p^1 + s[2]p^2 + ... + s[i]p^i) \text{ mod } M$$$

Where p is a small prime and M is a big prime (p=101,M=10^9+7 should be fine). Also for convenience h[0] = 0. Now we also have:

$$$\text{ hash of }s[i..j] = \frac{(h[j] - h[i - 1])}{ p^{i - 1}}$$$

Where the right hand side is evaluated mod M. Lastly, it is possible that there are some hash-collisions. In this case, you may use a combination of two primes and two moduli to make the collisions less likely.

→ Reply