Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff string x is proper substring of y? - Codeforces

→ Pay attention

Before contest
Codeforces Round 941 (Div. 1)
35:21:13
Register now »

*has extra registration

Before contest
Codeforces Round 941 (Div. 2)
35:21:13
Register now »

*has extra registration

→ Streams

AMA: TheOneYouWant

By aryanc403

Before stream 11:16:12

Atcoder ABC #351 Short Solution Discussion

By aryanc403

Before stream 34:26:12

View all →

→ Top rated

#	User	Rating
1	ecnerwala	3649
2	Benq	3581
3	orzdevinwang	3570
4	Geothermal	3569
4	cnnfls_csy	3569
6	tourist	3565
7	maroonrk	3531
8	Radewoosh	3521
9	Um_nik	3482
10	jiangly	3468

Countries | Cities | Organizations

→ Top contributors

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	151
8	SecondThread	147
9	orz	146
10	pajenegod	145

View all →

→ Find user

→ Recent actions

Detailed →

Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff string x is proper substring of y?

Revision en1, by pabloskimg, 2018-10-21 07:45:01

There are at most N = 10^4 strings, but the length of the concatenation of all strings is at most MAXLEN = 10^6. What would be the more efficient way to build a DAG as described in the title? The naive way would be comparing each pair of strings (X,Y), which leads to O(N^2) comparisons, and then for each pair to check whether X is substring of Y in O(MAXLEN^2). The naive solution could be improved by first sorting strings by length so that each string X can only be substring of strings to the right, and also we could use Rolling Hashing to reduce the complexity of substring verification to O(MAXLEN). Is it possible to do even better? I've got the feeling that Suffix Array could be of help, but I'm not sure of exactly how. The motivating problem is this one

Tags

#strings, substring search

History

Revisions

	Rev.	Lang.	By	When	Δ	Comment
	en5		pabloskimg	2018-10-21 17:55:36	1
	en4		pabloskimg	2018-10-21 07:50:28	63
	en3		pabloskimg	2018-10-21 07:47:51	12	Tiny change: 'substring verification to O(MAXL' -> 'substring search to O(MAXL'
	en2		pabloskimg	2018-10-21 07:45:53	7
	en1		pabloskimg	2018-10-21 07:45:01	1009	Initial revision (published)