Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff x is proper substring of y

→ Pay attention

Before contest
Codeforces Round 968 (Div. 2)
43:33:17
Register now »

*has extra registration

→ Streams

Atcoder ABC368 Solution Discussion

By aryanc403

Before stream 18:43:17

Codeforces Round 968 Solution Discussion

By aryanc403

Before stream 45:38:17

View all →

→ Top rated

#	User	Rating
1	tourist	3947
2	jiangly	3734
3	Radewoosh	3646
4	jqdai0815	3620
4	Benq	3620
6	orzdevinwang	3612
7	ecnerwala	3581
8	Geothermal	3569
8	cnnfls_csy	3569
10	ksun48	3479

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	awoo	162
2	maomao90	160
3	nor	156
4	adamant	155
4	cry	155
4	atcoder_official	155
4	-is-this-fft-	155
8	maroonrk	153
9	SecondThread	147
10	Petr	146

View all →

→ Find user

→ Recent actions

Detailed →

Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff x is proper substring of y

Revision en5, by pabloskimg, 2018-10-21 17:55:36

There are at most N = 10^4 strings, each string is at most MAXLEN = 1000 characters long, but the length of the concatenation of all strings is at most 10^6. What would be the more efficient way to build a DAG as described in the title? The naive way would be comparing each pair of strings (X,Y), which leads to O(N^2) comparisons, and then for each pair to check whether X is substring of Y in O(MAXLEN^2). The naive solution could be improved by first sorting strings by length so that each string X can only be substring of strings to the right, and also we could use Rolling Hashing to reduce the complexity of substring search to O(MAXLEN). Is it possible to do even better? I've got the feeling that Suffix Array could be of help, but I'm not sure of exactly how. The motivating problem is this one

#strings, substring search

History

Revisions

Rev.	By	When	Δ	Comment
en5	pabloskimg	2018-10-21 17:55:36	1
en4	pabloskimg	2018-10-21 07:50:28	63
en3	pabloskimg	2018-10-21 07:47:51	12	Tiny change: 'substring verification to O(MAXL' -> 'substring search to O(MAXL'
en2	pabloskimg	2018-10-21 07:45:53	7
en1	pabloskimg	2018-10-21 07:45:01	1009	Initial revision (published)