Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff x is proper substring of y

→ Pay attention

Before contest
Codeforces Round (Div. 2)
3 days
Register now »

*has extra registration

→ Top rated

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3611
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	Qingyu	163
2	adamant	150
3	Um_nik	146
4	Dominater069	144
5	errorgorn	141
6	cry	139
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
9	TheScrasse	134

View all →

→ Find user

→ Recent actions

Detailed →

pabloskimg's blog

Given a list of strings, build a DAG in which each node is a string and there is an edge x->y iff x is proper substring of y

By pabloskimg, history, 8 years ago, In English

There are at most N = 10^4 strings, each string is at most MAXLEN = 1000 characters long, but the length of the concatenation of all strings is at most 10^6. What would be the more efficient way to build a DAG as described in the title? The naive way would be comparing each pair of strings (X,Y), which leads to O(N^2) comparisons, and then for each pair to check whether X is substring of Y in O(MAXLEN^2). The naive solution could be improved by first sorting strings by length so that each string X can only be substring of strings to the right, and also we could use Rolling Hashing to reduce the complexity of substring search to O(MAXLEN). Is it possible to do even better? I've got the feeling that Suffix Array could be of help, but I'm not sure of exactly how. The motivating problem is this one