Number of different substrings of a string

→ Обратите внимание

До соревнования
Pinely Round 4 (Div. 1 + Div. 2)
26:20:27
Зарегистрироваться »

*есть доп. регистрация

→ Трансляции

Atcoder ABC #364 Solution Discussion

aryanc403

До начала 01:30:27

Codeforces Pinely Round 4 (Div 1 + Div 2) Solution Discussion

Shayan

До начала 29:25:27

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	atcoder_official	148
5	-is-this-fft-	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя XLR8ST

Number of different substrings of a string

Автор XLR8ST, 9 лет назад, По-английски

How can i find the no. of distinct substrings of a string using Z-FUNCTION/Z-ARRAY ?

Time complexity should be less than O(n²).

I know there is a way using suffix array but i am more interested in solving this using Z-array/Z-function.

zalgorithm, substring

XLR8ST
9 лет назад
18

Комментарии (16)

Показать архивные | Написать комментарий?

mkirsche

9 лет назад, # |

Let Z[i] be the Z-value array of the suffix of S starting at position i. Then, for each position k, compute maxZ[k], the maximum of Z[0][k], Z[1][k-1], Z[2][k-2], ..., Z[k-1][1] (the maximum prefix of S[k..n) that also occurs in some earlier prefix of S). Then, all substrings starting at position k of length up to maxZ[k] have already occurred so you do not want to count those, but you count all longer substrings. Therefore, the answer is sum{k=0 to n} (n — k — maxZ[k]).

→ Ответить

XLR8ST

9 лет назад, # ^ |

-9

I'm unable to understand you . Why is the Z array 2-d ?

→ Ответить

mkirsche

9 лет назад, # ^ |

Each Z[i] is its own 1D array so Z is a 2D array

→ Ответить

smx

9 лет назад, # ^ |

How much have you have complexity?

→ Ответить

mkirsche

9 лет назад, # ^ |

It's O(n^2) time and space complexity. Here is a link to some Java code for it. You don't actually need to make a 2-D array though. You can just create a new 1-D array in each iteration of the outer for loop which results in O(n) memory.

→ Ответить

smx

9 лет назад, # ^ |

But XLR8ST asked something about less then O(N^2). And this is really interesting. Of course, we have suffix structurers such as suffix tree, but it's intresting to know about something easier than suffix structurers.

→ Ответить

prem.ktiw

8 лет назад, # ^ |

-11

http://ideone.com/hNMki9 this one uses a suffix array (nlogn) construction + LCP array (n) construction. Together they make the overall complexity nlogn. There is also one linear time suffix array calculation approach. If you use SA + LCP approach then you can count no. of distinct substrings in a string in time similar to the construction time of SA + LCP because, after SA + LCP is constructed it takes only linear time to count .

→ Ответить

TooNewbie

8 лет назад, # ^ |

← Rev. 2 →

-10

prem.kvit you dont have to share your hackerrank solution!! also your solution is wrong

→ Ответить

smx

9 лет назад, # ^ |

-8

you defined Z as 1-dimensional array and after that used them such as 2-dimensional.

→ Ответить

XLR8ST

9 лет назад, # ^ |

How ? Can you explain using code ?

→ Ответить

smx

9 лет назад, # ^ |

i am very intresting in it.

→ Ответить

Kino

8 лет назад, # |

for each prefix i of the word, reverse it and do z_function over it, the number of new distinct substrings that end in the prefix i is (the length of the prefix) — (maximum value in the z_function array) the pseudo code look like this:

string s; cin >> s;
int sol = 0
foreach i to s.size()-1
    string x = s.substr( 0 , i+1 );
    reverse( x.begin() , x.end() );
    vector<int> z = z_function( x );
    //this work too
    //vector<int> z = prefix_functionx(x); 
    int mx = 0;
    foreach j to x.size()-1
        mx = max( mx , z[j] );
    sol += (i+1) - mx; 

cout << sol;

→ Ответить

SyberCage

2 года назад, # |

← Rev. 2 →

-20

The following is the wrong algorithm, sorry for this.

I guess it is better to use trie data structure that will reduce the time complexity to O(26*N) but it effectively use to give number of distinct substrings as you want to calculate.

Code link: https://wtools.io/paste-code/bDox

→ Ответить