Help finding sum of the number of distinct characters in all the distinct substrings of string S.

→ Обратите внимание

До соревнования
Pinely Round 4 (Div. 1 + Div. 2)
20:00:39
Зарегистрироваться »

*есть доп. регистрация

→ Трансляции

Codeforces Pinely Round 4 (Div 1 + Div 2) Solution Discussion

Shayan

До начала 23:05:39

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	atcoder_official	149
6	-is-this-fft-	148
6	SecondThread	148
8	Petr	147
9	nor	144
10	cry	142

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя deepak1527

Help finding sum of the number of distinct characters in all the distinct substrings of string S.

Автор deepak1527, история, 5 лет назад, По-английски

How to solve the following problem. Find the sum of the number of distinct characters in all the distinct substrings of S. 1<=|S|<=100000. S="aabb"
Set of distinct sub-strings of "aabb" = {a,b,aa,ab,bb,aab,abb,aabb} sum = 1 + 1 + 1 + 2 + 1 + 2 + 2 + 2 = 12.

Thanks!

deepak1527
5 лет назад
4

Комментарии (4)

Написать комментарий?

aviroop123

5 лет назад, # |

+28

Can you give the link to the problem so that I can verify that it's not a part of some ongoing contest?

→ Ответить

deepak1527

5 лет назад, # ^ |

No it's not a part of any ongoing contest, this problem was asked in coding round of a company and that round has ended.

→ Ответить

m0nk3ydluffy

5 лет назад, # |

+19

I would like to propose a solution. It would be nice if someone can verify it.

The main idea is that for each character, we have to figure out its contribution to the final sum i.e. for each character $$$S_{i}$$$ in our string, we have to find how many distinct substrings exist such that $$$S_{i}$$$ is the first occurrence of its kind in that substring? This can be answered using a Suffix Automaton. (If we didn't have to count distinct substrings, the problem would have an easier solution.)

So we will build a Suffix Automaton on the given string. Lets name the starting node of the automaton as $$$t_{0}$$$. Consider an edge from node $$$u$$$ to node $$$v$$$ using the character $$$c$$$. The contribution of $$$c$$$ to the final sum = $$$dp[u][c] * dp[v]$$$ where,

$$$dp[u][c]$$$ = The number of paths from $$$t_{0}$$$ to $$$u$$$ such that we never use an edge with character $$$c$$$

$$$dp[v]$$$ = The number of paths that begin from node $$$v$$$ and end at any other node.

The above two dp's can be calculated fairly easily. The final answer should be the sum of the contribution of each edge in the Suffix Automaton.

Since you mentioned that this question was asked in the coding round of a company, it should have an easier solution.

→ Ответить

MSchallenkamp

5 лет назад, # |

We can solve this with a suffix array by recognizing that each unique substring can be represented as a prefix of a suffix.

Let's construct a suffix array. Now we'll walk forward, starting with the first string in the suffix array. For the first string in the suffix array all of the prefixes of this suffix are valid. For the next string only the prefixes longer than the longest common prefix between it and the last string are unique. All the others were counted as part of some earlier suffix. This will take nlogn time to construct the suffix array, nlogn time to find the longest common prefix between every two suffixes, and finally n time to sum all the values for each suffix.

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 27.07.2024 21:34:21 (i2).

Десктопная версия, переключиться на мобильную.

При поддержке