About kmp optimization

→ Pay attention

Contest is running
2023 Post World Finals Online ICPC Challenge powered by Huawei
9 days
Register now »

→ Top rated

#	User	Rating
1	tourist	3690
2	jiangly	3647
3	Benq	3581
4	orzdevinwang	3570
5	Geothermal	3569
5	cnnfls_csy	3569
7	Radewoosh	3509
8	ecnerwala	3486
9	jqdai0815	3474
10	gyh20	3447

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	160
5	nor	158
6	maroonrk	156
7	-is-this-fft-	152
8	orz	146
9	pajenegod	145
9	SecondThread	145

View all →

→ Find user

→ Recent actions

Detailed →

Monster_Nerd's blog

About kmp optimization

By Monster_Nerd, history, 3 years ago, In English

Recently i encountered TLE on this problem E. Tree-String Problem
using decent dfs + kmp. Later i came to know that there is an optimized version of kmp. I listed the code below which i had taken from Arpa 's code.

int k = 0;
    for(int i = 1; i < p.size(); i++){
	while(k && p[i] != p[k]) k = f[k];
	if(p[i] == p[k])  k++;
	f[i + 1] = k;
    }

for(int i = 0; i < p.size(); i++)
	for(int j = 0; j < z; j++)
	    nxt[i][j] = p[i] - 'a' == j ? i + 1 : bool(i) * nxt[ f[i] ][j];

now my question is, can i use it always [if memory limit is ok] ? Does it consistent and can anyone just briefly explain what does it do?

Monster_Nerd
3 years ago
6

Comments (6)

Write comment?

Monster_Nerd

3 years ago, # |

Auto comment: topic has been updated by Monster_Nerd (previous revision, new revision, compare).

→ Reply

Monster_Nerd

3 years ago, # |

Auto comment: topic has been updated by Monster_Nerd (previous revision, new revision, compare).

→ Reply

Arpa

3 years ago, # |

Yes, it's consistent. Simplify the code, it's easy to understand.

→ Reply

Monster_Nerd

3 years ago, # ^ |

Thanks's a lot. Yep i am iterating!!

→ Reply

clyring

3 years ago, # |

← Rev. 4 →

Your original code TLEs on this problem because its worst-case runtime is $$$\Omega(|t| + n \cdot \min (|t|, \sum |s_v|))$$$. The failing testcase is probably a larger version of the following:

10
1 aaaaaaaaaa
2 x
2 x
2 x
2 x
2 x
2 x
2 x
2 x
aaaaaaaaaay

Your original code performs the backtracking part of KMP at match-time. When parsing a single string, this doesn't affect asymptotic complexity and doesn't have a big impact on the constant factor because each character added to a partial match is removed at most once. But in the tree setting, the partial match characters may be removed $$$n-2$$$ times in a test case like the one above. Arpa's version of KMP gets around this by preprocessing all of the potential backtracking steps so that they may be performed in (un-amortized) $$$O(1)$$$ at match time, restoring a more reasonable $$$O(n + |\sigma | |t| + \sum |s_v|)$$$ time complexity.

EDIT: Complexities were rewritten a bit more precisely.

→ Reply

Monster_Nerd

3 years ago, # ^ |

Thanks a lot buddy. I had tried a lot to figure out the test case and was quite dumb to understand what's wrong going on. now i completely got it. Thanks, and at last i understood why optimization was required. Thanks a lot.

→ Reply