Multiple Queries/String Matching

→ Pay attention

Before contest
Codeforces Round 941 (Div. 1)
4 days

Before contest
Codeforces Round 941 (Div. 2)
4 days

→ Streams

CodeChef Starters 131 Solution Discussion

By aryanc403

Before stream 35:15:45

View all →

→ Top rated

#	User	Rating
1	ecnerwala	3650
2	Benq	3582
3	Geothermal	3570
3	orzdevinwang	3570
5	cnnfls_csy	3569
6	tourist	3565
7	maroonrk	3532
8	Radewoosh	3522
9	Um_nik	3483
10	jiangly	3468

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	163
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	151
8	SecondThread	147
9	orz	146
10	pajenegod	145

View all →

→ Find user

→ Recent actions

Detailed →

Hepic_Antony_Skarlatos's blog

Multiple Queries/String Matching

By Hepic_Antony_Skarlatos, history, 8 years ago, In English

What is the most efficient algorithm when the problem gives you a string of N length and asks you to answer in Q queries if the ith word of length M (where M is much lesser than N -> M << N) is contained into 'N length word' ?

Thanks in advance !

Comments (8)

Write comment?

gabrielsimoes

8 years ago, # |

← Rev. 2 →

Well, if M is really small, you can compute hash for all possible words of size <= M inside the string N. Then, just compute the hash for the ith word and check if the same hash was found inside the string N.

Edit: this would be O(n*m + q*m)

→ Reply

gabrielsimoes

8 years ago, # ^ |

← Rev. 2 →

Could anyone tell me if that will work?

→ Reply

tenshi_kanade

8 years ago, # |

← Rev. 3 →

You can compute the suffix array for the string of length N and then answer each query in O(M * _logN), making the algorithm O(Q * M * _logN). If M is small, it should run in time.

→ Reply

Hepic_Antony_Skarlatos

8 years ago, # ^ |

I just know a MlogN algorithm. How I will get that in O(M) ?

→ Reply

tenshi_kanade

8 years ago, # ^ |

← Rev. 2 →

Yes, you're right. I fixed the typo. I'd need to know the actual constraints, but I guess this solution should be fast enough.

→ Reply

_index

8 years ago, # ^ |

I think you can get O(M) per query using suffix automaton.

→ Reply

radoslav11

8 years ago, # ^ |

Yep. Just build a suffix automaton on the string of length N and after that for each query run a dfs from the start node of the automaton. If you can do all M transitions between the automaton states then the small string is contained in the big one. The time complexity is O(M)*O(Q)=O(M*Q).

→ Reply

aka.Sohieb

8 years ago, # |

+25

I think the most efficient algorithm for this kind of problems is Aho–Corasick

→ Reply