Time complexity of regex object c++

→ Pay attention

Before contest
Educational Codeforces Round 165 (Rated for Div. 2)
30:15:18
Register now »

→ Streams

CF Edu Round #165 Solution Discussion

By aryanc403

Before stream 32:25:18

Educational Round 165 Solution Discussion (with myself)

By Shayan

Before stream 32:25:18

View all →

→ Top rated

#	User	Rating
1	ecnerwala	3649
2	Benq	3581
3	jiangly	3578
4	orzdevinwang	3570
5	Geothermal	3569
5	cnnfls_csy	3569
7	tourist	3565
8	maroonrk	3531
9	Radewoosh	3521
10	Um_nik	3482

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	161
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	152
8	SecondThread	146
8	orz	146
10	pajenegod	145

View all →

→ Find user

→ Recent actions

Detailed →

Giaco's blog

Time complexity of regex object c++

By Giaco, history, 14 months ago, In English

Hello everyone!

I'm looking for the time complexity of the builder of c++ regex object. From a fast web search, i didn't find an answer (at least from the top 3 google search results lol). The stackoverflow's answer does not give an isnight of what it's happening in the constructor.

If you wonder why i'm looking for this, here are my two submission for the problem 1800A - Is It a Cat?.

Tle 196106344
Accepted 196106502

regex, timecomplexity, c++

Giaco
14 months ago
1

Comments (1)

Write comment?

nor

14 months ago, # |

← Rev. 2 →

+27

C++ regex is too slow in some cases to be usable practically. Constructing a regex object can be very slow due to possibly having exponentially many nodes in the resulting automaton (which in turn depends on the choice of regex you're using: ECMAScript, basic POSIX, and so on; and the type of matches you want can potentially make the language non-regular). The bottom line: don't use C++ regex.

A great resource on a simpler variant of the underlying machinery is here and shows why you should prefer using NFA based implementations over DFA based ones. In particular, you can use a dp to get a linear time (assuming the automaton size and alphabet size are constant) matching algorithm on a linear size automaton (in the size of the expression).

A substring matching algorithm for regular languages can be found in the next post, which is here.

Fun fact: I use this example a lot whenever someone comes up to me and asks me why theory is important, and why they should care about complexity, when obviously computers are getting faster (they're not, even on a hardware scale, let alone with the kind of code software developers write).

→ Reply