SQRT decomposition for beginners

#	User	Rating
1	ecnerwala	3649
2	Benq	3581
3	orzdevinwang	3570
4	Geothermal	3569
4	cnnfls_csy	3569
6	tourist	3565
7	maroonrk	3531
8	Radewoosh	3521
9	Um_nik	3482
10	jiangly	3468

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	159
5	nor	158
6	maroonrk	156
7	-is-this-fft-	151
8	SecondThread	147
9	orz	146
10	pajenegod	145

Hi CodeForces, in this topic i want to tell what sqrt decomposition is, give you an example, and give some problems which can be solved by sqrt.

The whole thing that sqrt is, is to reduce complexity by divide queries and get better complexity. e.g. I have an algorithm with O(N^2) complexity and another algorithm with O(k.N + N^2 / k) complexity where k is size of blocks i made, then i can have O(N.sqrt(N)) algorithm with set k to sqrt(N).

We'll solve a problem to clarify. The problem is this:

you're given an array A with n elements and q queries with 2 types: "? l r" you have to print sum of values between l(inclusive) and r(exclusive) ( [l, r) ) "+ l r x" you have to increase all elements between l and r by x. and n <= 1e5, q <= 1e5. (by the way you can solve this problem using segment tree or fenwick tree with more ease.)

let's consider A as n/k consecutive bucket with each bucket (except of the last) has k elements. then we can do so: if we want to change/process a full bucket, then we'll change a variable referencing to this bucket and if not, we'll change elements one by one. so in each query we'll use at most 2 incomplete buckets and at most n/k complete buckets, so the complexity will be from O(q.n/k + q.k). since product of q.n/k and q.k is constant, the minimum sum is reached when q.n/k = q.k. thus k = sqrt(n) and the complexity is from O(n.sqrt(n)).

there is a sample code that do so here.

read 342E - Xenia and Tree, first think about how you can solve it and then go ahead.

think about how we can solve if we could use O(q^2) for example.

Answer

now, let's try to improve the algorithm by partition queries in k-sized groups and see how this can help us.

Answer

so now we have an algorithm from O(n.q/k + qk), thus how we proved before, k = sqrt(n) and the algorithm is from O(q.sqrt(n)).

here are some other problems that can be solved by sqrt decomposition. 44C - Holidays, 13E - Holes, 86D - Powerful array, 398D - Instant Messanger and 455D - Serega and Fun.

In the end, how you saw in problems there's not a single type problems which sqrt can solve. another thing that you should notice, is what your bucket sizes should not be sqrt always, think of you have an O(k + n.log/k) algorithm like we proved before, the minimum is when k^2 = n.log or k = sqrt(n.log).

Comments (14)

Write comment?

VIIIIIIVX

7 years ago, # |

Thanks a lot:) very nice implementation of Sqrt decomposition .

→ Reply

-arma-

7 years ago, # ^ |

You're welcome. If you had any idea about topic, feel free to tell me ;)

:joy:

stringray

good tutorial .

adarsh.kumariitbhu

6 months ago, # |

OK, we can reunion after k queries and make a variable for answer of queries upon last k multiplier using bfs with first nodes on red nodes.

plz explain this clearly...why don't you all remember that we are noobies and you should explain your approach clearly and be clear in your words...please explain

6 months ago, # ^ |

You're right. Zorry.

So after k queries, I want to find the closest red node to each of the nodes in the tree. Let's reformuate this problem: We have a tree T and we want to find $$$d_v$$$ for every node $$$v$$$ of $$$T$$$.

To solve this problem, I make an observation: closest red node to a node $$$v$$$ is either the highest red node in $$$v$$$'s subtree or the closest node to its parent (since the path has to go through the parent)

You can solve this problem linearly by figuring out the highest red node in each subtree with a DFS. (Return the highest red node observed to the parent, and find highest over all the children)

Then for the root of $$$T$$$ the answer has to be highest red node in the subtree as there are noother nodes available. For each of the root's child we can find the answer, because we know both the highest in the subtree and the parent's answer. So with a DFS or BFS we can find the answer for all the nodes.

Now that we know the answer with red nodes until query $$$ik$$$, we can answer query $$$ik + j$$$ by figuring ou whether any of the $$$j$$$ neww red nodes are closer than the last answer we found for $$$T$$$.

Feel free to write back if it's still unclear.

← Rev. 2 →

can u plz explain me in detailed that how you derived the time complexity ? and you have written that naively we would have time complexity as O(q^2)...how is it ?

willgax

go watch errichto video.

i watched it but there he has given a convex hull example...so couldn't understand

Let's say I'm dividing queries into batches of $$$k$$$ and we have in total $$$q$$$ queries. After every batch I have a processing of $$$O(n)$$$ to update the closest red node for each node. And every query I have at most $$$k$$$ nodes to compare with the most recent value at the last batch.

So in total we have $$$q$$$ queries each taking time $$$O(k)$$$ and at each of the $$$q/k$$$ batches we have a processing of $$$O(n)$$$. So it's going to be of $$$O(qk + nq/k)$$$.

To find the distance between two nodes on a tree, you can find the Least Common Ancestor (LCA, you can find many tutorials on how to calculate it in logarithmic time) and the rest is just simple calculations based on the nodes' height. You can draw the path from each of the two nodes to the root, and you can see that you have the shortest (only) path between the nodes and you have traversed the distance between the LCA and the root twice.

← Rev. 3 →

Ok...now I understood it...thanks for replying to a comment 6 years after its original post Just one last thing to clarify...I understood the O(n) processing time via BFS after each batch...and the O(k) time (for each query in worst case) comes when after each color update of a node...we update the minimum distance for the (minimum distance queries) of nodes in that batch...am I right ? plz confirm...

No worries.

Didn't get what you mean.

After each batch closest red node for all of the tree's nodes are updated (you need those to find the values anyway I guess)

After each query you compare the last solution you have at a batch with all new red nodes in the current batch.

yes....I got it....I was trying to tell the same thing that after each query we update solution of each blue node in a batch while comparing the previous solution with the new list of red nodes....thanks for taking out so much of your precious time...sorry for being too dumb !!

also...plz provide me some useful beginner friendly, intermediate resources, videos and questions of sqrt decomposition...because there is much on the codeforces so it becomes very difficult to filter out the content

-arma-'s blog