#	User	Rating
1	ecnerwala	3648
2	Benq	3580
3	orzdevinwang	3570
4	cnnfls_csy	3569
5	Geothermal	3568
6	tourist	3565
7	maroonrk	3530
8	Radewoosh	3520
9	Um_nik	3481
10	jiangly	3467

#	User	Contrib.
1	maomao90	174
2	awoo	164
2	adamant	164
4	TheScrasse	159
4	nor	159
6	maroonrk	156
7	-is-this-fft-	150
8	SecondThread	147
9	orz	146
10	pajenegod	145

Hello, Codeforces!

Some time ago I invented an interesting data structure and I'd like to share it with the community. Maybe, it was known before, and if you knew about it before my blog post, please share the link to the source.

What can the data structure do?

Given an array a that contains n elements and the operation op that satisfies some criteria:

Associative property: (x op y)op z = x op(y op z).
Commutative property (optional, not required): x op y = y op x.

So, such operations as gcd, min, max, sum, multiplication, and, or, xor, etc. satisfy these conditions.

Also we have some queries l, r. For each query, we need to find a_l op a_l + 1 op ... op a_r. Let's call such queries q(l, r).

My data structure can process such queries in O(1) time with $\text{[math]}$ preprocessing time and $\text{[math]}$ memory.

How it works?

1. Sqrt

Let's do a sqrt-decomposition. We divide our array in $\text{[math]}$ blocks, each block has size $\text{[math]}$ . For each block, we compute:

Answers to the queries that lie in the block and begin at the beginning of the block (_prefix-op_)
Answers to the queries that lie in the block and begin at the end of the block (_suffix-op_)

And we'll calculate another one thing:

between[i, j] i ≤ j -- answer to the query that begins at the start of block i and ends at the end of block j. Note that we have $\text{[math]}$ blocks, so the size of this array will be $\text{[math]}$ .

Let's see the example.

Let op be + (we calculate sum on the segment) and we have the following array a:

1 2 3 4 5 6 7 8 9

It will be divided onto three blocks: 1 2 3, 4 5 6 and 7 8 9.

For first block prefix-op is 1 3 6 and suffix-op is 6 5 3.

For second block prefix-op is 4 9 15 and suffix-op is 15 11 6.

For third block prefix-op is 7 15 24 and suffix-op is 24 17 9.

between array is:

6 21 45
- 15 39
-  - 24

It's obvious to see that these arrays can be easily calculated in O(n) time and memory.

We can answer such queries using these arrays. If the query doesn't fit in one block, we can divide it onto three parts: suffix of a block, then some segment of contigious blocks and then prefix of some block. We can answer the query by taking op of some segment from suffix-op, then some segment from in between, then some segment from _prefix-op$.

But if we have queries that entirely fit into one block, we cannot process them using these three arrays. So, we need to do something.

2. Tree

We cannot answer only the queries that entirely fit in one block. But what if we build the same stucture as described above for each block? Yes, we can do it. And we do it recursively, until we reach the block size of 1 or 2. Answers for such blocks can be calculated easily in O(1).

So, we get a tree. Each node of the tree represents some segment of the array. Node that represents array segment with size k has $\text{[math]}$ children -- for each block. Also each node contains the three arrays described above for the segment it contains. The root of the tree represents the entire array.

Also it's obvious that the size of this tree is $\text{[math]}$ , because if some vertex of the sqrt-tree represented an array with length k, then its children will have length $\text{[math]}$ . $\text{[math]}$ , so $\text{[math]}$ decreases two times every layer of the tree and its height is $\text{[math]}$ . The time for building and memory usage will be $\text{[math]}$ , because every element of the array appears exactly once in each layer of the tree.

Now we can answer the queries in $\text{[math]}$ . We can go down on the tree until we meet a segment with length 1 or 2 (answer for it can be calculated in O(1) time) or meet a segment in which our query doesn't fit entirely into one block. See the first section on how to answer the query in this case.

OK, now we can do $\text{[math]}$ per query. Can it be done faster?

3. Optimizing the query complexity

One of the most obvious optimization is to binary search the node we need in the tree. Using binary search, we can reach the $\text{[math]}$ complexity per query. Can we do it even faster?

The answer is yes. Let's assume the following two things:

The block size is a power of two.
All the blocks are equal on each layer.

To reach this, we can add some zero elements to our array so that its size becomes a power of two.

When we use this, some block sizes may become twice larger to be a power of two, but it still be $O(\sqrt{k}) in size and linear complexity for building the arrays in one block remains.

Now, we can easily check if the query fits entirely into a block size 2^k. Let's write the ranges of the query, l and r (we use 0-indexation) in binary form. For instance, let's assume k = 4, l = 39, r = 46. The binary representation of l and r is:

l = 39(10) = 100111(2)

r = 46(10) = 101110(2)

Remember that one layer contains segments of the equal size, and the block on one layer have also equal size (in our case, their size is 2^k = 2⁴ = 16. The blocks cover the array entirely, so the first block covers elements (0 - 15) (000000 - 001111 in binary), the second one covers elements (16 - 31) (010000 - 011111 in binary) and so on. We see that the indices of the positions covered by one block may differ only in k (in our case, 4) last bits.

So, we need to check if nothing more that k smallest bits differ (or l xor r doesn't exceed 2^k - 1).

Using this observation, we can find a layer that is suitable to answer the query quickly. How to do this:

For each i that doesn't exceed the array size, we find the highest bit that is equal to 1. To do this quickly, we use DP and a precalculated array.
Now, for each q(l, r) we find the highest bit of l xor r and, using this information, it's easy to choose the layer on which we can process the query easily. We can also use a precalculated array here.

For more details, see the code below.

So, using this, we can answer the queries in O(1) each.

Conclusion

We have a data structures that asymptotically answers the queries on a segment very fast. But, the constant of this data structures is big enough. Also it doesn't support changing the elements. But it can be modified to preform modification queries in something about $\text{[math]}$ for modifying a single element. But in this case, query complexity becomes $\text{[math]}$ . You can think how to do this as an exercise.

https://paste.ubuntu.com/26367387/

If you have some suggestions on how to improve this data structure or how to modify it, or something is written unclear and needs more explanation, feel free to write in the comments.

Rev.	By	When	Δ	Comment
en15	gepardo	2018-04-24 22:44:35	80
en14	gepardo	2018-01-12 18:37:10	224	Removed the commutative property
en13	gepardo	2018-01-12 13:25:17	11	Tiny change: 'y to $O(n\alpha(n))$ for pre' -> 'y to $O(n\log* n)$ for pre'
en12	gepardo	2018-01-12 12:56:59	222	Text was rewritten a little
en11	gepardo	2018-01-12 12:50:08	2	Tiny change: '-01-12] decsribed in t' -> '-01-12] described in t'
en10	gepardo	2018-01-12 12:49:47	372	Added improvement by geniucos
en9	gepardo	2018-01-12 11:35:41	1	Tiny change: '(\sqrt{k}) in size a' -> '(\sqrt{k})$ in size a'
en8	gepardo	2018-01-12 00:30:18	254	Added link to the article
en7	gepardo	2018-01-11 22:37:23	1	Tiny change: ' structures that asym' -> ' structure that asym'
en6	gepardo	2018-01-11 22:17:37	132	Added link to the problem
en5	gepardo	2018-01-11 22:10:37	1
en4	gepardo	2018-01-11 21:58:21	0	(published)
en3	gepardo	2018-01-11 21:57:36	4447	More minor fixes
en2	gepardo	2018-01-11 21:10:15	12	Minor fixes
en1	gepardo	2018-01-11 21:09:18	7034	Initial revision (saved to drafts)