Big integers with negative digits: The Trygub numbers

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
1	maomao90	161
3	adamant	156
4	maroonrk	153
5	-is-this-fft-	148
5	atcoder_official	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

Hi everyone!

Today we will talk about something that might be useful in programming competitions. Yay! Also great thanks to antontrygubO_o for sharing this with me and others in a Discord server.

Let's implement a data structure that maintains a big number $$$N$$$ in base $$$b$$$ and supports the following:

Given (possibly negative) integers $$$|x|, |y| \leq n$$$, add $$$x b^y$$$ to $$$N$$$.
Given $$$k$$$ and assuming $$$N \geq 0$$$, print the $$$k$$$-th digit of $$$N$$$.
Check if $$$N$$$ is positive, negative or equals to $$$0$$$.

Each operation should take at most $$$O(\log n)$$$ amortized time and $$$O(q)$$$ memory. While some approach you may think of immediately would imply using segment tree, or a similar structure, the solution proposed here only requires std::map, so it's much shorter and easier to implement (at the slight expense of increased constant factor). It may be used in the following problems:

If you implement the big integers in these numbers the standard way (i.e. keeping digits in the $$$[0, b)$$$ segment, carefully executing carries, etc), you will quickly learn that you may get in trouble because you may be forced to do and undo a lot of carry operations which chain up and so you need to frequently change large segments between the values of $$$0$$$ and $$$b-1$$$.

Now, stop being fooled by the non-negative propaganda! You don't have to do it! Let's give ourselves some slack and allow negative digits. Well, just a bit of them. Instead of maintaining digits in $$$[0,b)$$$, let's now maintain them in the interval $$$(-b, b)$$$. It seems like a tiny change, but the effect is tremendous. On one hand, the representation of any number is not unique anymore. On the other hand, when we actually reach the value $$$b$$$ or $$$-b$$$, we wrap them back to $$$0$$$, and carry $$$1$$$ or $$$-1$$$ to the next digit correspondingly.

Noticed anything? The carry now wraps us from the endpoints of the interval to its middle instead of from one endpoint to another! It would be easy to add $$$1$$$ to a particular bit, turn it into $$$b$$$ and cause a chain of carries by it. But! If after that we add $$$-1$$$ to the same bit, it will not wrap all the bits back to $$$b-1$$$! It will just change this specific bit to $$$-1$$$! So, we give up the uniqueness of the representation, but we gain a whole lot of stability in exchange.

The C++ implementation for the first two queries is also quite concise:

code

struct trygub_num {
    const int base = 1 << 30;

    map<int, int> digs;

    void add(int a, int b) {
        digs[b] += a;
        int t;
        do {
            t = digs[b] / base;
            digs[b + 1] += t;
            digs[b] -= t * base;
            if(digs[b] == 0) {
                digs.erase(b);
            }
            b++;
        } while(t);
        if(digs[b] == 0) {
            digs.erase(b);
        }
    }

    int get(int k) {
        auto it = digs.lower_bound(k);
        int ans = it == end(digs) || it->first > k ? 0 : it->second;
        if(it != begin(digs) && prev(it)->second < 0) {
            ans--;
        }
        return (ans + base) % base;
    }
} me;

I tested it on #2302. 「NOI2017」整数 and it works!

P.S. Applying it to 1817E - Half-sum and 1810F - M-tree is left to the curious reader as an exercise :)

P.P.S. Is this trick well-known? Does it have a name?

#if __has_include("pch.hpp") #include "pch.hpp" #else #include <bits/stdc++.h> #include <ext/pb_ds/assoc_container.hpp> #include <ext/pb_ds/tree_policy.hpp> #endif using namespace std; typedef long long ll; typedef unsigned long long ull; typedef long double ld; using namespace __gnu_pbds; typedef tree<ll,null_type,less_equal<ll>,rb_tree_tag,tree_order_statistics_node_update> order_set; mt19937 mt_rand(chrono::high_resolution_clock::now().time_since_epoch().count()); //ld rand(ld a, ld b) {uniform_real_distribution<ld> uni(a, b); return uni(mt_rand);} //const ld PI=3.141592653589793238462643383279; const int mxN=2e6+50000; const int mod=998244353; const int mxlogN=18; const ll inf=2e18; const int iinf=1e9; const int K=20; struct trygub_num { int base; map<int, int> digs; void setup(int b) { digs.clear(); base=b; } void add(int a, int b) { digs[b] += a; int t; do { t = digs[b] / base; digs[b + 1] += t; digs[b] -= t * base; if(digs[b] == 0) { digs.erase(b); } b++; } while(t); if(digs[b] == 0) { digs.erase(b); } } }; struct trygub { int base; map<int,int> digs; void setup(int b) { digs.clear(); base=b; } void add(int a, int b) { if(!a) return; auto it=digs.insert({b,0}).first; it->second+=a; vector<map<int,int>::const_iterator> todel; while(1) { int t=it->second/base; if(!t) break; it->second-=t*base; if(!it->second) todel.push_back(it); if(next(it)==digs.end()||next(it)->first!=b+1) digs.insert(next(it),{b+1,0}); it++; it->second+=t; b++; } if(!it->second) todel.push_back(it); for(auto it:todel) digs.erase(it); } }; int a[mxN], b[mxN]; struct test { int n, base, bnd; }; int main() { ios_base::sync_with_stdio(0); cin.tie(0); trygub_num num; trygub num2; vector<test> tests={ {400000,2,100000}, {400000,3,100000}, {400000,4,100000}}; int cnt=1; for(auto t:tests) { int n=t.n; for(int i=0; i<n; i++) a[i]=mt_rand(), b[i]=mt_rand()%t.bnd; num.setup(t.base); num2.setup(t.base); time_t start, stop; start=clock(); for(int i=0; i<n; i++) num.add(a[i],b[i]); stop=clock(); cout << "TEST " << cnt++ << "\n"; cout << "Original time " << (double)(stop - start) / CLOCKS_PER_SEC << "\n"; start=clock(); for(int i=0; i<n; i++) num2.add(a[i],b[i]); stop=clock(); cout << "Optimized time " << (double)(stop - start) / CLOCKS_PER_SEC << "\n"; } }

Comments (11)

Write comment?

adamant

15 months ago, # |

+58

At the request of jeroenodb and in honor of great antontrygubO_o I also suggest calling the structure "Trygub num".

→ Reply

ffao

+20

Lyrically

+77

Finally adamant posted something that we can understand:P

bvd

+30

Allowing negative digits in number representations has already appeared in Concrete Mathematics, page 15-16, but this is the first time I see the "wrapping around" part.

← Rev. 3 →

Now that I think about it, the intended solution to 102354E - Decimal Expansion uses pentagonal number theorem to represent the constant $$$\frac{9}{10}\frac{99}{100}\frac{999}{1000} \dots$$$ as a Trygub number in base $$$10$$$ and recover the corresponding digit. So many applications!

alwyn

Another problem: INC 2022 C — Powers of Two

jeroenodb

← Rev. 6 →

+24

I really wanted to know why the amortized complexity is good. So I decided to prove it for myself:

We will be using a potential function, which represents how bad we messed up the data structure. If the potential function is high it means we need to clean up a lot of trash.

Let's take as potential function:

$$$\phi(\text{trygub num DS}) = c\log(n) \sum | \text{digs}[i]|/b$$$

The absolute value of the digits, times a constant $$$c \log(n)$$$, which isn't too important for now. It's intuitively clear that the higher the absolute values of the digits, the more carries can happen in future operations, so such a state of the data structure is worse. Good to note is that this potential function starts at $$$0$$$ and its value is always non-negative.

To prove the amortized bound, let's see how the potential changes with the add operation. (It is the only operation that changes the potential function).

In the $$$\texttt{add}$$$ function, if there are $$$k$$$ carries, the function needs to do $$$O( (1+ k) \cdot \log(n))$$$ work. When a carry is happening, it means we subtract $$$\text{carry} \cdot b$$$ from $$$\text{digs}[i]$$$ and add back $$$\text{carry}$$$ to $$$\text{digs}[i+1]$$$. The carrying always decreases $$$|\text{digs}[i]|$$$ and either increases or decreases $$$|\text{digs}[i+1]|$$$. In the worst-case, the potential function decreases by at least $$$ c \log(n) \cdot (1 - \frac{1}{b})$$$. So

$$$T_\text{amortized} = \Delta \phi + \text{actual work} \leq -k \cdot c \log(n) (1 - \frac{1}{b} ) + O( (k+1) \log(n))$$$

If the constant $$$c$$$ is chosen big enough, such that the $$$-k$$$ term is bigger than the $$$+k$$$ term inside the big Oh notation, the $$$+k$$$ and $$$-k$$$ terms can cancel. So $$$T_\text{amortized} = O(\log(n))$$$.

We didn't yet account for the extra potential that is due to the addition of $$$x$$$ to the digit. If we can assume $$$x \leq b$$$, then this adds potential $$$\leq c \log(n) \cdot b / b = c \log(n)$$$, so it is still amortized fast.

If $$$b \ll x $$$, the amortized analysis does not work (you can prove some slightly worse bounds though). Luckily, this can be easily fixed. It doesn't bother us if we change the base used in the data structure to $$$b^\prime = b^k$$$, for $$$k>0$$$, so we just choose a large enough $$$k$$$, such that $$$b^k \geq \max X$$$.

You can even prove if $$$\max X = O( \text{base}^c)$$$, for some constant $$$c$$$, the amortized analysis still works.

15 months ago, # ^ |

Btw, a good way to think about how a potential function actually proves an amortized bound is this:

The potential function can be seen as a bank account, where you can put money into to save it for later. Whenever you're decreasing the potential you are taking away money. Everyone knows money = time. So in this analogy, to pay for the time of a slow operation, you take some money out of the bank account. With the money you saved in earlier operations by paying extra and putting it in the bank account. By ensuring the potential function is non-negative you are ensuring you're never spending more money than you have in the bank.

peltorator

+16

Now, stop being fooled by the non-negative propaganda!

Thank you adamant for keeping Codeforces from propaganda!

toniskrijelj

2 months ago, # |

We can utilize the fact that additions will be done on some segment [b,b+x], so we can insert and update values in map using an iterator. I did some benchmarking and optimized version gets reasonable faster.

struct trygub
{
    int base;
    map<int,int> digs;
    void setup(int b)
    {
        digs.clear();
        base=b;
    }
    void add(int a, int b)
    {
        if(!a) return;
        auto it=digs.insert({b,0}).first;
        it->second+=a;
        vector<map<int,int>::const_iterator> todel;
        while(1)
        {
            int t=it->second/base;
            if(!t) break;
            it->second-=t*base;
            if(!it->second) todel.push_back(it);
            if(next(it)==digs.end()||next(it)->first!=b+1)
            digs.insert(next(it),{b+1,0});
            it++; it->second+=t;
            b++;
        }
        if(!it->second) todel.push_back(it);
        for(auto it:todel) digs.erase(it);
    }
    int get(int k)
    {
        auto it=digs.lower_bound(k);
        int ans=0;
        if(it!=digs.end()&&it->first==k) ans=it->second;
        if(it!=digs.begin()&&prev(it)->second<0) ans--;
        return (ans+base)%base;
    }
    int le0()
    {
        if(digs.empty()) return 1;
        return digs.rbegin()->second<0;
    }
    void pop()
    {
        while(digs.size()>1)
        {
            auto [k,c]=*digs.rbegin();
            auto [k2,c2]=*next(digs.rbegin());
            if(k2+1==k&&-c2==base-1&&c==1)
            {
                digs.erase(prev(digs.end()));
                digs.rbegin()->second=1;
            }
            else break;
        }
    }
};

I also added function to check if the number is <= 0, and pop function which clears digs map so that last non-zero digit is at last map element or last-1 (implemented for positive trygub num).

benchmark code

Further, we can utilize array and set if we know that additions will be on values <= maxN. But this actually doesn't get faster than optimized map, only in certain cases.

set code

int cnt[mxN];
struct trygub2
{
    set<int> digs;
    int base;
    void setup(int b)
    {
        base=b;
        for(auto x:digs) cnt[x]=0;
        digs.clear();
    }
    void add(int a, int b)
    {
        if(!a) return;
        cnt[b]+=a;
        auto it=digs.insert(b).first;
        vector<set<int>::const_iterator> todel;
        while(1)
        {
            int t=cnt[b]/base;
            if(!t) break;
            if(next(it)==digs.end()||*next(it)!=b+1)
            digs.insert(next(it),b+1);
            cnt[b]-=t*base;
            cnt[b+1]+=t;
            if(!cnt[b]) todel.push_back(it);
            it++, b++;
        }
        if(!cnt[b]) todel.push_back(it);
        for(auto it:todel) digs.erase(it);
    }
    int get(int k)
    {
        auto it=digs.lower_bound(k);
        int ans=cnt[k];
        if(it!=digs.begin()&&cnt[*prev(it)]<0) ans--;
        return (ans+base)%base;
    }
    int le0()
    {
        if(digs.empty()) return 1;
        return cnt[*digs.rbegin()]<0;
    }
    void pop()
    {
        while(digs.size()>1)
        {
            auto k=*digs.rbegin(), k2=*next(digs.rbegin());
            int c=cnt[k],c2=cnt[k2];
            if(k2+1==k&&c==1&&c2==(-base+1))
            {
                digs.erase(prev(digs.end()));
                cnt[k]=0, cnt[k2]=1;
            }
            else break;
        }
    }
};

mano54

2 months ago, # ^ |

-13

that's too hard

adamant's blog