KACTL ModMul. It says that it runs around 2x faster than naive
(__int128_t)a * b % M
When I ran my benchmarks with -O2, the results were similar. Am I mistaken?
№ | Пользователь | Рейтинг |
---|---|---|
1 | tourist | 3690 |
2 | jiangly | 3647 |
3 | Benq | 3581 |
4 | orzdevinwang | 3570 |
5 | Geothermal | 3569 |
5 | cnnfls_csy | 3569 |
7 | Radewoosh | 3509 |
8 | ecnerwala | 3486 |
9 | jqdai0815 | 3474 |
10 | gyh20 | 3447 |
Страны | Города | Организации | Всё → |
№ | Пользователь | Вклад |
---|---|---|
1 | maomao90 | 174 |
2 | awoo | 164 |
3 | adamant | 163 |
4 | TheScrasse | 159 |
5 | nor | 157 |
6 | maroonrk | 155 |
7 | -is-this-fft- | 152 |
8 | Petr | 146 |
8 | orz | 146 |
10 | BledDest | 145 |
KACTL ModMul. It says that it runs around 2x faster than naive
(__int128_t)a * b % M
When I ran my benchmarks with -O2, the results were similar. Am I mistaken?
Название |
---|
This is only one mod operation, try doing more operations and benchmarking.
I did that. I did 1e5 runs and their running time was basically the same.
1e5 operations is not very many. That should take around 1 millisecond. Try something like 1e10 of them to spot a consistent difference. Also make sure the compiler can't optimize it out.