Pollard Rho Integer Factorization

Please subscribe to the official Codeforces channel in Telegram via the link https://t.me/codeforces_official. ×

→ Pay attention

Before contest
Codeforces Round 961 (Div. 2)
21:19:50
Register now »

*has extra registration

→ Streams

Codeforces Round 1995 (Div 2) - Official Solution Discussion

By Shayan

Before stream 23:24:49

Codeforces Round 960 Solution Discussion

By aryanc403

Before stream 23:29:49

CodeChef Starters 144 Solution Discussion

By aryanc403

Before stream 47:44:49

View all →

→ Top rated

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	awoo	161
2	maomao90	160
3	adamant	156
4	maroonrk	153
5	-is-this-fft-	148
5	SecondThread	148
5	atcoder_official	148
8	Petr	147
9	nor	144
9	TheScrasse	144

View all →

→ Find user

→ Recent actions

Detailed →

saru95's blog

Pollard Rho Integer Factorization

By saru95, history, 9 years ago, In English

I am a newbie . I was studying this particular algorithm and I have some doubts that have arisen. Would be glad if someone could clear them .

My code goes like :

int pollardRho(int n) {
  if(n%2==0)
    return 2 ;
  srand (time(NULL)) ; 
  int x, y , g=1 , a;
  x = rand() % n + 1 ;
  y = x ;
  a = rand() % n + 1 ;
  while(g==1) {
    x = ((x*x) + a)%n ;  
    y = ((y*y) + a)%n ;
    y = ((y*y) + a)%n ;  
    g = gcd(abs(x - y), n) ;
  }
  return g ;
}

What I infer is that the use of (x*x) + a mod n is to generate another pseudo random number . But why cant we write it as :

int pollardRho(int n) {
  if(n%2==0)
    return 2 ;
  srand (time(NULL)) ; 
  int x, y , g=1 , a;
  while(g==1) {
    x = rand() % n + 1 ;
    y = rand() % n + 1 ;
    g = gcd(abs(x - y), n) ;
  }
  return g ;
}

where the rand() itself generates a new number . Basically, I want to know what is the significance of the equation being used .

factorisation, integer

saru95
9 years ago
5

Comments (5)

Write comment?

AlexandruValeanu

9 years ago, # |

You do realise that in the second one you only generate one x and one y? The first one generates many pairs (x, y).

→ Reply

saru95

9 years ago, # ^ |

Yes, but then I can put statements involving random in the while loop .

→ Reply

vitux

9 years ago, # |

because rand() generates integer in the interval [0..RAND_MAX], and RAND_MAX almost always equals 32767 (It depends on complator, and in popular compilators it equals 32767).

Actually, rand() is not recommended to use in C++ at all, because it generates small numbers, have no addtional parametres, and have bad distribution.

But C++11 library , which realises good random is much harder to use, so big amount of people still using rand() in CP codes.

→ Reply

Kaban-5

9 years ago, # |

Second code just calculates gcd(k, n), where k is random integer (but not really uniform, because you spoil the distribution by subtracting numbers) until it is not 1. For n = pq, where p and q are primes, your chance of success on each iteration is $\text{[math]}$ . If p and q are close to 10⁹, it is very small. Plus what vitux said.

First code is more clever. It, basically, looks at sequence r modulo n, where r₀ and a are chosen randomly and $\text{[math]}$ for 1 ≤ i. Over any prime modulo this sequence is cyclic, possibly not strictly. So, we can see that if n has a prime divisor $\text{[math]}$ , than we will after a few iterations (not more than p) find two elements in this sequence that are equal modulo p, so their difference is divisible by p, and so is n, so gcd(n, abs(x - y)) is divisible by p, so if x != y then gcd(n, abs(x - y)) is non-trivial divisor of n (it is divisible by p, so it is more than 1, it is less than n, because 0 ≤ x < n and 0 ≤ y < n). Why we will find such x and y? On k-th iteration x = r_k and y = r_2k, so if k is bigger than start of periodic part of sequence modulo p and divides length of period of sequence modulo p, then x and y are equal modulo p.

So, first algorithm finds non-trivial divisor of n in $\text{[math]}$ (amount of iterations multiplied by calculating gcd). But actually, it finds it faster because our sequence is almost random modulo p, so birthday paradox applies and total length of non-periodic part and first period is about $\text{[math]}$ , so complexity is closer to $\text{[math]}$ , kind of that.

→ Reply

hellman_

9 years ago, # |

← Rev. 13 →

Kaban-5 answered very comprehensively, I will just add that x² + a is also a compressing function, because x² maps p values to half of p values (mod p, for odd primes p). This fastens the algorithm, since working set size continuosly decreases.

WTF: why "pi divided by 2" is replaced with cur date in my comment?

PS: If you really want to try (but it's a bad idea!) to use rand() or random() instead of x² + a, you have to seed random with current number, because the numbers must form a chain:

x = ((x*x) + a)%n ;

becomes

srand(x);
x = rand() % n ;

→ Reply