Pollard rho factorization doubt.

#	User	Rating
1	ecnerwala	3648
2	Benq	3580
3	orzdevinwang	3570
4	cnnfls_csy	3569
5	Geothermal	3568
6	tourist	3565
7	maroonrk	3530
8	Radewoosh	3520
9	Um_nik	3481
10	jiangly	3467

#	User	Contrib.
1	maomao90	174
2	adamant	164
2	awoo	164
4	TheScrasse	160
5	nor	159
6	maroonrk	156
7	SecondThread	150
8	-is-this-fft-	148
9	pajenegod	146
10	BledDest	144

I am reading the article on this website https://www.cs.colorado.edu/~srirams/courses/csci2824-spr14/pollardsRho.html to understand Pollard rho factorization.

I will explain my doubt with an example.

Suppose for N = p * q, we have 35 = 5 * 7 where N = 35,p = 5,q = 7.

Probability of finding factor 5 from [1...35] is 1 / 35.

To increase the probability of getting the factor 5, we search for number x such that GCD(x, N) = 5. Numbers 5, 10, 15, 20, 25, 30 satisfy the equation GCD(x, N) = 5. So probability of finding factor 5 increases to 6 / 35.

Now they have mentioned that to increase the probability further, we find pairs of numbers such that |X_i - X_j| = Y_i where Y_i is 5, 10, 15, 20, 25, 30.

Number of pairs satisfying the criteria |X_i - X_j| = 5 is 30 * 2 = 60, similarly number of pairs satisfying |X_i - X_j| = 10 is 25 * 2 = 50 and so on..

So to get factor 5, we need to search (30 + 25 + 20 + 15 + 10 + 5) * 2 pairs out of 35 * 35 pairs. So the probability of getting required pair is (210) / (35 * 35) = 6 / 35 which is same as the one where we found factor 5 using GCD(x, N) without searching for pairs.

So my doubt is, to find required factor, why do we search for $\text{[math]}$ such that GCD(|X_i - X_j|, N) = p instead of GCD(X_i, N) = p as their probabilities are same ?

Comments (3)

Write comment?

hellman_

6 years ago, # |

The probability is the same but you need to take less X_i. For example you take k elements X_i, or you can take $\text{[math]}$ elements X_i and take all pairs — the amount will be the same (roughly). However, you can note that you still have to check each element/pair and the time complexity is still the same. And here comes the Pollard rho algorithm which performs a random walk and captures the correct pair at some point.

→ Reply

zephyr_23

6 years ago, # ^ |

Yes, I understand that pollard rho performs random walk and captures correct pair. But why can't pollard rho perform a random walk to capture a single value as I asked in the question? According to the probability calculation, it should not matter whether we search for a pair or a single value right (as probabilities are same) ?

You can do walk and capture single value but then you have to walk mod N through roughly N^1 / 2 values until you get multiple of p (which you get with probability 1 / q).

To see why rho is better, consider your walk mod N but modulo p. We expect to have a loop in values mod p after p^1 / 2 ≤ N^1 / 4 steps. Then a cycle-finding algorithm allows to find two values a_i, a_j that have same value mod p but probably have different values mod q. The time complexity is proportional to size of the loop. That's where you get improvement and get N^1 / 4 instead of N^1 / 2.

zephyr_23's blog