Off by one errors / Edge Cases in Binary Search/ 2-Pointer Technique

→ Pay attention

Before contest
Codeforces Round 968 (Div. 2)
39:12:03
Register now »

*has extra registration

→ Streams

Atcoder ABC368 Solution Discussion

By aryanc403

Before stream 14:22:02

Codeforces Round 968 Solution Discussion

By aryanc403

Before stream 41:17:02

View all →

→ Top rated

#	User	Rating
1	tourist	3947
2	jiangly	3734
3	Radewoosh	3646
4	jqdai0815	3620
4	Benq	3620
6	orzdevinwang	3612
7	ecnerwala	3581
8	Geothermal	3569
8	cnnfls_csy	3569
10	ksun48	3479

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	awoo	162
2	maomao90	160
3	nor	156
4	cry	155
4	adamant	155
4	atcoder_official	155
4	-is-this-fft-	155
8	maroonrk	153
9	SecondThread	147
10	Petr	146

View all →

→ Find user

→ Recent actions

Detailed →

viralm's blog

Off by one errors / Edge Cases in Binary Search/ 2-Pointer Technique

By viralm, history, 8 years ago, In English

Often when trying to solve a problem involving Binary Search or 2-Pointer Technique, I make off by one errors, and/or fail to handle edge cases. Even if I know that my solution might fail on a particular edge case, to correct it takes a lot of time. I would like to know a method/implementation, such that I can code up the solutions without having to worry about edge cases, etc. For Binary Search, I know of one such method where to avoid infinite loop, we can use the following code: Say we want to find the maximum index in an array which satisfies certain property -->

while(hi-lo>1)
{
    int mid=(lo+hi)/2;
    int chk=check(mid);
    if(chk==1) lo=mid;
    else hi=mid-1;
}
int ans;
if(check(hi)==1) ans=hi;
else ans=lo;

If any better approach is available(for Binary Search), you are welcome to comment. Also, can you give a good implementation for 2 — Pointer Technique.

Thanks in advance.

binary seach, two-pointers, implementation

viralm
8 years ago
8

Comments (8)

Write comment?

dush1729

8 years ago, # |

Suppose we want to find the maximum index at which function returns a true value.

while(lo<=hi)
{
   mid=(lo+hi)/2;
   if(f(mid)) ans=mid, lo=mid+1;
   else hi=mid-1;
}
print ans

→ Reply

viralm

8 years ago, # ^ |

Your code gets into an infinite loop when lo==hi.

→ Reply

dush1729

8 years ago, # ^ |

It won't. Depending on what f(mid) returns either lo will be incremented or hi will be decremented which will make lo<=hi false.

→ Reply

viralm

8 years ago, # ^ |

Sorry. You are right.

→ Reply

_index

8 years ago, # |

Here.

→ Reply

viralm

8 years ago, # ^ |

Can you explain your code?

→ Reply

yeputons

8 years ago, # |

← Rev. 2 →

+23

I'd like to copy to my a part of my Quora answer almost verbatim here:

The most common mistake in implementing binary search is trying to remember or guess off-by-ones, correct termination conditions and pre-checks instead of understanding invariant of the algorithm. Some peoplestick to a specific implementation of binary search. But once you have an invariant and follow it, you won't make a mistake and will be able to write and understand any kind of binary search. Moreover, you will be able to find bugs easily.

Example: I want to find first element ≥ X in a sorted array. Invariant: I have a interval from L to R such that a_L < X and a_R ≥ X. Then, after I check element M in between, I set either L or R to M, preserving the invariant. Loop is terminated when answer is obvious — that is, there are no elements strictly between L and R, and answer is R. Oh, and initialization is easy as well: I can assume that my array has - ∞ before the beginning and + ∞ after the end. As I will never read these elements, I just initialize L = - 1 and R = l, where l is length of the array.

Another example: I want to find element X. Invariant: a[L]<X and a[R]>X. Initialization: same. If a_M is X, I return answer, otherwise I change L or R. If there are no elements between L and R, there is no answer.

So, my recommendation: basically, understand what exactly your binary search is required to return and what whether there are any "corner" cases or cases where answer is non-existent. If there are, you're walking on the edge and have to be extra careful with invariants.

Typically, you binary search problem should be formulated in the following form: there is a predicate f(x) such that it does not hold for some prefix of the array (possibly empty) and it holds for the remaining suffix, binary search is to find that border. If it's your case, then invariant is simple: f(a_L) = 0, f(a_R) = 1. Say, C++'s lower_bound and upper_bound's invariants are formulated in that way: lower_bound returns first element which is ≥ X (assuming that there is + ∞ after array's end), upper_bound return first element which is > X (under the same assumption). No corner cases.

→ Reply

riadwaw

8 years ago, # ^ |

← Rev. 2 →

BTW, after you formulated predicate, you may often use std::partition_point

→ Reply