科技回声

8 条评论

This depend entirely on being able to do the interpolation on your keys. He computes:<pre><code> offset = (key - min) / (max - min) </code></pre> but not all data can have arithmetic done on it like that. What if the data in your array are, for example, names?It's pretty trivial to beat binary search if your data are integers, or other keys on which you can do arithmetic. You can make the uniform distribution assumption, check it as you go, degrade to binary search if the assumption seems flawed, and then proceed from there.However, if you're given an array of unknown objects, a comparison function, and no way to do arithmetic on your objects, you can't beat binary search.

评论 #1438812 未加载

评论 #1438785 未加载

colomon将近 15 年前

I'm fascinated that so many of the commenters here seem to be focusing on the fact that the algorithm mentioned doesn't work in every case. My reaction was different -- I thought of a case where it would be less than ideal, and allowed it to inspire me to find an algorithm that worked better than naive binary search for that case, too.In particular, I thought of the case of searching arrays of strings. If you are doing repeated searches against the same array, you can easily build a table breaking the array down into subarrays all sharing the same first character. (I'm assuming ASCII there!) Using that can quickly eliminate 5 or so comparisons that would be required with a naive binary search.Okay, that's not brilliant, but might be useful in some circumstances. My bigger point is I thought this article was a very useful reminder that even a terrific general purpose algorithm like binary search can be beaten if you can apply specific domain knowledge to the search.

评论 #1439055 未加载

评论 #1439839 未加载

评论 #1461438 未加载

bnoordhuis将近 15 年前

Caveat emptor: The article fails to mention that interpolation search's worst-case performance is O(n), considerably worse than binary search's O(log n).

评论 #1438767 未加载

Tichy将近 15 年前

I have thought about this years ago, simply because it is also kind of the way humans search in telephone books. (Advantage of being born in a time when telephone books where still in use). I think humans would also do repeated interpolation, as some letters would be "thicker" (more pages in the book) than others.Bionics for the win...

aplari将近 15 年前

You can beat pretty much any algorithm if you make convenient extra assumptions. In this case they have suitable data for the implication search (good for them!), but the title promises too much.You can't beat binary search without new assumptions.

oinopion将近 15 年前

It works well only on uniformly distributed keys. Not everyone has pleasure to work with such data.

评论 #1438515 未加载

评论 #1439039 未加载

pascal_cuoq将近 15 年前

I do not know this language (Java?), but doesn't the line<pre><code> int offset = (int) (((size - 1) * ((long) key - min)) / (max - min)); </code></pre> compute (max - min) as an int (potentially overflowing), and then convert it to long, defeating the precautions obviously taken against this kind of event with the cast to long elsewhere?

评论 #1439085 未加载

panic将近 15 年前

Interesting article, but without actual numbers it's hard to tell whether the change made any difference in performance.

评论 #1438615 未加载

8 条评论

RiderOfGiraffes将近 15 年前

评论 #1438812 未加载

评论 #1438785 未加载

colomon将近 15 年前

评论 #1439055 未加载

评论 #1439839 未加载

评论 #1461438 未加载

bnoordhuis将近 15 年前

Caveat emptor: The article fails to mention that interpolation search's worst-case performance is O(n), considerably worse than binary search's O(log n).

评论 #1438767 未加载

Tichy将近 15 年前

aplari将近 15 年前

oinopion将近 15 年前

It works well only on uniformly distributed keys. Not everyone has pleasure to work with such data.

评论 #1438515 未加载

评论 #1439039 未加载

pascal_cuoq将近 15 年前

评论 #1439085 未加载

panic将近 15 年前

Interesting article, but without actual numbers it's hard to tell whether the change made any difference in performance.

评论 #1438615 未加载

Beating Binary Search

8 条评论

Beating Binary Search

8 条评论