On the Efficient Determination of Most Near Neighbors. Mark S. Manasse

On the Efficient Determination of Most Near Neighbors

Год выпуска: 0

Автор произведения: Mark S. Manasse

Серия: Synthesis Lectures on Information Concepts, Retrieval, and Services

Жанр: Компьютеры: прочее

Издательство: Ingram

isbn: 9781627054942

Краткое описание:

The time-worn aphorism «close only counts in horseshoes and hand grenades» is clearly inadequate. Close also counts in golf, shuffleboard, archery, darts, curling, and other games of accuracy in which hitting the precise center of the target isn't to be expected every time, or in which we can expect to be driven from the target by skilled opponents.
This book is not devoted to sports discussions, but to efficient algorithms for determining pairs of closely related web pages—and a few other situations in which we have found that inexact matching is good enough – where proximity suffices. We will not, however, attempt to be comprehensive in the investigation of probabilistic algorithms, approximation algorithms, or even techniques for organizing the discovery of nearest neighbors. We are more concerned with finding nearby neighbors; if they are not particularly close by, we are not particularly interested.
In thinking of when approximation is sufficient, remember the oft-told joke about two campers sitting around after dinner. They hear noises coming towards them. One of them reaches for a pair of running shoes, and starts to don them. The second then notes that even with running shoes, they cannot hope to outrun a bear, to which the first notes that most likely the bear will be satiated after catching the slower of them. We seek problems in which we don't need to be faster than the bear, just faster than the others fleeing the bear.