Locality Sensitive Hashing

conceptAlgorithm

Try in Playground →

Overview

Developed byPiotr Indyk and Rajeev Motwani

Founded1998

Use caseapproximate nearest neighbor search in high-dimensional spaces

Technical

Protocols

Hamming distance Jaccard similarity cosine similarity

Integrates with

machine learning frameworks Apache Spark MLlib

Also see

Alternative to

exhaustive k-nearest neighbors search brute force nearest neighbor search exact nearest neighbor search k-d trees tree-based nearest neighbor methods

Based onprobabilistic hashing techniques

Knowledge graph stats

Claims36

Avg confidence90%

Avg freshness99%

Last updatedUpdated 5 days ago

WikidataQ1641203

Trust distribution

100% unverified

Governance

Not assessed

Contribute governance data →

Locality Sensitive Hashing

concept

Algorithmic technique that hashes similar input items into the same buckets with high probability.

Compare with...

developed by

Value	Trust	Confidence	Freshness	Sources
Piotr Indyk and Rajeev Motwani	○Unverified	High	Fresh	1

primary use case

Value	Trust	Confidence	Freshness	Sources
approximate nearest neighbor search in high-dimensional spaces	○Unverified	High	Fresh	1
approximate nearest neighbor search	○Unverified	High	Fresh	1
similarity search in large datasets	○Unverified	High	Fresh	1
dimensionality reduction for high-dimensional data	○Unverified	High	Fresh	1
similarity search and clustering of documents	○Unverified	Moderate	Fresh	1
duplicate detection in large datasets	○Unverified	Moderate	Fresh	1
deduplication and clustering	○Unverified	Moderate	Fresh	1
recommendation systems and information retrieval	○Unverified	Moderate	Fresh	1

founded year

Value	Trust	Confidence	Freshness	Sources
1998	○Unverified	High	Fresh	1

alternative to

Value	Trust	Confidence	Freshness	Sources
exhaustive k-nearest neighbors search	○Unverified	High	Fresh	1
brute force nearest neighbor search	○Unverified	High	Fresh	1
exact nearest neighbor search	○Unverified	High	Fresh	1
k-d trees	○Unverified	Moderate	Fresh	1
tree-based nearest neighbor methods	○Unverified	Moderate	Fresh	1

supports protocol

Value	Trust	Confidence	Freshness	Sources
Hamming distance	○Unverified	High	Fresh	1
Jaccard similarity	○Unverified	High	Fresh	1
cosine similarity	○Unverified	Moderate	Fresh	1

supports model

Value	Trust	Confidence	Freshness	Sources
MinHash for Jaccard similarity	○Unverified	High	Fresh	1
random projection for Euclidean distance	○Unverified	Moderate	Fresh	1

based on

Value	Trust	Confidence	Freshness	Sources
probabilistic hashing techniques	○Unverified	High	Fresh	1
probabilistic dimension reduction techniques	○Unverified	Moderate	Fresh	1

implemented by

Value	Trust	Confidence	Freshness	Sources
Apache Spark MLlib	○Unverified	High	Fresh	1
scikit-learn	○Unverified	Moderate	Fresh	1

integrates with

Value	Trust	Confidence	Freshness	Sources
machine learning frameworks	○Unverified	Moderate	Fresh	1
Apache Spark MLlib	○Unverified	Moderate	Fresh	1

Alternatives & Similar Tools

exhaustive k-nearest neighbors search

alternative to

Compare →

brute force nearest neighbor search

alternative to

Compare →

exact nearest neighbor search

alternative to

Compare →

k-d trees

alternative to

Compare →

tree-based nearest neighbor methods

alternative to

Compare →

Commonly Used With

machine learning frameworks Apache Spark MLlib

Related entities

Claim count: 36Last updated: 4/5/2026Edit history