Locality Sensitive Hashing
Algorithm
Overview
Developed byPiotr Indyk and Rajeev Motwani
Founded1998
Use caseapproximate nearest neighbor search in high-dimensional spaces
Technical
Integrates with
Knowledge graph stats
Claims36
Avg confidence90%
Avg freshness99%
Last updatedUpdated 5 days ago
WikidataQ1641203
Trust distribution
100% unverified
Governance
Not assessed
Locality Sensitive Hashing
concept
Algorithmic technique that hashes similar input items into the same buckets with high probability.
Compare with...developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Piotr Indyk and Rajeev Motwani | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| approximate nearest neighbor search in high-dimensional spaces | ○Unverified | High | Fresh | 1 |
| approximate nearest neighbor search | ○Unverified | High | Fresh | 1 |
| similarity search in large datasets | ○Unverified | High | Fresh | 1 |
| dimensionality reduction for high-dimensional data | ○Unverified | High | Fresh | 1 |
| similarity search and clustering of documents | ○Unverified | Moderate | Fresh | 1 |
| duplicate detection in large datasets | ○Unverified | Moderate | Fresh | 1 |
| deduplication and clustering | ○Unverified | Moderate | Fresh | 1 |
| recommendation systems and information retrieval | ○Unverified | Moderate | Fresh | 1 |
founded year
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 1998 | ○Unverified | High | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| exhaustive k-nearest neighbors search | ○Unverified | High | Fresh | 1 |
| brute force nearest neighbor search | ○Unverified | High | Fresh | 1 |
| exact nearest neighbor search | ○Unverified | High | Fresh | 1 |
| k-d trees | ○Unverified | Moderate | Fresh | 1 |
| tree-based nearest neighbor methods | ○Unverified | Moderate | Fresh | 1 |
supports protocol
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Hamming distance | ○Unverified | High | Fresh | 1 |
| Jaccard similarity | ○Unverified | High | Fresh | 1 |
| cosine similarity | ○Unverified | Moderate | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MinHash for Jaccard similarity | ○Unverified | High | Fresh | 1 |
| random projection for Euclidean distance | ○Unverified | Moderate | Fresh | 1 |
based on
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| probabilistic hashing techniques | ○Unverified | High | Fresh | 1 |
| probabilistic dimension reduction techniques | ○Unverified | Moderate | Fresh | 1 |
implemented by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Apache Spark MLlib | ○Unverified | High | Fresh | 1 |
| scikit-learn | ○Unverified | Moderate | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| machine learning frameworks | ○Unverified | Moderate | Fresh | 1 |
| Apache Spark MLlib | ○Unverified | Moderate | Fresh | 1 |