Skip to content

Commit 6fb2dbd

Browse files
authored
fix: inconsistent FLAT/HNSW algorithm recommendations (#409)
Updated algorithm recommendations in docs/api/schema.rst to provide consistent, performance-based guidance: FLAT Algorithm: • Before: Mixed recommendations (<10K in some places, <100K in others) • After: Consistent <100K vectors recommendation across all sections HNSW Algorithm: • Before: 10K-1M+ vectors (creating a gap between FLAT and HNSW) • After: 100K-1M+ vectors (providing clear transition point)
1 parent 2cbbae4 commit 6fb2dbd

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/api/schema.rst

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -160,7 +160,7 @@ HNSW (Hierarchical Navigable Small World) - Graph-based approximate search with
160160

161161
**Use HNSW when:**
162162

163-
- Medium to large datasets (10K-1M+ vectors) requiring high recall rates
163+
- Medium to large datasets (100K-1M+ vectors) requiring high recall rates
164164
- Search accuracy is more important than memory usage
165165
- Need general-purpose vector search with balanced performance
166166
- Cross-platform deployments where hardware-specific optimizations aren't available
@@ -400,7 +400,7 @@ Algorithm Selection Guide
400400
- Minimal overhead
401401
- Exact but slow for large data
402402
* - **HNSW**
403-
- General purpose (10K-1M+ vectors)
403+
- General purpose (100K-1M+ vectors)
404404
- 95-99% recall, O(log n) search
405405
- Moderate (graph overhead)
406406
- Fast approximate search
@@ -414,13 +414,13 @@ When to Use Each Algorithm
414414
--------------------------
415415

416416
**Choose FLAT when:**
417-
- Dataset size < 10,000 vectors
417+
- Dataset size < 100,000 vectors
418418
- Exact results are mandatory
419419
- Simple setup is preferred
420420
- Query latency is not critical
421421

422422
**Choose HNSW when:**
423-
- Dataset size 10K - 1M+ vectors
423+
- Dataset size 100K - 1M+ vectors
424424
- Need balanced speed and accuracy
425425
- Cross-platform compatibility required
426426
- Most common choice for production

0 commit comments

Comments
 (0)