Use FastEmbed for local, offline embedding generation with no API costs and complete privacy
BGE Models (Recommended)
Model | Dimensions | Description | Best For |
---|---|---|---|
BAAI/bge-small-en-v1.5 | 384 | Compact, fast | Development, quick prototyping |
BAAI/bge-base-en-v1.5 | 768 | Balanced quality/speed | General production use |
BAAI/bge-large-en-v1.5 | 1024 | Highest quality | Quality-critical applications |
Sentence Transformers
Model | Dimensions | Description |
---|---|---|
sentence-transformers/all-MiniLM-L6-v2 | 384 | Fast, lightweight |
sentence-transformers/all-mpnet-base-v2 | 768 | High quality, balanced |
Model Caching
Batch Processing
Hardware Optimization
Metric | FastEmbed (BGE-Large) | OpenAI (3-Large) | Notes |
---|---|---|---|
Dimensions | 1024 | 3072 | OpenAI has higher dimensionality |
Quality | Very Good | Excellent | OpenAI slightly better for complex tasks |
Speed | Fast (local) | Medium (API) | FastEmbed faster for small batches |
Cost | Free | $0.13/1M tokens | FastEmbed eliminates API costs |
Privacy | Complete | API-dependent | FastEmbed keeps data local |