Embeddings

Definition

Numerical representations of data (text, images, audio) in high-dimensional space where similar items are close together.

Use Cases

Provider Equivalents

Frequently Asked Questions

What’s the difference between embeddings and vector databases?
Embeddings are the vectors (numbers) produced by a model to represent meaning. A vector database (or vector index) is the system that stores those vectors and lets you quickly search for “nearest” vectors (most similar items). You often use both: generate embeddings with a model, then store/search them in a vector database.
When should I use embeddings?
Use embeddings when you need meaning-based matching instead of exact keyword matching. Common cases include semantic search, retrieval-augmented generation (RAG) for chatbots over your documents, recommendations (“items like this”), deduplication/near-duplicate detection, clustering topics, and similarity-based classification.
How much do embeddings cost?
Costs usually come from (1) generating embeddings (priced by input size, often per token for text or per image) and (2) storing/querying vectors (database/index compute, storage, and read/write operations). Total cost depends on how many items you embed, how often you re-embed (e.g., when content changes), vector dimension size, and query volume/latency requirements.

Category: ai-ml

Difficulty: advanced

Related Terms

See Also