System Blueprint

Search Blueprint

Ingest → Embed → Index → Retrieve → Re-Rank → Present. Intelligent search from raw data to ranked results.

The Pipeline

Six stages from raw data to ranked results

Click any stage for technical depth.

Data Ingestion

Multi-source connectors, format parsing, incremental sync, deduplication.

Raw data flows in from REST APIs, databases, file stores, webhooks, and third-party SaaS platforms. Format-aware parsers normalize structured, semi-structured, and unstructured content into a clean canonical form. Change-detection and incremental sync ensure only new or modified records enter the pipeline, while deduplication guards prevent redundant processing.

Technical Stack

REST API connectors

Database connectors

File parsers

Webhook listeners

Change detection

Queue management

PIPELINE ACTIVE

Stage 1/6How RAG works →

Search Strategies

Four retrieval paradigms compared

Each strategy has distinct strengths. Hybrid search combines the best of keyword and semantic for production workloads.

Keyword (BM25)

Term-frequency matching with inverse document frequency weighting. The workhorse of traditional search.

Strengths

Exact match precision

Blazing fast

No model dependencies

Interpretable scoring

Trade-offs

No semantic understanding

Misses synonyms

Vocabulary mismatch

Semantic (Vector)

Embedding-based search that matches by meaning, not surface text. Handles paraphrasing and synonyms naturally.

Strengths

Meaning-based matching

Handles synonyms

Cross-lingual potential

Robust to typos

Trade-offs

Needs embedding model

Higher latency

Opaque scoring

Hybrid

Fuses dense vector search with sparse keyword matching via reciprocal rank fusion for the best of both worlds.

Strengths

Best overall accuracy

Semantic + exact match

Tunable fusion weights

Production-proven

Trade-offs

More infrastructure

Dual index maintenance

Tuning complexity

Multi-Modal

Unified search across text, images, and structured data using multi-modal embedding models.

Strengths

Cross-modal retrieval

Image + text queries

Richer understanding

Unified index

Trade-offs

Requires multi-modal embeddings

Larger vectors

Higher compute cost

Embedding Models

Embedding selection guide

Your embedding model is the foundation of search quality. We benchmark against your domain data to find the optimal choice.

Model

Dimensions

MTEB Score

Latency

Cost

OpenAI · standard tier

Dims:1,536

MTEB:61.0

Latency:~50ms

Cost:$$

Cohere · Embed

Dims:1,024

MTEB:64.5

Latency:~40ms

Cost:$$

BAAI · BGE (large)

Dims:1,024

MTEB:63.9

Latency:~15ms

Cost:Self-host

Microsoft · E5 (self-host)

Dims:4,096

MTEB:66.6

Latency:~120ms

Cost:Self-host

Voyage AI · embeddings

Dims:1,024

MTEB:65.1

Latency:~45ms

Cost:$$

Alibaba · GTE

Dims:3,584

MTEB:67.2

Latency:~100ms

Cost:Self-host

Benchmarks based on the MTEB leaderboard. Actual performance varies by domain — we run A/B evaluations against your data before committing to a model.

Go deeper

Deep Dive

RAG Blueprint

Full retrieval-augmented generation pipeline.

Deep Dive

Large Language Models

The generation engine.

Deep Dive

Data Integrations

Connect your sources.

Deep Dive

Reference Architecture

Full system stack.

Build intelligent search for your data.

Describe your data sources and search requirements. We'll design the embedding, indexing, and retrieval pipeline.

Ask the AI Architect Explore semantic search