Retrieval

Vector Index

The vector index provides fast semantic candidate retrieval for paraphrase-heavy queries.

Embedding path

A bi-encoder converts memory text and queries into a shared dense space. Vector similarity retrieves semantically close passages even when wording differs.

Normalize vectors consistently for both ingest and query paths to avoid score drift.

HNSW tradeoffs

Higher `ef_search` improves recall but increases latency.
Higher `M` improves graph connectivity but uses more memory.
Batch insertion patterns influence graph quality and cold-start behavior.

Typical tuning

vector_index:
  metric: cosine
  m: 24
  ef_construction: 200
  ef_search_default: 64

Operational checks

Track recall@k on a fixed evaluation set.
Monitor p95 query latency by entity size bucket.
Verify vector-id to memory-id mapping consistency after restarts.

Architecture

Vector Index

Embedding path

HNSW tradeoffs

Operational checks