Cognitive Extraction
Transform raw text into structured knowledge using integrated Neural and Symbolic extractors.
- Neural Entity Extraction (BERT-NER)
- Autonomous Relationship Discovery
- Implicit Preference Detection
Aletheia is currently under development. For early access,contact us.
Hybrid vector + BM25 search, knowledge graphs, deterministic analytics, and fact supersession in a single Rust binary. Self-host or deploy on our platform with one click.
Single binary, zero deps
Vector, FTS, Graph, KV, Analytics
Timestamps, decay, supersession
Typed RDF with BFS traversal
Deterministic numeric queries
Air-gapped, your infra
One-click deploy
Hybrid retrieval latency
Vector databases are giant warehouses of static text. They find words, but they do not understand life. They lose context, ignore the passage of time, and drown in their own noise.
Retrieves conflicting data from 2 years ago exactly like data from 2 minutes ago. No concept of evolving truth.
Cannot accurately aggregate or count facts (e.g. 'How many cars do I own?'). Relies entirely on the LLM to do math.
Stores every single conversational 'uh' and 'um' instead of maintaining a clean, structured user profile.
When life changes (e.g. moving from NYC to SF), Aletheia marks the old fact as stale, ensuring the LLM always gets the latest truth.
Built-in execution layer accurately computes numeric and temporal queries before hitting the LLM, fixing benchmark failures.
Distills thousands of words into compact, continuous user profiles. We track the deltas, you save on context windows.
Aletheia is architected to resolve the fundamental recall failures of standard vector databases. Preliminary benchmarks on LongMemEval-S indicate significant improvements in handling numeric and temporal aggregation tasks.
Aletheia tracks the evolution of truth. When new facts arrive, old ones are superseded, preventing stale data from leaking into your agent's current worldview.
Raw chat logs are noise. Aletheia acts as a cognitive filter, distilling human rambling into a clean, queryable lattice of facts.
I used to love coffee, but now I only drink tea. Aletheia does not hallucinate your old preferences. It updates your profile in real time.
Our engine automatically discards greetings and filler, keeping only the high-value semantic facts that actually matter for personalization.
Our White Mercedes engine ensures your user's identity is not locked inside a single chat window.
Hey! I just bought a white Mercedes! What should I do first?
GPT-4o detects: User Ownership → Vehicle: Mercedes (White)
Fact Integration
What was that maintenance tip for my car?
Claude 3.5 recalls: "For your white Mercedes, I recommend..."
Engineered for builders who need more than just a place to dump text files.
Memory that follows the user, not the model. Switch from GPT-4 to Claude to Llama and Aletheia keeps the brain intact across every integration.
When life changes, Aletheia knows. If a user moves from NYC to LA, the old fact is marked as superseded so stale context stops leaking into answers.
Built with Rust as a single compiled binary. Deployment is fast and recall latency stays in the sub-100ms range without orchestration drama.
Aletheia is not just storage; it is a multi-stage cognitive processor that transforms raw noise into reliable agentic state.
Automatically detects if the user is asking for numbers, preferences, or narrative history.
Applies a secondary precision pass to ensure the top-k candidates are semantically perfect.
Computes aggregates (sums, counts) before delivery, preventing LLM arithmetic errors.
Experience how Aletheia organizes memories. Drag nodes to interact with the underlying graph logic where new facts supersede the old.
Nodes represent discrete semantic facts, preferences, and entities stored within the Rust engine.
Red nodes indicate **superseded memories**—stale data that has been automatically invalidated by more recent truths.
We did not just build a wrapper. We built a high-performance memory kernel from the ground up for safety and scale.
Ultimate memory safety and blazing-fast execution. No garbage collection pauses, just low-level performance where it matters.
No complex Docker chains. One file, zero configuration, instant memory synchronization across your stack.
Human-like recall speeds that keep up with your fastest LLM workflows without turning memory into the bottleneck.

Every layer from ingest semantics to production operations is implemented with one cohesive design system and runtime story.
Transform raw text into structured knowledge using integrated Neural and Symbolic extractors.
Track and aggregate numeric truth (money, counts, distances) with absolute deterministic precision.
The engine models memory as something that changes over time, not a static bag of embeddings.
SDK and HTTP entry points are designed to keep local-first workflows aligned with hosted deployments.
The product has a clear progression: ingest fidelity, retrieval intelligence, and operational reliability.
Raw events are normalized, deduplicated, and expanded into durable memories with lineage.
Semantic and lexical candidates are fused, reranked, then filtered by temporal policy before response.
Teams deploy one memory engine surface from local bench runs to hosted multi-tenant workloads.
Experience Aletheia's real-time ingestion and recall loop. Store a fact, then retrieve it across model contexts.
Ingestion Layer
Serverless Pulse
Ping the engine to reduce cold-start latency before testing the demo.
Retrieval Kernel
Ingest Latency
---
Query Latency
---
Retrieved Memory Hits
Book A Session
Discuss architecture, integration strategy, and production rollout for your memory stack.
One binary, five memory substrates, zero lock-in. Start building persistent, self-improving agents today.
Open Source Engine
The Rust-powered temporal memory engine that runs anywhere. Hybrid vector + BM25 search, knowledge graph traversal, deterministic analytics, and fact supersession — all in a single binary. No cloud dependency. No vendor lock-in.
Managed Cloud Service
A full SaaS experience on top of the core engine. Deploy clusters in one click, manage your team, track usage with analytics, explore knowledge graphs visually, and never worry about infrastructure.
We provide the tooling to make persistent memory a first-class citizen in your development workflow, from local testing to global scale.
An OpenAI-compatible gateway that automatically injects memories into your agent's system prompt. Zero code changes required.
Unified command-line tool to manage your engine, run local benchmarks, and monitor memory logs in real-time.
Built-in support for the Model Context Protocol. Connect Aletheia directly to Claude Code, Cursor, and agentic IDEs.