Three Operating Modes
You choose the privacy-accuracy tradeoff. Switch anytime. Your data stays consistent across all modes.
Mode A
Local Guardian
Zero cloud calls. All memory operations — storage, encoding, retrieval, lifecycle — execute locally. Your data never leaves your machine under any circumstance.
LoCoMo Retrieval — data stays local
Pure zero-LLM (no LLM at any stage)
Best for: Regulated industries, air-gapped networks, privacy-conscious developers, EU compliance requirements.
$ slm mode a Mode B
Smart Local
Everything in Mode A, plus a local LLM via Ollama for answer synthesis and enhanced fact extraction. All processing stays on your machine — nothing sent to any cloud.
Best for: Developers who want composed answers but need data to stay local. Teams with 16GB+ RAM for local models.
$ slm mode b Mode C
Full Power
Maximum accuracy. Cloud LLM participates at every layer — fact extraction, answer synthesis, agentic multi-round retrieval. Data leaves your machine for processing. This is the configuration comparable to other memory systems in the field.
LoCoMo — competitive with EverMemOS (92.3%)
Best for: Maximum accuracy when cloud access is organizationally approved. Comparable to industry standard.
$ slm mode c Feature Comparison
| Feature | A | B | C |
|---|---|---|---|
| Semantic search | |||
| BM25 keyword search | |||
| Entity graph traversal | |||
| Temporal retrieval | |||
| Fisher-Rao scoring | |||
| Sheaf consistency | |||
| Langevin lifecycle | |||
| Cross-encoder reranking | |||
| LLM answer synthesis | |||
| Agentic multi-round retrieval | |||
| Data leaves device | |||
| EU AI Act compliant | |||
| Internet required |
Frequently Asked Questions
Can I switch between modes after installation?
+
Which mode should I start with?
+
Does Mode A really work without any LLM?
+
What LLM providers does Mode C support?
+
Start with Mode A. Upgrade when you need to.
One install. Choose your mode. Switch anytime. Your memories stay consistent.