SuperLocalMemory adapts to your infrastructure — from zero-cloud local installs to cloud-LLM power mode. Pick once, change any time.
Mode A uses TF-IDF statistical embeddings — no neural network, no GPU, no external API. All storage lives in SQLite on your machine. Retrieval uses the Fisher-Rao information metric for mathematically-grounded recall, not cosine heuristics.
Mode B plugs into your local Ollama instance to generate high-quality neural embeddings. Same Fisher-Rao retrieval, same local storage — but with the semantic depth of a full language model running on your hardware.
Mode C uses OpenAI or Anthropic embedding APIs for the highest-quality semantic representations. Your memories stay local — only embedding generation hits the cloud. Ideal when accuracy matters more than absolute privacy.
Every tradeoff, visible at a glance. No asterisks.
| Feature | Mode A | Mode B | Mode C |
|---|---|---|---|
| No internet required | ✓ | ✓ | – |
| No GPU needed | ✓ | – | ✓ |
| No API key | ✓ | ✓ | – |
| Setup time | < 1 min | ~5 min | ~2 min |
| LoCoMo accuracy | 74.8% | 82.3% | 87.7% |
| Embedding type | TF-IDF (statistical) | Ollama (neural, local) | OpenAI / Anthropic |
| Storage backend | SQLite + in-memory | SQLite + LanceDB | SQLite + LanceDB |
| Recommended for | Privacy · CI/CD · Default | Local GPU teams | Max accuracy |
SLM re-indexes your existing memories using the new embedding method in the background. No data loss. No interruption. The command takes under a second — re-indexing happens asynchronously.