Numbers that matter
Audited against 1,000-chunk noise haystacks. No cherry-picked queries.
Two modes. One memory.
GLIA runs as a browser extension and an MCP server simultaneously, sharing the same database. Use either or both.
For quick, everyday chats. The browser extension invisibly injects context from your codebase directly into your prompts on Claude.ai and ChatGPT, letting you chat without copy-pasting.
- Auto-intercepts prompts before sending
- Prepends relevant project context silently
- Save full conversations with one click
- Works across 7 AI platforms
For local development. The MCP Server hooks directly into your code editor, allowing the AI to recall memories automatically based on your current project path.
- Native tools: recall_context, store_memory
- search_memory across all projects globally
- Auto-identifies project from working directory
- Zero-Docker — single SQLite file
Both interfaces read and write the same database. Save in ChatGPT, recall in Cursor. Instantly.
Everything your AI needs to actually remember
Not a wrapper. Not a cloud service. A local memory infrastructure that plugs into every tool you already use.
Hybrid RAG Engine
Three search layers fused: Sentence Vector + Chunk Vector + FTS5 keyword. Surgical trimming returns only the matching sentences.
7 AI Platforms
Auto-intercepts prompts on Claude, ChatGPT, Gemini, DeepSeek, Grok, Copilot, and Mistral. No copy-paste required.
Native MCP Tools
recall_context, store_memory, search_memory, list_projects, identify_project and more — native tool calls in every coding agent.
Shared Memory Bridge
Memory saved in a browser chat is instantly available in your coding tool. One SQLite database. Two interfaces.
100% Project Isolation
recall_context is SQL-scoped to the project. Project A's data never leaks into Project B, even with semantically similar queries.
Portable JSON Sessions
Share context across machines or with teammates instantly. Download any project session as a clean JSON file and import it on another PC.
Zero-Docker Mode
Set GLIA_STORAGE_MODE=sqlite and eliminate all containers. SQLite + sqlite-vec delivers full RAG on any machine.
Knowledge Graph
Conversations are extracted into a D3 force-directed graph of entities and relationships. Browse your project's architecture visually.
HyDE Retrieval
Hypothetical Document Embeddings generate a synthetic answer to your query, then search by that embedding — improving recall on rephrased queries.
Stop re-explaining yourself.
One command. Persistent memory across every AI tool you use. Runs entirely on your machine.
