Memory Indexer

A cognitive memory system for LLMs implementing human-inspired 3-axis memory architecture.

The Problem

LLMs face a fundamental constraint: finite context windows.

┌─────────────────────────────────────────────────────┐
│  Session 1  │  Session 2  │  Session 3  │  Current  │
│   (lost)    │   (lost)    │   (lost)    │  (active) │
└─────────────────────────────────────────────────────┘

Current workarounds fall short:

Approach	Limitation
Summarization	Information loss, extra LLM calls
Sliding Window	Important early context lost
Full History	Hits token limits quickly
RAG	Not optimized for conversation context

The Solution

Memory Indexer provides Zero Context Engineering—you focus on your prompt, we handle all memory management.

Before (manual context management):

class ChatService:
    def chat(self, message):
        # You manage: history, summarization, token counting,
        # context assembly, profile loading, fact extraction...
        if self.count_tokens(self.history) > MAX_TOKENS:
            self.history = self.summarize(self.history)  # 😓

After (with Memory Indexer):

class ChatService:
    def chat(self, message):
        await memory.store(session, message)           # Auto-classify, auto-place
        context = await memory.recall(message)         # Intelligent retrieval
        return await llm.generate(context, message)    # Done.

"The goal of memory is not to transmit the most accurate information over time, but to guide and optimize intelligent decision-making by only preserving valuable information." — Richards & Frankland (2017)

Role & Scope

What It Is	What It Isn't
General-purpose memory primitives	A chatbot framework
Cognitive science-based architecture	A vector database
MCP server for any LLM client	Tied to specific use cases
Domain-agnostic building blocks	An opinionated application

Core Architecture

3-Axis Memory Model where each memory has three orthogonal dimensions:

Type × Scope × Tier = What × When × Where

Axis	Values	Cognitive Basis
Type	Episodic, Semantic, Procedural, Fact, Reflection	Tulving's memory classification
Scope	Turn, Topic, Session, User	Temporal reach (seconds → forever)
Tier	Buffer, Short, Long, Archive	Atkinson-Shiffrin + Baddeley

Tier Promotion Pipeline (Atkinson-Shiffrin + Tulving):

┌─────────────────────────────────────────────────────┐
│  Buffer (T0) - Sensory Store                        │
│  TTL: 60s idle │ 500 tokens │ 3 turns               │
├─────────────────────────────────────────────────────┤
│  Short (T1) - Working Memory (Baddeley's 7±2)       │
│  Capacity: 9 items, auto-promote when exceeded      │
├─────────────────────────────────────────────────────┤
│  Long (T2) - Episodic Memory                        │
│  Session-level events and experiences               │
├─────────────────────────────────────────────────────┤
│  Archive (T3) - Semantic Memory                     │
│  Promotion: Confidence ≥ 0.8 AND Confirms ≥ 3       │
└─────────────────────────────────────────────────────┘

Benchmark Summary

Operation	Latency	Throughput
Store	~2.3 μs	435K ops/s
Recall (limit 5)	~1.5 μs	667K ops/s
Store→Recall workflow	~3.8 μs	263K ops/s

In-memory storage with mock embeddings. See Benchmark Details for full results.

Quick Start

As MCP Server

dotnet tool install -g MemoryIndexer.Mcp

Configure Claude Desktop (%APPDATA%\Claude\claude_desktop_config.json):

{
  "mcpServers": {
    "memory-indexer": {
      "command": "memory-indexer-mcp"
    }
  }
}

As SDK

dotnet add package MemoryIndexer.Sdk

// Register your embedding service BEFORE AddMemoryIndexer()
services.AddSingleton<IEmbeddingService>(myEmbeddingService);

// InMemory storage (default)
services.AddMemoryIndexer(options =>
{
    options.Embedding.Dimensions = 1536;  // Match your embedding model
});

// Or with SQLite persistent storage
services.AddMemoryIndexer(options =>
{
    options.Storage.ConnectionString = "memories.db";
    options.Embedding.Dimensions = 1536;
}).WithSqliteVec();

// Store
await memoryService.StoreAsync("user123", "User prefers dark mode", importance: 0.8f);

// Recall
var results = await memoryService.RecallAsync("user123", "UI preferences", limit: 5);

Samples

MemoryChatApp

Web-based chat demonstrating Context Budget API—intelligent recall replaces full conversation history.

Traditional: messages = [msg1, msg2, ... msgN]  → Token cost: O(n)
This Demo:   context = recall(query, budget=2000)  → Token cost: O(1)

Features:

Token-budget-aware context building (RecentHeavy, Balanced, SemanticHeavy strategies)
4-tier memory visualization (Buffer → Short → Long → Archive)
Session isolation with cross-session user facts
Flexible embeddings (inject your own IEmbeddingService) with LLM support (GpuStack/OpenAI)

cd samples/MemoryChatApp
.\start-dev.ps1               # Opens frontend + backend

Twenty Questions Game

AI vs AI demo where two LLM agents play 20 Questions using only memory recall—no chat history injection.

Traditional: messages: [Q1, A1, Q2, A2, ... Q19, A19]  ← O(n) growing context
This Demo:   user: "Alpha says: Yes"                   ← O(1) constant context

What It Proves:

Agents build coherent multi-turn strategy via memory_recall() only
O(1) context maintenance regardless of conversation length
Memory isolation between agents works correctly

cd samples/TwentyQuestionsGame
dotnet run                    # Auto-detect LLM provider
dotnet run -- --local         # Use local ONNX model (no API key)

Custom Storage Backends

Memory Indexer provides IMemoryStore interface for custom storage implementations. Use this to integrate with PostgreSQL, Qdrant, Redis, Pinecone, or any other storage system.

using MemoryIndexer.Utilities;

public class MyPostgresStore : IMemoryStore
{
    public async Task<MemoryUnit> StoreAsync(MemoryUnit memory, CancellationToken ct)
    {
        memory.PrepareForStore();   // Extension: sets Id, CreatedAt, UpdatedAt
        memory.ValidateForStore();  // Extension: validates required fields

        // Your storage logic here
        await _db.Memories.AddAsync(MapToEntity(memory), ct);
        await _db.SaveChangesAsync(ct);
        return memory;
    }
    // ... implement other IMemoryStore methods
}

// Register your custom store
services.AddSingleton<IMemoryStore, MyPostgresStore>();
services.AddMemoryIndexer(options => options.Embedding.Dimensions = 1536);

See Custom IMemoryStore Implementation Guide for complete patterns including hybrid PostgreSQL+Qdrant setups.

Documentation

Document	Description
Architecture	System design, 3-axis model, tier/type details
Intelligence	Conflict resolution, adaptive retrieval, graph traversal
Evaluation	KPIs, NIAH tests, multi-needle scenarios
Health	Health checks, Kubernetes probes
Benchmarks	Performance measurements
Guides	Configuration, custom storage, usage patterns
Roadmap	Feature timeline and status

Research Foundation

Built on cutting-edge memory research:

MemGPT: OS-inspired virtual memory paging
Mem0/Mem0g: Graph-based memory networks
H-MEM: Hierarchical memory with index routing
Cognitive Psychology: Atkinson-Shiffrin, Baddeley, Tulving models

License

MIT License - see LICENSE for details.

Built by iyulab

Name		Name	Last commit message	Last commit date
Latest commit History 233 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
deploy		deploy
docs		docs
samples		samples
scripts		scripts
src		src
tests		tests
tools		tools
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Directory.Build.props		Directory.Build.props
Directory.Packages.props		Directory.Packages.props
LICENSE		LICENSE
MemoryIndexer.slnx		MemoryIndexer.slnx
README.md		README.md
claude_desktop_config.example.json		claude_desktop_config.example.json
global.json		global.json
nuget.config		nuget.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory Indexer

The Problem

The Solution

Role & Scope

Core Architecture

Benchmark Summary

Quick Start

As MCP Server

As SDK

Samples

MemoryChatApp

Twenty Questions Game

Custom Storage Backends

Documentation

Research Foundation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Memory Indexer

The Problem

The Solution

Role & Scope

Core Architecture

Benchmark Summary

Quick Start

As MCP Server

As SDK

Samples

MemoryChatApp

Twenty Questions Game

Custom Storage Backends

Documentation

Research Foundation

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages