Update the MIRIX code base to be async-native by LiaoJianhe · Pull Request #118 · Mirix-AI/MIRIX

LiaoJianhe · 2026-03-06T15:51:39Z

MIRIX Async-Native Rewrite

1. Why Async-Native

MIRIX is a multi-agent system where every user request fans out into
database queries, Redis lookups, LLM API calls, embedding computations,
and Kafka messages -- all I/O-bound. The previous sync codebase serialized
these operations: each blocked thread sat idle waiting for a network
response, and concurrency was limited to the thread-pool size.

Rewriting the stack to be async-native delivers several concrete benefits:

Higher throughput on the same hardware.
A single event loop multiplexes thousands of in-flight I/O operations
without dedicating a thread to each one. Connection pools (asyncpg, Redis,
httpx) are shared across all coroutines, so the server handles more
concurrent users with fewer file descriptors, less memory, and less
context-switching overhead.

End-to-end consistency with FastAPI / Uvicorn.
FastAPI is async-first. When route handlers are async def and directly
await the server/manager/agent/LLM chain, there is no implicit offload
to a thread-pool executor. This removes an entire class of subtle bugs
(thread-safety of shared state, session leaks across threads) and makes
the call stack easy to reason about.

Natural streaming and SSE.
LLM token streaming and Server-Sent Events map directly to async
generators. No background thread is needed to feed an SSE response; the
generator yields tokens as they arrive from the LLM provider.

In-process background workers.
Queue consumers (memory extraction, cleanup) run as asyncio.Tasks in
the same process. This simplifies deployment (one container, one process)
while keeping workers non-blocking.

Lower tail latency.
asyncio.sleep-based retries and exponential back-off do not occupy a
thread during the wait, freeing the loop to serve other requests.

2. High-Level Changes

2.1 External Library Migrations

Layer	Sync (before)	Async (after)
Database driver	`pg8000` / `psycopg2-binary`	asyncpg (PostgreSQL), aiosqlite (SQLite)
SQLAlchemy	`create_engine`, `sessionmaker`, `Session`	`create_async_engine`, `async_sessionmaker`, `AsyncSession`
Redis	`redis.Redis`	redis.asyncio.Redis with `hiredis`
HTTP client	`requests`	httpx.AsyncClient
OpenAI	`openai.OpenAI`	`openai.AsyncOpenAI`
Anthropic	`anthropic.Anthropic`	`anthropic.AsyncAnthropic`
Azure OpenAI	`AzureOpenAI`	`AsyncAzureOpenAI`
Google AI	sync `genai` calls	async `genai` + `httpx.AsyncClient`
Kafka	sync kafka-python (if used)	aiokafka (`AIOKafkaProducer`, `AIOKafkaConsumer`)
Web search	`duckduckgo_search`	asyncddgs
Google APIs	sync `google-api-python-client`	aiogoogle
Test runner	sync pytest	pytest-asyncio (`asyncio_mode = "auto"`)

2.2 Application-Layer Changes

ORM base (mirix/orm/sqlalchemy_base.py)
All CRUD methods (create, read, update, delete, list) are
async def. Sessions are used via async with session. Retry decorators
use asyncio.sleep().

Service managers (mirix/services/)
All 16 managers are async:

#	Manager	File
1	UserManager	`user_manager.py`
2	ClientManager	`client_manager.py`
3	ToolManager	`tool_manager.py`
4	AdminUserManager	`admin_user_manager.py`
5	OrganizationManager	`organization_manager.py`
6	BlockManager	`block_manager.py`
7	MessageManager	`message_manager.py`
8	CloudFileMappingManager	`cloud_file_mapping_manager.py`
9	StepManager	`step_manager.py`
10	AgentManager	`agent_manager.py`
11	RawMemoryManager	`raw_memory_manager.py`
12	EpisodicMemoryManager	`episodic_memory_manager.py`
13	SemanticMemoryManager	`semantic_memory_manager.py`
14	ProceduralMemoryManager	`procedural_memory_manager.py`
15	ResourceMemoryManager	`resource_memory_manager.py`
16	KnowledgeVaultManager	`knowledge_vault_manager.py`

Every manager method uses async with self.session_maker() and await
for all database operations.

LLM API layer (mirix/llm_api/)
LLMClientBase.send_llm_request() and request() are async. All
provider clients (OpenAI, Anthropic, Azure, Google, Cohere, Mistral, AWS
Bedrock) use their respective async SDK classes. Streaming responses are
AsyncGenerator. retry_with_exponential_backoff() uses
asyncio.sleep().

Agent execution (mirix/agent/agent.py)
step(), inner_step(), _get_ai_reply(), _handle_ai_response(),
execute_tool_and_persist_state(), and save_agent() are all async.
Built-in tools (core, memory, extras) are async. User-defined tools
execute in ToolExecutionSandbox via asyncio.create_subprocess_exec()
(no thread pool).

MetaAgent (mirix/agent/meta_agent.py)
MetaAgent.step(), initialize(), and sub-agent orchestration are async.
MessageQueue uses asyncio.Lock instead of threading.Lock.

Queue system (mirix/queue/)

MemoryQueue wraps asyncio.Queue.
KafkaQueue uses aiokafka (fully async producer/consumer).
QueueWorker runs as an asyncio.Task in the main event loop.

Server (mirix/server/server.py)
AsyncServer (renamed from the former SyncServer) exposes async
methods: send_messages(), _step(), load_agent(), create_agent().
A backward-compatible alias SyncServer = AsyncServer is retained for
external callers that have not yet updated.

REST API (mirix/server/rest_api.py)
All route handlers are async def and directly await server methods.
Zero asyncio.to_thread wrappers on the request path. SSE streaming uses
sse_async_generator().

Client SDK (mirix/client/remote_client.py)
MirixClient uses httpx.AsyncClient with RetryTransport. All public
methods (add, send_message, create_agent, etc.) are async.
MirixClient.create() is an async factory for initialization.

Observability (mirix/observability/langfuse_client.py)
Singleton initialization uses asyncio.Lock for coroutine-safe
double-checked locking. The sync LangFuse SDK is called via
asyncio.to_thread (see Section 3.1).

Tests (tests/, pyproject.toml)
pytest-asyncio with asyncio_mode = "auto". Fixtures in conftest.py
are async. asyncio_default_fixture_loop_scope = "session".

3. Remaining Synchronous Code

The request-serving hot path is fully async. The items below are the only
remaining synchronous touch-points. Each is intentional.

3.1 LangFuse SDK


Where	`mirix/observability/langfuse_client.py`
What	`Langfuse()` init, `.flush()`, `.shutdown()` are sync SDK calls, wrapped with `await asyncio.to_thread(...)`.
Why	No official async LangFuse client exists.
Impact	Low. Observability is off the hot path. `to_thread` borrows a thread from the default executor briefly; it does not block the event loop or limit request concurrency.

3.2 Gmail OAuth


Where	`mirix/functions/mcp_client/gmail_client.py`
What	`authenticate_gmail_local()` blocks waiting for a browser OAuth redirect. Called via `await asyncio.to_thread(...)`.
Why	The OAuth flow is inherently blocking (human in the loop).
Impact	Low. One-time auth; not on the per-request path.

3.3 SQLAlchemy DDL at Startup


Where	`mirix/server/server.py`, `ensure_tables_created()`
What	`await conn.run_sync(Base.metadata.create_all)`
Why	SQLAlchemy's DDL/metadata API is sync-only; `run_sync` is the documented pattern for async engines.
Impact	None at runtime. Runs once during application startup.

3.4 Cleanup Job Entry Point


Where	`mirix/jobs/cleanup_raw_memories.py`
What	`asyncio.run(delete_stale_raw_memories_async(threshold))` in `__main__`.
Why	Standard pattern for a standalone script invoked by cron; it bootstraps its own event loop.
Impact	None. Separate process; does not affect the API server.

3.5 Pure CPU Helpers -- Intentionally Sync


Where	`mirix/utils.py`, `mirix/services/utils.py`, and private helpers in memory managers (`_clean_text_for_search`, `_parse_embedding_field`, `_count_word_matches`, `_preprocess_text_for_bm25`).
What	String manipulation, regex, JSON parsing, token counting, date formatting, UUID generation. Zero I/O.
Why	Adding `async def` to a function that never `await`s provides no concurrency benefit. The event loop only yields at `await` points, so an `async def` body with no awaits runs identically to a plain `def` -- but with extra coroutine-object overhead. A function should be `async def` if and only if it performs I/O. (Note: `mirix/services/utils.py::build_query` is correctly `async def` because it awaits `embedding_model()`.)
Impact	None. These run in microseconds. If a future helper became CPU-heavy, the correct fix would be `asyncio.to_thread` (offload to a thread), not `async def`.

3.6 Server Class Naming (Resolved)

The class formerly named SyncServer has been renamed to AsyncServer
as part of this change set. All imports, type hints, docstrings, and tests
have been updated. A backward-compatible alias SyncServer = AsyncServer
is retained in mirix/server/server.py.

4. Summary

The MIRIX application is async-native from the HTTP boundary through the
server, agents, service managers, ORM, database, Redis, Kafka, and
LLM/embedding clients. The only remaining sync touch-points are:

LangFuse -- sync SDK wrapped in asyncio.to_thread; low impact.
Gmail OAuth -- blocking by design; wrapped in to_thread; rare.
Startup DDL -- one-time run_sync; no runtime impact.
Cleanup script -- asyncio.run() in __main__; separate process.
Pure CPU helpers -- no I/O; async def would add overhead, not benefit.
Server naming -- SyncServer renamed to AsyncServer; alias kept.

None of these limit MIRIX's ability to scale request throughput or
concurrent users. The critical path is fully async.

@L-u-k-e

* feat: multi scope clients * fix: format and passing tests * fix: fix langfuse tests * fix: fix local client tests * fix: fix tests * chore: format * feat: scoped core memory * fix: fix some test bugs * chore: tests * Apply suggestion from @L-u-k-e * Apply suggestion from @L-u-k-e --------- Co-authored-by: Jianhe Liao <jianhe_liao@intuit.com>

feat: Allow clients to add `filter_tags` to blocks and use them for cross user searches

* feat: support multiple filter operators in tag search * chore: remove integration test * fix: messageToDict * feat: block filter tag updates - always-apply on save plus new update mode options (#58) * gfeat: update block filter tags * fix: fix bugs

Made-with: Cursor

LiaoJianhe · 2026-03-06T15:52:22Z

Wrong PR target branch

L-u-k-e and others added 11 commits February 17, 2026 17:43

fix: fix docker test script

9777ef8

feat: block filter tags

a17d164

feat: add param to other send_message call sites

3b05a90

chore: logging

a38959c

fix: put stream back

e26881e

Merge pull request #56 from LiaoJianhe/lp/cross-user-search-core-memory

9c7201b

feat: Allow clients to add `filter_tags` to blocks and use them for cross user searches

feat: async native implementation for MIRIX agents and services

101ddd9

Made-with: Cursor

test: adapt PR #57/#58 tests for async (fixtures, await, AsyncMock)

d9a77d1

Made-with: Cursor

[VEPEAGE-525] Change Mirix code base to be async-native

135a93b

LiaoJianhe closed this Mar 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the MIRIX code base to be async-native#118

Update the MIRIX code base to be async-native#118
LiaoJianhe wants to merge 11 commits intoMirix-AI:re-orgfrom
LiaoJianhe:jianhe-async-mar03

LiaoJianhe commented Mar 6, 2026

Uh oh!

LiaoJianhe commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LiaoJianhe commented Mar 6, 2026

MIRIX Async-Native Rewrite

1. Why Async-Native

2. High-Level Changes

2.1 External Library Migrations

2.2 Application-Layer Changes

3. Remaining Synchronous Code

3.1 LangFuse SDK

3.2 Gmail OAuth

3.3 SQLAlchemy DDL at Startup

3.4 Cleanup Job Entry Point

3.5 Pure CPU Helpers -- Intentionally Sync

3.6 Server Class Naming (Resolved)

4. Summary

Uh oh!

LiaoJianhe commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants