ArgosBrain vs Letta

What Letta is

The baseline, stated fairly.

Letta (formerly MemGPT; Apache-2.0, Letta AI / Charles Packer) ships a tiered memory architecture — core (always in context), archival (vector-searchable), recall (message history) — with agent self-edits via tool calls.

Sleep-time compute performs background LLM summarization during idle time. Letta Code (March 2026) is their memory-first coding harness — #1 model-agnostic open-source agent on Terminal-Bench.

How it actually works

Technical facts.

ArchitectureTiered (core / archival / recall).
Write costEvery write is an LLM tool call. Sleep-time adds background LLM calls.
Read costEvery read is an LLM tool call. Costliest of all systems compared on the hot path. Why $0 retrieval matters — FAQ Q16.
Code-specificLetta Code is a harness, not an engine. Still treats code as text; no AST/symbol layer.
PricingOSS Apache-2.0; Letta Cloud $20–$200/mo tiered.

Sources: Letta docs · Letta repo · Letta Code

Verdict

Where each one wins.

↑ Where ArgosBrain wins

$0/query vs LLM-per-read. This is the sharpest cost difference in the whole competitive set.
Deterministic reads. Letta's agent has to decide to search; ArgosBrain answers structurally.
No sleep-time compute cost. Our consolidation is a tokio task, not an LLM call.
Symbol precision.

↑ Where Letta wins

Agent Development Environment (ADE) — visual debugger, stateful agent graph.
Terminal-Bench #1 OSS (Letta Code) — a real number we don't match on full-agent benchmarks.
Richer agent loop (self-editing memory) is conceptually ambitious.

When to choose which

Honest recommendation.

Choose Letta if

You want a full agent framework
Built-in stateful memory
Token cost is not a concern

Choose ArgosBrain if

Memory layer any agent can use
Zero token cost at retrieval