Umi

Memory for AI agents - entity extraction, dual retrieval, and evolution tracking.

Overview

Umi provides infrastructure for building AI agents that:

Extract structured knowledge from unstructured text via LLM-powered entity extraction
Retrieve intelligently using dual-mode search (fast vector + LLM semantic expansion)
Track evolution by detecting when memories update, extend, or contradict each other
Test deterministically with simulation testing for reproducible results

Quick Start

Python

Status: ✅ Feature Complete (v0.2.0) - Full async Memory API with real provider support. See PYTHON.md for full details.

# Install from source (not yet on PyPI)
cd umi-py
pip install maturin
maturin develop  # Build and install locally

import asyncio
import umi

async def main():
    # Production with real providers
    memory = umi.Memory.with_anthropic(
        anthropic_key="sk-ant-...",
        openai_key="sk-...",
        db_path="./umi_db"
    )

    # Store information
    result = await memory.remember("Alice works at Acme Corp")
    print(f"Stored {result.entity_count()} entities")

    # Retrieve information
    entities = await memory.recall("Who works at Acme?")
    for entity in entities:
        print(f"- {entity.name}: {entity.content}")

asyncio.run(main())

What works:

Full Memory API with async/await support
Real LLM providers (Anthropic, OpenAI)
Storage backends (Lance, Postgres)
CoreMemory, WorkingMemory, Entity, options types
Deterministic testing with sim providers

See PYTHON.md for comprehensive documentation and examples.

Rust

cargo add umi-memory

use umi_memory::umi::{Memory, RememberOptions, RecallOptions};

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Simulation mode (deterministic, no API calls)
    let mut memory = Memory::sim(42);

    // Remember information
    let result = memory
        .remember("Alice works at Acme Corp", RememberOptions::default())
        .await?;
    println!("Stored {} entities", result.entity_count());

    // Recall memories
    let results = memory
        .recall("Alice", RecallOptions::default())
        .await?;

    for entity in results {
        println!("  - {}: {}", entity.name, entity.content);
    }

    Ok(())
}

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                   Python: pip install umi (PyO3 bindings)           │
│                                                                     │
│  ┌─────────────────────────────────────────────────────────────┐   │
│  │         Memory (thin wrapper around Rust Memory API)        │   │
│  └─────────────────────────┬───────────────────────────────────┘   │
└────────────────────────────┼─────────────────────────────────────────┘
                             │ PyO3 (exposing Rust to Python)
┌────────────────────────────┴─────────────────────────────────────────┐
│                          Rust: umi-memory                            │
│                                                                      │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────────┐  │
│  │   Memory     │  │EntityExtract │  │  DualRetriever           │  │
│  │  (Main API)  │  │  (LLM-based) │  │  (Fast + LLM semantic)   │  │
│  └──────┬───────┘  └──────┬───────┘  └──────────┬───────────────┘  │
│         │                 │                      │                  │
│  ┌──────┴─────────────────┴──────────────────────┴───────────────┐  │
│  │       EvolutionTracker (Memory relationship detection)        │  │
│  └──────────────────────────┬───────────────────────────────────┘  │
│                             │                                       │
│  ┌──────────────────────────┴───────────────────────────────────┐  │
│  │           LLMProvider (Sim | Anthropic | OpenAI)             │  │
│  └──────────────────────────┬───────────────────────────────────┘  │
│                             │                                       │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────────┐  │
│  │ CoreMemory   │  │WorkingMemory │  │  ArchivalMemory          │  │
│  │  (32KB ctx)  │  │  (1MB TTL)   │  │  (Unlimited vector)      │  │
│  └──────┬───────┘  └──────┬───────┘  └──────────┬───────────────┘  │
│         │                 │                      │                  │
│  ┌──────┴─────────────────┴──────────────────────┴───────────────┐  │
│  │         StorageBackend (Sim | Postgres | Lance)               │  │
│  └───────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────┘

Features

Memory Hierarchy

Tier	Purpose	Size	Persistence	Access Pattern
Core	Always in LLM context	~32KB	Per-session	Read/Write via Memory
Working	Session KV store	~1MB	TTL-based	Fast key-value ops
Archival	Long-term semantic search	Unlimited	Persistent	Vector similarity

Component Status

Component	Description	Status	Tests
umi-memory	Rust core (memory tiers, DST)	✅ Complete	~813
umi-py	PyO3 bindings (full Memory API, async/await)	✅ Complete	23
Memory API	Main orchestrator with remember/recall	✅ Complete	✓
MemoryConfig	Global configuration system	✅ Complete	✓
EntityExtractor	LLM-powered entity extraction	✅ Complete	✓
DualRetriever	Fast + LLM semantic search	✅ Complete	✓
EvolutionTracker	Memory relationship detection	✅ Complete	✓
SimLLMProvider	Deterministic LLM simulation	✅ Complete	✓
LanceVectorBackend	Production vector storage	✅ Complete	✓
PostgresVectorBackend	Postgres-based storage	✅ Complete	✓

LLM Providers (Rust)

use umi_memory::llm::{SimLLMProvider, LLMProvider};

// Simulation (deterministic, no API)
let llm = SimLLMProvider::with_seed(42);

// Anthropic Claude (requires 'anthropic' feature)
#[cfg(feature = "anthropic")]
let llm = umi_memory::llm::AnthropicProvider::new(api_key);

// OpenAI GPT (requires 'openai' feature)
#[cfg(feature = "openai")]
let llm = umi_memory::llm::OpenAIProvider::new(api_key);

Note: Python bindings support all LLM providers through convenient constructors. See PYTHON.md.

Storage Backends

use umi_memory::umi::Memory;
use umi_memory::dst::SimConfig;
use umi_memory::embedding::SimEmbeddingProvider;
use umi_memory::llm::SimLLMProvider;
use umi_memory::storage::{SimStorageBackend, SimVectorBackend};

// Quick start with simulation (all components)
let mut memory = Memory::sim(42);

// Or construct with explicit components
let llm = SimLLMProvider::with_seed(42);
let embedder = SimEmbeddingProvider::with_seed(42);
let vector = SimVectorBackend::new(42);
let storage = SimStorageBackend::new(SimConfig::with_seed(42));
let mut memory = Memory::new(llm, embedder, vector, storage);

// Production with LanceDB (requires 'lance' feature)
#[cfg(feature = "lance")]
{
    use umi_memory::llm::AnthropicProvider;
    use umi_memory::embedding::OpenAIEmbeddingProvider;
    use umi_memory::storage::{LanceVectorBackend, LanceStorageBackend};

    let llm = AnthropicProvider::new(api_key);
    let embedder = OpenAIEmbeddingProvider::new(openai_key);
    let vector = LanceVectorBackend::connect("./vectors.lance").await?;
    let storage = LanceStorageBackend::connect("./storage.lance").await?;
    let mut memory = Memory::new(llm, embedder, vector, storage);
}

Production Mode (Rust)

use umi_memory::umi::Memory;
use std::env;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Production with real providers (requires 'anthropic', 'openai', 'lance' features)
    #[cfg(all(feature = "anthropic", feature = "openai", feature = "lance"))]
    {
        use umi_memory::llm::AnthropicProvider;
        use umi_memory::embedding::OpenAIEmbeddingProvider;
        use umi_memory::storage::{LanceVectorBackend, LanceStorageBackend};

        let llm = AnthropicProvider::new(env::var("ANTHROPIC_API_KEY")?);
        let embedder = OpenAIEmbeddingProvider::new(env::var("OPENAI_API_KEY")?);
        let vector = LanceVectorBackend::connect("./vectors.lance").await?;
        let storage = LanceStorageBackend::connect("./storage.lance").await?;

        let mut memory = Memory::new(llm, embedder, vector, storage);

        // Use memory
        let result = memory.remember("Alice works at Acme Corp", Default::default()).await?;
        println!("Stored {} entities", result.entity_count());
    }

    Ok(())
}

Note: Python bindings (v0.2.0) include full Memory API with async/await, real LLM providers, and convenient constructors like Memory.with_anthropic(). See PYTHON.md for Python examples.

Configuration

Memory Configuration (Rust)

use umi_memory::umi::{Memory, MemoryConfig};
use std::time::Duration;

// Default configuration
let mut memory = Memory::sim(42);

// Custom configuration
let config = MemoryConfig::default()
    .with_core_memory_bytes(64 * 1024)  // 64KB core memory
    .with_recall_limit(20)              // Return up to 20 results
    .without_embeddings();              // Disable embedding generation

let mut memory = Memory::sim_with_config(42, config);

// Configuration options:
// - core_memory_bytes: Size of core memory (default: 32KB)
// - working_memory_bytes: Size of working memory (default: 1MB)
// - working_memory_ttl: TTL for working memory (default: 1 hour)
// - generate_embeddings: Enable/disable embeddings (default: true)
// - embedding_batch_size: Batch size for embedding generation (default: 100)
// - default_recall_limit: Default limit for recall (default: 10)
// - semantic_search_enabled: Enable semantic search (default: true)
// - query_expansion_enabled: Enable query expansion (default: true)

Environment Variables

Variable	Description	Default	Required
`ANTHROPIC_API_KEY`	Anthropic API key	-	For Claude
`OPENAI_API_KEY`	OpenAI API key	-	For GPT
`UMI_LOG_LEVEL`	Logging level	`info`	No
`DST_SEED`	Deterministic test seed	Random	No

Python Package Extras

Extra	Purpose	Install
`anthropic`	Anthropic Claude support	`pip install "umi[anthropic]"`
`openai`	OpenAI GPT support	`pip install "umi[openai]"`
`dev`	Development tools	`pip install "umi[dev]"`
`all`	All extras	`pip install "umi[all]"`

Rust Crate Features

Feature	Description	Enable
`lance`	LanceDB vector backend	`features = ["lance"]`
`postgres`	Postgres storage backend	`features = ["postgres"]`
`anthropic`	Anthropic LLM provider	`features = ["anthropic"]`
`openai`	OpenAI LLM provider	`features = ["openai"]`

Testing

Deterministic Simulation Testing (DST)

Umi uses deterministic simulation for reliable, reproducible tests:

// Rust - Deterministic simulation
use umi_memory::dst::SimConfig;
use umi_memory::umi::Memory;

// Same seed = same results
let mut memory1 = Memory::sim(42);
let mut memory2 = Memory::sim(42);

// Both produce identical results

Running Tests

# Rust tests (~813 tests)
cargo test -p umi-memory --all-features

# All tests with specific DST seed
DST_SEED=12345 cargo test --all-features

# Rust with coverage
cargo tarpaulin --all-features --out Html

# Python tests (23 tests with pytest)
cd umi-py
pip install pytest pytest-asyncio
pytest tests/ -v

Fault Injection (Rust)

use umi_memory::dst::{Simulation, SimConfig, FaultConfig, FaultType};

// Test with simulated failures
let sim = Simulation::new(SimConfig::with_seed(42))
    .with_fault(FaultConfig::new(FaultType::StorageWriteFail, 0.1));  // 10% failure

sim.run(|env| async move {
    let mut memory = env.create_memory();
    // Operations may fail due to fault injection
    Ok(())
}).await.unwrap();

Note: Python bindings currently use simulation providers for testing. Fault injection with real providers is planned.

Test Coverage

Category	Tests	Status
Rust Core	~813	✅ Passing
Python Bindings	23	✅ Passing

Performance

Metric	Target	Current
Entity extraction	<2s	~1.5s
Vector search (10K)	<100ms	~50ms
Vector search (1M)	<500ms	~300ms
Memory write	<10ms	~5ms
Dual retrieval	<3s	~2s

Development

Python Development

cd umi-py
pip install maturin pytest pytest-asyncio
maturin develop  # Build and install locally

# Run tests (23 tests)
pytest tests/ -v

# Test basic functionality
python test_bindings.py

Rust Development

# Build
cargo build --all-features

# Run tests
cargo test --all-features

# Run linter
cargo clippy --all-features -- -D warnings

# Format
cargo fmt --check

# Run benchmarks
cargo bench -p umi-memory

Pre-commit Checklist

Before every commit:

# Rust (required)
cargo fmt && cargo clippy --all-features -- -D warnings && cargo test --all-features

# Python (if modifying Python bindings)
cd umi-py && pytest tests/ -v

Project Structure

umi/
├── umi-memory/             # Rust core library
│   ├── src/
│   │   ├── lib.rs          # Public exports
│   │   ├── umi/            # Memory orchestrator
│   │   ├── extraction/     # EntityExtractor
│   │   ├── retrieval/      # DualRetriever
│   │   ├── evolution/      # EvolutionTracker
│   │   ├── storage/        # Sim + Lance backends
│   │   ├── llm/            # LLM providers
│   │   ├── memory/         # Memory tiers
│   │   ├── dst/            # Deterministic simulation
│   │   └── constants.rs    # TigerStyle limits
│   └── benches/            # Benchmarks
│
├── umi-py/                 # PyO3 Python bindings (v0.2.0)
│   ├── src/lib.rs
│   ├── tests/              # Pytest tests (23 tests)
│   ├── examples/           # Example scripts
│   └── umi.pyi             # Type stubs
│
├── docs/                   # Documentation
│   └── adr/                # Architecture Decision Records
│
├── .github/workflows/      # CI/CD
│   ├── ci.yml              # Python CI
│   └── ci-rust.yml         # Rust CI
│
├── VISION.md               # Project vision and roadmap
├── CLAUDE.md               # Development guidelines
├── README.md               # This file
├── Cargo.toml              # Rust workspace
└── pyproject.toml          # Python package

Roadmap

See VISION.md for detailed roadmap.

Next priorities:

✅ PyO3 bindings (P0) - Connect Python to Rust core (Completed v0.2.0)
PyPI publishing (P0) - Public release of Python bindings
Integration tests with real providers (P1) - Test with actual API keys
Crates.io publishing (P1) - Rust crate distribution

Engineering Principles

Umi follows TigerStyle (Safety > Performance > DX):

Explicit constants with units: MEMORY_BYTES_MAX, TTL_SECONDS_DEFAULT
2+ assertions per non-trivial function
No silent truncation or implicit conversions
Deterministic simulation testing for critical paths
Every component has a simulation implementation

See CLAUDE.md for detailed development guidelines.

Inspiration

memU - Dual retrieval architecture
Mem0 - Entity extraction and evolution tracking
Supermemory - Temporal metadata
TigerBeetle - TigerStyle engineering
FoundationDB - Deterministic simulation testing

Contributing

See CLAUDE.md for development guidelines.

Key principles:

Simulation-first - Every component has a Sim implementation
Tests must pass - All tests before commit
TigerStyle - Explicit limits, assertions, no silent failures
No stubs - Complete implementations or don't merge
Graceful degradation - Handle LLM failures elegantly

License

MIT - See LICENSE for details

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
.progress		.progress
docs		docs
scripts		scripts
umi-memory		umi-memory
umi-py		umi-py
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
LICENSE		LICENSE
PYTHON.md		PYTHON.md
README.md		README.md
RELEASING.md		RELEASING.md
VISION.md		VISION.md
clippy.toml		clippy.toml
pyproject.toml		pyproject.toml

License

rita-aga/umi

Folders and files

Latest commit

History

Repository files navigation

Umi

Overview

Quick Start

Python

Rust

Architecture

Features

Memory Hierarchy

Component Status

LLM Providers (Rust)

Storage Backends

Production Mode (Rust)

Configuration

Memory Configuration (Rust)

Environment Variables

Python Package Extras

Rust Crate Features

Testing

Deterministic Simulation Testing (DST)

Running Tests

Fault Injection (Rust)

Test Coverage

Performance

Development

Python Development

Rust Development

Pre-commit Checklist

Project Structure

Roadmap

Engineering Principles

Inspiration

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages