🚀 TUG (TheUltimateRAG)

A Modular, Production-Ready Foundation for Next-Generation AI Applications

Build scalable, secure, and intelligent RAG (Retrieval-Augmented Generation) systems without reinventing the wheel.

🔗 Official Website & Documentation
👉 https://theultimaterag.vercel.app/

Key Features • Architecture • Getting Started • Visualizer • API • Contributing

📖 What is TUG (TheUltimateRAG)?

TUG (TheUltimateRAG) is a real-world, production-grade RAG framework, not just another tutorial or demo project.

It is designed to solve common problems developers face when moving from simple prototypes to scalable AI systems, such as:

Multi-user data separation
Long-term memory handling
Organizational knowledge sharing
Clean, modular architecture

Whether you’re building:

A corporate knowledge assistant
A legal or research AI
A personal second-brain
Or a multi-tenant SaaS AI platform

👉 TUG (TheUltimateRAG) gives you a strong, extensible backend foundation.

For a complete walkthrough, architecture deep-dives, and usage examples,
📘 visit the official documentation:
https://theultimaterag.vercel.app/

🌟 Key Features (Explained Simply)

Feature	What It Means for You
⚡ High-Performance API	Built with FastAPI for fast, async, and scalable AI services
🛡️ True Multi-Tenant Isolation	Each user’s data is fully isolated and secure
🏢 Organization-Level Knowledge	Share documents across teams without duplicating data
🧠 Session-Aware Memory	Conversations retain context naturally across turns
🔍 Hybrid Semantic Search	Metadata-aware vector search with logical filters
👁️ RAG Visualizer GUI	Real-time visualization of retrieval, context, and generation

🏗️ System Architecture (Designed for Flexibility)

The system follows a plug-and-play architecture.
You can replace or extend any core component without breaking the rest of the system.

Swap vector databases
Change LLM providers
Add custom memory logic
Introduce agent workflows

graph TD
    Client[Client / Frontend] -->|HTTP / JSON| API[FastAPI Gateway]

    subgraph "Core RAG Engine"
        API --> Logic[Orchestrator]
        Logic -->|Retrieve Context| Vector[Vector Store Manager]
        Logic -->|Conversation State| Memory[Session Memory]
        Logic -->|Generate Response| LLM[LLM Service]
    end

    subgraph "Data Layer"
        Vector <-->|Embeddings| Chroma[(ChromaDB)]
        Memory <-->|Chat Logs| Cache[(In-Memory / Redis)]
    end

📖 Detailed architecture explanation available at: 👉 https://theultimaterag.vercel.app/

🚀 Getting Started Quickly

Requirements

Python 3.10+
Node.js & npm (for the Visualizer UI)
API keys (OpenAI, Anthropic, etc.)

Installation Steps

Option 1: Install via pip (Recommended)

pip install ultimaterag

Option 2: Run from Source

If you cloned the repository, install dependencies first:

pip install -e .

🔐 Environment Configuration for TheUltimateRAG

To run TheUltimateRAG correctly, you must create and configure a .env file.
This file stores environment-specific settings such as API keys, database configs, and runtime options.

The project uses Pydantic Settings + python-dotenv, so all variables defined in .env are automatically loaded at startup.

📁 Step 1: Create the `.env` File

At the root of the project, create a file named:

.env

⚙️ Step 2: Required & Optional Environment Variables

Below is a complete reference of supported environment variables, grouped by purpose.

You only need to configure the parts relevant to your setup.

🧩 Core Application Settings

APP_NAME=TheUltimateRAG
APP_ENV=development        # development | production
DEBUG=true

Variable	Description
`APP_NAME`	Application name
`APP_ENV`	Runtime environment
`DEBUG`	Enable/disable debug logs

🤖 LLM & Embedding Providers

LLM_PROVIDER=openai        # openai | ollama | anthropic
EMBEDDING_PROVIDER=openai # openai | ollama | huggingface
MODEL_NAME=gpt-3.5-turbo

Variable	Description
`LLM_PROVIDER`	LLM backend to use
`EMBEDDING_PROVIDER`	Embedding model provider
`MODEL_NAME`	Chat model name

🔑 API Keys (Required Based on Provider)

OpenAI

OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxx

Anthropic

ANTHROPIC_API_KEY=sk-ant-xxxxxxxxxxxx

⚠️ Note: If LLM_PROVIDER or EMBEDDING_PROVIDER is set to openai, OPENAI_API_KEY must be provided, otherwise a warning will be shown.

🧠 Ollama Configuration (Local Models)

OLLAMA_BASE_URL=http://localhost:11434

Use this only if you are running Ollama locally.

🗂️ Vector Database Configuration

ChromaDB (Default – Local)

VECTOR_DB_TYPE=chroma
VECTOR_DB_PATH=./chroma_db_data
EMBEDDING_DIMENSION=1536

PostgreSQL + PGVector

VECTOR_DB_TYPE=postgres
POSTGRES_HOST=localhost
POSTGRES_PORT=5432
POSTGRES_DB=vector_db
POSTGRES_USER=postgres
POSTGRES_PASSWORD=postgres

Variable	Description
`VECTOR_DB_TYPE`	`chroma` or `postgres`
`VECTOR_DB_PATH`	Local ChromaDB storage path
`EMBEDDING_DIMENSION`	Vector embedding size

🧠 Memory & Conversation Storage (Redis)

MEMORY_WINDOW_SIZE=10
MEMORY_WINDOW_LIMIT=10

REDIS_HOST=localhost
REDIS_PORT=6379
REDIS_DB=0
REDIS_USER=default
REDIS_PASSWORD=

The system automatically builds the Redis connection URL internally.

🔄 How `.env` Is Loaded

The project uses:

python-dotenv
pydantic-settings

load_dotenv()
settings = Settings()

So no manual loading is required.

✅ Minimal `.env` (Quick Start)

If you want to get started quickly, this is enough:

OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxx
LLM_PROVIDER=openai
EMBEDDING_PROVIDER=openai
VECTOR_DB_TYPE=chroma

📘 Refer to the full configuration guide here: 👉 https://theultimaterag.vercel.app/

3️⃣ Run the Platform (CLI)

You can use the installed ultimaterag CLI to control the system.

Start the Server:

ultimaterag start
# Options: --host 0.0.0.0 --port 8000 --reload

or via python:

python app.py

Other CLI Commands:

ultimaterag version : Show current version
ultimaterag about : Show project information
ultimaterag license : usage license
ultimaterag help : Show full help guide

🧪 Custom Implementation Example

We provide a standalone example example.py in the root directory to demonstrate how to build a custom application using ultimaterag as a library.

How to Run

# Ensure you are in the project root
python example.py

This starts a custom FastAPI server on port 8001 with a specific /ask endpoint that uses the RAG engine directly.

Test the Custom Endpoint:

curl -X POST "http://localhost:8001/ask" \
     -H "Content-Type: application/json" \
     -d '{"query": "What is UltimateRAG?"}'

🖥️ RAG Visualizer GUI

A dedicated React-based GUI lets you:

Inspect retrieved documents
Understand context flow
Debug hallucinations
Optimize retrieval strategies

cd rag_visualizer
npm install
npm run dev

📡 API Endpoints Overview

Access live API documentation at: 👉 http://localhost:8000/docs

Core APIs

POST /api/v1/chat → Chat with your knowledge base
POST /api/v1/ingest → Secure document ingestion

Agent & Advanced APIs

GET /api/v1/agent/tools
POST /api/v1/agent/search
POST /api/v1/agent/workflow → Self-correcting RAG pipelines

📘 Full API reference: 👉 https://theultimaterag.vercel.app/

🤝 Contributing

Contributions are welcome and encouraged 🚀

Fork the repository
Create a feature branch
Commit your changes
Open a Pull Request

See CONTRIBUTING.md for guidelines.

🎓 Learning & Documentation

📘 Official Docs: https://theultimaterag.vercel.app/
📚 User Manual: USER_MANUAL.md
🌱 Git Learning Guide: LEARN.md

Built with ❤️ by Matrixxboy Empowering real-world RAG systems

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
MCPs		MCPs
Markdown		Markdown
Verify		Verify
rag_visualizer		rag_visualizer
src/ultimaterag		src/ultimaterag
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
PYPI.md		PYPI.md
README.md		README.md
app.py		app.py
example.py		example.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

License

Matrixxboy/TheUltimateRAG

Folders and files

Latest commit

History

Repository files navigation

🚀 TUG (TheUltimateRAG)

A Modular, Production-Ready Foundation for Next-Generation AI Applications

📖 What is TUG (TheUltimateRAG)?

🌟 Key Features (Explained Simply)

🏗️ System Architecture (Designed for Flexibility)

🚀 Getting Started Quickly

Requirements

Installation Steps

Installation Steps

Option 1: Install via pip (Recommended)

Option 2: Run from Source

🔐 Environment Configuration for TheUltimateRAG

📁 Step 1: Create the .env File

⚙️ Step 2: Required & Optional Environment Variables

🧩 Core Application Settings

🤖 LLM & Embedding Providers

🔑 API Keys (Required Based on Provider)

OpenAI

Anthropic

🧠 Ollama Configuration (Local Models)

🗂️ Vector Database Configuration

ChromaDB (Default – Local)

PostgreSQL + PGVector

🧠 Memory & Conversation Storage (Redis)

🔄 How .env Is Loaded

✅ Minimal .env (Quick Start)

🧪 Custom Implementation Example

How to Run

🖥️ RAG Visualizer GUI

📡 API Endpoints Overview

Core APIs

Agent & Advanced APIs

🤝 Contributing

🎓 Learning & Documentation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

📁 Step 1: Create the `.env` File

🔄 How `.env` Is Loaded

✅ Minimal `.env` (Quick Start)

Packages