ChatMyDocs 🗂️🤖 — Local RAG Chatbot

ChatMyDocs lets you upload PDFs/DOCX/TXT and chat with them locally. No cloud LLM required — it runs against Ollama (local LLM + embeddings), LangChain, ChromaDB, FastAPI, and a clean React + Vite + Chakra UI front‑end.

🧑‍💻 Non-technical? Think of it like “Ctrl+F on steroids” for your own documents — private and on your machine.

✨ Features

Upload PDF / DOCX / TXT and ask questions about them
Local inference with Ollama (e.g. llama3, mxbai-embed-large)
RAG pipeline (retrieve relevant chunks → grounded answers)
Sleek chat UI (typing indicator, right‑aligned user bubbles, empty state for file picker)
One-click Reset corpus and re‑ingest
Dev-friendly: FastAPI, LangChain, Chroma, Chakra UI
CI: Ruff + Pytest + ESLint + Typecheck on GitHub Actions

🧭 Demo

Add a GIF or screenshots here after you run it locally.

🧱 Architecture (high level)

React (Vite + Chakra)  ─┬── Chat UI / Upload / History
                        │
                        └── calls REST
FastAPI ────────────────────────────────────────────────────────────
  /ingest  -> save uploads → loaders → splitter → Chroma.add_documents (embeds via Ollama)
  /query   -> retriever (Chroma) → LLM (Ollama) via LangChain → answer + sources
  /stats   -> count vectors
  /reset   -> delete & recreate collection + refresh chain

Local services
  Ollama  : LLM (e.g. llama3) + Embeddings (mxbai-embed-large)
  Chroma  : on-disk vector store (./backend/chroma_store)

🚀 Quickstart (Windows)

0) Prereqs

Python 3.12
Node 20+ (or latest LTS)
Ollama installed & running: https://ollama.com/
Pull models (first time):

ollama pull llama3
ollama pull mxbai-embed-large

1) Clone

git clone https://github.com/thereds11/ChatMyDocs.git
cd ChatMyDocs

2) Backend (FastAPI)

cd backend
python -m venv .venv
.\.venv\Scripts\activate
pip install -r requirements.txt

(optional) Configure — create backend\.env to override defaults:

BASE_URL=http://localhost:11434
LLM_MODEL=llama3
EMBED_MODEL=mxbai-embed-large
PERSIST_DIR=chroma_store
COLLECTION_NAME=chatmydocs
CHUNK_SIZE=800
CHUNK_OVERLAP=100
TOP_K=4
ALLOWED_ORIGINS=["http://localhost:8001","http://127.0.0.1:8001"]

Run:

uvicorn app.main:app --reload --port 8000

Health check:

curl http://localhost:8000/health

3) Frontend (Vite + React)

cd ..\frontend
npm i

Create frontend\.env.local:

VITE_API_URL=http://localhost:8000

Run:

npm run dev

Open: http://localhost:8001/ (or the port Vite prints)

4) Use it

On first load, you’ll see Get started (file picker centered).
Choose files → Confirm → wait for ingest (first run may take longer while models warm up).
You’ll get a bot message: “I’m ready to answer…”
Ask away. Sources appear under the transcript.

🧪 Developer Experience

Lint / Format / Test (backend)

cd backend
.\.venv\Scripts\activate
.\run_lint.bat --fix
.\test.bat

Lint / Typecheck / Build (frontend)

cd frontend
npm run lint
npm run typecheck
npm run build

CI runs both workflows on push/PR: see Actions tab.

🔌 API (for tinkerers)

Method	Path	Body	Returns
GET	`/health`	–	`{ ok: true }`
GET	`/stats`	–	`{ documents: number }`
POST	`/ingest`	`multipart/form-data files`	`{ added_chunks, total_documents }`
POST	`/query`	`{ "question": "..." }`	`{ answer: string, sources: [] }`
POST	`/reset`	–	`{ ok: true, message: "..." }`

🛠️ Troubleshooting (Windows tips)

Ingest hangs on first run The embedding model loads lazily. We bumped the frontend request timeout and added backend logs. Check terminal logs for /ingest: adding N chunks… — it will finish.
Reset then ingest hangs We rebuild the Chroma instance & refresh the chain on /reset. If you killed the server mid‑ingest, stop/restart the backend.
“Model not found” ollama list → if llama3 or mxbai-embed-large missing, ollama pull <name>.
CORS Change ALLOWED_ORIGINS in backend/.env. Restart backend.
Chroma lock on Windows Avoid deleting chroma_store while server is running.

🧭 Roadmap

Streaming responses (token‑by‑token)
Source preview (page snippets)
Conversation history (local)
Multi‑doc collections / named workspaces
Light Docker image (optional)
Packaging (one‑click binaries)

🤝 Contributing

PRs welcome!

Please:

use Conventional Commits (feat: …, fix: …, chore: …)
run linters & tests before pushing:

cd backend && .\.venv\Scripts\activate && .\run_lint.bat --fix && .\test.bat
cd ..\frontend && npm run lint && npm run typecheck

📜 License

Released under the MIT License.

See LICENSE for the full text.

🙏 Acknowledgements

Ollama — local LLM runtime
LangChain — LLM app framework
Chroma — vector store
FastAPI — web API
Chakra UI — React components
Vite — blazing fast dev server

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github		.github
backend		backend
demo		demo
frontend		frontend
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatMyDocs 🗂️🤖 — Local RAG Chatbot

✨ Features

🧭 Demo

🧱 Architecture (high level)

🚀 Quickstart (Windows)

0) Prereqs

1) Clone

2) Backend (FastAPI)

3) Frontend (Vite + React)

4) Use it

🧪 Developer Experience

Lint / Format / Test (backend)

Lint / Typecheck / Build (frontend)

🔌 API (for tinkerers)

🛠️ Troubleshooting (Windows tips)

🧭 Roadmap

🤝 Contributing

📜 License

See LICENSE for the full text.

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

coding-ninja-afk/ChatMyDocs

Folders and files

Latest commit

History

Repository files navigation

ChatMyDocs 🗂️🤖 — Local RAG Chatbot

✨ Features

🧭 Demo

🧱 Architecture (high level)

🚀 Quickstart (Windows)

0) Prereqs

1) Clone

2) Backend (FastAPI)

3) Frontend (Vite + React)

4) Use it

🧪 Developer Experience

Lint / Format / Test (backend)

Lint / Typecheck / Build (frontend)

🔌 API (for tinkerers)

🛠️ Troubleshooting (Windows tips)

🧭 Roadmap

🤝 Contributing

📜 License

See LICENSE for the full text.

🙏 Acknowledgements

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages