End-to-end documentation to set up your own local & fully private LLM server on Debian. Equipped with chat, web search, RAG, model management, MCP servers, image generation, and TTS.
-
Updated
Mar 2, 2026
End-to-end documentation to set up your own local & fully private LLM server on Debian. Equipped with chat, web search, RAG, model management, MCP servers, image generation, and TTS.
Send text from browser to Kokoro-FastAPI for TTS generation
Text-to-speech CLI tool that uses the Kokoro model for inference. Runs extremely fast locally with or without a GPU. Render smooth speech faster than real-time on most machines. Use Kokoro from CLI or the FastAPI webserver via HTTP requests or directly in the browser. Supports audio playback from the CLI, web interface, or download in many formats.
Kokoro-FastAPI为基础,支持hexgrad/Kokoro-82M-v1.1-zh模型,优化中英文混读,中文语音更自然,支持 Docker 自动化化部署,支持 NVIDIA GPU 加速和 CPU 推理两种运行模式
Add a description, image, and links to the kokoro-fastapi topic page so that developers can more easily learn about it.
To associate your repository with the kokoro-fastapi topic, visit your repo's landing page and select "manage topics."