Super STT enables effortless voice-to-text in any application, using the most advanced speech models that run 100% locally.
-
Updated
Jan 13, 2026 - Rust
Super STT enables effortless voice-to-text in any application, using the most advanced speech models that run 100% locally.
Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.
A Web UI for easy subtitle using various models including voxtral
Effortless Push-to-Talk Transcription, Anywhere.
Offline Speech-to-Text (STT) service using Mistral's Voxtral model with Wyoming protocol compatibility for Home Assistant Assist integration.
speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
Voice note taking utility that uses cloud audio multimodal models for single pass transcription and text cleanup
Self-hosted AI Suite, GDPR-compliant, featuring Multi-LLM Chat (Mistral, Nebius, etc), Audio Transcription (Gladia, Deepgram, AssemblyAI, Voxtral, Whisper, etc), Image Gen (Flux, etc), Cloud Storage integration, etc
Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through vLLM with configurable model selection and parameter control.
Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through 🤗 Transformers with configurable model selection and parameter control.
Add a description, image, and links to the voxtral topic page so that developers can more easily learn about it.
To associate your repository with the voxtral topic, visit your repo's landing page and select "manage topics."