VIBEVOICE
// Open-Source Frontier Voice AI
VibeVoice
Open-Source Frontier Voice AI
13EmergingUnknown
What it does
π° News 2026-01-21: π£ We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in Playground. - βοΈ VibeVoice-ASR is natively multilingual, supporting over 50 languages β check the
Getting Started
git
git clone https://github.com/microsoft/VibeVoice