T
ToolShelf
VIBEVOICE
// Open-Source Frontier Voice AI

VibeVoice

Open-Source Frontier Voice AI

13EmergingUnknown
License
MIT
Updated
Today

What it does

πŸ“° News 2026-01-21: πŸ“£ We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in Playground. - ⭐️ VibeVoice-ASR is natively multilingual, supporting over 50 languages β€” check the

Getting Started

git
git clone https://github.com/microsoft/VibeVoice

Platforms

πŸͺŸwindows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions