VIBEVOICE

// Open-Source Frontier Voice AI

VibeVoice

Open-Source Frontier Voice AI

13EmergingUnknown

See Alternatives Compare...

License

MIT

Updated

1mo ago

What it does

📰 News 2026-01-21: 📣 We open-sourced VibeVoice-ASR, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context. Try it in Playground. - ⭐️ VibeVoice-ASR is natively multilingual, supporting over 50 languages — check the

Getting Started

git

git clone https://github.com/microsoft/VibeVoice

Links

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions

Similar Tools

See all alternatives →

36Emerging

Mcpproxy Go

Supercharge AI Agents, Safely

Active123open-source

36Emerging

Routilux

Routines-based, event-driven workflow orchestration for Python—compose complex data/AI pipelines and run concurrent w...

Active153open-source

36Emerging

Flowfeat

FlowFeat: Pixel-Dense Embedding of Motion Profiles (NeurIPS 2025 Spotlight)

Active111open-source

36Emerging

Csv Ai Analyzer

A self-hosted, browser-based AI CSV analyzer

Active71open-source

36Emerging

Llumen

🕯️ A lightweight but powerful LLM chat application

Active80open-source

33Emerging

Zen7 Payment Agent

Zen7 Payment Agent is the first implementation project of DePA (Decentralized Payment Agent), pioneers next-generatio...

Active171open-source