T
ToolShelf
DIA2
// TTS model capable of streaming conversational audio in realtime.

Dia2

TTS model capable of streaming conversational audio in realtime.

13EmergingUnknown
License
Apache-2.0
Updated
Today

What it does

Dia2 is a streaming dialogue TTS model created by Nari Labs. The model does not need the entire text to produce the audio, and can start generating as the first few words are given as input. You can condition the output on audio, enabling natural conversations in realtime. We provide model checkpoints (1B, 2B) and inference code to accelerate research. The model only supports up to 2 minutes of

Getting Started

git
git clone https://github.com/nari-labs/dia2

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions