DIA2
// TTS model capable of streaming conversational audio in realtime.
Dia2
TTS model capable of streaming conversational audio in realtime.
13EmergingUnknown
What it does
Dia2 is a streaming dialogue TTS model created by Nari Labs. The model does not need the entire text to produce the audio, and can start generating as the first few words are given as input. You can condition the output on audio, enabling natural conversations in realtime. We provide model checkpoints (1B, 2B) and inference code to accelerate research. The model only supports up to 2 minutes of
Getting Started
git
git clone https://github.com/nari-labs/dia2