T
ToolShelf
LINACODEC
// A highly compressive and high-quality neural audio codec for speech models.

LinaCodec

A highly compressive and high-quality neural audio codec for speech models.

13EmergingUnknown
Updated
Today

What it does

Linacodec is an audio tokenizer that compresses audio into just 12.5 tokens per second (171 bps) and decodes to 48khz audio. https://github.com/user-attachments/assets/77094afd-2d5d-490e-b283-9100e74a69a4 Compression: 12.5 tokens/sec (60x more compressed than DAC). Audio Quality: 48khz output (much clearer then 16khz/24khz which is the standard). Encoder Speed: 200x realtime. Decoder Speed: 400x

Getting Started

git
git clone https://github.com/ysharma3501/LinaCodec

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions