Media & Design Tools

13Emerging

LongLive

LongLive: Real-time Interactive Long Video Generation

Unknownopen-source

🪟🍎🐧

13Emerging

LuxTTS

A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

Unknownopen-source

🪟🍎🐧

13Emerging

Audioghost Ai

Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.

Unknownopen-source

🪟🍎🐧

13Emerging

Oxdraw

Diagram as Code Tool Written in Rust with Draggable Editing

Unknownopen-source

🪟🍎🐧

13Emerging

Thinking With Video

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThi...

Unknownopen-source

🪟🍎🐧

13Emerging

Open Agent Builder

🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time e...

Unknownopen-source

🪟🍎🐧

13Emerging

openreel-video

OpenReel Video - Professional browser-based video editor. Open source CapCut alternative. 100% browser-based, no inst...

Unknownopen-source

🪟🍎🐧

13Emerging

Tambourine Voice

Your personal voice interface into any app. Speak naturally and your words appear wherever your cursor is, with fully...

Unknownopen-source

🪟🍎🐧

13Emerging

Vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Unknownopen-source

🪟🍎🐧

13Emerging

Fun Audio Chat

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Unknownopen-source

🪟🍎🐧

13Emerging

8mb.local

a free local self hosted video compressor webui designed for performance and ease of use. inspired by 8mb.video

Unknownopen-source

🪟🍎🐧

13Emerging

Glyph

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Unknownopen-source

🪟🍎🐧

13Emerging

Sticker Dream

voice activated sticker dreamer and printer.

Unknownopen-source

🪟🍎🐧

13Emerging

Skyfall GS

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

Unknownopen-source

🪟🍎🐧

13Emerging

React Glass Keep

Glass Keep is Keep Notes alternative using Glass design. Made in React + Tailwind

Unknownopen-source

🪟🍎🐧

13Emerging

LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Unknownopen-source

🪟🍎🐧

13Emerging

Mdserve

Fast markdown preview server with live reload and theme support.

Unknownopen-source

🪟🍎🐧

13Emerging

HoloCine

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Unknownopen-source

🪟🍎🐧

13Emerging

Video As Prompt

Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"

Unknownopen-source

🪟🍎🐧

13Emerging

Qwen3-ASR

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multi...

Unknownopen-source

🪟🍎🐧

13Emerging

MoCha

MoCha: End-to-End Video Character Replacement without Structural Guidance

Unknownopen-source

🪟🍎🐧

13Emerging

Diffusion Gpt

From baby GPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted fr...

Unknownopen-source

🪟🍎🐧

13Emerging

Bolna

Conversational voice AI agents

Unknownopen-source

🪟🍎🐧

13Emerging

Wan Alpha

High-Quality Text-to-Video Generation with Alpha Channel

Unknownopen-source

🪟🍎🐧

PreviousPage 4 of 5Next

Best Media & Design Tools Tools For

LongLive

LuxTTS

Audioghost Ai

Oxdraw

Thinking With Video

Open Agent Builder

openreel-video

Tambourine Voice

Vipe

Fun Audio Chat

8mb.local

Glyph

Sticker Dream

Skyfall GS

React Glass Keep

LTX-2

Mdserve

HoloCine

Video As Prompt

Qwen3-ASR

MoCha

Diffusion Gpt

Bolna

Wan Alpha