Spark Audio

community

SparkAudio

AI & ML interests

Audio Generation and Understanding.

Recent Activity

lmxue authored a paper 2 days ago

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Xinsheng-Wang authored a paper 2 days ago

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Xinsheng-Wang updated a model 7 days ago

SparkAudio/Spark-TTS-0.5B

View all activity

SparkAudio's activity

lmxue

authored a paper 2 days ago

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Paper • 2503.01710 • Published 11 days ago • 3

Xinsheng-Wang

authored a paper 2 days ago

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Paper • 2503.01710 • Published 11 days ago • 3

Xinsheng-Wang

updated a model 7 days ago

SparkAudio/Spark-TTS-0.5B

Text-to-Speech • Updated 7 days ago • 8.91k • 401

Xinsheng-Wang

published a model 16 days ago

SparkAudio/Spark-TTS-0.5B

Text-to-Speech • Updated 7 days ago • 8.91k • 401

lmxue

authored a paper 17 days ago

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published 19 days ago • 34

lmxue

authored 2 papers about 1 year ago

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 60

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 55