Skip to content
@FunAudioLLM

FunAudioLLM

Popular repositories Loading

  1. CosyVoice CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 19.3k 2.2k

  2. SenseVoice SenseVoice Public

    Multilingual Voice Understanding Model

    Python 7.4k 690

  3. FunMusic FunMusic Public

    A fundamental toolkit designed for music, song, and audio generation

    Python 1.3k 131

  4. ThinkSound ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    Python 1.1k 67

  5. Fun-ASR Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    Python 799 61

  6. Fun-Audio-Chat Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    Python 711 69

Repositories

Showing 10 of 12 repositories
  • Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    FunAudioLLM/Fun-ASR’s past year of commit activity
    Python 799 Apache-2.0 61 40 0 Updated Jan 25, 2026
  • FunResearch Public

    This repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge research in speech, audio, NLP technologies. We believe in accelerating scientific progress through transparent collaboration, and invite the global research community to explore, reproduce, and build upon our work.

    FunAudioLLM/FunResearch’s past year of commit activity
    Python 16 Apache-2.0 1 0 0 Updated Jan 25, 2026
  • Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    FunAudioLLM/Fun-Audio-Chat’s past year of commit activity
    Python 711 Apache-2.0 69 8 2 Updated Jan 22, 2026
  • FunAudioLLM/FunAudioLLM.github.io’s past year of commit activity
    HTML 56 MIT 10 0 1 Updated Jan 21, 2026
  • CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    FunAudioLLM/CosyVoice’s past year of commit activity
    Python 19,320 Apache-2.0 2,162 858 15 Updated Jan 19, 2026
  • MME-Emotion Public

    Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models”

    FunAudioLLM/MME-Emotion’s past year of commit activity
    Python 19 MIT 2 1 0 Updated Jan 17, 2026
  • SenseVoice Public

    Multilingual Voice Understanding Model

    FunAudioLLM/SenseVoice’s past year of commit activity
    Python 7,428 690 169 4 Updated Dec 30, 2025
  • ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    FunAudioLLM/ThinkSound’s past year of commit activity
    Python 1,141 67 32 1 Updated Nov 25, 2025
  • CV3-Eval Public
    FunAudioLLM/CV3-Eval’s past year of commit activity
    Python 171 Apache-2.0 14 7 0 Updated Aug 25, 2025
  • OmniAudio Public
    FunAudioLLM/OmniAudio’s past year of commit activity
    Python 8 3 0 0 Updated May 21, 2025