Skip to content
View ddlBoJack's full-sized avatar

Highlights

  • Pro

Block or report ddlBoJack

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ddlBoJack/README.md

Hi there 👋

Ziyang's GitHub stats

Pinned Loading

  1. X-LANCE/SLAM-LLM X-LANCE/SLAM-LLM Public

    A Framework for Speech, Language, Audio, Music Processing with Large Language Model

    Python 1k 116

  2. Omni-Captioner Omni-Captioner Public

    [ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.

    Python 138

  3. MMAR MMAR Public

    [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

    Python 213 5

  4. MMAE MMAE Public

    MMAE: A Massive Multitask Audio Editing Benchmark

    Python 95 4

  5. emotion2vec emotion2vec Public

    [ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

    Python 1.1k 88

  6. MT4SSL MT4SSL Public

    [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets

    Python 45 4