High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
-
Updated
Oct 28, 2025 - Python
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
A collection of resources and papers on Diffusion Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Official repository for LTX-Video
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
A curated list of recent diffusion models for video generation, editing, and various other applications.
Taming Stable Diffusion for Lip Sync!
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
MAGI-1: Autoregressive Video Generation at Scale
Diffusion model papers, survey, and taxonomy
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Add a description, image, and links to the diffusion-models topic page so that developers can more easily learn about it.
To associate your repository with the diffusion-models topic, visit your repo's landing page and select "manage topics."