// repositories (100)

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

0
0
Updated 2mo ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

0
0
Updated 2mo ago

SDXL implementation of AnimateDiff.

1
0
Updated 2mo ago

Tooll 3 is an open source software to create realtime motion graphics.

1
0
Updated 3mo ago

🎨 GPT for video generation ⚡️

2
0
Updated 5mo ago

Isoflow Diagram as Code and AI Integration to build diagram as code using AI

0
0
Updated 6mo ago
0
0
Updated 6mo ago

Model components of the Llama Stack APIs

0
0
Updated 7mo ago

A Unity MCP server that allows MCP clients like Claude Desktop or Cursor to perform Unity Editor actions.

0
0
Updated 7mo ago

[T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv.org/abs/2411.17832

0
0
Updated 8mo ago

Converts raster images into SVG in ComfyUI.

0
0
Updated 8mo ago
0
0
Updated 8mo ago

Bridge between ComfyUI and blender ComfyUI-BlenderAI-node addon - Advance Nodes and English Translations.

0
0
Updated 8mo ago

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

0
0
Updated 8mo ago

An open protocol enabling communication and interoperability between opaque agentic applications.

0
0
Updated 9mo ago

Development repository for the Triton language and compiler

0
0
Updated 9mo ago
0
0
Updated 9mo ago

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

0
0
Updated 9mo ago

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

0
0
Updated 9mo ago
0
0
Updated 9mo ago

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

0
0
Updated 10mo ago
0
0
Updated 10mo ago

HunyuanVideo: A Systematic Framework For Large Video Generation Model

0
0
Updated 10mo ago

gradio WebUI for AdvancedLivePortrait

0
0
Updated 11mo ago

:fire: 2D and 3D Face alignment library build using pytorch

0
0
Updated 11mo ago

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

0
0
Updated 11mo ago

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

0
0
Updated 11mo ago

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

0
0
Updated 11mo ago

Official extension for Blender

0
0
Updated 11mo ago

Text to 4D Worlds in Blender

0
0
Updated 11mo ago
0
0
Updated 11mo ago

Use AI Agents directly in Blender.

0
0
Updated 11mo ago

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

0
0
Updated 11mo ago

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

0
0
Updated 11mo ago
0
0
Updated 11mo ago

This extension integrates ByteDance's UNO-FLUX model into ComfyUI, allowing you to use UNO's powerful text-to-image generation with reference capabilities.

0
0
Updated 11mo ago
JavaScript
0
0
Updated 11mo ago

Better than SHAP for Keyword Importance

Python
0
0
Updated 11mo ago

Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.

0
0
Updated 11mo ago

Official inference repo for FLUX.1 models

0
0
Updated 1y ago

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

0
0
Updated 1y ago

Nodes for image juxtaposition for Flux in ComfyUI

0
0
Updated 1y ago
0
0
Updated 1y ago

Official inference repo for FLUX.1 models

0
0
Updated 1y ago

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

0
0
Updated 1y ago

Flow is a custom node designed to provide a user-friendly interface for ComfyUI.

JavaScript
0
0
Updated 1y ago

LLM inference in C/C++

0
0
Updated 1y ago

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

0
0
Updated 1y ago

A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Model

0
0
Updated 1y ago

FastVideo is an open-source framework for accelerating large video diffusion model.

0
0
Updated 1y ago

A pipeline parallel training script for diffusion models.

0
0
Updated 1y ago
0
0
Updated 1y ago
0
0
Updated 1y ago

Image composition toolbox: everything you want to know about image composition or object insertion

0
0
Updated 1y ago

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

0
0
Updated 1y ago

real time face swap and one-click video deepfake with only a single image

0
0
Updated 1y ago

pix2pix3D: Generating 3D Objects from 2D User Inputs

0
0
Updated 1y ago

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

0
0
Updated 1y ago

Bring portraits to life!

0
0
Updated 1y ago

Various AI scripts. Mostly Stable Diffusion stuff.

0
0
Updated 1y ago

ComfyUI nodes for LivePortrait

0
0
Updated 1y ago

Select a portrait, click to move the head around (please use your own space / GPU!)

0
0
Updated 1y ago

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

0
0
Updated 1y ago

AI Photo Editing with Inpainting

0
0
Updated 1y ago

AI-Powered Photo Editor (Python, PyQt6, PyTorch)

0
0
Updated 1y ago

A web app that allows you to select a subject and then change its background, OR keep the background and change the subject.

0
0
Updated 1y ago
0
0
Updated 1y ago
0
0
Updated 1y ago

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

0
0
Updated 1y ago

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

0
0
Updated 1y ago

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

0
0
Updated 1y ago
0
0
Updated 1y ago

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

0
0
Updated 1y ago

A simple and easy-to-use fx sounds generator

0
0
Updated 1y ago

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

0
0
Updated 1y ago

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

0
0
Updated 1y ago

Demo for NVIDIA's Fewshot Vid2vid

0
0
Updated 1y ago

Automatic1111 Stable Diffusion WebUI Video Extension

0
0
Updated 1y ago

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

0
0
Updated 1y ago

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

0
0
Updated 1y ago

Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment

0
0
Updated 1y ago

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

0
0
Updated 1y ago

Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.

0
0
Updated 1y ago

Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]

0
0
Updated 1y ago

Code for Investigating Personalization Methods in Text to Music Generation

0
1
Updated 1y ago

some generative audio tools for ComfyUI

0
0
Updated 1y ago

Mustango: Toward Controllable Text-to-Music Generation

0
0
Updated 1y ago

Text-to-Audio/Music Generation

0
0
Updated 1y ago

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.

0
0
Updated 1y ago

Automatically generate and overlay subtitles for any video.

0
0
Updated 1y ago

Code for the paper "Jukebox: A Generative Model for Music"

0
0
Updated 1y ago

A trainable PyTorch reproduction of AlphaFold 3.

0
0
Updated 1y ago

A HTML5 video player with a parser that saves traffic

0
0
Updated 1y ago

Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"

0
0
Updated 1y ago

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

0
0
Updated 1y ago

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

0
0
Updated 1y ago

A nearly-live implementation of OpenAI's Whisper.

0
0
Updated 1y ago

Robust Speech Recognition via Large-Scale Weak Supervision

0
0
Updated 1y ago

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

0
0
Updated 1y ago
[beta]v0.14.0