Model components of the Llama Stack APIs

0
0
Pushed 1y ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

0
0
Pushed 1y ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

0
0
Pushed 1y ago

SDXL implementation of AnimateDiff.

1
0
Pushed 1y ago

Tooll 3 is an open source software to create realtime motion graphics.

1
0
Pushed 1y ago

🎨 GPT for video generation ⚡️

2
0
Pushed 1y ago

Isoflow Diagram as Code and AI Integration to build diagram as code using AI

0
0
Pushed 7mo ago

No description

0
0
Pushed 1y ago

A Unity MCP server that allows MCP clients like Claude Desktop or Cursor to perform Unity Editor actions.

0
0
Pushed 9mo ago

[T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv.org/abs/2411.17832

0
0
Pushed 1y ago

Converts raster images into SVG in ComfyUI.

0
0
Pushed 10mo ago

No description

0
0
Pushed 11mo ago

Bridge between ComfyUI and blender ComfyUI-BlenderAI-node addon - Advance Nodes and English Translations.

0
0
Pushed 10mo ago

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

0
0
Pushed 10mo ago

An open protocol enabling communication and interoperability between opaque agentic applications.

0
0
Pushed 1y ago

Development repository for the Triton language and compiler

0
0
Pushed 11mo ago

No description

0
0
Pushed 12mo ago

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

0
0
Pushed 11mo ago

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

0
0
Pushed 11mo ago

No description

0
0
Pushed 11mo ago

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

0
0
Pushed 1y ago

No description

0
0
Pushed 1y ago

HunyuanVideo: A Systematic Framework For Large Video Generation Model

0
0
Pushed 1y ago

gradio WebUI for AdvancedLivePortrait

0
0
Pushed 1y ago

:fire: 2D and 3D Face alignment library build using pytorch

0
0
Pushed 1y ago

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

0
0
Pushed 1y ago

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

0
0
Pushed 1y ago

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

0
0
Pushed 1y ago

Official extension for Blender

0
0
Pushed 1y ago

Text to 4D Worlds in Blender

0
0
Pushed 1y ago

No description

0
0
Pushed 1y ago

Use AI Agents directly in Blender.

0
0
Pushed 1y ago

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

0
0
Pushed 1y ago

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

0
0
Pushed 1y ago

No description

0
0
Pushed 1y ago

This extension integrates ByteDance's UNO-FLUX model into ComfyUI, allowing you to use UNO's powerful text-to-image generation with reference capabilities.

0
0
Pushed 1y ago
JavaScript
0
0
Pushed 1y ago

Better than SHAP for Keyword Importance

Python
0
0
Pushed 1y ago

Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.

0
0
Pushed 1y ago

Official inference repo for FLUX.1 models

0
0
Pushed 1y ago

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

0
0
Pushed 1y ago

Nodes for image juxtaposition for Flux in ComfyUI

0
0
Pushed 1y ago

No description

0
0
Pushed 1y ago

Official inference repo for FLUX.1 models

0
0
Pushed 1y ago

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

0
0
Pushed 1y ago

Flow is a custom node designed to provide a user-friendly interface for ComfyUI.

JavaScript
0
0
Pushed 1y ago

LLM inference in C/C++

0
0
Pushed 1y ago

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

0
0
Pushed 1y ago

A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Model

0
0
Pushed 1y ago

FastVideo is an open-source framework for accelerating large video diffusion model.

0
0
Pushed 1y ago
0
0
Pushed 1y ago

A pipeline parallel training script for diffusion models.

0
0
Pushed 1y ago
0
0
Pushed 2y ago
0
0
Pushed 1y ago

Image composition toolbox: everything you want to know about image composition or object insertion

0
0
Pushed 1y ago

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

0
0
Pushed 1y ago

real time face swap and one-click video deepfake with only a single image

0
0
Pushed 1y ago

pix2pix3D: Generating 3D Objects from 2D User Inputs

0
0
Pushed 2y ago

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

0
0
Pushed 1y ago

Bring portraits to life!

0
0
Pushed 1y ago

Various AI scripts. Mostly Stable Diffusion stuff.

0
0
Pushed 1y ago

ComfyUI nodes for LivePortrait

0
0
Pushed 1y ago

Select a portrait, click to move the head around (please use your own space / GPU!)

0
0
Pushed 1y ago

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

0
0
Pushed 1y ago

AI Photo Editing with Inpainting

0
0
Pushed 1y ago

AI-Powered Photo Editor (Python, PyQt6, PyTorch)

0
0
Pushed 2y ago

A web app that allows you to select a subject and then change its background, OR keep the background and change the subject.

0
0
Pushed 2y ago

No description

0
0
Pushed 2y ago

No description

0
0
Pushed 1y ago

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

0
0
Pushed 1y ago

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

0
0
Pushed 1y ago

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

0
0
Pushed 1y ago

No description

0
0
Pushed 1y ago

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

0
0
Pushed 1y ago

A simple and easy-to-use fx sounds generator

0
0
Pushed 1y ago

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

0
0
Pushed 2y ago

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

0
0
Pushed 2y ago

Demo for NVIDIA's Fewshot Vid2vid

0
0
Pushed 3y ago

Automatic1111 Stable Diffusion WebUI Video Extension

0
0
Pushed 1y ago

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

0
0
Pushed 3y ago

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

0
0
Pushed 1y ago

Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment

0
0
Pushed 1y ago

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

0
0
Pushed 1y ago

Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.

0
0
Pushed 1y ago

Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]

0
0
Pushed 1y ago

Code for Investigating Personalization Methods in Text to Music Generation

0
1
Pushed 2y ago

some generative audio tools for ComfyUI

0
0
Pushed 2y ago

Mustango: Toward Controllable Text-to-Music Generation

0
0
Pushed 1y ago

Text-to-Audio/Music Generation

0
0
Pushed 1y ago

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.

0
0
Pushed 1y ago

Automatically generate and overlay subtitles for any video.

0
0
Pushed 1y ago

Code for the paper "Jukebox: A Generative Model for Music"

0
0
Pushed 1y ago

A trainable PyTorch reproduction of AlphaFold 3.

0
0
Pushed 1y ago

A HTML5 video player with a parser that saves traffic

0
0
Pushed 1y ago

Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"

0
0
Pushed 1y ago

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

0
0
Pushed 1y ago

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

0
0
Pushed 1y ago

A nearly-live implementation of OpenAI's Whisper.

0
0
Pushed 1y ago

Robust Speech Recognition via Large-Scale Weak Supervision

0
0
Pushed 1y ago

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

0
0
Pushed 1y ago
v0.3.3[beta]