ai-tools's Repositories

llama-stack

Model components of the Llama Stack APIs

Pushed 1y ago

TestAutomationWorkflow

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Live demo

Pushed 1y ago

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Live demo

Pushed 1y ago

AnimateDiff_SDXL

SDXL implementation of AnimateDiff.

Live demo

Pushed 1y ago

tooll3

Tooll 3 is an open source software to create realtime motion graphics.

Pushed 1y ago

generative-manim1

🎨 GPT for video generation ⚡️

Live demo

Pushed 1y ago

diagram-ai

Isoflow Diagram as Code and AI Integration to build diagram as code using AI

Pushed 7mo ago

isoflow-pvt

No description

Pushed 1y ago

unity-mcp

A Unity MCP server that allows MCP clients like Claude Desktop or Cursor to perform Unity Editor actions.

Pushed 9mo ago

SVGDreamerV2

[T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv.org/abs/2411.17832

Live demo

Pushed 1y ago

ComfyUI-ToSVG

Converts raster images into SVG in ComfyUI.

Pushed 10mo ago

EasyBakeNodeAdvance

No description

Pushed 11mo ago

ComfyUI-CUP-EN

Bridge between ComfyUI and blender ComfyUI-BlenderAI-node addon - Advance Nodes and English Translations.

Pushed 10mo ago

ComfyUI-BlenderAI-node-adv

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

Pushed 10mo ago

A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

Live demo

Pushed 1y ago

triton-enhanced-win

Development repository for the Triton language and compiler

Live demo

Pushed 11mo ago

InstantCharacter2

No description

Pushed 12mo ago

Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Live demo

Pushed 11mo ago

HunyuanPortrait

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Pushed 11mo ago

HunyuanVideo-Avatar

No description

Pushed 11mo ago

Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Live demo

Pushed 1y ago

InstantCharacter

No description

Pushed 1y ago

HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Live demo

Pushed 1y ago

AdvancedLivePortrait-WebUI

gradio WebUI for AdvancedLivePortrait

Pushed 1y ago

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

Live demo

Pushed 1y ago

ditto-talkinghead

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Live demo

Pushed 1y ago

Upscale-A-Video

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Pushed 1y ago

ComfyUI-BlenderAI-node

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

Pushed 1y ago

tripo-3d-for-blender

Official extension for Blender

Pushed 1y ago

blender-mcp-csm

Text to 4D Worlds in Blender

Pushed 1y ago

blender-mcp

No description

Pushed 1y ago

meshgen

Use AI Agents directly in Blender.

Pushed 1y ago

Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Pushed 1y ago

UniAnimate-DiT

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Pushed 1y ago

UI-TARS

No description

Pushed 1y ago

ComfyUI-UNO-Wrapper

This extension integrates ByteDance's UNO-FLUX model into ComfyUI, allowing you to use UNO's powerful text-to-image generation with reference capabilities.

Pushed 1y ago

hailuo-ai-chrome-extension

hailuo automation

JavaScript

Pushed 1y ago

ai-multi-text-classification-keyword-importance

Better than SHAP for Keyword Importance

Python

Pushed 1y ago

ComfyUI-IF_VideoPrompts

Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.

Pushed 1y ago

flux-org

Official inference repo for FLUX.1 models

Pushed 1y ago

RF-Inversion

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Live demo

Pushed 1y ago

ComfyUI-Fluxtapoz

Nodes for image juxtaposition for Flux in ComfyUI

Pushed 1y ago

STGuidance

No description

Pushed 1y ago

flux

Official inference repo for FLUX.1 models

Pushed 1y ago

RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Live demo

Pushed 1y ago

ComfyUI-disty-Flow

Flow is a custom node designed to provide a user-friendly interface for ComfyUI.

JavaScript

Pushed 1y ago

llama.cpp

LLM inference in C/C++

Pushed 1y ago

hallo1

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Live demo

Pushed 1y ago

HunyuanVideo-WebUI

A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Model

Pushed 1y ago

FastVideo

FastVideo is an open-source framework for accelerating large video diffusion model.

Pushed 1y ago

ComfyUI-HunyuanVideoWrapper

No description

Pushed 1y ago

diffusion-pipe

A pipeline parallel training script for diffusion models.

Pushed 1y ago

ComfyUI_Dave_CustomNode

No description

Pushed 2y ago

ComfyUI_IPAdapter_plus

No description

Pushed 1y ago

libcom

Image composition toolbox: everything you want to know about image composition or object insertion

Pushed 1y ago

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Live demo

Pushed 1y ago

Deep-Live-Cam1

real time face swap and one-click video deepfake with only a single image

Pushed 1y ago

pix2pix3D

pix2pix3D: Generating 3D Objects from 2D User Inputs

Live demo

Pushed 2y ago

krita-ai-diffusion

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Live demo

Pushed 1y ago

LivePortrait

Bring portraits to life!

Live demo

Pushed 1y ago

ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Pushed 1y ago

ComfyUI-LivePortraitKJ

ComfyUI nodes for LivePortrait

Pushed 1y ago

FacePoke

Select a portrait, click to move the head around (please use your own space / GPU!)

Live demo

Pushed 1y ago

echomimic_v2

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Live demo

Pushed 1y ago

AI-Photo-Editing-with-Inpainting-other

AI Photo Editing with Inpainting

Pushed 1y ago

PhotoLab

AI-Powered Photo Editor (Python, PyQt6, PyTorch)

Pushed 2y ago

AI-Photo-Editing-with-Inpainting

A web app that allows you to select a subject and then change its background, OR keep the background and change the subject.

Pushed 2y ago

slahmr

No description

Pushed 2y ago

Cinetransfer

No description

Pushed 1y ago

Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Live demo

Pushed 1y ago

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Live demo

Pushed 1y ago

IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Live demo

Pushed 1y ago

kohya_ss2

No description

Pushed 1y ago

bolt.new-any-llm

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

Live demo

Pushed 1y ago

rfxgen

A simple and easy-to-use fx sounds generator

Live demo

Pushed 1y ago

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Live demo

Pushed 2y ago

vid2vid-zero

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Pushed 2y ago

imaginaire-fsvid2vid

Demo for NVIDIA's Fewshot Vid2vid

Pushed 3y ago

video2video

Automatic1111 Stable Diffusion WebUI Video Extension

Pushed 1y ago

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Pushed 3y ago

Prompt-Singer

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Pushed 1y ago

Melodist

Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment

Pushed 1y ago

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Pushed 1y ago

Musicgen-Text-to-Music

Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.

Pushed 1y ago

AP-adapter

Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]

Pushed 1y ago

DreamSound

Code for Investigating Personalization Methods in Text to Music Generation

Live demo

Pushed 2y ago

ComfyUI-audio

some generative audio tools for ComfyUI

Pushed 2y ago

mustango

Mustango: Toward Controllable Text-to-Music Generation

Pushed 1y ago

AudioLDM2

Text-to-Audio/Music Generation

Pushed 1y ago

voice-pro

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.

Pushed 1y ago

auto-subtitle

Automatically generate and overlay subtitles for any video.

Pushed 1y ago

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Live demo

Pushed 1y ago

Protenix

A trainable PyTorch reproduction of AlphaFold 3.

Pushed 1y ago

xgplayer

A HTML5 video player with a parser that saves traffic

Live demo

Pushed 1y ago

X-Portrait

Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"

Pushed 1y ago

PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Pushed 1y ago

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Live demo

Pushed 1y ago

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Pushed 1y ago

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Pushed 1y ago

llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Pushed 1y ago

llama-stack

Model components of the Llama Stack APIs

Pushed 1y ago

TestAutomationWorkflow

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Live demo

Pushed 1y ago

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Live demo

Pushed 1y ago

AnimateDiff_SDXL

SDXL implementation of AnimateDiff.

Live demo

Pushed 1y ago

tooll3

Tooll 3 is an open source software to create realtime motion graphics.

Pushed 1y ago

generative-manim1

🎨 GPT for video generation ⚡️

Live demo

Pushed 1y ago

diagram-ai

Isoflow Diagram as Code and AI Integration to build diagram as code using AI

Pushed 7mo ago

isoflow-pvt

No description

Pushed 1y ago

unity-mcp

A Unity MCP server that allows MCP clients like Claude Desktop or Cursor to perform Unity Editor actions.

Pushed 9mo ago

SVGDreamerV2

[T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv.org/abs/2411.17832

Live demo

Pushed 1y ago

ComfyUI-ToSVG

Converts raster images into SVG in ComfyUI.

Pushed 10mo ago

EasyBakeNodeAdvance

No description

Pushed 11mo ago

ComfyUI-CUP-EN

Bridge between ComfyUI and blender ComfyUI-BlenderAI-node addon - Advance Nodes and English Translations.

Pushed 10mo ago

ComfyUI-BlenderAI-node-adv

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

Pushed 10mo ago

A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

Live demo

Pushed 1y ago

triton-enhanced-win

Development repository for the Triton language and compiler

Live demo

Pushed 11mo ago

InstantCharacter2

No description

Pushed 12mo ago

Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Live demo

Pushed 11mo ago

HunyuanPortrait

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Pushed 11mo ago

HunyuanVideo-Avatar

No description

Pushed 11mo ago

Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Live demo

Pushed 1y ago

InstantCharacter

No description

Pushed 1y ago

HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Live demo

Pushed 1y ago

AdvancedLivePortrait-WebUI

gradio WebUI for AdvancedLivePortrait

Pushed 1y ago

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

Live demo

Pushed 1y ago

ditto-talkinghead

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Live demo

Pushed 1y ago

Upscale-A-Video

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Pushed 1y ago

ComfyUI-BlenderAI-node

Used for AI model generation, next-generation Blender rendering engine, texture enhancement&generation (based on ComfyUI)

Pushed 1y ago

tripo-3d-for-blender

Official extension for Blender

Pushed 1y ago

blender-mcp-csm

Text to 4D Worlds in Blender

Pushed 1y ago

blender-mcp

No description

Pushed 1y ago

meshgen

Use AI Agents directly in Blender.

Pushed 1y ago

Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Pushed 1y ago

UniAnimate-DiT

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Pushed 1y ago

UI-TARS

No description

Pushed 1y ago

ComfyUI-UNO-Wrapper

This extension integrates ByteDance's UNO-FLUX model into ComfyUI, allowing you to use UNO's powerful text-to-image generation with reference capabilities.

Pushed 1y ago

hailuo-ai-chrome-extension

hailuo automation

JavaScript

Pushed 1y ago

ai-multi-text-classification-keyword-importance

Better than SHAP for Keyword Importance

Python

Pushed 1y ago

ComfyUI-IF_VideoPrompts

Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.

Pushed 1y ago

flux-org

Official inference repo for FLUX.1 models

Pushed 1y ago

RF-Inversion

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Live demo

Pushed 1y ago

ComfyUI-Fluxtapoz

Nodes for image juxtaposition for Flux in ComfyUI

Pushed 1y ago

STGuidance

No description

Pushed 1y ago

flux

Official inference repo for FLUX.1 models

Pushed 1y ago

RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Live demo

Pushed 1y ago

ComfyUI-disty-Flow

Flow is a custom node designed to provide a user-friendly interface for ComfyUI.

JavaScript

Pushed 1y ago

llama.cpp

LLM inference in C/C++

Pushed 1y ago

hallo1

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Live demo

Pushed 1y ago

HunyuanVideo-WebUI

A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Model

Pushed 1y ago

FastVideo

FastVideo is an open-source framework for accelerating large video diffusion model.

Pushed 1y ago

ComfyUI-HunyuanVideoWrapper

No description

Pushed 1y ago

diffusion-pipe

A pipeline parallel training script for diffusion models.

Pushed 1y ago

ComfyUI_Dave_CustomNode

No description

Pushed 2y ago

ComfyUI_IPAdapter_plus

No description

Pushed 1y ago

libcom

Image composition toolbox: everything you want to know about image composition or object insertion

Pushed 1y ago

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Live demo

Pushed 1y ago

Deep-Live-Cam1

real time face swap and one-click video deepfake with only a single image

Pushed 1y ago

pix2pix3D

pix2pix3D: Generating 3D Objects from 2D User Inputs

Live demo

Pushed 2y ago

krita-ai-diffusion

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Live demo

Pushed 1y ago

LivePortrait

Bring portraits to life!

Live demo

Pushed 1y ago

ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Pushed 1y ago

ComfyUI-LivePortraitKJ

ComfyUI nodes for LivePortrait

Pushed 1y ago

FacePoke

Select a portrait, click to move the head around (please use your own space / GPU!)

Live demo

Pushed 1y ago

echomimic_v2

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Live demo

Pushed 1y ago

AI-Photo-Editing-with-Inpainting-other

AI Photo Editing with Inpainting

Pushed 1y ago

PhotoLab

AI-Powered Photo Editor (Python, PyQt6, PyTorch)

Pushed 2y ago

AI-Photo-Editing-with-Inpainting

A web app that allows you to select a subject and then change its background, OR keep the background and change the subject.

Pushed 2y ago

slahmr

No description

Pushed 2y ago

Cinetransfer

No description

Pushed 1y ago

Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Live demo

Pushed 1y ago

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Live demo

Pushed 1y ago

IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Live demo

Pushed 1y ago

kohya_ss2

No description

Pushed 1y ago

bolt.new-any-llm

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

Live demo

Pushed 1y ago

rfxgen

A simple and easy-to-use fx sounds generator

Live demo

Pushed 1y ago

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Live demo

Pushed 2y ago

vid2vid-zero

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Pushed 2y ago

imaginaire-fsvid2vid

Demo for NVIDIA's Fewshot Vid2vid

Pushed 3y ago

video2video

Automatic1111 Stable Diffusion WebUI Video Extension

Pushed 1y ago

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Pushed 3y ago

Prompt-Singer

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Pushed 1y ago

Melodist

Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment

Pushed 1y ago

audiocraft

Pushed 1y ago

Musicgen-Text-to-Music

Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.

Pushed 1y ago

AP-adapter

Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]

Pushed 1y ago

DreamSound

Code for Investigating Personalization Methods in Text to Music Generation

Live demo

Pushed 2y ago

ComfyUI-audio

some generative audio tools for ComfyUI

Pushed 2y ago

mustango

Mustango: Toward Controllable Text-to-Music Generation

Pushed 1y ago

AudioLDM2

Text-to-Audio/Music Generation

Pushed 1y ago

voice-pro

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.

Pushed 1y ago

auto-subtitle

Automatically generate and overlay subtitles for any video.

Pushed 1y ago

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Live demo

Pushed 1y ago

Protenix

A trainable PyTorch reproduction of AlphaFold 3.

Pushed 1y ago

xgplayer

A HTML5 video player with a parser that saves traffic

Live demo

Pushed 1y ago

X-Portrait

Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"

Pushed 1y ago

PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Pushed 1y ago

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Live demo

Pushed 1y ago

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Pushed 1y ago

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Pushed 1y ago

llama-recipes

Pushed 1y ago

Find me

v0.3.3[beta]