Stars
The ultimate training toolkit for finetuning diffusion models - NOW WITH DYNAMIC AUDIO IMPLEMENTATION FOR LTX 2 TRAINING (AND MORE)
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
extension to automatically claim epic games free games
Handle multiprompts and images within one run. Quick OutputLists from spreadsheet, JSON, multiline text, numberranges for sequential processing. Combinations of lists and prompts. Load any file wit…
dffdeeq / Qwen3-TTS-streaming
Forked from QwenLM/Qwen3-TTSFork with streaming inference support + ~6× faster inference
A set of ComfyUI nodes designed specifically for the Z-Image / Z-Image Turbo model.
Custom nodes for ComfyUI by Light-x02. Optimize and simplify workflows, adding utilities, samplers, schedulers and various tools (Flux, images, etc.) to enrich and extend ComfyUI’s capabilities.
phazei / MediaSyncer
Forked from WhatDreamsCost/MediaSyncerMedia Player that can play multiple videos/images at once in sync. Easily drag and drop media to rearrange them on a grid.
ComfyUI's Photoshop-like layered canvas editor to your ComfyUI workflow. This node is perfect for complex compositing, inpainting, and outpainting, featuring multi-layer support, masking, blend mod…
View Image and Video Metadata of ComfyUI as well as of ForgeUI or Automatic 1111 generations images in Easily Readable Format
A ComfyUI node implementation for ByteDance's Sa2VA
[ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
PGCRT / musubi-tuner_Wan2.2_GUI
Forked from kohya-ss/musubi-tunerTesting things..
abdo1819 / Kimi-Audio
Forked from MoonshotAI/Kimi-AudioKimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
Modification of the KSampler for running models like Wan2.2 a14B
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (cl…
An interactive image adjustment node for ComfyUI, with an easy-to-use graphical interface and realtime preview.
This repository is dedicated to maintaining, updating, fixing bugs and keeping up to date my inpainting ComfyUI workflow, previously hosted on CivitAI called: Proper Flux Control-Net inpainting and…
Separate stems (vocals, bass, drums, other) from audio. Recombine, tempo match, slice/crop audio
Fixes AI pixel art or sprite web uploads
This node preserves image quality by selectively merging only the changed regions from AI-generated edits back into the original image.
Media Player that can play multiple videos/images at once in sync. Easily drag and drop media to rearrange them on a grid.
Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
An ComfyUI custom node integration for multi-language High-quality Text-to-Speech and Voice Conversion nodes using ResembleAI's Chatterbox TTS and F5-TTS with unlimited text length, SRT timing Char…
cjeen / LoRAEdit
Forked from tdrussell/diffusion-pipeWe achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.
ComfyUI nodes to crop before sampling and stitch back after sampling that speed up inpainting
AlexanderChen1989 / canvas_tab
Forked from Lerc/canvas_tabComfyUI canvas editor page