ComfyUI PiD

Custom node for using nvidia/PiD in ComfyUI with the native PiD workflow, without relying on PiD Conditioning or the core KSampler.

What This Package Does

Loads the official PiD checkpoints for flux, sd3, and flux2.
Supports prompt pre-encoding to reduce repeated text encoder cost.
Supports image encoding with PiD's own encoder to generate compatible latents.
Decodes LATENT -> IMAGE in the standard flow or in tiles for larger resolutions.

Included Nodes

PiD Load Model
- Selects backbone and checkpoint_variant.
- Keeps only the current model runtime active.
PiD Encode Prompt
- Inputs: PID_MODEL, prompt.
- Output: PID_PROMPT.
PiD Encode Image
- Inputs: PID_MODEL, IMAGE.
- Output: LATENT.
PiD Decode Latent
- Inputs: PID_MODEL, LATENT, prompt, pid_inference_steps, seed, degrade_sigma.
- Optional input: PID_PROMPT.
- Output: IMAGE.
PiD Decode Latent Tiled
- Inputs: PID_MODEL, LATENT, tile_size, tile_overlap, tile_batch_size, prompt, pid_inference_steps, seed, degrade_sigma.
- Optional input: PID_PROMPT.
- Output: IMAGE.
PiD KSampler
- Inputs: PID_MODEL, LATENT, prompt, pid_inference_steps, seed, degrade_sigma, keep_model_loaded_on_gpu, use_tiled, tile_size, tile_overlap, tile_batch_size.
- Optional input: PID_PROMPT.
- Output: IMAGE.

Recommended Flow

Load the model with PiD Load Model.
If you plan to reuse the same prompt, use PiD Encode Prompt.
Use a LATENT compatible with the selected backbone, or generate one with PiD Encode Image.
Decode with PiD Decode Latent, PiD Decode Latent Tiled, or PiD KSampler.

Example

Example workflow: workflow_pid_flux2_2kto4k_tiled.json
Example output image:

Example comparison image showing input and model result:

Installation

Place this folder inside ComfyUI/custom_nodes/.
Install the dependencies in the ComfyUI environment:

pip install -r requirements.txt

Restart ComfyUI.
When the PiD weights are downloaded automatically, they are stored inside this custom node folder under upstream-pid/checkpoints/.
In a typical ComfyUI installation, that means the files will be saved to:

ComfyUI/custom_nodes/ComfyUI-PiD/upstream-pid/checkpoints/

Supported Backbones

flux
sd3
flux2

Official Resources And Disclaimer

Official model repository: nvidia/PiD on Hugging Face
Official PiD source repository: nv-tlabs/PiD on GitHub
The released PiD models are subject to NVIDIA's own model license and usage terms. Always review the official model card and license before downloading, using, sharing, or adapting any of the provided weights.
This custom node is developed for personal use and experimentation in a controlled environment.
Running this node, downloading the models, and using them on your own hardware is the sole responsibility of each user.
Any use outside the applicable model license terms, usage restrictions, or deployment limitations is the sole responsibility of the user who chooses to do so.

Important Notes

The backbone selected in PiD Load Model must match the latent family.
The first load downloads weights from the nvidia/PiD repository.
Downloaded PiD checkpoint files are stored locally in upstream-pid/checkpoints/ inside this custom node directory.
The PiD runtime also loads the Efficient-Large-Model/gemma-2-2b-it text encoder, so the first run requires a significant amount of VRAM, RAM, and disk space.
Development and validation for this custom node were tested on an NVIDIA RTX 3090.
The recommended minimum GPU memory for practical use is 16 GB of VRAM.
Environments with 12 GB or 8 GB of VRAM were not tested, but they may still work depending on the workflow and settings used.
This node uses CUDA. There is no practical CPU support in this wrapper.
This package does not keep conditioner, core KSampler, or extra experimental nodes outside the main PiD flow.
upstream-pid is a vendored minimal subset of NVIDIA PiD kept only for the runtime pieces this node uses, not the full original project.
PiD Decode Latent Tiled and PiD KSampler use tile_batch_size to process multiple same-sized tiles per call and reduce overhead.
PiD KSampler includes keep_model_loaded_on_gpu so you can decide whether the PiD network stays resident on GPU after sampling.
The decode node shows only height x width in a compact read-only resolution field below the preview, without creating extra graph outputs.
Small tiles such as 256 are decoded with extra internal context before the final crop to reduce green tint and color collapse.
Green artifacts can also appear in non-tiled decoding when aspect-ratio changes or custom resolutions push the latent outside the model's most stable size/alignment range. This is not limited to 4:3; the bigger issue is usually latent/grid alignment and using a checkpoint variant outside the resolution range where it is most stable.
As a practical rule, 2k is usually the safer choice around a 512 base workflow, while 2kto4k is usually the safer choice around a 1024 base workflow.
Using the 2k checkpoint variant with 1024 in direct non-tiled decoding can still be accepted by the model, but it is more likely to produce green artifacts, color collapse, or unstable results than the same workflow at 512.
Tiled decoding is often more stable at larger resolutions because the model processes smaller local regions instead of one large full-frame decode, so 1024 and larger outputs tend to behave better in tiled mode than in direct mode.
For best stability, prefer generating or encoding directly at the target aspect ratio and keep dimensions aligned to the backbone: multiples of 32 are safer for flux and sd3, and multiples of 64 are safer for flux2.
For flux2, the encode path is especially strict because the VAE uses an effective 16x spatial compression and an internal 2x2 patchification step, so dimensions that drift away from the safe multiples are more likely to fail or produce unstable colors.
The nvidia/PiD model has its own NVIDIA usage terms. Check the model card license before distributing or using it in production.

Local Validation

The unit tests in this repository validate:

ComfyUI node contracts;
LATENT to IMAGE conversion;
channel validation per backbone;
image encoding and prompt cache behavior;
correct runtime delegation with mocks.

Run:

python -m unittest discover -s tests -v

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
tests		tests
upstream-pid		upstream-pid
web		web
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
compare.png		compare.png
nodes.py		nodes.py
pid_512-2048_flux1_upscale_example.png		pid_512-2048_flux1_upscale_example.png
pid_runtime.py		pid_runtime.py
requirements.txt		requirements.txt
workflow_pid_flux2_2kto4k_tiled.json		workflow_pid_flux2_2kto4k_tiled.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI PiD

What This Package Does

Included Nodes

Recommended Flow

Example

Installation

Supported Backbones

Official Resources And Disclaimer

Important Notes

Local Validation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

ComfyUI PiD

What This Package Does

Included Nodes

Recommended Flow

Example

Installation

Supported Backbones

Official Resources And Disclaimer

Important Notes

Local Validation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages