GitHub - SQCU/futudiffu: future diffusers. it's the future!

futudiffu

future diffusers.

what is this repository?

the future of diffusers!

no, seriously?

modern deep learning models are trained by unsupervised learning on lots of different data.

the more they see, the more they learn.

but modern deep learning models are not pretrained and then released.

there are other things you have to do besides 'pretraining' to make a useable machine learning model people can deploy and run.

this repository covers several important gaps in 'midtraining' and 'posttraining' allowing the task adaptation of diffusion models.

major features:

various kernels you shouldn't need to look at
verifiable reward functions as an example of use to promote ordinary software development over rewards
pairwise ranking reward model training code to teach unsupervised models to 'look' for visual features
two demonstration BTRM heads demonstrating PINKIFY/THISNOTTHAT rankings
total liberation from comfyui; we're all free now, you never need to drag the nodes/noodles around ever again.
todo: stepcount and activation quantization distillation reward models as alternative to reward weighted odds maximization distillation
DRGPO for denoising diffusion (porting in progress)
todo: total replacement of buggy shim code first pass codebase
todo: SSDIT text encoder quantization aware distillation training
todo: vlm-as-judge RLVR support (super advanced feature: requires cross integration w/ primeintellect environments to train judge VLMs)

r_theta validation

This is a compact demonstration that reward models implemented as low rank adapters over pretrained models... use the existing residual stream and feature circuits from unsupervised objectives.

A reward adapter (r_theta) is trained via BTRM to simply predict whether an image is more or less pink, and more like reference_image_a while also less like reference_image_b.
The composites below show reference (no adapter) model sampling trajectories on the left and r_theta intervened-models on the right.
Plots demonstrate BTRM scores for each step for both the pinkify and thisnotthat reward heads in both the reference model's sampling trajectories and the reward-intervened model's sampling trajectories.
Reward models trained to detect pinkness don't make sampled images more pink; reward adapters are not policy adapters.

policy intervention validation

scripts_ii/validate_policy_intervention.py is a resumable, incremental-persistence script that compares DDGRPO-trained policy adapters against a BTRM-only reference across prompts, seeds, and resolutions.

if this is a diffusion model repo where do i click on buttons and write 'prompts'?

uv run python scripts_ii\launch_server.py

uv run python scripts_ii\launch_yeetums.py --inference-url http://localhost:8000 --port 8079

why?

brain hurt after trying to cram for mats / anthropic fellows code screens in ancient dead languages no longer used in ml, needed cooldown exercise

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.claude		.claude
batch_rollout_validation		batch_rollout_validation
bench		bench
bench_renders		bench_renders
btrm_dataset		btrm_dataset
btrm_dataset_v2_heattest		btrm_dataset_v2_heattest
docs		docs
heattest_renders		heattest_renders
lora_dumps_test_refactor		lora_dumps_test_refactor
packed_dataset		packed_dataset
plans		plans
prompts		prompts
ref_papers		ref_papers
remote_validation		remote_validation
scripts		scripts
scripts_ii		scripts_ii
src/futudiffu		src/futudiffu
src_ii		src_ii
stream_comfyui		stream_comfyui
stream_compat_bf16		stream_compat_bf16
stream_futudiffu		stream_futudiffu
stream_futudiffu_bf16		stream_futudiffu_bf16
stream_futudiffu_f16te		stream_futudiffu_f16te
tests		tests
training_output		training_output
validation_renders		validation_renders
yeetums_gallery		yeetums_gallery
.gitignore		.gitignore
.mcp.json		.mcp.json
CLAUDE.md		CLAUDE.md
bootstrap.py		bootstrap.py
paths4claude.md		paths4claude.md
pyproject.toml		pyproject.toml
readme.md		readme.md
remote_target.json.example		remote_target.json.example
sourcemap.md		sourcemap.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

futudiffu

what is this repository?

no, seriously?

major features:

r_theta validation

policy intervention validation

if this is a diffusion model repo where do i click on buttons and write 'prompts'?

why?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

futudiffu

what is this repository?

no, seriously?

major features:

r_theta validation

policy intervention validation

if this is a diffusion model repo where do i click on buttons and write 'prompts'?

why?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages