Flux.2 Klein 4b-9B(GGUF/FP8/BF16) Image Gen & Editing

install flux2 klein bf16/fp8/gguf in comfyui

You usually accept high latency and heavy hardware requirements, if you want great quality like what happened with Flux 2 Dev. You often sacrifice detail, prompt adherence, or editing flexibility if you want speed. For developers and creators building real time applications, this becomes a serious bottleneck.

Waiting several seconds per image is not acceptable for interactive tools, live previews, or production pipelines. On top of that, many high quality models are either too large, too slow, or locked behind restrictive licenses. Flux2 Klein by Black forest labs do a real time image generation and editing, without sacrificing quality.

Flux 2 Klein Image Generation showcase

It unifies text to image, image editing, and multi reference workflows into a single compact architecture. No switching models. No hacks. Just fast, end-to-end inference that can complete in under a second.

(a) FLUX.2 klein 4B

Its the most accessible version of the family.
-4 billion parameters
-Supports Img To Img Editing and Multi-reference generation
-Designed to run on consumer GPUs like RTX 3090 or 4070
-Requires roughly 13GB VRAM
-Fully open under Apache 2.0, making it safe for commercial use.

The focus here is latency critical workflows interactive UIs, local development, and production environments where speed matters more than anything else.

Flux 2 Klein Image Editing showcase

(b) FLUX.2 klein 9B

This is Text to Image generation the flagship small model
-9 billion parameter flow model
-Supports Single and Multi-reference generation
-Uses an 8B Qwen3 text embedder
-Step-distilled down to 4 inference steps
-Achieves quality comparable to models 5× larger, in under half a second

The trade off here is licensing. Its available under a BFL non commercial license, making it ideal for research, experimentation, and creative exploration.

Installation

1. First, install ComfyUI if you are new user. Older user need to update ComfyUI from the Manager by selecting Update ComfyUI option.

2. Download Flux 2 Klein 4B model or Flux 2 Klein 9B model from hugging face repository:-

(a) flux-2-klein-4b.safetensors - This is the distilled model that runs fast but have some quality degradation.
(b) flux-2-klein-base-4b.safetensors - The base variant, inference is slow but have the capability to generate in best quality.

(c) flux-2-klein-9b.safetensors- Its 9b parameter models takes almost 20GB of VRAM gives you ultimate image generation without compromising quality.

Both(a) and (b) can be run on 8GB of VRAM. Save it into ComfyUI/models/diffusion_models folder.

GGUF-

For GGUF models, make sure you have ComfyUI-GGUF custom node by City 96. If not yet done, just install from Manager by selecting Custom Nodes Manager option. Update it if already using this. To get the detailed overview about GGUF, follow our quantized tutorial.

(a) Flux Klein 4b GGUF by unsloth

(a) Flux Klein 9b GGUF by unsloth

Save it into ComfyUI/models/unet folder. Text encoders and vae will same as provided below.

3. Download Text encoder for Flux2 Klein 4b or Text Encoder for Flux2 Klein 9B from hugging face repository. There are multiple models to choose from. Choose the smaller one, if you donot have enough VRAM. Save it into ComfyUI/models/text_encoders folder.

4. Download VAE (flux2-vae.safetensors) from hugging face repository. Both 9b and 4b variant uses the same Vae model. Save it into ComfyUI/models/vae folder.

5. Restart ComfyUI and refresh it to take effect.

Workflow

1. Download the workflows from our Hugging face repository. Then drag and drop into comfyUI.

Workflows listed below-

(a) Flux2_Klein_T2Img.json (Text To Image Workflow)- It has both distilled and base workflow included. Select and bypass the nodes for specific image generation.

(b) Flux2_Klein_Image_Edit_4b_distilled.json

(c) Flux2_klein_image_edit_4b_base.json

(d) Flux2_klein_image_edit_9b_base.json

(e) Flux2_klein_image_edit_9b_distilled.json

If using the GGUF model, just load the workflow and replace the Load diffusion model loader to unet loader node.

2. Put the positive prompt into Text box and set the configuration. Be descriptive and detailed as much as you want. The model cannot auto enhance your prompt. What so ever you put is what your results will be.

Use this prompt structure to get the best out of it:

Subject → Setting → Details → Lighting → Atmosphere

KSampler Settings:

Base variant
steps- 20-50
CFG-4
Sampler-Euler

Distilled variant
steps- 4
CFG-1
Sampler-Euler

3. Now, finally hit Run to start the image generation.

The results that we shown here are not cherry picked. We used Flux 2 Klein 9B(Base & Distilled) variant. We posted whatever we got at our first attempt to test the model's real performance.

Text To Image Generation

Flux2 klein Distilled model output

Flux 2 klein Base model output

Prompt- This is a realistic, analog-style photo of Persian model. It captures a scene where she is attending a secret Illuminati party. She is winking, raising one hand above her head making a V-sign, and holding a highball glass in the other hand. The party features dazzling lasers and lights, and an LED screen displaying the Illuminati symbol. Aliens, Elon Musk, Donald Trump, and famous celebrities are dancing. The photo looks raw and unedited, characterized by visible film grain and the intense lighting from a flash. Cool lighting. Analog style. A candid, honest photo.

Here, the model has generated clones that's not acceptable. And in distilled generation, an alien wore fancy costume that's little weird kind of. It fails while handling anatomy.

Flux 2 Klein text rendering and realism test

Flux 2 Klein Base Model output

Prompt- Ultra-realistic street fashion portrait of a young woman leaning against a textured brick wall, wearing a red baseball cap with white embroidered text-"FLUX 2 Klein", black oversized sunglasses, a white ribbed tank top, and a burgundy bandana tied around her neck. A dark denim jacket is casually draped over one shoulder. Blonde, slightly tousled shoulder-length hair with natural flyaways. Soft neutral makeup with natural skin texture and subtle freckles visible. Confident, relaxed expression. Golden hour sunlight casting warm highlights and soft shadows across her face and clothing, strong side lighting creating depth and contrast. Shallow depth of field with creamy background bokeh, brick wall receding into blur. Shot at eye level, medium close-up framing, cinematic color grading, realistic skin tones, high dynamic range, sharp focus on face, professional fashion photography, DSLR quality, 85mm lens look, f/1.8, natural urban aesthetic.

Here, text rendering is very good, detailed and seems realistic. Prompt adherence also followed well.

Flux 2 klein distilled variant

Prompt- High-quality anime-style illustration of two fashionable women standing in stylized pose, viewed from a slightly low angle, looking confidently toward the horizon. The woman on the right has long wavy dark hair, sharp elegant facial features, light blush on cheeks, and wears a teal-blue long coat over a red blouse, with gold hoop earrings. The woman on the left has straight light-brown hair with bangs, softer facial features, and wears a red coat over a blue high-neck top with subtle patterns. Clean line art with smooth outlines, vibrant yet balanced color palette, soft cel-shading with gentle gradients, crisp highlights on hair and clothing. Bright daytime sky with soft clouds in the background, minimal environment details to keep focus on characters. Strong sense of confidence, calm determination, and modern fashion aesthetics. Studio-quality anime illustration, sharp focus, no noise, high resolution, modern anime art style, editorial character illustration.

Image Editing

Here, we are using single image as input to do the edit.

Image editing using Flux 2 Klein Base variant

Prompt- Remove the man who is wearing black shades.

You can enable and disable the nodes if do not want more images as input. Officially max 10 input images are supported. In the workflow, basically max 5 images are supported to get the more you can create new similar nodes and connect them as shown in the image editing workflow.

The 4B model feels like a perfect fit for developers who actually want to ship products, while the 9B model is ideal for creators who care deeply about prompt adherence and output diversity, without waiting around. The model is bad in anatomy but performs better for realism.

Flux.2 Klein 4b-9B(GGUF/FP8/BF16) Image Gen & Editing

Installation

Workflow

Posted by Administrator

Search This Blog

Popular Posts

Sulphur 2 -The Uncensored LTX2.3 Video Generation

Best 19 Ltx 2.3 LoRA Models for Optimized Video Generation

Install Forge Neo WebUI- Better than Forge & Automatic1111

Wan2.2 (FP16/FP8/GGUF) VideoGen locally