Lahr Carpet Cleaning — Image Generation Pipeline

Model Stack

Prompt
gen-images-flux.py
28 images (16 hero + 12 svc)
API
ComfyUI
localhost:8188
UNet (model)
FLUX.1 Schnell
Q8_0 GGUF · 12GB · 12B params
Sampler
KSampler
4 steps · euler · cfg=1.0
Decode
FLUX AE
ae.safetensors · 108MB
Output
JPEG → WebP
1024×576 · q92 → q80

Text Encoders

CLIP-L
clip_l.safetensors
235MB · short prompts
+
T5-XXL fp8
t5xxl_fp8_e4m3fn
4.6GB · long prompt understanding
Node
DualCLIPLoader
type: flux

Hardware

GPU
AMD Radeon (2GB VRAM)
Execution
CPU only (VRAM too small)
Speed
~4 min / image
Total ETA
~1h50m for 28 images

Model Files on Disk

UNet
flux1-schnell-Q8_0.gguf · 12GB
T5-XXL
t5xxl_fp8_e4m3fn.safetensors · 4.6GB
CLIP-L
clip_l.safetensors · 235MB
VAE
ae.safetensors · 108MB (official BFL)

Generation Progress

4 / 28
14% — reload page to update

Prompt Strategy

Low-angle perspective (35mm / 24mm lens specified in prompt)
Carpet/floor texture sharp in foreground — subject recedes into bokeh
Shallow depth of field + vanishing point for depth cues
No people, no machines, no equipment
Finger Lakes / upstate NY context for residential scenes

Previous model: RealVisXL V5.0 fp16 (SDXL 3.5B) — rejected: flat angles, poor depth
Current model: FLUX.1 Schnell (12B transformer) — better spatial understanding