Skip to content
AI Primer
workflow

Krea 2 Turbo community releases GGUF ports: RTX 3090 tests report 1.9x int8 speedups

Builders published GGUF conversions, loader nodes, and local benchmarks for Krea 2 Turbo after yesterday’s open-weights release, alongside new multi-style and watercolor tests. The follow-up matters because creators now have clearer ways to run, tune, and style-push Krea locally on smaller VRAM budgets.

4 min read
Krea 2 Turbo community releases GGUF ports: RTX 3090 tests report 1.9x int8 speedups
Krea 2 Turbo community releases GGUF ports: RTX 3090 tests report 1.9x int8 speedups

TL;DR

You can pull the GGUF conversion, grab the ComfyUI GGUF loader, compare the 3090 int8 side-by-side, and browse Krea's own moodboard gallery. One of the more useful community add-ons was a reference-image text encoder, and one of the weirder early tells was hellorob's claim that Krea finally nailed a public-domain watercolor dataset Midjourney missed.

GGUF ports

Krea 2's first useful community move was not another gallery post. It was shrinking the thing.

r/StableDiffusion

Krea2 GGUFs and GGUF loaders available

0 comments

According to molbal's Reddit post, the BF16 source weights are 26.6 GB and FP8 is about half that, while the Q4_0 GGUF lands around 7.6 GB and is aimed at 8 GB GPUs. TheLocalLab's companion workflow post framed the same pitch more directly: 8 GB should work.

That matters because the open-weights release immediately became a packaging story. The community now has three practical local paths:

Int8 benchmarks

The fastest early optimization came from a community quant, not the base drop.

r/StableDiffusion

Krea 2 Turbo on a 3090: int8 is ~1.9× faster than fp8 (same sampler, same seed)

0 comments

r/ComfyUI

Z-Image vs Boogu vs Krea 2 Turbo — local benchmark on a single RTX 3090

0 comments

WinResponsible9977's follow-up held sampler, seed, prompt, and resolution constant, then swapped only precision. The result was 0.65 it/s for fp8 and 1.27 it/s for int8 ConvRot on a 3090, which cut image time from 14.8 seconds to 7.7 seconds.

The same post also makes a useful correction to the usual low-bit story: on this setup, int8's win was speed, not memory. Peak VRAM was 18.8 GB for fp8 and 19.2 GB for int8, so the gain came from Ampere's INT8 tensor cores rather than a smaller footprint.

Style range

The reason creators kept posting Krea tests is simple: it appears unusually good at jumping styles without falling apart.

r/StableDiffusion

Krea 2 Turbo: 100+ styles on the same scene

0 comments

hellorob said Krea reproduced faded watercolor gradients and bleeding edges from an early-1900s illustration book, with better prompt adherence than the other models they tried. fragilesleep took a different angle, generating thousands of children's-book images from Krea's official moodboards and publishing a 100-plus-style gallery built from one base scene.

Other users kept finding the same pattern in narrower tests. a comic-cover experiment said the model knows many characters by name, and a costume-design comparison found stronger apparel-language adherence than Z Base, even though it missed requested background figures.

Prompt stack

The best community tips were already getting pretty specific, which is usually a good sign a model has escaped pure novelty mode.

r/StableDiffusion

Some important Krea usage tips I've found / not seen discussed here.

0 comments

r/ComfyUI

Krea 2 Turbo ... wow

0 comments

According to Different_Fix_2217's tip list, the most common stack was the raw model plus the turbo LoRA at 0.6 weight, around 12 steps, and a different VAE than the default. The same post also linked a reference-image text encoder, suggested more explicit naming for characters and series, and said Krea knows many artists by name.

Two prompt habits came up repeatedly in that thread:

Reference and img2img

The last interesting reveal is that people were already treating Krea as more than a txt2img style toy.

r/StableDiffusion

Testing IMG2IMG in Krea 2 (ComfyUI, 12Gb 4070)

0 comments

r/StableDiffusion

Krea 2

0 comments

lazyspock's img2img test reported workable denoise settings from 0.4 to 0.75 on a 12 GB 4070 using a GGUF build, while the usage tips thread pointed to a reference-image encoder for feeding visual inputs into the text side. That combination, img2img plus reference conditioning, is where these early local ports start looking less like a benchmark hobby and more like a real style workflow.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

Share on X