Skip to content
AI Primer
release

Diffusers 0.38.0 adds Ace-Step 1.5 pipelines and Flash Attention 4 support

Hugging Face released Diffusers 0.38.0 with new audio and image pipelines, Flash Attention 4, FlashPack loading, and Ring Anything for context parallelism. Use the new profiling guidance to tune diffusion performance as you adopt the added model coverage.

3 min read
Diffusers 0.38.0 adds Ace-Step 1.5 pipelines and Flash Attention 4 support
Diffusers 0.38.0 adds Ace-Step 1.5 pipelines and Flash Attention 4 support

TL;DR

You can jump straight from RisingSayak's release-notes post to the full GitHub release notes, and RisingSayak's release thread packs most of the useful nouns into one short list: new audio pipelines, Flash Attention 4, FlashPack, and Ring Anything. Even RisingSayak's follow-up post was basically an instruction to go read the changelog twice.

Pipelines

Diffusers 0.38.0 looks like a model-coverage release first. The items called out in the launch thread span audio and image generation rather than a single modality.

  • Ace-Step 1.5
  • LongCat-AudioDiT
  • Ernie-Image

That is a useful signal for Diffusers users because the library keeps widening from image-centric diffusion plumbing into a broader set of generation pipelines, including audio, as RisingSayak's release thread explicitly notes.

Flash Attention 4, FlashPack, Ring Anything

The infrastructure side of the release is tighter than the version number suggests. RisingSayak's feature list groups three additions together:

  • Flash Attention 4 support
  • Loading with FlashPack
  • Ring Anything as a new backend for context parallelism

Those are not end-user model names. They are library-level hooks for faster attention, loading, and distributed execution, the kind of release-note items that matter more to people running larger diffusion workloads than to people just trying one new checkpoint. The official details live in the GitHub release notes.

Profiling example

The last item in the launch thread is easy to miss: Diffusers 0.38.0 adds an example showing how to profile a DiffusionPipeline and hunt for performance bottlenecks.

That is new information, not just packaging. Library releases often add support for more models and kernels, but RisingSayak's performance note also points readers toward a workflow for measuring where their own pipeline is slow, with the linked release notes as the place to inspect the implementation details.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 3 threads
TL;DR2 posts
Flash Attention 4, FlashPack, Ring Anything1 post
Profiling example1 post
Share on X