Skip to content
AI Primer

FlashQLA

High-performance linear attention kernel library built on TileLang.

Open-source high-performance linear attention kernel library built on TileLang for GDN Chunked Prefill, with fused forward/backward optimizations and NVIDIA Hopper performance tuning.

Recent stories

1 linked story
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.