MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode

MoonMath AI team has released a bf16 forward attention kernel for AMD’s MI300X GPU. It is written in HIP, not hand-written assembly. The code is open-source under the MIT license. The MoonMath.ai team reports it beats AITER v3, AMD’s own optimized kernel, on every tested shape. Bare-metal access came from HotAisle, an AMD cloud provider. Attention is the fused softmax(QKᵀ/√d)·V operation inside every transformer. The MI300X is AMD’s CDNA3 data-center GPU, with the ISA targe

Get smart on it

A team has released open-source code that performs a core AI computation called attention more efficiently on AMD's MI300X graphics processor. The code, written in HIP programming language rather than lower-level assembly, consistently outperforms AMD's own optimized version across all tested configurations and rounding modes. The speedup comes primarily from careful memory placement: storing certain data structures in different cache levels and registers to minimize data movement between fast and slow storage. The kernel has already been used in real applications, improving video diffusion generation speed by 1.23 times on MI300X hardware with no quality loss.

Virginia Approves First-Ever Data Center Power Tax

Virginia’s new electricity tax on data centers, including self-generated power, is projected to generate $600M annually.

Hardware & ComputeOpen story →

The Breaking Points 2035: A Data Center Space Odyssey

Orbital data centers promise relief from terrestrial power challenges, but their future may hinge on a harder question: repair infrastructure or replace fleets.

Hardware & ComputeOpen story →

MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode

Virginia Approves First-Ever Data Center Power Tax

The Breaking Points 2035: A Data Center Space Odyssey

Chevron Lands 20-Year Microsoft Deal to Power West Texas AI Campus

Data Centers Take Training into Their Own Hands Amid Talent Shortages

Nvidia says its AI data center design runs hotter to use a lot less water

Nvidia wants to cut data center water use, but that’s not the same as fixing AI’s water problem