isztld.com — Dávid Isztl

2026-02-07

The trick that makes FlashAttention possible. Step-by-step from PyTorch to Triton, with no magic operators.

#flashattention #triton #cuda #optimization

2026-01-30

A deep dive into attention: the math, three implementations, and how it maps to GPU memory hierarchies. With interactive visualizations.

#attention #transformers #pytorch #cuda

2026-01-21

GPU preprocessing is supposed to be faster. With a Ryzen 9 9950X3D vs RTX PRO 6000, the results surprised me.

#pytorch #gpu #optimization #benchmarks

2026-01-16

Current AI models follow a single reasoning path and can't recover from mistakes. What if the solution is making them more human?

#llm #reasoning #ai #research

2026-01-14

I replaced NumPy with PyTorch. FFT is now 1700× faster. Here's what nobody is talking about.

#pytorch #numpy #gpu #benchmarks

Recent Posts