Mufeez

Blog

AboutBlogPhotosGitHub
Feb 28, 2026

(WIP) Optimizing NVFP4 Grouped GEMM on Blackwell

238μs to 20μs using tcgen05 MMA, TMA async loads, cluster multicast, etc.

ml sys
4 min
Apr 1, 2025

Flushable SSTables in Pebble

~60% performance improvement when ingested SSTs overlap with the memtable

databases
11 min
Oct 6, 2024

Enforcing Robust Type Safety with ESLint

Surfacing ~250 type inconsistencies in Linear's type definitions

dev-ex
6 min
Aug 13, 2024

Shipping Linear Drafts

Making drafts a first-class entity throughout Linear

product
10 min
Jul 15, 2024

Concurrent manual compactions

Segmenting key space and parallelizing execution, 30% perf improvement

databases
11 min