Latest
-
LLMs and Napkin Problems
— 2026-06-02
On May 20th, Tim Gowers advised fellow mathematicians to sit down before reading the tweet that was to follow. In it, he declared that AI had solved Erdős’s unit-distance problem, a celebrated problem in discrete geometry first posed by Paul Erdős.
-
Auto-Researching Tagore's Songs
— 2026-05-18
Rabindranath Tagore wrote roughly 2,000 songs — collectively Rabindra Sangeet. Earlier this year I curated them from widely available public sources into a structured dataset and put it on Hugging Face:
-
The Chimeras of Tokenization
— 2026-04-15
The Byte Pair Encoding (BPE) method is odd!
-
Clustering Graphs: Spectral vs HAC
— 2026-04-02
You are looking at a plot comparing two clustering algorithms on parameterized graph model. As the parameter changes, the graph transitions from one regime to another, the preferred algorithm changes (as evidenced by the y-axis: higher is better). The rest of the post will build up to this plot.
-
Flash Attention in a Jiffy
— 2026-03-18
To celebrate the release of Flash Attention 4, I think it will be fun to work through the basic idea of Flash Attention. To keep things simple, we will focus on the forward pass and only on memory write traffic.
-
AI v. Tagore
— 2026-02-26
Sarvam AI has garnered well-deserved attention for their recent sequence of “drops” of ML models for Indic languages (Bangla included). I’ve used and recommended their amazing dubbing model myself.