- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Can we really identify LLM Generated Text? The promise and limits of watermarking
Exploring the theoretical and practical aspects of watermarking techniques for detecting AI-generated content, including trade-offs, failure modes, and information-theoretic limits
-
R squared in Machine Learning
Meaning, Explanation & more
-
Training a simple bigram character level model on tiny stories
Training a simple bigram character level model on tiny stories
-
The Evolution of FlashAttention
We present a mathematical & technical overview of FlashAttention and its evolution across versions 1 to 4. We explain why IO-aware design became central to scalable transformers and how these kernels shape modern long-context LLMs as memory patterns and hardware limits shift. We then describe the changes across versions with Triton examples and place these kernels in the context of recent work on efficient attention. We close by outlining principles that can guide the next generation of attention algorithms.
-
Machine Learning and AI Resources
A collection of links to essential courses on machine learning, deep learning, natural language processing, and artificial intelligence.