Library

A living bookshelf of papers, kernels, and system guides that influence my research. Use the search box to filter by title, author, venue, or tag.

NeurIPS · 2017

Attention Is All You Need

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Introduces the Transformer architecture, demonstrating that self-attention alone can outperform recurrent and convolutional models for sequence transduction.

transformerattentionnlp

PDF ·Page

arXiv · 2022

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Ré

Proposes an IO-aware tiled attention kernel that reduces memory traffic and speeds up training while remaining exact.

attentionefficiencygpu

PDF ·Page

arXiv · 2025

Efficient Attention Mechanisms for Large Language Models: A Survey

Yutao Sun, Zhenyu Li, Yike Zhang, Tengyu Pan, Bowen Dong, Yuyi Guo, Jianyong Wang

Surveys the design space of efficient attention variants for LLMs, covering algorithmic approaches and hardware implications.

attentionsurveyefficiency

PDF ·Page

Bookshelf

Whatever I am, I am because of them

Library

Manning Publications · 2020

Deep Learning for Vision Systems

Mohamed Elgendy

View

O'Reilly · 2023

AI Engineering

Chip Huyen

View

Springer · 2024

Deep Learning Foundations and Concepts

Chris Bishop

View

Manning Publications · 2024

Build a Large Language Model (From Scratch)

Sebastian Raschka

View

Manning Publications · 2024

Ensemble Methods for Machine Learning

Gautam Kunapuli

View

Manning Publications · 2024

A Simple Guide to Retrieval Augmented Generation

Abhinav Kimothi

View

Manning Publications · 2024

Machine Learning for Tabular Data

Mark Ryan

View

Manning Publications · 2024

Math and Architectures of Deep Learning

Krishnendu Chaudhury etal

View

Manning Publications · 2022

Inside Deep Learning Math, Algorithms, Models

Edward Raff

View

Manning Publications · 2024

Getting Started with Natural Language Processing

Ekaterina Kochmar

View

Manning Publications · 2025

Natural Language Processing in Action

Hobson Lane, Maria Dyshel

View

Oreilly · 2024

Hands-On Large Language Models

Jay Alammar, Maarten Grootendorst

View

Oreilly · 2025

Designing Large Language Model Applications

Suhas Pai

View

Oreilly · 2025

How Large Language Models Work

Drew Farris, Stella Biderman, Edward Raff

View

Manning Publications · 2025

LLMs in Production

Christopher Brousseau and Matthew Sharp

View

Packt Publishing · 2025

GPU Programming with C++ and CUDA

Paulo Motta

View

MK · 2024

Programming Massively Parallel Processors

Wen-mei Hwu, David Kirk

View

Oreilly · 2025

Fluent Python

Luciano Ramalho

View

Manning Publications · 2025

AI Agents in Action

Micheal Lanham

View