Advertisement · 728 × 90

Posts by Umar Jamil

Moving to China was the best experience of my life. Learnt a new language, met my wife, discovered a new cuisine I’m in love with (especially the one from Xi’an) and I finally understood what it means to work hard consistently, while cherishing the present. Highly recommend to everyone who’s “lost”!

1 year ago 3 0 0 0

Finally I can find ML related content without having to endure cherrypicked content on how everybody but two guys are bad

1 year ago 1 0 0 0
Preview
Optimizing AI Inference at Character.AI (Part Deux) At Character.AI, we’re building personalized AI entertainment. In order to offer our users engaging, interactive experiences, it's critical we achieve highly efficient inference, or the process by whi...

Amazing job by CharacterAI as usual: research.character.ai/optimizing-a...

1 year ago 4 0 0 0
Flash Attention derived and coded from first principles with Triton (Python)
Flash Attention derived and coded from first principles with Triton (Python) YouTube video by Umar Jamil

In this video, I'll be deriving and coding Flash Attention from scratch. No prior knowledge of CUDA or Triton is required.

Link to the video: youtu.be/zy8ChVd_oTM

All the code will be written in Python with Triton, but no prior knowledge of Triton is required.

1 year ago 4 0 1 0