Harikanth Lingutla — AI/ML Engineer

Harikanth Lingutla — AI/ML EngineerI build practical AI systems, GPU projects, and software tools. Writing about PyTorch, CUDA, Triton, and building AI products.https://harikanth.site/en-usFlash Attention Explainedhttps://harikanth.site/blog/flash-attention-explained/https://harikanth.site/blog/flash-attention-explained/A practical walkthrough of how Flash Attention reduces memory traffic and speeds up transformer training.Mon, 12 May 2025 00:00:00 GMTDeep LearningCUDAGPU KernelsGPU Kernels — First Noteshttps://harikanth.site/blog/gpu-kernels-notes/https://harikanth.site/blog/gpu-kernels-notes/Notes from learning CUDA memory hierarchy, occupancy, and writing my first custom kernels.Mon, 28 Apr 2025 00:00:00 GMTCUDAGPU KernelsPyTorch