-
LeetGPU Challenge #7: Reverse Array (Triton)
January 13, 2026Implementing a GPU kernel to reverse an array in place with Triton. Covering the thought process and intuition behind it.
Arayz · 4 min read ·gputriton+4 -
LeetGPU Challenge #6: 1D Convolution (Triton)
January 11, 2026Implementing a 1D convolution GPU kernel with Triton. Covering the thought process and intuition behind it, and going over how we use broadcasting to efficiently handle the sliding window pattern.
Arayz · 9 min read ·gputriton+4 -
LeetGPU Challenge #5: Matrix Addition (Triton)
January 7, 2026Implementing a matrix addition GPU kernel with Triton. We'll be treating the row major matrix as a flat array and reuse the vector addition pattern we covered before.
Arayz · 4 min read ·gputriton+4 -
LeetGPU Challenge #4: Color Inversion (Triton)
January 6, 2026Implementing a color inversion GPU kernel with Triton. Covering the thought process, the intuition behind it, and going over pointer reinterpretation and bitwise XOR operations for pixel data manipulation.
Arayz · 7 min read ·gputriton+4 -
LeetGPU Challenge #3: Matrix Transpose (Triton)
January 4, 2026Implementing a matrix transpose GPU kernel with Triton. Covering the thought process and intuition behind it, and going over memory coalescing as a key optimization concept for efficient GPU memory access patterns.
Arayz · 15 min read ·gputriton+4 -
LeetGPU Challenge #2: Matrix Multiplication (Triton)
January 2, 2026Implementing a matrix multiplication GPU kernel with Triton. Covering the thought process and intuition behind it, and going over important optimizations like tiling, local accumulators, and grouped ordering for optimal performance.
Arayz · 25 min read ·gputriton+4 -
LeetGPU Challenge #1: Vector Addition (Triton)
December 30, 2025Implementing a vector addition GPU kernel with Triton. Covering the thought process and the intuition behind it, basic GPU programming concepts and code walkthrough.
Arayz · 12 min read ·gputriton+4