The first episode of my new blog series is live! This series dives into frame pointers and their critical role in building a CPU profiler.
This episode will explore the fundamentals of a program's execution environment. By the end, you'll understand how frame pointers become essential for stack walking, a key technique in profiling.
The second episode will tackle the practical side by diving into coding a simple sampling-based profiler with the Linux kernel assistance.
GPU "threads" aren't exactly CPU threads, and GPU "cores" aren't really CPU cores. It's better to think of threads as SIMD instructions and cores as ALUs.
GPU execution order is typically either "immediate mode" or "tile mode", where tiles are more common on mobile GPUs, but Nvidia has also used them.