Why?
Because memory behavior dominates performance, we need to write programs in a way that makes good use of the memory hierarchy.
What methods?
The scope we’ll focus on is Single function.
Focus:
- Inner loops with predictable access patterns (e.g., array access)
- Improving spatial locality
Assume the compiler handles temporal locality by placing scalar variables in registers