Measuring Data Access Latency in Large CPU Caches

Author

Shaotong Sun

Mentors

Chen Ding and Michael Scott

Abstract

This practitioner paper describes a new, multi-locality benchmark program for testing memory access latency and using it to study recent AMD machines equipped with 3D vertical cache(V-Cache) that can be over 1 GiB in total size on a single node. The latency study shows that these large caches differ from traditional LLCs in two aspects: the V-Cache is partitioned rather than shared, and the cache replacement policy is more similar to random than it is to LRU.

Measuring Data Access Latency in Large CPU Caches