|
Evaluating a number of cache coherency misses based on a statistical model
Evgeny Velesevich Institute for System Programming of the Russian Academy of Sciences
Abstract:
False cache sharing happens when different threads executing in parallel on distinct processor cores simultaneously update the variables that reside in the same cache line. This results in the invalidation of the current memory state data that is saved in the cache utilized by the first thread’s core, and also in the necessity to stall for the second thread on the memory state data update.
We suggest in this paper to evaluate the number of cache misses using code instrumentation and post-mortem trace analysis: the probability of the false sharing cache miss (defined as a memory write issued by one thread between two consecutive memory accesses issued by another thread) is calculated based on the gathered event trace, where each observed event is a memory access with a timestamp. The tracer tool is implemented as a GCC compiler pass inserting necessary tracing instructions before each memory access. The pass is scheduled after all other optimization passes that allow to use the tracer for optimized code. The post-mortem analyzer is a separate application that gets the trace collection gathered on a sample application input data as its own input. Program slowdown in our approach is 10 times, and it is dependent on a sampling probability but it does not depend on a cache line size.
Keywords:
false cache sharing, GCC compiler, code instrumentation.
Citation:
Evgeny Velesevich, “Evaluating a number of cache coherency misses based on a statistical model”, Proceedings of ISP RAS, 27:4 (2015), 39–48
Linking options:
https://www.mathnet.ru/eng/tisp163 https://www.mathnet.ru/eng/tisp/v27/i4/p39
|
|