Investigating performance counters related to memory
As part of my investigation to create a page for STREAM, I have tried to reconcile things with underlying performance counters. This page documents some of that work.
Continue reading →As part of my investigation to create a page for STREAM, I have tried to reconcile things with underlying performance counters. This page documents some of that work.
Continue reading →I have updated wspy to dump a “processtable.csv” file at the same time it dumps a “processtable.txt” file. This gives me several advantages: I’ve separated the format of output from collecting instrumentation. Hence, I can run things once to collect … Continue reading →
As described in top down performance counter analysis part 1, top down analysis is an approach that uses key performance counters to characterize an application and then successively drills down with further refinement. On Intel x86 processors, this first level … Continue reading →
As described in top down performance counter analysis part 1, top down analysis is an approach that uses key performance counters to characterize an application and then successively drills down with further refinement. On Intel x86 processors, this first level … Continue reading →
In this posting I summarize top-down performance counter analysis to evaluate workloads and show how this can be measured on Haswell using likwid-perfctr and perf. In part 2 to follow, I’ll describe how top down metrics have been added to … Continue reading →
Geekbench is a system benchmark that I see frequently in comparing Android devices such as tablets or mobile phones. The benchmark is also available for PC processors running Windows and Linux. Geekbench is developed and sold by Primate Labs. I … Continue reading →
This article summarizes three overall metrics to assess performance of a workload running on a microprocessor. These metrics provide an overall picture of what determines the performance and provide clues on where to dig for a deeper understanding. Following the … Continue reading →
wspy is both a code base and a tool for performance analysis. As a code base, I modify wspy to run particular experiments and measurements. This lets me experiment trying particular techniques and interfaces to create a quick and dirty … Continue reading →
wspy is now enhanced to provide two new metrics, for a particular process tree: on_cpu – what percentage of the total time are all the cores scheduled to run in either user or system time? on_core – what percentage of … Continue reading →
Found a new tool for creating and browsing kernel ftrace events.
Continue reading →