Last active
April 8, 2021 11:17
-
-
Save estsaon/8d71f1535eaa0e4891dbc0aa6f0c91e1 to your computer and use it in GitHub Desktop.
MEM_DP likwid group file for HaswellEP
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
SHORT Overview of arithmetic and main memory performance | |
EVENTSET | |
FIXC0 INSTR_RETIRED_ANY | |
FIXC1 CPU_CLK_UNHALTED_CORE | |
FIXC2 CPU_CLK_UNHALTED_REF | |
PMC0 AVX_INSTS_CALC | |
MBOX0C0 CAS_COUNT_RD | |
MBOX0C1 CAS_COUNT_WR | |
MBOX1C0 CAS_COUNT_RD | |
MBOX1C1 CAS_COUNT_WR | |
MBOX2C0 CAS_COUNT_RD | |
MBOX2C1 CAS_COUNT_WR | |
MBOX3C0 CAS_COUNT_RD | |
MBOX3C1 CAS_COUNT_WR | |
MBOX4C0 CAS_COUNT_RD | |
MBOX4C1 CAS_COUNT_WR | |
MBOX5C0 CAS_COUNT_RD | |
MBOX5C1 CAS_COUNT_WR | |
MBOX6C0 CAS_COUNT_RD | |
MBOX6C1 CAS_COUNT_WR | |
MBOX7C0 CAS_COUNT_RD | |
MBOX7C1 CAS_COUNT_WR | |
METRICS | |
Runtime (RDTSC) [s] time | |
Runtime unhalted [s] FIXC1*inverseClock | |
Clock [MHz] 1.E-06*(FIXC1/FIXC2)/inverseClock | |
CPI FIXC1/FIXC0 | |
Packed DP [MFLOP/s] 1.0E-06*(PMC0*4.0)/time | |
Memory read bandwidth [MBytes/s] 1.0E-06*(MBOX0C0+MBOX1C0+MBOX2C0+MBOX3C0+MBOX4C0+MBOX5C0+MBOX6C0+MBOX7C0)*64.0/time | |
Memory read data volume [GBytes] 1.0E-09*(MBOX0C0+MBOX1C0+MBOX2C0+MBOX3C0+MBOX4C0+MBOX5C0+MBOX6C0+MBOX7C0)*64.0 | |
Memory write bandwidth [MBytes/s] 1.0E-06*(MBOX0C1+MBOX1C1+MBOX2C1+MBOX3C1+MBOX4C1+MBOX5C1+MBOX6C1+MBOX7C1)*64.0/time | |
Memory write data volume [GBytes] 1.0E-09*(MBOX0C1+MBOX1C1+MBOX2C1+MBOX3C1+MBOX4C1+MBOX5C1+MBOX6C1+MBOX7C1)*64.0 | |
Memory bandwidth [MBytes/s] 1.0E-06*(MBOX0C0+MBOX1C0+MBOX2C0+MBOX3C0+MBOX4C0+MBOX5C0+MBOX6C0+MBOX7C0+MBOX0C1+MBOX1C1+MBOX2C1+MBOX3C1+MBOX4C1+MBOX5C1+MBOX6C1+MBOX7C1)*64.0/time | |
Memory data volume [GBytes] 1.0E-09*(MBOX0C0+MBOX1C0+MBOX2C0+MBOX3C0+MBOX4C0+MBOX5C0+MBOX6C0+MBOX7C0+MBOX0C1+MBOX1C1+MBOX2C1+MBOX3C1+MBOX4C1+MBOX5C1+MBOX6C1+MBOX7C1)*64.0 | |
Operational intensity (PMC0*4.0)/((MBOX0C0+MBOX1C0+MBOX2C0+MBOX3C0+MBOX4C0+MBOX5C0+MBOX6C0+MBOX7C0+MBOX0C1+MBOX1C1+MBOX2C1+MBOX3C1+MBOX4C1+MBOX5C1+MBOX6C1+MBOX7C1)*64.0) | |
LONG | |
Formulas: | |
Packed DP [MFLOP/s] = 1.0E-06*(AVX_INSTS_CALC*4)/runtime | |
Memory read bandwidth [MBytes/s] = 1.0E-06*(SUM(MBOXxC0))*64.0/runtime | |
Memory read data volume [GBytes] = 1.0E-09*(SUM(MBOXxC0))*64.0 | |
Memory write bandwidth [MBytes/s] = 1.0E-06*(SUM(MBOXxC1))*64.0/runtime | |
Memory write data volume [GBytes] = 1.0E-09*(SUM(MBOXxC1))*64.0 | |
Memory bandwidth [MBytes/s] = 1.0E-06*(SUM(MBOXxC0)+SUM(MBOXxC1))*64.0/runtime | |
Memory data volume [GBytes] = 1.0E-09*(SUM(MBOXxC0)+SUM(MBOXxC1))*64.0 | |
Operational intensity = (AVX_INSTS_CALC*4)/((SUM(MBOXxC0)+SUM(MBOXxC1))*64.0) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment