Examples to extract GEM5 & XS performance counter

This is a tool to extract GEM5 & XS performance counter from the output of GEM5 & XS simulation.

Examples to extract GEM5 & XS performance counter

We use batch.py to extract the performance counter for each checkpoint.

Get full option list of batch.py with

batch.py -h

To use batch.py anywhere, you can add gem5_data_proc to you PATH:

export PATH='/path/to/gem5_data_proc':$PATH

Use batch.py to extract GEM5's cache performance counters:

batch.py -s /path/to/results/top/directory  --cache -f stats.txt

Include only a specific benchmark like gromacs:

batch.py -s /path/to/results/top/directory  --cache -f stats.txt -F gromacs

Use batch.py to extract XS's cache & branch performance counters:

batch.py -s /path/to/results/top/directory --cache --branch --xiangshan -f simulator_err.txt

One-shot runner (recommended)

Use run.py to extract CSV and compute weighted + score in one command. It auto-detects XS format by searching simulator_err.txt in the directory. By default it enables all YAML groups (equivalent to batch.py --groups all).

python3 run.py /path/to/results/tag --out-dir results
python3 run.py /path/to/results/tag --out-dir results -j /path/to/cluster.json
python3 run.py /path/to/results/tag --out-dir results -g basic,branch,tage  # override groups

The legacy wrapper is still available (it calls run.py internally):

bash example-scripts/gem5-topdown-tag.sh /path/to/results/tag
bash example-scripts/gem5-topdown-tag.sh /path/to/results/tag -g basic,branch,tage,mbtb

YAML targets (recommended for adding new counters)

Targets can be defined in targets/*.yaml and enabled by group name. This is the preferred way to add new counters without editing utils/target_stats.py.

List groups:

python3 batch.py --list-groups

Enable YAML groups (missing counters are allowed and kept as NaN):

python3 batch.py -s /path/to/results --groups basic,branch,tage -o results/run.csv
python3 batch.py -s /path/to/results --groups all -o results/run.csv
python3 run.py /path/to/results --out-dir results -g basic,branch,tage

Common extra groups:

python3 run.py /path/to/results --out-dir results -g basic,branch,fetch
python3 run.py /path/to/results --out-dir results -g basic,intel_topdown

Local (not committed) extensions can be put under targets/local/*.yaml (gitignored).

Compare weighted CSV (web UI)

Compare two weighted CSVs in a local web UI:

python3 compare_weighted.py results/a-weighted.csv results/b-weighted.csv

Features:

Group filter (multi-select, union): show one or more YAML groups at a time
Click x in a column header to hide a column; reset hidden to restore
only changed to show columns with any diff
export csv to export current view as diff strings: +12.34% (1.23 -> 1.38)

Example for eval targets

The eval targets trick makes use of Python's eval to avoid creating new options for every new stat group

Use eval target to extract GEM5's memory bandwidth:

batch.py -s /path/to/results/top/directory --eval-stat mem_targets

Using eval target to extract GEM5's memory bandwidth and memory dependency counters:

batch.py -s /path/to/results/top/directory --eval-stat mem_targets#mem_dep_targets

Using eval target to extract XS's memory bandwidth and memory dependency counters:

batch.py -s /path/to/results/top/directory -X --eval-stat mem_targets#mem_dep_targets

Compute weighted performance

Unified weighted metric computation with batch.py

Now we use batch.py to compute the performance for each checkpoint. Then we use simpoint_cpt/compute_weighted.py to compute weighted metrics and scores Example usage here:

export PYTHONPATH=`pwd`

example_stats_dir=/nfs-nvme/home/share/zyy/gem5-results/example-outputs

mkdir -p results

python3 batch.py -s $example_stats_dir -t --topdown-raw -o results/example.csv  # The topdown results for each checkpoint

python3 simpoint_cpt/compute_weighted.py \
    -r results/example.csv \
    -j simpoint_cpt/resources/spec06_rv64gcb_o2_20m.json \
    -o results/example-weighted.csv  # The weighted topdown counters for each benchmark

python3 simpoint_cpt/compute_weighted.py \
    -r results/example.csv \
    -j simpoint_cpt/resources/spec06_rv64gcb_o2_20m.json \
    --score results/example-score.csv  # The SPEC score for each benchmark and overll score

Analysis topdown performance

First, we need to get the topdown outputs for one tests

bash example-scripts/gem5-topdown-tag.sh spec_ideal_numBr6

Then, we can use topdown/draw_new.py to analyze the topdown performance, and draw pictures, save to figure/

python3 topdown/draw_new.py -f=1 -p  -t1=spec_ideal_numBr4 -t2=spec_ideal_numBr6
# -f=1 means the highest level of detail
# -p means print the level 1 percentage and diff two tags outputs
# -t1=spec_ideal_numBr4 means the first tag
# -t2=spec_ideal_numBr6 means the second tag

python3 topdown/draw_new.py -f=3 -c=Frontend -t1=spec_ideal_numBr4 -t2=spec_ideal_numBr6
# -f=3 means the most detailed level
# -c=Frontend means the category, choises: Frontend, Backend, BadSpec

Dual-core performance

stats parser will obtain XS_CORE_ID from environment variables to choose which core to compute score:

export XS_CORE_ID
python3 batch.py -s $example_stats_dir -o results/$tag-core$core.csv -X

Full scripts to obtain dual-core performance is in example-scripts/xs-dual-core.sh

How to add more interested stats

Simple stats target group

See cache_targets defined in utils/target_stats.py and its usage in batch.py.

Simple stats target group contains a list of targets. Each entry of the list is a regex. batch.py will ``search'' for the pattern in given stats file, and name it with the first match group in parentheses. For example

(l3\.demandAcc)esses::total'
        ^
        The first match group, used as name

Complex stats target

Complex stats target group is a dictionary. The key of an entry is the name of the target. The value of an entry has two possible types: list or str.

If str, it is the regex to search. (xs_cache_targets_nanhu in utils/target_stats.py is an example.)

If list, like xs_cache_targets_22_04_nanhu, value[0] is the regex to search, while values[1] is how many times such pattern repeats. This is to handle the case that one pattern repeats multiple times in specific version of XS. The occurs because the performance counter of different banks of L2/L3 caches are named the same. Because this is to handle the buggy behavior in RTL, this type of stats group is rarely used and is out of maintained.

Assumed directory structure

A typical directory structure of GEM5 results looks like:

.
|-- bwaves_1299
|   |-- completed
|   |-- dcache_miss.db
|   |-- dramsim3.json
|   |-- dramsim3.txt
|   |-- dramsim3epoch.json
|   |-- log.txt
|   `-- m5out
|       |-- TableHitCnt.txt
|       |-- altuseCnt.txt
|       |-- config.ini
|       |-- config.json
|       |-- misPredIndirect.txt
|       |-- misPredIndirectStream.txt
|       |-- missHistMap.txt
|       |-- stats.txt
|       |-- topMisPredictHist.txt
|       `-- topMisPredicts.txt
`-- gcc_2000
    |-- completed
    |-- dcache_miss.db
    |-- dramsim3.json
    |-- dramsim3.txt
    |-- dramsim3epoch.json
...

A typical directory structure of XS looks like:

.
|-- GemsFDTD_1041040000000_0.022405
|   |-- simulator_err.txt
|   `-- simulator_out.txt
|-- GemsFDTD_1121140000000_0.004928
|   |-- simulator_err.txt
|   `-- simulator_out.txt
|-- GemsFDTD_1175660000000_0.022268
|   |-- simulator_err.txt
|   `-- simulator_out.txt
...

Name		Name	Last commit message	Last commit date
Latest commit History 275 Commits
area		area
energy		energy
example-scripts		example-scripts
loop_buffer		loop_buffer
not_used_for_long		not_used_for_long
omegaflow_figure		omegaflow_figure
omegaflow_model		omegaflow_model
outputs		outputs
pick		pick
points		points
sample_method		sample_method
scripts		scripts
simple_figures		simple_figures
simpoint_cpt		simpoint_cpt
targets		targets
topdown		topdown
utils		utils
warmup_analysis		warmup_analysis
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
analyze		analyze
batch.py		batch.py
checkpoint-0-0-0.lst		checkpoint-0-0-0.lst
cluster-0-0.json		cluster-0-0.json
cluster_top3.csv		cluster_top3.csv
collect_ipc.py		collect_ipc.py
compare.py		compare.py
compare_weighted.py		compare_weighted.py
example.sh		example.sh
filter.py		filter.py
find_var.py		find_var.py
flow_cmp.py		flow_cmp.py
ipc_bar_cmp.py		ipc_bar_cmp.py
local_configs.py		local_configs.py
mail.py		mail.py
paths.py		paths.py
pldm_log_get.py		pldm_log_get.py
requirements.txt		requirements.txt
run.py		run.py
smt_target_stats.py		smt_target_stats.py
st_stat.py		st_stat.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Examples to extract GEM5 & XS performance counter

One-shot runner (recommended)

YAML targets (recommended for adding new counters)

Compare weighted CSV (web UI)

Example for eval targets

Compute weighted performance

Analysis topdown performance

Dual-core performance

How to add more interested stats

Simple stats target group

Complex stats target

Assumed directory structure

About

Uh oh!

Releases

Packages

Languages

jensen-yan/gem5_data_proc

Folders and files

Latest commit

History

Repository files navigation

Examples to extract GEM5 & XS performance counter

One-shot runner (recommended)

YAML targets (recommended for adding new counters)

Compare weighted CSV (web UI)

Example for eval targets

Compute weighted performance

Analysis topdown performance

Dual-core performance

How to add more interested stats

Simple stats target group

Complex stats target

Assumed directory structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages