Address the slow growth of memory utilization with perf #1000

mlim19 · 2025-10-03T19:59:45Z

It seems the perf has known issue regarding memory utilization when perf runs continuously. As it's observed on idle system where there are not many processes running and output file sizes are not huge, it looks the perf has to be restarted if its internal memory usage is beyond a threshold.
The solution I implement here is to restart the perf collection when the memory utilization growth is more than 100MB (which can be changed depending on use cases, we can also make it as command line argument), compared the first memory RSS size.

The issue is different from what were reported at #990 which is related to perf output handling.

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Screenshots

Checklist:

I have read the CONTRIBUTING document.
I have updated the relevant documentation.
I have added tests for new logic.

prashantbytesyntax · 2025-10-09T19:37:27Z

gprofiler/utils/perf_process.py

+        should_restart_time_based = (
+            time_elapsed >= self._RESTART_AFTER_S and current_rss >= self._PERF_MEMORY_USAGE_THRESHOLD
+        )
+        should_restart_growth_based = memory_growth > self._RSS_GROWTH_THRESHOLD


thanks, looks good
can we get some metrics behind this, what the average growth rate of perf from initialization to post profiling for busy vs non-busy system

The system I tested is an AWS instance with small number of cpus. Therefore, our tests do not represent your use case. On idle system I do see around 50MB as baseline RSS and 1~2MB growth every duration. I tried to run more cpu intensive workloads but it seems not increase the rss much because the system configuration has limited memory and cpus. So, it would be good if you can evaluate this change on your side with real use cases. Can you try that?

yes, this would need testing in environments where there are lot of processes ( over 1k-1.5k) running. I would suggest we can start testing the root cause fix here #1002 and then test this fix.
@ashokbytebytego ^ ^

prashantbytesyntax · 2025-10-09T19:38:32Z

gprofiler/utils/perf_process.py

    # we use double for dwarf.
    _MMAP_SIZES = {"fp": 129, "dwarf": 257}
+    _RSS_GROWTH_THRESHOLD = 100 * 1024 * 1024  # 100MB in bytes
+    _BASELINE_COLLECTION_COUNT = 3  # Number of function calls to collect RSS before setting baseline


thanks, i would suggest having some data backed to configure this collection count baseline

mlim19 · 2025-10-09T21:59:07Z

Here is the screenshot how memory utilization drops when perf got restarted:

Signed-off-by: Min Yeol Lim <[email protected]>

mlim19 · 2025-11-06T18:43:09Z

This PR merge is on hold because Prashant mentioned that #1002 resolves most issues related memory usage. We will make a decision whether to merge it or not sometime after merging #1002

dkorlovs force-pushed the fix_perf_slow_memory_util_growth branch 3 times, most recently from 1c26438 to 7f7610d Compare October 9, 2025 18:01

prashantbytesyntax reviewed Oct 9, 2025

View reviewed changes

mlim19 added 4 commits October 23, 2025 10:44

Address the slow growth of memory utilization with perf

dc5fa61

Signed-off-by: Min Yeol Lim <[email protected]>

Fix linter issues

610f098

Signed-off-by: Min Yeol Lim <[email protected]>

Fix linter issues

24e879c

Signed-off-by: Min Yeol Lim <[email protected]>

Update the calculation of rss baseline with the average of 3 snapshots

495eaae

Signed-off-by: Min Yeol Lim <[email protected]>

mlim19 force-pushed the fix_perf_slow_memory_util_growth branch from 7f7610d to 495eaae Compare October 23, 2025 17:45

mlim19 marked this pull request as draft November 8, 2025 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Address the slow growth of memory utilization with perf #1000

Address the slow growth of memory utilization with perf #1000

Uh oh!

mlim19 commented Oct 3, 2025 •

edited

Loading

Uh oh!

prashantbytesyntax Oct 9, 2025

Uh oh!

mlim19 Oct 9, 2025

Uh oh!

prashantbytesyntax Oct 16, 2025 •

edited

Loading

Uh oh!

prashantbytesyntax Oct 9, 2025

Uh oh!

mlim19 commented Oct 9, 2025

Uh oh!

mlim19 commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Address the slow growth of memory utilization with perf #1000

Are you sure you want to change the base?

Address the slow growth of memory utilization with perf #1000

Uh oh!

Conversation

mlim19 commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Screenshots

Checklist:

Uh oh!

prashantbytesyntax Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

mlim19 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

prashantbytesyntax Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prashantbytesyntax Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

mlim19 commented Oct 9, 2025

Uh oh!

mlim19 commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mlim19 commented Oct 3, 2025 •

edited

Loading

prashantbytesyntax Oct 16, 2025 •

edited

Loading