The Profiler Performance Engineering by diegolovison · Pull Request #26 · RedHatPerf/redhatperf.github.io

diegolovison · 2025-12-05T16:02:12Z

No description provided.

stalep · 2025-12-08T14:13:39Z

content/post/the-profiler-performance-engineering/index.adoc

@@ -0,0 +1,189 @@
+---
+title: "The Profiler Performance Engineering"


I think we should try to find a better title.. Perhaps "The Profiler, a story of what it does and doesn't show"? :)

stalep · 2025-12-08T14:15:02Z

content/post/the-profiler-performance-engineering/index.adoc

+Modern software systems are increasingly complex, and ensuring their performance under real-world conditions is critical
+to delivering reliable and scalable applications. Traditional performance testing often focuses on high-level metrics such
+as average response time or throughput. While these metrics are useful, they can obscure deeper system inefficiencies and
+bottlenecks. To uncover these hidden issues, a more granular and methodical approach is required—one that examines the


change required-one to "required. - One"?

stalep · 2025-12-08T14:57:37Z

content/post/the-profiler-performance-engineering/index.adoc

+This document introduces a performance engineering workflow that integrates profiling techniques with the
+https://www.brendangregg.com/usemethod.html[USE Method] (Utilization, Saturation, and Errors) to diagnose and resolve
+performance issues. By combining performance testing tools like Hyperfoil with low-level profilers such as
+https://github.com/async-profiler/async-profiler[async-profiler] and https://man7.org/linux/man-pages/man1/perf-stat.1.html[perf],
+we demonstrate how to identify CPU stalls, cache misses, and poor memory access patterns. Through a real-world benchmarking scenario,
+we show how profiling data can guide code optimizations and system configuration changes that lead to measurable
+improvements in Instruction Per Cycle (IPC) and overall system responsiveness.
+
+## Software Development Life Cycle
+
+A *software developer* implements features based on defined requirements—such as creating multiple endpoints to solve a
+specific business problem. Once development is complete, the *performance engineering* team gathers SLAs from stakeholders
+and designs performance tests that reflect real business use cases. These tests typically measure metrics like average
+response time. For each release that affects the business logic, the performance tests are rerun to detect any regressions.
+If a regression is found, the team receives feedback to address it.
+
+There is nothing wrong with this approach, but we can go even further.
+
+### Personas
+
+*Software Developer*: A professional who designs, builds, tests, and maintains software applications or systems.
+
+*Performance Engineering*: Ensures that a software system meets performance requirements under expected workloads. This
+involves creating and maintaining performance tests—using tools like Hyperfoil and web-based scenarios—to simulate
+real-world usage. The results provide valuable feedback to the team. If the system passes these tests, the product is
+considered ready for General Availability (GA).
+
+*Profiler Performance Engineering*: Analyzes performance test results by profiling the source code to uncover system
+bottlenecks. The process typically begins by identifying which resource—CPU, memory, disk, or network— the team has
+chosen to stress, guiding the analysis toward the root cause of any performance degradation.
+
+### Java Developer Persona Belt
+
+* Software Developer: Eclipse IDE, IBM Semeru JDK
+* Performance Engineering: Hyperfoil and a web-based application


I do not think this section adds much value to the message we should focus on in this blog. @franz1981 ?

I would like to show what we do and how we can fit under other teams. Like a "benchmarking-post".

stalep · 2025-12-08T14:59:42Z

content/post/the-profiler-performance-engineering/index.adoc

+* Profiler Performance Engineering: async-profiler, jfrconv, perf, sar
+
+## The USE Method
+
+According to Brendan Gregg, The **U**tilization **S**aturation and **E**rrors (USE) Method is a methodology for analyzing the
+performance of any system. It directs the construction of a checklist, which for server analysis can be used for
+quickly identifying resource bottlenecks or errors. It begins by posing questions, and then seeks answers, instead of
+beginning with given metrics (partial answers) and trying to work backwards.
+
+### Terminology definitions:
+
+The terminology definition is:
+
+* *resource*: all physical server functional components (CPUs, disks, busses, ...)
+* *utilization*: the average time that the resource was busy servicing work
+* *saturation*: the degree to which the resource has extra work which it can't service, often queued
+* *errors*: the count of error events
+
+The metrics are usually expressed in the following terms:
+
+* *utilization*: as a percent over a time interval. eg, "one disk is running at 90% utilization".
+* *saturation*: as a queue length. eg, "the CPUs have an average run queue length of four".
+* *errors*: scalar counts. eg, "this network interface has had fifty late collisions".


I do not think this section adds much value to the message we should focus on in this blog. @franz1981 ?

Most of those involved in performance may not be aware of this. This sections aim to provide an introduction to the topic without the user opening a new tab and reading the entire USE Method article.

stalep · 2025-12-08T15:00:33Z

content/post/the-profiler-performance-engineering/index.adoc

+
+### SUT CPU analyses
+
+We can start by looking for the “perf stat” for the SUT’ PID. "perf stat" is a powerful Linux command-line tool that


change to: for the SUT's application PID?

stalep · 2025-12-08T15:02:58Z

content/post/the-profiler-performance-engineering/index.adoc

+This metric indicates that 5.2 CPU cores are being utilized. For this test, we have a constraint of only 15 CPU cores
+available. Therefore, 5.2 ÷ 15 equals approximately 34%, meaning the CPU is busy 34% of the time. This suggests that
+the loader is not a highly utilized resource, so we could experiment by increasing its injection rate to raise the
+system load and observe the impact. However, this is not guaranteed to succeed, as other bottlenecks might limit the
+outcome. In our case, the loader can sustain that increase in the injection rate, and now the perf stat output for the SUT is:


I think we need to inform about at what rate the load driver was pushing when we saw the 34% utilization and what we changed it to to drive higher load.

stalep · 2026-01-30T13:32:47Z

content/post/the-profiler-performance-engineering/index.adoc

+we show how profiling data can guide code optimizations and system configuration changes that lead to measurable
+improvements in Instruction Per Cycle (IPC) and overall system responsiveness.
+
+## Software Development Life Cycle


Change to === for proper asciidoc formatting

The Profiler Performance Engineering

52b183f

diegolovison force-pushed the new_post branch from 7f9019b to 52b183f Compare December 5, 2025 16:07

stalep reviewed Dec 8, 2025

View reviewed changes

stalep reviewed Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Profiler Performance Engineering#26

The Profiler Performance Engineering#26
diegolovison wants to merge 1 commit intoRedHatPerf:devfrom
diegolovison:new_post

diegolovison commented Dec 5, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

diegolovison Dec 8, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

diegolovison Dec 8, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

stalep Dec 8, 2025

Uh oh!

stalep Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,189 @@
		---
		title: "The Profiler Performance Engineering"


		### SUT CPU analyses

		We can start by looking for the “perf stat” for the SUT’ PID. "perf stat" is a powerful Linux command-line tool that

Conversation

diegolovison commented Dec 5, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants