Skip to content

Commit

Permalink
[extension/pprof] Improve documentation for pprof usage (#28665)
Browse files Browse the repository at this point in the history
**Description:** Update readme with instructions on how to use pprof

**Link to tracking Issue:**
#26095

**Testing:** Not applicable since no functional code was changed.

**Documentation:** The extension readme now contains information what a
profile is, how it can be generated and analysed.

Co-authored-by: Tyler Helmuth <[email protected]>
  • Loading branch information
Jamie-Ullerich and TylerHelmuth authored Nov 20, 2023
1 parent dae730c commit 399a74b
Showing 1 changed file with 57 additions and 0 deletions.
57 changes: 57 additions & 0 deletions extension/pprofextension/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,3 +48,60 @@ extensions:
The full list of settings exposed for this exporter are documented [here](./config.go)
with detailed sample configurations [here](./testdata/config.yaml).
### Go Profiling with pprof basics
The profiler can be used to improve a program.
The most common usage is a CPU profile, which determines where the program spends the most time while actively consuming resources.
After generating a profile, we can interpret it in different ways.
Go's _pprof_ offers a text, visualization, or web-based analysis.
To collect a meaningful profile, it should run on an idle machine and if that is not possible, it is best to generate the profile several times to get consistent results.
The profiler stops the program multiple times per second and collects information (such as program counters) at that point in time.
This is called a sample, and a profile is a collection of those samples.
#### Generating a profile
The extension enables the collection of profiling data expected by _pprof_.
To generate a profile, include the extension in your program and run it to start the server.
If you are using the default config, it will listen on `localhost:1777`.
To save a CPU profile on your machine, run `go tool pprof http://localhost:1777/debug/pprof/profile\?seconds\=30`.
This will enter the interactive mode of _pprof_, where you can analyze the profile.

There are different endpoints for other types of profiles.
For instance, the memory can be analyzed using `go tool pprof http://localhost:1777/debug/pprof/heap`.
To see all available profiles, visit `http://localhost:1777/debug/pprof/` in your browser.

#### Analyzing a profile

After running the above command to save the profile, _pprof_ will enter the interactive mode.
From here, the profiles can be analyzed.

Use the command `web` to open an image of the complete call graph in your browser.
Each box corresponds to a function in the program, and it is sized according to the number of samples in which this function was running.
This means, if the box of a function is bigger, it was executed more often than a function in a smaller box.
The arrows between boxes show the connectivity of the functions.
If there is an arrow from box A to B, A called B.
The numbers along the edges represent how often that call happened.
This includes every call of a recursive function.
The color of the edges also represents that number.
A red edge means more resources were used, whereas grey indicates the used resources were close to zero.

However, the complete call graph can be a bit noisy.
A good place to start breaking it down is using the `topN` command.
It will show you the top `N` nodes, consuming the most resources.
The output is a table, where the first two columns show the number and percentage of total samples where the function was running (`flat`).
The third column shows the total percentage, for instance stating that function X was running in 20% of the samples.
The two remaining columns show the cumulative (`cum`) numbers of the profile.

From here, the results can be filtered.
Choose one of the top consuming functions which you would like to analyze.
_pprof_ uses a regex-based search to filter for functions matching the input.
Type `web <function name>` to show the call graph for this specific function.
The image in your browser should now be more clear and less cluttered.

The `list` command is also useful.
Type `list <function name>` to see the source code of your function, annotated with the resource consumption (`flat` and `cum` columns like in the `topN` command).
If you prefer to view it in your browser, use the `weblist <function name>` command instead.
In this view, you can see which line exactly used the most resources and start to improve it.

0 comments on commit 399a74b

Please sign in to comment.