[extension/pprof] Improve documentation for pprof usage (open-telemet…

…ry#28665) **Description:** Update readme with instructions on how to use pprof **Link to tracking Issue:** open-telemetry#26095 **Testing:** Not applicable since no functional code was changed. **Documentation:** The extension readme now contains information what a profile is, how it can be generated and analysed. Co-authored-by: Tyler Helmuth <[email protected]>
rnishtala-sumo · Nov 20, 2023 · 399a74b · 399a74b
1 parent dae730c
commit 399a74b
Showing 1 changed file with 57 additions and 0 deletions.
diff --git a/extension/pprofextension/README.md b/extension/pprofextension/README.md
@@ -48,3 +48,60 @@ extensions:
 
 The full list of settings exposed for this exporter are documented [here](./config.go)
 with detailed sample configurations [here](./testdata/config.yaml).
+
+
+### Go Profiling with pprof basics
+
+The profiler can be used to improve a program.
+The most common usage is a CPU profile, which determines where the program spends the most time while actively consuming resources.
+After generating a profile, we can interpret it in different ways.
+Go's _pprof_ offers a text, visualization, or web-based analysis.
+To collect a meaningful profile, it should run on an idle machine and if that is not possible, it is best to generate the profile several times to get consistent results.
+
+The profiler stops the program multiple times per second and collects information (such as program counters) at that point in time.
+This is called a sample, and a profile is a collection of those samples.
+
+#### Generating a profile
+The extension enables the collection of profiling data expected by _pprof_.
+To generate a profile, include the extension in your program and run it to start the server.
+If you are using the default config, it will listen on `localhost:1777`.
+To save a CPU profile on your machine, run `go tool pprof http://localhost:1777/debug/pprof/profile\?seconds\=30`.
+This will enter the interactive mode of _pprof_, where you can analyze the profile.
+
+There are different endpoints for other types of profiles.
+For instance, the memory can be analyzed using `go tool pprof http://localhost:1777/debug/pprof/heap`.
+To see all available profiles, visit `http://localhost:1777/debug/pprof/` in your browser.
+
+#### Analyzing a profile
+
+After running the above command to save the profile, _pprof_ will enter the interactive mode.
+From here, the profiles can be analyzed.
+
+Use the command `web` to open an image of the complete call graph in your browser.
+Each box corresponds to a function in the program, and it is sized according to the number of samples in which this function was running.
+This means, if the box of a function is bigger, it was executed more often than a function in a smaller box.
+The arrows between boxes show the connectivity of the functions.
+If there is an arrow from box A to B, A called B.
+The numbers along the edges represent how often that call happened.
+This includes every call of a recursive function.
+The color of the edges also represents that number.
+A red edge means more resources were used, whereas grey indicates the used resources were close to zero.
+
+However, the complete call graph can be a bit noisy.
+A good place to start breaking it down is using the `topN` command.
+It will show you the top `N` nodes, consuming the most resources.
+The output is a table, where the first two columns show the number and percentage of total samples where the function was running (`flat`).
+The third column shows the total percentage, for instance stating that function X was running in 20% of the samples.
+The two remaining columns show the cumulative (`cum`) numbers of the profile.
+
+From here, the results can be filtered.
+Choose one of the top consuming functions which you would like to analyze.
+_pprof_ uses a regex-based search to filter for functions matching the input.
+Type `web <function name>` to show the call graph for this specific function.
+The image in your browser should now be more clear and less cluttered.
+
+The `list` command is also useful.
+Type `list <function name>` to see the source code of your function, annotated with the resource consumption (`flat` and `cum` columns like in the `topN` command).
+If you prefer to view it in your browser, use the `weblist <function name>` command instead.
+In this view, you can see which line exactly used the most resources and start to improve it.
+