Profiling Python Applications

This repository contains examples on how to profile Python applications using cProfile, snakeviz, line_profiler, and memory_profiler modules.

Environment setup

To reproduce the same environment, I suggest using Conda as your package manager. If you have it installed, you can use environment.yml to create it using

conda env create -f environment.yml

and activate it with

conda activate profiling

Time-based Profiling

Time-based profiling allows you to see how much time your application spends in each one of its components.

Application overview

We use the cProfile module to profile an entire Python script. In each example folder, you will find a time/app-overview folder that contains the relevant code, along with a profile.sh script that will run the Python code with cProfile on. This script will generate a file example.prof, that contains the profiling data.

Visualization

Even though it is possible to get statistics directly from cProfile, a great way to visualize the profiling results is with snakeviz. It's very easy to use. For each example, you will find a visualize.sh script that, when run, will launch snakeviz in a browser tab. Below is how a typical result looks:

Line-by-line profile

Once you spotted what functions, methods or routines are consuming most of the time in your application, you may want to dig deeper into it to see exactly what instructions under each of them are the hot ones. For each example, in time/line-by-line, we use line_profiler for that, which requires decorating the target function with @profile. The profile.sh script calls the relevant binary (kernprof) to generate the profiling data, which can then be visualized with the visualize.sh script. A typical output is:

Memory-based Profiling

Understanding your Python application in terms of time is definitely an important step, but to characterize your application workload better, we also need to understand how it uses memory.

Application overview

We use the memory_profiler module to get an overview of how much memory a Python script is using as a function of time. For each example, the memory/app-overview folder contains the code to be profiled and a profile.sh script that uses the relevant binary (mprof) to generate the profiling data, which can be visualized using the visualize.sh script. A typical output is:

Line-by-line profile

We can also target individual functions with the @profile decorator. memory_profiler will then show the amount of memory that the process associated to the Python interpreter is using as your code evolves, line by line. For each example, under memory/line-by-line, the profile.sh script runs the profiler and shows the results. A typical output is:

Jupyter notebooks

Profiling Jupyter notebooks directly involves jumping through some hoops. The simplest alternative is to copy the content of your cells into a Python script. It possible to get the same effect with the nbconvert module:

jupyter nbconvert <YourNB>.ipynb --to script

which will generate a <YourNB>.py script. Sometimes it looks quite ugly, though.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
basics		basics
intermediate		intermediate
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Profiling Python Applications

Environment setup

Time-based Profiling

Application overview

Visualization

Line-by-line profile

Memory-based Profiling

Application overview

Line-by-line profile

Jupyter notebooks

About

Releases

Packages

Languages

brunoabreuphd/ProfilingPython

Folders and files

Latest commit

History

Repository files navigation

Profiling Python Applications

Environment setup

Time-based Profiling

Application overview

Visualization

Line-by-line profile

Memory-based Profiling

Application overview

Line-by-line profile

Jupyter notebooks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages