Skip to content

Commit

Permalink
try to refer to the Figure
Browse files Browse the repository at this point in the history
  • Loading branch information
yiitozer committed Mar 12, 2024
1 parent 064964b commit edd6574
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -105,14 +105,14 @@ but also for educators, students, composers, sound designers, and individuals ex

# Core Functionalities
In the following, we briefly describe some of the main modules included in the Python toolbox libsoni.
For an illustration of some core functionalities, we also refer to Figure\ref{fig:libsoni}. A comprehensive API
For an illustration of some core functionalities, we also refer to Figure \ref{libsoni}. A comprehensive API
documentation of libsoni is publicly accessible through GitHub[^1]. Furthermore, the applications of core
functionalities are illustrated by educational Jupyter notebooks as an integral part of libsoni, providing
illustrative code examples within concrete MIR scenarios.

![Illustration of some core functionalities provided by the Python toolbox libsoni. \textbf{(a)} Logo
of libsoni. \textbf{(b)} Sonification of F0 trajectories. \textbf{(c)} Sonification of piano roll representations.
\textbf{(d)} Sonification of time–frequency representations.](teaser.pdf)\label{fig:libsoni}
\textbf{(d)} Sonification of time–frequency representations.](teaser.pdf){#libsoni width="100%"}



Expand All @@ -136,7 +136,7 @@ recording (rather than a musical score), the melody corresponds to a sequence of
(also called F0 values) representing the pitches of the tones. In real performances, these sequences often form
complex time--frequency patterns known as frequency trajectories, which may include continuous frequency glides
(glissando) or frequency modulations (vibrato). In libsoni, the F0 module allows for sonifying a sequence of frame-wise
frequency values that correspond to manually annotated or estimated F0 values (see also Figure~\ref{fig:libsoni}b). This module offers a
frequency values that correspond to manually annotated or estimated F0 values (see also Figure \ref{fig:libsoni}b). This module offers a
variety of adjustable parameters, allowing for the inclusion of additional partials to tonally enrich the sonification,
thereby generating sounds of different timbre. Moreover, users have the option to adjust the amplitude of each predicted
F0 value based on its confidence level, as provided by an F0 estimator. This allows for insights into the reliability
Expand All @@ -145,7 +145,7 @@ of the predictions.
## Piano-Roll Representations (\texttt{libsoni.pianoroll})
A symbolic score-based representation describes each note by parameters such as start time, duration, pitch, and other
attributes. This representation is closely related to MIDI encodings and is often visualized in the form of
two-dimensional piano-roll representations (see also Figure~\ref{fig:libsoni}c). In these representations, time is
two-dimensional piano-roll representations (see also Figure \ref{libsoni}c). In these representations, time is
encoded on the horizontal axis, pitch on the vertical axis, and each note is represented by an axis-parallel rectangle
indicating onset, pitch, and duration. This representation is widely used in several MIR tasks, including automatic
music transcription [@BenetosDDE19_MusicTranscription_SPM] und music score--audio music synchronization
Expand Down

0 comments on commit edd6574

Please sign in to comment.