Skip to content

Commit

Permalink
Use MAEST as title in the documentation.
Browse files Browse the repository at this point in the history
This makes a cleaner url for the model and is consistent with other
models without reference to the training dataset (e.g., OpenL3,
CREPE).
  • Loading branch information
palonso committed Oct 20, 2023
1 parent 6b043f1 commit 92f0831
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions doc/sphinxdoc/models.rst
Original file line number Diff line number Diff line change
Expand Up @@ -137,8 +137,8 @@ Models:
*Note: We provide models operating with a fixed batch size of 64 samples since it was not possible to port the version with dynamic batch size from ONNX to TensorFlow. Additionally, an ONNX version of the model with* `dynamic batch <https://essentia.upf.edu/models/feature-extractors/discogs-effnet/discogs-effnet-bsdynamic-1.onnx>`_ *size is provided.*


Discogs-MAEST
^^^^^^^^^^^^^
MAEST
^^^^^

Music Audio Efficient Spectrogram Transformer (`MAEST <https://github.com/palonso/MAEST/>`_) trained to predict music style labels using an in-house dataset annotated with Discogs metadata.
We offer versions of MAEST trained with sequence lengths ranging from 5 to 30 seconds (``5s``, ``10s``, ``20s``, and ``30s``), and trained starting from different intial weights: from random initialization (``fs``), from `DeiT <https://doi.org/10.48550/arXiv.2012.12877>`_ pre-trained weights (``dw``), and from `PaSST <https://doi.org/10.48550/arXiv.2106.07139>`_ pre-trained weights (``pw``). Additionally, we offer a version of MAEST trained following a teacher student setup (``ts``).
Expand Down

0 comments on commit 92f0831

Please sign in to comment.