Question on quality of Training Files #344

ChrisSmithUNR · 2024-05-24T20:39:00Z

ChrisSmithUNR
May 24, 2024

First, thanks so much for helping create this awesome tool!

I am a PhD student, trying to train a model to identify American pika vocalizations, and relatively new to bioacoustics. I have a couple questions:

How important is it to have only high-quality training data vs sonograms with calls only faintly showing up? I have a wide range, and was wondering how it was likely to impact the model.
One recorder was placed near a stream. How important is it to do a high-pass band filter to remove lower pitched background noise? (Pika frequency is well above the background)
Pika vocalizations are repeated notes in various rhythms, classified as either a "short call" (a few notes) or a "long call" (usually 5+ notes strung together). The individual notes of a "short call" and "long call" are quite similar structure...it is just the repetition and gaps in between notes that are different. Is there a way to help the model recognize the difference between these 2 calls, given that there are often few differences within the 3-second sample window?

Thanks for your help!

Image of Sonogram of calls types:
wdfw02210.pdf