Question on quality of Training Files #344
Unanswered
ChrisSmithUNR
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
First, thanks so much for helping create this awesome tool!
I am a PhD student, trying to train a model to identify American pika vocalizations, and relatively new to bioacoustics. I have a couple questions:
How important is it to have only high-quality training data vs sonograms with calls only faintly showing up? I have a wide range, and was wondering how it was likely to impact the model.
One recorder was placed near a stream. How important is it to do a high-pass band filter to remove lower pitched background noise? (Pika frequency is well above the background)
Pika vocalizations are repeated notes in various rhythms, classified as either a "short call" (a few notes) or a "long call" (usually 5+ notes strung together). The individual notes of a "short call" and "long call" are quite similar structure...it is just the repetition and gaps in between notes that are different. Is there a way to help the model recognize the difference between these 2 calls, given that there are often few differences within the 3-second sample window?
Thanks for your help!
Image of Sonogram of calls types:
wdfw02210.pdf
Beta Was this translation helpful? Give feedback.
All reactions