sample entries from the ACLEW spreadsheet
$ python sample.py aclew_spreadsheet.csv output.csv
or as a function:
import sample
df = sample("data/ACLEW_corpora.csv", "output/aclew_sampled.csv")
splice the selected audio files
$ python splice.py input_dir output_dir
input_dir
is a folder with all the audio files
as a function:
import splice
splice(input_dir="data/audio_input", output_dir="output/spliced_audio_out")
choose a random set of 5x2min regions in a daylong audio file and generate ELAN templates.
$ python templgen.py