Elpis is a tool which allows language workers with minimal computational experience to build their own speech recognition models to automatically transcribe audio. Elpis provides a way to use multiple speech recognition systems for orthographic or phonemic transcription. The current systems included are Kaldi and Huggingface Transformers wav2vec2.
Documentation is here.
This software is the product of academic research funded by the Australian Research Council Centre of Excellence for the Dynamics of Language. If you use the software in an academic setting, please cite it appropriately as follows:
Foley, B., Arnold, J., Coto-Solano, R., Durantin, G., Ellison, T. M., van Esch, D., Heath, S., Kratochvíl, F., Maxwell-Smith, Z., Nash, D., Olsson, O., Richards, M., San, N., Stoakes, H., Thieberger, N. & Wiles, J. (2018). Building Speech Recognition Systems for Language Documentation: The CoEDL Endangered Language Pipeline and Inference System (Elpis). In S. S. Agrawal (Ed.), The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU) (pp. 200–204). Available on https://www.isca-archive.org/sltu_2018/foley18_sltu.pdf.