AfroLID, a neural LID toolkit for 517 African languages and varieties. AfroLID exploits a multi-domain web dataset manually curated from across 14 language families utilizing five orthographic systems
GitHub link: https://github.com/UBC-NLP/afrolid
Online demo link: https://demos.dlnlp.ai/afrolid
The full documentation contains instructions for getting started, translation using diffrent methods, intergrate AfroLID with your code, and provides more examples.
afrolid(-py) is Apache-2.0 licensed. The license applies to the pre-trained models as well.
If you use AfroLID toolkit or the pre-trained models for your scientific publication, or if you find the resources in this repository useful, please cite our paper as follows:
useful, please cite our paper as follows:
@article{adebara2022afrolid, title={AfroLID: A Neural Language Identification Tool for African Languages}, author={Adebara, Ife and Elmadany, AbdelRahim and Abdul-Mageed, Muhammad and Inciarte, Alcides Alcoba}, booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP)", month = December, year = "2022", }