Create an API for your WordWeaver verb conjugator
This project is the outcome of an idea from Owennatékha Brian Maracle to create an online verb conjugator for Kanyen'kéha. The first iteration of this project required the use of a Finite State Transducer as the language model. This version is decoupled and instead uses flat-file, gzipped JSON to store the data needed for the conjugator front-end. This was decided on for the following reasons:
- It vastly simplifies the codebase. FSTs can be written in a variety of creative ways, and the process of having a modular plugin-architecture for them was difficult to maintain.
- It improves the speed. Querying an FST in production is fast, but still not as fast as querying this API.
- Less overhead. There's no need to index your data or do other performance-enhancing things that you would need to do with a database or with sorting out the serialization/deserialization architecture with an FST.
- Offline use. FSTs cannot run offline on mobile devices. Because the endpoints are fixed, the WordWeaver-UI can cache them with a service worker to allow complete offline functionality (except for file requests like returning docx, latex or csv files for conjugations)
Note: there is a limitation with using a flat file architecture. We have some optimizations in place so all you should need to do is drop your un-zipped, plain ol' JSON in here. Most conjugators that we have built fall somewhere in the 10,000 conjugations (French) to 300,000 (Mohawk) range. There are some variables here, like how many morphemes are defined for each conjugation etc. But generally speaking, after compression, our small French-type conjugators weigh in at around ~40KiB and our big Mohawk-type conjugators weigh in at < 2MiB which for most connections is perfectly fine. Especially considering this is only sent potentially once over the wire. If you have a monstrously large conjugator, say 1,000,000+ conjugations, you may have to look into other options. For more information, please visit the docs!
- Docker
- Docker Compose
- Python 3
-
After making sure you have the prerequisite technologies, clone this repo
-
In the root of the repo, make the following changes:
- Create a new folder in
src/wordweaver/data/
(ideally with the ISO code of your language) - Add an empty
__init__.py
file - Copy one of the
models.py
files in the other data directories and past it into your folder - Create four JSON files,
conjugations.json
,options.json
,pronouns.json
andverbs.json
- Populate the files with the data required for your language
- Update the
models.py
file to reflect your data structure - Pip install
wordweaver
(cd wordweaver && pip install -e .
) - Validate your data:
wordweaver validate
and fix any type errors - Once the validation passes, go back to the root and run
docker-compose build
- Then, to run your local version,
docker-compose up
- Then, visit
http://localhost/docs
to see the details of using your API. - You can stop the service by running
docker-compose down
- Change the
WWLANG
environment variable inenv-backend.env
to the name of your language folder
- Create a new folder in
-
Head over to wordweaver-UI repo and configure that for usage with this API.
The following instructions are for deploying using docker compose.
To run locally and deploy a basic service, you must have Docker installed, and then follow the steps from above:
- Once the validation passes, go back to the root and run
docker-compose build
- Then, to run your local version,
docker-compose up
- You can stop the service by running
docker-compose down
To build the production version:
Requirements:
- VPS
- Domain
- DNS control
- Set up DNS to point to your VPS that will host the site.
- Clone this repo
- Change the email and domain in
.env
- run
init-letsencrypt.sh
to get an SSL cert for your site - run
docker-compose -f docker-compose.prod.yml up
to spin up your site
Please visit the docs for more information.
Feel free to dive in! Open an issue or submit PRs.
This repo follows the Contributor Covenant Code of Conduct.