A simple script to parse a text corpus in the Khmer language and convert it to a form that can be imported by FieldWorks.
- Install .Net Core SDK 3.0 or later
- Run
dotnet tool restore
- Run
dotnet restore
- Run
dotnet fsi src/Khmer/doit.fsx file1.txt file2.txt ... fileNNN.txt > all-output.txt
to create output file - Run
dotnet fsi src/Khmer/split.fsx all-output.txt
to createsplit-output-NNN.sfm
files