You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Dear Collatex creators,
thank you so much for making your tool available!! It sounds super useful. Sadly, I haven't been able to run it. It would be lovely if you can show me an example of how to read a .txt file using the Command Line Interface.
Say I have output-adobe.txt + output-tesseract.txt + original.txt and want to compare them.
I open collatex like: C:\Users\xxxx\Desktop> java -jar collatex-tools-1.7.1.jar
and then?
The text was updated successfully, but these errors were encountered:
Typing:
C:\Users\xxxxx\Desktop> java -jar collatex-tools-1.7.1.jar output-adobe.txt output-tesseract.txt original.txt
should produce an alignment table in JSON format.
If you don't like the JSON format there are other formats, for example comma separated values (CSV).
Although I was not able to get the encoding right :(
In the documentation I found that:
plain text version can also be provided in other encodings supported by the Java Platform and will be converted to Unicode before comparison. The command line interface is one such interface which supports character set conversions
Dear Collatex creators,
thank you so much for making your tool available!! It sounds super useful. Sadly, I haven't been able to run it. It would be lovely if you can show me an example of how to read a .txt file using the Command Line Interface.
Say I have output-adobe.txt + output-tesseract.txt + original.txt and want to compare them.
I open collatex like:
C:\Users\xxxx\Desktop> java -jar collatex-tools-1.7.1.jar
and then?
The text was updated successfully, but these errors were encountered: