Skip to content

Latest commit

 

History

History
21 lines (15 loc) · 939 Bytes

README.md

File metadata and controls

21 lines (15 loc) · 939 Bytes

abby2dsed.py - Converts Abby FineReader XML* to Dsed-File, embeddabe to DJVU.

  • for example, for Public Domain scans you can get Abby-xml for free from archive.org, or use Free-to-Try version of FineReader, or buy a FineReader.

Example of an ABBY xml (in DOWNLOAD OPTIONS - ABBYY GZ): https://archive.org/details/maupassant_selected_by_tolstoi_transl_lazareva_1894_rus

Usage:

  abby2dsed.py filename.xml

It will produce the file filename.dsed in a same folder (inside it will have only unicode text with bracket-like structure, human-readable and editable). All coordinates are starting from left-down corner of an each page.

To embed dsed to DJVU, use

djvused -u -s -f filename.dsed filename.djvu

(djvused is a part of DJVULibre package in Linux, there are Windows and MacOS/OSX versions somewhere).

xml2dsed.py - I have used it as an example, it was published in italian Wikisource, you'll find a link inside