This module provides access to Transkribus PageXML files via Xquery functions. It is designed to be used in context of a Basex xml database, but should work with other xml databases as well.
Transkribus uses the PageXML format to store its data. There are a couple extensions including the use of the attribute @custom to store annotations on text regions, lines and tokens.
This API provides access to these structured strings. Furthermore it provides a simple mechanic to query structures spanning over multiple pages, which can be linked using Transkribus structural metadata.
xquerydoc -x /src -o /docs -f markdown