Skip to content

Process and related code to tag Bible words with Strong's numbers

License

Notifications You must be signed in to change notification settings

sabdelmalik/BibleTagging

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bible Tagging

This project was started to specifically tag Smith & Van Dyck Arabic Bible translation with Strong’s numbers. However, the process used can be applied to virtually any Bible. It was recently applied to a Zokam (Burmese) Bible.

Process Summary

  1. The Bible text is provided in a Verse Per Line (VPL) format.
  2. The text is then normalized and stemmed. This basically means removing any diacritics and punctuation marks, then reducing words to a basic form (it is not exactly the roots of the words).
  3. Using a publicly available software, Berkely Aligner, to map the text to Strong’s number. The process is not perfect, but has around 75% accuracy (which can vary between languages).
  4. As a last step, the mapped text is converted to OSIS format from which a SWORD module can be created. SWORD modules are used by many Bible study tools.

Refer to From-VPL-Bible-to-tagged-SWORD-module in this repository for details.

About

Process and related code to tag Bible words with Strong's numbers

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages