Skip to content

DHBern/PTT_phonebook_data_tsv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PTT_phonebook_data_tsv

This data and the online search platform historic.localsearch makes old telephone telephone directory entries through a public and free online database accessible to everyone. historic.localsearch was created through the collaboration between the PTT archive, the Digital Humanities at the University of Bern and localsearch.

Data

The aim is to make historical data accessible to a broad public.

This repository contains the basis of historic.localsearch.ch. All tsv are in the folder "tsv". The tsv can be loaded to the SQL-DB.

All data has been processed automatically, based on the OCRed phone book data. OCR errors are to be expected on several levels:

  • Error in recognized characters (no specific training for used printings)
  • Error in the layout identification. In several cases the columns have not been correctly identified, mixing the entries
  • Mixing advertisment with phone number entries.

As a consequence, three different batches have been built:

  • A – Data from the beginning to 1910: Data on the level of pages
  • B – Data from 1911 to 1937: Data on the level of segmented entries
  • C – Data from 1938 to 1950: Data on the level of unsegmented entries

Application

In the folder "code_manual_replacement" a bash script and a document of the replaced strings can be found.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages