Skip to content

Latest commit

 

History

History
168 lines (133 loc) · 8.53 KB

README.md

File metadata and controls

168 lines (133 loc) · 8.53 KB

StructureDataset PHOIBLE

CLDF Metadata: StructureDataset-metadata.json

Sources: sources.bib

PHOIBLE's phoneme inventories are modeled as follows:

  • Phonemes are rows in ParameterTable
  • Inventories are rows in inventories.csv
  • Each row in ValueTable marks a Phoneme as member of an inventory
property value
dc:conformsTo CLDF StructureDataset
dc:identifier https://phoible.org/
dc:license https://creativecommons.org/licenses/by/4.0/
dcat:accessURL https://github.com/cldf-datasets/phoible
prov:wasDerivedFrom
  1. phoible/dev v2.0-54-g435f657
  2. cldf-datasets/phoible v2.0.1-1-g7cde4cb
  3. Glottolog v4.3
prov:wasGeneratedBy
  1. python: 3.8.5
  2. python-packages: requirements.txt
rdf:ID phoible
rdf:type http://www.w3.org/ns/dcat#Distribution
property value
dc:conformsTo CLDF ValueTable
dc:extent 105459

Columns

Name/Property Datatype Description
ID string Primary key
Language_ID string References languages.csv::ID
Parameter_ID string References parameters.csv::ID
Value string
Source list of string (separated by ;) References sources.bib::BibTeX-key
Marginal boolean Marginal phonemes are those that are notably different phonologically from the majority of segments found in a particular language. For example, loanwords containing non-native sounds can introduce marginal phonemes into the borrowing language. Any type of phoneme described as “marginal”, “dubious” or “occurs only in loan words” is included in the database alongside other phonemes, but is marked with a boolean 'True' value. Ordinary phonemes typically have the value 'False', although for data sources that explicitly exclude marginal phonemes, the value is NULL, i.e. ''.
Allophones list of string (separated by ) If a phonological description includes some information about allophonic alternations, it is assumed that phonemes with no allophones mentioned do not exhibit “major” allophonic variation (in the judgment of the original author(s) of the language description). For such phonemes, the sole allophone is treated as identical with the phonemic representation. In phonological descriptions where no information about allophonic alternations was given about any of the phonemes, the description is considered incomplete with regard to allophonic information, and all phonemes of that inventory are given an empty value for their allophones.
Inventory_ID string References inventories.csv::ID

PHOIBLE includes distinctive feature data for every phoneme in every inventory. The feature system was created by the PHOIBLE developers to be descriptively adequate cross-linguistically. In other words, if two phonemes differ in their graphemic representation, then they should necessarily differ in their featural representation as well (regardless of whether those two phonemes coexist in any known doculect). The feature system is loosely based on the feature system in Hayes (2009) with some additions drawn from Moisik & Esling (2011).

property value
dc:conformsTo CLDF ParameterTable
dc:extent 3164

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Description string
SegmentClass string
tone string
stress string
syllabic string
short string
long string
consonantal string
sonorant string
continuant string
delayedRelease string
approximant string
tap string
trill string
nasal string
lateral string
labial string
round string
labiodental string
coronal string
anterior string
distributed string
strident string
dorsal string
high string
low string
front string
back string
tense string
retractedTongueRoot string
advancedTongueRoot string
periodicGlottalSource string
epilaryngealSource string
spreadGlottis string
constrictedGlottis string
fortis string
raisedLarynxEjective string
loweredLarynxImplosive string
click string
property value
dc:conformsTo CLDF LanguageTable
dc:extent 2177

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Macroarea string
Latitude decimal
Longitude decimal
Glottocode string
ISO639P3code string
Family_Glottocode string Glottocode of the top-level family this variety belongs to.
Family_Name string Name of the top-level family this variety belongs to.

This table lists the phoneme inventories which are aggregated in PHOIBLE

property value
dc:extent 3020

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Inventory_source_ID string References inventory_sources.csv::ID
Source list of string (separated by ;) References sources.bib::BibTeX-key
URL string
count_phonemes integer
count_consonants integer
count_vowels integer
count_tones integer
property value
dc:extent 10

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Description string
Readme string
Contents string Semi-structured description of the contents of inventories fromthis contribution
Source list of string (separated by ;) References sources.bib::BibTeX-key
URL string
with_tones boolean 1 if the inventories from this source containtones, 0 if tone is systematically left out.