CLDF Metadata: StructureDataset-metadata.json
Sources: sources.bib
Table data.csv
Values are coded datapoints, i.e. measurements of a variable for a society.
Note: Missing data is signaled by an empty Value column.
property | value |
---|---|
dc:conformsTo | CLDF ValueTable |
dc:extent | 631668 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Soc_ID | string |
References societies.csv::ID |
Var_ID | string |
References variables.csv::ID |
Value | string |
Values for categorical and ordinal variables reference the corresponding code via the Code_ID column. Values for continuous variables have the measured number in the Value column and an empty Code_ID. |
Code_ID | string |
References codes.csv::ID |
Comment | string |
|
Source | list of string (separated by ; ) |
References sources.bib::BibTeX-key |
sub_case |
string |
More specific description of the population the data refer to in terms of society or area. |
year |
string Regex: -?[0-9]{1,4}(-[0-9]{4})? |
Focal year, i.e. the time period to which the data refer. |
source_coded_data |
string |
The source of the coded data, which was aggregated in this dataset. |
admin_comment |
string |
Table societies.csv
The aggregated D-PLACE data lists two kinds of entities in its LanguageTable: Societies, i.e. cultural groups for which D-PLACE datasets provide data, and Languoids, i.e. language (varieties) which are referenced in Phlorest phylogenies. The two kinds are marked as "society" and "languoid"respectively in the type column.
property | value |
---|---|
dc:conformsTo | CLDF LanguageTable |
dc:extent | 6174 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Name | string |
|
Latitude | decimal ≥ -90 ≤ 90 |
|
Longitude | decimal ≥ -180 ≤ 180 |
|
Glottocode | string Regex: [a-z0-9]{4}[1-9][0-9]{3} |
|
Name_and_ID_in_source |
string |
Society names identified as pejorative have been replaced with a preferred, English-language ethnonym. The name (and ID) as given in the source dataset is kept in this field. |
xd_id |
string |
“cross-data-set” identifier, used to link societies present in different datasets, if they share a focal location. Note: If this field is empty, other fields such as Name, Glottocode, focal year and location may be used to identify societies across datasets if appropriate. |
alt_names_by_society |
list of string (separated by ; ) |
A list of ‘alternate’ names for the society; includes, where available, one or more autonyms in the society’s own language, as well as other commonly encountered ethnonyms. |
main_focal_year |
integer |
Focal year specifying the time period to which the data refer, given as number of years BCE - if negative - or CE. |
HRAF_name_ID |
string |
Name(s) and ID(s) of the corresponding society in HRAF (the Human Relations Area Files) |
HRAF_ID |
string |
ID of the corresponding society in HRAF |
origLat |
decimal ≥ -90 ≤ 90 |
Uncorrected latitude as given in the source. |
origLong |
decimal ≥ -270 ≤ 180 |
Uncorrected longitude as given in the source. |
comment | string |
|
glottocode_comment |
string |
Comment on the Glottocode assignment. |
region |
string |
World Geographical Scheme for Recording Plant Distributions level2 region |
type |
string Valid choices: society languoid |
|
Language_Level_Glottocodes |
list of string (separated by ) |
Glottocode(s) of the language-level languoid(s) in Glottolog associated with the languoid specified by Glottocode. Matches the "Glottocode" column for languages, but differs for dialects and lists all contained languages for subgroups. The language-level Glottocodes can be used to match societies to languages in the Glottolog classification trees. |
ISO639P3code | string |
|
Contribution_ID | string |
References contributions.csv::ID |
Table variables.csv
Variables are cultural features or practices, or environmental descriptors.
property | value |
---|---|
dc:conformsTo | CLDF ParameterTable |
dc:extent | 2987 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [A-Za-z.0-9_]+([0-9]+)? |
Primary key |
Name | string |
|
Description | string |
|
ColumnSpec | json |
|
category |
list of string Valid choices: Anthropometry Architecture Ceramics and Art Ceremony Childhood Class Climate Clothing Community Community organization Data Quality Death Demography Dwelling Dwellings Ecology Economics Economy Games Gender Gossip Health Household Housing Infancy Kinship Labor Labour Law and Judicial Process Leadership Life cycle Marriage Material culture Metalworking Modernization Mourning Physical Landscape Political Organization Politics Population Property Religion Ritual Settlement Settlements Sexual practices Social Organization and Stratification Special Knowledge and Practices Subsistence Technology Tools Utensils War Warfare Watercraft and Navigation Wealth Transactions Wealth transactions and Textiles (separated by , ) |
|
type |
string Valid choices: Continuous Categorical Ordinal |
Variables may be categorical (and then must be accompanied by a list of possible ‘codes’, i.e. rows in Codetable. Variables can also be continuous (e.g. Population size) or ordinal. Ordinal variables are accompanied by a list of codes (like categorical variables). The order of codes is encoded as ord column in CodeTable. |
unit |
string |
The unit of measurement |
source_comment |
string |
A note about the source of this variable. |
changes |
string |
Notes about how a variable may have been derived from the source. |
comment | string |
|
Contribution_ID | string |
References contributions.csv::ID |
Table codes.csv
property | value |
---|---|
dc:conformsTo | CLDF CodeTable |
dc:extent | 15450 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Var_ID | string |
The parameter or variable the code belongs to. References variables.csv::ID |
Name | string |
|
Description | string |
|
ord |
integer |
Table media.csv
For better re-usability, D-PLACE provides the Glottolog classification trees and Phlorest phylogenies in the NEXUS file format.
property | value |
---|---|
dc:conformsTo | CLDF MediaTable |
dc:extent | 109 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Name | string |
|
Description | string |
|
Media_Type | string Regex: [^/]+/.+ |
|
Download_URL | anyURI |
|
Path_In_Zip | string |
Table contributions.csv
Both, individual D-PLACE datasets as well as Phlorest phylogenies and Glottolog classification trees are citable units - and should be cited, if their data is used.
property | value |
---|---|
dc:conformsTo | CLDF ContributionTable |
dc:extent | 122 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Name | string |
|
Description | string |
|
Contributor | string |
|
Citation | string |
|
DOI |
string |
|
type |
string Valid choices: dataset phylogeny |
D-PLACE aggregates two kinds of data: D-PLACE datasets, i.e. lists variables and coded values for cultural groups and language phylogenies from Phlorest. |
Source | list of string (separated by ; ) |
References sources.bib::BibTeX-key |
Table trees.csv
D-PLACE contains the summary trees of Phlorest phylogenies and classification trees for Glottolog families which are associated with at least two societies in D-PLACE.
property | value |
---|---|
dc:conformsTo | CLDF TreeTable |
dc:extent | 109 |
Name/Property | Datatype | Description |
---|---|---|
ID | string Regex: [a-zA-Z0-9_\-]+ |
Primary key |
Name | string |
Name of tree as used in the tree file, i.e. the tree label in a Nexus file or the 1-based index of the tree in a newick file |
Description | string |
Describe the method that was used to create the tree, etc. |
Tree_Is_Rooted | boolean Valid choices: Yes No |
Whether the tree is rooted (Yes) or unrooted (No) (or no info is available (null)) |
Tree_Type | string Valid choices: summary sample |
Whether the tree is a summary (or consensus) tree, i.e. can be analysed in isolation, or whether it is a sample, resulting from a method that creates multiple trees |
Tree_Branch_Length_Unit | string Valid choices: change substitutions years centuries millennia |
The unit used to measure evolutionary time in phylogenetic trees. |
Media_ID | string |
References a file containing a Newick representation of the tree, labeled with identifiers as described in the LanguageTable (the Media_Type column of this table should provide enough information to chose the appropriate tool to read the newick) References media.csv::ID |
Source | list of string (separated by ; ) |
References sources.bib::BibTeX-key |
Contribution_ID | string |
References contributions.csv::ID |