Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-17921 merge DDL v47 branch to main #4087

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

CLDR-17921 merge DDL v47 branch to main #4087

wants to merge 4 commits into from

Commits on Sep 18, 2024

  1. CLDR-17897 Make ConvertLanguageData Consistent (#4015)

    If we re-run ConvertLanguageData on unrelated data, it will update the order and values of some other data -- this fixes inconsistencies with the XML outputs to match expectations. The biggest change was updating values in `language_script.tsv` to demote script variations to secondary when they really are not expected. Furthermore I added explicit annotations to `country_language_population.tsv` when the writing system for a country was a variant.
    
    Scripts ran:
    
     mvn package -DskipTests=true
     java -jar tools/cldr-code/target/cldr-code.jar ConvertLanguageData
     java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelySubtags
    conradarcturus authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    8ac1a2f View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2024

  1. CLDR-17921 merge from main

    srl295 committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    4f68a60 View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2024

  1. CLDR-11567 Add population counts for El Salvadoran Languages (#4064)

    El Salvador has a number of local languages not yet included in CLDR -- this adds them.
    
    See https://unicode-org.atlassian.net/browse/CLDR-11567 and https://translatorswithoutborders.org/language-data-for-el-salvador
    
    Re-generated the xml with mvn package -DskipTests=true &&  java -jar tools/cldr-code/target/cldr-code.jar ConvertLanguageData &&  java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelySubtags
    conradarcturus authored Oct 1, 2024
    Configuration menu
    Copy the full SHA
    248cded View commit details
    Browse the repository at this point in the history
  2. CLDR-10015 Update errors in Likely Subtags (#4068)

    `zlm` and `apd` are not languages of Togo -- there's a copy-paste error in the Likely Subtags overrides.
    
    This fixes `apd` (Sudanese Arabic) by adding population counts -- I'll note that the ethnologue estimate is 48,000,000 million people, but the last Sudan census we have in the data shows only ~41 million people in the country -- so I want to be cautious. Since I had problems finding a reliable census I just duplicated the Standard Arabic value since likely most people in Sudan that use Arabic speak the Sudanese dialect but write Standard Arabic/ar.
    
    For `zlm` -- Malay (individual language) that's even harder to get a thorough population value so I just fixed the override entry in GenerateLikelySubtags.
    conradarcturus authored Oct 1, 2024
    Configuration menu
    Copy the full SHA
    47b44af View commit details
    Browse the repository at this point in the history