Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Each Cōdex should also explain related files #20

Closed
fititnt opened this issue Feb 11, 2022 · 2 comments
Closed

Each Cōdex should also explain related files #20

fititnt opened this issue Feb 11, 2022 · 2 comments

Comments

@fititnt
Copy link
Member

fititnt commented Feb 11, 2022

Related:


Currently each /Cōdex/@lat-Latn both document all direct information on the main /Dictiōnāria/@lat-Latn and attach explanations of lingual and interlingual information which de facto will appear on the /Cōdex/@lat-Latn. This can be improved, but already works.

However, we also need to do a similar approach, but for the archive files themselves. Such documentation needs to be targeted: only appear for files which actually are pre-compiled. Also already with exact file names.

Reasoning

Each /Cōdex/@lat-Latn have as target audience humans likely to review or fix errors on source of the Information. However, we know that users also intended to create derived works (or which may be tempted to do a lot of copy and pasting) may not know that we explicitly have machine readable files.

We're also having issues with Unicode rendering of the graphenes. So more than one option of Codex needs to be documented and extra information.

The PDF (despite being less accessible) has the advantage of allowing embed fonts so we're sure end users can at bare minimum see the content. However this depends on our realistic skill to implement it correctly. The issue on this point ishttps://github.com/EticaAI/multilingual-lexicography/issues/13. The ASCIIdoctor version is already the one we use to prepare the PDF (and could be used to export to many other formats) so we document that version too.

Limitation

The embed descriptions of files should not be overlong as this would be better done by dedicated books which explain the files better. The motivation for this is also that the explanation of the files would be better to not be hard to be translated. And by keeping short and focused, we somewhat help with this.

@fititnt
Copy link
Member Author

fititnt commented Apr 16, 2022

Status quo

Each Cōdex already uses heavily these group of dictionaries

  • [1603:1:51] //Dictiōnāria Linguārum ad MMXXII ex Numerordĭnātĭo//
    • for linguistic content (like language codes, example ara-Arab = Macrolingua Arabica (Abecedarium Arabicum) )
  • [1603:1:7] //Dictiōnāria basibus dē rēs interlinguīs//
    • for inter-linguistic content (like the properties, example: ix_wikip655 = /translator/@eng-Latn)

Other not so integrated dictionaries

  • [1603:1:1] //Numeroclatura Generālium ad MM ex Numerordĭnātĭo//
  • [1603:1:99] Nōtitiae temporāriōrum circā librārium
    • This is what Brazilians would call gambiarra. Not ideal, but temporary messages previously hard-coded on Codex generation are at least moved to here

Dictionaries NOT integrated (need prepare dictionaries first)

These ones are done as we develop the things. so they do exist as dictionaries, but are not used yet 100% integrated to explain the Codex

  • [1603:1:6] /Dictiōnāria de terminī typīs/
  • [1603:1:101] //Dictiōnāria Archīōrum ad MMXXII ex Numerordĭnātĭo//

@fititnt
Copy link
Member Author

fititnt commented Jul 22, 2022

Already implemented some months ago. Minor improvements could be done (likely move some things from python code to tabular format) but the main issues was solved.

@fititnt fititnt closed this as completed Jul 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant