Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support 歴史的仮名遣い #31

Open
jet082 opened this issue May 17, 2023 · 1 comment
Open

Support 歴史的仮名遣い #31

jet082 opened this issue May 17, 2023 · 1 comment

Comments

@jet082
Copy link

jet082 commented May 17, 2023

As it says on the title, it would be nice to be able to automatically change ゑ to え, even を to お (as in をとこ). As well as be able to automatically transform けふ to きょう, くわ to か, げう to ぎょう etc.

I'm sure you know this very well, but there is also a convenient list here if it is helpful: https://www.bunka.go.jp/kokugo_nihongo/sisaku/joho/joho/kijun/naikaku/gendaikana/index.html

@ikegami-yukino
Copy link
Owner

@jet082
Implementing 歴史的仮名遣い conversion is difficult.
Because the Japanese Language is an agglutinative language it is no word delimiter.
For example, if parts of speech are not considered, "こどもをとこやにいかせた" is converted to "こどもおとこやにいかせた".
Therefore, Supporting 歴史的仮名遣い requires some external module such as POS Tagger.
However, the jaconv has a policy of implementing string conversion processes without using external modules.
If I were to implement a 歴史的仮名遣い conversion, I would implement it in another module rather than in jaconv.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants