-
-
Notifications
You must be signed in to change notification settings - Fork 114
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: EditorUtils use document's locale to splitting words (#1175)
* feat: EditorUtils - Update javadoc for getWordEnd and getWordStart utility functions - Remove "TODO: change to use document's locale" - grab word with source and target locale of the project Signed-off-by: Hiroshi Miura <[email protected]> * chore: add test case in English first steps pane Signed-off-by: Hiroshi Miura <[email protected]> * chore: add task dependency for acceptance test Signed-off-by: Hiroshi Miura <[email protected]> * test: add the case - test EditorUtils.getWord* with loaded project from Chinese to Japanese in English environment Signed-off-by: Hiroshi Miura <[email protected]> * refactor: reduce duplicated code Signed-off-by: Hiroshi Miura <[email protected]> * fix: fix copyright header typo Signed-off-by: Hiroshi Miura <[email protected]> * feat: use ICU4J for BreakItelator to support CJ - Add unit test for getBoundary method with English, Japanese and Chinese Signed-off-by: Hiroshi Miura <[email protected]> * refactor: give locale by callers of EditorUtils Signed-off-by: Hiroshi Miura <[email protected]> * docs: javadoc of EditorUtils - Update javadoc - @deprecated for methods with old signature - Simplify some lines Signed-off-by: Hiroshi Miura <[email protected]> * chore: bump [email protected] Signed-off-by: Hiroshi Miura <[email protected]> * chore: fix typo in dependency Signed-off-by: Hiroshi Miura <[email protected]> * chore: fix degraded merge for dependencies Signed-off-by: Hiroshi Miura <[email protected]> --------- Signed-off-by: Hiroshi Miura <[email protected]>
- Loading branch information
Showing
20 changed files
with
389 additions
and
31 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
project_stats.txt | ||
project_stats.json | ||
*.bak |
Empty file.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# Glossary in tab-separated format -*- coding: utf-8 -*- | ||
介绍 紹介 | ||
中的 中心的な |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
<?xml version='1.0' encoding='UTF-8'?> | ||
<omegat> | ||
<project version="1.0"> | ||
<source_dir>source</source_dir> | ||
<source_dir_excludes> | ||
<mask>**/.svn/**</mask> | ||
<mask>**/CVS/**</mask> | ||
<mask>**/.cvs/**</mask> | ||
<mask>**/.git/**</mask> | ||
<mask>**/.hg/**</mask> | ||
<mask>**/.repositories/**</mask> | ||
<mask>**/desktop.ini</mask> | ||
<mask>**/Thumbs.db</mask> | ||
<mask>**/.DS_Store</mask> | ||
<mask>**/~$*</mask> | ||
</source_dir_excludes> | ||
<target_dir>target</target_dir> | ||
<tm_dir>tm</tm_dir> | ||
<glossary_dir>glossary</glossary_dir> | ||
<glossary_file>.-glossary.txt</glossary_file> | ||
<dictionary_dir>dictionary</dictionary_dir> | ||
<export_tm_dir></export_tm_dir> | ||
<export_tm_levels></export_tm_levels> | ||
<source_lang>zh-CN</source_lang> | ||
<target_lang>ja-JP</target_lang> | ||
<source_tok>org.omegat.tokenizer.LuceneSmartChineseTokenizer</source_tok> | ||
<target_tok>org.omegat.tokenizer.LuceneJapaneseTokenizer</target_tok> | ||
<sentence_seg>true</sentence_seg> | ||
<support_default_translations>true</support_default_translations> | ||
<remove_tags>true</remove_tags> | ||
<external_command></external_command> | ||
</project> | ||
</omegat> |
Empty file.
4 changes: 4 additions & 0 deletions
4
test-acceptance/data/project_CN_JP/omegat/last_entry.properties
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
#Thu Nov 07 21:30:29 JST 2024 | ||
LAST_ENTRY_NUMBER=1 | ||
LAST_ENTRY_SRC=\u592A\u5E73\u5BFA\u4E2D\u7684\u6587\u7B14\u5854 | ||
LAST_ENTRY_FILE=source.txt |
Empty file.
17 changes: 17 additions & 0 deletions
17
test-acceptance/data/project_CN_JP/omegat/project_save.tmx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
<?xml version='1.0' encoding='UTF-8'?> | ||
<!DOCTYPE tmx SYSTEM "tmx11.dtd"> | ||
<tmx version="1.1"> | ||
<header creationtool="OmegaT" o-tmf="OmegaT TMX" adminlang="EN-US" datatype="plaintext" creationtoolversion="6.1.0_0_50ff299ad" segtype="sentence" srclang="zh-CN"/> | ||
<body> | ||
<!-- Default translations --> | ||
<tu> | ||
<tuv lang="zh-CN"> | ||
<seg>太平寺中的文笔塔</seg> | ||
</tuv> | ||
<tuv lang="ja-JP" changeid="Hiroshi Miura" changedate="20241107T122621Z" creationid="Hiroshi Miura" creationdate="20241107T122621Z"> | ||
<seg>太平寺の中心的なペン塔</seg> | ||
</tuv> | ||
</tu> | ||
<!-- Alternative translations --> | ||
</body> | ||
</tmx> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
太平寺中的文笔塔 | ||
|
||
文筆塔原是江苏省常州市太平寺中的塔。太平寺始建于南北朝齐梁时期,是常州最古老的佛寺之一,今已不存。 | ||
文笔塔为砖木结构,七级八面,每级4个拱门,中有旋梯。塔下有曲池、拱桥。 | ||
“夕照塔影”为文笔胜景。现存塔为光绪末年(1905-1908年)重建 |
Empty file.
Empty file.
Oops, something went wrong.