Skip to content

Commit

Permalink
Release v2.0.0
Browse files Browse the repository at this point in the history
  • Loading branch information
danny0838 committed Apr 18, 2024
1 parent 690c2d5 commit c65e208
Show file tree
Hide file tree
Showing 21 changed files with 1,269 additions and 0 deletions.
14 changes: 14 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
簡繁祕書-簡繁轉換暨校對程式
----------------------------

* [線上版](https://danny0838.github.io/sts-lib/)
* [簡繁祕書專案主頁](https://github.com/danny0838/sts-lib)


隨著技術的進步,簡繁轉換軟體多如繁星,似乎毋庸介紹亦無須研發。然而簡繁轉換正確率之瓶頸一直無法突破,簡體轉繁體尤為明顯,從「字」的層次上說,「幹」擾、頭「發」、乙「醜」等怪詞層出不窮;從「詞」的層次上說,「小明帶了一台計算機去考試」,天曉得他實際上帶的是「电脑」還是「计算机」?更有不可能判斷的句子如對照表「簡繁轉換常見錯字:干[干幹乾榦]、叶[葉叶]」……繁轉簡的問題少得多,但偶爾還是會發生,比如「乾隆」不是「干隆」,「原著」不是「原着」,「癥結」也不是「症结」。

顯然,在真正的人工智慧出來以前,簡繁轉換永遠不完美,只要對「正確」有所要求,事後人工校對絕對不可避免。因此,本程式的想法也很簡單──用機器輔助人工校對。亦即先用程式轉換,但把「所有轉換過可能出錯」(一對多轉換及二個字以上的詞)的地方上色,並允許使用者逐一校對、選字、或自動套用到相同的字,如此便可省下很多校對時間,至少可以保證檢查過一遍後,不再有任何簡繁轉換上的錯誤。

本線上轉換工具支援文字轉換及檔案轉換。前者只要在輸入區填入文字,就會自動轉換並且可以互動式校訂。後者可以用按鈕或拖放選擇一或多個檔案,就會逐一轉換後自動下載。預設檔案輸入輸出編碼皆是UTF-8,如要輸入其他編碼的檔案,可在進階選項設定。

目前內建 [OpenCC](https://github.com/BYVoid/OpenCC) 的轉換方案,並且修正了 OpenCC 演算法缺陷導致一些地區詞無法正常轉換的問題(詳見[相關問題回報](https://github.com/BYVoid/OpenCC/issues/475))。未來有機會再擴充,如中文維基百科的轉換詞庫。
1 change: 1 addition & 0 deletions dicts/opencc/hk2s.tlist

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions dicts/opencc/hk2t.tlist
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
{"一":{"口":{"吃":{"個":{"":["一口喫個"]},"成":{"":["一口喫成"]}}},"家":{"三":{"口":{"":["一家三口"]}},"五":{"口":{"":["一家五口"]}},"六":{"口":{"":["一家六口"]}},"四":{"口":{"":["一家四口"]}}}},"七":{"星":{"巖":{"":["七星巖"]}}},"世":{"胄":{"":["世胄"]}},"介":{"胄":{"":["介冑"]}},"傅":{"巖":{"":["傅巖"]}},"免":{"胄":{"":["免冑"]}},"冠":{"胄":{"":["冠冑"]}},"千":{"巖":{"競":{"秀":{"":["千巖競秀"]}},"萬":{"壑":{"":["千巖萬壑"]},"谷":{"":["千巖萬谷"]}}}},"口":{"吃":{"":["口吃"]}},"台":{"山":{"":["台山"]},"州":{"":["台州"],"地":{"區":{"":["台州地區"]}},"市":{"":["台州市"]}},"":["臺"]},"吃":{"口":{"":["喫口","吃口"],"令":{"":["吃口令"]},"飯":{"":["喫口飯"]}},"吃":{"":["喫喫","吃吃"]},"子":{"":["喫子","吃子"]},"":["喫"]},"名":{"胄":{"":["名胄"]}},"國":{"胄":{"":["國胄"]}},"圍":{"巖":{"":["圍巖"]}},"地":{"胄":{"":["地胄"]}},"壓":{"胄":{"子":{"":["壓冑子"]}}},"士":{"胄":{"":["士胄"]}},"大":{"巖":{"桐":{"":["大巖桐"]}}},"天":{"台":{"女":{"":["天台女"]},"宗":{"":["天台宗"]},"山":{"":["天台山"]},"縣":{"":["天台縣"]}},"潢":{"貴":{"胄":{"":["天潢貴胄"]}}}},"奇":{"巖":{"":["奇巖"]}},"寶":{"胄":{"":["寶胄"]}},"小":{"巖":{"洞":{"":["小巖洞"]}}},"岫":{"巖":{"縣":{"":["岫巖縣"]}}},"峯":{"巖":{"":["峯巖"]}},"嵌":{"巖":{"":["嵌巖"]}},"巉":{"巖":{"":["巉巖"]}},"巖":{"壁":{"":["巖壁"]},"居":{"":["巖居"],"穴":{"處":{"":["巖居穴處"]}},"谷":{"飲":{"":["巖居谷飲"]}}},"岸":{"":["巖岸"]},"巉":{"":["巖巉"]},"巖":{"":["巖巖"]},"徼":{"":["巖徼"]},"手":{"縣":{"":["巖手縣"]}},"村":{"":["巖村"]},"洞":{"":["巖洞"]},"流":{"圈":{"":["巖流圈"]}},"牆":{"":["巖牆"],"之":{"下":{"":["巖牆之下"]}}},"畫":{"":["巖畫"]},"穴":{"":["巖穴"],"之":{"士":{"":["巖穴之士"]}}},"薔":{"薇":{"":["巖薔薇"]}},"邑":{"":["巖邑"]},"郎":{"":["巖郎"]},"阻":{"":["巖阻"]},"陛":{"":["巖陛"]}},"帝":{"胄":{"":["帝胄"]}},"幽":{"巖":{"":["幽巖"]},"棲":{"巖":{"谷":{"":["幽棲巖谷"]}}}},"張":{"口":{"":["張口"]}},"懸":{"巖":{"":["懸巖"],"峭":{"壁":{"":["懸巖峭壁"]}}},"胄":{"":["懸冑"]}},"攀":{"巖":{"":["攀巖"]}},"支":{"胄":{"":["支胄"]}},"教":{"胄":{"":["教胄"]}},"景":{"胄":{"":["景胄"]}},"望":{"胄":{"":["望胄"]}},"末":{"胄":{"":["末胄"]}},"村":{"胄":{"":["村胄"]}},"枕":{"巖":{"漱":{"流":{"":["枕巖漱流"]}}}},"枝":{"胄":{"":["枝胄"]}},"氏":{"胄":{"":["氏胄"]}},"洪":{"胄":{"":["洪胄"]}},"浙":{"江":{"天":{"台":{"縣":{"":["浙江天台縣"]}}}}},"清":{"胄":{"":["清胄"]}},"灰":{"巖":{"殘":{"丘":{"":["灰巖殘丘"]}}}},"玄":{"胄":{"":["玄胄"]}},"甲":{"胄":{"":["甲冑"],"魚":{"類":{"":["甲冑魚類"]}}}},"皇":{"胄":{"":["皇胄"]}},"石":{"灰":{"巖":{"洞":{"":["石灰巖洞"]}}}},"神":{"胄":{"":["神胄"]}},"簪":{"纓":{"世":{"胄":{"":["簪纓世胄"]}}}},"系":{"胄":{"":["系胄"]}},"紅":{"巖":{"":["紅巖"]}},"絕":{"巖":{"":["絕巖"]}},"緒":{"胄":{"":["緒胄"]}},"纂":{"胄":{"":["纂胄"]}},"胃":{"口":{"":["胃口"]}},"胄":{"嗣":{"":["胄嗣"]},"子":{"":["胄子"]},"序":{"":["胄序"]},"族":{"":["胄族"]},"甲":{"":["冑甲"]},"監":{"":["胄監"]},"科":{"":["冑科"]},"緒":{"":["胄緒"]},"胤":{"":["胄胤"]},"裔":{"":["胄裔"],"繁":{"衍":{"":["胄裔繁衍"]}}},"閥":{"":["胄閥"]}},"胡":{"雪":{"巖":{"":["胡雪巖"]}}},"胤":{"胄":{"":["胤胄"]}},"苗":{"胄":{"":["苗胄"]}},"英":{"胄":{"":["英胄"]}},"華":{"胄":{"":["華胄"]}},"血":{"胄":{"":["血胄"]}},"裔":{"胄":{"":["裔胄"]}},"訓":{"胄":{"":["訓胄"]}},"試":{"胄":{"":["試胄"]}},"豪":{"門":{"貴":{"胄":{"":["豪門貴胄"]}}}},"貝":{"胄":{"":["貝冑"]}},"貴":{"胄":{"":["貴胄"]}},"賢":{"胄":{"":["賢胄"]}},"蹇":{"吃":{"":["蹇吃"]}},"躬":{"擐":{"甲":{"胄":{"":["躬擐甲冑"]}}}},"遐":{"胄":{"":["遐胄"]}},"遙":{"胄":{"":["遙胄"]},"遙":{"華":{"胄":{"":["遙遙華胄"]}}}},"遠":{"胄":{"":["遠胄"]}},"遺":{"胄":{"":["遺胄"]}},"鄧":{"艾":{"吃":{"":["鄧艾吃"]}}},"重":{"巖":{"疊":{"嶂":{"":["重巖疊嶂"]}}}},"金":{"胄":{"":["金胄"]}},"鎧":{"胄":{"":["鎧冑"]}},"鑿":{"巖":{"":["鑿巖"]}},"門":{"胄":{"":["門胄"]}},"開":{"口":{"":["開口"]}},"雲":{"巖":{"區":{"":["雲巖區"]}}},"非":{"層":{"巖":{"":["非層巖"]}}},"韓":{"侂":{"胄":{"":["韓侂冑"]}}},"飮":{"胄":{"":["飮冑"]}},"骨":{"巖":{"巖":{"":["骨巖巖"]}}},"高":{"胄":{"":["高胄"]}},"魚":{"胄":{"":["魚冑"]}},"鮮":{"胄":{"":["鮮胄"]}},"鴻":{"胄":{"":["鴻胄"]}},"黃":{"巖":{"區":{"":["黃巖區"]},"島":{"":["黃巖島"]}},"炎":{"貴":{"胄":{"":["黃炎貴胄"]}}}},"齒":{"胄":{"":["齒胄"]}},"龍":{"巖":{"":["龍巖"],"市":{"":["龍巖市"]},"村":{"":["龍巖村"]}},"胄":{"":["龍胄"]}},"偽":{"":["僞"]},"兑":{"":["兌"]},"叁":{"":["叄"]},"只":{"":["只"]},"衹":{"":["只"]},"啓":{"":["啓"]},"啟":{"":["啓"]},"囱":{"":["囪"]},"妝":{"":["妝"]},"粧":{"":["妝"]},"媪":{"":["媼"]},"媯":{"":["嬀"]},"悦":{"":["悅"]},"愠":{"":["慍"]},"户":{"":["戶"]},"捝":{"":["挩"]},"揾":{"":["搵"]},"抬":{"":["擡"]},"敚":{"":["敓"]},"敍":{"":["敘"]},"敘":{"":["敘"]},"枴":{"":["柺"]},"棁":{"":["梲"]},"稜":{"":["棱"]},"棱":{"":["棱"]},"榅":{"":["榲"]},"枱":{"":["檯"]},"氲":{"":["氳"]},"涚":{"":["涗"]},"温":{"":["溫"]},"濕":{"":["溼"]},"溈":{"":["潙"]},"潀":{"":["潨"]},"煴":{"":["熅"]},"為":{"":["爲"]},"痴":{"":["癡"]},"皂":{"":["皁"]},"秘":{"":["祕"]},"税":{"":["稅"]},"灶":{"":["竈"]},"粽":{"":["糉"]},"糉":{"":["糉"]},"糭":{"":["糉"]},"緼":{"":["縕"]},"才":{"":["纔"]},"唇":{"":["脣"]},"脱":{"":["脫"]},"腽":{"":["膃"]},"卧":{"":["臥"]},"煙":{"":["菸"]},"蒀":{"":["蒕"]},"葱":{"":["蔥"]},"蒍":{"":["蔿"]},"藴":{"":["蘊"]},"蜕":{"":["蛻"]},"眾":{"":["衆"]},"衞":{"":["衛"]},"核":{"":["覈"]},"説":{"":["說"]},"踴":{"":["踊"]},"輼":{"":["轀"]},"醖":{"":["醞"]},"缽":{"":["鉢"]},"鈎":{"":["鉤"]},"鋭":{"":["銳"]},"針":{"":["鍼"]},"閲":{"":["閱"]},"鰛":{"":["鰮"]}}
Loading

0 comments on commit c65e208

Please sign in to comment.