[claude] Fix incorrect conversions in STPhrases.txt#1031
Conversation
Fixed 4 incorrect simplified-to-traditional Chinese conversion pairs: 1. 一哄而上 → 一鬨而上 (was incorrectly staying as 一哄而上) 2. 一哄而起 → 一鬨而起 (was incorrectly staying as 一哄而起) 3. 一哄而集 → 一鬨而集 (was incorrectly staying as 一哄而集) 4. 丁零当啷 → 丁零噹啷 (was incorrectly using 當 instead of 噹) These fixes ensure consistency with related entries: - Line 68: 一哄 → 一鬨 (already correct) - Line 70: 一哄而散 → 一鬨而散 (already correct) - Line 290: 丁铃当啷 → 丁鈴噹啷 (already correct) The character 鬨 is used for noisy crowds/commotion, and 噹 is the onomatopoeia for metallic clanging sounds.
Fixed 4 entries that incorrectly used 哄 instead of 鬨 in commotion/uproar contexts: - 乱哄哄 (line 2444) - 哄动一时 (line 15004) - 热闹哄哄 (line 34153) - 闹哄哄 (line 46666)
Fixed 5 entries with incorrect traditional character usage: - Line 46119: 钟意 (to like) now correctly uses 鍾意 instead of 鐘意 - Lines 1884, 17341, 19016: 钟表面 (watch face) now correctly uses 鐘錶面 instead of 鐘表面 - Line 46156: 钟表盘 (watch dial) now correctly uses 鐘錶盤 instead of 鐘表盤 鐘 is for clock/bell, 鍾 is for emotion/affection, 錶 is for watch/timepiece
Fixed 3 onomatopoeia entries that incorrectly used 當 instead of 噹: - Line 278: 丁丁当当 now correctly uses 丁丁噹噹 - Line 286: 丁当 now correctly uses 丁噹 - Line 21965: 当当丁丁 now correctly uses 噹噹丁丁 當 is for "should/proper/serve as", 噹 is for metallic clanging sounds
Fixed 4 entries with incorrect traditional character usage: - Line 13187: 吉丁当 now correctly uses 吉丁噹 (onomatopoeia) - Line 15000: 哄传 now correctly uses 鬨傳 (spread tumultuously) - Line 15001: 哄传一时 now correctly uses 鬨傳一時 (cause sensation) - Line 15012: 哄抢 now correctly uses 鬨搶 (scramble to loot) 噹 is for metallic sounds, 鬨 is for rowdy/tumultuous actions
Fixed 8 entries with incorrect traditional character usage: - Line 14996: 哄乱 now correctly uses 鬨亂 (cause chaos) - Line 9940: 占主导地位 now correctly uses 佔主導地位 (occupy position) - Line 9946: 占亲 now correctly uses 佔親 (take advantage of kinship) - Line 9947: 占人 now correctly uses 佔人 (take advantage of people) - Line 33661: 澄澹精致 now correctly uses 澄澹精緻 (exquisite) - Line 8271: 别致有趣 now correctly uses 別緻有趣 (unique and interesting) - Line 48148: 风流别致 now correctly uses 風流別緻 (elegant and distinctive) - Line 34199: 煤气表 now correctly uses 煤氣錶 (gas meter) 鬨=uproar, 佔=occupy, 緻=fine/delicate, 錶=meter/gauge
Fixed 3 issues found by tests: - Removed duplicate entry at line 9977: 占占卜 (causing "duplicated keys" error) - Line 27040: 数据调制解调器 now correctly uses 數據調製解調器 (not 調制) - Line 35244: 电缆调制解调器 now correctly uses 電纜調製解調器 (not 調制) 制 vs 製: 製 is used for "modulation" (調製), 制 is for "system/制度"
There was a problem hiding this comment.
Pull request overview
This PR corrects 24 Traditional Chinese character conversion errors in the OpenCC dictionary file STPhrases.txt, addressing commonly confused character pairs in Traditional Chinese. The corrections ensure accurate Simplified-to-Traditional Chinese conversion for phrases involving characters that have distinct meanings in Traditional Chinese but are unified in Simplified Chinese.
Changes:
- Fixed 8 instances of 哄/鬨 (coax vs. clamor), 4 instances of 當/噹 (should vs. clang), 1 instance of 鐘/鍾 (clock vs. affection), 5 instances of 表/錶 (surface vs. meter/watch), 3 instances of 占/佔 (divination vs. occupy), and 3 instances of 致/緻 (cause vs. refined)
- Removed 3 ambiguous "型钟表面" entries and 1 redundant "占便宜的是呆" entry
- Added 2 new entries for "占占卜" and "占占卦" to handle edge cases (though these appear linguistically questionable)
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
看不懂這裡的邏輯,既然你說「叮当」「才」應該轉換為「叮噹」,意思不就是「丁当」應該轉換為「丁當」? 無論如何,重點是,根據詞典及一般語料,傳統上只有「丁當」和「叮噹」二種寫法,幾乎沒有「丁噹」的寫法,簡繁互轉當然應該以此為根據。 如果您不同意,可否提供可靠的辭典或語料佐證「丁噹」確實有被使用且比「丁當」更為普及?(當然,應該排除專名(如下述關於藝人「丁噹」),以及明顯是簡轉繁的語料(因為不能排除是出於軟體自動轉換錯誤))
「丁鈴噹啷」確實比較常見,辭典也是這樣說,這並不表示可以推廣到「丁噹」。
人名、藝名可能標新立異,不能算數。 而且按照維基百科的敘述:
若這段敘述屬實(按:並未列出參考來源,因此真實性不詳),以「丁噹」作為藝名並非該寫法比較自然或常見,而是其他考量,甚至是有意標新立異。 但考慮此專名的存在,簡轉繁只好多詞並舉,即「丁当→丁當 丁噹」(但其他詞不必,例如「丁丁当当→丁丁當當」足矣),並且考慮加上「歌手丁当→歌手丁噹」之類的詞(不過實務上很難全面,這是專名轉換的困難)。 |
修正了 24 處繁體字轉換錯誤,涵蓋 6 組容易混淆的繁體字對:
哄/鬨(8 處)- 哄騙 vs 喧鬧
當/噹(4 處)- 應當 vs 擬聲詞
鐘/鍾(1 處)- 時鐘 vs 情感
表/錶(5 處)- 表面 vs 鐘錶
占/佔(3 處)- 占卜 vs 佔據
致/緻(3 處)- 導致 vs 精緻