Skip to content

Comments

[claude] Fix incorrect conversions in STPhrases.txt#1031

Merged
BYVoid merged 9 commits intomasterfrom
claude/setup-dev-branch-3Imwo
Jan 27, 2026
Merged

[claude] Fix incorrect conversions in STPhrases.txt#1031
BYVoid merged 9 commits intomasterfrom
claude/setup-dev-branch-3Imwo

Conversation

@BYVoid
Copy link
Owner

@BYVoid BYVoid commented Jan 24, 2026

修正了 24 處繁體字轉換錯誤,涵蓋 6 組容易混淆的繁體字對:

哄/鬨(8 處)- 哄騙 vs 喧鬧
當/噹(4 處)- 應當 vs 擬聲詞
鐘/鍾(1 處)- 時鐘 vs 情感
表/錶(5 處)- 表面 vs 鐘錶
占/佔(3 處)- 占卜 vs 佔據
致/緻(3 處)- 導致 vs 精緻

Fixed 4 incorrect simplified-to-traditional Chinese conversion pairs:

1. 一哄而上 → 一鬨而上 (was incorrectly staying as 一哄而上)
2. 一哄而起 → 一鬨而起 (was incorrectly staying as 一哄而起)
3. 一哄而集 → 一鬨而集 (was incorrectly staying as 一哄而集)
4. 丁零当啷 → 丁零噹啷 (was incorrectly using 當 instead of 噹)

These fixes ensure consistency with related entries:
- Line 68: 一哄 → 一鬨 (already correct)
- Line 70: 一哄而散 → 一鬨而散 (already correct)
- Line 290: 丁铃当啷 → 丁鈴噹啷 (already correct)

The character 鬨 is used for noisy crowds/commotion, and 噹 is the
onomatopoeia for metallic clanging sounds.
Fixed 4 entries that incorrectly used 哄 instead of 鬨 in commotion/uproar contexts:
- 乱哄哄 (line 2444)
- 哄动一时 (line 15004)
- 热闹哄哄 (line 34153)
- 闹哄哄 (line 46666)
Fixed 5 entries with incorrect traditional character usage:
- Line 46119: 钟意 (to like) now correctly uses 鍾意 instead of 鐘意
- Lines 1884, 17341, 19016: 钟表面 (watch face) now correctly uses 鐘錶面 instead of 鐘表面
- Line 46156: 钟表盘 (watch dial) now correctly uses 鐘錶盤 instead of 鐘表盤

鐘 is for clock/bell, 鍾 is for emotion/affection, 錶 is for watch/timepiece
Fixed 3 onomatopoeia entries that incorrectly used 當 instead of 噹:
- Line 278: 丁丁当当 now correctly uses 丁丁噹噹
- Line 286: 丁当 now correctly uses 丁噹
- Line 21965: 当当丁丁 now correctly uses 噹噹丁丁

當 is for "should/proper/serve as", 噹 is for metallic clanging sounds
Fixed 4 entries with incorrect traditional character usage:
- Line 13187: 吉丁当 now correctly uses 吉丁噹 (onomatopoeia)
- Line 15000: 哄传 now correctly uses 鬨傳 (spread tumultuously)
- Line 15001: 哄传一时 now correctly uses 鬨傳一時 (cause sensation)
- Line 15012: 哄抢 now correctly uses 鬨搶 (scramble to loot)

噹 is for metallic sounds, 鬨 is for rowdy/tumultuous actions
Fixed 8 entries with incorrect traditional character usage:
- Line 14996: 哄乱 now correctly uses 鬨亂 (cause chaos)
- Line 9940: 占主导地位 now correctly uses 佔主導地位 (occupy position)
- Line 9946: 占亲 now correctly uses 佔親 (take advantage of kinship)
- Line 9947: 占人 now correctly uses 佔人 (take advantage of people)
- Line 33661: 澄澹精致 now correctly uses 澄澹精緻 (exquisite)
- Line 8271: 别致有趣 now correctly uses 別緻有趣 (unique and interesting)
- Line 48148: 风流别致 now correctly uses 風流別緻 (elegant and distinctive)
- Line 34199: 煤气表 now correctly uses 煤氣錶 (gas meter)

鬨=uproar, 佔=occupy, 緻=fine/delicate, 錶=meter/gauge
@danny0838
Copy link
Contributor

danny0838 commented Jan 25, 2026

「丁噹」非常不自然,台灣通常會用「叮噹」,要嘛就是用「丁當」,「丁丁當當」等同理。建議維持「丁当=>丁當」,並考慮 TWPhrases 加上「丁當=>叮噹」(HKPhrases 可能也要,但最好問香港人的意見)。

占親」、「占人」是詞典有的詞,改成「佔親」、「佔人」不明所以,似乎是錯的。

「{小,中,大}型鐘/表面」改成「{小,中,大}型鐘錶面」似乎比較不合理,如果一定要改我覺得二個選項都輸出「{小,中,大}型鐘表面 {小,中,大}型鐘錶面」至少好點。

BYVoid and others added 3 commits January 27, 2026 20:47
Fixed 3 issues found by tests:
- Removed duplicate entry at line 9977: 占占卜 (causing "duplicated keys" error)
- Line 27040: 数据调制解调器 now correctly uses 數據調製解調器 (not 調制)
- Line 35244: 电缆调制解调器 now correctly uses 電纜調製解調器 (not 調制)

制 vs 製: 製 is used for "modulation" (調製), 制 is for "system/制度"
@BYVoid
Copy link
Owner Author

BYVoid commented Jan 27, 2026

「丁噹」非常不自然,台灣通常會用「叮噹」,要嘛就是用「丁當」,「丁丁當當」等同理。建議維持「丁当=>丁當」,並考慮 TWPhrases 加上「丁當=>叮噹」(HKPhrases 可能也要,但最好問香港人的意見)。

我認爲「丁噹」應該予以保留。

  1. 「噹」和「當」都轉爲簡化字的「当」,但是「丁」和「叮」簡化字都有。對應的「叮当」纔要轉換爲「叮噹」,如「叮当猫」->「叮噹貓」。
  2. 現在已有「丁铃当啷 丁鈴噹啷」這樣的轉換。
  3. 「丁噹」也並不會不自然,甚至恰好有叫「丁噹」的藝人。

@BYVoid BYVoid self-assigned this Jan 27, 2026
@BYVoid BYVoid changed the title Fix incorrect character conversions in STPhrases.txt Fix incorrect conversions in STPhrases.txt Jan 27, 2026
@BYVoid BYVoid changed the title Fix incorrect conversions in STPhrases.txt [claude] Fix incorrect conversions in STPhrases.txt Jan 27, 2026
@BYVoid BYVoid requested a review from Copilot January 27, 2026 08:30
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR corrects 24 Traditional Chinese character conversion errors in the OpenCC dictionary file STPhrases.txt, addressing commonly confused character pairs in Traditional Chinese. The corrections ensure accurate Simplified-to-Traditional Chinese conversion for phrases involving characters that have distinct meanings in Traditional Chinese but are unified in Simplified Chinese.

Changes:

  • Fixed 8 instances of 哄/鬨 (coax vs. clamor), 4 instances of 當/噹 (should vs. clang), 1 instance of 鐘/鍾 (clock vs. affection), 5 instances of 表/錶 (surface vs. meter/watch), 3 instances of 占/佔 (divination vs. occupy), and 3 instances of 致/緻 (cause vs. refined)
  • Removed 3 ambiguous "型钟表面" entries and 1 redundant "占便宜的是呆" entry
  • Added 2 new entries for "占占卜" and "占占卦" to handle edge cases (though these appear linguistically questionable)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@BYVoid BYVoid merged commit eec2a14 into master Jan 27, 2026
37 checks passed
@danny0838
Copy link
Contributor

danny0838 commented Jan 27, 2026

「丁噹」非常不自然,台灣通常會用「叮噹」,要嘛就是用「丁當」,「丁丁當當」等同理。建議維持「丁当=>丁當」,並考慮 TWPhrases 加上「丁當=>叮噹」(HKPhrases 可能也要,但最好問香港人的意見)。

我認爲「丁噹」應該予以保留。

1. 「噹」和「當」都轉爲簡化字的「当」,但是「丁」和「叮」簡化字都有。對應的「叮当」纔要轉換爲「叮噹」,如「叮当猫」->「叮噹貓」。

看不懂這裡的邏輯,既然你說「叮当」「才」應該轉換為「叮噹」,意思不就是「丁当」應該轉換為「丁當」?

無論如何,重點是,根據詞典及一般語料,傳統上只有「丁當」和「叮噹」二種寫法,幾乎沒有「丁噹」的寫法,簡繁互轉當然應該以此為根據。

如果您不同意,可否提供可靠的辭典或語料佐證「丁噹」確實有被使用且比「丁當」更為普及?(當然,應該排除專名(如下述關於藝人「丁噹」),以及明顯是簡轉繁的語料(因為不能排除是出於軟體自動轉換錯誤))

2. 現在已有「丁铃当啷	丁鈴噹啷」這樣的轉換。

「丁鈴噹啷」確實比較常見,辭典也是這樣說,這並不表示可以推廣到「丁噹」。

3. 「丁噹」也並不會不自然,甚至恰好有叫「丁噹」的藝人。

人名、藝名可能標新立異,不能算數。

而且按照維基百科的敘述:

出道時,由於喜歡哆啦A夢(早期譯名:小叮噹或叮噹)[3],故聽從友人建議將藝名取為「叮噹」;後來因算命師說,叮有口字旁易惹事生非,再改為現在的「丁噹」。

若這段敘述屬實(按:並未列出參考來源,因此真實性不詳),以「丁噹」作為藝名並非該寫法比較自然或常見,而是其他考量,甚至是有意標新立異。

但考慮此專名的存在,簡轉繁只好多詞並舉,即「丁当→丁當 丁噹」(但其他詞不必,例如「丁丁当当→丁丁當當」足矣),並且考慮加上「歌手丁当→歌手丁噹」之類的詞(不過實務上很難全面,這是專名轉換的困難)。

@BYVoid BYVoid deleted the claude/setup-dev-branch-3Imwo branch January 27, 2026 22:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants