TW518479B - Phonetic spelling conversion system and method capable of handling multi-tone words - Google Patents

Phonetic spelling conversion system and method capable of handling multi-tone words Download PDF

Info

Publication number
TW518479B
TW518479B TW090118427A TW90118427A TW518479B TW 518479 B TW518479 B TW 518479B TW 090118427 A TW090118427 A TW 090118427A TW 90118427 A TW90118427 A TW 90118427A TW 518479 B TW518479 B TW 518479B
Authority
TW
Taiwan
Prior art keywords
conversion
character
pinyin
chinese
database
Prior art date
Application number
TW090118427A
Other languages
Chinese (zh)
Inventor
Kuang-Shin Lin
Qiang Xia
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to TW090118427A priority Critical patent/TW518479B/en
Application granted granted Critical
Publication of TW518479B publication Critical patent/TW518479B/en

Links

Abstract

The present invention provides a phonetic spelling conversion system and method capable of handling multi-tone words, which is used to handle the input and conversion of Chinese phonetic and phonetic notation between simplified Chinese words and traditional Chinese words. The mapping conversion table of ""Chinese phonetic and phonetic notation"" provided by the present invention uses the flags to determine the inner mapping conversion database mode, which only needs a set of mapping Chinese database for realizing multiple conversion functions of ""Chinese word/Chinese phonetic"", ""Chinese word/phonetic notation"", and ""Chinese word/phonetic notation"". The present invention is further characterized in that the mapping conversion table of ""Chinese phonetic and phonetic notation"" can provide a method for handling multiple pronunciations for multi-tone words in the conversion process.

Description

518479 五、發明說明(1) 【發明領域】 本發明係為一種可處理多音字的拼音轉換系統及方 法,特別是指一種應用於電腦可執行之操作系統平台上, 用以進行簡體中文漢字、繁體中文漢字之間漢語拼音和注 音拼音的輸入與轉換。 【發明背景】 隨著企業與企業間的往來互動越來越頻繁,彼此間的 文件資訊的交流溝通也成了日常生活中重要而且不可或缺 的一項重要工作,在網際網路的如此快速發達的今天,造 成大部分的資訊往來交流所採用的方式都是透過網路來傳 遞電子文件貢訊的模式來達成的’而這樣的方式也顯得最 方便迅速而最有經濟效益。 然而由於現行世界各地所使用的文字語言在電腦上具 有許多不同的編碼方式,所以往往在資訊交流的過程當中 會造成接收資訊的雙方在内容的解讀及認知上一項潛在的 困擾因素,例如:繁體漢字與簡體漢字,就有根本上編碼 方式的差別存在,也因此對於在所使用的操作系統平台上 載入所謂的繁體簡體漢字互換功能的軟體需求也就伴隨著 企業間日益頻繁的互動往來而變得越來越殷切。 由於繁體、簡體文字所採行的拼音方式有所不同,所 以在拼音編碼的方式上也產生出繁體字採「注音拼音」的 編碼方式’而簡體字採「漢語拼音」編碼方式的基本差異 存在,導致在進行繁體簡體漢字互換的過程中,我們通常 必須要將原始文字的發音先做必要的轉換,以便擷取得到518479 V. Description of the Invention (1) [Field of the Invention] The present invention relates to a system and method for pinyin conversion capable of processing multi-syllabic characters, particularly to a computer-executable operating system platform for performing simplified Chinese characters, Input and conversion of Chinese pinyin and Zhuyin pinyin between traditional Chinese characters. [Background of the Invention] As businesses interact with each other more and more frequently, the exchange of documents and information between them has also become an important and indispensable task in daily life, so fast on the Internet Today, as a result of the development, most of the methods of information exchange are achieved through the mode of transmitting electronic documents via the Internet, and this method seems to be the most convenient, fast and most economical. However, due to the many different encoding methods used on the computer for the current languages used around the world, the process of information exchange often causes a potential confounding factor in the interpretation and cognition of the content receiving parties, such as: There is a fundamental difference in encoding methods between traditional Chinese characters and simplified Chinese characters. Therefore, software requirements for loading the so-called traditional and simplified Chinese character interchange function on the operating system platform used are accompanied by increasingly frequent interactions between enterprises. It became more and more eager. Since the traditional and simplified Chinese characters have different pinyin coding methods, the traditional Chinese characters use the "Zhuyin Pinyin" coding method, while the simplified characters use the "Chinese Pinyin" coding method. As a result, in the process of swapping traditional and simplified Chinese characters, we usually have to convert the pronunciation of the original text first, in order to retrieve it.

第4頁 518479 五、發明說明(2) 欲轉換字詞的正確發音,最後產生出正確的繁體或簡體字 詞。 就過去習知的技術而言,幾種類似的繁體簡體漢字互 換功能的軟體在處理繁體轉簡體或是簡體轉繁體中「字轉 音」的部分,幾乎都是透過在内部分別建立兩套儲存有各 自發音符號和其對應之簡體字和繁體字的對應資料庫,然 後在處理過程中將原始文字的發音符號拿來跟對應資料庫 中所記載的轉換發音資料逐一比對,以找到所需要轉換的 正確發音符號。這樣的方式必然會產生下列幾個問題:首 先,為了提高比對結果的成功率,必須花費大量的人力以 及時間去從事資料庫的維護工作,造成人力、金錢、時間 成本上不必要的浪費;再者,使用人力維護資料庫不但容 易因為人員的疏失而造成資料庫内容上產生謬誤,也極難 保證兩者資料庫内容的一致性;此外,與日倶增的龐大資 料庫内容也將造成硬體空間上沉重的負擔,當有大量文件 資訊需要進行識別轉換時更直接會造成執行效率上的不 彰。 而從另一項習知的技術中發現,這些具有繁體簡體漢 字互換功能的軟體在處理繁體轉簡體或是簡體轉繁體中 「字轉音」的部分,並不能夠為多音字的轉換提供一種有 效的解決方法,也就是在現有方法中只能依照資料庫所儲 存的現有資料去提供有限的發音内容,往往是該多音字的 某一種發音,而不能處理該多音字的其他發音,造成在轉 換過程當中可能因為資料庫内容的不夠完整而影響到比對Page 4 518479 V. Description of the invention (2) The correct pronunciation of the word to be converted, and finally the correct traditional or simplified word is produced. As far as the conventional technology is concerned, several similar softwares for traditional Chinese-Simplified Chinese character interchange function deal with the part of traditional Chinese-to-Simplified Chinese or traditional Chinese-to-Chinese traditional Chinese character transliteration, almost by internally creating two sets of storage. There is a corresponding database of each pronunciation symbol and its corresponding simplified and traditional characters, and then in the processing process, the pronunciation symbols of the original text are compared with the converted pronunciation data recorded in the corresponding database one by one to find the required conversion Correct pronunciation symbol. This method will inevitably produce the following problems: First, in order to improve the success rate of comparison results, it must spend a lot of manpower and time to engage in the maintenance of the database, causing unnecessary waste of manpower, money and time costs; In addition, the use of human resources to maintain the database is not only easy to cause errors in the contents of the database due to the negligence of personnel, but also extremely difficult to ensure the consistency of the contents of the two databases; in addition, the increasing content of the huge database with the increase of daily will also cause The heavy burden on the hardware space, when there is a large amount of document information that needs to be identified and converted, it will directly cause the implementation efficiency to be inconsistent. And from another known technology, it is found that these softwares with the function of interchange of traditional and simplified Chinese characters are not able to provide a kind of conversion for polyphonic characters when processing traditional Chinese or simplified Chinese traditional characters An effective solution, that is, in the existing methods, only limited pronunciation content can be provided according to the existing data stored in the database. Often, it is one pronunciation of the polysyllabic character, but it cannot handle other pronunciations of the polysyllabic character. The conversion process may affect the comparison due to the incompleteness of the database content

第5頁 518479 五、發明說明(3) 的結果,進而造成對資訊的解讀及認知上產生語意上的錯 誤,甚至嚴重地還會影響到工作進度上的問題。 因此,在對於繁體簡體漢字互換功能的軟體需求逐漸擴增 時,我們正需要一種能夠同時提供簡體中文漢字、繁體中 文漢字的輸入和轉換並且可以提供多音字處理的系統及方 法,來解決前述技術上不足之問題。 【發明之概述及目的】 有鑑於此,本發明為了解決上述習知技術所產生之問 題,提出一種可處理多音字的拼音轉換系統及方法: 首先,由本發明所提供”漢語拼音/注音拼音π對應關係轉 換表,使我們可以在系統中只需要透過一套對應之漢字資 料庫,即可實現在一個操作系統平台中,進行簡體中文漢 字、繁體中文漢字的輸入和轉換,進而達到漢語拼音/注 音拼音的轉換功能。這樣的方式不但可以節省硬體所需要 的空間,同時也降低人工輸入資料庫的成本,減少發生不 必要的錯誤,提高了執行上的速度。 此外,由於在本發明所提之”漢語拼音/注音拼音”對 應關係轉換表中,具有可處理多音字的特殊對應關係存 在,可以使得本系統及方法可以在不使用轉換演算法則 (Algorithm)、不使用詞庫的情況下,即能夠處理漢語拼 音以及注音拼音轉換過程中的多音字的多種發音,解決了 過去傳統簡體中文漢字、繁體中文漢字間漢語拼音和注音 拼音輸入與轉換的過程中,無法有效針對多音字的部分進 行處理的問題。Page 5 518479 V. The result of the description of the invention (3), which leads to semantic errors in the interpretation and cognition of information, and even seriously affects the problem of work progress. Therefore, when the software requirements for the interchange function of traditional and simplified Chinese characters are gradually expanding, we are in need of a system and method that can simultaneously provide input and conversion of simplified Chinese characters and traditional Chinese characters and can provide multi-sound word processing to solve the aforementioned technologies. On the problem. [Summary and Purpose of the Invention] In view of this, in order to solve the problems caused by the conventional technology described above, the present invention proposes a system and method for pinyin conversion that can handle multi-syllabic characters: First, the "Chinese Pinyin / Zhuyin Pinyin π" provided by the present invention Correspondence conversion table, so that we can use only a set of corresponding Chinese character database in the system to achieve the input and conversion of simplified Chinese characters, traditional Chinese characters in an operating system platform, and then achieve Chinese pinyin / Zhuyin Pinyin conversion function. This method can not only save the space required by the hardware, but also reduce the cost of manual input of the database, reduce unnecessary errors, and increase the speed of execution. In addition, due to the It is mentioned that in the "Chinese Pinyin / Zhuyin Pinyin" correspondence conversion table, there is a special correspondence relationship that can handle multi-syllabic characters. This system and method can be used without the use of conversion algorithms and thesaurus. , That is, it can handle the conversion of Chinese Pinyin and Zhuyin Pinyin Phonetic pronunciation of many words, to solve the traditional past Simplified Chinese characters, Pinyin and phonetic Pinyin input process and the transition between traditional Chinese characters, it can not effectively carry out part of the problem for the processing of multi-tone words.

第6頁 518479 五、發明說明(4) 有關本發明之詳細内容及技術,茲就配合圖式說明如 下: 【發明之詳細說明】 本發明係提出一種可處理多音字的拼音轉換系統及方 法,主要係應用於電腦可執行之操作系統平台上,用以提 供簡體中文漢字、繁體中文漢字之間漢語拼音和注音拼音 的輸入與轉換之系統及方法,同時還可根據對應關係轉換 資料庫中特定内容實現對多音字的處理。 本發明以一較佳實施例說明本方法之具體可行性, 「第1圖」係本發明所提之可處理多音字的拼音轉換系統 及方法之功能模組圖,其主要的系統功能模組包含(1 )資 料庫載入模組1 0 1 ;( 2 )字元對應關係產生模組1 0 2 ;( 3 )字元 轉換產生模組103; (4)多音字轉換模組104;及(5)輸出模組 105 : (1 )資料庫載入模組1 0 1 :用以根據操作系統中所使 用之漢字資料庫來決定載入系統之漢字資料庫類型。 (2 )字元對應關係產生模組1 0 2 :用以根據資料庫載 入模組1 0 1所提供之載入漢字資料庫類型來決定内部的字 元對應關係轉換表類型,產生出對應關係轉換資料庫。 (3 )字元轉換產生模組1 0 3 :用以接受使用者由輸入 設備所輸入之字率來進行轉換動作,根據字元對應關係產 生模組1 0 2所提供之對應關係轉換資料庫,產生轉換後的Page 6 518479 V. Description of the invention (4) The detailed content and technology of the present invention will be described in conjunction with the drawings as follows: [Detailed description of the invention] The present invention proposes a system and method for pinyin conversion that can handle multi-syllabic characters. It is mainly used on computer-executable operating system platforms to provide systems and methods for inputting and converting Chinese pinyin and Zhuyin pinyin between simplified Chinese characters and traditional Chinese characters, and can also convert specific data in the database according to the corresponding relationship. The content realizes the processing of polyphonic characters. The present invention uses a preferred embodiment to illustrate the specific feasibility of the method. "Figure 1" is a functional module diagram of the pinyin conversion system and method capable of processing multi-syllabic characters according to the present invention, and its main system function modules Contains (1) database loading module 1 0 1; (2) character correspondence generation module 1 0 2; (3) character conversion generation module 103; (4) polyphonic character conversion module 104; and (5) Output module 105: (1) Database loading module 1 01: It is used to determine the type of Chinese character database to be loaded into the system according to the Chinese character database used in the operating system. (2) Character correspondence generation module 1 02: used to determine the type of the internal character correspondence conversion table according to the type of the loaded Chinese character database provided by the database loading module 1 01, and generate correspondence Relationship Transformation Database. (3) Character conversion generating module 103: It is used to accept the word rate input by the user to perform the conversion action, and generates the correspondence conversion database provided by module 1 02 according to the character correspondence. To generate the converted

第7頁 518479 五、發明說明(5) 組1 0 3所傳來的多音字處理需求,將多音字的部分進行對 應的轉換處理。 (5 )輸出模組1 0 5 ·•用以接受由字元轉換產生模組1 0 3 以及多音字轉換模組1 0 4所傳來的轉換後字元組,並將此 字元組輸出至輸出設備中。 請參閱「第2圖」,該圖係本發明所提之可處理多音 字的拼音轉換系統及方法之整體流程圖,說明本系統主要 的方法如下: 首先,我們必須在所使用的系統操作平台上面載入一 套漢字資料庫(步驟2 0 1 ),這套漢字資料庫的類型可以是π 漢字/漢語拼音π的對應資料庫也可以是一套π漢字/注音拼 音π的對應資料庫。接著,本系統會根據我們在系統操作 平台中所載入之漢字資料庫的類型,由本系統所提供之字 元對應關係產生模組1 0 2,來決定本系統内部所需要建立 的字元對應關係轉換表類型(步驟2 0 2 ),而此一字元對應 關係轉換表的類型可以是:"漢語拼音對注音拼音”或者是 π注音拼音對漢語拼音π的内部字元對應關係,然後再依據 字元對應關係產生模組1 0 2所決定之内部字元對應關係, 來產生出對應關係轉換資料庫(步驟2 0 3 ),也就是π漢語拼 音對注音拼音”對應關係轉換資料庫或者是”注音拼音對漢 語拼音π對應關係轉換資料庫。當使用者由輸入設備輸入 一欲轉換之文字字串(步驟2 0 4 )時,則本系統所提供之字 元轉換產生模組1 0 3,便會首先判斷使用者所輸入之字串 是否為漢字字串?(步驟2 0 5 )以及是否需要進行多音字處Page 7 518479 V. Description of the invention (5) Polyphonic word processing requirements transmitted from group 103, correspondingly convert the polyphonic characters. (5) Output module 1 0 5 · • is used to accept the converted character group transmitted by the character conversion generating module 1 3 and the polyphonic character conversion module 1 0 4 and output the character group Into the output device. Please refer to "Figure 2", which is an overall flow chart of the system and method for pinyin conversion that can handle multi-syllabic characters. The main methods of this system are as follows: First, we must operate the system on the platform Loaded with a set of Chinese character database (step 2 0 1), the type of this set of Chinese character database can be π Chinese character / Chinese Pinyin π corresponding database or a set of π Chinese character / Zhuyin Pinyin π corresponding database. Next, the system will generate a module 102 from the character correspondence provided by the system according to the type of the Chinese character database loaded in the system operating platform to determine the character correspondence that needs to be established within the system. The type of the relationship conversion table (step 202), and the type of the one-character correspondence relationship conversion table can be: " Chinese Pinyin vs Zhuyin Pinyin "or π Zhuyin Pinyin vs. Han Pinyin's internal character correspondence, and then According to the internal character correspondence determined by the module 102, the corresponding correspondence conversion database is generated (step 203), that is, the “π Chinese Pinyin vs Zhuyin Pinyin” correspondence conversion database. Or it is "Zhuyin Pinyin to Chinese Pinyin π correspondence conversion database. When the user inputs a text string to be converted from the input device (step 2 0 4), the character conversion provided by this system generates module 1 0 3, it will first determine whether the character string entered by the user is a Chinese character string? (Step 2 0 5) and whether a polyphonic character is required.

第8頁 518479 五、發明說明(6) 理?(步驟2 0 5 ),當該字串不是屬於漢字字串的話,則本 系統將會結束流程離開本系統的運作,但如果是漢字字串 的話,則本系統會進一步判斷使用者所輸入的字串(步驟 2 0 4 )是否有多音字必須作進一步處理(步驟2 0 5 ),如果有 的話那麼本系統所提供之多音字轉換模組1 0 4便會讓使用 者依據實際需求自行選擇該多音字之相對應的發音字元 (步驟2 0 6 ),若沒有多音字需要額外作處理的話則系統會 繼續進行系統的流程,將使用者所輸入之字串(步驟2 0 4 ) 依據載入之漢字資料庫(步驟2 0 1 )產生出完整的對應字元 組(步驟2 0 7 ),然後再根據字元對應關係產生模組1 0 2所建 立之對應關係轉換資料庫進行對應字元組轉換的動作,產 生出轉換後的對應字元組(步驟2 0 8 ),最後透過系統操作 平台之輸出設備將轉換後之對應字元組輸出(步驟2 0 9 )。 請參閱「第3圖」,該圖係本發明所提之可處理多音 字的拼音轉換系統及方法的π漢語拼音/注音拼音π對應關 係轉換表,說明本系統中所定義之π漢語拼音/注音拼音π 字元發音對應關係對照。 【達成之功效】 本發明以自行建立之”漢語拼音/注音拼音”對應關係 轉換表,只需要在電腦可執行之操作系統平台上提供一套 π漢字/漢語拼音π或者π漢字/注音拼音π相對應之漢字資料 庫,便能夠實現π漢字/漢語拼音π、π漢字/注音拼音”以及 π注音/拼音”等多種拼音轉換的功能。 不但節省硬體空間,減少維護資料庫内容的人力、時Page 8 518479 V. Explanation of the invention (6) Principle? (Step 2 0 5), when the string is not a Chinese character string, the system will end the process and leave the operation of the system, but if it is a Chinese character string, the system will further determine the user input Whether the string (step 2 0 4) has polyphonic characters must be further processed (step 2 0 5). If so, the polyphonic character conversion module 1 0 4 provided by the system will allow users to make their own decisions based on actual needs. Select the corresponding pronunciation character of the multi-syllabic character (step 2 06). If there is no poly-syllabic character that needs additional processing, the system will continue the process of the system to input the character string entered by the user (step 2 0 4) According to the loaded Chinese character database (step 2 0 1), a complete corresponding character group (step 2 7) is generated, and then the correspondence conversion database created by the module 1 2 is generated according to the character correspondence relationship. The corresponding character group conversion action generates a converted corresponding character group (step 208), and finally outputs the converted corresponding character group through the output device of the system operation platform (step 209). Please refer to "Figure 3", which is the π Chinese Pinyin / Zhuyin Pinyin π correspondence conversion table for the system and method for processing multi-syllabic characters in the present invention, which illustrates the π Chinese Pinyin / Zhuyin pinyin π character pronunciation correspondence. [Achieved effect] The present invention uses a self-established "Chinese Pinyin / Zhuyin Pinyin" correspondence conversion table. It only needs to provide a set of π Chinese characters / Chinese Pinyin or π Chinese characters / Zhuyin Pinyin on a computer-executable operating system platform. Corresponding Chinese character database can realize multiple pinyin conversion functions such as π Chinese character / Chinese pinyin π, π Chinese character / Zhuyin pinyin, and π Zhuyin / pinyin. Not only saves hardware space, but also reduces the manpower and time required to maintain the contents of the database

JIL 第9頁 518479 五、發明說明(7) 間與成本,還減少人工作業 此提昇執行上的效率。並且 關係轉換表在不使用演算法 便能夠提供在拼音、注音轉 理方法,解決以往只能處理 規、龐大詞庫建立的方式來 雖然本發明以前述之較 並非用以限定本發明,任何 明之精神和範圍内,當可作 明之保護範圍當視後附之申 上發生錯誤的機率,更可以因 該”漢語拼音/注音拼音π對應 則,也不採用詞庫的情況下, 換過程中多音字的多種發音處 單音字或是必須透過演算法 解決之既有瓶頸。 佳具體實施例揭露如上,然其 熟習此技藝者,在不脫離本發 些許之更動與潤飾,因此本發 請專利範圍所界定者為準。JIL Page 9 518479 V. Description of the invention (7) Time and cost, also reducing manual work This improves the efficiency of execution. In addition, the relationship conversion table can provide the method of transliteration in pinyin and Zhuyin without using an algorithm, which solves the previous method that can only deal with the establishment of a large and large vocabulary. Although the present invention is not intended to limit the present invention to the foregoing, any clear Within the spirit and scope, when the scope of protection that can be made clear depends on the probability of an error occurring in the attached application, it can also be caused by the "Chinese Pinyin / Zhuyin Pinyin π correspondence, and without thesaurus. Monophonic characters in multiple pronunciations of pronunciations or existing bottlenecks that must be solved by algorithms. The best specific embodiments are disclosed above, but those who are familiar with this technique will not deviate from the changes and retouches of this issue, so the scope of this patent The ones defined shall prevail.

第10頁 518479 圖式簡單說明 第 1圖 係本發明 所 提 之 可 處 理 多 音 字 的 拼 音 轉 換 系 統 法 之功能模 組圖; 第 2圖 係本發明 所 提 之 可 處 理 多 音 字 的 拼 音 轉 換 糸 統 法 之整體流 程圖; 以 及 第 3圖 係本發明 所 提 之 可 處 理 多 音 字 的 拼 音 轉 換 糸 統 法 之▼ 4莫語拼音 /注音拼音π 對 應 關 係 轉 換 表 〇 [ 符號說明 ] Π )1 資 料 庫 載 入 模 組 102 字 元 對 應 關 係 產 生 模 組 103 字 元 轉 換 產 生 模 組 104 多 音 字 轉 換 模 組 105 輸 出 模 組 步 驟 201 載 入 漢 字 資 料 庫 步 驟 202 根 據 漢 字 資 料 庫 決 定 對 應 關 係 轉 換 表 步 驟 203 根 據 對 應 關 係 轉 換 表 產 生 轉 換 資 料 庫 步 驟 204 接 收 使 用 者 m 入 欲 轉 換 字 串 步 驟 205 判 斷 是 否 為 漢 字 及 是 否 需 多 音 字 處 理 步 驟 206 選 擇 多 字 所 需 對 應 發 音 字 元 步 驟 207 產 生 fm 入 字 串 之 對 應 字 元 組 步 驟 208 根 據 轉 換 資 料 庫 將 該 字 元 組 轉 換 步 驟 209 輸 出 轉 換 後 之 字 元 組 « «Page 10 518479 Brief Description of Drawings Figure 1 is a functional module diagram of a method for processing a pinyin conversion system capable of processing polyphones according to the present invention; Figure 2 is a system of pinyin conversion systems capable of processing a multiphonetic character according to the present invention; The overall flow chart of the method; and FIG. 3 is the method of the pinyin conversion system that can handle multi-syllabic characters in the present invention. ▼ 4 Moyu Pinyin / Zhuyin Pinyin π Correspondence Conversion Table 0 [Symbol Description] Π) 1 Database Load module 102 Character correspondence generation module 103 Character conversion generation module 104 Multi-sound character conversion module 105 Output module step 201 Load Chinese character database step 202 Determine correspondence mapping table according to Chinese character database Step 203 According to Correspondence conversion table generating conversion database step 204 receiving user m into conversion string step Step 205 Determine whether it is a Chinese character and whether polyphonic character processing is required. Step 206 Select the corresponding pronunciation character required for multiple characters. Step 207 Generate the corresponding character group of the fm input string. Step 208 Convert the character group according to the conversion database. Step 209 Output Transformed character group ««

第11頁Page 11

Claims (1)

518479 六、申請專利範圍 1. 一種可處理多音字的拼音轉換系統,用以進行簡體中文 漢字、1繁體中文漢字之間漢語拼音和注音拼音的輸入與 轉換,該系統至少包含: 一資料庫載入模組,用以根據一系統操作平台中所使 用之一漢字資料庫來決定一載入之漢字資料庫類型; 一字元對應關係產生模組,用以根據該資料庫載入模 組所提供之該載入漢字資料庫類型來決定一字元對應關 係轉換表類型,產生出一對應關係轉換資料庫; 一字元轉換產生模組,用以接受一使用者由一輸入設 備所輸入之一字•進行轉換動作,根據該字元對應關係 產生模組提供之該對應關係轉換資料庫,產生一轉換後 的對應字元組; 一多音字轉換模組,用以接受該字元轉換產生模組傳 來之一多音字處理需求,進行對應的轉換處理;以及 一輸出模組,用以接受該字元轉換產生模組以及該多 音字轉換模組傳來的一轉換後字元組,將該字元組輸出 至一輸出設備當中。 2. 如申請專利範圍第1項所述之可處理多音字的拼音轉換 系統,該漢字資料庫以及該載入之漢字資料庫類型至少 包含有:”漢字/漢語拼音”以及”漢字/注音拼音拼音”之 對應資料庫。 3. 如申請專利範圍第1項所述之可處理多音字的拼音轉換 系統,該字元對應關係轉換表類型以及該對應關係轉換 資料庫係以一程式語言旗標(F 1 ag )的方式決定其内部對518479 6. Scope of Patent Application 1. A Pinyin conversion system capable of processing multi-syllabic characters for inputting and converting Chinese pinyin and Zhuyin pinyin between simplified Chinese characters and 1 traditional Chinese characters, the system includes at least: a database containing The input module is used to determine the type of a Chinese character database to be loaded according to a Chinese character database used in a system operation platform; a character correspondence generating module is used to load the module based on the database The type of the loaded Chinese character database is provided to determine the type of a character correspondence conversion table to generate a correspondence conversion database. A character conversion generating module is used to accept a user input from an input device. One character • Perform a conversion operation, and generate a converted corresponding character group according to the corresponding conversion database provided by the character corresponding relationship generation module; a multi-sound word conversion module for receiving the character conversion generated A polyphonic word processing request from the module is received to perform corresponding conversion processing; and an output module is used to accept the character conversion to generate a module Group and a converted character group transmitted from the polyphonic word conversion module, and output the character group to an output device. 2. According to the Pinyin conversion system capable of processing multi-syllabic characters as described in item 1 of the scope of patent application, the type of the Chinese character database and the type of the loaded Chinese character database at least include: "Chinese characters / Pinyin" and "Chinese characters / Zhuyin pinyin The corresponding database of Pinyin ". 3. As in the Pinyin conversion system capable of processing multi-syllabic characters as described in item 1 of the scope of patent application, the character correspondence conversion table type and the correspondence conversion database are in the form of a programming language flag (F 1 ag) Determine its internal 第12頁 518479 六、申請專利範圍 應模式,該程式語言旗標屬於一布林(Boo lean)資料型 態。 4. 如申請專利範圍第1項所述之可處理多音字的拼音轉換 系統,該字元對應關係轉換表中所儲存的係π漢語拼音1 和π注音拼音”間的發音符號對照組合。 5. 如申請專利範圍第1項所述之可處理多音字的拼音轉換 系統,該輸入設備可以從鍵盤、滑鼠、數位觸碰式面板 及聲音辨識系統的組合中任意擇一。 6. —種可處理多音字的拼音轉換方法,用以進行簡體中文 漢字、繁體中文漢字之間漢語拼音和注音拼音的輸入與 轉換,該方法至少包含下列步驟: 在一電腦可執行之系統操作平台載入一漢字資料庫; 根據該載入漢字資料庫,決定一對應關係轉換表類 型; 根據。該對應關係轉換表類型,決定一對應關係轉換資 料庫類型; 接受一使用者由一輸入設備輸入一字串; 判斷該字串是否屬於一漢字串及是否需要一多音字處 理; 根據該使用者自行選擇一多音字所需要之正確發音符 號; 根據該字元對應關係轉換表將該字串轉換產生一轉換 後對應子兀組, 根據該轉換後對應字元組輸出至一輸出設備;及Page 12 518479 VI. Application scope of patent application mode, the programming language flag belongs to a Boo lean data type. 4. According to the Pinyin conversion system capable of processing multi-syllabic characters as described in item 1 of the scope of the patent application, the combination of the pronunciation symbols between π Chinese Pinyin 1 and π Zhuyin Pinyin stored in the character correspondence conversion table is 5 The input device can be selected from a combination of a keyboard, a mouse, a digital touch panel, and a voice recognition system, as described in item 1 of the scope of the patent application. A multi-syllabic pinyin conversion method for inputting and converting Chinese pinyin and Zhuyin pinyin between simplified Chinese characters and traditional Chinese characters. The method includes at least the following steps: Chinese character database; According to the loaded Chinese character database, determine a correspondence conversion table type; According to. The corresponding correspondence conversion table type, determine a correspondence conversion database type; Accept a user to input a string from an input device ; Determine whether the string belongs to a Chinese character string and whether a polyphonic word processing is required; A correct pronunciation symbol required for a phonetic character; converting the string into a corresponding sub-group according to the character correspondence conversion table, and outputting the corresponding character group to an output device according to the converted corresponding character group; and 第13頁 518479 六、申請專利範圍 根據判斷該字串之結果決定一結束操作。 7. 如申請專利範圍第6項所述之可處理多音字的拼音轉換 方法,該漢字資料庫至少包含有:π漢字/漢語拼音”以 及π漢字/注音拼音拼音”之對應資料庫。 8. 如申請專利範圍第6項所述之可處理多音字的拼音轉換 方法,該字元對應關係轉換表類型以及該對應關係轉換 資料庫係以一程式語言旗標(F 1 ag )的方式決定其内部對 應模式,該程式語言旗標屬於一布林(Boo lean)資料型 態。 9 .如申請專利範圍第6項所述之可處理多音字的拼音轉換 方法,該字元對應關係轉換表中所儲存的係π漢語拼音^ 和π注音拼音π間的發音符號對照組合。 1 0 .如申請專利範圍第9項所述之可處理多音字的拼音轉換 方法,該字元對應關係轉換表中沒有使用到演算法則技 術或是以建立詞庫的方式作為轉換輔助,其字元轉換過 程完全使用該字元轉換表中所列之發音符號以逐一比對 轉換的方式來進行。 1 1.如申請專利範圍第6項所述之可處理多音字的拼音轉換 方法,該輸入設備可以從鍵盤、滑鼠、數位觸碰式面板 及聲音辨識系統的組合中任意擇一。Page 13 518479 VI. Scope of patent application Determine the end operation based on the result of judging the string. 7. As described in item 6 of the scope of the patent application, the method for processing the pinyin conversion of multi-syllabic characters, the Chinese character database includes at least: π Chinese character / Chinese pinyin "and π Chinese character / Zhuyin pinyin pinyin". 8. As described in item 6 of the scope of the patent application, the method of pinyin conversion capable of processing multi-syllabic characters, the type of the character correspondence conversion table and the correspondence conversion database are in the form of a programming language flag (F 1 ag) Determines its internal mapping mode. The programming language flag belongs to a Boo lean data type. 9. The method of pinyin conversion for multi-syllabic characters as described in item 6 of the scope of patent application, the character correspondence stored in the conversion table of the character correspondence is a combination of pronunciation symbols between π Chinese Pinyin ^ and π Zhuyin Pinyin π. 10. According to the pinyin conversion method for multi-syllabic characters described in item 9 of the scope of the patent application, the algorithm for the conversion of the character correspondence relationship is not used in the algorithm or the method of establishing thesaurus is used as a conversion aid. The meta conversion process completely uses the pronunciation symbols listed in the character conversion table to perform the conversion one by one. 1 1. According to the pinyin conversion method capable of processing multi-syllabic characters as described in item 6 of the scope of patent application, the input device can be selected from a combination of a keyboard, a mouse, a digital touch panel and a voice recognition system. 第14頁Page 14
TW090118427A 2001-07-27 2001-07-27 Phonetic spelling conversion system and method capable of handling multi-tone words TW518479B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW090118427A TW518479B (en) 2001-07-27 2001-07-27 Phonetic spelling conversion system and method capable of handling multi-tone words

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW090118427A TW518479B (en) 2001-07-27 2001-07-27 Phonetic spelling conversion system and method capable of handling multi-tone words

Publications (1)

Publication Number Publication Date
TW518479B true TW518479B (en) 2003-01-21

Family

ID=27801478

Family Applications (1)

Application Number Title Priority Date Filing Date
TW090118427A TW518479B (en) 2001-07-27 2001-07-27 Phonetic spelling conversion system and method capable of handling multi-tone words

Country Status (1)

Country Link
TW (1) TW518479B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101859295B (en) * 2009-04-07 2013-03-13 英业达股份有限公司 System and method for converting simplified Chinese character/word and traditional Chinese character/word with labels and prompts

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101859295B (en) * 2009-04-07 2013-03-13 英业达股份有限公司 System and method for converting simplified Chinese character/word and traditional Chinese character/word with labels and prompts

Similar Documents

Publication Publication Date Title
US8200475B2 (en) Phonetic-based text input method
US7493559B1 (en) System and method for direct multi-modal annotation of objects
US10380241B2 (en) Modular system and method for managing chinese, japanese, and korean linguistic data in electronic form
TWI496012B (en) Modular system and method for managing chinese, japanese, and korean linguistic data in electronic form
JP2014139809A (en) Shared language model
US11853689B1 (en) Computer-implemented presentation of synonyms based on syntactic dependency
WO2010036457A2 (en) Editing 2d structures using natural input
Le et al. Fine-grained event trigger detection
JPH07114558A (en) Chinese character conversion correcting process system
US20080313607A1 (en) Unified input stack
TW518479B (en) Phonetic spelling conversion system and method capable of handling multi-tone words
TWI269193B (en) Keyword sector-index data-searching method and it system
KR100188498B1 (en) Method and apparatus for telephone modification of documents
JPH0344764A (en) Mechanical translation device
Wang et al. Theoretical and architectural support for input device adaptation
Elliott et al. Computer based mathematics using continuous speech recognition
JP3867775B2 (en) Sentence creation device by word input
WO1999052041A1 (en) Opening and holographic template type of language translation method having man-machine dialogue function and holographic semanteme marking system
JP3219527B2 (en) Information processing device
Vergo A statistical approach to multimodal natural language interaction
Cherid et al. Building Natural Language Understanding System from User Manual to Execute Office Application Functions
EP1221082B1 (en) Use of english phonetics to write non-roman characters
JPS60186904A (en) Program input system
Netzer et al. Using semantic authoring for blissymbols communication boards
Feng et al. The Chinese digraphia problem in the information age

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent
MM4A Annulment or lapse of patent due to non-payment of fees