TW200530856A - System, method and program for identifying corresponding translation - Google Patents

System, method and program for identifying corresponding translation Download PDF

Info

Publication number
TW200530856A
TW200530856A TW93138040A TW93138040A TW200530856A TW 200530856 A TW200530856 A TW 200530856A TW 93138040 A TW93138040 A TW 93138040A TW 93138040 A TW93138040 A TW 93138040A TW 200530856 A TW200530856 A TW 200530856A
Authority
TW
Taiwan
Prior art keywords
sentence
translation
natural
original
phrase
Prior art date
Application number
TW93138040A
Other languages
Chinese (zh)
Other versions
TWI281111B (en
Inventor
Jacobson Yoko
Original Assignee
Lab For Language Technology Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lab For Language Technology Inc filed Critical Lab For Language Technology Inc
Publication of TW200530856A publication Critical patent/TW200530856A/en
Application granted granted Critical
Publication of TWI281111B publication Critical patent/TWI281111B/en

Links

Landscapes

  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Suitable translations of phrases to be translated in original sentences having a high probability of obtaining natural translated sentences as sentences in an object language from the original sentences in an original language are obtained. A plurality of natural sentences, in the original language composed of a plurality of words, correlated with translated sentences in the object language are registered in corresponding translation database. When a phrase to be translated in natural sentences in the original language is specified, the corresponding translation database is searched for natural sentences containing the phrase to be translated (102). When no natural sentence completely coinciding with the original sentence is found (negative in 104), a degree of coincidence between each of the extracted natural sentences and the original sentence is evaluated, and a translation having the greatest degree of coincidence is displayed (108-112). When a plurality of natural sentences having the largest degree of coincidence is found (positive in 110), words having a high probability (correlation) of existing in the same sentence as the phrase to be translated are identified and translated sentences of natural sentences containing words having high correlation are displayed (114-118). When no natural sentences containing words having a high correlation are found (negative in 116), the original sentences are searched for words replaceable with targeted words (120-128), and the translated sentences of the natural sentences of which targeted words are replaced with the alternative words in the original sentence are displayed.

Description

200530856 五、發明說明α) 【發明所屬之技術領域】 本發明係有關用於識別一種相對應的翻譯之系統、裝置與 程式,特別是有關一原始語言寫的一原始句子中,至少識 別一翻譯字彙之翻譯之翻譯識別系統,與用以識別應用於 一翻譯識別系統之一翻譯之裝置,以及用以允許一電腦具 備上述翻譯識別系統之功能之程式。 【先前技術】 預期能夠了解一機器翻譯系統將敘述於某一自然語言 (natural language) 中之一句子(一原始句子:an original sentence)翻譯成為欽述於另一語言中(一對 象語言:an object language)之一句子(翻譯的句子) 已有很長的時間,並且提過很多有關機器翻譯改進之技 術。 例如,日本專利申請公告-公開(J a p a n e s e P a t e n t Application Laid-Open: JP-A)第 1 1 - 3 2 8 1 7 8號專利揭露 一技術,其中一片語所表示的一組表現型式(e x p r e s s i〇η forms)係儲存於一硬體裝置所安裝之一英語-日語片語辭 典之中,以及在語法分析過程之英文原文中檢索耦合協調 連接(coordinating conjunction)之包含字彙的表現型 式,並且當檢索表示表格儲存於英語-日語片語辭典時, 或當識別一包含檢索表現型式之字彙的字首(prefix)或 字尾(suffix)時,檢索表現型式視為一詞素(丨norpheme200530856 V. Description of the invention α) [Technical field to which the invention belongs] The present invention relates to a system, a device, and a program for identifying a corresponding translation, and particularly to at least one translation in an original sentence written in an original language. A translation recognition system for the translation of vocabulary, a device for identifying a translation applied to a translation recognition system, and a program for allowing a computer to have the functions of the above translation recognition system. [Prior art] It is expected to understand that a machine translation system translates a sentence (an original sentence) narrated in a natural language into a language (an object language: an object language) sentence (translated sentence) has been a long time, and many techniques related to machine translation improvement have been mentioned. For example, Japanese Patent Application Laid-Open (JP-A) Patent No. 1 1-3 2 8 1 7 8 discloses a technology in which a group of expressions expressed in one phrase (expressi 〇η forms) are stored in one of the English-Japanese phrase dictionaries installed on a hardware device, and retrieve the coordinating conjunction's vocabulary-containing expressions in the original English text of the grammatical analysis process, and when The search expression form is regarded as a morpheme when it is stored in the English-Japanese phrase dictionary, or when a prefix or suffix containing a vocabulary of the search expression is identified

第8頁 200530856 五、發明說明(2) )以分析一句子結構,並且不需分開字彙。 此外,日本專利申請公告-公開(JP-A)第0 6 - 3 1 4 2 9 4號專 利揭露一技術,其中事先儲存每個多意(m u 1 t i v 〇 c a 1)字 彙之特性表格(c h a r a c t e r i s t i c t a b 1 e)相關涵義 (m e a n i n g s)與特性,以及產生第一語言寫於輸入的原始 句子中之一特性紀錄,並且基於產生的特性表格與儲存的 特性表格相比較之原始句子所處理之特性而撿選與顯示一 多意字彙之涵義。 【發明内容】 將敘述於一原始語言中之一原始句子,翻譯成為敘述於一 對象語言(〇 b j e c t 1 a n g u a g e)中之一翻譯句子之工作, 係透過一翻譯過程來完成,其中敘述於原始語言中之原始 句子之字彙轉換成為對象語言中之字彙,並且基於了解句 子通常包含字彙和文法以及依據對象語言之文法規則,重 新排列所轉換的字彙。翻譯過程係採取利用電腦來執行翻 譯之機器翻譯,並且相對應的翻譯之系統係按照字彙單元 (w 〇 r d u n i t)登錄於辭典,而字彙係順序從原始句子中 揀選,並且重複檢索所揀選字彙之相對應的翻譯,以將原 始句子按照字彙單元置換成為相對應的翻譯,並且識別一 部分言詞(speech)之每個字彙以分析句子結構,基於經 常使用之句子結構之分析結果以及根據對象語言之文法規 則,按照字彙單元重新排列相對應的翻譯,以得到慣用 200530856 五、發明說明(3) (g en e ra 1 1 y u s e d)之翻譯句子(相對應的翻譯)。 在機器翻譯中,即使原始句子中之一刻板片語 (stereotyped phrase)可以應用敘述於日本專利中請第 1 1 - 3 2 8 1 7 8號專利之技術,並且具有得到上述片語之可能 的合適翻譯,即使存在於原始句子中之一多意字彙,也可 以運用敘述於日本專利申請第0 6 - 3 1 4 2 9 4號專利之技術可 能得到上述多意字彙之特定翻譯(敘述於日本專利申請第 0 6 - 3 1 4 2 9 4號專利之技術需要複雜的工作,例如分析撿選 多意字彙與含意以及撿選多意字彙之特性,以事先準備特 性表格)。然而,在機器翻譯中,因為即使得到的翻譯沒 有文法錯誤以及按照字彙單元翻譯也沒有錯誤,仍然常發 生非自然(u η n a t u r a 1)發現翻譯的句子作為一對象語言 之句子,即使運用敘述於,例如日本專利第1 1 - 3 2 8 1 7 8號 與第0 6 - 3 1 4 2 9 4號專利之技術,實際情況係機器翻譯之技 術仍然不能得到實際使用之適合的翻譯精確性。 估計這些事實情況之起因係採用機器翻譯之翻譯過程,當 人們用他們的母語(native language)說或寫一句子 時,明顯不同於創造一個句子之過程。也就是說,當人們 用他們的母語說或寫一句子時,過程中他們運用文法規則 而檢索未使用過的字彙,但是實際上,基於取決於上下文 的字彙之前以及有關本身片語的背景知識之後,從累積於 一個別大腦之龐大詞塊(chunks)之數目(含意之質量:Page 8 200530856 V. Description of the Invention (2)) to analyze the structure of a sentence without the need to separate the vocabulary. In addition, Japanese Patent Application Publication-Publication (JP-A) No. 0 6-3 1 4 2 9 4 discloses a technology in which a characteristic table (characteristictab) of each mu 1 tiv 〇ca 1 vocabulary is stored in advance. 1 e) Relevant meanings and features, and generate a feature record in the original sentence written in the first language, and based on the features processed by the original sentence compared with the generated feature table and the stored feature table Choose and display the meaning of a multi-meaning word. [Summary of the Invention] The work of translating an original sentence narrated in an original language into a translated sentence narrated in an object language (〇bject 1 anguage) is accomplished through a translation process, in which the narration is described in the original language The vocabulary of the original sentence in is converted into the vocabulary in the target language, and the converted vocabulary is rearranged based on the understanding that the sentence usually contains vocabulary and grammar and according to the grammar rules of the target language. The translation process uses machine translation using a computer to perform translation, and the corresponding translation system is registered in the dictionary according to the vocabulary unit (word unit), and the vocabulary is sequentially selected from the original sentence, and the selected vocabulary is repeatedly retrieved. Corresponding translations to replace the original sentence with the corresponding vocabulary unit, and identify each vocabulary of a part of speech to analyze the sentence structure, based on the analysis results of frequently used sentence structures and the grammar of the target language Rules, rearrange the corresponding translations according to the vocabulary units to get the commonly used 200530856 V. Invention Description (3) (g en e ra 1 1 yused) translation sentences (corresponding translations). In machine translation, even if one of the stereotyped phrases in the original sentence can be applied to the technology described in the Japanese Patent No. 1 1-3 2 8 1 7 8 and has the possibility of obtaining the above phrase Appropriate translation, even if a polysemy exists in the original sentence, the specific translation of the polysemy can be obtained using the technology described in Japanese Patent Application No. 0 6-3 1 4 2 9 4 (narrated in Japan) The technology of the patent application No. 0 6-3 1 4 2 9 4 requires complicated work, such as analyzing the characteristics of selecting polysemes and meanings and selecting polysemes to prepare characteristic tables in advance). However, in machine translation, even if the translation obtained is free of grammatical errors and translation according to vocabulary units, it is still unnatural (u η natura 1) to find the translated sentence as a sentence of the target language, even if narrative For example, the technologies of Japanese Patent Nos. 1 1-3 2 8 1 7 8 and 0 6-3 1 4 2 9 4 are actually machine translation technologies that still cannot obtain the appropriate translation accuracy for practical use. It is estimated that the cause of these facts is the translation process using machine translation. When people speak or write a sentence in their native language, it is obviously different from the process of creating a sentence. That is, when people speak or write a sentence in their native language, they use grammar rules to retrieve unused vocabulary, but in fact, based on the context of the context-dependent vocabulary and background knowledge about the phrase itself After that, from the number of huge chunks (meaning quality:

第10頁 200530856 五、發明說明(4) 字彙、片語、刻板措辭(stereotyped express i ο 排列、句子結構,以及句子)中,連接其所檢索( 況而撿選)之合適的字彙、片語以及句子來創造句 因此,了解取決於上下文之相關字彙之前與相關字 有關片語本身的背景知識以及當人們用其母語說或 同時用相同的方法以作為當人們用其母語說或寫時 的單位(也可以稱為詞塊,chunks),尋找原始句 對應的翻譯而得到自然翻譯作為對象語言中之句子 而,真實情況係按照詞塊尋找相對應的翻譯之翻譯 創造,而未能於機器翻譯系統中實行係因為上述翻 之創造不容易識別原文或有關片語本身的背景知識 詞塊之產生也會含糊不清。 將上述事實作為考量建立本發明,目的在於得到相 翻譯識別系統、相對應的翻譯識別方法以及程式。 為了達到上述目的,本發明相關之申請專利範圍第 翻譯識別系統包含儲存裝置,用以儲存於原始語言 複數個字彙之複數個自然句子,上述之自然句子與 言中之翻譯相關聯;檢索裝置用以從儲存於前述之 置中之原始語言之複數個自然句子之中,檢索原始 原始句子中包含翻譯片語之自然句子;相對應的翻 裝置根據上述檢索裝置與上述原始句子檢索而撿選 ns)、 根據情 子。 彙後、 寫時, 之檢索 子之相 〇 缺 / η、 句子之 譯句子 ,並且 對應的 1項之 中包含 對象語 儲存裝 語言之 譯識別 之自然 200530856 五、發明說明(5) 句子之間,用以決定一致之程度,並且識別包含至少在基 於決定一致之程度而選擇之自然句子之相對應的翻譯中之 上述翻譯片語,以作為至少上述原始句子中之上述翻譯片 語。 根據本發明之申請專利範圍第1項,原始語言中包含複數 個字彙之自然句子係儲存於自然句子與對象語言中之翻譯 相關之儲存裝置。在傳統機器翻譯系統之辭典中,本發明 相關之自然句子係至少為未經編輯或處理,例如按照字彙 單元相除與多意字彙之撿選之句子、子句、片語、刻板措 辭以及慣用語(c ο 1 1 〇 c a t i ο η)中之任一種,並且在按照 儲存自然句子之字彙單元與對象語言中之翻譯相關之句子 之辭典中登錄之例子中,排除多意字彙提供很多相對應的 翻譯之必要性(多意字彙係有關很多翻譯,因為必須選出 適合的翻譯,所以可能會有不正確的選擇)以及可以將相 對應於一自然句子之翻譯轉換成為對象語言中之自然句 子。此外,字彙與另外的自然句子本身相對應的翻譯可以 儲存於本發明之儲存裝置。 根據本發明之申請專利範圍第1項,從儲存於儲存裝置之 原始語言中之複數個自然句子之中,檢索裝置檢索包含敘 述於原始語言(句子、子句、片語、刻板措辭,以及慣用 語中之任一種)中之原始句子之翻譯片語之自然句子。翻 譯片語意指特別在原始句子中翻譯之片語,可以是一字彙Page 10 200530856 V. Description of the invention (4) The vocabulary, phrases, and stereotyped expressions (stereotyped express i ο arrangement, sentence structure, and sentences) are connected to the appropriate vocabulary and phrases searched (and sometimes selected) As well as sentences to create sentences. Therefore, to understand the background knowledge of the phrase itself related to related words before context-dependent vocabulary and when people speak in their mother tongue or both use the same method as when people speak or write in their mother tongue The unit (also known as chunks) searches for the translation of the original sentence and obtains the natural translation as a sentence in the target language. The actual situation is based on the translation of the chunk to find the corresponding translation. The implementation in the translation system is because the above-mentioned translation creation is not easy to identify the original text or the generation of background knowledge chunks related to the phrase itself will also be ambiguous. The present invention was established by taking the above facts into consideration, and the purpose is to obtain a corresponding translation recognition system, a corresponding translation recognition method, and a program. In order to achieve the above purpose, the translation recognition system of the scope of patent application related to the present invention includes a storage device for storing a plurality of natural sentences in a plurality of vocabularies of the original language, and the above natural sentences are related to the translation in the language; the retrieval device is used for Retrieve natural sentences containing translated phrases from the original original sentence from the plurality of natural sentences stored in the original language in the aforementioned center; the corresponding translation device selects ns according to the above retrieval device and the above original sentence retrieval ), According to love. After rendezvous, at the time of writing, the search terms are missing / η, the translated sentence of the sentence, and the corresponding 1 item contains the nature of the target language storage language translation recognition 200530856 V. Description of the invention (5) Between the sentences To determine the degree of agreement, and to identify the above translation phrases included in at least the corresponding translations of natural sentences selected based on the degree of agreement determination, as at least the above translation phrases in the original sentence. According to the first patent application scope of the present invention, a natural sentence containing a plurality of words in the original language is stored in a storage device related to the translation of the natural sentence and the target language. In the dictionary of the traditional machine translation system, the natural sentences related to the present invention are at least unedited or processed, such as the selection of sentences, clauses, phrases, stereotyped words, and idiomatic phrases that are divided by the vocabulary unit and the multi-word vocabulary. (C ο 1 1 〇cati ο η), and in the example registered in the dictionary that stores the vocabulary units of natural sentences and sentences related to translation in the target language, the exclusion of polysemes provides a lot of correspondence The need for translation (multi-word vocabulary is related to many translations, because suitable translations must be selected, so there may be incorrect choices), and translations corresponding to a natural sentence can be converted into natural sentences in the target language. In addition, translations of vocabulary corresponding to other natural sentences themselves can be stored in the storage device of the present invention. According to item 1 of the scope of patent application of the present invention, the retrieval device retrieves from the plurality of natural sentences stored in the original language of the storage device the narration contained in the original language (sentences, clauses, phrases, stereotyped words, and idiomatic expressions). Any one of the languages), the natural sentence of the translated phrase of the original sentence. Translation phrase means a phrase translated especially in the original sentence and can be a word

第12頁 200530856 五、發明說明(6) 或包含複數個字彙。舉例而言,當使用者(翻譯者)使用 本發明相關之翻譯識別系統作為用於翻譯工作之電子辭典 時,翻譯者可以明確說明翻譯片語。此外,例如當使用本 發明相關之翻譯識別系統作為一部分之機器翻譯系統或自 動解釋系統(automatic interpretation system)時, 使用本發明相關之翻譯識別系統識別之相對應的翻譯執行 機器翻譯之機器翻譯系統或自動解釋系統可以明確說明翻 譯片語。另外,也可以由使用者(翻譯者等)明確說明包 含翻譯片語之原始句子,或自動明確說明(例如,自動識 別一句子或包含翻譯片語之一子句以作為原始句子)。由 於檢索裝置檢索包含一翻譯片語之一自然句子,檢索結果 係包含於相對應的翻譯中之翻譯片語之翻譯的一自然句子 之撿選。 根據申請專利範圍第1項之第一翻譯識別裝置,由檢索裝 置之檢索所撿選之自然句子之間決定一致之程度,並且識 別至少在基於一致之程度而選擇之自然句子之相對應的翻 譯中之翻譯片語之翻譯,以作為至少原始句子中翻譯片語 之翻譯。因此,在撿選檢索裝置之檢索所撿選之高可能之 自然句子(例如,使用具有相同意義之一自然句子以作為 於原始句子中多意字彙)之間,自然句子與原始句子相似 以作為具有高一致之程度之自然句子,並且至少識別翻譯 中之翻譯片語以作為原始句子中之翻譯片語(理所當然, 片語之翻譯除了識別相對翻譯中之翻譯片語之外,並且依Page 12 200530856 V. Description of the invention (6) Or it may contain plural words. For example, when a user (translator) uses a translation recognition system related to the present invention as an electronic dictionary for translation work, the translator can specify the translation phrase. In addition, for example, when using a machine translation system or an automatic interpretation system as a part of the translation recognition system related to the present invention, a machine translation system that performs machine translation using a corresponding translation identified by the translation recognition system related to the present invention Or the automatic interpretation system can clearly explain the translation phrase. Alternatively, the user (translator, etc.) can explicitly specify the original sentence containing the translation phrase, or can automatically specify it (for example, automatically recognize a sentence or include a clause of the translation phrase as the original sentence). Since the retrieval device retrieves a natural sentence containing a translation phrase, the retrieval result is a selection of a natural sentence translated by the translation phrase included in the corresponding translation. According to the first translation recognition device of the first patent application scope, the degree of agreement between the natural sentences selected by the search of the retrieval device is determined, and the corresponding translation of the natural sentence selected based on at least the degree of agreement is identified The translation of the translated phrase in the translation of at least the original sentence in the original sentence. Therefore, between the high-probability natural sentences picked by the retrieval device (for example, using a natural sentence with the same meaning as the polysemous word in the original sentence), the natural sentence is similar to the original sentence as Natural sentences with a high degree of consistency, and at least the translation phrase in the translation is recognized as the translation phrase in the original sentence (as a matter of course, the translation of the phrase is in addition to identifying the translation phrase in the relative translation, and according to

第13頁 200530856 五、發明說明(7) 據與原始句子一致之程度以作為原始句子中片語之翻譯 因此,根據本發明之申請專利範圍第1項,考量原始句子 中識別詞塊之標點符號(p u n c t u a t i ο η)之最大技術困 難,係儲存相對應於原始語言中的翻譯之原始語言中之複 數個自然句子,並且從儲存的自然句子中揀選與原始語言 中之原始句子具有高一致之程度之自然句子,以及識別相 對應於所撿選之自然句子之自然翻譯中之翻譯,以作為至 少原始語言中之翻譯片語。因此,本發明使得到之自然翻 譯能夠等於相對應於按照詞塊經常執行之原始句子之翻 譯,以作為不需要識別原始句子中詞塊之標點符號之一結 果,並且使能夠得到之原始句子中翻譯片語之合適翻譯 (從原始句子中得到之高可能之翻譯,以作為對象語言中 之句子)。 根據本發明之申請專利範圍第1項,可以了解儲存於自然 句子與儲存裝置中之本身相對應的句子,不需要執行複雜 的編輯與程序,例如將自然句子除以字彙單元、撿選多意 字彙以及當儲存自然句子與本身相對應的翻譯於,至少, 儲存裝置時,列舉多意字彙之翻譯盡可能與每個相關的字 彙一樣多。此外,根據本發明之申請專利範圍第1項,在 檢索裝置之檢索所撿選之自然句子與原始句子之間決定一 致之程度,基於決定一致之程度而檢選之一自然句子可以 200530856 五、發明說明(8) 得到合適的翻譯,並且不需要執行複雜的程序,例如在傳 統機器翻譯系統中識別一部分之言詞(speech)與句子結 構之分析,並且可以簡化程序。 根據本發明之申請專利範圍第1項之儲存裝置中儲存自然 句子與本身相對應的翻譯,自然句子有可能完全與儲存於 儲存裝置中之原始句子一致。考量此點,如同上述,例如 申請專利範圍第2項,較佳的翻譯識別裝置從儲存於儲存 裝置之原始語言之複數個自然句子中,也檢索與原始句子 完全一致之自然句子,並且當撿選與檢索裝置之檢索之原 始句子完全一致之自然句子時,第一翻譯識別裝置識別與 原始句子完全一致之自然句子之翻譯。因此,當與原始句 子完全一致之自然句子係儲存於儲存裝置時,可以得到原 始句子之翻譯。 根據本發明之申請專利範圍第1項,在檢索所撿選之自然 句子與原始句子之間得到一致之程度,例如下列的方法。 也就是說,根據本發明之申請專利範圍第3項具有一特 性,根據本發明之申請專利範圍第1項,在檢索裝置之檢 索所撿選之自然句子與原始句子之間,第一識別裝置計算 與原始句子一致之字彙之數目而評估一致之程度,使得當 一致的字彙之數目增加時,與原始句子一致之程度也隨之 提南。在自然句子與原始句子之間,表示一致之程度之一 致的字彙之數目,係一重要的指標。藉由評估一致之程Page 13 200530856 V. Description of the invention (7) According to the degree of consistency with the original sentence, it is used as the translation of the phrase in the original sentence. Therefore, according to item 1 of the scope of patent application of the present invention, consider the punctuation marks in the original sentence. The biggest technical difficulty of (punctuati ο η) is to store a plurality of natural sentences in the original language corresponding to the translation in the original language, and to select from the stored natural sentences with a high degree of consistency with the original sentences in the original language Natural sentences and identifying translations in the natural translations corresponding to the selected natural sentences as translation phrases in at least the original language. Therefore, the present invention enables a natural translation to be equivalent to a translation corresponding to an original sentence often performed in accordance with a chunk, as a result of not needing to recognize punctuation marks in the original sentence, and enables the original sentence to be obtained. Appropriate translation of the translation phrase (highly probable translation obtained from the original sentence as a sentence in the target language). According to item 1 of the scope of patent application of the present invention, it is possible to understand the sentences corresponding to the natural sentences stored in the storage device, without the need to perform complicated editing and procedures, such as dividing natural sentences by vocabulary units, and selecting multiple ideas. The vocabulary and when storing translations corresponding to the natural sentence itself, at least when storing the device, list as many translations of the multi-sense vocabulary as possible for each related vocabulary. In addition, according to the first patent application scope of the present invention, the degree of agreement between the natural sentence selected by the retrieval device and the original sentence is determined. Based on the degree of agreement, a natural sentence can be selected. 200530856 Description of the Invention (8) A proper translation is obtained, and it is not necessary to execute a complicated program, such as analyzing a part of speech and sentence structure analysis in a traditional machine translation system, and the program can be simplified. According to the invention, the storage device of item 1 of the patent application stores the translation of the natural sentence corresponding to itself, and the natural sentence may be completely consistent with the original sentence stored in the storage device. Taking this into consideration, as described above, for example, in the scope of patent application No. 2, the preferred translation recognition device also retrieves natural sentences that are completely consistent with the original sentence from the plurality of natural sentences stored in the original language of the storage device, and should pick up When selecting a natural sentence that is identical to the original sentence retrieved by the retrieval device, the first translation recognition device recognizes the translation of the natural sentence that is identical to the original sentence. Therefore, when a natural sentence that is completely consistent with the original sentence is stored in the storage device, a translation of the original sentence can be obtained. According to item 1 of the scope of patent application of the present invention, the degree of agreement between the retrieved natural sentence and the original sentence is retrieved, such as the following method. That is, according to item 3 of the scope of patent application of the present invention, there is a characteristic. According to item 1 of the scope of patent application of the present invention, between the natural sentence and the original sentence selected by the retrieval device, the first recognition device Calculate the number of words that are consistent with the original sentence and evaluate the degree of agreement, so that as the number of words that coincide with each other increases, the degree of agreement with the original sentence is also raised. Between natural sentences and original sentences, the number of words that express one degree of consistency is an important indicator. By assessing a consistent process

第15頁 200530856 五、發明說明(9) 度,使得當一致的字彙之數目增加時,與原始句子一致之 程度也隨之提高,並且基於一致之程度之高準確性而撿選 與原始句子相近之一自然句子。 根據本發明之申請專利範圍第3項,雖然可能直接使用一 致的字彙之計算數目作為一致之程度,如上所述,例如申 請專利範圍第4項,較佳的係使用一致的字彙之計算數目 除以包含翻譯片語之字彙之數目,以作為一致之程度。因 此,由此項程序,基於包含翻譯片語之字彙之數目可以標 準化(normalizing) —致字彙之數目之數值(value)得 到關於本發明之一致之程度,用以撿選與具有使用一致之 程度之高度準確性之原始句子相近之自然句子,並且與包 含翻譯片語之字彙之數目不相關。為了計算一致之字彙, 可以分辨包含翻譯片語的字彙之數目(評估一致的字彙之 數目之數值)以及其他字彙,除了藉由不同的權重 (weights)(權重設定(weights set)使得權重包含片 語之字彙係較高(heav i er))之外,並且使用包含翻譯 片語之字彙之數目乘以一致的字彙之字彙之數目,而需要 加總數值所得到之數值以作為一致的字彙之數目。 根據本發明之申請專利範圍第3項,允許第一翻譯識別系 統在檢索裝置之檢索而檢選之自然句子與原始句子之間, 計算未一致(mismatched)的字彙之數目,並且評估與原 始句子一致之程度,使得當計算的字彙之數目減少時,與Page 15 200530856 V. Description of the invention (9) degree, when the number of consistent vocabulary increases, the degree of consistency with the original sentence also increases, and based on the high accuracy of the degree of consistency, the selection is similar to the original sentence One of the natural sentences. According to item 3 of the scope of patent application of the present invention, although it is possible to directly use the calculated number of consistent vocabularies as the degree of consistency, as described above, for example, for item 4 of the scope of patent application, it is better to use the calculated number of consistent vocabularies divided The number of vocabularies containing translated phrases is taken as the degree of consistency. Therefore, from this procedure, the number of vocabularies containing translation phrases can be normalized-the value of the number of vocabulary values is used to obtain a degree of agreement with the present invention to select the degree of consistency with use The highly accurate original sentence is similar to the natural sentence and is not related to the number of words containing the translated phrase. In order to calculate a consistent vocabulary, the number of vocabularies containing translation phrases (the value of the number of consistent vocabularies evaluated) and other vocabulary can be distinguished, except that different weights (weights set) are used to make the weights include slices. The word vocabulary of the language is higher (heav i er)), and the number of vocabulary including the translation phrase is multiplied by the number of the vocabulary of the consistent vocabulary, and the value obtained by adding the total value is used as the word number. According to item 3 of the scope of patent application of the present invention, the first translation recognition system is allowed to calculate the number of mismatched words between the natural sentence and the original sentence selected by the retrieval device and to evaluate the original sentence. The degree of consistency, so that when the number of words calculated is reduced, and

第16頁 200530856 五、發明說明(10) 原始句子一致之程度也隨之提高,如申請專利範圍第5項 所敘述。因此,可能在評估一致之程度之自然句子與原始 句子之間,更增進評估一致之程度之準確性,使得當未一 致的字彙之數目減少時,一致之程度也隨之提高,使用上 述之未一致的字彙之數目,另外如根據申請專利範圍第3 項所敘述之一致字彙之數目。 儲 中 明 發 本 在 原 之 言 語 始 原 ,於 現 項 3 出 常 圍經 範要 利只 專, 請子 φ-句 據然 根自 如之 ,置 法裝 方存 此儲 由於 藉存 彙 彙 字 字 之e)的 h 中 ΐ 應 子丨對 句這相 始卩為 a 作V菜 以· 字 J 的 cn)現 • 1 一 出 Γ 常 之 > 經 7之 中 多 子及/很 句 含 文α包 英d子 为 ο 句 例 當 相一 不高 子有 句具 始而 原彙 與字 上的 際現 實出 選常 撿經 以為 可因 中為 其作 , 以 隹, 能子 可句 一 然 有自 具一 ,之 時近 致專 一 請 算申 言 從例 的述 佳所 較上 ,如 題, 問中 個彙 這字 慮之 考算 。 計 子的 然之 白 S 之數 度之 程囊 之字 致的 第 義 定 先 預 將 置 裝 別 識 可用 為使 因以 ,可 此, 因鄉音 。影 外之 在上 除彙 排字 彙的 字致 的一 現在 謂 : 河出彙 f-K-r -#f {子 經的 }現 , d e 出 項 η 6 i 常 第ef經 d 圍 e 除 範pr排 利{以 精 更 間 之 子 句 始 原 與。 子X) 6 句 d η 然.1 白 ( 在標 為指 作之 以度 g程 數之 之致 彙一 字應 的反 致地 〆確 根據本發明之申請專利範圍第3項或第5項,較佳的如上所 述例如申請專利範圍第7項,當計算一致或未一致的字彙Page 16 200530856 V. Description of the invention (10) The degree of consistency of the original sentence has also increased, as described in item 5 of the scope of patent application. Therefore, it is possible to improve the accuracy of the degree of consistency between the evaluation of the natural sentence and the original sentence, so that when the number of non-consistent words decreases, the degree of consistency also increases. The number of consistent vocabulary, and the number of consistent vocabulary as described in item 3 of the scope of patent application. Chu Zhongming's original text was in the original language. In the current item 3, the Circumvention of the Scriptures and Fan Li only specializes, please subordinate φ-sentence as it should be. You can save this deposit due to the debit and exchange of e) In the middle of h, 子 子 丨 the beginning of the opposite sentence is a for V. The word cn for the word J) Now • 1 out of Γ Changzhi > d is for ο sentence example when the first one is not high, the sentence has a beginning and the original sink and the word, the actual reality is often chosen to think that it can be done in the cause, so that the can, but the sentence can have its own First, at that time, I would like to ask you to consider the application of the testimony from the example of the description, such as the question, the question of the question in the question. Of course, the definition of the number of degrees in the white S and the degree of the process of the capsule is to make the device identification available in advance so that it can be, therefore, due to the accent. The one in the shadow of the word suffix above the hui hui is now called: river outbound fKr-# f {子 经 的} present, de output term η 6 i often ef d d e e except Fan pr row profit {to The clause between the finer ones begins with the original.子 X) 6 sentences d η ran. 1 White (indicated as the meaning of the degree of the course of the degree of the convergent response should be based on the correct application of the invention according to the scope of the patent application item 3 or 5 , Preferably as described above, for example, the 7th in the scope of patent application, when calculating consistent or non-consistent vocabulary

第17頁 200530856 五、發明說明(11) 時考量因為單數/複數(s i n g u 1 a r / p 1 u r a 1)或時態 (tense)以作為一致之字彙而具有曲折形式 (i n f 1 e c t i ο η)之字彙,第一翻譯識別裝置計算一字彙以 作為一致的字彙。在表格中事先登錄相關之字彙,並且研 究是否只在表格中顯示登錄之未一致之字尾,也可以發現 因為單數/複數或時態而具有曲折形式之字彙。因此,基 本地(fundamentally)考量,將因為單數/複數或時態而 具有曲折形式之字彙之影響排除在外,在一致或未一致的 字彙之數目上,也可能使用一致或未一致的字彙之數目以 作為在自然句子與原始句子之間更精確地反應一致之程度 之指標。另外,較佳的也考慮使用寫在上文或下文之信件 的英文句子中之字彙等,以作為一致的字彙。 根據本發明之申請專利範圍第3項,較佳的如上所述例如 申請專利範圍第8項,當計算一致的字彙時,第一翻譯識 別裝置不重複計算使用兩次或多次的字彙。因此,在一致 的字彙上可以排除使用兩次或多次一致的字彙之影響,並 且可以使用一致的字彙之數目以作為在自然句子與原始句 子之間更精確地反應一致之程度之指標。如上所述取代避 免重複計算,可以限制計算使用兩次或多次預先定義的數 目η ( η - 2)之一致的字彙之數目的最大數目之次數。 根據本發明之申請專利範圍第3項或第5項,較佳的如上所 述例如申請專利範圍第9項,第一翻譯識別裝置在檢索裝Page 17 200530856 V. Description of the invention (11) When considering the singular / plural (singu 1 ar / p 1 ura 1) or tense as a consistent vocabulary, it has a zigzag form (inf 1 ecti ο η) Vocabulary, the first translation recognition device calculates a vocabulary as a consistent vocabulary. Register the relevant vocabulary in advance in the form, and study whether to display only the inconsistent suffixes registered in the form. You can also find that the vocabulary has a zigzag form because of the singular / plural or tense. Therefore, fundamentally (fundamentally) consideration is to exclude the influence of vocabulary in zigzag form due to singular / plural or tense. It is also possible to use the number of consistent or non-consistent vocabulary on the number of consistent or inconsistent vocabulary Used as an indicator of the degree of more consistent response between natural sentences and original sentences. In addition, it is also preferable to use the vocabulary in the English sentence of the letter above or below as a consistent vocabulary. According to item 3 of the scope of patent application of the present invention, preferably as described above, for example, item 8 of the scope of patent application, when calculating a consistent vocabulary, the first translation recognition device does not repeatedly calculate the vocabulary used two or more times. Therefore, the effect of using two or more consistent vocabularies can be ruled out on the consistent vocabulary, and the number of consistent vocabularies can be used as an indicator of the degree of more consistent response between the natural sentence and the original sentence. Instead of avoiding double counting as described above, the maximum number of times that the number of consistent vocabularies using two or more predefined numbers η (η-2) can be calculated can be limited. According to item 3 or item 5 of the scope of patent application of the present invention, preferably as described above, for example, item 9 of the scope of patent application, the first translation recognition device searches for

第18頁 200530856 五、發明說明(12) 置之檢索而撿選之自然句子與原始句子之間評估字彙之排 列順序之相似度,並且評估與原始句子一致之程度,使得 當字彙之排列順序之相似度提高時,與原始句子一致之程 度也隨之提高。即使句子包含相同的字彙,一自然句子有 時依據字彙之排列順序而會有不同的意思(翻譯),基於 如上述之由字彙之排列順序評估一致之程度,可以選擇與 具有高度準確性之原始句子相似之一自然句子並且評估一 致之程度,使得當字彙之排列順序之相似度提高,一致之 程度也隨之提高。 根據本發明之申請專利範圍第3項或第5項,較佳的,如上 所述例如申請專利範圍第1 0項,第一翻譯識別裝置在與檢 索裝置之檢索所撿選之自然句子一致之字彙中,計算未一 致的字彙之數目,並且評估與原始句子一致之程度,使得 當計算在一致的字彙中未一致的字彙之數目減少時,與原 始句子一致之程度隨之提高。因此,增進一致之程度之準 確性,並且可以選擇與具有高度準確性之原始句子相近之 自然句子。 因此,根據本發明之申請專利範圍第1項,有時只基於一 致之程度,難以識別(選擇)與原始句子相近之自然句 子,例如檢選具有相同的一致之程度之複數個自然句子 );雖然,上述之識別係視儲存於儲存裝置之自然句子 (翻譯)之種類而定。考慮上述之問題,根據本發明之申Page 18 200530856 V. Description of the invention (12) Search the natural sentence selected and the original sentence to evaluate the similarity of the order of the vocabulary, and evaluate the degree of consistency with the original sentence, so that the order of the vocabulary As the degree of similarity increases, the degree of agreement with the original sentence also increases. Even if the sentences contain the same vocabulary, a natural sentence sometimes has different meanings (translations) depending on the arranging order of the vocabulary. Based on the evaluation of the degree of consistency from the arranging order of the vocabulary as described above, you can choose the original Sentences are similar to a natural sentence and the degree of consistency is evaluated, so that when the similarity of the arrangement order of the vocabulary increases, the degree of consistency also increases. According to item 3 or item 5 of the scope of patent application of the present invention, preferably, as described above, for example, item 10 of the scope of patent application, the first translation recognition device is consistent with the natural sentence selected by the retrieval device. In the vocabulary, the number of inconsistent vocabularies is calculated, and the degree of agreement with the original sentence is evaluated, so that when the number of inconsistent vocabularies in the consistent vocabulary is reduced, the degree of agreement with the original sentence increases. Therefore, the accuracy of the degree of consistency is improved, and a natural sentence close to the original sentence with high accuracy can be selected. Therefore, according to item 1 of the scope of patent application of the present invention, sometimes based on the degree of consistency, it is difficult to identify (select) natural sentences that are similar to the original sentence, such as selecting a plurality of natural sentences with the same degree of consistency); Although, the above identification depends on the kind of natural sentence (translation) stored in the storage device. Considering the problems described above, according to the application of the present invention

第19頁 200530856 五、發明說明(13) 請專利範圍第1項,較佳的如上所述例如申請專利範圍第 1 1項,搜尋裝置用以搜尋檢選裝置之檢索所撿選之自然句 子,上述檢索裝置係用於翻譯片語與經常出現於另外提供 之原始語言之相同文章中之經常出現的字彙,參照搜尋裝 置發現的特定經常出現片語與存在於原始句子,第一翻譯 識別裝置搜尋有關包含每個翻譯片語以及特定經常出現的 片語之自然句子中的翻譯片語之經常出現的翻譯,如同在 檢索裝置之檢索所撿選的自然句子之間包含每個翻譯片語 之翻譯的自然句子之翻譯,而且第一翻譯識別裝置識別經 常出現的翻譯以作為原始句子中翻譯片語之翻譯。 當一翻譯片語與特定經常出現片語經常出現(稱為慣用語 )於原始句子中之原始語言之相同句子時,具有高可能的 翻譯片語之翻譯與包含每個翻譯片語以及特定經常出現的 片語之自然句子中之相對應的翻譯之翻譯片語之翻譯一 致。然而,雖然翻譯片語很可能識別成為自然句子之相對 應的翻譯中之翻譯片語之合適翻譯,上述自然句子包含每 個特定經常出現的片語,而具有翻譯片語之不合適的翻譯 之自然句子也可能混合在儲存於儲存裝置之上述自然句子 中。基於上述,根據本發明之申請專利範圍第1 1項,當搜 尋具有高度相關之經常出現的片語,在了解經常出現的片 語以及包含每個翻譯片語之自然句子之翻譯之間,參考原 始句子中特定經常出現的片語,而發現用於翻譯片語以及 包含每個特定經常出現的片語之自然句子中之經常出現的Page 19, 200530856 V. Description of the invention (13) Please refer to item 1 of the patent range, preferably as described above, for example, for item 11 of the patent application range. The above search device is used to translate phrases and words that often appear in the same article in the original language provided separately. With reference to specific frequently-occurring phrases found in the search device and existing sentences, the first translation recognition device searches Regarding the frequently occurring translations of natural phrases containing each translated phrase and specific frequently occurring phrases, as in the case of including each translated phrase between natural sentences selected by the retrieval device search Translation of natural sentences, and the first translation recognition device recognizes frequently occurring translations as translations of translation phrases in the original sentence. When a translated phrase is the same as the original sentence in the original sentence, which is often found in the original sentence (referred to as idiomatic phrase), the translation of the highly possible translated phrase includes each translated phrase and the specific regular phrase. The translations of the corresponding translations in the natural sentences of the emerging phrases are consistent. However, although the translated phrase is likely to be a proper translation of the translated phrase in the corresponding translation of the natural sentence, the above-mentioned natural sentence contains each specific frequently occurring phrase, and has an inappropriate translation of the translated phrase. Natural sentences may also be mixed in the above-mentioned natural sentences stored in the storage device. Based on the above, according to item 11 of the scope of patent application of the present invention, when searching for highly relevant and frequently occurring phrases, between understanding frequently occurring phrases and the translation of natural sentences containing each translated phrase, refer to Specific frequently-occurring phrases in the original sentence, and frequently-occurring ones found in natural sentences used to translate phrases and include each specific frequently-occurring phrase

第20頁 前專圍而片沒原 的請範語譯,慮翻 t)申利片翻此考的 X 之專的與因,適 t明請現語,果合 發申出片子結之 文本之常照句。囊 原據述經參譯慮字 將根敘之及翻考意 別。後中以的入多 識譯其章,應列到 以翻明文譯對子得 難的發同#相句略 ,適本相的之始粗 擇合據於應子原以 選之根中對句之可 多慮及集相然後子 很考以時之自與句 有入C同語一前始 況列項由片之文原 情囊1 ,譯現原之 據字第3翻出將後 依之圍項一時要與 4 為後範1J?>同必前。 因與利第識語有文譯 200530856 五、發明說明(14) 翻譯。因此,基於經常出現的片語而可以得到原始句子中 翻譯片語之合適的翻譯(從原始句子中得到自然翻譯之相 對應的翻譯具有高可能性,以作為對象語言中之句子), 即使只基於一致之程度也很難識別(選擇)一合適的自然 句子。 登錄表格中之經常出現於原始語言之相同原文中之片語並 且參照表格,以搜尋上述經常出現的片語。根據本發明之 申請專利範圍第1 1項,基於檢索裝置之檢索所撿選之自然 句子發現經常出現的片語有兩個優點,其中可以節省準備 上述表格的時間與人力,並且也可以節省所需要儲存表格 之儲存容量。 根據本發明之申請專利範圍第1項,較佳的用以識別一選On page 20, if you have n’t done the original, please ask the translation of the idiom, consider turning t) Shen Li ’s review of the special and cause of the X, suitable for the present language, Guo Hefa issued the final text of the film As usual. Capsules According to the reference translation, the meaning of the reference will be explained in the root and review. The chapters of the post-Chinese-Israeli translation should be listed in the same sentence, which is difficult to translate in the plain text. The relative selection is based on the rough selection of Ying Ziyuan's choice. Sentences of the sentence can be taken into consideration, and then the time is taken into account. The self-sentence and the sentence are included in the same predecessor of the C language. The original item is from the original text of the film, and the original word is translated. According to the enclosing term, it is necessary to be the same as 4 as the post-normative 1J? ≫ Yin Yudi has a text translation 200530856 V. Description of the invention (14) Translation. Therefore, appropriate translations of the translated phrases in the original sentence can be obtained based on the frequently occurring phrases (the corresponding translation of the natural translation from the original sentence is highly likely to be used as the sentence in the target language), even if only It is also difficult to identify (select) a suitable natural sentence based on the degree of consistency. Register the phrases that often appear in the same original text as the original language and refer to the table to search for the frequently occurring phrases mentioned above. According to item 11 of the scope of patent application of the present invention, there are two advantages of frequently occurring phrases found based on the search of natural sentences selected by the retrieval device. Among them, time and manpower for preparing the above-mentioned forms can be saved, and Requires storage capacity for the form. According to item 1 of the scope of patent application of the present invention, it is preferred to identify a selection

第21頁 200530856 五、發明說明(15) 擇性片語(alternative phrase)之識別裝置可以取代存 在於原始句子中之標的片語(targeted phrase),但是 不包含另外提供的檢索裝置之檢索所撿選之原始句子中之 自然句子,並且第一翻譯識別裝置識別由識別裝置所識別 的選擇性片語之翻譯,以及至少包含檢索裝置之檢索所撿 選之自然句子之間之每個翻譯片語之一自然句子之相對應 的翻譯句子中之翻譯片語,以作為至少原始句子中翻譯片 語之翻譯,如申請專利範圍第1 2項所敘述。 根據本發明之申請專利範圍第1項,一自然句子有時由檢 索裝置之檢索而撿選,而於自然句子中,原始句子之特定 片語取代另一個片語。在此,如果原始句子與自然句子之 含意相似,考慮特定的片語具有其他片語的取代性。即 使,例如與檢索裝置之檢索所撿選之自然句子具有相同的 一致程度之複數個自然句子,當用於原始句子之一特定片 語的一自然句子取代自然句子中特定片語之具有可取代之 其他片語(選擇性片語)時,因為自然句子與原始句子很 可能具有相同的含意,較佳的用以選擇自然句子。 另一方面,根據本發明之申請專利範圍第1 2項,選擇性片 語由識別裝置所識別,而選擇性片語係取代檢索裝置之檢 索所撿選之標的片語,並且不包含存在原始句子中之自然 句子之中,而第一翻譯識別裝置識別由識別裝置所識別的 選擇性片語之翻譯,以及至少包含檢索裝置之檢索所撿選 200530856 五、發明說明(16) 之自然句子之間之每個翻譯片語之一自铁 譯句子中之翻嗶Η 1 ^ ^ …、习子之相對應的 4片語,以作為至少屌私4 2山 翻譯。因此,即使σ Α+ f始句子中翻譯片語 人“一 p使,、基&致之程度也很難識j 存在而得到原始句子中之 一 原始句子之一自^< _ 1彳罩势丨古^Γ 口適的翻澤(從 言中之一句子)。 彳下為對象語 合適的自然句子,可以基於;= 選擇 而得到原始句子中之一翻譯 :口勺存在或不 •” t I 2 °口 合適的翻譯(從 本發明相關之申請專利範圍f 擇性片語之識別可以由表格中互相具有夂:;,行-Page 21 200530856 V. Description of the invention (15) Alternative phrase recognition device can replace the target phrase in the original sentence, but it does not include the search phrase The natural sentence in the selected original sentence, and the first translation recognition means recognizes the translation of the selective phrase recognized by the recognition means, and each translation phrase including at least the natural sentence selected by the retrieval means by the retrieval means The translation phrase in the corresponding translation sentence of one natural sentence is used as the translation of at least the translation sentence in the original sentence, as described in item 12 of the scope of patent application. According to item 1 of the scope of patent application of the present invention, a natural sentence is sometimes selected by a retrieval device, and in a natural sentence, a specific phrase of the original sentence replaces another phrase. Here, if the meaning of the original sentence is similar to that of the natural sentence, consider that the specific phrase has the substitution of other phrases. Even if, for example, a plurality of natural sentences having the same degree of consistency as the natural sentences picked up by the retrieval means of the retrieval device, when a natural sentence used for a specific phrase in one of the original sentences replaces a specific phrase in the natural sentence, it can be substituted. In other phrases (optional phrases), because natural sentences are likely to have the same meaning as the original sentences, it is better to choose natural sentences. On the other hand, according to item 12 of the scope of patent application of the present invention, the optional phrase is recognized by the recognition device, and the optional phrase replaces the selected target phrase by the search of the retrieval device, and does not include the existence of the original Among the natural sentences in the sentence, the first translation recognition device recognizes the translation of the optional phrase recognized by the recognition device, and at least the retrieval selected by the retrieval device 200530856 V. Description of the natural sentence of (16) One of each of the translated phrases is translated from the iron translation sentence 1 ^ ^…, the corresponding 4 phrases of Xizi, as a translation of at least the private 4 2 mountain. Therefore, even if σ Α + f starts with the phrase "a p make, the base & cause is difficult to recognize the existence of j and get one of the original sentences in the original sentence ^ < _ 1 彳Mask potential 丨 Ancient ^ Γ Translation of mouth fit (from a sentence in a sentence). His Majesty is a natural sentence suitable for the target language, which can be based on; = Selection to get one of the original sentences Translation: presence or absence of mouth ”T I 2 ° appropriate translation (from the scope of the patent application related to the invention f identification of optional phrases can have each other in the form 夂:;, line-

:^ ΐ夕; 例如,從儲存於儲存裳置之複數個 幻句 ,可以檢索一包含標的片語之自麸句子而勃 、"自缺勺早目f 子中,檢索與檢索所 遠之自…、句子具有相同的句子結構之一自然句子以及取 檢索所撿選之自然句子中之識別的片語,以作為選擇性 語,如申請專利第i 3項所敘述。並且,在此可以節省準 上述表格的時間與人力,並且也可以節省所需要儲存表 之儲存容量。 關於片語本身大量的(immeasurable amount)之背景知 識根據片語之存在而具有很多特性,很難撿選全部而將他 們依照種類(c a t e g 〇 r i e s)加以分類。根據本發明之申請 專利範圍第1 2項與第1 3項,識別相同句子結構中之可取代 之一片語以作為相同種類之一片語,基於有關片語本身之: ^ ΐ 夕; For example, from a plurality of magic sentences stored in the storage dress, you can retrieve a phrase containing the subject phrase from the bran sentence, "quote since the lack of a spoon early item f, search and retrieve far away Since ..., the sentence has a natural sentence with the same sentence structure, and retrieves the identified phrase in the selected natural sentence as an alternative language, as described in the item i 3 of the patent application. In addition, the time and manpower required for the above table can be saved, and the storage capacity of the required storage table can also be saved. A large amount of background knowledge about the phrase itself (immeasurable amount) has many characteristics based on the existence of the phrase, and it is difficult to select all and classify them according to the type (ca t e g ο r i e s). According to the application of the present invention, the items 12 and 13 of the patent scope identify a replaceable phrase in the same sentence structure as a phrase of the same kind, based on the relevant phrase itself.

第23頁 200530856 五、發明說明(17) 背景知識可以粗略得到合適的翻譯,而不需要得到依照種 類而加以分類之全部片語之背景知識。 關於本發明之申請專利範圍第1 4項之翻譯識別系統包含一 儲存裝置,用以儲存在原始語言中包含複數個字彙之複數 個自然句子,而自然句子係與原始語言中之翻譯相關聯; 一檢索裝置用以從儲存於上述儲存裝置中之複數個自然句 子中,檢索包含原始語言之原始句子中之翻譯片語;一搜 尋裝置用以搜尋用於上述翻譯片語之上述檢索裝置之檢索 所撿選之自然句子,與經常出現於原始語言中之相同句子 之經常出現的片語;以及第二翻譯識別裝置用以搜尋上述 翻譯片語與包含每個特定經常出現的片語之自然句子之翻 譯句子中之翻譯片語之經常出現的翻譯,並且用以識別所 發現經常出現的片語以作為上述原始語言中至少上述翻譯 片語之翻譯,而第二翻譯識別裝置係參照上述搜尋裝置所 發現與存在於上述原始句子之特定經常出現的片語,以及 參照包含每個上述檢索裝置之檢索所撿選的自然句子之間 之上述翻譯片語之自然句子之相對應的翻譯句子,與在自 然句子之翻譯句子中之翻譯片語之經常出現的翻譯。 根據本發明之申請專利範圍第1 4項,一儲存裝置以及所提 供之類似本發明之申請專利範圍第1項之一檢索裝置。搜 尋裝置搜尋用於翻譯片語之檢索裝置之檢索所撿選之自然 句子,以及經常出現在原始語言之相同句子中之經常出現Page 23 200530856 V. Description of the invention (17) The background knowledge can be roughly translated without the need to obtain the background knowledge of all the phrases classified according to categories. The translation recognition system for item 14 of the scope of patent application for the present invention includes a storage device for storing a plurality of natural sentences including a plurality of words in the original language, and the natural sentence system is associated with the translation in the original language; A retrieval device for retrieving translation phrases in the original sentence including the original language from the plurality of natural sentences stored in the storage device; a retrieval device for retrieving the retrieval of the retrieval device for the translation phrases The selected natural sentences are frequently-occurring phrases that are the same sentences that often appear in the original language; and the second translation recognition device is used to search for the above-mentioned translated phrases and natural sentences that include each specific frequently-occurring phrase The frequently occurring translation of the translated phrase in the translated sentence, and used to identify the frequently found phrase found as the translation of at least the above translated phrase in the original language, and the second translation recognition device refers to the above search device Certain frequently-occurring phrases found and present in the original sentence, as well as references Each of the above-mentioned retrieval means retrieves the translation sentence corresponding to the natural sentence of the above-mentioned translation phrase between the selected natural sentences, and the frequently occurring translation of the translation phrase in the translation sentence of the natural sentence. According to item 14 of the scope of patented application of the present invention, a storage device and a retrieval device provided with one of the scope of patented scope of application similar to the present invention. The search device searches for natural sentences picked up by the search device used to translate the phrase, and frequently appears in the same sentence that often appears in the original language.

第24頁 200530856 五、發明說明(18) 的片語。第二翻譯識別裝置用以搜尋翻譯片語與包含每個 特殊經常出現的片語之自然句子之翻譯句子中之翻譯片語 之經常出現的翻譯,並且識別所找到之經常出現的片語以 作為至少原始語言中翻譯片語之翻譯,而第二翻譯識別裝 置係參照上述搜尋裝置所發現與存在於上述原始句子之特 定經常出現的片語,以及參照包含每個上述檢索裝置之檢 索所撿選的自然句子之間之上述翻譯片語之自然句子之相 對應的翻譯句子,與在自然句子之翻譯句子中之翻譯片語 之經常出現的翻譯。因此,根據申請專利範圍第11項之相 同的方法,即使儲存於儲存裝置之自然句子中混合具有翻 譯片語之不合適的翻譯之自然句子,並且包含每個翻譯片 語與特定經常出現的片語,不需要受到此問題的影響,就 可以得到原始句子中翻譯片語之合適的翻譯(從原始句子 中得到高可能的自然翻譯之一翻譯,以作為原始句子中之 一句子)〇 根據本發明之申請專利範圍第1項或第1 4項,雖然翻譯片 語可以包含複數個字彙,當字彙之數目包含之翻譯片語增 加時,即使檢索裝置執行檢索之後,自然句子仍具有包含 而不能撿選所有的翻譯片語之可能性。考量此問題,如上 所述,例如申請專利範圍第1 5項,當翻譯片語係包含複數 個字彙,較佳的從儲存於儲存裝置之原始語言中複數個自 然句子之中,檢索裝置檢索包含至少包含翻譯片語之複數 個字彙中之一個之自然句子(也就是說,檢索包含全部翻Page 24 200530856 V. The phrase of invention description (18). The second translation recognition device searches for frequently occurring translations of the translated phrase and the translated phrase in the natural sentence of each particular frequently occurring phrase, and recognizes the frequently occurring phrase found as At least the translation of the translation phrase in the original language, and the second translation recognition device refers to the specific frequently-occurring phrase found by the search device and existing in the original sentence, and refers to the search that includes each of the search devices. The translated sentences corresponding to the natural sentences of the above-mentioned translation phrases between natural sentences and the frequently occurring translations of the translated phrases in the translated sentences of natural sentences. Therefore, according to the same method as the item 11 of the scope of patent application, even if the natural sentences stored in the storage device are mixed with inappropriate translated natural sentences with translation phrases, and each translation phrase is included with a specific frequently-occurring phrase You do not need to be affected by this problem to get a proper translation of the translated phrase in the original sentence (one of the high-potential natural translations is obtained from the original sentence as a sentence in the original sentence). The scope of patent application for the invention is item 1 or item 14. Although the translation phrase can include multiple words, when the number of words included in the word phrase increases, even after the retrieval device performs the search, the natural sentence still contains and cannot The possibility to pick all the translated phrases. Considering this problem, as described above, for example, in the case of the 15th patent application, when the translation phrase contains a plurality of words, it is better to retrieve a plurality of natural sentences from the original language stored in the storage device. A natural sentence that contains at least one of the plural words of the translation phrase (that is, the search contains all translations

第25頁 200530856 五、發明說明(19) 譯片語之自然句子,與包含一部分翻譯片語之自然句子 )。因此,明確說明片語包含很多的字彙以作為翻譯片 語,即使包含全部翻譯片語之自然句子不儲存於儲存裝 置,可以至少從翻譯片語之相對應的翻譯中檢索裝置之檢 索而得到自然句子(相對應的翻譯句子)。 關於根據本發明之申請專利範圍第1 6項之翻譯識別方法, 包含第一步驟,從包含儲存於儲存裝置之複數個字彙之自 然句子中,用以檢索原始語言之原始句子中包含翻譯片語 之自然句子,而自然句子係與對象語言之翻譯相關聯;以 及第二步驟,在上述第一步驟中,用以決定檢索所撿選之 自然句子與上述原始句子之間之一致之程度,並且用以識 別至少在基於上述決定之一致的程度而選擇之自然句子之 相對應的翻譯中之上述翻譯片語之翻譯,以作為原始句子 中上述翻譯片語之翻譯。因此,根據本發明之申請專利範 圍第1項之相同的方法,可以得到原始句子中翻譯片語之 合適的翻譯(從原始句子中得到一自然翻譯之高可能的翻 譯,以作為對象語言中之一句子)。 關於根據本發明之申請專利範圍第1 7項之翻譯識別方法, 包含第一步驟,從包含儲存於儲存裝置之複數個字彙之原 始語言之自然句子中,用以檢索原始語言之原始句子中包 含翻譯片語之自然句子,自然句子係與對象語言之翻譯相 關聯;第二步驟,用以搜尋在上述用於翻譯片語,以及原Page 25 200530856 V. Description of the invention (19) Natural sentences of translated phrases and natural sentences containing a part of translated phrases). Therefore, it is clearly stated that the phrase contains a lot of vocabulary as a translation phrase, and even if a natural sentence including all the translated phrases is not stored in the storage device, it can be retrieved by at least the retrieval of the device from the corresponding translation of the translated phrase to obtain the natural Sentences (corresponding translated sentences). The translation recognition method according to item 16 of the scope of patent application of the present invention includes a first step, from a natural sentence containing a plurality of vocabularies stored in a storage device, to retrieve a translation phrase contained in the original sentence of the original language A natural sentence, which is related to the translation of the target language; and a second step, in the first step described above, to determine the degree of agreement between the retrieved natural sentence and the original sentence, and To identify the translation of the above translation phrase in the corresponding translation of at least the natural sentence selected based on the degree of consistency of the above decisions, as the translation of the above translation phrase in the original sentence. Therefore, according to the same method of the first patent application scope of the present invention, a proper translation of the translated phrase in the original sentence can be obtained (a natural translation with a high probability of translation is obtained from the original sentence as the target language. A sentence). Regarding the translation recognition method according to item 17 of the scope of patent application of the present invention, the method includes a first step, from a natural sentence containing an original language of a plurality of vocabularies stored in a storage device, and the original sentence used to retrieve the original language includes Translate the natural sentences of the phrases. The natural sentences are related to the translation of the target language. The second step is to search for the phrases used in the translation as well as the original.

第26頁 200530856 五、發明說明(20) 始語言之經常出現於相同句子之經常出現的片語之第一步 驟中檢索裝置之檢索所撿選之自然句子;以及第三步驟用 以搜尋上述翻譯片語與用於包含於每個上述特定經常出現 的片語之自然句子之相對應的翻譯句子中之翻譯片語之經 常出現的翻譯,並且識別所發現經常出現的翻譯,以作為 原始句子中上述翻譯片語之翻譯,而第三步驟係參照上述 第二步驟中發現之特定經常出現的片語與存在於上述第一 步驟中之檢索所撿選之自然句子之間之上述原始句子中, 並且參照每個包含上述翻譯片語之自然句子之相對應的翻 譯。因此,根據申請專利範圍第1 4項之相同的方法,可以 得到原始句子中翻譯片語之合適的翻譯(從原始句子中得 到一自然翻譯之高可能的翻譯,以作為對象語言中之一句 子)。 關於根據本發明之申請專利範圍第1 8項之程式,允許一電 腦連接到一用於儲存包含複數個字彙之原始語言中自然句 子之儲存裝置,從儲存於上述儲存裝置之原始語言之複數 個自然句子中,用以儲存包含複數個字彙之原始語言中之 複數個自然句子,上述字彙具有對象語言中相對應的翻 譯,例如用以檢索包含於原始語言之原始句子中翻譯片語 之自然句子之一檢索裝置之功能,並且第一翻譯識別裝置 用以決定在檢索裝置之檢索所撿選之自然句子與上述原始 句子之間之一致之程度,以及用以識別至少在基於決定之 一致程度而選擇之自然句子之翻譯中的上述翻譯片語之翻Page 26 200530856 V. Description of the invention (20) The first step in the first language that often appears in the same sentence often appears in the first step of the retrieval device to retrieve the selected natural sentence; and the third step is to search for the above translation Phrases and frequently used translations of the translated sentences used in the corresponding translations of the natural sentences contained in each of the above-mentioned specific frequently-occurring phrases, and identify the frequently-occurring translations found as the original sentences The translation of the above-mentioned translation phrase, and the third step refers to the above-mentioned original sentence between the specific frequently-occurring phrase found in the above-mentioned second step and the natural sentence existing in the retrieval and selection in the above-mentioned first step, And refer to the corresponding translation of each natural sentence containing the above translation phrase. Therefore, according to the same method as in item 14 of the scope of the patent application, a suitable translation of the translated phrase in the original sentence can be obtained (a natural translation with a high probability of translation is obtained from the original sentence as a sentence in the target language ). Regarding the program according to item 18 of the scope of patent application according to the present invention, a computer is allowed to be connected to a storage device for storing natural sentences in an original language containing a plurality of vocabularies, and from the plurality of original languages stored in the above storage device A natural sentence is used to store a plurality of natural sentences in an original language including a plurality of vocabularies, and the above vocabulary has a corresponding translation in the target language, such as a natural sentence used to retrieve the translated phrase contained in the original sentence of the original language One of the functions of a retrieval device, and the first translation recognition device is used to determine the degree of agreement between the natural sentence selected by the retrieval of the retrieval device and the original sentence, and to identify at least the degree of agreement based on the decision Translation of the above translations in the translation of natural sentences of choice

第27頁 200530856 五、發明說明(21) 譯,以作為至少上述原始句子中之上述翻譯片語之翻譯。 關於根據本發明之申請專利範圍第1 8項之程式,使一電腦 連接到用於儲存原始語言中包含複數個字彙之自然句子之 一儲存裝置之一程式,上述字彙係具有對象語言中相對應 的翻譯,例如上述檢索裝置與第一翻譯識別裝置之功能。 因此,當上述電腦執行關於本發明之根據申請專利範圍第 1 8項之程式時,電腦具有如根據申請專利範圍第1項之翻 譯識別裝置之功能。因此,根據本發明之申請專利範圍第 1項,可以得到原始句子中翻譯片語之合適的翻譯(從原 始句子中得到一自然翻譯之高可能的翻譯,以作為對象語 言中之一句子)。 關於根據本發明之申請專利範圍第1 9項之程式,使一電腦 連接到一用於儲存包含複數個字彙之原始語言中自然句子 之儲存裝置,從儲存於上述儲存裝置之原始語言之複數個 自然句子中,用以儲存包含複數個字彙之原始語言中之複 數個自然句子,上述字彙具有對象語言中相對應的翻譯, 例如用以檢索包含於原始語言之原始句子中翻譯片語之自 然句子之一檢索裝置之功能,以及一搜尋裝置用以搜尋由 用於上述翻譯片語與經常出現在原始語言之相同句子中之 經常出現的片語之上述檢索裝置之檢索所撿選之自然句 子,並且一第二翻譯識別裝置用以搜尋上述翻譯片語與用 於每個包含上述經常出現的片語之自然句子之相對應的翻Page 27 200530856 V. Description of the invention (21) Translation as a translation of at least the above-mentioned translation phrase in the original sentence. Regarding the program according to item 18 of the scope of patent application according to the present invention, a computer is connected to a program for storing a natural sentence containing a plurality of vocabularies in the original language. Translation, such as the functions of the retrieval device and the first translation recognition device. Therefore, when the above-mentioned computer executes the program according to item 18 of the scope of patent application for the present invention, the computer has the function of a translation recognition device according to the item 1 of scope of patent application. Therefore, according to the first patent application scope of the present invention, a proper translation of the translated phrase in the original sentence can be obtained (a natural translation with a high possibility of translation is obtained from the original sentence as one sentence in the target language). Regarding the program according to item 19 of the scope of patent application of the present invention, a computer is connected to a storage device for storing natural sentences in an original language including a plurality of words, and the plurality of original languages stored in the storage device are Natural sentences are used to store plural natural sentences in the original language containing plural words. The above words have corresponding translations in the target language, for example, natural sentences used to retrieve the translated phrases contained in the original sentence of the original language. A function of a retrieval device, and a search device for searching for a natural sentence selected by the retrieval of the retrieval device used for the above-mentioned translated phrase and a phrase frequently appearing in the same sentence that often appears in the original language, And a second translation recognition device is used to search for the translation of the above-mentioned translated phrases and corresponding to each natural sentence containing the above-mentioned frequently-occurring phrases.

第28頁 200530856 五、發明說明(22) 譯句子中之翻譯片語之經常出現的翻譯,並且用以識別所 找到之經常出現的翻譯以作為上述原始句子中上述翻譯片 語之翻譯,而第二翻譯識別裝置係參照上述搜尋裝置發現 與存在於上述檢索裝置之檢索所撿選之自然句子之間之上 述原始句子之特定經常出現的片語,並且參照每個包含上 述翻譯片語之自然句子之相對應的翻譯。 關於根據本發明之申請專利範圍第1 9項之程式,使一電腦 連接到一用於儲存原始語言中包含複數個字彙之自然句子 之一儲存裝置之程式,上述字彙具有對象語言中相對應的 翻譯(建立具備儲存裝置之電腦,或透過電性連接線連接 到另一電腦),例如上述檢索裝置、搜尋裝置以及第二翻 譯識別裝置之功能。因此,當上述電腦執行關於根據本發 明之申請專利範圍弟1 9項之程式時,電腦具有例如根據本 發明之申請專利範圍第1 4項之翻譯識別裝置之功能。因 此,根據本發明之申請專利範圍第1 4項之相同方法,可以 得到原始句子中翻譯片語之合適的翻譯(從原始句子中得 到一自然翻譯之高可能的翻譯,以作為對象語言中之一句 子)。 如上所述,本發明從儲存於儲存裝置之原始語言之複數個 自然句子中,檢索原始語言之原始句子中包含翻譯片語之 自然句子’決定檢索所檢選的自然句子與原始句子之間之 一致之程度,並且至少識別基於所決定之一致程度而選擇Page 28 200530856 V. Description of the invention (22) Frequently occurring translations of translated phrases in translated sentences, and used to identify the frequently occurring translations found as translations of the aforementioned translated phrases in the original sentence, and The second translation recognition device refers to a specific frequently occurring phrase of the original sentence that is found between the search device and the natural sentence that exists in the search and retrieval of the search device, and refers to each natural sentence that includes the translated phrase. Corresponding translation. Regarding the program according to item 19 of the scope of patent application of the present invention, a program for connecting a computer to a storage device for storing a natural sentence containing a plurality of vocabulary in the original language, the above vocabulary has the corresponding Translation (creating a computer with a storage device, or connecting to another computer through an electrical cable), such as the functions of the above-mentioned retrieval device, search device, and second translation identification device. Therefore, when the above-mentioned computer executes the program concerning the item 19 of the scope of patent application according to the present invention, the computer has a function of a translation recognition device according to the item 14 of the scope of patent application of the present invention, for example. Therefore, according to the same method as in item 14 of the scope of patent application of the present invention, a proper translation of the translated phrase in the original sentence can be obtained (a highly probable translation of natural translation is obtained from the original sentence as the target language. A sentence). As described above, the present invention retrieves natural sentences containing translation phrases from the original sentences in the original language from the plurality of natural sentences stored in the storage device to determine the search between the selected natural sentence and the original sentence. The degree of agreement, and at least identify the choice based on the degree of agreement determined

第29頁 200530856 五、發明說明(23) 之自然句子之翻譯中翻譯片語之翻譯,以作為至少原始語 言中翻譯片語之翻譯。因此,本發明有一特別的效用 (e X c e 1 1 e n t e f f e c t),其中在很可能得到自然翻譯句子 的情況,從原始語言之原始句子中可以得到原始句子中翻 譯片語之合適的翻譯。 此外,本發明搜尋在每個包含特定經常出現的片語以及經 常出現的翻譯之自然句子之相對應的翻譯中之翻譯片語, 並且從儲存於儲存裝置之原始語言中之複數個自然句子之 中,檢索原始語言之原始句子中包含翻譯片語之自然句子 而識別所發現的經常出現的翻譯,以作為原始句子中上述 翻譯片語之翻譯,以及搜尋用於翻譯片語以及經常出現於 原始語言之相同句子中經常出現的片語之檢索所撿選之自 然句子,並且參照每個包含特定經常出現的片語以及原始 句子中翻譯片語之自然句子之相對應的翻譯。 因此,本發明具有一很好的效用(excellent effect), 其中在很可能得到自然翻譯句子的情況下,從原始語言之 原始句子中可以得到原始句子中翻譯片語之合適的翻譯。 【實施方式】 參考圖示詳細敘述本發明之一最佳實施例。首先,敘述一 裝置運用本發明於一電子辭典系統,當使用者翻譯指定片 語時提供使用者具有翻譯一指定片語之翻譯。Page 29 200530856 V. The translation of the translation phrase in the translation of the natural sentence of the description of the invention (23), as a translation of the translation phrase in at least the original language. Therefore, the present invention has a special utility (e X c e 1 1 e n t e f f e c t), in which it is possible to obtain a naturally translated sentence, and from the original sentence of the original language, an appropriate translation of the translated phrase in the original sentence can be obtained from the original sentence of the original language. In addition, the present invention searches for a translation phrase in each corresponding translation including a certain frequently occurring phrase and a frequently occurring translated natural sentence, and from the plurality of natural sentences stored in the original language stored in the storage device, In the original sentence, the original sentence of the original language is searched for natural sentences containing the translation phrase to identify the frequently occurring translations found as the translation of the above translation phrase in the original sentence, and the search is used to translate the phrase and often appear in the original Retrieve selected natural sentences that are frequently found in the same sentence of the language, and refer to the corresponding translation of each natural sentence that contains the specific frequently-occurring phrase and the translated phrase in the original sentence. Therefore, the present invention has an excellent effect, in which a natural translation sentence is likely to be obtained, and a proper translation of the translation phrase in the original sentence can be obtained from the original sentence in the original language. [Embodiment] A preferred embodiment of the present invention will be described in detail with reference to the drawings. First, a device is described that uses the present invention in an electronic dictionary system to provide a user with a translation that translates a designated phrase when the user translates the designated phrase.

第30頁 200530856 五、發明說明(24) 圖一顯示一個人電腦(PC) 1 0能夠具有上述電子辭典之功 月匕°個人電月向1〇包含中央處理機(central processing unit· CPU) 1〇Α、唯讀記憶體(Reacj-〇nly memory: ROM )10B、I1远機存取記憶體(rand〇m-access memory: RAM) ioc以及輸入/輸出埠(port) i〇D,並且透過匯流排(bus )1 〇 E互相連結以裝配一個人電腦,其匯流排包含資料 (data)匯流排、位址(address)匯流排以及控制 (control)匯流排等。 顯示為(display)包含陰極射線管(Cathode Ray Tube· CRT)與液晶顯示器(Liquid Crystal Display : LCD)等’透過使用者輸入資料到鍵盤(keyboard) 14 專’滑鼠(mouse) 16、硬碟裝置(hard disk drive: HDD) 18、CD唯讀記憶體(CD-ROM) 2 4讀取資料之CD唯讀 吕己憶體裝置(C D - R 0 M d r i v e) 2 0以及能夠讀取文件資料之 掃描器2 2連結到輸入/輸出琿1 〇 D,例如很多輸入/輸出裝 置。 一翻譯識別程式(相對應於敘述於申請專利範圍丨8與丨9中 之程式)使個人電腦1 〇具有裝配於個人電腦1 〇之硬體裝置 之電子辭典系統之功能。儲存上述翻譯識別程式之資料之 一相對應的翻譯資料庫(corresponding translation database)用來識別儲存於硬體裝置1 8中之合適翻譯◦有Page 30 200530856 V. Description of the invention (24) Figure 1 shows that a personal computer (PC) 10 can perform the functions of the electronic dictionary described above. The personal electrical month direction 10 includes a central processing unit (CPU) 1〇 Α, read-only memory (Reacj-〇nly memory: ROM) 10B, I1 remote machine access memory (random-access memory: RAM) ioc and input / output port (port) i〇D, and through the convergence The bus 10E is connected to each other to assemble a personal computer, and the bus includes a data bus, an address bus, and a control bus. The display includes a cathode ray tube (CRT), a liquid crystal display (LCD), and the like. 'Input data to the keyboard through the user's keyboard 14' 'mouse' 16, hard disk Device (hard disk drive: HDD) 18. CD read-only memory (CD-ROM) 2 4 CD-only read-only device (CD-R 0 M drive) 2 0 and read data The scanner 22 is connected to the input / output unit 100D, for example, many input / output devices. A translation recognition program (corresponding to the program described in the patent application range 丨 8 and 丨 9) enables the personal computer 10 to have the function of an electronic dictionary system equipped with a hardware device of the personal computer 10. A corresponding translation database is used to store the corresponding translation identification program data, which is used to identify the appropriate translation stored in the hardware device 18

第31頁 200530856 五 、發明說明 (25) 數 個 安 裝 ( 轉 移 ) 翻 譯 識 別 程 式 到 個 人 電 腦 1 0的 裝 置 〇 例 如 事 先 紀 錄 翻 譯 識 別 程 式 與 CD唯 讀 記 憶 體 2 4中 之 設 定 程 式 而 安 裝 翻 譯 識 別 程 式 0 將 CD唯 石貝 記 憶 體 2 4插 入 CD唯 讀 記 憶 體 裝 置 2 0之 中 並 且 命 令 中 央 處 理 器 1 ( 〕A執行上述設 定 程 式 以 從 CD唯 讀 記 憶 體 2 4中 順 序 5貝 取 翻 譯 識 別 程 式 並 且 在 硬 體 裝 置 3中 順 序 書 寫 所 讀 取 翻 譯 識 別 程 式 〇 相 對 應 的 翻 譯 資 料 庫 可 以 儲 存 在 硬 體 裝 置 18中 例 如 事 先 紀 錄 在 CD隨 機 存 取 記 憶 體 24中 相 對 應 翻 譯 資 料 庫 , 並 且 當 安 裝 翻 譯 識 別 程 式 時 更 改 設 定 程 序 以 便 同 時 在 硬 體 裝 置 18中 書 寫 相 對 應 翻 譯 資 料 庫 〇 複 數 個 字 彙 組 成 之 缺 句 子 與 用 原 始 語 言 敘 述 之 大 量 原 文 資 料 ( 句 子 子 句 片 語 刻 板 措 辭 以 及 慣 用 語 中 之 任 一 種 其 未 經 編 輯 或 處 理 如 其 後 所 敘 述 之 特 定 的 例 子 例 如 字 彙 相 除 、 多 意 字 彙 之 撿 選 等 ) 具 有 對 象 語 中 所 敘 述 之 相 對 應 原 文 資 料 之 翻 譯 ( 這 些 翻 譯 亦 未 經 編 輯 或 處 理 之 白 句 子 例 如 字 彙 相 除 多 意 字 彙 之 撿 選 等 如 其 後 所 敘 述 之 特 定 的 例 子 等 ) 係 登 錄 於 相 對 應 的 翻 譯 資 料 庫 〇 在 此 最 佳 實 施 例 中 5 硬 碟 裝 置 18儲 存 相 對 應 於 有 關 本 發 明 之 儲 存 裝 置 之 相 對 應 的 翻 譯 資 料 庫 0 在 紀 錄 媒 體 ( recording med i .um) 上 紀 錄 相 對 應 的 翻 譯 資 料 庫 用 以 使 用 相 對 應 的 翻 譯 資 料 庫 例 如 CD唯 5貝 記 憶 體 2 4與 DVD唯讀記憶體| (DVD-ROM) 並 且 從 紀 錄 媒 體 中 直 接Page 31, 200530856 V. Description of the invention (25) Several devices that install (transfer) translation recognition programs to personal computer 10, such as recording translation recognition programs and setting programs in CD read-only memory 24 and installing translation recognition in advance Program 0: Insert CD CD-ROM memory 2 4 into CD read-only memory device 2 0 and instruct CPU 1 () A to execute the above-mentioned setting procedure to obtain translation recognition from CD read-only memory 2 4 in sequence. Programs and sequentially write the read translation recognition programs in the hardware device 3. The corresponding translation database can be stored in the hardware device 18, such as the corresponding translation database recorded in the CD random access memory 24 in advance, and When the translation recognition program is installed, the setting program is changed so that the corresponding translation database is written in the hardware device 18 at the same time. Lack of sentences and a large amount of original materials narrated in the original language (sentences, phrase phrases, stereotyped words, and any of the idiomatic phrases that have not been edited or processed. Specific examples as described later such as word division, polysemous word picking (Selection, etc.) Translations of the corresponding source materials described in the target language (white sentences such as translations that have not been edited or processed, such as the selection of vocabulary, multi-word vocabulary, etc., as described in specific examples, etc.) Registered in the corresponding translation database. 0 In this preferred embodiment, the hard disk device 18 stores the corresponding translation database corresponding to the storage device of the present invention. 0 On a recording medium (recording med i.um). The corresponding translation database is used to use the corresponding translation database, such as CD 5 MB memory 2 4 and DVD Read memory | (DVD-ROM) and discipline record from the media and in direct

第32頁 200530856 五、發明說明(26) ^ - 讀取相對應的翻譯資料庫。因此在此最佳實施例中,媒體 儲存相對應的翻譯資料庫之功能,例如關於本 二 _ , + %明之儲存 衣置。此外’原始語言與對象語言中本身相對應的翻譯之 中的字彙係登錄於有關此最佳實施例之相對應的翻$資料 庫。增加自然句子以及本身的翻譯可以創造有關^最^實 施例之相對應的翻譯資料庫,例如需要存在於辭典=^二 中,舉例而言在原始語言中的字彙與本身相對應的翻譯。 在此敘述最佳實施例之效用。在一實施例中,卷 田敘述於原 始語言之原始句子中發現之敘述於對象語言中的翻譯之— 片語存在時(翻譯片語可以是字彙,或原始句子中包含複 數個連續排列的字彙之片語),使用者透過個人電腦丨⑽丸 行特定的操作用以控制原始句子中翻譯片語之翻譯< I 出。可以於個人電腦1 〇中讀取原文,例如原文資料(例如 使用者透過鍵盤1 4輸入之原文、使用字彙處理軟體創造之 原文,透過網際網路於網頁上瀏覽之原文、掃描器由、紀錄 字元(character)資料庫而得到之原文,以及光學字元 辨識(Optical Character Recognition: OCR)系統等) 處理之資料),以作為原始句子。 最佳實施例的操作,例如正確選擇一翻譯片語所顯示之内 容選單,相對地(r e v e r s e 1 y)顯示翻譯片語之後,選擇 在此情況中的翻譯片語而使用選擇一相對應的項目「相對 應的翻譯輸出(corresponding translation output)」Page 32 200530856 V. Description of the invention (26) ^-Read the corresponding translation database. Therefore, in this preferred embodiment, the function of the corresponding translation database is stored in the media, for example, about this two _, +% storage of clothes. In addition, the words in the translation corresponding to the original language and the target language itself are registered in the corresponding translation database of this preferred embodiment. Increasing the translation of natural sentences and its own translation can create a corresponding translation database for the ^ most ^ embodiment, for example, it needs to exist in the dictionary = ^ 二, for example, the translation of the vocabulary in the original language corresponding to itself. The effect of the preferred embodiment is described here. In one embodiment, the translation of Narada in the original sentence of the original language and the translation of the narrative in the target language—the phrase exists (the translation phrase can be a vocabulary, or the original sentence contains multiple consecutively arranged vocabularies Phrase), the user uses the personal computer to perform specific operations to control the translation of the translated phrase in the original sentence. Can read the original text in personal computer 10, such as the original text data (such as the original text entered by the user through the keyboard 14, the original text created using word processing software, the original text browsed on the Internet through the Internet, scanner origin, records The original text obtained from the character database and the data processed by the Optical Character Recognition (OCR) system, etc.) as the original sentence. The operation of the preferred embodiment, for example, correctly selects a content menu displayed in a translation phrase, and after displaying the translation phrase (reverse 1 y), selects a translation phrase in this case and selects a corresponding item using "Corresponding translation output"

第33頁 200530856 五、發明說明(27) 之操作,例如原始句子係顯示於顯示器1 2之上。也可以執 行一操作以明確說明(s p e c i f y)翻譯片語之翻譯之輸 出,例如選擇顯示於所需要的圖像(i con),舉例而言, 如上所述之使用内容選單内之工具列。當執行上述的操作 用以明確說明翻譯片語之翻譯之輸出,個人電腦1 0之中央 處理機1 0 A執行翻譯識別程式,以執行翻譯識別程序,如 圖2 A與2 B所顯示。翻譯識別程序係處理關於申請專利範圍 第1 6項與第1 7項所提供敘述之翻譯識別方法。執行此程 序,個人電腦1 0具有一電子辭典系統之功能(申請專利範 圍第1項與第1 4項敘述之翻譯識別系統)。 翻譯識別程序中,包含特定的翻譯片語之一單一原始句子 中之原文資料(處理過的原始句子)於步驟1 0 0中係為首 要,需要用以識別處理所需原始句子中的翻譯片語之資 料。處理過的原始句子可以是一包含翻譯片語,或包含一 子句、一片語、一刻板措辭,或一包含翻譯片語的慣用語 中任一種之句子。當假定一子句、一片語、一刻板措辭, 或一包含翻譯片語的慣用語中任一種為一處理過的原始句 子時,由使用者或透過翻譯識別程序自動決定,使子句、 片語、刻板措辭,或慣用語作為明確說明處理過的原始句 子。 在下一步驟1 0 2中,從登錄於相對應的翻譯資料庫之自然 句子中,使用步驟1 0 0中所需處理過的原始句子之原文資Page 33 200530856 V. Operation of invention description (27), for example, the original sentence is displayed on the display 1 2. It is also possible to perform an operation to specify (spe c c fy) the output of the translation of the translation phrase, such as selecting to display the desired image (i con), for example, using the toolbar in the content menu as described above. When the above operation is performed to clearly explain the translation output of the translation phrase, the central processing unit 10 A of the personal computer 10 executes the translation recognition program to execute the translation recognition program, as shown in FIGS. 2A and 2B. The translation recognition program deals with the translation recognition methods described in items 16 and 17 of the scope of patent application. By executing this procedure, the personal computer 10 has the function of an electronic dictionary system (the translation identification system described in the patent application scopes 1 and 14). In the translation recognition program, the original data (processed original sentence) in a single original sentence containing one of the specific translation phrases is given priority in step 100, and it is needed to identify the translation piece in the original sentence required for processing Language information. The processed original sentence can be a sentence containing a translation phrase, or a clause, a phrase, a stereotype, or a phrase containing a translation phrase. When it is assumed that any one of a clause, a phrase, a stereotyped phrase, or a idiom containing a translated phrase is a processed original sentence, it is automatically determined by the user or through a translation recognition program, so that the clause, the phrase Words, stereotyped words, or idiomatic phrases as clear explanations of the original sentence processed. In the next step 102, from the natural sentences registered in the corresponding translation database, use the original text of the original sentence processed in step 100.

第34頁 200530856 五、發明說明(28) 料,檢索一完全與處理過的原始句子一致之自然句子以作 為一關鍵(k e y),並且從登錄於相對應的翻譯資料庫之 自然句子中,使用步驟1 0 0中所需處理過的原始句子之原 文資料以作為一關鍵(k e y),並且檢索一包含翻譯片語 之自然句子(包含部分與原始句子一致之自然句子’因為 至少包含翻譯片語)。當翻譯片語係包含複數個字彙時, 自然句子包含至少同時檢索包含翻譯片語之複數個字彙中 任一種。步驟1 0 2相當於申請專利範圍第1項、第1 4項、第 1 8項,以及第1 9項所敘述之檢索裝置,也相當於申請專利 範圍第1 6項與第1 7項所敘述之第一步驟。步驟1 0 4與其後 的步驟,相當於申請專利範圍第1項與第1 8項敘述之第一 翻譯識別裝置。 步驟1 0 4中,詢問是否自然句子與步驟1 0 2之檢索之相對應 的翻譯資料庫中所撿選之翻譯的原始句子完全一致。如果 答案是肯定(positive),程式移至步驟106,以讀取與 處理過的原始語言一致之自然句子相關聯之對象語言中之 相對應的翻譯,並且登錄於相對應的翻譯資料庫,以及在 顯示器1 2上顯示相對應的翻譯以結束此程序。當顯示這些 相對應的翻譯時,讀取翻譯係搜尋翻譯片語之翻譯,並且 使得發現之翻譯片語之翻譯顯得重要(h 1 gh 1 1 gh t e d)。 在此例子中,使用者可以了解特定的翻譯片語之合適的翻 譯(從原始句子中得到一自然翻譯之具有高可能性之翻 譯,以作為對象語言中之一句子),並且可以了解包含特Page 34 200530856 V. Description of the invention (28) material, retrieve a natural sentence completely consistent with the processed original sentence as a key, and use it from the natural sentence registered in the corresponding translation database. The original data of the original sentence that needs to be processed in step 100 is used as a key, and a natural sentence containing the translated phrase (including a natural sentence partially consistent with the original sentence 'is included because it contains at least the translated phrase ). When the translation phrase contains plural words, the natural sentence contains at least one of the plural words containing the translated phrase to be retrieved at the same time. Step 102 corresponds to the search devices described in items 1, 14 and 18 of the scope of patent application, and item 19, and also corresponds to the items 16 and 17 of the scope of patent application. The first step of the narrative. Steps 104 and subsequent steps are equivalent to the first translation recognition device described in items 1 and 18 of the scope of patent application. In step 104, it is asked whether the natural sentence is exactly the same as the original sentence of the translation selected in the corresponding translation database retrieved in step 102. If the answer is positive, the program moves to step 106 to read the corresponding translation in the target language associated with the natural sentence consistent with the processed original language, and register it in the corresponding translation database, and The corresponding translation is displayed on the display 12 to end the procedure. When these corresponding translations are displayed, reading translation is searching for translations of translation phrases, and making translations of translations found to be important (h 1 gh 1 1 gh t e d). In this example, the user can understand the appropriate translation of a specific translation phrase (getting a natural translation from the original sentence with a high probability of translation as a sentence in the target language), and can understand the inclusion of special features

第35頁 200530856 五、發明說明(29) 定的翻譯片語之處理過的原始句子之合適的翻譯(自然的 翻譯句子,以作為於原始語言中之句子)。上述的步驟 1 0 4與1 0 6相當於如申請專利範圍第2項所敘述之第一翻譯 識別裝置。 (例子一) 上述的程序係更使用真實的例子來敘述。假設一原始語言 係英文,以及一對象語言係日文,當包含在一處理過的原 始句子中之「為了你的安全(For your safety)」明確 說明「為了你的安全,不要倉促衝進火車(F o r y o u r safety, don’t rush into the train)」以作為番il 譯片 語,因為原始語言以及相對應的翻譯係由一般翻譯程序中 之字彙來登錄字彙,對照(a g a i n s t)包含上述翻譯片語 之「安全(safety)」而選擇翻譯「anzen」。然而,在 關於本實施例之翻譯識別程序之中,因為原始語言中自然 句子以及對象語言中相對應的翻譯係登錄於相對應的翻譯 資料庫,完全與上述原始句子一致之自然句子係具有登錄 於相對應的翻譯資料庫之可能性。當自然句子與上述原始 句子完全一致時,登錄於與自然句子有關之相對應的翻譯 資料庫之翻譯係隨重要的翻譯片語之翻譯而顯示,如下列 所顯示: r Kiken desu kara ? kakekomi-josha wa oyame kudasa i」。 在一般翻譯程序中,很難去選擇「k i k e n」以作為Page 35 200530856 V. Description of the invention (29) Appropriate translation of the processed original sentences (naturally translated sentences as sentences in the original language). The above steps 104 and 106 are equivalent to the first translation recognition device described in item 2 of the scope of patent application. (Example 1) The above program is described using real examples. Suppose an original language is English and an object language is Japanese. When included in a processed original sentence, "For your safety" clearly states "For your safety, don't rush into the train ( Foryour safety, don't rush into the train) "as the translation phrase, because the original language and the corresponding translation are registered by the vocabulary in the general translation program, and the comparison contains the above translation phrase "Safety" and choose to translate "anzen". However, in the translation recognition program of this embodiment, since the natural sentences in the original language and the corresponding translation systems in the target language are registered in the corresponding translation database, the natural sentence systems that are completely consistent with the original sentences have registrations. The corresponding translation database. When the natural sentence is exactly the same as the original sentence, the translation registered in the translation database corresponding to the natural sentence is displayed with the translation of the important translation phrase, as shown below: r Kiken desu kara? Kakekomi- josha wa oyame kudasa i ". It ’s difficult to choose "ki i k e n" as

第36頁 200530856 五、發明說明(30) 「s a f e t y」字彙之相對應的翻譯,並且只可以得到一不自 ,然白勺翻5 譯,Ϊ列士口 「Anata no anzen no tame, sono ressha n i kakekomanaide kudasai」。另一方面,本發 明提供如母語(na t i v e 1 a n g u ag e)之自然翻譯。 當步驟1 0 6中顯示翻譯時,除了與處理過的原始句子完全 一致之自然句子之外,還可以顯示與步驟1 〇 2中之檢索所 撿選之處理過的原始句子部分一致之自然句子。與處理過 的原始句子部份一致之複數個自然句子一般係由檢索而撿 選。當顯示這些,為了顯示在顯示器丨2上一致之程度之遞 減的(d e s c e n d i n g)次序之每個自然句子之翻譯,較佳的 用於與處理過的原始句子部分一致之個別的自然句子,在 下列敘述中計算一致之程度。 另一方面’當在相對應的翻譯資料庫中步驟1 〇 2之檢索中 未發現與處理過的原始句子完全一致之自然句子時,在步 驟1 0 4中問述之答案係為否定。然後,程式移至步驟1 q 8, 在自然句子之子彙之間用以計算與原始句子之字彙中任一 種一致之子彙之數目(一致字彙之數目),以用於步驟 1 0 2中之檢索之相對應的翻譯中所撿選的自然句子,並且 基於計异一致字彙之數目之結果而計算與原始句子一致之 程度。 在本貫施例中,當裝配翻譯識別程式時,經常出現於原始P.36 200530856 V. Description of the invention (30) The corresponding translation of the word "safety", and you can only get a disapproval, but you can turn 5 translations. kakekomanaide kudasai ". On the other hand, the present invention provides a natural translation such as a native language (na t i v e 1 a n g u ag e). When the translation is displayed in step 106, in addition to the natural sentences that are completely consistent with the processed original sentence, it is also possible to display the natural sentences that are consistent with the processed original sentence portion selected in the search in step 102. . A plurality of natural sentences that are consistent with the processed original sentence part are generally selected by retrieval. When these are displayed, in order to show the translation of each natural sentence in a descending order of consistency on the display, it is preferred to use individual natural sentences that are consistent with the processed original sentence portion. The degree of agreement is calculated in the narrative. On the other hand, when a natural sentence completely identical to the processed original sentence is not found in the search of the corresponding translation database in step 102, the answer to the question in step 104 is negative. Then, the program moves to step 1 q 8 to calculate the number of sub-sinks (the number of consistent vocabularies) consistent with any of the vocabulary of the original sentence between the sub-sinks of the natural sentence for the retrieval in step 102 Corresponding natural sentences are selected in the corresponding translation, and the degree of agreement with the original sentence is calculated based on the result of calculating the number of different consistent words. In the present embodiment, when the translation recognition program is assembled, it often appears in the original

第37頁 200530856 五、發明說明(31) 語言中原始句子中之字彙(例如,在自然英文句子中之 「一 (a)」、「這(the)」、「以(to)」,以及「之 中(i η)」)於經常出現的字彙表格中,亦儲存於硬碟裝 置1 8。步驟1 0 8計算一致之數目同時參照經常出現的字彙 表格,以排除登錄於所計算之一致的字彙之數目中之經常 出現的字彙表格中之字彙。因此,可以避免在一致的字彙 之數目上經常出現的字彙之影響◦步驟1 0 8中之上述的程 序,相當於申請專利範圍第6項所敘述之第一翻譯識別裝 置。 在此最佳實施例中,當裝配翻譯識別程式時,因為登錄之 單數/複數或時態之差異而具有不同字尾之字彙之一曲折 形式的字彙表格亦儲存於硬碟裝置1 8。當一致的字彙之數 目於步驟1 0 8中計算並且出現只具有不一致的字尾之字彙 時,不管是否詢問單數/複數之差異中之不一致的字尾之 理由或是時態之差異,假設計算因為單數/複數或時態之 差異而具有不同字尾之字彙以作為一致的字彙。代替此差 異,只有不規則曲折形式之系統係登錄於曲折形式表格以 及規則曲折形式中,自動假設一致之字彙為可接受的。因 此,可以避免在一致的字彙之數目上因為單數/複數或時 態之差異(應該根本地了解字彙,以作為一致之字彙)而 具有不同字尾之字彙之影響。步驟10 8中之上述的程序, 相當於申請專利範圍第7項所敘述之第一翻譯識別裝置。Page 37 200530856 V. Description of the invention (31) The vocabulary of the original sentence in the language (for example, "a", "the", "to") in the natural English sentence, and " (I η) ″) in the frequently occurring vocabulary table, and also stored in the hard disk device 18. Step 108 calculates the number of coincidences while referring to the frequently occurring vocabulary table to exclude the words appearing in the frequently occurring vocabulary table among the calculated number of consistent vocabularies. Therefore, the influence of the vocabulary that often appears in the number of consistent vocabulary can be avoided. The above-mentioned procedure in step 108 is equivalent to the first translation recognition device described in item 6 of the scope of patent application. In this preferred embodiment, when a translation recognition program is assembled, a word form in a zigzag form having a suffix having a different suffix due to differences in registered singular / plural or tense is also stored in the hard disk device 18. When the number of consistent suffixes is calculated in step 108 and only suffixes with inconsistent suffixes appear, regardless of whether the reason for the inconsistent suffixes in the singular / plural difference or the tense difference is assumed, the calculation is assumed Words with different suffixes are used as consistent words due to differences in singular / plural or tense. Instead of this difference, only the system of the irregular zigzag form is registered in the zigzag form form and the regular zigzag form, and the consistent vocabulary is automatically assumed to be acceptable. Therefore, the influence of suffixes with different suffixes on the number of consistent vocabularies due to singular / plural or temporal differences (the vocabulary should be fundamentally understood as a consistent vocabulary) can be avoided. The above procedure in step 108 is equivalent to the first translation recognition device described in item 7 of the scope of patent application.

第38頁 200530856 五、發明說明(32) 在此最佳實施例中,即使字彙再次出現於自然句子,在步 驟1 0 8中計算一致字彙之數目的期間,一旦為了與某些原 始句子中之字彙一致之字彙之理由而計算字彙以作為一致 的字彙,為了避免重複計算出現二次或多次之一致的字 彙,不會計算兩次。因此即使自然句子之很多地方有相同 一致的字彙,也可以避免在一致的字彙之數目上受到一致 的字彙之影響。步驟1 0 8中之上述的程序,相當於申請專 利範圍第8項所敘述之第一翻譯識別裝置。 定義用以計算一致之程度之一公式,使得當一致的字彙增 加,一致之程度也隨之增加。例如,基於包含翻譯片語的 字彙之數目,而可以使用將一致的字彙之數目標準化 (normalizing)之計算公式(一致之程度等於一致的字 彙之數目除以包含翻譯片語的字彙之數目)。步驟1 0 8中 之上述的程序,相當於申請專利範圍第3項所敘述之第一 翻譯識別裝置(更具體而言如申請專利範圍第4項所敘述 下一個步驟1 1 0中,對照每個自然公式中計算一致之程度 以詢問是否存在具有最大一致之程度之複數個自然句子。 如果答案為否定,程式移至步驟1 1 2,用以讀取登錄於相 對應的翻譯資料庫之相對應的翻譯句子,上述翻譯資料庫 係具有最大的一致之程度之一相對應的自然句子,以及用 以搜尋用於翻譯片語之翻譯之讀取的翻譯句子,並且用以 200530856 五、發明說明(33) 將顯示器1 2上讀取的翻譯句子與發現重要的 (h i gh 1 i gh t e d)翻譯片語之翻譯一起顯示,並且結束此 程序。在此例子中,使用者可以了解特定翻譯片語之一合 適的翻譯(從原始句子中得到一自然翻譯之高可能之翻 譯,以作為對象語言中之一句子)。 步驟11 2中,除了具有在步驟1 0 2中之檢索所撿選之自然句 子之間之最大一致之程度之自然句子(在一致之程度之遞 減的次序中一某自然句子之數目,或是一致之程度係高於 特定的數值之全部的自然句子)之外,也從在一致之程度 之遞減的次序之相對應的翻譯資料庫中讀取,並且同時於 顯示器1 2上顯示。如果存在具有相同的一致之程度之複數 個自然句子,計算與每個自然句子中之原始句子不一致的 字彙之數目,相對應於具有相同一致的程度之自然句子之 翻譯句子,係依照相對應的自然句子中不一致的字彙之數 目之遞減的(從較小的)次序而顯示,以同時顯示翻譯句 子。 (例子2) 上述的程序更使用一真實的例子來敘述。假設原始語言係 英文,並且對象語言係日文,當包含一處理過的原始句子 中之一字彙「操作(operat i on)」,具體說明「系統突 然進入操作(The system suddenly came into o p e r a t i〇n)」,並且圖三中所顯示之自然句子(1)到Page 38 200530856 V. Description of the invention (32) In this preferred embodiment, even if the vocabulary reappears in the natural sentence, the period during which the number of consistent vocabulary is calculated in step 108 is once in order to match with some of the original sentences. The word vocabulary is calculated for the reason that the word vocabulary is consistent. To avoid repeated calculations, the word vocabulary is not calculated twice or more. Therefore, even if many parts of a natural sentence have the same consistent vocabulary, the number of consistent vocabularies can be prevented from being affected by the consistent vocabulary. The above procedure in step 108 is equivalent to applying for the first translation recognition device described in item 8 of the patent scope. A formula is defined to calculate the degree of agreement, so that as the word of agreement increases, the degree of agreement also increases. For example, based on the number of vocabularies containing translated phrases, a calculation formula that normalizes the number of consistent vocabularies (the degree of consistency is equal to the number of consistent vocabularies divided by the number of vocabularies containing translated phrases) can be used. The above-mentioned procedure in step 108 is equivalent to the first translation recognition device described in item 3 of the patent application scope (more specifically, as described in item 4 of the patent application scope in the next step 1 10), The degree of agreement in each natural formula is calculated to ask whether there are plural natural sentences with the greatest degree of agreement. If the answer is negative, the program moves to step 1 12 to read the phase registered in the corresponding translation database. Corresponding translation sentences, the above translation database has one of the largest degree of consistency, corresponding natural sentences, and translation sentences used to search for translations for translation of translated phrases, and is used for 200530856 V. Description of the invention (33) Display the translated sentence read on the display 12 together with the translation that found the important (hi gh 1 i gh ted) translation phrase, and end this process. In this example, the user can understand the specific translation piece A proper translation of the language (getting a natural translation of the highest possible translation from the original sentence as a sentence in the target language). Step 11 In step 2, The natural sentences with the greatest degree of agreement between the natural sentences selected in the search in step 102 (the number of natural sentences in a decreasing order of the degree of agreement, or the degree of agreement is high In addition to all the natural sentences of a specific value), it is also read from the corresponding translation database in a decreasing order of the degree of consistency, and displayed on the display 12 at the same time. If there is a The degree of plural natural sentences counts the number of words that are inconsistent with the original sentence in each natural sentence. Corresponding to the translated sentences of natural sentences with the same degree of consistency, it is based on the inconsistent words in the corresponding natural sentences. Decreasing (smaller) order is displayed to show translated sentences at the same time. (Example 2) The above program uses a real example to describe. Assume that the original language is English and the target language is Japanese. One word in the processed original sentence, "operat i on", specifically states "the system suddenly entered the operation (Th e system suddenly came into o p e r a t i〇n) ", and the natural sentences (1) to

第40頁 200530856 五、發明說明(34) (8)係與翻譯句子(1)到(8)相關聯,並且登錄於相 對應的翻譯資料庫以作為包含翻譯片語「操作 (operation)」之自然句子,除了圖三中所顯示之自然 句子(3)具有與具備「1」之原始句子之一致字彙之數目 1 0 0 %之一致之程度(亦計算自然句子中之「操作 (operation)」以作為如上述之一致的字彙),並且自 然句子(3)具有與具備「4之原始句子之一致字彙之數目 (從計算一致的字彙而撿選之字彙「這(the)」以作為 經常出現的字彙)4 0 0 %之一致之程度。因此,翻譯句子 (3)與自然句子(3)相關並且登錄於相對應的翻譯資料 庫係與重要的翻譯片語一起顯示,如下列所敘述: r Sisutemu ga kadosh i hajimeta」。 (例子3) 假設一原始語言係英文,並且一對象語言係日文,當包含 一處理過的原始句子中之一字彙「實施(operation )」,具體說明「一小生意係免除新法之實施(A sma 1 1 business is exempt from the operation of the new law)」,並且圖三中所顯示之自然句子(1)到(8)係 與相對應的翻譯句子(1)到(8) —起提供,並且登錄於 相對應的翻譯資料庫以作為包含翻譯片語「實施 (operation)」之自然句子,除了圖三中所顯示之自然 句子(2)具有與具備「1」之原始句子之一致字彙之數目 1 0 0 %之一致之程度(亦計算自然句子中之「操作 (operation)」以作為如上述之一致的字彙),並且自Page 40 200530856 V. Description of the invention (34) (8) is related to the translation sentences (1) to (8), and is registered in the corresponding translation database as the translation operation "operation" Natural sentences, except for the natural sentence (3) shown in Figure 3, which has a degree of agreement with the number of consistent vocabulary of the original sentence with "1" 100% (also calculates the "operation" in the natural sentence As the consistent vocabulary as above), and the natural sentence (3) has the same number of vocabulary as the original sentence with "4 (the vocabulary" the "selected from the calculation of the consistent vocabulary as frequently appear Vocabulary) of 4%. Therefore, the translated sentence (3) is related to the natural sentence (3) and registered in the corresponding translation database. It is displayed together with important translation phrases, as described below: r Sisutemu ga kadosh i hajimeta ”. (Example 3) Assume that an original language is English and an object language is Japanese. When it contains a word "operation" in a processed original sentence, specify "a small business is exempt from the implementation of the new law (A sma 1 1 business is exempt from the operation of the new law) ", and the natural sentences (1) to (8) shown in Figure 3 are provided from the corresponding translated sentences (1) to (8), And registered in the corresponding translation database as a natural sentence containing the translation phrase "operation", except that the natural sentence (2) shown in Figure 3 has the same vocabulary as the original sentence with "1" The degree of agreement of the number 100% (also calculates "operation" in natural sentences as a consistent vocabulary as described above), and since

第41頁 200530856 五、發明說明(35) 然句子(3)具有與具備「4」之原始句子之一致字彙之數 目(從計算一致的字彙而撿選之字彙「這(the)」以作 為經常出現的字彙)4 0 0 %之一致之程度。因此,翻譯句子 (2)與自然句子(2)相關聯並且登錄於相對應的翻譯資 料庫中係與重要的翻譯片語一起顯示,如下列所敘述: r Sh i mpo no tek i yo」 在英文句子中之字彙「實施(operation)」,係用於難 以識別合適的翻譯之一多意字彙。當從上述的例子2和例 子3中之容易的了解,在相對應的翻譯資料庫中登陸與自 然句子相關之翻譯句子中,可以得到在相同含意中使用多 一的字彙之一合適的自然句子(翻譯句子)以作為原始句 子,並且基於一致之程度而選擇一自然句子(翻譯句子 (例子4) 中文句子之例子中,因為沒有足夠了解中文語言難以分辨 大量之含意,常常難以識別應該得到之一翻譯中之單位。 在關於本發明之翻譯識別程序中,為了解決此問題,從登 錄於相對應的翻譯資料庫之自然句子中,檢索包含特定翻 譯片語之自然句子(當翻譯片語係包含複數個字彙時,同 時也檢索包含至少包括翻譯片語之複數個字彙中之任一種 之自然句子),並且顯示具有與全部的原始句子高一致之 程度之翻譯句子(包括自然句子之翻譯句子,除了具有高 一致之程度之自然句子)。因此,使用者可以了解從原始Page 41 200530856 V. Description of the invention (35) Natural sentence (3) has the same number of vocabulary as the original sentence with "4" (the vocabulary "the" selected from the calculated vocabulary is used as a regular The vocabulary that appears) is consistent with 400%. Therefore, the translated sentence (2) is associated with the natural sentence (2) and registered in the corresponding translation database, and displayed with important translation phrases, as described below: r Sh i mpo no tek i yo ”in The vocabulary "operation" in English sentences is used for a polysemous word that is difficult to identify a suitable translation. When it is easy to understand from the above examples 2 and 3, the corresponding translation database is registered in the translation sentence related to the natural sentence, and you can get a suitable natural sentence that uses one more word in the same meaning. (Translation sentence) as the original sentence, and a natural sentence is selected based on the degree of consistency (translated sentence (example 4) Chinese sentence example, because it is difficult to distinguish a large number of meanings without sufficient understanding of the Chinese language, it is often difficult to identify what should be obtained A unit in translation. In order to solve the problem in the translation recognition program of the present invention, a natural sentence containing a specific translation phrase is retrieved from the natural sentence registered in the corresponding translation database (when the translation phrase is When plural vocabularies are included, natural sentences containing at least one of the plural vocabularies including translation phrases are also retrieved, and translated sentences (including translated sentences including natural sentences) are displayed with a degree of consistency with all the original sentences , Except for natural sentences with a high degree of consistency). Therefore, users Can learn from the original

第42頁 200530856 五、發明說明(36) — 句子以及本身合適的翻譯中,參照顯示的翻譯句子應該同 時得到翻譯中之單位(u n i t)。 例如田中文之一原始句子「發展中的國家(展中国家 )」作為一原始浯言時,係翻譯成為日文以作為一對象語 言,如果具體說明「發()」、「發展(展)」、「發 展中(展中)」、「發展中國(展中国)」,或「發展 中國豕(展中国家)」中之任一種以作為翻譯片語,基於 下列次序中一致之程度而排列下列自然句子之翻譯句子。 (當具·體說明「發展中國家(展中国家)」以作為一翻譯 片語時,一致之程度也同時顯示於下列,然而,理所當然 依據具體說明片語個別之一致程度有所不同以作為翻譯片 語): 1·「發展中國家(展中国家)」=「展途上国(hatten tojokoku) 、 ( developing country)」 (一致之字彙之數目「5」,並且一致之程度1 〇 〇 %)。 2·「發展中國(展中国)」二「中国 展 (Chugokuwo hattensaseru) 、 ( to develop China)」(一致之字彙 之數目「4」,並且一致之程度80%)。 3.「發展中(展中)」=「展中 (hatten-chu no)、 (developing)」 (一致之字彙之數目「3」’並且一致之程度60%)。 4·「發展(展)」二「展 、展 (hattensuru、 hattensaseru) 、 ( develop)」Page 42 200530856 V. Description of the invention (36) — In the sentence and the proper translation itself, the translated sentence with reference to the display should also obtain the unit in translation (u n i t). For example, when one of the original sentences of Tian Chinese is "developing country (developing country)" as an original predicate, it is translated into Japanese as a target language. If you specify "fa ()", "development (exhibition)" , "Developing (Exhibiting)", "Developing China (Exhibiting China)", or "Developing China (Exhibiting Countries)" as translation phrases, arrange the following based on the degree of consistency in the following order Translated sentences of natural sentences. (When the description of "developing countries (developing countries)" is used as a translation phrase, the degree of consistency is also shown in the following. However, it is a matter of course that the degree of agreement among the specific phrases is different. Translation phrases): 1. "Developing countries (developing countries)" = "hatten tojokoku, (developing country)" (the number of consistent vocabulary "5", and the degree of consistency 100% ). 2. "Chugokuwo hattensaseru" and "to develop China" "(the number of consistent vocabulary is" 4 ", and the degree of consistency is 80%). 3. "Under development (in development)" = "hatten-chu no, (developing)" (the number of consistent vocabulary is "3" and the degree of consistency is 60%). 4 · "Development (Exhibition)" 2 "Exhibition, Exhibition (hattensuru, hattensaseru), (Develop)"

第43頁 發照 厂參 係,。 ,中位 中譯單 子翻之 命的中 本適譯 在合翻 {身到 子本得 句及時 始以同 原}該 從 J應 解/子 了家句 以帽譯 可Ϊ自 者 的 用{列 使家排 ,國所 此中述 因展上 200530856_ 五、發明說明(37) (一致之字彙之數目「2丨,並且一致之程度40%)。 5.「發( )」=「 、出 (hassuru、dasu)、 (start、 put out)」 (一致之字彙之數目「1」,並且一致之程度20%)。 當一原始句子與一翻譯片語兩者係「發()」,每個土 述的自然句子具有一致之字彙之數目「1」,並且一致之 程度1 0 0 %。然而,當彼此不一致的字彙之數目不相同時, 原始句子係排列於下列中不同於原始句子中之例子「發展 中國家(展中国家)」之次序··Page 43 Licensing, factory reference. , The mid-level translation of the order of the Chinese translation of the order of the Nakamoto in the translation {body to the child to get the sentence in time to start with the same original} should be interpreted from J should be / the family sentence can be interpreted by the use of the { Line up the ambassadors, as stated in the State Council because of the exhibition 200530856_ V. Invention Description (37) (the number of consistent vocabulary is "2 丨, and the degree of consistency is 40%). 5." Fa () "=", out (Hassuru, dasu), (start, put out) "(the number of consistent vocabulary is" 1 ", and the degree of consistency is 20%). When an original sentence and a translated phrase are both "fa ()", each natural spoken sentence has the same number of vocabulary "1", and the degree of consistency is 100%. However, when the number of inconsistent vocabularies is not the same, the original sentence system is arranged in the following order, which is different from the example "Developing Country (Exhibiting Country)" in the original sentence.

1. 厂 發 ( ) j ( 不一 致 的字彙之數目「 〇」) 2. 厂 發 展 ( 展) 」( 不 一致的字彙之數 目「1」 ) 3. 厂 發 展 中 (展 中) j (不一致的字彙之數目 「2」 ) 4. 厂 發 展 中 國( 展中 国) 」(不一致的字 彙之數 S厂 3」 ) 5. 厂 發 展 中國家( 展 中国家)」(不一 致的字 彙之 數目 厂 4」 ) 在 類 似 的 方 法中3 ,者一 田 原始句子與一翻譯 片語兩 者係 「發 展 ( 展) j 時, 基於 一 致之程度以及不_ -致的字彙, 原 始 句 子 也 排 列於下列不民 1於原始句子中之 例子「 發展 中國 家(展中国家)」或「發()」之次序·· 第44頁 200530856_ 五、發明說明(38) 1 ·「發展(展)j (一致的字彙之數目「2」,一致之程 度100%,並且不一致的字彙之數目「0」)。 2. 「發展中(展中)」(一致的字彙之數目「2」,一致 之程度100%,並且不一致的字彙之數目「1」)。 3. 「發展中國(展中国)」(一致的字彙之數目「2」, 一致之程度1 0 0 %,並且不一致的字彙之數目「2」)。 4. 「發展中國家(展中国家)」(一致的字彙之數目 「2」,一致之程度1 00%,並且不一致的字彙之數目「3」 )° 5 ·「發()」(一致的字彙之數目「1」·,一致之程度 5 0 %,並且不一致的字彙之數目「0」)。 即使當翻譯片語係與此相同時,翻譯句子係根據如上述顯 示如果與原始句子不相同之自然句子,依照一合適的次序 而排列。因此,使用者可以參照翻譯句子之排列結果,而 了解一相對於原始句子之一合適的翻譯。 當具有最大一致之程度之複數個自然句子存在時,步驟 1 1 0中之答案為肯定。接著,程式移至步驟1 1 4,以執行一 用以基於翻譯片語以及經常出現於原始語言中之相同的句 子(具有與翻譯片語高相關之字彙:相對應於如申請專 利範圍第1 1項、第1 4項,以及第1 7項所敘述之經常出現的 片語),而於步驟1 1 4與1 1 6中選擇自然句子之關聯性分 析。1. Factory issued () j (the number of inconsistent vocabulary "0") 2. Factory development (exhibition) "(the number of inconsistent vocabulary" 1 ") 3. Factory development (in exhibition) j (of the inconsistent vocabulary The number "2") 4. Factory development China (exhibiting China) "(the number of inconsistent vocabulary S factory 3") 5. Factory developing countries (developing country) "(the number of inconsistent vocabulary factory 4") in similar In the method 3, both the original field of Yitian and a translated phrase are "developed (exhibited) j. Based on the degree of consistency and inconsistent vocabulary, the original sentences are also arranged in the following inhabitants. 1 in the original sentence. Examples in the order of "developing countries (developing countries)" or "developing ()" page 44 200530856_ V. Description of invention (38) 1 · "Development (exhibition) j (number of consistent vocabulary" 2 ", The degree of consistency is 100%, and the number of inconsistent words is" 0 "). 2. "In development (in development)" (the number of consistent vocabularies is "2", the degree of consistency is 100%, and the number of inconsistent vocabularies is "1"). 3. "Developing China (Exhibiting China)" (the number of consistent vocabularies is "2", the degree of consistency is 100%, and the number of inconsistent vocabularies is "2"). 4. "Developing countries (developing countries)" (the number of consistent vocabularies is "2", the degree of consistency is 100%, and the number of inconsistent vocabularies is "3") ° 5 · "Fa ()" (consistent The number of vocabularies is "1", the degree of consistency is 50%, and the number of inconsistent vocabularies is "0"). Even when the translation phrase system is the same, the translation sentence system is arranged in a proper order according to the natural sentence which is displayed as described above if it is different from the original sentence. Therefore, the user can refer to the arrangement result of the translated sentences to understand a suitable translation relative to one of the original sentences. When a plurality of natural sentences with the greatest degree of consistency exist, the answer in step 1 10 is yes. Then, the program moves to steps 1 to 4 to execute a sentence based on the translation phrase and the same sentence that often appears in the original language (with a vocabulary highly related to the translation phrase: corresponding to the first paragraph of the scope of patent application). Item 1, item 14, and item 17 that are frequently encountered), and select the analysis of the relevance of natural sentences in steps 1 1 4 and 1 16.

第45頁 200530856 五、發明說明(39) 也就是說,步驟1 1 4中,翻譯片語與經常出現於原始語言 中之相同的句子是否包含於處理過的原始句子(與翻譯片 語具有一高相關性之字彙),係沒有調查而研究是否除了 原始句子中翻譯片語中之翻譯片語之外的每個字彙係包含 步驟1 0 2中所檢索之翻譯資料庫而檢選每個自然句子。在 此程序中,例如在識別除了處理過的原始句子中翻譯片語 之外的每個字彙之間具有出現於每個自然句子之次數之數 目,大於特定數目之次數的字彙(一次、二次,或多次) 以作為與翻譯片語具有一高相關性之字彙。上述步驟1 1 4 相當於如申請專利範圍第1 1項、第1 4項,以及第1 9項所敘 述之搜尋裝置,並且也如申請專利範圍第1 7項所敘述之第 二步驟。 下一步驟1 1 6中,詢問與翻譯片語具有高相關性之字彙是 否由步驟1 1 4中之檢索而發現。當答案為肯定時,程式移 至步驟1 1 8,從步驟1 0 2中檢索之相對應的翻譯資料庫所撿 選之每個自然句子之間,用以參照翻譯片語之翻譯句子, 以及每個包含步驟1 1 4中與所決定的翻譯片語具有高相關 性之字彙(在處理過的原始語言中的字彙)之自然句子, 並且用以識別上述的自然句子中翻譯片語之經常出現的翻 譯。翻譯片語之經常出現的翻譯,具體而言,當撿選翻譯 片語以及一單一自然句子作為一包含每個與翻譯片語具有 高相關性之字彙之一自然句子時,識別自然句子中翻譯片Page 45 200530856 V. Description of the invention (39) That is, in step 1 1 4 whether the translated phrase and the same sentence often appearing in the original language are included in the processed original sentence. Highly relevant vocabulary), no investigation was conducted to determine whether each vocabulary except the translated phrase in the original sentence contains the translation database retrieved in step 102 and checked each natural sentence. In this program, for example, the number of occurrences of each natural sentence between each vocabulary other than the translated phrase in the processed original sentence is recognized, and the vocabulary greater than a specific number of times (once, twice) , Or multiple times) as a vocabulary of high relevance to the translation phrase. The above steps 1 1 4 are equivalent to the search device described in item 11 of the patent application scope, item 14 and item 19, and also the second step described in the patent application scope item 17. In the next step 1 1 6, ask whether the vocabulary that has a high correlation with the translated phrase was found by searching in step 1 1 4. When the answer is affirmative, the program moves to step 1 18, between each natural sentence selected from the corresponding translation database retrieved in step 102, to refer to the translated sentence of the translation phrase, and Each natural sentence containing the vocabulary (words in the processed original language) that are highly relevant to the determined translation phrase in step 1 1 4 and is used to identify the often translated phrases in the natural sentences described above Appear translation. Often translations of translation phrases. Specifically, when a translation phrase and a single natural sentence are selected as a natural sentence containing each vocabulary that has a high relevance to the translated phrase, the translation in the natural sentence is identified. sheet

第46頁 200530856 五、發明說明(40) 語之翻譯以作為經常出現的翻譯。當撿選複數個自然句子 作為上述自然句子時,識別自然句子中翻譯片語之翻譯之 最經常出現之翻譯以作為經常出現的翻譯。 如上所提,當包含翻譯片語以及包含識別處理過的原始句 子之字彙之自然句子作為與存在翻譯片語有高相關性之字 彙時,自然句子可能使用翻譯片語之句子以作為具有相同 含意之句子,例如原始句子。具有翻譯片語之不同翻譯之 一自然句子,也可能混合於上述的自然句子中。另一方 面,在步驟1 1 8參照包含每個翻譯片語以及與翻譯片語具 有高相關性之字彙之自然句子之翻譯句子,識別上述的自 然句子中翻譯片語之經常出現的翻譯。因此,可以得到處 理過的原始句子中翻譯片語之合適的翻譯。 步驟1 1 9中,從包含每個翻譯片語以及與翻譯片語高相關 之字彙之自然句子之翻譯句子中,讀取登記於與自然句子 相關之相對應的翻譯資料庫之對象語言中之翻譯句子,在 自然句子中翻譯片語係與步驟1 1 8中所發現經常出現的翻 譯相關,並且讀取翻譯句子係與翻譯片語之翻譯以及在讀 取翻譯句子上強調與翻譯片語具有高相關性之字彙一起顯 示於顯示器1 2上,以結束此程序。在本例子中,使用者也 可以了解特定翻譯片語之合適的翻譯(從原始句子中得到 一自然翻譯之高可能之翻譯,以作為對象語言中之一句子 )。上述步驟1 1 6到1 1 9,重複相對應於如申請專利範圍第Page 46 200530856 V. Description of the invention (40) The translation of the language is used as a frequent translation. When plural natural sentences are selected as the above-mentioned natural sentences, the most frequently occurring translations of translations of translation phrases in natural sentences are identified as frequently occurring translations. As mentioned above, when a natural sentence containing a translated phrase and a vocabulary of a recognized original sentence is used as a vocabulary that is highly relevant to the existence of the translated phrase, the natural sentence may use the sentence of the translated phrase as having the same meaning Sentence, such as the original sentence. A natural sentence with different translations of the translated phrase may also be mixed in the natural sentence described above. On the other hand, in step 1 1 8 referring to the translated sentence containing each translated phrase and a natural sentence with a highly relevant vocabulary of the translated phrase, the frequently occurring translation of the translated phrase in the above-mentioned natural sentence is identified. Therefore, a proper translation of the translated phrase in the processed original sentence can be obtained. In step 1 19, from the translated sentences containing each translated phrase and a natural sentence with a high vocabulary related to the translated phrase, read the target language registered in the corresponding translation database related to the natural sentence. Translate sentences. In the natural sentence, the translation phrase series is related to the translation that is often found in step 1 1 8. Reading the translation sentence series is related to the translation of the translation phrase. In reading the translation sentence, it is emphasized that the translation phrase has The highly relevant vocabulary is displayed on the display 12 together to end the process. In this example, the user can also know the appropriate translation for a specific translation phrase (getting a high-probability translation from the original sentence as a sentence in the target language). The above steps 1 16 to 1 1 9 are repeated corresponding to

第47頁 200530856 五、發明說明(41) 1 1項所敘述之第一翻譯識別裝置,如申請專利範圍第1 4項 與第1 9項所敘述之第二翻譯識別裝置,以及如申請專利範 圍第1 7項所敘述之第三步驟。 步驟1 1 9中,也另外包含每個翻譯片語以及與翻譯片語高 相關之字彙之自然句子,並且上述翻譯片語在步驟1 0 2中 所撿選之自然句子之間係與經常出現的片語相關聯,從一 致之程度之遞減的次序之相對應的翻譯資料庫中讀取複數 個自然句子之翻譯(一致之程度之遞減的次序中之某一自 然句子之數目,或全部具有高於一致的程度之特定數值之 句子),並且同時根據一致之程度或不一致的字彙之次序 而排列於顯示器1 2上。 (例子5) 上述的程序係更敘述使用一真實的例子。假設一原始語言 係英文,並且一對象語言係日文,當包含一處理過的原始 句子中之一字彙「手術(operation)」係具體說明「我 接受移除直腸癌之手#ί ( I had an operation to remove a rectal cancer)」,並且圖三中所顯示之自然句子(1 )到(8)與其他自然句子係登錄於相對應的翻譯資料 庫,以作為包含翻譯片語「手術(〇 p e r a t i ο η)」之自然 句子,因為自然句子(6)到(8)兩者皆具有一致字彙之 數目「2」,所以只從一致之程度難以選擇一最合適的自 然句子(翻譯句子)。當從自然句子(1)到(8)中檢索Page 47 200530856 V. Description of the invention (41) The first translation recognition device described in item 11 is the second translation recognition device described in items 14 and 19 of the scope of patent application, and the scope of patent application The third step described in item 17. Step 1 1 9 also includes each translated phrase and a natural sentence with a high vocabulary related to the translated phrase, and the above-mentioned translated phrases are often related to the natural sentence selected in step 102. The phrases are related, and the translations of plural natural sentences are read from the corresponding translation database in a decreasing order of consistency (the number of natural sentences in a decreasing order of consistency, or all have Sentences with specific values higher than the degree of consistency), and at the same time arranged on the display 12 according to the degree of consistency or the order of the inconsistent vocabulary. (Example 5) The above-mentioned program further describes the use of a real example. Suppose an original language is English and an object language is Japanese. When it contains a word "operation" in a processed original sentence, it specifies "I accept the hand to remove rectal cancer # ί (I had an operation to remove a rectal cancer) ", and the natural sentences (1) to (8) shown in Figure 3 are registered in the corresponding translation database with other natural sentence systems as a translation phrase containing" operative (〇perati ο η) ", because both natural sentences (6) to (8) have the same number of vocabularies" 2 ", it is difficult to choose the most suitable natural sentence (translated sentence) only from the degree of consistency. When searching from natural sentences (1) to (8)

第48頁 200530856 五、發明說明(42) 除了處理過的原始句子(例如,「移除(remove)」、 「直腸的(rectal)」,或「癌症(cancer)」)中翻譯 片語「手術(operat i on)」之一關鍵字,因為自然句子 (6)到(8)包含「癌症(cancer)」,所以檢索「癌症 (cancer)」以作為在處理過的原始句子中與翻譯片語 「癌症(cancer)」高相關之字彙。Page 48 200530856 V. Description of the invention (42) In addition to processed original sentences (for example, "remove", "rectal", or "cancer"), the translation phrase "surgery" (Operat i on) "keyword, because natural sentences (6) to (8) contain" cancer ", so" cancer "is retrieved as a translation phrase in the processed original sentence The word "cancer" is highly relevant.

包含翻譯片語「手術(〇 p e r a t i ο η)」以及具有高相關之 字彙「癌症(cancer)」之自然句子,可能使用相同含意 中之翻譯片語「手術(〇 p e r a t i ο η)」之句子。然而,翻 譯片語之翻譯不同於翻譯片語中之自然句子,也具有混合 在登記於相對應的翻譯資料庫中之自然句子,並且符合上 述的情況之可能性。在本例子中,自然句子(8)相當於 本例子。雖然自然句子(8)包含翻譯片語「手術 (ope rat i on)」以及具有高相關性之字彙「癌症 (cancer)」,但是因為翻譯片語「手術(operat i on )」之翻譯係「sosa」,所以自然句子(8)不同於處理 過的原始句子。Natural sentences containing the translated phrase "operative (〇 p e r a t i ο η)" and the highly relevant vocabulary "cancer" may use the same meaning as the translated phrase "operative (〇 p e r a t i ο η)". However, the translation of the translated phrase is different from the natural sentences in the translated phrase. It also has the possibility of mixing the natural sentences registered in the corresponding translation database and conforming to the above situation. In this example, the natural sentence (8) corresponds to this example. Although the natural sentence (8) contains the translation phrase "ope rat i on" and the highly relevant vocabulary "cancer", but because the translation phrase "operat i on" is " sosa ", so the natural sentence (8) is different from the processed original sentence.

由於這些自然句子影響,為了防止不正確的識別翻譯片語 之翻譯,參照包含翻譯片語「手術(〇 p e r a t i ο η)」,以 及具有高相關性之字彙「癌症(cancer)」。翻譯片語 「手術(operat i on)」以及高相關之字彙「癌症 (cancer)」,除了自然句子(6)之外,係包含於自然Due to the influence of these natural sentences, in order to prevent incorrect recognition of the translation of the translation phrase, reference is made to the translation phrase "surgery (〇 p er a t i ο η)" and the highly relevant vocabulary "cancer". The translation phrase "operat i on" and the highly relevant vocabulary "cancer" are included in nature in addition to the natural sentence (6)

第49頁 200530856 五、發明說明(43)Page 49 200530856 V. Description of the invention (43)

句子中,並且上述中之一自然句子之翻譯片語「手術 (operat i on)」與自然句子中登記於相對應的翻譯片語 中之翻譯「shu j ut su」相關聯。因此,決定包含翻譯片語 「手術(operation)」以及高相關之字彙「癌症 (cancer)」之自然句子中之翻譯片語「手術 (〇 p e r a t i ο η)」之經常出現的翻譯為「s h u j u t s u」,包 含翻譯片語「手術(〇 P e r a t i ο η)」,與具有高相關性之 字彙「癌症(cancer)」之自然句子(6),選擇上述自 然句子t翻譯片語與所決定之經常出現相對應的翻譯,以 及與翻譯句子(6)相關之自然句子,並且登記於相對應 的翻譯資料庫強調與翻譯片語之翻譯高相關之字彙一起顯 示,如下列顯示。 厂 Jon — san no shujutsu-g〇、kare no gan wa naotta」。 因此,在本例子中,也選擇使用相同含意中之翻譯片語 「手術(operat i on)」之一合適的自然句子(翻譯句子 ),以作為原始句子。In the sentence, and the translation phrase "operat i on" of one of the above natural sentences is associated with the translation "shu j ut su" in the corresponding translation phrase registered in the natural sentence. Therefore, it is decided that the frequently-used translation of the translation phrase "operation (〇perati ο η)" in the natural sentence containing the translation phrase "operation" and the highly relevant vocabulary "cancer" is "shujutsu" , Including the natural phrase (6) including the translation phrase "operative (〇P erati ο η)" and the highly relevant vocabulary "cancer", choose the above-mentioned natural sentence t the translation phrase and the often appearing decision Corresponding translations, as well as natural sentences related to the translation sentence (6), are registered in the corresponding translation database to emphasize the vocabulary related to the translation of the translation phrase, as shown below. Factory Jon — san no shujutsu-g〇, kare no gan wa naotta. " Therefore, in this example, we also chose to use one of the appropriate natural sentences (translated sentences) as the original sentence.

參考圖三,雖然包含「手術(〇 p e r a t i ο η)」之自然句子 之數目係登錄以簡化敘述,但是實際上登錄更多自然句 子。當處理過的原始句子,舉例而言,係撿選而使用翻 譯片語「手術(operat i on)」,例如「傷害(i n j ury )」或「醫院(hospital)」之字彙之一句子,根據處理 過的原始句子之内容,以作為具有高相關性之字彙。當處Referring to FIG. 3, although the number of natural sentences including "operation (〇 p r a t i ο η)" is registered to simplify the narrative, more natural sentences are actually registered. When processing the original sentence, for example, it was picked and used the translated phrase "operat i on", such as "inj ury" or "hospital" as a sentence, according to The content of the processed original sentence as a highly relevant vocabulary. Everywhere

第50頁 200530856 五、發明說明(44) 理過的原始句子係,舉例而言,撿選而使用字彙 「 k a d 〇」,例如「系統(injury)」或「電月I ( hospital )」之含意中之翻譯片語「手術(〇 p e r a ΐ i ο η)」之一句 子,根據處理過的原始句子之内容以作為具有高相關性之 字彙。當處理過的原始句子,舉例而言,撿選而使用字彙 「ΐ e k i y 〇」,例如「法律(1 a w)」之含意中之翻譯片語 「手術(operation)」之一句子,根據處理過的原始句 子之内容,以作為具有高相關性之字彙。 另一方面,當不檢索與翻譯片語高相關之字彙時,步驟 1 1 6中問題之答案係否定。因此,程式移至步驟1 2 0,步驟 1 2 0到步驟1 3 4中,基於取代除了處理過的原始句子中之翻 譯片語之外的標的字彙之選擇性字彙(相對應於如申請專 利範圍第1 2項所敘述之選擇性片語)而選擇之自然句子來 執行計晝分析(scheme analysis)。 也就是說,識別存在於步驟1 2 0 (標的字彙)中之處理過 的原始句子之字彙,以及不存在於步驟1 2 0檢索之相對應 的翻譯資料庫所撿選的每個自然句子中之字彙。步驟1 2 2 中,從登錄於相對應的翻譯資料庫中,檢索包含標的字彙 之一自然句子。當存在複數個標的字彙時,步驟1 2 2中之 檢索係為了每個字囊而執行。步驟1 2 4中識別用以檢索母 個自然句子(如申請專利範圍第1 3項所敘述之「具有相同 句子結構之自然句子,以作為包含標的片語之自然句子之Page 50 200530856 V. Explanation of the invention (44) The original sentence system that has been processed, for example, pick and use the word "kad 〇", such as "injury" or "Dian Yue I (hospital)" One sentence in the translated phrase "operative (〇pera ΐ i ο η)" is based on the content of the processed original sentence as a highly relevant vocabulary. When processing the original sentence, for example, pick and use the vocabulary "ΐ ekiy 〇", such as a sentence in the meaning of "law (1 aw)" in the translated phrase "operation", according to the processed Content of the original sentence as a highly relevant vocabulary. On the other hand, when the vocabulary related to the translation phrase is not retrieved, the answer to the question in step 1 16 is negative. Therefore, the program moves to step 120, steps 120 to 134, based on the alternative vocabulary that replaces the target vocabulary other than the translated phrase in the processed original sentence (corresponding to the patent application (Optional phrase described in item 12 of the scope) and natural sentences selected to perform scheme analysis. That is, the vocabulary of the processed original sentence that exists in step 120 (the target vocabulary) is identified, and that it does not exist in each natural sentence selected by the corresponding translation database retrieved in step 120. Zigzag. In step 1 2 2, a natural sentence containing one of the target words is retrieved from the corresponding translation database registered in the corresponding translation database. When there are a plurality of target vocabularies, the search in step 1 2 2 is performed for each word bag. Steps 1 2 4 identify the natural sentences used to retrieve the mother (such as "natural sentences with the same sentence structure as described in item 13 of the scope of the patent application, as the natural sentences containing the target phrase"

ill 第51頁 200530856 五、發明說明(45) 檢索所撿選之自然句子」)的檢索情況,上述自然句子識 別只不同於步驟1 2 2之檢索所撿選之每個自然句子,以從 登錄於基於識別檢索情況之相對應的翻譯資料庫之自然句 子中來檢索自然句子。 在下一步驟1 2 6,詢問步驟1 2 4之檢索是否撿選一致的自然 句子。當存在於只有標的字彙取代其他字彙之相對應的翻 譯資料庫中之每個相同的翻譯句子,上述其他的字彙可能 可以用來取代標的字彙之一選擇性片語。因此,當答案係 為肯定時,程式移至步驟1 2 8,了解字彙取代檢索所撿選 之自然句子中之標的字彙以作為用於標的字彙之選擇性字 彙。上述步驟1 2 0到1 3 0,相當於如申請專利範圍第1 2項所 敘述之識別裝置(如申請專利範圍第1 3項中之更具體的說 明)〇 中 大 最 3 1有 驟具 步於 問 古口 在 存 代 取 否 是 彙 字 的 標 之 中 子 句 由 Γ\ 彙 字 性 擇 選 之 子 句 然 個 數 複 之 度 程 致 之子使 句 子然意 句自含 然個同 自數相 一 複在 當之能 。度可 }程子 代致句 取一然 來大自 度最關 程有有 的具定 致於決 一在, 之存時 值代彙 數取字 定彙性 特字擇 過的選 超標之 子 句 始 原 的 過2ffl 3 CD-d 理1 處驟 為步 作至 以移 子式 句程 之定 語肯 片為 譯案 翻答 3J mrc 如 此 因 最 有 具 於 錄 登 取 讀 以 之取 中來 庫度 料程 資的 譯致 _ 1 一 的之 應值 對數 相定 之特 關過 相超 子由 句C 然子 自句 之譯 度翻 程之 致語 一始 大原ill Page 51 200530856 V. Description of the invention (45) Retrieving selected natural sentences "), the above natural sentence recognition is only different from each natural sentence selected in the search of step 1 2 2 to retrieve from the registration The natural sentences are retrieved from the natural sentences in the corresponding translation database based on the recognition retrieval situation. In the next step 1 2 6, ask whether the search in step 1 2 4 selects a consistent natural sentence. When the same translation sentence exists in the corresponding translation database in which only the target vocabulary replaces the other vocabulary, the above other vocabulary may be used to replace one optional phrase of the target vocabulary. Therefore, when the answer is affirmative, the program moves to step 1 2 8 to understand that the vocabulary replaces the search for the target vocabulary in the selected natural sentence as an optional vocabulary for the target vocabulary. The above steps 120 to 130 are equivalent to the identification device described in item 12 of the scope of patent application (as described in more detail in item 13 of the scope of patent application). Step by step, ask whether Gukou is the mark of the hui word in the surrogate. The clause is chosen by Γ \ hui zi. The number of clauses is complex. The son of the degree makes the sentence self-contained and self-contained. Repeatedly. Degree can} Cheng Zi's generation of the sentence is taken from the most important degree. Some of the most important processes are determined to be determined, the existence of the value of the algebraic number of words, the definiteness of the selected word, the original clause of the clause that has been selected and exceeded. 2ffl 3 CD-d processing 1 step to step by step attributive sentence Ken Ken film as a translation 3J mrc Therefore, most of them have the ability to register for reading资 的 致 致 _ 1 The corresponding value of the one The logarithmic phase is related to the special super-subordinate sentence C Ranzi's self-sentence translation The translation of the speech

第52頁 200530856 五、發明說明(46) 代),並且包含取代標的字彙之選擇性字彙;用以搜尋用 於翻譯片語之翻譯以及選擇性字彙之翻譯之讀取翻譯句 子;讀取翻譯句子與認定重要的翻譯片語以及所標示之選 擇性字彙之翻譯一起顯示在顯示器1 2上,也用以作為一選 擇性的字彙,並且結束此程序。在本例子中,使用者也可 以了解特定翻譯片語之合適的翻譯(從原始句子中得到一 自然翻譯之具有高可能性之翻譯,以作為對象語言中之句 子)。上述步驟1 3 0以及步驟1 3 2中,相當於如申請專利範 圍第1 3項所敘述之第一翻譯識別裝置。 因此步驟1 3 2中,另外具有最大一致的程度之自然句子以 及包含取代步驟1 0 2中檢索所撿選之自然句子之間之標的 字彙之選擇性字彙,依照一致的程度之遞減的次序來讀取 複數個自然句子之翻譯句子(一致之程度之遞減的次序中 之某一自然句子之數目,或全部具有高於一致的程度之特 定數值之句子),並且同時根據一致之程度與不一至字彙 之數目,依照次序排列於顯示器1 2之上。 (例子6) 上述的程序係更敘述使用一真實的例子。假設一原始語言 係英文,並且一對象語言係日文,當具體說明一字彙「吃 (have)」以作為處理過的原始句子中包含「吃午餐 (have 1 unch)」之翻譯片語時,一包含「吃早餐(ha v e breakfast)」之自然句子(在相同含意中使用翻譯片語Page 52 200530856 V. Description of the invention (46th generation), and contains alternative words that replace the subject vocabulary; read translation sentences for searching for translation of translation phrases and translation of selective vocabulary; It is displayed on the display 12 together with the translations that are deemed important and the marked optional vocabulary, also used as an optional vocabulary, and the process ends. In this example, the user can also know the appropriate translation for a particular translation phrase (getting a natural translation with a high probability of translation as a sentence in the target language) from the original sentence. The above steps 130 and 132 are equivalent to the first translation recognition device described in item 13 of the patent application scope. Therefore, in step 1 2 2, the natural sentences with the greatest degree of consistency and the optional words containing the words that replace the target between the natural sentences selected in step 102 are searched in the descending order of the degree of consistency. Read translated sentences of plural natural sentences (the number of certain natural sentences in a decreasing order of consistency, or all sentences with a specific value higher than the degree of consistency), and at the same time according to the degree of consistency The number of words is arranged on the display 12 in order. (Example 6) The above-mentioned program further describes the use of a real example. Assume that an original language is English and an object language is Japanese. When the word "have" is specified as a translated phrase that includes "have 1 unch" in the processed original sentence, one Natural sentences containing "ha ve breakfast" (use translated phrases in the same meaning

第53頁 ,自索字Γ t 因取字C午早早 是個檢的選1 。有性吃Γ 吃Γ 於每所標檢 a子具擇^ 彙Γ 彙 200530856 五、發明說明(47) 「吃(have)」之自然句子以作為處理過的原始句子)係 登錄於相對應的翻譯資料庫中,沒有登錄一包含「吃午餐 (have lunch)」之翻譯片語,即使基於一致的程度或相 關性分析之調查也不能識別一適當的翻譯句子,處理過的 原始句子中之翻譯片語「吃(have)」或每個除了決定用 以與翻譯片語具有高相關性之外的字彙(例如,「午餐 (1 unch)」),係假設為標的字彙,並且檢索包含每個 標的字彙之自然句子。因此,檢索以作為標的字彙「午餐 (lunch)」,自然句子包含例如「吃午餐(eat lunch )」或「吃晚午餐(take a late lunch)」。 從上述檢索所撿遙的每個自然句子中,檢索只有在 然句子差異中標的字彙之自然句子。接著,當用於 撿選的自然句子以作為一自然句子的内容時,例如 彙「午餐(lunch)」,自然句子的内容,例如, 吃早餐(eat breakfast)」或「吃早午餐(take e breakfast)」以作為只有標的字彙差異之自然句 此,當決定一字彙「早餐(breakfast)」以作為 代標的字彙「午餐(1 unch)」之高可能性之一選 彙,在第一檢索所撿選之自然句子(包含翻譯片語 have)」之自然句子之檢索)之間可能取代標的字 餐(lunch)」之選擇性字彙之翻譯句子(包含 餐(have breakfast)」之翻譯句子)取代標的字 餐(breakfast)」,並且登錄於相對應的翻譯資On page 53, the self-retrieval word Γ t is chosen because it is a check 1 noon. Eating sexually Γ eating Γ is optional for each of the standard inspections huihui huihui 200530856 V. Description of the invention (47) The natural sentence "have" as the processed original sentence) is registered in the corresponding No translation phrase containing "have lunch" is registered in the translation database. Even a survey based on the degree of consistency or correlation analysis cannot identify a proper translation sentence. The translation in the processed original sentence The phrase "have" or each vocabulary other than the one that has been determined to have a high relevance to the translated phrase (for example, "lunch") is assumed to be the target vocabulary, and the search contains each The natural word of the target word. Therefore, the word "lunch" which is the target word is retrieved, and the natural sentence includes, for example, "eat lunch" or "take a late lunch". From each natural sentence picked up by the above search, a natural sentence with only the vocabulary of the successful bid in the natural sentence difference is retrieved. Then, when the natural sentence used for picking is used as the content of a natural sentence, for example, "lunch", the content of the natural sentence, for example, "eat breakfast" or "take brunch" "Breakfast" "as a natural sentence with only the difference in the target vocabulary. When determining the word" breakfast "as one of the high-possibility vocabulary" lunch ", select it in the first search office The selected natural sentences (including the search of the translated phrase have) "natural sentence search) may replace the optional word vocabulary (" have breakfast "" translation sentence) Targeted word meal (breakfast) "and registered in the corresponding translation

第54頁 200530856 五、發明說明(48) 料庫係顯示與強調的翻譯片語「吃(have) < taberu>」 以及標示的選擇性字彙「早餐(breakfast) < breakfast >」之翻譯◦因此,在此例子中選擇一合適的自然句子 (翻譯句子)使用相同含意中之翻譯片語「吃(have)」 以作為處理過的原始句子。 當也具體說明包含「有I苗(have cats)」之處理過的原 始句子中「有(h a v e )」以作為翻譯片語時,包含「有狗 (have dogs)」之自然句子係登錄於相對應的翻譯資料 庫,沒有登錄包含「有猶(have cats)」之自然句子, 即使基於一致的程度或相似方法中相關性分析之調查,也 不能識別一適當的翻譯句子,在第一檢索所撿選之自然句 子(包含翻譯片語「有(have)」之自然句子之檢索)之 間’假設字囊’例如「猶(c a t s )」為標的字彙’並且決 定字彙「狗(dogs)」為具有用上述之相同方法取代標的 字彙「貓(cats)」之高可能之選擇性字彙,標的字彙 「1¾ ( cat s)」取代選擇性字彙「狗(dogs)」之自然句 子(包含「有狗(have dogs)」之自然句子)相關之登 錄於相對應的翻譯資料庫中之翻譯句子,與強調的翻譯片 語「有(have)」之翻譯「kau」,以及所標示的選擇性 字彙「狗(d〇g s)」之翻譯「i nu」一起顯示。因此,在 例子中也選擇使用相同含意中之翻譯片語「有(have)」 之一合適的自然句子(翻譯句子),以作為處理過的原始 句子。Page 54 200530856 V. Description of the invention (48) Translation of the display and emphasis of the translation "have < taberu >" and the optional wording "breakfast & breakfast >" ◦ Therefore, in this example, a suitable natural sentence (translated sentence) is selected using the translated phrase "have" in the same meaning as the processed original sentence. When it is specified that "have" in the processed original sentence containing "have cats" is used as a translation phrase, the natural sentence containing "have dogs" is registered in the phase The corresponding translation database does not register a natural sentence containing "have cats". Even if it is based on the degree of consistency or similarity surveys in similar methods, an appropriate translation sentence cannot be identified. Pick the natural sentences (including the search of natural sentences with the phrase "have") between the 'hypothetical word bag' such as "cats" as the target vocabulary "and decide that the word" dogs "is It has the high possibility of using the same method to replace the target word "cats", and the target word "1¾ (cat s)" replaces the natural sentence of the optional word "dogs" (including "have dogs" (Have dogs) natural sentences) related translation sentences registered in the corresponding translation database, the translation "kau" with the emphasis translation phrase "have", and the marked options Vocabulary "Dog (d〇g s)" appears along with the translation of "i nu." Therefore, in the example, one of the appropriate natural sentences (translated sentences) with the same meaning in the translated phrase "have" was also selected as the processed original sentence.

第55頁 200530856 五、發明說明(49) (例子Ό 假設原始語言係日文,並且原始語言係英文,當具體說明 處理過的句子「o-yu wo kakeru」中之「kakeru」作為翻 譯片語時,一自然句子「mizu wo kakeru」(使用相同含 意中之翻譯片語「kakeru」之自然句子,以作為處理過的 原始句子)係登錄於相對應的翻譯資料庫,但是沒有登錄 「〇-yu wo kakeru」,並且即使基於一致的程度或相關性 分析之調查也不能識別一適合的翻譯句子,因為登錄其他 自然句子,例如「denwa wo kakeru」,會假設處理過的 原始句子中之「o-yu」為標的字彙,並且檢索包含標的字 彙之自然句子。因此,撿選自然句子,例如「〇 - y u n i tsukeru」、r 〇_yu wo wakasu」、「〇_yu wo nagasu」、「〇-yu de arau」、「o-yu kara hikiageru」,以及「〇-yu de atatameru」。 於是,從每個上述所撿選的自然句子中,檢索只有在每個 自然句子差異之標的字彙中之自然句子。檢索之結果可以 得到下列的字彙取代標的字彙「o-yu」 「〜ni tsukeru」 : r sosu」、厂 shoyu」、厂 mizu」 、 「 sutsukesu」 ,以及「 ude」 r 〜wakasu」 :無(None) 厂〜wo nagasu」·· 「denki」、「mizu」、「bangumi」, 以及r uwasa」 厂〜de arau」 : r shampu」、「 suponji」、Page 55 200530856 V. Description of the invention (49) (Example: Suppose the original language is Japanese and the original language is English. When the "kakeru" in the processed sentence "o-yu wo kakeru" is specified as the translation phrase , A natural sentence "mizu wo kakeru" (a natural sentence using the translated phrase "kakeru" in the same meaning as the processed original sentence) is registered in the corresponding translation database, but "〇-yu is not registered wo kakeru ", and even a survey based on the degree of consistency or relevance analysis cannot identify a suitable translated sentence, because registering other natural sentences, such as" denwa wo kakeru ", will assume the" o- "yu" is the target vocabulary, and natural sentences containing the target vocabulary are retrieved. Therefore, pick natural sentences such as "〇- yuni tsukeru", r 〇_yu wo wakasu "," 〇_yu wo nagasu "," 〇-yu de arau "," o-yu kara hikiageru ", and" 〇-yu de atatameru ". Therefore, from each of the natural sentences selected above, the search Natural sentences in the target vocabulary of natural sentence differences. The search results can be obtained by replacing the target vocabulary "o-yu" "~ ni tsukeru": r sosu ", factory shoyu", factory mizu "," sutsukesu ", and "Ude" r ~ wakasu ": None factory ~ wo nagasu" ... "denki", "mizu", "bangumi", and r uwasa "factory ~ de arau": r shampu "," suponji ",

第56頁 200530856 五、發明說明(50) 厂 sh i om i z U」 Λ 厂 m i z U」 以 及 r sekke η」 厂 〜kara h ik i ageru」 ; 厂 in i z uj ,^ Γ ka it e i」 厂 ka i g ij ,以及 「零 ( z e r o) j 厂 〜de atatame: ru」 • 厂 densh i- -r e n j i j 以及 厂 yonetsu」 在 上述的字 彙 之 間 1 此 字 彙 厂 m i zu」經 常 出 現並 且 句 子 厂 m i zu i NO kaker u」 存 在 o 因 此,決 定 字 彙「 m : i z 1 Jj 以 作 為具有取 代 處 理 過 的 原 始 句 子 「〇 一 y u w o kakeru」 中 之 標 的字彙「 〇- -yu」 之 可 能 之 選 擇 性字彙 ο 因 it匕, 在 第 一 檢 索 (包含翻 譯 片 語 之 白 狄 句 子 厂 kakeruj 之 檢索 ) 所 撿 選 之 自然句子 之 間 有 關 標 的 字 彙 厂 o-y u」 取 代 選擇 性 字 彙 厂 m i zu」之 句 子 厂 m i zu wo kakeruj 之 翻譯 並 且 登 錄 於相對應 的 翻 譯 資 料 庫 , 係 與 強調的 翻 譯 片語 厂 kakeru」 之 翻 譯 厂 倒 ( pour .) 」,以 及 標 示的 選 擇 性 片 語「k a k e r u」之翻譯「水(w a t e r)」一起顯示,如下列 所顯示。 「隹J『水』進入〜(pour *water氺into〜)」 在上述翻譯句子中,「『…』」(「氺···*」)表示「…」 係一選擇性字彙。因此,在本例子中,亦選擇使用處理過 的原始句子之相同含意中之翻譯片語「kakeru」之一合適 的自然句子(翻譯句子)。 在相似的方法中,當具體說明處理過的原始句子「3 woPage 56 200530856 V. Description of Invention (50) Factory sh i om iz U ″ Λ factory miz U ″ and r sekke η ″ factory ~ kara h ik i ageru ″; factory in iz uj, ^ Γ ka it ei ”factory ka ig ij, and "zero (j) j factory ~ de atatame: ru" • factory densh i- -renjij and factory yonetsu "between the above vocabulary 1 This vocabulary factory mi zu" often appears and the sentence factory mi zu i NO kaker u ”exists o Therefore, the vocabulary“ m: iz 1 Jj ”is determined as a possible alternative vocabulary with the target vocabulary“ 〇- -yu ”in place of the processed original sentence“ 〇 一 yuwo kakeru ”ο because it In the first search (including the translation of the Baidi sentence factory kakeruj search), the relevant word vocabulary factory oy u between the selected natural sentences was replaced by the sentence factory mi zu wo kakeruj. Translate and register in the corresponding translation database. Tone translation phrase plant kakeru "The translation plant down (pour.)", As well as selective phrase labeled the "k a k e r u" of translation "water (w a t e r)" is displayed together, as the one shown below. "隹 J『 水 "入 ~ (pour * water 氺 into ~)" In the above translation sentence, "『 ... 」" ("氺 ··· *") means "..." is an optional word. Therefore, in this example, it is also chosen to use a suitable natural sentence (translated sentence) that is one of the translated phrases "kakeru" in the same meaning of the processed original sentence. In a similar way, when specifying the processed original sentence "3 wo

第57頁 200530856 五、發明說明(51) 片語Page 57 200530856 V. Description of the invention (51)

),但是沒有登錄「3 wo kakeru」,並且即使基於一致 的程度或相似方法中一相關性分析之調查,也不能識別 適當的翻譯句子,翻譯與標的字彙「3」取代為r 之 然句子r /…------..... J kakeru」中之「kakeru」時,於相對應的翻譯資料庫中a 錄一自然句子「4 wo kakeru」(使用相同含意中之翻譯 「k a k e r u」之自然句子,以作為處理過的原始句子 p? ^ 1心 κλ* r a , … kakeru」相關並且登錄於在第一檢索所檢 wo • 个 似系尸坏檢 =自然句子(包含翻譯片/吾「kakeru」之自然句子之檢 索)之間之相對應的翻澤資料庫,係與重要的翻譯片七五 「kakeru」之翻譯(「乘(multiply)」)以及藉由二私 字彙,例如「3」為標的字彙而標示之選擇性字彙㈢「4又= 翻譯「four」一起顯示,並且決定字彙r 4」以作」之 取代標的字彙「3」之高可能之一選擇性字彙。 /、 「乘(multiply)以 4(氺 f0UI> 氺)」 因此,在本例子中,亦選擇使用處理過的原始句 含意中之翻譯片語「kakeru」之一合適的自麸^ 5 句子)。 口、旧目然句子(翻譯 ::方面’當在標的字彙差異中之自'然句子之檢索广右於 k符合的自然句子(當步驟i 2 6中答案係否 ’、/又欢 標的字彙取代選擇性字彙之自然句子不 :有曰或當 致程度之複數個自然句子(當步驟丨3 〇中答宰传、有取大— T二以識,卜,適的翻譯句子。因此,程式移至步驟 用以δ貝取母個步驟12 0中之檢索所撿選之全部的自然), But "3 wo kakeru" is not registered, and even based on the degree of consistency or a survey of correlation analysis in similar methods, the appropriate translated sentence cannot be identified. The "3" vocabulary of the translation and target is replaced by r and the sentence r /… ------..... J kakeru "" kakeru ", record a natural sentence" 4 wo kakeru "in the corresponding translation database a (use the translation" kakeru "in the same meaning "The natural sentence as the processed original sentence p? ^ 1 heart κλ * ra,… kakeru" is related and registered in the first search wo • A quasi-corpse bad inspection = natural sentence (including translation / The corresponding translation database of my "kakeru" natural sentence search) is related to the important translation of the seventy-five "kakeru" translation ("multiply") and the use of two private words, such as "3" is an optional word that is marked with the target word "4" = "four" is displayed together, and it is determined that the word "r 4" is used as a replacement of the target word "3", which is one of the highest possible words. , "Multiply" by 4 (氺 f0UI > 氺) ”Therefore, in this example, we also chose to use one of the translated phrases“ kakeru ”in the meaning of the processed original sentence, as appropriate. Spoken and old Mu Ran sentences (translation :: aspect 'When in the target vocabulary difference' since the search of Ran sentence wide right than k matches the natural sentence (when the answer is no in step i 2 6 ') Natural sentences that replace selective vocabulary are not: there are multiple natural sentences with a degree of succinctness or similarity (when step 丨 3 0 is answered, there is a big one-T two to understand, bu, appropriate translated sentences. Therefore, the program Go to step to retrieve all selected natural

200530856 五、發明說明(52) 句子,或依照一致的程度之遞減的次序之與複數個自然句 子相關之翻譯句子(一致之程度之遞減的次序中之某一自 然句子之數目,或全部具有高於一致的程度之特定數值之 句子),並且登記於相對應的翻譯資料庫,用以搜尋每個 用於翻譯片語之翻譯之讀取翻譯句子,依照一致的程度之 次序以及每個具有與發現強調的翻譯片語之翻譯之最大一 致程度之自然句子一致程度之遞減的次序之不一致字彙之 數目,於顯示器1 2上排列個別的翻譯句子,並且結束此程 序。在本例子中,使用者可以了解一些特定翻譯片語之候 選的合適翻譯。 在上述例子中,雖然敘述只使用一致的字彙之數目來計算 一致的程度之模式,但是模式並不限定於此實施例。可以 識別計算一致程度之公式,當一致之字彙之數目增加時, 使得一致之程度也隨之提高,以作為自然句子與原始句子 之間因為基於一致的字彙之數目以及不一致字彙之數目之 計算與評估一致之程度而增加之一致字彙之數目。此模式 相當於本發明之申請專利範圍第5項。也可以只基於一致 的字彙之數目而首先計算一致之程度,當即使執行一相關 性分析或一計晝分析,也難以選擇一單一的自然句子時用 以計算不一致的數目,並且用以選擇具有最小不一致字彙 之數目之自然句子(翻譯句子)。 評估一致之程度時,通常也可以使用一自然句子與一原始200530856 V. Description of Invention (52) Sentences, or translated sentences related to plural natural sentences in decreasing order of consistency (the number of natural sentences in decreasing order of consistency, or all have high Sentences with a specific value for the degree of consistency), and registered in the corresponding translation database, used to search each read translation sentence for the translation of the translation phrase, in accordance with the order of the degree of consistency and each with The number of inconsistent vocabularies in the descending order of the maximum consistent degree of translation of the emphasized translation phrase and the natural sentence agreement is found, the individual translated sentences are arranged on the display 12, and the process ends. In this example, the user can learn about suitable translations for some specific translation phrases. In the above example, although the mode that uses only the number of consistent words to calculate the degree of consistency is described, the mode is not limited to this embodiment. You can identify the formula for calculating the degree of consistency. When the number of consistent vocabularies increases, the degree of consistency also increases, as a natural sentence and the original sentence because of the calculation based on the number of consistent vocabularies and the number of inconsistent vocabularies. Increase the number of consistent vocabularies by assessing the degree of consistency. This mode is equivalent to item 5 of the patent application scope of the present invention. It is also possible to first calculate the degree of consistency based only on the number of consistent vocabularies. It is used to calculate the number of inconsistencies when it is difficult to select a single natural sentence even when performing a correlation analysis or a day-to-day analysis. Natural sentences with minimum number of inconsistent words (translated sentences). When assessing the degree of agreement, a natural sentence and a primitive

第59頁 200530856 五、發明說明(53) 句子之排列次序之相似度之程度 之字彙之自然句 原始句子增加或評估一致 子中之不一致字 ,或存在 彙之數目 之間 之數目與不 致 字彙之 也隨之 數目減 然句子 模式, 用存在 本發明 程序9 句子相 順序之 排列次序 提南’也 少時,與 與一原始 相當於根 於一致的 之申請專 具體了解 關之字彙 相似度之 字彙 之相 使得 原始 句子 據本 字彙 利範 考量 之程度 目用以評估一致之程度 與原始句子相關 ,在自然句子與 另外上述一致字彙 之數 似度增加時 當存 句子 之間使用字 發明之申請 中之 圍第 在一致 一致之 不一致 10項。 然句子與 中之不一致字彙 一致之程度之評 ,與原始 的字彙間 程度也隨 彙之排列 專利範圍 字彙之數 可以藉由 原始句子 之數目之 估: 句子一 之不一 之提高 次序之 第9項 目,相 ,例如 以及存 間,字 ,使得當 致之程度 致字彙之 〇在一自 相似度之 並且使 當於根據 ,下列之 在與原始 彙之排列 也就是說,根據原始句子中翻譯片語之差異(字彙之數目 ),每個原始句子中之字彙提供第一估計數值(f i r s t assessment value)。定義第一估計數值,使得原始句子 中之翻譯片語之第一估計數值為最大,並且除了原始句子 中之翻譯片語之外,每個字彙之第一估計數值減少以作為 原始句子中之翻譯片語增加造成的差異(以作為增加存在 字彙中間之數目)。舉例而言,當原始句子包含九個字彙 A 到 J (原始句子=A、B、C、D、E、F、G、H,以及 J), 具體說明原始句子中之第四個字彙D以作為翻譯片語時,Page 59 200530856 V. Description of the invention (53) The degree of similarity in the order of the sentences is similar to the natural words of the original sentence. Increasing or evaluating inconsistent words in the consensus sentence, or the number between the number of existing sinks and the number of non-words. It also reduced the number of sentence patterns, and used the sequence order of 9 sentences in the program of the present invention to mention “Nan'an”, and when it was similar to the original equivalent of a rooted application, the word vocabulary of similarity was specifically understood. The phase makes the original sentence according to the degree of vocabulary of the original word used to evaluate the degree of consistency related to the original sentence. When the number of similarities between the natural sentence and the other consistent vocabulary increases, the use of the word invention between the stored sentences There are 10 inconsistencies around the consensus. However, the evaluation of the degree of consistency between the inconsistent vocabulary in the sentence and the original vocabulary is also arranged with the vocabulary. The number of vocabulary in the patent scope can be estimated by the number of the original sentences: Items, phases, such as, and interstitial, words, so that the degree of consistency to the word vocabulary is at a level of self-similarity and basis, the following are arranged in the same order as the original vocabulary, that is, according to the translation of the original sentence in the original sentence The linguistic differences (the number of vocabulary), each vocabulary in the original sentence provides a first assessment value. Define the first estimated value so that the first estimated value of the translated phrase in the original sentence is the largest, and in addition to the translated phrase in the original sentence, the first estimated value of each word is reduced to serve as the translation in the original sentence The difference caused by the increase in phrases (as an increase in the number of words in the middle). For example, when the original sentence contains nine vocabularies A to J (original sentence = A, B, C, D, E, F, G, H, and J), specify the fourth vocabulary D in the original sentence to start with As a translation phrase,

第60頁 200530856 五、發明說明(54) 原始句子中每個字彙A到J提供下列第一估計數值: A二 1 · 0、B二 2. 0、C= 5· 0、D二 1 0· 0、E= 5. 0、F二 2· 0、G 二 1. 0、H= 0.5,以及 1二 0.2。 接著,每個包含步驟1 0 2之檢索,從相對應的翻譯資料庫 中所撿選的翻譯片語之自然句子係用以搜尋自然句子中之 每個字彙之間之原始句子中與任何字彙之一致之一致的字 彙,並且用以計算已提供給每個發現的一致的字彙之第一 估計數值之總合。第一估計數值之總合假設為一致之程度 (第一估計數值之總合),並且依照每個相關的自然句子 之一致之程度之遞減的次序,顯示檢索所撿選之每個自然 句子之翻譯句子。例如,當具有由檢索而撿選之下列字彙 排列之自然句子1到5 (其中字彙X係一任意的(arbitrary )字彙): 自然句子 1= ( D、X、X、E、F、G) 自然句子 2= ( D、X、E、F、G) 自然句子3= ( D、E、F、G) 自然句子 4= ( A、B、X、C、X、D) 自然句子 5二(D、E、F、G、H、 J) 每個自然句子之第一估計數值之總合係: 自然句子1 : 1 0 · 0 (字彙D) + 0 (字彙X) + 0 (字彙X) + 5.0(字彙 E) +2.0(字彙 F) +1·0(字彙 G)= 18.0 自然句子2 : 1 0 · 0 (字彙D) + 0 (字彙X) + 5 · 0 (字彙E) + 2· 0 (字彙 F) + 1. 0 (字彙 G)= 18· 0 200530856 五、發明說明(55)Page 60 200530856 V. Description of the invention (54) Each vocabulary A to J in the original sentence provides the following first estimated values: A 2 1 · 0, B 2 2. 0, C = 5 · 0, D 2 1 0 · 0, E = 5.0, F = 2.0, G = 1.0, H = 0.5, and 1.2 = 0.2. Next, each natural sentence that contains the search of step 102 is selected from the corresponding translation database. The natural sentences are used to search the original sentence and any vocabulary between each vocabulary in the natural sentence. The consistent vocabulary is used to calculate the sum of the first estimated values provided to each found consistent vocabulary. The sum of the first estimated values is assumed to be the degree of agreement (the sum of the first estimated values), and in accordance with the decreasing order of the degree of agreement of each related natural sentence, the retrieval of each natural sentence selected is retrieved. translate sentence. For example, when there are natural sentences 1 to 5 arranged by the following vocabulary selected by searching (where the vocabulary X is an arbitrary vocabulary): natural sentence 1 = (D, X, X, E, F, G) Natural sentence 2 = (D, X, E, F, G) Natural sentence 3 = (D, E, F, G) Natural sentence 4 = (A, B, X, C, X, D) Natural sentence 5 2 ( D, E, F, G, H, J) The sum of the first estimated values of each natural sentence: Natural sentence 1: 1 0 · 0 (Word D) + 0 (Word X) + 0 (Word X) + 5.0 (word vocabulary E) +2.0 (word vocabulary F) + 1 · 0 (word vocabulary G) = 18.0 natural sentence 2: 1 0 · 0 (word vocabulary D) + 0 (word vocabulary X) + 5 · 0 (word vocabulary E) + 2 · 0 (word vocabulary F) + 1. 0 (word vocabulary G) = 18 · 0 200530856 V. Description of the invention (55)

自然句子3 : 1 0. 0 (字彙D) + 5. 0 (字彙E) + 2 . 0 (字彙F )+ 1· 0 (字彙 G)= 18. 0 自然句子4 : 1 · 0 (字彙A) + 2 · 0 (字彙B) + 0 (字彙X)Natural Sentence 3: 1 0. 0 (Word D) + 5. 0 (Word E) + 2. 0 (Word F) + 1 0 (Word G) = 18. 0 Natural Sentence 4: 1 · 0 (Word A ) + 2 · 0 (Word B) + 0 (Word X)

+ 5.0(字彙 C) +0(字彙 X) +10.0(字彙 D)二 18.0 自然句子5: 10.0(字彙D) +5.0(字彙E) +2.0(字彙F )+1.0(字彙 G) +0.5(字彙 H) +0.2(字彙 J) =18.7 其中第一估計數值之總合(一致之程度)之自然句子5係 最大。 定義第一估計數值,使得當原始句子中翻譯片語以及除了 原始句子中翻譯片語外之每個字彙減少時,數值也隨之增 加。因此,如上所述,基於第一估計數值之總合,並且評 估每個包含翻譯片語之自然句子,上述自然句子包含靠近 於原始句子中翻譯片語之位置之很多字彙,換言之評估自 然句子具有包含很多包括原始句子中之翻譯片語之片語, 以及包括靠近翻譯片語之字彙之高可能性,以作為具有較 高一致程度之自然句子。 在上述實施例中,自然句子1到4具有相同一致之程度之數 值(第一估計數值之總合)。當複數個自然句子基於存在 於第一估計數值上而具有相同一致之程度之數值時,一致 的字彙與基於第一估計數值提供之第二估計數值之具有相 同一致之程度之數值之每個字彙之間之原始句子中之任何 字彙一致,第二估計數值係根據每個自然句子上之翻譯片+ 5.0 (word vocabulary C) +0 (word vocabulary X) +10.0 (word vocabulary D) 2 18.0 Natural sentence 5: 10.0 (word vocabulary D) +5.0 (word vocabulary E) +2.0 (word vocabulary F) +1.0 (word vocabulary G) +0.5 ( The vocabulary H) +0.2 (the vocabulary J) = 18.7 Of which the sum of the first estimated values (the degree of consistency) of the natural sentence 5 is the largest. The first estimated value is defined so that as the translation phrase in the original sentence and each vocabulary other than the translation phrase in the original sentence decrease, the value also increases. Therefore, as described above, based on the sum of the first estimated values, and each natural sentence containing the translated phrase is evaluated, the above natural sentence contains many vocabularies close to the position of the translated phrase in the original sentence. In other words, the estimated natural sentence has Contains many phrases that include translated phrases from the original sentence, as well as a high probability of including words that are close to the translated phrase, as natural sentences with a high degree of consistency. In the above embodiment, the natural sentences 1 to 4 have values of the same degree of agreement (the sum of the first estimated values). When a plurality of natural sentences have the same degree of consistency based on the values existing on the first estimated value, each word of the same vocabulary has the same degree of consistency as the second estimated value provided based on the first estimated value. Any vocabulary in the original sentence is consistent, the second estimated value is based on the translation piece in each natural sentence

第62頁 200530856 五、發明說明(56) 語中之差異(字彙之數目)而提供。也定義第二估計數 值,使得當每個原始句子中之翻譯片語增加造成差異時, 用於每個原始句子中之翻譯片語之第二估計數值為最大 (以作為增加存在字彙中間之數目)◦因此,基於第一估 計數值,計算提供每個具有相同一致之程度之數值之自然 句子中之每個一致的字彙之第二估計數值之總合,並且基 於第一估計數值,依照每個相關的自然句子之第二估計數 值之總合之遞減的次序(也包括關於本發明之一致之程度 ),而顯示每個具有相同一致之程度之數值之自然句子之 翻譯0 第示 供所 提列 字如 的, 致合 一總 個之 每值 中數 4 JnJ. 計 1估 子二 句第 然到 自得 述以 上用 在, ,值 如數 例計 為數 值之 數囊 計字 估間 二之 第語 之片 古口 古口 譯當 翻得 設使 假, , 值 中數 例計 施估 實二 列第 下義 加 增 估在定Page 62 200530856 V. Description of the invention (56) Differences in terms (number of words) are provided. The second estimated value is also defined so that when the increase in translation phrases in each original sentence causes a difference, the second estimated value for the translated phrases in each original sentence is the largest (as an increase in the number of words in the middle ) ◦ Therefore, based on the first estimated value, calculate the sum of the second estimated value of each consistent vocabulary in each natural sentence that provides a numerical value of the same degree of consistency, and based on the first estimated value, according to each The descending order of the sum of the second estimated values of the related natural sentences (including the degree of agreement with the present invention), and the translation of each natural sentence with a value of the same degree of agreement is shown. The numbers are as follows, each of the total values of the unity is 4 JnJ. The first sentence of the first sentence is counted to the content statement, and the value is used as the example of the number. The interpretation of Yuguchi Piangugugu should be set to false, and the value is estimated in several cases.

4少 、減 3 之 2隨 、也 r ▲ ο、值: ο數· 如計2 例估G 第 之 彙 字 的 致 - 他 其 設 假 時 如 例 然 然 + 1i ΠΖ f-tf 字 f 〒字-fci1 ο 句 4. ο .G )ο I 囊 3: F 囊 D)彙D)字D) 彙字彙彳彙 字{字.字 彙 字 彙 字4 少, minus 2 of 3, also r ▲ ο, value: ο number · If count 2 examples to estimate the confluence of the G number-as usual when he set a fake + 1i ΠZ f-tf 字 f 〒 字-fci1 ο Sentence 4. ο .G) ο I capsule 3: F capsule D) sink D) word D) huizihui 彳 huizi {字. 字 汇 字 汇 字

G + 字 0)2+ G + ,、+) 彙 E \)y X 字 X 5 囊 + X 彙 字G + word 0) 2+ G +, +) sink E \) y X word X 5 capsule + X sink word

E 彙 字 F 彙 字E sink word F sink word

第63頁 200530856 五、發明說明(57) 自然句子4: 0·2(字彙A) +0·5(字彙B) +0(字彙X) + 2.0(字彙 C) +0(字彙 X) +10.0(字彙 D) =12.7 因此,依照第二估計數值之總合之遞減的次序而顯示自然 句子1到4 (翻譯句子),也就是依照自然句子之次序3、 2、4、1 〇 定義第二估計數值,當每個自然句子中之字彙以及每個自 然句子中之翻譯片語之間之原始句子中之任何字彙一致之 一致之字彙之間之差異減少時,使得數值也隨之增加。因 此,自然句子包含與原始句子一致之較大字彙數目,並且 在於一致的字彙之間盡可能不包含不一致的字彙,也就是 說,基於第二評估數值之總合,藉由評估每個包含翻譯片 語之自然句子,自然句子具有評估包含原始句子中翻譯片 語之自然句子,以及片語可能包含接近翻譯片語之字彙, 以作為具有較高一致之程度之自然句子。 例如藉由下列的程序,可以了解在自然句子與原始句子中 字彙之排列次序之相似度之評估,以及考量存在於原始句 子與一致的字彙之間之自然句子中不一致字彙之數目之一 致之程度: 從存在於原始句子中翻譯片語之上邊(upstream side) 中之一前面的字彙君_ ( upstream word group)中,撿選 與翻譯片語(此處,一前面的字彙鄰近一翻譯片語)具有 200530856 五、發明說明(58) 最短字距(distance)之一前面的字彙,並且從存在於原 始句子中翻譯片語之下邊(downstream side)中之一後 面的字彙群(downstream word group)中,撿選與翻譯 片語具有最短字距之一後面的字彙。接著,翻譯片語存在 之位置,假設為檢索所撿選之包含翻譯片語之每個自然句 子中之參考位置(當翻譯片語存在於自然句子中很多複數 個地方時,存在於任何翻譯片語之位置假設為參考位置 ),從參考位置到用於已經撿選前面的字彙之上邊,用以 搜尋具有特定字彙(例如具有三個字彙)之數目之前面的 區域,並且從參考位置到用於已經撿選後面的字彙之下 邊,用以搜尋具有特定字彙(例如具有三個字彙)之數目 之後面的區域。提供上述搜尋所發現之前面的字彙與後面 的字彙中之自然句子,用於定義第三估計數值(the third assessment value),使得當參考位置與前面的字 彙之間以及參考字彙與後面的字彙之間之字距減少時,數 值也隨之增加。 撿面 ,前 中在 子 ’ 句時 始此 原C 在距 及字 以的 選短 撿最 中之 彙有 字具 的群 面彙 前字 從的 有面 沒前 還與 ,有 是具 於選 用’ 使選 中撿 索中 檢群 述囊 上字 C的 彙面 字後 一 從 單有 一 沒 在還 存且 間並 之, 語} 片 } 譯彙 翻字 與的 彙面 字前 的之Page 63 200530856 V. Description of the invention (57) Natural sentence 4: 0 · 2 (word vocabulary A) + 0 · 5 (word vocabulary B) +0 (word vocabulary X) + 2.0 (word vocabulary C) +0 (word vocabulary X) +10.0 (Word D) = 12.7 Therefore, the natural sentences 1 to 4 are displayed according to the descending order of the sum of the second estimated values (translated sentences), that is, according to the order of natural sentences 3, 2, 4, and 10. Define the second The estimated value will increase as the difference between the vocabulary in each natural sentence and the consistent vocabulary of any vocabulary in the original sentence that is consistent between the translated phrases in each natural sentence decreases. Therefore, a natural sentence contains a larger number of vocabulary consistent with the original sentence, and there is as little as possible inconsistent vocabulary between the consistent vocabulary, that is, based on the sum of the second evaluation values, by evaluating each containing translation The natural sentence of the phrase, the natural sentence has a natural sentence that includes the translated phrase in the original sentence, and the phrase may contain a vocabulary close to the translated phrase as a natural sentence with a high degree of consistency. For example, the following procedures can be used to understand the evaluation of the similarity of the order of the vocabulary in the natural sentence and the original sentence, and to consider the degree of agreement between the number of inconsistent vocabulary in the natural sentence that exists between the original sentence and the consistent vocabulary. : Pick and translate phrases from one of the upstream words group that exists in the upstream side of the original phrase in the original sentence (here, a previous word is adjacent to a translation phrase ) Have 200530856 V. Description of the invention (58) The vocabulary in front of one of the shortest distances, and the downstream word group from one of the downstream sides of the translation phrase existing in the original sentence In, pick and translate phrases that have one of the shortest kerning. Next, the location where the translation phrase exists is assumed to be a reference position in each natural sentence that contains the selected translation. (When a translation phrase exists in many plural places in a natural sentence, it exists in any translation sentence. The position of the language is assumed to be the reference position), from the reference position to the top of the vocabulary for which the previous vocabulary has been selected, to search for an area with a specific number of vocabularies (for example, with three vocabularies), and from the reference position to the Below the selected vocabulary, search for the area behind the number of specific vocabularies (for example, three vocabularies). Provide the natural words in the previous and subsequent vocabulary found in the search above to define the third assessment value, so that when the reference position and the previous vocabulary and between the reference vocabulary and the subsequent vocabulary As the space between words decreases, the value increases. Picking the face, the beginning of the middle and the beginning of the sub sentence, the original C in the short distance and the short of the word to pick the middle of the group of words with the face of the group face before the word from the face is not yet with, there are optional 'After selecting the Hui face word of the character C on the search group, the word “C” has not existed and has been merged with each other.} 片} Translate the words before the Hui face word

的字發 短一索 最單檢 之一之 有在前 具存先 群間從 彙之。 字語} 的片 } 面譯彙 後翻字 與與的 有彙面 具字後 選的之 撿面用 中後使 子在中 句,索 始時檢 原此述 在彳上 及距{ 以字囊One of the most single inspections is the first one, the first one, and the one from the group.字 的 的 的 片} Interpretation of the vocabulary and translation of the word with the confluence of the face and the word of the choice of picking the noodles with the middle and the post in the middle of the sentence, the original inspection of the original description on the 及 and the distance {with the word capsule

第65頁 200530856 五、發明說明(59) 現之前面的字彙之位置,到先前檢索發現之前面的字彙與 後面的字彙之每個自然句子中之上邊之具有特定字彙之數 目之區域(例如具有三個字彙)中,搜尋先前檢索之前面 的字彙,並且從先前之檢索發現之後面的字彙之位置到先 前檢索發現之前面的字彙與後面的字彙之每個自然句子中 之下邊,具有特定字彙之數目之區域搜尋先前檢索之後面 的字彙。提供這次的搜尋發現至少前面的字彙與後面的字 彙之自然句子,用以定義第三估計數值,使得當先前的搜 尋發現之前面的字彙與後面的字彙之位置,以及這次的搜 尋發現之前面的字彙與後面的字彙之位置之間之字距減少 時,數值也隨之增加。 可以使用只根據先前的搜尋發現之前面的字彙與後面的字 彙之位置,以及這次的搜尋發現之前面的字彙與後面的字 彙之位置之間之字距,同等定義之一估計數值,如前所 述。然而,考量原始句子中翻譯片語以及所搜尋之前面的 字彙或後面的字彙之間之字距,可以定義第三估計數值, 使得當原始句子中所搜尋之前面的字彙或後面的字彙,以 及翻譯片語之間之字距減少時,第三估計數值上之前面的 字彙或後面的字彙之影響也隨之更增加。取代定義第三估 計數值,使得當先前的搜尋發現之前面的字彙與後面的字 彙之位置,以及這次的搜尋發現之前面的字彙與後面的字 彙之位置之間之字距減少時,數值也隨之增加,如前所 述,可以定義第三估計數值,使得當上述參考位置以及這Page 65 200530856 V. Description of the invention (59) The position of the previous vocabulary, to the previous search and found the natural vocabulary of the previous vocabulary and the following vocabulary in each natural sentence with the number of specific vocabulary areas (such as Three vocabularies), search for the previous vocabulary of the previous search, and from the position of the previous vocabulary of the previous search to the previous search and previous natural vocabulary and the following vocabulary in each natural sentence, with a specific vocabulary The number of fields is searched for the word after the previous search. Provide this search to find natural sentences with at least the preceding vocabulary and the following vocabulary, to define the third estimated value, so that when the previous search finds the position of the preceding vocabulary and the following vocabulary, and this search finds the previous As the kerning between the vocabulary and the position of the subsequent vocabulary decreases, the value also increases. You can use one of the same definitions to estimate the value based on the previous search and the position of the next word, and the distance between the previous word and the position of the next word. Described. However, considering the gap between the translated phrase in the original sentence and the word vocabulary before or after the search, a third estimated value can be defined so that when the word vocabulary before or after the word search in the original sentence, and As the kerning between translated phrases decreases, the influence of the preceding or following vocabulary on the third estimate will increase. Instead of defining the third estimated value, when the previous search finds the position of the previous word and the subsequent word, and the current search finds that the kerning between the previous word and the position of the subsequent word decreases, the value also changes with The increase, as mentioned earlier, can define a third estimated value such that when the above reference position and this

第66頁 200530856 五、發明說明(60) 次發現之前面的字彙與後面的字彙之位置之間之字距減少 時,第三估計數值也隨之增加。 從原始句子中依照翻譯片語到重複上述程序之遞減的次序 之距離,撿選前面的字彙與後面的字彙,直到從原始句子 中不再有任何可以檢選之前面的字彙與後面的字彙。接 著,計算第三估計數值之總合提供給每個自然句子以及假 設一致之程度,並且依照每個相對應的自然句子之一致之 程度之遞減的次序(第三估計數值之總合),而顯示檢索 所撿選之每個自然句子之翻譯。 舉例而言,當一原始句子包含五個字彙A到E (原始句子= (A、B、C、D、E)),並且具體說明原始句子中之第三 個字彙C以作為翻譯片語,從原始句子中檢選第一字彙B與 D以分別作為一上邊的字彙與一下邊的字彙,以從前面的 字彙之翻譯片語到上邊中搜尋具有特定數目之字彙之區域 (例如具有三個字彙),並且從檢索相對應的翻譯資料庫 t撿選之每個包含翻譯片語之自然句子中之後面的字彙之 翻譯片語到下邊,用以搜尋具有特定字彙之數目之區域 (例如具有三個字彙)以提供第三估計數值給自然句子中 發現前面的字彙與後面的字彙。於是,從原始句子中檢選 第一字彙A與E以分別作為一上邊的字彙與一下邊的字彙, 從前面的字彙A之翻譯片語到上邊以搜尋具有特定字彙之 數目之區域(例如具有三個字彙),並且從自然句子中所Page 66 200530856 V. Description of the invention (60) When the kerf between the previous vocabulary and the position of the subsequent vocabulary was found to decrease, the third estimated value also increased. From the original sentence in accordance with the decreasing order of repeating the above procedure, select the previous vocabulary and the subsequent vocabulary, until there is no longer any original vocabulary and subsequent vocabulary that can be selected from the original sentence. Next, the total of the third estimated value is calculated and provided to each natural sentence and the degree to which the hypothesis is consistent, and in accordance with the decreasing order of the degree of consistency of each corresponding natural sentence (the total of the third estimated value), and Display the translation of each natural sentence retrieved. For example, when an original sentence contains five words A to E (original sentence = (A, B, C, D, E)), and the third word C in the original sentence is specified as a translation phrase, Select the first vocabulary B and D from the original sentence as an upper vocabulary and a lower vocabulary, respectively, to search the area with a specific number of vocabularies from the translation of the previous vocabulary to the upper (for example, three Vocabulary), and from the corresponding translation database t to retrieve the translation phrase of each subsequent natural vocabulary in the natural sentence containing the translation phrase to search below for a region with a specific number of vocabulary (for example, (Three vocabularies) to provide a third estimate to the preceding vocabulary and subsequent vocabulary found in natural sentences. Therefore, the first vocabulary A and E are selected from the original sentence as an upper vocabulary and a lower vocabulary, respectively, from the translation phrase of the previous vocabulary A to the upper to search for a region with a specific number of vocabularies (for example, the Three words), and from the natural sentence

第67頁 200530856 五、發明說明(61) 發現前面的字彙B與後面的字彙D之後面的字彙E之後面的 字彙D到下邊,用以搜尋具有特定字彙之數目之區域(例 如具有三個字彙),以提供第三估計數值給自然句子發現 之前面的字彙A與後面的字彙E。 提供上述的程序給下列十個自然句子(a)到(k),並且 依照第三估計數值之總合之遞減的次序來排列句子,如下 列所示,其中字彙X與「/」分別代表一任意的字彙以及標 點符號。 1. 自然句子(a)二(A、B、C、D、E) 2. 自然句子(b) = (A、B、C、D、/、B、C、D、E) 3. 自然句子(c) = ( B、C、D) 4·自然句子(d) = ( A、B、C、X、D、E) 5. 自然句子(e) = ( A、B、C、X、D) 6. 自然句子(f) = ( A、B、C、/、C、D、E) 7·自然句子(g) = (A、X、B、C、X、D) 8. 自然句子(h) = (B、C、/、C、D) 9. 自然句子(j) = ( B、X、C、/、C、X、D) 10. 自然句子(k)二(B、X、X、C、/、C、X、 X、D) 如上述提供第三估計數值給每個包含翻譯片語之自然句 子,並且基於第三估計數值之總合而評估,自然句子與原 始句子之字彙之排列次序具有高相似度,並且不一致字彙 之數目盡可能不存在於與原始句子一致之字彙之間,也就 是說,可以評估包含盡可能不包括不一致字彙之自然句子Page 67 200530856 V. Description of the invention (61) The preceding word vocabulary B and the following word vocabulary D are followed by the word vocabulary E and the word vocabulary D behind is used to search for a region with a specific number of word vocabularies (for example, with three word vocabularies) ) To provide a third estimate to the natural vocabulary A and the subsequent vocabulary E. Provide the above procedure to the following ten natural sentences (a) to (k), and arrange the sentences in descending order of the sum of the third estimated values, as shown below, where the vocabulary X and "/" represent one Arbitrary words and punctuation. 1. Natural sentence (a) two (A, B, C, D, E) 2. Natural sentence (b) = (A, B, C, D, /, B, C, D, E) 3. Natural sentence (C) = (B, C, D) 4. Natural sentences (d) = (A, B, C, X, D, E) 5. Natural sentences (e) = (A, B, C, X, D ) 6. Natural sentences (f) = (A, B, C, /, C, D, E) 7. Natural sentences (g) = (A, X, B, C, X, D) 8. Natural sentences ( h) = (B, C, /, C, D) 9. Natural sentence (j) = (B, X, C, /, C, X, D) 10. Natural sentence (k) 2 (B, X, X, C, /, C, X, X, D) As mentioned above, the third estimated value is provided to each natural sentence containing the translated phrase, and it is evaluated based on the sum of the third estimated value. The order of the vocabulary has a high degree of similarity, and the number of inconsistent vocabularies does not exist as much as possible between the vocabulary consistent with the original sentence, that is, it is possible to evaluate natural sentences that include as little as possible inconsistent vocabulary.

第68頁 200530856 五、發明說明(62) 之情況之原始句子之自然句子,以作為具有較高一致之程 度之自然句子。 可以了解在自然句子與原始句子之間評估考量字彙之排列 次序之相似度,以及存在與原始句子一致之字彙之間之自 然句子中不一致字彙之數目,例如下列的程序: 首先,從存在於原始句子中之翻譯片語(此次之前面的字 彙鄰近翻譯片語)上邊之前面的字彙群中,檢選與翻譯片 語具有最短距離之前面的字彙,於是假設存在於自然句子 之翻譯片語之位置為包含從檢索相對應的翻譯資料庫中所 撿選之翻譯片語之每個自然句子之參考位置(當存在於自 然句子中每個複數個地方中之翻譯片語時,假設存在於任 一翻譯片語中之位置為參考位置),從參考位置到上邊之 具有字彙之特定數目之區域,搜尋先前所撿選之前面的字 彙。程序由字距計算字彙之數目(更具體,由字距計算第 一字彙之數目以及由字距計算第二字彙之數目),並且測 量上述檢索中發現的前面的字彙之自然句子中參考位置與 前面的字彙之間之字距,而根據每個測量的距離以由字距 決定第一字彙之數目。Page 68 200530856 V. The natural sentence of the original sentence in the case of invention description (62), as a natural sentence with a high degree of consistency. You can understand the similarity of the order in which the vocabulary is considered between the natural sentence and the original sentence, and the number of inconsistent vocabularies in natural sentences that exist between the vocabulary consistent with the original sentence. For example, the following procedure: First, from the original In the translation phrase in the sentence (this time the previous vocabulary is adjacent to the translation phrase), the previous vocabulary group is selected to have the shortest distance from the translated vocabulary, so it is assumed that the translation phrase exists in the natural sentence The position is the reference position of each natural sentence containing the translation phrase selected from the corresponding translation database (when the translation phrase exists in each of the plural places in the natural sentence, it is assumed to exist in The position in any translation phrase is the reference position). From the reference position to the specific number of areas above it, search for the previously selected word. The program calculates the number of vocabulary from kerning (more specifically, the number of first vocabulary from kerning and the number of second vocabulary from kerning), and measures the reference position and The kerning between the previous vocabulary, and the number of the first vocabulary is determined by the kerning based on each measured distance.

第69頁 200530856 五、發明說明(63) 從先前之搜尋中發現所撿選之前面的字彙之位置到先前搜 尋中發現之前面的字彙之每個自然句子中之上邊之具有特 定字彙之數目之區域(例如具有三個字彙)中,搜尋先前 檢選之前面的字彙,並且測量在先前搜尋中發現之前面的 字彙與這次發現之前面的字彙之間位置之字距,以及在前 先前的搜尋中發現之前面的字彙與參考位置之間之字距, 加上所測量的字距,得到在這次搜尋中發現的前面的字彙 與參考位置之間之字距,並且根據每次所得到的距離以由 字距決定第一字彙之數目。依照原始句子中從翻譯片語之 字距之遞減的次序檢選前面的字彙,並且重複上述程序, 直到原始句子中不再有任何可撿選的前面的字囊。 舉例而言,當一原始句子包含五個字彙A到E (原始句子= (A、B、C、D、E)),以及具體說明原始句子中第四個 字彙D以作為翻譯片語,首先從原始句子中所撿選之字彙C 作為前面的字彙,並且在包含檢索相對應的翻譯資料庫中 所撿選之翻譯片語之自然句子中,從翻譯片語到上邊之具 有特定字彙之數目之區域搜尋前面的字彙,由所發現字彙 C之自然句子中之字距來決定第一字彙之數目。由此處之 字距以計算第一字彙之數目之結果的例子,如下列表格一 所顯示: 表格一、在檢索字彙C之時期的字距,計算字彙之數目之 結果的例子。Page 69 200530856 V. Description of the invention (63) The number of words with a specific word above each natural sentence from the position where the selected previous word was found in the previous search to the previous word found in the previous search Area (for example, with three vocabularies), search for the previous vocabulary of the previous check, and measure the kerning between the position of the previous vocabulary found in the previous search and the previous vocabulary found this time, and the previous previous search Find the kerning between the previous vocabulary and the reference position, and add the measured kerning to get the kerning between the previous vocabulary and the reference position found in this search, and according to the distance obtained each time The number of first words is determined by the kerning. Check the previous vocabulary in descending order of the kerning of the translation phrases in the original sentence, and repeat the above procedure until there are no longer any optional preceding sacs in the original sentence. For example, when an original sentence contains five vocabularies A to E (original sentence = (A, B, C, D, E)), and the fourth vocabulary D in the original sentence is specified as a translation phrase, first The vocabulary C selected from the original sentence is used as the previous vocabulary, and among the natural sentences containing the translation phrases selected in the corresponding translation database, the number of specific vocabularies from the translation phrase to the top The area searches for the previous vocabulary, and the number of the first vocabulary is determined by the kerning in the natural sentence of the found vocabulary C. Here is an example of the result of calculating the number of the first vocabulary, as shown in the following table. Table 1. An example of the result of calculating the number of vocabulary when the kerning of the word vocabulary C is retrieved.

第70頁 200530856 五、發明說明(64) 自然句子 字彙之字距 之一例子 字距=0 1 2 3 CD 1 0 0 0 CXD 0 1 0 0 CXXD 0 0 1 〇 前字 先的 在面 ,前 彙從 字’ 的中 面子 前句 為然 作 B 彙 字彙 選字 檢的 中面 子前 句現 始發 原所 從中 ’尋 著搜 接的 之 c$域 彙區 字 定 特 有 具 之 邊 彙 字 的 面 前 尋 搜 '子 之7 句 \)y ε然 彙 自 字 _ 3之 個 Β 三彙 ,字 如的 例面 C前 目現 數發 之由 彙, Β 字 之二 處格 此表 由列 。下 目如 數, 之子 彙例 字的 一果 第結 定之 決目 以數 距之 字彙 之字 中一 顯 第 算 計 · · 以示 表格二、在檢索字彙B之時期的字距,計算字彙之數目之 結果的例子。Page 70 200530856 V. Description of the invention (64) An example of the kerning of the natural vocabulary kerning = 0 1 2 3 CD 1 0 0 0 CXD 0 1 0 0 CXXD 0 0 1 〇 The preceding word comes first, before The middle face sentence of Hui Congzi 'is the B face of the middle face before the word selection of the word check. The original c $ domain of Huiyuan District, which has been searched for, has a unique edge of the word. Look for "Zi 7 sentences" in front of you \) y ε Ran Hui from the word _ 3 of the B three hui, the example of the word such as C before the current number issued by the hui, two words in the Β word are listed in this table. The number of words is as follows. The final result of the word vocabulary is determined by the number of words in the word space. · Table 2 is used to calculate the word space during the period of searching word word B. Calculate the number of word words. Examples of results.

第71頁 200530856 五、發明說明(65) 自然句子 字彙之字距 之一例子 字距=0 1 2 3 4 BCD 2 0 〇 0 〇 BXCD 1 1 0 0 0 BXXCD 1 0 1 〇 0 BCXD 0 2 0 0 0 BXCXD 0 1 1 0 0 BXXCXD 0 1 0 1 0 BCXXD 0 0 2 0 0 BXCXXD 0 0 1 1 0 BXXCXXD 0 0 1 0 1 更進一步,從原始句子中檢選字彙A作為前面的字彙,在 先前的搜尋中所發現前面的字彙B之自然句子中,從前面 的字彙B到上邊之具有特定字彙之數目(例如,三個字彙 )之區域搜尋前面的字彙A,由發現前面的字彙A之自然句 子中之字距以決定第一字彙之數目。由此處之字距以計算 第一字彙之數目之結果的例子,如下列表格三所顯示: 表格三、在檢索字彙A之時期的字距,計算字彙之數目之Page 71 200530856 V. Description of the invention (65) An example of the kerning of a natural sentence kerning kerning = 0 1 2 3 4 BCD 2 0 〇0 〇BXCD 1 1 0 0 0 BXXCD 1 0 1 〇0 BCXD 0 2 0 0 0 BXCXD 0 1 1 0 0 BXXCXD 0 1 0 1 0 BCXXD 0 0 2 0 0 BXCXXD 0 0 1 1 0 BXXCXXD 0 0 1 0 1 Further, check the vocabulary A from the original sentence as the previous vocabulary. Previously, In the natural sentence of the previous vocabulary B found in the search, search for the previous vocabulary A from the previous vocabulary B to the area with the number of specific vocabularies (for example, three vocabulary) above. The kerning in the sentence determines the number of first characters. An example of the result of the kerning here is to calculate the number of the first vocabulary, as shown in Table 3 below. Table 3. Calculating the kerning of the period of the vocabulary A, and calculating the number of vocabulary.

第72頁 200530856 五、發明說明(66) 結果的例子。 自然句子 字彙之字5 之一例子 字距=0 1 2 3 4 5 6 ABCD 3 0 0 0 0 0 0 AXBCD 2 1 0 0 0 0 0 AXXBCD 2 0 1 0 0 0 〇 ABXCD 1 2 0 0 0 0 〇 AXBXCD 1 1 1 0 0 〇 0 AXXBXCD 1 1 0 1 0 0 0 ABXXCD 1 0 2 0 0 0 0 AXBXXCD 1 0 1 1 0 〇 0 AXXBXXCD 1 0 1 0 1 0 0 ABCXD 0 3 0 0 0 0 0 AXBCXD 0 2 1 0 〇 0 0 AXXBCXD 0 2 0 1 0 0 0 ABXCXD 0 1 2 0 0 0 〇 AXBXCXD 0 1 1 1 0 0 0Page 72 200530856 V. Description of the invention (66) Examples of results. An example of a natural sentence vocabulary of 5: kerning = 0 1 2 3 4 5 6 ABCD 3 0 0 0 0 0 0 AXBCD 2 1 0 0 0 0 0 AXXBCD 2 0 1 0 0 0 〇 ABXCD 1 2 0 0 0 0 〇AXBXCD 1 1 1 0 0 〇0 AXXBXCD 1 1 0 1 0 0 0 ABXXCD 1 0 2 0 0 0 0 AXBXXCD 1 0 1 1 0 〇0 AXXBXXCD 1 0 1 0 1 0 0 ABCXD 0 3 0 0 0 0 0 AXBCXD 0 2 1 0 〇0 0 AXXBCXD 0 2 0 1 0 0 0 ABXCXD 0 1 2 0 0 0 〇AXBXCXD 0 1 1 1 0 0 0

200530856 五、發明說明(67) 1 AXXBXCXD ! 0 1 1 1 1 0 1 1 1 0 _ 1 0 ABXXCXD 0 1 0 2 0 0 0 AXBXXCXD 0 1 0 1 1 0 0 AXXBXXCXD 0 1 0 1 0 1 0 ABCXXD 0 0 3 0 0 0 0 AXBCXXD 0 0 2 1 0 . 0 0 AXXBCXXD 0 0 2 0 1 0 0 ABXCXXD 0 0 1 2 0 0 0 AXBXCXXD 0 0 1 1 •1 0 0 AXXBXCXXD 0 0 1 1 0 1 0 ABXXCXXD 0 0 1 0 2 0 0 AXBXXCXXD 0 0 1 0 1 1 0 AXXBXXCXXD 0 0 1 0 1 0 1200530856 V. Description of the invention (67) 1 AXXBXCXD! 0 1 1 1 1 1 0 1 1 1 0 _ 1 0 ABXXCXD 0 1 0 2 0 0 0 AXBXXCXD 0 1 0 1 1 0 0 AXXBXXCXD 0 1 0 1 0 1 0 ABCXXD 0 0 3 0 0 0 0 AXBCXXD 0 0 2 1 0. 0 0 AXXBCXXD 0 0 2 0 1 0 0 ABXCXXD 0 0 1 2 0 0 0 AXBXCXXD 0 0 1 1 • 1 0 0 AXXBXCXXD 0 0 1 1 0 1 0 ABXXCXXD 0 0 1 0 2 0 0 AXBXXCXXD 0 0 1 0 1 1 0 AXXBXXCXXD 0 0 1 0 1 0 1

IRlilll 第74頁 200530856 五、發明說明(68) 不同於前述之第一字彙之數目之一方法中之檢索,在包含 相對應的翻譯資料庫中所撿選之翻譯片語之每個自然句子 中測量字彙之間之字距,由加總之字距以作為由字距之第 二字彙之數目。更具體而言,在原始句子中前面的字彙群 之間,忽略鄰近翻譯片語之前面的字彙之存在的情況下 (把包含字彙C之每個自然句子視為一任意的字彙X之不一 致),與原始句子中之前面的字彙群所撿選之翻譯片語具 有最短之字距之前面的字彙,以及假設存在自然句子中之 翻譯片語之位置,為包含檢索相對應的翻譯資料庫中所撿 選之翻譯片語之每個自然句子之參考位置,以搜尋所撿選 之前面的字彙中,從參考位置到上邊之具有特定數目(例 如,三個字彙)之區域。在先前的搜尋中所發現之前面的 字彙之自然句子中,測量參考位置與前面的字彙之字距 (更具體而言,存在於參考位置與前面的字彙之間不一致 字彙之數目),並且假設由「1」加到所測量的字距得到 之字距為參考位置之字距,根據每個參考位置之字距由字 彙以決定第二字彙之數目。 還沒有從前面的字彙中撿選,以及從前面的字彙群中撿選 與翻譯片語具有最短的字距,並且從先前搜尋中發現之前 面的字彙之位置中之具有特定字彙之數目之區域,搜尋先 前搜尋中發現之前面的字彙之位置中之每個自然句子中之 先前所撿選之前面的字彙,在這次搜尋中發現前面的字彙 之自然句子中,測量在先前搜尋中發現前面的字彙以及這IRlilll Page 74 200530856 V. Description of the invention (68) Search in a method different from the number of the first vocabulary mentioned above, in each natural sentence containing the translation phrase selected in the corresponding translation database Measure the kerning between the kerics, and add the kerning to the number of the second kerning. More specifically, in the case where the existence of the vocabulary before the adjacent translation phrase is ignored between the preceding vocabulary groups in the original sentence (every natural sentence containing the vocabulary C is regarded as an inconsistency of an arbitrary vocabulary X) , The translation phrase selected by the previous vocabulary group in the original sentence has the shortest kerning distance, and the position of the translation phrase assumed to exist in the natural sentence is the translation database corresponding to the search The reference position of each natural sentence of the selected translation phrase is used to search for the area with a specific number (for example, three words) from the reference position to the top of the selected previous vocabulary. In previous natural words found in previous searches, measure the kerning between the reference position and the previous word (more specifically, the number of inconsistent words between the reference position and the previous word), and assume The kerning obtained by adding "1" to the measured kerning is the kerning of the reference position. According to the kerning of each reference position, the kerning is used to determine the number of the second kerning. Has not selected from the previous vocabulary, and selected from the previous vocabulary group and has the shortest kerning distance with the translation phrase, and found the previous vocabulary from the previous search in the area with the number of specific vocabulary , Search for the previously selected previous vocabulary in each of the natural sentences in the position where the previous vocabulary was found in the previous search, and in the natural sentence where the previous vocabulary was found in this search, measure the previous one found in the previous search Vocabulary and this

第75頁 200530856 五、發明說明(69) 次發現前面的字彙之間之位置的字距,將先前搜尋中發現 之前面的字彙與參考位置之間之字距加到所測量的字距, 以得到在這次搜尋發現之前面的字彙與參考位置之間之字 距,並且根據所得到的字距,由字距以決定第二字彙之數 目。依照原始句子中翻譯片語之字距之遞減的次序檢選前 面的字彙,並且重複上述程序直到原始句子中不再有任何 可撿選的前面的字彙。Page 75 200530856 V. Description of the invention (69) Find the kerning between the previous vocabulary, add the kerning between the previous vocabulary found in the previous search and the reference position to the measured kerning, and Obtain the kerning between the previous vocabulary and the reference position before the search, and determine the number of the second vocabulary by the kerning according to the obtained kerning. The previous vocabulary is checked in descending order of the kerning of the translated phrases in the original sentence, and the above process is repeated until there is no longer any selectable previous vocabulary in the original sentence.

子 句 始丨 \ly 原 E 1 , D 當 ,C 言 、 B 而 例A 舉C 子ξ 四 句 in第 始中 原 Μ子 Ε 句 _始 原彙W 字 個 五 含 包 說 喱 具 且 並 彙間 字之 語 片 譯 翻 為 作 以 彙 字 的 面 前 之 語 片 譯 # 近 鄰 略 忽 群 彙況 字情 的的 面在 前存 中之 C 子 句 始 原 在 先 首 囊 字 句所 然四 自格 個表 每列 旦里下 測如 序, 依果 A 結 之 量 與J B 淠 彙到 字得 之以 選距 檢字 中之 子置 句位 始考 原參· · ,中示 下子顯 的 果 結 之 距 字 之 目 數 之 彙 字 二 第 於 關 測 量 距 字 由 四A 格子 表例 自然句子 之一例子 B與D之間 的字距 (實際字距+1) (A與B之間 的字距) A與D之間 的字距 (BD+AB) ABCD 2 〇 2 AXBCD 2 1 3 AXXBCD 2 2 4 AB(XCD 3 0 3Clause beginning 丨 \ ly original E 1, D when, C, B, for example, A, C, C, ξ, fourth sentence in the beginning of the Central Plains, M, the sentence _ the beginning of the W, a five-inclusive package, and the union字 之 语 片片 翻翻 为 为 以 字 的 前 之 片片 译 # Nearest neighbor slightly huqunhuihui the face of the character, the C clause in the previous memory, the original first suffix, and the four character forms The order of each column is as follows. The amount of Ego and the result of JB are merged into the word. The test of the original parameter is based on the position of the child in the distance check word. Convergence of the number of eyes is two. The distance measurement is made up of four A. One of the natural sentences on the grid table is an example of the kerning between B and D (actual kerning +1) (kerning between A and B) A Kerning with D (BD + AB) ABCD 2 〇2 AXBCD 2 1 3 AXXBCD 2 2 4 AB (XCD 3 0 3

第76頁 200530856 五、發明說明(70) 基於上述測量結果,由字距決定第二字彙之數目,以得到 由字距計算第二字彙之數目之結果,如表格五所顯示。 表格五由字距計算第二字彙之數目之結果的例子。 自然句子 之一例子 字彙之字距 字距 =0 1 2 3 4 5 6 ABCD • 0 0 2 0 0 0 0 AXBCD 0 0 1 1 0 0 0 AXXBCD 0 0 1 〇 1 0 0 ABXCD 0 0 0 2 0 0 0 如上所述,當在包含翻譯片語之每個自然句子中,完成有 關原始句子中前面的字彙群之由字距計算第一字彙之數目 以及由字距計算第二字彙之數目時,在每個自然句子中, 彙第數 字擇之 二選彙 第距字 算字二 計由第 距從擇 字示選 由顯距 及果字 以結由 目的及 數算以 之計彙 彙且字 字並之 一,小 第果較 算結中 計算目 距計數 字之之 由目彙 較數字 比之一Page 76 200530856 V. Description of the invention (70) Based on the above measurement results, the number of second vocabulary is determined by the kerning to get the result of calculating the number of second vocabulary from the kerning, as shown in Table 5. Table 5 is an example of the result of calculating the number of second words from the kerning. An example of a natural sentence Kerning Kerning = 0 1 2 3 4 5 6 ABCD • 0 0 2 0 0 0 0 AXBCD 0 0 1 1 0 0 0 AXXBCD 0 0 1 〇1 0 0 ABXCD 0 0 0 2 0 0 0 As mentioned above, when in each natural sentence containing a translation phrase, the number of first vocabulary by kerning and the number of second vocabulary by kerning are completed in relation to the preceding vocabulary group in the original sentence, In each natural sentence, the number of digits is selected, the number of digits in the distance is calculated by the number of digits, and the word is selected by the distance. One of the words, and one of the ratios of the head to head in the calculation of the eye distance meter in the small calculation

體 具C 距 更 字 ο 4,之 估 ο 評有 的具 後 最一 L 為、 B作、 以 A 5 /(V 果子 結句 之然 距自 字一 之如 間例 之, 目說 來The form of the C distance is changed to ο 4, the estimate is ο the last one evaluated is L, B is used, and A 5 / (V is the conclusion of the fruit. The distance from the word is as an example.

D 由 及D by and

第77頁 200530856 五、發明說明(71) 字距之第一字彙之數目中字彙之數目3,同時自然句子在 字距0時具有字彙之數目0,在字距1時具有字彙之數目0, 在字距2時具有字彙之數目2,如表格五所顯示。因此,選 擇在字距0時具有最大的字彙之數目之字距之第一字彙之 數目以作為自然句子(A、B、C、D)之前面的字彙群之最 後的評估。上述的實施例應用於本例子,字彙之數目在由 字距之第一字彙之數目以及由字距之第二字彙之數目係為 不相同的◦當在字距0時字彙之數目0彼此相同時,在字距 1、2、…時之字距之字彙之數目依序比較,並且從由字距 選擇第一字彙之數目以及由字距選擇第二字彙之數目,在 相同字距具有較大數目之字彙之數目以作為最後的評估。 由字距計 句子之字 變自然句 (C、A、 距時字彙 字距時字 距中,在 數目為2, 然句子中 之最後的 B、D)視 數目之結 算第二字彙之數目之 彙之排列次序之相似 子中之字彙之部分之 B、D)當在第一字彙 之數目為0,在1之字 彙之數目為1的時候, 0之字距時字彙之數目 在2之字距時字彙之 由字距選擇第二字彙 評估。在上述例子中 為(C、 X、 X、 D)而 果的時候,同時將自 目的,在於正確評估與原始 度,並且也從原始句子中改 次序。例如,自然句子 ?之 字 2 之在 ο 在 ο ,為 中目 距數 字之 之彙 @ 字 數時 之距 字之 之彙 目字 數時 之距 彙字 字之 11 二在 第 時ο 同為 群 彙 字 的 面 前 為 作 〇 ο 以 為目 目數 數之 述 上 在 此 因Page 77 200530856 V. Description of the invention (71) The number of word vocabulary in the number of kerning of the first word is 3, and the natural sentence has the number of vocabulary of 0 when the kerning is 0, and the number of vocabulary of 0 when the kerning is 1, When the kerning is 2, the number of vocabularies is 2, as shown in Table 5. Therefore, the number of the first vocabulary with the largest number of vocabulary when the kerning is 0 is selected as the final evaluation of the vocabulary group preceding the natural sentence (A, B, C, D). The above embodiment is applied to this example. The number of vocabularies is different from the number of kerning first vocabularies and the number of kerning second vocabularies. When the kerning is 0, the number of vocabularies 0 is the same as each other. In the case of kerning 1, the number of kerning at kerning is sequentially compared, and the number of first kerning selected by kerning and the number of second kerning selected by kerning are compared in the same kerning. The number of large vocabularies is used as the final evaluation. From the kerning of the sentence to the natural sentence (C, A, kerning, kerning, kerning, the number is 2, but the last B, D in the sentence), depending on the number of settlement of the number of the second vocabulary The order of the vocabulary is similar to the vocabulary part of the vocabulary. When the number of the first vocabulary is 0, the number of the vocabulary of 1 is 1, and the number of vocabulary is 0. The kerning time of the word is evaluated by the kerning second word. In the case of (C, X, X, D) in the above example, the aim is to correctly evaluate and originality, and to change the order from the original sentence. For example, the natural sentence? The word 2 is in ο in ο, which is the confluence of numbers in the middle distance @ the number of words in the distance of the word in the distance of the word in the distance of 11 in the distance. The same is The word in front of the group is written as οο, because the number of items is described here because

C 第 B 子/Γ/Γ异、 A Al^J/aT 然距L 自字 c 將由子 當估句 ,評然 A Λ c 字C The Bth child / Γ / Γ dissimilarity, A Al ^ J / aT However, the distance L from the word c will be evaluated by the child, and the word A Λ c

D 之為 彙視D is the view

第78頁 200530856 五、發明說明(72) (X、A、B (移除掉字彙C) 、D)而評估由字距計算第二 字彙之數目之結果。因此,評估上述自然句子之包含於較 高完整性(integrity)之情況中之原始句子。 當執行上述用於包含檢索相對應的翻譯資料庫中所撿選之 翻譯片語之每個自然句子的程序,以決定最後的評估時, 使用存在於每個上述的自然句子之原始句子中之翻譯片語 之下邊中之後面的字彙群,執行相似的程序(由字距計算 第一字彙之數目以及由字距計算第二字彙之數目,並且關 於後面的字彙群之最後的評估之決定)。 於是,計算每個自然句子得到前面的字彙群之最後的評估 以及後面的字彙群之最後的評估(由相同的字距,假設每 個最後的評估中由字距計算之字彙之數目),以得到每個 自然句子之一廣泛的評估。基於每個自然句子之廣泛的評 估,依照字彙之間之字距之遞減的次序所表示來顯示每個 自然句子之翻譯句子。 更具體而言,例如在字距0時依照字彙之數目之遞減之次 序重複每個自然句子之排列,並且在字距1時依照字彙之 數目之遞減的次序的時候,在字距0時具有相同字彙之數 目之自然句子之其後的排列,同時在字距1時增加相當於 所比較字彙之數目之字距,由廣泛的評估所表示之依照字 彙之間之字距之遞減的次序來排列每個句子,並且顯示相Page 78 200530856 V. Description of the invention (72) (X, A, B (remove the word vocabulary C), D) and evaluate the result of calculating the number of the second vocabulary from the kerning. Therefore, the original sentence of the above-mentioned natural sentence in the case of higher integrity is evaluated. When executing the above procedure for searching each natural sentence of the selected translation phrase in the corresponding translation database to determine the final evaluation, use the one that exists in the original sentence of each of the above natural sentences. The following vocabulary groups below the translation phrase are executed similarly (the number of first vocabularies is calculated by kerning and the number of second vocabularies is calculated by kerning, and the final evaluation of the subsequent vocabulary groups is determined) . Then, calculate each natural sentence to get the last evaluation of the previous vocabulary group and the last evaluation of the subsequent vocabulary group (from the same kerning, assuming the number of vocabularies calculated by kerning in each final evaluation), to Get an extensive assessment of one of every natural sentence. Based on the extensive evaluation of each natural sentence, the translated sentences of each natural sentence are displayed in the decreasing order of the kerning between the words. More specifically, for example, when the kerning is 0, the arrangement of each natural sentence is repeated in the decreasing order of the number of vocabularies, and when the kerning is 1 in the decreasing order of the number of vocabularies, the The subsequent arrangement of natural sentences with the same number of vocabularies, while increasing the kerning equivalent to the number of compared vocabularies at the time of kerning 1, according to the descending order of the kerning between the vocabularies indicated by extensive evaluation. Arrange each sentence and display relative

第79頁 200530856 五、發明說明(73) 當 於 每 個 白 缺 句 子 之 翻譯 句 子。 根 據 此 點 5 基 於 廣 泛 白ί I評 估 而處 理 下列 顯 示 之 九 個 白 缺 /、、、 句 子 ( a) 到( j) 作 為 ,上 述 所提 及 以排 列 白 夕k /、、、 句 子 1 下 列 的 次 序 中 之 結 果 • 1. 白 缺 句 子 ( a) = ( Α、 Β 、C、 D、 'Ε) 2. 白 缺 句 子 ( b) ( Α、 Β 、C、 D、 | /、 Β、 C、 D E) 3. Pk 句 子 ( c) ( Α、 Β 、C、 X、 D) 4. 缺 句 子 ( d) ( Β、 C 、D) 5. 白 Pk 句 子 ( e) = ( Α、 X 、Β、 C、 X、 D) 6. 妙; 句 子 ( f) = ( Β、 C 、X、 D) 7. 白 Pk 句 子 ( g) = ( Β、 C 、/、 C、 D) 8. 白 缺 Μ、、 句 子 ( h) ( Β、 X 、C、 /、 C、 X、 D) 9. 白 Pk 句 子 ( j) ( Β、 X 、X、 C、 /、 C、 X、 1、 、] D) 上 述 的 程 序 中 由 字 彙 計 算 之第 二 字彙 之 數 § 係 不 需 要 的 〇 當 不 需 要 從 原 始 句 子 中 評估 白 然句 子 中 部 分 排 列 之 字 彙 時 可 以 省 略 由 字 距 計 算 之第 二 字彙 之 數 目 並 且 使 用 由 字 距 計 算 之 第 一 字 彙 之 數 目以 作 為最 後 的 評 估 0 照例,基於上述第一估計數值、第二估計數值/第三估計 數值/第四估計數值,一致程度之評估以及自然句子之選 擇或排列,通常可以由一相關性分析或一計畫分析而使用 自然句子之評估、選擇或排列。當檢索用以參考之翻譯句 子作為具有較全部原始句子範圍狹窄之片語時,基於第一Page 79 200530856 V. Description of the invention (73) The translated sentence for each sentence that is missing. Based on this point 5 based on the evaluation of the extensive white I, the following nine missing / ,,,, sentences (a) to (j) shown are processed as mentioned above, in order to arrange Bai Xi k /, ,, sentence 1 the following Results in the sequence • 1. White missing sentence (a) = (Α, Β, C, D, 'Ε) 2. White missing sentence (b) (Α, Β, C, D, | /, Β, C, DE) 3. Pk sentence (c) (Α, Β, C, X, D) 4. Missing sentence (d) (Β, C, D) 5. White Pk sentence (e) = (Α, X, B, C, X, D) 6. Wonderful; sentence (f) = (Β, C, X, D) 7. White Pk sentence (g) = (Β, C, /, C, D) 8. Bai Que M, , Sentence (h) (Β, X, C, /, C, X, D) 9. White Pk sentence (j) (Β, X, X, C, /, C, X, 1,,] D) above The number of the second vocabulary counted by the vocabulary in the procedure § is not needed. When it is not necessary to evaluate the Bairan sentence from the original sentence When arranging the partial vocabulary in the child, you can omit the number of the second vocabulary calculated by the kerning and use the number of the first vocabulary calculated by the kerning as the final evaluation. 0 As usual, based on the first and second estimated values The third / fourth estimated value / the fourth estimated value, the degree of agreement, and the selection or arrangement of natural sentences can usually be evaluated, selected, or arranged by a correlation analysis or a plan analysis using natural sentences. When retrieving a translated sentence for reference as a phrase with a narrower range than the entire original sentence, based on the first

第80頁 200530856 五、發明說明(74) 估計數值以及第二估計數值,或第三估計數值係更適合本 例子,用於評估自然句子一致之程度之模式,例如與此模 式比較中存在於原始句子之片語等,在原始句子與自然句 子之間,只使用一致字彙之數目用以評估一致之程度。舉 例而言,可以提供一檢索模式,例如一片語檢索模式,另 外還有一般檢索模式,並且基於第一估計數值以及第二估 計數值或第三估計數值,執行一致之程度之評估以及當片 語檢索模式時選擇自然句子之選擇或重新排列。 雖然一例子,其中一致之程度首先計算以選擇一自然句子 (識別相對應的翻譯),當不能只基於一致之程度而限制 合適的自然句子時,由一相關性分析(c 〇 r r e 1 a t i ο η a n a 1 y s i s)選擇一自然句子(識別相對應的翻譯),並且 當即使基於上述相關性分析也不能限制合適的自然句子 時,由一計畫分析(scheme analysis)選擇一自然句 子,本應用並不限制於此。可以計算個別包含檢索裝置之 檢索所撿選之翻譯片語之自然句子之一致之程度,執行相 關性分析以及計晝分析可以評估個別自然句子,並且根據 優先的次序選擇一自然句子,例如下列表格六中顯示之基 於評估結果。 表格六、選擇一自然句子優先之次序的例子。Page 80 200530856 V. Description of the invention (74) The estimated value and the second estimated value, or the third estimated value are more suitable for this example, and are used to evaluate the pattern of the degree of consistency of natural sentences. For example, this pattern exists in the original comparison. Sentence phrases, etc., use only the number of consistent vocabulary between the original sentence and the natural sentence to assess the degree of consistency. For example, a search mode can be provided, such as a phrase search mode, and there is also a general search mode, and based on the first estimated value and the second estimated value or the third estimated value, the evaluation of the degree of consistency and the current phrase are performed. Choose natural sentence selection or rearrange when searching mode. Although an example in which the degree of agreement is first calculated to select a natural sentence (identify the corresponding translation), when a suitable natural sentence cannot be restricted based on the degree of agreement only, a correlation analysis (c ο rre 1 ati ο η ana 1 ysis) select a natural sentence (recognize the corresponding translation), and when a suitable natural sentence cannot be restricted even based on the above-mentioned correlation analysis, a natural sentence is selected by a scheme analysis, this application It is not limited to this. It can calculate the consistency of the natural sentences of the translation phrases selected by the retrieval device, and perform correlation analysis and day-to-day analysis to evaluate individual natural sentences and select a natural sentence according to the priority order, such as the following table The results shown in No. 6 Middle School are based on the evaluation results. Table 6. Example of choosing a natural sentence priority order.

第81頁 200530856 五、發明說明(75)Page 81 200530856 V. Description of the invention (75)

一致之程度 相關性分析 (存在高相關性字彙) 計畫分析 (存在選擇性字彙) 是 否 局 是 1 否 2 5 低 是 3 .否 4 6 再者,計算個別自然句子之一致之程度以執行每個相關性 分析以及計晝分析,並且可以選擇具有最太廣泛的評估分 數(score)之自然句子。 廣泛的評估分數=a X (—致之程度)+ b X (相關性評估 分數)+ c X(計晝評估分數)Correlation analysis of the degree of consistency (the existence of highly relevant vocabulary) Planning analysis (the existence of selective vocabulary) Whether the Bureau is 1 No 2 5 Low Yes 3. No 4 6 Furthermore, the degree of consistency of individual natural sentences is calculated to perform each Correlation analysis and day-to-day analysis, and can choose natural sentences with the most extensive evaluation scores. Extensive evaluation score = a X (—degree of consistency) + b X (correlation evaluation score) + c X (daily evaluation score)

其中基於相關性分析之結果相關性評估分數係提供給個別 自然句子之一評估分數,並且基於計畫分析之結果計畫評 估分數係提供給個別自然句子之一評估分數,並且a、b以 及c係預先定義的係數。The correlation evaluation score based on the result of the correlation analysis is provided to one of the individual natural sentences, and the evaluation score based on the result of the plan analysis is provided to one of the individual natural sentences, and a, b, and c It is a predefined coefficient.

第82頁 200530856 五、發明說明(76) 先前敘述之翻譯識別程係中程式,係以公式表示(圖2 A與 2 B),使得詢問自然句子是否與步驟1 0 2中檢索相對應的 翻譯資料庫所撿選之處理過的原始句子完全一致(步驟 1 04),並且如果答案係為肯定,讀取以及顯示與處理過 的原始句子完全一致之自然句子之翻譯句子。然而此應用 並不限定於此。可以忽略上述的步驟1 0 4以及步驟1 0 6,並 且不管關於任何自然句子是否與步驟1 0 2中檢索而撿選處 理過的原始句子完全一致,可以無條件執行步驟1 0 8中之 程序以及其後之步驟(程序,例如計算一致之程度)。在 此模式中,當步驟1 0 2中檢索而撿選複數個自然句子時, 如果與原始句子完全一致之自然句子包含於複數個檢選的 自然句子中,自然句子之一致之程度係最大。因此,當執 行步驟1 0 4以及步驟1 0 6時可以得到相同的程序結果。 根據自然句子之數目以及登記於相對應的翻譯資料庫之翻 譯句子,選擇本發明中一翻譯之正確性,並且當自然句子 以及翻譯句子之數目增加時,選擇一翻譯之正確性也隨之 提高。因此,可以讀取關於本發明之翻譯識別系統中之翻 譯的原始句子以及翻譯句子,並且當本身執行選擇相對應 的翻譯資料庫之前或之後時,自動同時登錄讀取自然句子 以及翻譯句子。根據本發明,只需自動登錄相對應的翻譯 資料庫中之自然句子以及翻譯句子,並且不需要執行編輯 或處理,例如自然字彙除以字彙單位。因此,相對應的翻 譯資料庫中很容易地自動並且同時登錄資料(自然句子以Page 82 200530856 V. Description of the invention (76) The program in the translation recognition process described earlier is expressed by a formula (Figures 2 A and 2 B), which asks whether the natural sentence corresponds to the translation retrieved in step 102. The processed original sentences selected by the database are exactly the same (step 104), and if the answer is affirmative, the translated sentences of natural sentences that are exactly the same as the processed original sentences are read and displayed. However, this application is not limited to this. The above steps 104 and 106 can be ignored, and regardless of whether any natural sentence is exactly the same as the original sentence retrieved and selected in step 102, the procedure in step 108 can be performed unconditionally and Subsequent steps (procedures, such as calculating the degree of consistency). In this mode, when a plurality of natural sentences are selected and retrieved in step 102, if the natural sentences that are completely consistent with the original sentence are included in the plurality of selected natural sentences, the degree of agreement of the natural sentences is the greatest. Therefore, the same program results can be obtained when performing steps 104 and 106. According to the number of natural sentences and the translated sentences registered in the corresponding translation database, the correctness of a translation in the present invention is selected, and as the number of natural sentences and translated sentences increases, the correctness of selecting a translation also increases. . Therefore, it is possible to read the translated original sentence and the translated sentence in the translation recognition system of the present invention, and automatically register and read the natural sentence and the translated sentence at the same time when the corresponding translation database is executed before or after it is executed by itself. According to the present invention, only natural sentences and translated sentences in a corresponding translation database need to be automatically registered, and no editing or processing is required, such as dividing a natural word by a word unit. Therefore, the corresponding translation database is easily and automatically registered at the same time (natural sentences begin with

第83頁 200530856 五、發明說明(77) 及翻譯句子)。 雖然前面敘述之強調翻譯片語之翻譯之模式的例子,當顯 示一翻譯句子時,此模式並不限制於此。理所當然,模式 不強調翻譯片語之翻譯也包含於本發明之適當的之分數。 本發明中選擇一翻譯之正確性,也根據登錄於相對應的翻 譯資料庫中之自然句子以及翻譯句子之容量之贅詞 (r e d u n d a n c y)。當強調登錄於相對應的翻譯資料庫中之 自然句子以及翻譯句子之容量之贅詞時,選擇一翻譯之正 確性與登錄於相對應的翻譯資料庫(相對應的翻譯資料庫 之容量)中之自然句子,以及翻譯句子之數目成比例降 低。因此,當登錄於相對應的翻譯資料庫中之每個自然句 子以及翻譯句子,搜尋具有内容之高相似度之自然句子以 及翻譯句子,並且發現内容之高相似度之自然句子以及翻 譯句子,可以提供從相對應的翻譯資料庫中移除一組自然 句子以及翻譯句子之功能。 雖然前面敘述選擇使用一單一相對應的翻譯資料庫之一翻 譯的例子,但是運用並不限定於此。當處理過的原始句子 限制於一特定領域時,例如翻譯一特定領域之主選單而使 用此程式,可以將相對應的翻譯資料庫除以自然句子之領 域。因此,可以避免個別相對應的翻譯資料庫之容量增 加,並且得到,例如節省檢索自然句子之時間以及增進選Page 83 200530856 V. Description of the invention (77) and translated sentences). Although the example of the translation mode that emphasizes the translation of phrases is described above, when a translation sentence is displayed, the mode is not limited to this. As a matter of course, the mode does not emphasize that the translation of the translated phrase is also included in the appropriate score of the present invention. The correctness of selecting a translation in the present invention is also based on the natural sentences registered in the corresponding translation database and the redundant words (r e d u n d a n c y) of the capacity of the translated sentences. When emphasizing the natural sentences registered in the corresponding translation database and the redundant words in the capacity of the translated sentence, choose a correct translation and register in the corresponding translation database (the capacity of the corresponding translation database) The number of natural sentences and the number of translated sentences decreases proportionally. Therefore, when each natural sentence and translated sentence registered in the corresponding translation database is searched for natural sentences and translated sentences with high similarity of content, and natural sentences and translated sentences with high similarity are found, you can Provides the ability to remove a set of natural sentences and translate sentences from the corresponding translation database. Although the foregoing example of choosing to translate using a single corresponding translation database has been described, the application is not limited to this. When the processed original sentence is limited to a specific field, such as translating the main menu of a specific field and using this program, the corresponding translation database can be divided by the field of natural sentences. Therefore, it is possible to avoid an increase in the capacity of individual corresponding translation databases, and to obtain, for example, saving time for searching natural sentences and improving selection

第84頁 200530856 五、發明說明(78) 之效用 擇一翻譯之正確性 雖然個人電腦1 0根據特定 人電腦1 0,用以執行命令 別程序之模式的例子,但 可以運用一電腦系統38力口 述之電腦糸統3 8中之終端 一網路3 0,例如網際網路 (Local Area Network· 儲存媒體36等,連接到伺 庫,使得當一使用者透過 問時,伺服器3 4識別並且 所顯示。 參考圖四中表示 澤上做詢問(芬 句子之原文,例 言之原文資料到 語言中之原文到 位置(uniform 可以安裝此系統 候,同時存取( 的翻譯資料庫以 # (句子),並 操作如上述使用者透過上述之個 而顯示翻譯片語之翻譯之翻 是本發明並不限定於此。:又°曰 不發明 以裝配(conf igure),並且上 機3 2與一伺服器3 4兩者都連接到 C Internet)或區域網路 LAN),並且包含一硬體裝置 服裔3 4以儲存相對應的翻譯資 終端機32在線上(〇n iine)詢' 提供線上合適的翻譯,如圖四中 之電腦系統38,為了使一使用者能夠在 考圖=中之(1)),而具體說明一原# 如’藉由使用者透過終端機3 2傳送原始 伺服為3 4之操作,或使用者連接包含原 伺服器34網頁(Web page)之—致性資 resource l〇cator: URL)。在此例子% ,使得當伺服器34執行翻譯識別程序的 a c c e s s 1 n g)儲存於儲存媒體3 6之相對應 識別特定的原文(圖四中之(2) ) ^ 且將是別的翻譯(句子)傳輪到終端機Page 84 200530856 V. The utility of invention description (78) The correctness of alternative translation Although the personal computer 10 is an example of a model for executing commands and other programs according to a specific personal computer 10, a computer system 38 can be used. The terminal-network 30 in the spoken computer system 38, such as the Internet (Local Area Network, storage medium 36, etc.), is connected to the server so that when a user asks through it, the server 34 recognizes and Refer to Figure 4. Refer to Figure 4 to show the query on the ground (the original text of the sentence, the original text of the example to the original text in the language to the location (uniform can be installed in this system, and at the same time access the translation database of " ), And operate as the above-mentioned user through the above to display the translation of the translation phrase is the present invention is not limited to this: again, said not to invent to assemble (conf igure), and the machine 3 2 and a Servers 3 and 4 are both connected to the C Internet) or a local area network (LAN), and include a hardware device server 3 4 to store the corresponding translation information. Terminal 32 is online (Online). Close Appropriate translation, as shown in computer system 38 in Figure 4. In order to enable a user to examine the map = (of (1)), and specifically specify an original # such as' through the user through the terminal 3 2 to transfer the original servo For the operation of 3 or 4, or the user connection includes the original server 34 web page (consistent resource resource URL: URL). In this example,% 1, so that when the server 34 executes the translation recognition program, access 1 ng) is stored in the storage medium 36 and the corresponding original text is identified ((2) in Figure 4) ^ and it will be another translation (sentence ) Pass to the terminal

第85頁 200530856 五、發明說明(79) 以回答此詢問(參考圖四中之(3))。因為這樣的安裝 允許複數個使用者可以分享伺服器3 4之翻譯識別的功能 (以及相對應的翻譯資料庫),特別當相對應的翻譯資料 庫需要容量大時可以有效減少成本。 雖然前面敘述本發明運用至一電子辭典系統之模式的例 子,但是應用並不限定於此。理所當然,本發明也應用到 一機器翻譯(自動翻譯)系統、一自動解釋系統(用聽覺 解釋了解一聽覺上(acoustically)輸入的原始句子之系 統,識別所了解的原始句子之翻譯句子,並且傳輸聽覺上 所識別的翻譯句子)以及一語言學習系統等,圖五,係顯 示本發明所提供之一機器翻譯系統4 0之安裝之示意圖。參 考圖五,一原始句子輸入部分4 2將一翻譯的原始句子輸入 (原文資料)用以選擇一翻譯片語4 4以及一翻譯句子集合 (assembly)部分4 6之兩者的部分。當選擇一翻譯片語4 4 之部分重複地選擇已輸入翻譯的原始句子中之特定片語, 以作為一翻譯片語的時候,同時在翻譯的原始句子中選擇 一個別的片語,並且詢問一翻譯識別部分4 8以檢索所選擇 的翻譯片語之一翻譯。翻譯識別部分4 8係一部分相當於關 於本發明之翻譯識別系統,並且執行翻譯識別程序,同時 存取儲存於儲存媒體5 0中之相對應的翻譯資料庫以識別所 詢問的翻譯片語之翻譯,並且將識別的翻譯傳送到翻譯句 子集合部分4 6。Page 85 200530856 V. Description of the Invention (79) To answer this query (refer to (3) in Figure 4). Because this installation allows multiple users to share the translation recognition function of the server 34 (and the corresponding translation database), especially when the corresponding translation database needs a large capacity, it can effectively reduce costs. Although the example of the mode in which the present invention is applied to an electronic dictionary system is described above, the application is not limited to this. As a matter of course, the present invention is also applied to a machine translation (automatic translation) system, an automatic interpretation system (a system that understands an acoustically input original sentence using auditory interpretation, recognizes the translated sentence of the original sentence that is understood, and transmits Auditory-recognized translated sentences) and a language learning system, etc. Figure 5 is a schematic diagram showing the installation of a machine translation system 40 provided by the present invention. Referring to Fig. 5, an original sentence input section 4 2 inputs a translated original sentence (original data) to select both a translation phrase 4 4 and a translation sentence assembly 4 6. When a translation phrase 4 4 is selected, a specific phrase in the original sentence that has been input is repeatedly selected as a translation phrase, and at the same time, another phrase is selected in the translated original sentence, and asked A translation recognition section 48 translates by retrieving one of the selected translation phrases. The translation recognition part 48 is a part equivalent to the translation recognition system of the present invention, and executes the translation recognition program, and simultaneously accesses the corresponding translation database stored in the storage medium 50 to identify the translation of the requested translation phrase. And pass the recognized translation to the translation sentence set part 4 6.

第86頁 200530856 五、發明說明(80) 基於用於選擇翻譯片語4 4之部分傳送翻譯片語,翻譯句子 集合部分4 6連接從翻譯識別部分4 8傳送之翻譯(如果需 要,改變排列的次序)以集合原始語言中翻譯句子,翻譯 識別部分4 8傳送之翻譯以及原始句子輸入部分4 2傳送之翻 譯片語之原始句子。將集合於翻譯句子集合部分4 6之翻譯 句子傳送到翻譯句子輸出部分5 2,用以執行翻譯句子輸出 程序,例如顯示於一顯示器上,紀錄到一儲存媒體以及一 聽覺的輸出等。在上述所提及之安裝中,因為本發明應用 到翻譯片語之翻譯之識別,從原始句子中得到具有得到自 然翻譯句子作為原始句子中之一句子之高可能性之翻譯, 並且從翻譯句子輸出部分5 2中傳送之翻譯句子成為一自然 譯句子以作為原始句子中之一句子。再者,因為不需要 複雜的程序,例如識別一部分之講告或分析依據子結構, 所以程序可以被簡化。照例,參考圖五,透過此程序可以 線上傳送所得到的翻譯句子,已經說明例如識別以及集合 翻譯以回答翻譯句子之線上詢問,如圖四所顯示。 雖然登記於相對應的翻譯資料庫(儲存裝置)中之一對相 關句子(不同語言敘述一對句子)之一句,以及其他部分 分別稱為一自然句子與一翻譯句子,第一語言到第二語言 中,可以使用翻譯(翻譯句子之識別)之相對應的翻譯資 料庫以作為上述之第二語言到第一語言中之翻譯(翻譯句 子之識別)之資料庫。Page 86 200530856 V. Description of the invention (80) Transmit the translation phrase based on the part used to select the translation phrase 4 4 and the translation sentence set part 4 6 connect the translation transmitted from the translation recognition part 4 8 (if necessary, change the arrangement The sequence) is to gather the translated sentences in the original language, the translations transmitted by the translation recognition section 48 and the original sentences of the translated phrases transmitted by the original sentence input section 42. The translated sentences collected in the translated sentence collection part 46 are transmitted to the translated sentence output part 52 for executing the translated sentence output program, such as being displayed on a display, recording to a storage medium, and an audible output. In the above-mentioned installation, since the present invention is applied to the recognition of translation phrases, a translation having a high probability of obtaining a naturally translated sentence as one of the sentences in the original sentence is obtained from the original sentence, and the sentence is translated from The translated sentence transmitted in the output section 52 becomes a natural translated sentence as one of the original sentences. Furthermore, the procedure can be simplified because it does not require complicated procedures, such as identifying a part of the report or analyzing the substructure. As usual, referring to Figure 5, through this procedure, the resulting translated sentences can be transmitted online. It has been explained, for example, recognition and collective translation to answer online queries of translated sentences, as shown in Figure 4. Although one of a pair of related sentences (a pair of sentences in different languages) is registered in the corresponding translation database (storage device), and the other parts are called a natural sentence and a translated sentence, the first language to the second In the language, the corresponding translation database of the translation (identification of translated sentences) can be used as the database of the translation from the second language to the first language (identification of translated sentences).

第87頁 200530856 五、發明說明(81) 雖然前面敘述英文、日文以及中文之例子,以作為一原始 語言或一對象語言之一例子,但是並不限定於這些語言。 理所當然,可以應用到,例如,法文、德文、義大利文、 西班牙文以及葡萄牙文等,以作為原始語言或對象語言。 本發明以較佳實施例說明如上,然其並非用以限定本發明 所主張之專利權利範圍。其專利保護範圍當視後附之申請 專利範圍及其等同領域而定。凡熟悉此領域之技藝者,在 不脫離本專利精神或範圍内,所作之更動或潤飾,均屬於 本發明所揭示精神下所完成之等效改變或設計,且應包含 在下述之申請專利範圍内。 200530856 圖式簡早說明 【圖式簡單說明】 圖一係顯示一關於最佳實施例之個人電腦配置之方格示意 圖。 圖二A與二B係顯示翻譯識別程序之内容的流程圖。 圖三係顯示自然句子之例子與相對應翻譯之表格,其包含 登錄於一翻譯資料庫中之一字彙「操作」(operation 圖四係一敘述應用本發明之一電腦系統之模式,且透過網 路連結一終端機與一伺服器之方格示意圖。 圖五係顯示應用本發明之一機器翻譯系統之配置的方格示 意圖。 【主要元件符號說明】 個人電腦1 0 中央處理機1 0 A 唯讀記憶體1 0 B 隨機存取記憶體1 0 C 輸入/輸出埠1 0 D 匯流排1 0 E 顯示器1 2P.87 200530856 V. Description of the Invention (81) Although the examples in English, Japanese, and Chinese are described as an example of an original language or an object language, they are not limited to these languages. As a matter of course, it can be applied to, for example, French, German, Italian, Spanish, and Portuguese as the original language or the target language. The present invention has been described above with reference to the preferred embodiments, but it is not intended to limit the scope of patent rights claimed by the present invention. The scope of patent protection shall depend on the scope of the attached patent application and its equivalent fields. Anyone skilled in this field can make changes or modifications without departing from the spirit or scope of this patent, which are all equivalent changes or designs made in the spirit disclosed by the present invention, and should be included in the scope of patent application described below. Inside. 200530856 Brief description of the drawings [Simplified illustration of the drawings] Fig. 1 is a schematic diagram showing the configuration of the personal computer in the preferred embodiment. Figures 2A and 2B are flowcharts showing the contents of the translation recognition program. Figure 3 is a table showing examples of natural sentences and corresponding translations, which includes a vocabulary "operation" (operation) registered in a translation database. Figure 4 is a description of a mode of applying a computer system of the present invention, Grid diagram of a terminal connected to a server. Figure 5 shows a grid diagram of the configuration of a machine translation system to which the present invention is applied. [Description of main component symbols] Personal computer 1 0 Central processing unit 1 0 A Read memory 1 0 B Random access memory 1 0 C Input / output port 1 0 D Bus 1 0 E Display 1 2

第89頁 200530856 圖式簡单說明 鍵盤14 滑鼠1 6 硬碟裝置1 8 CD唯讀記憶體裝置20 掃描器2 2 CD唯讀記憶體24 網路30 終端機3 2 伺服器34 儲存媒體3 6 電腦糸統3 8 機器翻譯系統4 0 原始句子輸入部分4 2 翻譯片語4 4 翻譯句子集合部分4 6 翻譯識別部分4 8 儲存媒體5 0 翻譯句子輸出部分5 2 步驟 1 0 0、1 0 2、1 0 4、1 0 6、1 0 8、1 1 0、1 1 2、1 1 4 步驟 1 1 6、1 1 8、1 1 9、1 2 0、1 2 2、1 2 4、1 2 6、1 2 8、 132、 134 130Page 89 200530856 Brief description of the keyboard 14 Mouse 1 6 Hard disk device 1 8 CD read-only memory device 20 Scanner 2 2 CD read-only memory 24 Network 30 Terminal 3 2 Server 34 Storage media 3 6 Computer system 3 8 Machine translation system 4 0 Original sentence input part 4 2 Translation phrase 4 4 Translation sentence set part 4 6 Translation recognition part 4 8 Storage media 5 0 Translation sentence output part 5 2 Step 1 0 0, 1 0 2, 1 0 4, 1 0 6, 1 0 8, 1 1 0, 1 1 2, 1 1 4 Steps 1 1 6, 1 1 8, 1 1 9, 1 2 0, 1 2 2, 1 2 4, 1 2 6, 1 2 8, 132, 134 130

Claims (1)

200530856 六、申請專利範圍 1. 一種用於識別一相對應翻譯之系統,該系統包含: 一儲存裝置,用以儲存包含於一原始語言中之複數個字彙 之複數個自然句子’該自然句子與一標的語言之翻譯句子 相關聯; 一檢索裝置,用以從儲存於該儲存裝置之該原始語言中之 複數個自然句子中檢索自然句子’在該自然句子包含該原 始語言之原始句子中之翻譯片語;以及 一第一翻譯識別裝置,用以評估該檢索裝置之檢索所揀選 之自然句子之一致之程度,並且識別至少該自然句子之翻 譯句子中翻譯該片語之翻譯,以作為該翻譯片語之翻譯, 該自然句子係基於一致的程度之評估而選擇。 2 .如申請專利範圍第1項之用於識別一相對應翻譯之系 統,其中 該檢索裝置係從該儲存於該儲存裝置之該原始語言中之複 數個自然句子中,檢索與該原始句子完全一致之自然句 子;以及 當完全與該原始句子一致之自然句子與由該檢索裝置之檢 索而揀選時,該第一翻譯識別裝置識別與該原始句子完全 一致之自然句子之該翻譯句子,以作為該原始句子之翻譯 句子。 3 .如申請專利範圍第1項之用於識別一相對應翻譯之系 統,其中該翻譯識別裝置計算該檢索裝置之檢索所揀選之200530856 6. Scope of patent application 1. A system for identifying a corresponding translation, the system includes: a storage device for storing a plurality of natural sentences including a plurality of words in the original language, the natural sentence and A translation sentence of a target language is associated; a retrieval device for retrieving a natural sentence from a plurality of natural sentences stored in the original language of the storage device, where the natural sentence contains a translation of the original sentence of the original language A phrase; and a first translation recognition device for evaluating the degree of consistency of the natural sentence selected by the retrieval device, and identifying a translation of the phrase in at least the translated sentence of the natural sentence as the translation For the translation of the phrase, the natural sentence is selected based on the assessment of the degree of consistency. 2. The system for identifying a corresponding translation as described in item 1 of the scope of the patent application, wherein the retrieval device retrieves from the plurality of natural sentences stored in the original language of the storage device exactly the same as the original sentence A consistent natural sentence; and when a natural sentence that completely matches the original sentence is selected by a search by the retrieval device, the first translation recognition device recognizes the translated sentence of the natural sentence that completely matches the original sentence as The translated sentence of the original sentence. 3. If the system for identifying a corresponding translation is described in item 1 of the scope of the patent application, wherein the translation recognition device calculates the search selected by the retrieval device 200530856 六、申請專利範圍 自然句子’與該原始句子一致之字彙之數值’並且評估與 該原始句子一致之程度,使得當該計算一致的字彙之數目 增加時,與該原始句子一致之該程度隨之提高。 4.如申請專利範圍第3項之用於識別一相對應翻譯之系 統,其中該第一翻譯識別裝置將該計算一致的字彙之數 值,除以該計算包含該翻譯片語之字彙之數值,以計算該 一致之程度。 5 .如申請專利範圍第3項之用於識別一相對應翻譯之系 統, 其中該翻譯識別裝置亦計算由該檢索裝置之檢索所揀選之 自然句子,與該原始句子不一致之字彙之數值,並且評估 與該原始句子一致之程度,使得當該計算一致的字彙之數 值減少時,與該原始句子不一致之該程度隨之提高。 6. 如申請專利範圍第3項之用於識別一相對應翻譯之系 統,其中當計算該一致字彙之數值時,該第一翻譯識別裝 置不包含從計算之字彙中預先定義(predefined)之經常 出現的字彙。 7. 如申請專利範圍第3項或第5項之用於識別一相對應翻譯 之系統,其中當計算該一致的字彙之數目或該不一致的字 彙之數目時,藉由將該字彙視為一致的字彙,第一翻譯識200530856 VI. The scope of the patent application Natural sentence 'the value of the vocabulary consistent with the original sentence' and evaluate the degree of consistency with the original sentence, so that when the number of vocabulary consistent with the calculation increases, the degree of consistency with the original sentence varies Its improvement. 4. If the system for identifying a corresponding translation is described in item 3 of the scope of patent application, wherein the first translation recognition device divides the value of the vocabulary consistent with the calculation by the value of the vocabulary including the translated phrase in the calculation, To calculate the degree of consistency. 5. If the system for identifying a corresponding translation is described in item 3 of the scope of the patent application, the translation recognition device also calculates the value of the vocabulary inconsistent with the original sentence selected by the retrieval of the retrieval device, Evaluate the degree of agreement with the original sentence, so that when the number of words that are consistent with the calculation decreases, the degree of inconsistency with the original sentence increases. 6. If the system for identifying a corresponding translation is described in item 3 of the scope of patent application, when calculating the value of the consistent vocabulary, the first translation recognition device does not include the pre-defined frequently from the calculated vocabulary. Words that appear. 7. If a system for identifying a corresponding translation is used in item 3 or item 5 of the scope of patent application, when calculating the number of the consistent vocabulary or the number of the inconsistent vocabulary, the vocabulary is regarded as consistent Vocabulary 第92頁 200530856 六、申請專利範圍 別裝置用以計算由於單數/複數或時態不同而具有不同字 尾之字彙。 8. 如申請專利範圍第3項之用於識別一相對應翻譯之系 統,其中當計算該一致的字彙之數目時,該第一翻譯識別 裝置不需重覆計算出現二次或多次之一致的字彙。 9. 如申請專利範圍第3項或第5項之用於識別一相對應翻 譯之系統,其中該第一翻譯識別裝置評估該檢索裝置之檢 索所撿選之自然句子與該原始句子之間之字彙排列順序之 相似度,並且評估與該原始句子一致之該程度,使得當該 字彙排列順序之相似度提高時,與該原始句子一致之該程 度隨之提高。 1 0 .如申請專利範圍第3項或第5項之用於識別一相對應翻 譯之系統,其中該第一翻譯識別裝置在與該檢索裝置之檢 索所撿選之自然句子一致之字彙中,計算該不一致的字彙 之數目,並且評估與該原始句子一致之該程度,使得當存 在於該計算一致的字彙中不一致的字彙之數目減少時,與 該原始句子一致之該程度隨之提高。 1 1 .如申請專利範圍第1項之用於識別一相對應翻譯之系 統,更包含一搜尋裝置,用以搜尋用於該翻譯片語之該檢 索裝置之檢索所撿選之自然句子,與經常出現於該原始語Page 92 200530856 VI. Scope of patent application Other devices are used to calculate vocabulary with different suffixes due to different singular / plural or tense. 8. For the system for identifying a corresponding translation in item 3 of the scope of patent application, when calculating the number of the consistent vocabulary, the first translation recognition device does not need to repeatedly calculate the consistency that appears two or more times. Vocabulary. 9. If the system for identifying a corresponding translation is described in item 3 or 5 of the scope of patent application, wherein the first translation recognition device evaluates the relationship between the natural sentence selected by the retrieval device and the original sentence The similarity of the vocabulary arrangement order, and the degree of agreement with the original sentence is evaluated, so that when the similarity of the vocabulary arrangement order increases, the degree of agreement with the original sentence increases accordingly. 10. If the system for identifying a corresponding translation is described in item 3 or 5 of the scope of the patent application, wherein the first translation recognition device is in a vocabulary consistent with the natural sentence selected by the retrieval device, Calculate the number of inconsistent vocabularies, and evaluate the degree of agreement with the original sentence, so that when the number of inconsistent vocabularies in the calculated coincidence decreases, the degree of agreement with the original sentence increases. 1 1. If the system for identifying a corresponding translation in item 1 of the scope of patent application, further includes a search device for searching for a natural sentence selected by the retrieval of the retrieval device for the translation phrase, and Often appears in the primitive language 第93頁 200530856 六、申請專利範圍 言之相同句子中之經常出現的字彙; 其中該第一翻譯識別裝置參照該搜尋裝置發現與存在於該 原始句子中之特定經常出現的片語,以及參照包含每個該 翻譯片語的自然句子之翻譯句子,該翻譯片語在該用以搜 尋包含每個該翻譯片語之檢索裝置之檢索所檢選之自然句 子,以及用於經常出現的翻譯片語之翻譯之特定經常出現 片語之間用以識別發現經常出現的翻譯,以作為該原始句 子中該翻譯片語之翻譯。 1 2 .如申請專利範圍第1項之用於識別一相對應翻譯之系 統,更包含一識別裝置,用以識別在該原始句子中之選擇 性片語,並且取代不包含該檢索裝置之檢索所檢選之自然 句子中之標的片語, 其中該第一翻譯識別裝置識別至少自然句子之翻譯句子中 之該翻譯片語之翻譯,該自然句子之翻譯句子係包含每個 該識別裝置所識別之選擇性片語,以及在該檢索裝置之檢 索所撿選之自然句子中之該翻譯片語,以作為該原始句子 中至少該翻譯片語之翻譯。 1 3 .如申請專利範圍第1 2項之用於識別一相對應翻譯之系 統,其中該識別裝置搜尋儲存於包含該標的字彙之自然句 子之該儲存裝置之複數個自然句子,與搜尋儲存於該具有 相同句子結構的自然句子之儲存裝置中之自然句子,以作 為該搜尋所撿選之自然句子,並且識別該檢索所撿選之自Page 93 200530856 VI. Words that appear frequently in the same sentence as stated in the scope of the patent application; where the first translation recognition device refers to specific frequently-occurring phrases found by the search device and existing in the original sentence, and the reference contains A translation sentence of a natural sentence of each of the translated phrases, the translated sentence in the search for a natural sentence selected by a search device that includes each of the translated phrases, and a frequently used translation phrase The translation of certain frequently occurring phrases is used to identify and find frequently occurring translations as a translation of the translated phrase in the original sentence. 1 2. If the system for identifying a corresponding translation according to item 1 of the scope of the patent application, further includes an identification device for identifying the optional phrase in the original sentence, and replaces the search that does not include the retrieval device The selected phrase in the selected natural sentence, wherein the first translation recognition device recognizes the translation of the translation phrase in at least the translated sentence of the natural sentence, and the translation sentence of the natural sentence includes each recognized by the recognition device The optional phrase and the translated phrase in the natural sentence selected by the retrieval of the retrieval device are used as a translation of at least the translated phrase in the original sentence. 13. If the system for identifying a corresponding translation is described in item 12 of the scope of patent application, the identification device searches for a plurality of natural sentences of the storage device stored in a natural sentence containing the subject vocabulary, and searches for stored in The natural sentence in the storage device of the natural sentence having the same sentence structure is used as the natural sentence selected by the search, and the self-selected sentence selected by the search is identified 第94頁 200530856 六、申請專利範圍 然句子中之取代標的片語之片語,以作為該選擇性片語。 1 4. 一種用於識別一相對應翻譯之系統,該系統包含: 一儲存裝置,用以儲存包含於一原始語言中之複數個字彙 之複數個自然句子,該自然句子與一標的語言之翻譯句子 相關聯, 一檢索裝置,用以從儲存於該儲存裝置之該原始語言中之 複數個自然句子中檢索自然句子,該自然句子包含一原始 語言之原始句子之翻譯片語; 一搜尋裝置,用以搜尋用於該翻譯片語之該檢索裝置之檢 索所撿選之自然句子,與出現於該原始語言之相同句子中 之經常出現的翻譯;以及 一第二翻譯識別裝置,用以搜尋包含每個該翻譯片語之自 然句子之翻譯句子,與搜尋用於經常出現的翻譯片語之特 定經常出現的片語,該翻譯片語係參照該搜尋裝置所發現 與存在於該原始句子中之該特定經常出現片語,以及參照 包含於該檢索裝置之檢索所撿選之自然句子之間之每個該 翻譯片語之自然句子之翻譯句子,並且用於識別發現經常 出現的翻譯,以作為該原始句子中翻譯片語之翻譯。 1 5.如申請專利範圍第1項或第1 4項之用於識別一相對應 翻譯之系統,其中當該翻譯片語包含複數個字彙時,該檢 索裝置從儲存於該儲存裝置中之該原始語言之複數個自然 句子之中,檢索包含至少一個包括翻譯該片語之複數個字Page 94 200530856 6. Scope of Patent Application The phrase in place of the subject phrase in the sentence is used as the optional phrase. 1 4. A system for identifying a corresponding translation, the system includes: a storage device for storing a plurality of natural sentences contained in a plurality of words in an original language, and the translation of the natural sentence and a target language Sentence association, a retrieval device for retrieving a natural sentence from a plurality of natural sentences stored in the original language of the storage device, the natural sentence including a translation phrase of the original sentence of the original language; a search device, A natural sentence selected by the retrieval of the retrieval device for the translation phrase and a frequently occurring translation appearing in the same sentence of the original language; and a second translation recognition device for searching for a message containing The translation sentence of the natural sentence of each of the translation phrases, and the search for a specific frequently-occurring phrase for a frequently-occurring translation phrase, the translation phrase refers to the ones found in the search device and existing in the original sentence. The particular frequently occurring phrase, and referring to each of the natural sentences picked up by the retrieval device contained in the retrieval Natural sentence translated sentence of the translated phrases and for identifying the found frequently appearing translation, as the original sentence Translation of phrases. 1 5. If the system for identifying a corresponding translation is provided in item 1 or item 14 of the scope of the patent application, wherein when the translation phrase includes a plurality of words, the retrieval device retrieves from the stored in the storage device the Among the plural natural sentences in the original language, the search contains at least one plural word including the translation of the phrase 第95頁 200530856 六、申請專利範圍 彙之自然句子。 1 6. —種用於識別一相對應翻譯之方法,該方法包含: 一第一步驟,於該第一步驟中,檢索包含儲存於一儲存裝 置中之複數個字彙之原始語言之每個自然句子,該自然句 子與一對象語言之翻譯句子相關聯;以及 一第二步驟,於該第二步驟中,評估在該第一步驟檢索所 撿選之自然句子與該原始句子之間之一致之程度,並且至 少識別基於一致的程度之評估而選擇之自然句子之翻譯句 子中之翻譯該片語之翻譯,以作為該原始句子中至少翻譯 該片語之翻譯。 1 7. —種用於識別一相對應翻譯之方法,該方法包含: 一第一步驟,於該第一步驟中,檢索包含儲存於一儲存裝 置之複數個字彙之原始語言之每個自然句子,該自然句子 與一對象語言之翻譯句子相關聯; 一第二步驟,於該第二步驟中,該第一步驟檢索所撿選之 自然句子係用以搜尋該翻譯片語與經常出現於一原始語言 之相同句子中之經常出現的字彙;以及 一第三步驟,於該第三步驟中,包含每個該翻譯片語與特 定經常出現的片語之自然句子之翻譯句子,係用以搜尋識 別經常出現的翻譯片語之翻譯與發現經常出現的翻譯,以 作為該原始句子中之該翻譯片語之翻譯,該翻譯片語之翻 譯係參照該原始句子與該第二步驟發現之特定經常出現的Page 95 200530856 6. Scope of Patent Application Natural sentence of Huihui. 16. A method for identifying a corresponding translation, the method includes: a first step in which each of the natural language of the original language including a plurality of words stored in a storage device is retrieved A sentence, the natural sentence being associated with a translation sentence of an object language; and a second step in which the consistency between the selected natural sentence retrieved in the first step and the original sentence is evaluated Degree, and at least the translation of the phrase in the translation sentence of the natural sentence selected based on the evaluation of the degree of consistency is identified as the translation of at least the phrase in the original sentence. 17. A method for identifying a corresponding translation, the method includes: a first step in which each natural sentence containing an original language of a plurality of words stored in a storage device is retrieved , The natural sentence is associated with a translation sentence of an object language; a second step, in the second step, the first step retrieves the selected natural sentence system for searching the translation phrase and frequently appears in a Frequently occurring vocabulary in the same sentence of the original language; and a third step, in which the translated sentence containing the natural sentence of each of the translated phrase and a specific frequently occurring phrase is used to search Recognize the translation of frequently occurring translation phrases and find the frequently occurring translations as the translation of the translation phrase in the original sentence. The translation of the translation phrase refers to the original sentence and the specific frequently found in the second step appeared 第96頁 200530856 六、申請專利範圍 片語,並且於該第一步驟檢索所撿選之自然句子中,參照 包含每個該翻譯片語的自然句子之翻譯。 1 8. —種使電腦連接到一儲存裝置之程式,該儲存裝置從 儲存於該儲存裝置中之該原始語言之複數個自然句子中, 儲存包含與一對象語言中之翻譯相關之複數個字彙之一原 始語言中之複數個自然句子,該程式具有檢索裝置的功能 以檢索包含於該原始語言之原始句子中之翻譯片語之自然 句子;以及 一第一翻譯識別裝置,用以評估該檢索裝置之檢索所揀選 之自然句子與該原始句子之間的一致之程度,並且用以識 別至少自然句子之翻譯句子中之該翻譯片語之識別翻譯, 以作為該原始句子中該翻譯片語之翻譯,而該自然句子係 基於該評估一致之程度而選擇。 1 9. 一種使電腦連接到一儲存裝置之程式,從該儲存裝置 從儲存於該儲存裝置之該原始語言之複數個自然句子中, 儲存包含與一對象語言中之翻譯一致之複數個字彙之一原 始語言中之複數個自然句子,以具有檢索裝置的功能,該 檢索裝置的功能係用以檢索包含在該原始語言之原始句子 中翻譯片語之自然句子; 一搜尋裝置,用以搜尋該翻譯片語之檢索裝置之檢索所撿 選之自然句子,與出現於該原始語言之相同句子中之經常 出現的片語;以及 200530856 六、申請專利範圍 一第二翻譯識別裝置,用以搜尋包含每個該翻譯片語之自 然句子之翻譯句子,以及搜尋用於經常出現的翻譯片語之 特定經常出現的片語,並且用以識別發現經常出現的翻 譯,以作為該原始句子中該翻譯片語之翻譯,該特定經常 出現的片語係參照利用該搜尋裝置發現與存在於該原始句 子中之特定經常出現的片語,以及參照每個包含翻譯該片 語的自然句子之翻譯。Page 96 200530856 VI. Patent application phrase, and in the first step of searching the selected natural sentence, refer to the translation of the natural sentence containing each of the translated phrases. 1 8. —A program for connecting a computer to a storage device that stores, from a plurality of natural sentences of the original language stored in the storage device, a plurality of vocabularies related to translations in a target language A plurality of natural sentences in one of the original languages, the program having a function of a retrieval device to retrieve natural sentences of translated phrases contained in the original sentences of the original language; and a first translation recognition device for evaluating the retrieval The device searches for the degree of agreement between the selected natural sentence and the original sentence, and is used to identify the identification translation of the translation phrase in the translation sentence of at least the natural sentence as the translation phrase in the original sentence. Translation, and the natural sentence is selected based on the degree of consistency of the assessment. 1 9. A program for connecting a computer to a storage device, from the storage device, from a plurality of natural sentences of the original language stored in the storage device, storing a plurality of vocabulary that are consistent with a translation in an object language A plurality of natural sentences in an original language to have a function of a retrieval device, the function of the retrieval device is to retrieve the natural sentences of the translated phrases contained in the original sentences of the original language; a search device for searching the Retrieval of translation phrases by the retrieval device. Natural sentences picked up frequently appear in the same sentences as the original language; and 200530856 6. Patent application scope-a second translation recognition device for searching The translated sentence of the natural sentence of each of the translated phrases, as well as the search for a specific frequently-occurring phrase for a frequently-occurring translation phrase, and used to identify the frequently-occurring translation as the translation piece in the original sentence Language translation, the particular frequently-occurring phrase refers to the use of the search device to discover and exist in the original The phrase sentence in particular often appear, as well as reference translation of each sentence contains natural language translation of the film. 第98頁Chapter 98
TW93138040A 2003-12-15 2004-12-08 System and method for identifying corresponding translation TWI281111B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2003416778 2003-12-15
JP2004301302A JP4401269B2 (en) 2003-12-15 2004-10-15 Parallel translation judgment device and program

Publications (2)

Publication Number Publication Date
TW200530856A true TW200530856A (en) 2005-09-16
TWI281111B TWI281111B (en) 2007-05-11

Family

ID=34829141

Family Applications (1)

Application Number Title Priority Date Filing Date
TW93138040A TWI281111B (en) 2003-12-15 2004-12-08 System and method for identifying corresponding translation

Country Status (3)

Country Link
JP (1) JP4401269B2 (en)
MY (1) MY137235A (en)
TW (1) TWI281111B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI417747B (en) * 2006-04-19 2013-12-01 Raytheon Co Enhancing multilingual data querying

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4688629B2 (en) * 2005-10-25 2011-05-25 株式会社ジャストシステム Electronic message classification device and electronic message classification method
JP5032453B2 (en) * 2008-12-15 2012-09-26 株式会社東芝 Machine translation apparatus and machine translation program
TWI456415B (en) * 2010-12-16 2014-10-11 Inventec Corp Inquiring and transforming system and method for translation
TWI492072B (en) * 2013-12-19 2015-07-11 英業達股份有限公司 Input system and input method
JP6252204B2 (en) * 2014-01-28 2017-12-27 富士通株式会社 Information processing apparatus, intention analysis method, and program
JP7439429B2 (en) 2019-09-24 2024-02-28 カシオ計算機株式会社 Search device, search method, search program

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI417747B (en) * 2006-04-19 2013-12-01 Raytheon Co Enhancing multilingual data querying

Also Published As

Publication number Publication date
JP2005202924A (en) 2005-07-28
MY137235A (en) 2009-01-30
TWI281111B (en) 2007-05-11
JP4401269B2 (en) 2010-01-20

Similar Documents

Publication Publication Date Title
US9959776B1 (en) System and method for automated scoring of texual responses to picture-based items
Lu Automatic analysis of syntactic complexity in second language writing
Kauchak et al. Paraphrasing for automatic evaluation
US20070112553A1 (en) System, method, and program for identifying the corresponding translation
Calzada Pérez Corpus-based methods for comparative translation and interpreting studies: Mapping differences and similarities with traditional and innovative tools
Kit et al. Evaluation in machine translation and computer-aided translation
Hadla et al. Comparative study between meteor and bleu methods of mt: Arabic into english translation as a case study
Östling et al. Compounding in a Swedish blog corpus
Kashiha Recurrent formulas and moves in writing research article conclusions among native and nonnative writers
TW200530856A (en) System, method and program for identifying corresponding translation
Daðason Post-correction of Icelandic OCR text
De Schryver et al. Dictionary Writing System (DWS)+ Corpus Query Package (CQP): The Case of" TshwaneLex"
Curtotti et al. A right to access implies a right to know: An open online platform for research on the readability of law
Cui et al. CTAP for Chinese: a linguistic complexity feature automatic calculation platform
Elliott et al. A fluency error categorization scheme to guide automated machine translation evaluation
Hu et al. Word‐Combination‐Based Measures of Phraseological Diversity, Sophistication, and Complexity and Their Relationship to Second Language Chinese Proficiency and Writing Quality
Andersen Semi-automatic ESOL error annotation
Gaizauskas et al. Extracting bilingual terms from the Web
Lenci et al. How to harvest Word Combinations from corpora: Methods, evaluation and perspectives
Tyers et al. Unsupervised training of maximum-entropy models for lexical selection in rule-based machine translation
Wolk et al. Unsupervised tool for quantification of progress in L2 English phraseological
De Melo et al. On the utility of automatically generated wordnets
Guessoum et al. A methodology for evaluating Arabic machine translation systems
Ulitkin et al. Automatic evaluation of the quality of machine translation of a scientific text: the results of a five-year-long experiment
Gao et al. Computer-aided translation

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees