EP1111514B1 - Procédé et système de traduction de phrases - Google Patents
Procédé et système de traduction de phrases Download PDFInfo
- Publication number
- EP1111514B1 EP1111514B1 EP00127817A EP00127817A EP1111514B1 EP 1111514 B1 EP1111514 B1 EP 1111514B1 EP 00127817 A EP00127817 A EP 00127817A EP 00127817 A EP00127817 A EP 00127817A EP 1111514 B1 EP1111514 B1 EP 1111514B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- phrase
- text
- database
- translation
- language
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/45—Example-based machine translation; Alignment
Definitions
- the invention generally relates to translating expressions from one natural language into another natural language, and in particular assisting a translator to get the right translation for any phrase.
- Any translator is evaluated according to two criteria: translation speed and translation quality.
- One difficulty affecting both of these criteria is the appearance of a word or group of words which makes the translator hesitate. Finding the suitable translation may lead to a time-consuming manual search, with no guarantee of the result.
- Contextual dictionaries allow for getting the translation of a word according to its context. This technique is strongly limited in the extent to which translations are possible, i.e. by looking up a contextual dictionary, the translator is provided with a low number of proposed translations only.
- multi-lingual terminology databases exist which are based on translations of pre-accepted terms. This technique is strongly restricted to the prestored set of terms, and the translator is not assisted in translating expressions which are not part of the set of pre-accepted terms.
- a further technique is based on the use of translation memory which stores already translated sentences.
- the system queries the database and automatically proposes a translation.
- this system requires matching complete sentences, even if the matching can be fuzzy, so that this technique is again strongly restricted in its applicability.
- the present invention has been made in consideration of the above situation and has as its primary object to assist a translator to achieve an improved quality of the resulting document.
- a further object of the present invention is to be compatible with existing technology and software tools.
- the database is phrase-indexed.
- Extracted word phrases directly index whole text fragments.
- the noun phrases are used to index a sentence database.
- the extracted noun phrases directly index whole sentences thereby leaving the recognition of the corresponding sub-units in the translated sentences to the translator. Therefore, no overall parse of the translated sentences is performed and no alignment of subtrees is necessary.
- the invention is further advantageous in that it makes use of already translated material and presents to the translator, in the preferred embodiment, sentences containing the respective noun phrase both in the source and target language.
- sentences containing the respective noun phrase both in the source and target language By using a phrase-indexed sentence database, both translation speed and translation quality are improved.
- word phrases are used for indexing text fragments representing a primary grammatical unit including at least one clause.
- Phrases are expressions consisting usually of but a few words, denoting a single idea or forming a separate part of a sentence. Specifically, a phrase is a group of two or more associated words, not containing a subject and predicate.
- Noun phrases are phrases involving either pronouns or nouns. Nouns are words used as the name of a thing, quality or action existing or conceived by the mind. Pronouns are words used as a substitute for a noun. Thus, noun phrases are e.g. "road test”, “fuel pressure test operations", or "verb phrase”. By contrast, verb phrases are phrases involving one or more verbs such as "broadened” or "having been fitted”.
- a clause is a group of words containing a subject and predicate, that is, clauses are syntactic constructions forming part of a sentence or constituting a whole simple sentence.
- Sentences are grammatical units of one or more words, bearing minimal syntactical relation to the words that precede or follow it, i.e. comprising a minimum sense of completeness and unity. Sentences express a complete thought, whether a statement of fact, a question, a command, or an exclamation.
- clauses and sentences can be defined as primary grammatical units. Whereas sentences may comprise clauses and phrases, and clauses may comprise phrases, phrases cannot comprise clauses or sentences.
- the preferred embodiment of the invention makes use of noun phrases to index sentences.
- the translation system comprises a control unit 14, which may be a computer of any kind such as a personal computer or a workstation, e.g. running a conventional operating system such as Windows NT or UNIX.
- the control unit 14 runs a software application which may be controlled by pointer device 12 or keyboard 13, using a display 11.
- the interface software may for instance be written in Visual Basic, and the resulting application can be an OLE server which can be integrated directly to any Visual Basic or C/C++ code. It will however be appreciated by those of ordinary skill in the art that many other kinds of implementations are also possible.
- sentence database 17 which might be an Access or Oracle database and which has preferably been generated using a UNIX workstation.
- the size of the database depends on the field in which the system is used. Again, it will be appreciated that many other implementations are possible.
- Sentence database 17 stores a plurality of sentences in English as the source language, and also stores to each English sentence the corresponding sentence in French as the target language.
- the sentence database 17 further includes a phrase index holding information to each noun phrase stored in any sentence of the database, indicating the sentences in which the respective noun phrase is to be found.
- the translation software running on control unit 14 further has access to phrase extractor 15, which extracts noun phrases from an input text using complex linguistic algorithms.
- the input text may come from any text source such as storage media, scanners, messages, speech recognition etc.
- FIG. 2 illustrates a flow chart of the process of generating the phrase-indexed sentence database 17, the system first obtains, in steps 21 and 22, a document written in the source language and a corresponding document written in the target language. Then, in step 23, in the pair of source and target documents the sentences are aligned. Aligning sentences means establishing a link between each source sentence and the corresponding target sentence. After the sentence have been aligned, noun phrases are extracted from each of the source sentences in step 24, and the extracted noun phrases are added to the phrase index of sentence database 17 together with the respective information concerning the sentence from which the noun phrase has been extracted.
- step 26 After the extracted noun phrases have been added to the phrase index in step 25, a decision is made in step 26 whether another pair of source/target documents is to be read in. If so, the process of generating the database returns to step 21 for obtaining the documents, aligning the sentences and extracting and storing the noun phrases. By repeating steps 21 to 26 for a set of translated document pairs, a comprehensive database will be built storing a high number of encountered noun phrases with the sentences holding them.
- step 31 the user inputs the text to be translated in step 31.
- the text is then displayed so that the user can select an unknown word or group of words, e.g. by double-clicking on the word.
- phrase extractor 15 extracts in step 33 all the possible noun phrases relating to the selected word.
- the set of possible noun phrases is then displayed in step 34, and one of the displayed noun phrases is selected, in step 35, either automatically or by user request.
- the longest of the extracted noun phrases is selected, with no need of user selection.
- This automated selection may also be performed depending on whether the extracted noun phrases exist in the database. In this case, the longest (sub) noun phrase existing in the database is selected. In any case, selection may be performed or changed by the user.
- FIG. 5 shows an example of how to present the mentioned information to the user.
- the control unit 14 displays a window 51 in which a number of fields is shown.
- the word which has been selected by double-clicking in step 32 is displayed in field 55.
- the set of possible noun phrases which has been extracted in step 33 is displayed. From this set of noun phrases the user has selected, in the example of FIG. 5, the noun phrase "road test", which is then shown in field 52.
- control unit 14 queries the phrase-indexed sentence database 17 in step 41. Since all the sentences in the phrase-indexed sentence database 17 are directly indexed by noun phrases, the system is able to retrieve all pairs of sentences indexed by the selected noun phrase without any substantive delay. The system then sorts in step 42 the retrieved pairs of sentences according to their relatedness to the original sentence. For this purpose all the noun phrases in the original sentence of the input text and those in the retrieved sentence are compared and the relatedness then depends on the number of common noun phrases. Thus, the system is able to present to the user in step 43 all the pairs of sentences with those sentences first which are closest to the one to be translated. An example of a displayed sorted set of paired sentences is depicted in FIG. 5 as field 54.
- step 44 of FIG. 4 the user selects one of the displayed sentences of field 54 which is then copied to fields 56 and 57 to allow the user to more intensively study the proposed translation.
- phrase extractor 15 extracts in step 45 from the selected sentence pair the noun phrase translation and inserts the translated noun phrase automatically into the translation of the input text.
- step 46 the translator decides whether a further noun phrase needs to be translated. If so, the process returns to step 32 and the user selects another unknown word.
- FIG. 6 illustrates the operation on field 52 according to a preferred embodiment, the capability of the system according to the invention to operate on noun sub-phrases is described.
- the system either automatically looks for partial matching noun phrases, or presents to the user in fields 61 and 62 a list of sub-phrases from which the user may select an entry for proceeding with the translation process. Assuming no sentence contains the noun phrase "fuel pressure test operations", the user is allowed to choose the sub-phrase "fuel pressure test" in field 62 for which the phrase index of the database 17 might have an entry.
- the translation system has been described as including a noun-phrase-indexed sentence database 17.
- the system further includes a terminology base 16 to which the translation application running on control unit 14 has access.
- the system queries the terminology base in step 36.
- the terminology base does include a translation of the selected noun phrase
- the system presents to the user in field 78 of FIG. 7 the retrieved translation.
- the user is then allowed to either approve the translation retrieved from the terminology base 16 and display it in field 78 or to approve the proposed translation retrieved from the phrase-indexed sentence database 17 and select it from the list displayed in field 74.
- the translation retrieved from the terminology base 16 is highlighted in the list of field 74 at each location where it appears.
- the present invention has many advantages in that it uses a noun-phrase-indexed sentence database in which noun phrases directly index whole sentences, leaving the recognition of the corresponding sub-units in the translated sentences to the user. Thus, no overall parse for translated sentences is performed. Nevertheless, the system according to the invention may be integrated among other conventional tools for translating expressions, such as example-based machine translation, contextual dictionary lookup, multi-lingual terminology databases, or translation memory. By providing a translation environment using phrase-indexed sentence databases and integrating existing technologies, any translator is provided with a powerful new translation aid.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Claims (9)
- Procédé de traduction d'un syntagme de mots depuis une première langue naturelle vers une seconde langue naturelle, ledit syntagme de mots étant un groupe de deux mots associés ou plus, le procédé comprenant les étapes consistant à :appliquer en entrée (31) un texte écrit dans une première langue,extraire (33) ledit syntagme de mots depuis ledit texte,interroger (41) une base de données (17) au sujet dudit syntagme de mots extrait, la base de données (17) comportant des paires de fragments de texte, chaque paire comportant un fragment de texte de la première langue et un fragment de texte correspondant de la seconde langue, lesdits fragments de texte représentant une unité grammaticale principale contenant au moins une proposition, ladite proposition étant un groupe de mots contenant un sujet et un prédicat, etobtenir (45) une traduction dudit syntagme de mots extrait sur la base d'une des paires de fragments de texte visualisé pendant l'étape d'interrogation de la base de données (17),caractérisé en ce que
ladite étape d'interrogation (41) d'une base de données (17) utilise un index de syntagmes de ladite base de données, ledit index de syntagmes répertoriant lesdits fragments de texte par syntagmes de mots. - Procédé selon la revendication 1, dans lequel ledit syntagme de mots est un syntagme nominal.
- Procédé selon la revendication 1, dans lequel ledit syntagme de mots est un syntagme verbal.
- Procédé selon la revendication 1, dans lequel ledit syntagme de mots est un prédicat impliquant au moins un verbe et un nom ou adjectif utilisé en tant que nom.
- Procédé selon la revendication 1, dans lequel l'étape consistant à obtenir une traduction comprend l'étape de présentation (43) à un utilisateur des paires visualisées de fragments de texte.
- Procédé selon la revendication 1, dans lequel l'étape consistant à obtenir une traduction comprend l'étape de présentation à un utilisateur des traductions proposées du syntagme de mots extrait sur la base des paires de fragments de texte visualisées.
- Procédé selon la revendication 5 ou 6, dans lequel l'étape consistant à obtenir une traduction comprend en outre l'étape d'approbation d'une des traductions visualisées par l'utilisateur et l'utilisation de la traduction approuvée en tant que traduction du syntagme de mots extrait.
- Système de traduction d'un texte appliqué en entrée depuis une langue source naturelle vers une langue cible naturelle, le système comprenant :un moyen de mémorisation destiné à mémoriser une base de données (16, 17) contenant une pluralité de paires de fragments de texte, lesdits fragments de texte représentant une unité grammaticale principale comportant au moins une proposition, ladite proposition étant un groupe de mots contenant un sujet et un prédicat, chaque paire incluant un fragment de texte dans la langue source et un fragment de texte correspondant dans la langue cible, chaque fragment de texte contenant au moins un syntagme de mots, ledit syntagme de mots étant un groupe de deux mots associés ou plus,un extracteur de syntagme (15) destiné à extraire un syntagme de mots depuis un fragment de texte dudit texte appliqué en entrée,un moyen de récupération de base de données (14) destiné à récupérer, à partir de ladite base de données (16, 17), des paires de fragments de texte qui contiennent le syntagme de mots extrait, etun moyen d'interface d'utilisateur (11, 12, 13) destiné à permettre à un utilisateur de sélectionner l'une desdites paires de fragments de texte récupérées afin d'obtenir une traduction dudit syntagme de mots extrait,caractérisé en ce que
ledit moyen de récupération de base de données (14) récupère des paires de fragments de texte depuis ladite base de données (16, 17) en utilisant un index de syntagmes de ladite base de données, ledit index de syntagmes répertoriant des fragments de texte par syntagmes de mots. - Support de mémorisation lisible par un ordinateur mémorisant des instructions destinées à un procédé de traduction d'un syntagme de mots à partir d'une première langue naturelle vers une seconde langue naturelle en exécutant les étapes consistant à :appliquer en entrée (31) un texte écrit dans une première langue,extraire (33) ledit syntagme de mots à partir dudit texte, ledit syntagme de mots étant un groupe de deux mots associés ou plus,interroger (41) une base de données (17) au sujet dudit syntagme de mots extrait, la base de données (17) contenant des paires de fragments de texte, chaque paire incluant un fragment de texte d'une première langue et un fragment de texte correspondant de la seconde langue, lesdits fragments de texte représentant une unité grammaticale principale contenant au moins une proposition, ladite proposition étant un groupe de mots contenant un sujet et un prédicat, etobtenir (45) une traduction dudit syntagme de mots extrait sur la base de l'une des paires de fragments de texte visualisées pendant l'étape d'interrogation de la base de données (17),caractérisé en ce que
ladite étape d'interrogation (41) d'une base de données (17) utilise un index de syntagmes de ladite base de données, ledit index de syntagmes répertoriant lesdits fragments de texte par syntagmes de mots.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/466,823 US6473729B1 (en) | 1999-12-20 | 1999-12-20 | Word phrase translation using a phrase index |
US466823 | 1999-12-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1111514A1 EP1111514A1 (fr) | 2001-06-27 |
EP1111514B1 true EP1111514B1 (fr) | 2006-08-02 |
Family
ID=23853235
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00127817A Expired - Lifetime EP1111514B1 (fr) | 1999-12-20 | 2000-12-19 | Procédé et système de traduction de phrases |
Country Status (4)
Country | Link |
---|---|
US (1) | US6473729B1 (fr) |
EP (1) | EP1111514B1 (fr) |
JP (1) | JP4504555B2 (fr) |
DE (1) | DE60029732T2 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008080190A1 (fr) * | 2007-01-04 | 2008-07-10 | Thinking Solutions Pty Ltd | Analyse linguistique |
Families Citing this family (101)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6498921B1 (en) * | 1999-09-01 | 2002-12-24 | Chi Fai Ho | Method and system to answer a natural-language question |
US5836771A (en) * | 1996-12-02 | 1998-11-17 | Ho; Chi Fai | Learning method and system based on questioning |
EP1266313A2 (fr) | 1999-03-19 | 2002-12-18 | Trados GmbH | Systeme de gestion de flux des travaux |
US20060116865A1 (en) | 1999-09-17 | 2006-06-01 | Www.Uniscape.Com | E-services translation utilizing machine translation and translation memory |
US7389221B1 (en) * | 2000-07-17 | 2008-06-17 | Globalenglish Corporation | System and method for interactive translation |
US7660740B2 (en) | 2000-10-16 | 2010-02-09 | Ebay Inc. | Method and system for listing items globally and regionally, and customized listing according to currency or shipping area |
US6983240B2 (en) | 2000-12-18 | 2006-01-03 | Xerox Corporation | Method and apparatus for generating normalized representations of strings |
US8874431B2 (en) * | 2001-03-16 | 2014-10-28 | Meaningful Machines Llc | Knowledge system method and apparatus |
US20030083860A1 (en) * | 2001-03-16 | 2003-05-01 | Eli Abir | Content conversion method and apparatus |
US7860706B2 (en) | 2001-03-16 | 2010-12-28 | Eli Abir | Knowledge system method and appparatus |
US7689405B2 (en) * | 2001-05-17 | 2010-03-30 | Language Weaver, Inc. | Statistical method for building a translation memory |
WO2003005166A2 (fr) | 2001-07-03 | 2003-01-16 | University Of Southern California | Modele de traduction statistique fonde sur la syntaxe |
US7752266B2 (en) * | 2001-10-11 | 2010-07-06 | Ebay Inc. | System and method to facilitate translation of communications between entities over a network |
US20030101044A1 (en) * | 2001-11-28 | 2003-05-29 | Mark Krasnov | Word, expression, and sentence translation management tool |
US7620538B2 (en) | 2002-03-26 | 2009-11-17 | University Of Southern California | Constructing a translation lexicon from comparable, non-parallel corpora |
US8078505B2 (en) | 2002-06-10 | 2011-12-13 | Ebay Inc. | Method and system for automatically updating a seller application utilized in a network-based transaction facility |
US7941348B2 (en) * | 2002-06-10 | 2011-05-10 | Ebay Inc. | Method and system for scheduling transaction listings at a network-based transaction facility |
US8719041B2 (en) | 2002-06-10 | 2014-05-06 | Ebay Inc. | Method and system for customizing a network-based transaction facility seller application |
US7308399B2 (en) * | 2002-06-20 | 2007-12-11 | Siebel Systems, Inc. | Searching for and updating translations in a terminology database |
US7293015B2 (en) * | 2002-09-19 | 2007-11-06 | Microsoft Corporation | Method and system for detecting user intentions in retrieval of hint sentences |
US7171351B2 (en) * | 2002-09-19 | 2007-01-30 | Microsoft Corporation | Method and system for retrieving hint sentences using expanded queries |
US7194455B2 (en) * | 2002-09-19 | 2007-03-20 | Microsoft Corporation | Method and system for retrieving confirming sentences |
US9881308B2 (en) | 2003-04-11 | 2018-01-30 | Ebay Inc. | Method and system to facilitate an online promotion relating to a network-based marketplace |
US7742985B1 (en) | 2003-06-26 | 2010-06-22 | Paypal Inc. | Multicurrency exchanges between participants of a network-based transaction facility |
US8548794B2 (en) | 2003-07-02 | 2013-10-01 | University Of Southern California | Statistical noun phrase translation |
CA2549769A1 (fr) * | 2003-12-15 | 2005-06-30 | Laboratory For Language Technology Incorporated | Systeme, methode et programme pour reconnaitre la traduction correspondante |
US20100262621A1 (en) * | 2004-03-05 | 2010-10-14 | Russ Ross | In-context exact (ice) matching |
US7983896B2 (en) | 2004-03-05 | 2011-07-19 | SDL Language Technology | In-context exact (ICE) matching |
US8296127B2 (en) | 2004-03-23 | 2012-10-23 | University Of Southern California | Discovery of parallel text portions in comparable collections of corpora and training using comparable texts |
DE202005021923U1 (de) * | 2004-04-02 | 2011-06-09 | Star Ag | Computergestütztes Hilfsmittel für ein Verfahren zur Erstellung von fremdsprachigen Dokumenten |
US8666725B2 (en) | 2004-04-16 | 2014-03-04 | University Of Southern California | Selection and use of nonstatistical translation components in a statistical machine translation framework |
US9189568B2 (en) | 2004-04-23 | 2015-11-17 | Ebay Inc. | Method and system to display and search in a language independent manner |
GB2415518A (en) * | 2004-06-24 | 2005-12-28 | Sharp Kk | Method and apparatus for translation based on a repository of existing translations |
JP5452868B2 (ja) | 2004-10-12 | 2014-03-26 | ユニヴァーシティー オブ サザン カリフォルニア | トレーニングおよび復号のためにストリングからツリーへの変換を使うテキスト‐テキスト・アプリケーションのためのトレーニング |
US7937396B1 (en) * | 2005-03-23 | 2011-05-03 | Google Inc. | Methods and systems for identifying paraphrases from an index of information items and associated sentence fragments |
US8719244B1 (en) | 2005-03-23 | 2014-05-06 | Google Inc. | Methods and systems for retrieval of information items and associated sentence fragments |
US8676563B2 (en) | 2009-10-01 | 2014-03-18 | Language Weaver, Inc. | Providing human-generated and machine-generated trusted translations |
US8886517B2 (en) | 2005-06-17 | 2014-11-11 | Language Weaver, Inc. | Trust scoring for language translation systems |
JP4234698B2 (ja) * | 2005-06-20 | 2009-03-04 | 富士通株式会社 | 構造化文書処理システム |
US7653531B2 (en) * | 2005-08-25 | 2010-01-26 | Multiling Corporation | Translation quality quantifying apparatus and method |
US7937265B1 (en) | 2005-09-27 | 2011-05-03 | Google Inc. | Paraphrase acquisition |
US10319252B2 (en) | 2005-11-09 | 2019-06-11 | Sdl Inc. | Language capability assessment and training apparatus and techniques |
US20070162481A1 (en) * | 2006-01-10 | 2007-07-12 | Millett Ronald P | Pattern index |
US8176052B2 (en) * | 2006-03-03 | 2012-05-08 | Perfect Search Corporation | Hyperspace index |
US8266152B2 (en) * | 2006-03-03 | 2012-09-11 | Perfect Search Corporation | Hashed indexing |
US20080213734A1 (en) * | 2006-04-02 | 2008-09-04 | Steve George Guide | Method for Decoding Pictographic Signs Present on Ancient Artifacts |
US8943080B2 (en) | 2006-04-07 | 2015-01-27 | University Of Southern California | Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections |
US10062062B1 (en) | 2006-05-25 | 2018-08-28 | Jbshbm, Llc | Automated teller machine (ATM) providing money for loyalty points |
US8668146B1 (en) | 2006-05-25 | 2014-03-11 | Sean I. Mcghie | Rewards program with payment artifact permitting conversion/transfer of non-negotiable credits to entity independent funds |
US7703673B2 (en) | 2006-05-25 | 2010-04-27 | Buchheit Brian K | Web based conversion of non-negotiable credits associated with an entity to entity independent negotiable funds |
US9704174B1 (en) | 2006-05-25 | 2017-07-11 | Sean I. Mcghie | Conversion of loyalty program points to commerce partner points per terms of a mutual agreement |
US8684265B1 (en) | 2006-05-25 | 2014-04-01 | Sean I. Mcghie | Rewards program website permitting conversion/transfer of non-negotiable credits to entity independent funds |
US7865352B2 (en) * | 2006-06-02 | 2011-01-04 | Microsoft Corporation | Generating grammatical elements in natural language sentences |
US8209163B2 (en) * | 2006-06-02 | 2012-06-26 | Microsoft Corporation | Grammatical element generation in machine translation |
US7725306B2 (en) * | 2006-06-28 | 2010-05-25 | Microsoft Corporation | Efficient phrase pair extraction from bilingual word alignments |
US20080027911A1 (en) * | 2006-07-28 | 2008-01-31 | Microsoft Corporation | Language Search Tool |
US8886518B1 (en) | 2006-08-07 | 2014-11-11 | Language Weaver, Inc. | System and method for capitalizing machine translated text |
US8639782B2 (en) | 2006-08-23 | 2014-01-28 | Ebay, Inc. | Method and system for sharing metadata between interfaces |
US8521506B2 (en) | 2006-09-21 | 2013-08-27 | Sdl Plc | Computer-implemented method, computer software and apparatus for use in a translation system |
US8433556B2 (en) | 2006-11-02 | 2013-04-30 | University Of Southern California | Semi-supervised training for statistical word alignment |
US9122674B1 (en) | 2006-12-15 | 2015-09-01 | Language Weaver, Inc. | Use of annotations in statistical machine translation |
US8468149B1 (en) | 2007-01-26 | 2013-06-18 | Language Weaver, Inc. | Multi-lingual online community |
NZ553484A (en) * | 2007-02-28 | 2008-09-26 | Optical Systems Corp Ltd | Text management software |
US8615389B1 (en) | 2007-03-16 | 2013-12-24 | Language Weaver, Inc. | Generation and exploitation of an approximate language model |
US8831928B2 (en) | 2007-04-04 | 2014-09-09 | Language Weaver, Inc. | Customizable machine translation service |
US9779079B2 (en) * | 2007-06-01 | 2017-10-03 | Xerox Corporation | Authoring system |
US8825466B1 (en) | 2007-06-08 | 2014-09-02 | Language Weaver, Inc. | Modification of annotated bilingual segment pairs in syntax-based machine translation |
US8548791B2 (en) * | 2007-08-29 | 2013-10-01 | Microsoft Corporation | Validation of the consistency of automatic terminology translation |
US7774347B2 (en) * | 2007-08-30 | 2010-08-10 | Perfect Search Corporation | Vortex searching |
US7774353B2 (en) * | 2007-08-30 | 2010-08-10 | Perfect Search Corporation | Search templates |
US7912840B2 (en) * | 2007-08-30 | 2011-03-22 | Perfect Search Corporation | Indexing and filtering using composite data stores |
US8135574B2 (en) * | 2007-11-15 | 2012-03-13 | Weikel Bryan T | Creating and displaying bodies of parallel segmented text |
KR100917784B1 (ko) * | 2007-12-24 | 2009-09-21 | 한성주 | 콘텐트에 대한 코멘트를 기반으로 한 집단 감성 정보 검색방법 및 시스템 |
US8032495B2 (en) * | 2008-06-20 | 2011-10-04 | Perfect Search Corporation | Index compression |
US9262403B2 (en) | 2009-03-02 | 2016-02-16 | Sdl Plc | Dynamic generation of auto-suggest dictionary for natural language translation |
GB2468278A (en) | 2009-03-02 | 2010-09-08 | Sdl Plc | Computer assisted natural language translation outputs selectable target text associated in bilingual corpus with input target text from partial translation |
US9189475B2 (en) * | 2009-06-22 | 2015-11-17 | Ca, Inc. | Indexing mechanism (nth phrasal index) for advanced leveraging for translation |
US8990064B2 (en) | 2009-07-28 | 2015-03-24 | Language Weaver, Inc. | Translating documents based on content |
US8380486B2 (en) | 2009-10-01 | 2013-02-19 | Language Weaver, Inc. | Providing machine-generated translations and corresponding trust levels |
US10417646B2 (en) | 2010-03-09 | 2019-09-17 | Sdl Inc. | Predicting the cost associated with translating textual content |
CN103038764A (zh) * | 2010-04-14 | 2013-04-10 | 惠普发展公司,有限责任合伙企业 | 用于关键字提取的方法 |
KR101416098B1 (ko) | 2010-10-12 | 2014-07-09 | 한국전자통신연구원 | 구단위 번역 지식 학습 방법 및 이를 수행하는 장치 |
KR101356417B1 (ko) * | 2010-11-05 | 2014-01-28 | 고려대학교 산학협력단 | 병렬 말뭉치를 이용한 동사구 번역 패턴 구축 장치 및 그 방법 |
US9128929B2 (en) | 2011-01-14 | 2015-09-08 | Sdl Language Technologies | Systems and methods for automatically estimating a translation time including preparation time in addition to the translation itself |
US11003838B2 (en) | 2011-04-18 | 2021-05-11 | Sdl Inc. | Systems and methods for monitoring post translation editing |
US8694303B2 (en) | 2011-06-15 | 2014-04-08 | Language Weaver, Inc. | Systems and methods for tuning parameters in statistical machine translation |
US8886515B2 (en) | 2011-10-19 | 2014-11-11 | Language Weaver, Inc. | Systems and methods for enhancing machine translation post edit review processes |
CN103164390B (zh) * | 2011-12-15 | 2016-05-18 | 富士通株式会社 | 文档处理方法和文档处理装置 |
CN102591859B (zh) * | 2011-12-28 | 2014-11-05 | 华为技术有限公司 | 一种重用行业标准格式文件的方法及相关装置 |
US8942973B2 (en) | 2012-03-09 | 2015-01-27 | Language Weaver, Inc. | Content page URL translation |
US10261994B2 (en) | 2012-05-25 | 2019-04-16 | Sdl Inc. | Method and system for automatic management of reputation of translators |
US9152622B2 (en) | 2012-11-26 | 2015-10-06 | Language Weaver, Inc. | Personalized machine translation via online adaptation |
US9183197B2 (en) | 2012-12-14 | 2015-11-10 | Microsoft Technology Licensing, Llc | Language processing resources for automated mobile language translation |
US9213694B2 (en) | 2013-10-10 | 2015-12-15 | Language Weaver, Inc. | Efficient online domain adaptation |
US10747419B2 (en) | 2015-09-25 | 2020-08-18 | CSOFT International | Systems, methods, devices, and computer readable media for facilitating distributed processing of documents |
US11200217B2 (en) | 2016-05-26 | 2021-12-14 | Perfect Search Corporation | Structured document indexing and searching |
US10635863B2 (en) | 2017-10-30 | 2020-04-28 | Sdl Inc. | Fragment recall and adaptive automated translation |
US10817676B2 (en) | 2017-12-27 | 2020-10-27 | Sdl Inc. | Intelligent routing services and systems |
US11256867B2 (en) | 2018-10-09 | 2022-02-22 | Sdl Inc. | Systems and methods of machine learning for digital assets and message creation |
RU2767965C2 (ru) * | 2019-06-19 | 2022-03-22 | Общество С Ограниченной Ответственностью «Яндекс» | Способ и система для перевода исходной фразы на первом языке целевой фразой на втором языке |
USD917549S1 (en) | 2019-06-20 | 2021-04-27 | Yandex Europe Ag | Display screen or portion thereof with graphical user interface |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8900600A (nl) | 1989-03-13 | 1990-10-01 | Bso Buro Voor Systeemontwikkel | Tweetalige kennisbank. |
JPH02301869A (ja) * | 1989-05-17 | 1990-12-13 | Hitachi Ltd | 自然言語処理システム保守支援方式 |
US5329446A (en) * | 1990-01-19 | 1994-07-12 | Sharp Kabushiki Kaisha | Translation machine |
JPH0417065A (ja) | 1990-05-11 | 1992-01-21 | Hitachi Ltd | 翻訳支援装置 |
GB2272091B (en) * | 1992-10-30 | 1996-10-23 | Canon Europa Nv | Apparatus for use in aligning bilingual corpora |
GB2279164A (en) * | 1993-06-18 | 1994-12-21 | Canon Res Ct Europe Ltd | Processing a bilingual database. |
JPH07114565A (ja) * | 1993-10-18 | 1995-05-02 | Sharp Corp | 電子辞書 |
EP0672989A3 (fr) * | 1994-03-15 | 1998-10-28 | Toppan Printing Co., Ltd. | Système de traduction automatique |
JP3377290B2 (ja) * | 1994-04-27 | 2003-02-17 | シャープ株式会社 | イディオム処理機能を持つ機械翻訳装置 |
JPH0950435A (ja) | 1995-08-04 | 1997-02-18 | Sharp Corp | 翻訳装置 |
JP3254642B2 (ja) * | 1996-01-11 | 2002-02-12 | 株式会社日立製作所 | 索引の表示方法 |
US6098034A (en) * | 1996-03-18 | 2000-08-01 | Expert Ease Development, Ltd. | Method for standardizing phrasing in a document |
US6161083A (en) * | 1996-05-02 | 2000-12-12 | Sony Corporation | Example-based translation method and system which calculates word similarity degrees, a priori probability, and transformation probability to determine the best example for translation |
EP0887748B1 (fr) | 1997-06-27 | 2007-06-27 | International Business Machines Corporation | Système pour extraire une terminologie multilingue |
US6535842B1 (en) | 1998-12-10 | 2003-03-18 | Global Information Research And Technologies, Llc | Automatic bilingual translation memory system |
-
1999
- 1999-12-20 US US09/466,823 patent/US6473729B1/en not_active Expired - Lifetime
-
2000
- 2000-12-15 JP JP2000381611A patent/JP4504555B2/ja not_active Expired - Fee Related
- 2000-12-19 DE DE60029732T patent/DE60029732T2/de not_active Expired - Lifetime
- 2000-12-19 EP EP00127817A patent/EP1111514B1/fr not_active Expired - Lifetime
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008080190A1 (fr) * | 2007-01-04 | 2008-07-10 | Thinking Solutions Pty Ltd | Analyse linguistique |
US8600736B2 (en) | 2007-01-04 | 2013-12-03 | Thinking Solutions Pty Ltd | Linguistic analysis |
Also Published As
Publication number | Publication date |
---|---|
DE60029732T2 (de) | 2006-11-30 |
DE60029732D1 (de) | 2006-09-14 |
EP1111514A1 (fr) | 2001-06-27 |
US6473729B1 (en) | 2002-10-29 |
JP4504555B2 (ja) | 2010-07-14 |
JP2001195404A (ja) | 2001-07-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1111514B1 (fr) | Procédé et système de traduction de phrases | |
JP4544674B2 (ja) | 選択文字列に関連する情報を提供するシステム | |
US5612872A (en) | Machine translation system | |
US6055528A (en) | Method for cross-linguistic document retrieval | |
JP4911028B2 (ja) | 単語翻訳装置、翻訳方法および翻訳プログラム | |
EP0399533B1 (fr) | Système et méthode de traduction par machine | |
US20020010574A1 (en) | Natural language processing and query driven information retrieval | |
JPH083815B2 (ja) | 自然言語の共起関係辞書保守方法 | |
JPH0689304A (ja) | テキスト処理システムにより使用されるテキストを準備する方法及び装置 | |
AU3825999A (en) | Machine-assisted translation tools | |
JPH0644296A (ja) | 機械翻訳装置 | |
US5608623A (en) | Special cooccurrence processing method and apparatus | |
US5075851A (en) | System for translating a source language word with a prefix into a target language word with multiple forms | |
JP3352799B2 (ja) | 機械翻訳方法及び機械翻訳装置 | |
JP4007630B2 (ja) | 対訳例文登録装置 | |
JPH1074207A (ja) | 情報検索装置及び情報検索方法 | |
Piperidis et al. | A Multi-level Framework for Memory-Based Translation Aid Tools | |
Bennison et al. | Designing a Tool for Exploiting Bilingual Comparable Corpora. | |
JPH0561902A (ja) | 機械翻訳システム | |
Khoroshilov et al. | Machine phraseological translation of scientific-technical texts based on the model of generalized syntagmas | |
Schmidt-Wigger | Term checking through term variation | |
JP3358100B2 (ja) | 日本語質問メッセージ解析方法及び装置 | |
JP4262529B2 (ja) | 全文検索装置、方法、プログラム及び記録媒体 | |
Nenadić et al. | Recognition and acquisition of compound names from corpora | |
JPH0320866A (ja) | テキストベース検索方式 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE FR GB |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17P | Request for examination filed |
Effective date: 20011227 |
|
AKX | Designation fees paid |
Free format text: DE FR GB |
|
17Q | First examination report despatched |
Effective date: 20041104 |
|
GRAJ | Information related to disapproval of communication of intention to grant by the applicant or resumption of examination proceedings by the epo deleted |
Free format text: ORIGINAL CODE: EPIDOSDIGR1 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 60029732 Country of ref document: DE Date of ref document: 20060914 Kind code of ref document: P |
|
ET | Fr: translation filed | ||
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20070503 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 16 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20161128 Year of fee payment: 17 Ref country code: DE Payment date: 20161121 Year of fee payment: 17 Ref country code: FR Payment date: 20161121 Year of fee payment: 17 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 60029732 Country of ref document: DE |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20171219 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20180831 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180703 Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180102 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20171219 |