EP0838765B1 - System und Verfahren zur Wiederauffindung von Dokumenten in mehreren Sprachen - Google Patents
System und Verfahren zur Wiederauffindung von Dokumenten in mehreren Sprachen Download PDFInfo
- Publication number
- EP0838765B1 EP0838765B1 EP97118378A EP97118378A EP0838765B1 EP 0838765 B1 EP0838765 B1 EP 0838765B1 EP 97118378 A EP97118378 A EP 97118378A EP 97118378 A EP97118378 A EP 97118378A EP 0838765 B1 EP0838765 B1 EP 0838765B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- search
- documents
- key word
- searcher
- search result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3337—Translation of the query language, e.g. Chinese to English
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/55—Rule-based translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
Definitions
- This invention relates to a document searching system and method which enables to search discretionally the documents described by plural different languages (multilingual documents) according to the key word written by the designated language (e.g. native language) and to display the search result by the designated language.
- the designated language e.g. native language
- the searcher When the database to be searched is written in another language from the searcher's native language, the searcher should translate the key words for searching from their native language to that used in the database prior to inputting the search.
- Such a searching system has been provided in which the search formula input is written by the searcher's native language and then is translated automatically to that used in the database to be searched. The search is then carried out in the database.
- Such a system is disclosed by Japanese Kokai Patent No. 8-202721 where the search result is translated automatically to the searcher 'a native language and then displayed.
- the documents to be searched are generally text data only, but they are usually supplemented by objects such as drawings, photographs or animations.
- each object is usually arranged in a designated area and shown together with the text data on the same page.
- the object is linked to the text data by assigning a tag with specified function in the document, and such text is referred to as hyper-linked text.
- SGML and HTML, used in WWW are two kinds of texts of this type.
- Such software such as viewers or browsers are generally utilized to interpret and develop the hyper-linked text and then display it.
- Such a system may be constructed by combining these techniques mentioned above as follows.
- the search condition formula is translated automatically to an equivalent one written in another language so as to include synonyms.
- the search is then carried out and then the search result is automatically translated to and displayed by the language defined during the input of the search condition formula.
- Hong Yoon Kim et al. "A Presentation Agent for Multilingual Services", Proceedings of the Network Operations and Management Symposium (Nom, Kissimmee, Feb. 14-17, 1994, vol. 2 of 3, February 1994, IEEE, pages 635-644) discloses a presentation agent for multilingual services, such as flight information systems, in which the items of a database are classified into three types. The items which are common in all languages are not translated, short vocabulary items are converted by simple conversion tables and long sentence items have to be translated either by a machine translator or are pre-translated before being entered into the database.
- EP-A-0 725 353 discloses a document retrieval and display system which retrieves source documents in different languages from servers linked by a communication network, translates the retrieved source documents as necessary, stores the translated documents, and displays the source documents and translated documents at a client device connected to the communication network.
- Patent abstracts of Japan 01185724 discloses an information retriever in which a key word is extracted from the query text.
- the system automatically generates the synonyms of the search formula and translates them to other languages. This happens even if the documents to be searched are written in the same language used in the original search terms. Thus, if both the searcher's native language and the other languages are included this may complicate the search.
- the documents to be searched include more than three languages such as Japanese, English. French and German
- plural translating functions and means that is, from English to Japanese, from English to French, and from English to German, are required.
- the translating function is generally applied to sentences and therefore, it is of a high technical level, has a large structure and is a complicated program. This causes the system response to be lowered when the program is packaged.
- the document searching system for multilingual documents of the present invention is characterized as described below to solve these problems.
- the system of the present invention is provided with and characterized by an input means to input a search command including a search key word designated by the searcher; a translation control means for the key word to translate the key word input by the searcher into another language used in the document to be searched; a search formula generating means to generate a search formula from the key word transferred from the translation control means based on the key word; a search means to search a document storage means according to the search formula transferred from the search formula generating means; a search result storage means to store the searched and selected documents; another translation control means for the search result to translate the documents stored in the search result storage means to the designated language; and a display means to display the results of the translation.
- the document searching system for multilingual documents of the present invention is provided with two translation means independent of each other.
- One is a translation control means for the key word to translate the key word written by the searcher's native language and the other is a translation control means for the search result which may translate the entire text of the documents selected as a search result.
- the key word translation means a simple translation system shall be applied because the objects of the translation are words such as nouns, verbs and so forth.
- the search result translation means a high-level translation system shall be applied because the objects of the translation are common sentences and the appropriate translation is required by deducing from the context of the document.
- Fig. 1 is a block diagram to show the functional structure of the system of the present invention.
- a document searching system for multilingual documents is provided with a client computer A, an application server B and a data server C.
- the client computer A has a display input means 1.
- the display input means 1 takes the information input by the client into the system and transfers it to a communication control means 2 in the application server B through the communication network and also receives the information from the communication control means and then displays it.
- the application server B comprises the communication control means 2; a translation control means 3 for a free key word to translate the free key word into the other required languages; a synonym search means 4 which is provided with the functions for registration and modification of the word in order to search and output the words having identical or equivalent meaning with the free key word input by the client; and a search formula generating means 5 which generates a search formula according to key words output from the translation control means 3 for the free key word.
- the application server B also comprises a primary storage means 8 for temporarily storing the search results; a translation control means 9 for optionally translating the search result; and an edit means 10 for editing the search results and outputting the edited results to the communication control means 2.
- a secondary storage means 11 is added to the primary storage means 8 to enable temporary storage of the document which may be the search result just prior to being displayed or the document not yet translated.
- the data server C comprises a search means 6 which works as a search engine; and a plural document storage means 7 which stores plural documents to be searched.
- the operation of the document searching system for multilingual documents will be described.
- the language used to define the search conditions is selected by the searcher (step 100) and then the databases to be searched are designated (step 101) and at the next step, the search condition is input (step 102).
- the search start button is operated (step 103)
- the translation control means 3 for the free key word and the synonym search means 4 start processing.
- the initial condition is set (step 200). Then the language used to define and input the search conditions is checked to see whether it coincides with that of the documents to be searched. In case they coincide, the processing goes to the synonym search routine 203 (Fig. 5). In case they do not coincide, the processing goes to the translation function main routine 202 (Fig. 4) and then moves to the synonym search routine 203.
- the search condition is redefined to add the synonyms to the original search condition input by the searcher and set to a search table (i). The "i" is a variable. The databases to be searched and the corresponding search formula are set in the search table.
- the databases to be searched are checked to see whether the next one is waiting or not. If a database is waiting, the variable "1" is added by 1 and the language for the next database waiting to be searched is set (step 207) and then the processing cycle returns to the step 201. When the next database is not waiting, the routine is terminated.
- Fig. 4 shows a main routine for the translation function.
- the search condition is input at step 300.
- the search condition consists of the language used by the searcher, the free word used for searching and the languages used in the databases to be searched. Therefore the number of search conditions is decided by the combination of those elements.
- the search condition is set, the language used to define the search conditions is translated into other languages that are used in the databases to be searched (step 301).
- the search word is checked to see whether it is translatable or not at step 302. If translatable, the translated search condition is added (step 303). If not translatable, nothing is added and the corresponding portion is left blank. Then the search condition is checked to see whether another search condition is waiting or not (step 304), and if waiting, the cycle returns to step 301.
- Fig. 5 shows a synonym search routine.
- the search condition is checked to see whether the word is set or not (step 400). If the word is set, the synonyms of the word are picked up from the synonym tables defined for each language used for searching (step 401). If the synonyms are stored in the table (step 402), these synonyms are picked up (step 403). If no synonym is stored in the table, nothing will be picked up. Then the search condition is checked to see whether another search condition is waiting or not at step 404, and if waiting, the cycle returns to step 400.
- the search formula generating means 5 receives the table in which the synonyms for the free key word in the search condition and the database to be searched are stored from the free key word translation means (step 500). Then the searcher is asked to confirm the contents of each search condition (step 501). The searcher checks whether the search condition should be changed or not (step 502) and if changed, revised data will be set (step 503). The revision by the searcher includes the confirmation, the addition and the deletion of synonyms. When step 502 results in no revision, the information from the free key word translation means is used as it is. Then the search formula is generated (step 504).
- the search formula mentioned above means that the Patent No. field of the records in which either of "KURUMA” "SHARYOU” or “JIDOUSHA” is included and the data of the corresponding Title field are output to the text field from the JP table in which the Japanese Patents are stored ; and that the Patent No. field of the records in which either of "car”, “vehicle”, “automotive” or “automobile” is included and the data of the corresponding Title field are output to the text field from the USP table in which the U.S. Patents are stored.
- search formula mentioned above is written to make a simple explanation for generating the search formula. Therefore other types of search formulas with individual and general links can be generated because the search method of the document with links for the data server which includes plural storage means of the document with links.
- the RDB search means and the search means of the document with link execute the search according to the information from the search formula generating means.
- General and known search methods are applied to these search methods, so their description has been omitted.
- the number of the selected documents as the search result, and their information including the identification of the sentence, the title, the author and the like and their management information are stored in the primary storage means 8 for the search result (step 600).
- This information is generally referred to as bibliographic data.
- the management information comprises language information to identify the kind of the languages and the location information to indicate the location of the sentence.
- the primary storage means 8 for the search result reserves a designated amount of memory when the searcher is logged in the system. The amount of memory defines the upper limit of the number of searches. This area of memory is reserved until the searcher logs-out and is then released at the same time of the log-out.
- the searcher is inquired at the step 601 about whether the bibliographic information mentioned above should be translated and displayed or not.
- step 602 When translation is required (step 602), the information stored in the primary storage means 8 for the search result is read out (step 603). Then the translation control means for the search results is called according to required individual language information and in the next step the translation is carried out (step 604). Then the bibliographic information is transmitted to each translation function (step 605) and it is then transferred to the search result edit routine (step 606) and at the next step 607 the search result list is displayed to the searcher.
- step 606 will be executed directly.
- the searcher selects the sentence (e.g.
- step 608 the management information for the selected sentence is read out (step 609) and the management information is delivered to the search means as a search condition (step 610).
- step 611 the search is executed by the search means and then the search result is stored in the secondary storage means 11 (step 612).
- step 612 the whole document and the corresponding objects are stored.
- the secondary storage means 11 is linked with the primary storage means 8.
- the searcher is inquired at the step 613 whether the translation should be carried out or not and then the necessity of the translation is judged (step 614) and if necessary, whole sentences or designated portions are translated at step 615.
- Fig. 8A is a process to show the list of the search result and a tag is assigned to show the list (step 700) and then another tag is assigned to show the number of hits in the search result (step 701).
- another tag is assigned for the display layout, for instance, it may be a tag for a button to display the result of the translation (step 702).
- Fig. 8B shows another case where each content to be displayed is respectively defined and a tag is assigned to allocate the button to the position of the drawing (step 703) and then another tag is assigned to link with the control bar on the display (step 704).
- another tag is assigned to structure the display layout (step 705).
- the display layout made by these processes will be described with reference to Fig. 9.
- Fig. 9 shows an example where the present invention is applied to the search system of a patent journal.
- Fig. 9(A) shows a basic display layout.
- the frame is divided into three areas : an area 20, an area 30 and an area 40.
- the area 20 is used to show a control bar in which is arranged the bibliographic information of the patent.
- a button 21 can be used to indicate the patent publication number, the assignee and the like and to display them on the area 30.
- a button 22 can be used to show the abstract of the patent; a button 23 can be used to show the claim of the patent; and button 24 can be used to show all of the drawings included in the patent.
- the area 30 is used to show the contents of the item designated by these buttons.
- the abstract information is displayed.
- the drawing number previously selected by the assignee (e.g. Fig. 1) is included in the abstract information.
- the image drawing corresponding to that Figure number is displayed on the area 40 (object frame).
- the drawing number included in the abstract can also call the corresponding drawing onto the display by clicking on that number.
- the select all drawings button 24 of the area 20 is clicked under this condition, the frame shown in Fig. 9B is displayed.
- This frame comprises an area 50 to show the currant image drawing; a button 51 to call the previous image drawing; a button 52 to call the following image drawing; and a button 53 to call all the drawings at the same time in one frame.
- the button 53 is selected under this condition, the frame shown in Fig. 9C is displayed.
- the document searching system for multilingual documents of the present invention is provided with two translation means that are independent of each other as described above. That is, one is a translation control means for the translating the key words written by the searcher's language into another language.
- the other translation control means is for the search results which may translate the whole documents selected as a search result.
- the key word translation means a simple translation system is applied because the objects of the translation are words such as nouns, verbs and so forth.
- the search result translation means a high-level translation system may be applied because the objects of the translation are common sentences and the appropriate translation is required by deducing from the context of the document. Therefore, in the processing steps of the free key word translation means where the search formula may be changed, added, modified or deleted frequently to improve the exactness of the search result, the processing speed in these steps is not decreased and therefore the whole system response may speed up.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Claims (10)
- Dokumentsuchsystem für mehrsprachige Dokumente, umfassend:Eingabemittel bzw. -einrichtungen zum Eingeben eines Suchbefehls, beinhaltend ein Suchschlüsselwort, eine Sprache, welche durch einen Suchenden bezeichnet ist, und eine Datenbank, welche zu durchsuchen ist, welche durch den Suchenden bezeichnet ist;Suchmittel zum Durchsuchen von Dokumentspeichermitteln für ein Speichern von Dokumenten;Suchresultat-Speichermittel zum Speichern der ausgewählten Dokumente;Suchresultat-Übersetzungsregel- bzw. -steuermittel zum Übersetzen des Suchresultats, welches in den Suchresultat-Speichermitteln gespeichert ist, in die in der Eingabe des Suchbefehls bezeichnete Sprache; undAnzeigemittel, um das Resultat der Übersetzung anzuzeigen, gekennzeichnet durch Schlüsselwort-Übersetzungssteuer- bzw. -regelmittel zum Übersetzen des Schlüsselworts, welches durch den Suchenden eingegeben wurde, in eine andere Sprache, welche in den zu durchsuchenden Dokumenten verwendet wird;Suchformel-Erzeugungsmittel zum Erzeugen einer Suchformel, umfassend Informationen für die Suche, wie beispielsweise das übersetzte Schlüsselwort, welches von den Schlüsselwort-Übersetzungssteuermitteln übertragen wurde, und die zu durchsuchende Datenbank;
worin die Schlüsselwort-Übersetzungssteuermittel und die Suchresultat-Übersetzungssteuermittel unabhängig voneinander sind, so daß die Schlüsselwort-Übersetzungssteuermittel das Schlüsselwort übersetzen und die Suchresultat-Übersetzungssteuermittel wenigstens Sätze der ausgewählten Dokumente übersetzen, unddie Suchmittel die Dokumentenspeichermittel gemäß der Suchformel durchsuchen, welche von den Suchformel-Erzeugungsmittel übertragen wurde. - Dokumentensuchsystem für mehrsprachige Dokumente nach Anspruch 1, worin die Dokumente Dokumente mit Links bzw. Verbindungen sind.
- Dokumentensuchsystem für mehrsprachige Dokumente nach Anspruch 1 oder 2, worin das System zusätzlich Synonym-Suchmittel umfaßt, um Synonyme des Schlüsselworts auszugeben, welches durch den Suchenden eingegeben ist.
- Dokumentensuchsystem für mehrsprachige Dokumente nach einem der vorangehenden Ansprüche, worin die Synonym-Suchmittel zusätzlich Synonym-Speichermittel und Schreibmittel umfassen, um die Synonym-Speichermittel neu zu schreiben.
- Dokumentensuchsystem für mehrsprachige Dokumente nach einem der vorangehenden Ansprüche, worin die Suchformel-Erzeugungsmittel mit Synonym-Modifikationsmitteln versehen sind, welche ein Ändern der ausgewählten Synonyme ermöglichen.
- Dokumentensuchsystem für mehrsprachige Dokumente nach einem der vorangehenden Ansprüche, worin die Suchresultat-Übersetzungssteuer- bzw. -regelmittel mit Übersetzungsbereich-Bezeichnungs- bzw. -Bestimmungsmitteln versehen sind, um einen zu übersetzenden Bereich zu bezeichnen.
- Dokumentensuchsystem für mehrsprachige Dokumente nach einem der vorangehenden Ansprüche, worin die Suchresultat-Speichermittel mit primären Speichermitteln, um bibliographische Information des Suchresultats zu speichern, und sekundären Speichermitteln versehen sind, um den gesamten Text der Dokumente zu speichern.
- Dokumentensuchsystem für mehrsprachige Dokumente nach einem der vorangehenden Ansprüche, worin das Dokument mit einem Link ein HTML-Dokument ist.
- Dokumentensuchverfahren für mehrsprachige Dokumente, umfassend die Schritte:Eingeben eines Suchbefehls, umfassend ein Such-Schlüsselwort, eine Sprache, welche durch einen Suchenden bezeichnet ist, und eine Datenbank, welche zu durchsuchen ist, welche durch den Suchenden bezeichnet oder bezeichenbar ist;Übersetzen des Schlüsselworts, welches durch den Suchenden eingegeben wird, in eine andere Sprache, welche in den zu durchsuchenden Dokumenten verwendet wird; undErzeugen einer Suchformel, welche Information für die Suche umfaßt, wie beispielsweise das übersetzte Schlüsselwort und die zu durchsuchende Datenbank;Durchsuchen von Dokumentenspeichermitteln zum Speichern von Dokumenten;Speichern der ausgewählten Dokumente in Suchresultat-Speichermitteln;Übersetzen der Dokumente, welche in den Suchresultat-Speichermitteln gespeichert werden, in die in der Eingabe des Suchbefehls bezeichnete Sprache; undAnzeigen des Resultats der Übersetzung,
- Dokumentensuchverfahren für mehrsprachige Dokumente, umfassend die Schritte:Eingeben eines Suchbefehls, beinhaltend ein Suchschlüsselwort, eine Sprache, welche durch einen Suchenden bezeichnet ist, und eine Datenbank, welche zu durchsuchen ist, welche durch den Suchenden bezeichnet oder bezeichenbar ist;Übertragen des Schlüsselworts, welches durch den Suchenden eingegeben wurde, in ein Dokumentensuchsystem für mehrsprachige Dokumente, umfassend:Schlüsselwort-Übersetzungssteuer- bzw. -regelmittel zum Übersetzen des Schlüsselworts, welches durch den Suchenden eingegeben wurde, in eine andere Sprache, welche in den zu durchsuchenden Dokumenten verwendet wird;Suchformel-Erzeugungsmlttel, um eine Suchformel zu erzeugen, umfassend Information für die Suche, wie beispielsweise das Schlüsselwort, welches von den Schlüsselwort-Übersetzungssteuer- bzw. -regelmitteln übertragen wird, und die zu durchsuchende Datenbank;Suchmittel zum Durchsuchen von Dokumentenspeichermitteln entsprechend der Suchformel, welche von den Suchformel-Erzeugungsmitteln übertragen wird;Suchresultat-Speichermittel zum Speichern der ausgewählten Dokumente;Suchresultat-Übersetzungssteuer- bzw. -regelmittel zum Übersetzen der ausgewählten Dokumente, welche in den Suchresultat-Speichermitteln gespeichert werden, in die bei dem Schritt einer Eingabe bezeichnete Sprache; undÜbertragungsmittel zum Übertragen des übersetzten Suchresultats, wobei das Verfahren weiters umfaßt:Anzeigen des Resultats der Übersetzung;
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US740044 | 1996-10-23 | ||
US08/740,044 US5956740A (en) | 1996-10-23 | 1996-10-23 | Document searching system for multilingual documents |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0838765A1 EP0838765A1 (de) | 1998-04-29 |
EP0838765B1 true EP0838765B1 (de) | 2003-10-01 |
Family
ID=24974807
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP97118378A Expired - Lifetime EP0838765B1 (de) | 1996-10-23 | 1997-10-22 | System und Verfahren zur Wiederauffindung von Dokumenten in mehreren Sprachen |
Country Status (4)
Country | Link |
---|---|
US (3) | US5956740A (de) |
EP (1) | EP0838765B1 (de) |
JP (1) | JP3666004B2 (de) |
DE (1) | DE69725258T2 (de) |
Families Citing this family (102)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5956740A (en) * | 1996-10-23 | 1999-09-21 | Iti, Inc. | Document searching system for multilingual documents |
JP3282976B2 (ja) * | 1996-11-15 | 2002-05-20 | 株式会社キングジム | 文字情報処理装置及び方法 |
US6542888B2 (en) * | 1997-11-26 | 2003-04-01 | International Business Machines Corporation | Content filtering for electronic documents generated in multiple foreign languages |
US20130124987A1 (en) | 1998-02-23 | 2013-05-16 | Transperfect Global, Inc. | Translation management system |
US6526426B1 (en) * | 1998-02-23 | 2003-02-25 | David Lakritz | Translation management system |
US6623529B1 (en) * | 1998-02-23 | 2003-09-23 | David Lakritz | Multilingual electronic document translation, management, and delivery system |
US8489980B2 (en) | 1998-02-23 | 2013-07-16 | Transperfect Global, Inc. | Translation management system |
JP3666005B2 (ja) * | 1998-03-03 | 2005-06-29 | マツダ株式会社 | 特許情報検索システム、特許情報検索装置、情報検索中継装置並びに特許情報検索方法 |
DE19814348A1 (de) * | 1998-03-31 | 1999-10-07 | Ibm | System und Verfahren zur Kommunikation mit verschiedenen elektronischen Archivsystemen |
GB2337611A (en) * | 1998-05-20 | 1999-11-24 | Sharp Kk | Multilingual document retrieval system |
GB2338089A (en) * | 1998-06-02 | 1999-12-08 | Sharp Kk | Indexing method |
US6735585B1 (en) * | 1998-08-17 | 2004-05-11 | Altavista Company | Method for search engine generating supplemented search not included in conventional search result identifying entity data related to portion of located web page |
US7191393B1 (en) * | 1998-09-25 | 2007-03-13 | International Business Machines Corporation | Interface for providing different-language versions of markup-language resources |
US6349276B1 (en) * | 1998-10-29 | 2002-02-19 | International Business Machines Corporation | Multilingual information retrieval with a transfer corpus |
US6381598B1 (en) * | 1998-12-22 | 2002-04-30 | Xerox Corporation | System for providing cross-lingual information retrieval |
CA2367320A1 (en) | 1999-03-19 | 2000-09-28 | Trados Gmbh | Workflow management system |
US6954767B1 (en) * | 1999-03-31 | 2005-10-11 | Fuji Photo Film Co., Ltd. | Server and method for searching for image using image prefetch, designating database and storage devices for searching, and setting retrieval and processing parameters for search |
JP2001022787A (ja) * | 1999-07-13 | 2001-01-26 | Nippon Telegr & Teleph Corp <Ntt> | 多言語インタラクティブ情報検索システム及び多言語インタラクティブ情報検索プログラムを記録した記録媒体 |
US6418402B1 (en) * | 1999-07-27 | 2002-07-09 | International Business Machines Corporation | Method and system for utilizing machine translation as input correction |
CN1176432C (zh) * | 1999-07-28 | 2004-11-17 | 国际商业机器公司 | 提供本国语言查询服务的方法和系统 |
US20060116865A1 (en) * | 1999-09-17 | 2006-06-01 | Www.Uniscape.Com | E-services translation utilizing machine translation and translation memory |
US7925610B2 (en) * | 1999-09-22 | 2011-04-12 | Google Inc. | Determining a meaning of a knowledge item using document-based information |
US7149964B1 (en) * | 2000-02-09 | 2006-12-12 | Microsoft Corporation | Creation and delivery of customized content |
KR100372582B1 (ko) * | 2000-02-23 | 2003-02-17 | 가부시키가이샤 히타치세이사쿠쇼 | 데이터처리방법 및 시스템 및 그 처리프로그램을 기록한계산기판독이 가능한 기록매체 |
EP1281134A4 (de) * | 2000-03-17 | 2007-01-10 | Filesx Ltd | Beschleunigung von antworten auf anforderungen von benutzern an ein internet |
US20040006560A1 (en) * | 2000-05-01 | 2004-01-08 | Ning-Ping Chan | Method and system for translingual translation of query and search and retrieval of multilingual information on the web |
US6604101B1 (en) * | 2000-06-28 | 2003-08-05 | Qnaturally Systems, Inc. | Method and system for translingual translation of query and search and retrieval of multilingual information on a computer network |
FR2809509B1 (fr) * | 2000-05-26 | 2003-09-12 | Bull Sa | Systeme et procede d'internationalisation du contenu de documents a balises dans un systeme informatique |
GB2362971B (en) * | 2000-05-30 | 2004-03-24 | Com Nation Ltd | A method of searching the internet and an internet search engine |
US20010054011A1 (en) * | 2000-06-15 | 2001-12-20 | Nec Corporation | Contents conversion fee charging system, contents conversion fee charging method and storage medium storing program for controlling same |
JP2002024141A (ja) * | 2000-07-05 | 2002-01-25 | Nec Corp | 電子メールの翻訳を代行する方法、そのための装置及びシステム |
JP2002063201A (ja) * | 2000-08-17 | 2002-02-28 | Nova Asia:Kk | コンピュータネットワークを用いた多言語対応情報検索方法及び検索兼翻訳ツールバー |
US6999932B1 (en) | 2000-10-10 | 2006-02-14 | Intel Corporation | Language independent voice-based search system |
WO2002035369A1 (fr) * | 2000-10-13 | 2002-05-02 | Kabushiki Kaisha Toshiba | Systeme de relais de donnees, procede de relais de donnees, programme de relais de donnees, procede de traitement d'informations, et programme de traitement d'informations |
US20020099533A1 (en) * | 2001-01-23 | 2002-07-25 | Evan Jaqua | Data processing system for searching and communication |
JP3328913B1 (ja) * | 2001-08-03 | 2002-09-30 | 学校法人 慶應義塾 | 多言語文献検索システム |
US7146358B1 (en) * | 2001-08-28 | 2006-12-05 | Google Inc. | Systems and methods for using anchor text as parallel corpora for cross-language information retrieval |
US7392173B2 (en) * | 2001-09-06 | 2008-06-24 | International Business Machines Corporation | Method and system of presenting a document to a user |
JP3452558B2 (ja) * | 2001-09-25 | 2003-09-29 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 翻訳対象のリソースと分野別辞書を関連付けるための方法、システムおよびプログラム |
EP1300773A1 (de) * | 2001-10-02 | 2003-04-09 | Sun Microsystems, Inc. | Informationsdienst unter Verwendung von einem Thesaurus |
US20030131316A1 (en) * | 2002-01-07 | 2003-07-10 | Kyle Brown | Internationalizing SGML documents |
US20030144912A1 (en) * | 2002-01-29 | 2003-07-31 | Mcgee Todd | Multilingual messaging system and method for e-commerce |
US6952691B2 (en) * | 2002-02-01 | 2005-10-04 | International Business Machines Corporation | Method and system for searching a multi-lingual database |
JP2003296223A (ja) * | 2002-03-29 | 2003-10-17 | Fuji Xerox Co Ltd | ウェブページ提供方法および装置並びにプログラム |
ATE330287T1 (de) * | 2002-04-12 | 2006-07-15 | Targit As | Verfahren zur verarbeitung von mehrsprachigen abfragen |
US7111237B2 (en) * | 2002-09-30 | 2006-09-19 | Qnaturally Systems Inc. | Blinking annotation callouts highlighting cross language search results |
CN1685313A (zh) * | 2002-09-30 | 2005-10-19 | 陈宁萍 | 由指针发起的对电子文档中文本信息的即时双语注释 |
US20040103087A1 (en) * | 2002-11-25 | 2004-05-27 | Rajat Mukherjee | Method and apparatus for combining multiple search workers |
US7580960B2 (en) | 2003-02-21 | 2009-08-25 | Motionpoint Corporation | Synchronization of web site content between languages |
US7451129B2 (en) * | 2003-03-31 | 2008-11-11 | Google Inc. | System and method for providing preferred language ordering of search results |
US7451130B2 (en) * | 2003-06-16 | 2008-11-11 | Google Inc. | System and method for providing preferred country biasing of search results |
US8306972B2 (en) * | 2003-03-31 | 2012-11-06 | Google Inc. | Ordering of search results based on language and/or country of the search results |
US20040243391A1 (en) * | 2003-05-28 | 2004-12-02 | Nelson David D. | Apparatus, system, and method for multilingual regulation management |
US20100070265A1 (en) * | 2003-05-28 | 2010-03-18 | Nelson David D | Apparatus, system, and method for multilingual regulation management |
US7854009B2 (en) * | 2003-06-12 | 2010-12-14 | International Business Machines Corporation | Method of securing access to IP LANs |
US20040268306A1 (en) * | 2003-06-30 | 2004-12-30 | Cheng Ken Prayoon | Methods, systems and computer program products for language independent data communication and display |
US8014997B2 (en) * | 2003-09-20 | 2011-09-06 | International Business Machines Corporation | Method of search content enhancement |
US20050065774A1 (en) * | 2003-09-20 | 2005-03-24 | International Business Machines Corporation | Method of self enhancement of search results through analysis of system logs |
US20050177358A1 (en) * | 2004-02-10 | 2005-08-11 | Edward Melomed | Multilingual database interaction system and method |
US20100262621A1 (en) * | 2004-03-05 | 2010-10-14 | Russ Ross | In-context exact (ice) matching |
US7983896B2 (en) | 2004-03-05 | 2011-07-19 | SDL Language Technology | In-context exact (ICE) matching |
US20060173829A1 (en) * | 2005-01-10 | 2006-08-03 | Neeman Yoni M | Embedded translation-enhanced search |
JP4710417B2 (ja) * | 2005-05-25 | 2011-06-29 | 富士ゼロックス株式会社 | 情報処理装置 |
US8521506B2 (en) * | 2006-09-21 | 2013-08-27 | Sdl Plc | Computer-implemented method, computer software and apparatus for use in a translation system |
KR100835172B1 (ko) * | 2006-10-16 | 2008-06-05 | 한국전자통신연구원 | 동의어를 이용한 정보 검색 시스템 및 그 방법 |
DE102006060173A1 (de) * | 2006-12-18 | 2008-04-10 | Zettwerk Software Engineering Gmbh | Rekursives sprachübergreifendes Recherche-System |
US7720856B2 (en) * | 2007-04-09 | 2010-05-18 | Sap Ag | Cross-language searching |
US8117194B2 (en) * | 2007-05-07 | 2012-02-14 | Microsoft Corporation | Method and system for performing multilingual document searches |
US8799307B2 (en) * | 2007-05-16 | 2014-08-05 | Google Inc. | Cross-language information retrieval |
US7917840B2 (en) * | 2007-06-05 | 2011-03-29 | Aol Inc. | Dynamic aggregation and display of contextually relevant content |
US8051061B2 (en) * | 2007-07-20 | 2011-11-01 | Microsoft Corporation | Cross-lingual query suggestion |
US7917488B2 (en) * | 2008-03-03 | 2011-03-29 | Microsoft Corporation | Cross-lingual search re-ranking |
KR100923936B1 (ko) * | 2008-03-14 | 2009-10-29 | 엔에이치엔(주) | 일본어 사전 서비스에 있어서 2개 단어 이상의 쿼리, 한글쿼리 또는 일반적인 쿼리의 입력 시 검색 결과를 제공하는방법 및 시스템 |
US8171041B2 (en) * | 2008-05-15 | 2012-05-01 | Enpulz, L.L.C. | Support for international search terms |
KR100956413B1 (ko) | 2008-05-21 | 2010-05-06 | 한국과학기술정보연구원 | 언어 교차 검색 방법 및 시스템 |
US8312032B2 (en) * | 2008-07-10 | 2012-11-13 | Google Inc. | Dictionary suggestions for partial user entries |
JP2010039721A (ja) * | 2008-08-04 | 2010-02-18 | Just Syst Corp | データ入力装置、方法、及びプログラム |
JP2010055235A (ja) * | 2008-08-27 | 2010-03-11 | Fujitsu Ltd | 翻訳支援プログラム、及び該システム |
US9262403B2 (en) * | 2009-03-02 | 2016-02-16 | Sdl Plc | Dynamic generation of auto-suggest dictionary for natural language translation |
GB2468278A (en) * | 2009-03-02 | 2010-09-08 | Sdl Plc | Computer assisted natural language translation outputs selectable target text associated in bilingual corpus with input target text from partial translation |
JPWO2010109594A1 (ja) * | 2009-03-24 | 2012-09-20 | 三菱電機インフォメーションシステムズ株式会社 | 文書検索装置、文書検索システム、文書検索プログラム、および文書検索方法 |
US20110251837A1 (en) * | 2010-04-07 | 2011-10-13 | eBook Technologies, Inc. | Electronic reference integration with an electronic reader |
US9569549B1 (en) | 2010-05-25 | 2017-02-14 | Amazon Technologies, Inc. | Location based recommendation and tagging of media content items |
US8843814B2 (en) * | 2010-05-26 | 2014-09-23 | Content Catalyst Limited | Automated report service tracking system and method |
US8769392B2 (en) * | 2010-05-26 | 2014-07-01 | Content Catalyst Limited | Searching and selecting content from multiple source documents having a plurality of native formats, indexing and aggregating the selected content into customized reports |
US11100098B2 (en) * | 2010-05-28 | 2021-08-24 | Oracle International Corporation | Systems and methods for providing multilingual support for data used with a business intelligence server |
EP2680162A1 (de) | 2010-07-13 | 2014-01-01 | Motionpoint Corporation | Lokalisierung eines Websiteinhalts |
US8862595B1 (en) | 2010-11-23 | 2014-10-14 | Google Inc. | Language selection for information retrieval |
US9128929B2 (en) | 2011-01-14 | 2015-09-08 | Sdl Language Technologies | Systems and methods for automatically estimating a translation time including preparation time in addition to the translation itself |
US8756120B2 (en) * | 2011-11-01 | 2014-06-17 | Sap Ag | Hybrid context-sensitive matching algorithm for retrieving product catalogue information |
US9098494B2 (en) * | 2012-05-10 | 2015-08-04 | Microsoft Technology Licensing, Llc | Building multi-language processes from existing single-language processes |
JP6110647B2 (ja) * | 2012-12-10 | 2017-04-05 | 株式会社アイ・オー・データ機器 | ストリーミングコンテンツ再生用セットトップボックスおよびテレビジョン |
US20140181691A1 (en) * | 2012-12-20 | 2014-06-26 | Rajesh Poornachandran | Sharing of selected content for data collection |
CN103905515A (zh) * | 2012-12-28 | 2014-07-02 | 北大方正集团有限公司 | 一种分布式互联网信息下载系统及方法 |
US10529013B2 (en) | 2013-07-01 | 2020-01-07 | Intuit Inc. | Identifying business type using public information |
US20150370891A1 (en) * | 2014-06-20 | 2015-12-24 | Sony Corporation | Method and system for retrieving content |
US20180082307A1 (en) * | 2016-09-19 | 2018-03-22 | Experian Health, Inc. | Selection of pre-arranged assistance from an electronic qualification transaction |
CN107688614B (zh) * | 2017-08-04 | 2018-08-10 | 平安科技(深圳)有限公司 | 意图获取方法、电子装置及计算机可读存储介质 |
US10635863B2 (en) | 2017-10-30 | 2020-04-28 | Sdl Inc. | Fragment recall and adaptive automated translation |
US10817676B2 (en) | 2017-12-27 | 2020-10-27 | Sdl Inc. | Intelligent routing services and systems |
US11256867B2 (en) | 2018-10-09 | 2022-02-22 | Sdl Inc. | Systems and methods of machine learning for digital assets and message creation |
CN111708911B (zh) * | 2020-06-17 | 2022-06-24 | 北京字节跳动网络技术有限公司 | 搜索方法、装置、电子设备及计算机可读存储介质 |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS57152070A (en) * | 1981-03-13 | 1982-09-20 | Sharp Corp | Electronic interpreter |
JPH0743719B2 (ja) * | 1986-05-20 | 1995-05-15 | シャープ株式会社 | 機械翻訳装置 |
US4870610A (en) * | 1987-08-25 | 1989-09-26 | Bell Communications Research, Inc. | Method of operating a computer system to provide customed I/O information including language translation |
CA1280215C (en) * | 1987-09-28 | 1991-02-12 | Eddy Lee | Multilingual ordered data retrieval system |
JP2854000B2 (ja) * | 1988-01-20 | 1999-02-03 | エヌ・ティ・ティ・アドバンステクノロジ株式会社 | 検索装置 |
JPH0421180A (ja) * | 1990-05-16 | 1992-01-24 | Matsushita Graphic Commun Syst Inc | 文書検索装置 |
JPH04209068A (ja) * | 1990-12-05 | 1992-07-30 | Nippon Telegr & Teleph Corp <Ntt> | データベース検索方法 |
US5278980A (en) * | 1991-08-16 | 1994-01-11 | Xerox Corporation | Iterative technique for phrase query formation and an information retrieval system employing same |
JPH05165889A (ja) * | 1991-12-13 | 1993-07-02 | Canon Inc | 文書検索装置 |
US5523946A (en) * | 1992-02-11 | 1996-06-04 | Xerox Corporation | Compact encoding of multi-lingual translation dictionaries |
US5692176A (en) * | 1993-11-22 | 1997-11-25 | Reed Elsevier Inc. | Associative text search and retrieval system |
JPH07319917A (ja) * | 1994-05-24 | 1995-12-08 | Fuji Xerox Co Ltd | 文書データべース管理装置および文書データべースシステム |
US5995920A (en) * | 1994-12-22 | 1999-11-30 | Caterpillar Inc. | Computer-based method and system for monolingual document development |
US5987402A (en) * | 1995-01-31 | 1999-11-16 | Oki Electric Industry Co., Ltd. | System and method for efficiently retrieving and translating source documents in different languages, and other displaying the translated documents at a client device |
JP2780735B2 (ja) * | 1995-01-31 | 1998-07-30 | 沖電気工業株式会社 | 翻訳機能付文書検索表示システム |
US5708825A (en) * | 1995-05-26 | 1998-01-13 | Iconovex Corporation | Automatic summary page creation and hyperlink generation |
US6006221A (en) * | 1995-08-16 | 1999-12-21 | Syracuse University | Multilingual document retrieval system and method using semantic vector matching |
JPH0981569A (ja) * | 1995-09-12 | 1997-03-28 | Hitachi Ltd | 多カ国対応サービス提供システム |
JPH09114852A (ja) * | 1995-10-20 | 1997-05-02 | Matsushita Electric Ind Co Ltd | 情報検索装置 |
US5924089A (en) * | 1996-09-03 | 1999-07-13 | International Business Machines Corporation | Natural language translation of an SQL query |
US5956740A (en) * | 1996-10-23 | 1999-09-21 | Iti, Inc. | Document searching system for multilingual documents |
-
1996
- 1996-10-23 US US08/740,044 patent/US5956740A/en not_active Expired - Fee Related
-
1997
- 1997-10-22 EP EP97118378A patent/EP0838765B1/de not_active Expired - Lifetime
- 1997-10-22 DE DE69725258T patent/DE69725258T2/de not_active Expired - Fee Related
- 1997-10-23 JP JP29135797A patent/JP3666004B2/ja not_active Expired - Fee Related
-
1999
- 1999-09-21 US US09/399,769 patent/US6212537B1/en not_active Expired - Fee Related
-
2001
- 2001-02-21 US US09/788,679 patent/US6356894B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
DE69725258D1 (de) | 2003-11-06 |
US6212537B1 (en) | 2001-04-03 |
EP0838765A1 (de) | 1998-04-29 |
DE69725258T2 (de) | 2004-08-05 |
US6356894B2 (en) | 2002-03-12 |
US20010016860A1 (en) | 2001-08-23 |
US5956740A (en) | 1999-09-21 |
JPH10232883A (ja) | 1998-09-02 |
JP3666004B2 (ja) | 2005-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0838765B1 (de) | System und Verfahren zur Wiederauffindung von Dokumenten in mehreren Sprachen | |
US9477656B1 (en) | Cross-lingual indexing and information retrieval | |
US6647364B1 (en) | Hypertext markup language document translating machine | |
US8359550B2 (en) | Method for dynamically generating a “table of contents” view of the HTML-based information system | |
CN1685341B (zh) | 跨语言搜索结果的闪烁注释标注加亮 | |
US6269189B1 (en) | Finding selected character strings in text and providing information relating to the selected character strings | |
US5797011A (en) | Method for controlling the translation of information on a display screen from a source language to a target language | |
US6094649A (en) | Keyword searches of structured databases | |
US6338059B1 (en) | Hyperlinked search interface for distributed database | |
US6470306B1 (en) | Automated translation of annotated text based on the determination of locations for inserting annotation tokens and linked ending, end-of-sentence or language tokens | |
US20060173829A1 (en) | Embedded translation-enhanced search | |
JP2008152760A (ja) | マシンアシスト翻訳ツール | |
JP2003030186A (ja) | 機械翻訳装置 | |
JP2002229981A (ja) | 文字列の正規化表示を生成するシステム | |
Pingali et al. | WebKhoj: Indian language IR from multiple character encodings | |
US20040098380A1 (en) | Method, system and apparatus for providing a search system | |
US6907562B1 (en) | Hypertext concordance | |
JP3352799B2 (ja) | 機械翻訳方法及び機械翻訳装置 | |
Fairon | GlossaNet: Parsing a web site as a corpus | |
Abdelali et al. | Cross-language information retrieval using ontology | |
WO1997048058A9 (en) | Automated translation of annotated text | |
GB2368167A (en) | Knowledge management software system | |
JP3328913B1 (ja) | 多言語文献検索システム | |
Richy | A hypertext electronic index based on the Grif structured document editor | |
JP2001101207A (ja) | 文書要約装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE FR GB |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;RO;SI |
|
17P | Request for examination filed |
Effective date: 19981029 |
|
AKX | Designation fees paid |
Free format text: DE FR GB |
|
RBV | Designated contracting states (corrected) |
Designated state(s): DE FR GB |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: MAZDA MOTOR CORPORATION |
|
17Q | First examination report despatched |
Effective date: 20000724 |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 06F 17/30 B Ipc: 7G 06F 17/28 A |
|
RTI1 | Title (correction) |
Free format text: A DOCUMENT SEARCHING SYSTEM AND METHOD FOR MULTILINGUAL DOCUMENTS |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 69725258 Country of ref document: DE Date of ref document: 20031106 Kind code of ref document: P |
|
ET | Fr: translation filed | ||
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20040702 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20061018 Year of fee payment: 10 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20061019 Year of fee payment: 10 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20071022 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20080501 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20080630 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20061010 Year of fee payment: 10 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071022 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071031 |