US20010047255A1 - Translating apparatus, dictionary search apparatus, and translating method - Google Patents

Translating apparatus, dictionary search apparatus, and translating method Download PDF

Info

Publication number
US20010047255A1
US20010047255A1 US09/525,020 US52502000A US2001047255A1 US 20010047255 A1 US20010047255 A1 US 20010047255A1 US 52502000 A US52502000 A US 52502000A US 2001047255 A1 US2001047255 A1 US 2001047255A1
Authority
US
United States
Prior art keywords
context
translated
range
word
sentence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/525,020
Other versions
US6418403B2 (en
Inventor
Masaru Fuji
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to US09/525,020 priority Critical patent/US6418403B2/en
Publication of US20010047255A1 publication Critical patent/US20010047255A1/en
Application granted granted Critical
Publication of US6418403B2 publication Critical patent/US6418403B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/44Statistical methods, e.g. probability models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation

Definitions

  • the present invention relates to a translating apparatus, a dictionary search apparatus, and a translating method, which are preferably applied for when translation is to be efficiently performed based on context in a machine translating process.
  • a mechanism and a method for performing a translation process using context information for a document to be translated have been developed. Additionally, a method or apparatus for learning context information, and a translating apparatus for performing a translation process using learned context information, have been developed.
  • An object of the present invention is to provide a translating apparatus, a dictionary search apparatus, and a translating method for translating a portion of a document efficiently and accurately.
  • a range to be translated and a context process range in original text are set separately. Therefore, even if only a portion of a document is specified as a range to be translated, it can be translated efficiently and accurately by referencing context; outside the range to be translated.
  • a range to be translated and a context process range in original text are separately set to perform a context process.
  • a translation process is performed by referencing context outside the range to be translated, thereby translating the portion of a document efficiently and accurately.
  • field estimation is made as a context process. Even if there is no word identifying a field in a range to be translated, the field can be identified according to context outside the range to be translated. As a result, a suitable translation process depending on a field can be performed.
  • document type estimation is made as a context process. Even if there is no expression which allows a document type to be identified in a range to be translated, a suitable translation process depending on a document type can be performed by identifying the document type according to context outside the range to be translated.
  • an abbreviation is estimated as a context process. Even if there is no complete form of the abbreviation in a range to be translated, it can be searched according to context outside the range to be translated. As a result, a translation process can be performed with completion of an abbreviation.
  • a reference destination is estimated as a context process. Even if a range to be translated includes no word pointed to by a pronoun included in this range, a translation process can be performed while replacing the pronoun depending on need, by searching the word pointed to by the pronoun according to context outside the range to be translated.
  • a sentence to be translated is identified according to a range to be translated. As a result, a translation process can be performed accurately.
  • a group of sentences to be translated is identified according to a range to be translated. As a result, a translation process can be performed accurately.
  • a sentence to be translated is identified by ignoring an inline tag. That is, a translation process can be accurately performed by removing a portion unrelated to translation.
  • a sentence to be translated is highlighted.
  • a translation process can be performed while recognizing the sentence to be translated.
  • a first word having a plurality of translated words, and one of second words closest to the location of the first word among second words representing the translated words of the first word, are highlighted.
  • a sentence to be translated is highlighted with a first method, and a context process range is highlighted with a second method.
  • a translation process can be performed while recognizing the context process range, by making a distinction between the sentence to be translated and the context process range.
  • a word at a reference destination and a word at a reference source are highlighted.
  • a translation process can be performed while recognizing a state of a context process.
  • a context process range is extended if a translated sentence corresponding to an original sentence cannot be identified. If context information which allows a translated sentence to be identified is not included in a range to be translated, the translated sentence can be identified according to context outside the range to be translated. As a result, a portion of a document can be translated efficiently and accurately.
  • a context process range is extended in sentences until a sentence to be translated corresponding to an original sentence can be determined.
  • a portion of a document can be translated efficiently and accurately by performing a context process such as estimating an abbreviation, estimating a reference destination, etc. efficiently.
  • a context process range is extended in paragraphs in correspondence with a range to be translated until a translated sentence corresponding to an original sentence can be determined.
  • a portion of a document can be translated efficiently and accurately by performing a context process such as estimating a field, estimating a document type, etc. efficiently.
  • a direction in which a context process range is extended is determined according to a type of a context process.
  • the context process is efficiently performed, thereby translating a portion of a document efficiently and accurately.
  • a context process range is extended in sentences in a forward direction until a translated sentence corresponding to an original sentence can be determined.
  • the context process such as estimating a reference destination is performed efficiently, thereby translating a portion of a document efficiently and accurately.
  • a context process range is set in correspondence with a range to be translated separated into paragraphs, if the range to be translated in original text straddles a plurality of paragraphs.
  • an inline tag is removed from a context process range. That is, by removing a portion unrelated to context, a context process can be performed efficiently.
  • a context process range is stopped from extending at a boundary of context.
  • a context process can be performed accurately and quickly by performing the context process in a context process range including context corresponding to a range to be translated.
  • context information is displayed in correspondence with a range to be translated in original text.
  • a translation process can be performed while recognizing a state of a context process in the range to be translated.
  • a plurality of pieces of context information are displayed as candidates if a translated sentence corresponding to an original sentence cannot be determined.
  • a translation process can be performed while recognizing a state of a context process being performed in a range to be translated.
  • context information includes a field name, document type, complete form of an abbreviation, or a word at a reference destination.
  • a context process of original text is performed based on a context process range which is set, and a translated word corresponding to an original word is determined based on text in the context process range.
  • a dictionary search is properly performed in consideration of context.
  • a context process range is extended if a translated word corresponding to an original word cannot be determined.
  • a dictionary search can be properly made by allowing a context process to be performed efficiently.
  • a context process range is extended in sentences until a translated word corresponding to an original word can be determined. That is, by allowing a context process such as estimating an abbreviation, etc., to be performed efficiently, a dictionary search can be made efficiently and accurately.
  • field estimation is made as a context process. If there are a plurality of translated words corresponding to an original word, the field of the original word is estimated with a context process in order to select a translated word corresponding to the estimated field. As a result, a dictionary search is performed accurately.
  • an abbreviated expression is estimated as a context process.
  • a dictionary search can be performed accurately.
  • a meaning of a word to be searched is determined according to context in a context process range.
  • a dictionary search is performed in consideration of context, thereby performing the dictionary search accurately.
  • context of original text outside a range to be translated is referenced when a portion of the original text is translated.
  • first and second translated words respectively belonging to first and second fields which correspond to a first original word in a range to be translated of original text
  • a second original word outside the range to be translated of the original text is referenced. If the second original word belongs to the first field, the first translated word is selected for the first original word.
  • the field of the first original word in the range to be translated is estimated based on context outside the range to be translated, thereby determining a translated word of the first original word based on the estimated field.
  • data in a conversion range is converted based on context in a context process range.
  • Data is converted in consideration of information other than the data in the conversion range, thereby converting the data accurately.
  • FIG. 1 is a block diagram showing the configuration of a data conversion apparatus according to an embodiment of the present invention
  • FIG. 2 is a block diagram showing the configuration of a translating apparatus according to the embodiment of the present invention.
  • FIG. 3 is a schematic diagram showing the configuration of a computer system of the translation apparatus according to the embodiment of the present invention.
  • FIG. 4 is a flowchart showing operations performed by a translating apparatus according to a first embodiment of the present invention
  • FIG. 5 shows an example of original text input to the translating apparatus according to the first embodiment of the present invention
  • FIG. 6 shows a correspondence between an original sentence (English) input to the translating apparatus according to the first embodiment of the present invention and a translated sentence (Japanese) output from the translating apparatus;
  • FIG. 7 shows a correspondence table between original words (English) and corresponding translated words (Japanese) provided by the translating apparatus according to the first embodiment of the present invention
  • FIG. 8 shows a correspondence table between an original sentence (English) input to the translating apparatus according to the first embodiment of the present invention and a translated sentence (French) output from the translating apparatus;
  • FIG. 9 shows a correspondence table between an original word (English) and corresponding translated words in respective fields (French), provided by the translating apparatus according to the first embodiment of the present invention
  • FIG. 10 shows a correspondence table between a document type and an expression method of translation, implemented by the translating apparatus according to a second embodiment of the present invention
  • FIG. 11 is a flowchart showing operations performed in a context process according to a third embodiment of the present invention.
  • FIG. 12 is a flowchart showing operations performed in a context process according to a fourth embodiment of the present invention.
  • FIG. 13 is a schematic diagram showing a method for specifying a translation portion of an original text input to the translation apparatus according to the first embodiment of the present invention
  • FIG. 14 shows a method for specifying a translation range of original text input to the translating apparatus according to the second embodiment of the present invention
  • FIG. 15 shows an example of original text into which inline tags are input to the translating apparatus according to the first embodiment of the present invention
  • FIG. 16 shows an example of original text into which inline tags are input to the translating apparatus according to the second embodiment of the present invention
  • FIGS. 17A to 17 C show a method for extending a context process range by the translating apparatus according to the second embodiment of the present invention
  • FIG. 18 shows a correspondence table between a method for extending a context process range and a context process type, implemented by the translating apparatus according to the third embodiment of the present invention
  • FIG. 19 is a flowchart showing the method for extending a context process range according to the third embodiment of the present invention.
  • FIG. 20 shows a state where context information is displayed for original text input to the translating apparatus according to the first embodiment of the present invention
  • FIG. 21 shows a state where context information is displayed for the original text input to the translating apparatus according to the second embodiment of the present invention
  • FIG. 22 shows a state where context information is displayed for the original text input to the translating apparatus according to the third embodiment of the present invention
  • FIG. 23 shows a state where context information is displayed for the original text input to the translating apparatus according to the fourth embodiment of the present invention.
  • FIG. 24 is a block diagram showing the configuration of a dictionary search apparatus according to an embodiment of the present invention.
  • FIG. 25 shows a method for specifying an original word input to the dictionary search apparatus according to the embodiment of the present invention
  • FIG. 26 is a flowchart showing operations performed by the dictionary search apparatus according to a first embodiment of the present invention.
  • FIG. 27 is a flowchart showing operations performed by the dictionary search apparatus according to a second embodiment of the present invention.
  • FIG. 1 is a block diagram showing the configuration of a data conversion apparatus according to an embodiment of the present invention.
  • the data conversion apparatus shown in this figure comprises a conversion range setting unit 100 for setting a conversion range of data, a context process range setting unit 101 for setting a context process range where a context process is performed, and a conversion processing unit 102 for converting the data in consideration of context in the context process range.
  • a conversion range setting unit 100 for setting a conversion range of data
  • a context process range setting unit 101 for setting a context process range where a context process is performed
  • a conversion processing unit 102 for converting the data in consideration of context in the context process range.
  • FIG. 2 is a simplified block diagram showing configuration of the translating apparatus according to the embodiment of the present invention.
  • This apparatus is intended to quickly perform a translation process by obtaining appropriate context from a neighborhood of a range to be translated depending on need, when a portion of a document is specified as the range to be translated.
  • a translation processing apparatus 9 comprises a user interface 8 , translation range identifying unit 4 , context processing unit 5 , translation processing unit 6 , and a context process range extending unit 7 .
  • the user interface 8 comprises an original text displaying unit 1 and an input unit 2 .
  • the original text displaying unit 1 is intended to display original text stored in a storing unit 3 for a translation operator.
  • the original text displaying unit 1 is implemented by, for example, a CRT display.
  • the input unit 2 is intended to accept a translation instruction given by a translation operator.
  • the translation operator instructs the input unit 2 of a range to be translated while watching the original text displaying unit 1 .
  • the input unit 2 is implemented by, for example, a mouse, a keyboard, etc.
  • the storing unit 3 is intended to store original text, which is implemented by, for example, a hard disk unit, optical disk unit, etc.
  • the translation range identifying unit 4 is intended to extract a portion to be translated in original text from the storing unit 3 , according to a translation instruction from the input unit 2 . Note that a system instructing a sentence to be translated using a sentence number, etc. does not always require the translation range identifying unit 4 .
  • the context processing unit 5 is intended to perform a context process in a context process range specified by the context process range extending unit 7 for a sentence to be translated identified by the translation range identifying unit 4 .
  • the translation processing unit 6 is intended to translate a sentence to be translated according to context obtained by the context processing unit 5 , and output a translated sentence.
  • the context process range extending unit 7 is intended to extend a context process range where a context is retrieved by reading an original text around a range to be translated from the storing unit 3 , if context information required by the context processing unit 5 cannot be obtained from a sentence to be translated which is identified by the translation range identifying unit 4 .
  • FIG. 3 shows an example of the translating apparatus implemented by a computer system according to the embodiment of the present invention.
  • a CPU 111 , a memory 112 , an input device 113 , a display 114 , and a disk device 115 are connected via a bus 110 .
  • the input unit; 2 is implemented by the input device 113
  • the original text displaying unit 1 is implemented by the display 114
  • the original text storing unit 3 is; implemented by the disk device 115 .
  • the translation range identifying unit 4 , context processing unit 5 , translation processing unit 6 and the context process range extending unit 7 are implemented by, for example, the CPU 111 and the memory 112 .
  • FIG. 4 is a flowchart showing the operations performed by the translating apparatus according to the embodiment of the present invention.
  • a translation operator inputs a translation instruction to an input unit 2 , and the input unit 2 accepts the translation instruction in step S 1 .
  • step 2 the translation range identifying unit 4 identifies a sentence to be translated in a document to be translated according to the translation instruction input to the input unit 2 .
  • step S 3 a context processing unit 5 performs a context process for the sentence to be translated identified by the translation range identifying unit 4 to extract context information.
  • step S 4 it is determined whether or not the context information extracted in step S 3 is sufficient for performing a translation process by the translation processing unit 6 . If the context information is insufficient for the translation process, the process goes to step S 5 .
  • a context process range extending unit 7 extends a context process range where the context process will be performed, beyond a sentence to be translated, and passes the original text in the extended context process range to a context processing unit 5 .
  • step S 4 If it is determined that the context information is sufficient for performing the translation process in step S 4 , the context processing unit 5 passes sufficient context information for performing the translation process as well as the document including the sentence to be translated identified by the translation range identifying unit 4 , to the translation processing unit 6 . Then, the process goes to step S 6 .
  • step S 6 the translation processing unit 6 translates the sentence to be translated included in the document to be translated, based on the context information passed from the context processing unit 5 .
  • an original text displaying unit 1 is assumed to display an original sentence numbered 31 “The talks were held in the Upper House.”, and an original sentence numbered 32 “This has forced the party to change its stance.”
  • a translation operator inputs the number of a desired sentence to be translated to an input unit 2 , and issues a translation instruction. If the translation operator specifies the sentence numbered 32 , the original sentence numbered 32 “This has forced the party to change its stance.” will be a translation target.
  • the context processing unit 5 performs a context process for the original sentence numbered 32 .
  • a dictionary is consulted for each original word included in the original sentence numbered 32 .
  • the translated word “ ” is selected for the original word “This”; the translated word “ ” is selected for the original word “forced”; the translated word “ ” or “ ” is selected for the original word “party”; the translated word “ ” is selected for the original word “change”; the translated word “ ” is selected for the original word “its”; and the translated word “ ” or “ ” is selected for the original word “stance”, as shown in FIG. 6.
  • the original word “has” is recognized as an auxiliary verb
  • the original word “the” is recognized as a definite article
  • the original word “to” is recognized as a function word.
  • FIG. 7 shows a correspondence table between the translated words of each of the original words “party” and “stance” and the corresponding fields.
  • the original sentence indicates the “general” field.
  • the word “ ” is selected as a translated word of the original word “stance”
  • the original sentence indicates a “sports” field. Note that the “general” field indicates a word available for all fields.
  • a context process range is extended in order to analyze context beyond the context process range of the original sentence to be translated.
  • the context processing unit 5 which receives the original sentence numbered 31 , consults a dictionary for each original word of this sentence. As a result, a compound “Upper House” is retrieved from the original sentence numbered 31 . Since the compound “Upper House” indicates the “political” field only, it is determined that the original sentence to be translated includes context related to the political field. Note that context information other than field information can be also extracted from the original sentence numbered 31 .
  • the translation processing unit 6 performs the translation process according to the context information, that is, the political field.
  • the word “ ” used in the political field is selected as a translated word of the original word “party”.
  • the translated word “ ” used in the sports field is deleted, and the translated word “ ” available for all fields is selected.
  • the original text displaying unit 1 is assumed to display the original sentence numbered 31 “The talks were held in the Upper House.”, and the original sentence numbered 32 “This has forced the party to change its stance.”.
  • a translation operator inputs a number of a sentence to be translated to the input unit 2 , and issues a translation instruction. Assuming that the translation operator specifies the sentence number 32 , the original sentence numbered 32 “This has forced the party to change its stance.” will be a translation target.
  • the context processing unit 5 then performs a context process for the original sentence numbered 32 .
  • a dictionary is consulted for each of the original words included in the original sentence numbered 32 .
  • a translated word “Ceci” is selected for the original word “This”; a translated word “a” is selected for the original word “has”; a translated word “force” is selected for the original word “forced”; a translated word “la” is selected for the original word “the”; a translated word “parti” or “groupe” is selected for the original word “party”; a translated word “a” is selected for the original word “to”; a translated word “changer” is selected for the original word “change”; a translated word “sa” is selected for the original word “its”; and a translated word “position” is selected for the original word “stance”, as shown in FIG. 8.
  • FIG. 9 shows a correspondence table between a translated word of the original word “party” and a field where the translated word is used.
  • a context process range is extended for analyzing the context beyond the range to be translated in the desired original text.
  • the context process range extending unit 7 extracts the original sentence numbered 31 “The talks were held in the Upper House.”, and passes the extracted sentence to the context processing unit 5 .
  • the context processing unit 5 which receives the original sentence numbered 31 , consults a dictionary for each original word of the sentence numbered 31 As a result, the compound “Upper House” is retrieved from the original sentence numbered 31 . Since the compound “Upper House” indicates the “political” field only, it is determined that the original sentence to be translated includes context related to the political field. Note that context information other than field information can be also extracted from the original sentence numbered 31 .
  • the translation processing unit 6 performs the translation process based on the context information, that is, the political field.
  • the word “parti” used in the political field is selected as a translated word of the original word “party”.
  • FIG. 10 shows a correspondence table between a field and a document type when English text is translated into Japanese text.
  • the original text is a political article
  • translation is performed in a “da/dearu” tone, and in a passive voice.
  • the original text is an operating manual
  • translation is performed in a “desu/masu” tone, and in an imperative form.
  • the imperative form and the passive voice are default expressions of translated text corresponding to an English expression such as “must be”, etc.
  • the field of the original text shown in FIG. 5 is the political field. Accordingly, the context processing unit 5 references the column of the political article shown in FIG. 10, and instructs the translation processing unit 6 to translate the article in the “da/dearu” tone and in a passive voice.
  • the translation processing unit 6 receives document type information as well as other context information from the context processing unit 5 , and performs the translation process.
  • a range to be translated includes no expression identifying a document type, it can be identified based on context outside the range to be translated by estimating a document type. As a result, a suitable translation process depending on a document type can be performed.
  • the original text displaying unit 1 is assumed to display the original text “In the last election, the Japan Socialist Party (JSP) enjoyed victory. The reason for JSP's victory was . . . ”.
  • a translation operator instructs the input unit 2 to translate the sentence beginning with “The reason . . . ” as a sentence to be translated.
  • the context processing unit 5 performs a context process for this sentence.
  • a dictionary is consulted for each of the words included in the sentence beginning with “The reason . . . ”.
  • the word “JSP” is recognized as an unregistered word.
  • the context process range extending unit 7 extracts the original sentence “In the last election, the Japan Socialist Party (JSP) enjoyed victory.”, and passes the extracted sentence to the context processing unit 5 .
  • the context processing unit 5 makes a comparison between the original sentence “In the last election, the Japan Socialist Party (JSP) enjoyed victory.” and the sentence to be translated including the unregistered word “JSP”.
  • the unregistered word “JSP” is determined as an abbreviation of the proper noun “Japan Socialist Party”.
  • the meaning of the abbreviation can be identified. Therefore, the result of the identification is passed to the translation processing unit 6 to perform a translation process.
  • the translation processing unit 6 then performs the translation process by using a translated word of the noun “Japan Socialist Party” as that of “JSP”.
  • the original sentence is translated as “ . . . ” if the context process is not performed.
  • the sentence is translated as “ . . . ” if the context process is performed. That is, a translated sentence that is easy to read can be obtained.
  • FIG. 11 is a flowchart showing a context process according to the third embodiment of the present invention.
  • the context processing unit 5 consults a dictionary for each word included in a sentence to be translated in step S 11 .
  • step S 12 it is determined whether or not an abbreviation is included in the sentence to be translated in step S 12 . If “YES”, the process goes to step S 13 .
  • step S 13 it is determined whether or not an expression which completes the abbreviation exists in a context process range. If “NO”, the process goes to step S 14 .
  • step S 14 the context process range extending unit 7 extends the context process range.
  • the context processing unit 5 then consults a dictionary for each word included in a sentence in the extended context process range in step S 15 . After that, the process goes back to step S 13 .
  • step S 13 If it is determined that the expression which completes the abbreviation exists in the context process range in step S 13 , the process goes to step S 16 .
  • step S 16 the abbreviation is completed in the sentence to be translated according to the expression which completes the abbreviation in the context process range.
  • step S 17 the translation processing unit 6 performs a translation process in consideration of the completed form of the abbreviation.
  • step S 12 if there is no abbreviation in the sentence to be translated in step S 12 , the process skips to step S 17 to perform a translation process without considering a completion of the abbreviation.
  • a translation operator instructs the input unit 2 to translate the sentence beginning with “The reason . . . ” as a sentence to be translated.
  • the context processing unit 5 performs a context process for this sentence.
  • a dictionary is consulted for each of the original words included in the sentence beginning with “The reason . . . ” .
  • what the pronoun “its” indicates is determined as being indefinite.
  • the context process range extending unit 7 extracts the original sentence “In the last election, the Japan Socialist Party enjoyed victory.”, and passes the extracted sentence to the context processing unit 5 .
  • the context processing unit 5 then performs a context process for the original sentence “In the last election, the Japan Socialist Party enjoyed victory.” As a result, it is determined that the pronoun “its;” included in the phrase “its victory” indicates the “Japan Socialist Party”. This is because the winner of the “victory” is the “Japan Socialist Party”.
  • a reference destination can be clarified according to context of a neighborhood.
  • the reference destination can be identified. Then, the result of the identification is passed to the translation processing unit 6 to perform a translation process.
  • the translation processing unit 6 performs the translation process using the context information indicating that the pronoun “its” points to “the Japan Socialist Party”, depending on need.
  • FIG. 12 is a flowchart showing a context process according to the fourth embodiment of the present invention.
  • the context processing unit 5 first consults a dictionary for each word in a sentence to be translated in step S 21 .
  • step S 22 it is determined whether or not there is a word having an ambiguous reference relationship in a range to be translated in step S 22 . If “YES”, the process goes to step S 23 .
  • step S 23 it is determined whether or not a context process range includes a word or a word sequence which could possibly be a reference destination. If “NO”, the process goes to step S 24 .
  • step S 24 the context process range extending unit 7 extends the context process range.
  • step S 25 After the context processing unit 5 consults a dictionary for each word in a sentence in the extended context process range in step S 25 , the process goes to step S 23 .
  • step S 23 If it is determined that the word or the word sequence could possibly be a reference destination in step S 23 , the process goes to step S 26 .
  • step S 26 the reference destination of the sentence in the extended context process range is identified.
  • step S 27 a process such as replacing the word having the ambiguous reference relationship in the range to be translated with a word at a reference destination, etc. is performed.
  • step S 28 the translation processing unit 6 performs a translation process in consideration of the word at the reference destination.
  • step S 22 if it Ls determined that the range to be translated includes no word having the ambiguous reference relationship, the process goes to step S 28 to perform the translation process without considering the word at the reference destination.
  • a range to be translated includes no word pointed to by a demonstrative pronoun included in that range, a word pointed to by the demonstrative pronoun can be retrieved from context outside the range to be translated, by estimating a reference destination. As a result, a translation process can be performed while replacing a demonstrative pronoun, depending on need.
  • a translation operator operates the input unit 2 to move the cursor 10 to an arbitrary position of a sentence to be translated for specification.
  • the translation range identifying unit 4 identifies the sentence to be translated. That is, this unit detects the position of the cursor 10 , and automatically determines which portion the translation operator intends to translate according to the position of the cursor 10 . Since translation is performed by sentence, the sentence pointed to by the cursor 10 is extracted.
  • the context processing unit 5 performs a context process for the original sentence “This has forced the party to change its stance.”, which is identified by the translation range identifying unit 4 . If context cannot be determined from this sentence only, the context process range extending unit 7 extends the context process range where the context process is performed, to the original sentence “The talks were held in the Upper House.”
  • the translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5 .
  • the translation range identifying unit 4 is intended to identify a range to be translated in original text where one sentence does not terminate in one line, according to shaded area 11 produced by dragging a mouse, etc.
  • a translation operator operates the input unit 2 , and produces a shaded area 11 in order to select a sentence to be translated.
  • the translation range identifying unit 4 identifies the sentence to be translated. That is, the translation range identifying unit 4 detects the range of the shaded area 11 , and automatically determines which portion the translation operator intends to translate according to the shaded area 11 .
  • the forward scanning finally reaches the last word “change” in the selected range of the shaded area 11 , without detecting an additional termination of a sentence other than the above described termination, as a result of continuing the forward scanning.
  • the range from the most recently detected termination to the last word “change” is determined as being incomplete, and the range “This has forced the party to change.” is removed from the range selected by the shaded area 11 .
  • the context processing unit 5 performs a context process for the original sentence “The talks were held in the Upper House.”, which is identified by the translation range identifying unit 4 . If context cannot be determined based on only this sentence, the context process range extending unit 7 extends a context process range where the context process is performed.
  • the translation processing unit 6 then performs a translation process based on context information obtained by the context processing unit 5 .
  • a sentence to be translated can be extracted by being suitably identified, thereby performing a translation process accurately.
  • a translation operator operates the input unit 2 , moves a cursor 41 to a character “i”, and issues a translation instruction.
  • the translation range identifying unit 4 then identifies a sentence to be translated. That is, this unit detects the position of the cursor 41 , and automatically determines which portion the translation operator intends to translate, according to the position of the cursor 41 . Since translation is performed by sentence, a sentence where the cursor 41 is positioned is extracted.
  • the context processing unit 5 performs a context process for the original sentence “This is a pen.”, which is identified by the translation range identifying unit 4 . If context cannot be determined according to only the original sentence “This is a pen.”, the context process range extending unit 7 extends a context process range where the context process is performed.
  • the translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5 .
  • a portion unrelated to translation can be removed by ignoring an inline tag, thereby a translation process can be performed accurately.
  • the translation operator operates the input unit 2 , moves a cursor 42 to a character “b”, and issues a translation instruction.
  • the translation range identifying unit 4 identifies a sentence to be translated.
  • the translation range identifying unit 4 detects the position of the cursor 42 , and automatically determines which portion the translation operator intends to translate, according to the position of the cursor 42 . Since translation is performed by sentence, the sentence where the cursor 42 is positioned is extracted.
  • the context processing unit 5 performs a context process for the original sentence “And that is a book.”, which is identified by the translation range identifying unit 4 . If context cannot be determined according to only the original sentence “And that is a book.”, the context process range extending unit 7 extends a context process range where the context process is performed.
  • the translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5 .
  • a portion unrelated to context can be removed by removing an inline tag from a context process range, thereby performing a context process accurately.
  • the context process range extending unit 7 further extends the context process range to sentences preceding or succeeding the sentence to be translated until the required context can be obtained.
  • the method for extending a context process range by sentence backwards and forwards is helpful for examining a correspondence where an interval between sentences is an important factor, etc.
  • a context process range may be extended to before and after the sentence to be translated in units of a plurality of sentences.
  • a context process such as estimating an abbreviation, estimating a reference destination, etc. can be performed efficiently, thereby translating a portion of a document efficiently and accurately.
  • the original text displaying unit 1 is assumed to display original text, and shaded area 20 is made in a sentence to be translated “The talks were held in the Upper House.” included in the original text.
  • the context processing unit 5 performs a context process for the sentence to be translated in which the shaded area 20 is made. If sufficient context information cannot be obtained from the shaded sentence as a result of the context process, the context process range extending unit 7 extends the context process range to an entire paragraph including the sentence to be translated.
  • the context process range extending unit 7 specifies the text contained in the shaded area 21 is made “Secondly, . . . The talks were held in the Upper House. This has forced the party to change its stance.”, as a context process range, as shown in FIG. 17B.
  • the context process range after extension is identified by detecting the first and the last sentences of a paragraph.
  • the context processing unit 5 extracts context information from the context process range contained in the shaded area 21 is made, as shown in FIG. 17B. If sufficient context information cannot be obtained from the context process range contained in the shaded area 21 is made, the context process range extending unit 7 extends the context process range to the entire text of the next paragraph.
  • the context process range extending unit 7 extends the context process range to the entire text of the preceding paragraph, as shown in FIG. 17C, and adds the sentence which begins with “Firstly,” to the context process range.
  • the text contained in the shaded area 22 is made “Firstly, . . . , secondly, . . . .
  • the talks were held in the Upper House. This has forced the party to change its stance.” is specified as the context process range.
  • the context processing unit 5 extracts context information from the context process range contained in the shaded area 22 , as shown in FIG. 17C. If sufficient context information is extracted from the context process range contained in the shaded area 22 , the context processing unit 5 passes the extracted context information to the translation processing unit 6 .
  • the translation processing unit 6 translates the sentence to be translated where the shaded area 20 is made, based on the context information passed from the context processing unit 5 .
  • a context process range may be extended before or after the sentence to be translated in units of a plurality of paragraphs.
  • a context process range is extended in paragraphs corresponding to a range to be translated until a translated sentence can be determined for an original sentence. Accordingly, a context process such as estimating a field, estimating a document type, etc. can be performed efficiently, thereby translating a portion of a document efficiently and accurately.
  • FIG. 18 shows a correspondence table between a type of context process and the method for extending a context process range. This table is pre-stored in the context process range extending unit 7 .
  • the context process range is extended in sentences before or after the sentence to be translated.
  • FIG. 19 is a flowchart showing the method for extending a context process range performed by the translating apparatus according to the third embodiment of the present invention.
  • the context processing unit 5 consults a dictionary for each word included in a sentence to be translated in step S 31 .
  • one type of context process is selected for the sentence to be translated, and the selected context process is performed in step S 32 .
  • step S 33 it is determined whether or not sufficient context information can be obtained from a range to be translated. If “NO”, the process goes to step S 34 .
  • step S 34 the correspondence table shown in FIG. 18 is referenced to obtain a method for extending a context process range according to the selected type of context process. If the type of context process is a field estimation or document type estimation, the context process range is extended in paragraphs before or after the sentence to be translated. If the type of context process is completion of an abbreviation, the context process range is extended in sentences before or after the sentence to be translated. If the type of context process is referring to a reference destination, the context process range is extended in sentences before the sentence to be translated.
  • step S 35 the context process range extending unit 7 extends the context process range based on the method for extending a context process range obtained in step S 34 .
  • the context processing unit 5 then performs a context process for text in the extended context process range in step S 36 .
  • step S 37 the context processing unit 5 determines whether or not sufficient context information can be obtained from the extended context process range. If “NO”, the process goes to step S 35 to repeatedly extend the context process range.
  • step S 38 If it is determined that sufficient context information can be obtained from the extended context process range, the process goes to step S 38 .
  • the context processing unit 5 determines whether or not all types of context processes have been performed in step S 38 . If “NO”, the process goes to step S 32 to perform a next type of context process, and completes the process after all types of context processes have been performed.
  • step S 33 If it is determined that sufficient context information can be obtained from the context process range in step S 33 , the process goes to step S 38 .
  • a context process can be efficiently performed by determining a direction in which a context process range is extended based on a type of context process, thereby translating a portion of a document efficiently and accurately.
  • a context process for estimating a reference destination can be efficiently performed by extending a context process range in sentences in a backward direction, thereby translating a portion of a document efficiently and accurately.
  • a context process can be performed in a context process range having a context corresponding to a range to be translated by terminating extension of the context process range at a boundary of the context, thereby performing the context process accurately and quickly.
  • the last sentence of the paragraph A can be translated based on only the context of the paragraph A, while the first sentence of the paragraph B can be translated based on only the context of the paragraph B. Accordingly, even if a topic suddenly changes at the change of paragraph from A to B, translation can be performed accurately.
  • a translation process can be performed for respective fields such as a baseball field, a weather field, and a transportation field, thereby performing the translation process accurately.
  • context corresponding to a range to be translated can be accurately extracted by dividing the range into paragraphs, even if a topic suddenly changes at a change to a next paragraph. As a result, a translation process can be performed accurately.
  • a translation process is then performed respectively for the last sentence of the paragraph A and the first sentence of the paragraph B that are determined to straddle the plurality of paragraphs.
  • the last sentence of the paragraph A can be translated independently from the first sentence of the paragraph B, and the first sentence of the paragraph B can be translated independently from the last sentence of the paragraph A. Therefore, even if a topic suddenly changes at the change of paragraph from A to B, translation can be performed accurately.
  • context corresponding to a context process range can be accurately extracted by setting a context process range for each divided range to be translated for each paragraph, when a topic changes at a change of paragraph.
  • a shaded area 43 is made for a sentence to be translated, which is identified by the translation range identifying unit 4 .
  • the context processing unit 5 performs a context process for the sentence to be translated, and detects a compound “Upper House” from the sentence to be translated in which the shaded area 43 is made. Since the compound “Upper House” is used in a political field only, it is determined that the sentence to be translated in which the shaded area is made, relates to the political field.
  • the original text displaying unit 1 displays context information 44 extracted by the context processing unit 5 from the sentence to be translated, at the side of the sentence to be translated. Since the sentence to be translated in which the shaded area 43 is made, for example, relates to the political field, a word “political” is displayed as context information 44 .
  • the translation operator can recognize context of a sentence to be translated, and can perform a translation process while checking if a suitable context process is performed.
  • a translation process can be performed while recognizing a state of a context process in a range to be translated, by displaying context information in correspondence with a range to be translated in original text.
  • a translation process can be performed while identifying a sentence to be translated, by highlighting the sentence to be translated.
  • the shaded area 45 is made for the original sentence “This has forced the party to change its stance.”, which is identified as a sentence to be translated by the translation range identifying unit 45 .
  • the context processing unit 5 performs a context process for the sentence to be translated in which the shaded area 45 is made, and the context processing unit 5 detects the original words “party” and “stance”, respectively having a plurality of translated words, from the sentence to be translated in which the shaded area 45 is made.
  • the original word “party” is used for a general field in addition to the political field, while the original word “stance” is used for the general field in addition to the sports field.
  • This original sentence contained in the shaded area 45 includes no word used in one particular field. Accordingly, the field of the sentence to be. translated in which the shaded area 45 is made, cannot be identified at this stage.
  • the original text displaying unit 1 displays the context information 46 yet to be determined, which is extracted by the context processing unit 5 from the sentence to be translated. For example, since the political and sports fields are candidates of the sentence to be translated in which the shaded area 45 is made, the words “political” and “sports” are displayed as the context information 46 .
  • a translation operator can recognize whether or not context of a sentence to be translated is determined. As a result, the translation operator can perform a translation process while checking to see if a suitable context process is performed.
  • a translation process can be performed while recognizing a state of a context process being performed in a range to be translated, by displaying a plurality of pieces of context information as candidates.
  • the context processing unit 5 performs a context process for a sentence to be translated, and determines that the original word “it” refers to the original word “party”.
  • the original text displaying unit 1 then displays the word at the reference destination in correspondence with the word at the reference source, based on the context information obtained by the context processing unit 5 . For example, areas 50 and 51 shaded in identical colors are provided for the original word “party” at the reference destination and the original word “it” at the reference source.
  • the translation operator can perform a translation process while checking to see if a process of reference relationship is properly performed.
  • a translation process can be performed while recognizing a state of a context process, by highlighting a word at a reference destination and a word at a reference source.
  • the context processing unit 5 performs a context process for a sentence to be translated, and locates a word “ ” used in a general field and a word “ ” used in the political field, as translated words of the original word “party”. Additionally, the compound “Upper House” used in the political field only, is located. Accordingly, it is determined that the sentence to be translated relates to the political field.
  • the word “ ” used in the political field is selected as a translated word of the original word “party”.
  • the original text displaying unit 1 displays a word having a plurality of translated words obtained by the context process based on the context information obtained by the context processing unit 5 , in correspondence with a word identifying a field. For example, areas 52 an 53 shaded in identical colors are made for the original word “party” having a plurality of translated words, and the compound “Upper House” used as a basis for selecting one of the plurality of translated words for the original word “party”.
  • a translation operator can perform a translation process while checking to see if a process of reference relationship is properly performed for a sentence to be translated.
  • the field of the context of the sentence including the original word “party” is determined as being “political”, by making a field estimation.
  • a translation process can be performed while recognizing a state of a context process, by highlighting a short substitute and its formal name.
  • a translation process may be performed while recognizing a field name, document type name, formal name or a word at a reference destination, by displaying the above described names or word.
  • context information can be concisely displayed by highlighting a word closest to the first word among second words identifying a translated word of the first word, thereby recognizing a state of a context process with ease.
  • a sentence to be translated and a context process range can be separately recognized by highlighting the sentence to be translated using a first method, and highlighting the context process range using a second method, thereby performing a translation process while recognizing the context process range.
  • FIG. 24 is a simplified block diagram showing configuration of the dictionary search apparatus according to the embodiment of the present invention. This apparatus is intended to properly perform a dictionary search process by obtaining appropriate context from a neighborhood of an original word for which a search operation is performed, depending on need.
  • the dictionary search apparatus comprises a searching unit 61 , context process range setting unit 62 , context processing unit 63 , and a determining unit 64 .
  • the searching unit 61 is intended to search a translated word for an original word, when specified.
  • the context process range setting unit 62 is intended to read original text around a specified original word, and set a context process range, if a plurality of translated words exist for the specified original word.
  • the context processing unit 63 is intended to perform a context process in a context process range set by the context process range setting unit 62 .
  • the determining unit 64 is intended to determine a translated word for an original word, based on context obtained by the context processing unit 63 .
  • a dictionary search can be performed in consideration of context by determining a translated word for an original word, thereby performing the dictionary search accurately.
  • the searching unit 61 then consults a dictionary for the original word “party”, and selects translated words “ ” or “ ” when an English-Japanese search is performed. Since there are a plurality of translated words for the original word “party” in this case, a suitable translated word cannot be determined.
  • a field is estimated to select the suitable translated word for the original word “party”. That is, if the original text relates to a general field, the word “ ” can be selected as a translated word of the original word “party”. Or, if the original text relates to the political field, the word “ ” is selected as the translated word of the original word “party”.
  • the context process range setting unit 62 sets the sentence “This has forced the party to change its stance.”, which includes the original word “party”, as a context process range, in order to identify a field of the original text.
  • the context processing unit 63 then performs a context process for the sentence “This has forced the party to change its stance.”, which is set by the context process range setting unit 62 . Since this sentence includes no word identifying a field in this case, the field of the original word “party” cannot be determined.
  • the context process range setting unit 62 extends the context process range, and sets the text “The talks were held in the Upper House. This has forced the party to change its stance.” as a context process range.
  • the context processing unit 63 then performs a context process for this text set by the context process range setting unit 62 . Since this text includes the compound “Upper House” used in the political field only, the field of the original word “party” can be determined to be the political field according to this compound.
  • the determining unit 64 therefore determines the word “ ” relating to the political field, as the translated word of the original word “party” shown in FIG. 9.
  • the searching unit 61 consults a dictionary for the original word “party”, and selects the translated words “parti” and “groupe”, when an English-French search is performed. Since there are a plurality of translated words for the original word “party” in this case, a suitable translated word cannot be determined.
  • the context process range setting unit 62 sets the sentence “This has forced the party to change its stance.”, which include the original word “party”, as a context process range, in order to identify the field of the original text.
  • the context processing unit 63 then performs a context process for the sentence “This has forced the party to change its stance.”, which is set by the context process range setting unit 62 . Since this sentence does not include a word identifying a field in this case, the field of the original word “party” cannot be determined.
  • the context process range setting unit 62 extends the context process range, and sets the text “The talks were held in the Upper House. This has forced the party to change its stance.” as a context process range.
  • the context processing unit 63 performs a context process for this text set by the context process range setting unit 62 . Since this text includes the compound “Upper House” used in the political field only in this case, the field of the original word “party” can be determined to be the political field, based on this compound.
  • the determining unit 64 therefore determines the word “parti” relating to the political field as the translated word of the original word “party”, and displays “parti” with a higher priority.
  • the word “parti” is displayed as a first candidate in the example shown in FIG. 9. If this word is displayed later, the order of the candidates “parti” and tribe” is changed in order to display “parti” first.
  • FIG. 26 is a flowchart showing operations performed by the dictionary search apparatus according to the first embodiment of the present invention.
  • a word to be searched is first identified according to a position of the cursor 70 in step S 41 .
  • step S 43 it is determined whether or not ambiguity of a field exists in the word to be searched. If “YES”, the process goes to step S 44 .
  • step S 44 a context process range is set for original text around the word to be searched, and a field estimation is made for the original text included in the set context process range.
  • step S 45 it is determined whether or not sufficient context information can be obtained from the context process range. If “NO”, the process goes to step S 46 .
  • step S 46 After the context process range is extended in step S 46 , the process goes back to step S 44 in order to repeatedly extend the context process range until sufficient context information can be obtained. If it is determined that sufficient context information can be obtained from the context process range in step S 45 , the process goes to step S 47 .
  • step S 47 a word definition matching an estimated field is moved to a first location.
  • step S 48 a result of the search is displayed.
  • step S 43 If it is determined that the word to be searched has no ambiguity of a field in step S 43 , the process goes to step S 48 to display the word definition of the word searched.
  • a translated word corresponding to an estimated field can be selected by estimating a field as a context process, if there are a plurality of translated words for an original word. As a result, a dictionary search can be performed accurately.
  • a translation operator who makes a search moves a cursor to, for example, the character “J” included in the word “JSP”, in order to specify this abbreviation as a word for which a search operation of a word definition is required.
  • JSP is recognized as the word for which a search operation of a word definition is required.
  • the searching unit 61 consults a dictionary for the word “JSP”, and determines this word as being unregistered. Since the word “JSP” is written in uppercase letters in this case, this word is determined as an abbreviation.
  • the context process range setting unit 62 sets the sentence “The reason for JSP's victory was ”, which includes the abbreviation “JSP”, as a context process range, in order to complete the abbreviation “JSP”.
  • the context processing unit 63 then performs a context process for the sentence “The reason for JSP's victory was . . . ”, which is set by the context process range setting unit 62 . Since this sentence includes no character string which completes the “JSP”, the abbreviation “JSP” cannot be completed.
  • the context process range setting unit 62 extends the context process range, and sets the original text “In the last election, “the Japan Socialist Party enjoyed victory. The reason for JSP's victory was . . . ”, as the context process range.
  • the context processing unit 63 performs a context process for this original text set by the context process range setting unit 62 . Since this original text includes the character string “Japan Socialist Party”, each of which begins with an uppercase letter “J”, “S”, and “P”, this character string is determined as the character string which completes the abbreviation “JSP”.
  • the searching unit 61 consults a dictionary for the character string “Japan Socialist Party”, and retrieves “ ” as the definition of this character string.
  • the determining unit 64 therefore determines “ ” as the definition of the word “JSP”.
  • FIG. 27 is a flowchart showing operations performed by the translating apparatus according to the second embodiment of the present invention.
  • a word to be searched is identified according to a position of a cursor in step S 51 .
  • step S 53 it is determined whether or not the word to be searched is an abbreviation. If “YES”, the process goes to step S 54 .
  • step S 54 a sentence including the word to be searched is set as a context process range.
  • step S 55 it is determined whether or not included in the context process range, and the abbreviation is completed.
  • step S 56 it is determined whether or not an expression completing the word to be searched exists in the context process range. If “NO”, the process goes to step S 57 .
  • step S 58 a dictionary is consulted for the completed word to be searched.
  • step S 59 a definition of the completed word to be searched is displayed.
  • step S 53 If the word to be searched is not determined as an abbreviation in step S 53 , the process goes to step S 59 to display a definition of the word to be searched.
  • a complete name of an abbreviation can be retrieved by performing a context process such as estimating an abbreviation, thereby performing a dictionary search operation efficiently.

Abstract

If a context process range extending unit cannot obtain context information required by a context processing unit, from a range to be translated, it extends the context process range. Then, the context processing unit performs a context process, and passes extracted context information to a translation processing unit in order to perform translation, based on the extended context process range.

Description

    FIELD OF THE INVENTION
  • The present invention relates to a translating apparatus, a dictionary search apparatus, and a translating method, which are preferably applied for when translation is to be efficiently performed based on context in a machine translating process. [0001]
  • DESCRIPTION OF THE RELATED ART
  • Conventional machine translation is performed in sentences due to the limitation of a processing capability. For example, if a text written in a foreign language is translated, a grammatical structure of the text to be translated and a knowledge of the words used are necessary. [0002]
  • However, an understandable text can often not be obtained if context information, that is, a meaning of one sentence to be translated in the entire text, is not properly interpreted, even if the grammatical structure of the text to be translated, and the knowledge of words, etc. used are known. [0003]
  • If it cannot be determined whether the English word “car” means either an “automobile” or a “railroad car” from the information included in a sentence, a sentence preceding or succeeding the sentence in question, or the context of the entire text must be referenced in order to determine the meaning of the one word. [0004]
  • Actually, not only the simple selection of a translated word, but also an interpretation of a complex meaning is required to be made based on the context. [0005]
  • In recent years, a short sentence has been able to be analyzed with high accuracy, thereby performing translation with higher accuracy according to a context process. [0006]
  • This context process enables the following processes to be performed, which could not be realized by using only a sentence. [0007]
  • Selecting a translated word by estimating a field of a document to be translated [0008]
  • Selecting a suitable expression method by estimating a type of a document to be translated [0009]
  • Removing ambiguity existing in a reference relationship [0010]
  • Completing an abbreviation [0011]
  • A mechanism and a method for performing a translation process using context information for a document to be translated, have been developed. Additionally, a method or apparatus for learning context information, and a translating apparatus for performing a translation process using learned context information, have been developed. [0012]
  • However, the translation process using a conventional context process is performed only when an entire document is specified as a translation target. [0013]
  • That is, if a portion of a document is specified as a translation target, the context process is not performed. [0014]
  • Therefore, when a portion of a document such as one sentence, one paragraph, etc. is translated, the entire document must be specified as a translation target in order to obtain accurate translation using the context process. As a result, it takes a large amount of time to perform the process. [0015]
  • SUMMARY OF THE INVENTION
  • An object of the present invention is to provide a translating apparatus, a dictionary search apparatus, and a translating method for translating a portion of a document efficiently and accurately. [0016]
  • According to a feature of the present invention, a range to be translated and a context process range in original text are set separately. Therefore, even if only a portion of a document is specified as a range to be translated, it can be translated efficiently and accurately by referencing context; outside the range to be translated. [0017]
  • According to an additional feature of the present invention, a range to be translated and a context process range in original text are separately set to perform a context process. As a result, a translation process is performed by referencing context outside the range to be translated, thereby translating the portion of a document efficiently and accurately. [0018]
  • According to a further feature of the present invention, field estimation is made as a context process. Even if there is no word identifying a field in a range to be translated, the field can be identified according to context outside the range to be translated. As a result, a suitable translation process depending on a field can be performed. [0019]
  • According to a still further feature of the present invention, document type estimation is made as a context process. Even if there is no expression which allows a document type to be identified in a range to be translated, a suitable translation process depending on a document type can be performed by identifying the document type according to context outside the range to be translated. [0020]
  • According to a still further feature of the present invention, an abbreviation is estimated as a context process. Even if there is no complete form of the abbreviation in a range to be translated, it can be searched according to context outside the range to be translated. As a result, a translation process can be performed with completion of an abbreviation. [0021]
  • According to a still further feature of the present invention, a reference destination is estimated as a context process. Even if a range to be translated includes no word pointed to by a pronoun included in this range, a translation process can be performed while replacing the pronoun depending on need, by searching the word pointed to by the pronoun according to context outside the range to be translated. [0022]
  • According to a still further feature of the present invention, a sentence to be translated is identified according to a range to be translated. As a result, a translation process can be performed accurately. [0023]
  • According to a still further feature of the present invention, a group of sentences to be translated is identified according to a range to be translated. As a result, a translation process can be performed accurately. [0024]
  • According to a still further feature of the present invention, a sentence to be translated is identified by ignoring an inline tag. That is, a translation process can be accurately performed by removing a portion unrelated to translation. [0025]
  • According to a still further feature of the present invention, a sentence to be translated is highlighted. As a result, a translation process can be performed while recognizing the sentence to be translated. [0026]
  • According to a still further feature of the present invention, a first word having a plurality of translated words, and one of second words closest to the location of the first word among second words representing the translated words of the first word, are highlighted. As a result, a state of a context process can be recognized with ease by displaying context information simply. [0027]
  • According to a still further feature of the present invention, a sentence to be translated is highlighted with a first method, and a context process range is highlighted with a second method. As a result, a translation process can be performed while recognizing the context process range, by making a distinction between the sentence to be translated and the context process range. [0028]
  • According to a still further feature of the present invention, a word at a reference destination and a word at a reference source are highlighted. As a result, a translation process can be performed while recognizing a state of a context process. [0029]
  • According to a still further feature of the present invention, an abbreviation and its complete form are highlighted. As a result, a translation process can be performed while recognizing a state of a context process. [0030]
  • According to a still further feature of the present invention, a context process range is extended if a translated sentence corresponding to an original sentence cannot be identified. If context information which allows a translated sentence to be identified is not included in a range to be translated, the translated sentence can be identified according to context outside the range to be translated. As a result, a portion of a document can be translated efficiently and accurately. [0031]
  • According to a still further feature of the present invention, a context process range is extended in sentences until a sentence to be translated corresponding to an original sentence can be determined. As a result, a portion of a document can be translated efficiently and accurately by performing a context process such as estimating an abbreviation, estimating a reference destination, etc. efficiently. [0032]
  • According to a still further feature of the present invention, a context process range is extended in paragraphs in correspondence with a range to be translated until a translated sentence corresponding to an original sentence can be determined. As a result, a portion of a document can be translated efficiently and accurately by performing a context process such as estimating a field, estimating a document type, etc. efficiently. [0033]
  • According to a still further feature of the present invention, a direction in which a context process range is extended is determined according to a type of a context process. As a result, the context process is efficiently performed, thereby translating a portion of a document efficiently and accurately. [0034]
  • According to a still further feature of the present invention, a context process range is extended in sentences in a forward direction until a translated sentence corresponding to an original sentence can be determined. The context process such as estimating a reference destination is performed efficiently, thereby translating a portion of a document efficiently and accurately. [0035]
  • According to a still further feature of the present invention, a context process range is set in correspondence with a range to be translated separated into paragraphs, if the range to be translated in original text straddles a plurality of paragraphs. As a result, even if a topic changes at the beginning of a next paragraph, a context process can be performed accurately by extracting context corresponding to the range to be translated correctly, thereby performing a context process correctly. [0036]
  • According to a still further feature of the present invention, an inline tag is removed from a context process range. That is, by removing a portion unrelated to context, a context process can be performed efficiently. [0037]
  • According to a still further feature of the present invention, a context process range is stopped from extending at a boundary of context. As a result, a context process can be performed accurately and quickly by performing the context process in a context process range including context corresponding to a range to be translated. [0038]
  • According to a still further feature of the present invention, context information is displayed in correspondence with a range to be translated in original text. As a result, a translation process can be performed while recognizing a state of a context process in the range to be translated. [0039]
  • According to a still further feature of the present invention, a plurality of pieces of context information are displayed as candidates if a translated sentence corresponding to an original sentence cannot be determined. As a result, a translation process can be performed while recognizing a state of a context process being performed in a range to be translated. [0040]
  • According to a still further feature of the present invention, context information includes a field name, document type, complete form of an abbreviation, or a word at a reference destination. As a result, a translation process can be performed while recognizing the field name, document type, complete form, or the reference destination. [0041]
  • According to a still further feature of the present invention, if a range to be translated in original text straddles a plurality of paragraphs, it is divided into a plurality of paragraphs to perform a context process. As a result, even if a topic changes at the beginning of a next paragraph, context corresponding to a range to be translated can be extracted correctly. Therefore, a translation process can be performed accurately. [0042]
  • According to a still further feature of the present invention, a context process of original text is performed based on a context process range which is set, and a translated word corresponding to an original word is determined based on text in the context process range. As a result, a dictionary search is properly performed in consideration of context. [0043]
  • According to a still further feature of the present invention, a context process range is extended if a translated word corresponding to an original word cannot be determined. As a result, a dictionary search can be properly made by allowing a context process to be performed efficiently. [0044]
  • According to a still further feature of the present invention, a context process range is extended in sentences until a translated word corresponding to an original word can be determined. That is, by allowing a context process such as estimating an abbreviation, etc., to be performed efficiently, a dictionary search can be made efficiently and accurately. [0045]
  • According to a still further feature of the present invention, field estimation is made as a context process. If there are a plurality of translated words corresponding to an original word, the field of the original word is estimated with a context process in order to select a translated word corresponding to the estimated field. As a result, a dictionary search is performed accurately. [0046]
  • According to a still further feature of the present invention, an abbreviated expression is estimated as a context process. By allowing a complete form of an abbreviation to be searched, a dictionary search can be performed accurately. [0047]
  • According to a still further feature of the present invention, a meaning of a word to be searched is determined according to context in a context process range. As a result, a dictionary search is performed in consideration of context, thereby performing the dictionary search accurately. [0048]
  • According to a still further feature of the present invention, context of original text outside a range to be translated is referenced when a portion of the original text is translated. As a result, even if the portion of the original text is specified as the range to be translated, the context outside the range to be translated is referenced. Therefore, the portion of the document can be translated efficiently and accurately. [0049]
  • According to a still further feature of the present invention, if there are first and second translated words respectively belonging to first and second fields, which correspond to a first original word in a range to be translated of original text, a second original word outside the range to be translated of the original text is referenced. If the second original word belongs to the first field, the first translated word is selected for the first original word. As a result, the field of the first original word in the range to be translated is estimated based on context outside the range to be translated, thereby determining a translated word of the first original word based on the estimated field. [0050]
  • According to a still further feature of the present invention, data in a conversion range is converted based on context in a context process range. Data is converted in consideration of information other than the data in the conversion range, thereby converting the data accurately.[0051]
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram showing the configuration of a data conversion apparatus according to an embodiment of the present invention; [0052]
  • FIG. 2 is a block diagram showing the configuration of a translating apparatus according to the embodiment of the present invention; [0053]
  • FIG. 3 is a schematic diagram showing the configuration of a computer system of the translation apparatus according to the embodiment of the present invention; [0054]
  • FIG. 4 is a flowchart showing operations performed by a translating apparatus according to a first embodiment of the present invention; [0055]
  • FIG. 5 shows an example of original text input to the translating apparatus according to the first embodiment of the present invention; [0056]
  • FIG. 6 shows a correspondence between an original sentence (English) input to the translating apparatus according to the first embodiment of the present invention and a translated sentence (Japanese) output from the translating apparatus; [0057]
  • FIG. 7 shows a correspondence table between original words (English) and corresponding translated words (Japanese) provided by the translating apparatus according to the first embodiment of the present invention; [0058]
  • FIG. 8 shows a correspondence table between an original sentence (English) input to the translating apparatus according to the first embodiment of the present invention and a translated sentence (French) output from the translating apparatus; [0059]
  • FIG. 9 shows a correspondence table between an original word (English) and corresponding translated words in respective fields (French), provided by the translating apparatus according to the first embodiment of the present invention; [0060]
  • FIG. 10 shows a correspondence table between a document type and an expression method of translation, implemented by the translating apparatus according to a second embodiment of the present invention; [0061]
  • FIG. 11 is a flowchart showing operations performed in a context process according to a third embodiment of the present invention; [0062]
  • FIG. 12 is a flowchart showing operations performed in a context process according to a fourth embodiment of the present invention; [0063]
  • FIG. 13 is a schematic diagram showing a method for specifying a translation portion of an original text input to the translation apparatus according to the first embodiment of the present invention; [0064]
  • FIG. 14 shows a method for specifying a translation range of original text input to the translating apparatus according to the second embodiment of the present invention; [0065]
  • FIG. 15 shows an example of original text into which inline tags are input to the translating apparatus according to the first embodiment of the present invention; [0066]
  • FIG. 16 shows an example of original text into which inline tags are input to the translating apparatus according to the second embodiment of the present invention; [0067]
  • FIGS. 17A to [0068] 17C show a method for extending a context process range by the translating apparatus according to the second embodiment of the present invention;
  • FIG. 18 shows a correspondence table between a method for extending a context process range and a context process type, implemented by the translating apparatus according to the third embodiment of the present invention; [0069]
  • FIG. 19 is a flowchart showing the method for extending a context process range according to the third embodiment of the present invention; [0070]
  • FIG. 20 shows a state where context information is displayed for original text input to the translating apparatus according to the first embodiment of the present invention; [0071]
  • FIG. 21 shows a state where context information is displayed for the original text input to the translating apparatus according to the second embodiment of the present invention; [0072]
  • FIG. 22 shows a state where context information is displayed for the original text input to the translating apparatus according to the third embodiment of the present invention; [0073]
  • FIG. 23 shows a state where context information is displayed for the original text input to the translating apparatus according to the fourth embodiment of the present invention; [0074]
  • FIG. 24 is a block diagram showing the configuration of a dictionary search apparatus according to an embodiment of the present invention; [0075]
  • FIG. 25 shows a method for specifying an original word input to the dictionary search apparatus according to the embodiment of the present invention; [0076]
  • FIG. 26 is a flowchart showing operations performed by the dictionary search apparatus according to a first embodiment of the present invention; and [0077]
  • FIG. 27 is a flowchart showing operations performed by the dictionary search apparatus according to a second embodiment of the present invention.[0078]
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • FIG. 1 is a block diagram showing the configuration of a data conversion apparatus according to an embodiment of the present invention. The data conversion apparatus shown in this figure comprises a conversion [0079] range setting unit 100 for setting a conversion range of data, a context process range setting unit 101 for setting a context process range where a context process is performed, and a conversion processing unit 102 for converting the data in consideration of context in the context process range. As a result, data can be converted in consideration of context information obtained from other than data in a conversion range, thereby converting the data accurately.
  • Provided next is the explanation about a translating apparatus according to an embodiment of the present invention. FIG. 2 is a simplified block diagram showing configuration of the translating apparatus according to the embodiment of the present invention. This apparatus is intended to quickly perform a translation process by obtaining appropriate context from a neighborhood of a range to be translated depending on need, when a portion of a document is specified as the range to be translated. [0080]
  • In FIG. 2, a translation processing apparatus [0081] 9 comprises a user interface 8, translation range identifying unit 4, context processing unit 5, translation processing unit 6, and a context process range extending unit 7. The user interface 8 comprises an original text displaying unit 1 and an input unit 2.
  • The original [0082] text displaying unit 1 is intended to display original text stored in a storing unit 3 for a translation operator. The original text displaying unit 1 is implemented by, for example, a CRT display.
  • The [0083] input unit 2 is intended to accept a translation instruction given by a translation operator. The translation operator instructs the input unit 2 of a range to be translated while watching the original text displaying unit 1. The input unit 2 is implemented by, for example, a mouse, a keyboard, etc.
  • The [0084] storing unit 3 is intended to store original text, which is implemented by, for example, a hard disk unit, optical disk unit, etc.
  • The translation range identifying unit [0085] 4 is intended to extract a portion to be translated in original text from the storing unit 3, according to a translation instruction from the input unit 2. Note that a system instructing a sentence to be translated using a sentence number, etc. does not always require the translation range identifying unit 4.
  • The [0086] context processing unit 5 is intended to perform a context process in a context process range specified by the context process range extending unit 7 for a sentence to be translated identified by the translation range identifying unit 4.
  • The [0087] translation processing unit 6 is intended to translate a sentence to be translated according to context obtained by the context processing unit 5, and output a translated sentence.
  • The context process range extending unit [0088] 7 is intended to extend a context process range where a context is retrieved by reading an original text around a range to be translated from the storing unit 3, if context information required by the context processing unit 5 cannot be obtained from a sentence to be translated which is identified by the translation range identifying unit 4.
  • Provided next is the explanation about a computer system implemented as the translating apparatus according to the embodiment of the present invention. FIG. 3 shows an example of the translating apparatus implemented by a computer system according to the embodiment of the present invention. [0089]
  • In FIG. 3, a [0090] CPU 111, a memory 112, an input device 113, a display 114, and a disk device 115 are connected via a bus 110. For example, the input unit; 2 is implemented by the input device 113, the original text displaying unit 1 is implemented by the display 114, and the original text storing unit 3 is; implemented by the disk device 115. The translation range identifying unit 4, context processing unit 5, translation processing unit 6 and the context process range extending unit 7 are implemented by, for example, the CPU 111 and the memory 112.
  • Provided next is the explanation about operations performed by the translating apparatus according to the embodiment of the present invention, by referring to the drawings. FIG. 4 is a flowchart showing the operations performed by the translating apparatus according to the embodiment of the present invention. [0091]
  • In FIG. 4, a translation operator inputs a translation instruction to an [0092] input unit 2, and the input unit 2 accepts the translation instruction in step S1.
  • In [0093] step 2, the translation range identifying unit 4 identifies a sentence to be translated in a document to be translated according to the translation instruction input to the input unit 2.
  • In step S[0094] 3, a context processing unit 5 performs a context process for the sentence to be translated identified by the translation range identifying unit 4 to extract context information.
  • In step S[0095] 4, it is determined whether or not the context information extracted in step S3 is sufficient for performing a translation process by the translation processing unit 6. If the context information is insufficient for the translation process, the process goes to step S5.
  • Then, in step S[0096] 5, a context process range extending unit 7 extends a context process range where the context process will be performed, beyond a sentence to be translated, and passes the original text in the extended context process range to a context processing unit 5.
  • If it is determined that the context information is sufficient for performing the translation process in step S[0097] 4, the context processing unit 5 passes sufficient context information for performing the translation process as well as the document including the sentence to be translated identified by the translation range identifying unit 4, to the translation processing unit 6. Then, the process goes to step S6.
  • In step S[0098] 6, the translation processing unit 6 translates the sentence to be translated included in the document to be translated, based on the context information passed from the context processing unit 5.
  • As described above, by separately setting a range to be translated and a context process range in original text, context outside the range to be translated can be referenced even if only a portion of a document is specified as a range to be translated. As a result, the portion of the document can be translated efficiently and accurately. [0099]
  • Provided next is the explanation about a context process according to the first embodiment of the present invention, by referring to the drawings. The context process according to the first embodiment of the present invention is intended to estimate a field. Note that the following explanation is based on the assumption that a document to be translated includes sentences, each of which terminates in one line and is numbered. [0100]
  • Provided first is the explanation about an example of translating English text into the equivalent Japanese. [0101]
  • As shown in FIG. 5, an original [0102] text displaying unit 1 is assumed to display an original sentence numbered 31 “The talks were held in the Upper House.”, and an original sentence numbered 32 “This has forced the party to change its stance.”
  • A translation operator inputs the number of a desired sentence to be translated to an [0103] input unit 2, and issues a translation instruction. If the translation operator specifies the sentence numbered 32, the original sentence numbered 32 “This has forced the party to change its stance.” will be a translation target.
  • Then, the [0104] context processing unit 5 performs a context process for the original sentence numbered 32. In the context process, a dictionary is consulted for each original word included in the original sentence numbered 32. As a result of consulting the dictionary, the translated word “
    Figure US20010047255A1-20011129-P00001
    ” is selected for the original word “This”; the translated word “
    Figure US20010047255A1-20011129-P00002
    ” is selected for the original word “forced”; the translated word “
    Figure US20010047255A1-20011129-P00003
    ” or “
    Figure US20010047255A1-20011129-P00004
    ” is selected for the original word “party”; the translated word “
    Figure US20010047255A1-20011129-P00005
    ” is selected for the original word “change”; the translated word “
    Figure US20010047255A1-20011129-P00006
    ” is selected for the original word “its”; and the translated word “
    Figure US20010047255A1-20011129-P00007
    ” or “
    Figure US20010047255A1-20011129-P00008
    ” is selected for the original word “stance”, as shown in FIG. 6.
  • Furthermore, the original word “has” is recognized as an auxiliary verb, the original word “the” is recognized as a definite article, and the original word “to” is recognized as a function word. [0105]
  • Since there are a plurality of candidates for the original words “party” and “stance”, a suitable translated word must be selected using a context process. In the meantime, there is only one translated word corresponding to each of the original words “This”, “forced”, “change”, and “its”. Accordingly, a Japanese equivalent for each of the original words is determined. [0106]
  • Then, field estimation is made in order to select a suitable translated word for each of the original words “party” and “stance”. [0107]
  • FIG. 7 shows a correspondence table between the translated words of each of the original words “party” and “stance” and the corresponding fields. [0108]
  • In FIG. 7, if the Japanese word “[0109]
    Figure US20010047255A1-20011129-P00009
    ” is selected as a translated word of the original word “party”, the original sentence indicates a “general” field. If the translated word “
    Figure US20010047255A1-20011129-P00010
    ” is selected as a translated word of the original word “party”, the original sentence indicates a “political” field.
  • Furthermore, if the word “[0110]
    Figure US20010047255A1-20011129-P00011
    ” is selected as a translated word of the original word “stance”, the original sentence indicates the “general” field. Or, if the word “
    Figure US20010047255A1-20011129-P00012
    ” is selected as a translated word of the original word “stance”, the original sentence indicates a “sports” field. Note that the “general” field indicates a word available for all fields.
  • Accordingly, since the “political” and “sports” fields are candidates for the fields of the original words “party” and “stance” for which a translated word must be selected, a corresponding field cannot be determined uniquely. [0111]
  • As a result, if field estimation is made within a range of the original sentence numbered [0112] 32, this range includes no translated word indicating the political field or the sports field only. Therefore, a suitable translated word cannot be determined uniquely for each of the original words “party” and “stance”.
  • If context cannot be determined in a range to be translated included in the original text, a context process range is extended in order to analyze context beyond the context process range of the original sentence to be translated. [0113]
  • Assume that a sentence preceding the sentence to be translated shown in FIG. 5 is added to the context: process range. That is, the context process range extending unit [0114] 7 extracts the original sentence numbered 31 “The talks were held in the Upper House.”, and passes the extracted sentence to the context processing unit 5.
  • The [0115] context processing unit 5 which receives the original sentence numbered 31, consults a dictionary for each original word of this sentence. As a result, a compound “Upper House” is retrieved from the original sentence numbered 31. Since the compound “Upper House” indicates the “political” field only, it is determined that the original sentence to be translated includes context related to the political field. Note that context information other than field information can be also extracted from the original sentence numbered 31.
  • As a result of the context process in the extended context process range, sufficient context related to the original sentence to be translated can be obtained. This context information is therefore passed to the [0116] translation processing unit 6 to perform the translation process.
  • Then, the [0117] translation processing unit 6 performs the translation process according to the context information, that is, the political field. The word “
    Figure US20010047255A1-20011129-P00013
    ” used in the political field is selected as a translated word of the original word “party”. In addition, the translated word “
    Figure US20010047255A1-20011129-P00014
    ” used in the sports field is deleted, and the translated word “
    Figure US20010047255A1-20011129-P00015
    ” available for all fields is selected.
  • As a result, the translated sentence “[0118]
    Figure US20010047255A1-20011129-P00018
    Figure US20010047255A1-20011129-P00400
    Figure US20010047255A1-20011129-P00401
    ” can be obtained for the original sentence numbered 32 “This has forced the party to change its stance.” Provided next is the explanation about an example of translating English text into the equivalent French.
  • As shown in FIG. 5, the original [0119] text displaying unit 1 is assumed to display the original sentence numbered 31 “The talks were held in the Upper House.”, and the original sentence numbered 32 “This has forced the party to change its stance.”.
  • A translation operator inputs a number of a sentence to be translated to the [0120] input unit 2, and issues a translation instruction. Assuming that the translation operator specifies the sentence number 32, the original sentence numbered 32 “This has forced the party to change its stance.” will be a translation target.
  • The [0121] context processing unit 5 then performs a context process for the original sentence numbered 32. With the context process, a dictionary is consulted for each of the original words included in the original sentence numbered 32. As a result of consulting the dictionary: a translated word “Ceci” is selected for the original word “This”; a translated word “a” is selected for the original word “has”; a translated word “force” is selected for the original word “forced”; a translated word “la” is selected for the original word “the”; a translated word “parti” or “groupe” is selected for the original word “party”; a translated word “a” is selected for the original word “to”; a translated word “changer” is selected for the original word “change”; a translated word “sa” is selected for the original word “its”; and a translated word “position” is selected for the original word “stance”, as shown in FIG. 8.
  • Since there are a plurality of candidates for the original word “party”, a suitable translated word must be selected with the context process. In the meantime, there is only one equivalent for the original words “This”, “has”, “forced”, “the”, “to”, “change”, “its”, and “stance”. Accordingly, a translated word for each of the original words is determined uniquely. [0122]
  • Then, field estimation is made for selecting a suitable translated word for the original word “party”. [0123]
  • FIG. 9 shows a correspondence table between a translated word of the original word “party” and a field where the translated word is used. [0124]
  • In FIG. 9, if a word “parti” is selected as a translated word of the original word “party”, the original sentence indicates the “political” field. Or, if the word “groupe” is selected as the translated word of the original word “party”, the original sentence indicates the “general” field. The “general” field indicates a word available for all fields. [0125]
  • That is, the “political” and “general” fields will be candidates for the field of the original word “party” for which a translated word must be selected. Therefore, a corresponding field cannot be determined uniquely. [0126]
  • Therefore, even if the field estimation is made within a range to be translated in the original sentence numbered [0127] 32, this range includes no translated word indicating the political field only. As a result, a suitable translated word cannot be uniquely determined for the original word “party”.
  • If context cannot be determined within a range to be translated in desired original text, a context process range is extended for analyzing the context beyond the range to be translated in the desired original text. [0128]
  • Assume that a sentence preceding the sentence to be translated is added to the context process range in FIG. 5. That is, the context process range extending unit [0129] 7 extracts the original sentence numbered 31 “The talks were held in the Upper House.”, and passes the extracted sentence to the context processing unit 5.
  • The [0130] context processing unit 5 which receives the original sentence numbered 31, consults a dictionary for each original word of the sentence numbered 31 As a result, the compound “Upper House” is retrieved from the original sentence numbered 31. Since the compound “Upper House” indicates the “political” field only, it is determined that the original sentence to be translated includes context related to the political field. Note that context information other than field information can be also extracted from the original sentence numbered 31.
  • As a result of the context process performed for the extended context process range, sufficient context related to the original sentence to be translated can be obtained. Therefore, this context information is passed to the [0131] translation processing unit 6 in order to perform a translation process.
  • Then, the [0132] translation processing unit 6 performs the translation process based on the context information, that is, the political field. The word “parti” used in the political field is selected as a translated word of the original word “party”.
  • As a result, a translated sentence “Ceci a force la parti á changer sa position.” can be obtained for the original sentence “This has forced the party to change its stance.”[0133]
  • As described above, even if there is no translated word identifying a field in a range to be translated, the field can be identified from context outside the range to be translated. Therefore, a suitable translation process depending on a field can be performed. [0134]
  • Provided next is the explanation about a context process according to the second embodiment of the present invention, by referring to the drawings. This context process is intended to estimate a document type. [0135]
  • As a document type, a political article, thesis, operating manual, etc. can be specified. Translation is sometimes performed in a “da/dearu” tone, or performed without using a personal pronoun, depending on a document type. [0136]
  • FIG. 10 shows a correspondence table between a field and a document type when English text is translated into Japanese text. In this figure, if the original text is a political article, translation is performed in a “da/dearu” tone, and in a passive voice. If the original text is an operating manual, translation is performed in a “desu/masu” tone, and in an imperative form. Here, the imperative form and the passive voice are default expressions of translated text corresponding to an English expression such as “must be”, etc. [0137]
  • For example, the field of the original text shown in FIG. 5 is the political field. Accordingly, the [0138] context processing unit 5 references the column of the political article shown in FIG. 10, and instructs the translation processing unit 6 to translate the article in the “da/dearu” tone and in a passive voice.
  • The [0139] translation processing unit 6 receives document type information as well as other context information from the context processing unit 5, and performs the translation process.
  • As described above, even if a range to be translated includes no expression identifying a document type, it can be identified based on context outside the range to be translated by estimating a document type. As a result, a suitable translation process depending on a document type can be performed. [0140]
  • Provided next is the explanation about a context process according to the third embodiment of the present invention, by referring to the drawings. This context process is intended to complete an abbreviation. [0141]
  • The original [0142] text displaying unit 1 is assumed to display the original text “In the last election, the Japan Socialist Party (JSP) enjoyed victory. The reason for JSP's victory was . . . ”.
  • A translation operator instructs the [0143] input unit 2 to translate the sentence beginning with “The reason . . . ” as a sentence to be translated.
  • Then, the [0144] context processing unit 5 performs a context process for this sentence. With the context process, a dictionary is consulted for each of the words included in the sentence beginning with “The reason . . . ”. As a result of consulting the dictionary, the word “JSP” is recognized as an unregistered word.
  • The meaning of the unregistered word “JSP” cannot be determined according to this sentence only. [0145]
  • Therefore, if context cannot be determined within a range to be translated in original text to be translated, a context process range in the original text to be translated is extended in order to analyze the context beyond the range to be translated in the original text to be translated. [0146]
  • Assume that a sentence preceding the sentence to be translated is added to the context process range. That is, the context process range extending unit [0147] 7 extracts the original sentence “In the last election, the Japan Socialist Party (JSP) enjoyed victory.”, and passes the extracted sentence to the context processing unit 5.
  • The [0148] context processing unit 5 makes a comparison between the original sentence “In the last election, the Japan Socialist Party (JSP) enjoyed victory.” and the sentence to be translated including the unregistered word “JSP”.
  • As a result, the unregistered word “JSP” is determined as an abbreviation of the proper noun “Japan Socialist Party”. [0149]
  • To make such a comparison, factors such as that each of the words in the proper noun “Japan Socialist Party” begins with an uppercase letter, that the words beginning with the uppercase letters are in sequence, and that the abbreviation “JSP” generated by combining only the uppercase letters of the respective words is included, are used. [0150]
  • As described above, the meaning of the abbreviation can be clarified according to context of a neighborhood. [0151]
  • As a result of the context process performed in the extended context process range, the meaning of the abbreviation can be identified. Therefore, the result of the identification is passed to the [0152] translation processing unit 6 to perform a translation process.
  • The [0153] translation processing unit 6 then performs the translation process by using a translated word of the noun “Japan Socialist Party” as that of “JSP”.
  • As a result, the original sentence is translated as “[0154]
    Figure US20010047255A1-20011129-P00020
    . . . ” if the context process is not performed. In the meantime, the sentence is translated as “
    Figure US20010047255A1-20011129-P00021
    Figure US20010047255A1-20011129-P00402
    . . . ” if the context process is performed. That is, a translated sentence that is easy to read can be obtained.
  • FIG. 11 is a flowchart showing a context process according to the third embodiment of the present invention. [0155]
  • In FIG. 11, the [0156] context processing unit 5 consults a dictionary for each word included in a sentence to be translated in step S11.
  • Then, it is determined whether or not an abbreviation is included in the sentence to be translated in step S[0157] 12. If “YES”, the process goes to step S13.
  • In step S[0158] 13, it is determined whether or not an expression which completes the abbreviation exists in a context process range. If “NO”, the process goes to step S14.
  • In step S[0159] 14, the context process range extending unit 7 extends the context process range.
  • The [0160] context processing unit 5 then consults a dictionary for each word included in a sentence in the extended context process range in step S15. After that, the process goes back to step S13.
  • If it is determined that the expression which completes the abbreviation exists in the context process range in step S[0161] 13, the process goes to step S16.
  • In step S[0162] 16, the abbreviation is completed in the sentence to be translated according to the expression which completes the abbreviation in the context process range.
  • In step S[0163] 17, the translation processing unit 6 performs a translation process in consideration of the completed form of the abbreviation.
  • In the meantime, if there is no abbreviation in the sentence to be translated in step S[0164] 12, the process skips to step S17 to perform a translation process without considering a completion of the abbreviation.
  • As described above, even if a range to be translated includes no complete form of an abbreviation, it can be retrieved from context outside the range to be translated by estimating the abbreviation. As a result, a translation process can be performed with completion of the abbreviation, depending on need. [0165]
  • Provided next is the explanation about a context process according to the fourth embodiment of the present invention, by referring to the drawings. This context process is intended to identify a reference destination. [0166]
  • Assume that the original [0167] text displaying unit 1 displays the original text “In the last election, the Japan Socialist Party enjoyed victory. The reason for its victory was . . . ” .
  • A translation operator instructs the [0168] input unit 2 to translate the sentence beginning with “The reason . . . ” as a sentence to be translated.
  • Then, the [0169] context processing unit 5 performs a context process for this sentence. With the context process, a dictionary is consulted for each of the original words included in the sentence beginning with “The reason . . . ” . As a result of consulting the dictionary, what the pronoun “its” indicates is determined as being indefinite.
  • What the pronoun “its” indicates cannot be determined from only the sentence to be translated which begins with “The reason . . . ” . [0170]
  • Therefore, if context cannot be determined within a range to be translated in original text to be translated, a context process range is extended in order to analyze context beyond the range to be translated in the original text to be translated. [0171]
  • Assume that a sentence preceding the sentence to be translated is added to the context process range. That is, the context process range extending unit [0172] 7 extracts the original sentence “In the last election, the Japan Socialist Party enjoyed victory.”, and passes the extracted sentence to the context processing unit 5.
  • The [0173] context processing unit 5 then performs a context process for the original sentence “In the last election, the Japan Socialist Party enjoyed victory.” As a result, it is determined that the pronoun “its;” included in the phrase “its victory” indicates the “Japan Socialist Party”. This is because the winner of the “victory” is the “Japan Socialist Party”.
  • The following paper refers to such a context process. [0174]
  • Source Document: [0175]
  • Information Processing Society Study Report [0176]
  • Natural Language Processing Study Report [0177]
  • Issue Date: November 19, 1993 [0178]
  • Title of Paper: [0179]
  • Context Processing System DIANA Using Context Restriction and Context Selection [0180]
  • Number of Paper: 98-8 [0181]
  • Author: [0182]
  • Tetsuya Nasukawa [0183]
  • As described above, a reference destination can be clarified according to context of a neighborhood. [0184]
  • As a result of the context process performed in the extended context process range, the reference destination can be identified. Then, the result of the identification is passed to the [0185] translation processing unit 6 to perform a translation process.
  • Then, the [0186] translation processing unit 6 performs the translation process using the context information indicating that the pronoun “its” points to “the Japan Socialist Party”, depending on need.
  • As a result, the phrase “its victory” is translated as “[0187]
    Figure US20010047255A1-20011129-P00022
    ” if the context process is not performed. In the meantime, this phrase is translated as “
    Figure US20010047255A1-20011129-P00023
    ” if the context process is performed. That is, a translated sentence which is easy to read can be obtained.
  • FIG. 12 is a flowchart showing a context process according to the fourth embodiment of the present invention. [0188]
  • In FIG. 12, the [0189] context processing unit 5 first consults a dictionary for each word in a sentence to be translated in step S21.
  • Then, it is determined whether or not there is a word having an ambiguous reference relationship in a range to be translated in step S[0190] 22. If “YES”, the process goes to step S23.
  • In step S[0191] 23, it is determined whether or not a context process range includes a word or a word sequence which could possibly be a reference destination. If “NO”, the process goes to step S24.
  • In step S[0192] 24, the context process range extending unit 7 extends the context process range.
  • After the [0193] context processing unit 5 consults a dictionary for each word in a sentence in the extended context process range in step S25, the process goes to step S23.
  • If it is determined that the word or the word sequence could possibly be a reference destination in step S[0194] 23, the process goes to step S26.
  • In step S[0195] 26, the reference destination of the sentence in the extended context process range is identified.
  • In step S[0196] 27, a process such as replacing the word having the ambiguous reference relationship in the range to be translated with a word at a reference destination, etc. is performed.
  • In step S[0197] 28, the translation processing unit 6 performs a translation process in consideration of the word at the reference destination.
  • In step S[0198] 22, if it Ls determined that the range to be translated includes no word having the ambiguous reference relationship, the process goes to step S28 to perform the translation process without considering the word at the reference destination.
  • As described above, even if a range to be translated includes no word pointed to by a demonstrative pronoun included in that range, a word pointed to by the demonstrative pronoun can be retrieved from context outside the range to be translated, by estimating a reference destination. As a result, a translation process can be performed while replacing a demonstrative pronoun, depending on need. [0199]
  • Provided next is the explanation about operations performed by the translation range identifying unit [0200] 4 included in the translating apparatus according to the first embodiment of the present invention, by referring to the drawings. This unit is intended to identify a sentence to be translated included in original text where one sentence does not terminate in one line, according to a position of a cursor 10.
  • Assume that the original [0201] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance.”, as shown in FIG. 13.
  • In this case, the sentences do not terminate in one line, and they are also not numbered. [0202]
  • A translation operator operates the [0203] input unit 2 to move the cursor 10 to an arbitrary position of a sentence to be translated for specification.
  • Then, the translation range identifying unit [0204] 4 identifies the sentence to be translated. That is, this unit detects the position of the cursor 10, and automatically determines which portion the translation operator intends to translate according to the position of the cursor 10. Since translation is performed by sentence, the sentence pointed to by the cursor 10 is extracted.
  • For example, if the cursor is positioned at the character “e” as shown in FIG. 13, backward scanning is started from the character “e”. Then, the word beginning with an uppercase letter “This” is searched. Since the period “.” immediately follows the word “House” which immediately precedes the word “This”, the word “This” is determined to be the beginning of the sentence in which the [0205] cursor 10 is positioned.
  • Then, forward scanning is performed from the character “e” at which the [0206] cursor 10 is positioned. Since the period “.” immediately follows the word “stance”, this word is determined to be the last word of the sentence in which the cursor 10 is positioned.
  • As a result, the sentence “This has forced the party to change its stance” is determined as the sentence to be translated. [0207]
  • Then, the [0208] context processing unit 5 performs a context process for the original sentence “This has forced the party to change its stance.”, which is identified by the translation range identifying unit 4. If context cannot be determined from this sentence only, the context process range extending unit 7 extends the context process range where the context process is performed, to the original sentence “The talks were held in the Upper House.”
  • The [0209] translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5.
  • As a result, a translated sentence “[0210]
    Figure US20010047255A1-20011129-P00024
    Figure US20010047255A1-20011129-P00403
    Figure US20010047255A1-20011129-P00404
    ” can be obtained for the original sentence “This has forced the party to change its stance.”
  • Provided next is the explanation about operations performed by the translation range identifying unit [0211] 4 included in the translating apparatus according to the second embodiment of the present invention, by referring to the drawings. The translation range identifying unit 4 is intended to identify a range to be translated in original text where one sentence does not terminate in one line, according to shaded area 11 produced by dragging a mouse, etc.
  • Assume that the original [0212] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance.”, as shown in FIG. 14.
  • In this case, the sentences do not terminate in one line, and they are also not numbered. [0213]
  • A translation operator operates the [0214] input unit 2, and produces a shaded area 11 in order to select a sentence to be translated.
  • Then, the translation range identifying unit [0215] 4 identifies the sentence to be translated. That is, the translation range identifying unit 4 detects the range of the shaded area 11, and automatically determines which portion the translation operator intends to translate according to the shaded area 11.
  • For example, if the shaded [0216] area 11 is made over the range “The talks were held in the Upper House. This has forced the party to change” in FIG. 14, it is determined whether or not the beginning of the range contained in the shaded area 11, is the beginning of a sentence. Since the word beginning with the uppercase letter “The” is located at the beginning, this word is determined as the beginning of the sentence.
  • Then, forward scanning is started from the word “The” in order to detect a termination of the sentence. Since the period “.” exists between the word “House” and the word “This”, it is determined that the termination of the sentence exists between the words “House” and “This”. [0217]
  • The forward scanning finally reaches the last word “change” in the selected range of the shaded [0218] area 11, without detecting an additional termination of a sentence other than the above described termination, as a result of continuing the forward scanning.
  • Then, it is determined whether or not the last word “change” is the end of the sentence. Since there is no period “.” immediately after the last word “change”, the last word “change” is not determined as the end of the sentence. [0219]
  • Accordingly, the range from the most recently detected termination to the last word “change” is determined as being incomplete, and the range “This has forced the party to change.” is removed from the range selected by the shaded [0220] area 11.
  • As a result, the sentence “The talks were held in the Upper House.” is determined as the sentence to be translated. [0221]
  • The above described embodiment refers to the case where a sentence is completely included in the range selected by the shaded [0222] area 11. However, a sentence partially included in a range selected by the shaded area 11 may also be a sentence to be translated.
  • Then, the [0223] context processing unit 5 performs a context process for the original sentence “The talks were held in the Upper House.”, which is identified by the translation range identifying unit 4. If context cannot be determined based on only this sentence, the context process range extending unit 7 extends a context process range where the context process is performed.
  • The [0224] translation processing unit 6 then performs a translation process based on context information obtained by the context processing unit 5.
  • As a result, a translated sentence “[0225]
    Figure US20010047255A1-20011129-P00026
    Figure US20010047255A1-20011129-P00405
    ” can be obtained for the original sentence “The talks were held in the Upper House.”.
  • As described above, a sentence to be translated can be extracted by being suitably identified, thereby performing a translation process accurately. [0226]
  • Furthermore, a group of sentences to be translated is identified, thereby performing a translation process accurately. [0227]
  • Provided next is the explanation about a method for processing an inline tag performed by the translating apparatus according to the first embodiment of the present invention, by referring to the drawings. This method is intended to detect an inline tag and ignore it as a non-target in a range to be translated, if the inline tag of a markup language, etc. exists in original text. [0228]
  • Assume that the original [0229] text displaying unit 1 displays the original sentence “This is <B> a </B> pen.”, as shown in FIG. 15. In this case, “<B>” and “</B>” are inline tags used to emphasize the indefinite article “a”.
  • A translation operator operates the [0230] input unit 2, moves a cursor 41 to a character “i”, and issues a translation instruction.
  • The translation range identifying unit [0231] 4 then identifies a sentence to be translated. That is, this unit detects the position of the cursor 41, and automatically determines which portion the translation operator intends to translate, according to the position of the cursor 41. Since translation is performed by sentence, a sentence where the cursor 41 is positioned is extracted.
  • For example, since the [0232] cursor 41 is positioned at the character “i”, backward scanning is started from the character “i”. Then, the word beginning with an uppercase letter “This” is located in FIG. 15, and this word is determined as the beginning of the sentence where the cursor 41 is positioned.
  • Then, forward scanning is started from the character “i” at which the [0233] cursor 41 is positioned. Then, the inline tags “<B>” and “</B>” are detected using an interpreter of a markup language. The detected inline tags are ignored as a non-target of translation. Since the period “.” immediately follows the word “pen” as a result of continuing the forward scanning, the word “pen” is determined as the last word in the sentence in which the cursor 41 is positioned.
  • As a result, the sentence “This is a pen.” is determined as the sentence to be translated. [0234]
  • Then, the [0235] context processing unit 5 performs a context process for the original sentence “This is a pen.”, which is identified by the translation range identifying unit 4. If context cannot be determined according to only the original sentence “This is a pen.”, the context process range extending unit 7 extends a context process range where the context process is performed.
  • The [0236] translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5.
  • As a result, a translated sentence “[0237]
    Figure US20010047255A1-20011129-P00027
    ” can be obtained for the original sentence “This is a pen.”.
  • As described above, a portion unrelated to translation can be removed by ignoring an inline tag, thereby a translation process can be performed accurately. [0238]
  • Provided next is the explanation about a method for processing an inline tag performed by the translating apparatus according to the second embodiment of the present invention, by referring to the drawings. This method is intended to detect an inline tag, and ignore it as a non-target in a context process range to be extended, if the inline tag of a markup language, etc. is included in original text. [0239]
  • Assume that the original [0240] text displaying unit 1 displays the original text “This is <B> a </B> pen. And that is a book.” is displayed, as shown in FIG. 16. Here, “<B>” and “</B>” are inline tags used to emphasize the indefinite article “a”.
  • The translation operator operates the [0241] input unit 2, moves a cursor 42 to a character “b”, and issues a translation instruction.
  • The translation range identifying unit [0242] 4 identifies a sentence to be translated. The translation range identifying unit 4 detects the position of the cursor 42, and automatically determines which portion the translation operator intends to translate, according to the position of the cursor 42. Since translation is performed by sentence, the sentence where the cursor 42 is positioned is extracted.
  • Since the [0243] cursor 42 is positioned at the character “b” in FIG. 16, backward scanning is started from this character. The word “And” which begins with an uppercase letter is located as a result of the scanning, and this word is determined to be the beginning of the sentence in which the cursor 42 is positioned.
  • Then, forward scanning is started from the character “b” at which the [0244] cursor 42 is positioned. Since the period “.” immediately follows the word “book”, this word is determined to be the last word of the sentence in which the cursor 42 is positioned.
  • As a result, the sentence “And that is a book.” is determined as a sentence to be translated. [0245]
  • The [0246] context processing unit 5 performs a context process for the original sentence “And that is a book.”, which is identified by the translation range identifying unit 4. If context cannot be determined according to only the original sentence “And that is a book.”, the context process range extending unit 7 extends a context process range where the context process is performed.
  • If the context process range is extended to a sentence immediately preceding the original sentence “And that is a book.”, backward scanning is started from the word “pen”. The inline tags “<B>” and “</B>” are detected using an interpreter of a markup language, and these tags are removed from the context process range as a non-targets in the extended context process range. The word “This” which begins with an uppercase letter, is located as a result of continuing the backward scanning, and determined to be the beginning of the sentence in the context process range. [0247]
  • As a result, the original sentence “This is a pen.” is added to the context process range, and the [0248] context processing unit 5 detects context information from the extended context process range.
  • The [0249] translation processing unit 6 then performs a translation process based on the context information obtained by the context processing unit 5.
  • As a result, a translated sentence “[0250]
    Figure US20010047255A1-20011129-P00028
    ” is obtained for the original sentence “And that is a book.”
  • As described above, a portion unrelated to context can be removed by removing an inline tag from a context process range, thereby performing a context process accurately. [0251]
  • Provided next is the explanation about a method for extending a context process range performed by the translating apparatus according to the first embodiment of the present invention. This method is intended to extend a context process range sentence by sentence, backwards and forwards until required context can be obtained. [0252]
  • In FIG. 5, if context information cannot be obtained from only the original sentence numbered [0253] 32 “This has forced the party to change its stance.”, the context process range is extended to a sentence immediately preceding or succeeding the original sentence numbered 32.
  • If required context cannot be obtained even though the context process range is extended to the sentence immediately preceding or succeeding the sentence to be translated, the context process range extending unit [0254] 7 further extends the context process range to sentences preceding or succeeding the sentence to be translated until the required context can be obtained. The method for extending a context process range by sentence backwards and forwards is helpful for examining a correspondence where an interval between sentences is an important factor, etc.
  • If a sentence including required context is separate from a sentence to be translated, a context process range may be extended to before and after the sentence to be translated in units of a plurality of sentences. [0255]
  • As described above, by extending a context process range sentence by sentence, a context process such as estimating an abbreviation, estimating a reference destination, etc. can be performed efficiently, thereby translating a portion of a document efficiently and accurately. [0256]
  • Provided next is the explanation about a method for extending a context process range performed by the translating apparatus according to the second embodiment of the present invention, by referring to the drawings. This method is intended to extend the context process range in paragraphs before or after a sentence to be translated until required context can be obtained. [0257]
  • As shown in FIG. 17A, the original [0258] text displaying unit 1 is assumed to display original text, and shaded area 20 is made in a sentence to be translated “The talks were held in the Upper House.” included in the original text.
  • The [0259] context processing unit 5 performs a context process for the sentence to be translated in which the shaded area 20 is made. If sufficient context information cannot be obtained from the shaded sentence as a result of the context process, the context process range extending unit 7 extends the context process range to an entire paragraph including the sentence to be translated.
  • That is, the context process range extending unit [0260] 7 specifies the text contained in the shaded area 21 is made “Secondly, . . . The talks were held in the Upper House. This has forced the party to change its stance.”, as a context process range, as shown in FIG. 17B.
  • To identify the paragraph as the context process range, backward scanning is started from the shaded original sentence to be translated in order to check to see if any sentence exists. A sentence which begins with “Secondly,” is determined to precede the sentence to be translated where the shaded [0261] area 20 is made. Then, it is determined whether or not the sentence beginning with “Secondly,” is the beginning of the paragraph. Since this sentence is indented and an empty line exists between the preceding sentence beginning with “Firstly” and this line, it is determined to be the first sentence of the paragraph.
  • Next, forward scanning is started from the sentence to be translated where the shaded [0262] area 20 is made, in order to check to see if any sentence exists. The sentence “This has forced the party to change its stance.” follows the shaded sentence to be translated. Then, it is determined whether or not the detected sentence is the last sentence of the paragraph. Since no sentence immediately follows this sentence in the same line; there is an empty line between the succeeding sentence and this sentence; and the next sentence is indented, the sentence “This has forced the party to change its stance.” is determined to be the last sentence of the paragraph.
  • With the above described method, the context process range after extension is identified by detecting the first and the last sentences of a paragraph. [0263]
  • Then, the [0264] context processing unit 5 extracts context information from the context process range contained in the shaded area 21 is made, as shown in FIG. 17B. If sufficient context information cannot be obtained from the context process range contained in the shaded area 21 is made, the context process range extending unit 7 extends the context process range to the entire text of the next paragraph.
  • Assume that the context process range extending unit [0265] 7 extends the context process range to the entire text of the preceding paragraph, as shown in FIG. 17C, and adds the sentence which begins with “Firstly,” to the context process range. As a result, the text contained in the shaded area 22 is made “Firstly, . . . , secondly, . . . . The talks were held in the Upper House. This has forced the party to change its stance.” is specified as the context process range.
  • The [0266] context processing unit 5 extracts context information from the context process range contained in the shaded area 22, as shown in FIG. 17C. If sufficient context information is extracted from the context process range contained in the shaded area 22, the context processing unit 5 passes the extracted context information to the translation processing unit 6.
  • The [0267] translation processing unit 6 translates the sentence to be translated where the shaded area 20 is made, based on the context information passed from the context processing unit 5.
  • Note that the method for extending a context process range in paragraphs backwards and forwards is helpful for making a field determination where the existence/non-existence of a word identifying that a specific field exists is important. [0268]
  • If a sentence including required context is separate from a sentence to be translated, a context process range may be extended before or after the sentence to be translated in units of a plurality of paragraphs. [0269]
  • As described above, a context process range is extended in paragraphs corresponding to a range to be translated until a translated sentence can be determined for an original sentence. Accordingly, a context process such as estimating a field, estimating a document type, etc. can be performed efficiently, thereby translating a portion of a document efficiently and accurately. [0270]
  • Provided next is the explanation about the method for extending a context process range performed by the translating apparatus according to the third embodiment of the present invention, by referring to the drawings. This method is intended to change the method for extending a context process range depending on a required type of context process. [0271]
  • FIG. 18 shows a correspondence table between a type of context process and the method for extending a context process range. This table is pre-stored in the context process range extending unit [0272] 7.
  • In FIG. 18, when field estimation is made, a context process range is extended in paragraphs before or after a sentence to be translated. [0273]
  • When document type estimation is made, the context process range is extended in paragraphs before or after the sentence to be translated. [0274]
  • To complete an abbreviation, the context process range is extended in sentences before or after the sentence to be translated. [0275]
  • In the meantime, to refer to a reference destination, the context process range is extended in sentences before the sentence to be translated. [0276]
  • FIG. 19 is a flowchart showing the method for extending a context process range performed by the translating apparatus according to the third embodiment of the present invention. [0277]
  • In FIG. 19, the [0278] context processing unit 5 consults a dictionary for each word included in a sentence to be translated in step S31.
  • Then, one type of context process is selected for the sentence to be translated, and the selected context process is performed in step S[0279] 32.
  • In step S[0280] 33, it is determined whether or not sufficient context information can be obtained from a range to be translated. If “NO”, the process goes to step S34.
  • In step S[0281] 34, the correspondence table shown in FIG. 18 is referenced to obtain a method for extending a context process range according to the selected type of context process. If the type of context process is a field estimation or document type estimation, the context process range is extended in paragraphs before or after the sentence to be translated. If the type of context process is completion of an abbreviation, the context process range is extended in sentences before or after the sentence to be translated. If the type of context process is referring to a reference destination, the context process range is extended in sentences before the sentence to be translated.
  • In step S[0282] 35, the context process range extending unit 7 extends the context process range based on the method for extending a context process range obtained in step S34.
  • The [0283] context processing unit 5 then performs a context process for text in the extended context process range in step S36.
  • In step S[0284] 37, the context processing unit 5 determines whether or not sufficient context information can be obtained from the extended context process range. If “NO”, the process goes to step S35 to repeatedly extend the context process range.
  • If it is determined that sufficient context information can be obtained from the extended context process range, the process goes to step S[0285] 38.
  • The [0286] context processing unit 5 determines whether or not all types of context processes have been performed in step S38. If “NO”, the process goes to step S32 to perform a next type of context process, and completes the process after all types of context processes have been performed.
  • If it is determined that sufficient context information can be obtained from the context process range in step S[0287] 33, the process goes to step S38.
  • As described above, a context process can be efficiently performed by determining a direction in which a context process range is extended based on a type of context process, thereby translating a portion of a document efficiently and accurately. [0288]
  • Additionally, a context process for estimating a reference destination can be efficiently performed by extending a context process range in sentences in a backward direction, thereby translating a portion of a document efficiently and accurately. [0289]
  • Furthermore, a context process can be performed in a context process range having a context corresponding to a range to be translated by terminating extension of the context process range at a boundary of the context, thereby performing the context process accurately and quickly. [0290]
  • Provided next is the explanation about a method for dividing a range to be translated performed by the translating apparatus according to the first embodiment of the present invention. This method is intended to perform translation by dividing the range to be translated for each paragraph, if the range to be translated straddles a plurality of paragraphs. This is also because context may change at a change of paragraph. [0291]
  • That is, if the last sentence of a paragraph A and the first sentence of a paragraph B are simultaneously specified as sentences to be translated by a process of dragging a mouse, etc., the last sentence of the paragraph A and the first sentence of the paragraph B are scanned when a context process is performed. Then, it is detected that these sentences respectively belong to different paragraphs, so that the range to be translated is determined to straddle the plurality of paragraphs. [0292]
  • Then, a translation process is respectively performed for the last sentence of the paragraph A and the first sentence of the paragraph B that are determined to straddle the plurality of paragraphs. [0293]
  • As a result, the last sentence of the paragraph A can be translated based on only the context of the paragraph A, while the first sentence of the paragraph B can be translated based on only the context of the paragraph B. Accordingly, even if a topic suddenly changes at the change of paragraph from A to B, translation can be performed accurately. [0294]
  • Assuming that a topic of baseball changes to the one such as the weather of the next day, means of transportation to a baseball ground, etc., a translation process can be performed for respective fields such as a baseball field, a weather field, and a transportation field, thereby performing the translation process accurately. [0295]
  • As described above, context corresponding to a range to be translated can be accurately extracted by dividing the range into paragraphs, even if a topic suddenly changes at a change to a next paragraph. As a result, a translation process can be performed accurately. [0296]
  • Provided next is the explanation about a method for dividing a range to be translated, performed by the translating apparatus according to the second embodiment of the present invention. This method is intended to divide a range to be translated for each paragraph, and set each context process range for each of the divided ranges to be translated for each of the paragraphs. This is because context may often change as a paragraph changes. [0297]
  • If the last sentence of the paragraph A and the first sentence of the paragraph B are simultaneously specified as sentences to be translated by a process of dragging a mouse, etc., these sentences are scanned when a context process is performed. Then, it is detected that these sentences respectively belong to different paragraphs, so that the range to be translated is determined to straddle the plurality of paragraphs. [0298]
  • A translation process is then performed respectively for the last sentence of the paragraph A and the first sentence of the paragraph B that are determined to straddle the plurality of paragraphs. [0299]
  • If sufficient information cannot be obtained from the last sentence of the paragraph A when a translation process is performed for the last sentence of the paragraph A, a context process range is extended based on only the last sentence of the paragraph A. Then, context information is extracted from the extended context process range, and the last sentence of the paragraph A is translated. [0300]
  • In the meantime, if sufficient context information cannot be obtained from the first sentence of the paragraph B, a context process range is extended based on only the first sentence of the paragraph B. Then, context information is extracted from the extended context process range, and the first sentence of the paragraph B is translated. [0301]
  • That is, the last sentence of the paragraph A can be translated independently from the first sentence of the paragraph B, and the first sentence of the paragraph B can be translated independently from the last sentence of the paragraph A. Therefore, even if a topic suddenly changes at the change of paragraph from A to B, translation can be performed accurately. [0302]
  • As described above, context corresponding to a context process range can be accurately extracted by setting a context process range for each divided range to be translated for each paragraph, when a topic changes at a change of paragraph. [0303]
  • Provided next is the explanation about a method for displaying context information according to the first embodiment of the present invention, by referring to the drawings. This method is intended to display context information determined and obtained by a context process, at the side of a sentence to be translated. [0304]
  • Assume that the original [0305] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance.”, as shown in FIG. 20.
  • A shaded [0306] area 43 is made for a sentence to be translated, which is identified by the translation range identifying unit 4.
  • Then, the [0307] context processing unit 5 performs a context process for the sentence to be translated, and detects a compound “Upper House” from the sentence to be translated in which the shaded area 43 is made. Since the compound “Upper House” is used in a political field only, it is determined that the sentence to be translated in which the shaded area is made, relates to the political field.
  • Next, the original [0308] text displaying unit 1 displays context information 44 extracted by the context processing unit 5 from the sentence to be translated, at the side of the sentence to be translated. Since the sentence to be translated in which the shaded area 43 is made, for example, relates to the political field, a word “political” is displayed as context information 44.
  • As a result, the translation operator can recognize context of a sentence to be translated, and can perform a translation process while checking if a suitable context process is performed. [0309]
  • As described above, a translation process can be performed while recognizing a state of a context process in a range to be translated, by displaying context information in correspondence with a range to be translated in original text. [0310]
  • Furthermore, a translation process can be performed while identifying a sentence to be translated, by highlighting the sentence to be translated. [0311]
  • Provided next is the explanation about a method for displaying context information according to the second embodiment of the present invention, by referring to the drawings. This method is intended to display context information yet to be determined, which is obtained by a context process, at the side of a sentence to be translated. [0312]
  • Assume that the original [0313] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance”, as shown in FIG. 21A.
  • The shaded [0314] area 45 is made for the original sentence “This has forced the party to change its stance.”, which is identified as a sentence to be translated by the translation range identifying unit 45.
  • Then, the [0315] context processing unit 5 performs a context process for the sentence to be translated in which the shaded area 45 is made, and the context processing unit 5 detects the original words “party” and “stance”, respectively having a plurality of translated words, from the sentence to be translated in which the shaded area 45 is made. The original word “party” is used for a general field in addition to the political field, while the original word “stance” is used for the general field in addition to the sports field.
  • This original sentence contained in the shaded [0316] area 45, includes no word used in one particular field. Accordingly, the field of the sentence to be. translated in which the shaded area 45 is made, cannot be identified at this stage.
  • Therefore, the original [0317] text displaying unit 1 displays the context information 46 yet to be determined, which is extracted by the context processing unit 5 from the sentence to be translated. For example, since the political and sports fields are candidates of the sentence to be translated in which the shaded area 45 is made, the words “political” and “sports” are displayed as the context information 46.
  • With such a display, a translation operator can recognize whether or not context of a sentence to be translated is determined. As a result, the translation operator can perform a translation process while checking to see if a suitable context process is performed. [0318]
  • As described above, a translation process can be performed while recognizing a state of a context process being performed in a range to be translated, by displaying a plurality of pieces of context information as candidates. [0319]
  • Provided next is the explanation about a method for displaying context information according to the third embodiment of the present invention, by referring to the drawings. This method is intended to display a word at a reference destination, which is obtained by a context process, in correspondence with a word at a reference source. [0320]
  • Assume that the original [0321] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance.”, as shown in FIG. 22.
  • The [0322] context processing unit 5 performs a context process for a sentence to be translated, and determines that the original word “it” refers to the original word “party”.
  • The original [0323] text displaying unit 1 then displays the word at the reference destination in correspondence with the word at the reference source, based on the context information obtained by the context processing unit 5. For example, areas 50 and 51 shaded in identical colors are provided for the original word “party” at the reference destination and the original word “it” at the reference source.
  • As a result, the translation operator can perform a translation process while checking to see if a process of reference relationship is properly performed. [0324]
  • As described above, a translation process can be performed while recognizing a state of a context process, by highlighting a word at a reference destination and a word at a reference source. [0325]
  • Provided next is the explanation about a method for displaying context information according to the fourth embodiment of the present invention, by referring to the drawings. This method is intended to display a word having a plurality of translated words obtained by a context process in correspondence with a word identifying a field. [0326]
  • Assume that the original [0327] text displaying unit 1 displays the original text “The talks were held in the Upper House. This has forced the party to change its stance.”, as shown in FIG. 23.
  • The [0328] context processing unit 5 performs a context process for a sentence to be translated, and locates a word “
    Figure US20010047255A1-20011129-P00029
    ” used in a general field and a word “
    Figure US20010047255A1-20011129-P00030
    ” used in the political field, as translated words of the original word “party”. Additionally, the compound “Upper House” used in the political field only, is located. Accordingly, it is determined that the sentence to be translated relates to the political field.
  • As a result, the word “[0329]
    Figure US20010047255A1-20011129-P00031
    ” used in the political field is selected as a translated word of the original word “party”.
  • The original [0330] text displaying unit 1 displays a word having a plurality of translated words obtained by the context process based on the context information obtained by the context processing unit 5, in correspondence with a word identifying a field. For example, areas 52 an 53 shaded in identical colors are made for the original word “party” having a plurality of translated words, and the compound “Upper House” used as a basis for selecting one of the plurality of translated words for the original word “party”.
  • Accordingly, a translation operator can perform a translation process while checking to see if a process of reference relationship is properly performed for a sentence to be translated. [0331]
  • If there are a plurality of words that could be a basis for selecting a translated word for an original word, only a word closest to the original word may be displayed, in order to prevent a display from being confusing to the user. [0332]
  • For example, if there are a number of words used only in the political field in addition to the compound “Upper House” in a sentence to be translated, only the compound “Upper House” closest to the original word “party” is displayed in correspondence with the original word “party”. [0333]
  • In this case, searching for the word closest to the original word “party” to be used as the basis for selecting the translated word, is performed as described below. [0334]
  • That is, the field of the context of the sentence including the original word “party” is determined as being “political”, by making a field estimation. [0335]
  • Then, backward scanning is started from the original word “party”, and a word in the political field only is searched. [0336]
  • Next, a first searched word among words used in the political field only is displayed. In FIG. 23, the compound “Upper House”, for example, corresponds to the first searched word. [0337]
  • Additionally, a translation process can be performed while recognizing a state of a context process, by highlighting a short substitute and its formal name. [0338]
  • Note that a translation process may be performed while recognizing a field name, document type name, formal name or a word at a reference destination, by displaying the above described names or word. [0339]
  • Additionally, context information can be concisely displayed by highlighting a word closest to the first word among second words identifying a translated word of the first word, thereby recognizing a state of a context process with ease. [0340]
  • Furthermore, a sentence to be translated and a context process range can be separately recognized by highlighting the sentence to be translated using a first method, and highlighting the context process range using a second method, thereby performing a translation process while recognizing the context process range. [0341]
  • Provided next is the explanation about a dictionary search apparatus according to an embodiment of the present invention, by referring to the drawings. [0342]
  • FIG. 24 is a simplified block diagram showing configuration of the dictionary search apparatus according to the embodiment of the present invention. This apparatus is intended to properly perform a dictionary search process by obtaining appropriate context from a neighborhood of an original word for which a search operation is performed, depending on need. [0343]
  • In FIG. 24, the dictionary search apparatus comprises a searching [0344] unit 61, context process range setting unit 62, context processing unit 63, and a determining unit 64.
  • The searching [0345] unit 61 is intended to search a translated word for an original word, when specified.
  • The context process [0346] range setting unit 62 is intended to read original text around a specified original word, and set a context process range, if a plurality of translated words exist for the specified original word.
  • The [0347] context processing unit 63 is intended to perform a context process in a context process range set by the context process range setting unit 62.
  • The determining [0348] unit 64 is intended to determine a translated word for an original word, based on context obtained by the context processing unit 63.
  • As described above, a dictionary search can be performed in consideration of context by determining a translated word for an original word, thereby performing the dictionary search accurately. [0349]
  • Provided next is the explanation about operations performed by a dictionary search apparatus according to the first embodiment of the present invention. This apparatus is intended to estimate a field as a context process. [0350]
  • Assume that the original text “The talks were held in the Upper House. This has forced the party to change its stance.” is displayed, as shown in FIG. 25. First of all, a translation operator who performs a search moves a [0351] cursor 70 to a character “r” included in the original word “party”, in order to specify this word as an original word for which a search operation is to be performed.
  • Then, scanning is performed from the character “r” in backward and forward directions until a character space is detected. As a result, the word “party” is recognized as the original word for which the search operation is to be performed. [0352]
  • The searching [0353] unit 61 then consults a dictionary for the original word “party”, and selects translated words “
    Figure US20010047255A1-20011129-P00032
    ” or “
    Figure US20010047255A1-20011129-P00033
    ” when an English-Japanese search is performed. Since there are a plurality of translated words for the original word “party” in this case, a suitable translated word cannot be determined.
  • Then, a field is estimated to select the suitable translated word for the original word “party”. That is, if the original text relates to a general field, the word “[0354]
    Figure US20010047255A1-20011129-P00034
    ” can be selected as a translated word of the original word “party”. Or, if the original text relates to the political field, the word “
    Figure US20010047255A1-20011129-P00035
    ” is selected as the translated word of the original word “party”.
  • The context process [0355] range setting unit 62 sets the sentence “This has forced the party to change its stance.”, which includes the original word “party”, as a context process range, in order to identify a field of the original text.
  • The [0356] context processing unit 63 then performs a context process for the sentence “This has forced the party to change its stance.”, which is set by the context process range setting unit 62. Since this sentence includes no word identifying a field in this case, the field of the original word “party” cannot be determined.
  • Therefore, the context process [0357] range setting unit 62 extends the context process range, and sets the text “The talks were held in the Upper House. This has forced the party to change its stance.” as a context process range.
  • The [0358] context processing unit 63 then performs a context process for this text set by the context process range setting unit 62. Since this text includes the compound “Upper House” used in the political field only, the field of the original word “party” can be determined to be the political field according to this compound.
  • The determining [0359] unit 64 therefore determines the word “
    Figure US20010047255A1-20011129-P00036
    ” relating to the political field, as the translated word of the original word “party” shown in FIG. 9.
  • The searching [0360] unit 61 consults a dictionary for the original word “party”, and selects the translated words “parti” and “groupe”, when an English-French search is performed. Since there are a plurality of translated words for the original word “party” in this case, a suitable translated word cannot be determined.
  • Next, field estimation is made in order to select the suitable word for the original word “party”. If the original text relates to a general field, the word “groupe” can be selected as a translated word of the original word “party”. Or, if the original text relates to the political field, the word “parti” can be selected as the translated word of the original word “party”. [0361]
  • The context process [0362] range setting unit 62 sets the sentence “This has forced the party to change its stance.”, which include the original word “party”, as a context process range, in order to identify the field of the original text.
  • The [0363] context processing unit 63 then performs a context process for the sentence “This has forced the party to change its stance.”, which is set by the context process range setting unit 62. Since this sentence does not include a word identifying a field in this case, the field of the original word “party” cannot be determined.
  • Therefore, the context process [0364] range setting unit 62 extends the context process range, and sets the text “The talks were held in the Upper House. This has forced the party to change its stance.” as a context process range.
  • The [0365] context processing unit 63 performs a context process for this text set by the context process range setting unit 62. Since this text includes the compound “Upper House” used in the political field only in this case, the field of the original word “party” can be determined to be the political field, based on this compound.
  • The determining [0366] unit 64 therefore determines the word “parti” relating to the political field as the translated word of the original word “party”, and displays “parti” with a higher priority. The word “parti” is displayed as a first candidate in the example shown in FIG. 9. If this word is displayed later, the order of the candidates “parti” and groupe” is changed in order to display “parti” first.
  • FIG. 26 is a flowchart showing operations performed by the dictionary search apparatus according to the first embodiment of the present invention. [0367]
  • In this figure, a word to be searched is first identified according to a position of the [0368] cursor 70 in step S41.
  • Then, a dictionary is consulted for the word to be searched in step S[0369] 42.
  • In step S[0370] 43, it is determined whether or not ambiguity of a field exists in the word to be searched. If “YES”, the process goes to step S44.
  • In step S[0371] 44, a context process range is set for original text around the word to be searched, and a field estimation is made for the original text included in the set context process range.
  • In step S[0372] 45, it is determined whether or not sufficient context information can be obtained from the context process range. If “NO”, the process goes to step S46.
  • After the context process range is extended in step S[0373] 46, the process goes back to step S44 in order to repeatedly extend the context process range until sufficient context information can be obtained. If it is determined that sufficient context information can be obtained from the context process range in step S45, the process goes to step S47.
  • In step S[0374] 47, a word definition matching an estimated field is moved to a first location.
  • In step S[0375] 48, a result of the search is displayed.
  • If it is determined that the word to be searched has no ambiguity of a field in step S[0376] 43, the process goes to step S48 to display the word definition of the word searched.
  • As described above, even if a translated word cannot be determined for an original word, a context process can be efficiently performed for a dictionary search operation by extending a context process range. As a result, the dictionary search can be performed accurately. [0377]
  • In addition, a translated word corresponding to an estimated field can be selected by estimating a field as a context process, if there are a plurality of translated words for an original word. As a result, a dictionary search can be performed accurately. [0378]
  • For example, if first and second translated words respectively belonging to first and second fields for a first original word, and if a second original word outside a range to be translated belongs to the first field, the field of the first original word is estimated by selecting the first translated word for the first original word. As a result, a translated word of the first original word can be determined. [0379]
  • Provided next is the explanation about operations performed by the dictionary search apparatus according to the second embodiment of the present invention, by referring to the drawings. This apparatus completes an abbreviation as a context process. [0380]
  • Assume that the original text “In the last election, the Japan Socialist Party enjoyed victory. The reason for JSP's victory was . . . ” is displayed. [0381]
  • A translation operator who makes a search moves a cursor to, for example, the character “J” included in the word “JSP”, in order to specify this abbreviation as a word for which a search operation of a word definition is required. [0382]
  • Then, scanning is performed from the character “J” in forward and backward directions until a character space is detected. The word “JSP” is recognized as the word for which a search operation of a word definition is required. [0383]
  • The searching [0384] unit 61 consults a dictionary for the word “JSP”, and determines this word as being unregistered. Since the word “JSP” is written in uppercase letters in this case, this word is determined as an abbreviation.
  • The context process [0385] range setting unit 62 then sets the sentence “The reason for JSP's victory was ”, which includes the abbreviation “JSP”, as a context process range, in order to complete the abbreviation “JSP”.
  • The [0386] context processing unit 63 then performs a context process for the sentence “The reason for JSP's victory was . . . ”, which is set by the context process range setting unit 62. Since this sentence includes no character string which completes the “JSP”, the abbreviation “JSP” cannot be completed.
  • Accordingly, the context process [0387] range setting unit 62 extends the context process range, and sets the original text “In the last election, “the Japan Socialist Party enjoyed victory. The reason for JSP's victory was . . . ”, as the context process range. The context processing unit 63 performs a context process for this original text set by the context process range setting unit 62. Since this original text includes the character string “Japan Socialist Party”, each of which begins with an uppercase letter “J”, “S”, and “P”, this character string is determined as the character string which completes the abbreviation “JSP”.
  • Then, the searching [0388] unit 61 consults a dictionary for the character string “Japan Socialist Party”, and retrieves “
    Figure US20010047255A1-20011129-P00037
    ” as the definition of this character string.
  • The determining [0389] unit 64 therefore determines “
    Figure US20010047255A1-20011129-P00038
    ” as the definition of the word “JSP”.
  • FIG. 27 is a flowchart showing operations performed by the translating apparatus according to the second embodiment of the present invention. [0390]
  • In this figure, a word to be searched is identified according to a position of a cursor in step S[0391] 51.
  • Next, a dictionary is consulted for the word to be searched in step S[0392] 52.
  • In step S[0393] 53, it is determined whether or not the word to be searched is an abbreviation. If “YES”, the process goes to step S54.
  • In step S[0394] 54, a sentence including the word to be searched is set as a context process range.
  • Then in step S[0395] 55, it is determined whether or not included in the context process range, and the abbreviation is completed.
  • In step S[0396] 56, it is determined whether or not an expression completing the word to be searched exists in the context process range. If “NO”, the process goes to step S57.
  • The context process range is then extended to before and after the sentence including the word to be searched, and the process goes back to step S[0397] 54.
  • In the meantime, if the expression completing the abbreviation exists in the context process range, the process goes to step S[0398] 58.
  • In step S[0399] 58, a dictionary is consulted for the completed word to be searched.
  • In step S[0400] 59, a definition of the completed word to be searched is displayed.
  • If the word to be searched is not determined as an abbreviation in step S[0401] 53, the process goes to step S59 to display a definition of the word to be searched.
  • As described above, a complete name of an abbreviation can be retrieved by performing a context process such as estimating an abbreviation, thereby performing a dictionary search operation efficiently. [0402]

Claims (36)

What is claimed is:
1. A translating apparatus, comprising:
translation target range setting means for setting a range to be translated in an original text;
context process range setting means for setting a context process range in the original text which is at least partially different from the range to be translated; and
translation processing means for translating an original text in the range to be translated into a translated text, based on a context in the context process range.
2. A translating apparatus, comprising:
input means for inputting a range to be translated in an original text;
display means for displaying the original text;
context process range setting means for setting a context process range in the original text;
context processing means for performing a context process for the original text; and
translation processing means for translating an original text in the range to be translated into translated text, based on a context in the context process range.
3. The translating apparatus according to
claim 2
, wherein said context processing means makes a field estimation.
4. The translating apparatus according to
claim 2
, wherein said context processing means makes a document type estimation.
5. The translating apparatus according to
claim 2
, wherein said context processing means estimates an abbreviation.
6. The translating apparatus according to
claim 2
, wherein said context processing means estimates a reference destination.
7. The translating apparatus according to
claim 2
, further comprising:
translation range identifying means for identifying a sentence to be translated, based on the range to be translated, when the range to be translated in the original text, which is displayed by said displaying means, is input from said input means.
8. The translating apparatus according to
claim 2
, further comprising:
translation range identifying means for identifying a group of sentences to be translated, based on the range to be translated, when the range to be translated in the original text, which is displayed by said displaying means, is input from said input means.
9. The translating apparatus according to
claim 7
, wherein:
if an inline tag is included in the original text, said translation range identifying means identifies the sentence to be translated by ignoring the inline tag.
10. The translating apparatus according to
claim 7
, wherein:
said displaying means highlights the sentence to be translated, which is identified by said translation range identifying means.
11. The translating apparatus according to claim 2, wherein:
said displaying means highlights a first word having a plurality of translated words, and a second word closest to a location of the first word, among words which identify a translated word of the first word.
12. The translating apparatus according to
claim 7
, wherein:
said displaying means highlights the sentence to be translated, which is identified by said translation range identifying means, by using a first means, and highlights the context process range set by said context process range setting means, by using a second means.
13. The translating apparatus according to
claim 2
, wherein:
said displaying means highlights a word at a reference destination and a word at a reference source.
14. The translating apparatus according to
claim 2
, wherein:
said displaying means highlights an abbreviation and its complete name.
15. The translating apparatus according to
claim 2
, wherein:
if the translated text cannot be identified for the original text, said context process range setting means extends the context process range.
16. The translating apparatus according to
claim 15
, wherein:
said context process range setting means extends the context process range in sentences until the translated text can be determined for the original text.
17. The translating apparatus according to
claim 16
, wherein:
said context process range setting means extends the context process range in paragraphs which correspond to the range to be translated, until the translated text can be determined for the original text.
18. The translating apparatus according to
claim 15
, wherein:
said context process range setting means determines a direction to which the context process range is extended, according to a type of a context process.
19. The translating apparatus according to
claim 16
, wherein:
said context process range setting means extends the context process range in sentences in a forward direction until the translated text can be determined for the original text.
20. The translating apparatus according to
claim 2
, wherein:
if the range to be translated in the original text straddles a plurality of paragraphs, said context process range setting means sets each context process range in correspondence with each of ranges to be translated, which are divided in paragraphs.
21. The translating apparatus according to
claim 2
, wherein:
if an inline tag is included in the original text, said context process range setting means removes the inline tag from the context process range.
22. The translating apparatus according to
claim 2
, wherein:
said context process range setting means comprises:
boundary detecting means for detecting a boundary of context; and
terminating means for terminating an extension of the context process range at the boundary, when said boundary detecting means detects the boundary of context.
23. A translating apparatus, comprising:
input means for inputting a range to be translated in an original text;
context process range setting means for setting a context process range in the original text;
context processing means for performing a context process for the original text;
translation processing means for translating at least that part of the original text which is in the range to be translated into a translated text, based on context in the context process range; and
displaying means for displaying context information in correspondence with the range to be translated in the original text.
24. The translating apparatus according to
claim 23
, wherein:
if the translated text cannot be determined for the original text, said displaying means displays a plurality of pieces of context information, as candidates.
25. The translating apparatus according to
claim 23
, wherein:
the context information includes a field name, a document type name, a complete name corresponding to an abbreviation, or a word at a reference destination.
26. A translating apparatus, comprising:
translation target range setting means for setting a range to be translated in an original text;
context processing means for dividing the range to be translated into paragraphs in order to perform a context process, if the range to be translated in the original text straddles a plurality of paragraphs; and
translation processing means for translating the original text in the range to be translated into a translated text, based on context of each paragraph.
27. A dictionary search apparatus, comprising:
searching means for searching a translated word for an original word;
context process range setting means for setting a context process range in an original text;
context processing means for performing a context process for the original text; and
determining means for determining a translated word for the original word, based on context in the context process range.
28. The dictionary search apparatus according to
claim 27
, wherein:
if the translated word for the original word cannot be determined, said context process range setting means extends the context process range.
29. The dictionary search apparatus according to
claim 28
, wherein:
said context process range setting means extends the context process range in sentences until the translated word for the original word can be determined.
30. The dictionary search apparatus according to
claim 27
, wherein:
said context processing means makes a field estimation.
31. The dictionary search apparatus according to
claim 27
, wherein:
said context processing means estimates an abbreviation.
32. A dictionary search apparatus, comprising:
specifying means for specifying a word to be searched;
context process range setting means for setting a context process range;
context processing means for performing a context process in the context process range; and
determining means for determining a definition of the word to be searched, based on context of the context process range.
33. A translating method, comprising the steps of:
specifying a portion of an original text as a range to be translated;
referencing context of the original text outside the range to be translated, which is stored in a storage apparatus; and
translating the original text within the range to be translated.
34. A translating method, comprising the steps of:
referencing a second original word outside a range to be translated in an original text, if a first translated word belonging to a first field and a second translated word belonging to a second field exist for a first original word in the range to be translated in the original text; and
selecting the first translated word for the first original word, if the second original word belongs to the first field.
35. A data conversion apparatus, comprising:
conversion range setting means for setting a data conversion range;
context process range setting means for setting a context process range where a context process is performed; and
conversion processing means for converting the data, based on context of the context process range.
36. A computer-readable storage medium used to direct a computer to perform the functions of:
specifying a portion of an original text as a range to be translated;
referencing context of the original text outside the range to be translated, which is stored in a storage apparatus; and
translating the original text within the range to be translated.
US09/525,020 1995-11-27 2000-03-14 Translating apparatus, dictionary search apparatus, and translating method Expired - Fee Related US6418403B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/525,020 US6418403B2 (en) 1995-11-27 2000-03-14 Translating apparatus, dictionary search apparatus, and translating method

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
JP30698195A JP3952216B2 (en) 1995-11-27 1995-11-27 Translation device and dictionary search device
JP7-306981 1995-11-27
JP07-306981 1995-11-27
US08/728,852 US6151570A (en) 1995-11-27 1996-10-10 Translating apparatus, dictionary search apparatus, and translating method
US09/525,020 US6418403B2 (en) 1995-11-27 2000-03-14 Translating apparatus, dictionary search apparatus, and translating method

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
US08/728,852 Division US6151570A (en) 1995-11-27 1996-10-10 Translating apparatus, dictionary search apparatus, and translating method
US08/728,852 Continuation US6151570A (en) 1995-11-27 1996-10-10 Translating apparatus, dictionary search apparatus, and translating method

Publications (2)

Publication Number Publication Date
US20010047255A1 true US20010047255A1 (en) 2001-11-29
US6418403B2 US6418403B2 (en) 2002-07-09

Family

ID=17963588

Family Applications (2)

Application Number Title Priority Date Filing Date
US08/728,852 Expired - Lifetime US6151570A (en) 1995-11-27 1996-10-10 Translating apparatus, dictionary search apparatus, and translating method
US09/525,020 Expired - Fee Related US6418403B2 (en) 1995-11-27 2000-03-14 Translating apparatus, dictionary search apparatus, and translating method

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US08/728,852 Expired - Lifetime US6151570A (en) 1995-11-27 1996-10-10 Translating apparatus, dictionary search apparatus, and translating method

Country Status (2)

Country Link
US (2) US6151570A (en)
JP (1) JP3952216B2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030046062A1 (en) * 2001-08-31 2003-03-06 Cartus John R. Productivity tool for language translators
US20090106016A1 (en) * 2007-10-18 2009-04-23 Yahoo! Inc. Virtual universal translator
US7543270B1 (en) * 2004-01-23 2009-06-02 The Math Works, Inc. Tracing output descriptions generated from corresponding input descriptions
US20090193332A1 (en) * 2008-01-28 2009-07-30 Samsung Electronics Co., Ltd. Method and apparatus for editing letters
US20100030760A1 (en) * 2008-08-04 2010-02-04 Justsystems Corporation Data input apparatus and data input method
US20100082324A1 (en) * 2008-09-30 2010-04-01 Microsoft Corporation Replacing terms in machine translation
US20100161311A1 (en) * 2008-12-19 2010-06-24 Massuh Lucas A Method, apparatus and system for location assisted translation
US20110202333A1 (en) * 2001-03-16 2011-08-18 Meaningful Machines, LLC Knowledge System Method and Apparatus
US20110219304A1 (en) * 2010-03-05 2011-09-08 Casio Computer Co., Ltd. Dictionary information display device
US8050906B1 (en) * 2003-06-01 2011-11-01 Sajan, Inc. Systems and methods for translating text
US11694041B2 (en) 2018-05-15 2023-07-04 Iflytek Co., Ltd. Chapter-level text translation method and device

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3768105B2 (en) * 2001-01-29 2006-04-19 株式会社東芝 Translation apparatus, translation method, and translation program
US7188064B2 (en) * 2001-04-13 2007-03-06 University Of Texas System Board Of Regents System and method for automatic semantic coding of free response data using Hidden Markov Model methodology
US20070265834A1 (en) * 2001-09-06 2007-11-15 Einat Melnick In-context analysis
MXPA04004573A (en) * 2001-11-13 2005-11-23 Applied Business Ct Ed Technol Systems and methods for rendering multilingual information on output device.
US8667390B2 (en) 2002-03-25 2014-03-04 Adobe Systems Incorporated Asynchronous access to structured data
US7810024B1 (en) * 2002-03-25 2010-10-05 Adobe Systems Incorporated Efficient access to text-based linearized graph data
US20040193557A1 (en) * 2003-03-25 2004-09-30 Olsen Jesse Dale Systems and methods for reducing ambiguity of communications
US8170863B2 (en) * 2003-04-01 2012-05-01 International Business Machines Corporation System, method and program product for portlet-based translation of web content
US8135575B1 (en) 2003-08-21 2012-03-13 Google Inc. Cross-lingual indexing and information retrieval
WO2005086021A2 (en) * 2004-03-02 2005-09-15 Melingo, Ltd. Embedded translation document method and system
GB2415518A (en) * 2004-06-24 2005-12-28 Sharp Kk Method and apparatus for translation based on a repository of existing translations
WO2006072882A2 (en) * 2005-01-10 2006-07-13 Melingo, Ltd. Embedded translation-enhanced search
JP4428266B2 (en) * 2005-03-22 2010-03-10 富士ゼロックス株式会社 Translation apparatus and program
JP2006276903A (en) * 2005-03-25 2006-10-12 Fuji Xerox Co Ltd Document processing device
JP2006276918A (en) * 2005-03-25 2006-10-12 Fuji Xerox Co Ltd Translating device, translating method and program
GB2444084A (en) * 2006-11-23 2008-05-28 Sharp Kk Selecting examples in an example based machine translation system
US7983897B2 (en) * 2007-02-14 2011-07-19 Google Inc. Machine translation feedback
DE102007010036A1 (en) * 2007-03-01 2008-09-04 Vodafone Holding Gmbh Method for electronic provision of detail information for shortcut represented with in presentation, involves electronically detecting shortcut by using detection device of electronic terminal
US7890493B2 (en) * 2007-07-20 2011-02-15 Google Inc. Translating a search query into multiple languages
US9262409B2 (en) * 2008-08-06 2016-02-16 Abbyy Infopoisk Llc Translation of a selected text fragment of a screen
JP6097529B2 (en) * 2012-11-15 2017-03-15 株式会社ゼンリンデータコム Information processing apparatus, information processing method, and program
US9323715B2 (en) * 2013-11-14 2016-04-26 Cavium, Inc. Method and apparatus to represent a processor context with fewer bits
US20150220514A1 (en) * 2014-02-04 2015-08-06 Ca, Inc. Data processing systems including a translation input method editor
US10140293B2 (en) * 2015-05-18 2018-11-27 Google Llc Coordinated user word selection for translation and obtaining of contextual information for the selected word
JP6930179B2 (en) * 2017-03-30 2021-09-01 富士通株式会社 Learning equipment, learning methods and learning programs

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2523745B1 (en) 1982-03-18 1987-06-26 Bull Sa METHOD AND DEVICE FOR PROTECTING SOFTWARE DELIVERED BY A SUPPLIER TO A USER
JPS638864A (en) * 1986-06-27 1988-01-14 Sharp Corp Translating device
US5155680A (en) 1986-10-24 1992-10-13 Signal Security Technologies Billing system for computing software
US4796181A (en) 1986-10-24 1989-01-03 Wiedemer John D Billing system for computer software
US5047928A (en) 1986-10-24 1991-09-10 Wiedemer John D Billing system for computer software
US4916614A (en) * 1986-11-25 1990-04-10 Hitachi, Ltd. Sentence translator using a thesaurus and a concept-organized co- occurrence dictionary to select from a plurality of equivalent target words
GB2199170A (en) * 1986-11-28 1988-06-29 Sharp Kk Translation apparatus
GB2199680A (en) 1986-11-28 1988-07-13 Sharp Kk Translating apparatus
JPS63223962A (en) 1987-03-13 1988-09-19 Hitachi Ltd Translating device
US5005127A (en) 1987-10-26 1991-04-02 Sharp Kabushiki Kaisha System including means to translate only selected portions of an input sentence and means to translate selected portions according to distinct rules
US5195032A (en) 1987-12-24 1993-03-16 Sharp Kabushiki Kaisha Device for designating a processing area for use in a translation machine
US4989233A (en) 1989-04-11 1991-01-29 Evanston Enterprises, Inc. Systems for capturing telephonic mass responses
FR2654851B1 (en) 1989-11-23 1992-03-27 Schlumberger Ind Sa METHOD AND SYSTEM FOR SECURE OPERATION OF SOFTWARE.
US5497319A (en) * 1990-12-31 1996-03-05 Trans-Link International Corp. Machine translation and telecommunications system
US5251130A (en) 1991-04-18 1993-10-05 International Business Machines Corporation Method and apparatus for facilitating contextual language translation within an interactive software application
JPH05120324A (en) 1991-10-30 1993-05-18 Sharp Corp Language processing system
JPH05314166A (en) * 1992-05-08 1993-11-26 Sharp Corp Electronic dictionary and dictionary retrieval device
US5327497A (en) 1992-06-04 1994-07-05 Integrated Technologies Of America, Inc. Preboot protection of unauthorized use of programs and data with a card reader interface
US6278967B1 (en) * 1992-08-31 2001-08-21 Logovista Corporation Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis
US5386369A (en) 1993-07-12 1995-01-31 Globetrotter Software Inc. License metering system for software applications
WO1995004427A1 (en) 1993-08-03 1995-02-09 A.D.S. Communications, Inc. Automated dispatch system and method
JPH0756957A (en) * 1993-08-03 1995-03-03 Xerox Corp Method for provision of information to user
US5689560A (en) 1994-04-25 1997-11-18 International Business Machines Corporation Method and apparatus for enabling trial period use of software products: method and apparatus for allowing a try-and-buy user interaction
US5533103A (en) 1994-04-28 1996-07-02 Electronic Information Systems, Inc. Calling system and method
JPH0844719A (en) * 1994-06-01 1996-02-16 Mitsubishi Electric Corp Dictionary access system
US5633909A (en) 1994-06-17 1997-05-27 Centigram Communications Corporation Apparatus and method for generating calls and testing telephone equipment
JP2773652B2 (en) 1994-08-04 1998-07-09 日本電気株式会社 Machine translation equipment
JPH08329105A (en) * 1995-05-31 1996-12-13 Canon Inc Method and device for processing document
US5995922A (en) * 1996-05-02 1999-11-30 Microsoft Corporation Identifying information related to an input word in an electronic dictionary

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110202333A1 (en) * 2001-03-16 2011-08-18 Meaningful Machines, LLC Knowledge System Method and Apparatus
US8818789B2 (en) * 2001-03-16 2014-08-26 Meaningful Machines Llc Knowledge system method and apparatus
US6993473B2 (en) * 2001-08-31 2006-01-31 Equality Translation Services Productivity tool for language translators
US20030046062A1 (en) * 2001-08-31 2003-03-06 Cartus John R. Productivity tool for language translators
US8050906B1 (en) * 2003-06-01 2011-11-01 Sajan, Inc. Systems and methods for translating text
US7543270B1 (en) * 2004-01-23 2009-06-02 The Math Works, Inc. Tracing output descriptions generated from corresponding input descriptions
US8725490B2 (en) * 2007-10-18 2014-05-13 Yahoo! Inc. Virtual universal translator for a mobile device with a camera
US20090106016A1 (en) * 2007-10-18 2009-04-23 Yahoo! Inc. Virtual universal translator
US20090193332A1 (en) * 2008-01-28 2009-07-30 Samsung Electronics Co., Ltd. Method and apparatus for editing letters
US20100030760A1 (en) * 2008-08-04 2010-02-04 Justsystems Corporation Data input apparatus and data input method
US20100082324A1 (en) * 2008-09-30 2010-04-01 Microsoft Corporation Replacing terms in machine translation
US20100161311A1 (en) * 2008-12-19 2010-06-24 Massuh Lucas A Method, apparatus and system for location assisted translation
US9323854B2 (en) * 2008-12-19 2016-04-26 Intel Corporation Method, apparatus and system for location assisted translation
US20110219304A1 (en) * 2010-03-05 2011-09-08 Casio Computer Co., Ltd. Dictionary information display device
CN102193902A (en) * 2010-03-05 2011-09-21 卡西欧计算机株式会社 Dictionary information display device
US11694041B2 (en) 2018-05-15 2023-07-04 Iflytek Co., Ltd. Chapter-level text translation method and device

Also Published As

Publication number Publication date
US6151570A (en) 2000-11-21
US6418403B2 (en) 2002-07-09
JPH09146950A (en) 1997-06-06
JP3952216B2 (en) 2007-08-01

Similar Documents

Publication Publication Date Title
US6418403B2 (en) Translating apparatus, dictionary search apparatus, and translating method
JP3969628B2 (en) Translation support apparatus, method, and translation support program
US6516296B1 (en) Translating apparatus, dictionary search apparatus, and translating method
US5349368A (en) Machine translation method and apparatus
US6539116B2 (en) Information processing apparatus and method, and computer readable memory therefor
US20110093254A1 (en) Method and System for Using Alignment Means in Matching Translation
JPH07282063A (en) Machine translation device
JP2001195404A (en) Phase translation method and system
JP2002297588A (en) Method, program and apparatus for automatic creation for dictionary of translation example
JPH08305730A (en) Automatic method for selection of key phrase from document of machine-readable format to processor
US5890182A (en) Sentence processing method and apparatus
JP4113235B2 (en) Translation support device
US20070083855A1 (en) Method and apparatus to enhance context for specific instances of output text in source files
JP2960936B2 (en) Dependency analyzer
JP2838984B2 (en) General-purpose reference device
JPS6118074A (en) Pre-editing system
JPH07334526A (en) Dictionary retrieval and display device
JPH0765005A (en) Document fast read support display device, word processor and digital copying device
JPH1115826A (en) Document analyzer and its method
JP3999771B2 (en) Translation support program, translation support apparatus, and translation support method
JP2005189955A (en) Document processing method, document processor, control program, and recording medium
JP2893239B2 (en) Translation support dictionary search and display system
JP2608384B2 (en) Machine translation apparatus and method
JPH11203281A (en) Electronic dictionary retrieving device and medium stored with control program for the device
JP3939151B2 (en) Translated word selection device, translated word selection program, and medium storing translated word selection program

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

REMI Maintenance fee reminder mailed
LAPS Lapse for failure to pay maintenance fees
STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20140709