JP2009080727A

JP2009080727A - Translation device and program

Info

Publication number: JP2009080727A
Application number: JP2007250709A
Authority: JP
Inventors: Masahiro Kato; 雅弘加藤
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2007-09-27
Filing date: 2007-09-27
Publication date: 2009-04-16
Anticipated expiration: 2027-09-27
Also published as: JP4998176B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a translation device and a program, improving convenience of a user when simultaneously outputting an original text and a translated text. <P>SOLUTION: This translation device 2 reads a document including the original sentence to acquire an original image. The translation device 2 specifies an original text area included in the document image. The translation device 2 recognizes the original text comprising characters included in the original text area and generates the translated text. The translation device 2 arranges the translated text inside the original text area of the document image as parent characters, and arranges the original text as kana characters alongside it to generate a translated document. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、翻訳装置及びプログラムに関する。 The present invention relates to a translation apparatus and a program.

原文を含む原稿のスキャン画像を解析し、スキャン画像に含まれる原文を翻訳して出力する装置がある。例えば、下記特許文献１には、原文と翻訳文を行単位で交互に配置した画像を出力する方法が記載されている。この方法によると、例えば、利用者は翻訳結果の確認を容易に行うことができる。
特開平５−３２４７２０号公報 There is a device that analyzes a scanned image of a document including an original text, and translates and outputs the original text included in the scanned image. For example, the following Patent Document 1 describes a method of outputting an image in which an original sentence and a translated sentence are alternately arranged in line units. According to this method, for example, the user can easily confirm the translation result.
JP-A-5-324720

しかしながら、特許文献１の方法では、原文の文字サイズよりも翻訳文の文字サイズを小さくするため、原文の文字サイズによっては、翻訳文の文字サイズが小さくなりすぎてしまい、翻訳結果の確認が容易でなくなる場合がある。そもそも、利用者にとっての重要性は、原文よりも翻訳文の方が高い傾向にある。 However, in the method of Patent Document 1, since the character size of the translated sentence is made smaller than the character size of the original sentence, the character size of the translated sentence becomes too small depending on the character size of the original sentence, and it is easy to check the translation result. May disappear. To begin with, translations tend to be more important for users than for originals.

本発明は、上記課題を鑑みてなされたものであって、原文と翻訳文とを同時に出力する場合においての、利用者の利便性を向上させる翻訳装置及びプログラムの提供を、その目的の一つとする。 The present invention has been made in view of the above problems, and one of its purposes is to provide a translation device and a program that improve the convenience of the user when outputting an original sentence and a translated sentence simultaneously. To do.

請求項１の発明は、原文領域を含む原稿画像を取得する原稿画像取得手段と、前記原稿画像の前記原文領域を特定する原文領域特定手段と、前記原稿画像の前記原文領域に含まれる文字からなる原文を認識して翻訳文を生成する翻訳手段と、前記原稿画像の前記原文領域内に、翻訳文を親文字として配置するとともに、原文をルビ文字として配置し、翻訳文書を生成する翻訳文書生成手段と、を含むことを特徴とする翻訳装置である。ここで、ルビ文字とは、翻訳文書において翻訳文が縦書きである場合には翻訳文の文字列（親文字の文字列）の右又は左に添えて配置され、翻訳文が横書きである場合には親文字の文字列の上又は下に添えて配置される文字であって、その文字サイズが親文字の文字サイズよりも小さい文字をいう。 According to the first aspect of the present invention, there is provided a document image acquiring unit that acquires a document image including a source region, a source region specifying unit that specifies the source region of the document image, and characters included in the source region of the document image. A translation means for recognizing the original sentence to generate a translation sentence, and a translation document in which the translation sentence is arranged as a parent character in the original sentence area of the manuscript image and the original sentence is arranged as a ruby character to generate a translation document And a generation unit. Here, the ruby character is placed along the right or left of the character string of the translated sentence (parent character string) when the translated sentence is written vertically in the translated document, and the translated sentence is written horizontally. Is a character that is placed above or below the character string of the parent character and whose character size is smaller than the character size of the parent character.

請求項２の発明は、請求項１の発明において、前記原文領域に含まれる原文の文字サイズを特定する文字サイズ特定手段と、親文字の文字サイズを、前記文字サイズ特定手段により特定された文字サイズに基づいて決定する文字サイズ決定手段、をさらに含むことを特徴とする。 According to a second aspect of the present invention, in the first aspect of the present invention, the character size specifying means for specifying the character size of the original text included in the original text area, and the character size specified by the character size specifying means for the character size of the parent character. Character size determining means for determining based on the size is further included.

請求項３の発明は、請求項２の発明において、前記原文領域のサイズと、翻訳文の文字数と、前記決定された親文字の文字サイズと、に基づいて、少なくとも１つの行からなる翻訳文を配置する翻訳文領域を前記原文領域内に規則的に設定する翻訳文領域設定手段と、前記各翻訳文領域の間に、少なくとも１つの行からなる原文を配置するルビ文字領域を設定するルビ文字領域設定手段と、をさらに含むことを特徴とする。 According to a third aspect of the present invention, in the second aspect of the invention, a translated sentence comprising at least one line based on the size of the original text area, the number of characters of the translated sentence, and the determined character size of the parent character. A translated sentence area setting means for regularly setting a translated sentence area in which the text is arranged in the original sentence area, and a ruby character area in which the source sentence consisting of at least one line is set between the translated sentence areas. And a character area setting means.

請求項４の発明は、請求項３の発明において、前記原文領域に含まれる原文の各行の文字行画像を取得する文字行画像取得手段をさらに含み、前記翻訳文書生成手段は、前記各ルビ文字領域に、前記各文字行画像を配置する、ことを特徴とする。 The invention of claim 4 further includes character line image acquisition means for acquiring a character line image of each line of the original text included in the original text area according to the invention of claim 3, wherein the translated document generation means Each character line image is arranged in an area.

請求項５の発明は、請求項４の発明において、前記各文字行画像の高さが、翻訳文領域の間隔よりも大きいか否かを判断する判断手段と、その高さが前記間隔よりも大きいと判断された文字行画像を、その高さが前記間隔以下となるように縮小する第１縮小手段と、をさらに含むことを特徴とする。 According to a fifth aspect of the present invention, in the invention of the fourth aspect, the judging means for judging whether or not the height of each character line image is larger than the interval between the translated sentence regions, and the height is higher than the interval. It further includes first reduction means for reducing the character line image determined to be large so that the height thereof is equal to or less than the interval.

請求項６の発明は、請求項４又は５の発明において、前記原文領域内に配置される翻訳文の行数よりも前記文字行画像の数が大きい場合に、各文字行画像の長さを１より小さい倍率で縮小する第２縮小手段をさらに含み、前記第２縮小手段により縮小された各文字行画像の連結画像を各ルビ文字領域の長さで分割し、各分割画像を、新たな文字行画像とすることを特徴とする。 The invention of claim 6 is the invention of claim 4 or 5, wherein when the number of character line images is larger than the number of lines of translated sentences arranged in the source text area, the length of each character line image is set. A second reduction unit that reduces the image at a magnification smaller than 1 and divides the connected images of the character line images reduced by the second reduction unit by the length of each ruby character region, A character line image is used.

請求項７の発明は、請求項６の発明において、前記１より小さい倍率は、前記原文領域内に配置される翻訳文の行数を前記文字行画像の数で割った商以下の倍率である、ことを特徴とする。 According to a seventh aspect of the present invention, in the sixth aspect of the present invention, the magnification smaller than 1 is a magnification equal to or lower than a quotient obtained by dividing the number of translated text lines arranged in the source text area by the number of character line images. It is characterized by that.

請求項８の発明は、原文領域を含む原稿画像を取得する原稿画像取得手段、前記原稿画像の前記原文領域を特定する原文領域特定手段、前記原稿画像の前記原文領域に含まれる文字からなる原文を認識し、翻訳する翻訳手段、前記原稿画像の前記原文領域内に、翻訳文を親文字として配置するとともに、原文をルビ文字として配置し、翻訳文書を生成する翻訳文書生成手段としてコンピュータを機能させるプログラムである。 The invention according to claim 8 is a document image acquisition means for acquiring a document image including a text area, a text area specifying means for specifying the text area of the text image, and a text composed of characters included in the text area of the text image. A computer functioning as a translation document generating means for recognizing and translating a text, arranging a translated text as a parent character in the original text area of the manuscript image and arranging the original text as a ruby character to generate a translated document It is a program to let you.

請求項１及び請求項８の発明によれば、原文と翻訳文とを同時に出力する場合においての、利用者の利便性を向上させることができる。 According to the first and eighth aspects of the invention, it is possible to improve the convenience for the user when the original text and the translated text are output simultaneously.

請求項２の発明によれば、原文画像における原文の文字サイズに基づいて、親文字の文字サイズを決定することができる。 According to the invention of claim 2, the character size of the parent character can be determined based on the character size of the original text in the original text image.

請求項３の発明によれば、規則正しく、原文と翻訳文とを出力することができる。 According to the invention of claim 3, the original sentence and the translated sentence can be output regularly.

請求項４の発明によれば、原文を誤りなく出力することができる。 According to the invention of claim 4, the original text can be output without error.

請求項５の発明によれば、ルビ文字が親文字に重ならないように図ることができる。 According to the invention of claim 5, it is possible to prevent the ruby character from overlapping the parent character.

請求項６の発明によれば、翻訳文の行数よりも原文の行数を少なくすることができる。 According to the invention of claim 6, the number of lines in the original sentence can be made smaller than the number of lines in the translated sentence.

請求項７の発明によれば、翻訳文の行数よりも原文の行数を確実に少なくすることができる。 According to the invention of claim 7, the number of lines of the original sentence can be surely reduced from the number of lines of the translated sentence.

本発明の実施形態について図面を参照しながら説明する。図１は、本発明の実施形態に係る翻訳装置２のハードウェア構成図の一例である。同図に示すように、翻訳装置２は、中央処理部４０と、主記憶部４２と、補助記憶部４４と、画像読取部４６と、操作部４８と、ディスプレイ５０を含む。各部は、データバス５４を介して、相互に通信可能である。なお、翻訳装置２は、図示しないネットワークインターフェイスやプリンタ等も含む。 Embodiments of the present invention will be described with reference to the drawings. FIG. 1 is an example of a hardware configuration diagram of a translation apparatus 2 according to an embodiment of the present invention. As shown in the figure, the translation apparatus 2 includes a central processing unit 40, a main storage unit 42, an auxiliary storage unit 44, an image reading unit 46, an operation unit 48, and a display 50. Each unit can communicate with each other via a data bus 54. The translation device 2 includes a network interface, a printer, etc. (not shown).

中央処理部４０は、ＣＰＵやＭＰＵ等である。中央処理部４０は、主記憶部４２に予め格納されているプログラムに従って動作し、各部を制御する。また、中央処理部４０は、主記憶部４２に格納される情報を用いて演算し、主記憶部４２に出力する。なお、上記プログラムは、主記憶部４２に格納されるものに限らず、ＣＤ−ＲＯＭやＤＶＤ−ＲＯＭやＤＶＤ−ＲＯＭ等の情報記憶媒体に格納されるものであってもよいし、ネットワークから供給されるものであってもよい。 The central processing unit 40 is a CPU, MPU, or the like. The central processing unit 40 operates according to a program stored in advance in the main storage unit 42 and controls each unit. In addition, the central processing unit 40 calculates using information stored in the main storage unit 42 and outputs it to the main storage unit 42. The program is not limited to the one stored in the main storage unit 42 but may be stored in an information storage medium such as a CD-ROM, DVD-ROM, or DVD-ROM, or supplied from a network. It may be done.

主記憶部４２は、ＲＡＭやＲＯＭ等のメモリ素子である。主記憶部４２は、上記プログラムを格納する。また、主記憶部４２は、各部から入力される情報を格納する。 The main storage unit 42 is a memory element such as a RAM or a ROM. The main storage unit 42 stores the program. Further, the main storage unit 42 stores information input from each unit.

補助記憶部４４は、ハードディスク等である。補助記憶部４４は、中央処理部４０からの制御信号に従い、主記憶部４２に格納される情報を記憶する。 The auxiliary storage unit 44 is a hard disk or the like. The auxiliary storage unit 44 stores information stored in the main storage unit 42 in accordance with a control signal from the central processing unit 40.

画像読取部４６は、スキャナである。画像読取部４６は、中央処理部４０からの制御信号に従って、原稿を読み取り、読取った原稿画像を主記憶部４２に出力する。ここで、原稿は、第１言語の原文を含む。ここでは、第１言語は、日本語である。 The image reading unit 46 is a scanner. The image reading unit 46 reads a document in accordance with a control signal from the central processing unit 40 and outputs the read document image to the main storage unit 42. Here, the manuscript includes the original text in the first language. Here, the first language is Japanese.

操作部４８は、キーボードやマウス等である。操作部４８は、操作情報を主記憶部４２に出力する。ディスプレイ５０は、中央処理部４０からの制御信号に従い、主記憶部４２に格納される情報を表示する。 The operation unit 48 is a keyboard, a mouse, or the like. The operation unit 48 outputs operation information to the main storage unit 42. The display 50 displays information stored in the main storage unit 42 in accordance with a control signal from the central processing unit 40.

図２は、中央処理部４０が上記プログラムに従って動作することにより、翻訳装置２にて実現される機能のブロック図である。ここでは、本発明に関連する機能を中心として示す。同図に示すように、翻訳装置２は、原稿画像取得部６０と、原文領域特定部６２と、文字行画像取得部６４と、翻訳部６６と、親文字サイズ決定部６８と、配置領域設定部７０と、翻訳文書生成部７６と、翻訳結果出力部７８と、を含む。配置領域設定部７０は、親文字領域設定部７２と、ルビ文字領域設定部７４と、を含む。 FIG. 2 is a block diagram of functions realized by the translation apparatus 2 when the central processing unit 40 operates according to the program. Here, the functions related to the present invention are mainly shown. As shown in the figure, the translation apparatus 2 includes a document image acquisition unit 60, a source text region specification unit 62, a character line image acquisition unit 64, a translation unit 66, a parent character size determination unit 68, and an arrangement region setting. A section 70, a translation document generation section 76, and a translation result output section 78. The arrangement region setting unit 70 includes a parent character region setting unit 72 and a ruby character region setting unit 74.

原稿画像取得部６０は、画像読取部４６を中心として実現される。原稿画像取得部６０は、原稿を読み取り、原稿画像の画像データ（ビットマップデータ）を取得する。 The document image acquisition unit 60 is realized with the image reading unit 46 as the center. The document image acquisition unit 60 reads a document and acquires image data (bitmap data) of the document image.

原文領域特定部６２は、中央処理部４０を中心として実現される。原文領域特定部６２は、原稿画像取得部６０によって取得された原稿画像のレイアウトを解析し、原稿画像に含まれる原文領域の位置・範囲を特定する。本実施形態では、原文領域特定部６２は、図３に示すように、原文領域を、原文画像を囲む矩形領域として検出し、そのＸ座標とＹ座標の範囲を特定する。 The original text region specifying unit 62 is realized with the central processing unit 40 as the center. The original text area specifying unit 62 analyzes the layout of the original image acquired by the original image acquisition unit 60 and specifies the position / range of the original text area included in the original image. In the present embodiment, as shown in FIG. 3, the original text area specifying unit 62 detects the original text area as a rectangular area surrounding the original text image, and specifies the range of the X and Y coordinates.

文字行画像取得部６４は、中央処理部４０を中心として実現される。文字行画像取得部６４は、原文領域ごとに、当該原文領域に含まれる原文の各行の文字行画像を取得する。 The character line image acquisition unit 64 is realized with the central processing unit 40 as the center. The character line image acquisition unit 64 acquires, for each original text area, a character line image of each line of the original text included in the original text area.

本実施形態では、文字行画像取得部６４は、図４に示すように、原文領域ごとに射影演算を行い、垂直方向の軸（Ｙ軸）に対する文字画素の射影ヒストグラムを生成する。そして、文字行画像取得部６４は、原文領域ごとに、射影ヒストグラム基づいて、文字行領域の位置・範囲を特定する。すなわち、文字行画像取得部６４は、文字画素数が所定値ＴＨ以上であるＹ座標の範囲を特定し、特定した範囲を、それぞれ、各文字行領域の垂直方向の範囲とする。また、原文領域のＸ座標の範囲を、各文字行領域の水平方向の範囲とする。こうして、文字行画像取得部６４は、図５に示すように、原文領域ごとに、文字行領域を特定する。 In the present embodiment, as shown in FIG. 4, the character line image acquisition unit 64 performs a projection operation for each original text region, and generates a projection histogram of character pixels with respect to the vertical axis (Y axis). Then, the character line image acquisition unit 64 specifies the position / range of the character line region for each original text region based on the projection histogram. That is, the character line image acquisition unit 64 specifies a range of Y coordinates in which the number of character pixels is equal to or greater than a predetermined value TH, and sets the specified range as a range in the vertical direction of each character line region. Further, the range of the X coordinate of the original text area is set as the horizontal range of each character line area. Thus, the character line image acquisition unit 64 specifies a character line area for each original text area as shown in FIG.

そして、文字行画像取得部６４は、原文領域ごとに、各文字行領域内の画像（すなわち、文字行画像）を複製する。こうして、文字行画像取得部６４は、原文領域ごとに、各文字行画像を取得する。 And the character line image acquisition part 64 duplicates the image (namely, character line image) in each character line area | region for every original text area | region. Thus, the character line image acquisition unit 64 acquires each character line image for each original text area.

翻訳部６６は、中央処理部４０を中心として実現される。翻訳部６６は、原文領域に含まれる文字からなる原文を認識して第２言語の翻訳文を生成する。 The translation unit 66 is realized with the central processing unit 40 as the center. The translation unit 66 recognizes an original sentence composed of characters included in the original sentence area and generates a translated sentence in the second language.

本実施形態では、翻訳部６６は、原文領域ごとに、当該原文領域から取得された文字行画像の一つ一つに含まれる文字を認識することにより、当該原文領域に含まれる原文を認識する。そして、翻訳部６６は、原文領域ごとに、認識した原文の翻訳文のテキストを生成する。ここでは、第２言語は英語である。 In the present embodiment, the translation unit 66 recognizes the original text included in the original text area by recognizing the characters included in each character line image acquired from the original text area for each original text area. . And the translation part 66 produces | generates the text of the translated sentence of the recognized original sentence for every original sentence area | region. Here, the second language is English.

親文字サイズ決定部６８は、中央処理部４０を中心として実現される。親文字サイズ決定部６８は、原文領域ごとに、後述する翻訳文書生成部７６が当該原文領域内に翻訳文の文字を親文字として配置する際の文字サイズを決定する。ここでは、文字サイズとは、フォントサイズである。 The parent character size determination unit 68 is realized with the central processing unit 40 as the center. The parent character size determination unit 68 determines, for each source text area, a character size when a translation document generation unit 76, which will be described later, arranges the text of the translation sentence as a parent character in the source text area. Here, the character size is the font size.

本実施形態では、親文字サイズ決定部６８は、原文領域ごとに、当該原文領域から取得された文字行画像の縦幅から、当該原文領域に含まれる原文の文字サイズを特定し、特定した文字サイズより大きい文字サイズ（例えば、特定した文字サイズより１ポイント大きい文字サイズ）を、当該原文領域に配置される親文字の文字サイズとする。 In the present embodiment, the parent character size determination unit 68 specifies the character size of the original text included in the original text area from the vertical width of the character line image acquired from the original text area for each original text area, and specifies the specified character. A character size larger than the size (for example, a character size one point larger than the specified character size) is set as the character size of the parent character arranged in the original text area.

配置領域設定部７０は、中央処理部４０を中心として実現される。配置領域設定部７０は、親文字領域設定部７２と、ルビ文字領域設定部７４と、を含む。 The arrangement area setting unit 70 is realized with the central processing unit 40 as the center. The arrangement region setting unit 70 includes a parent character region setting unit 72 and a ruby character region setting unit 74.

親文字領域設定部７２は、原文領域ごとに、当該原文領域のサイズと、翻訳文の文字数と、当該原文領域に配置される親文字の文字サイズと、に基づいて、親文字を配置する親文字領域を当該原文領域内に設定する。 The parent character area setting unit 72 sets, for each source text area, a parent character that arranges a parent character based on the size of the source text area, the number of characters in the translated sentence, and the character size of the parent text placed in the source text area. Set the character area in the source text area.

本実施形態では、親文字領域設定部７２は、原文領域の横幅と、当該原文領域に配置される親文字の文字サイズの横幅と、に基づいて、当該原文領域に配置可能な親文字の一行あたりの文字数を算出する。そして、親文字領域設定部７２は、翻訳文の文字数（親文字の文字数）から、原文領域に親文字を配置した場合の行数を算出する。例えば、原文領域に配置可能な親文字の一行あたりの文字数が１００文字で、親文字の文字数が４５０文字であれば、算出される行数は５行となる。 In the present embodiment, the parent character area setting unit 72 uses a line of parent characters that can be placed in the source text area based on the width of the source text area and the width of the character size of the parent text placed in the source text area. Calculate the number of characters per. Then, the parent character area setting unit 72 calculates the number of lines when the parent characters are arranged in the original sentence area from the number of characters of the translated sentence (number of characters of the parent character). For example, if the number of characters per line that can be arranged in the original text area is 100 characters and the number of characters of the parent character is 450 characters, the calculated number of lines is 5.

そして、親文字領域設定部７２は、原文領域内に、算出した行数と同数の配置領域を設定する。具体的には、親文字領域設定部７２は、算出した行数で原文領域を横に等分してできた矩形領域の一つ一つを、配置領域として設定する。なお、図６（ａ）は、算出した行数が３行である場合に、原文領域内に設定される配置領域の一例を示す。 The parent character area setting unit 72 sets the same number of arrangement areas as the calculated number of lines in the original sentence area. Specifically, the parent character area setting unit 72 sets each rectangular area formed by equally dividing the original text area by the calculated number of lines as an arrangement area. FIG. 6A shows an example of an arrangement area set in the original sentence area when the calculated number of lines is three.

そして、親文字領域設定部７２は、図６（ｂ）に示すように、各配置領域の底部に、親文字の文字サイズの縦幅をその縦幅とし、原文領域の横幅をその横幅とする親文字領域を設定する。従って、各親文字領域は、等間隔に設定される。 Then, as shown in FIG. 6B, the parent character area setting unit 72 sets the vertical width of the character size of the parent character as the vertical width and the horizontal width of the original text area as the horizontal width at the bottom of each arrangement area. Set the parent character area. Therefore, each parent character area is set at equal intervals.

ルビ文字領域設定部７４は、原文領域ごとに、当該原文領域内に、原文の文字をルビ文字として配置するためのルビ文字領域を設定する。本実施形態では、ルビ文字領域設定部７４は、ルビ文字領域を、各配置領域内に一つずつ設定する。具体的には、ルビ文字領域設定部７４は、図６（ｂ）に示すように、配置領域内の親文字領域以外の矩形領域をルビ文字領域として設定する。従って、各ルビ文字領域の縦幅は、配置領域の縦幅から親文字領域の縦幅を引いた差となり、各ルビ文字領域の横幅は、原文領域の横幅となる。 The ruby character area setting unit 74 sets, for each original text area, a ruby character area for arranging original text characters as ruby characters in the original text area. In the present embodiment, the ruby character area setting unit 74 sets one ruby character area in each arrangement area. Specifically, as shown in FIG. 6B, the ruby character area setting unit 74 sets a rectangular area other than the parent character area in the arrangement area as the ruby character area. Accordingly, the vertical width of each ruby character area is a difference obtained by subtracting the vertical width of the parent character area from the vertical width of the arrangement area, and the horizontal width of each ruby character area is the horizontal width of the original text area.

翻訳文書生成部７６は、原稿画像の各原文領域内に、翻訳文を親文字として配置するとともに、原文をルビ文字として配置して、翻訳文書を生成する。なお、図７は、図３に示す原稿画像に基づいて生成された翻訳文書を示す。 The translated document generation unit 76 arranges the translated text as a parent character in each original text area of the original image, and arranges the original text as a ruby character to generate a translated document. FIG. 7 shows a translated document generated based on the document image shown in FIG.

本実施形態では、翻訳文書生成部７６は、原稿画像の各原文領域内を特定色で塗り潰す。そして、翻訳文書生成部７６は、図８に示すように、原稿領域ごとに、当該原稿領域内に設定された各ルビ文字領域に、当該原文領域から取得された各文字行画像を配置する。 In the present embodiment, the translation document generation unit 76 fills each original text area of the document image with a specific color. Then, as shown in FIG. 8, the translated document generation unit 76 arranges each character line image acquired from the original text area in each ruby character area set in the original area for each original area.

但し、翻訳文書生成部７６は、場合によっては、各文字行画像を縮小してから配置する。 However, the translation document generation unit 76 arranges the character line images after reducing them in some cases.

すなわち、翻訳文書生成部７６は、注目している原文領域内の各ルビ文字領域に当該原文領域から取得された文字行画像を配置する場合、当該原文領域から取得された文字行画像の中から縦幅の最も大きいもの（注目文字行画像）を選択し、注目文字行画像の縦幅がルビ文字領域の縦幅よりも大きいか否かを判断する。そして、注目文字行画像の縦幅がルビ文字領域の縦幅よりも大きければ、図９に示すように、各文字行画像の縦幅を縮小する。ここでは、各文字行画像の縦幅を、ルビ文字領域の縦幅を注目文字行画像の縦幅で割った商以下の倍率で、縮小する。これにより、各文字行画像の縦幅が、ルビ文字領域の縦幅以下に縮小される。 That is, when the translated document generation unit 76 arranges a character line image acquired from the original text region in each ruby character region in the original text region of interest, the translated document generation unit 76 selects from the character line images acquired from the original text region. The one with the largest vertical width (attention character line image) is selected, and it is determined whether or not the vertical width of the attention character line image is larger than the vertical width of the ruby character area. If the vertical width of the target character line image is larger than the vertical width of the ruby character area, the vertical width of each character line image is reduced as shown in FIG. Here, the vertical width of each character line image is reduced at a magnification equal to or lower than the quotient obtained by dividing the vertical width of the ruby character area by the vertical width of the target character line image. As a result, the vertical width of each character line image is reduced below the vertical width of the ruby character area.

また、翻訳文書生成部７６は、注目している原文領域内の各ルビ文字領域に当該原文領域から取得された文字行画像を配置する場合、当該原文領域内に設定された親文字領域の数よりも当該原文領域から取得された文字行画像の数が多い場合、図１０に示すように各文字行画像の横幅を縮小する。例えば、翻訳文書生成部７６は、各文字行画像の横幅を、親文字領域の数を文字行画像の数で割った商以下の倍率で、縮小する。なお、倍率は、１より小さい所定の倍率であってもよい。 In addition, when the translated document generation unit 76 places a character line image acquired from the original text area in each ruby character area in the original text area of interest, the number of parent character areas set in the original text area If the number of character line images acquired from the original text area is larger than the width of each character line image, as shown in FIG. For example, the translation document generation unit 76 reduces the width of each character line image at a magnification equal to or lower than the quotient obtained by dividing the number of parent character regions by the number of character line images. The magnification may be a predetermined magnification smaller than 1.

そして、翻訳文書生成部７６は、以上のようにして縦幅と横幅の少なくとも一方が縮小された各文字行画像を、各ルビ文字領域にルビ文字領域、翻訳文書生成部７６は、各文字行画像の横幅を縮小した場合、図１１（ａ）に示すように、各文字行画像を横方向に連結してルビ文字領域の横幅の長さで分割する。そして、図１１（ｂ）に示すように、各分割画像を、新たな文字行画像として、各ルビ文字領域に配置する。 Then, the translation document generation unit 76 converts each character line image reduced in at least one of the vertical width and the horizontal width as described above into a ruby character area, and the translation document generation unit 76 converts each character line image When the horizontal width of the image is reduced, as shown in FIG. 11A, the character line images are connected in the horizontal direction and divided by the horizontal width of the ruby character area. Then, as shown in FIG. 11B, each divided image is arranged in each ruby character area as a new character line image.

以上のようにして各文字行画像を各ルビ文字領域に配置すると、翻訳文書生成部７６は、各親文字領域に、親文字サイズ決定部６８により決定された文字サイズで、翻訳文の文字画像を、親文字として配置する。こうして、翻訳文書生成部７６は、翻訳文書の画像データ（ビットマップデータ）を生成する。 When each character line image is arranged in each ruby character area as described above, the translation document generation unit 76 uses the character size determined by the parent character size determination unit 68 in each parent character area, and the character image of the translated sentence. Are placed as parent characters. Thus, the translation document generation unit 76 generates image data (bitmap data) of the translation document.

翻訳結果出力部７８は、中央処理部４０を中心として実現される。翻訳結果出力部７８は、翻訳文書生成部７６により生成された翻訳文書をディスプレイ５０に出力させる。なお、翻訳結果出力部７８は、翻訳文書をプリンタに印刷出力させるようにしてもよい。 The translation result output unit 78 is realized with the central processing unit 40 as the center. The translation result output unit 78 causes the display 50 to output the translation document generated by the translation document generation unit 76. The translation result output unit 78 may cause the printer to print out the translated document.

次に、上記の機能を有する翻訳装置２が、実行する処理の一例を、図１２のフローチャートを参照しながら説明する。なお、本実施形態では、利用者が翻訳装置２に原稿を読み取らせ翻訳指示操作を行った場合に、この処理が実行される。 Next, an example of processing executed by the translation apparatus 2 having the above function will be described with reference to the flowchart of FIG. In the present embodiment, this process is executed when the user causes the translation apparatus 2 to read a document and performs a translation instruction operation.

翻訳装置２は、原稿画像取得部６０の機能により、原稿を読み取って原稿画像の画像データを取得する。そして、翻訳装置２は、取得した原稿画像にスキュー補正や下地除去処理等の公知の高画質化処理を施す（Ｓ１０１）。そして、翻訳装置２は、原文領域特定部６２の機能により、原稿画像のレイアウトを解析し、原稿画像に含まれる原文領域の位置・範囲を特定する（Ｓ１０２）。 The translation device 2 reads the document and acquires the image data of the document image by the function of the document image acquisition unit 60. Then, the translation apparatus 2 performs known image quality enhancement processing such as skew correction and background removal processing on the acquired document image (S101). Then, the translation device 2 analyzes the layout of the document image by using the function of the original text region specifying unit 62 and specifies the position / range of the original text region included in the original image (S102).

そして、翻訳装置２は、文字行画像取得部６４の機能により、原文領域ごとに、原文の各行の文字行領域を特定し、各文字行領域内の画像（すなわち、文字行画像）を複製して取得する（Ｓ１０３）。 Then, the translation device 2 specifies the character line area of each line of the original sentence for each original text area by the function of the character line image acquisition unit 64, and duplicates the image (that is, the character line image) in each character line area. (S103).

そして、翻訳装置２は、公知の処理により、Ｓ１０３のステップで取得した文字行画像ごとに、当該文字行画像に含まれる原文の文字を認識する（Ｓ１０４）。そして、翻訳装置２は、文字行画像ごとに認識した文字を原文領域単位で統合することにより、原文領域ごとに、当該原文領域に含まれる原文を認識する（Ｓ１０５）。そして、翻訳装置２は、原文領域ごとに、認識した原文の翻訳文のテキストを生成する（Ｓ１０６）。 And the translation apparatus 2 recognizes the character of the original text contained in the said character line image for every character line image acquired by the step of S103 by a well-known process (S104). And the translation apparatus 2 recognizes the original text contained in the said original text area | region for every original text area | region by integrating the character recognized for every character line image for every original text area | region (S105). And the translation apparatus 2 produces | generates the text of the recognized translation of the original sentence for every original text area | region (S106).

そして、翻訳装置２は、Ｓ１０２のステップで特定された原文領域のうちから１つの注目原文領域を選択する（Ｓ１０７）。そして、翻訳装置２は、注目原文領域内を所定色（例えば白）で塗り潰す（Ｓ１０８）。ここで、翻訳装置２は、注目原文領域のうち文字行領域以外の領域の平均色を検出し、当該平均色で注目原文領域内を塗り潰すようにしてもよい。 Then, the translation apparatus 2 selects one target original text area from the original text areas specified in step S102 (S107). Then, the translation apparatus 2 fills the target original text area with a predetermined color (for example, white) (S108). Here, the translation apparatus 2 may detect an average color of an area other than the character line area in the target original text area, and fill the target text area with the average color.

そして、翻訳装置２は、親文字サイズ決定部６８の機能により、注目原文領域内に配置する親文字の文字サイズを決定する（Ｓ１０９）。 Then, the translation apparatus 2 determines the character size of the parent character to be arranged in the target original text area by the function of the parent character size determination unit 68 (S109).

すなわち、翻訳装置２は、注目原文領域から取得された文字行画像の１つに着目し、その縦幅に基づいて、注目原文領域に含まれる原文の文字サイズを特定する。そして、翻訳装置２は、特定した文字サイズより１ポイント大きい文字サイズを、注目原文領域に配置する親文字の文字サイズとして決定する。 That is, the translation device 2 focuses on one of the character line images acquired from the target original text area, and specifies the character size of the original text included in the target text area based on the vertical width. Then, the translation apparatus 2 determines a character size that is one point larger than the specified character size as the character size of the parent character to be arranged in the target original text area.

また、翻訳装置２は、着目している文字行画像に基づいて、注目原文領域に含まれる原文の文字色を検知し、検知した文字色とは異なる文字色を、親文字の文字色として決定する（Ｓ１１０）。例えば、翻訳装置２は、検知した文字色と所定色差のある文字色を、親文字の文字色として決定する。 Further, the translation device 2 detects the character color of the original text included in the target original text region based on the focused character line image, and determines a character color different from the detected character color as the character color of the parent character. (S110). For example, the translation device 2 determines a character color having a predetermined color difference from the detected character color as the character color of the parent character.

そして、翻訳装置２は、親文字領域設定部７２の機能により、注目原文領域に親文字を配置した場合の行数を算出し、注目原文領域内に、算出した行数と同数の配置領域を設定する。そして、翻訳装置２は、各配置領域の底部に親文字領域を設定する（図６参照）。また、翻訳装置２は、ルビ文字領域設定部７４の機能により、各配置領域内の親文字領域以外の矩形領域を、ルビ文字領域として設定する（Ｓ１１１）。 Then, the translation device 2 calculates the number of lines when the parent character is arranged in the target text area by the function of the parent text area setting unit 72, and sets the same number of arrangement areas as the calculated number of lines in the target text area. Set. Then, translation apparatus 2 sets a parent character area at the bottom of each arrangement area (see FIG. 6). Also, the translation device 2 sets a rectangular area other than the parent character area in each arrangement area as the ruby character area by the function of the ruby character area setting unit 74 (S111).

そして、翻訳装置２は、注目原文領域内に設定した親文字領域の数よりも注目原文領域から取得した文字行画像の数が多いか否かを判断する（Ｓ１１２）。そして、親文字領域の数が文字行画像の数以上である場合（Ｓ１１２のＮ）、翻訳装置２は、注目原文領域から取得された文字行画像の中から最も大きい縦幅の文字行画像（注目文字行画像）を選択し、その縦幅がルビ文字領域の縦幅よりも大きいか否かを判断する（Ｓ１１３）。そして、翻訳装置２は、注目文字行画像の縦幅がルビ文字領域の縦幅以下であれば（Ｓ１１３のＮ）、Ｓ１１４に進む。一方、翻訳装置２は、注目文字行画像の縦幅がルビ文字領域の縦幅よりも大きければ（Ｓ１１３のＹ）、各文字行画像の縦幅を、ルビ文字領域の縦幅を注目文字行画像の縦幅で割った商以下の倍率で、縮小し（Ｓ１１５）、Ｓ１１４のステップに進む。 Then, the translation apparatus 2 determines whether or not the number of character line images acquired from the target text area is larger than the number of parent character areas set in the target text area (S112). If the number of parent character areas is equal to or greater than the number of character line images (N in S112), the translation device 2 determines that the character line image having the largest vertical width (from the character line images acquired from the original text area of interest) The target character line image) is selected, and it is determined whether or not the vertical width is larger than the vertical width of the ruby character area (S113). If the vertical width of the target character line image is equal to or smaller than the vertical width of the ruby character area (N in S113), the translation device 2 proceeds to S114. On the other hand, if the vertical width of the target character line image is larger than the vertical width of the ruby character area (Y in S113), the translation apparatus 2 sets the vertical width of each character line image to the vertical width of the ruby character area. The image is reduced at a magnification equal to or lower than the quotient divided by the vertical width of the image (S115), and the process proceeds to step S114.

一方、注目原文領域内に設定した親文字領域の数よりも注目原文領域から取得した文字行画像の数が多い場合（Ｓ１１２のＹ）、翻訳装置２は、各文字行画像の横幅を、親文字領域の数を文字行画像の数で割った商以下の倍率で、縮小する（Ｓ１１６）。なお、ここでの倍率は１より小さい所定の倍率であってもよい。そして、翻訳装置２は、注目文字行画像の縦幅がルビ文字領域の縦幅よりも大きいか否かを判断する（Ｓ１１７）。そして、翻訳装置２は、注目文字行画像の縦幅がルビ文字領域の縦幅以下であれば（Ｓ１１７のＮ）、Ｓ１１８に進む。一方、翻訳装置２は、注目文字行画像の縦幅がルビ文字領域の縦幅よりも大きければ（Ｓ１１７のＹ）、各文字行画像の縦幅を、ルビ文字領域の縦幅を注目文字行画像の縦幅で割った商以下の倍率で縮小して（Ｓ１１９）、Ｓ１１８のステップに進む。なお、翻訳装置２は、Ｓ１１６及びＳ１１９のステップにおいて、各文字行画像を全体的に縮小するようにしてもよい。 On the other hand, when the number of character line images acquired from the target text area is larger than the number of parent character areas set in the target text area (Y in S112), the translation apparatus 2 determines the width of each character line image as the parent text area. Reduction is performed at a magnification equal to or less than the quotient obtained by dividing the number of character areas by the number of character line images (S116). Note that the magnification here may be a predetermined magnification smaller than 1. Then, the translation device 2 determines whether or not the vertical width of the target character line image is larger than the vertical width of the ruby character area (S117). If the vertical width of the target character line image is equal to or smaller than the vertical width of the ruby character area (N in S117), the translation apparatus 2 proceeds to S118. On the other hand, if the vertical width of the target character line image is larger than the vertical width of the ruby character area (Y in S117), the translation apparatus 2 sets the vertical width of each character line image to the vertical width of the ruby character area. The image is reduced at a magnification equal to or smaller than the quotient divided by the vertical width of the image (S119), and the process proceeds to step S118. Note that the translation apparatus 2 may reduce each character line image as a whole in steps S116 and S119.

そして、翻訳装置２は、Ｓ１１８のステップにおいて、各文字行画像を横方向に連結してルビ文字領域の横幅の長さで分割し、各分割画像を、新たな文字行画像として、Ｓ１１４のステップに進む。 Then, in step S118, translation apparatus 2 concatenates each character line image in the horizontal direction and divides it by the horizontal width of the ruby character region, and sets each divided image as a new character line image, step S114. Proceed to

そして、翻訳装置２は、Ｓ１１４のステップにおいて、各文字行画像を、各ルビ文字領域に配置し、各親文字領域に、翻訳文の文字の画像、すなわち、親文字の画像を、Ｓ１０９のステップで決定された文字サイズで配置する（Ｓ１２０）。 Then, in step S114, the translation device 2 arranges each character line image in each ruby character area, and in each parent character area, a character image of the translated sentence, that is, a parent character image, is placed in step S109. The character size determined in step S120 is arranged (S120).

そして、翻訳装置２は、すべての原文領域を対象にＳ１０８乃至Ｓ１２０のステップを実行したか否かを判断する（Ｓ１２１）。そして、すべての原文領域を対象にＳ１０８乃至Ｓ１２０のステップを実行していない場合（Ｓ１２１のＮ）、翻訳装置２は、他の注目原文領域を選択して、Ｓ１０８以降のステップを実行する。一方、すべての原文領域を対象にＳ１０８乃至Ｓ１２０のステップを実行した場合（Ｓ１２１のＹ）、翻訳装置２は、ここでまでのステップにより生成した翻訳文書の画像データに基づいて、翻訳文書をディスプレイ５０に出力したり、印刷出力したりする（Ｓ１２２）。以上が、翻訳装置２が実行する処理の具体的内容である。なお、翻訳装置２は、原稿が複数頁からなる場合は、各頁の原稿画像について図１２の処理を実行すればよい。 Then, the translation apparatus 2 determines whether or not steps S108 to S120 have been executed for all the original text regions (S121). If the steps S108 to S120 are not executed for all the original text regions (N in S121), the translation apparatus 2 selects another target text region and executes the steps after S108. On the other hand, when the steps S108 to S120 are executed for all the original text regions (Y in S121), the translation apparatus 2 displays the translation document based on the image data of the translation document generated by the steps so far. 50 or print out (S122). The above is the specific content of the processing executed by the translation apparatus 2. Note that the translation apparatus 2 may perform the process of FIG. 12 on the original image of each page when the original consists of a plurality of pages.

以上のように、本発明の実施形態に係る翻訳装置２によれば、原文の文字がルビ文字として配置され、翻訳文の文字が親文字として配置された翻訳文書が出力されるので、利用者にとっての利便性が向上する。 As described above, according to the translation device 2 according to the embodiment of the present invention, a translated document in which the original text characters are arranged as ruby characters and the translated text characters are arranged as parent characters is output. Convenience for the user is improved.

本発明の実施形態は、上記実施形態だけに限らない。 The embodiment of the present invention is not limited to the above embodiment.

例えば、翻訳装置２は、Ｓ１０５のステップで認識した原文のテキストの画像を各ルビ文字領域に描画することにより翻訳文書の画像データを生成するようにしてもよい。但し、翻訳装置２は、ルビ文字（原文の文字）の画像を、ルビ文字のサイズが親文字（翻訳文の文字）よりも小さくなるように描画する。 For example, the translation apparatus 2 may generate image data of the translation document by drawing an image of the original text recognized in step S105 in each ruby character area. However, the translation device 2 draws the image of the ruby character (original character) so that the size of the ruby character is smaller than that of the parent character (translated character).

また、例えば、翻訳装置２は、Ｓ１１１のステップにおいて、各親文字領域を、各配置領域の底部ではなく、上部に設定するようにしてもよい。すなわち、親文字の各行の文字列の下にルビ文字の各行の文字列を配置するようにしてもよい。 Further, for example, the translation apparatus 2 may set each parent character area not at the bottom of each arrangement area but at the top in the step of S111. That is, the character string of each line of ruby characters may be arranged below the character string of each line of the parent character.

また、例えば、翻訳装置２は、Ｓ１１５のステップにおいて、各文字行画像を、全体的に縮小するようにしてもよい。この場合、翻訳装置２は、Ｓ１１５のステップの後、Ｓ１１８のステップと同様に、各文字行画像を横方向に連結してルビ文字領域の横幅の長さで分割し、各分割画像を、新たな文字行画像として、Ｓ１１４のステップに進む。 For example, the translation apparatus 2 may reduce each character line image as a whole in step S115. In this case, after the step of S115, as in the step of S118, the translation apparatus 2 concatenates each character line image in the horizontal direction and divides it by the horizontal width of the ruby character region, and each divided image is newly added. As a simple character line image, the process proceeds to step S114.

また、例えば、翻訳装置２は、Ｓ１１１のステップにおいて、各配置領域を、原文領域の上部から下部になるに従ってその縦幅が徐々に大きくなるように設定するようにしてもよい。これにより、各親文字領域は、その間隔が原文領域の上部から下部になるに従って徐々に大きくなるように設定される。なお、この場合、翻訳装置２は、Ｓ１１３やＳ１１７のステップにおいて、注目文字行画像の縦幅が、最上部の配置領域に設定したルビ文字領域の縦幅よりも大きいか否かを判断するようにし、Ｓ１１５やＳ１１９のステップにおいて、当該ルビ文字領域の縦幅を注目文字行画像の縦幅で割った商以下の倍率で、各文字行画像の縦幅を縮小することが考えられる。 For example, the translation apparatus 2 may set each arrangement area so that the vertical width gradually increases from the upper part to the lower part of the original sentence area in step S111. Thereby, each parent character area is set so that the interval gradually increases from the top to the bottom of the original text area. In this case, the translation apparatus 2 determines whether or not the vertical width of the target character line image is larger than the vertical width of the ruby character area set in the uppermost arrangement area in steps S113 and S117. In step S115 or S119, it is conceivable to reduce the vertical width of each character line image by a magnification equal to or less than the quotient obtained by dividing the vertical width of the ruby character area by the vertical width of the target character line image.

また、例えば、翻訳装置２は、原稿画像自身に親文字とルビ文字とを配置しているが、原稿画像の複製画像に親文字とルビ文字とを配置するようにしてもよい。 Further, for example, the translation device 2 arranges the parent character and the ruby character in the document image itself, but may arrange the parent character and the ruby character in a duplicate image of the document image.

また、翻訳装置２は、上記実施形態では翻訳文書を画像データ形式で生成しているが、他のデータ形式で生成するようにしてもよい。例えば、翻訳装置２は、原稿画像（正確には、各原文領域に文字行画像が配置された状態の原稿画像）に対する親文字の描画位置、描画サイズ、描画色等のデータを含む描画データとして、翻訳文書を生成するようにしてもよい。 Moreover, although the translation apparatus 2 has produced | generated the translation document in the image data format in the said embodiment, you may make it produce | generate in another data format. For example, the translation device 2 uses drawing data including data such as a drawing position, a drawing size, and a drawing color of a parent character for a document image (more precisely, a document image in which a character line image is arranged in each original text area). A translation document may be generated.

また、本発明は、第１言語と第２言語がどのような言語であっても、適用可能である。例えば、本発明は、フランス語をドイツ語に翻訳する場合にも適用可能である。 In addition, the present invention is applicable regardless of the language of the first language and the second language. For example, the present invention can be applied to a case where French is translated into German.

また、本発明は、横書きの原文を含む原稿だけでなく、縦書きの原文を含む原稿にも適用可能である。 Further, the present invention can be applied not only to a document including a horizontally written original but also to a document including a vertically written original.

本発明の実施形態に係る翻訳装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of the translation apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る翻訳装置の機能ブロックの一例を示す図である。It is a figure which shows an example of the functional block of the translation apparatus which concerns on embodiment of this invention. 原稿画像の一例を示す図である。It is a figure which shows an example of a manuscript image. 文字行領域を特定する様子を説明するための図である。It is a figure for demonstrating a mode that a character line area is specified. 文字行領域を到底する様子を説明するための図である。It is a figure for demonstrating a mode that a character line area is reached. 原文領域に、配置領域、親文字領域、ルビ文字領域を設定する様子を説明するための図である。It is a figure for demonstrating a mode that an arrangement | positioning area | region, a parent character area | region, and a ruby character area | region are set to an original sentence area | region. 翻訳文書の一例を示す図である。It is a figure which shows an example of a translation document. 原文領域に文字行画像を配置する様子を説明するための図である。It is a figure for demonstrating a mode that a character line image is arrange | positioned in an original text area | region. 文字行画像の縦幅を縮小する様子を説明するための図である。It is a figure for demonstrating a mode that the vertical width of a character line image is reduced. 文字行画像の横幅を縮小する様子を説明するための図である。It is a figure for demonstrating a mode that the horizontal width of a character line image is reduced. 新たな文字行画像を生成する様子を説明するための図である。It is a figure for demonstrating a mode that a new character line image is produced | generated. 本発明の実施形態に係る翻訳装置が実行する処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the process which the translation apparatus concerning embodiment of this invention performs.

Explanation of symbols

２翻訳装置、４０中央処理部、４２主記憶部、４４補助記憶部、４６画像読取部、４８操作部、５０ディスプレイ、５４データバス、６０原稿画像取得部、６２原文領域特定部、６４文字行画像取得部、６６翻訳部、６８親文字サイズ決定部、７０配置領域設定部、７２親文字領域設定部、７４ルビ文字領域設定部、７６翻訳文書生成部、７８翻訳結果出力部。 2 translation device, 40 central processing unit, 42 main storage unit, 44 auxiliary storage unit, 46 image reading unit, 48 operation unit, 50 display, 54 data bus, 60 original image acquisition unit, 62 original text region specifying unit, 64 character line Image acquisition unit, 66 translation unit, 68 parent character size determination unit, 70 arrangement region setting unit, 72 parent character region setting unit, 74 ruby character region setting unit, 76 translation document generation unit, 78 translation result output unit

Claims

Document image acquisition means for acquiring a document image including a source text area;
Original text area specifying means for specifying the original text area of the original image;
Translation means for recognizing an original text composed of characters included in the original text area of the original image and generating a translated text;
In the original text area of the original image, a translated text is arranged as a parent character, a translated text is arranged as a ruby character, and a translated document is generated to generate a translated document
The translation apparatus characterized by including.

Character size specifying means for specifying the character size of the original text included in the original text area;
Character size determining means for determining a character size of a parent character based on the character size specified by the character size specifying means;
The translation apparatus according to claim 1, further comprising:

Based on the size of the original text area, the number of characters of the translated text, and the determined character size of the parent character, a translated text area in which the translated text consisting of at least one line is arranged in the original text area. A translation area setting means to be set to
Ruby character area setting means for setting a ruby character area in which an original sentence consisting of at least one line is arranged between the translated sentence areas;
The translation apparatus according to claim 2, further comprising:

A character line image acquisition means for acquiring a character line image of each line of the original text included in the original text area;
The translated document generation means arranges each character line image in each ruby character region.
The translation apparatus according to claim 3.

Determining means for determining whether or not the height of each character line image is larger than the interval between translated sentence regions;
First reduction means for reducing a character line image whose height is determined to be larger than the interval so that the height is equal to or less than the interval;
The translation apparatus according to claim 4, further comprising:

A second reduction means for reducing the length of each character line image at a magnification smaller than 1 when the number of the character line images is larger than the number of lines of translated sentences arranged in the original sentence region;
Dividing the concatenated image of each character line image reduced by the second reduction means by the length of each ruby character region, and making each divided image a new character line image;
The translation apparatus according to claim 4 or 5, characterized in that:

The magnification smaller than 1 is a magnification equal to or lower than the quotient obtained by dividing the number of translated text lines arranged in the original text area by the number of character line images.
The translation apparatus according to claim 6.

A document image acquisition means for acquiring a document image including an original area;
Original text area specifying means for specifying the original text area of the document image;
Translation means for recognizing and translating an original text consisting of characters included in the original text area of the original image;
A translation document generating means for arranging a translation sentence as a parent character in the original sentence area of the original image and arranging the original sentence as a ruby character to generate a translation document;
As a program that allows the computer to function.