JP2007052614A

JP2007052614A - Documentat management device, document management system and document management method

Info

Publication number: JP2007052614A
Application number: JP2005236981A
Authority: JP
Inventors: Akio Yamashita; 明男山下; Kazunari Hashimoto; 一成橋本
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2005-08-17
Filing date: 2005-08-17
Publication date: 2007-03-01
Anticipated expiration: 2025-08-17
Also published as: JP4797507B2

Abstract

<P>PROBLEM TO BE SOLVED: To make it possible to provide accurate translation data in a short time without need of conversion time or translation time and to provide the latest version of translation origin to a user only by reading the old version of an original by providing data to a user using the latest version of translation data which are preliminarily stored in a storage means. <P>SOLUTION: In a translation server, when the latest version of translation data corresponding to an original text preliminarily exist in a document database (step S14: YES), the translation data are used. When not translation data but the latest version of document data exist, the document data are translated to obtain translation data (step S17). When neither the latest version of document data nor translation data exist, the document data obtained from the original are translated (step S19) to use the translation data. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、例えば古いバージョンの文書データから最新版の文書データが得られる文書管理装置、文書管理システムおよび文書管理方法に関する。 The present invention relates to a document management apparatus, a document management system, and a document management method that can obtain the latest version of document data from, for example, an old version of document data.

近年のグローバルな情報流通、経済活動など、国境を越えた活動の広がりにより、ある言語（例えば、日本語）から別の言語（例えば、英語、中国語）へ変換する翻訳の需要が高まっている。しかし、翻訳を業者（翻訳者）に依頼すると、一般に価格が高く、また時間もかかってしまうため、コンピュータなどの機械を用いて自動的に翻訳する自動翻訳の需要が高まっている。 Due to the spread of cross-border activities such as global information distribution and economic activities in recent years, there is an increasing demand for translation from one language (eg Japanese) to another (eg English, Chinese). . However, when a contractor (translator) is requested to translate, it is generally expensive and time-consuming, so that there is an increasing demand for automatic translation using a machine such as a computer.

一般的に、自動翻訳は、文章が印刷された紙をスキャナ装置で走査してイメージ化された画像データを取得し、この画像データを文字認識（例えば、ＯＣＲ（Optical Character Recognition：光学的文字認識））処理によってテキスト化した文書データに変換し、この文書データを機械翻訳のソフトウェアを用いて第１言語から第２言語に翻訳するものである。
この機械翻訳のソフトウェアは、文書解析によりテキスト化された文書データから単語単位で文字を認識させ、翻訳辞書を参照して単語の意味を導き、係り受けの関係から翻訳文を作成するといった具合で行っている。 In general, automatic translation is performed by scanning a paper on which text is printed with a scanner device to obtain imaged image data, and characterizing the image data (for example, OCR (Optical Character Recognition)). )) The document data is converted into text data by processing, and the document data is translated from the first language into the second language using machine translation software.
This machine translation software recognizes characters in word units from document data that has been converted to text by document analysis, derives the meaning of words by referring to a translation dictionary, and creates a translation sentence from a dependency relationship. Is going.

また、機械翻訳では、その翻訳精度は翻訳辞書に記憶された単語数に依存しているため、記憶容量に対応したデータ数によって機械翻訳の精度が設定されてしまう。さらに、日本語を他の言語に翻訳する場合を考えると、日本語は主語を省略して記載する場合があるため、機械翻訳では省略されている主語を補って翻訳することは困難である。
これに対し、人による翻訳は、辞書に記憶されていない新語に対しても、翻訳者の経験によって対応させて翻訳させることもでき、前後の文章から省略されている主語を補完して翻訳することも可能である。
このように、機械翻訳による翻訳精度は、文字認識による精度と機械翻訳のソフトウェアによる精度によって決められしまうため、人による翻訳よりも精度が落ちることになる。例えば、第２言語で記載された翻訳データを第１言語に戻す場合、２度翻訳処理を行うため、得られた文書データはかなり難解な文章となってしまい、理解できなくなってしまう虞がある。 In machine translation, since the translation accuracy depends on the number of words stored in the translation dictionary, the accuracy of machine translation is set by the number of data corresponding to the storage capacity. Further, considering the case of translating Japanese into another language, Japanese may be described with the subject omitted, so that it is difficult to translate with the subject omitted in machine translation.
On the other hand, human translation can also translate new words that are not stored in the dictionary according to the experience of the translator, complementing the subject that is omitted from the previous and subsequent sentences. It is also possible.
Thus, since the translation accuracy by machine translation is determined by the accuracy by character recognition and the accuracy by machine translation software, the accuracy is lower than human translation. For example, when the translation data described in the second language is returned to the first language, the translation processing is performed twice, so that the obtained document data becomes a rather difficult sentence and may not be understood. .

ここで、翻訳精度を上げる技術としては、文字認識により認識した文字を翻訳する際、翻訳辞書に該当する単語が存在しない場合には、自動的に単語の修正を行う技術（特許文献１、参照）、文字列の一部或いは全部を置換したバーコードを原稿に印刷し、必要に応じて専用のバーコード読取装置を用いて、このバーコードを読取って文字列に変換する技術（特許文献２、参照）がある。 Here, as a technique for improving the translation accuracy, when a word recognized by character recognition is translated, if a corresponding word does not exist in the translation dictionary, a technique for automatically correcting the word (see Patent Document 1, Reference). ), A technique of printing a barcode in which a part or all of a character string is replaced on a document, and reading the barcode and converting it into a character string using a dedicated barcode reader as required (Patent Document 2) , See).

特開平５−５４０７０号公報Japanese Patent Laid-Open No. 5-54070 特開平８−８３２８３号公報JP-A-8-83283

しかし、特許文献１が機械翻訳のソフトウェアの精度を上げるものであり、特許文献２がイメージデータからテキストデータに変換する文字認識の精度を上げるものである。しかし、何れの技術であっても、所詮人による翻訳精度には到達することができない。 However, Patent Document 1 increases the accuracy of machine translation software, and Patent Document 2 increases the accuracy of character recognition for converting image data into text data. However, with any technique, the accuracy of translation by the person concerned cannot be reached.

一方、文書データの管理を行っていないと、ある文書に対して改訂版が存在する場合であっても、利用者が翻訳を要求する原稿が旧バージョンであったとしても、この利用者は最新版の存在を知らずに翻訳処理を行う。このため、利用者に対して最新版の情報を提供することができない。 On the other hand, if document data is not managed, even if there is a revised version for a document, even if the manuscript that the user requests to translate is an old version, this user Perform the translation process without knowing the existence of the version. For this reason, the latest version information cannot be provided to the user.

本発明は、前述した先行技術に鑑みてなされたもので、入力される文書データに関連した文書データがある場合には、その文書データを利用することで、短時間に高い精度の文書データを得るとともに、例えば、関連したデータが改訂による最新のバージョン情報の場合には、利用者に対して最新の情報を容易に提供することが可能となる文書管理装置、文書管理システムおよび文書管理方法を提供することを目的とする。 The present invention has been made in view of the above-described prior art. When there is document data related to input document data, the document data is used to obtain highly accurate document data in a short time. For example, when the related data is the latest version information by revision, a document management apparatus, a document management system, and a document management method capable of easily providing the latest information to the user are provided. The purpose is to provide.

前記目的を達成するために、本発明が採用する文書管理装置は、文書データおよび当該文書データを識別する識別情報が格納された文書データ格納手段と、前記文書データ格納手段に格納された前記識別情報を所定の区分ルールに従ってグループとして扱う識別情報処理手段と、識別情報とともに入力される文書データを受け取る受信手段と、前記受信手段が受け取った識別情報と同一の識別情報が前記文書データ格納手段に記憶されているか否かを判定する第１の判定手段と、前記第１の判定手段の判定が肯定的である場合に、前記識別情報処理手段によって同一のグループとして区分されている識別情報が前記文書データ格納手段内にあるか否かを判定する第２の判定手段と、前記第２の判定手段の判定が肯定的な場合に、前記同一のグループ内の他の識別情報を予め決められた選択ルールに基づいて選択し、選択された識別情報に対応する文書データを前記文書データ格納手段から読み出して出力する文書データ出力手段と、を具備したことを特徴とする文書管理装置を提供する。 In order to achieve the above object, the document management apparatus employed by the present invention includes document data and document data storage means storing identification information for identifying the document data, and the identification stored in the document data storage means. Identification information processing means for handling information as a group according to a predetermined classification rule, receiving means for receiving document data input together with identification information, and identification information identical to the identification information received by the receiving means is stored in the document data storage means When the determination of the first determination means for determining whether or not stored and the determination of the first determination means is affirmative, the identification information classified as the same group by the identification information processing means The second determination unit that determines whether or not the document data storage unit is in the document data storage unit and the same determination unit when the determination by the second determination unit is positive. Document data output means for selecting other identification information in the group based on a predetermined selection rule, and reading out and outputting document data corresponding to the selected identification information from the document data storage means. A document management apparatus characterized by the above is provided.

この発明によれば、第１の判定手段により識別情報と同一の識別情報が文書データ格納手段に記憶されていると判定され、第２の判定手段により同一のグループとして区分されている識別情報が文書データ格納手段内にあると判定された場合、文書データ出力手段が、予め決められた選択ルールに基づき、同一のグループ内の他の識別情報に対応する文書データを文書データ格納手段から読み出して出力する。これにより、識別情報をキーとして、入力される文書データに対応した文書データを出力することができる。
例えば、入力される文書データがバージョンの古い文書に対応したものであって、最新の文書データが文書データ格納手段に格納されている場合には、入力される文書データの識別情報をキーにして最新の文書データが出力される。 According to this invention, it is determined by the first determination means that the same identification information as the identification information is stored in the document data storage means, and the identification information classified as the same group by the second determination means If it is determined that the document data is stored in the document data storage unit, the document data output unit reads out document data corresponding to other identification information in the same group from the document data storage unit based on a predetermined selection rule. Output. Thereby, the document data corresponding to the input document data can be output using the identification information as a key.
For example, when the input document data corresponds to an old document and the latest document data is stored in the document data storage means, the identification information of the input document data is used as a key. The latest document data is output.

上記構成において、前記区分ルールは、前記識別情報の格納エリアによる区分であることが好ましい。 The said structure WHEREIN: It is preferable that the said division | segmentation rule is the division by the storage area of the said identification information.

上記構成において、前記識別情報は、予め決められた体系に従った内容に設定され、前記区分ルールは、前記識別情報の内容の共通事項によって区分するルールであることが好ましい。 In the above configuration, it is preferable that the identification information is set to contents according to a predetermined system, and the classification rule is a rule for classifying according to common matters of the contents of the identification information.

上記構成において、前記識別情報は、文書の種類を示すコードとバージョン情報とを含んだ所定の体系に従った内容に設定され、前記第２の判定手段における予め決められた選択ルールは、最新のバージョン情報を含んだ識別情報を選択するというルールに設定されていることが好ましい。 In the above configuration, the identification information is set to a content according to a predetermined system including a code indicating a document type and version information, and the predetermined selection rule in the second determination means is the latest information. It is preferable that the rule is set to select identification information including version information.

上記構成において、前記識別情報は、文書の種類を示すコード、バージョン情報および文書が表される言語形式を含んだ所定の体系に従った内容に設定され、前記第２の判定手段における予め決められた選択ルールは、最新のバージョン情報と指定の言語形式を含んだ識別情報を選択するというルールに設定されていることが好ましい。 In the above configuration, the identification information is set to a content according to a predetermined system including a code indicating a document type, version information, and a language format in which the document is represented, and is determined in advance by the second determination unit. The selection rule is preferably set to a rule for selecting identification information including the latest version information and a specified language format.

上記構成において、前記識別情報は、所定の体系に従った識別コードまたは文書の名称を示す文書名であることが好ましい。 In the above configuration, the identification information is preferably an identification code or a document name indicating a document name according to a predetermined system.

上記構成において、前記文書データ出力手段は、前記他の文書データと前記受信手段が受け取った文書データの相違点を検出し、出力する文書データ中の相違点に対応する部分には、表示上の特徴付けとなる修飾処理を施すことが好ましい。 In the above configuration, the document data output means detects a difference between the other document data and the document data received by the receiving means, and a portion corresponding to the difference in the output document data is displayed on the display. It is preferable to perform a modification process that is a characteristic.

上記構成において、前記文書データ出力手段は、前記他の文書データと前記受信手段が受け取った文書データの相違点を検出するに際し、両者の言語が違う場合には、前記受信手段が受け取った文書データを前記他の文書データの言語に翻訳した後に相違点検出を行うことが好ましい。 In the above configuration, when the document data output means detects the difference between the other document data and the document data received by the receiving means and the language of the two is different, the document data received by the receiving means It is preferable to detect the difference after translating the text into the language of the other document data.

上記構成において、前記第１の判定手段の判定が肯定的でも前記第２の判定手段の判定が否定的となった識別情報については、前記第１の判定手段の判定で肯定的となった識別情報と同一のグループに区分するとともに、当該識別情報およびこれに対応する文書データを前記文書データ格納手段に更新記憶するデータ更新記憶手段を備えることが好ましい。 In the above configuration, for identification information in which the determination by the first determination unit is affirmative but the determination by the second determination unit is negative, the identification that is affirmative by the determination by the first determination unit It is preferable to provide data update storage means for dividing the information into the same group as the information and for storing the identification information and corresponding document data in the document data storage means.

上記構成において、前記更新記憶手段は、前記第１の判定手段の判定が否定的となった識別情報については、新たな区分に分類して、当該識別情報およびこれに対応する文書データを前記文書データ格納手段に更新記憶することが好ましい。 In the above configuration, the update storage means classifies the identification information for which the determination by the first determination means is negative into a new classification, and the identification information and the document data corresponding thereto are classified into the document Preferably, the data storage means is updated and stored.

上記構成において、前記識別情報は、特定文字列または特定画像であることが好ましい。 The said structure WHEREIN: It is preferable that the said identification information is a specific character string or a specific image.

上記構成において、原稿から画像を読み取って識別情報および入力される文書データを生成して前記受信手段に受け渡す画像読取手段を備えることが好ましい。 In the above-described configuration, it is preferable that the image reading unit includes an image reading unit that reads an image from a document, generates identification information and input document data, and delivers the document data to the receiving unit.

本発明に係る文書管理装置によれば、入力される文書データに関連した文書データが予め記憶された文書データ格納手段に格納されている場合には、その文書データを利用することで、文字認識処理を行う必要がなく、精度の高い文書を得ることができる。しかも、格納された文書データであるため、文字認識等の変換時間が必要ないため、短時間で正確な文書データを得ることができる。 According to the document management apparatus of the present invention, when document data related to input document data is stored in a previously stored document data storage unit, character recognition is performed by using the document data. There is no need to perform processing, and a highly accurate document can be obtained. In addition, since the stored document data does not require conversion time such as character recognition, accurate document data can be obtained in a short time.

例えば、関連したデータが改訂による複数個の文書データの場合には、入力される文書が旧バージョンの文書データであった際、この文書に対して最新版の文書データが文書データ格納手段に格納されていれば、最新版の文書データを出力でき、利用者に対して最新の文書データを提供することが可能となる。 For example, when the related data is a plurality of document data by revision, when the input document is the old version of the document data, the latest version of the document data is stored in the document data storage means for this document. If so, the latest document data can be output, and the latest document data can be provided to the user.

＜Ａ．第１実施形態＞
以下、図面を参照し、本発明に係る第１実施形態を説明する。図１は、本実施形態による翻訳システム１の全体構成を示す図である。このシステム１は、ネットワーク３００を介して接続された複合機１００および翻訳サーバ２００を具備している。
複合機１００は、コピー、スキャナ、プリンタの機能を備える。複合機１００は、ネットワーク３００を介して翻訳サーバ２００或いはパソコン等の利用者端末（図示せず）に接続されている。翻訳サーバ２００は、翻訳処理を専門に行うサーバ装置であり、複合機１００から原文に対応するデータを受信し、受信した原文に対して関連した翻訳データを生成して返信する機能を有する。 <A. First Embodiment>
A first embodiment according to the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing an overall configuration of a translation system 1 according to the present embodiment. The system 1 includes a multifunction peripheral 100 and a translation server 200 connected via a network 300.
The multifunction device 100 has functions of a copy, a scanner, and a printer. The multifunction device 100 is connected to a user terminal (not shown) such as a translation server 200 or a personal computer via a network 300. The translation server 200 is a server device that specializes in translation processing, and has a function of receiving data corresponding to the original text from the multi-function peripheral 100, generating related translation data for the received original text, and returning the data.

本実施形態によるシステム１は、関連文書を改訂による得られるバージョンの異なった取扱説明書とし、システム１は、複合機１００に読み込まれる原文Ｍが旧バージョンの文書であった場合には、最新版の原文に対応した翻訳データに基づいた翻訳原稿Ｔが得られるものである。 In the system 1 according to the present embodiment, the related document is a different version of the instruction manual obtained by the revision, and the system 1 is the latest version when the original text M read into the multifunction peripheral 100 is an old version document. A translation manuscript T based on translation data corresponding to the original is obtained.

＜Ａ−１．複合機１００の構成＞
図２は、複合機１００の機能構成を示すブロック図である。この複合機１００は、読み取り部１１０、画像データ生成部１２０、指示入力部１３０、送受信部１４０、印刷部１５０およびこれらを制御する制御部１６０を具備する。制御部１６０は、図示しないＣＰＵ，ＲＡＭおよびＲＯＭを備えており、各種処理を司る機能を有する。 <A-1. Configuration of MFP 100>
FIG. 2 is a block diagram illustrating a functional configuration of the multifunction peripheral 100. The multi-function device 100 includes a reading unit 110, an image data generation unit 120, an instruction input unit 130, a transmission / reception unit 140, a printing unit 150, and a control unit 160 that controls them. The control unit 160 includes a CPU, a RAM, and a ROM (not shown) and has a function for performing various processes.

指示入力部１３０は、表示部を兼ねたタッチパネルやスイッチ等から構成され、利用者の操作により各種の指示情報が入力される。その指示情報は、翻訳対象言語（以下、指定言語という），関連文書の種類（旧バージョン、最新版），出力方式（表示、印刷），版の異なった文書中の相違点を表示するか否か等である。
送受信部１４０は、翻訳サーバ２００等のネットワーク３００に接続された外部機器（図示せず）との間でデータの授受を行う。印刷部１５０は、画像データ生成部１２０で生成された画像データを受けて用紙（記録材）上に画像を印刷する。 The instruction input unit 130 includes a touch panel that also serves as a display unit, a switch, and the like, and various instruction information is input by a user operation. Whether the instruction information indicates the language to be translated (hereinafter referred to as the specified language), the type of the related document (old version, latest version), the output method (display, print), or the differences in the different versions of the document Or the like.
The transmission / reception unit 140 exchanges data with an external device (not shown) connected to the network 300 such as the translation server 200. The printing unit 150 receives the image data generated by the image data generation unit 120 and prints an image on a sheet (recording material).

そして、複合機１００は、読み取り部１１０が読み取った画像データを、送受信部１４０を経由して外部の装置に出力することによりスキャナ機として機能する。また、読み取り部１１０が読み取った画像データに応じた画像を画像印刷部１５０から出力することによりコピー機として機能する。さらに、複合機１００は、外部からデータ送受信部１４０を経由して受信したデータに応じた画像を画像印刷部１５０において出力することにより、プリンタ機として機能する。 The multifunction device 100 functions as a scanner device by outputting the image data read by the reading unit 110 to an external device via the transmission / reception unit 140. Further, the image printing unit 150 outputs an image corresponding to the image data read by the reading unit 110 to function as a copier. Further, the multi-function device 100 functions as a printer by outputting an image corresponding to data received from the outside via the data transmission / reception unit 140 in the image printing unit 150.

ここで、図１に示すように、原稿Ｍには、余白の所定位置にこの原文文書ＩＤを示すバーコードＢＭが印刷されている。翻訳原稿Ｔには、余白の所定位置にこの翻訳文書ＩＤを示すバーコードＢＴが印刷される。なお、本実施形態では、原稿Ｍに対し、改訂によって何種類かのバージョンの異なった文書が存在するものとする。 Here, as shown in FIG. 1, the original M is printed with a barcode BM indicating the original document ID at a predetermined position in the margin. On the translated document T, a barcode BT indicating the translated document ID is printed at a predetermined position in the margin. In the present embodiment, it is assumed that several types of different documents exist for the original M due to revision.

＜Ａ−２．翻訳サーバ２００の構成＞
図３は、翻訳サーバ２００の機能構成を示すブロック図である。翻訳サーバ２００は、文書データベース２１０が接続された文書データ管理部２２０、翻訳辞書部２３０、送受信部２４０およびこれらを制御する制御部２５０を具備する。制御部２５０は、図示しないＣＰＵ，ＲＡＭおよびＲＯＭを備えており、各種処理を司る機能を有する。 <A-2. Configuration of Translation Server 200>
FIG. 3 is a block diagram illustrating a functional configuration of the translation server 200. The translation server 200 includes a document data management unit 220 to which a document database 210 is connected, a translation dictionary unit 230, a transmission / reception unit 240, and a control unit 250 for controlling them. The control unit 250 includes a CPU, a RAM, and a ROM (not shown) and has a function for performing various processes.

送受信部２４０は、複合機１００等のネットワーク３００に接続された外部機器（図示せず）との間でデータの授受を行う。翻訳辞書部２３０は、制御部２５０からの指示に基づき、原文データを指示言語に翻訳する際に用いられるものであり、種々の言語に対する辞書を備えている。 The transmission / reception unit 240 exchanges data with an external device (not shown) connected to the network 300 such as the multifunction peripheral 100. The translation dictionary unit 230 is used when the original text data is translated into the instruction language based on the instruction from the control unit 250, and includes dictionaries for various languages.

文書データベース２１０には、テキスト形式の文書データ（ファイル）が文書ＩＤ毎に対応付けて格納されている。この文書データの文書データベース２１０への格納処理は、後述する図６のステップＳ１８における処理のみでなく、複合機１００のスキャナ機能、コピー機能、プリンタ機能を動作させた際、或いは利用者によるマニュアル操作された際に行われる。この際、格納処理は、原稿のバージョンと文書データ（原文データ或いは翻訳データ）を文書ＩＤと共に文書データベース２１０に格納させ、文書データ管理部２２０に翻訳データの文書ＩＤと文書名を記憶更新させる。 In the document database 210, text-format document data (file) is stored in association with each document ID. The storage process of the document data in the document database 210 is not limited to the process in step S18 of FIG. 6 to be described later, but also when the scanner function, copy function, and printer function of the multi-function peripheral 100 are operated, or manually operated by the user. When it is done. At this time, the storage process stores the document version and document data (original text data or translation data) in the document database 210 together with the document ID, and causes the document data management unit 220 to store and update the document ID and document name of the translation data.

文書データ管理部２２０は、文書データベース２１０に格納されている文書データを管理するもので、図４に示すようなグループ化された複数のデータテーブル毎に、それぞれ識別番号と文書名の対として記憶されている。データテーブルは、関連した文書毎に１つ形成される。
本実施形態では、関連した文書を例えば取扱説明書とし、第１版から第３版までの文書データが存在する場合を例に挙げて示す。
このデータテーブルには、例えば、日本語の文書ＩＤと日本語文書名（ファイル名）、英語の文書ＩＤと英語文書名（ファイル名）、中国語の文書ＩＤと中国語文書名（ファイル名）が、改訂のバージョン毎にそれぞれ記憶されている。つまり、このデータテーブルでは、第１版から第３版までの取扱説明書の文書名が、日本語，英語，中国語の言語形式の文書名として記憶され、さらに、この文書名に対応した文書データが文書データベース２１０に格納されることになる。
この例の文書ＩＤは、初めの３文字を共通の文字列にして各文書が関連（例えば、取扱説明書）あることを示し、「−」（ハイフン）以下の添え字の十の位が文書の言語形式（第１言語、第２言語、第３言語）、１の位がバージョン（第１版、第２版、第３版）を示している。 The document data management unit 220 manages the document data stored in the document database 210, and stores it as a pair of identification number and document name for each of a plurality of grouped data tables as shown in FIG. Has been. One data table is created for each related document.
In the present embodiment, for example, a related document is an instruction manual, and document data from the first version to the third version exists as an example.
In this data table, for example, Japanese document ID and Japanese document name (file name), English document ID and English document name (file name), Chinese document ID and Chinese document name (file name), It is stored for each revision version. That is, in this data table, the document names of the instruction manuals from the first version to the third version are stored as document names in Japanese, English, and Chinese language formats, and the document corresponding to this document name Data is stored in the document database 210.
The document ID in this example uses the first three characters as a common character string to indicate that each document is related (for example, an instruction manual), and the tenth digit of the subscript following “-” (hyphen) is the document. Language format (first language, second language, third language), the first place indicates the version (first version, second version, third version).

さらに、図４のデータテーブルについて説明すると、原文が日本語の第１版の文書であった場合には、この文書の文書ＩＤは“００１−１１”となり、この文書ＩＤ“００１−１１”に対応した改訂版の文書ＩＤは、第２版“００１−１２”、第３版“００１−１３”となる。さらに、他の言語形式の対応は、英語の文書ＩＤは、第１版“００１−２１”、第２版“００１−２２”、第３版“００１−２３”となり、中国語の文書ＩＤは、第１版“００１−３１”、第２版“００１−３２”、第３版“００１−３３”となる。
例えば、原文が英語の第２版の文書であった場合には、この文書の文書ＩＤは“００１−２２”となり、この文書ＩＤ“００１−２２”のバージョンの異なった文書ＩＤは、第１版“００１−２１”、第３版“００１−２３”となる。さらに、他の言語形式におけるバージョンの対応は、日本語の文書ＩＤは、第１版“００１−１１”、第２版“００１−１２”、第３版“００１−１３”となり、中国語の文書ＩＤは、第１版“００１−３１”、第２版“００１−３２”、第３版“００１−３３”となる。 Further, the data table of FIG. 4 will be described. When the original text is a first-version document in Japanese, the document ID of this document is “001-11”, and this document ID is “001-11”. The corresponding revised document IDs are the second version “001-12” and the third version “001-13”. Furthermore, the correspondence of other language formats is that the English document IDs are the first version “001-21”, the second version “001-22”, and the third version “001-23”, and the Chinese document ID is The first version “001-31”, the second version “001-32”, and the third version “001-33”.
For example, if the original text is a second edition document in English, the document ID of this document is “001-22”, and the document ID having a different version of this document ID “001-22” is the first document ID. The version is “001-21” and the third version is “001-23”. Furthermore, the correspondence between versions in other language formats is that the Japanese document IDs are the first version “001-11”, the second version “001-12”, and the third version “001-13”. The document IDs are the first version “001-31”, the second version “001-32”, and the third version “001-33”.

また、文書データ管理部２２０には、１つの関連した文書に対して１つのデータテーブルが形成されるため、文書データ管理部２２０では、複数のグループ化した文書を管理するため、複数のデータテーブルを備えることになる。 Further, since one data table is formed for one related document in the document data management unit 220, the document data management unit 220 manages a plurality of data tables in order to manage a plurality of grouped documents. Will be provided.

＜Ａ−３．翻訳システム１の動作＞
次に、図５のシーケンスチャートに基づき、翻訳システム１の全体の動作について説明する。
まず、利用者が複合機１００の指示入力部１３０を操作して指示情報を入力することにより、制御部１６０はこの指示情報を記憶する（ステップＳ１）。この指示情報は、翻訳対象語となる指示言語、改訂版があった場合に出力する版の種類等である。本実施形態では、指示言語を英語、改訂版の出力を最新版とする。 <A-3. Operation of translation system 1>
Next, the overall operation of the translation system 1 will be described based on the sequence chart of FIG.
First, when the user operates the instruction input unit 130 of the multifunction peripheral 100 to input instruction information, the control unit 160 stores the instruction information (step S1). This instruction information includes the instruction language to be translated, the type of version to be output when there is a revised version, and the like. In the present embodiment, the instruction language is English, and the revised version is the latest version.

次に、利用者が読み取り部１１０に原稿Ｍをセットして所定のスイッチを操作することにより、制御部１６０は読み取り部１１０を動作させて原稿Ｍを読み取る。これにより、イメージ形式の原文画像データを取得して記憶する（ステップＳ２）。
さらに、複合機１００の制御部１６０は、記憶された指示情報および原文画像データを送受信部１４０を経由して翻訳サーバ２００に送信する（ステップＳ３）。 Next, when the user sets the document M on the reading unit 110 and operates a predetermined switch, the control unit 160 operates the reading unit 110 to read the document M. Thereby, the original image data in the image format is acquired and stored (step S2).
Further, the control unit 160 of the multifunction peripheral 100 transmits the stored instruction information and original text image data to the translation server 200 via the transmission / reception unit 140 (step S3).

翻訳サーバ２００の制御部２５０では、指示情報および原文画像データを送受信部２４０で受信することにより、後述する翻訳処理を行い（ステップＳ４）、この翻訳処理によって得られた翻訳データを送受信部２４０を経由して複合機１００に送信する（ステップＳ５）。 The control unit 250 of the translation server 200 receives the instruction information and the original text image data by the transmission / reception unit 240 to perform a later-described translation process (step S4), and transmits the translation data obtained by the translation process to the transmission / reception unit 240. Then, the data is transmitted to the multi function device 100 (step S5).

複合機１００の制御部１６０では、送受信部１４０で翻訳データを受信することにより、この翻訳データを画像データ生成部１２０に転送し、画像データ生成部１２０がテキスト形式の翻訳データからイメージ形式の翻訳画像データを生成する（ステップＳ６）。この際、翻訳データにこの翻訳データの文書ＩＤを示すバーコードを印刷するためのデータが付加されている場合には、変換したバーコードの画像データを翻訳画像データに付加する。
さらに、制御部１６０は、画像データ生成部１２０で生成された翻訳画像データを印刷部１５０に転送し、印刷部１５０がこの翻訳画像データを用紙に印刷して翻訳原稿Ｔを出力する（ステップＳ７）。 The control unit 160 of the MFP 100 receives the translation data by the transmission / reception unit 140 and transfers the translation data to the image data generation unit 120. The image data generation unit 120 converts the image format translation data into the image format translation data. Image data is generated (step S6). At this time, if data for printing a barcode indicating the document ID of the translation data is added to the translation data, the converted barcode image data is added to the translation image data.
Further, the control unit 160 transfers the translation image data generated by the image data generation unit 120 to the printing unit 150, and the printing unit 150 prints the translation image data on a sheet and outputs a translation document T (step S7). ).

次に、図６のフローチャートに基づき、翻訳サーバ２００における翻訳処理について説明する。
この翻訳サーバ２００は、複合機１００から送信された指示情報および原文画像データ（文書ＩＤを含む）を送受信部２４０で受信することによって動作を開始する。 Next, the translation processing in the translation server 200 will be described based on the flowchart of FIG.
The translation server 200 starts operation by receiving the instruction information and the original image data (including the document ID) transmitted from the multi-function device 100 by the transmission / reception unit 240.

制御部２５０は、この制御部２５０内でイメージ形式の原文画像データを文字認識処理（例えば、ＯＣＲ処理）してテキスト形式のデータに変換し、文書ＩＤを示すバーコードがある場合には、このバーコード部分の画像データから文書ＩＤを抽出する（ステップＳ１１）。本実施形態の場合には、第１版（日本語）の取扱説明書を示す文書ＩＤ“００１−１１”が抽出されたものとする。 The control unit 250 converts the original text image data in the image format into text format data by performing character recognition processing (for example, OCR processing) in the control unit 250, and if there is a barcode indicating the document ID, The document ID is extracted from the image data of the barcode part (step S11). In the case of the present embodiment, it is assumed that the document ID “001-11” indicating the instruction manual of the first version (Japanese) is extracted.

次に、制御部２５０は、ステップＳ１１の処理により、バーコードによる文書ＩＤが抽出されたか否かを判定する（ステップＳ１２）。制御部２５０は、文書ＩＤが抽出された場合には（ステップＳ１２；ＹＥＳ）、ステップＳ１３に移行する。 Next, the control unit 250 determines whether or not a document ID based on a barcode has been extracted by the process of step S11 (step S12). When the document ID is extracted (step S12; YES), the control unit 250 proceeds to step S13.

制御部２５０は、抽出された文書ＩＤを検索キーとして、文書データ管理部２２０（データーテーブル）の文書ＩＤとを照合させることにより、文書データベース２１０に格納されている文書データおよび翻訳データを検索して検索結果を得る（ステップＳ１３）。
図４のデータテーブルの例では、抽出された文書ＩＤが“００１−１１”である場合には、初めの３文字を検索キーとしてデータテーブルが選択され、日本語の第１版の文書データを始め、第２版，第３版の日本語の文書データ、さらに第１〜第３版の英語の翻訳データ、中国語の翻訳データが文書データベース２１０に格納されていることを検索結果として得る。 The control unit 250 searches the document data and translation data stored in the document database 210 by collating the document ID of the document data management unit 220 (data table) with the extracted document ID as a search key. The search result is obtained (step S13).
In the example of the data table in FIG. 4, when the extracted document ID is “001-11”, the data table is selected using the first three characters as a search key, and the first version document data in Japanese is selected. First, the second and third editions of Japanese document data, and the first to third editions of English translation data and Chinese translation data are stored in the document database 210 as search results.

制御部２５０は、検索結果および指示情報から文書データベース２１０に翻訳データ（この場合、最新版の英語）が格納されているか否かを判定する（ステップＳ１４）。翻訳対象語となる指示言語（英語）で且つ最新版の翻訳データ（この場合、文書ＩＤ“００１−２３”）が文書データベース２１０に格納されていた場合には（ステップＳ１４；ＹＥＳ）、該当する翻訳データ（この場合、文書ＩＤ“００１−２３”）を読み出し、この翻訳データを記憶エリア（図示せず）に一時的に記憶する（ステップＳ１５）。 The control unit 250 determines whether translation data (in this case, the latest version of English) is stored in the document database 210 from the search result and the instruction information (step S14). If the instruction language (English) to be translated and the latest version of translation data (in this case, document ID “001-23”) is stored in the document database 210 (step S14; YES), it corresponds. Translation data (in this case, document ID “001-23”) is read out, and this translation data is temporarily stored in a storage area (not shown) (step S15).

一方、ステップＳ１４の判定において、最新版の英語による翻訳データが文書データベース２１０に格納されていない場合には（ステップＳ１４；ＮＯ）、制御部２５０は、検索結果から文書データベース２１０に原文の言語による最新版の文書データ（この場合、文書ＩＤ“００１−１３”）が格納されているか否かを判定する（ステップＳ１６）。
制御部２５０は、最新版の文書データ（この場合、文書ＩＤ“００１−１３”）が文書データベース２１０に格納されていた場合には（ステップＳ１６；ＹＥＳ）、この最新版の文書データを読み出し、この文書データに基づいた翻訳処理を翻訳辞書部２３０を利用して行い、この処理で得られた翻訳データを記憶エリアに一時的に記憶する（ステップＳ１７）。 On the other hand, if it is determined in step S14 that the latest English translation data is not stored in the document database 210 (step S14; NO), the control unit 250 uses the search result in the document database 210 according to the original language. It is determined whether or not the latest version of the document data (in this case, the document ID “001-13”) is stored (step S16).
When the latest version of the document data (in this case, the document ID “001-13”) is stored in the document database 210 (step S16; YES), the control unit 250 reads the latest version of the document data, The translation processing based on the document data is performed using the translation dictionary unit 230, and the translation data obtained by this processing is temporarily stored in the storage area (step S17).

さらに、制御部２５０は、ステップＳ１７で得られた翻訳データを文書データベース２１０に格納すると共に、文書データ管理部２２０のデータテーブルに原文の文書ＩＤの添え字を変えた文書ＩＤを翻訳文の文書名（ファイル名）に対応付けて加えることによりデータベースを更新する（ステップＳ１８）。 Further, the control unit 250 stores the translation data obtained in step S17 in the document database 210, and converts the document ID obtained by changing the subscript of the original document ID into the data table of the document data management unit 220. The database is updated by adding it in association with the name (file name) (step S18).

一方、制御部２５０は、受信した原文画像データから文書ＩＤが抽出されなかった場合（ステップＳ１２；ＮＯ）、或いは文書データベース２１０に原文データに対応した最新版の文書データが格納されていない場合には（ステップＳ１６；ＮＯ）、原文画像データをテキスト化したデータに対して翻訳処理を行い、この処理で得られた翻訳データを記憶エリアに一時的に記憶する（ステップＳ１９）。この処理後に、文書データベース２１０に翻訳前の原文データを文書ＩＤと共に格納し、文書データ管理部２２０に原文データを文書ＩＤに対応した新たなデータテーブルとして作成するようにしてもよい。 On the other hand, when the document ID is not extracted from the received original text image data (step S12; NO), or when the latest version of the document data corresponding to the original text data is not stored in the document database 210, the control unit 250 (Step S16; NO), translation processing is performed on the text data of the original image data, and the translation data obtained by this processing is temporarily stored in the storage area (Step S19). After this processing, the original text data before translation may be stored in the document database 210 together with the document ID, and the original text data may be created in the document data management unit 220 as a new data table corresponding to the document ID.

このように、翻訳サーバ２００の制御部２５０では、三通りの処理のうち何れかの処理により得られた最新版の翻訳データを送受信部２４０を介して複合機１００に送信する（図５中のステップＳ５）。この送信される翻訳データには、翻訳文を表すデータと共に、この翻訳文を示す文書ＩＤが含まれている。この翻訳データを受けた複合機１００は、原文に対して最新版の翻訳文が印刷して翻訳文献Ｔを出力する。 As described above, the control unit 250 of the translation server 200 transmits the latest version of the translation data obtained by any one of the three processes to the MFP 100 via the transmission / reception unit 240 (in FIG. 5). Step S5). The translation data to be transmitted includes a document ID indicating the translated sentence as well as data representing the translated sentence. Upon receiving this translation data, the MFP 100 prints the latest version of the translated text on the original text and outputs the translated document T.

＜Ａ−４．第１実施形態の効果＞
このように、本実施形態による翻訳システム１では、予め翻訳サーバ２００の文書データベース２１０に原文に関連した最新版の翻訳データが格納されている場合には、その翻訳データを複合機１００に送信することにより、ＯＣＲ処理や翻訳処理を省略でき、翻訳精度の高い翻訳文を利用者に提供することができる。しかも、翻訳処理を行わなわずに済むため、翻訳時間を大幅に短縮することができる。 <A-4. Effects of First Embodiment>
As described above, in the translation system 1 according to the present embodiment, when the latest version of translation data related to the original text is stored in the document database 210 of the translation server 200 in advance, the translation data is transmitted to the MFP 100. Thus, OCR processing and translation processing can be omitted, and a translated sentence with high translation accuracy can be provided to the user. In addition, since the translation process is not required, the translation time can be greatly shortened.

また、翻訳データが文書データベース２１０に格納されていない場合であっても、原文の文書データが文書データベース２１０に格納されている場合には、この文書データを翻訳処理することにより、ＯＣＲ処理を行わずに済むため、ＯＣＲ処理による文字の誤認識を防止でき、原文をＯＣＲ処理および翻訳処理を行って得た翻訳文に比べて誤翻訳を低減することができる。
以上のように、本実施形態による翻訳システム１は、利用者に対して精度の高い翻訳文を提供することが可能となる。 Even if the translation data is not stored in the document database 210, if the original document data is stored in the document database 210, the OCR process is performed by translating the document data. Therefore, erroneous recognition of characters due to OCR processing can be prevented, and erroneous translation can be reduced as compared with a translation obtained by performing OCR processing and translation processing on an original sentence.
As described above, the translation system 1 according to the present embodiment can provide a translated sentence with high accuracy to the user.

このように、本実施形態による翻訳システム１は、改訂等によりバージョンの異なった文書があることを知らずに、利用者が複合機１００を利用して機械翻訳を行おうとした場合、最新版の翻訳データが出力され、利用者に対して最新の翻訳データを自動的に提供することができる。 As described above, the translation system 1 according to the present embodiment, when the user tries to perform machine translation using the multifunction device 100 without knowing that there is a document having a different version due to revision or the like, Data is output and the latest translation data can be automatically provided to the user.

＜Ｂ．第２実施形態＞
以下、図面を参照しつつ、本発明に係る第２実施形態を説明する。なお、第２実施形態の特徴は、原文のバーコードから特定される文書と最新版の文書とを照合するこにより相違点を検出し、この相違点を特徴付けて表現した翻訳データを得る点にある。
本実施形態では、前述した第１実施形態と同一の構成要件に同一の符号を付し、その説明は省略するものとする。 <B. Second Embodiment>
The second embodiment according to the present invention will be described below with reference to the drawings. The feature of the second embodiment is that a difference is detected by collating the document specified from the barcode of the original text with the latest version of the document, and obtaining translation data that characterizes and expresses the difference. It is in.
In the present embodiment, the same components as those in the first embodiment described above are denoted by the same reference numerals, and the description thereof is omitted.

＜Ｂ−１．翻訳システムの動作＞
本実施形態による翻訳システムの全体の動作については、前述した第１実施形態で示した図５のシーケンスチャートと同様であるので、その説明は省略する。
次に、図７および図８のフローチャートに基づき、翻訳サーバ２００における翻訳処理および相違点検出処理について説明する。
この翻訳サーバ２００は、複合機１００から送信された指示情報および原文画像データを送受信部２４０で受信することによって動作を開始する。 <B-1. Operation of translation system>
Since the overall operation of the translation system according to the present embodiment is the same as that of the sequence chart of FIG. 5 shown in the first embodiment, description thereof will be omitted.
Next, translation processing and difference detection processing in the translation server 200 will be described based on the flowcharts of FIGS. 7 and 8.
The translation server 200 starts the operation by receiving the instruction information and the original text image data transmitted from the multi-function device 100 by the transmission / reception unit 240.

制御部２５０は、その制御部２５０内でイメージ形式の原文画像データを文字認識処理（例えば、ＯＣＲ処理）してテキスト形式のデータに変換し、文書ＩＤを示すバーコードがある場合には、このバーコード部分の画像データから文書ＩＤを抽出する（ステップＳ１１）。本実施形態の場合には、第１版（日）の取扱説明書を示す文書ＩＤ“００１−１１”が抽出されたものとする。 The control unit 250 converts the original text image data in the image format into text data by performing character recognition processing (for example, OCR processing) in the control unit 250, and if there is a barcode indicating the document ID, The document ID is extracted from the image data of the barcode part (step S11). In the case of the present embodiment, it is assumed that the document ID “001-11” indicating the instruction manual for the first version (date) is extracted.

制御部２５０は、抽出された文書ＩＤを検索キーとして、文書データ管理部２２０（データーテーブル）の文書ＩＤとを照合させることにより、文書データベース２１０に格納されている文書データおよび翻訳データを検索して検索結果を得る（ステップＳ１３）。
図４のデータテーブルの例では、抽出された文書ＩＤが“００１−１１”である場合には、初めの３文字を検索キーとしてデータテーブルが選択され、日本語の第１版の文書データを始め、第２版，第３版の日本語の文書データ、さらに第１〜第３版の英語の翻訳データ、第１〜第３版の中国語の翻訳データが文書データベース２１０に格納されていることを検索結果として得る。 The control unit 250 searches the document data and translation data stored in the document database 210 by collating the document ID of the document data management unit 220 (data table) with the extracted document ID as a search key. The search result is obtained (step S13).
In the example of the data table in FIG. 4, when the extracted document ID is “001-11”, the data table is selected using the first three characters as a search key, and the first version document data in Japanese is selected. First, the second and third editions of Japanese document data, the first to third editions of English translation data, and the first to third editions of Chinese translation data are stored in the document database 210. Is obtained as a search result.

ここで、翻訳サーバ２００における相違点検出処理を、図８のフローチャートに基づいて説明する。
制御部２５０は、検索結果から文書データベース２１０に原文の翻訳データ（この場合、第１版の英語）が格納されているか否かを判定する（ステップＳ３１）。翻訳対象語となる指示言語（英語）で且つ第１版の翻訳データ（この場合、文書ＩＤ“００１−２１”）が文書データベース２１０に格納されていた場合には（ステップＳ３１；ＹＥＳ）、この翻訳データを読み出し、この翻訳データを記憶エリア（図示せず）に一時的に記憶する（ステップＳ３５）。 Here, the difference detection process in the translation server 200 will be described based on the flowchart of FIG.
The control unit 250 determines whether or not the original translation data (in this case, the first version of English) is stored in the document database 210 from the search result (step S31). If the instruction language (English) to be translated and the first version of translation data (in this case, document ID “001-21”) are stored in the document database 210 (step S31; YES), this The translation data is read, and this translation data is temporarily stored in a storage area (not shown) (step S35).

一方、ステップＳ３１の判定において、指示言語（英語）で且つ第１版の翻訳データ（この場合、文書ＩＤ“００１−２１”）が文書データベース２１０に格納されていない場合には（ステップＳ３１；ＮＯ）、制御部２５０は、原文の文書データ（この場合、第１版の日本語）が文書データベース２１０に有るか否かを判定する（ステップＳ３２）。日本語で且つ第１版の文書データ（この場合、文書ＩＤ“００１−１１”）が文書データベースに格納されていた場合には（ステップＳ３２；ＹＥＳ）、この文書データを読み出し、この文書データに基づいた翻訳処理を翻訳辞書部２３０を利用して行い（ステップＳ３３）、この処理で得られた翻訳データを記憶エリアに一時的に記憶する（ステップＳ３５）。 On the other hand, if it is determined in step S31 that the instruction language (English) and the first version of the translation data (in this case, the document ID “001-21”) are not stored in the document database 210 (step S31; NO) ), The control unit 250 determines whether or not the original document data (in this case, the first version of Japanese) exists in the document database 210 (step S32). If the document data in Japanese and the first version (in this case, the document ID “001-11”) is stored in the document database (step S32; YES), this document data is read out, and this document data is stored in the document data. The translation process based on the translation dictionary unit 230 is performed (step S33), and the translation data obtained by this process is temporarily stored in the storage area (step S35).

さらに、制御部２５０は、受信した原文画像データから文書ＩＤが抽出されなかった場合（図７中ステップＳ１２；ＮＯ）、或いは文書データベース２１０に原文データに対応した第１版の文書データが格納されていない場合には（ステップＳ３２；ＮＯ）、原文画像データをテキスト化したデータに対して翻訳処理を行い（ステップＳ３４）、この処理で得られた翻訳データを記憶エリアに一時的に記憶する（ステップＳ３５）。
このステップＳ３１〜Ｓ３５までの処理が、原文に対応した翻訳データを生成する処理である。 Further, when the document ID is not extracted from the received original text image data (step S12 in FIG. 7; NO), the control unit 250 stores the first version of the document data corresponding to the original text data in the document database 210. If not (step S32; NO), translation processing is performed on the text data of the original text data (step S34), and the translation data obtained by this processing is temporarily stored in the storage area (step S34). Step S35).
The processing from step S31 to S35 is processing for generating translation data corresponding to the original text.

また、制御部２５０は、検索結果および指示情報から文書データベース２１０に翻訳データ（この場合、最新版の英語）が格納されているか否かを判定する（ステップＳ３６）。翻訳対象語となる指示言語（英語）で且つ最新版の翻訳データ（この場合、文書ＩＤ“００１−２３”）が文書データベース２１０に格納されていた場合には（ステップＳ３６；ＹＥＳ）、この翻訳データを読み出し、この翻訳データを記憶エリア（図示せず）に一時的に記憶する（ステップＳ３９）。 Further, the control unit 250 determines whether translation data (in this case, the latest version of English) is stored in the document database 210 from the search result and the instruction information (step S36). If the instruction language (English) to be translated and the latest version of translation data (in this case, document ID “001-23”) is stored in the document database 210 (step S36; YES), this translation Data is read, and this translation data is temporarily stored in a storage area (not shown) (step S39).

一方、ステップＳ３６の判定において、指示言語（英語）で且つ最新版の翻訳データ（この場合、文書ＩＤ“００１−２３”）が文書データベース２１０に格納されていない場合には（ステップＳ３６；ＮＯ）、制御部２５０は、検索結果から原文の言語による最新版の文書データ（この場合、文書ＩＤ“００１−１３”）が文書データベース２１０に有るか否かを判定する（ステップＳ３７）。ステップＳ３７の判定処理において、最新版の文書データ（この場合、文書ＩＤ“００１−１３”）が文書データベース２１０に有る場合には（ステップＳ３７；ＹＥＳ）、この文書データを読み出し、この文書データに基づいた翻訳処理を翻訳辞書部２３０を利用して行い（ステップＳ３８）、この処理で得られた翻訳データを記憶エリアに一時的に記憶する（ステップＳ３９）。 On the other hand, if it is determined in step S36 that the instruction language (English) and the latest translation data (in this case, the document ID “001-23”) is not stored in the document database 210 (step S36; NO). The control unit 250 determines whether or not the latest document data (in this case, the document ID “001-13”) in the original language exists in the document database 210 from the search result (step S37). In the determination process in step S37, when the latest version of the document data (in this case, the document ID “001-13”) is present in the document database 210 (step S37; YES), this document data is read out, and this document data is stored in this document data. The translation process based on the translation dictionary unit 230 is performed (step S38), and the translation data obtained by this process is temporarily stored in the storage area (step S39).

さらに、制御部２５０は、ステップＳ３７の判定において、文書データベース２１０に最新版の文書データが格納されていない場合には（ステップＳ３７；ＮＯ）、最新版の翻訳データが入手できない旨を記憶エリアに一時的に記憶する（ステップＳ３９）。
このステップＳ３６〜Ｓ３９までの処理が、最新版の翻訳データを生成する処理である。 Further, if the latest version of the document data is not stored in the document database 210 in the determination in step S37 (step S37; NO), the control unit 250 informs the storage area that the latest version of translation data cannot be obtained. Store temporarily (step S39).
The processes from step S36 to step S39 are processes for generating the latest version of translation data.

そして、制御部２５０は、記憶エリアに記憶された原文の翻訳データと最新版の翻訳データを照合させてその相違点を検出する（ステップＳ４０）。
制御部２５０は、ステップＳ４０で抽出された原文の翻訳データと最新版の翻訳データとの相違点を特徴付けた修飾処理を施した翻訳データとして生成する（図７中のステップＳ２２）。ここで、修飾処理としては、追記された箇所にアンダーラインを付し、削除された箇所に取消線を引いて表示したり、文字に色を付けたり、文字網掛け処理をしたり、文字のフォントを変えたり、文字ポイントを変えたり等、利用者に対して識別できる表示による修飾である。
なお、最新版の翻訳データおよび文書データが文書データベース２１０に格納されていない場合には（ステップＳ３６，３７；ＮＯ）、最新の文書データがないため、相違点の検出は行われないことになる。 Then, the control unit 250 collates the original translation data stored in the storage area with the latest translation data and detects the difference (step S40).
The control unit 250 generates translation data that has been subjected to modification processing that characterizes the difference between the translation data of the original text extracted in step S40 and the latest version of translation data (step S22 in FIG. 7). Here, as modification processing, an underline is added to the added part, a strike-through line is drawn at the deleted part, a character is colored, a character shading process is performed, It is a display modification that can be identified to the user, such as changing the font or changing the character point.
If the latest translation data and document data are not stored in the document database 210 (steps S36 and 37; NO), no difference is detected because there is no latest document data. .

＜Ｂ−２．第２実施形態の効果＞
本実施形態による翻訳システムにおいても、第１の実施形態と同様に、利用者に対して精度の高い翻訳文を提供するという効果を奏する。
また、バージョンの古い文書を複合機１００で読み込むことにより、最新版の文書を翻訳した翻訳原稿Ｔが得られ、しかも改訂された部分を特徴付けて表示できるため、利用者に対して改訂部分を印象付けることができる。 <B-2. Effect of Second Embodiment>
Also in the translation system according to the present embodiment, as in the first embodiment, there is an effect that a translated sentence with high accuracy is provided to the user.
In addition, by reading an old version of the document with the multifunction peripheral 100, a translation manuscript T obtained by translating the latest version of the document can be obtained, and the revised part can be characterized and displayed. You can make an impression.

＜Ｂ−３．第２実施形態の変形例＞
第２実施形態では、旧版の文書と新版の文書との相違点を抽出するのに、翻訳後のデータを用いて照合することにより行ったが、本発明はこれに限らず、原文の言語の段階で相違点を抽出しても、翻訳に一般的に用いられる中間言語の段階で相違点を抽出するようにしてもよい。 <B-3. Modification of Second Embodiment>
In the second embodiment, the difference between the old version document and the new version document is extracted by collating using the translated data. However, the present invention is not limited to this, and the language of the source language is not limited to this. Even if the difference is extracted at the stage, the difference may be extracted at an intermediate language stage generally used for translation.

＜Ｃ．変形例＞
以上、本発明の実施形態について説明したが、本発明は上述した各実施形態に限定されるものではなく、種々の態様が可能である。
前記各実施形態では、複合機１００の読み取り部１１０で、読み取った原文画像データを翻訳サーバ２００に送信するようにしたが、本発明は、原稿の所定位置にバーコードが印刷されていれば、原稿画像データからバーコードの識別ＩＤのみを読み取り、翻訳サーバ２００に送信するようにしてもよい。これにより、バーコードのデータ容量に比べて大容量となる原文画像データを送信することがなくなり、ネットワーク３００を経由しての送信時間を大幅に短縮できる。 <C. Modification>
As mentioned above, although embodiment of this invention was described, this invention is not limited to each embodiment mentioned above, A various aspect is possible.
In each of the embodiments described above, the read original image data is transmitted to the translation server 200 by the reading unit 110 of the multifunction peripheral 100. However, in the present invention, if a barcode is printed at a predetermined position of the document, Only the barcode identification ID may be read from the document image data and transmitted to the translation server 200. As a result, the original image data that has a larger capacity than the barcode data capacity is not transmitted, and the transmission time via the network 300 can be greatly reduced.

翻訳サーバ２００の文書データベース２１０に翻訳データおよび文書データが格納されていない場合、原文画像データを再度複合機１００から翻訳サーバ２００に送信すればよい。
また、複合機１００の読み取り部１１０で原稿Ｍを読み取った段階で、ＯＣＲ処理を施してイメージ形式の原文画像データからテキスト形式の原文入力データに変換してもよい。但し、この原文入力データと予め文書データベース２１０に格納されている文書データとは異なったものである。 If translation data and document data are not stored in the document database 210 of the translation server 200, the original text image data may be transmitted again from the multifunction peripheral 100 to the translation server 200.
Alternatively, when the document M is read by the reading unit 110 of the multifunction peripheral 100, OCR processing may be performed to convert the original text image data in the image format into the original text input data in the text format. However, the original text input data is different from the document data stored in the document database 210 in advance.

前記各実施形態では、複合機１００と翻訳サーバ２００とをネットワーク３００で接続する翻訳システムとして説明したが、本発明はこれに限らず、複合機１００内に翻訳サーバ２００の機能を備え、複合機１００が翻訳装置してもよい。このような複合機１００は、複合機１台で仕事をサポートできる比較的小さな会社への適用が好適である。この際、文書データベース２１０に格納される文書データは、図６のステップＳ１８による記憶更新のみでなく、複合機１００のスキャナ機能、コピー機能、プリンタ機能を動作させた際に、文書データベース２１０に格納させてもよい。この際、スキャナ・コピー・プリントされる原稿の文書データを文書ＩＤおよびバージョン（版番）と共に文書データベース２１０に格納すると共に、文書データ管理部２２０に翻訳データの文書ＩＤと文書名を記憶更新するようにすればよい。 In each of the above-described embodiments, the translation system in which the multifunction peripheral 100 and the translation server 200 are connected via the network 300 has been described. However, the present invention is not limited to this, and the multifunction peripheral 100 includes the function of the translation server 200. 100 may be a translation device. Such a multifunction device 100 is preferably applied to a relatively small company that can support work with one multifunction device. At this time, the document data stored in the document database 210 is stored in the document database 210 when the scanner function, copy function, and printer function of the multi-function peripheral 100 are operated, as well as the storage update in step S18 of FIG. You may let them. At this time, the document data of the original to be scanned, copied and printed is stored in the document database 210 together with the document ID and version (version number), and the document ID and document name of the translation data are stored and updated in the document data management unit 220. What should I do?

また、前記各実施形態では、利用者が最新の翻訳文を得ることを目的として翻訳システムについて詳述したが、本発明はこれに限らず、入力された識別番号に対して異なったバージョンの文書データを出力する文書管理システムとしてもよいことは勿論である。
さらに、読み取り部１１０は、スキャナ機構を前提に記載したが、ネットワーク３００を介して接続されたユーザ端末（例えば、パーソナルコンピュータ）からの印刷指令信号に重畳された文書データをキーにして異なったバージョンの文書データを出力するようにしてもよい。この際、ユーザ端末に対してその旨を通知してモニタに表示させて、利用者からの指示を促してもよい。 In each of the above-described embodiments, the translation system has been described in detail for the purpose of obtaining the latest translated sentence by the user. However, the present invention is not limited to this, and different versions of the document with respect to the input identification number. Of course, it may be a document management system that outputs data.
Furthermore, although the reading unit 110 has been described on the assumption of the scanner mechanism, different versions are provided using document data superimposed on a print command signal from a user terminal (for example, a personal computer) connected via the network 300 as a key. The document data may be output. At this time, the user terminal may be notified of this and displayed on the monitor to prompt an instruction from the user.

前記実施形態のデータテーブルでは、図４に示すように、共通の文字列を有する文書ＩＤとして文書データを関連付けたが、本発明はこれに限らず、異なった文書ＩＤであっても、１つのデータテーブルで関連付けられるようにしてもよい。
本発明は、翻訳可能な言語は、第１〜第３言語の３つに限らず、２または４以上の言語に対応できることは勿論である。 In the data table of the embodiment, as shown in FIG. 4, document data is associated as a document ID having a common character string. However, the present invention is not limited to this, and even with different document IDs, You may make it relate by a data table.
In the present invention, the translatable languages are not limited to three of the first to third languages, and can naturally correspond to two or four or more languages.

前各記実施形態では、文書管理システムとして版の異なった翻訳文書データを管理する場合に例示したが、本発明はこれに限らず、１カ国の言語に対して改訂によるバージョンの異なった文書データを管理する場合に用いても、見積書・納品書・請求書といった一連の文書データの管理する場合に用いても、特許書類のような願書・明細書・要約書といった一連の文書データの管理する場合に用いてもよいことは勿論である。 In each of the above-described embodiments, the translation document data of different versions is managed as the document management system. However, the present invention is not limited to this, and the document data of different versions according to revisions for one language. Management of a series of document data, such as patent documents, application forms, specifications, summaries, etc., whether used to manage a series of document data such as quotations, invoices or invoices Of course, it may be used.

前記各実施形態では、文書ＩＤの抽出を用紙に印刷されたバーコードから抽出する構成としたが、本発明はこれに限らず、指定領域に存在する文字列を識別情報（文書ＩＤ）としてもよい。例えば、用紙に印刷された文章が横書きの場合、文書の中央部分の列を所定領域とし、この所定領域をＯＣＲ処理することによりこの文字列を文書ＩＤとしてもよい。 In each of the above embodiments, the document ID is extracted from the barcode printed on the paper. However, the present invention is not limited to this, and a character string existing in the designated area may be used as identification information (document ID). Good. For example, when the text printed on the paper is written horizontally, the central column of the document may be set as a predetermined area, and the character string may be set as the document ID by performing OCR processing on the predetermined area.

また、原稿に利用者が注釈や付箋等のアノテーションを記載している場合には、読み取り部１１０で原稿を読み取る際に、アノテーションに記載された文字とその位置情報を算出し、翻訳サーバ２００から翻訳データが返信された場合には、この翻訳データ、アノテーションの位置情報に基づき、印刷部１５０が翻訳文と共に、翻訳されたアノテーションを元の原稿と同じ位置に印刷する。 In addition, when the user describes annotations such as annotations and sticky notes on the manuscript, when the manuscript is read by the reading unit 110, the characters described in the annotation and the position information thereof are calculated, and the translation server 200 When the translated data is returned, the printing unit 150 prints the translated annotation together with the translated text at the same position as the original document based on the translated data and the positional information of the annotation.

図６のステップＳ１８で文書データベース２１０に記憶更新される翻訳データは、翻訳サーバ２００による機械翻訳による翻訳データを格納させるようにしたが、ここでは格納させずに、この翻訳データに基づき翻訳者が補正した翻訳データ、或いは翻訳者が原文の原稿から翻訳した翻訳データを文書データベース２１０に新たに格納させるようにしてもよい。 The translation data stored and updated in the document database 210 in step S18 in FIG. 6 is stored as translation data by machine translation by the translation server 200. However, the translation data is not stored here but the translator based on this translation data. The corrected translation data or the translation data translated by the translator from the original document may be newly stored in the document database 210.

前記各実施形態では、複合機１００および翻訳サーバ２００で翻訳システム１を構成するものとして述べたが、個人的なクライアント端末（例えば、パーソナルコンピュータ）と翻訳サーバ２００とをネットワーク３００を介して接続した翻訳システムとしてもよい。
このシステムでは、クライアント端末の表示部に、テキスト形式の翻訳データ或いは原文データと翻訳データとを比較させながら表示させることも可能となる。さらに、データを表示部に表示し、マウス等の領域指定手段によって文章領域を指定することにより、この文章領域に対応した翻訳文を表示させる等、種々の使用方法が適用できる。 In each of the embodiments described above, the MFP 100 and the translation server 200 are described as constituting the translation system 1. However, a personal client terminal (for example, a personal computer) and the translation server 200 are connected via the network 300. It may be a translation system.
In this system, the translation data in the text format or the original text data and the translation data can be displayed on the display unit of the client terminal while being compared. Furthermore, various usage methods can be applied, such as displaying data on the display unit and designating a text area by an area designating means such as a mouse to display a translation corresponding to the text area.

なお、図１においては図面が煩雑になるのを防ぐため、複合機１００および翻訳サーバ２００はそれぞれ１台のみ図示されているが、翻訳システム１は、複合機１００および翻訳サーバ２００を複数有してもよいことは勿論である。 In FIG. 1, only one MFP 100 and one translation server 200 are shown in order to prevent the drawing from becoming complicated, but the translation system 1 has a plurality of MFPs 100 and translation servers 200. Of course, it may be.

また、上述の実施形態においては、複合機１００は制御部１６０が、翻訳サーバ２００は制御部２５０がそれぞれプログラムを実行することにより実現されたが、この一部または全部をハードウェアで実現する構成としてもよい。 In the above-described embodiment, the MFP 100 is realized by the control unit 160 and the translation server 200 is executed by the control unit 250, respectively. It is good.

本発明の実施形態に係る翻訳システムの構成を示す図である。It is a figure which shows the structure of the translation system which concerns on embodiment of this invention. 本発明実施形態に用いられる複合機の構成を示すブロック図である。1 is a block diagram illustrating a configuration of a multifunction machine used in an embodiment of the present invention. 本発明実施形態に用いられる翻訳サーバの構成を示すブロック図である。It is a block diagram which shows the structure of the translation server used for this invention embodiment. 文書データ管理部に記憶されているデータテーブルを示す図である。It is a figure which shows the data table memorize | stored in the document data management part. 実施形態による翻訳システムの動作を示す流れ図である。It is a flowchart which shows operation | movement of the translation system by embodiment. 第１実施形態による翻訳サーバにおける翻訳処理の動作を示す流れ図である。It is a flowchart which shows the operation | movement of the translation process in the translation server by 1st Embodiment. 第２実施形態による翻訳サーバにおける翻訳処理の動作を示す流れ図である。It is a flowchart which shows the operation | movement of the translation process in the translation server by 2nd Embodiment. 図７に続く翻訳処理の動作を示す流れ図である。It is a flowchart which shows the operation | movement of the translation process following FIG.

Explanation of symbols

１…翻訳システム、１００…複合機、１１０…読み取り部、１２０…画像形成部、１３０…指示入力部、１４０…送受信部、１５０…印刷部、１６０…制御部、２００…翻訳サーバ、２１０…文書データベース、２２０…文書データ管理部、２３０…翻訳辞書部、２４０…送受信部、２５０…制御部、３００…ネットワーク。 DESCRIPTION OF SYMBOLS 1 ... Translation system, 100 ... MFP, 110 ... Reading part, 120 ... Image forming part, 130 ... Instruction input part, 140 ... Transmission / reception part, 150 ... Printing part, 160 ... Control part, 200 ... Translation server, 210 ... Document Database, 220 ... Document data management unit, 230 ... Translation dictionary unit, 240 ... Transmission / reception unit, 250 ... Control unit, 300 ... Network.

Claims

Document data storage means storing document data and identification information for identifying the document data;
Identification information processing means for handling the identification information stored in the document data storage means as a group according to a predetermined classification rule;
Receiving means for receiving document data input together with identification information;
First determination means for determining whether or not the same identification information as the identification information received by the reception means is stored in the document data storage means;
A second determining unit that determines whether the identification information classified as the same group by the identification information processing unit is in the document data storage unit when the determination by the first determination unit is affirmative; A determination means;
When the determination by the second determination unit is affirmative, other identification information in the same group is selected based on a predetermined selection rule, and the document data corresponding to the selected identification information is selected. Document management apparatus comprising: document data output means for reading out and outputting from the document data storage means.

The document management apparatus according to claim 1, wherein the classification rule is classification based on a storage area of the identification information.

The document management apparatus according to claim 1, wherein the identification information is set to contents according to a predetermined system, and the classification rule is a rule for classifying according to common items of the contents of the identification information. .

The identification information is set to a content according to a predetermined system including a code indicating a document type and version information,
The document management apparatus according to claim 3, wherein the predetermined selection rule in the second determination unit is set to a rule of selecting identification information including the latest version information.

The identification information is set to contents according to a predetermined system including a code indicating the type of document, version information, and a language format in which the document is represented,
The document according to claim 3, wherein the predetermined selection rule in the second determination means is set to a rule of selecting identification information including the latest version information and a specified language format. Management device.

The document management apparatus according to claim 1, wherein the identification information is an identification code according to a predetermined system or a document name indicating a document name.

The document data output means detects a difference between the other document data and the document data received by the reception means, and a portion corresponding to the difference in the output document data has a display characteristic. The document management apparatus according to claim 1, wherein a modification process is performed.

When the document data output means detects the difference between the other document data and the document data received by the receiving means, if the language of the two is different, the document data received by the receiving means The document management apparatus according to claim 7, wherein the difference is detected after the document data is translated into a language.

Even if the determination by the first determination means is positive, the identification information for which the determination by the second determination means is negative is the same as the identification information for which the determination by the first determination means is positive. 9. The document management apparatus according to claim 1, further comprising a data update storage unit that divides the data into groups, and updates and stores the identification information and corresponding document data in the document data storage unit. .

The update storage means classifies the identification information for which the determination by the first determination means is negative into a new category, and stores the identification information and the corresponding document data in the document data storage means. The document management apparatus according to claim 9, wherein the document management apparatus is updated and stored.

The document management apparatus according to claim 1, wherein the identification information is a specific character string or a specific image.

12. The document management apparatus according to claim 1, further comprising an image reading unit that reads an image from a document, generates identification information and input document data, and transfers the document data to the receiving unit.

A terminal device and a server device that communicates with the terminal device via a network;
The server device is
Document data storage means storing document data and identification information for identifying the document data;
Identification information processing means for handling the identification information stored in the document data storage means as a group according to a predetermined classification rule;
Receiving means for receiving document data input together with identification information;
First determination means for determining whether or not the same identification information as the identification information received by the reception means is stored in the document data storage means;
A second determining unit that determines whether the identification information classified as the same group by the identification information processing unit is in the document data storage unit when the determination by the first determination unit is affirmative; A determination means;
When the determination by the second determination unit is affirmative, other identification information in the same group is selected based on a predetermined selection rule, and the document data corresponding to the selected identification information is selected. Document data output means for reading out and outputting from the document data storage means,
The terminal device is
An image reading unit that reads an image from a document, generates identification information and input document data, and delivers the document data to the receiving unit;
A document management system comprising: data display means for printing or displaying the document data output from the document data output means.

A document data storage step for storing document data and identification information for identifying the document data in the document data storage means;
An identification information processing step of handling the identification information stored in the document data storage means as a group according to a predetermined classification rule;
A receiving step for receiving document data input together with identification information;
A first determination step of determining whether or not the same identification information as the identification information received in the reception step is stored in the document data storage unit;
A second step of determining whether or not the identification information classified as the same group by the identification information processing step is in the document data storage means when the determination of the first determination step is affirmative; A determination step;
If the determination in the second determination step is affirmative, other identification information in the same group is selected based on a predetermined selection rule, and the document data corresponding to the selected identification information is selected. A document data output step of reading out and outputting from the document data storage means.