JP2009130398A

JP2009130398A - Information processing apparatus and method

Info

Publication number: JP2009130398A
Application number: JP2007299836A
Authority: JP
Inventors: Hiroki Yamamoto; 寛樹山本; Yasuo Okuya; 泰夫奥谷
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2007-11-19
Filing date: 2007-11-19
Publication date: 2009-06-11

Abstract

PROBLEM TO BE SOLVED: To hold alternative character string information given to an image when printing an electronic document. SOLUTION: An analysis means analyzes a structured document. When the analysis means detects a link to image data, an acquisition means obtains the image data. When the analysis means detects a character string explaining the image data, an embedding means embeds the character string into the image data obtained by the acquisition means, as a digital watermark. An extracted alternative character string is read out in response to detection of watermark information. COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、視覚障害者に対する情報保障を実現するための技術に関する。 The present invention relates to a technique for realizing information security for a visually impaired person.

インターネットや携帯電話など通信インフラの普及により、ＷＥＢや電子メール等の電子文書を利用した情報交換・サービスが広く用いられるようになった。流通している電子文書の多くが文字情報と画像情報であるため、視覚障害者が利用するのは容易ではない。視覚障害者が支障なく情報サービスを利用できるように、情報保障に関してさまざまな取り組みが行われている。 With the spread of communication infrastructure such as the Internet and mobile phones, information exchange and services using electronic documents such as WEB and e-mail have become widely used. Since many electronic documents in circulation are character information and image information, it is not easy for visually impaired people to use. Various efforts have been made to guarantee information so that visually impaired people can use information services without any problems.

例えば、画面に表示されている文字情報を音声に変換する技術を用いて音声で読み上げるスクリーンリーダと呼ばれるソフトウェアが市販されている。代表的なものではFreedom Scientific社のJaws、IBM社のHome Page Readerという製品が広く知られている。また、文字情報を点字に変換して触読を実現した点字表示ディスプレイの一例が特許文献１に開示されている。スクリーンリーダや特許文献１のような装置を用いることにより、文字情報に関しては音声に変換して情報を得ることができる。 For example, software called a screen reader that reads aloud by using a technology for converting character information displayed on the screen into speech is commercially available. Representative products such as Freedom Scientific's Jaws and IBM's Home Page Reader are widely known. Further, Patent Document 1 discloses an example of a braille display that realizes tactile reading by converting character information into braille. By using a screen reader or a device such as Patent Document 1, character information can be converted into speech and information can be obtained.

このような、スクリーンリーダの使用を前提として、画像データについても情報が得られるように、日本工業規格のＪＩＳＸ８３４１−３では、ＷＥＢコンテンツ作成のガイドラインが示されている。画像データへの参照を記述するタグに、画像の内容を説明する代替文字列をＡＬＴ属性として付与するよう推奨している。 Assuming the use of such a screen reader, JIS X8341-3 of Japanese Industrial Standards provides guidelines for creating web contents so that information can be obtained also about image data. It is recommended that an alternative character string that describes the content of an image be assigned as an ALT attribute to a tag that describes a reference to image data.

この他、特許文献２には、画像データ中に含まれるヘッダ情報から画像の作者名、作成日、タイトル等画像に関連する情報を抽出して音声で読み上げる情報提示装置が開示されている。 In addition, Patent Document 2 discloses an information presentation device that extracts information related to an image such as an image author's name, creation date, and title from header information included in the image data and reads it out by voice.

特許第３７３２７５６号公報Japanese Patent No. 3732756 特開２００２−１７５１７６号公報JP 2002-175176 A

ＪＩＳＸ８３４１−３のガイドラインに従った電子文書であっても、印刷時にＡＬＴ属性に付与された文字列は印字されない。つまり、視覚障害者に配慮して記述された電子文書であっても、印刷すると画像に関する情報が消失するという課題があった。特許文献２に開示された情報提示装置でも、印刷すると画像のヘッダ部分の情報は印字されないため、視覚障害者が画像に関する情報を得ることはできない。 Even for an electronic document that complies with the guidelines of JIS X8341-3, the character string assigned to the ALT attribute at the time of printing is not printed. That is, there is a problem that even if an electronic document is described with consideration for the visually impaired, information about the image is lost when the electronic document is printed. Even in the information presentation device disclosed in Patent Document 2, when the information is printed, the information in the header portion of the image is not printed, so that the visually impaired cannot obtain information on the image.

これに対して、印刷時にＡＬＴ属性に付与された代替文字列を合わせて印字するという方法が考えられるが、一方で、印刷された文書のレイアウトを変更したくない場合もある。 On the other hand, a method is conceivable in which an alternative character string assigned to the ALT attribute is printed at the time of printing. However, there is a case where it is not desired to change the layout of the printed document.

本発明は、上記の課題を解決するため、画像に付与された代替文字列を電子透かしとして画像に埋め込んで印刷する情報処理装置を提供することを目的とする。さらに、印刷された文書の画像から代替文字列を抽出し、抽出した代替文字列を音声で提示する情報処理装置を提供することを目的とする。 In order to solve the above-described problems, an object of the present invention is to provide an information processing apparatus that embeds and prints an alternative character string added to an image as a digital watermark. It is another object of the present invention to provide an information processing apparatus that extracts a substitute character string from an image of a printed document and presents the extracted substitute character string by voice.

本発明の一側面によれば、構造化文書を解析する解析手段と、前記解析手段が画像データへのリンクを検出した場合、前記画像データを取得する取得手段と、前記解析手段が前記画像データを説明する文字列を検出した場合、前記取得手段が取得した前記画像データに、前記文字列を電子透かしとして埋め込む埋め込み手段とを備えることを特徴とする情報処理装置が提供される。 According to one aspect of the present invention, an analysis unit that analyzes a structured document, an acquisition unit that acquires the image data when the analysis unit detects a link to image data, and the analysis unit that includes the image data An information processing apparatus is provided that includes an embedding unit that embeds the character string as a digital watermark in the image data acquired by the acquiring unit.

本発明によれば、電子文書の印刷、スキャンの過程を通して、元の電子文書に記述された画像の代替文字列を保持することができる。したがって、視覚障害者が電子文書の印刷物をスキャンして音声で内容を確認する場合に、画像の代替文字列の情報を失うことがないので、元の電子文書から得られる情報と同一の情報を得ることができ、利便性が向上する。さらに、代替文字列を直接印刷する必要がないので、電子文書の表示と印刷物で同一の文書レイアウトを保つことができる。 According to the present invention, an alternative character string of an image described in an original electronic document can be held through the process of printing and scanning the electronic document. Therefore, when a visually handicapped person scans a printed matter of an electronic document and confirms the contents by voice, the information of the substitute character string of the image is not lost, so the same information as the information obtained from the original electronic document is used. It can be obtained and convenience is improved. Furthermore, since it is not necessary to print the substitute character string directly, the same document layout can be maintained between the electronic document display and the printed matter.

以下、図面を参照して本発明の好適な実施形態について詳細に説明する。なお、本発明は以下の実施形態に限定されるものではなく、本発明の実施に有利な具体例を示すにすぎない。また、以下の実施形態の中で説明されている特徴の組み合わせの全てが本発明の課題解決手段として必須のものであるとは限らない。 DESCRIPTION OF EMBODIMENTS Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings. In addition, this invention is not limited to the following embodiment, It shows only the specific example advantageous for implementation of this invention. In addition, not all combinations of features described in the following embodiments are indispensable as means for solving the problems of the present invention.

＜実施形態１＞
本実施形態では、代表的なマークアップ言語であるHTML（Hyper Text Markup Language）で記述された電子文書を処理する場合を例に説明するが、これに限るものではない。本実施形態は、HTMLのように所定の書式に従って、文書の構造やレイアウトを制御する記述言語で記述された構造化文書であれば、HTMLに限らず適用することができる。 <Embodiment 1>
In the present embodiment, a case where an electronic document described in HTML (Hyper Text Markup Language), which is a representative markup language, is described as an example, but the present invention is not limited to this. The present embodiment can be applied to any structured document described in a description language that controls the structure and layout of a document according to a predetermined format such as HTML.

また、以下では、文書のレイアウトを制御するのに用いる記述子をタグと表記する。 Hereinafter, a descriptor used for controlling the layout of a document is referred to as a tag.

（情報処理装置の構成）
図１は、本実施形態における情報処理装置の機能構成を示す図である。 (Configuration of information processing device)
FIG. 1 is a diagram illustrating a functional configuration of the information processing apparatus according to the present embodiment.

図１において、情報処理装置１００は表示部１０１、印刷部１０２、スキャン部１０３、音声出力部１０４、電子文書処理部１１０、印刷文書処理部１２０から構成される。 In FIG. 1, the information processing apparatus 100 includes a display unit 101, a printing unit 102, a scanning unit 103, an audio output unit 104, an electronic document processing unit 110, and a printed document processing unit 120.

表示部１０１は、液晶ディスプレイ等の表示装置から構成され、画像や文字等により構成される各種の情報を表示する。印刷部１０２は、プリンタ、複写機等の印刷装置から構成され、文字や画像を印刷する。スキャン部１０３は、スキャナや複写機等のスキャナ装置で構成され、印刷物の紙面上の文字や画像をデジタルデータに変換して読み取る。音声出力部１０４は、Ｄ／Ａ変換回路及びスピーカから構成され、デジタル音声信号をアナログ音声信号に変換してスピーカから出力する。電子文書処理部１１０は電子文書を取得し、取得した電子文書１９１の表示あるいは印刷に伴う処理を行う。印刷文書処理部１２０は、スキャン部１０３を介して印刷文書１９２を取得し、文書の内容を読み上げた音声を音声出力部１０４から出力する。 The display unit 101 includes a display device such as a liquid crystal display, and displays various types of information including images, characters, and the like. The printing unit 102 includes a printing device such as a printer or a copying machine, and prints characters and images. The scanning unit 103 is composed of a scanner device such as a scanner or a copying machine, and converts characters and images on the paper surface of the printed matter into digital data and reads the digital data. The audio output unit 104 includes a D / A conversion circuit and a speaker, converts a digital audio signal into an analog audio signal, and outputs the analog audio signal from the speaker. The electronic document processing unit 110 acquires an electronic document and performs processing associated with display or printing of the acquired electronic document 191. The print document processing unit 120 acquires the print document 192 via the scan unit 103, and outputs a sound that reads out the content of the document from the sound output unit 104.

１９１はHTMLで記述された電子文書である。電子文書１９１は、インターネットやＬＡＮを介して接続された図示しない外部のサーバに記憶されている。また、情報処理装置１００が備える図示しないＨＤＤなどの記憶装置に記憶されていてもよい。１９２は電子文書を印刷した印刷文書である。 Reference numeral 191 denotes an electronic document described in HTML. The electronic document 191 is stored in an external server (not shown) connected via the Internet or a LAN. Further, it may be stored in a storage device such as an HDD (not shown) included in the information processing apparatus 100. Reference numeral 192 denotes a print document obtained by printing an electronic document.

図２にHTMLで記述した電子文書１９１の一例を示す。文書の内容は会議場へのアクセス案内である。図２の電子文書を電子文書処理部１１０で処理して表示又は印刷した結果の一例を図３に示す。 FIG. 2 shows an example of an electronic document 191 described in HTML. The content of the document is an access guide to the conference hall. An example of the result of processing and displaying or printing the electronic document of FIG. 2 by the electronic document processing unit 110 is shown in FIG.

続いて、電子文書処理部１１０、印刷文書処理部１２０の詳細を説明する。 Next, details of the electronic document processing unit 110 and the print document processing unit 120 will be described.

図１に示すように、電子文書処理部１１０は、電子文書取得部１１１、文書構造解析部１１２、代替文字列抽出部１１３、文書表示制御部１１４、文書印刷制御部１１５、透かし情報埋め込み部１１６から構成される。 As shown in FIG. 1, the electronic document processing unit 110 includes an electronic document acquisition unit 111, a document structure analysis unit 112, an alternative character string extraction unit 113, a document display control unit 114, a document print control unit 115, and a watermark information embedding unit 116. Consists of

電子文書取得部１１１は電子文書１９１を取得する。 The electronic document acquisition unit 111 acquires the electronic document 191.

文書構造解析部１１２は、取得した電子文書を解析し、タグで記述された文書の各要素の属性に従って文書の構造を取得する。また、取得した電子文書に画像のリンク情報が含まれる場合は、電子文書取得部１１１を介してリンク先の画像を取得する。さらに、タグによって画像に対して代替文字列が記述されている場合は、代替文字列抽出部１１３が代替文字列を抽出する。図２において破線の枠で囲った２０１の部分が画像データに関する情報の記述である。HTMLでは、行頭の“＜img src =” に続く部分で画像へのリンク情報、続く“alt =”に続く部分で代替文字列を記述する。図２の２０１の部分には、画像は“map1.jpg”であり、この画像の代替文字列は「Access Map Shin-Yokohama」であることが記述されている。したがって、情報処理装置１００がHTMLで記述した電子文書を対象にする場合は、代替文字列抽出部１１３は“alt=”に続く文字列を抽出する。文書構造解析部１１２及び代替文字列抽出部１１３で取得した文書構造の情報、画像データ、画像、代替文字列などは、ＲＡＭ等の情報処理装置１００が備える図示しない記憶装置に一時的に記憶する。 The document structure analysis unit 112 analyzes the acquired electronic document, and acquires the structure of the document according to the attribute of each element of the document described by the tag. If the acquired electronic document includes image link information, the link destination image is acquired via the electronic document acquisition unit 111. Further, when an alternative character string is described for the image by the tag, the alternative character string extraction unit 113 extracts the alternative character string. In FIG. 2, a portion 201 surrounded by a broken-line frame is a description of information about image data. In HTML, the link information to the image is described in the part following "<img src =" at the beginning of the line, and the alternative character string is described in the part following "alt =". The portion 201 in FIG. 2 describes that the image is “map1.jpg” and that the alternative character string of this image is “Access Map Shin-Yokohama”. Therefore, when the information processing apparatus 100 targets an electronic document described in HTML, the alternative character string extraction unit 113 extracts a character string following “alt =”. Document structure information, image data, images, substitute character strings, and the like acquired by the document structure analysis unit 112 and the substitute character string extraction unit 113 are temporarily stored in a storage device (not shown) provided in the information processing apparatus 100 such as a RAM. .

文書表示制御部１１４は、文書構造解析部１１２で解析した文書構造に従って、表示部１０１に表示する内容を制御する。 The document display control unit 114 controls the content displayed on the display unit 101 according to the document structure analyzed by the document structure analysis unit 112.

文書印刷制御部１１５は、文書構造解析部１１２で解析した文書構造に従って、印刷部１０２で受理可能な印刷データを生成する。この時、対応する代替文字列がある画像については、透かし情報埋め込み部１１６が画像データ中に代替文字列を電子透かしとして埋め込む。図２に示した電子文書の場合、「Access Map Shin-Yokohama」という代替文字列が印刷時に図３の３０２に示す対応する画像内に埋め込まれる。電子透かしの埋め込みには例えば特開２０００−１０６６２４号公報に開示されている方法などの、公知の電子透かし技術を用いることができる。 The document print control unit 115 generates print data that can be received by the printing unit 102 in accordance with the document structure analyzed by the document structure analysis unit 112. At this time, for an image having a corresponding substitute character string, the watermark information embedding unit 116 embeds the substitute character string as an electronic watermark in the image data. In the case of the electronic document shown in FIG. 2, an alternative character string “Access Map Shin-Yokohama” is embedded in the corresponding image shown at 302 in FIG. 3 at the time of printing. For embedding the digital watermark, a known digital watermark technique such as a method disclosed in Japanese Patent Laid-Open No. 2000-106624 can be used.

続いて、印刷文書処理部１２０について詳細に説明する。 Next, the print document processing unit 120 will be described in detail.

図１に示すように、印刷文書処理部１２０は、スキャン画像取得部１２１、スキャン画像処理部１２２、音声生成部１２３、透かし情報抽出部１２４から構成される。 As shown in FIG. 1, the print document processing unit 120 includes a scan image acquisition unit 121, a scan image processing unit 122, an audio generation unit 123, and a watermark information extraction unit 124.

スキャン画像取得部１２１は、印刷文書１９２をスキャン部１０３でスキャンして得られる画像イメージであるスキャン画像を取得する。 The scan image acquisition unit 121 acquires a scan image that is an image image obtained by scanning the print document 192 with the scan unit 103.

スキャン画像処理部１２２は取得したスキャン画像を、公知のＯＣＲ（Optical Character Recognition）技術を用いて画像処理を行う。まず、スキャン画像を、画像の領域とテキストの領域に分離し、画像領域については画像のまま、テキストの領域については文字認識を行ってテキスト化する。 The scan image processing unit 122 performs image processing on the acquired scan image using a known OCR (Optical Character Recognition) technique. First, the scanned image is separated into an image area and a text area, and the image area is left as an image, and the text area is subjected to character recognition to be converted into text.

音声生成部１２３はスキャン画像処理部１２２で構成された電子文書をもとに、文書の内容を読み上げる音声データを生成する。電子文書に画像が含まれる場合は、透かし情報抽出部１２４が画像に含まれるテキスト情報を抽出し、抽出したテキスト情報に基づいて読み上げる音声データを生成する。画像からテキスト情報が抽出できなかった場合は、画像部分については読み上げデータを生成しないか、例えば「画像です」など文書中に画像がある旨の読み上げデータを生成してもよい。なお、画像に透かされたテキストを抽出する方法は、透かし情報埋め込み部１１６でテキスト情報の埋め込みに用いた技術と同様に既存の電子透かし技術を用いる。 The sound generation unit 123 generates sound data that reads out the content of the document based on the electronic document configured by the scan image processing unit 122. When an image is included in the electronic document, the watermark information extraction unit 124 extracts text information included in the image, and generates voice data to be read out based on the extracted text information. If text information cannot be extracted from the image, read-out data is not generated for the image portion, or read-out data indicating that there is an image in the document, such as “It is an image”, may be generated. Note that, as a method of extracting the text that is shown through the image, an existing digital watermark technique is used in the same manner as the technique used for embedding text information in the watermark information embedding unit 116.

続いて、情報処理装置１００の動作を説明する。 Subsequently, the operation of the information processing apparatus 100 will be described.

（透かし埋め込みのフローの説明）
まず、電子文書処理部１１０による、電子文書の取得から印刷までの動作を図４のフローチャートに基づき説明する。 (Explanation of watermark embedding flow)
First, the operation from electronic document acquisition to printing by the electronic document processing unit 110 will be described with reference to the flowchart of FIG.

ステップＳ４０１で、電子文書取得部１１１が電子文書を取得する。この際、取得する電子文書の保持場所については限定しない。同じ情報処理装置１００内に保持している電子文書、ネットワークを経由して接続されている他装置やサーバに保持している電子文書の取得が可能である。 In step S401, the electronic document acquisition unit 111 acquires an electronic document. At this time, the location of the electronic document to be acquired is not limited. It is possible to acquire an electronic document held in the same information processing apparatus 100 and an electronic document held in another apparatus or server connected via a network.

次に、ステップＳ４０２〜Ｓ４１１の処理において、文書構造解析部１１２は、取得した電子文書を解析して電子文書の構造を取得する。文書印刷制御部１１５は、解析結果に基づき電子文書を印刷するための印刷データを生成する。電子文書の解析は、取得した電子文書の記述について所定の処理単位ごとに行われる。処理単位としては、例えば、行ごと、タグごとなどが考えられる。 Next, in the processes of steps S402 to S411, the document structure analysis unit 112 analyzes the acquired electronic document and acquires the structure of the electronic document. The document print control unit 115 generates print data for printing the electronic document based on the analysis result. The analysis of the electronic document is performed for each predetermined processing unit with respect to the description of the acquired electronic document. As processing units, for example, every row, every tag, etc. can be considered.

文書構造解析部１１２が図２の２０１に示したような画像データへのリンクを検出した場合（Ｓ４０２においてＹＥＳ）、電子文書取得部１１１は、その画像データを取得する（Ｓ４０３）。取得した画像データは一時的に情報処理装置１００内に保持される。また、その画像データに対応する代替文字列が記述されている場合（Ｓ４０５においてＹＥＳ）、すなわち文書構造解析部１１２がその画像データを説明する文字列を検出した場合、代替文字列抽出部１１３は代替文字列を抽出する（Ｓ４０６）。HTML文書においては、画像データを説明する文字列は、alt属性により与えられている。 When the document structure analysis unit 112 detects a link to image data as indicated by 201 in FIG. 2 (YES in S402), the electronic document acquisition unit 111 acquires the image data (S403). The acquired image data is temporarily held in the information processing apparatus 100. If an alternative character string corresponding to the image data is described (YES in S405), that is, if the document structure analysis unit 112 detects a character string explaining the image data, the alternative character string extraction unit 113 An alternative character string is extracted (S406). In an HTML document, a character string that describes image data is given by an alt attribute.

Ｓ４０７において、透かし情報埋め込み部１１６は、抽出した代替文字列を対応する画像データに電子透かしとして埋め込む処理を行う。Ｓ４０８では、文書印刷制御部１１５が、Ｓ４０７で電子透かしを埋め込んだ画像データの印刷用データを生成する（Ｓ４０８）。 In step S407, the watermark information embedding unit 116 performs processing for embedding the extracted substitute character string in the corresponding image data as a digital watermark. In step S408, the document print control unit 115 generates print data for the image data in which the digital watermark is embedded in step S407 (S408).

以上の処理を図２に示した電子文書に適用した場合は、２０１に記述されている画像データmap1.jpgに代替文字列“Access Map Shin-Yokohama”を電子透かしとして埋め込んだ画像の印刷用データが生成される。すなわち、図３に示した印刷例では、３０２の領域に代替文字列が透かしこまれていることになる。 When the above processing is applied to the electronic document shown in FIG. 2, the print data of the image in which the substitute character string “Access Map Shin-Yokohama” is embedded as a digital watermark in the image data map1.jpg described in 201 Is generated. That is, in the printing example shown in FIG. 3, the alternative character string is watermarked in the area 302.

処理対象の記述が画像以外であった場合は（Ｓ４０２においてＮＯ）、文書構造解析部１１２は、電子文書の記述に従って一般的なHTML文書の解析と同様に文書の解析を適宜処理する（Ｓ４０４）。Ｓ４０４の解析結果に基づき、文書印刷制御部１１５は印刷用のデータを生成する（Ｓ４１０）。 If the description to be processed is other than an image (NO in S402), the document structure analysis unit 112 appropriately processes the analysis of the document according to the description of the electronic document as in the case of a general HTML document (S404). . Based on the analysis result of S404, the document print control unit 115 generates data for printing (S410).

Ｓ４０５において画像に対応する代替文字列を指定する記述がない場合（Ｓ４０５においてＮＯ）は、Ｓ４０９で文書印刷制御部１１５がＳ４０３で取得した画像データの印刷用データを生成する。 If there is no description designating an alternative character string corresponding to the image in S405 (NO in S405), the document print control unit 115 generates print data for the image data acquired in S403 in S409.

Ｓ４０１で取得した電子文書について、全ての処理が終了すると（Ｓ４１１においてＹＥＳ）、文書印刷制御部１１５が生成した印刷用データを印刷部１０２に転送し、印刷を行って（Ｓ４１２）処理を終了する。 When all the processes are completed for the electronic document acquired in S401 (YES in S411), the print data generated by the document print control unit 115 is transferred to the printing unit 102, printed (S412), and the process ends. .

（印刷文書のスキャン・読み上げのフローの説明）
次に、印刷文書をスキャンして取得し、スキャンした文書の内容を音声生成で読み上げる印刷文書処理部１２０の動作について図５のフローチャートを用いて説明する。 (Explanation of print document scanning / reading flow)
Next, the operation of the print document processing unit 120 that scans and acquires a print document and reads out the content of the scanned document by voice generation will be described with reference to the flowchart of FIG.

まずＳ５０１において、スキャン画像取得部１２１は、スキャン部１０３が印刷文書をスキャンしたスキャン画像を取得する。次に、スキャン画像処理部１２２が、取得したスキャン画像に画像処理を行い、画像領域とテキスト領域の分割を行う（Ｓ５０２）。 First, in step S501, the scan image acquisition unit 121 acquires a scan image obtained by scanning the print document by the scan unit 103. Next, the scan image processing unit 122 performs image processing on the acquired scan image, and divides the image area and the text area (S502).

続くＳ５０３〜Ｓ５１０で分割された領域ごとに音声生成部１２３により音声データを生成する。 The sound generation unit 123 generates sound data for each of the areas divided in subsequent S503 to S510.

Ｓ５０３においてスキャン画像処理部１２２の処理の対象が画像の領域の場合（Ｓ５０３においてＹＥＳ）、透かし情報抽出部１２４は対象とする画像から透かし情報を抽出する（Ｓ５０４）。抽出した情報が代替文字列であった場合（Ｓ５０６においてＹＥＳ）は、音声生成部１２３は、抽出した代替文字列を読み上げる音声データを生成する（Ｓ５０７）。Ｓ５０６において透かし情報が抽出できない場合あるいは抽出した透かし情報が代替文字列でない場合はＳ５０８において、所定のメッセージの音声データを生成する。所定のメッセージは、例えば、「画像です」などの画像があることを通知する内容が望ましい。 If the processing target of the scan image processing unit 122 is an image region in S503 (YES in S503), the watermark information extraction unit 124 extracts watermark information from the target image (S504). If the extracted information is an alternative character string (YES in S506), the voice generation unit 123 generates voice data that reads the extracted alternative character string (S507). If the watermark information cannot be extracted in S506, or if the extracted watermark information is not a substitute character string, voice data of a predetermined message is generated in S508. The predetermined message preferably has a content for notifying that there is an image such as “It is an image”.

Ｓ５０３において、スキャン画像処理部１２２の処理の対象がテキスト領域の場合（Ｓ５０３においてＮＯ）、スキャン画像処理部１２２はスキャン画像の対象領域に対して文字認識を行う（Ｓ５０５）。その後、音声生成部１２３は、認識した文字列の音声データを生成する（Ｓ５０９）。 In S503, when the processing target of the scan image processing unit 122 is a text region (NO in S503), the scan image processing unit 122 performs character recognition on the target region of the scan image (S505). Thereafter, the voice generation unit 123 generates voice data of the recognized character string (S509).

以上のＳ５０３〜Ｓ５０９の処理を、全ての領域について終了するまで行う。全ての領域について処理が終了したら（Ｓ５１０においてＹＥＳ）、音声生成部１２３が生成した音声データを音声出力部１０４から出力して処理を終了する。 The above-described processing of S503 to S509 is performed for all regions. When the process is completed for all regions (YES in S510), the audio data generated by the audio generation unit 123 is output from the audio output unit 104, and the process ends.

なお、以上の実施形態では、１つの情報処理装置に電子文書処理部１１０及び印刷文書処理部１２０が含まれる構成を説明した。ただし、電子文書処理部１１０及び印刷文書処理部１２０が互いに別の情報処理装置に分散して構成する場合にも本発明は適用可能である。 In the above embodiment, the configuration in which the electronic document processing unit 110 and the print document processing unit 120 are included in one information processing apparatus has been described. However, the present invention can also be applied to a case where the electronic document processing unit 110 and the print document processing unit 120 are configured to be distributed to different information processing apparatuses.

また、図５のＳ５０７において音声生成部１２３が代替文字列の音声を生成する際に、単に代替文字列を読み上げた音声を生成するだけでは、聞いている側は他のテキストの部分との差異がわかりにくい場合がある。そこで、音声生成部１２３は、「ＸＸの画像があります」（ＸＸは代替文字列）のように、画像があることを通知するメッセージを付加してもよい。 In addition, when the voice generation unit 123 generates the voice of the alternative character string in S507 of FIG. 5, if the voice generation unit 123 simply generates the voice reading the alternative character string, the listening side is different from the other text portions. May be difficult to understand. Therefore, the voice generation unit 123 may add a message notifying that there is an image, such as “There is an image of XX” (XX is an alternative character string).

また、図４のＳ４１１及び図５のＳ５１１において、印刷及び音声出力を文書全ての処理を終えてから行うように説明したが、一定の処理を終えるごとに逐次印刷あるいは音声出力を行ってもよい。 Further, in S411 of FIG. 4 and S511 of FIG. 5, it has been described that the printing and the voice output are performed after the processing of all the documents is completed. .

以上のように、本実施形態の情報処理装置によれば、電子文書の印刷、スキャンの過程を通して、元の電子文書に記述された画像の代替文字列を保持することができる。したがって、視覚障害者が電子文書の印刷物をスキャンして音声で内容を確認する場合に、画像の代替文字列の情報を失うことがないので、元の電子文書をから得られる情報と同等の情報を得ることができ、利便性が向上する。さらに、代替文字列を直接印刷する必要がないので、電子文書の表示と印刷物で同一の文書レイアウトを保つことができるという効果がある。 As described above, according to the information processing apparatus of the present embodiment, an alternative character string of an image described in an original electronic document can be held through the process of printing and scanning the electronic document. Therefore, when a visually handicapped person scans a printed matter of an electronic document and confirms the contents by voice, the information of the substitute character string of the image is not lost, so information equivalent to the information obtained from the original electronic document The convenience can be improved. Furthermore, since it is not necessary to print the substitute character string directly, there is an effect that the same document layout can be maintained between the display of the electronic document and the printed matter.

＜実施形態２＞
上述の実施形態１では、電子文書取得部１１１が取得した電子文書を解析しながら印刷データを生成する場合を説明した。このかわりに、同様の処理に従って表示部１０１に電子文書の内容を表示し、表示部１０１に表示された内容を一括して印刷するように構成してもよい。この場合の情報処理装置の動作を図６のフローチャートを用いて説明する。 <Embodiment 2>
In the first embodiment described above, the case where print data is generated while analyzing an electronic document acquired by the electronic document acquisition unit 111 has been described. Instead, the contents of the electronic document may be displayed on the display unit 101 according to the same processing, and the contents displayed on the display unit 101 may be printed collectively. The operation of the information processing apparatus in this case will be described with reference to the flowchart of FIG.

（透かし埋め込みのフローの説明）
まず、Ｓ６０１で電子文書取得部１１１が電子文書を取得する。次にＳ６０２〜Ｓ６１１の処理において、文書構造解析部１１２は、取得した電子文書を解析して電子文書の構造を取得し、文書表示制御部１１４は、解析結果に基づき電子文書を表示するための表示用データを生成する。実施形態１と同様に、取得した電子文書の記述について所定の処理単位ごとに解析処理を行う。 (Explanation of watermark embedding flow)
First, in step S601, the electronic document acquisition unit 111 acquires an electronic document. Next, in the processing of S602 to S611, the document structure analysis unit 112 analyzes the acquired electronic document to acquire the structure of the electronic document, and the document display control unit 114 displays the electronic document based on the analysis result. Generate display data. Similar to the first embodiment, analysis processing is performed for each predetermined processing unit with respect to the description of the acquired electronic document.

Ｓ６０２において、処理対象が画像を表示する記述である場合（Ｓ６０２においてＹＥＳ）は、電子文書取得部１１１は、記述されている画像データを取得する（Ｓ６０３）。取得した画像データは一時的に情報処理装置１００内に保持される。画像に対応する代替文字列が記述されている場合（Ｓ６０５においてＹＥＳ）は、代替文字列抽出部１１３が代替文字列を抽出する（Ｓ６０６）。続いてＳ６０７で、透かし情報埋め込み部１１６は、抽出した代替文字列を対応する画像データに電子透かしとして埋め込む処理を行う。続くＳ６０８では、文書表示制御部１１４は、Ｓ６０７で電子透かしを埋め込んだ画像データの表示用データを生成する（Ｓ６０８）。 In S602, when the processing target is a description for displaying an image (YES in S602), the electronic document acquisition unit 111 acquires the described image data (S603). The acquired image data is temporarily held in the information processing apparatus 100. If an alternative character string corresponding to the image is described (YES in S605), the alternative character string extraction unit 113 extracts the alternative character string (S606). In step S 607, the watermark information embedding unit 116 embeds the extracted substitute character string in the corresponding image data as a digital watermark. In subsequent S608, the document display control unit 114 generates display data of the image data in which the digital watermark is embedded in S607 (S608).

Ｓ６０２において、処理対象の記述が画像以外であった場合（Ｓ６０２においてＮＯ）、Ｓ６０４において、文書構造解析部１１２は電子文書記述に従って一般的なHTML文書の解析と同様に文書の解析を適宜処理する（Ｓ６０４）。文書表示制御部１１４は、Ｓ６０４の解析結果に基づき表示用のデータを生成する（Ｓ６１０）。 In S602, when the description to be processed is other than an image (NO in S602), in S604, the document structure analysis unit 112 appropriately processes document analysis in accordance with the electronic document description in the same manner as general HTML document analysis. (S604). The document display control unit 114 generates display data based on the analysis result of S604 (S610).

Ｓ６０５において画像に対応する代替文字列を指定する記述がない場合（Ｓ６０５においてＮＯ）は、Ｓ６０９で文書表示制御部１１４は、Ｓ６０３で取得した画像データの表示用データを生成する。 If there is no description designating an alternative character string corresponding to the image in S605 (NO in S605), the document display control unit 114 generates display data for the image data acquired in S603 in S609.

Ｓ６０１で取得した電子文書について、全ての処理が終了すると（Ｓ６１１においてＹＥＳ）、文書表示制御部１１４は、生成した表示用データを表示部１０１に転送して表示する。さらにＳ６１３において、文書印刷制御部１１５は、表示部１０１の表示内容に基づいて印刷用データを生成し、Ｓ６１４で印刷部１０２が印刷を行って処理を終了する。 When all the processes are completed for the electronic document acquired in S601 (YES in S611), the document display control unit 114 transfers the generated display data to the display unit 101 for display. In step S613, the document print control unit 115 generates print data based on the display content of the display unit 101. In step S614, the print unit 102 performs printing, and the process ends.

以上のように、実施形態２の情報処理装置によれば、画面やウィンドウのスクリーンショットを印刷した場合でも、画像の代替文字列の情報を失うことがなく印刷でき、利便性が向上する As described above, according to the information processing apparatus of the second embodiment, even when a screen shot of a screen or a window is printed, the information can be printed without losing information on an alternative character string of the image, and convenience is improved.

＜実施形態３＞
上述の実施形態１及び実施形態２では、画像に代替文字列を電子透かしとして埋め込む場合を説明した。しかしながら、電子文書中で使用されている画像が小さく電子透かしとして埋め込める情報量が少ない場合や、代替文字列の情報量が多い場合などは、画像に代替文字列を埋め込めないことも考えられる。こうしたことに対処するため、実施形態３では、文書全体に代替文字列の情報を埋め込む構成をとる。本実施形態の情報処理装置は、実施形態１及び実施形態２の情報処理装置１００と同じ構成で実現できる。以下、本実施形態における情報処理装置の動作を図７のフローチャートを用いて説明する。 <Embodiment 3>
In the first embodiment and the second embodiment described above, the case where an alternative character string is embedded as an electronic watermark in an image has been described. However, when the image used in the electronic document is small and the amount of information that can be embedded as a digital watermark is small, or when the amount of information of the alternative character string is large, the alternative character string may not be embedded in the image. In order to cope with such a situation, the third embodiment adopts a configuration in which the information of the alternative character string is embedded in the entire document. The information processing apparatus according to the present embodiment can be realized with the same configuration as the information processing apparatus 100 according to the first and second embodiments. Hereinafter, the operation of the information processing apparatus according to the present embodiment will be described with reference to the flowchart of FIG.

（透かし埋め込みのフローの説明）
まず、Ｓ７０１で電子文書取得部１１１が電子文書を取得する。次にＳ７０２〜Ｓ７１２の処理において、文書構造解析部１１２は、取得した電子文書を文書構造解析部１１２が解析して電子文書の構造を取得し、文書印刷制御部１１５は、解析結果に基づき電子文書を印刷するための印刷データを生成する。Ｓ７０２において、処理対象が図２の２０１に示したような画像を表示する記述である場合（Ｓ７０２においてＹＥＳ）は、電子文書取得部１１１は記述されている画像データを取得する（Ｓ７０３）。画像に対応する代替文字列が記述されている場合（Ｓ７０５においてＹＥＳ）、代替文字列抽出部１１３は代替文字列を抽出し（Ｓ７０６）、画像と代替文字列の対応関係を情報処理装置内に記憶する（Ｓ７０７）。画像と代替文字列の対応関係は、画像に固有の識別情報を付与し、この識別情報と代替文字列の対応関係を記憶しておく。識別情報は例えば、電子文書の解析中に取得した順に付与した番号、画像ファイル名などが考えられる。続いてＳ７０８で、透かし情報埋め込み部１１６は、画像に固有の識別情報を電子透かしとして埋め込む処理を行う。続くＳ７０９では、Ｓ７０８で文書印刷制御部１１５が電子透かしを埋め込んだ画像データの印刷用データを生成する。 (Explanation of watermark embedding flow)
First, in step S701, the electronic document acquisition unit 111 acquires an electronic document. Next, in the processing of S702 to S712, the document structure analysis unit 112 analyzes the acquired electronic document by the document structure analysis unit 112 to acquire the structure of the electronic document, and the document print control unit 115 performs electronic processing based on the analysis result. Print data for printing a document is generated. If the processing target is a description for displaying an image as shown in 201 of FIG. 2 in S702 (YES in S702), the electronic document acquisition unit 111 acquires the described image data (S703). If an alternative character string corresponding to the image is described (YES in S705), the alternative character string extraction unit 113 extracts the alternative character string (S706), and the correspondence between the image and the alternative character string is stored in the information processing apparatus. Store (S707). As the correspondence between the image and the alternative character string, unique identification information is given to the image, and the correspondence between the identification information and the alternative character string is stored. As the identification information, for example, a number given in the order obtained during analysis of the electronic document, an image file name, and the like can be considered. In step S 708, the watermark information embedding unit 116 embeds identification information unique to the image as a digital watermark. In subsequent S709, the document print control unit 115 generates print data of the image data in which the digital watermark is embedded in S708.

Ｓ７０２において処理対象の記述が画像以外であった場合（Ｓ７０２においてＮＯ）、Ｓ７０４において、文書構造解析部１１２は、電子文書の記述に従って一般的なHTML文書の解析と同様に文書の解析を適宜処理する。文書印刷制御部１１５は、Ｓ７０４の解析結果に基づき印刷用のデータを生成する（Ｓ７１１）。 If the description of the processing target is other than an image in S702 (NO in S702), in S704, the document structure analysis unit 112 appropriately processes the analysis of the document in the same manner as a general HTML document analysis according to the description of the electronic document. To do. The document print control unit 115 generates print data based on the analysis result of S704 (S711).

Ｓ７０５において画像に対応する代替文字列を指定する記述がない場合（Ｓ７０５においてＮＯ）は、文書印刷制御部１１５は、Ｓ７０３で取得した画像データの印刷用データを生成する（Ｓ７１０）。 If there is no description designating the substitute character string corresponding to the image in S705 (NO in S705), the document print control unit 115 generates print data for the image data acquired in S703 (S710).

Ｓ７０１で取得した電子文書について、全ての処理を終了すると（Ｓ７１２においてＹＥＳ）、透かし情報埋め込み部１１６は、記憶している画像と代替文字列の対応関係を文書に電子透かしとして埋め込む。その後、文書印刷制御部１１５は、文書の印刷用データを生成し（Ｓ７１３）、その印刷用データを印刷部１０２に転送し、印刷を行って（Ｓ７１４）処理を終了する。 When all the processes are completed for the electronic document acquired in S701 (YES in S712), the watermark information embedding unit 116 embeds the correspondence between the stored image and the alternative character string as a digital watermark. Thereafter, the document printing control unit 115 generates document printing data (S713), transfers the printing data to the printing unit 102, performs printing (S714), and ends the process.

（代替文字列の抽出・音声出力）
次に、文書全体に代替文字列を埋め込んだ文書をスキャンして、音声で内容を通知する動作を、図８のフローチャートを用いて説明する。 (Extraction of alternative character string / voice output)
Next, an operation of scanning a document in which an alternative character string is embedded in the entire document and notifying the contents by voice will be described with reference to the flowchart of FIG.

まずＳ８０１において、スキャン画像取得部１２１は、スキャン部１０３が印刷文書をスキャンして得られたスキャン画像を取得する。 In step S 801, the scan image acquisition unit 121 acquires a scan image obtained by the scan unit 103 scanning a print document.

次にＳ８０２において、透かし情報抽出部１２４は、文書に埋め込まれた透かし情報を抽出する。抽出した透かし情報に画像の識別情報と代替文字列の対応関係が含まれる場合（Ｓ８０３においてＹＥＳ）、これを記憶する（Ｓ８０４）。Ｓ８０３において透かし情報が抽出できなかった場合、あるいは抽出した透かし情報に識別情報と代替文字列の対応関係が含まれない場合（Ｓ８０３においてＮＯ）はＳ８０５に進む。 In step S802, the watermark information extraction unit 124 extracts watermark information embedded in the document. If the extracted watermark information includes the correspondence between the image identification information and the alternative character string (YES in S803), this is stored (S804). If the watermark information cannot be extracted in S803, or if the extracted watermark information does not include the correspondence between the identification information and the alternative character string (NO in S803), the process proceeds to S805.

Ｓ８０５において、スキャン画像処理部１２２は、取得したスキャン画像に画像処理を行い、画像領域とテキスト領域の分割を行う。 In step S805, the scan image processing unit 122 performs image processing on the acquired scan image, and divides the image area and the text area.

続くＳ８０６〜Ｓ８１３では、分割された領域ごとに音声生成部１２３により音声データを生成する。 In subsequent S806 to S813, the voice generation unit 123 generates voice data for each divided area.

Ｓ８０６において、スキャン画像処理部１２２の処理の対象が画像の領域の場合（Ｓ８０６においてＹＥＳ）、透かし情報抽出部１２４は対象とする画像から透かし情報を抽出する（Ｓ８０７）。抽出した情報が画像の識別情報であった場合（Ｓ８０９においてＹＥＳ）は、音声生成部１２３は、記憶している対応関係に従って抽出した識別情報に対応する代替文字列の音声データを生成する（Ｓ８１０）。 In S806, when the processing target of the scanned image processing unit 122 is an image region (YES in S806), the watermark information extracting unit 124 extracts watermark information from the target image (S807). If the extracted information is image identification information (YES in S809), the voice generation unit 123 generates voice data of an alternative character string corresponding to the extracted identification information according to the stored correspondence (S810). ).

Ｓ８０９において透かし情報が抽出できない場合あるいは抽出した透かし情報が画像の識別情報でない場合はＳ８１１において、画像があることを通知する所定のメッセージの音声データを生成する。 If the watermark information cannot be extracted in S809, or if the extracted watermark information is not the image identification information, voice data of a predetermined message notifying that there is an image is generated in S811.

Ｓ８０６において、スキャン画像処理部１２２の処理の対象がテキスト領域の場合（Ｓ８０６においてＮＯ）、スキャン画像処理部１２２はスキャン画像の対象領域に対して文字認識を行う（Ｓ８０８）。その後、音声生成部１２３が認識した文字列の音声データを生成する（Ｓ８１２）。 In S806, when the processing target of the scan image processing unit 122 is a text region (NO in S806), the scan image processing unit 122 performs character recognition on the target region of the scan image (S808). Thereafter, voice data of the character string recognized by the voice generation unit 123 is generated (S812).

以上のＳ８０６〜Ｓ８１２で説明したスキャン画像の処理を全ての領域について終了するまで行う。全ての領域について処理が終了したら（Ｓ８１３においてＹＥＳ）、音声生成部１２３が生成した音声データを音声出力部１０４から出力して処理を終了する。 The scan image processing described in steps S806 to S812 is performed for all regions. When the process is completed for all the regions (YES in S813), the sound data generated by the sound generation unit 123 is output from the sound output unit 104, and the process ends.

以上の説明したように実施形態３の情報処理装置では、代替文字列に関する情報を画像ではなく文書中に電子透かしとして埋め込むことができる。画像に埋め込む場合に比べ、画像サイズが小さい場合や、埋め込む代替文字列の情報量が多い場合などに効果的に代替文字列を埋め込むことが可能となる。 As described above, the information processing apparatus according to the third embodiment can embed information on an alternative character string as a digital watermark in a document instead of an image. Compared to the case of embedding in an image, it is possible to embed an alternative character string effectively when the image size is small or the information amount of the alternative character string to be embedded is large.

また、図８のフローチャートで説明した動作により、代替文字列の情報が文書に埋め込まれた場合であっても、文書をスキャンして音声で通知することが可能となる。 In addition, the operation described with reference to the flowchart of FIG. 8 makes it possible to scan and notify a document by voice even when information on an alternative character string is embedded in the document.

＜実施形態４＞
実施形態１乃至実施形態３では、透かし情報埋め込み部１１６は画像に代替文字列抽出部１１３が抽出した代替文字列を埋め込む処理を行った。しかし、これに限らず、文書構造解析部１１２が解析した結果に基づき、例えば、画像のファイル名、画像のサイズなど画像に付与された他の情報を合わせて埋め込んでも良い。 <Embodiment 4>
In the first to third embodiments, the watermark information embedding unit 116 performs the process of embedding the substitute character string extracted by the substitute character string extracting unit 113 in the image. However, the present invention is not limited to this, and other information attached to the image such as the file name of the image and the size of the image may be embedded together based on the analysis result of the document structure analysis unit 112.

また、代替文字列を埋め込む際に、代替文字列を読み上げる際に抽出した代替文字列をそのまま埋め込むのではなく、代替文字列と等価な別の情報を埋め込んでも良い。以下に、代替文字列の等価な情報の例を説明する。 Further, when embedding the substitute character string, instead of embedding the substitute character string extracted when reading the substitute character string as it is, another information equivalent to the substitute character string may be embedded. Hereinafter, an example of equivalent information of an alternative character string will be described.

（１）Shin-Yokohama → シンヨコハマ
（２）Shin-Yokohama → 新横浜
（３）Shin-Yokohama → /SH, I, X, Y, O, K, O, H, A, M, A/ (1) Shin-Yokohama → Shin-Yokohama (2) Shin-Yokohama → Shin-Yokohama (3) Shin-Yokohama → / SH, I, X, Y, O, K, O, H, A, M, A /

（１）はローマ字表記の代替文字列をカナ文字列に変換した例である。ここではカタカナ表記にしているが、平仮名表記であっても良い。（２）は漢字表記に変換した例である。（３）は、音声生成部１２３が受理可能な情報に変換する場合の一例で、音を表す記号列に変換した例である。このとき、さらにアクセント情報など音声を生成する際に必要な制御情報などをさらに付与した情報を用いても良い。上記例では、ローマ字表記から仮名、漢字、音を表す記号列への変換を説明したが、これとは逆に、元の代替文字列が、仮名、漢字の場合はローマ字表記に変換した情報を用いても良い。 (1) is an example in which an alternative character string written in Roman letters is converted into a kana character string. Although katakana notation is used here, hiragana notation may be used. (2) is an example converted to Kanji notation. (3) is an example of conversion into information acceptable by the voice generation unit 123, and is an example of conversion into a symbol string representing a sound. At this time, information to which control information necessary for generating sound such as accent information is further added may be used. In the above example, conversion from Roman alphabet to kana, kanji, and sound symbol was explained, but conversely, if the original substitute character string is kana or kanji, the information converted to Roman alphabet is used. It may be used.

また、透かし情報埋め込み部１１６が代替文字列の情報を埋め込む際に、画像の大きさによる制約により、代替文字列に関する情報を全て埋め込めない場合は、圧縮した情報を埋め込むようにしても良い。以下に、情報を圧縮する例を説明する。 In addition, when the watermark information embedding unit 116 embeds the information on the alternative character string, if all the information on the alternative character string cannot be embedded due to restrictions on the size of the image, the compressed information may be embedded. An example of compressing information will be described below.

（１）代替文字列の一部省略
"Access Map Shin-Yokohama"を"Access Map"のように埋め込める情報量に応じて代替文字列の一部を省略する。この例のように単語単位で先頭から埋め込める単語のみを埋め込んでも良いし、文書を要約したり、重要語を抽出したりする言語処理技術を用いて、代替文字列の要約文や重要語のみを埋め込むようにしてもよい。 (1) Partial omission of substitution character string
Depending on the amount of information that can embed "Access Map Shin-Yokohama" as in "Access Map", a part of the substitute character string is omitted. You can embed only the words that can be embedded from the beginning as shown in this example, or use a language processing technology that summarizes the document or extracts the important words, and only the summary text of the alternative character string or the important words May be embedded.

（２）情報量の少ない他の表現形式に変換
例えば、"Shin-Yokohama"を「シンヨコハマ」のように、後に抽出して音声で読み上げる際に、読み上げ内容が等価になるような別の表現形式に変換する。この例ではローマ字１３文字をカタカナ６文字に変換することで埋め込む情報量を圧縮している。 (2) Convert to another expression format with a small amount of information. For example, when “Shin-Yokohama” is extracted as “Shin-Yokohama” and is read out later in speech, another expression that makes the reading content equivalent. Convert to format. In this example, the amount of information to be embedded is compressed by converting 13 Roman characters into 6 katakana characters.

＜他の実施形態＞
以上、本発明の実施形態を詳述したが、本発明は、複数の機器から構成されるシステムに適用してもよいし、また、一つの機器からなる装置に適用してもよい。 <Other embodiments>
As mentioned above, although embodiment of this invention was explained in full detail, this invention may be applied to the system comprised from several apparatuses, and may be applied to the apparatus which consists of one apparatus.

なお、本発明は、前述した実施形態の各機能を実現するプログラムを、システム又は装置に直接又は遠隔から供給し、そのシステム又は装置に含まれるコンピュータがその供給されたプログラムコードを読み出して実行することによっても達成される。 In the present invention, a program for realizing each function of the above-described embodiments is supplied directly or remotely to a system or apparatus, and a computer included in the system or apparatus reads and executes the supplied program code. Can also be achieved.

したがって、本発明の機能・処理をコンピュータで実現するために、そのコンピュータにインストールされるプログラムコード自体も本発明を実現するものである。つまり、上記機能・処理を実現するためのコンピュータプログラム自体も本発明の一つである。 Accordingly, since the functions and processes of the present invention are implemented by a computer, the program code itself installed in the computer also implements the present invention. That is, the computer program itself for realizing the functions and processes is also one aspect of the present invention.

その場合、プログラムの機能を有していれば、オブジェクトコード、インタプリタにより実行されるプログラム、ＯＳに供給するスクリプトデータ等、プログラムの形態を問わない。 In this case, the program may be in any form as long as it has a program function, such as an object code, a program executed by an interpreter, or script data supplied to the OS.

プログラムを供給するためのコンピュータ読み取り可能な記録媒体としては、例えば、フレキシブルディスク、ハードディスク、光ディスク、光磁気ディスク、ＭＯ、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷなどがある。また、記録媒体としては、磁気テープ、不揮発性のメモリカード、ＲＯＭ、ＤＶＤ（ＤＶＤ−ＲＯＭ，ＤＶＤ−Ｒ）などもある。 Examples of the computer-readable recording medium for supplying the program include a flexible disk, a hard disk, an optical disk, a magneto-optical disk, an MO, a CD-ROM, a CD-R, and a CD-RW. Examples of the recording medium include a magnetic tape, a non-volatile memory card, a ROM, a DVD (DVD-ROM, DVD-R), and the like.

また、プログラムは、クライアントコンピュータのブラウザを用いてインターネットのホームページからダウンロードしてもよい。すなわち、ホームページから本発明のコンピュータプログラムそのもの、もしくは圧縮され自動インストール機能を含むファイルをハードディスク等の記録媒体にダウンロードしてもよい。また、本発明のプログラムを構成するプログラムコードを複数のファイルに分割し、それぞれのファイルを異なるホームページからダウンロードする形態も考えられる。つまり、本発明の機能・処理をコンピュータで実現するためのプログラムファイルを複数のユーザに対してダウンロードさせるＷＷＷサーバも、本発明の構成要件となる場合がある。 The program may be downloaded from a homepage on the Internet using a browser on a client computer. That is, the computer program itself of the present invention or a compressed file including an automatic installation function may be downloaded from a home page to a recording medium such as a hard disk. Further, it is also possible to divide the program code constituting the program of the present invention into a plurality of files and download each file from a different home page. That is, a WWW server that allows a plurality of users to download a program file for realizing the functions and processing of the present invention on a computer may be a constituent requirement of the present invention.

また、本発明のプログラムを暗号化してコンピュータ読み取り可能なＣＤ−ＲＯＭ等のコンピュータ読み取り可能な記憶媒体に格納してユーザに配布してもよい。この場合、所定条件をクリアしたユーザにのみ、インターネットを介してホームページから暗号化を解く鍵情報をダウンロードさせ、その鍵情報で暗号化されたプログラムを復号して実行し、プログラムをコンピュータにインストールしてもよい。 The program of the present invention may be encrypted and stored in a computer-readable storage medium such as a computer-readable CD-ROM and distributed to users. In this case, only the user who cleared the predetermined condition is allowed to download the key information to be decrypted from the homepage via the Internet, decrypt the program encrypted with the key information, execute it, and install the program on the computer May be.

また、コンピュータが、読み出したプログラムを実行することによって、前述した実施形態の機能が実現されてもよい。なお、そのプログラムの指示に基づき、コンピュータ上で稼動しているＯＳなどが、実際の処理の一部又は全部を行ってもよい。もちろん、この場合も、前述した実施形態の機能が実現され得る。 Further, the functions of the above-described embodiments may be realized by the computer executing the read program. Note that an OS or the like running on the computer may perform part or all of the actual processing based on the instructions of the program. Of course, also in this case, the functions of the above-described embodiments can be realized.

さらに、記録媒体から読み出されたプログラムが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれてもよい。そのプログラムの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部又は全部を行ってもよい。このようにして、前述した実施形態の機能が実現されることもある。 Furthermore, the program read from the recording medium may be written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer. Based on the instructions of the program, a CPU or the like provided in the function expansion board or function expansion unit may perform part or all of the actual processing. In this way, the functions of the above-described embodiments may be realized.

本発明の実施形態１における情報処理装置の機能構成を示す図である。It is a figure which shows the function structure of the information processing apparatus in Embodiment 1 of this invention. 本発明の実施形態１に係るHTMLで記述された電子文書の一例を示す図である。It is a figure which shows an example of the electronic document described by HTML which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係るHTMLで記述された電子文書の表示又は印刷結果の一例を示す図である。It is a figure which shows an example of the display or printing result of the electronic document described by HTML which concerns on Embodiment 1 of this invention. 本発明の実施形態１の情報処理装置における電子文書を処理する動作を説明するフローチャートである。It is a flowchart explaining the operation | movement which processes the electronic document in the information processing apparatus of Embodiment 1 of this invention. 本発明の実施形態１の情報処理装置における印刷文書を処理する動作を説明するフローチャートである。6 is a flowchart illustrating an operation for processing a print document in the information processing apparatus according to the first embodiment of the present invention. 本発明の実施形態２の情報処理装置における電子文書を処理する動作を説明するフローチャートである。It is a flowchart explaining the operation | movement which processes the electronic document in the information processing apparatus of Embodiment 2 of this invention. 本発明の実施形態３の情報処理装置における電子文書を処理する動作を説明するフローチャートである。It is a flowchart explaining the operation | movement which processes the electronic document in the information processing apparatus of Embodiment 3 of this invention. 本発明の実施形態３の情報処理装置における印刷文書を処理する動作を説明するフローチャートである。It is a flowchart explaining the operation | movement which processes the print document in the information processing apparatus of Embodiment 3 of this invention.

Explanation of symbols

１００情報処理装置
１０１表示部
１０２印刷部
１０３スキャン部
１０４音声出力部
１１０電子文書処理部
１１１電子文書取得部
１１２文書構造解析部
１１３代替文字列抽出部
１１４文書表示制御部
１１５文書印刷制御部
１１６透かし情報埋め込み部
１２０印刷文書処理部
１２１スキャン画像取得部
１２２スキャン画像処理部
１２３音声生成部
１２４透かし情報抽出部
１９１電子文書
１９２印刷文書 DESCRIPTION OF SYMBOLS 100 Information processing apparatus 101 Display part 102 Printing part 103 Scan part 104 Audio | voice output part 110 Electronic document processing part 111 Electronic document acquisition part 112 Document structure analysis part 113 Alternative character string extraction part 114 Document display control part 115 Document printing control part 116 Watermark Information embedding unit 120 Print document processing unit 121 Scan image acquisition unit 122 Scan image processing unit 123 Audio generation unit 124 Watermark information extraction unit 191 Electronic document 192 Print document

Claims

An analysis means for analyzing the structured document;
An acquisition means for acquiring the image data when the analysis means detects a link to the image data;
An embedding unit that embeds the character string as a digital watermark in the image data acquired by the acquiring unit when the analyzing unit detects a character string describing the image data;
An information processing apparatus comprising:

The information processing apparatus according to claim 1, further comprising: a print control unit that generates print data for the image data in which the character string is embedded by the embedding unit.

The information processing apparatus according to claim 1, further comprising display control means for generating display data of image data in which the character string is embedded by the embedding means.

The information processing apparatus according to claim 3, further comprising: a print control unit that generates print data based on display contents generated by the display data generated by the display control unit.

The information processing apparatus according to claim 1, wherein the structured document is a document described in HTML.

6. The information processing apparatus according to claim 5, wherein the analysis unit detects the character string from a description of an alt attribute of the document described in the HTML.

The embedding unit converts the notation of the character string detected by the analyzing unit into any one of katakana, kanji, romaji, and a character string representing sound, and converts the converted character string into the image data as a digital watermark. The information processing apparatus according to claim 1, wherein the information processing apparatus is embedded.

The embedding unit compresses the information amount of the character string detected by the analyzing unit according to the size of the image data acquired by the acquiring unit, and embeds the compressed character string in the image data as a digital watermark. The information processing apparatus according to claim 1.

Reading means for reading a print document of the print data generated by the information processing apparatus according to claim 2,
Extraction means for extracting a character string embedded as a digital watermark from a reading result by the reading means;
An information processing apparatus comprising:

The information processing apparatus according to claim 9, further comprising a voice generation unit that generates voice data based on the character string extracted by the extraction unit.

The information processing apparatus according to claim 10, wherein if the extraction unit cannot extract the character string, voice data is generated to notify that an image is included in a reading result by the reading unit.

An analysis process in which the analysis means analyzes the structured document;
When a link to image data is detected in the analysis step, an acquisition step in which an acquisition unit acquires the image data;
When a character string describing the image data is detected in the analyzing step, an embedding unit embeds the character string as an electronic watermark in the image data acquired by the acquiring unit;
An information processing method characterized by comprising:

A program for causing a computer to function as the information processing apparatus according to any one of claims 1 to 11.

A computer-readable storage medium storing the program according to claim 13.