JP2020060963A

JP2020060963A - Apparatus and method for information processing, and program

Info

Publication number: JP2020060963A
Application number: JP2018191996A
Authority: JP
Inventors: 欽也本田; Kinya Honda
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2018-10-10
Filing date: 2018-10-10
Publication date: 2020-04-16

Abstract

To provide an apparatus for information processing adapted to improve the convenience for a visually handicapped user.SOLUTION: An apparatus for information processing has acquiring means that is effectively set with a screen reader function and if a print target file is designated, acquires analysis information of the print target file and control means that performs control in a manner to audibly output the analysis information acquired by the acquiring means.SELECTED DRAWING: Figure 8

Description

本発明は、情報処理装置、情報処理方法及びプログラムに関する。 The present invention relates to an information processing device, an information processing method, and a program.

米国では電子・電気技術に対して、視覚障がい者にも使いやすくすることを義務づけられており、アクセシビリティを高めることが要望されてきている。また、日本でも同様に、アクセシビリティを高めることが要望されてきている。
これに対して、スクリーンリーダ機能が搭載されたスマートフォンが登場している。スクリーンリーダ機能とは、視覚障がい者が画面を操作するために、情報を音声で読み上げることによって操作を補助する機能である。例えば、ｉＰｈｏｎｅ（登録商標）には、ＶｏｉｃｅＯｖｅｒというスクリーンリーダ機能が搭載されている。
図１６は、スクリーンリーダ機能の設定画面の一例を示している。図１６の例では、領域１６０１へのタップ操作を介して、スクリーンリーダ機能のＯＮ／ＯＦＦが設定される。
印刷やスキャンに関連するデバイスやアプリケーションにおいても、アクセシビリティを強化して、視覚障がい者でも印刷を可能にすることが求められている。これに対して、特許文献１には、複写機等の原稿読み取り装置において、スキャンした原稿の種別を音声で伝えることで、視覚障がい者でも原稿の概要を知ることを可能にする技術が開示されている。 In the United States, electronic and electrical technologies are obliged to make them easier to use for people with visual impairments, and there is a demand for improved accessibility. Similarly, in Japan, there is a demand for enhancing accessibility.
On the other hand, smartphones equipped with a screen reader function have appeared. The screen reader function is a function for assisting the operation by reading information by voice so that the visually impaired person can operate the screen. For example, iPhone (registered trademark) has a screen reader function called VoiceOver.
FIG. 16 shows an example of a screen for setting the screen reader function. In the example of FIG. 16, ON / OFF of the screen reader function is set through a tap operation on the area 1601.
There is a demand for enhancing accessibility in devices and applications related to printing and scanning so that even visually impaired people can print. On the other hand, Patent Document 1 discloses a technique in which a document reading device such as a copying machine transmits the type of a scanned document by voice so that even a visually impaired person can know the outline of the document. ing.

特開２００５−２６０６５５号公報JP, 2005-260655, A

視覚障がい者のユーザが印刷機能を利用する場合、スクリーンリーダ機能により読み上げられるファイル名称の音声情報等を元にファイルを選択していた。しかしながら、ファイル名だけでは印刷したいファイルが正しく選択されているか否かをユーザが判断できない場合がある。例えば、ファイル名が日付ベースでつけられる場合、「２０１８０５２１１１３０００．ｐｄｆ」というファイル名のファイルが複数作成されることがある。この場合、ファイル名が読み上げられても、ユーザは、コンテンツの内容を把握できない可能性がある。また、印刷プレビューの画面が表示されたとしても、目視が困難なユーザは、確認できない恐れもある。
このように、視覚障がい者のユーザにとって、単にファイル名称を読み上げるだけでは、印刷するファイルが正しく選択されているか否かを判断することが困難な場合がある。
本発明は上述の問題点の少なくとも１つを鑑みなされたものである。本発明は、視覚障がい者のユーザにとっての利便性を向上させることを目的の１つとする。 When a user who is visually impaired uses the print function, he or she selects the file based on the voice information of the file name read by the screen reader function. However, the user may not be able to determine whether or not the file to be printed is correctly selected only by the file name. For example, when the file name is given on a date basis, a plurality of files with the file name “20120521113000.pdf” may be created. In this case, even if the file name is read aloud, the user may not be able to grasp the details of the content. Further, even if the print preview screen is displayed, there is a possibility that the user who has difficulty in visual confirmation cannot confirm the screen.
As described above, it may be difficult for the visually impaired user to determine whether or not the file to be printed is correctly selected by merely reading the file name.
The present invention has been made in view of at least one of the above problems. One of the objects of the present invention is to improve the convenience for the visually impaired user.

本発明の情報処理装置は、スクリーンリーダ機能が有効であり、印刷対象ファイルが指定された場合、前記印刷対象ファイルの解析情報を取得する取得手段と、前記取得手段により取得された前記解析情報を音声出力するよう制御する制御手段と、を有する。 In the information processing apparatus of the present invention, when the screen reader function is effective and a print target file is designated, an acquisition unit that acquires analysis information of the print target file and the analysis information acquired by the acquisition unit are displayed. And a control means for controlling the audio output.

本発明の１つの側面によれば、視覚障がい者のユーザにとっての利便性を向上させることができる。 According to one aspect of the present invention, it is possible to improve convenience for a visually impaired user.

データ処理システムのシステム構成の一例を示す図である。It is a figure which shows an example of the system configuration of a data processing system. データ処理装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of a data processing apparatus. 印刷処理装置のハードウェア構成の一例を示す図である。3 is a diagram illustrating an example of a hardware configuration of a print processing apparatus. FIG. データ処理装置の機能構成の一例を示す図である。It is a figure showing an example of functional composition of a data processor. トップメニュー画面の一例を示す図である。It is a figure which shows an example of a top menu screen. ファイル選択画面の一例を示す図である。It is a figure which shows an example of a file selection screen. 印刷プレビュー画面の一例を示す図である。It is a figure which shows an example of a print preview screen. データ処理装置の処理の一例を示すフローチャートである。It is a flow chart which shows an example of processing of a data processor. ＰＤＦの構造の一例を説明する図である。It is a figure explaining an example of the structure of PDF. 文字認識処理の一例を説明する図である。It is a figure explaining an example of character recognition processing. データ処理装置の処理の一例を示すフローチャートである。It is a flow chart which shows an example of processing of a data processor. 印刷画像の一例を示す図である。It is a figure which shows an example of a print image. 文字属性情報の一例を示す図である。It is a figure which shows an example of character attribute information. データ処理装置の処理の一例を示すフローチャートである。It is a flow chart which shows an example of processing of a data processor. データ処理装置の処理の一例を示すフローチャートである。It is a flow chart which shows an example of processing of a data processor. スクリーンリーダ機能の設定画面の一例を示す図である。It is a figure which shows an example of the setting screen of a screen reader function.

以下に、本発明の実施の形態の一例を、図面に基づいて詳細に説明する。 Hereinafter, an example of an embodiment of the present invention will be described in detail with reference to the drawings.

＜実施形態１＞
図１は、本実施形態のデータ処理システムのシステム構成の一例を示す図である。図１の例では、データ処理システムは、データ処理装置１０１、無線ＬＡＮターミナル１０２、印刷処理装置１０４、印刷処理装置１０５を含む。データ処理装置１０１、無線ＬＡＮターミナル１０２、印刷処理装置１０４、印刷処理装置１０５は、ＬＡＮ１０３を介して相互に通信可能に接続されている。
データ処理装置１０１は、印刷処理装置１０４又は１０５に対して印刷ジョブ送信の指示等を行う情報処理装置である。データ処理装置１０１は、例えば、スマートフォン、タブレット装置、パーソナルコンピュータ（ＰＣ）、ノートＰＣ等である。本実施形態では、データ処理装置１０１は、モバイル形態の情報処理装置である。しかし、データ処理装置１０１は、モバイル形態でない情報処理装置であることとしてもよい。
印刷処理装置１０４は、プリンタ機能、コピー機能、スキャナ機能、ファクス送信機能等を備えるプリンタ、複合機等の印刷処理装置である。印刷処理装置１０５は、印刷処理装置１０４と同様に、プリンタ機能、コピー機能、スキャナ機能、ファクス送信機能等を備えるプリンタ、複合機等の印刷処理装置である。 <Embodiment 1>
FIG. 1 is a diagram showing an example of the system configuration of the data processing system of the present embodiment. In the example of FIG. 1, the data processing system includes a data processing device 101, a wireless LAN terminal 102, a print processing device 104, and a print processing device 105. The data processing apparatus 101, the wireless LAN terminal 102, the print processing apparatus 104, and the print processing apparatus 105 are communicably connected to each other via the LAN 103.
The data processing apparatus 101 is an information processing apparatus that gives a print job transmission instruction to the print processing apparatus 104 or 105. The data processing device 101 is, for example, a smartphone, a tablet device, a personal computer (PC), a notebook PC, or the like. In the present embodiment, the data processing device 101 is a mobile information processing device. However, the data processing device 101 may be an information processing device that is not in a mobile form.
The print processing apparatus 104 is a print processing apparatus such as a printer or a multifunction peripheral having a printer function, a copy function, a scanner function, and a fax transmission function. Like the print processing apparatus 104, the print processing apparatus 105 is a print processing apparatus such as a printer or a multifunction peripheral having a printer function, a copy function, a scanner function, a fax transmission function, and the like.

ＬＡＮ１０３は、データ処理システムの各装置が接続されているＬＡＮである。無線ＬＡＮターミナル１０２は、ネットワーク・ルーター機能を有した無線ＬＡＮの親機であって、ＬＡＮ１０３の設置場所の中でＷｉ−Ｆｉを通じた無線ＬＡＮ機能の実現のため用いられる。
また、データ処理装置１０１は、モバイル端末であることから、Ｗｉ−Ｆｉ機能を有効にすることで、無線ＬＡＮターミナル１０２を介してＬＡＮ１０３に参加することができる。データ処理装置１０１は、無線ＬＡＮターミナル１０２が提供する無線ＬＡＮエリアに入ると、予め設定していた認証情報を利用してＬＡＮ１０３のネットワークに参加することができる。
無線信号１０６、１０７、１０８それぞれは、データ処理装置１０１と、印刷処理装置１０４又は１０５と、が送受信するＢｌｕｅｔｏｏｔｈ（登録商標）ＬＥ（ＢｌｕｅｔｏｏｔｈＬｏｗＥｎｅｇｙ）による無線信号である。この無線信号が到達し合う周辺のコンピューターデバイス間においては、ＷＰＡＮ（ＷｉｒｅｌｅｓｓＰｅｒｓｏｎａｌＡｒｅａＮｅｔｗｏｒｋ）を形成し通信を行うことができる。 The LAN 103 is a LAN to which each device of the data processing system is connected. The wireless LAN terminal 102 is a parent device of a wireless LAN having a network router function, and is used in the installation place of the LAN 103 for realizing the wireless LAN function through Wi-Fi.
Since the data processing device 101 is a mobile terminal, the data processing device 101 can participate in the LAN 103 via the wireless LAN terminal 102 by enabling the Wi-Fi function. When the data processing apparatus 101 enters the wireless LAN area provided by the wireless LAN terminal 102, it can participate in the network of the LAN 103 using the authentication information set in advance.
The wireless signals 106, 107, and 108 are wireless signals based on Bluetooth (registered trademark) LE (Bluetooth Low Energy) transmitted and received by the data processing apparatus 101 and the print processing apparatus 104 or 105. A WPAN (Wireless Personal Area Network) can be formed and communicated between peripheral computer devices that the wireless signals reach each other.

図２は、データ処理装置１０１のハードウェア構成の一例を示す図である。本実施形態では、データ処理装置１０１は、小型端末用のオペレーティングシステムや、通話、データ通信を制御するためのプログラムの実行を行う。
データ処理装置１０１は、ＣＰＵ２０２、ＲＯＭ２０３、ＲＡＭ２０４、ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５、音声制御部２０６、表示制御部２０７、入力制御部２０８、記憶装置２０９、位置検出制御部２１０を含む。各要素は、システムバス２０１を介して相互に通信可能に接続されている。
ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２０２は、データ処理装置１０１を制御する中央演算装置である。ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２０３は、データ処理装置１０１のオペレーティングシステム、通話、データ通信等を制御するアプリケーションのプログラム等を記憶する記憶装置である。データ通信を制御するアプリケーションとは、例えば、印刷アプリケーション、Ｍａｉｌソフト、Ｗｅｂブラウザ等である。ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２０４は、ＣＰＵ２０２のワークメモリやデータの一時的な記憶領域として機能する記憶装置である。ＣＰＵ２０２は、実行対象のプログラムを実行する際に、そのプログラムをＲＡＭ２０４に展開する。 FIG. 2 is a diagram illustrating an example of a hardware configuration of the data processing device 101. In the present embodiment, the data processing device 101 executes an operating system for a small terminal and a program for controlling a telephone call and data communication.
The data processing device 101 includes a CPU 202, a ROM 203, a RAM 204, a network controller 205, a voice control unit 206, a display control unit 207, an input control unit 208, a storage device 209, and a position detection control unit 210. The respective elements are connected via a system bus 201 so that they can communicate with each other.
A CPU (Central Processing Unit) 202 is a central processing unit that controls the data processing device 101. A ROM (Read Only Memory) 203 is a storage device that stores an operating system of the data processing device 101, an application program that controls a call, data communication, and the like. The application that controls data communication is, for example, a print application, Mail software, a web browser, or the like. A RAM (Random Access Memory) 204 is a storage device that functions as a work memory of the CPU 202 or a temporary storage area for data. When executing the program to be executed, the CPU 202 expands the program in the RAM 204.

ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５は、外部の装置との間でのデータの通信に用いられるコントローラである。ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５は、ＬＡＮ通信部２１１と、電話データ通信部２１２と、ＢＬＥ通信部２１３と、を含む。
ＬＡＮ通信部２１１は、無線ＬＡＮターミナル１０２を介して、ＬＡＮ１０３のネットワークへの参加に用いられる。電話データ通信部２１２は、携帯キャリアの提供するネットワークへの参加に用いられる。ＢＬＥ通信部２１３は、ＢｌｕｅｔｏｏｔｈＬＥによる無線信号が到達し合う周辺のコンピューターデバイス間においてＷＰＡＮを形成するために用いられる。
ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５は、例えば、無線ＬＡＮのネットワークに参加可能な場合、無線ＬＡＮへの接続を優先する。そして、ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５は、データ処理装置１０１が無線ＬＡＮのネットワークエリアから外れた場合、携帯キャリアが提供する無線通信ネットワークへの参加を行うような排他制御を行う。しかし、ＮｅｔｗｏｒｋＣｏｎｔｒｏｌｌｅｒ２０５は、ＢｌｕｅｔｏｏｔｈＬＥの通信を行う場合、他の通信との間で、排他制御を行わない。 The NetworkController 205 is a controller used for data communication with an external device. The network controller 205 includes a LAN communication unit 211, a telephone data communication unit 212, and a BLE communication unit 213.
The LAN communication unit 211 is used to participate in the network of the LAN 103 via the wireless LAN terminal 102. The telephone data communication unit 212 is used to participate in a network provided by a mobile carrier. The BLE communication unit 213 is used to form a WPAN between peripheral computer devices that wireless signals by Bluetooth LE reach each other.
For example, when the network controller 205 can participate in the wireless LAN network, the network controller 205 gives priority to the connection to the wireless LAN. Then, the Network Controller 205 performs exclusive control such that the data processing apparatus 101 participates in the wireless communication network provided by the mobile carrier when the data processing apparatus 101 is out of the wireless LAN network area. However, the NetworkController 205 does not perform exclusive control with other communication when performing Bluetooth LE communication.

音声制御部２０６は、マイク・スピーカ２１４を介した音声データの入出力に用いられる制御部である。音声制御部２０６は、例えば、通話アプリケーションが起動しユーザが電話をしているときに用いられる。また、音声制御部２０６は、後述するスクリーンリーダ機能により出力された音声データをスピーカを介して出力する。
表示制御部２０７は、ディスプレイ２１５に出力される情報の制御を行う制御部である。入力制御部２０８は、データ処理装置１０１のボタンやタッチパネル２１６等の入力部を介してユーザにより入力された情報を受け付ける制御部である。
データ処理装置１０１上で実現されるアプリケーションは、音声制御部２０６、表示制御部２０７、入力制御部２０８を利用して、ネットワーク通信情報やデータ処理装置１０１のさまざまな情報をユーザに提供する。 The voice control unit 206 is a control unit used for inputting / outputting voice data via the microphone / speaker 214. The voice control unit 206 is used, for example, when the call application is activated and the user is making a call. Further, the voice control unit 206 outputs the voice data output by the screen reader function described later through the speaker.
The display control unit 207 is a control unit that controls the information output to the display 215. The input control unit 208 is a control unit that receives information input by the user via the buttons of the data processing apparatus 101 or the input unit such as the touch panel 216.
The application implemented on the data processing device 101 uses the voice control unit 206, the display control unit 207, and the input control unit 208 to provide network communication information and various information of the data processing device 101 to the user.

記憶装置２０９は、不揮発性の記憶装置であり、データ処理装置１０１の再起動後も保持しておく必要のある各種動作モード設定や、稼働ログ、各種プログラム、各種設定情報等を記憶する。記憶装置２０９は、例えば、ハードディスクドライブ（ＨＤＤ）、ソリッドステートドライブ（ＳＳＤ）、フラッシュメモリ等である。
位置検出制御部２１０は、ＧＰＳセンサー２１７を介してデータ処理装置１０１の位置情報を取得し、オペレーティングシステムに提供する制御部である。
これらのデータ処理装置１０１において、ＣＰＵ２０２がＲＯＭ２０３、記憶装置２０９等に記憶されたプログラムにしたがって処理を実行することで、図４で後述する機能、図９、１１、１４、１５で後述するフローチャートの処理等が実現される。即ち、ＣＰＵ２０２、ＲＯＭ２０３、記憶装置２０９は、所謂コンピュータとして機能する。なお、複数のプロセッサ、メモリ、及びストレージを協働させ各処理を実行することもできる。また、一部の処理は、ＡＳＩＣ等のハードウェア回路を用いて実行することもできる。 The storage device 209 is a non-volatile storage device, and stores various operation mode settings, operation logs, various programs, various setting information, and the like that need to be retained even after the data processing device 101 is restarted. The storage device 209 is, for example, a hard disk drive (HDD), a solid state drive (SSD), a flash memory, or the like.
The position detection control unit 210 is a control unit that acquires the position information of the data processing device 101 via the GPS sensor 217 and provides it to the operating system.
In these data processing devices 101, the CPU 202 executes the processes in accordance with the programs stored in the ROM 203, the storage device 209, etc., so that the functions described below with reference to FIG. 4 and the flowcharts described below with reference to FIGS. Processing and the like are realized. That is, the CPU 202, the ROM 203, and the storage device 209 function as a so-called computer. It should be noted that a plurality of processors, memories, and storages can cooperate to execute each process. Also, some processing can be executed by using a hardware circuit such as an ASIC.

図３は、印刷処理装置１０４のハードウェア構成の一例を示す図である。本実施形態では、印刷処理装置１０４は、スキャナ機能と、プリンタ機能を有する複合機（ＭＦＰ（ＭｕｌｔｉＦｕｎｃｔｉｏｎＰｅｒｐｈｅｒａｌ））を想定しているがこれに限定されるものではない。読取機能を有さないプリンタ等の印刷処理装置であってもよい。本実施例では、一例として印刷処理装置が以下に説明する各種構成要件を備えるものとする。
印刷処理装置１０４は、Ｉ／Ｏ３０１、Ｉ／Ｆ制御部３０２、ＲＡＭ３０３、ＲＡＭ制御部３０４、画像データ調歩回路３０５、プリンタエンジン３０６、エンジンＩ／Ｆ３０７、メインコントローラ３０８、を含む。また、印刷処理装置１０４は、スキャナコントローラ３０９、プリンタコントローラ３１０、ユーザインターフェース３１２、スキャナエンジン３１３を含む。
Ｉ／Ｏ３０１は、外部の装置との間の接続に用いられるインターフェースである。Ｉ／Ｏ３０１は、ＬＡＮ通信部３１４、ＢＬＥ通信部３１５を含む。ＬＡＮ通信部３１４は、ＬＡＮ１０３等の通信媒介を介して、データ処理装置１０１との間で通信を行う。ＢＬＥ通信部３１５は、ＢｌｕｅｔｏｏｔｈＬＥを用いたＷＰＡＮの形成に用いられる。印刷処理装置１０４は、Ｉ／Ｏ３０１を通して、デバイスＩＤやスキャンイメージをデータ処理装置１０１に送信する。また、印刷処理装置１０４は、Ｉ／Ｏ３０１を通して、データ処理装置１０１から各種の制御コマンドを受信し、受信した制御コマンドに応じた処理を行う。
Ｉ／Ｆ制御部３０２は、印刷処理装置１０４に搭載されているスキャナ、プリンタ、ファクス等のデバイスに対してデバイスＩＤを発行する制御を行う制御部である。ＲＡＭ３０３は、Ｉ／Ｏ３０１を介して取得された制御コマンド等の外部データや、スキャナエンジン３１３で読み取られたイメージのデータ等の一時的な記憶領域として機能する記憶装置である。また、ＲＡＭ３０３は、プリンタコントローラ３１０で展開されたプリンタエンジン３０６に渡される前のイメージの記憶等に用いられる。ＲＡＭ制御部３０４は、ＲＡＭ３０３内の領域の割り当て管理を行う制御部である。
画像データ調歩回路３０５は、ＲＡＭ制御部３０４によりＲＡＭ３０３に展開されたイメージをプリンタエンジン３０６の回転にあわせて出力する装置である。プリンタエンジン３０６は、紙等の出力メディアにイメージを現像する装置である。メインコントローラ３０８は、エンジンＩ／Ｆ３０７を介してプリンタエンジン３０６の各種制御を行うコントローラである。また、メインコントローラ３０８は、スキャナコントローラ３０９やプリンタコントローラ３１０に対して、Ｉ／Ｏ３０１経由でデータ処理装置１０１から受信した制御言語の適切な振り分け処理を行う。更に、メインコントローラ３０８は、それぞれのコントローラやユーザインターフェース３１２からの指示をうけてプリンタエンジン３０６やスキャナエンジン３１３の制御を行う。
スキャナコントローラ３０９は、データ処理装置１０１から送信されたスキャン制御コマンドをメインコントローラ３０８が解釈可能な内部実行命令に分解する。また、スキャナコントローラ３０９は、スキャナエンジン３１３で読み取られたイメージをスキャン制御コマンドに変更する。プリンタコントローラ３１０は、データ処理装置１０１から送信された印刷ジョブとして受けたＰＤＬ（ＰａｇｅＤｅｓｃｒｉｐｔｉｏｎＬａｎｇｕａｇｅ）データを、メインコントローラ３０８が解釈可能な、展開イメージ等を含む内部実行命令に分解する。展開イメージは、プリンタエンジン３０６まで送信され、用紙等の出力メディアに印刷される。
本実施形態では、印刷処理装置１０５のハードウェア構成は、印刷処理装置１０４のハードウェア構成と同様である。 FIG. 3 is a diagram illustrating an example of the hardware configuration of the print processing apparatus 104. In the present embodiment, the print processing apparatus 104 is assumed to be a multifunction peripheral (MFP (Multi Function Peripheral)) having a scanner function and a printer function, but the present invention is not limited to this. It may be a print processing device such as a printer having no reading function. In this embodiment, as an example, the print processing apparatus has various constituent features described below.
The print processing apparatus 104 includes an I / O 301, an I / F control unit 302, a RAM 303, a RAM control unit 304, an image data start / stop circuit 305, a printer engine 306, an engine I / F 307, and a main controller 308. The print processing apparatus 104 also includes a scanner controller 309, a printer controller 310, a user interface 312, and a scanner engine 313.
The I / O 301 is an interface used for connection with an external device. The I / O 301 includes a LAN communication unit 314 and a BLE communication unit 315. The LAN communication unit 314 communicates with the data processing device 101 via a communication medium such as the LAN 103. The BLE communication unit 315 is used to form a WPAN using Bluetooth LE. The print processing apparatus 104 transmits the device ID and the scan image to the data processing apparatus 101 via the I / O 301. The print processing apparatus 104 also receives various control commands from the data processing apparatus 101 through the I / O 301 and performs processing according to the received control commands.
The I / F control unit 302 is a control unit that performs control for issuing a device ID to a device such as a scanner, a printer, or a fax mounted on the print processing apparatus 104. The RAM 303 is a storage device that functions as a temporary storage area for external data such as control commands acquired via the I / O 301 and data for images read by the scanner engine 313. Further, the RAM 303 is used for storing an image before being transferred to the printer engine 306 expanded by the printer controller 310. The RAM control unit 304 is a control unit that manages allocation of areas in the RAM 303.
The image data start / stop circuit 305 is a device that outputs the image developed in the RAM 303 by the RAM control unit 304 in accordance with the rotation of the printer engine 306. The printer engine 306 is a device that develops an image on an output medium such as paper. The main controller 308 is a controller that performs various controls of the printer engine 306 via the engine I / F 307. Further, the main controller 308 performs appropriate distribution processing of the control language received from the data processing apparatus 101 via the I / O 301 to the scanner controller 309 and the printer controller 310. Further, the main controller 308 controls the printer engine 306 and the scanner engine 313 in response to instructions from the respective controllers and the user interface 312.
The scanner controller 309 decomposes the scan control command transmitted from the data processing apparatus 101 into an internal execution command that can be interpreted by the main controller 308. Further, the scanner controller 309 changes the image read by the scanner engine 313 into a scan control command. The printer controller 310 decomposes PDL (Page Description Language) data received as a print job transmitted from the data processing apparatus 101 into an internal execution command that can be interpreted by the main controller 308 and that includes a developed image. The developed image is transmitted to the printer engine 306 and printed on an output medium such as paper.
In this embodiment, the hardware configuration of the print processing apparatus 105 is the same as the hardware configuration of the print processing apparatus 104.

図４は、データ処理装置１０１の機能構成の一例を示す図である。データ処理装置１０１は、ＯＳ４１０、アプリケーション４０１、その他のアプリケーション４０８を含む。
ＯＳ４１０は、データ処理装置１０１の全体を制御するためのＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）である。本実施形態のＯＳ４１０は、操作画面に表示されている文字列等の情報を音声で読み上げることによって操作を補助するスクリーンリーダ機能を有している。ＯＳとしてｉＯＳ（登録商標）を採用する場合、スクリーンリーダ機能は、アクセシビリティ向上のためにプリインストールされているＶｏｉｃｅＯｖｅｒ（登録商標）機能によって提供される。また、ＯＳとしてＡｎｄｒｏｉｄ（登録商標）を採用する場合、スクリーンリーダ機能は、アクセシビリティ向上のためにプリインストールされているＴａｌｋＢａｃｋ機能によって提供される。これらのスクリーンリーダ機能は、図１６に一例を示すＯＳ４１０の設定画面から有効（ＯＮ）、無効（ＯＦＦ）を切り替えることができる。これらのスクリーンリーダに関する設定は、記憶装置２０９に記憶される。スクリーンリーダ機能がＯＮに設定されている場合、データ処理装置１０１は操作画面の読み上げを実行する。読み上げは、例えばユーザが表示オブジェクトや表示文字列をタップしたことに従って実行される。また、読み上げは、操作画面が遷移したことに従って実行される。
アプリケーション４０１は、印刷処理を制御するアプリケーションである。アプリケーション４０１は、ＵＩ部４０２、探索部４０３、印刷制御部４０４、解析部４０５、保存制御部４０６を含む。アプリケーション４０１は、アプリケーションストア等からユーザ操作を介してダウンロードされ、データ処理装置１０１にインストールされているものとする。
ＵＩ部４０２は、アプリケーション４０１への情報の入出力を制御するＵＩ部であり、アプリケーション４０１内の設定を変更させるためのユーザインターフェースを提供する。探索部４０３は、データ処理装置１０１が参加するＬＡＮ１０３上において、データ処理装置１０１がＳＮＭＰで管理されるネットワーク機器を探索する。そして、探索部４０３は、探索したネットワーク機器の中から、印刷が実行可能な印刷処理装置を探索する。更に、探索部４０３は、ＢＬＥの送受信が可能なＷＰＡＮ内において、印刷を実行可能な印刷処理装置を探索する。
印刷制御部４０４は、印刷処理装置に対して送信する印刷ジョブを生成し、生成した印刷ジョブを対応する印刷処理装置に送信する。また、解析部４０５は、画像やＰＤＦ等のファイルを解析する。保存制御部４０６は、解析部４０５による解析結果を、記憶装置２０９に記憶する。 FIG. 4 is a diagram illustrating an example of a functional configuration of the data processing device 101. The data processing device 101 includes an OS 410, an application 401, and another application 408.
The OS 410 is an OS (Operating System) for controlling the entire data processing apparatus 101. The OS 410 of the present embodiment has a screen reader function that assists the operation by reading aloud information such as a character string displayed on the operation screen. When the iOS (registered trademark) is adopted as the OS, the screen reader function is provided by the VoiceOver (registered trademark) function preinstalled for improving accessibility. When Android (registered trademark) is adopted as the OS, the screen reader function is provided by the TalkBack function preinstalled for improving accessibility. These screen reader functions can be switched between valid (ON) and invalid (OFF) from the setting screen of the OS 410, an example of which is shown in FIG. The settings relating to these screen readers are stored in the storage device 209. When the screen reader function is set to ON, the data processing device 101 reads the operation screen. The reading is executed according to, for example, the user tapping the display object or the display character string. The reading is executed according to the transition of the operation screen.
The application 401 is an application that controls print processing. The application 401 includes a UI unit 402, a search unit 403, a print control unit 404, an analysis unit 405, and a storage control unit 406. The application 401 is assumed to be downloaded from the application store or the like through a user operation and installed in the data processing apparatus 101.
The UI unit 402 is a UI unit that controls input / output of information to / from the application 401, and provides a user interface for changing settings in the application 401. The search unit 403 searches the LAN 103 to which the data processing apparatus 101 participates for a network device managed by the data processing apparatus 101 by SNMP. Then, the search unit 403 searches for a print processing apparatus capable of executing printing from the searched network devices. Further, the search unit 403 searches for a print processing apparatus capable of executing printing in the WPAN capable of transmitting / receiving BLE.
The print control unit 404 generates a print job to be transmitted to the print processing apparatus, and transmits the generated print job to the corresponding print processing apparatus. The analysis unit 405 also analyzes files such as images and PDFs. The storage control unit 406 stores the analysis result of the analysis unit 405 in the storage device 209.

図５は、アプリケーション４０１が起動された際に、アプリケーション４０１によりディスプレイ２１５に表示されるトップメニュー画面の一例を示す図である。トップメニュー画面は、ユーザからの各種入力操作の受け付けに利用されるユーザインターフェースの一例である。ボタン５０１は、印刷を実行する印刷処理装置を選択するための画面への遷移の指示に用いられるボタンである。アプリケーション４０１は、ボタン５０１の選択を検知すると、ディスプレイ２１５に、印刷を実行する印刷処理装置を選択するための画面を表示する。図５の例では、印刷処理を実行する印刷処理装置として、プリンタが選択されている。ボタン５０２は、印刷対象となるファイルの選択・印刷の実行の指示を行うための画面であるファイル選択画面６０１への遷移の指示に用いられるボタンである。アプリケーション４０１は、ボタン５０２の選択を検知すると、ディスプレイ２１５にファイル選択画面６０１を表示する。
図６は、ファイル選択画面６０１の一例を示す図である。図６の例ではファイル選択画面６０１内に、印刷されるファイルの候補となるファイル６０２〜６０４がリスト形式で表示されている。ユーザは、タッチパネル２１６を介して、ファイル６０２〜６０４の中から印刷したいファイルを指定する。アプリケーション４０１は、タッチパネル２１６、ファイル選択画面６０１を介して、ユーザから印刷対象のファイルの指定を受け付ける。以下では、印刷対象のファイルを、印刷対象ファイルとする。そして、アプリケーション４０１は、印刷対象ファイルの指定を受付けると、ディスプレイ２１５に印刷プレビュー画面７０１を表示する。印刷プレビューとは、印刷対象ファイルの内容の確認に用いられる情報である。印刷プレビュー画面とは、印刷対象ファイルの内容の確認に用いられる画面である。なお、スクリーンリーダ機能が有効な場合、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働しマイク・スピーカ２１４のスピーカを介して、ファイル６０２〜６０４のファイル名を順番に音声で出力する。ここで、スクリーンリーダ機能とは、情報を音声出力することで、ユーザの操作を補助する機能である。 FIG. 5 is a diagram showing an example of a top menu screen displayed on the display 215 by the application 401 when the application 401 is activated. The top menu screen is an example of a user interface used for receiving various input operations from the user. The button 501 is a button used for instructing a transition to a screen for selecting a print processing apparatus that executes printing. Upon detecting the selection of the button 501, the application 401 displays a screen for selecting a print processing apparatus that executes printing on the display 215. In the example of FIG. 5, the printer is selected as the print processing apparatus that executes the print processing. The button 502 is a button used for instructing transition to a file selection screen 601 which is a screen for selecting a file to be printed and instructing execution of printing. When detecting the selection of the button 502, the application 401 displays the file selection screen 601 on the display 215.
FIG. 6 is a diagram showing an example of the file selection screen 601. In the example of FIG. 6, files 602 to 604 that are candidates for files to be printed are displayed in a list format on the file selection screen 601. The user designates a file to be printed from the files 602-604 via the touch panel 216. The application 401 receives the designation of the file to be printed from the user via the touch panel 216 and the file selection screen 601. In the following, the file to be printed is the file to be printed. Then, when the application 401 receives the designation of the print target file, it displays the print preview screen 701 on the display 215. The print preview is information used to confirm the content of the print target file. The print preview screen is a screen used to confirm the contents of the file to be printed. When the screen reader function is valid, the application 401 cooperates with the screen reader function provided by the OS 410 to sequentially output the file names of the files 602 to 604 by voice through the speaker of the microphone / speaker 214. Here, the screen reader function is a function of assisting a user's operation by outputting information by voice.

図７は、ファイル選択画面６０１を介して印刷対象ファイルが指定された際にディスプレイ２１５に表示される印刷プレビュー画面７０１の一例を示す図である。
印刷プレビュー画面７０１には、印刷対象ファイルの内容が表示される。ボタン７０２は、印刷の指示に用いられるボタンである。また、ボタン７０３は、前のページに戻ることの指示に用いられるボタンである。アプリケーション４０１は、ボタン７０３の選択を検知すると、ディスプレイ２１５にファイル選択画面６０１を表示する。視覚が健常なユーザは、印刷プレビュー画面７０１を視認することで、印刷対象ファイルの内容を確認することができる。ユーザは、確認して問題なければ、ボタン７０２を選択して、印刷対象ファイルの印刷をデータ処理装置１０１に指示する。アプリケーション４０１は、ボタン７０２の選択を検知すると、印刷対象ファイルの印刷を行うよう制御する。
しかし、視覚障がい者は、印刷プレビュー画面を目視確認できない。そのため、印刷対象ファイルの内容を確認できない。
本実施形態では、データ処理装置１０１が、スクリーンリーダ機能が有効な場合に、印刷対象ファイルを解析して、印刷対象ファイルの印刷プレビューを取得し、取得した印刷プレビューを音声出力する処理について説明する。印刷対象ファイルの解析とは、印刷対象ファイルを調べることで印刷対象ファイルに関連する情報を取得することであり、例えば、印刷対象ファイルに含まれる情報を取得したり、印刷対象ファイルが示す画像の内容を示す情報を取得したりすることである。また、印刷対象ファイルの解析により取得される情報を、印刷対象ファイルの解析情報とする。 FIG. 7 is a diagram showing an example of the print preview screen 701 displayed on the display 215 when a file to be printed is designated via the file selection screen 601.
The print preview screen 701 displays the contents of the print target file. The button 702 is a button used for instructing printing. A button 703 is a button used for instructing to return to the previous page. When detecting the selection of the button 703, the application 401 displays the file selection screen 601 on the display 215. A user with normal vision can confirm the contents of the print target file by visually recognizing the print preview screen 701. If the user confirms and there is no problem, the user selects the button 702 to instruct the data processing apparatus 101 to print the file to be printed. When detecting the selection of the button 702, the application 401 controls to print the print target file.
However, visually impaired persons cannot visually confirm the print preview screen. Therefore, the contents of the print target file cannot be confirmed.
In the present embodiment, a process will be described in which the data processing apparatus 101 analyzes a print target file, acquires a print preview of the print target file, and outputs the acquired print preview by voice when the screen reader function is enabled. . The analysis of the file to be printed is to obtain the information related to the file to be printed by checking the file to be printed. For example, the information included in the file to be printed or the image of the file to be printed is acquired. It is to acquire information indicating the contents. Further, the information acquired by the analysis of the print target file is set as the analysis information of the print target file.

図８は、本実施形態のデータ処理装置１０１の処理の一例を示すフローチャートの一例である。
Ｓ８０１において、アプリケーション４０１は、ユーザによるボタン５０２の選択を検知し、ディスプレイ２１５にファイル選択画面６０１を表示する。
Ｓ８０２において、アプリケーション４０１は、ファイル選択画面６０１を介して、ユーザから印刷対象ファイルの指定を受け付ける。
Ｓ８０３において、アプリケーション４０１は、ディスプレイ２１５に、Ｓ８０２で受け付けた指定が示す印刷対象ファイルの印刷プレビュー画面を表示する。
Ｓ８０４において、アプリケーション４０１は、スクリーンリーダ機能が有効か否かを判定する。本実施形態では、アプリケーション４０１は、スクリーンリーダ機能が有効が否かをＯＳ４１０に問い合わせる。ＯＳ４１０は、記憶装置２０９に記憶されたスクリーンリーダ機能が有効か否かを示す情報に基づいて、スクリーンリーダ機能が有効か否かを判定する。アプリケーション４０１は、スクリーンリーダ機能が有効であると判定した場合、処理をＳ８０５に進め、スクリーンリーダ機能が無効であると判定した場合、図８の処理を終了する。 FIG. 8 is an example of a flowchart showing an example of processing of the data processing apparatus 101 of this embodiment.
In step S801, the application 401 detects the selection of the button 502 by the user and displays the file selection screen 601 on the display 215.
In step S 802, the application 401 receives a print target file designation from the user via the file selection screen 601.
In step S 803, the application 401 displays on the display 215 a print preview screen of the print target file indicated by the designation accepted in step S 802.
In step S804, the application 401 determines whether the screen reader function is valid. In this embodiment, the application 401 inquires of the OS 410 whether the screen reader function is valid. The OS 410 determines whether the screen reader function is valid based on the information stored in the storage device 209 that indicates whether the screen reader function is valid. If the application 401 determines that the screen reader function is valid, the process proceeds to step S 805, and if the application 401 determines that the screen reader function is invalid, the process of FIG. 8 ends.

Ｓ８０５において、アプリケーション４０１は、解析部４０５を介して、Ｓ８０２で指定された印刷対象ファイルにアクセシビリティ情報が含まれるか否かを判定する。アクセシビリティ情報とは、ファイルへのアクセシビリティ向上のためにファイル内の予め定められた領域に組み込まれた情報である。
印刷対象ファイルがＰＤＦファイルである場合、アプリケーション４０１は、印刷対象ファイル内にドキュメント構造タグ（ＳｔｒｕｃｔＴｒｅｅＲｏｏｔ）が含まれているか否かを判定する。ドキュメント構造タグは、ＰＤＦファイルに組み込まれた情報であり、文書の内容に関する構造を示す情報である。ドキュメント構造タグの情報には、例えば、章と節による文書の編成や、表、脚注を識別する情報等がある。アプリケーション４０１は、印刷対象ファイル内のドキュメント構造タグを解析することで、印刷対象ファイルのアクセシビリティ情報を取得することができる。 In step S805, the application 401 determines whether the print target file designated in step S802 includes accessibility information via the analysis unit 405. The accessibility information is information incorporated in a predetermined area in the file to improve accessibility to the file.
When the print target file is a PDF file, the application 401 determines whether or not the document structure tag (StructureTreeRoot) is included in the print target file. The document structure tag is information incorporated in the PDF file and is information indicating the structure related to the content of the document. The information of the document structure tag includes, for example, the organization of documents by chapters and sections, information for identifying tables and footnotes, and the like. The application 401 can acquire the accessibility information of the print target file by analyzing the document structure tag in the print target file.

ＰＤＦファイルの構造は、オブジェクトの階層構造とみなすことができる。図９を用いて、ＰＤＦファイルの構造の一例について説明する。ツリー９０１は、あるＰＤＦファイルの構造を表現したツリーである。ドキュメント構造タグは、［文書ルートカタログ（Ｃａｔａｌｏｇ）］―［ＳｔｒｕｃｔＴｒｅｅＲｏｏｔ］のオブジェクト（辞書）に相当する。
図９の例では、ドキュメント構造タグは、タグ９０３にあたる。
一方、印刷対象ファイルが画像ファイルである場合、アプリケーション４０１は、その画像ファイルのフォーマットの仕様にのっとって、印刷対象ファイルにアクセシビリティ情報が含まれるか否かを判定する。しかし、画像ファイルには、アクセシビリティ情報が含まれない場合がある。その場合、アプリケーション４０１は、印刷対象ファイルにアクセシビリティ情報が含まれないと判定する。 The structure of the PDF file can be regarded as a hierarchical structure of objects. An example of the structure of the PDF file will be described with reference to FIG. The tree 901 is a tree expressing the structure of a certain PDF file. The document structure tag corresponds to an object (dictionary) of [document root catalog (Catalog)]-[StructureTreeRoot].
In the example of FIG. 9, the document structure tag corresponds to the tag 903.
On the other hand, when the print target file is an image file, the application 401 determines whether the print target file includes accessibility information according to the format specifications of the image file. However, the image file may not include accessibility information. In that case, the application 401 determines that the file to be printed does not include accessibility information.

Ｓ８０６において、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルからアクセシビリティ情報を取得し、取得したアクセシビリティ情報の内容を示す文字列を取得する。アプリケーション４０１は、アクセシビリティ情報が文字列の情報である場合、アクセシビリティ情報が示す文字列を取得する。また、アプリケーション４０１は、アクセシビリティ情報が文字列と異なる情報（例えば、画像に対応する位置の情報、画像が撮影された時刻の情報等）である場合、以下のようにする。即ち、アプリケーション４０１は、アクセシビリティ情報が示す内容（例えば、画像に対応する位置の情報、画像が撮影された時刻の情報等）を示す文字列を取得する。そして、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働してマイク・スピーカ２１４のスピーカを介して、取得した文字列を、音声出力する。
印刷対象ファイルがＰＤＦの場合、アプリケーション４０１は、ＰＤＦファイルのフォーマットのドキュメント構造タグ内の子要素である構造要素（ＳｔｒｕｃｔＥｌｅｍ）からアクセシビリティ情報を取得する。図９の例では、アプリケーション４０１は、構造要素９０４、９０５の情報を、アクセシビリティ情報として取得する。 In step S806, the application 401 acquires the accessibility information from the print target file via the analysis unit 405, and acquires the character string indicating the content of the acquired accessibility information. When the accessibility information is character string information, the application 401 acquires the character string indicated by the accessibility information. If the accessibility information is information different from the character string (for example, information on the position corresponding to the image, information on the time when the image was captured, etc.), the application 401 performs the following. That is, the application 401 acquires a character string indicating the content indicated by the accessibility information (for example, information on the position corresponding to the image, information on the time when the image was captured, etc.). Then, the application 401 cooperates with the screen reader function provided by the OS 410 to output the acquired character string by voice through the speaker of the microphone / speaker 214.
When the file to be printed is a PDF, the application 401 acquires accessibility information from a structure element (StructElem) that is a child element within the document structure tag of the PDF file format. In the example of FIG. 9, the application 401 acquires the information of the structural elements 904 and 905 as accessibility information.

Ｓ８０７において、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイル内のコンテンツデータに文字列が含まれるか否かを判定する。ファイルのコンテンツデータとは、そのファイルが保存する対象の中身を示すデータであり、画像ファイルの場合は画像のデータであり、テキストファイルの場合はテキストデータである。
印刷対象ファイルがＭｉｃｒｏｓｏｆｔＷｏｒｄ（登録商標）等のアプリケーションから作成されたＰＤＦであるとすると、印刷対象ファイルは、コンテンツデータ内に文字コードを持つ文字列を含む場合がある。その場合、アプリケーション４０１は、印刷対象ファイルのコンテンツデータ内の文字列を検出して、印刷対象ファイル内のコンテンツデータに文字列が含まれると判定する。
一方、印刷対象ファイルが複写機で原稿をスキャンすることで作成されたＰＤＦファイルであるとすると、印刷対象ファイルは、コンテンツとしてスキャン画像を含み、文字列を含まない場合がある。その場合、アプリケーション４０１は、印刷対象ファイルのコンテンツデータ内から文字列を検出できないため、印刷対象ファイル内のコンテンツデータに文字列が含まれないと判定する。
アプリケーション４０１は、印刷対象ファイル内のコンテンツデータに文字列が含まれると判定した場合、処理をＳ８０８に進め、含まれないと判定した場合、処理をＳ８０９に進める。
Ｓ８０８において、アプリケーション４０１は、印刷対象ファイル内のコンテンツデータから文字列を取得する。そして、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働してマイク・スピーカ２１４のスピーカを介して、取得した文字列を、音声出力する。 In step S 807, the application 401 determines whether the content data in the print target file includes a character string via the analysis unit 405. The content data of a file is data indicating the contents to be stored in the file, which is image data in the case of an image file and text data in the case of a text file.
If the print target file is a PDF created from an application such as Microsoft Word (registered trademark), the print target file may include a character string having a character code in the content data. In that case, the application 401 detects the character string in the content data of the print target file and determines that the content data in the print target file includes the character string.
On the other hand, if the print target file is a PDF file created by scanning an original with a copying machine, the print target file may include a scan image as content and not a character string. In that case, the application 401 cannot detect the character string in the content data of the print target file, and therefore determines that the content data in the print target file does not include the character string.
If the application 401 determines that the content data in the print target file includes a character string, the process proceeds to step S808. If it is determined that the character string is not included in the content data, the process proceeds to step S809.
In step S808, the application 401 acquires a character string from the content data in the print target file. Then, the application 401 cooperates with the screen reader function provided by the OS 410 to output the acquired character string by voice through the speaker of the microphone / speaker 214.

Ｓ８０９において、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルの印刷画像に対して、以下で説明する領域分割処理を行う。印刷画像とは、印刷対象ファイルが実際にどのように印刷されるかを示す画像である。本実施形態では、印刷対象ファイルの印刷画像は、Ｓ８０３で表示された印刷プレビュー画面の画像である
領域分割処理について説明する。
領域分割処理とは、画像内から文字列が存在する領域を分割する処理である。本実施形態では、文字列には、１つの文字も含むこととする。本実施形態では、アプリケーション４０１は、以下の（１）〜（５）の処理を行うことで、印刷対象ファイルに対して領域分割処理を実行する。
（１）二値化処理
アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルの印刷画像に対して２値化を行うことにより、２値画像を取得する。この２値化により、印刷画像における予め定められた閾値より濃い色の画素は、黒画素となる。また、その閾値より薄い色の画素は、白画素となる。なお、本実施形態では、印刷画像が、１００ＤＰＩであるとする。しかし、印刷画像は、この解像度に限定されず、２００ＤＰＩ等の他の解像度であってもよい。 In step S 809, the application 401 performs the area division processing described below on the print image of the print target file via the analysis unit 405. The print image is an image showing how the print target file is actually printed. In this embodiment, the print image of the print target file is the image of the print preview screen displayed in S803. The area division processing will be described.
The area dividing process is a process of dividing an area in the image where a character string exists. In this embodiment, the character string includes one character. In the present embodiment, the application 401 performs the area dividing process on the print target file by performing the following processes (1) to (5).
(1) Binarization Processing The application 401 acquires a binary image by binarizing the print image of the print target file via the analysis unit 405. By this binarization, the pixels of the color darker than the predetermined threshold in the print image become black pixels. In addition, pixels of a color lighter than the threshold value become white pixels. In the present embodiment, it is assumed that the print image has 100 DPI. However, the print image is not limited to this resolution and may have another resolution such as 200 DPI.

（２）黒画素塊検出処理
アプリケーション４０１は、解析部４０５を介して、（１）の処理で取得した２値画像に対して、８連結で繋がる黒画素の輪郭を追跡することにより、８方向の何れかの方向で連続して存在する黒画素の塊（黒画素塊）を検出する。ここで、８連結とは、ある画素を基準として、左上、左、左下、下、右下、右、右上、上の８つの方向のうちの何れかの方向で、その画素と同じ色（本実施形態では黒）の画素が連続しているという意味である。また、４連結とは、ある画素を基準として、左、下、右、上の４つの方向の何れかの方向で、その画素と同じ色の画素が連続しているという意味である。
アプリケーション４０１は、８方向に存在する８つの隣接画素の何れもが黒画素ではない単独の黒画素を検出しないこととなる。一方、８方向に存在する８つの隣接画素の何れか１つにでも黒画素が存在する黒画素は、その隣接する黒画素と共に、黒画素塊として検出されることになる。図１０の黒画素塊１４０１は、解析部４０５を介して検出された黒画素塊の一例である。また、アプリケーション４０１は、解析部４０５を介して検出した黒画素塊の外接矩形の位置情報（四頂点のＸ、Ｙ座標情報のこと）を取得する。なお、印刷画像内では、Ｘ軸は右方向に伸び、Ｙ軸は下方向に伸びているものとする。外接矩形の幅は、Ｘ軸方向の長さ、外接矩形の高さは、Ｙ軸方向の長さを示す。図１０の矩形１４０２は、黒画素塊１４０１の外接矩形である。なお、本実施形態では、矩形とは、四辺の全てがＸ座標軸、Ｙ座標軸の何れかと平行な矩形であり、斜め向きの矩形ではないとする。 (2) Black pixel block detection processing The application 401 traces, through the analysis unit 405, the contours of black pixels that are connected in eight connections in the binary image acquired in the processing of (1), and thus in eight directions. A black pixel block (black pixel block) that continuously exists in either direction is detected. Here, 8-connection means any one of the eight directions of upper left, left, lower left, lower, lower right, right, upper right, and upper with respect to a certain pixel, and has the same color (main This means that black pixels are continuous in the embodiment. Further, 4-connected means that a pixel having the same color as the pixel is continuous in any of the four directions of left, bottom, right, and top with respect to a certain pixel.
The application 401 will not detect a single black pixel in which none of the eight adjacent pixels existing in eight directions is a black pixel. On the other hand, a black pixel in which a black pixel exists in any one of the eight adjacent pixels existing in eight directions is detected as a black pixel block together with the adjacent black pixel. The black pixel block 1401 in FIG. 10 is an example of the black pixel block detected via the analysis unit 405. Further, the application 401 acquires the position information of the circumscribing rectangle of the black pixel block detected through the analysis unit 405 (X and Y coordinate information of four vertices). In the print image, the X axis extends rightward and the Y axis extends downward. The width of the circumscribing rectangle indicates the length in the X-axis direction, and the height of the circumscribing rectangle indicates the length in the Y-axis direction. A rectangle 1402 in FIG. 10 is a circumscribed rectangle of the black pixel block 1401. In the present embodiment, the rectangle is a rectangle in which all four sides are parallel to either the X coordinate axis or the Y coordinate axis, and is not an obliquely oriented rectangle.

（３）表領域検出処理
アプリケーション４０１は、解析部４０５を介して、（２）の処理で検出した黒画素塊それぞれについて、以下の条件１〜３の全てに該当するか否かを判定する。そして、アプリケーション４０１は、条件１〜３の全てに該当する黒画素塊を、表の枠線を示す黒画素塊であると判断する。
条件１：黒画素塊の外接矩形の幅、高さが閾値（例えば、１００画素、０．２５ｃｍ等）以上である。
条件２：外接矩形の内部における黒画素塊が占める割合が予め定められた閾値（例えば、２０パーセント）以下である。
条件３：黒画素塊の最大幅と外接矩形の幅との差が予め定められた閾値（例えば、１０画素等）以下であり、かつ、黒画素塊の最大高さと外接矩形の高さとの差が予め定められた閾値（例えば、１０画素等）以下である。
アプリケーション４０１は、表の枠線を構成すると判断した黒画素塊の外接矩形の位置情報を、保存制御部４０６を介して記憶装置２０９に記憶する。ここで記憶された位置情報を持つ外接矩形の領域を、以下では、表領域とする。なお、図１０の例では、黒画素塊１４０１は、表の枠線を構成する黒画素塊と判定されたとする。そのため、矩形１４０２の領域は、表領域となる。 (3) Surface Area Detection Processing The application 401 determines, via the analysis unit 405, whether or not all of the following conditions 1 to 3 are satisfied for each black pixel block detected in the processing of (2). Then, the application 401 determines that the black pixel block that satisfies all of the conditions 1 to 3 is the black pixel block that indicates the frame line of the table.
Condition 1: The width and height of the circumscribed rectangle of the black pixel block are equal to or more than a threshold value (for example, 100 pixels, 0.25 cm, etc.).
Condition 2: The proportion of black pixel blocks in the circumscribed rectangle is equal to or less than a predetermined threshold value (for example, 20%).
Condition 3: The difference between the maximum width of the black pixel block and the width of the circumscribing rectangle is less than or equal to a predetermined threshold (for example, 10 pixels), and the difference between the maximum height of the black pixel block and the height of the circumscribing rectangle. Is less than or equal to a predetermined threshold (for example, 10 pixels).
The application 401 stores the position information of the circumscribed rectangle of the black pixel block that is determined to form the frame of the table in the storage device 209 via the storage control unit 406. The circumscribed rectangle area having the position information stored here is hereinafter referred to as a table area. In the example of FIG. 10, it is assumed that the black pixel block 1401 is determined to be the black pixel block forming the frame line of the table. Therefore, the area of the rectangle 1402 becomes a table area.

（４）認識セルの特定処理
アプリケーション４０１は、解析部４０５を介して、表領域から、認識対象の領域である認識セルを特定する。アプリケーション４０１は、認識セルを特定するために、表領域内部の白画素の輪郭を追跡することにより、白画素塊を検出する。アプリケーション４０１は、黒画素塊を求めた処理と同様の処理で白画素塊を検出する。アプリケーション４０１は、検出した白画素塊が予め定められた条件に合致する場合、その白画素塊の外接矩形の領域を認識セルとして特定する。この予め定められた条件は、以下の条件ａ〜ｃである。
条件ａ：白画素塊の外接矩形の幅、高さが予め定められた閾値（例えば、２０画素等）以上である。
条件ｂ：外接矩形の内部における黒画素塊の占める割合が予め定められた閾値（例えば、２０％等）以下である。
条件ｃ：白画素塊の最大幅と外接矩形の幅との差が予め定められた閾値（例えば、５画素等）以下であり、かつ、白画素塊の最大高さと外接矩形の高さとの差が予め定められた閾値（例えば、５画素等）以下である。
図１３の例では、領域１４０３、１４０４が、解析部４０５を介して、認識セルとして特定される。アプリケーション４０１は、特定した認識セルの位置情報を、保存制御部４０６を介して、記憶装置２０９に記憶する。 (4) Recognition Cell Identification Processing The application 401 identifies a recognition cell, which is a recognition target area, from the table area via the analysis unit 405. The application 401 detects a white pixel block by tracing the outline of the white pixel inside the table area in order to identify the recognition cell. The application 401 detects a white pixel block by the same process as the process of obtaining the black pixel block. When the detected white pixel block matches the predetermined condition, the application 401 specifies the area of the circumscribed rectangle of the white pixel block as a recognition cell. The predetermined conditions are the following conditions ac.
Condition a: The width and height of the circumscribed rectangle of the white pixel block are equal to or larger than a predetermined threshold value (for example, 20 pixels).
Condition b: The ratio of black pixel blocks in the circumscribed rectangle is equal to or less than a predetermined threshold value (for example, 20%).
Condition c: the difference between the maximum width of the white pixel block and the width of the circumscribing rectangle is less than or equal to a predetermined threshold (for example, 5 pixels), and the difference between the maximum height of the white pixel block and the height of the circumscribing rectangle. Is less than or equal to a predetermined threshold value (for example, 5 pixels).
In the example of FIG. 13, the areas 1403 and 1404 are specified as recognition cells via the analysis unit 405. The application 401 stores the position information of the identified recognition cell in the storage device 209 via the storage control unit 406.

（５）文字領域の特定処理
アプリケーション４０１は、解析部４０５を介して、（４）の処理で特定した表領域内の各認識セルの内部に、その各認識セルに内接する白画素塊によって囲まれた黒画素塊があるか否かを判定する。そして、アプリケーション４０１は、黒画素塊があると判定した場合、あると判定された全ての黒画素塊に外接矩形を設定する。
更に、解析部４０５は、一つの認識セルの中に複数の外接矩形を設定した場合、外接矩形同士の距離が予め定められた閾値（例えば、２０画素、０．５ｃｍ等）以下であるか否かを判定する。より具体的には、アプリケーション４０１は、１つの認識セル内に含まれる外接矩形を一つ一つ選択し、選択された外接矩形からの距離が閾値以内である外接矩形を検出する。
そして、アプリケーション４０１は、閾値以下の距離だけ離れた外接矩形を検出した場合、閾値以下の距離だけ離れた外接矩形同士を統合して、新たな外接矩形を生成する。即ち、アプリケーション４０１は、両方の外接矩形に外接する新たな外接矩形を生成し、生成した外接矩形の情報を記憶装置２０９に記憶し、選択された外接矩形と検出された外接矩形との情報を記憶装置２０９から削除する。 (5) Character Area Identification Processing The application 401, via the analysis unit 405, surrounds each recognition cell in the table area identified in the processing of (4) with a white pixel block inscribed in each recognition cell. It is determined whether there is a black pixel block that has been removed. Then, when the application 401 determines that there is a black pixel block, it sets the circumscribing rectangle for all the black pixel blocks that are determined to be present.
Furthermore, when a plurality of circumscribing rectangles are set in one recognition cell, the analysis unit 405 determines whether the distance between the circumscribing rectangles is equal to or less than a predetermined threshold value (for example, 20 pixels, 0.5 cm, etc.). To determine. More specifically, the application 401 selects each circumscribing rectangle included in one recognition cell and detects a circumscribing rectangle whose distance from the selected circumscribing rectangle is within a threshold value.
When the application 401 detects circumscribing rectangles separated by a distance equal to or smaller than the threshold, the application 401 integrates the circumscribed rectangles separated by the distance equal to or smaller than the threshold to generate a new circumscribed rectangle. That is, the application 401 generates a new circumscribing rectangle that circumscribes both circumscribing rectangles, stores the information of the generated circumscribing rectangle in the storage device 209, and stores the information of the selected circumscribing rectangle and the detected circumscribing rectangle. It is deleted from the storage device 209.

その後、アプリケーション４０１は、その認識セル内の外接矩形をまた初めから一つ一つ選択し、互いの間の距離が閾値以下である外接矩形同士を統合していく。アプリケーション４０１は、以上の処理を繰り返す。即ち、互いの間の距離が閾値以下である外接矩形が無くなるまで、外接矩形同士の統合が繰り返される。
このように、本実施形態では、アプリケーション４０１は、一つの認識セルの内部に存在する外接矩形同士の統合を行うが、認識セルをまたぐ外接矩形同士の統合を行わない。
以上の処理が完了した際に、記憶装置２０９に記憶されている外接矩形の情報は、文字列が存在する領域である文字領域を示す情報となる。解析部４０５は、認識セルの内部に存在する文字領域の位置情報を、対応する認識セルと関連付けて記憶装置２０９に記憶する。以上の処理により、アプリケーション４０１は、印刷画像内の表領域内に存在する文字領域を特定する。
図１３の例では、領域１４０５、１４０６それぞれは、文字領域として特定された領域である。また、領域１４０５、１４０６それぞれの位置情報は、認識セルである領域１４０３の情報と関連付けられて、記憶装置２０９に記憶される。
また、アプリケーション４０１は、公知のＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）の技術を用いて、印刷画像内の表領域以外の部分に存在する文字列が存在する文字領域を特定する。そして、アプリケーション４０１は、特定した文字領域の位置情報を、記憶装置２０９に記憶する。
アプリケーション４０１は、以上の（１）〜（５）の処理を実行することで、印刷画像から、文字列が存在する領域である文字領域を分割する。 After that, the application 401 selects the circumscribing rectangles within the recognition cell again one by one from the beginning, and integrates the circumscribing rectangles whose distances are equal to or less than the threshold value. The application 401 repeats the above processing. That is, the integration of the circumscribed rectangles is repeated until there is no circumscribed rectangle whose distance between them is less than or equal to the threshold value.
As described above, in this embodiment, the application 401 integrates circumscribing rectangles existing inside one recognition cell, but does not integrate circumscribing rectangles that cross recognition cells.
When the above process is completed, the circumscribed rectangle information stored in the storage device 209 becomes information indicating a character area in which a character string exists. The analysis unit 405 stores the position information of the character area existing inside the recognition cell in the storage device 209 in association with the corresponding recognition cell. Through the above processing, the application 401 identifies the character area existing in the table area in the print image.
In the example of FIG. 13, areas 1405 and 1406 are areas specified as character areas. Further, the position information of each of the areas 1405 and 1406 is stored in the storage device 209 in association with the information of the area 1403 which is the recognition cell.
Further, the application 401 uses a known OCR (Optical Character Recognition) technique to specify a character area in which a character string exists in a portion other than the table area in the print image. Then, the application 401 stores the position information of the specified character area in the storage device 209.
The application 401 divides a character area, which is an area in which a character string exists, from the print image by executing the above processes (1) to (5).

Ｓ８１０において、アプリケーション４０１は、解析部４０５を介して、印刷画像内に文字列が含まれるか否かを判定する。より具体的には、アプリケーション４０１は、Ｓ８０９で特定した文字領域が存在するか否かに基づいて、印刷画像内に文字列が含まれるか否かを判定する。アプリケーション４０１は、Ｓ８０９で特定した文字領域が存在する場合、印刷画像内に文字列が含まれると判定して、処理をＳ８１１に進め、Ｓ８０９で特定した文字領域が存在しない場合、印刷画像内に文字列が含まれないと判定して、処理をＳ８１２に進める。
Ｓ８１１において、アプリケーション４０１は、解析部４０５を介して、印刷画像におけるＳ８０９で特定した文字領域それぞれに対して文字認識を行い、文字列を取得する。そして、アプリケーション４０１は、マイク・スピーカ２１４のスピーカを介して、取得した文字列を音声出力する。
アプリケーション４０１は、文字認識により取得した文字列を、対応する文字領域と関連付けて記憶装置２０９に記憶する。それにより、文字領域が表領域内に存在する場合、文字列は、文字領域と予め関連付けられている認識セルとも関連付けられることになる。また、アプリケーション４０１は、文字認識に失敗した場合、文字領域に関連付けられる文字列を取得できないこととなる。
また、アプリケーション４０１は、文字認識を行う際に、更に、認識率を取得してもよい。認識率は、文字を正しく認識できたか否かを何かしらの数値で示した値である。その場合、アプリケーション４０１は、認識した文字列と関連付けて認識率についても、記憶装置に記憶することとしてもよい。 In step S810, the application 401 determines whether the print image contains a character string via the analysis unit 405. More specifically, the application 401 determines whether or not a character string is included in the print image based on whether or not the character area specified in S809 exists. If the character area specified in S809 exists, the application 401 determines that the print image includes a character string, advances the process to S811, and if the character area specified in S809 does not exist, the application 401 determines in the print image. It is determined that the character string is not included, and the process proceeds to S812.
In step S811, the application 401 performs character recognition on each of the character areas specified in step S809 in the print image through the analysis unit 405, and acquires a character string. Then, the application 401 outputs the acquired character string by voice through the speaker of the microphone / speaker 214.
The application 401 stores the character string acquired by the character recognition in the storage device 209 in association with the corresponding character area. Thereby, when the character area exists in the table area, the character string is also associated with the recognition cell previously associated with the character area. Further, if the character recognition fails, the application 401 cannot acquire the character string associated with the character area.
In addition, the application 401 may further acquire the recognition rate when performing character recognition. The recognition rate is a numerical value indicating whether or not a character was correctly recognized. In this case, the application 401 may store the recognition rate in the storage device in association with the recognized character string.

Ｓ８１２において、アプリケーション４０１は、解析部４０５を介して、外部の画像解析サービスを利用して、印刷画像がどのような画像であるかを認識する認識処理の結果の情報を取得する。本実施形態では、アプリケーション４０１は、ＧｏｏｇｌｅＣｌｏｕｄＶｉｓｉｏｎＡＰＩ（登録商標）等のクラウドコンピューティングを用いた画像解析のサービス（クラウドサービス）に印刷画像の認識処理を依頼することで、応答として印刷画像の認識処理の結果（認識結果）の情報を取得する。認識処理の結果の情報は、例えば、何等かのオブジェクト（例えば、人物、動物、風景等）の画像であることを示す情報等である。そして、アプリケーション４０１は、マイク・スピーカ２１４のスピーカを介して、取得した認識処理の結果の情報を音声出力する。
本実施形態では、アプリケーション４０１は、Ｓ８１２で、クラウドコンピューティングを用いた画像解析サービスを利用することとしたが、利用しないこととしてもよい。その場合、アプリケーション４０１は、例えば、画像内の被写体を検出するためのオフラインの推論エンジンを用いて、印刷画像に対する認識処理を実行し、認識処理の結果の情報を取得することとしてもよい。その場合、アプリケーション４０１は、例えば、ＯＳ４１０が提供するＡＰＩ等に印刷画像の認識処理を依頼し、応答として認識処理の結果の情報を取得する。続いて、ＯＳ４１０が提供するスクリーンリーダ機能と協働して取得した情報を音声出力する。
Ｓ８０６、Ｓ８０８、Ｓ８１１、Ｓ８１２それぞれで音声出力される情報（アクセシビリティ情報を示す文字列、印刷対象ファイルに含まれる文字列、印刷画像から認識された文字列、印刷画像に対する認識処理の結果の情報）それぞれは、解析情報の一例である。 In step S 812, the application 401 uses the external image analysis service via the analysis unit 405 to acquire information on the result of the recognition process for recognizing what the print image is. In the present embodiment, the application 401 requests a print image recognition process from an image analysis service (cloud service) that uses cloud computing, such as Google Cloud Vision API (registered trademark), and prints the print image as a response. Acquires information on the result of recognition processing (recognition result). The information on the result of the recognition processing is, for example, information indicating that it is an image of some object (for example, a person, an animal, a landscape, etc.). Then, the application 401 outputs the obtained information of the recognition processing result by voice through the speaker of the microphone / speaker 214.
In the present embodiment, the application 401 uses the image analysis service using cloud computing in S812, but may not use it. In that case, the application 401 may execute the recognition process for the print image by using, for example, an offline inference engine for detecting a subject in the image, and acquire information on the result of the recognition process. In that case, the application 401 requests, for example, an API provided by the OS 410 to perform recognition processing of the print image, and acquires the information of the recognition processing result as a response. Subsequently, the information acquired in cooperation with the screen reader function provided by the OS 410 is output as voice.
Information output by voice in each of S806, S808, S811, and S812 (character string indicating accessibility information, character string included in print target file, character string recognized from print image, information on result of recognition process for print image) Each is an example of analysis information.

なお、Ｓ８０６、Ｓ８０８、Ｓ８１１、Ｓ８１２の何れかの処理において、情報の音声出力が行われている最中に、ユーザが予め定められた操作を行った場合、アプリケーション４０１は、音声出力を途中で中止することとする。
図１１を用いて、データ処理装置１０１が音声出力を中止する処理について説明する。
図１１（ａ）、（ｂ）の処理の開始の際には、Ｓ８０６、Ｓ８０８、Ｓ８１１、Ｓ８１２の何れかの処理で情報が音声出力されているとする。本実施形態では、アプリケーション４０１は、マルチスレッド処理により図８のフローチャートの処理と、図１１（ａ）、（ｂ）の何れかのフローチャートの処理と、を並列して実行することとする。
図１１（ａ）のフローチャートの処理について説明する。
Ｓ１００１において、アプリケーション４０１は、ボタン７０２が選択されたか否かを判定する。アプリケーション４０１は、ボタン７０２が選択されたと判定した場合、処理をＳ１００２に進め、ボタン７０２が選択されていないと判定した場合、図１１の処理を終了する。
Ｓ１００２において、アプリケーション４０１は、Ｓ８０６、Ｓ８０８、Ｓ８１１、Ｓ８１２の何れかの処理で実行されている音声出力処理を中止するよう制御する。
Ｓ１００３において、アプリケーション４０１は、印刷対象ファイルの印刷処理を実行するよう制御する。 Note that in any of the processes of S806, S808, S811, and S812, if the user performs a predetermined operation during the audio output of information, the application 401 outputs the audio midway. It will be canceled.
A process in which the data processing apparatus 101 stops the audio output will be described with reference to FIG. 11.
At the time of starting the processing of FIGS. 11A and 11B, it is assumed that information is output as voice by any of the processing of S806, S808, S811, and S812. In this embodiment, the application 401 executes the process of the flowchart of FIG. 8 and the process of the flowchart of FIG. 11A or 11B in parallel by the multithread process.
The process of the flowchart of FIG. 11A will be described.
In step S1001, the application 401 determines whether the button 702 has been selected. If the application 401 determines that the button 702 has been selected, the process advances to step S1002, and if it determines that the button 702 has not been selected, the process of FIG. 11 ends.
In step S1002, the application 401 controls so as to stop the audio output process executed in any of the processes of S806, S808, S811, and S812.
In step S1003, the application 401 controls to execute print processing of the print target file.

図１１（ｂ）のフローチャートの処理について説明する。
Ｓ１００４において、アプリケーション４０１は、ボタン７０３が選択されたか否かを判定する。アプリケーション４０１は、ボタン７０３が選択されたと判定した場合、処理をＳ１００５に進め、ボタン７０３が選択されていないと判定した場合、図１１（ｂ）の処理を終了する。
Ｓ１００５において、アプリケーション４０１は、Ｓ８０６、Ｓ８０８、Ｓ８１１、Ｓ８１２の何れかの処理で実行されている音声出力処理を中止するよう制御する。
Ｓ１００６において、アプリケーション４０１は、ディスプレイ２１５に、ファイル選択画面６０１を表示する。
以上の図１１の処理により、データ処理装置１０１は、音声出力処理をキャンセルして、別の処理に進むことができる。
なお、本実施形態では、アプリケーション４０１は、印刷プレビュー画面７０１に移動したタイミング以外に、タッチパネル２１６へのユーザによるタップ操作を検知した場合に、Ｓ８０３〜Ｓ８１２の処理を実行することとする。これにより、タッチパネル２１６へのタップ操作に応じて、再度、印刷対象ファイルの解析情報が音声出力される。そのため、ユーザが、解析情報を聞き返したい場合に聞き返すことができるようになる。 The process of the flowchart of FIG. 11B will be described.
In step S1004, the application 401 determines whether the button 703 has been selected. If the application 401 determines that the button 703 has been selected, the process advances to step S1005, and if it determines that the button 703 has not been selected, the process of FIG. 11B ends.
In step S1005, the application 401 controls to stop the audio output process executed in any of the processes of S806, S808, S811, and S812.
In step S1006, the application 401 displays the file selection screen 601 on the display 215.
Through the processing in FIG. 11 described above, the data processing apparatus 101 can cancel the audio output processing and proceed to another processing.
In addition, in the present embodiment, the application 401 executes the processes of S 803 to S 812 when a tap operation by the user on the touch panel 216 is detected in addition to the timing of moving to the print preview screen 701. As a result, the analysis information of the print target file is again voice output in response to the tap operation on the touch panel 216. Therefore, when the user wants to hear back the analysis information, it can be heard back.

以上、本実施形態の処理により、データ処理装置１０１は、印刷対象ファイルの解析情報を、音声出力することで、ユーザに提供できる。これにより、視覚障がい者のユーザが、印刷するファイルが正しく選択されているか否かをより容易に判断することができるようになる。困難な場合がある。即ち、データ処理装置１０１は、印刷の際の視覚障がい者のユーザにとっての利便性を向上できる。 As described above, according to the processing of the present embodiment, the data processing apparatus 101 can provide the user with the analysis information of the print target file by voice output. This allows the visually impaired user to more easily determine whether or not the file to be printed is correctly selected. It can be difficult. That is, the data processing apparatus 101 can improve convenience for the visually impaired user at the time of printing.

＜実施形態２＞
実施形態１では、データ処理装置１０１が、印刷対象ファイルを解析してから解析情報を取得して、取得した解析情報を音声出力する処理について説明した。しかし、印刷対象ファイルの解析情報に含まれる文字数によっては、解析情報の音声出力に時間がかかってしまう。例えば、図１２のような請求書の文書ファイルの印刷画像から取得された文字列を音声出力すると時間がかかり、ユーザの混乱を招く可能性がある。
そこで、本実施形態では、データ処理装置１０１が、印刷対象ファイルから、特定の解析情報を取得し、取得した特定の解析情報を音声出力する処理について説明する。
本実施形態では、データ処理装置１０１が、印刷対象ファイルから、特定の解析情報として、文書のタイトルを取得し、音声出力する。
本実施形態のデータ処理システムのシステム構成は、実施形態１と同様である。また、データ処理システムの各構成要素のハードウェア構成及び機能構成についても、実施形態１と同様である。 <Embodiment 2>
In the first embodiment, the processing in which the data processing apparatus 101 analyzes the print target file, acquires the analysis information, and outputs the acquired analysis information by voice has been described. However, depending on the number of characters included in the analysis information of the print target file, it takes time to output the analysis information by voice. For example, outputting a character string obtained from a print image of a document file of an invoice as shown in FIG. 12 takes time, which may cause confusion for the user.
Therefore, in the present embodiment, a process in which the data processing apparatus 101 acquires specific analysis information from a print target file and outputs the acquired specific analysis information by voice will be described.
In the present embodiment, the data processing apparatus 101 acquires the title of a document as specific analysis information from the file to be printed and outputs it as a voice.
The system configuration of the data processing system of this embodiment is the same as that of the first embodiment. Further, the hardware configuration and the functional configuration of each component of the data processing system are the same as in the first embodiment.

アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルの印刷画像から文字属性情報１２０１を取得する。
図１２の文書１１０１から取得される文字属性情報１２０１について説明する。図１３は、文書１１０１から取得された文字属性情報１２０１の一例を示す図である。文字属性情報とは、文書内に含まれる文字列の属性を示す情報である。なお、図１３の例では、文字属性情報１２０１は、ＸＭＬ形式となっているが、別の形式でもよい。例えば、文字属性情報１２０１は、解析部４０５内部でのみ使用される情報であるため、バイナリ形式でもよい。
例えば、文書１１０１内の文字列１１０２から、文字属性情報１２０１における要素１２０２の情報が取得される。
要素１２０２には、以下のように、「文字列（ｓｔｒｉｎｇ）」、「文字サイズ（ｓｉｚｅ）」、「文字位置（ｐｏｓｉｔｉｏｎ）」、「タイプ（ｔｙｐｅ）」の項目が含まれる。図１３の例では、「文字列（ｓｔｒｉｎｇ）」は、Ｉｎｖｏｉｃｅである。また、「文字サイズ（ｓｉｚｅ）」は、２４である。また、「文字位置（ｐｏｓｉｔｉｏｎ）」は、対応する文字列に外接する文字枠１１０３の左上の座標［２０、１０］と右下の座標［１００、３４］とである。また、「タイプ（ｔｙｐｅ）」は、Ｔｅｘｔである。「タイプ（ｔｙｐｅ）」の項目は、例えば、表内のデータである場合、Ｔａｂｌｅとなる。タイプは必要に応じて種類が増減されることとしてもよい。 The application 401 acquires the character attribute information 1201 from the print image of the print target file via the analysis unit 405.
The character attribute information 1201 acquired from the document 1101 of FIG. 12 will be described. FIG. 13 is a diagram showing an example of the character attribute information 1201 acquired from the document 1101. The character attribute information is information indicating the attribute of the character string included in the document. Note that in the example of FIG. 13, the character attribute information 1201 is in the XML format, but another format may be used. For example, since the character attribute information 1201 is information used only inside the analysis unit 405, it may be in a binary format.
For example, the information of the element 1202 in the character attribute information 1201 is acquired from the character string 1102 in the document 1101.
The element 1202 includes items of "character string (string)", "character size (size)", "character position (position)", and "type (type)" as described below. In the example of FIG. 13, the “character string (string)” is Invoice. The “character size (size)” is 24. The “character position” is the upper left coordinate [20, 10] and the lower right coordinate [100, 34] of the character frame 1103 circumscribing the corresponding character string. Also, the “type” is Text. The item of “type” is Table, for example, when the data is in a table. The types may be increased or decreased as necessary.

以下では、文字属性情報１２０１を取得する処理について説明する。
（印刷対象ファイルが画像ファイル又は画像のみを含むＰＤＦファイルである場合）
アプリケーション４０１は、印刷対象ファイルに対して、図８のＳ８０９で説明した領域分割処理を実行することで、印刷対象ファイルの印刷画像内の文字領域を特定する。そして、アプリケーション４０１は、特定した文字領域それぞれについて、文字領域が表領域内に存在する場合、対応する文字列のタイプをＴａｂｌｅとして、文字領域が表領域内に存在しない場合、対応する文字列のタイプをＴｅｘｔとして、取得する。
そして、アプリケーション４０１は、印刷画像内の各文字領域に対して文字認識処理を実行することで、各文字領域に対応する文字列と文字サイズとを取得する。
アプリケーション４０１は、取得したタイプ、文字列、文字サイズ、及び文字領域の位置情報を用いて、印刷対象ファイルの文字属性情報１２０１を作成する。アプリケーション４０１は、作成した文字属性情報１２０１を、記憶装置２０９に記憶する。しかし、アプリケーション４０１は、作成した文字属性情報１２０１を、記憶装置２０９に記憶せずに、ＲＡＭ２０４に記憶してもよい。 The process of acquiring the character attribute information 1201 will be described below.
(When the file to be printed is an image file or a PDF file containing only images)
The application 401 identifies the character area in the print image of the print target file by performing the area division process described in S809 of FIG. 8 on the print target file. Then, for each of the specified character areas, the application 401 sets the type of the corresponding character string as Table when the character area exists in the table area, and when the character area does not exist in the table area, Get the type as Text.
Then, the application 401 acquires the character string and the character size corresponding to each character area by performing the character recognition process on each character area in the print image.
The application 401 creates the character attribute information 1201 of the print target file using the acquired type, character string, character size, and position information of the character area. The application 401 stores the created character attribute information 1201 in the storage device 209. However, the application 401 may store the created character attribute information 1201 in the RAM 204 instead of storing it in the storage device 209.

（印刷対象ファイルが文字コードを含むＰＤＦファイルである場合）
印刷対象ファイルが、文字コードを含むＰＤＦの場合、印刷対象ファイル内に文字列、文字サイズ、文字位置の情報が含まれている。そのため、アプリケーション４０１は、印刷対象ファイルから、それらの情報を取得する。
ＰＤＦフォーマットの仕様では、表というオブジェクトの定義は無い。そのため、ＰＤＦファイル内のある文字列が、表内の文字列か否かを、単純にＰＤＦファイル内の情報から判断できない。ただし、ドキュメント構造タグにより、表内の文字列が定義されていれば、アプリケーション４０１は、このドキュメント構造タグから、ＰＤＦファイル内の文字列が、表内の文字列か否かを判定できる。そして、アプリケーション４０１は、判定結果に基づいて、印刷画像内の文字列のタイプを取得する。
また、ドキュメント構造タグにより、表内の文字列が定義されていない場合、アプリケーション４０１は、印刷対象ファイルを一旦画像に変換して、変換後の画像に対して領域分割処理を行うことで、印刷画像内の各文字列が表内の文字列か否かを判定する。そして、アプリケーション４０１は、判定結果に基づいて、印刷画像内の文字列のタイプを取得する。 (When the file to be printed is a PDF file containing a character code)
When the print target file is a PDF including a character code, the print target file includes information about a character string, a character size, and a character position. Therefore, the application 401 acquires such information from the print target file.
The PDF format specification does not define a table object. Therefore, it cannot be simply determined from the information in the PDF file whether a certain character string in the PDF file is a character string in the table. However, if the character string in the table is defined by the document structure tag, the application 401 can determine whether the character string in the PDF file is the character string in the table from the document structure tag. Then, the application 401 acquires the type of the character string in the print image based on the determination result.
Further, when the character string in the table is not defined by the document structure tag, the application 401 converts the print target file into an image once, and performs area division processing on the converted image to print. It is determined whether each character string in the image is a character string in the table. Then, the application 401 acquires the type of the character string in the print image based on the determination result.

以上の処理により、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルの文字属性情報１２０１を作成する。
続いて、文字属性情報１２０１の情報を元に、文書のタイトルを示す文字列を取得し、取得した文字列を音声出力する処理について説明する。
処理の概要として、アプリケーション４０１が文字属性情報１２０１内の複数の＜ｄａｔａ＞要素の中から、以下の条件α、βを満たす文字列情報を探索する処理である。
条件α：印刷画像内で、相対的に上側、左側に位置する文字列である。
条件β：印刷画像内で、相対的にフォントサイズが大きい文字列である。
条件α、βは、文書内のタイトルは、文書内で上側、左側に位置し、かつ、フォントサイズも他の文字列よりも大きい、との仮定に基づく条件である。 Through the above processing, the application 401 creates the character attribute information 1201 of the print target file via the analysis unit 405.
Next, a process of acquiring the character string indicating the title of the document based on the information of the character attribute information 1201 and outputting the acquired character string by voice will be described.
The outline of the process is a process in which the application 401 searches the plurality of <data> elements in the character attribute information 1201 for character string information that satisfies the following conditions α and β.
Condition α: a character string located relatively on the upper side and the left side in the print image.
Condition β: a character string having a relatively large font size in the print image.
The conditions α and β are conditions based on the assumption that the title in the document is located on the upper side and the left side in the document and the font size is larger than that of other character strings.

図１４は、本実施形態のデータ処理装置１０１の処理の一例を示すフローチャートである。図１４を用いて、データ処理装置１０１が、文字属性情報１２０１の情報を元に、文書のタイトルを示す文字列を取得し、取得した文字列を音声出力する処理について説明する。
Ｓ１３０１において、アプリケーション４０１は、解析部４０５を介して、ＰＤＦファイルである印刷対象ファイル内に、タイトルの情報が含まれているか否かを判定する。印刷対象ファイル内にアクセシビリティ情報として、タイトルが定義された情報が含まれる場合がある。そこで、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルのドキュメント構造タグ（ＳｔｒｕｃｔＴｒｅｅＲｏｏｔ）内から、タイトルとして定義されている構造要素を探索する。アプリケーション４０１は、探索できた場合、処理をＳ１３０２に進め、探索できなかった場合、処理をＳ１３０３に進める。
本実施形態では、アプリケーション４０１は、印刷対象ファイルのドキュメント構造タグ（ＳｔｒｕｃｔＴｒｅｅＲｏｏｔ）内から、タイトルとして定義されている構造要素を探索することで、タイトルの情報を探索することとした。しかし、アプリケーション４０１は、印刷対象ファイルのドキュメント構造タグ（ＳｔｒｕｃｔＴｒｅｅＲｏｏｔ）内から、タイトルとして定義されている構造要素を探索しないこととしてもよい。例えば、印刷対象ファイル内にメタデータとして、タイトルの情報が含まれる場合がある。そこで、アプリケーション４０１は、印刷対象ファイルのメタデータから、タイトルを示すメタデータを探索することとしてもよい。 FIG. 14 is a flowchart showing an example of processing of the data processing device 101 of this embodiment. A process in which the data processing apparatus 101 acquires a character string indicating the title of a document based on the information of the character attribute information 1201 and outputs the acquired character string by voice will be described with reference to FIG. 14.
In step S1301, the application 401 determines, via the analysis unit 405, whether the print target file, which is a PDF file, includes title information. Information that defines a title may be included as accessibility information in the file to be printed. Therefore, the application 401 searches the document structure tag (StructureTreeRoot) of the file to be printed for a structural element defined as a title via the analysis unit 405. If the application 401 can be searched, the process proceeds to step S1302, and if the application cannot be searched, the process proceeds to step S1303.
In the present embodiment, the application 401 searches for the title information by searching for the structural element defined as the title in the document structure tag (StructureTreeRoot) of the print target file. However, the application 401 may not search for the structural element defined as the title in the document structure tag (StructureTreeRoot) of the print target file. For example, title information may be included as metadata in the print target file. Therefore, the application 401 may search the metadata of the print target file for the metadata indicating the title.

Ｓ１３０２において、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働してＳ１３０１で探索したタイトルの情報を、マイク・スピーカ２１４のスピーカを介して、音声出力する。
Ｓ１３０３において、アプリケーション４０１は、解析部４０５を介して、印刷対象ファイルから文字属性情報１２０１を作成する。
Ｓ１３０４において、アプリケーション４０１は、解析部４０５を介して、Ｓ１３０３で作成した文字属性情報１２０１内の＜ｄａｔａ＞要素のうち、文書内における上側の領域として予め定められた領域内に存在する＜ｄａｔａ＞要素を抽出する。本実施形態では、印刷画像に表される文書の領域をｙ方向に３分割した場合の最上部の領域を、文書内における上側の領域として予め定められた領域とする。また、文書内における上側の領域として予め定められた領域は、文書の領域をｙ方向に３分割した場合の最上部の領域に限定されない。例えば、文書内における上側の領域として予め定められた領域は、文書の領域をｙ方向に５分割した場合の最上部の領域であってもよい。
例えば、文書の下側にロゴがあり、ロゴの形によっては、サイズの大きい文字列と判定されてしまう場合がある。このような文字列を、文書のタイトルとして取得しないようにするために、アプリケーション４０１は、Ｓ１３０４の処理を行う。 In step S1302, the application 401 outputs the information of the title searched in step S1301 as a voice through the speaker of the microphone / speaker 214 in cooperation with the screen reader function provided by the OS 410.
In step S1303, the application 401 creates the character attribute information 1201 from the print target file via the analysis unit 405.
In step S1304, the application 401, via the analysis unit 405, exists in the area defined in advance as the upper area in the document among the <data> elements in the character attribute information 1201 created in step S1303. Extract the element. In the present embodiment, the uppermost area in the case where the area of the document represented by the print image is divided into three in the y direction is the area determined in advance as the upper area in the document. Further, the predetermined area as the upper area in the document is not limited to the uppermost area when the area of the document is divided into three in the y direction. For example, the predetermined region as the upper region in the document may be the uppermost region when the document region is divided into five in the y direction.
For example, there is a logo on the lower side of the document, and it may be determined that the character string has a large size depending on the shape of the logo. In order not to acquire such a character string as the title of the document, the application 401 performs the process of S1304.

Ｓ１３０５において、アプリケーション４０１は、解析部４０５を介して、Ｓ１３０４で抽出した＜ｄａｔａ＞要素のうち、フォントサイズが最も大きい＜ｄａｔａ＞要素を全て抽出する。
Ｓ１３０６において、アプリケーション４０１は、解析部４０５を介して、Ｓ１３０４で抽出した＜ｄａｔａ＞要素のうち、以下のような＜ｄａｔａ＞要素を抽出する。即ち、アプリケーション４０１は、Ｓ１３０５で抽出した＜ｄａｔａ＞要素に対応するフォントサイズとのサイズ差が、予め定められた閾値（例えば、０．５ｐ、１ｐ等）以下であるフォントサイズの＜ｄａｔａ＞要素を全て抽出する。
文字認識によるフォントサイズの特定処理は、精度が１００％ではない場合がある。そのため、フォントサイズが実際のサイズと比べて、０．５〜１．０ｐほどずれて特定される場合が生じうる。そこで、アプリケーション４０１は、そのような場合に対応して、Ｓ１３０６の処理を実行する。 In step S1305, the application 401 extracts all the <data> elements having the largest font size from the <data> elements extracted in step S1304 via the analysis unit 405.
In step S1306, the application 401 extracts the following <data> element from the <data> elements extracted in step S1304 via the analysis unit 405. That is, the application 401 has a font size <data> element whose size difference from the font size corresponding to the <data> element extracted in S1305 is less than or equal to a predetermined threshold value (for example, 0.5p, 1p, etc.). To extract all.
The accuracy of the font size specifying process by character recognition may not be 100%. Therefore, the font size may be specified by being displaced by 0.5 to 1.0 p from the actual size. Therefore, the application 401 executes the process of S1306 in response to such a case.

Ｓ１３０７において、アプリケーション４０１は、解析部４０５を介して、Ｓ１３０５とＳ１３０６とで抽出した＜ｄａｔａ＞要素中から、もっとも左上に位置する＜ｄａｔａ＞要素の文字列を、タイトルを示す文字列として取得する。より具体的には、アプリケーション４０１は、各＜ｄａｔａ＞要素の「文字位置（ｐｏｓｉｔｉｏｎ）」が示す文字枠の左上の座標値を取得する。そして、アプリケーション４０１は、取得した座標値のうち、対応するｘ座標値とｙ座標値との和が最も小さい座標値を特定する。アプリケーション４０１は、特定した座標値に対応する＜ｄａｔａ＞要素を、最も左上に存在する文字列を示す＜ｄａｔａ＞要素として特定する。アプリケーション４０１は、特定した＜ｄａｔａ＞要素に対応する文字列を、タイトルを示す文字列として取得する。
Ｓ１３０８において、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働してＳ１３０７で取得した文字列を、マイク・スピーカ２１４のスピーカを介して音声出力する。 In step S1307, the application 401 obtains, through the analysis unit 405, the character string of the <data> element located at the upper left as the character string indicating the title from the <data> elements extracted in steps S1305 and S1306. . More specifically, the application 401 acquires the upper left coordinate value of the character frame indicated by the “character position (position)” of each <data> element. Then, the application 401 identifies the coordinate value having the smallest sum of the corresponding x coordinate value and y coordinate value among the acquired coordinate values. The application 401 specifies the <data> element corresponding to the specified coordinate value as the <data> element indicating the character string that exists at the upper left. The application 401 acquires the character string corresponding to the specified <data> element as the character string indicating the title.
In step S1308, the application 401 outputs the character string acquired in step S1307 in cooperation with the screen reader function provided by the OS 410 via the speaker of the microphone / speaker 214.

なお、本実施形態では、アプリケーション４０１は、印刷対象ファイルに対応する文書のタイトルを示す文字列を１つだけ取得することとした。しかし、タイトルの候補が複数ある場合に対応するため、アプリケーション４０１は、印刷対象ファイルに対応する文書のタイトルを示す文字列を複数取得してもよい。その場合、アプリケーション４０１は、Ｓ１３０７で、Ｓ１３０５とＳ１３０６とで抽出した＜ｄａｔａ＞要素中から、各＜ｄａｔａ＞要素の「文字位置（ｐｏｓｉｔｉｏｎ）」が示す文字枠の左上の座標値を取得する。そして、アプリケーション４０１は、取得した座標値のうち、対応するｘ座標値とｙ座標値との和が、予め定められた閾値以下の座標値を複数特定する。アプリケーション４０１は、特定した座標値それぞれに対応する＜ｄａｔａ＞要素を、タイトルを示す文字列を示す＜ｄａｔａ＞要素として特定する。アプリケーション４０１は、特定した複数の＜ｄａｔａ＞要素に対応する複数の文字列を、タイトルを示す文字列として取得することとなる。そして、アプリケーション４０１は、ＯＳ４１０が提供するスクリーンリーダ機能と協働して特定した複数の文字列を音声出力する。
また、本実施形態では、アプリケーション４０１は、タイトルに相当する文字列を、音声出力の対象となる特定の文字列として取得した。しかし、アプリケーション４０１は、他の種類の文字列を音声出力の対象となる文字列として取得してもよい。例えば、アプリケーション４０１は、印刷画像から抽出した文字列に基づき要約を作成し、作成した要約を、音声出力の対象となる文字列として取得してもよい。なお、アプリケーション４０１は、公知の要約アルゴリズムを用いることで、印刷画像から抽出した文字列から要約を作成できる。アプリケーション４０１は、例えば、各文のグラフ表現におけるベクトルの類似度に基づき重要な文を抽出するＬｅｘＲａｎｋ等のアルゴリズムを用いることができる。また、アプリケーション４０１は、外部のクラウドサーバ等と連携して要約を作成することとしてもよい。 In this embodiment, the application 401 acquires only one character string indicating the title of the document corresponding to the print target file. However, since there is a plurality of title candidates, the application 401 may acquire a plurality of character strings indicating the title of the document corresponding to the print target file. In this case, in step S1307, the application 401 acquires the coordinate values at the upper left of the character frame indicated by the “character position (position)” of each <data> element from the <data> elements extracted in steps S1305 and S1306. Then, the application 401 specifies a plurality of coordinate values in which the sum of the corresponding x coordinate value and y coordinate value is equal to or less than a predetermined threshold value among the acquired coordinate values. The application 401 identifies the <data> element corresponding to each of the identified coordinate values as the <data> element indicating the character string indicating the title. The application 401 will acquire a plurality of character strings corresponding to the identified plurality of <data> elements as a character string indicating a title. Then, the application 401 outputs the plurality of specified character strings by voice in cooperation with the screen reader function provided by the OS 410.
Further, in the present embodiment, the application 401 acquires the character string corresponding to the title as the specific character string that is the target of audio output. However, the application 401 may acquire another type of character string as a character string that is the target of voice output. For example, the application 401 may create a summary based on the character string extracted from the print image, and acquire the created summary as a character string that is a target of voice output. The application 401 can create a summary from the character string extracted from the print image by using a known summary algorithm. The application 401 can use, for example, an algorithm such as LexRank that extracts important sentences based on the degree of vector similarity in the graph representation of each sentence. Further, the application 401 may create an abstract in cooperation with an external cloud server or the like.

以上、本実施形態の処理により、データ処理装置１０１は、印刷対象ファイルが指定された際に、印刷画像に対応する文書に関する特定の情報のみ音声出力できる。これにより、視覚障がい者のユーザは、より容易に、印刷対象ファイルの内容を確認することができる。 As described above, according to the processing of the present embodiment, when the file to be printed is designated, the data processing apparatus 101 can output only specific information regarding the document corresponding to the print image by voice. This allows the visually impaired user to more easily confirm the content of the print target file.

＜実施形態３＞
数値のみの文字列と表内の文字列とについては、音声で読み上げられても分かりづらく、かえって混乱を招く可能性があった。
そこで、本実施形態では、データ処理装置１０１が数値のみの文字列と表内の文字列とを、音声出力させないように制御する処理について説明する。
本実施形態のデータ処理システムのシステム構成は、実施形態１と同様である。また、データ処理システムの各構成要素のハードウェア構成及び機能構成についても、実施形態１と同様である。 <Embodiment 3>
As for the character strings containing only numerical values and the character strings in the table, it was difficult to understand even when read aloud, and there was a possibility of causing confusion.
Therefore, in the present embodiment, a process will be described in which the data processing apparatus 101 controls a character string having only numerical values and a character string in the table so as not to output a voice.
The system configuration of the data processing system of this embodiment is the same as that of the first embodiment. Further, the hardware configuration and the functional configuration of each component of the data processing system are the same as in the first embodiment.

図１５は、本実施形態のデータ処理装置１０１の処理の一例を示すフローチャートである。本実施形態では、印刷対象ファイルは、文書の画像を示す画像ファイルであるとする。
本実施形態では、データ処理装置１０１は、図１４のフローチャートのＳ１３０１〜Ｓ１３０３と同様の処理を実行する。データ処理装置１０１は、Ｓ１３０３の処理を実行した後に、図１５の処理を実行する。
Ｓ１５０１において、アプリケーション４０１は、解析部４０５を介して、Ｓ１３０３で作成した文字属性情報１２０１に含まれる＜ｄａｔａ＞要素の中で、タイプがＴａｂｌｅのものを削除する。
Ｓ１５０２において、アプリケーション４０１は、解析部４０５を介して、文字属性情報１２０１に含まれる＜ｄａｔａ＞要素の中で、予め定められた種類の文字を含まない文字列をもつ＜ｄａｔａ＞要素を削除する。本実施形態では、予め定められた種類の文字は、ひらがな、カタカナ、漢数字を除く漢字、アルファベットであるとする。これにより、アプリケーション４０１は、アラビア数字、漢数字等の数値のみで構成されるような文字列をもつ＜ｄａｔａ＞要素を削除できる。
Ｓ１５０３において、アプリケーション４０１は、マイク・スピーカ２１４のスピーカを介して、文字属性情報１２０１に残っている＜ｄａｔａ＞要素の文字列を音声出力する。 FIG. 15 is a flowchart showing an example of processing of the data processing apparatus 101 of this embodiment. In the present embodiment, the print target file is an image file showing an image of a document.
In the present embodiment, the data processing apparatus 101 executes the same processing as S1301 to S1303 in the flowchart of FIG. The data processing apparatus 101 executes the processing of S1303 and then the processing of FIG.
In step S1501, the application 401 deletes, via the analysis unit 405, the <data> element included in the character attribute information 1201 created in step S1303, of the type Table.
In step S1502, the application 401 deletes, through the analysis unit 405, a <data> element having a character string that does not include a character of a predetermined type among the <data> elements included in the character attribute information 1201. . In the present embodiment, it is assumed that the predetermined types of characters are hiragana, katakana, kanji excluding kanji and alphabets. As a result, the application 401 can delete the <data> element having a character string that is composed only of numerical values such as Arabic numerals and Chinese numerals.
In step S1503, the application 401 outputs the character string of the <data> element remaining in the character attribute information 1201 by voice through the speaker of the microphone / speaker 214.

本実施形態では、印刷対象ファイルは、文書の画像を示す画像ファイルであるとした。しかし、印刷対象ファイルは、文書の画像をコンテンツとして含むＰＤＦファイルであるとしてもよい。その場合、データ処理装置１０１は、印刷対象ファイルは、文書の画像を示す画像ファイルである場合と同様の処理を実行する。
また、印刷対象ファイルは、文字列をコンテンツとして含むＰＤＦファイルであるとしてもよい。その場合、データ処理装置１０１は、印刷対象ファイルにコンテンツとして含まれる文字列を取得し、取得した文字列から、表に格納される文字列と、予め定められた種類の文字を含まない文字列と、を除いた文字列を音声出力する。 In the present embodiment, the print target file is an image file showing an image of a document. However, the print target file may be a PDF file including the image of the document as content. In that case, the data processing apparatus 101 executes the same process as when the print target file is an image file showing an image of a document.
Further, the print target file may be a PDF file including a character string as content. In that case, the data processing apparatus 101 acquires a character string included in the print target file as content, and based on the acquired character string, a character string stored in a table and a character string that does not include a character of a predetermined type. The character string excluding and is output as voice.

以上、本実施形態では、データ処理装置１０１は、表に含まれている文字列、又は、数値のみで構成されているような文字列を音声出力しないように制御した。これにより、ユーザが混乱してしまう可能性を低減できる。 As described above, in the present embodiment, the data processing apparatus 101 controls so that the character strings included in the table or the character strings that are composed only of numerical values are not output by voice. This can reduce the possibility of user confusion.

＜その他の実施形態＞
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読み出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 <Other embodiments>
The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or a storage medium, and one or more processors in a computer of the system or apparatus read and execute the program. It can also be realized by the processing. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

例えば、上述したデータ処理システムの機能構成の一部又は全てをハードウェアとしてデータ処理装置１０１等に実装してもよい。
以上、本発明の実施形態の一例について詳述したが、本発明は係る特定の実施形態に限定されるものではない。例えば、上述した各実施形態を任意に組み合わせる等してもよい。 For example, part or all of the functional configuration of the data processing system described above may be implemented as hardware in the data processing device 101 or the like.
Although one example of the embodiment of the present invention has been described in detail above, the present invention is not limited to the specific embodiment. For example, the above-described embodiments may be arbitrarily combined.

１０１データ処理システム
２０２ＣＰＵ 101 data processing system 202 CPU

Claims

An acquisition means for acquiring analysis information of the print target file when the screen reader function is enabled and the print target file is designated;
Control means for controlling to output the analysis information acquired by the acquisition means by voice;
Information processing device having a.

The information according to claim 1, wherein the acquisition unit acquires the analysis information when the screen reader function is enabled and the print target file is designated through a tap operation on the input unit by a user. Processing equipment.

The information according to claim 1 or 2, wherein the acquisition unit acquires the analysis information based on the accessibility information when accessibility information used for accessibility improvement is included in a predetermined area in the print target file. Processing equipment.

The information processing apparatus according to claim 1, wherein the acquisition unit acquires the character string as the analysis information when the content data of the print target file includes a character string.

The information processing apparatus according to claim 4, wherein the acquisition unit acquires, as the analysis information, the character string indicating a title of a document corresponding to the print target file.

The information processing apparatus according to claim 4, wherein the acquisition unit acquires the character string not included in a table as the analysis information.

7. The information processing apparatus according to claim 4, wherein the acquisition unit acquires the character string including characters of a predetermined type as the analysis information.

Further comprising requesting means for requesting the recognition of the file to be printed to the operating system or cloud service of the information processing device,
The information processing apparatus according to claim 1, wherein the acquisition unit acquires, as the analysis information, information indicating a result of recognition processing obtained as a response to the request.

The acquisition unit acquires, as a result of the recognition processing, one of a character string indicating a recognition result of an image included in the print target file and a character string indicating a summary of a document included in the print target file. The information processing apparatus according to claim 8.

The information processing apparatus according to claim 1, wherein the acquisition unit acquires, as the analysis information, a character string included in a print image of a document corresponding to the print target file.

The information processing apparatus according to claim 10, wherein the acquisition unit acquires, as the analysis information, the character string indicating a title of a document corresponding to the print target file.

The information processing apparatus according to claim 11, wherein the acquisition unit acquires the character string that indicates the title and is not included in a table as the analysis information.

The information processing apparatus according to claim 11 or 12, wherein the acquisition unit acquires the character string including characters of a predetermined type as the analysis information.

An information processing method executed by an information processing device, comprising:
An acquisition step of acquiring analysis information of the print target file when the screen reader function is enabled and a print target file is designated;
A control step of controlling to output the analysis information acquired in the acquisition step by voice;
Information processing method including.

A program for causing a computer to function as each unit of the information processing apparatus according to claim 1.