JP2007081841A

JP2007081841A - Image reading device

Info

Publication number: JP2007081841A
Application number: JP2005267073A
Authority: JP
Inventors: Shoji Inoue; 省次井上
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2005-09-14
Filing date: 2005-09-14
Publication date: 2007-03-29

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image reading device which is capable of outputting a document having several pages in a suitable mode without giving inconvenience to a user. <P>SOLUTION: The image reading device 100 is equipped with an image reading unit 1, an information extraction unit 2, a memory unit 3, a UI unit 4, an image outputting unit 5, and a control unit 6. The control unit 6 makes the image reading unit 1 read out a document having several pages and generate image data, and makes the information extraction unit 2 extract page numbers from the image data. When the control unit 6 detects that the page numbers is increased or decreased irregularly, it determines that the lacuna of pages occurs at this part. When the pages are read out corresponding to a part where the lacuna of pages occurs, the control unit 6 sorts out the whole image data in a proper order on the basis of the page numbers of the image data and outputs them. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、画像を光学的に読み取る画像読取装置に係り、特に、書籍や資料を読み取る際のユーザの利便性を向上させるための技術に関する。 The present invention relates to an image reading apparatus that optically reads an image, and more particularly to a technique for improving user convenience when reading a book or a document.

書籍や資料等の複数ページに渡る文書をスキャナや複写機で読み取るときには、ユーザは１ページずつページを変えながら読み取る必要がある。このような単調な作業を延々と繰り返していると、人間は往々にしてミスをしてしまう。ここでいうミスとは、例えば、必要なページを飛ばしてしまったり、同じページを繰り返し読み取ってしまったり、あるいは必要なページの読み取りは終わったのに不必要なページまで読み取ってしまう、といった類のものである。一般に、このようなミスは自分でも気付かないうちに行ってしまうため、その発見は困難である。 When reading a document that covers a plurality of pages, such as a book or a document, with a scanner or a copying machine, the user needs to read the pages while changing the pages one page at a time. When such a monotonous task is repeated endlessly, humans often make mistakes. Mistakes here are, for example, skipping necessary pages, reading the same page repeatedly, or reading unnecessary pages even after reading necessary pages. Is. In general, it is difficult to find such mistakes because they make them without even being aware of them.

また、スキャナや複写機の一部には、複数ページの文書を連続的に読み取る自動原稿送り装置（ＡＤＦ）を備えるものがある。ＡＤＦは大量の文書を読み取るときに大きな威力を発揮するが、複数枚の文書を重ねて供給してしまう、いわゆる「重送」を生じさせることがあり、かかる場合には一部ページが欠落して読み取られてしまうことになる。 Some scanners and copiers include an automatic document feeder (ADF) that continuously reads a document of a plurality of pages. ADF is very powerful when reading a large amount of documents, but it may cause so-called “multiple feeding” that feeds multiple documents in a stack, and in such a case, some pages are missing. Will be read.

特許文献１には、これらの不都合を回避するものとして、連続コピー時に各々の文書からページ数を読み取り、そのページ数が連続的に増加していない場合には、連続コピーの完了後にユーザに通知する技術が開示されている。このような技術を用いれば、ユーザは欠落しているページを容易に認識することができるので、欠落しているページを追加コピーして補うことができる。
特開２００２−１３９９６５号公報 In order to avoid these disadvantages, Patent Document 1 reads the number of pages from each document during continuous copying, and notifies the user after completion of continuous copying if the number of pages has not increased continuously. Techniques to do this are disclosed. If such a technique is used, the user can easily recognize the missing page, so that the missing page can be supplemented by additional copying.
JP 2002-139965 A

しかしながら、上述の特許文献１に記載の技術を用いた場合、欠落しているページを補完するためには、（１）文書の中から欠落しているページの原稿を探し出し、（２）探し出した原稿をコピーし、（３）複数ページの複写物の中からページの欠落した箇所を特定し、（４）（２）でコピーされた複写物を（３）で特定した箇所に挿入する、という極めて煩雑な作業をユーザが行う必要がある。特に、ページの欠落した箇所を特定するという（２）の作業は、文書が膨大なページ数に渡るときには困難を極める。 However, when the technique described in Patent Document 1 is used, in order to complement the missing page, (1) the document of the missing page is searched from the document, and (2) the document is searched. The manuscript is copied, (3) the missing page is identified from the multi-page copy, and the copy copied in (4) and (2) is inserted into the location specified in (3). The user needs to perform extremely complicated work. In particular, the operation (2) of identifying a missing page portion is extremely difficult when the document has an enormous number of pages.

また、特許文献１に記載の技術においては、各ページの比較をページ番号を基準に行っているため、ページ番号が表示されていない文書に対しては何ら効果を奏さないという不都合もある。 Further, in the technique described in Patent Document 1, since each page is compared on the basis of the page number, there is a disadvantage that it is not effective for a document in which the page number is not displayed.

本発明は上述の事情に鑑みてなされたものであり、その目的は、ユーザにかかる不便を強いることなく、複数ページに渡る文書を好適な態様で出力することにある。 The present invention has been made in view of the above-described circumstances, and an object of the present invention is to output a document extending over a plurality of pages in a preferable manner without inconvenience to the user.

上述の目的を達成するために、本発明に係る画像読取装置は、ページ番号が表示されている複数枚の原稿を１枚ずつ読み取って、各原稿を表す画像データを順次生成する読取手段と、前記読取手段により生成された画像データからページ番号を抽出する抽出手段と、前記抽出手段が、第１の画像データから第１のページ番号を抽出し、その直後に第２の画像データから第２のページ番号を抽出したときに、前記第１のページ番号と前記第２のページ番号との差が１でない場合には、前記第２の画像データを削除する削除手段と、前記読取手段により生成され、かつ、前記削除手段により削除されない画像データを出力する出力手段とを備える。
または、本発明に係る画像読取装置は、ページ番号が表示されている複数枚の原稿を１枚ずつ読み取って、各原稿を表す画像データを順次生成する読取手段と、前記読取手段により生成された画像データからページ番号を抽出する抽出手段と、前記抽出手段による抽出結果が、Ｎを３以上の整数、ｄを正または負の整数、Ｐを２ｄより大である整数としたときに、（Ｎ−２）枚目のページ番号が（Ｐ−２ｄ）、（Ｎ−１）枚目のページ番号が（Ｐ−ｄ）であり、かつ、Ｎ枚目のページ番号がＰでない場合に、当該Ｎ枚目の画像データを削除する削除手段と、前記読取手段により生成され、かつ、前記削除手段により削除されない画像データを出力する出力手段とを備える。 In order to achieve the above-described object, an image reading apparatus according to the present invention includes a reading unit that reads a plurality of documents on which page numbers are displayed one by one and sequentially generates image data representing each document; An extraction unit that extracts a page number from the image data generated by the reading unit, and the extraction unit extracts a first page number from the first image data, and immediately after that, a second number is extracted from the second image data. If the difference between the first page number and the second page number is not 1 when the page number is extracted, the deletion unit deletes the second image data and the reading unit generates the page number. And an output unit that outputs image data that is not deleted by the deleting unit.
Alternatively, the image reading apparatus according to the present invention reads a plurality of originals on which page numbers are displayed one by one and sequentially generates image data representing each original, and the reading means generates the image data. When the extraction means for extracting the page number from the image data, and the extraction result by the extraction means, N is an integer greater than or equal to 3, d is a positive or negative integer, and P is an integer greater than 2d, (N -2) If the page number is (P-2d), the (N-1) th page number is (P-d), and the Nth page number is not P, the N A deletion unit that deletes the image data of the first sheet; and an output unit that outputs image data generated by the reading unit and not deleted by the deletion unit.

かかる画像読取装置によれば、ページ番号が不規則に増加（ないし減少）したことを検出すると、このページ番号が抽出された画像データは削除される。つまり、出力手段により出力されるのは、ページ番号が規則的に増加（ないし減少）している画像データのみとなる。 According to such an image reading apparatus, when it is detected that the page number is irregularly increased (or decreased), the image data from which the page number is extracted is deleted. That is, only the image data whose page number is regularly increased (or decreased) is output by the output means.

また、本発明に係る画像読取装置は、ページ番号が表示されている複数枚の原稿を１枚ずつ読み取って、各原稿を表す画像データを順次生成する読取手段と、前記読取手段により生成された画像データからページ番号を抽出する抽出手段と、前記抽出手段が、第１の画像データから第１のページ番号を抽出し、その直後に第２の画像データから第２のページ番号を抽出したときに、前記第１のページ番号と前記第２のページ番号とが同一でなく、かつ、その差が１でない場合に、前記第１のページ番号より大きく前記第２のページ番号より小さい値か、あるいは前記第２のページ番号より大きく前記第１のページ番号より小さい値を記憶する第２の記憶手段と、前記抽出手段によりページ番号を抽出された画像データについて、そのページ番号が前記第２の記憶手段に記憶された値に一致するか否かを判断する判断手段と、前記判断手段の判断結果が肯定的である場合に、前記第１の記憶手段から各ページの画像データとページ番号とを読み出し、ページ番号が増加あるいは減少するような順番で画像データを出力する出力手段とを備える。
または、本発明に係る画像読取装置は、ページ番号が表示されている複数枚の原稿を１枚ずつ読み取って、各原稿を表す画像データを順次生成する読取手段と、前記読取手段により生成された画像データからページ番号を抽出する抽出手段と、前記読取手段により生成された画像データと、前記抽出手段により抽出された当該画像データのページ番号とを関連付けて記憶する第１の記憶手段と、前記抽出手段による抽出結果が、Ｎを３以上の整数、ｄを正または負の整数、Ｐを２ｄより大である整数としたときに、（Ｎ−２）枚目のページ番号が（Ｐ−２ｄ）、（Ｎ−１）枚目のページ番号が（Ｐ−ｄ）であり、かつ、Ｎ枚目のページ番号がＰでない場合に、（Ｐ−ｄ）より大きくＮ枚目のページ番号より小さい値か、あるいはＮ枚目のページ番号より大きく（Ｐ−ｄ）より小さい値を記憶する第２の記憶手段と、前記抽出手段によりページ番号を抽出された画像データについて、そのページ番号が前記第２の記憶手段に記憶された値に一致するか否かを判断する判断手段と、前記判断手段の判断結果が肯定的である場合に、前記第１の記憶手段から各ページの画像データとページ番号とを読み出し、ページ番号が増加あるいは減少するような順番で画像データを出力する出力手段とを備える。 The image reading apparatus according to the present invention reads a plurality of originals on which page numbers are displayed one by one, sequentially generates image data representing each original, and the reading means generates the image data. Extraction means for extracting the page number from the image data, and when the extraction means extracts the first page number from the first image data and immediately after that extracts the second page number from the second image data In addition, when the first page number and the second page number are not the same and the difference is not 1, the value is larger than the first page number and smaller than the second page number, Alternatively, the second storage means for storing a value larger than the second page number and smaller than the first page number, and the page number of the image data extracted by the extraction means. Determining means for determining whether or not the value matches the value stored in the second storage means, and if the determination result of the determination means is affirmative, the image of each page from the first storage means Output means for reading out the data and the page number and outputting the image data in the order in which the page number increases or decreases;
Alternatively, the image reading apparatus according to the present invention reads a plurality of originals on which page numbers are displayed one by one and sequentially generates image data representing each original, and the reading means generates the image data. Extraction means for extracting page numbers from image data; first storage means for storing image data generated by the reading means; and page numbers of the image data extracted by the extraction means in association with each other; When the extraction result by the extraction unit is N as an integer of 3 or more, d as a positive or negative integer, and P as an integer greater than 2d, the (N-2) th page number is (P-2d). ) And (N−1) th page number is (Pd) and the Nth page number is not P, it is larger than (Pd) and smaller than the Nth page number. Value or Nth page A second storage means for storing a value larger than the number (Pd) and a value stored in the second storage means for the image data from which the page number has been extracted by the extraction means; If the judgment result of the judgment means is positive and the judgment result of the judgment means is affirmative, the image data and page number of each page are read from the first storage means, and the page number increases. Alternatively, output means for outputting the image data in a decreasing order.

かかる画像読取装置によれば、ページ番号が不規則に増加（ないし減少）し、読み取られたページに欠落が生じている場合であっても、その後に欠落箇所に相当する画像データが入力されれば、画像データはソート（並び替え）されてページ番号が規則的に増加（ないし減少）するような順番で出力される。 According to such an image reading apparatus, even when the page number is irregularly increased (or decreased) and the read page is missing, image data corresponding to the missing portion is input thereafter. For example, the image data is sorted (rearranged) and output in the order in which the page numbers regularly increase (or decrease).

また、本発明に係る画像読取装置は、文書を表す複数枚の原稿であって、かつ、文書の区切りを示す文字または番号である区分情報が表示されている原稿を１枚ずつ読み取り、各原稿を表す画像データを順次生成する読取手段と、前記読取手段により生成された画像データから前記区分情報を抽出する抽出手段と、Ｎを２以上の整数としたときに、前記抽出手段によりＮ枚目において抽出された区分情報が（Ｎ−１）枚目において抽出された区分情報と異なる場合に、文書の区切りの変化をユーザに通知する通知手段と、前記抽出手段により区分情報を抽出された画像データを出力する出力手段とを備える。 The image reading apparatus according to the present invention reads a single document that is a plurality of originals representing a document and that displays classification information that is a character or a number indicating a document separation one by one. Reading means for sequentially generating image data representing the image data, extracting means for extracting the classification information from the image data generated by the reading means, and when N is an integer of 2 or more, the extracting means When the category information extracted in step (N-1) is different from the category information extracted in the (N-1) th image, a notification means for notifying the user of a change in document separation, and an image from which the category information is extracted by the extraction means Output means for outputting data.

なお、この画像読取装置は、前記抽出手段によりＮ枚目において抽出された区分情報が（Ｎ−１）枚目において抽出された区分情報と異なる場合に、当該Ｎ枚目の画像データを削除する削除手段を備える構成であることが望ましい。 The image reading apparatus deletes the Nth image data when the classification information extracted on the Nth sheet by the extraction unit is different from the classification information extracted on the (N-1) th sheet. It is desirable for the configuration to include a deleting unit.

かかる画像読取装置によれば、文書の区切りを示す章番号や章タイトル等の区分情報を抽出することにより、文書の区切りをユーザに通知することが可能となる。そのため、たとえ文書にページ番号が表示されていない場合であっても、ユーザが気付かないうちに文書の不要な箇所までをも読み取ってしまうといった不都合を解消することができる。 According to such an image reading apparatus, it is possible to notify a user of a document delimiter by extracting section information such as a chapter number and a chapter title indicating a document delimiter. Therefore, even if the page number is not displayed on the document, it is possible to eliminate the inconvenience of reading even an unnecessary part of the document without the user's knowledge.

以上のように、本発明によれば、ユーザに不便を強いることなく、複数ページに渡る文書を好適な態様で出力することが可能となる。 As described above, according to the present invention, it is possible to output a document extending over a plurality of pages in a suitable manner without inconvenience to the user.

［１．構成］
以下においては、本発明の好適な実施形態を例示し、図面を参照しつつ詳細に説明する。図１は、本発明の一実施形態に係る画像読取装置１００を示したブロック図である。画像読取装置１００は、画像読取部１と、情報抽出部２と、記憶部３と、ＵＩ（User Interface）部４と、画像出力部５と、制御部６とを備える。 [1. Constitution]
In the following, preferred embodiments of the present invention will be illustrated and described in detail with reference to the drawings. FIG. 1 is a block diagram showing an image reading apparatus 100 according to an embodiment of the present invention. The image reading apparatus 100 includes an image reading unit 1, an information extraction unit 2, a storage unit 3, a UI (User Interface) unit 4, an image output unit 5, and a control unit 6.

画像読取部１はいわゆるスキャナの機能を有しており、プラテンガラスに載置された原稿を光学的に読み取って、原稿を表す画像データを生成する。なお、画像読取部１は、複数ページの原稿を１枚ずつ連続的に読み取る、いわゆるＡＤＦ（Auto Document Feeder）を備えるものであってもよい。画像読取部１がＡＤＦを備えていない場合には、ユーザがページを１枚ずつ読み取らせればよい。 The image reading unit 1 has a so-called scanner function, and optically reads a document placed on a platen glass and generates image data representing the document. The image reading unit 1 may include a so-called ADF (Auto Document Feeder) that continuously reads a plurality of pages of documents one by one. When the image reading unit 1 does not include the ADF, the user may read the pages one by one.

情報抽出部２は画像読取部１により生成された画像データにＯＣＲ（Optical Character Recognition）処理を施し、各々の画像データからページ番号や章番号等の識別情報を抽出する。情報抽出部２は、識別情報を画像データの任意の位置から抽出することができるが、識別情報の位置があらかじめ決められている原稿を読み取り対象としている場合には、その位置だけにＯＣＲ処理を施してもよい。例えば、原稿のいわゆるフッタ領域や、原稿の下部または下部中央に相当する位置にだけＯＣＲ処理を施してもよい。また、読み取り対象となる位置は、ユーザが指定できるように構成してもよい。このような構成とすれば、識別情報を高精度に抽出することが可能となる。 The information extraction unit 2 performs OCR (Optical Character Recognition) processing on the image data generated by the image reading unit 1 and extracts identification information such as a page number and a chapter number from each image data. The information extraction unit 2 can extract the identification information from an arbitrary position of the image data. However, when a document whose identification information position is determined in advance is to be read, an OCR process is performed only on that position. You may give it. For example, the OCR process may be performed only on a so-called footer region of the document or a position corresponding to the lower part or the lower center of the document. Moreover, you may comprise so that a user can designate the position used as reading object. With such a configuration, identification information can be extracted with high accuracy.

記憶部３はハードディスク等の記憶装置であり、種々のデータを記憶することができる。ＵＩ部４はタッチパネル、スピーカおよびボタンを備えた入力装置であり、ユーザに各種の情報を通知するとともにユーザからの入力を受け付ける。画像出力部５は画像データをプリンタや記憶装置等の外部装置に出力する。制御部６はＣＰＵ（Central Processing Unit）やＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）等を備えた演算処理装置であり、ＲＯＭに記憶されたプログラムを実行することによって読み取りから出力に至るまでの各部の動作を制御する。 The storage unit 3 is a storage device such as a hard disk and can store various data. The UI unit 4 is an input device that includes a touch panel, a speaker, and buttons, and notifies the user of various types of information and receives input from the user. The image output unit 5 outputs the image data to an external device such as a printer or a storage device. The control unit 6 is an arithmetic processing unit including a CPU (Central Processing Unit), a RAM (Random Access Memory), a ROM (Read Only Memory), and the like, and from reading to output by executing a program stored in the ROM. Control the operation of each part up to.

［２．動作］
以上の構成のもと、画像読取装置１００は原稿を読み取って外部装置に出力する。このとき、画像読取装置１００は画像データから識別情報を抽出し、抽出した情報に基づいてその後の動作を決定する。そこで、以下では識別情報に応じた動作例を挙げ、画像読取装置１００の動作について説明する。ここでは、画像読取装置１００が原稿を連続的に読み取る場合の動作を説明する。 [2. Operation]
With the above configuration, the image reading apparatus 100 reads a document and outputs it to an external device. At this time, the image reading apparatus 100 extracts identification information from the image data, and determines a subsequent operation based on the extracted information. Accordingly, the operation of the image reading apparatus 100 will be described below with an operation example corresponding to the identification information. Here, an operation when the image reading apparatus 100 continuously reads a document will be described.

［２−１．動作例１］
ここでは、識別情報がページ番号であって、ページの欠落があった場合の動作について説明する。ここで、「ページの欠落」とは、例えば１（ページ）、２（ページ）、３（ページ）、５（ページ）…、というように、ページ番号の増加が不規則になった場合を意味するものである。もちろん、ページ番号が規則的に増加する場合だけでなく、規則的に減少する場合も含むことができる。 [2-1. Operation example 1]
Here, an operation when the identification information is a page number and a page is missing will be described. Here, “missing page” means a case where the increase of the page number becomes irregular, such as 1 (page), 2 (page), 3 (page), 5 (page)... To do. Of course, not only the case where the page number increases regularly but also the case where the page number decreases regularly can be included.

なお、例えば書籍を見開きの状態、すなわち左右のページが向かい合う状態で読み取ろうとした場合、片側のページ番号について見れば、このページ番号は２（ページ）、４（ページ）、６（ページ）…、というように増加していく。同様に、両面コピーされている原稿の一方の面だけを連続的に読み取ろうとした場合にも、ページ番号は２（ページ）、４（ページ）、６（ページ）…、というように増加していく。このような場合も、ページ番号は規則的に増加していると言える。つまり、上述の「ページの欠落」を一般化すると、「それまで等差で増加（あるいは減少）していたページ番号が等差で増加（あるいは減少）しなかった場合」と言い換えることができる。 For example, when trying to read a book in a spread state, that is, when the left and right pages are facing each other, this page number is 2 (page), 4 (page), 6 (page) ... And so on. Similarly, when only one side of an original that has been copied on both sides is to be read continuously, the page number increases to 2 (page), 4 (page), 6 (page), and so on. Go. Even in such a case, it can be said that the page number is regularly increasing. In other words, generalizing the above “missing pages” can be rephrased as “when the page number that has been increased (or decreased) by the same difference until then has not increased (or decreased) by the same difference”.

図２および図３は、本動作例において画像読取装置１００が行う処理を示したフローチャートであり、以下、これらの図に沿って説明する。はじめに、画像読取装置１００の制御部６は、画像データを生成して記憶する一連の処理（以下「画像データ記憶処理」という。）を行う（Ｓ１０１）。この画像データ記憶処理を詳細に示したのが図３である。すなわち、このステップにおいて、制御部６は画像読取部１に原稿を読み取らせて画像データを生成し（Ｓ１）、読み取った原稿が何枚目の原稿であるかをカウントし（Ｓ２）、生成された画像データのページ番号を情報抽出部２に抽出させ（Ｓ３）、画像データとページ番号とを記憶部３に記憶させる（Ｓ４）。なお、以下においては、ステップＳ２においてカウントされたカウント値をＮ、ステップＳ３において抽出されるＮ枚目のページ番号をＰ_Ｎとする。そして記憶部３は、各々の画像データについて、カウント値Ｎとページ番号Ｐ_Ｎとを関連付けながら記憶していく。 2 and 3 are flowcharts showing processing performed by the image reading apparatus 100 in this operation example, and will be described below with reference to these drawings. First, the control unit 6 of the image reading apparatus 100 performs a series of processes for generating and storing image data (hereinafter referred to as “image data storage process”) (S101). FIG. 3 shows this image data storage process in detail. That is, in this step, the control unit 6 causes the image reading unit 1 to read a document to generate image data (S1), counts what number of the read document is (S2), and is generated. The page number of the image data is extracted by the information extraction unit 2 (S3), and the image data and the page number are stored in the storage unit 3 (S4). In the following, the counted count value in step S2 N, N-th page number extracted in step S3 and P _N. The storage unit 3, for each of the image data, will store in relationship to the count value N and the page number P _N.

画像データを記憶したら、制御部６は直前のカウント値Ｎが２より大きいか否かを判断する（Ｓ１０２）。この処理を行う理由は、少なくとも２ページ分のページ番号を抽出した後でなければ、ページ番号がどのように増減しているかを特定することができない（後述する値ｄを算出することができない）からである。そのため、直前のカウント値Ｎが２以下であれば（Ｓ１０２；ＮＯ）、制御部６は画像データ記憶処理を再度行う。 After storing the image data, the control unit 6 determines whether or not the immediately preceding count value N is larger than 2 (S102). The reason why this process is performed is that after the page numbers for at least two pages are extracted, it is not possible to specify how the page numbers increase or decrease (a value d described later cannot be calculated). Because. Therefore, if the immediately preceding count value N is 2 or less (S102; NO), the control unit 6 performs the image data storage process again.

直前のカウント値Ｎが２より大きい場合、すなわち３ページ分以上の画像データが生成されている場合には（Ｓ１０２；ＹＥＳ）、Ｎ枚目に読み取られた画像データのページ番号Ｐ_Ｎと（Ｎ−１）枚目に読み取られた画像データのページ番号Ｐ_Ｎ−１の差分が決められた値ｄであるか否かを判断する（Ｓ１０３）。 When the immediately preceding count value N is larger than 2, that is, when image data for three pages or more is generated (S102; YES), the page number P _N of the image data read for the Nth sheet and (N -1) It is determined whether or not the difference between the page numbers _PN-1 of the image data read for the first sheet is a predetermined value d (S103).

なお、値ｄの算出は、（Ｎ−１）枚目に読み取られた画像データのページ番号Ｐ_Ｎ−１から（Ｎ−２）枚目に読み取られた画像データのページ番号Ｐ_Ｎ−２を減算することにより求められる。すなわち、ページ番号Ｐ_Ｎを等差数列であると考えれば、値ｄはこの等差数列の公差に相当するものである。例えば、ページ番号が１ずつ規則的に増加する場合の値ｄは「１」、ページ番号が２ずつ規則的に減少する場合の値ｄは「−２」、といった具合になる。また、値ｄをあらかじめ決められた値（例えば「１」）とする場合には、値ｄを算出する必要はなく、そのためステップＳ１０２の処理も省略可能である。 The calculation of the value d is the page number P _N-2 of the (N-1) th from the page number P _N-1 of the image data read in (N-2) th image data read in It is obtained by subtraction. That is, given the page number P _N to be arithmetic progression, the value d is equivalent to the tolerance of the arithmetic progression. For example, the value d when the page number increases regularly by 1 is “1”, the value d when the page number decreases regularly by 2 is “−2”, and so on. Further, when the value d is set to a predetermined value (for example, “1”), it is not necessary to calculate the value d, and therefore the process of step S102 can be omitted.

ステップＳ１０３において、ページ番号の差分「Ｐ_Ｎ−Ｐ_Ｎ−１」が値ｄに等しい場合には（Ｓ１０３；ＹＥＳ）、制御部６はページ番号の増加（あるいは減少）が規則的であると判断し、ＵＩ部４のタッチパネルにページ番号Ｐ_Ｎを表示させる（Ｓ１０４）。一方、ページ番号の差分「Ｐ_Ｎ−Ｐ_Ｎ−１」が値ｄと異なる場合には（Ｓ１０３；ＮＯ）、制御部６は欠落しているページに相当するページ番号Ｐ_Ｘを推測し、これをＲＡＭに一時的に記憶する（Ｓ１０５）。ページ番号Ｐ_Ｘは、例えば、（Ｎ−１）枚目に読み取られた画像データのページ番号Ｐ_Ｎ−１から値ｄを減算したり、Ｎ枚目に読み取られた画像データのページ番号Ｐ_Ｎに値ｄを加算することによって得られる。このとき制御部６は、例えば「ページの欠落がありました。」といった内容の警告をＵＩ部４を介してユーザに通知する（Ｓ１０６）。 In step S103, when the page number difference “P _N −P _N−1 ” is equal to the value d (S103; YES), the control unit 6 determines that the increase (or decrease) in the page number is regular. Then, the page number _PN is displayed on the touch panel of the UI unit 4 (S104). On the other hand, if the difference between the page number _"P _{N -P N-1"} is different from the value d (S103; NO), the control unit 6 infers the page number _{P X} that corresponds to the page missing, this Is temporarily stored in the RAM (S105). Page number P _X, for example, (N-1) th or subtracting the page number P _N-1 from the value d of the image data read in, the page number of the image data read in the N-th P _N Is obtained by adding the value d to. At this time, the control unit 6 notifies the user of a warning with a content such as “There was a missing page” via the UI unit 4 (S106).

その後、制御部６は引き続き画像データ記憶処理を実行する（Ｓ１０７）。この処理はステップＳ１０１の処理と同様である。続いて、制御部６はステップＳ１０３と同様の要領でページ番号の差分の判断を行う（Ｓ１０８）。このときの判断が肯定的であれば（Ｓ１０８；ＹＥＳ）、制御部６はＵＩ部４のタッチパネルにページ番号Ｐ_Ｎを表示させる（Ｓ１０９）。この処理はステップＳ１０４と同様の処理である。一方、このときの判断が否定的であれば（Ｓ１０８；ＮＯ）、制御部６はＲＡＭに記憶されているページ番号Ｐ_Ｘと直前のページ番号Ｐ_Ｎとが一致するか否かを判断する（Ｓ１１０）。 Thereafter, the control unit 6 continues to execute the image data storage process (S107). This process is the same as the process of step S101. Subsequently, the control unit 6 determines the difference between the page numbers in the same manner as in step S103 (S108). If the affirmative determination at this time (S108; YES), the control unit 6 to display the page number _{P N} on the touch panel of the UI unit 4 (S109). This process is the same as step S104. On the other hand, if the negative and the determination of this time (S108; NO), the control unit 6 determines whether the page number P _N just before the page number P _X stored in the RAM matches ( S110).

これらのページ番号が一致する場合（Ｓ１１０；ＹＥＳ）、欠落していたページがユーザによる割り込み等により補完されたことを意味しているので、制御部６はＵＩ部４のタッチパネルにページ番号Ｐ_Ｎを表示させる（Ｓ１０９）。一方、これらのページ番号が一致しない場合（Ｓ１１０；ＮＯ）、ページの欠落が再び生じたことを意味している。そこで、ページ番号が一致しない場合には、上述したステップＳ１０５、Ｓ１０６と同様の処理を実行する（Ｓ１１１、Ｓ１１２）。 If these page numbers match (S110; YES), it means that the missing page has been supplemented by an interruption or the like by the user, so the control unit 6 displays the page number P _{N on} the touch panel of the UI unit 4. Is displayed (S109). On the other hand, if these page numbers do not match (S110; NO), it means that the page is missing again. Therefore, if the page numbers do not match, the same processing as in steps S105 and S106 described above is executed (S111 and S112).

以上の処理が終了したら、制御部６は全ての原稿を読み取ったか否かを判断する（Ｓ１１３）。この処理は、画像読取部１がＡＤＦを備えている場合であれば、ＡＤＦに原稿が載置されているかを判断すればよく、画像読取部１がＡＤＦを備えていない場合であれば、原稿の読み取りが所定の時間以上行われていないかを判断すればよい。読み取るべき原稿が残っている場合には（Ｓ１１３；ＮＯ）、制御部６は上述のステップＳ１０７からの処理を繰り返す。一方、全ての原稿を読み取った場合には（Ｓ１１３；ＹＥＳ）、制御部６は記憶部３に記憶された画像データをソートし、そのページ番号が規則的に増加または減少するような順番で出力する（Ｓ１１４）。 When the above processing is completed, the control unit 6 determines whether all the originals have been read (S113). In this process, if the image reading unit 1 has an ADF, it is only necessary to determine whether the document is placed on the ADF. If the image reading unit 1 does not have an ADF, the document What is necessary is just to judge whether reading is not performed more than predetermined time. If there remains a document to be read (S113; NO), the control unit 6 repeats the processing from step S107 described above. On the other hand, when all the originals have been read (S113; YES), the control unit 6 sorts the image data stored in the storage unit 3 and outputs them in the order in which the page numbers increase or decrease regularly. (S114).

画像読取装置１００がこのような処理を行うことで、ユーザはページの欠落を瞬時に知ることが可能となる。このとき、ユーザは欠落した箇所の原稿を容易に特定することができるので、原稿を１ページ分戻して読み取らせたり、重送されてしまった原稿を割り込ませたりすることによって、欠落したページを補完することができる。また、画像読取装置１００は画像データをソートしてから出力するので、原稿を割り込ませるのはいかなるタイミングであってもよい。ゆえに、この画像読取装置１００によれば、ユーザに不便を強いることなく、複数ページに渡る文書を適切な順番で出力することが可能となる。 When the image reading apparatus 100 performs such processing, the user can instantly know that a page is missing. At this time, since the user can easily identify the original of the missing part, the user can read back the original by one page, read the original, or interrupt the original that has been double-fed. Can be complemented. In addition, since the image reading apparatus 100 sorts and outputs the image data, the document may be interrupted at any timing. Therefore, according to the image reading apparatus 100, it is possible to output documents over a plurality of pages in an appropriate order without inconvenience to the user.

［２−２．動作例２］
上述の動作例１においては、「それまで等差で増加（あるいは減少）していたページ番号が等差で増加（あるいは減少）しなかった場合」に、ページに欠落が生じているとみなして処理を行った。しかしながら、「それまで等差で増加（あるいは減少）していたページ番号が等差で増加（あるいは減少）しなかった場合」には、例えば１（ページ）、２（ページ）、３（ページ）、３（ページ）…、というように、同じページ番号が続けて抽出された場合も含まれている。そこで以下では、このようにページに重複が生じた場合の動作について説明する。 [2-2. Operation example 2]
In the operation example 1 described above, it is assumed that a page is missing when “a page number that has been increased (or decreased) by an equal difference has not increased (or decreased) by an equal difference”. Processed. However, in the case where “the page number that has been increased (or decreased) by an equal difference has not increased (or decreased) by an equal difference”, for example, 1 (page), 2 (page), 3 (page) 3 (pages)... And the same page number is extracted continuously. Therefore, in the following, the operation in the case where duplication occurs in the page will be described.

図４は、本動作例において画像読取装置１００が行う処理を示したフローチャートであり、以下、この図に沿って説明する。はじめに、画像読取装置１００の制御部６は、上述の第１動作例と同様の画像データ記憶処理を行う（Ｓ２０１）。画像データを記憶したら、制御部６は直前のカウント値Ｎが１より大きいか否かを判断する（Ｓ２０２）。この処理を行う理由は、少なくとも２ページ分のページ番号を抽出しなければ、ページ番号の重複を判断することができないからである。そのため、直前のカウント値Ｎが１であれば（Ｓ２０２；ＮＯ）、制御部６は画像データ記憶処理を再度行う。 FIG. 4 is a flowchart showing processing performed by the image reading apparatus 100 in this operation example, and will be described below with reference to this figure. First, the control unit 6 of the image reading apparatus 100 performs the same image data storage process as in the first operation example (S201). After storing the image data, the control unit 6 determines whether or not the immediately preceding count value N is greater than 1 (S202). The reason for performing this process is that it is not possible to determine the overlap of page numbers unless the page numbers for at least two pages are extracted. Therefore, if the immediately preceding count value N is 1 (S202; NO), the control unit 6 performs the image data storage process again.

直前のカウント値Ｎが１より大きい場合、すなわち２ページ分以上の画像データが生成されている場合には（Ｓ２０２；ＹＥＳ）、Ｎ枚目に読み取られた画像データのページ番号Ｐ_Ｎと（Ｎ−１）枚目に読み取られた画像データのページ番号Ｐ_Ｎ−１が等しいか否かを判断する（Ｓ２０３）。Ｐ_Ｎ＝Ｐ_Ｎ−１でない場合（Ｓ２０３；ＮＯ）、制御部６はページが重複していないと判断し、ＵＩ部４のタッチパネルにページ番号Ｐ_Ｎを表示させる（Ｓ２０４）。 If the previous count value N is greater than 1, that is, when the image data of more than two pages are generated (S202; YES), a page number P _N of the image data read in the N-th (N -1) It is determined whether or not the page numbers P _{N-1 of} the image data read on the first sheet are equal (S203). If P _N = not _{P N-1 (S203; NO} ), the control unit 6 determines that the page is not duplicated, and displays the page number _{P N} on the touch panel of the UI unit 4 (S204).

一方、Ｐ_Ｎ＝Ｐ_Ｎ−１である場合には（Ｓ２０３；ＹＥＳ）、制御部６はページの重複が生じていると判断し、例えば「ページの重複がありました。」といった内容の警告をＵＩ部４を介してユーザに通知する（Ｓ２０５）。このときＵＩ部４のタッチパネルに表示されるのは、例えば図５のような画面である。ここで、制御部６はページ番号が重複した画像データを削除するか否かをユーザに問い合わせ（Ｓ２０６）、ユーザの選択に応じてその後の処理を異ならせる。 On the other hand, if P _N = P _N−1 (S203; YES), the control unit 6 determines that page duplication has occurred, and gives a warning such as “There was page duplication.” The user is notified via the UI unit 4 (S205). At this time, for example, a screen as shown in FIG. 5 is displayed on the touch panel of the UI unit 4. Here, the control unit 6 inquires of the user whether or not to delete the image data with the duplicate page number (S206), and changes the subsequent processing according to the user's selection.

このとき、ユーザが重複したページを削除する旨の指示を入力したときには（Ｓ２０６；ＹＥＳ）、制御部６は重複していると判断されたページ番号の画像データのいずれか一方を削除する（Ｓ２０７）。一方、ユーザが重複したページを削除しない旨の指示を入力したときには（Ｓ２０６；ＮＯ）、制御部６は重複したページをそのままにしておく。 At this time, when the user inputs an instruction to delete the duplicate page (S206; YES), the control unit 6 deletes one of the image data of the page numbers determined to be duplicated (S207). ). On the other hand, when the user inputs an instruction not to delete the duplicate page (S206; NO), the control unit 6 leaves the duplicate page as it is.

以上の処理が終了したら、制御部６は全ての原稿を読み取ったか否かを判断する（Ｓ２０８）。この処理は、画像読取部１がＡＤＦを備えている場合であれば、ＡＤＦに原稿が載置されているかを判断すればよく、画像読取部１がＡＤＦを備えていない場合であれば、原稿の読み取りが所定の時間以上行われていないかを判断すればよい。読み取るべき原稿が残っている場合には（Ｓ２０８；ＮＯ）、制御部６は上述のステップＳ２０１からの処理を繰り返す。一方、全ての原稿を読み取った場合には（Ｓ２０８；ＹＥＳ）、制御部６は記憶部３に記憶された画像データを読み取った順番で出力する（Ｓ２０９）。 When the above processing is completed, the control unit 6 determines whether all the originals have been read (S208). In this process, if the image reading unit 1 has an ADF, it is only necessary to determine whether the document is placed on the ADF. If the image reading unit 1 does not have an ADF, the document What is necessary is just to judge whether reading is not performed more than predetermined time. If there remains a document to be read (S208; NO), the control unit 6 repeats the processing from step S201 described above. On the other hand, when all the originals have been read (S208; YES), the control unit 6 outputs the image data stored in the storage unit 3 in the order of reading (S209).

画像読取装置１００がこのような処理を行うことで、ユーザはページの重複を瞬時に知り、不要であれば削除することが可能となる。ゆえに、この画像読取装置１００によれば、ユーザに不便を強いることなく、複数ページに渡る文書を適切な順番で出力することが可能となる。 When the image reading apparatus 100 performs such processing, the user can instantly know page duplication and delete it if unnecessary. Therefore, according to the image reading apparatus 100, it is possible to output documents over a plurality of pages in an appropriate order without inconvenience to the user.

なお、画像読取装置１００の実際の処理においては、ページの欠落と重複は同時に考慮すべきものである。そのため、本動作例の処理と上述の動作例１の処理とは、実際には同時にあるいは組み合わされて実行されることが望ましい。上述したように、ページの欠落とページの重複は、ともに「それまで等差で増加（あるいは減少）していたページ番号が等差で増加（あるいは減少）しなかった場合」であるという点で共通であり、ページ番号が直前のページ番号と同一である場合のみがページの重複である。このことを利用すれば、ページの欠落とページの重複は１回の処理で判別することが可能である。 In the actual processing of the image reading apparatus 100, missing pages and overlapping pages should be considered simultaneously. Therefore, it is desirable that the processing of this operation example and the processing of the above operation example 1 are actually executed simultaneously or in combination. As described above, both missing pages and overlapping pages are “when the page number that has been increased (or decreased) by the same difference until then has not increased (or decreased) by the same difference”. Only when they are common and the page number is the same as the previous page number, the page overlaps. By utilizing this fact, it is possible to determine missing pages and overlapping pages in a single process.

［２−３．動作例３］
上述の動作例２においては、ページ番号が同一の画像データが存在する場合に、これを同一ページの原稿であるとみなし、一方を削除する処理を行った。しかしながら、ページ番号が同一の画像データは必ずしも同一ページの原稿であるとは限らず、例えば、ページ番号自体に誤りがあり、本来は同一のページでないのに同じページ番号が付与されてしまっている場合も考えられる。このような場合に、ページ番号が同一の画像データのいずれか一方が削除されてしまうと、ユーザが必要とする情報が失われてしまう可能性もある。本動作例は、このような場合の不都合を解消することを目的としたものである。 [2-3. Operation example 3]
In the above-described operation example 2, when image data with the same page number exists, this is regarded as a document of the same page, and the process of deleting one is performed. However, image data with the same page number is not necessarily an original with the same page. For example, there is an error in the page number itself, and the same page number is assigned even though it is not originally the same page. Cases are also conceivable. In such a case, if any one of the image data with the same page number is deleted, information required by the user may be lost. This operation example is intended to eliminate the disadvantages in such a case.

図６は、本動作例において画像読取装置１００が行う処理を示したフローチャートであり、以下、この図に沿って説明するが、本動作例の一部の処理は上述の動作例２と同様である。そこで、同図においては、動作例２と同様の処理を行うステップには図４と同様の符号を付し、これらのステップの説明を適宜省略する。 FIG. 6 is a flowchart showing processing performed by the image reading apparatus 100 in the present operation example. Hereinafter, a description will be given with reference to FIG. is there. Therefore, in the same figure, steps similar to those of the operation example 2 are denoted by the same reference numerals as those in FIG. 4, and description of these steps will be omitted as appropriate.

ステップＳ２０３において、ページが重複していると判断された場合には（Ｓ２０３；ＹＥＳ）、制御部６はページ番号が同一である画像データどうしの類似度を判断する処理を行う（Ｓ２１１）。具体的な処理は特に限定されないが、例えば、制御部６は２つの画像データを画素単位で比較していき、各画素の色差の程度によって類似度を判断してもよいし、あるいは、画像データに含まれる文字や図形等の形状を認識し、２つの画像データでその類似度を判断してもよい。 If it is determined in step S203 that the pages overlap (S203; YES), the control unit 6 performs a process of determining the similarity between the image data having the same page number (S211). Although specific processing is not particularly limited, for example, the control unit 6 may compare two pieces of image data in units of pixels, and may determine the degree of similarity based on the degree of color difference of each pixel, or the image data May be recognized, and the degree of similarity may be determined using two image data.

この判断結果に基づき、制御部６は画像データが同一であるか否か、換言すれば画像データが同一の原稿から生成されたか否かを判断する（Ｓ２１２）。ここで、２つの画像データが同一であると判断された場合には（Ｓ２１２；ＹＥＳ）、制御部６は同一であると判断された画像データのいずれか一方を削除する（Ｓ２０７）。２つの画像データは同一であるから、削除する画像データはどちらでもよい。 Based on the determination result, the control unit 6 determines whether or not the image data is the same, in other words, whether or not the image data is generated from the same document (S212). If it is determined that the two image data are the same (S212; YES), the control unit 6 deletes one of the image data determined to be the same (S207). Since the two image data are the same, the image data to be deleted may be either.

一方、２つの画像データが同一でないと判断された場合には（Ｓ２１２；ＮＯ）、制御部６は、例えば「ページ番号が同一の画像データがあります。」といった内容の警告をＵＩ部４を介してユーザに通知する（Ｓ２１３）。このときＵＩ部４のタッチパネルに表示されるのは、例えば図７のような画面である。制御部６はページ番号が同一である２つの画像データを表示させ、いずれかの画像データを削除するか、あるいはいずれも削除しないかをユーザに問い合わせる（Ｓ２１４）。 On the other hand, if it is determined that the two image data are not the same (S212; NO), the control unit 6 gives a warning with a content such as “There is image data with the same page number” via the UI unit 4, for example. To the user (S213). At this time, for example, a screen as shown in FIG. 7 is displayed on the touch panel of the UI unit 4. The control unit 6 displays two pieces of image data having the same page number, and inquires of the user whether or not to delete any one of the image data (S214).

このとき、ユーザがいずれか一方の画像データを削除する旨の指示を入力したときには（Ｓ２１４；ＹＥＳ）、制御部６は指示された画像データを削除する（Ｓ２１５）。一方、ユーザが削除しない旨の指示を入力したときには（Ｓ２１４；ＮＯ）、制御部６はページ番号が同一の画像データを削除せずにそのままにしておく。 At this time, when the user inputs an instruction to delete one of the image data (S214; YES), the control unit 6 deletes the instructed image data (S215). On the other hand, when the user inputs an instruction not to delete (S214; NO), the control unit 6 does not delete the image data having the same page number, but leaves it as it is.

このような処理を行うことで、画像読取装置１００は、原稿そのものが同一である画像データについては削除し、ページ番号が同一でありながらもその内容が異なる画像データについては、削除の可否の判断をユーザに委ねることができる。ゆえに、この画像読取装置１００によれば、ユーザに不便を強いることなく、複数ページに渡る文書を出力することが可能となる。 By performing such processing, the image reading apparatus 100 deletes image data with the same original document itself, and determines whether or not to delete image data with the same page number but different contents. Can be left to the user. Therefore, according to the image reading apparatus 100, it is possible to output a document over a plurality of pages without inconvenience to the user.

［２−４．動作例４］
上述した動作例１から３は、いずれも、識別情報がページ番号である場合の例であった。しかし、本実施形態の画像読取装置１００は、章番号や章タイトルについても、ページ番号と同様の要領で抽出することが可能である。そこで以下では、章番号や章タイトルを識別情報とした場合の動作について説明する。 [2-4. Operation example 4]
The above-described operation examples 1 to 3 are all examples in which the identification information is a page number. However, the image reading apparatus 100 of the present embodiment can also extract chapter numbers and chapter titles in the same manner as page numbers. Therefore, the operation when the chapter number or chapter title is used as identification information will be described below.

ここでまず、章番号や章タイトルについて図８および図９を参照しながら説明する。章番号とは、例えば図８（ａ）において「４．」と表示されている文字列のことであり、この文字列は「第４章」であることを意味している。同様に、図８（ｂ）において「５．」と表示されている文字列は「第５章」であることを意味している。また、章タイトルとは、例えば図８（ａ）において「基礎編」と表示されている文字列や、図８（ｂ）において「応用編」と表示されている文字列のことである。情報抽出部２はこのような文字列を識別情報として認識し、抽出する。 First, chapter numbers and chapter titles will be described with reference to FIGS. The chapter number is, for example, a character string displayed as “4.” in FIG. 8A, and this character string means “Chapter 4”. Similarly, the character string displayed as “5.” in FIG. 8B means “Chapter 5”. The chapter title is, for example, a character string displayed as “Basic” in FIG. 8A or a character string displayed as “Application” in FIG. 8B. The information extraction unit 2 recognizes and extracts such a character string as identification information.

なお、文書によっては、例えば図９に示されている「４．１．」、「４．２．」といった文字列のように、同一の章をさらに細かく区分しているものもある。本実施形態の情報抽出部２は、これらの文字列を区別して認識するものとする。すなわち、図９（ａ）の「４．１．」であれば「第４章第１節」、図９（ｂ）の「４．２．」であれば「第４章第２節」というように、それぞれを異なるものとして識別する。そこで以下では、上述した章や節を表す識別情報を「区分情報」と総称する。すなわち区分情報とは、原稿である文書の区分（区切り）を表す情報のことである。 Depending on the document, for example, the same chapter may be further subdivided, such as character strings such as “4.1.” And “4.2.” Shown in FIG. The information extraction unit 2 of the present embodiment shall recognize and recognize these character strings. That is, “4.1.” In FIG. 9A is “Chapter 4 Section 1”, and “4.2.” In FIG. 9B is “Chapter 4 Section 2”. As such, each is identified as different. Therefore, hereinafter, the identification information representing the chapters and sections described above is collectively referred to as “section information”. That is, the division information is information indicating the division (separation) of a document that is a manuscript.

なお、上述の図８および図９においては、章番号や章タイトルがヘッダ領域に表示されている態様を示したが、区分情報の表示されている位置はこのような位置に限定されるわけではない。区分情報はその他の位置にあってもよいし、ページ毎に位置が異なっていてもよい。 8 and 9 described above show a mode in which chapter numbers and chapter titles are displayed in the header area. However, the position where the division information is displayed is not limited to such a position. Absent. The classification information may be at other positions, or the position may be different for each page.

図１０は、本動作例において画像読取装置１００が行う処理を示したフローチャートであり、以下、この図に沿って説明する。はじめに、画像読取装置１００の制御部６は、上述の第１動作例と同様の画像データ記憶処理を行う（Ｓ３０１）。画像データを記憶したら、制御部６は直前のカウント値Ｎが１より大きいか否かを判断する（Ｓ３０２）。直前のカウント値Ｎが１であれば（Ｓ３０２；ＮＯ）、制御部６はＵＩ部４のタッチパネルにその区分情報を表示させ（Ｓ３０６）、画像データを出力する（Ｓ３０７）。その後、制御部６は画像データ記憶処理を再度行う。 FIG. 10 is a flowchart showing processing performed by the image reading apparatus 100 in this operation example, and will be described with reference to this figure. First, the control unit 6 of the image reading apparatus 100 performs the same image data storage process as in the first operation example (S301). After storing the image data, the control unit 6 determines whether or not the immediately preceding count value N is greater than 1 (S302). If the immediately preceding count value N is 1 (S302; NO), the control unit 6 displays the classification information on the touch panel of the UI unit 4 (S306), and outputs image data (S307). Thereafter, the control unit 6 performs the image data storage process again.

直前のカウント値Ｎが１より大きい場合、すなわち２ページ分以上の画像データが生成されている場合には（Ｓ３０２；ＹＥＳ）、Ｎ枚目に読み取られた画像データの区分情報と（Ｎ−１）枚目に読み取られた画像データの区分情報が同一であるか否かを判断する（Ｓ３０３）。互いの区分情報が同一であると判断された場合（Ｓ３０３；ＹＥＳ）、制御部６は文書の区切りに変化がないと判断し、ＵＩ部４のタッチパネルにその区分情報を表示させる（Ｓ３０６）。その後、制御部６は直前に生成された画像データを出力し（Ｓ３０７）、画像データ記憶処理を再度行う。 When the immediately preceding count value N is larger than 1, that is, when image data for two pages or more is generated (S302; YES), the classification information of the image data read for the Nth sheet and (N−1) ) It is determined whether or not the classification information of the image data read on the sheet is the same (S303). If it is determined that the category information is the same (S303; YES), the control unit 6 determines that there is no change in the document segment, and displays the category information on the touch panel of the UI unit 4 (S306). Thereafter, the control unit 6 outputs the image data generated immediately before (S307), and performs the image data storage process again.

一方、互いの区分情報が同一でないと判断された場合には（Ｓ３０３；ＮＯ）、制御部６は文書の区切りに変化が生じたと判断し、例えば「第４章が終了しました。」といった内容の警告をＵＩ部４を介してユーザに通知する（Ｓ３０４）。このときＵＩ部４のタッチパネルに表示されるのは、例えば図１１のような画面である。ここで、制御部６はその後も原稿の読み取りを続けるか否かをユーザに問い合わせる（Ｓ３０５）。 On the other hand, if it is determined that the category information is not the same (S303; NO), the control unit 6 determines that a change has occurred in the document separation, for example, “Chapter 4 has ended”. Is notified to the user via the UI unit 4 (S304). At this time, for example, a screen as shown in FIG. 11 is displayed on the touch panel of the UI unit 4. Here, the control unit 6 inquires of the user whether or not to continue reading the document thereafter (S305).

ユーザが原稿の読み取りを続ける旨の指示を入力した場合には（Ｓ３０５；ＮＯ）、制御部６は直前に生成された画像データを出力し（Ｓ３０７）、画像データ記憶処理を再度行う。一方、ユーザが原稿の読み取りを終了する旨の指示を入力した場合には（Ｓ３０５；ＹＥＳ）、制御部６は画像の読み取りを終了させる。例えば、画像読取部１がＡＤＦを備える構成である場合、このときＡＤＦには読み取り前の原稿が残っていることもあるが、このような場合であっても制御部６は画像の読み取りを終了させる。 When the user inputs an instruction to continue reading the document (S305; NO), the control unit 6 outputs the image data generated immediately before (S307) and performs the image data storage process again. On the other hand, when the user inputs an instruction to end the reading of the document (S305; YES), the control unit 6 ends the reading of the image. For example, when the image reading unit 1 is configured to include an ADF, the original document before reading may remain in the ADF at this time. Even in such a case, the control unit 6 finishes reading the image. Let

このような処理を行うことで、画像読取装置１００は、章番号や章タイトル等の区分情報に基づいて文書の区切りを検出し、これをユーザに通知して判断させることが可能となる。ゆえに、この画像読取装置１００によれば、ページ番号が表示されていない文書であっても、不要な部分を読み取ることなく、必要な部分だけを出力することができる。 By performing such processing, the image reading apparatus 100 can detect a document break based on section information such as a chapter number and a chapter title, and notify the user of this to make a determination. Therefore, according to the image reading apparatus 100, even a document in which no page number is displayed can output only a necessary portion without reading an unnecessary portion.

［３．変形例］
以上においては、一の好適な実施形態を挙げて本発明を説明したが、本発明は上述した形態に限定されるものではなく、種々の変形が可能である。以下にその変形の例を説明する。 [3. Modified example]
In the above, the present invention has been described with reference to one preferred embodiment. However, the present invention is not limited to the above-described embodiment, and various modifications can be made. Examples of such modifications will be described below.

上述の実施形態においては、情報抽出部２はＯＣＲ処理を施して識別情報を抽出すると説明したが、２つの識別情報が同一であるか否かの判定においては、必ずしもＯＣＲ処理である必要はない。例えば、識別情報の位置に相当する画像の差違を検出し、その差違が所定のレベル以内であればこれらの識別情報が同一であると判断してもよい。 In the above-described embodiment, it has been described that the information extraction unit 2 performs the OCR process to extract the identification information. However, it is not always necessary to perform the OCR process in determining whether the two pieces of identification information are the same. . For example, a difference in images corresponding to the position of the identification information may be detected, and if the difference is within a predetermined level, it may be determined that the identification information is the same.

また、上述の実施形態においては、画像読取部１がＡＤＦを備える場合についても説明しているが、もちろん、画像読取部１がＤＡＤＦ（Duplex Auto Document Feeder）を備える構成であってもよい。このような態様であれば、両面コピーされた複数枚の文書の読み取りも行うことが可能となる。 In the above-described embodiment, the case where the image reading unit 1 includes the ADF has been described. Of course, the image reading unit 1 may include a DADF (Duplex Auto Document Feeder). With such an aspect, it is possible to read a plurality of documents copied on both sides.

また、上述の動作例１においては、ページの欠落があった場合に、欠落しているページの読み取りを待ってからソートして出力すると説明したが、ページの欠落を検出した場合には、直前に読み取られた画像データを削除し、動作を一旦停止させてもよい。このような態様であっても、欠落箇所を特定することは可能である。また、欠落しているページがない場合や、欠落しているページより前に複数ページの画像データが存在する場合には、画像データを順次出力していっても問題はない。 Further, in the above-described operation example 1, when there is a missing page, it has been described that the sorted page is output after waiting for the missing page to be read. However, if a missing page is detected, It is also possible to delete the image data read in step 1 and temporarily stop the operation. Even in such an aspect, it is possible to specify the missing part. Further, when there is no missing page or when there are a plurality of pages of image data before the missing page, there is no problem even if the image data is sequentially output.

本発明の一実施形態に係る画像読取装置を示したブロック図である。1 is a block diagram illustrating an image reading apparatus according to an embodiment of the present invention. 画像読取装置が行う画像データ記憶処理を示したフローチャートである。5 is a flowchart illustrating image data storage processing performed by the image reading apparatus. 画像読取装置が行う処理を示したフローチャートである（動作例１）。6 is a flowchart illustrating processing performed by the image reading apparatus (operation example 1). 画像読取装置が行う処理を示したフローチャートである（動作例２）。6 is a flowchart illustrating processing performed by the image reading apparatus (operation example 2). 画像読取装置のＵＩ部に表示される画面を例示した図である。It is the figure which illustrated the screen displayed on UI section of an image reading device. 画像読取装置が行う処理を示したフローチャートである（動作例３）。6 is a flowchart illustrating processing performed by the image reading apparatus (Operation Example 3). 画像読取装置のＵＩ部に表示される画面を例示した図である。It is the figure which illustrated the screen displayed on UI section of an image reading device. 原稿の区分情報を説明するための図である。It is a figure for demonstrating the division information of a document. 原稿の区分情報を説明するための図である。It is a figure for demonstrating the division information of a document. 画像読取装置が行う処理を示したフローチャートである（動作例４）。6 is a flowchart illustrating processing performed by the image reading apparatus (Operation Example 4). 画像読取装置のＵＩ部に表示される画面を例示した図である。It is the figure which illustrated the screen displayed on UI section of an image reading device.

Explanation of symbols

１００…画像読取装置、１…画像読取部、２…情報抽出部、３…記憶部、４…ＵＩ部、５…画像出力部、６…制御部 DESCRIPTION OF SYMBOLS 100 ... Image reading device, 1 ... Image reading part, 2 ... Information extraction part, 3 ... Memory | storage part, 4 ... UI part, 5 ... Image output part, 6 ... Control part

Claims

Reading means for reading a plurality of originals with page numbers displayed one by one and sequentially generating image data representing each original;
Extracting means for extracting a page number from the image data generated by the reading means;
When the extraction means extracts the first page number from the first image data and immediately after that extracts the second page number from the second image data, the first page number and the second page number are extracted. If the difference from the page number is not 1, deletion means for deleting the second image data;
An image reading apparatus comprising: output means for outputting image data generated by the reading means and not deleted by the deleting means.

Reading means for reading a plurality of originals with page numbers displayed one by one and sequentially generating image data representing each original;
Extracting means for extracting a page number from the image data generated by the reading means;
When the extraction means extracts the first page number from the first image data and immediately after that extracts the second page number from the second image data, the first page number and the second page number are extracted. And the difference is not 1, the value is larger than the first page number and smaller than the second page number, or larger than the second page number. Second storage means for storing a value smaller than the page number of
Determination means for determining whether or not the page number of the image data extracted by the extraction means matches the value stored in the second storage means;
When the determination result of the determination means is affirmative, the image data and page number of each page are read from the first storage means, and the image data is output in the order in which the page number increases or decreases An image reading apparatus comprising: means.

Reading means for reading a plurality of originals with page numbers displayed one by one and sequentially generating image data representing each original;
Extracting means for extracting a page number from the image data generated by the reading means;
When the extraction result by the extraction unit is such that N is an integer greater than or equal to 3, d is a positive or negative integer, and P is an integer greater than 2d, the (N-2) th page number is (P- 2d), a deletion means for deleting the Nth image data when the (N-1) th page number is (P-d) and the Nth page number is not P;
An image reading apparatus comprising: output means for outputting image data generated by the reading means and not deleted by the deleting means.

Reading means for reading a plurality of originals with page numbers displayed one by one and sequentially generating image data representing each original;
Extracting means for extracting a page number from the image data generated by the reading means;
First storage means for associating and storing the image data generated by the reading means and the page number of the image data extracted by the extraction means;
When the extraction result by the extraction unit is such that N is an integer greater than or equal to 3, d is a positive or negative integer, and P is an integer greater than 2d, the (N-2) th page number is (P- 2d) When the page number of the (N-1) th sheet is (Pd) and the page number of the Nth sheet is not P, the page number is larger than (Pd) and is larger than the Nth page number. Second storage means for storing a smaller value or a value larger than the page number of the Nth sheet and smaller than (Pd);
Determination means for determining whether or not the page number of the image data extracted by the extraction means matches the value stored in the second storage means;
When the determination result of the determination means is affirmative, the image data and page number of each page are read from the first storage means, and the image data is output in the order in which the page number increases or decreases An image reading apparatus comprising: means.

Reading means for reading a plurality of originals representing a document and displaying originals each of which is displayed with classification information that is a character or a number indicating a document separation, and sequentially generating image data representing each original; ,
Extraction means for extracting the classification information from the image data generated by the reading means;
When N is an integer greater than or equal to 2, if the category information extracted on the Nth page by the extraction means is different from the category information extracted on the (N-1) th page, the change of the document delimiter is changed. A notification means for notifying to,
An image reading apparatus comprising: output means for outputting image data from which the classification information has been extracted by the extraction means.

When the division information extracted on the Nth sheet by the extraction unit is different from the division information extracted on the (N-1) th sheet, the deletion means deletes the Nth image data. The image reading apparatus according to claim 5.

The image reading apparatus according to claim 1, further comprising a stopping unit that stops reading by the reading unit when the deleting unit deletes image data.

The image reading apparatus according to claim 7, further comprising an input unit that allows a user to select whether the stop unit can be stopped.