JP2004178481A

JP2004178481A - File reading device, file reading method, file reading program and storage medium

Info

Publication number: JP2004178481A
Application number: JP2002346835A
Authority: JP
Inventors: Hironori Goto; 裕典後藤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-11-29
Filing date: 2002-11-29
Publication date: 2004-06-24

Abstract

<P>PROBLEM TO BE SOLVED: To easily confirm the contents of a plurality of document files from a file reading program by materializing a file reading device by which an important word showing the contents is extracted from the contents of the document files for display, and by materializing a method, a program, and a recording medium. <P>SOLUTION: The file reading device has: a video watching means; an input means; an arithmetic means; a storage means; a means for extracting the word from the contents of the document files; and a means for displaying the word extracted by means of the means for extracting the word from the contents of the document files. The method, the program and the storage medium are constituted of these means respectively. In addition, the file reading device is provided with a means for assuring the word extracted from the contents of the plurality of document files. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、ファイル閲覧装置、ファイル閲覧方法、ファイル閲覧プログラムおよび記憶媒体に関するものである。
【０００２】
【従来の技術】
従来、ファイル閲覧アプリケーションは、Ｍｉｃｒｏｓｏｆｔ社製のエクスプローラなど存在している。それらが表示するファイルに関する情報は、ファイル名、ファイルのパス（ファイルシステム内の格納されている位置）、作成日、更新日、アクセス日等である。
【０００３】
また、例えば、特開平１０−２４７１５６号公報で提案されているように、イメージファイルのイメージを縮小した形で表示したり、または文書ファイルの文書全体をイメージとして捉えそれを縮小した形で表示したりする方法があった。
【０００４】
【発明が解決しようとする課題】
しかしながら、上記従来例では、文書ファイルのファイル名が的確に内容を表わす名前である場合はファイル名から内容を確認することが容易であるが、しかし、ファイル名が的確に内容を表わさない場合、例えば、ｔｅｘｔ１．ｔｘｔ、ｄｏｃｕｍｅｎｔ１．ｔｘｔなど内容に関係なくファイル名が付けられていたり、あるいはＳ−Ｐ−１．ｔｘｔのようなファイルを作成した人にしかわからない省略形でファイル名を付けられたりする場合などはファイル名から内容を確認することは難しかった。
【０００５】
また、特開平１０−２４７１５６号公報で提案されているような手段では、文書ファイルの全体をイメージとして捉えそれを縮小した形で表示するため、その表示は小さく、書かれている文字まで判読することが難しく、その文書ファイルに書かれている内容を確認することは難しかった。
【０００６】
そこで、本発明は、文書ファイルの内容からその内容を的確に表す複数の単語を表示することができるファイル閲覧装置、ファイル閲覧方法、ファイル閲覧プログラムおよび記憶媒体を提供することを目的とする。
【０００７】
【課題を解決するための手段】
上記の目的を達成するために、本発明の請求項１に記載のファイル閲覧装置は、映像観察手段、入力手段、演算手段、該演算手段の使用状況を測定する測定手段、記憶手段、文書ファイルの内容から単語を抽出する抽出手段、抽出手段で抽出された単語と文書ファイルの最終更新日時と単語を抽出した日時とを保存する保存手段、前記抽出手段で抽出した単語を表示する表示手段を有するファイル閲覧装置において、入力手段によって複数の文書ファイルが選択されたときに、表示手段にて、各々の文書ファイルの内容から抽出された単語を同時に表示することを特徴とする。
【０００８】
上記の目的を達成するために、本発明の請求項２に記載のファイル閲覧装置は、前記表示手段で表示された複数の文書ファイルの内容から抽出された単語を、ファイル名順に並び替えるファイル名順並び替え手段を有することを特徴とする。
【０００９】
上記の目的を達成するために、本発明の請求項３に記載のファイル閲覧装置は、前記表示手段で表示された複数の文書ファイルの内容から抽出された単語を、単語抽出日時順に並び替える単語抽出日時順並び替え手段を有することを特徴とする。
【００１０】
上記の目的を達成するために、本発明の請求項４に記載のファイル閲覧装置は、前記表示手段で複数の文書ファイルの内容から抽出された単語を表示する際に、通常の並び順をファイル名順にするか単語抽出日時順にするか設定する設定手段を有することを特徴とする。
【００１１】
【発明の実施の形態】
以下、本発明の実施例を図面に基づいて詳細に説明する。
【００１２】
（実施例）
本発明は、ファイル閲覧アプリケーションの一つの機能であり、本説明はファイル閲覧アプリケーションの本発明に関わる、文書ファイルの内容からその内容を的確に表す複数のキーワード（単語）を抽出し表示する機能について説明する。
【００１３】
図１に本実施例のブロック図を示す。１０１はＣＰＵで本実施例全体の制御を行なう。１０２はハードディスクコントローラ（ＨＤＣ）で１０３のハードディスク（ＨＤ）内のデータ・プログラムの制御を行なう。ハードディスク内には、本発明に関わる機能を持ったファイル閲覧プログラム１１３、文書ファイルＡ１１４、文書ファイルＢ１１５、文書ファイルＣ１１６が格納されている。
【００１４】
１０４はキーボード、１０５は例えばマウスやディジタイザなどのポインティングデバイス（ＰＤ）でプログラム開始などの指示を出す。１０６はＲＡＭでプログラムやデータを格納する。１０９は表示コントローラでＶＲＡＭ１０８に格納された映像データを映像信号としてモニター１１０に出力する制御を行なう。
【００１５】
図２は、本発明に関わる機能を持ったファイル閲覧アプリケーションのブロック図である。ただし、本発明に直接関わらない部分は簡略化している。
【００１６】
図２において、メイン処理部２０１は、ファイル閲覧アプリケーションの全体の制御を行なう部分である。ファイルマネージャー処理部２０２は、ＨＤ（１０３）に階層構造に格納されたファイルのファイル名を表示するための処理や階層間の移動などの処理を行なっている。
【００１７】
ＧＵＩ（ＧｒａｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ）処理部２０３は、ファイルマネージャー処理部２０２で得られたある階層内に存在するファイルの情報等を表示し、また、利用者からのキーボード１０４、あるいはポインティングデバイス１０５などの操作により、ファイルやディレクトリが選択された時にその情報を、メイン処理部２０１に送る。
【００１８】
キーワード抽出処理部２０４では、ファイルが選択された情報をＧＵＩ処理部２０３からメイン処理部２０１を経由して渡されたときに、選択されたファイルが文書ファイルであるか判断し、文書ファイルである場合は、文書読込部２０５で文書を読み込み、キーワード抽出部２０６で文書ファイルの内容を的確に表す複数のキーワードを抽出する。その後キーワード抽出処理部２０４は抽出されたキーワードをキーワードデータベース２０８に格納し、それを管理する。
【００１９】
また、キーワード抽出処理部２０４は抽出されたキーワードの情報をメイン処理部２０１に返す。
【００２０】
さらに、キーワード抽出処理部２０４は、ＣＰＵ監視部２０７にＣＰＵの使用状況を監視させ、同一コンピュータ上で動作している別のアプリケーションに対するＣＰＵの負荷が増大したときにはキーワード抽出部２０６にキーワード抽出処理を中止させ、中止したキーワード抽出処理の情報をバックグラウンド処理登録データベース２０９に格納する。
【００２１】
その後、同一コンピュータ上で動作している別のアプリケーションに対するＣＰＵの負荷が軽減されたときは、ＣＰＵ監視部２０７はキーワード抽出処理部２０４に対しその事を通知する。
【００２２】
その通知を受けたキーワード抽出処理部２０４は、バックグラウンド処理登録データベース２０９に格納してある中止していたキーワード抽出処理の情報を読み出し、その情報をキーワード抽出部２０６に送り、キーワード抽出処理を再開させる。
【００２３】
図３は、本発明の機能を持ったファイル閲覧アプリケーションのＧＵＩ画面である。
【００２４】
３０１には、現在このファイル閲覧アプリケーションがファイルシステム内のどのディレクトリを表示しているかを、パスで示している。３０３は、現在表示しているディレクトリに存在するファイルやディレクトリを示している。図中では、今ｔｅｘｔ１．ｔｘｔ、ｔｅｘｔ２．ｔｘｔが選択されており、その選択されていることを示すために、そのファイル名とアイコン（絵文字）を反転表示で示している。キーワード情報表示部３０２は本発明の機能を使用して、その選択された文書ファイル（ｔｅｘｔ１．ｔｘｔ、ｔｅｘｔ２．ｔｘｔ）のファイル名、その文書ファイルの内容から抽出されたキーワード、およびキーワード抽出日時を表示している。
【００２５】
図４には、図３の３０３で表示しているファイルが、利用者からのキーボード１０４、あるいはマウスなどのポインティングデバイス１０５などの操作によって複数選択されたときに、本発明の機能がキーワード抽出を行い、抽出されたキーワードを表示する流れを示した図であり、以下にその処理を詳細に説明する。
【００２６】
Ｓ４０１にて、選択された全てのファイル名と選択されたファイルの総数などの情報を取得する。また、ここで、ＲＡＭ１０６にある処理回数カウンターを０にリセットする。Ｓ４０２では、キーワード情報表示部３０２に現在表示されている情報を消去する。
【００２７】
以降、Ｓ４０３からＳ４１４までの処理の流れは、各ファイルごとに実施される。Ｓ４０３では、選択されたファイルが、文書ファイルかどうかを判断する。判断方法は、ファイル名の拡張子で、文書ファイルの拡張子（例えば、ｔｘｔ、ｒｔｆ、ｄｏｃ、ｈｔｍｌ）かどうかで判断する。
【００２８】
Ｓ４０４では、キーワード抽出処理部２０４からキーワードデータベース２０８にアクセスし、対象となる文書ファイルのキーワードが登録されているかを確認する。確認した結果、キーワードが登録されていなければ、Ｓ４０５でキーワード抽出処理が未完であるメッセージを図３の３０２に表示する。またＳ４０４で確認した結果、すでにキーワードが登録されている場合、Ｓ４１２に行きそのキーワードを図３の３０２に表示する。さらにＳ４１３で文書ファイルの最終更新日時をキーワードデータベースに登録されている最終更新日時情報と比較することでキーワードが抽出・登録された後に文書ファイルが更新されているかを確認する。更新されていなければＳ４１４に進む。Ｓ４０４で確認した結果キーワードが登録されていない、もしくはＳ４１３で文書ファイルが更新されていると判断された場合は、Ｓ４０６に進む。
【００２９】
Ｓ４０６では、ＲＡＭ１０６にあるバックグラウンド処理終了フラグがＨＩＧＨになっているかチェックする。もし、ＨＩＧＨになっていれば、後述のバックグランド処理の中止命令がすでになされていることなので、Ｓ４０８に進む。ＬＯＷのままの場合、Ｓ４０７にて、バックグラウンド終了フラグをＨＩＧＨにすることで、バックグラウンド処理の中止命令を出す。
【００３０】
バッググラウンド処理を中止させるのは、利用者からのキーボード１０４、あるいはマウス１０５などの操作により、ファイルが選択された時の本説明の処理が優先的に行われなければならないためである。
【００３１】
Ｓ４０８にてキーワードを抽出する。キーワードの抽出方法について図５を基に以下に説明する。
【００３２】
Ｓ５０１でＣＰＵ監視部２０７で取得したＣＰＵ使用率データを確認し、ＣＰＵ使用率が図６のＣＰＵ使用率上限値設定画面であらかじめ設定された値を超えていなければ、Ｓ５０２に進む。もし、超えていれば、文書ファイルのパスをバックグラウンド処理登録データベースに登録し、キーワード抽出処理を終了する。またこのキーワード抽出処理がバックグラウンド処理のＳ７０３であった場合、さらにここで、ＲＡＭ１０６にあるバックグラウンド処理終了フラグを確認し、バックグラウンド処理終了フラグがＨＩＧＨになっている場合、文書ファイルのパスをバックグラウンド処理登録データベースに登録し、キーワード抽出処理を終了する。
【００３３】
Ｓ５０２では文書読込部２０５にて文書ファイルを読込み、ＲＡＭ１０６に格納する。Ｓ５０３ではＳ５０１と同様にＣＰＵ使用率とバックグラウンド処理終了フラグを確認する。
【００３４】
Ｓ５０４では格納した文書ファイルのデータから文書のみを抽出する。これは例えば、文書ファイルがＨＴＭＬファイルであった場合は、その中のＨＴＭＬタグを取り去り、文書だけにする事を意味している。Ｓ５０５ではＳ５０１と同様にＣＰＵ使用率とバックグラウンド処理終了フラグを確認する。
【００３５】
Ｓ５０６では、Ｓ５０４で抽出された文書から、図８のキーワード数設定画面であらかじめ設定された数のキーワードを、キーワード抽出部２０６にて抽出する。キーワードの抽出方法は公知であるので、ここでは詳細な説明は行なわない。Ｓ５０７ではＳ５０１と同様にＣＰＵ使用率とバックグラウンド処理終了フラグを確認する。
【００３６】
Ｓ５０８ではバックグラウンド処理登録データベース２０９に本文書ファイルが登録されていないか確認し、すでに登録されていればそれを抹消する。
【００３７】
Ｓ５０９で抽出されたキーワードを文書ファイルのパス、最終更新日時データ、キーワード抽出日時データと共にキーワードデータベース２０８に登録する。
【００３８】
図４のＳ４０９では、キーワードデータベースにアクセスし、Ｓ４０５にてキーワード抽出が完了したかを登録されている更新日時データと文書ファイルの更新日時情報を比較し一致しているかで確認する。ただし、Ｓ５０１、Ｓ５０３、Ｓ５０５、Ｓ５０７でＣＰＵ利用率が設定値を超えた場合にＳ５１０にてキーワード抽出処理がバックグラウンド処理に移され、抽出が完了してない場合がある。よって完了している、つまり一致している場合はＳ４１０に進み、完了していない、つまり一致していない場合Ｓ４１４に進む。
【００３９】
Ｓ４１０では、Ｓ４０８で抽出されたキーワードを図３の３０２に表示する。Ｓ４１４ではＲＡＭ１０６にある処理回数カウンターを１カウントアップし、その結果Ｓ４０１で取得した選択ファイルの総数に一致するか確認する。一致していれば、すべてのファイルへの処理が終了したことを示すので、Ｓ４１５に進む。まだすべてのファイルへの処理が終了してない場合、Ｓ４０３に戻り、次のファイルへの処理を行う。
【００４０】
最後にＳ４１５でバックグランド処理終了フラグをＬＯＷにしたのち、バックグランド処理を再開させる。
【００４１】
図７は、バックグラウンド処理の流れ図である。
【００４２】
まず、Ｓ７０１ではバックグラウンド処理登録データベース２０９にアクセスし、バックグラウンド処理待ち状態のデータがあるか確認する。もしなければ、Ｓ７０４に進む。
【００４３】
Ｓ７０１でバックグラウンド処理待ちデータがバックグラウンド処理登録データベース２０９に存在するならば、Ｓ７０２に進み、バックグラウンド処理待ちデータをバックグラウンド処理登録データベース２０９から読み込む。その時、読み込んだデータの登録はバックグラウンド処理登録データベース２０９から抹消しておく。
【００４４】
次にＳ７０３でキーワードを抽出する。キーワード抽出処理の説明は、上記において図５を用いた説明に準じるので、ここでは省略する。
【００４５】
最後に、Ｓ７０４にてＲＡＭ１０６にあるバックグラウンド処理終了フラグを確認し、バックグラウンド処理終了フラグがＨＩＧＨになっている場合、処理を終了する。また、ＬＯＷである場合、Ｓ７０１に戻りバックグラウンド処理を繰り返す。
【００４６】
バックグラウンド処理の開始手段は、本発明の係わるファイル閲覧プログラムの起動時であってもかまわない。また、バックグラウンド処理の開始手段は、図３に示す本発明の係わるファイル閲覧プログラムのＧＵＩ画面からキーボード１０４もしくはポインターデバイス１０５による入力、メニュー選択、ボタン押下、等であってもかまわない。
【００４７】
上記説明ではＳ５０１、Ｓ５０３、Ｓ５０５、Ｓ５０７でＣＰＵ監視部２０７が取得したＣＰＵ使用率データをキーワード抽出処理部２０４が確認して処理の終了を判断したが、その判断をＣＰＵ監視部２０７に判断させ、ＣＰＵ使用率が設定値を超えたときにＣＰＵ監視部２０７からキーワード抽出処理部２０４に割り込みをかける形で処理し、割り込みがかかった時点でＳ５１０を実施し、キーワード抽出処理を終了してもかまわない。
【００４８】
図３の３０４、３０５は、それぞれ「名前」ボタン、「キーワード抽出日時」ボタンである。「名前」ボタン３０４を一度押下することで、キーワード情報表示部３０２の表示がファイル名の昇順で表示される。さらに「名前」ボタン３０４を一度押下すると、キーワード情報表示部３０２の表示はファイル名の降順で表示される。「名前」ボタン３０４を押下することで、キーワード情報表示部３０２の表示はファイル名の昇順／降順のトグル式で切り替わる。
【００４９】
これにより、例えば作成した文書ファイルのヴァージョンをファイル名で管理する場合（例えば、「特許１．ｔｘｔ」「特許２．ｔｘｔ」「特許３．ｔｘｔ」．．．）にヴァージョン順の内容の移り変わりを簡単に確認することができる。
【００５０】
「キーワード抽出日時」ボタン３０５を一度押下することで、キーワード情報表示部３０２の表示がキーワード抽出日時の昇順で表示される。さらに「キーワード抽出日時」ボタンを一度押下すると、キーワード情報表示部３０２の表示はキーワード抽出日時の降順で表示される。「キーワード抽出日時」ボタンを押下することで、キーワード情報表示部３０２の表示はキーワード抽出日時の昇順／降順のトグル式で切り替わる。
【００５１】
これにより、例えば選択した文書ファイルのファイル名はそれぞれまったく違っていても書かれている内容は近いと判断されるときに、キーワード抽出日時順の内容の移り変わりを簡単に確認することができる。
【００５２】
図８は、複数ファイル選択時にキーワード情報表示部３０２において、デフォルトでどの順序で表示するかを設定する画面である。図８の８０３〜８０６のチェックボタンのどれかを選択することで名前の昇順／降順・キーワード抽出日時順の昇順／降順のうちのどれかを選択し、「ＯＫ」ボタン８０１を押下することで、デフォルトでどの順序で表示するかを設定することができる。
【００５３】
以上により、本実施例では、ファイル閲覧アプリケーションにて、複数の選択された文書ファイルにおいて、各々の文書ファイルから複数のキーワードを抽出し表示することで、容易に複数の文書ファイルの内容を確認することが可能である。
【００５４】
【発明の効果】
以上説明したように、請求項１のファイル閲覧装置によれば、ファイル閲覧アプリケーションにおいて、複数の文書ファイルが選択されたときに、選択されたすべての文書ファイルに対して、その内容から内容を表すのにふさわしい複数のキーワードを抽出し表示することで、容易に複数の文書ファイルの内容を確認することができる。
【００５５】
また、請求項２のファイル閲覧装置によれば、ファイル閲覧アプリケーションにおいて、複数の文書ファイルの内容から抽出した複数のキーワードの表示をファイル名の昇順および降順で並び替えることができる。
【００５６】
また、請求項３のファイル閲覧装置によれば、ファイル閲覧アプリケーションにおいて、複数の文書ファイルの内容から抽出した複数のキーワードの表示をキーワード抽出日時の昇順および降順で並び替えることができる。
【００５７】
また、請求項４のファイル閲覧装置によれば、ファイル閲覧アプリケーションにおいて、複数の文書ファイルの内容から抽出した複数のキーワードを表示する際に、ファイル名の昇順・降順およびキーワード抽出日時の昇順・降順の中からどの順で表示させるか選択することができる。
【図面の簡単な説明】
【図１】本発明の第１の実施の形態に係るファイル閲覧装置の概略構成を示すブロック図である。
【図２】本発明の第１の実施の形態に係るファイル閲覧装置のプログラムのモジュール関係を示す説明図である。
【図３】本発明の第１の実施の形態に係るファイル閲覧装置のＧＵＩ画面である。
【図４】本発明の第１の実施の形態に係るファイル閲覧装置の処理の流れ図である。
【図５】本発明の第１の実施の形態に係るファイル閲覧装置のキーワード抽出処理の流れ図である。
【図６】本発明の第１の実施の形態に係るファイル閲覧装置のＣＰＵ使用率上限値設定画面である。
【図７】本発明の第１の実施の形態に係るファイル閲覧装置のバックグラウンド処理の流れ図である。
【図８】本発明の第１の実施の形態に係るファイル閲覧装置のキーワード数設定画面である。
【図９】本発明の第１の実施の形態に係るファイル閲覧装置のキーワード表示設定画面である。
【符号の説明】
１０１ＣＰＵ
１０２ハードディスクコントローラ
１０３ハードディスク
１０４キーボード
１０５ポインティングデバイス
１０６ＲＡＭ
１０８ビデオＲＡＭ
１０９表示コントローラ
１１０モニター
１１３プログラム
１１４文書ファイルＡ
１１５文書ファイルＢ
１１６文書ファイルＣ
２０１メイン処理部
２０２ファイルマネージャー処理部
２０３ＧＵＩ処理部
２０４キーワード抽出処理部
２０５文書読込部
２０６キーワード抽出部
２０７ＣＰＵ監視部
２０８キーワードデータベース
２０９バックグラウンド処理登録データベース[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a file browsing device, a file browsing method, a file browsing program, and a storage medium.
[0002]
[Prior art]
Conventionally, a file browsing application has existed, such as Microsoft Explorer. The information about the files displayed by them is the file name, the path of the file (the location where it is stored in the file system), the creation date, the update date, the access date, and the like.
[0003]
For example, as proposed in Japanese Patent Application Laid-Open No. H10-247156, an image of an image file is displayed in a reduced form, or the entire document of a document file is captured as an image and displayed in a reduced form. Or there was a way to.
[0004]
[Problems to be solved by the invention]
However, in the above conventional example, it is easy to confirm the contents from the file name if the file name of the document file is a name accurately representing the contents. However, if the file name does not accurately represent the contents, For example, text1. txt, document1. txt or the like, regardless of the content, or SP-1. When the file name is given in an abbreviated form such as txt which can be understood only by the person who created the file, it was difficult to confirm the contents from the file name.
[0005]
In the means proposed in Japanese Patent Application Laid-Open No. H10-247156, the entire document file is captured as an image and displayed in a reduced form. Therefore, the display is small, and even the written characters can be read. It was difficult to check the contents of the document file.
[0006]
Accordingly, an object of the present invention is to provide a file browsing apparatus, a file browsing method, a file browsing program, and a storage medium that can display a plurality of words that accurately represent the contents of a document file.
[0007]
[Means for Solving the Problems]
In order to achieve the above object, a file browsing apparatus according to claim 1 of the present invention comprises a video observation unit, an input unit, a calculation unit, a measurement unit for measuring a use situation of the calculation unit, a storage unit, and a document file. Extracting means for extracting words from the contents of the document, storing means for storing the words extracted by the extracting means, the last update date and time of the document file, and the date and time when the words were extracted, and display means for displaying the words extracted by the extracting means. In the file browsing apparatus, when a plurality of document files are selected by the input unit, the display unit displays words extracted from the contents of each document file at the same time.
[0008]
In order to achieve the above object, a file browsing apparatus according to claim 2 of the present invention provides a file name that sorts words extracted from the contents of a plurality of document files displayed by said display means in the order of file names. It is characterized by having an order rearranging means.
[0009]
In order to achieve the above object, a file browsing apparatus according to claim 3 of the present invention is a file browsing apparatus that sorts words extracted from the contents of a plurality of document files displayed by the display unit in the order of word extraction date and time. It is characterized by having an extraction date and time sorting means.
[0010]
In order to achieve the above object, the file browsing apparatus according to claim 4 of the present invention, when displaying words extracted from the contents of a plurality of document files on the display means, displays the words in a normal order. It has a setting means for setting whether to order by name or word extraction date and time.
[0011]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
[0012]
(Example)
The present invention is one function of a file browsing application, and the description relates to a function of extracting and displaying a plurality of keywords (words) accurately representing the contents from the contents of a document file according to the present invention of the file browsing application. explain.
[0013]
FIG. 1 shows a block diagram of this embodiment. A CPU 101 controls the entire embodiment. A hard disk controller (HDC) 102 controls a data program in the hard disk (HD) 103. The hard disk stores a file browsing program 113 having a function related to the present invention, a document file A114, a document file B115, and a document file C116.
[0014]
104 is a keyboard, and 105 is a pointing device (PD) such as a mouse or a digitizer, for example, to issue an instruction for starting a program. A RAM 106 stores programs and data. Reference numeral 109 denotes a display controller which controls output of video data stored in the VRAM 108 to the monitor 110 as a video signal.
[0015]
FIG. 2 is a block diagram of a file browsing application having functions related to the present invention. However, parts not directly related to the present invention are simplified.
[0016]
In FIG. 2, a main processing unit 201 is a part that controls the entire file browsing application. The file manager processing unit 202 performs processes such as displaying the file names of the files stored in the hierarchical structure on the HD (103), and moving between layers.
[0017]
A GUI (Graphical User Interface) processing unit 203 displays information on files existing in a certain hierarchy obtained by the file manager processing unit 202, and operates a keyboard 104 or a pointing device 105 by a user. When a file or directory is selected, the information is sent to the main processing unit 201.
[0018]
The keyword extraction processing unit 204 determines whether the selected file is a document file when the information of the selected file is passed from the GUI processing unit 203 via the main processing unit 201 and determines that the selected file is a document file. In this case, the document is read by the document reading unit 205, and a plurality of keywords that accurately represent the contents of the document file are extracted by the keyword extraction unit 206. Thereafter, the keyword extraction processing unit 204 stores the extracted keywords in the keyword database 208 and manages them.
[0019]
Also, the keyword extraction processing unit 204 returns information on the extracted keywords to the main processing unit 201.
[0020]
Further, the keyword extraction processing unit 204 causes the CPU monitoring unit 207 to monitor the usage of the CPU, and when the load of the CPU on another application running on the same computer increases, the keyword extraction unit 206 performs the keyword extraction processing. The process is stopped, and information on the stopped keyword extraction process is stored in the background process registration database 209.
[0021]
Thereafter, when the load on the CPU for another application operating on the same computer is reduced, the CPU monitoring unit 207 notifies the keyword extraction processing unit 204 of the fact.
[0022]
Upon receiving the notification, the keyword extraction processing unit 204 reads the information of the suspended keyword extraction processing stored in the background processing registration database 209, sends the information to the keyword extraction unit 206, and resumes the keyword extraction processing. Let it.
[0023]
FIG. 3 is a GUI screen of a file browsing application having the functions of the present invention.
[0024]
A path 301 indicates which directory in the file system this file browsing application is currently displaying. Reference numeral 303 denotes a file or directory existing in the currently displayed directory. In the figure, text1. txt, text2. txt is selected, and its file name and icon (pictogram) are shown in reverse video to indicate that it is selected. The keyword information display unit 302 uses the function of the present invention to display the file name of the selected document file (text1.txt, text2.txt), the keyword extracted from the content of the document file, and the keyword extraction date and time. it's shown.
[0025]
In FIG. 4, when a plurality of files displayed at 303 in FIG. 3 are selected by a user's operation of a keyboard 104 or a pointing device 105 such as a mouse, the function of the present invention extracts keywords. FIG. 11 is a diagram showing a flow of displaying the extracted and extracted keywords, and the processing will be described in detail below.
[0026]
In S401, information such as the names of all selected files and the total number of selected files is acquired. Here, the number-of-processes counter in the RAM 106 is reset to zero. In S402, the information currently displayed on the keyword information display unit 302 is deleted.
[0027]
Thereafter, the processing flow from S403 to S414 is performed for each file. In S403, it is determined whether or not the selected file is a document file. The determination method is based on the extension of the file name and the extension of the document file (for example, txt, rtf, doc, html).
[0028]
In step S404, the keyword extraction processing unit 204 accesses the keyword database 208 to check whether the keyword of the target document file is registered. As a result of the check, if the keyword has not been registered, a message indicating that the keyword extraction processing has not been completed is displayed at 302 in FIG. If it is determined in step S404 that a keyword has already been registered, the process proceeds to step S412, and the keyword is displayed in 302 in FIG. Further, in step S413, it is determined whether the document file has been updated after the keyword is extracted and registered by comparing the last update date and time of the document file with the last update date and time information registered in the keyword database. If not, the process proceeds to S414. If it is determined in S404 that the keyword has not been registered or that the document file has been updated in S413, the process proceeds to S406.
[0029]
In S406, it is checked whether the background processing end flag in the RAM 106 is HIGH. If it is HIGH, it means that an instruction to stop the background processing described later has already been issued, and the process proceeds to S408. If the signal remains LOW, the background end flag is set to HIGH in step S407 to issue a command to stop the background processing.
[0030]
The reason why the background processing is stopped is that the processing described in this description when a file is selected by a user's operation of the keyboard 104 or the mouse 105 must be preferentially performed.
[0031]
In step S408, a keyword is extracted. The keyword extraction method will be described below with reference to FIG.
[0032]
In S501, the CPU usage data acquired by the CPU monitoring unit 207 is confirmed. If the CPU usage does not exceed the value set in advance on the CPU usage upper limit setting screen in FIG. 6, the process proceeds to S502. If it exceeds, the path of the document file is registered in the background processing registration database, and the keyword extraction processing ends. If the keyword extraction processing is S703 of the background processing, the background processing end flag in the RAM 106 is checked. If the background processing end flag is HIGH, the path of the document file is changed. The keyword is registered in the background processing registration database, and the keyword extraction processing ends.
[0033]
In step S <b> 502, the document reading unit 205 reads a document file and stores the document file in the RAM 106. In step S503, the CPU usage rate and the background processing end flag are checked as in step S501.
[0034]
In step S504, only the document is extracted from the stored document file data. This means, for example, that if the document file is an HTML file, the HTML tags in the file are removed and only the document is created. In step S505, the CPU usage rate and the background processing end flag are checked as in step S501.
[0035]
In step S506, the keyword extraction unit 206 extracts a number of keywords set in advance on the keyword number setting screen in FIG. 8 from the document extracted in step S504. Since a method for extracting a keyword is known, a detailed description will not be given here. In S507, the CPU usage rate and the background processing end flag are checked as in S501.
[0036]
In step S508, it is checked whether the document file has been registered in the background processing registration database 209, and if it has been registered, it is deleted.
[0037]
The keyword extracted in S509 is registered in the keyword database 208 together with the path of the document file, the last update date / time data, and the keyword extraction date / time data.
[0038]
In step S409 of FIG. 4, the keyword database is accessed, and whether the keyword extraction has been completed in step S405 is compared with the registered update date / time data and the update date / time information of the document file to determine whether they match. However, when the CPU usage rate exceeds the set value in S501, S503, S505, and S507, the keyword extraction processing is shifted to the background processing in S510, and the extraction may not be completed. Therefore, if the process has been completed, that is, if they match, the process proceeds to S410. If the process has not been completed, that is, if they do not match, the process proceeds to S414.
[0039]
At S410, the keyword extracted at S408 is displayed at 302 in FIG. In S414, the number of times of processing counter in the RAM 106 is incremented by one, and as a result, it is confirmed whether or not the number matches the total number of selected files acquired in S401. If they match, it indicates that the processing for all files has been completed, and the process proceeds to S415. If the processing for all the files has not been completed yet, the process returns to S403, and the processing for the next file is performed.
[0040]
Finally, after the background processing end flag is set to LOW in S415, the background processing is restarted.
[0041]
FIG. 7 is a flowchart of the background processing.
[0042]
First, in step S701, the background processing registration database 209 is accessed to check whether there is data waiting for background processing. If not, the process proceeds to S704.
[0043]
If the data waiting for the background processing exists in the background processing registration database 209 in S701, the process proceeds to S702, and the data waiting for the background processing is read from the background processing registration database 209. At this time, the registration of the read data is deleted from the background processing registration database 209.
[0044]
Next, keywords are extracted in step S703. The description of the keyword extraction process is similar to the description above with reference to FIG.
[0045]
Lastly, in step S704, the background processing end flag in the RAM 106 is checked. If the background processing end flag is HIGH, the processing ends. If it is LOW, the process returns to S701 to repeat the background processing.
[0046]
The means for starting the background processing may be at the time of starting the file browsing program according to the present invention. Further, the means for starting the background processing may be input from the GUI screen of the file browsing program according to the present invention shown in FIG. 3 using the keyboard 104 or the pointer device 105, menu selection, button pressing, or the like.
[0047]
In the above description, the keyword extraction processing unit 204 confirms the CPU usage rate data acquired by the CPU monitoring unit 207 in S501, S503, S505, and S507, and determines the end of the process. When the CPU usage rate exceeds the set value, the CPU monitoring unit 207 performs processing by interrupting the keyword extraction processing unit 204, and when the interruption occurs, executes S510, and terminates the keyword extraction processing. I don't care.
[0048]
Reference numerals 304 and 305 in FIG. 3 denote a “name” button and a “keyword extraction date and time” button, respectively. By pressing the “name” button 304 once, the display of the keyword information display unit 302 is displayed in ascending order of the file names. Further, when the “name” button 304 is pressed once, the display of the keyword information display section 302 is displayed in descending order of the file name. When the “name” button 304 is pressed, the display of the keyword information display unit 302 is switched in a toggle manner of ascending order / descending order of file names.
[0049]
Thus, for example, when the version of the created document file is managed by the file name (for example, “Patent 1.txt”, “Patent 2.txt”, “Patent 3.txt”...), The contents change in the version order. You can easily check.
[0050]
By pressing the “keyword extraction date” button 305 once, the display of the keyword information display unit 302 is displayed in ascending order of the keyword extraction date. Further, when the “keyword extraction date” button is pressed once, the display of the keyword information display unit 302 is displayed in descending order of the keyword extraction date. By pressing the “keyword extraction date and time” button, the display of the keyword information display unit 302 is switched by a toggle type of the keyword extraction date and time in ascending / descending order.
[0051]
Thus, for example, when it is determined that the written contents are close even if the file names of the selected document files are completely different from each other, it is possible to easily confirm the change of the contents in the order of the keyword extraction date and time.
[0052]
FIG. 8 shows a screen for setting a default order in the keyword information display unit 302 when a plurality of files are selected. By selecting any of the check buttons 803 to 806 in FIG. 8, the user can select any of the ascending order / descending order of the names and the ascending order / descending order of the keyword extraction date and time, and press the “OK” button 801. , You can set the order in which they are displayed by default.
[0053]
As described above, in the present embodiment, the contents of a plurality of document files are easily confirmed by extracting and displaying a plurality of keywords from each document file in a plurality of selected document files by a file browsing application. It is possible.
[0054]
【The invention's effect】
As described above, according to the file browsing apparatus of the first aspect, when a plurality of document files are selected in the file browsing application, the contents are displayed from the contents of all the selected document files. By extracting and displaying a plurality of keywords suitable for the user, it is possible to easily confirm the contents of a plurality of document files.
[0055]
According to the file browsing apparatus of the second aspect, in the file browsing application, the display of the plurality of keywords extracted from the contents of the plurality of document files can be rearranged in ascending order and descending order of the file names.
[0056]
According to the file browsing apparatus of the third aspect, in the file browsing application, the display of the plurality of keywords extracted from the contents of the plurality of document files can be rearranged in ascending order and descending order of the keyword extraction date and time.
[0057]
According to the file browsing apparatus of the fourth aspect, when displaying a plurality of keywords extracted from the contents of a plurality of document files in the file browsing application, the file names are ascending or descending and the keyword extraction date and time are ascending or descending. You can select the order in which to display.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a schematic configuration of a file browsing device according to a first embodiment of the present invention.
FIG. 2 is an explanatory diagram showing a module relation of a program of the file browsing device according to the first embodiment of the present invention.
FIG. 3 is a GUI screen of the file browsing device according to the first embodiment of the present invention.
FIG. 4 is a flowchart of a process of the file browsing device according to the first embodiment of the present invention.
FIG. 5 is a flowchart of a keyword extracting process of the file browsing device according to the first embodiment of the present invention.
FIG. 6 is a CPU usage rate upper limit value setting screen of the file browsing device according to the first embodiment of the present invention.
FIG. 7 is a flowchart of a background process of the file browsing device according to the first embodiment of the present invention.
FIG. 8 is a keyword number setting screen of the file browsing device according to the first embodiment of the present invention.
FIG. 9 is a keyword display setting screen of the file browsing device according to the first embodiment of the present invention.
[Explanation of symbols]
101 CPU
102 hard disk controller 103 hard disk 104 keyboard 105 pointing device 106 RAM
108 Video RAM
109 Display controller 110 Monitor 113 Program 114 Document file A
115 Document File B
116 Document File C
201 Main processing unit 202 File manager processing unit 203 GUI processing unit 204 Keyword extraction processing unit 205 Document reading unit 206 Keyword extraction unit 207 CPU monitoring unit 208 Keyword database 209 Background processing registration database

Claims

Image observation means, input means, calculation means, measurement means for measuring the use of the calculation means, storage means, extraction means for extracting words from the contents of the document file, final update of the words extracted by the extraction means and the document file In a file browsing apparatus having storage means for storing a date and time and a date and time when a word is extracted, and a display means for displaying the word extracted by the extraction means, when a plurality of document files are selected by the input means, the display means A file browsing device for simultaneously displaying words extracted from the contents of each document file.

A file browsing apparatus, comprising: file name sorting means for sorting words extracted from the contents of a plurality of document files displayed by the display means in the order of file names.

A file browsing apparatus, comprising: word extraction date and time sorting means for sorting words extracted from the contents of a plurality of document files displayed by the display means in word extraction date and time order.

When displaying words extracted from the contents of a plurality of document files on the display unit, the file browsing unit has a setting unit for setting whether the normal arrangement order is the order of file names or the order of word extraction date and time. apparatus.