JP3515586B2

JP3515586B2 - Document processing method and apparatus

Info

Publication number: JP3515586B2
Application number: JP27885792A
Authority: JP
Inventors: 順一青江
Original assignee: 株式会社ジャストシステム
Priority date: 1992-10-16
Filing date: 1992-10-16
Publication date: 2004-04-05
Anticipated expiration: 2019-04-05
Also published as: JPH06131225A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書処理方法及び装置、
詳しくは作成された文書を所定の規則に従って分類する
文書処理方法及び装置に関するものである。BACKGROUND OF THE INVENTION The present invention relates to a document processing method and apparatus,
More specifically, the present invention relates to a document processing method and apparatus for classifying created documents according to predetermined rules.

【０００２】[0002]

【従来の技術】一般に、ワードプロセッサで代表される
文書編集装置においては、作成した或いは編集中の文書
をフロッピーディスクやハード磁気ディスク等の記憶装
置に保存する。2. Description of the Related Art Generally, in a document editing device represented by a word processor, a created or edited document is stored in a storage device such as a floppy disk or a hard magnetic disk.

【０００３】保存する理由はいろいろあるが、例えば、
文書が未完の場合の再編集を容易にするためや、既に作
成した文書を土台にして別の文書を作成するため等であ
る。There are various reasons for saving, for example,
This is for facilitating re-editing when a document is incomplete, and for creating another document based on the already created document.

【０００４】ところで、記憶装置はその容量が許す限
り、多数の文書を記憶できることが可能になっている。
従って、一旦保存された文書群の中から目的の文書をい
ち早く発見するため、ユーザはその文書を保存する場合
にその文書の特性を最も良く表すファイル名を付けるこ
とで対処していた。By the way, the storage device can store a large number of documents as long as its capacity allows.
Therefore, in order to quickly find the target document in the document group that has been once stored, the user has dealt with this by giving a file name that best represents the characteristics of the document when saving the document.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、個々の
ファイル名はユーザが勝手に決めるものであって、実際
問題として個々のファイル名を上手に付けることは結構
難しい。フロッピーディスク毎或いはディレクトリ毎に
文書を管理するのも一つの手段ではあるが、これもユー
ザ次第であって、複数の人間が共通に使用する装置の場
合には、個々のユーザ毎に文書管理法が異なるので、問
題点は依然として存在する。However, the individual file names are arbitrarily decided by the user, and as a practical matter, it is quite difficult to give each file name well. One of the means is to manage documents for each floppy disk or for each directory, but this is also up to the user, and in the case of a device commonly used by a plurality of people, a document management method for each individual user. , The problem still exists.

【０００６】[0006]

【課題を解決するための手段】本発明はかかる従来技術
に鑑みなされたものであり、格別意識せずとも文書ファ
イルの分類を行い、その管理を容易にする文書処理方法
及び装置を提供しようとするものである。SUMMARY OF THE INVENTION The present invention has been made in view of the above prior art, and an object of the present invention is to provide a document processing method and apparatus for classifying document files and facilitating their management without special consideration. To do.

【０００７】この課題を達成するため本発明の文書処理
方法は、文書を入力する入力工程と、前記入力工程によ
って入力された文書内の所定の文字列および前記所定の
文字列の前記文書中における存在行に関する情報を含む
存在位置を検出する検出工程と、前記検出工程によって
検出された前記所定の文字列と前記所定の文字列の前記
文書中における存在位置との組み合わせに基づいて前記
文書の種類を判別する文書種類判別工程と、前記文書種
類判別工程によって判別された種類に基づいて前記文書
を分類して登録する登録工程と、を含んだことを特徴と
するIn order to achieve this object, the document processing method of the present invention includes an input step of inputting a document, a predetermined character string in the document input by the input step, and a predetermined character string in the document. Type of the document based on a detection step of detecting an existence position including information about an existence line, and a combination of the predetermined character string detected by the detection step and an existence position of the predetermined character string in the document And a registration step of classifying and registering the document based on the type determined by the document type determination step.

【０００８】また、本発明の文書処理方法は、文書を入
力する入力工程と、前記入力工程によって入力された文
書内の所定の文字列および前記所定の文字列の前記文書
中における存在行に関する情報を含む存在位置を検出す
る検出工程と、前記検出工程によって検出された前記所
定の文字列と前記所定の文字列の前記文書中における存
在位置との組み合わせに基づいて前記文書の種類を判別
する文書種類判別工程と、前記文書種類判別工程によっ
て判別された種類に関する情報を前記文書に付加する種
類情報付加工程と、を含んだことを特徴とする。Further, the document processing method of the present invention includes an input step of inputting a document, a predetermined character string in the document input by the input step, and information regarding the existing line of the predetermined character string in the document. A document for discriminating the type of the document based on a combination of a detection step of detecting a presence position including the predetermined character string detected by the detection step and a position of the predetermined character string in the document. It is characterized by including a type discriminating step and a type information adding step of adding information on the type discriminated by the document type discriminating step to the document.

【０００９】また、本発明の文書処理装置は、文書の入
力を受け付ける入力手段と、前記入力手段によって入力
が受け付けられた文書内の所定の文字列および前記所定
の文字列の前記文書中における存在行に関する情報を含
む存在位置を検出する検出手段と、前記検出手段によっ
て検出された前記所定の文字列と前記所定の文字列の前
記文書中における存在位置との組み合わせに基づいて前
記文書の種類を判別する文書種類判別手段と、前記文書
種類判別手段によって判別された種類に基づいて前記文
書を分類して登録する登録手段と、を備えたことを特徴
とする。Further, the document processing apparatus of the present invention has input means for receiving an input of a document, a predetermined character string in the document input by the input means, and the existence of the predetermined character string in the document. A detecting unit that detects an existing position including information about a line; and a type of the document based on a combination of the predetermined character string detected by the detecting unit and an existing position of the predetermined character string in the document. It is characterized by further comprising document type discriminating means for discriminating and registration means for classifying and registering the document based on the type discriminated by the document type discriminating means.

【００１０】また、本発明の文書処理装置は、文書の入
力を受け付ける入力手段と、前記入力手段によって入力
が受け付けられた文書内の所定の文字列および前記所定
の文字列の前記文書中における存在行に関する情報を含
む存在位置を検出する検出手段と、前記検出手段によっ
て検出された前記所定の文字列と前記所定の文字列の前
記文書中における存在位置との組み合わせに基づいて前
記文書の種類を判別する文書種類判別手段と、前記文書
種類判別手段によって判別された種類に関する情報を前
記文書に付加する種類情報付加手段と、を備えたことを
特徴とする。Further, the document processing apparatus of the present invention has input means for receiving an input of a document, a predetermined character string in the document input by the input means, and the existence of the predetermined character string in the document. A detecting unit that detects an existing position including information about a line; and a type of the document based on a combination of the predetermined character string detected by the detecting unit and an existing position of the predetermined character string in the document. It is characterized by further comprising document type discriminating means for discriminating and type information adding means for adding information on the type discriminated by the document type discriminating means to the document.

【００１１】[0011]

【作用】かかる本発明の文書処理方法及び装置におい
て、入力した文書情報中に文書の種類を特定するための
有為な文字列があるかどうか、ある場合にはその存在位
置を検出する。そして検出された結果に基づいて当該文
書情報を分類する。In the document processing method and apparatus of the present invention, whether or not there is an effective character string for specifying the type of document in the input document information, and if there is, the existence position thereof is detected. Then, the document information is classified based on the detected result.

【００１２】[0012]

【実施例】以下添付図面に従って本発明に係る実施例を
詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【００１３】実施例における文書処理装置のブロック構
成を図１に示す。図示において、１は本装置全体の制御
を司るＣＰＵ、２はブートプログラムを記憶しているＲ
ＯＭ、３は文書編集に係るプログラム（後述する図３、
４のフローチャートを含む）及び編集中の文書を記憶す
るＲＡＭ、４は文字入力や各種指示コマンドを入力する
ためのキーボードである。５は本装置のＯＳ、前述した
プログラム、文書ファイル、かな漢字変換辞書、更に
は、後述する文書分類解析テーブルを記憶している外部
記憶装置（例えばハードディスク装置やフロッピーディ
スク装置等）である。６は表示される文字を展開するＶ
ＲＡＭ、７はＶＲＡＭ６に展開された文字等を表示する
表示装置である。FIG. 1 shows a block configuration of the document processing apparatus in the embodiment. In the figure, 1 is a CPU that controls the entire apparatus, and 2 is an R that stores a boot program.
OM and 3 are programs related to document editing (see FIG. 3, which will be described later).
(Including the flowchart of FIG. 4) and a RAM for storing the document being edited, and 4 is a keyboard for inputting characters and various instruction commands. An external storage device (for example, a hard disk device or a floppy disk device) 5 stores the OS of the device, the above-mentioned program, the document file, the kana-kanji conversion dictionary, and the document classification analysis table described later. 6 is the V that expands the displayed characters
RAMs 7 are display devices for displaying characters and the like expanded in the VRAM 6.

【００１４】上記構成において、実施例では、本装置上
で作成或いは編集した文書を外部記憶装置５に保存する
とき、当該文書の種類を判別し、その判別情報を付加し
て保存する。In the above configuration, in the embodiment, when the document created or edited on the apparatus is stored in the external storage device 5, the type of the document is discriminated and the discrimination information is added and stored.

【００１５】文書の種類を判別する原理を以下に説明す
る。The principle of determining the document type will be described below.

【００１６】通常、この種の装置では作成しようとする
文書は勿論自由であるが、作成・編集される文書として
手紙や論文等が多いことも事実である。Usually, in this type of apparatus, the document to be created is of course free, but it is a fact that there are many letters and papers as the created and edited documents.

【００１７】手紙文書、特に、個人から個人宛の手紙の
場合には先頭に“拝啓”や“前略”、英文の場合には
“Ｄｅａｒ”がくることが多い。同じ手紙文書でも、業
務で使用される形式では、先頭には日付、その次に相手
先会社名、そしてその次に自身の会社名が続き、その後
に前述した“拝啓”などが続いて以下本文が続く。ま
た、論文の場合には、タイトルがきて、最後には“参考
文献”或いは単に“文献”という見出しがあって、その
後に文献名が列挙されるケースが多い。In the case of a letter document, particularly a letter from one person to another, "Dearing" or "Omission" is often added at the beginning, and in the case of an English sentence, "Dear" is often used. Even in the same letter document, in the format used in business, the first part is the date, then the partner company name, and then the own company name, followed by the above-mentioned "dear" etc. Continues. In addition, in the case of papers, there are many cases in which the title comes, the end is the heading “References” or simply “References”, and the names of the references are listed after that.

【００１８】以上の説明から、“拝啓”などの文字列が
先頭に位置している場合にはその文書は個人的手紙文書
（個人から個人宛の手紙）と判断できることがわかる。
また、同じ“拝啓”等の文字列が文書の中間に位置する
場合（少なくとも先頭位置には存在しない場合）には、
その文書は業務用手紙と判断できる。また、“文献”と
いう文字列が中間位置に存在する場合には、その文書は
論文であると認識できる。From the above description, it can be understood that when a character string such as "dear greetings" is located at the beginning, the document can be judged as a personal letter document (individual to individual letter).
In addition, if the same character string such as "Dear greeting" is located in the middle of the document (at least at the beginning position),
The document can be identified as a business letter. If the character string “reference” exists in the middle position, the document can be recognized as a paper.

【００１９】そこで、実施例では、文書の種類を特定す
るのに有意な文字列及びその文字列の位置に基づいて、
文書の種類を判別する。そして、その判別結果は、保存
する文書に付加する。Therefore, in the embodiment, based on the character string significant for specifying the document type and the position of the character string,
Determine the type of document. Then, the determination result is added to the document to be saved.

【００２０】この文書種類判別を行うため、実施例では
図２に示す文書分類解析テーブルを外部記憶装置５内に
記憶保持しておく。図示の文書分類解析テーブルについ
て簡単に説明すると、以下の通りである。In order to determine this document type, in the embodiment, the document classification analysis table shown in FIG. 2 is stored and held in the external storage device 5. The document classification analysis table shown in the figure will be briefly described as follows.

【００２１】文字列“拝啓”、“前略”が文書の先頭に
位置している文書は個人的文書、同じ文字列“拝啓”や
“前略”が文書の中間に位置している場合には業務用手
紙、文字列“文献”が中間に位置している場合には論文
と判断されることを表す。A document in which the character strings "Dearing" and "Omitted" are located at the beginning of the document is a personal document, and the document when the same character strings "Dearing" and "Omitted" are located in the middle of the document If the letter and the character string "reference" are located in the middle, it means that the letter is judged as a paper.

【００２２】実施例では、上記分類処理を文書を外部記
憶装置５に保存する段階で実行するものである。In the embodiment, the classification process is executed at the stage of storing the document in the external storage device 5.

【００２３】以下、図４のフローチャートに従って説明
する。尚、図示のフローチャートは前述した様に文書を
保存する指示を与えた場合に呼び出されるルーチンを示
している。The operation will be described below with reference to the flowchart of FIG. The flowchart shown in the drawing shows a routine that is called when an instruction to save a document is given as described above.

【００２４】先ず、ステップＳ１では、保存すべきデー
タがＲＡＭ３上に存在するかどうかを判断する。保存す
べきデータがない場合には、当該文書保存処理は無効で
あるとして、メイン処理に復帰する。First, in step S1, it is determined whether or not the data to be stored exists in the RAM 3. If there is no data to be saved, the document saving process is considered invalid, and the process returns to the main process.

【００２５】また、保存すべきデータが存在する場合に
は、ステップＳ２に進んで、文書分類解析テーブル中の
１つの分類文字列を取り出し（初期段階では先頭の分類
文字列）、それが当該文書中に存在するかを検索する。
ステップＳ３では、検索した結果、それが存在したかど
うかを判断する。尚、検索そのものは公知であるので、
ここでの説明は割愛する。If there is data to be stored, the process proceeds to step S2, one classification character string in the document classification analysis table is extracted (the initial classification character string in the initial stage), and that is the document. Search whether it exists inside.
In step S3, it is determined whether or not it exists as a result of the search. Since the search itself is known,
I will omit the explanation here.

【００２６】さて、存在した場合には、ステップＳ４に
進んで、その分類文字列とその存在した位置（何行目か
等）をＲＡＭ３中の所定エリアに記憶し、ステップＳ５
に進む。If it exists, the process proceeds to step S4 to store the classified character string and its existing position (what line etc.) in a predetermined area in the RAM 3, and then step S5.
Proceed to.

【００２７】ステップＳ５では、文書分類解析テーブル
中の全ての分類文字列についての検索が終了したかどう
かを判断する。未完であると判断した場合には、ステッ
プＳ２に戻って、次の分類文字列について上述した処理
を行う。In step S5, it is determined whether the search has been completed for all the classification character strings in the document classification analysis table. When it is determined that the character string is incomplete, the process returns to step S2, and the above-described processing is performed on the next classification character string.

【００２８】尚、１つの検索処理によって、その分類文
字列及びその存在位置が検出されるわけであるから、重
複する分類文字列の検索は行わない。例えば、図２にお
ける個人的手紙文書についての“拝啓”の検索処理を行
った場合には、業務用手紙における同じ分類文字列につ
いての検索は行わない。この為、図３に示すように、分
類文字列のみのテーブルを別個設けた。つまり、ステッ
プＳ２における検索処理は図３に示す分類文字列テーブ
ルについての検索処理のみを行う。Since the classification character string and the position where the classification character string exists are detected by one search process, a search for a duplicate classification character string is not performed. For example, when the "dearing" search process for the personal letter document in FIG. 2 is performed, the search for the same classification character string in the business letter is not performed. Therefore, as shown in FIG. 3, a table containing only the classification character strings is provided separately. That is, in the search process in step S2, only the search process for the classification character string table shown in FIG. 3 is performed.

【００２９】こうして、全ての分類文字列についての検
索処理を行うと、ＲＡＭ３上の所定エリア上には、分類
文字列とその存在位置の情報が記憶されることになる。
但し、必ずしも１組のみが記憶されているとは限らず、
場合によっては２以上の組、或いは全然ない場合が考え
られる。In this way, when the search processing is performed for all the classification character strings, the classification character strings and the information of their existing positions are stored in a predetermined area on the RAM 3.
However, only one set is not always stored,
In some cases, there may be two or more groups or no group at all.

【００３０】ステップＳ６では、こうして検索して得ら
れた分類文字列及びその存在位置から、当該文書を分類
する。In step S6, the document is classified based on the classification character string obtained by the above search and its existing position.

【００３１】先に説明したように、１つの文書に図２に
示すような分類文字列が１つも存在しない場合には、当
該文書は通常の文書とは異なる文書であると認識し、
“その他”という分類を割り当てる。As described above, when there is no classification character string as shown in FIG. 2 in one document, the document is recognized as a document different from a normal document,
Assign the category “Other”.

【００３２】また、２以上の分類文字列が存在する場
合、例えば、個人用手紙にも該当し、論文にも該当する
というような結果になった場合には、“分類不能”とい
う分類を割り当てる。Further, when there are two or more classification character strings, for example, when the result is such that it corresponds to a personal letter and also corresponds to a paper, the classification "unclassifiable" is assigned. .

【００３３】さて、１つの分類文字列しかなかった場合
には、その分類文字列の存在位置に基づいて当該文書を
分類することはできる。If there is only one classification character string, the document can be classified based on the position where the classification character string exists.

【００３４】いずれにしても、上記処理を行うことで、
保存しようとする文書の分類が行われることになる。In any case, by performing the above processing,
The documents to be saved will be classified.

【００３５】ステップＳ７では、上記分類処理によって
決定された分類名を当該文書データの所定位置（書式制
御情報中）に付加させ外部記憶装置５に記憶させる。In step S7, the classification name determined by the classification processing is added to a predetermined position (in the format control information) of the document data and stored in the external storage device 5.

【００３６】次に、実施例の文書処理装置における文書
読み出し処理について説明する。Next, the document reading process in the document processing apparatus of the embodiment will be described.

【００３７】一旦記憶させた文書ファイルを再度編集す
る場合、当然のことながらその文書ファイルをＲＡＭ３
上に読み込むことが必要である。実施例では、読み込み
対象の文書ファイル一覧を画面に表示する場合、読み込
むファイルの分類を操作者が指定する。そして、例え
ば、“業務用手紙”を指定した場合には、その分類のフ
ァイル一覧のみを表示して、目的文書を探し易くする。When the document file once stored is to be edited again, the document file is naturally stored in the RAM 3
Needs to be read on. In the embodiment, when the document file list to be read is displayed on the screen, the operator specifies the classification of the read file. Then, for example, when "business letter" is specified, only the file list of that category is displayed to facilitate searching for the target document.

【００３８】図５のフローチャートに従って説明する。A description will be given according to the flowchart of FIG.

【００３９】先ず、ステップＳ１１では、表示対象の指
定を行う。ここで言う表示対象とは、フロッピーディス
クなどのドライブ名やディレクトリ名を言う。First, in step S11, a display target is designated. The display target here means a drive name or a directory name of a floppy disk or the like.

【００４０】ステップＳ１２では、分類名を指定する。
指定方法にも様々な例が考えられるが、ここでは分類の
一覧を表示し、そのうちの１つ、或いはそれ以上を指定
する。複数指定した場合には、その指定された分類の論
理和の類が指定されたものとして処理を行う。In step S12, a classification name is designated.
Although various examples can be considered as the designation method, here, a list of classifications is displayed and one or more of them are designated. When a plurality of types are designated, the logical sum type of the designated classification is designated.

【００４１】ステップＳ１３に処理が進むと、ステップ
Ｓ１１で指定された対象中にある１つの文書ファイルの
予め決められた位置を調べ、その文書の分類を抽出す
る。そして、次のステップＳ１４では、現在注目してい
る文書ファイルの分類は指定された分類であるかどうか
が判断される。指定された分類であると判断された場合
にはステップＳ１５に進んで、その文書ファイル名を画
面に表示する。When the process proceeds to step S13, the predetermined position of one document file in the target specified in step S11 is examined and the classification of the document is extracted. Then, in the next step S14, it is judged whether or not the classification of the document file of interest is the specified classification. If it is determined that the document is the designated category, the process proceeds to step S15, and the document file name is displayed on the screen.

【００４２】次のステップＳ１６では、指定された対象
中の全ての文書ファイルに対する上述した処理を行った
かどうかを判断し、未完であると判断した場合にはステ
ップＳ１３に戻る。In the next step S16, it is determined whether or not the above-described processing has been performed on all the document files in the designated object. If it is determined that the document files are incomplete, the process returns to step S13.

【００４３】従って、ステップＳ１６の判断で、“ｙｅ
ｓ”となったとき、表示画面には指定された分類の文書
ファイル一覧が表示されていることになる。以下、この
中で所望とする文書ファイルを指定し、その文書ファイ
ルのＲＡＭ３への読み込み処理が行われる。Therefore, in the determination of step S16, "yes"
When "s" is reached, a list of document files of the designated classification is displayed on the display screen. Hereinafter, a desired document file is designated among these, and the document file is read into the RAM3. Processing is performed.

【００４４】以上説明したように実施例によれば、特定
の文字列とその存在位置に基づいて文書を自動的に分類
することができるようになる。従って、文書ファイルの
管理が容易になり、且つ、目的とする文書ファイルを探
し出すことが簡単になる。As described above, according to the embodiment, it becomes possible to automatically classify the documents based on the specific character string and the position where the character string exists. Therefore, it becomes easy to manage the document file and it becomes easy to find the target document file.

【００４５】尚、実施例で示した文書分類解析テーブル
（図２参照）及び分類文字列テーブル（図３参照）は、
ユーザに自由に変更・追加できるようになっている（変
更・追加は通常の文書編集作業で行ってもよいし、特別
なコマンドを指示したときに行っても良い）。The document classification analysis table (see FIG. 2) and the classification character string table (see FIG. 3) shown in the embodiment are:
The user can freely change / add (change / add may be done by normal document editing work, or when a special command is instructed).

【００４６】従って、例えば、業務用手紙としてより高
い識別率で分類させようとする場合、先頭からｘ行目以
前に“○×△□株式会社”という文字列があれば業務用
手紙（或いは報告書）として分類できるようにしてもよ
い。更に、業務用手紙の場合に、相手先の会社毎に分類
させることも可能である。また、社内文書であれば、
“稟議書”や、“回覧”などの文字列を扱っても十分な
分類は行える。Therefore, for example, when trying to classify as a business letter with a higher identification rate, if there is a character string "○ × △ □ Co., Ltd." before the x-th line from the beginning, the business letter (or report) You may be able to classify as a book. Further, in the case of business letters, it is possible to classify the letters by the other company. Also, if it is an internal document,
Sufficient classification can be performed even when dealing with character strings such as “Ryokan” and “circulation”.

【００４７】このように、様々な分類文字列を登録する
ようにすると、図４のステップＳ４における処理で、Ｒ
ＡＭ３上に生成される分類文字列とその存在位置情報が
複数組検出されることが予想される。そこで、図２の文
書分類解析テーブルに、優先順位を付け、複数組が発生
しても分類不能にならないようにしても良い。As described above, if various classification character strings are registered, in the process in step S4 of FIG.
It is expected that a plurality of sets of classification character strings and their existing position information generated on the AM3 will be detected. Therefore, the document classification analysis table of FIG. 2 may be prioritized so that even if a plurality of sets occur, the classification cannot be disabled.

【００４８】更に、１つの文書ファイルの分類として１
つのみを許可するのではなく、複数の分類を許すように
してもよい。つまり、ある文書ファイルの分類として、
“業務用手紙”や、“回覧”などといった複数の分類カ
テゴリーを許すのである。Further, the classification of one document file is 1
Rather than allowing only one, multiple classifications may be allowed. In other words, as a classification of a certain document file,
It allows multiple categories of categories such as "business letter" and "circular".

【００４９】＜第２の実施例の説明＞上記実施例（第１の実施例）では編集中の文書を保存す
る場合にその文書の種類を判別し、その判別結果をその
文書データと共に保存させた。そして、表示する段階に
なってその付加された分類情報を基に表示するしないを
切り替えた。<Explanation of Second Embodiment> In the above embodiment (first embodiment), when the document being edited is to be saved, the type of the document is determined, and the determination result is saved together with the document data. It was Then, at the display stage, the display is switched based on the added classification information.

【００５０】しかしながら、本発明はこれのみに限定さ
れるものではない。However, the present invention is not limited to this.

【００５１】例えば、あるディレクトリ内にある複数の
文書ファイルをまとめて分類するようにしてもよい。For example, a plurality of document files in a certain directory may be classified together.

【００５２】ここでは、一例として、フロッピーディス
クＡ内にある複数文書ファイルをフロッピーディスクＢ
に複写する場合、フロッピーディスクＢに各々の分類名
のディレクトリを作成し、同じ分類の文書ファイルは同
じディレクトリ内に保存管理させる例を説明する。尚、
ここではフロッピーディスクＡ，Ｂは論理的デバイスを
意味するものであって、物理的に異なる２つのドライブ
を含む概念である。つまり、或るディレクトリ内の文書
を整理する場合には、入力対象と出力対象とを同じにす
れば済むからである。Here, as an example, a plurality of document files in the floppy disk A are stored in the floppy disk B.
An example will be described in which a directory of each classification name is created on the floppy disk B and the document files of the same classification are stored and managed in the same directory when copying. still,
Here, the floppy disks A and B mean logical devices, and are concepts including two physically different drives. That is, when organizing the documents in a certain directory, the input target and the output target may be the same.

【００５３】先ず、ステップＳ２１、Ｓ２２ではそれぞ
れ入力対象及び出力対象を指定する。First, in steps S21 and S22, an input target and an output target are designated, respectively.

【００５４】ステップＳ２３では、入力対象位置からフ
ァイルを１つ読み込み、それをＲＡＭ３上にロードす
る。次のステップＳ２４では、ＲＡＭ３上にロードされ
た文書データを調べて、その文書を分類する。分類処理
そのものは先の図４と同じであるのでここでは省略す
る。但し、本第２の実施例では後述するようにディレク
トリ単位に文書の種類を管理するから、文書データその
ものに分類名を付加させる必要はない。In step S23, one file is read from the input target position and loaded in the RAM3. In the next step S24, the document data loaded on the RAM 3 is examined to classify the document. Since the classification process itself is the same as that in FIG. 4 described above, it is omitted here. However, in the second embodiment, since the document type is managed in directory units as described later, it is not necessary to add the classification name to the document data itself.

【００５５】さて、読み込んだ文書ファイルに対する分
類処理が完了すると、ステップＳ２５に出力対象内に該
当する分類名のディレクトリが存在するかどうかを判断
する。When the classification process for the read document file is completed, it is determined in step S25 whether or not a directory having the corresponding classification name exists in the output target.

【００５６】その分類名のディレクトリが存在すると判
断した場合には、ステップＳ２７に進んで、読み込んだ
文書ファイルをそのディレクトリ下に保存する。また、
該当するディレクトリが存在しないと判断した場合に
は、出力対象内にそのサブディレクトリを作成した後、
ステップＳ２７に進む。When it is determined that the directory of the classification name exists, the process proceeds to step S27, and the read document file is saved under that directory. Also,
If it is determined that the corresponding directory does not exist, after creating the subdirectory in the output target,
It proceeds to step S27.

【００５７】こうして、１つの入力対象中の１つの文書
ファイルの分類そして出力処理が済むと、処理はステッ
プＳ２３に戻り、全ての入力文書ファイルに対する処理
が終わるまで繰り返し処理される。In this way, when the classification and output processing of one document file in one input target are completed, the processing returns to step S23 and is repeatedly processed until the processing for all input document files is completed.

【００５８】尚、説明が前後するが、ステップＳ２１、
Ｓ２２で指定した対象が論理的に同じであれば、分類し
て文書ファイルが保存された後、分類する以前に存在し
ていた文書ファイルを削除する。Incidentally, although the explanation goes back and forth, step S21,
If the targets specified in S22 are logically the same, the document files that have been classified and stored are deleted, and then the document files that existed before the classification are deleted.

【００５９】更には、上記第１、第２の実施例では、分
類に有意な文字列を１つとして説明したが、２つ以上と
してもよい。つまり、“ある文字列Ａがａ位置にあっ
て、文字列Ｂがｂ位置にある場合に、この文書を分類Ｃ
とする”というようにする。このようにすれば、より高
い率で分類できるようになる。Furthermore, in the first and second embodiments described above, one significant character string for classification has been described, but two or more character strings may be used. In other words, "when a certain character string A is at the position a and a certain character string B is at the position b, this document is classified into the category C.
By doing this, it becomes possible to classify at a higher rate.

【００６０】[0060]

【発明の効果】以上説明したように本発明によれば、格
別意識せずとも文書ファイルの分類を行い、その管理を
容易にすることができる。As described above, according to the present invention, document files can be classified and their management can be facilitated without special consideration.

[Brief description of drawings]

【図１】実施例の文書処理装置のブロック構成図であ
る。FIG. 1 is a block configuration diagram of a document processing apparatus according to an embodiment.

【図２】実施例における文書分類解析テーブルの内容を
示す図である。FIG. 2 is a diagram showing the contents of a document classification analysis table in the embodiment.

【図３】実施例における分類文字列テーブルの内容を示
す図である。FIG. 3 is a diagram showing the contents of a classified character string table in the embodiment.

【図４】実施例の文書保存時の処理手順を示すフローチ
ャートである。FIG. 4 is a flowchart illustrating a processing procedure when a document is stored according to the embodiment.

【図５】実施例の文書ファイル一覧表示の処理手順を示
すフローチャートである。FIG. 5 is a flowchart showing a processing procedure for displaying a document file list according to the embodiment.

【図６】第２の実施例における文書分類に係るフローチ
ャートである。FIG. 6 is a flowchart related to document classification in the second embodiment.

[Explanation of symbols]

１ＣＰＵ２ＲＯＭ３ＲＡＭ４キーボード５外部記憶装置６ＶＲＡＭ７表示装置 1 CPU 2 ROM 3 RAM 4 keyboard 5 External storage device 6 VRAM 7 Display

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 12/00 G06F 17/20 - 17/26 G06F 17/30 ─────────────────────────────────────────────────── ─── Continuation of front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) G06F 12/00 G06F 17/20-17/26 G06F 17/30

Claims

(57) [Claims]

1. An input step of inputting a document, and detection for detecting an existence position including information on a predetermined character string in the document input by the input step and an existing line of the predetermined character string in the document. And a document type determining step of determining the type of the document based on a combination of the predetermined character string detected by the detecting step and an existing position of the predetermined character string in the document, A document processing method comprising: a registration step of classifying and registering the document based on the type determined by the determination step.

2. An input step of inputting a document, and detection for detecting an existence position including information on a predetermined character string in the document input by the input step and an existing line of the predetermined character string in the document. And a document type determining step of determining the type of the document based on a combination of the predetermined character string detected by the detecting step and an existing position of the predetermined character string in the document, A document processing method comprising: a type information adding step of adding information on the type determined by the determining step to the document.

3. An input unit for receiving an input of a document, and an existence position including a predetermined character string in the document input by the input unit and information about an existing line of the predetermined character string in the document. Detecting means for detecting, and a document type determining means for determining the type of the document based on a combination of the predetermined character string detected by the detecting means and an existing position of the predetermined character string in the document, A document processing apparatus comprising: a registration unit that classifies and registers the document based on the type determined by the document type determination unit.

4. An input unit for receiving an input of a document, and an existence position including a predetermined character string in the document input by the input unit and information about an existing line of the predetermined character string in the document. Detecting means for detecting, and a document type determining means for determining the type of the document based on a combination of the predetermined character string detected by the detecting means and an existing position of the predetermined character string in the document, A document processing apparatus, comprising: type information adding means for adding to the document information related to the type determined by the document type determining means.