JP5235499B2

JP5235499B2 - Information processing apparatus, image forming apparatus, program, and document data configuration method

Info

Publication number: JP5235499B2
Application number: JP2008135892A
Authority: JP
Inventors: 敦鯉沼
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2007-10-12
Filing date: 2008-05-23
Publication date: 2013-07-10
Anticipated expiration: 2028-05-23
Also published as: CN101408875A; JP2009110497A

Description

本発明は、文書データの文書種別を判別して文書データを構成する情報処理装置、画像形成装置、プログラム及び文書データ構成方法に関する。 The present invention relates to an information processing apparatus, an image forming apparatus, a program, and a document data configuration method that configure document data by determining the document type of the document data.

報告書やレポートなどで紙文書や電子文書（以下、単に文書という）を回覧したり提出する場面は極めて多い。ユーザは報告書やレポートなどの使用目的、使用場面等に応じて文書を作成する度に、適切な文章に推敲したりレイアウトを検討する。ところで、文書のレイアウトは文書の種類や使用場面に応じて定まっている場合が多い（例えば、特許文献１参照。）。特許文献１には、スキャンした文書のレイアウトに基づいて、文書の種類を特定しその種類毎に分類して電子化する文書処理装置が記載されている。
特開２００７−０５２６１５号公報 There are many cases in which paper documents and electronic documents (hereinafter simply referred to as documents) are circulated and submitted in reports and reports. Each time a user creates a document according to the purpose of use or usage of a report or report, he or she reviews the appropriate text or examines the layout. By the way, the layout of a document is often determined in accordance with the type of document and usage scene (see, for example, Patent Document 1). Patent Document 1 describes a document processing apparatus that specifies the type of a document based on the layout of the scanned document, classifies the document for each type, and digitizes the document.
JP 2007-052615 A

特許文献１に記載されているように、文書のレイアウトは文書の種類や使用場面等に応じて定まっているが、これまで文書の種類に応じて自動的にレイアウトを定めたり文章を推敲することはできなかった。このため、ユーザは文書の種類や使用場面に応じて適切な言葉やレイアウトを調べて文書を作成する必要があるという問題があった。 As described in Patent Document 1, the layout of a document is determined according to the type of document and usage scene, but until now, the layout is automatically determined according to the type of document and the text is recommended. I couldn't. For this reason, there has been a problem that the user needs to create a document by examining appropriate words and layouts according to the type of document and usage scene.

本発明は、上記課題に鑑み、文書作成における表現の決定や構成の設定を容易にする情報処理装置、画像形成装置、プログラム及び文書データ構成方法を提供することを目的とする。 In view of the above problems, an object of the present invention is to provide an information processing apparatus, an image forming apparatus, a program, and a document data configuration method that facilitate determination of expression and setting of a configuration in document creation.

上記課題に鑑み、本発明は、文書データの文書種別と対応づけて、該文書種別の文書データにて用いられる文字又は文字列を記憶する文書種別情報記憶手段と、文書データを入力する文書データ入力手段と、前記文書データ入力手段により入力された文書データが含む文字又は文字列に対応づけて前記文書種別情報記憶手段に記憶された文書種別に基づき、入力された文書データの文書種別を判定する種別判定手段と、文書種別毎に、文書データの文字のフォント、サイズ、太さ、又は、文字若しくは文字列の配置の一以上を指定する文書構成情報を記憶する文書構成情報記憶手段と、前記種別判定手段が判定した文書データの文書種別に応じて、前記文書構成情報記憶手段に記憶された前記文書構成情報に従い入力された文書データの体裁を整える文書構成手段と、文書種別毎に、誤字、脱字及び文法誤用以外の校正前の文字又は文字列に対応づけて、校正後の文字又は文字列を記憶した校正情報記憶手段と、前記文書データ入力手段により入力された文書データに、前記校正情報記憶手段に記憶された校正前の文字又は文字列が記憶されている場合、前記種別判定手段が判定した、入力された文書データの文書種別に応じて、前記校正情報記憶手段に記憶された校正後の文字又は文字列により校正前の文字又は文字列を校正する校正手段と、を有し、前記文書構成手段は、前記文書構成情報を構成例と共に表示し、文書データの任意の文字列に前記文書構成情報を適用する操作が受け付けられた場合、前記任意の文字列の体裁を整える、ことを特徴とする情報処理装置を提供する。 In view of the above problems, the present invention relates to document type information storage means for storing characters or character strings used in document data of the document type in association with the document type of the document data, and document data for inputting the document data. Based on the document type stored in the document type information storage unit in association with the character or character string included in the document data input by the input unit and the document data input unit, the document type of the input document data is determined. A type determination unit that performs document configuration information storage that stores, for each document type, document configuration information that specifies one or more fonts, sizes, and thicknesses of characters of document data, or an arrangement of characters or character strings; The format of the document data input according to the document configuration information stored in the document configuration information storage unit according to the document type of the document data determined by the type determination unit. A document structure unit to arrange, for each document type, typographical, in association with the caret before calibration other than and grammatical misuse character or string, and calibration information storage means for storing character or character string after calibration, the document data When the pre-proofreading character or character string stored in the proofreading information storage unit is stored in the document data input by the input unit, the document type of the input document data determined by the type determination unit And proofreading means for proofreading the character or character string before proofreading with the proofread character or character string stored in the proofreading information storage means, and the document composition means constitutes the document composition information. displays with examples, when an operation to apply the document structure information in any string of the document data has been accepted, adjust the appearance of the arbitrary character string, to provide an information processing apparatus characterized by .

本発明によれば、文書データの文書種別を判定して、文書種別に適切な文書構成に再構成することができる。
本発明によれば、文書種別に応じて文書データを校正することができる。 According to the present invention, it is possible to determine the document type of the document data and reconfigure the document structure to be appropriate for the document type.
According to the present invention, document data can be calibrated according to the document type.

本発明によれば、文書種別に応じて文書データを校正することができる。 According to the present invention, document data can be calibrated according to the document type.

また、本発明の一形態において、前記種別判定手段は、前記文書データ入力手段により入力された文書データが含む文字又は文字列のうち、前記文書種別情報記憶手段に記憶された数を文書種別毎にカウントし、カウントされた数が最も大きい文書種別の文書データであると判定する、ことを特徴とする。 Further, in one aspect of the present invention, the type determination unit calculates the number stored in the document type information storage unit among the characters or character strings included in the document data input by the document data input unit for each document type. And the document data is determined to be document data of the document type having the largest counted number.

本発明によれば、文書データに文書種別の異なる文字又は文字列が含まれていても、文書種別を適切に判定することができる。 According to the present invention, it is possible to appropriately determine the document type even if the document data includes characters or character strings having different document types.

また、本発明の一形態において、種別判定手段が、文書データ入力手段により入力された文書データを事務や取引等で用いられるビジネス文書と判定した場合、文書構成手段は、ビジネス文書の文書構成情報に従い、文書データの箇条書きの行頭に行頭記号を付加する、ことを特徴とする。 In one embodiment of the present invention, when the type determination unit determines that the document data input by the document data input unit is a business document used in office work or transaction, the document configuration unit includes document configuration information of the business document. According to the above, a bullet is added to the beginning of the bulleted list of the document data.

本発明によれば、ビジネス文書に対し、箇条書きの行頭に自動的に記号を付加できるのでユーザの利便性が向上する。 According to the present invention, since a symbol can be automatically added to a bulleted line of a business document, convenience for the user is improved.

また、本発明の一形態において、前記種別判定手段が、前記文書データ入力手段により入力された文書データを事務や取引等で用いられるビジネス文書と判定した場合、前記文書構成手段は、ビジネス文書の前記文書構成情報に従い、文書データに含まれた日時情報又は場所情報を所定の位置に配置する、ことを特徴とする。 In one embodiment of the present invention, when the type determination unit determines that the document data input by the document data input unit is a business document used in office work or transaction, the document configuration unit includes: According to the document configuration information, date / time information or location information included in the document data is arranged at a predetermined position.

本発明によれば、ビジネス文書に対し、日時や場所を適切な位置に配置できるのでユーザの利便性が向上する。 According to the present invention, the user's convenience is improved because the date and place can be arranged at an appropriate position for the business document.

また、本発明の一形態において、前記種別判定手段が、前記文書データ入力手段により入力された文書データを年賀状と判定した場合、前記文書構成手段は、年賀状の前記文書構成情報に従い、文書データに含まれる新年の挨拶を構成する文字のフォントを毛筆体に変換する、ことを特徴とする。 In one embodiment of the present invention, when the type determination unit determines that the document data input by the document data input unit is a new year card, the document configuration unit converts the document data into document data according to the document configuration information of the new year card. The font of the character which comprises the greeting of the contained New Year is converted into a brush body, It is characterized by the above-mentioned.

本発明によれば、年賀状の新年の挨拶を毛筆体に変換できるのでユーザの利便性が向上する。 According to the present invention, the New Year's greeting on the New Year's card can be converted into a brush brush, which improves user convenience.

また、本発明の一形態において、前記文書構成情報記憶手段は、十二支の干支毎に、各干支に対応した動物の画像データを記憶しており、前記種別判定手段が、前記文書データ入力手段により入力された文書データを年賀状と判定した場合、前記文書構成手段は、文書データを入力した年の次の年の干支に対応した動物の画像データを前記文書構成情報記憶手段から抽出し、年賀状の前記文書構成情報に従い、前記年賀状の所定の位置に配置する、ことを特徴とする。 In one embodiment of the present invention, the document configuration information storage means stores animal image data corresponding to each zodiac for each of the twelve zodiac signs, and the type determination means uses the document data input means. When it is determined that the input document data is a New Year's card, the document composition means extracts image data of animals corresponding to the zodiac of the year following the year in which the document data was input from the document composition information storage means, According to the document configuration information, it is arranged at a predetermined position on the New Year's card.

本発明によれば、年賀状に干支に適した画像データを貼り付けることができるので、ユーザの利便性が向上する。 According to the present invention, since image data suitable for the zodiac signs can be pasted on the New Year's card, the convenience for the user is improved.

また、本発明の一形態において、前記校正手段は、文書データに含まれた校正前の文字又は文字列と共に、複数の校正後の文字又は文字列を表示装置に表示し、複数の校正後の文字又は文字列のうちポインティングデバイスにより選択された文字又は文字列を用いて、文書データの校正前の文字又は文字列を校正する、ことを特徴とする。 In one embodiment of the present invention, the proofreading unit displays a plurality of proofread characters or character strings on a display device together with the proofread characters or character strings included in the document data, and Characters or character strings before proofreading of document data are calibrated using characters or character strings selected by a pointing device among the characters or character strings.

本発明によれば、複数の候補から適切な校正後の文字又は文字列を選択することができる。 According to the present invention, it is possible to select an appropriate proofread character or character string from a plurality of candidates.

文書作成における表現の決定や構成の設定を容易にする情報処理装置、画像形成装置、プログラム及び文書データ構成方法を提供することができる。 It is possible to provide an information processing apparatus, an image forming apparatus, a program, and a document data configuration method that make it easy to determine an expression and set a configuration in document creation.

以下、本発明を実施するための最良の形態について、図面を参照しながら説明する。 The best mode for carrying out the present invention will be described below with reference to the drawings.

本実施形態の情報処理装置は、文書に含まれる文言から文書の種別（以下、文書種別という）を判定し、文書種別に応じて表現を校正すると共に文書構成の体裁を整える。これにより、ユーザが文書の種類に応じて表現を校正したり文書構成を定めることなく、文書種別に応じて適切な表現及び文書構成にすることができるので、ユーザの利便性を向上させることができる。 The information processing apparatus according to the present embodiment determines a document type (hereinafter referred to as a document type) from a word included in the document, corrects the expression according to the document type, and arranges the appearance of the document structure. As a result, the user's convenience can be improved because the user can make an appropriate expression and document structure according to the document type without proofreading the expression according to the document type or determining the document structure. it can.

なお、文書種別とは、文書が使用される場面に応じて定まる文書の種類、例えば、ビジネス文書、年賀状、プロジェクターにより投影されるプレゼンテーション資料、歓送迎会等の案内状、等である。また、ビジネス文書には、企画書、見積書、成果報告書、出張報告書等、種々のものがありこれらを区別することも可能であるが、以下では単にビジネス文書という。 The document type is a document type determined according to the scene where the document is used, for example, a business document, a New Year's card, a presentation material projected by a projector, a greeting card for a welcome party, and the like. In addition, there are various business documents such as a plan document, an estimate document, a result report, a business trip report, and the like, and these can be distinguished.

また、表現を校正するとは、誤字、脱字及び文法誤用以外の適切でない言い回しを修正することをいう。誤字脱字、文法的な誤り、文章の表記ゆれ、等を修正してもよい。また、文書構成とは、例えば段落毎の配置位置、センタリング・左詰め・右詰め、文字のフォント、サイズ、太字・細字の別、文字や背景の色、箇条書き部の行頭記号、段落の枠の有無、等をいう。 Also, proofreading an expression means correcting an inappropriate wording other than typos, omissions, and grammatical misuse. You may correct typographical errors, grammatical errors, text swaying, etc. The document structure includes, for example, the position of each paragraph, centering / left-justified / right-justified, character font, size, bold / thin type, character / background color, bullet point bullet, paragraph frame The presence or absence, etc.

図１は、文書種別を判定する情報処理装置１１１がネットワークＮを介して画像形成装置１１０と接続された印刷システムの概略構成を示す図である。情報処理装置１１１は、例えばＰＣ（パーソナルコンピュータ）、携帯電話、ＰＤＡ（Personal Digital(Data) Assistants）、ＰＨＳ（Personal Handyphone System）等のコンピュータにより構成される。画像形成装置１１０は、プリンタ、ファクシミリ装置、スキャナ装置、ＭＦＰ（ＭｕｌｔｉＦｕｎｃｔｉｏｎＰｒｉｎｔｅｒ）等、文書を印刷、送信、保存する文書の出力装置である。 FIG. 1 is a diagram illustrating a schematic configuration of a printing system in which an information processing apparatus 111 that determines a document type is connected to an image forming apparatus 110 via a network N. The information processing apparatus 111 is configured by a computer such as a PC (personal computer), a mobile phone, a PDA (Personal Digital (Data) Assistants), or a PHS (Personal Handyphone System). The image forming apparatus 110 is a document output apparatus that prints, transmits, and stores a document, such as a printer, a facsimile apparatus, a scanner apparatus, or an MFP (Multi Function Printer).

例えば、ユーザはワープロソフトウェアなどのアプリケーションプログラムを起動した後、情報処理装置１１１のキーボード１１２及びマウス１１３を操作して文字を入力する。入力された文字は順次ディスプレイ１１４に反映され、最終的には文書や表等が生成される。以下、文書データや電子メールなど文字を含むデータをテキストデータという。テキストデータがバイナリファイルに保存されるかテキストファイルに保存されるかは問わない。 For example, after starting an application program such as word processing software, the user inputs characters by operating the keyboard 112 and the mouse 113 of the information processing apparatus 111. The input characters are sequentially reflected on the display 114, and finally a document, a table, or the like is generated. Hereinafter, data including characters such as document data and electronic mail is referred to as text data. It does not matter whether the text data is stored in a binary file or a text file.

本実施形態の情報処理装置１１１は、テキストデータを構成する自立語（主に日本語の場合）や単語（主に英語の場合）を抽出し、文書種別情報データベース(以下、ＤＢという)を参照し、テキストデータの文書種別を判定する。自立語を日本語の構成要素と、単語は英語の構成要素としたが、他の言語であっても自立語や単語に相当する構成要素から文書種別を判定できる。 The information processing apparatus 111 according to the present embodiment extracts independent words (mainly in the case of Japanese) and words (mainly in the case of English) constituting the text data, and refers to a document type information database (hereinafter referred to as DB). Then, the document type of the text data is determined. Although the independent words are Japanese constituent elements and the words are English constituent elements, the document type can be determined from the independent words and the constituent elements corresponding to the words even in other languages.

また、例えば、情報処理装置１１１がテキストデータを受信したり、可搬型の記憶媒体１２８に記憶されたテキストデータを読み出し、記憶装置１２６に記憶した場合、情報処理装置１１１はテキストデータに含まれる自立語や単語に基づき、テキストデータの文書種別を判定する。 Further, for example, when the information processing apparatus 111 receives text data or reads the text data stored in the portable storage medium 128 and stores the text data in the storage apparatus 126, the information processing apparatus 111 is self-supporting included in the text data. The document type of the text data is determined based on the word or word.

図２は、情報処理装置１１１のハードウェア構成例を示す図である。情報処理装置１１１は、バスＢで相互に接続されているＲＡＭ（Random Access Memory ）１２１、ＲＯＭ（Read-Only Memory）１２２、入力装置（図１のキーボード１１２、マウス１１３に相当）１１２，１１３、ＮＩＣ（Network Interface Card）１２３、ドライブ装置１２４、表示制御部１２５、記憶装置１２６及びＣＰＵ１２７とを有する。 FIG. 2 is a diagram illustrating a hardware configuration example of the information processing apparatus 111. The information processing apparatus 111 includes a RAM (Random Access Memory) 121, a ROM (Read-Only Memory) 122, input devices (corresponding to the keyboard 112 and the mouse 113 in FIG. 1) 112, 113, A NIC (Network Interface Card) 123, a drive device 124, a display control unit 125, a storage device 126, and a CPU 127 are included.

ＲＡＭ１２１は、ＯＳやプログラムを実行する作業メモリになり、ＲＯＭ１２２はＢＩＯＳなどＯＳを起動するためのプログラムや設定ファイルを記憶している。入力装置１１２、１１３はキーボードやマウスなど、ユーザからの様々な操作を入力するためのデバイスである。ＮＩＣ１２３は、ネットワークＮに接続するためのインターフェイスであり、ＴＣＰ（Transmission Control Protocol ）/ＩＰ（Internet Protocol）等のプロトコル処理を実行する。ドライブ装置１２４は、ＣＤ−ＲＷやメモリカード等の記憶媒体１２８が着脱可能に構成されており、記憶媒体１２８にプログラムやデータを書き込む際に使用され、また、記憶媒体１２８に記録されたプログラムやデータを読み込み、記憶装置１２６に送出する。 The RAM 121 serves as a working memory for executing the OS and programs, and the ROM 122 stores programs and setting files for starting up the OS such as BIOS. The input devices 112 and 113 are devices for inputting various operations from the user, such as a keyboard and a mouse. The NIC 123 is an interface for connecting to the network N, and executes protocol processing such as TCP (Transmission Control Protocol) / IP (Internet Protocol). The drive device 124 is configured such that a storage medium 128 such as a CD-RW or a memory card is detachable. The drive device 124 is used when a program or data is written to the storage medium 128, and the program recorded on the storage medium 128 Data is read and sent to the storage device 126.

表示制御部１２５は、アプリケーションソフトウェアが指示する画面情報に基づき所定の解像度や色数等で、ＧＵＩ（Graphical User Interface）画面を形成し、操作に必要な各種ウィンドウやデータ等をディスプレイ１１４に表示する。 The display control unit 125 forms a GUI (Graphical User Interface) screen with a predetermined resolution and the number of colors based on screen information instructed by the application software, and displays various windows and data necessary for operation on the display 114. .

記憶装置１２６は、ＨＤＤ（ハードディスクドライブ）やフラッシュメモリなど不揮発性メモリであり、ＯＳ、アプリケーションソフトウェア、プログラム１３４が記憶されている。ＣＰＵ１２７は、ＯＳ、アプリケーションソフトウェア及びプログラム１３４を記憶装置１２６からロードして実行することで種々の機能を提供すると共に、情報処理装置１１１が行う処理を統括的に制御する。 The storage device 126 is a nonvolatile memory such as an HDD (hard disk drive) or a flash memory, and stores an OS, application software, and a program 134. The CPU 127 provides various functions by loading and executing the OS, application software, and program 134 from the storage device 126, and comprehensively controls processing performed by the information processing device 111.

また、記憶装置１２６には、後述する文書種別情報ＤＢ１３１、校正情報ＤＢ１３２及び文書構成情報ＤＢ１３３が記憶されている。 The storage device 126 stores a document type information DB 131, a proofreading information DB 132, and a document configuration information DB 133, which will be described later.

情報処理装置１１１が実行するプログラム１３４は、記録媒体１２８に記憶して配布されるか、所定のサーバからネットワークＮを介して配布される。プログラム１３４を記録した記録媒体１２８がドライブ装置１２４にセットされると、プログラム１３４が記録媒体１２８からドライブ装置１２４を介して記憶装置１２６にインストールされる。また、サーバからプログラムが送信された場合、ＮＩＣ１２３を介して記憶装置１２６にインストールされる。 The program 134 executed by the information processing apparatus 111 is stored in the recording medium 128 and distributed, or distributed from a predetermined server via the network N. When the recording medium 128 recording the program 134 is set in the drive device 124, the program 134 is installed in the storage device 126 from the recording medium 128 via the drive device 124. Further, when a program is transmitted from the server, it is installed in the storage device 126 via the NIC 123.

ＣＰＵ１２７がプログラム１３４を実行することで、テキストデータの種別を判定する文書種別判定手段２２、テキストデータの表現を校正する表現校正手段２３、テキストデータの文書構成の体裁を整える文書構成手段２４、文字列を抽出する文字列抽出手段２５と、が実現される。次述するように、表現校正手段２３はワープロソフトウェアなどのアプリケーションソフトウェア２７の表現を校正し、文書構成手段２４は同様にアプリケーションソフトウェア２７の表示画面を利用するので、アプリケーションソフトウェア２７のアドインなどで構成されることが好ましい。 When the CPU 127 executes the program 134, the document type determination unit 22 that determines the type of text data, the expression correction unit 23 that calibrates the expression of text data, the document configuration unit 24 that arranges the appearance of the text data document structure, and characters Character string extraction means 25 for extracting a string is realized. As will be described below, the expression proofreading means 23 proofreads the expression of the application software 27 such as word processor software, and the document composition means 24 similarly uses the display screen of the application software 27. It is preferred that

図３は、情報処理装置１１１の機能構成図を示す。文書種別判定手段２２はテキストデータ２０から文書種別を判定するので、ＯＳ上ではテキストデータ２０を入力するためのアプリケーションソフトウェア２７が実行されている。アプリケーションソフトウェア２７は、例えば、ワープロソフトウェア、表計算ソフトウェア、電子メールソフトウェア等、テキストデータ２０を入力しうるものであればよい。 FIG. 3 shows a functional configuration diagram of the information processing apparatus 111. Since the document type determination unit 22 determines the document type from the text data 20, application software 27 for inputting the text data 20 is executed on the OS. The application software 27 may be any software that can input the text data 20, such as word processing software, spreadsheet software, or e-mail software.

文字列抽出手段２５及び文書種別判定手段２２は、情報処理装置１１１の起動と共に実行されるか、アプリケーションソフトウェア２７の起動に伴い実行される。また、表現校正手段２３及び文書構成手段２４は、ユーザの所定の操作により起動する。 The character string extraction unit 25 and the document type determination unit 22 are executed when the information processing apparatus 111 is started or when the application software 27 is started. The expression proofreading means 23 and the document composition means 24 are activated by a predetermined operation by the user.

キーボード１１２は指でキーを操作することで情報処理装置１１１に文字を入力する。ユーザがキーを押すと、対応するキーコード（例えば、ＡＳＣＩＩコード）が情報処理装置１１１に送られる。キーコードは例えばＢＩＯＳ（Basic Input Output System）により対応する文字コードに変換される。なお、キーボード１１２はタッチパネルや手書き文字入力、音声入力等、文字に対応したキーコードを発生するものであればよい。 The keyboard 112 inputs characters to the information processing apparatus 111 by operating keys with a finger. When the user presses a key, a corresponding key code (for example, an ASCII code) is sent to the information processing apparatus 111. The key code is converted into a corresponding character code by, for example, BIOS (Basic Input Output System). The keyboard 112 only needs to generate a key code corresponding to characters, such as a touch panel, handwritten character input, and voice input.

テキストを日本語で入力する場合、ユーザの操作によりＩＭ（ＩｎｐｕｔＭｅｔｈｏｄ）２１が起動し、文字コードはＩＭ２１により日本語に変換される。変換せずに直接テキストを入力する場合は、文字コードは直接アプリケーションソフトウェア２７及び文書種別判定手段２２に入力される。 When text is input in Japanese, IM (Input Method) 21 is activated by a user operation, and the character code is converted into Japanese by IM 21. When text is directly input without conversion, the character code is directly input to the application software 27 and the document type determination unit 22.

ＩＭ２１は、一連の文字コードを日本語等の言語に変換する、いわゆる、かな漢字変換システムである。例えば、キーボード１１２から「Ｎ・Ｏ・Ｕ・Ｋ・Ｉ」と入力された場合、ＩＭ２１は辞書を参照して一連の文字コードを「のうき」と変換すると共に、所定の操作や設定に応じて「納期」や「農機」等に変換することを可能にする。「納期」と変換した場合には「納」「期」それぞれに対応する２つの文字コードが生成される。なお、文字コードは、Ｕｎｉｃｏｄｅ、ＪＩＳコード、シフトＪＩＳコード等、いずれであってもよい。以下、文書種別を判定する言葉を種別判定語という。 The IM 21 is a so-called kana-kanji conversion system that converts a series of character codes into a language such as Japanese. For example, when “N, O, U, K, I” is input from the keyboard 112, the IM 21 refers to the dictionary and converts a series of character codes into “noki”, and according to a predetermined operation or setting. It is possible to convert it to “delivery date” or “agricultural machinery”. When converted to “delivery date”, two character codes corresponding to “delivery” and “date” are generated. The character code may be any of Unicode, JIS code, shift JIS code, and the like. Hereinafter, a word for determining the document type is referred to as a type determination word.

文字列抽出手段２５は、所定の文字又は文字列を抽出する。文字列抽出手段２５は、テキストデータ２０の文字コードを１語１語参照し、テキストデータ２０から文書種別情報ＤＢ１３１に登録されている種別判定語を抽出する。また、辞書を参照しながらテキストデータ２０の例えば一文ごとに周知の日本語構文解析を行い係り受け関係を抽出し、文節の区分や自立語を抽出してから種別判定語を抽出してもよい。 The character string extraction means 25 extracts a predetermined character or character string. The character string extraction unit 25 refers to the character code of the text data 20 word by word, and extracts the type determination word registered in the document type information DB 131 from the text data 20. Further, referring to the dictionary, for example, each sentence of the text data 20 may be subjected to well-known Japanese syntax analysis to extract the dependency relationship, and the category determination word may be extracted after extracting the segment classification and independent words. .

そして、文書種別判定手段２２は、文書種別情報ＤＢ１３１に登録された種別判定語に対応つけられた文書種別をテキストデータ２０の文書種別として判定する。 Then, the document type determination unit 22 determines the document type associated with the type determination word registered in the document type information DB 131 as the document type of the text data 20.

図４は、情報処理装置１１１の機能構成図の他の一例を示す。なお、図４において図３と同一構成部には同一の符号を付しその説明は省略する。図３では、キーボード１１２からテキストデータ２０を入力する形態について説明したが、１つのファイルに含まれるテキストデータ２０からも同様に文書種別を判定することができる。 FIG. 4 shows another example of a functional configuration diagram of the information processing apparatus 111. 4 that are the same as those in FIG. 3 are assigned the same reference numerals and descriptions thereof are omitted. In FIG. 3, the form in which the text data 20 is input from the keyboard 112 has been described, but the document type can be similarly determined from the text data 20 included in one file.

情報処理装置１１１が電子メール等で受信したテキストデータ２０はいったん記憶装置１２６に記憶される。アプリケーションソフトウェア２７はテキストデータ２０を読み出しディスプレイ１１４に表示したり音声で読み上げたりするが、その際に、文書種別判定手段２２はテキストデータ２０を抽出し、図３と同様に、種別判定語から文書種別を判定することができる。 The text data 20 received by the information processing device 111 by e-mail or the like is temporarily stored in the storage device 126. The application software 27 reads the text data 20 and displays it on the display 114 or reads it out by voice. At that time, the document type determination means 22 extracts the text data 20 and, like FIG. The type can be determined.

〔文書種別の判定〕
文書種別の判定について説明する。図５は、文書種別情報ＤＢ１３１に記憶される情報の一例を示す。図５では、種々の種別判定語に文書種別が対応づけられている。例えば、「調査」、「資料」、「納期」、「企業」、「成果」にはビジネス文書の文書種別が、「謹賀新年」、「明けまして」、「元旦」、「賀正」、「賀春」には年賀状の文書種別が、「ビール」、「飲み放題」、「パーティ」、「歓迎会」、「歌い放題」には案内状の文書種別が、それぞれ対応づけられている。 [Determination of document type]
The determination of the document type will be described. FIG. 5 shows an example of information stored in the document type information DB 131. In FIG. 5, document types are associated with various type determination words. For example, for "Survey", "Document", "Delivery", "Company", and "Results", the document types of business documents are "Tsuruga New Year", "Happy New Year", "New Year's Day", "Kasho", Is associated with the document type of the New Year's card, and the document type of the guide letter is associated with “beer”, “all-you-can-drink”, “party”, “welcome party”, and “all-you-can sing”.

文書種別判定手段２２はテキストデータ２０が含む、文書種別を特徴づける種別判定語に基づき文書種別情報ＤＢ１３１を参照して、それぞれの種別判定語に対応づけられた文書種別を抽出する。そして、例えば、１つのテキストデータ２０毎に文書種別を判定する。 The document type determination unit 22 refers to the document type information DB 131 based on the type determination word characterizing the document type included in the text data 20, and extracts the document type associated with each type determination word. For example, the document type is determined for each piece of text data 20.

なお、１つのテキストデータ２０から異なる文書種別が検出された場合、検出された回数が最も多い文書種別に属すると判定する。文書種別判定手段２２は、メタデータとして文書種別を示す情報をテキストデータ２０に付加する。 When a different document type is detected from one text data 20, it is determined that it belongs to the document type with the largest number of detections. The document type determination unit 22 adds information indicating the document type as metadata to the text data 20.

図６は、文書種別情報ＤＢ１３１のより詳細な構成を示す図である。図６では文書種別に対応づけて、種別判定語及び種別判定語を構成する文字の文字コードが登録されている。したがって、文書種別判定手段２２は、テキストデータ２０に含まれた種別判定語と文書種別情報ＤＢ１３１の一連の文字コードを比較することで、種別判定語に対応する文書種別を抽出することができる。 FIG. 6 is a diagram showing a more detailed configuration of the document type information DB 131. In FIG. 6, the type determination word and the character code of the characters constituting the type determination word are registered in association with the document type. Therefore, the document type determination unit 22 can extract the document type corresponding to the type determination word by comparing the type determination word included in the text data 20 with a series of character codes in the document type information DB 131.

図７は、文書種別判定手段２２が文書種別を判定する手順を示すフローチャート図である。まず、情報処理装置１１１にテキストデータ２０が入力される。例えば、「競争力のある製品を核にして開拓推進を行う。」というテキストデータ２０が入力される。 FIG. 7 is a flowchart showing a procedure in which the document type determination unit 22 determines the document type. First, text data 20 is input to the information processing apparatus 111. For example, the text data 20 is input “Pursue promotion with competitive products as the core”.

文書種別判定手段２２は、テキストデータ２０の文字コード、８Ｂ２３「競」、９１８８「争」、９７ＣＤ「力」…８Ａ６Ａ「核」…８Ａ４Ａ「開」、９１Ｆ１「拓」…から、文書種別情報ＤＢ１３１に登録された種別判定語に一致する文字列を抽出する（Ｓ２０）。そして、抽出された種別判定語を用いて文書種別を判定する（Ｓ３０）。 The document type determination means 22 uses the character code of the text data 20, 8B23 “competition”, 9188 “conflict”, 97CD “power”... 8A6A “core”... 8A4A “open”, 91F1 “explore”. A character string that matches the type determination word registered in (2) is extracted (S20). Then, the document type is determined using the extracted type determination word (S30).

テキストデータ２０から「核」、「開拓」、「推進」という種別判定語が抽出されるが、これらはビジネス文書の文書種別に対応づけられているので、文書種別判定手段２２はこのテキストデータ２０をビジネス文書の文書種別と判定する。 The type determination words “nuclear”, “development”, and “promotion” are extracted from the text data 20. Since these are associated with the document type of the business document, the document type determination unit 22 uses the text data 20. Is determined as the document type of the business document.

〔表現校正〕
文書種別に応じた表現校正について説明する。表現校正手段２３は文書種別判定手段２２が判定した文書種別に応じて、テキストデータ２０の表現を校正する。 [Proofreading]
The expression proofreading according to the document type will be described. The expression proofreading means 23 proofreads the expression of the text data 20 according to the document type determined by the document type determining means 22.

ビジネス文書、年賀状、プレゼンテーション資料、歓送迎会等の案内状、では好ましい表現が決まっていたり、使用すべきでない表現が知られている（以下、被修正表現という）。本実施形態では、文書種別が検出された後、テキストデータ２０に被修正表現が含まれていた場合、被修正表現を校正する。被修正表現と校正後の表現は、校正情報ＤＢ１３２に記憶されている。 In business documents, New Year's cards, presentation materials, greeting cards such as welcome and farewell parties, expressions that are preferred or should not be used are known (hereinafter referred to as corrected expressions). In the present embodiment, after the document type is detected, if the corrected expression is included in the text data 20, the corrected expression is calibrated. The corrected expression and the expression after proofreading are stored in the proofreading information DB 132.

図８は、校正情報ＤＢ１３２に記憶される情報の一例を示す。校正情報ＤＢ１３２では、文書種別毎に、被修正表現と校正後の表現が対応づけて記憶されている。例えば、ビジネス文書の場合、「開拓を行う」という被修正表現に「開拓推進」という表現が対応づけられており、また、「調査を行う」という被修正表現に「精査」という表現が対応づけられている。このように誤用ではない表現でも、よりビジネスに適した表現に校正することができる。 FIG. 8 shows an example of information stored in the calibration information DB 132. In the proofreading information DB 132, the corrected expression and the proofread expression are stored in association with each document type. For example, in the case of a business document, the expression “promotion of development” is associated with the corrected expression of “exploring”, and the expression “scrutinized” is associated with the corrected expression of “investigating”. It has been. Thus, even expressions that are not misused can be proofread to expressions more suitable for business.

同様に、マナー上、修正した方がよい被修正表現について校正後の表現を対応づけておくことができる。
・お名前を頂戴できますでしょうか → お名前を伺えますでしょうか
・おまちください → おまちいただけますか
・後で → 後ほど
・一応 → 念のため
・今 → ただ今
・すぐ → 早速
・すごく → 非常に
・調査を行う → 精査する
・多分 → 推測するところ
・だれ → どなた
・どこ → どちら
・どう → どのように
・前に → 以前に
また、例えば年賀状の場合、漢字で記載した方が好ましい被修正表現に、校正後の表現を対応づけられている。
・あけまして → 明けまして
また、年賀状では、マナー上、修正した方がよい被修正表現に、校正後の表現が対応づけられている。「迎春」は簡略表現なので目上の人に対しては好ましくない。
・迎春 → 謹賀新年
また、年賀状で使用しがちな二重表現となる被修正表現に、校正後の表現が対応づけられている。「新年あけましておめでとうございます」は、「新年」と「明けまして」の意味が重複する。
・新年あけましておめでとうございます → 新年おめでとうございます
また、年賀状では忌み言葉を避けるのがマナーであるので、忌み言葉に当たる被修正表現には校正後の表現が対応づけられている。
・去年 → 昨年
・枯れる → 乾燥する
・滅びる → なくなる
表現校正手段２３は、文書種別に応じて校正情報ＤＢ１３２を参照して、テキストデータ２０に含まれる被修正表現を校正後の表現に置き換える。図９は、表現校正手段２３が文書種別に応じて表現を校正する手順を示すフローチャート図である。 Similarly, the corrected expression that should be corrected in manners can be associated with the corrected expression.
・ Can you give me your name → Can you ask me for your name ・ Please wait → Can I wait for you ・ Later → Later ・ For the time being → Just in case ・ Now → Right now → Immediately → Very → Very Investigate-> scrutinize-maybe-guess-who-who-who-where-how-how-before-before Also, for example, in the case of New Year cards An expression after proofreading is associated with the expression.
・ New Year → New Year In the New Year's card, the corrected expression should be corrected in the manners and should be corrected. “Welcome Spring” is a simple expression and is not desirable for the superior.
・ New Year's greeting → Tsuruga New Year In addition, the expression after proofreading is associated with the corrected expression that is the double expression that is often used in New Year's cards. “Happy New Year” has the same meaning as “New Year” and “Happy New Year”.
・ Happy New Year → Congratulations on the New Year Also, in New Year's cards, since it is a manner to avoid the abomination, the corrected expression corresponding to the apocalypse is associated with the corrected expression.
Last year → Last year, withering → Drying and ruining → Loss The expression proofreading means 23 refers to the proofreading information DB 132 according to the document type, and replaces the corrected expression included in the text data 20 with the expression after proofreading. FIG. 9 is a flowchart showing a procedure for the expression proofreading means 23 to calibrate the expression according to the document type.

まず、情報処理装置１１１にテキストデータ２０が入力される（Ｓ１０）。例えば、「競争力のある製品を核にして開拓を行う。」というテキストデータ２０が入力される。また、表現校正手段２３は文書種別判定手段２２が判定した文書種別を取得する（Ｓ１１０）。そして、校正情報ＤＢ１３２を参照して、被修正表現を校正後の表現に校正する(Ｓ１２０)。 First, text data 20 is input to the information processing apparatus 111 (S10). For example, the text data 20 is input that “development is based on competitive products”. Further, the expression proofreading unit 23 acquires the document type determined by the document type determining unit 22 (S110). Then, the corrected expression is calibrated to the proofread expression with reference to the proofreading information DB 132 (S120).

なお、表現校正手段２３はアプリケーションソフトウェア２７に、被修正表現を校正後の表現に置き換えるよう要求する。 The expression proofing means 23 requests the application software 27 to replace the corrected expression with the proofread expression.

図１０は、ディスプレイ１１４に表示されるテキストデータ２０の構成例を示す図である。校正前のテキストデータ２０は「競争力のある製品を核にして開拓を行う。」であるが、「開拓を行う」がビジネス文書では被修正表現なので、テキストデータ２０は「競争力のある製品を核にして開拓推進する。」に校正される。校正後の表現は、ユーザが把握できるよう下線を付されたり四角で囲まれたり、反転表示したり、色を変えて表現される。なお、校正後の表現を表示し、ユーザが校正を許可したら校正してもよい。 FIG. 10 is a diagram illustrating a configuration example of the text data 20 displayed on the display 114. The text data 20 before proofreading is “pioneering with competitive products as the core.” However, since “development” is a corrected expression in business documents, the text data 20 is “competitive products. Will be pioneered and promoted. ” The expression after proofreading is underlined, surrounded by a rectangle, displayed in reverse video, or displayed in a different color so that the user can grasp it. Note that the expression after calibration may be displayed, and calibration may be performed if the user permits calibration.

また、被修正表現によっては校正後の表現に複数の候補がある場合があるが、この場合は、複数の候補を選択可能とすることが好ましい。図１１は校正後の表現の複数の候補が表示されたテキストデータ２０の一例を示す。例えば、表現校正手段２３は被修正表現「開拓を行う」に下線を付したり四角で囲む等して表示し、ユーザがマウス１１３で右クリックすると複数の候補を表示する。ユーザは複数の候補の中からテキストデータ２０に適切な表現を選択できる。 Further, depending on the corrected expression, there may be a plurality of candidates in the expression after proofreading. In this case, it is preferable that a plurality of candidates can be selected. FIG. 11 shows an example of text data 20 in which a plurality of candidates for expression after proofreading are displayed. For example, the expression proofing means 23 displays the corrected expression “explore” underlined or surrounded by a square, etc., and displays a plurality of candidates when the user right-clicks with the mouse 113. The user can select an appropriate expression for the text data 20 from a plurality of candidates.

〔文書構成の体裁の調整〕
文書種別に応じた文書構成の体裁の調整について説明する。文書構成手段２４は、文書種別判定手段２２が判定した文書種別に応じて、テキストデータ２０の文書構成の体裁を整える。 [Adjustment of document structure]
The adjustment of the appearance of the document structure according to the document type will be described. The document composition unit 24 arranges the format of the document structure of the text data 20 according to the document type determined by the document type determination unit 22.

ビジネス文書、年賀状、プレゼンテーション資料、歓送迎会等の案内状、では好ましい文書構成が決まっている場合が多い。本実施形態では、文書種別に応じて種々の文書構成を予め用意しておき、その文書構成にテキストデータ２０の文書構成の体裁を整えることで、ユーザが文書構成を設定する煩わしさを低減する。文書種別毎の文書構成を指定する文書構成情報は文書構成情報ＤＢ１３３に記憶されている。 Business documents, New Year's cards, presentation materials, and invitations such as welcome and farewell parties often have favorable document configurations. In the present embodiment, various document configurations are prepared in advance according to document types, and the document configuration of the text data 20 is arranged in the document configuration, thereby reducing the troublesomeness of the user setting the document configuration. . Document configuration information for specifying a document configuration for each document type is stored in the document configuration information DB 133.

＜ビジネス文書＞
図１２は、ビジネス文書の文書構成の一例を示す。図１２の文書構成例は、例えばＡ４の用紙の領域にテキストデータ２０を配置するよう、複数の文字配置欄３１〜３６を有する。 <Business document>
FIG. 12 shows an example of the document configuration of a business document. The document configuration example of FIG. 12 includes a plurality of character arrangement fields 31 to 36 so that the text data 20 is arranged on, for example, an A4 sheet area.

文字配置欄３１〜３６は、用紙に対する左上のコーナの位置が定められていると共に、フォント、文字のサイズ、太字・細字、センタリング、行間隔、文字間隔、文字色、等、予め設定されている。なお、「件名欄」など「」で囲まれた文字列はテキストデータ２０を配置すると自動的に削除される。 In the character arrangement fields 31 to 36, the position of the upper left corner with respect to the paper is determined, and font, character size, bold / thin, centering, line spacing, character spacing, character color, etc. are set in advance. . It should be noted that a character string surrounded by “” such as “subject field” is automatically deleted when the text data 20 is arranged.

例えば、文字配置欄３１は、用紙の右上に配置され、文書構成手段２４がテキストデータ２０の文書構成の体裁を整える日付をＯＳから取得して設定する。また、文字配置欄３１にはテキストデータ２０の作成者の氏名が配置される。作成者の氏名は、情報処理装置１１１にログインしたユーザの氏名が自動的に取得される。 For example, the character arrangement field 31 is arranged on the upper right side of the sheet, and the document composition unit 24 acquires and sets the date when the document composition of the text data 20 is arranged from the OS. Further, the name of the creator of the text data 20 is arranged in the character arrangement column 31. As the name of the creator, the name of the user who has logged into the information processing apparatus 111 is automatically acquired.

文字配置欄３２は、テキストデータ２０の件名を配置する欄で、例えば、ゴシック体で１６〜２０ポイントの文字をセンタリングして配置する欄である。文字配置欄３３は、テキストデータ２０の概略、背景、要約等を配置する欄で、例えば、明朝体で１０．５ポイントの文字を左詰して配置する欄である。また、文字配置欄３４，３５は、テキストデータ２０が伝達する核となる内容を配置する欄で、例えば、明朝体で１０．５ポイントの文字を左詰して配置する欄である。文字配置欄３４、３５は、テキストデータ２０によっては複数存在した方が便利であるため、図示するように複数用意されている（図では２個）。また、箇条書きのテキストデータ２０に対応するため文字配置欄３５には、番号（１）〜（３）が予め設定されている。また、文字配置欄３６は、定型的に通知する内容を配置する欄で、例えば、明朝体で１０．５ポイントの文字をセンタリングして配置する欄である。図ではさらに文字配置欄３６を影つきの四角で囲むことで、通知する内容に視認しやすくしている。なお、この他、テキストデータ２０を提出する宛先を示す欄、テキストデータ２０の内容の問い合わせ先を示す欄、Ｊｐｅｇなど文字以外のオブジェクトを貼り付ける欄、等を設けてもよい。 The character arrangement column 32 is a column in which the subject of the text data 20 is arranged, for example, a column in which characters of 16 to 20 points are centered and arranged in a Gothic style. The character arrangement column 33 is a column in which outlines, backgrounds, summaries, and the like of the text data 20 are arranged. For example, the character arrangement column 33 is a column in which 10.5-point characters are left-justified in the Mincho style. In addition, the character arrangement columns 34 and 35 are columns in which the core contents transmitted by the text data 20 are arranged. For example, the character arrangement columns 34 and 35 are columns in which 10.5 point characters are left-justified and arranged in the Mincho style. Since it is more convenient to have a plurality of character arrangement fields 34 and 35 depending on the text data 20, a plurality of character arrangement columns 34 and 35 are prepared as shown (two in the figure). Further, numbers (1) to (3) are set in advance in the character arrangement column 35 in order to correspond to the itemized text data 20. The character arrangement column 36 is a column for arranging the contents to be notified in a typical manner, for example, a column in which 10.5 point characters are centered and arranged in the Mincho style. In the figure, the character arrangement column 36 is surrounded by a shaded square so that the contents to be notified are easily visible. In addition, a column indicating a destination to which the text data 20 is submitted, a column indicating an inquiry destination of the contents of the text data 20, a column for pasting an object other than a character such as Jpeg, and the like may be provided.

文書構成手段２４は、テキストデータ２０を解析し、文字配置欄３１〜３６にテキストデータ２０を配置する。図１３は、文書構成手段２４が、テキストデータ２０を文書構成情報に従い構成する手順を示すフローチャート図である。 The document construction unit 24 analyzes the text data 20 and arranges the text data 20 in the character arrangement fields 31 to 36. FIG. 13 is a flowchart showing a procedure in which the document composition unit 24 composes the text data 20 according to the document composition information.

すでにユーザは、テキストデータ２０をワープロソフトウェアなどのアプリケーションソフトウェア２７で編集中であり、文書種別判定手段２２により文書種別はビジネス文書であると判定されている。そして、ユーザが所定のメニューから文書構成手段２４を起動すると図１３のフローチャート図がスタートする。なお、文書構成の体裁の調整を容易にするため、テキストデータ２０の所定範囲を選択してから文書構成手段２４を起動してもよい。文書構成手段２４は起動されると、日付とユーザの氏名をＯＳから取得し、文字配置欄３１に配置する。 The user is already editing the text data 20 with application software 27 such as word processing software, and the document type determination unit 22 determines that the document type is a business document. Then, when the user activates the document composition means 24 from a predetermined menu, the flowchart of FIG. 13 starts. In order to facilitate the adjustment of the appearance of the document structure, the document composition means 24 may be activated after a predetermined range of the text data 20 is selected. When the document construction unit 24 is activated, it acquires the date and the name of the user from the OS and places them in the character placement column 31.

ここで、テキストデータ２０は図１４（ａ）に示すように次の文章であったとする。
これからの取組方法
市場調査
競合他社の調査
パンフレット作成
まず、文書構成手段２４は、テキストデータ２０の１行目の末尾に句読点があるか否か判定する（Ｓ２１０）。句読点がない場合は、テキストデータ２０の件名である可能性が高いので、文書構成手段２４は１行目を件名に対応した欄に配置する（Ｓ２２０）。図１２では、文字配置欄３２に「これからの取組方法」が配置される。 Here, it is assumed that the text data 20 is the following sentence as shown in FIG.
Future Approach Method Market Research Competitor Survey Pamphlet Creation First, the document composition means 24 determines whether there is a punctuation mark at the end of the first line of the text data 20 (S210). If there is no punctuation mark, there is a high possibility that the subject of the text data 20 is present, so the document composing means 24 places the first line in the column corresponding to the subject (S220). In FIG. 12, “Future approach” is arranged in the character arrangement column 32.

ついで、文書構成手段２４は、段落に箇条書きがあるか否かを判定する（Ｓ２３０）。なお、段落とは、例えば字下げして始まる行から次に字下げのある行の直前をいう。箇条書きは、各行の最初に、「・」「（１）」「Ｉ」「◆」「Ａ．」等の記号が付されることが多いので、これらの記号が検出された場合、箇条書きがあると判定する。また、箇条書きは行の終わりに句点「。」を付さないので、句点がない場合は箇条書きであると判定する。これらのいずれかを満たす場合に箇条書きであると判定してもよいし、全てを満たす場合に箇条書きであると判定しもよい。 Next, the document composition unit 24 determines whether or not there are bullets in the paragraph (S230). Note that a paragraph refers to, for example, a line immediately before a line with indentation from a line starting with indentation. Bulleted items are often marked with “•”, “(1)”, “I”, “◆”, “A.”, etc. at the beginning of each line. It is determined that there is. In addition, since the bullets do not have a punctuation mark “.” At the end of the line, if there is no punctuation, it is determined that the item is a bullet. When either of these is satisfied, it may be determined that the item is a bullet, and when all of the items are satisfied, it may be determined that the item is a bullet.

また、箇条書きの段落の次の段落は、１行空けて記載されることがあるので、箇条書きの段落は空行の手前までとすることができる。また、箇条書きの次の文が句点「。」で終了している場合は、句点「。」で終了する文の手前の行までを箇条書きを含む段落であると判定する。 In addition, since the paragraph following the bulleted paragraph may be described with one line left blank, the bulleted paragraph may be before the blank line. If the next sentence after the bulleted list ends with a punctuation mark “.”, The line preceding the sentence ending with the punctuation mark “.” Is determined to be a paragraph including the bulleted list.

図１４（ａ）のテキストデータ２０では、「市場調査」「競合他社の調査」「パンフレット作成」に句点「。」がないので、この３行は箇条書きを含む段落と判定される。段落の終了は、テキストデータ２０の終了に一致している。 In the text data 20 of FIG. 14A, since there is no punctuation mark “.” In “market research”, “competitor research”, and “pamphlet creation”, these three lines are determined to be paragraphs including bullets. The end of the paragraph coincides with the end of the text data 20.

段落に箇条書きがある場合（Ｓ２３０のＹｅｓ）、文書構成手段２４はその段落を箇条書きに対応した欄に配置する（Ｓ２４０）。図１２では、箇条書きに対応した文字配置欄３５に「市場調査」「競合他社の調査」「パンフレット作成」が配置される。 If there is a list item in the paragraph (Yes in S230), the document composing unit 24 arranges the paragraph in a column corresponding to the item list (S240). In FIG. 12, “market research”, “competitor research”, and “pamphlet creation” are arranged in the character arrangement column 35 corresponding to the itemized list.

ついで、文書構成手段２４は、「日時」又は「場所」の文字列があるか否かを判定する（Ｓ２５０）。なお、「日時」又は「場所」を含むことに加え、行の終わりに句点「。」が付されていないことを判定基準にくわえてもよい。 Next, the document construction unit 24 determines whether or not there is a character string “date and time” or “location” (S250). In addition to including “date and time” or “location”, it may be added to the determination criterion that a period “.” Is not added at the end of the line.

「日時」又は「場所」の文字列がある場合（Ｓ２５０のＹｅｓ）、日時、場所を含む段落は、定型的に通知する内容の段落であるので、文書構成手段２４は、図１２の文字配置欄３６に配置する（Ｓ２７０）。 If there is a character string “date and time” or “place” (Yes in S250), the paragraph including the date and time and the place is a paragraph with the contents to be notified in a typical manner, and therefore the document composing means 24 performs the character arrangement of FIG. It arranges in the column 36 (S270).

文書構成手段２４は、「日時」の後に連続して含まれる文字列（例えば２０ＸＸ年１月１日）、「場所」の後に連続して含まれる文字列（例えばＸＸ公園）を通知欄に配置する。 The document composing unit 24 arranges a character string (for example, January 1, 20XX) continuously included after “date and time” and a character string (for example, XX park) continuously included after “location” in the notification column. To do.

段落に「日時」又は「場所」の文字列がない場合（Ｓ２５０のＮｏ）、文書構成手段２４は上方の文字配置欄から順番に段落を配置する（Ｓ２６０）。図１２では、文字配置欄３３，３４に配置される。 If there is no “date” or “place” character string in the paragraph (No in S250), the document composing unit 24 arranges the paragraphs in order from the upper character arrangement column (S260). In FIG. 12, they are arranged in the character arrangement fields 33 and 34.

文書構成手段２４は以上の処理を段落毎にテキストデータ２０が終了するまで繰り返す（Ｓ２８０）。 The document composing unit 24 repeats the above processing for each paragraph until the text data 20 is completed (S280).

図１４（ｂ）は、文書構成手段２４が図１４（ａ）のテキストデータ２０を文書構成した結果の一例を示す。「これからの取組方法」は文字配置欄３２に配置されたので、大きめの文字かつ太文字に変更され、「市場調査」「競合他社の調査」「パンフレット作成」は文字配置欄３５に配置されたので、行頭に（１）〜（３）の番号が付与されている。 FIG. 14B shows an example of a result of the document composition unit 24 composing the text data 20 of FIG. 14A. Since “Future approach” has been placed in the character placement field 32, it has been changed to a larger and bolder character, and “market research”, “competitor research”, and “pamphlet creation” have been placed in the character placement field 35. Therefore, the numbers (1) to (3) are given to the beginning of the line.

したがって、ユーザがテキストデータ２０を入力するだけで、文書種別が判定され、表現が校正されると共に、文書種別に応じてテキストデータ２０を適切に文書構成することができる。 Therefore, only by the user inputting the text data 20, the document type is determined, the expression is proofread, and the text data 20 can be appropriately composed according to the document type.

ところで、図１２のように紙面全体の文書構成を決定するのでなく、段落毎やユーザが選択した範囲など、紙面の一部のみの文書構成の体裁を整えてもよい。この場合、文書構成情報ＤＢ１３３には、文字配置欄３２〜３６が個別に登録されていて、ユーザの操作に応じて文字配置欄３２〜３６と同様の文書構成ボックスが一覧表示され、ユーザの選択に応じて、テキストデータ２０が選択された文書構成ボックスのいずれかの文書構成の体裁に整える。
図１５は、ディスプレイ１１４に表示されたテキストデータ２０と文書構成ボックス４１〜４４の一例を示す。ユーザが所定のメニューから文書構成手段２４を起動すると、アプリケーションソフトウェア２７がフレームに分割され、文書構成ボックス４１〜４４が表示される。 By the way, instead of determining the document configuration of the entire page as shown in FIG. 12, the appearance of the document configuration of only a part of the page, such as each paragraph or a range selected by the user, may be arranged. In this case, the character arrangement columns 32 to 36 are individually registered in the document configuration information DB 133, and a document configuration box similar to that of the character arrangement columns 32 to 36 is displayed in a list according to the user's operation. Accordingly, the text data 20 is arranged in the format of any document configuration in the selected document configuration box.
FIG. 15 shows an example of the text data 20 and the document configuration boxes 41 to 44 displayed on the display 114. When the user activates the document composition means 24 from a predetermined menu, the application software 27 is divided into frames and document composition boxes 41 to 44 are displayed.

文書構成ボックス４１〜４４は、「件名用」など概略の用途を表示すると共に、選択を容易にするため各文書構成ボックス４１〜４４のフォント、太字・細字、文字のサイズ、センタリング・左詰め・右詰め、等が表示されている。また、実際にテキストデータ２０の文書構成の体裁を整えた場合の例として、「×××…」で示す文字列が表示されている。ユーザは、文書構成ボックス４１〜４４からテキストデータ２０に適切な文書構成を選択することができる。 The document configuration boxes 41 to 44 display a general use such as “for subject”, and for easy selection, the font, bold / fine font, character size, centering / left justified / Right justified, etc. are displayed. Further, as an example when the document structure of the text data 20 is actually arranged, a character string indicated by “XXX...” Is displayed. The user can select an appropriate document configuration for the text data 20 from the document configuration boxes 41 to 44.

＜年賀状＞
続いて、年賀状の文書構成について説明する。図１６は、年賀状の文書構成の一例を示す。図１６の文書構成例は、例えばハガキ内の領域にテキストデータ２０を配置するよう複数の文字配置欄３７〜３９を有し、また、イラスト欄４０を有する。 <New Year's card>
Next, the document structure of New Year's cards will be described. FIG. 16 shows an example of a document structure for New Year's cards. The document configuration example of FIG. 16 has a plurality of character arrangement columns 37 to 39 and an illustration column 40 so that the text data 20 is arranged in an area in a postcard, for example.

文字配置欄３７〜３９は、ハガキに対する左上のコーナの位置が定められていると共に、フォント、文字のサイズ、太字・細字、行間隔、文字間隔、文字色、等、予め設定されている。 In the character arrangement fields 37 to 39, the position of the upper left corner with respect to the postcard is determined, and font, character size, bold / thin character, line interval, character interval, character color, and the like are set in advance.

例えば、文字配置欄３７は、新年の挨拶を配置する欄で、毛筆体で２０〜２４ポイントの文字を配置する欄である。また、文字配置欄３８は、その他の文章を配置する欄で、例えば、毛筆体で１２ポイントの文字を配置する欄である。文字配置欄３９は、新年の西暦を設定する欄で、文書構成手段２４がＯＳから取得した西暦に１を足して設定する。 For example, the character arrangement column 37 is a column in which a New Year greeting is arranged, and is a column in which characters of 20 to 24 points are arranged with a brush. The character arrangement column 38 is a column for arranging other sentences, for example, a column for arranging 12-point characters with a brush. The character arrangement column 39 is a column for setting the year of the New Year, and is set by adding 1 to the year acquired by the document composition means 24 from the OS.

また、イラスト欄４０は、JPEG、GIF、TIFF等の画像データを配置する欄である。イラスト欄４０に配置する画像データは予め文書構成情報ＤＢ１３３に登録されている。ユーザが優先的にイラスト欄４０に配置する画像データを設定しておいてもよいし、干支に応じて自動的に配置してもよい。文書構成情報ＤＢ１３３には、干支毎の画像データが記憶されている。また、画像データの好みはユーザによって異なるので、年齢に対応づけて干支の画像データが記憶されている。例えば、年齢層が低いユーザ向けに、干支の動物を擬人化したアニメーション的な画像データが記憶されており、年齢層が高いユーザ向けに、干支の動物のイラストに松をモチーフにしたイラストがあしらわれた画像データが記憶されている。なお、ユーザの年齢は情報処理装置１１１に登録されている。 The illustration column 40 is a column for arranging image data such as JPEG, GIF, and TIFF. Image data to be arranged in the illustration column 40 is registered in the document configuration information DB 133 in advance. The user may set image data to be preferentially arranged in the illustration column 40, or may be automatically arranged according to the zodiac signs. The document configuration information DB 133 stores image data for each zodiac. Moreover, since preference of image data changes with users, the zodiac image data is stored in association with the age. For example, animated image data that anthropomorphizes animals of the zodiac are stored for users of lower age groups, and illustrations with pine motifs are used for illustrations of animals of the zodiac for users of older age groups. Stored image data. Note that the user's age is registered in the information processing apparatus 111.

文書構成手段２４は、テキストデータ２０を解析し、文字配置欄３７、３８にテキストデータ２０を配置する。図１７は、文書構成手段２４がテキストデータ２０を文書構成する手順を示すフローチャート図である。 The document construction unit 24 analyzes the text data 20 and arranges the text data 20 in the character arrangement fields 37 and 38. FIG. 17 is a flowchart showing a procedure in which the document construction unit 24 composes the text data 20.

すでにユーザは、テキストデータ２０をワープロソフトウェアなどのアプリケーションソフトウェア２７で編集中であり、文書種別判定手段２２により文書種別は年賀状であると判定されている。そして、ユーザが所定のメニューから文書構成手段２４を起動すると図１７のフローチャート図がスタートする。なお、文書構成の体裁の調整を容易にするため、テキストデータ２０の所定範囲を選択してから文書構成手段２４を起動してもよい。文書構成手段２４は起動されると、西暦をＯＳから取得し、文字配置欄３９に配置する。 The user is already editing the text data 20 with application software 27 such as word processing software, and the document type determination means 22 determines that the document type is a New Year's card. Then, when the user activates the document composing means 24 from a predetermined menu, the flowchart of FIG. 17 starts. In order to facilitate the adjustment of the appearance of the document structure, the document composition means 24 may be activated after a predetermined range of the text data 20 is selected. When the document construction unit 24 is activated, it acquires the year from the OS and places it in the character placement field 39.

ここで、テキストデータ２０は図１８（ａ）に示すように次の文章であったとする。
謹賀新年
旧年中は大変お世話になりました
今年もよろしくお願い致します
まず、文書構成手段２４は、テキストデータ２０から新年の挨拶を検出する（Ｓ３１０）。文書種別が年賀状であるので、謹賀新年、賀正などの新年の挨拶が検出される。 Here, it is assumed that the text data 20 is the following sentence as shown in FIG.
Thank you very much for your help during the old year of Tsuruga New Year. First of all, the document composing means 24 detects the greeting of the New Year from the text data 20 (S310). Since the document type is a New Year's card, New Year greetings such as Tsuruga New Year and Kasho are detected.

そして、文書構成手段２４は検出した新年の挨拶を対応した欄に配置する（Ｓ３２０）。図１６では、文字配置欄３７に「謹賀新年」が配置される。 Then, the document composing unit 24 arranges the detected New Year greeting in the corresponding column (S320). In FIG. 16, “Tsuruga New Year” is arranged in the character arrangement column 37.

ついで、文書構成手段２４は、その他の文章を対応する欄に配置する（Ｓ３３０）。図１８（ａ）のテキストデータ２０では、「旧年中は大変お世話になりました今年もよろしくお願い致します」が、文字配置欄３８に配置される。「謹賀新年」やその他の文のフォント等をユーザの年齢層に適当なフォント等で記載してもよい。 Next, the document construction unit 24 arranges other sentences in the corresponding columns (S330). In the text data 20 of FIG. 18A, “Thank you very much during the old year, thank you again this year” is placed in the character placement column 38. The font of “Tsuruga New Year” and other sentences may be described in a font suitable for the age group of the user.

ついで、文書構成手段２４は、干支に応じた画像データをイラスト欄４０に配置する（Ｓ３４０）。文書構成手段２４は、西暦から対応する干支を算出し、ユーザの年齢層に適当な画像データを文書構成情報ＤＢ１３３から抽出して、イラスト欄４０に配置する。画像データの大きさとイラスト欄４０の大きさが一致しない場合は、拡大又は縮小してもよい。 Next, the document construction unit 24 arranges image data corresponding to the zodiac signs in the illustration column 40 (S340). The document construction unit 24 calculates the corresponding zodiac from the year, extracts image data appropriate for the user's age group from the document construction information DB 133, and places it in the illustration column 40. If the size of the image data does not match the size of the illustration field 40, the image data may be enlarged or reduced.

図１８（ｂ）は、文書構成手段２４が図１８（ａ）のテキストデータ２０を文書構成した結果の一例を示す。「謹賀新年」は文字配置欄３７に配置されたので、毛筆体かつ大きめの文字で配置されている。イラスト欄４０には干支（卯年の場合）にちなんでウサギの画像データが配置されている。 FIG. 18B shows an example of a result of document composition of the text data 20 of FIG. Since “Tsuruga New Year” is arranged in the character arrangement field 37, it is arranged with a brush and a large character. In the illustration column 40, rabbit image data is arranged after the zodiac (in the case of leap years).

〔変形例〕
上述した実施形態では情報処理装置１１１が文書種別を判定し、表現を校正し、また、文書構成の体裁を整えたが、画像形成装置１１０が同様な処理を行ってもよい。 [Modification]
In the embodiment described above, the information processing apparatus 111 determines the document type, proofreads the expression, and arranges the appearance of the document configuration. However, the image forming apparatus 110 may perform the same processing.

画像形成装置１１０はコンピュータを搭載しているのでプログラム１３４を実行することで、文書種別判定手段２２、表現校正手段２３、文書構成手段２４及び文字列抽出手段２５として機能できる。 Since the image forming apparatus 110 is equipped with a computer, it can function as the document type determination means 22, the expression proofreading means 23, the document composition means 24, and the character string extraction means 25 by executing the program 134.

テキストデータ２０を例えば印刷する場合、テキストデータ２０は文字コードのまま画像形成装置１１０に送信される場合と、情報処理装置１１１でラスタデータに変換されてから画像形成装置１１０に送信される場合があるが、オフィスユースでは文字コードのまま画像形成装置１１０に送信されるので、画像形成装置１１０は上述した実施形態と同様に文書種別を判定し、表現を校正し、また、文書構成の体裁を整えることができる。 For example, when printing the text data 20, the text data 20 may be transmitted to the image forming apparatus 110 as a character code, or may be transmitted to the image forming apparatus 110 after being converted into raster data by the information processing apparatus 111. However, in office use, since the character code is transmitted as it is to the image forming apparatus 110, the image forming apparatus 110 determines the document type, calibrates the expression, and makes the format of the document structure as in the above-described embodiment. Can be arranged.

図１９は、画像形成装置１１０が文書種別を判定する手順のシーケンス図を示す。情報処理装置１１１のアプリケーションソフトウェア２７は画像形成装置１１０にテキストデータ２０の印刷を要求する（Ｓ４１０）。テキストデータ２０の送信時、情報処理装置１１１はテキストデータ２０と共に、文書種別に応じた表現の校正及び文書構成の体裁の調整を要求する情報を添付する。 FIG. 19 shows a sequence diagram of a procedure in which the image forming apparatus 110 determines the document type. The application software 27 of the information processing apparatus 111 requests the image forming apparatus 110 to print the text data 20 (S410). When transmitting the text data 20, the information processing apparatus 111 attaches information requesting proofreading of the expression according to the document type and adjustment of the appearance of the document structure together with the text data 20.

この情報に基づき画像形成装置１１０の文字列抽出手段２５は文字列を抽出し（Ｓ４２０）、文書種別判定手段２２はテキストデータ２０の文書種別を判定する（Ｓ４３０）。 Based on this information, the character string extraction unit 25 of the image forming apparatus 110 extracts a character string (S420), and the document type determination unit 22 determines the document type of the text data 20 (S430).

ついで、表現校正手段２３は文書種別に応じて表現を校正し（Ｓ４４０）、文書構成手段２４は文書構成の体裁を整える（Ｓ４５０）。印刷手段は体裁が整えられた文書構成のテキストデータ２０を印刷することができる（Ｓ４６０）。印刷が終了すると、画像形成装置１１０は印刷終了を示す情報を情報処理装置１１１に送信する（Ｓ４７０）。 Next, the expression proofreading means 23 proofreads the expression according to the document type (S440), and the document composition means 24 arranges the appearance of the document structure (S450). The printing means can print the text data 20 of the document structure with the appearance (S460). When printing ends, the image forming apparatus 110 transmits information indicating the end of printing to the information processing apparatus 111 (S470).

なお、印刷の前に、文書構成手段２４が配置したテキストデータ２０の配置をイメージデータにして情報処理装置１１１に送信し、ユーザが印刷を許可した場合に、調整後の文書構成で印刷してもよい。また、ネットワークＮを介して接続されたサーバにより文書構成の体裁の調整を要求し、印刷のみを画像形成装置１１０にて実行してもよい。 Before printing, the arrangement of the text data 20 arranged by the document composition unit 24 is transmitted as image data to the information processing apparatus 111, and when the user permits printing, the document is printed with the adjusted document structure. Also good. Alternatively, the server connected via the network N may request adjustment of the appearance of the document structure, and only the printing may be executed by the image forming apparatus 110.

本変形例によれば、画像形成装置１１０がテキストデータ２０の文書構成の体裁を自動的に調整するので、各情報処理装置１１１が文書種別判定手段２２、表現校正手段２３、文書構成手段２４及び文字列抽出手段２５を備える必要がなく、情報処理装置１１１のコストを低減できる。 According to this modification, since the image forming apparatus 110 automatically adjusts the format of the document structure of the text data 20, each information processing apparatus 111 includes the document type determination unit 22, the expression proofing unit 23, the document configuration unit 24, and the like. It is not necessary to provide the character string extraction means 25, and the cost of the information processing apparatus 111 can be reduced.

本実施例では公序良俗を害するおそれの高い文書種別を判定し、このような文書種別のテキストデータ２０の転送の禁止し、また、公的機関に通報する情報処理装置１１１について説明する。公序良俗を害するおそれの高いテキストデータ２０とは、例えば、読んだ者に羞恥心や不快感を呼び起こさせ、また、世間体を著しく害する単語を含むものである。本実施例では、一例として、迷惑な単語、いじめの単語、ストーカー用単語、反社会性単語を含むテキストデータ２０を、公序良俗を害するおそれがあるものとする。 In this embodiment, an information processing apparatus 111 that determines a document type that is likely to harm public order and morals, prohibits transfer of text data 20 of such document type, and notifies a public organization will be described. The text data 20 having a high possibility of harming public order and morals includes, for example, words that cause the reader to feel shame or discomfort and significantly harm the public body. In this embodiment, as an example, it is assumed that text data 20 including annoying words, bullying words, stalking words, and antisocial words may harm public order and morals.

公序良俗を害するおそれの高いテキストデータ２０は、作成者がテキストデータ２０を作成した以降であれば判定可能となるが、作成者が本実施形態の情報処理装置１１１を適用することは考えにくい。また、公序良俗を害するおそれの高いテキストデータ２０は、主に電子メールで送信されたり、電子掲示板に投稿されることが多い。このため、受信者、電子掲示板の管理人、又は、プロバイダのメールサーバ等が使用する情報処理装置１１１が、文書種別を判定することが考えられる。したがって、電子メールや投稿用のポストデータが作成者の端末から送信された以降であれば、テキストデータ２０の文書種別を判別できる。 The text data 20 that is likely to harm public order and morals can be determined after the creator creates the text data 20, but it is difficult for the creator to apply the information processing apparatus 111 of this embodiment. In addition, the text data 20 that is likely to harm public order and morals is often transmitted mainly by electronic mail or posted on an electronic bulletin board. For this reason, it is conceivable that the information processing apparatus 111 used by the recipient, the administrator of the electronic bulletin board, the mail server of the provider, or the like determines the document type. Therefore, the document type of the text data 20 can be determined after e-mail or post data for posting has been transmitted from the creator's terminal.

ところで、公序良俗を害するおそれが高い文書種別であると判定された場合、受信者に送信する必要性は低く、また、電子掲示板に掲示する必要性も低い。受信者に送信したり、電子掲示板に掲示してしまうと、公序良俗を害するおそれが高いテキストデータ２０を送信者が作成することを助長することにもなる。このため、公序良俗を害するおそれが高いテキストデータ２０は、プロバイダのメールサーバが受信者への送信を禁止することが好ましい。また、公序良俗を害するおそれが高いか否か不明な場合（必ずしも公序良俗を害するとは言えない場合）、例えば、迷惑な単語等を削除して受信者へ送信してもよい。いずれにしても、受信者が公序良俗を害するおそれが高いテキストデータ２０を受信したり、掲示板で見たりして不快な思いをすることを防止できる。 By the way, when it is determined that the document type has a high possibility of harming public order and morals, it is less necessary to transmit to the recipient, and the necessity to post on the electronic bulletin board is also low. If it is sent to the receiver or posted on the electronic bulletin board, it will help the sender to create text data 20 that is likely to harm public order and morals. For this reason, it is preferable that the mail data of the provider prohibit the transmission of the text data 20 that is likely to harm public order and morals to the recipient. In addition, when it is unclear whether there is a high possibility of harming public order and morals (when it cannot be said that public order and morals are harmed), for example, annoying words may be deleted and transmitted to the recipient. In any case, it is possible to prevent the receiver from feeling uncomfortable by receiving the text data 20 that is highly likely to harm public order and morals or looking at the bulletin board.

また、公序良俗を害するおそれが高い電子メール等を送信する送信者を識別する情報（例えば、電子メールの送信者のメールアドレス、ポストデータを送信した端末のＩＰアドレス等）を記録しておけば、犯罪性の高い悪質な電子メールや電子掲示板の送信者の特定に結びつけることができる。なお、テキストデータ２０の文書種別の判定方法は電子メールとポストデータで同じなので、以下では、主に電子メールを例に説明する。 In addition, if information that identifies a sender who sends an e-mail or the like that is likely to harm public order and morals (for example, the e-mail address of the e-mail sender, the IP address of the terminal that sent the post data, etc.) is recorded, It can be linked to the identification of the sender of malicious e-mail or bulletin board with high criminal characteristics. Note that the method for determining the document type of the text data 20 is the same for e-mail and post data, and therefore, e-mail will be mainly described below as an example.

〔機能構成図〕
図２０は、情報処理装置１１１の機能構成図の一例を示す。なお、図２０において図４と同一構成部には同一の符号を付しその説明は省略する。上記のとおり、図２０の情報処理装置１１１は、例えば、プロバイダや携帯電話事業者のＳＭＴＰサーバ、ＰＯＰサーバである。情報処理装置１１１が電子メール等で受信したテキストデータ２０はいったん記憶装置１２６に記憶され、転送する前に文書種別判定手段２２がテキストデータ２０を抽出し、種別判定語から文書種別を判定する。 [Function configuration diagram]
FIG. 20 shows an example of a functional configuration diagram of the information processing apparatus 111. In FIG. 20, the same components as those in FIG. 4 are denoted by the same reference numerals, and the description thereof is omitted. As described above, the information processing apparatus 111 in FIG. 20 is, for example, an SMTP server or a POP server of a provider or a mobile phone operator. The text data 20 received by the information processing device 111 by e-mail or the like is once stored in the storage device 126, and the document type determination means 22 extracts the text data 20 and determines the document type from the type determination word before transferring.

また、転送禁止手段２８は、公序良俗を害するおそれが高いテキストデータ２０の転送を禁止する。転送の禁止とは、テキストデータ２０が電子メールの場合は、例えば、ＳＭＴＰサーバからＰＯＰサーバへの転送の禁止、ＰＯＰサーバから受信者の端末への送信の禁止である。また、テキストデータ２０が電子掲示板への投稿用のポストデータの場合、ポストデータを端末で表示するために端末に送信することを禁止する。 Further, the transfer prohibiting means 28 prohibits the transfer of the text data 20 that is likely to harm public order and morals. When the text data 20 is an e-mail, the prohibition of transfer is, for example, prohibition of transfer from the SMTP server to the POP server, and prohibition of transmission from the POP server to the recipient's terminal. Further, when the text data 20 is post data for posting on the electronic bulletin board, it is prohibited to transmit the post data to the terminal in order to display it on the terminal.

また、公序良俗を害するおそれが高いか否か不明な場合は、表現校正手段２３はテキストデータ２０から種別判定語を削除した後、電子メールの転送を許可する。また、プロパティ情報記録手段２６は、プロパティ情報ＤＢ１３５に公序良俗を害するおそれが高いテキストデータ２０を送信した送信者などのプロパティ情報を記録する。さらに、通報手段２７は、プロパティ情報ＤＢ１３５を参照して、公序良俗を害するおそれが高いテキストデータ２０を多く送信する送信者を警察などの公的機関に通報する。 If it is unclear whether the public order and morals are likely to be harmed, the expression proofreading means 23 deletes the type determination word from the text data 20 and then permits the transfer of the e-mail. Further, the property information recording unit 26 records property information such as a sender who has transmitted the text data 20 having a high possibility of harming public order and morals in the property information DB 135. Further, the reporting unit 27 refers to the property information DB 135 and reports to the public organization such as the police a sender who transmits a large amount of text data 20 that is likely to harm public order and morals.

〔文書種別の判定〕
公序良俗を害するおそれの高いテキストデータ２０の文書種別の判定について説明する。図２１は、文書種別情報ＤＢ１３１に記憶される情報の一例を示す。図２１では文書種別に対応づけて、種別判定語及び種別判定語を構成する文字の文字コードが登録されている。図２１では、公序良俗を害するおそれの高い文書種別として、迷惑文書、いじめ文書、ストーカー文書、反社会文書、を挙げた。 [Determination of document type]
The determination of the document type of the text data 20 that is likely to harm public order and morals will be described. FIG. 21 shows an example of information stored in the document type information DB 131. In FIG. 21, the type determination word and the character code of the character constituting the type determination word are registered in association with the document type. In FIG. 21, nuisance documents, bullying documents, stalking documents, and anti-social documents are listed as document types that are likely to harm public order and morals.

文書種別判定手段２２は、テキストデータ２０に含まれた種別判定語と文書種別情報ＤＢ１３１の一連の文字コードを比較することで、種別判定語に対応する文書種別を抽出することができる。例えば、「エッチ」「淫ら」「人妻」「ホテル直行」には迷惑文書の文書種別が対応づけられており、「死ね」「うざい」「きもい」にはいじめ文書の文書種別が対応づけられており、「会いたい」にはストーカ文書の文書種別が対応づけられており、「拳銃」「ダイナマイト」には反社会文書の文書種別が対応づけられている。 The document type determination unit 22 can extract the document type corresponding to the type determination word by comparing the type determination word included in the text data 20 with a series of character codes in the document type information DB 131. For example, the document type of the nuisance document is associated with “Ecchi”, “Indecent”, “Married Woman”, and “Hotel Direct”, and the document type of the bullying document is associated with “Dead”, “Uzai”, and “Kimoi”. The document type of the stoker document is associated with “I want to meet”, and the document type of the anti-social document is associated with “handgun” and “dynamite”.

文書種別判定手段２２は、種別判定語に基づき文書種別情報ＤＢ１３１を参照して、１つのテキストデータ２０毎に、それぞれの種別判定語に対応づけられた文書種別を抽出する。そして、所定数以上（例えば、３個以上）の種別判定語が抽出された場合、文書種別判定手段２２は、抽出された種別判定語に対応づけられた文書種別であると判定する。なお、１つのテキストデータ２０から異なる文書種別が検出された場合、検出された回数が最も多い文書種別に属すると判定すればよい。本実施例ではどの文書種別と判定しても、それらは公序良俗を害するおそれが高いテキストデータ２０であり、種別判定語が削除されたり、転送が禁止される点で同じであるが、このように厳密に区別しておくことで文書種別毎の処理も可能となる（例えば、通報する公的機関を切り替える）。 The document type determination unit 22 refers to the document type information DB 131 based on the type determination word, and extracts a document type associated with each type determination word for each piece of text data 20. When a predetermined number or more (for example, three or more) of type determination words are extracted, the document type determination unit 22 determines that the document type is associated with the extracted type determination word. When different document types are detected from one text data 20, it may be determined that the document type belongs to the document type with the highest number of detections. In this embodiment, regardless of the document type, they are text data 20 that is highly likely to harm public order and morals, and are the same in that the type determination word is deleted or transfer is prohibited. By strictly distinguishing, processing for each document type can be performed (for example, a public institution to be notified is switched).

また、種別判定語が所定数未満（例えば、３個未満）の場合、文書種別判定手段２２は、公序良俗を害するおそれが高いか否か不明であると判定する。 When the number of type determination words is less than a predetermined number (for example, less than 3), the document type determination unit 22 determines that it is unknown whether there is a high possibility of harming public order and morals.

〔文書種別の判定に応じた処理〕
文書種別判定手段２２が文書種別を判定した結果、テキストデータ２０は、公序良俗を害するおそれがない、公序良俗を害するおそれが高い、又は、公序良俗を害するおそれが高いか否か不明の、３つの態様に区分することができる。公序良俗を害するおそれがない場合、テキストデータ２０はそのまま転送が許可される。 [Processing according to document type determination]
As a result of the document type determination means 22 determining the document type, the text data 20 is classified into three modes that have no possibility of harming public order and morals, high possibility of harming public order and morals, or high possibility of harming public order and morals. Can be classified. If there is no risk of harming public order and morals, the text data 20 is allowed to be transferred as it is.

公序良俗を害するおそれが高いか否か不明の場合、テキストデータ２０から種別判定語が削除される。例えば、「お前うざい、きもい」というテキストデータ２０の場合、テキストデータ２０は受信者は「お前（不適切な表現があるので省略しました）、（不適切な表現があるので省略しました）」というテキストデータ２０を受信することになる。受信者が不快な思いをすることを防止できる。 If it is unclear whether there is a high risk of harming public order and morals, the type determination word is deleted from the text data 20. For example, in the case of the text data 20 of “You are bad”, the recipient of the text data 20 is “You (omitted because there is an inappropriate expression), (Omitted because there is an inappropriate expression)” The text data 20 is received. It is possible to prevent the recipient from feeling uncomfortable.

なお、この場合も、プロパティ情報記録手段２６が、プロパティ情報ＤＢ１３５にこのテキストデータ２０を送信した送信者などのプロパティ情報を記録しておくことができる。公序良俗を害するおそれが高いかどうか不明な場合にも、プロパティ情報ＤＢ１３５に記録することで、例えば悪意のある送信者が、１回に送信するテキストデータ２０に含まれる文書種別判定語の数を少なくし、送信回数を増やしてテキストデータ２０を送信する場合にも、該送信者を抽出することができる。 Also in this case, the property information recording means 26 can record property information such as the sender who transmitted the text data 20 in the property information DB 135. Even if it is unclear whether there is a high possibility of harming public order and morals, by recording in the property information DB 135, for example, a malicious sender can reduce the number of document type determination words included in the text data 20 transmitted at a time. Even when the text data 20 is transmitted by increasing the number of transmissions, the sender can be extracted.

一方、「あの映画きもい、出演者が死ぬ場面も多いし…」というテキストデータ２０の場合、「あの映画（不適切な表現があるので省略しました）、出演者が（不適切な表現があるので省略しました）場面も多いし…」に修正されたテキストデータ２０が受信者に送信される。このテキストデータ２０は、種別判定語はあるが、実際には映画の内容を論評したものであるで、受信者又は受信者から通知された送信者が、メールサーバに元のテキストデータ２０の再送を要求すると、種別判定語が削除されていないテキストデータ２０が受信者に送信される。 On the other hand, in the case of the text data 20 “That movie, there are many scenes where the performer dies…”, “That movie (omitted because there is an inappropriate expression),” the performer (there is an inappropriate expression) The text data 20 corrected to "There are many scenes ..." is transmitted to the recipient. Although this text data 20 has a type determination word, it is actually a comment on the contents of the movie, and the sender or the sender notified from the receiver retransmits the original text data 20 to the mail server. Is requested, the text data 20 from which the type determination word has not been deleted is transmitted to the recipient.

したがって、テキストデータ２０に種別判定語が含まれていても、実際の内容を人間が判別して、元のテキストデータ２０を送信することができる。プロパティ情報をプロパティ情報ＤＢ１３５に記録した場合、再送要求によってプロパティ情報から削除される。 Therefore, even if the text data 20 includes a type determination word, a person can determine the actual contents and transmit the original text data 20. When property information is recorded in the property information DB 135, it is deleted from the property information by a retransmission request.

テキストデータ２０の再送要求が面倒なユーザ、種別判定語を厭わないユーザは、メールサーバに登録される、公序良俗を害するおそれが高いか否か不明なテキストデータ２０の配信にかかるポリシーに、削除せず配信するよう設定することができる。 Users who are troublesome to request re-sending of text data 20 or users who do not care about the type judgment word should be deleted in the policy related to the distribution of text data 20 that is registered in the mail server and has a high possibility of harming public order and morals. It can be set to be delivered.

公序良俗を害するおそれが高い場合、転送禁止手段２８はテキストデータ２０の転送を禁止し、また、プロパティ情報記録手段２６はプロパティ情報ＤＢ１３５にテキストデータ２０のプロパティ情報を記録する。 When there is a high possibility of harming public order and morals, the transfer prohibiting unit 28 prohibits the transfer of the text data 20, and the property information recording unit 26 records the property information of the text data 20 in the property information DB 135.

図２２は、プロパティ情報ＤＢ１３５に記録されるプロパティ情報の一例を示す。図２２に示すように、プロパティ情報ＤＢ１３５に記録されるプロパティ情報は、送信者の電子メールアドレス、受信者（宛先）の電子メールアドレス、文書種別、種別判定語の数、送信日時、等である。したがって、公序良俗を害するおそれが高いテキストデータ２０を送信する送信者及び受信者の電子メールアドレスを検出でき、日時からその頻度、種別判定語の数から悪質さを把握できる。 FIG. 22 shows an example of property information recorded in the property information DB 135. As shown in FIG. 22, the property information recorded in the property information DB 135 includes the sender's email address, the recipient (destination) email address, the document type, the number of type judgment words, the transmission date and time, and the like. . Therefore, it is possible to detect the e-mail addresses of the sender and the receiver who transmit the text data 20 that is likely to harm public order and morals, and to grasp the maliciousness from the frequency and the number of type determination words.

なお、プロパティ情報だけでなく、テキストデータ２０そのものを記録しておくことが好ましい。後述する公的機関への通報時には重要な証拠となるからである。 It is preferable to record not only the property information but also the text data 20 itself. This is because it becomes important evidence when making a report to a public institution described later.

図２３は、文書種別判定手段２２が文書種別を判定する手順を示すフローチャート図である。まず、情報処理装置１１１に電子メールなどのテキストデータ２０が送信される。例えば、テキストデータ２０が「うざい、うざい、うざい、うざい…。」というテキストデータ２０であるとすると、文書種別判定手段２２は、「うざい」の文字コード、８２Ａ４「う」、８２Ｂ４「ざ」、８２Ａ２「い」を検出して、文書種別情報ＤＢ１３１に登録された種別判定語から「うざい」を一致する文字列として抽出する（Ｓ２０）。そして、抽出された種別判定語を用いて文書種別を判定する（Ｓ３０）。 FIG. 23 is a flowchart showing a procedure in which the document type determination unit 22 determines the document type. First, text data 20 such as an electronic mail is transmitted to the information processing apparatus 111. For example, if the text data 20 is the text data 20 of “noisy, noisy, annoying, noisy ...”, the document type determination means 22 uses the character code of “noisy”, 82A4 “us”, 82B4 “za”, 82A2 “I” is detected, and “Uzai” is extracted as a matching character string from the type determination words registered in the document type information DB 131 (S20). Then, the document type is determined using the extracted type determination word (S30).

テキストデータ２０から「うざい」という種別判定語が抽出されるが、これらはいじめ文書の文書種別に対応づけられているので、文書種別判定手段２２はこのテキストデータ２０をいじめ文書の文書種別と判定する。 Although the type determination word “Uzai” is extracted from the text data 20, since these are associated with the document type of the bullying document, the document type determination unit 22 determines that the text data 20 is the document type of the bullying document. To do.

情報処理装置１１１は、公序良俗を害するおそれの程度に応じて、テキストデータ２０を処理する（Ｓ４０）。まず、公序良俗を害するおそれがない場合、文書種別判定手段２２はテキストデータ２０の転送を許可する。 The information processing apparatus 111 processes the text data 20 according to the degree of fear of harming public order and morals (S40). First, when there is no risk of harming public order and morals, the document type determination means 22 permits the transfer of the text data 20.

公序良俗を害するおそれが高い場合、転送禁止手段２８はテキストデータ２０の転送を禁止する（Ｓ６０）。そして、プロパティ情報記録手段２６は、そのテキストデータ２０を送信した送信者の電子メールアドレス、受信者の電子メールアドレス、文書種別、種別判定語の数、送信日時をプロパティ情報ＤＢ１３５に記録する。また、公序良俗を害するおそれが高いかどうか不明な場合、表現校正手段２３はテキストデータ２０から種別判定語を削除する（Ｓ８０）。そして、文書種別判定手段２２はテキストデータ２０の転送を許可する（Ｓ９０）。公序良俗を害するおそれが高いかどうか不明な場合にも、プロパティ情報ＤＢ１３５に記録する。 If there is a high possibility of harming public order and morals, the transfer prohibiting means 28 prohibits the transfer of the text data 20 (S60). Then, the property information recording unit 26 records the sender's email address, the recipient's email address, the document type, the number of type determination words, and the transmission date / time of the sender of the text data 20 in the property information DB 135. If it is unknown whether the public order and morals are likely to be harmed, the expression proofreading means 23 deletes the type determination word from the text data 20 (S80). Then, the document type determination unit 22 permits the transfer of the text data 20 (S90). Even when it is unclear whether there is a high risk of harming public order and morals, it is recorded in the property information DB 135.

〔公的機関への通報〕
図２３のような手順により、公序良俗を害するおそれが高いテキストデータ２０を受信者に送信することを防止できる。しかしながら、送信回数が多い悪質なテキストデータ２０については、警察や管轄省庁などの公的機関へ通報することが好ましい。例えば、電子メールや電子掲示板上の発言でも、刑法上の名誉毀損罪や民法上の不法行為、ストーカー規制法のストーカー行為、等に該当する場合がある。公的機関は種々あるが、文書種別に応じて通報先を切り替えることが好ましい。例えば、迷惑文書、ストーカー文書、反社会文書の場合は警察に、いじめ文書の場合はいじめ相談窓口や文部科学省、教育委員会、校長、等である。 [Reports to public institutions]
With the procedure as shown in FIG. 23, it is possible to prevent the text data 20 that is likely to harm public order and morals from being transmitted to the recipient. However, it is preferable to report the malicious text data 20 having a large number of transmissions to a public institution such as the police or a ministry. For example, a statement on an e-mail or an electronic bulletin board may fall under criminal defamation charges, illegal acts under civil law, stalker acts under the stalker regulation law, and the like. Although there are various public institutions, it is preferable to switch the report destination according to the document type. For example, in the case of nuisance documents, stalker documents, and antisocial documents, it is the police.

通報手段２７は、プロパティ情報ＤＢ１３５に記録された送信者の電子メールアドレス等を解析して、例えば１０回以上の送信のように悪質なテキストデータ２０の送信について、公的機関へ通報する。悪質な送信行為には、例えば次のような態様がある。
ａ）一人の送信者から → 一人の受信者
ｂ）一人の送信者から → 複数の受信者
ｃ）複数の送信者から → 一人の受信者
ｄ）複数の送信者から → 複数の受信者
ａ）の態様は、例えば、迷惑文書、ストーカー文書やいじめ文書のように、特定の加害者Ａが特定の被害者Ｂにテキストデータ２０を送信する態様である。ｂ）の態様は、例えば、特定の加害者Ａが、特定の被害者Ｂの名誉毀損等のため複数の第三者Ｘ〜Ｚにテキストデータ２０を送信する態様、又は、特定の加害者Ａが、反社会文書を複数の第三者Ｘ〜Ｚにテキストデータ２０を送信する態様である。ｃ）の態様は、例えば、いじめ文書のように、特定の複数の加害者Ａ、Ａ'、Ａ''が特定の被害者Ｂにテキストデータ２０を送信する態様である。また、ｄ）の態様は、例えば、特定の加害者Ａ、Ａ'、Ａ''等が、特定の被害者Ｂの名誉毀損等のため複数の第三者Ｘ〜Ｚにテキストデータ２０を送信する態様である。 The reporting means 27 analyzes the sender's e-mail address or the like recorded in the property information DB 135 and reports to the public institution about the transmission of the malicious text data 20 such as 10 or more transmissions. For example, the malicious transmission act has the following modes.
a) From one sender → One recipient b) From one sender → Multiple recipients c) From multiple senders → One recipient d) From multiple senders → Multiple recipients a) This mode is a mode in which the specific perpetrator A transmits the text data 20 to the specific victim B, such as a nuisance document, a stalker document, and a bullying document. The mode of b) is, for example, a mode in which a specific perpetrator A transmits text data 20 to a plurality of third parties X to Z for defamation of a specific victim B or a specific perpetrator A Is a mode in which the text data 20 is transmitted to a plurality of third parties X to Z as an antisocial document. The mode c) is a mode in which a plurality of specific perpetrators A, A ′, A ″ transmit the text data 20 to a specific victim B as in a bullying document, for example. Further, in the aspect of d), for example, the specific perpetrators A, A ′, A ″, etc. send the text data 20 to a plurality of third parties X to Z for defamation of the specific victim B, etc. It is an aspect to do.

公序良俗を害する態様としては、ａ）及びｃ）の態様が最も多いと考えられる。これに対し、ｂ）ｄ）の態様は、第三者Ｘ〜Ｚの電子メールアドレスを特定の加害者Ａ等が取得している必要があり、また、第三者Ｘ〜Ｚは特定の被害者Ｂの知人である必要があるため、公序良俗を害する態様としては少ない。 Aspects a) and c) are thought to be the most common aspects of harming public order and morals. On the other hand, the aspect of b) d) requires that a specific perpetrator A or the like obtains the e-mail address of the third party X to Z, and the third party X to Z Since it is necessary to be an acquaintance of the person B, there are few aspects that harm public order and morals.

したがって、最も簡単に加害者Ａを特定するには、プロパティ情報ＤＢ１３５に記録された送信者の電子メールアドレスの数が多い（例えば１０以上）送信者を抽出すればよい（ａ）ｂ）の態様）。また、送信者が複数の電子メールアドレスを使い分けて、テキストデータ２０を送信する場合でも、テキストデータ２０の宛先になることが多い（例えば、１０回以上）受信者のメールアドレスが特定できる（ｃ）の態様）。この場合でもいじめやストーカー等の行為であるとしてよいので、同じ受信者のメールアドレスに送信した複数の送信者が通報の対象となる。したがって、電子メールを用いて公序良俗を害するおそれが高いテキストデータ２０を送信する態様のほとんど（ａ）〜ｃ））に対し有効である。 Therefore, in order to identify the perpetrator A most easily, it is only necessary to extract a sender having a large number (for example, 10 or more) of the sender's e-mail addresses recorded in the property information DB 135 (a) b) ). Further, even when the sender uses a plurality of e-mail addresses to transmit the text data 20, the e-mail address of the receiver can often be specified (e.g., 10 times or more) (c). Embodiment)). Even in this case, since it may be an action such as bullying or stalking, a plurality of senders transmitted to the same recipient's mail address are subject to notification. Therefore, it is effective for most of the modes (a) to (c)) in which the text data 20 having a high possibility of harming public order and morals is transmitted using electronic mail.

なお、ｂ）やｄ）の態様は、電子メールよりも電子掲示板の態様に近いので、テキストデータ２０を電子掲示板にアップロードした端末のＩＰアドレスが、送信者（アップロードした者）を特定する手がかりとなる。このため、プロパティ情報記録手段２６は、端末のＩＰアドレスをプロパティ情報ＤＢ１３５に記録する。 Since the aspects of b) and d) are closer to those of electronic bulletin boards than e-mails, the IP address of the terminal that uploaded the text data 20 to the electronic bulletin board is a clue to identify the sender (uploader). Become. Therefore, the property information recording unit 26 records the IP address of the terminal in the property information DB 135.

通報手段２７は、プロパティ情報ＤＢ１３５に記録された数の多い送信者の電子メールアドレス、又は、記録された数の多い受信者の電子メールアドレスに送信した送信者の電子メールアドレスを公的機関に通報する。より好ましくは、公的機関への送信時に、プロバイダの電子証明やタイムスタンプを添付することで、通報手段２７の通報の証拠機能が向上する。 The reporting unit 27 sends the sender's email address recorded in the property information DB 135 to the public organization or the sender's email address sent to the recorded recipient's email address. report. More preferably, the reporting proof function of the reporting means 27 is improved by attaching a provider's electronic certificate or time stamp when transmitting to a public institution.

これにより公的機関は、プロバイダ責任制限法及び関連するガイドラインに基づきプロバイダ等に送信者の情報を開示するよう要求でき、送信者の情報から実際の住所、氏名、連絡先等を特定することができるようになる。 This allows public organizations to request the provider to disclose the sender's information based on the Provider Liability Limitation Law and related guidelines, and specify the actual address, name, contact information, etc. from the sender's information. Will be able to.

図２４は、通報手段２７がテキストデータ２０の送信者を抽出する手順を示すフローチャート図の一例を示す。図２４のフローチャート図は、例えば、所定のサイクル時間（１日１回）毎に繰り返し実行される。 FIG. 24 shows an example of a flowchart showing a procedure for the reporting means 27 to extract the sender of the text data 20. The flowchart of FIG. 24 is repeatedly executed, for example, every predetermined cycle time (once a day).

まず、通報手段２７は、プロパティ情報ＤＢ１３５に例えば１０以上記録された送信者の電子メールアドレスを抽出する（Ｓ５１０）。これによりａ）及びｂ）の態様の送信者を検出することができる。 First, the reporting means 27 extracts the sender's e-mail address recorded in the property information DB 135, for example, 10 or more (S510). As a result, it is possible to detect the sender in the aspects a) and b).

ついで、通報手段２７は、プロパティ情報ＤＢ１３５に例えば１０以上記録された受信者に送信する送信者の電子メールアドレスを抽出する（Ｓ５２０）。これによりｃ）の態様の送信者を検出することができる。 Next, the notification means 27 extracts the sender's e-mail address to be transmitted to the receiver recorded in the property information DB 135, for example, 10 or more (S520). Thereby, the sender of the aspect of c) can be detected.

かかる処理により、単に公序良俗を害するおそれの高いテキストデータ２０の送信を禁止するだけでなく公的機関に通報することができるので、自動的に証拠保存され、公的機関も犯罪検挙がしやすくなり、さらなる犯罪を抑止することができるようになる。 This process not only prohibits the transmission of text data 20 that is likely to harm public order and morals, but can also notify the public institution, so that the evidence is automatically preserved, and the public institution becomes easier to criminalize crimes. , Will be able to deter further crimes.

文書種別を判定する情報処理装置がネットワークＮを介して画像形成装置と接続された印刷システムの概略構成例を示す図である。1 is a diagram illustrating a schematic configuration example of a printing system in which an information processing apparatus that determines a document type is connected to an image forming apparatus via a network N. FIG. 情報処理装置のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of information processing apparatus. 情報処理装置の機能構成図の一例である。It is an example of a functional block diagram of information processing apparatus. 情報処理装置の機能構成図の他の一例を示す図である。It is a figure which shows another example of the function block diagram of information processing apparatus. 文書種別情報ＤＢに記憶される情報の一例を示す図である。It is a figure which shows an example of the information memorize | stored in document classification information DB. 文書種別情報ＤＢに記憶される情報のより詳細な例を示す図である。It is a figure which shows the more detailed example of the information memorize | stored in document classification information DB. 文書種別判定手段が文書種別を判定する手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the procedure in which a document type determination means determines a document type. 校正情報ＤＢに記憶される情報の一例を示す図である。It is a figure which shows an example of the information memorize | stored in calibration information DB. 表現校正手段が文書種別に応じて表現を校正する手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the procedure in which an expression proofreading means proofreads an expression according to a document type. ディスプレイに表示されるテキストデータの構成例を示す図である。It is a figure which shows the structural example of the text data displayed on a display. 校正後の表現の複数の候補が表示されたテキストデータの一例を示す図である。It is a figure which shows an example of the text data by which the several candidate of the expression after proofread was displayed. ビジネス文書の文書構成の一例を示す図である。It is a figure which shows an example of the document structure of a business document. 文書構成手段が、テキストデータを文書構成情報に従い構成する手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the procedure in which a document structure means comprises text data according to document structure information. 構成前と構成後のテキストデータの一例を示す図である。It is a figure which shows an example of the text data before a structure and after a structure. ディスプレイに表示されたテキストデータと文書構成ボックスの一例を示す図である。It is a figure which shows an example of the text data and document structure box which were displayed on the display. 年賀状の文書構成の一例を示す図である。It is a figure which shows an example of the document structure of a New Year's card. 文書構成手段がテキストデータを文書構成する手順を示すフローチャート図である。It is a flowchart figure which shows the procedure in which a document structure means composes text data. 構成前と構成後のテキストデータの一例を示す図である。It is a figure which shows an example of the text data before a structure and after a structure. 画像形成装置が文書種別を判定する手順のシーケンス図の一例である。FIG. 10 is an example of a sequence diagram of a procedure for determining a document type by the image forming apparatus. 情報処理装置の機能構成図の一例である（実施例２）。It is an example of the function block diagram of information processing apparatus (Example 2). 文書種別情報ＤＢに記憶される情報の一例を示す図である（実施例２）。(Example 2) which shows an example of the information memorize | stored in document classification information DB. プロパティ情報ＤＢに記録されるプロパティ情報の一例を示す図である。It is a figure which shows an example of the property information recorded on property information DB. 文書種別判定手段が文書種別を判定する手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the procedure in which a document type determination means determines a document type. 通報手段がテキストデータの送信者を抽出して通報する手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the procedure in which a report means extracts the sender of text data and reports.

Explanation of symbols

２０テキストデータ
２１ＩＭ（インプットメソッド）
２２文書種別判定手段
２３表現校正手段
２４文書構成手段
２６プロパティ情報記録手段
２７通報手段
２８転送禁止手段
３１〜３９文字配置欄
４０イラスト欄
１１０画像形成装置
１１１情報処理装置
１１２キーボード
１１３マウス
１１４ディスプレイ
１２６記憶装置
１３１文書種別情報ＤＢ
１３２校正情報ＤＢ
１３３文書構成情報ＤＢ
１３４プログラム
１３５プロパティ情報ＤＢ 20 Text data 21 IM (input method)
22 Document type determination unit 23 Expression proofing unit 24 Document composition unit 26 Property information recording unit 27 Notification unit 28 Transfer prohibition unit 31-39 Character arrangement column 40 Illustration column 110 Image forming apparatus 111 Information processing apparatus 112 Keyboard 113 Mouse 114 Display 126 Storage Device 131 Document type information DB
132 Calibration information DB
133 Document configuration information DB
134 Program 135 Property information DB

Claims

Document type information storage means for storing characters or character strings used in the document data of the document type in association with the document type of the document data;
Document data input means for inputting document data;
A type determination unit that determines the document type of the input document data based on the document type stored in the document type information storage unit in association with the character or character string included in the document data input by the document data input unit. When,
Document configuration information storage means for storing document configuration information for designating one or more fonts, sizes, thicknesses of characters of document data or arrangement positions of characters or character strings for each document type,
A document composition unit that arranges the appearance of the input document data based on the document structure information stored in the document structure information storage unit in association with the document type of the document data determined by the type determination unit;
A proofreading information storage means for storing a proofread character or character string in association with a character or character string before proofreading other than typographical error, omission and grammatical misuse for each document type,
When the document data input by the document data input unit stores characters or character strings before calibration stored in the calibration information storage unit,
A calibrating unit that calibrates a character or character string before proofreading with a calibrated character or character string stored in the proofreading information storage unit according to the document type of the input document data determined by the type determining unit. Have
The document configuration means displays the document configuration information together with a configuration example, and when an operation to apply the document configuration information to an arbitrary character string of document data is accepted, arranges the appearance of the arbitrary character string. An information processing apparatus characterized by the above.

The type determination unit counts the number stored in the document type information storage unit among the characters or character strings included in the document data input by the document data input unit, for each document type. It is determined that the document data is the largest document type.
The information processing apparatus according to claim 1 .

When the type determination unit determines that the document data input by the document data input unit is a business document used in office work or transactions,
The document composition means adds a bullet to the beginning of a bulleted list of document data in accordance with the document structure information of a business document.
The information processing apparatus according to claim 1 or 2 .

When the type determination unit determines that the document data input by the document data input unit is a business document used in office work or transactions,
The document configuration means arranges date and time information or location information included in document data at a predetermined position in accordance with the document configuration information of a business document.
The information processing apparatus according to any one of claims 1 to 3 .

When the type determination unit determines that the document data input by the document data input unit is a New Year's card,
The document composition means converts a font of characters constituting a New Year greeting included in the document data into a brush in accordance with the document structure information of a New Year card.
The information processing apparatus according to claim 1 or 2 .

The document configuration information storage means stores image data of animals corresponding to each zodiac for each zodiac,
When the type determination unit determines that the document data input by the document data input unit is a New Year's card,
The document composition means extracts the animal image data corresponding to the zodiac of the year following the year in which the document data was input from the document composition information storage means, and according to the document structure information of the new year card, a predetermined position of the new year card To place in the
The information processing apparatus according to claim 1 or 2 .

The calibration means includes
A plurality of characters or character strings after proofreading are displayed on the display device together with the characters or character strings before proofreading included in the document data,
Using a character or character string selected by a pointing device among a plurality of proofread characters or character strings, the character or character string before proofreading of document data is proofread.
The information processing apparatus according to claim 1 .

Document type information storage means for storing characters or character strings used in the document data of the document type in association with the document type of the document data;
Document data input means for inputting document data;
A type determination unit that determines the document type of the input document data based on the document type stored in the document type information storage unit in association with the character or character string included in the document data input by the document data input unit. When,
Document configuration information storage means for storing document configuration information for designating one or more fonts, sizes, thicknesses of characters of document data or arrangement positions of characters or character strings for each document type,
A document composition unit that arranges the appearance of the input document data based on the document structure information stored in the document structure information storage unit in association with the document type of the document data determined by the type determination unit;
A proofreading information storage means for storing a proofread character or character string in association with a character or character string before proofreading other than typographical error, omission and grammatical misuse for each document type,
When the document data input by the document data input unit stores characters or character strings before calibration stored in the calibration information storage unit,
A calibrating unit that calibrates a character or character string before proofreading with a calibrated character or character string stored in the proofreading information storage unit according to the document type of the input document data determined by the type determining unit. Have
The document configuration means displays the document configuration information together with a configuration example, and when an operation to apply the document configuration information to an arbitrary character string of document data is accepted, arranges the appearance of the arbitrary character string. An image forming apparatus.

Document type information storage means for storing characters or character strings used in the document type in association with the document type of the document data;
A computer that reads information from a document configuration information storage unit that stores document configuration information that specifies one or more fonts, sizes, and thicknesses of characters of document data, or arrangement positions of characters or character strings for each document type ,
Document data input means for inputting document data;
A type determination unit that determines the document type of the input document data based on the document type stored in the document type information storage unit in association with the character or character string included in the document data input by the document data input unit. When,
A document composition unit that arranges the appearance of the input document data based on the document structure information stored in the document structure information storage unit in association with the document type of the document data determined by the type determination unit;
A proofreading information storage means for storing a proofread character or character string in association with a character or character string before proofreading other than typographical error, omission and grammatical misuse for each document type,
When the document data input by the document data input unit stores characters or character strings before calibration stored in the calibration information storage unit,
A calibrating unit that calibrates a character or character string before proofreading with a calibrated character or character string stored in the proofreading information storage unit according to the document type of the input document data determined by the type determining unit. Function as,
A program for displaying the document configuration information together with a configuration example and executing a process for adjusting the appearance of the arbitrary character string when an operation for applying the document configuration information to an arbitrary character string of document data is received .

Inputting document data from the document data input means;
The type determination unit refers to the document type information storage unit that stores the character or character string used in the document type in association with the document type of the document data, and sets the character or character string included in the input document data. Determining the document type of the input document data based on the document type stored in the document type information storage unit in association with the document type;
A document in which the document composition unit designates one or more fonts, sizes, and thicknesses of characters of document data, or arrangement positions of characters or character strings for each document type according to the document type of the determined document data. Referring to the document configuration information storage means for storing the configuration information, and adjusting the format of the input document data in accordance with the document configuration information stored in the document configuration information storage means;
For each document type, the character or character string before proofreading of the proofreading information storage means storing the character or character string after proofreading in correspondence with the character or character string before proofreading other than typographical error, omission and grammatical misuse, When input by the document data input means,
In accordance with the document type of the input document data determined by the type determining unit, the calibrating unit calibrates the character or character string before proofreading with the calibrated character or character string stored in the proofreading information storage unit. And steps to
The document composition means displaying the document structure information together with a configuration example, and when an operation to apply the document structure information to an arbitrary character string of document data is accepted, adjusting the appearance of the arbitrary character string; ,
A document data construction method characterized by comprising: