JPH03233670A

JPH03233670A - Text data conversion system

Info

Publication number: JPH03233670A
Application number: JP2028164A
Authority: JP
Inventors: Yasuo Tanosaki; 康雄田野崎
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1990-02-09
Filing date: 1990-02-09
Publication date: 1991-10-17

Abstract

PURPOSE:To edit a text to a general form without requiring troublesome operation by providing a means which deletes an unnecessary space character code and a means which deletes an unnecessary line feed code. CONSTITUTION:A program part 20 is provided with a space character code deleting part 20c which executes the processing to delete the unnecessary space character code from text data and a line feed code deleting part 20d which executes the processing to delete the unnecessary line feed code. A data managing part 30 is provided with a character string temporary storage buffer 30d where text data inputted from an external storage device 40 is stored and a generated character string storage buffer 30h. Consequently, text data is obtained which has unnecessary space character code and line feed code deleted to have the continuity of contents. Thus, edited results are obtained without requiring troublesome operation though it is necessary to edit data into a general text.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）本発明は、ワードプロセッサ等の文書作成装置等によっ
て作成された文書の文書データ（テキストデータ）につ
いて変換を行なうテキストデータ変換方式に関する。[Detailed Description of the Invention] [Object of the Invention] (Field of Industrial Application) The present invention provides a text data conversion method for converting document data (text data) of a document created by a document creation device such as a word processor. Regarding.

（従来の技術）一般に、ワードプロセッサを用いてテキスト（文書）を
作成する場合、文書を読み易くする目的で、各行の先頭
に空白を表わす文字（スペース）等を挿入して行頭位置
が揃うように左余白を設けたり、行の途中で改行を行な
い１行中の文字数を揃える場合がある。(Prior art) Generally, when creating text (documents) using a word processor, in order to make the document easier to read, spaces are inserted at the beginning of each line so that the beginnings of the lines are aligned. In some cases, a left margin is provided or a line break is added in the middle of a line to equalize the number of characters in one line.

しかし、こうしたテキストを作成した場合には、以下に
示すような点に不具合が生じていた。However, when such texts were created, the following problems occurred.

（１）作成したテキストを修正する場合。(1) When modifying the created text.

ある行に含まれる語句（文字）の削除を行った場合、同
行に意味的に接続している次の行（の語句）が自動的に
追従してこない。このため、同行に含まれる文字数が減
少してしまい、操作者は何らかの操作によって、次の行
の先頭の語句に対する移動等の処理を行なわなければな
らなず、多大な労力を必要としていた。When a word (character) included in a certain line is deleted, the next line (word/phrase) that is semantically connected to the same line does not automatically follow. As a result, the number of characters included in the line decreases, and the operator has to perform some operations such as moving to the first word of the next line, which requires a great deal of effort.

（２）作成したテキストを他の表示系で表示する場合。(2) When displaying the created text on another display system.

作成したテキストを表示する際、テキストを表示しよう
とする表示系の１行あたりの文字数がテキストを作成し
た際の表示系の１行あたりの文字数と異なる場合に、本
来目的としない位置での改行が行われたり、本来は左余
白を表わす目的で入力した空白文字列が意味のない空白
文字列として表示されてしまうことがあった。When displaying the created text, if the number of characters per line of the display system that is trying to display the text is different from the number of characters per line of the display system when the text was created, line breaks may occur at unintended positions. In some cases, blank strings that were originally intended to represent the left margin were displayed as meaningless blank strings.

（３）作成したテキストから単語の検索を行なう場合。(3) When searching for words from the created text.

テキスト中の文字列（単語）の検索を行なう際に、本来
ならひとつの単語であるものか改行、空白文字列によっ
て分割されていると、この単語については目的の単語と
のマツチングが行なわれないため、検索することができ
ない。When searching for a character string (word) in text, if it is originally a single word but is divided by line breaks or blank strings, the word will not be matched with the target word. Therefore, it is not possible to search.

（４）作成したテキストの校閲あるいは翻訳を各処理機
能によって自動的に行なう場合。(4) When the created text is automatically proofread or translated by each processing function.

本来ならひとつの文であるものが改行、あるいは空白文
字列によって分割されていると、校閲あるいは翻訳を自
動的に行なうために必要な形態素解析、構文解析等の処
理を行なうことが困難となる。このため、自動校閲、自
動翻訳を行なう各処理機能の実行も困難となってしまう
。If what is normally a single sentence is divided by line breaks or blank strings, it becomes difficult to perform processes such as morphological analysis and syntactic analysis that are necessary for automatic proofreading or translation. Therefore, it becomes difficult to execute various processing functions such as automatic proofreading and automatic translation.

このため、テキストを読み易くする目的のために、各行
の先頭に空白文字列を挿入したり、行の途中で改行を行
ったテキストについて、作成したテキストを修正する場
合、テキストを他の表示系で表示する場合、単語の検索
を行なう場合、校閲。For this reason, when modifying text that has been created by inserting a blank string at the beginning of each line or by adding a line break in the middle of a line to make the text easier to read, it is necessary to When displaying, searching for words, proofreading.

翻訳を各処理機能によって自動的に行なう場合等には、
各行毎に意味のない空白文字列、改行（コード）を必要
に応じて削除し、テキストを一般的な形式に編集した後
に実行する必要があった。When translation is automatically performed by each processing function,
It was necessary to delete meaningless blank strings and line breaks (codes) from each line as necessary, edit the text into a general format, and then execute it.

（発明が解決しようとする課題）このように、テキストを読み易くする目的のために各行
の先頭に空白文字列を挿入したり、行の途中で改行を行
ったテキストについては、他の処理（前記（１）〜（４
）のような処理）を行なう場合に、空白文字列、改行（
コード）の削除等の作業が必要となり、処理効率を低下
させるという問題があった。(Problem to be Solved by the Invention) In this way, for text that has a blank string inserted at the beginning of each line or a line break in the middle of a line for the purpose of making the text easier to read, other processing ( (1) to (4) above
), blank strings, line breaks (
This requires work such as deleting code), which poses a problem of lowering processing efficiency.

本発明は前記のような点に鑑みてなされたもので、煩わ
しい操作を必要とすることなく、テキスト中の不要な空
白文字、改行を削除することが可能なテキストデータ変
換方式を提供することを目的とする。The present invention has been made in view of the above points, and an object of the present invention is to provide a text data conversion method that can delete unnecessary blank characters and line breaks in text without requiring troublesome operations. purpose.

［発明の目的］（課題を解決するための手段）本発明は、空白文字コード、改行コードを含む各種文字
の文字コードが所定順に配列されたテキストデータを格
納するための第１のテキストデータ格納手段と、前記テ
キストデータ格納手段に格納されたテキストデータから
、不要な空白文字コードを削除する空白文字コード削除
手段と、前記テキストデータ記憶手段に格納されたテキ
ストデータから、前記テキストの文末以外に付された不
要な改行コードを削除する改行コード削除手段と、前記
空白文字コード削除手段、及び前記改行コード削除手段
によって、空白文字コード、改行コードが削除されたテ
キストデータを格納するための第２のテキストデータ格
納手段とを具備し、テキスト中の不要な空白文字コード
、改行コードが削除された内容的に連続するテキストデ
ータを抽出するように構成するものである。[Object of the Invention] (Means for Solving the Problems) The present invention provides a first text data storage for storing text data in which character codes of various characters including blank character codes and line feed codes are arranged in a predetermined order. blank character code deleting means for deleting unnecessary blank character codes from the text data stored in the text data storage means; a second line feed code for storing text data from which blank character codes and line feed codes have been deleted by the line feed code deletion means, the blank character code deletion means, and the line feed code deletion means; The apparatus is configured to extract continuous text data from which unnecessary blank character codes and line feed codes have been deleted from the text.

（作用）このような構成によれば、表示系に表示したり、印刷し
た際に、文書が読み易くなるようにする目的で挿入され
た内容的に不要な空白文字コード、改行コードが削除さ
れるため、テキストデータが連続する（空白文字、改行
コードによって分割されない）条件で実行可能な、単語
の検索、自動的に行なう処理機能による校閲、翻訳等を
行なう場合に、煩わしい操作を伴なう方式によりテキス
ト編集を行なう必要がない。(Function) According to this configuration, unnecessary blank character codes and line feed codes inserted for the purpose of making the document easier to read when displayed on a display system or printed are deleted. Therefore, when performing word searches, proofreading using automatic processing functions, translation, etc., which can be performed under the condition that text data is continuous (not divided by blank characters or line feed codes), cumbersome operations are required. This method eliminates the need for text editing.

（実施例）以下、図面を参照して本発明の一実施例を説明する。第
１図は同実施例に係わるテキストデータ変換方式を適用
する情報処理装置の構成を示すブロック図である。同図
に示すように、テキストデータ変換処理を制御する制御
部ｌＯによって、プログラム部２０、及びデータ格納部
３０が管理される。(Example) Hereinafter, an example of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of an information processing apparatus to which the text data conversion method according to the embodiment is applied. As shown in the figure, a program section 20 and a data storage section 30 are managed by a control section IO that controls text data conversion processing.

また、同装置は、プログラム、テキストデータ等を格納
するためのハードディスク装置等によって構成される外
部記憶装置４０と接続されている。Further, the device is connected to an external storage device 40 constituted by a hard disk device or the like for storing programs, text data, and the like.

プログラム部２０には、テキストデータ中の不要な空白
文字、改行コードを削除するテキスト編集処理を実行す
る際の初期化処理を行なう初期化部２０ａ、外部記憶装
置４０に格納されたテキストデータを所定のデータ格納
部３０の領域に格納するデータ読込み部２０ｂ、処理対
象とするテキストデータから不要な空白文字コードを削
除する処理を実行する空白文字コード削除部２０ｃ１及
びテキストデータから不要な改行コードを削除する処理
を実行する改行コードさ削除部２０ｄが設けられている
。The program unit 20 includes an initialization unit 20a that performs initialization processing when executing text editing processing to delete unnecessary blank characters and line feed codes in text data, and an initialization unit 20a that performs initialization processing when executing text editing processing to delete unnecessary blank characters and line feed codes in text data. a data reading unit 20b that stores data in the area of the data storage unit 30; a blank character code deletion unit 20c1 that executes processing to delete unnecessary blank character codes from text data to be processed; and a blank character code deletion unit 20c1 that deletes unnecessary line feed codes from text data. A line feed code deleting unit 20d is provided to perform the processing to delete the line feed code.

データ管理部３０には、外部記憶装置４０から入力され
たテキストの総行数の値を格納するための行カウンタバ
ッファ３０ａ、外部記憶装置４０中の文字位置を記憶す
るための入力文字位置記憶用バッファ３０ｂ、後述する
生成文字列格納用バッファ３０ｈのテキストデータを格
納すべき位置を記憶する出力文字位置記憶用バッファ３
０Ｃ１外部記憶装置４０から入力したテキストデータを
一時的に格納するための文字列−時格納用バッファ３０
ｄ、文字列−時格納用バッファ３０ｃの内容を後の処理
のために保存するための文字列保存用バッファ３０ｅ１
文字列−時格納用バッファ３０ｄに格納されている文字
列の文字数を格納するための文字数カウンタ用バッファ
３０ｆ１文字数カウンタ用バッファ３０ｆの内容を後の
処理のために保存するための文字数カウンタ保存用バッ
ファ３０ｇ１及びテキスト編集処理の結果得られる不要
な空白文字、改行が削除された文字列を格納するための
生成文字列格納用バッファ３０ｈが設けられている。The data management unit 30 includes a line counter buffer 30a for storing the value of the total number of lines of text input from the external storage device 40, and a line counter buffer 30a for storing the input character position for storing the character position in the external storage device 40. Buffer 30b, output character position storage buffer 3 that stores the position where text data of generated character string storage buffer 30h, which will be described later, should be stored.
Character string-time storage buffer 30 for temporarily storing text data input from the 0C1 external storage device 40
d. Character string storage buffer 30e1 for storing the contents of the character string-time storage buffer 30c for later processing.
Character count counter buffer 30f for storing the number of characters in the character string stored in the character string-time storage buffer 30d1 Character counter storage buffer for saving the contents of the character counter buffer 30f for later processing 30g1 and a generated character string storage buffer 30h for storing a character string from which unnecessary blank characters and line breaks obtained as a result of text editing processing have been deleted.

次に、同実施例の動作について第２図に示すフローチャ
ートを参照しながら説明する。Next, the operation of this embodiment will be explained with reference to the flowchart shown in FIG.

まず、システムが起動され、テキストデータ中の不要な
空白文字、改行コードを削除するテキスト編集処理の実
行が指示されると、プログラム部２０中の初期化部２０
ａが起動する。初期化部２０ａは、データ格納部３０中
の各種変数・バッファの初期化を行なう（ステップＳｌ
）。ここで、特に行カウンタバッファ３０ａ１人力文字
位置記憶用バッファ３０ｂ、及び出力文字位置記憶バッ
ファ３０ｃに値“０°　（ゼロ）が格納される。初期化
が終了すると、データ読込み部２０ｂが起動し、外部記
憶装置４０中に格納されているテキストデータ（の一部
）を、文字列−時格納用バッファ３０ｄに転送する（ス
テップＳ２）。外部記憶装置４０中でのテキストデータ
の格納形式を第３図に示している。なお、第３図におい
て、ｒ　ＣＲＪは改行コード、ｒｓＰｃＪは空白文字コ
ード（スペース）、その他の文字は文字コードを示すも
のである。すなわち、データ読込み部２０ｂは、入力文
字位置記憶用バッファ３０ｂの内容が示す文字位置を先
頭とし、これに続く改行コードｒ　ＣＲＪまでの単位（
１行分の文字列）を、文字列−時格納用バッファ３０ｄ
の先頭から順に、末尾の改行コードを含めて格納する。First, when the system is started and instructions are given to execute text editing processing to delete unnecessary blank characters and line feed codes in text data, the initialization unit 20 in the program unit 20
a starts. The initialization unit 20a initializes various variables and buffers in the data storage unit 30 (step Sl
). Here, in particular, the value "0° (zero) is stored in the line counter buffer 30a, the manual character position storage buffer 30b, and the output character position storage buffer 30c. When the initialization is completed, the data reading unit 20b is activated, The text data (a part of it) stored in the external storage device 40 is transferred to the character string-time storage buffer 30d (step S2).The storage format of the text data in the external storage device 40 is In FIG. 3, rCRJ is a line feed code, rsPcJ is a blank character code (space), and other characters are character codes.In other words, the data reading unit 20b reads the input character Starting from the character position indicated by the contents of the position storage buffer 30b, the unit (
One line of character string) is stored in the character string-time storage buffer 30d.
are stored sequentially from the beginning, including the trailing newline code.

文字列−時格納用バッファ３０ｄにテキストデータが格
納されると、外部記憶装置４０中における読取られた改
行コードの次の位置を示すように、入力文字位置記憶用
バッファ３０ｂの内容を更新する。さらに、行カウンタ
バッファ３０ａの内容に「１」を加える。When the text data is stored in the character string-time storage buffer 30d, the contents of the input character position storage buffer 30b are updated to indicate the next position of the read line feed code in the external storage device 40. Furthermore, "1" is added to the contents of the row counter buffer 30a.

ステップＳ２において、テキストデータの読込みができ
なかった場合、つまり入力文字位置記憶用バッファ３０
ｂに格納されたデータによって示される文字位置に文字
データ（改行、空白文字コードを含む）が格納されてい
なかった場合には、本方式での処理を終了する（ステッ
プＳ３）。In step S2, if the text data cannot be read, that is, the input character position storage buffer 30
If character data (including line feed and blank character codes) is not stored at the character position indicated by the data stored in b, the process in this method is ended (step S3).

一方、ステップＳ２において、テキストデータの読込み
ができた場合には、空白文字コード削除部２０ｃが起動
する。空白文字コード削除部２０ｃは、行カウンタバッ
ファ３０ａの内容が「１」であるか（対象とする行がテ
キストの第１行目であるか）を判別する（ステップＳ４
）。行カウンタバッファ３０ａの内容が「１」の場合、
つまり外部記憶装置４０からテキストの最初の１行目が
読込まれた直後である場合、空白文字コード削除部２０
ｃは、文字−時格納用バッファ３０ｄに格納された文字
列のデータ（テキストデータ）を、文字列保存用バッフ
ァ３０ｅに転送する（ステップＳ５）。次に、文字−時
格納用バッファ３０ｄ中のテキストデータがら１行中の
文字数（空白文字コードを含む）をカウントして、その
カウント数を文字数カウンタ用バッファ３０ｆに格納す
る。この文字数のカウント結果は、文字数カウンタ用バ
ッファ３０ｆから、文字数カウンタ保存用バッファ３０
ｇに転送され格納される（ステップＳ６．Ｓ７）。この
１行中の文字数のカウント値は、後に実行される不要な
改行コードを削除するための処理に用いられる。空白文
字コード削除部２０ｃは、文字列−時格納用バッファ３
０ｄに格納されたテキストデータの先頭に空白文字コー
ドが存在するか判別する（ステップＳ８）。先頭に空白
文字コードが存在する（連続する空白文字コードを含む
〕場合、文字列保存用バッファ３０ｅに格納された先頭
の空白文字コード（連続する空白文字コードを含む）を
削除する（ステップＳ９）。この空白文字コードが削除
される様子を、第４図に示している。On the other hand, in step S2, if the text data has been successfully read, the blank character code deletion unit 20c is activated. The blank character code deletion unit 20c determines whether the content of the line counter buffer 30a is "1" (whether the target line is the first line of the text) (step S4
). If the content of the row counter buffer 30a is "1",
In other words, immediately after the first line of text is read from the external storage device 40, the blank character code deletion unit 20
c transfers the character string data (text data) stored in the character-time storage buffer 30d to the character string storage buffer 30e (step S5). Next, the number of characters in one line (including blank character codes) is counted from the text data in the character-time storage buffer 30d, and the counted number is stored in the character number counter buffer 30f. The result of counting the number of characters is transferred from the character number counter buffer 30f to the character number counter storage buffer 30f.
g and stored (steps S6 and S7). This count value of the number of characters in one line is used in a process executed later to delete unnecessary line feed codes. The blank character code deletion unit 20c is a character string-hour storage buffer 3.
It is determined whether a blank character code exists at the beginning of the text data stored in 0d (step S8). If a blank character code exists at the beginning (including consecutive blank character codes), delete the leading blank character code (including consecutive blank character codes) stored in the character string storage buffer 30e (step S9). FIG. 4 shows how this blank character code is deleted.

なお、ステップＳ８において、空白文字コードが存在し
ないと判別された場合には、文字列保存用バッファ３０
ｅの内容を操作することなく次の処理に移る。Note that if it is determined in step S8 that there is no blank character code, the character string storage buffer 30
Proceed to the next process without manipulating the contents of e.

次に、ステップＳ２の処理に移り、入力文字位置記憶用
バッファ３０ｂの内容が示す文字位置から改行コードｒ
　ＣＲＪまでの文字列（次行の文字列）を、前記同様に
して文字列−時格納用バッファ３０ｄの先頭から順に、
末尾の改行コードを含めて格納する。ここで、テキスト
データの読込みができた場合には（ステップＳ３）、行
カウンタの値が「１」でないため（ステップＳ４）、空
白文字コード削除部２０ｃが起動され、次行に対する空
白文字コードの削除処理が実行される。まず、空白文字
コード削除部２０ｃは、文字列−時格納用バッファ３（
ｌｄに格納されたテキストデータから１行中の文字数（
空白文字コードを含む）をカウントして、そのカウント
数を文字数カウンタ用バッファ３０ｆに格納する（ステ
ップ５ＩＯ）。この１行中の文字数のカウント値は、文
字数カウンタ保存用パンツ７３０ｇに格納された前行の
文字数を示すカウント値と共に、後に実行されるテキス
トデータ中の不要な改行コードを削除するための処理に
用いられる。空白文字コード削除部２０ｃは、文字列−
時格納用バッファ３０ｄに格納されたテキストデータの
先頭に空白文字コードが存在するか判別する（ステップ
５１１）。先頭に空白文字コードが存在する（連続する
空白文字コードを含む）場合、文字列保存用バッファ３
Ｃ１ｅに格納された先頭の空白文字コード（連続する空
白文字コードを含む）を削除しくステップ５Ｌ２）、そ
の結果を生成文字列格納用バッファ３０ｈの出力文字位
置記憶バッファ３０ｃの内容によって示される位置に転
送し格納する（ステップ８１３）。そして、出力文字位
置記憶用バッファ３０ｃの内容に、生成文字列格納用バ
ッファ３０ｈに格納した文字数を加える。なお、ステッ
プＳｌｌにおいて、空白文字コードが存在しないと判別
された場合には、文字列保存用バッファ３０ｅの内容を
操作することなく次の処理に移る。Next, the process moves to step S2, and the line feed code r is started from the character position indicated by the contents of the input character position storage buffer 30b.
The character strings up to CRJ (character strings in the next line) are processed in the same manner as described above, starting from the beginning of the character string-hour storage buffer 30d.
Store including the trailing newline code. Here, if the text data has been successfully read (step S3), the value of the line counter is not "1" (step S4), so the blank character code deletion unit 20c is activated and the blank character code for the next line is changed. Deletion processing is executed. First, the blank character code deletion unit 20c deletes the character string-hour storage buffer 3 (
The number of characters in one line from the text data stored in ld (
(including blank character codes) and stores the counted number in the character number counter buffer 30f (step 5IO). This count value of the number of characters in one line is used together with the count value indicating the number of characters in the previous line stored in the character counter storage pant 730g, in the process to delete unnecessary line feed codes in the text data that will be executed later. used. The blank character code deletion unit 20c deletes the character string -
It is determined whether a blank character code exists at the beginning of the text data stored in the time storage buffer 30d (step 511). If a blank character code exists at the beginning (including consecutive blank character codes), the character string storage buffer 3
Delete the leading blank character code (including consecutive blank character codes) stored in C1e (Step 5L2), and place the result in the position indicated by the contents of the output character position storage buffer 30c of the generated character string storage buffer 30h. Transfer and store (step 813). Then, the number of characters stored in the generated character string storage buffer 30h is added to the contents of the output character position storage buffer 30c. Note that if it is determined in step Sll that there is no blank character code, the process moves to the next process without operating the contents of the character string storage buffer 30e.

こうして、行頭に付された不要な空白文字コードが削除
されると、改行コード削除部２０ｄが起動する。改行コ
ード削除部２Ｄｄは、文字数カウンタ用バッファ３０ｆ
に格納された最後に読込んだ行（現在処理対象としてい
る行）の文字数と、文字数カウンタ保存用バッファ３０
ｇに格納された前行の文字数のとの差を計算する（ステ
ップ５１４）。When the unnecessary blank character code added to the beginning of the line is thus deleted, the line feed code deletion unit 20d is activated. The line feed code deletion unit 2Dd has a character count counter buffer 30f.
The number of characters in the last read line (the line currently being processed) stored in the buffer 30 for storing the character count counter
The difference between the number of characters in the previous line stored in g is calculated (step 514).

ここで、計算により得られた結果が０でない場合、つま
り直前にステップＳ２において読込んだ行（最後に読込
んだ行）の１行中に含まれる文字数が異なる場合は（ス
テップ５１５）、さらに現在処理対象としている行と前
行の文字数の何れの方が多いかを判別する（ステップ８
１６）。ここで、前行の文字列の文字数のほうが少ない
場合、前行は、文の最後の部分を含む行であるものと判
別する。Here, if the result obtained by the calculation is not 0, that is, if the number of characters included in one line is different from the line read in the previous step S2 (the last line read) (step 515), further Determine which has more characters, the line currently being processed or the previous line (Step 8
16). Here, if the number of characters in the character string in the previous line is smaller than that in the previous line, the previous line is determined to be the line containing the last part of the sentence.

例えば、第５図に示すようなテキストデータを処理対象
とすると、「示される。」の行が文の最後の部分を含む
行とする。このため、改行コード削除部２０ｄは、生成
文字列格納用バッファ３０ｈ中の出力文字位置記憶用バ
ッファ３０ｃの内容によって示される位置に改行コード
を格納する（ステップら５１７）。また、ステップＳｉｔにおいて、前行の文字
列の文字数のほうが多いと判別された場合は、さらに文
字列保存用バッファ３０ｅに格納されたテキストデータ
の最後の文字コードが、文の最後を示す句点コードであ
るか否かを判別する（ステップＳ　１８）。テキストデ
ータの最後が句点コードである場合は、同様にして文の
最後として生成文字列格納用バッファ３０ｈに改行コー
ドを格納する（ステップＳ　１？）。すなわち、テキス
ト中の「見出し文」のように、１行中の文字数が他の一
般文の行の文字数より少ない行を現在処理対象としてい
る行とする場合に、前行に確実に改行コードが付される
ようにするものである。改行コード削除部２０ｄは、生
成文字列格納用バッファ３０ｈに改行コードを格納する
と、出力文字位置記憶用バッファ３０ｃの内容に１を加
え、次の処理（ステップＳ　１９）に移る。For example, if text data as shown in FIG. 5 is to be processed, the line ``shown.'' is the line that includes the last part of the sentence. Therefore, the line feed code deletion unit 20d stores the line feed code at the position indicated by the contents of the output character position storage buffer 30c in the generated character string storage buffer 30h (step 517). Further, if it is determined in step Sit that the number of characters in the character string in the previous line is larger than that in the previous line, the last character code of the text data stored in the character string storage buffer 30e is the period code indicating the end of the sentence. It is determined whether or not (step S18). If the end of the text data is a period code, a line feed code is similarly stored in the generated character string storage buffer 30h as the end of the sentence (step S1?). In other words, when the current line to be processed is a line in which the number of characters in one line is smaller than the number of characters in other general text lines, such as a "headline sentence", it is possible to ensure that the previous line has a line feed code. It is to be attached. When the line feed code deletion unit 20d stores the line feed code in the generated character string storage buffer 30h, it adds 1 to the contents of the output character position storage buffer 30c, and moves on to the next process (step S19).

一方、ステップＳＩ５において、現在処理対象としてい
る行の文字数と、前行の文字数のとの差が等しいと判別
された場合には、テキストの行途中で改行することによ
って１行中の文字数を揃えたものとして、改行コードを
生成文字列格納用バッファ３０ｈに格納しない。従って
、ステップＳ１３において生成文字列格納用バッファ３
０ｈに文字列保存用バッファ３０ｅから改行コードを除
くテキストデータが転送され格納されているため、ここ
で改行コードを格納しないことによって、結果的に改行
コードを削除していることになる。On the other hand, if it is determined in step SI5 that the difference between the number of characters in the current line to be processed and the number of characters in the previous line is equal, the number of characters in one line is equalized by starting a line in the middle of the text line. As a result, the line feed code is not stored in the generated character string storage buffer 30h. Therefore, in step S13, the generated character string storage buffer 3
Since the text data excluding the line feed code is transferred and stored at 0h from the character string storage buffer 30e, by not storing the line feed code here, the line feed code is deleted as a result.

次に、改行コード削除部２０ｄは、新たに１行分のテキ
ストデータを読込んで処理するための準備として、文字
列−時格納用バッファ３０ｄに格納されたテキストデー
タを、文字列保存用バッファ３０已に転送し格納する（
ステップ５１９）。さらに、文こうして、次の処理の準
備が終了すると、ステップＳ２に処理が戻り、データ読
込み部２０ｂが起動し、行カウンタバッファ３０ｇの内
容によって示される行の読込みが行われる。以下、前記
において説明したようにして処理が実行される。Next, the line feed code deletion unit 20d transfers the text data stored in the character string-time storage buffer 30d to the character string storage buffer 30d in preparation for reading and processing one new line of text data. Transfer and store it immediately (
step 519). Furthermore, when preparation for the next process is completed, the process returns to step S2, the data reading section 20b is activated, and the line indicated by the contents of the line counter buffer 30g is read. Thereafter, the processing is executed as described above.

この結果、第５図に示すようなテキストデータは、第６
図に示すような、不要な空白文字、改行コードが削除さ
れたテキストデータに変換される。As a result, the text data as shown in Figure 5 is
The text data is converted to text data with unnecessary blank characters and line feed codes removed, as shown in the figure.

なお、本発明は前記実施例に限定されるものではない。Note that the present invention is not limited to the above embodiments.

例えば、本実施例では、外部記憶装置４０中に格納され
たテキストデータについて処理を実行するものとしたが
、通信回線を介して入力されたデータについて処理を行
なうようにしても良い。For example, in this embodiment, the processing is performed on text data stored in the external storage device 40, but the processing may be performed on data input via a communication line.

また、本発明の要旨を逸脱しない範囲で種々の変更が可
能である。Furthermore, various modifications can be made without departing from the gist of the present invention.

［発明の効果］以上のように本発明によれば、テキストを読み易くする
目的のための左余白を設けるための空白文字列や、１行
中の文字数を揃えるための改行が挿入されたテキストか
ら、不要な空白文字列。[Effects of the Invention] As described above, according to the present invention, text in which a blank character string is inserted to provide a left margin for the purpose of making the text easier to read, and a line break is inserted to equalize the number of characters in one line. From, an unnecessary blank string.

改行を削除して、文や語句を構成する文字列のみが抽出
するので、作成したテキストを修正する際、他の表示系
で表示する際、作成したテキスト中から単語等の検索を
行なう際、作成したテキストの校閲、翻訳を自動的に行
なう機能を用いる際などに、一般のテキストに編集する
必要があるで場合あっても、煩わしい操作を必要とする
ことなく編集結果を得ることができ、作業負担を大幅に
軽減すると共に、作業効率を向上させることが可能とな
るものである。Line breaks are removed and only the character strings that make up a sentence or phrase are extracted, so when editing the created text, displaying it on another display system, or searching for words etc. in the created text, Even if you need to edit the text into regular text when using a function that automatically proofreads or translates the text you have created, you can get the editing results without any troublesome operations. This makes it possible to significantly reduce the work burden and improve work efficiency.

[Brief explanation of drawings]

第１図は本発明の一実施例に係わるテキストデータ変換
方式を適用する情報処理装置の構成を示すブロック図、
第２図は同実施例の動作手順を示すフローチャート、第
３図は外部記憶装置中でのテキストデータの格納形式を
示す図、第４図は空白文字コードを削除する処理を説明
するための図、第５図は処理対象とするテキストデータ
の一例を示す図、第６図は第５図に示すテキストデータ
に対する処理結果を示す図である。ＩＯ・・・制御部、２０・・・プログラム部、２０ａ・
・・初期化部、２０ｂ・・・データ読込み部、２０ｃ・
・・空白文字コード削除部（空白文字コード削除手段）
　、２０ｄ・・・改行コード削除部（改行コード削除手
段）、３ｏ・・・データ格納部、３０ａ・・・行カウン
タバッファ、３０ｂ・・・人力文字位置記憶用バッファ
、３０ｃ・・・出力文学位ｒＩｔ記憶用バッファ、３（
ｌｄ・・・文字列−時格納用バッファ、３０ｅ・・文字
列保存用バッファ、３０ｆ・・・文字数カウンタ用バッ
ファ、３０ｇ・・・文字数カウンタ保存用バッファ、３
０ｈ・・・生成文字列格納用バッファ、４０・・・外部
記憶装置FIG. 1 is a block diagram showing the configuration of an information processing device to which a text data conversion method according to an embodiment of the present invention is applied;
FIG. 2 is a flowchart showing the operating procedure of the same embodiment, FIG. 3 is a diagram showing the storage format of text data in an external storage device, and FIG. 4 is a diagram for explaining the process of deleting blank character codes. , FIG. 5 is a diagram showing an example of text data to be processed, and FIG. 6 is a diagram showing a processing result for the text data shown in FIG. IO...Control unit, 20...Program unit, 20a.
...Initialization section, 20b...Data reading section, 20c.
・Blank character code deletion section (blank character code deletion means)
, 20d...Line feed code deletion unit (line feed code deletion means), 3o...Data storage unit, 30a...Line counter buffer, 30b...Manual character position storage buffer, 30c...Output character position rIt storage buffer, 3(
ld...Buffer for storing character string-time, 30e...Buffer for storing character string, 30f...Buffer for character number counter, 30g...Buffer for storing character number counter, 3
0h...Buffer for storing generated character strings, 40...External storage device

Claims

[Scope of Claims] First text data storage means for storing text data in which character codes of various characters including blank character codes and line feed codes are arranged in a predetermined order; Blank character code deletion means for deleting unnecessary blank character codes from text data; line feed code deletion means for deleting unnecessary line feed codes from text data stored in the text data storage means; and the blank character code deletion means. means, and second text data storage means for storing text data from which blank character codes and line feed codes have been deleted by the line feed code deletion means; A text data conversion method characterized by extracting content-continuous text data from which codes have been deleted.