JP2002297574A

JP2002297574A - System and program for kana putting along side chinese characters

Info

Publication number: JP2002297574A
Application number: JP2001102584A
Authority: JP
Inventors: Shigeki Ota; 繁喜太田; Masayuki Kogo; 正之向後
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2001-03-30
Filing date: 2001-03-30
Publication date: 2002-10-11

Abstract

PROBLEM TO BE SOLVED: To provide an efficient achieving means to automatically put Kana alongside Chinese characters by inputting given document data or document data put Kana by other type set according to pages as a result of type set and conditions of Kana putting to a character string specified by a user. SOLUTION: In a system for Kana putting, a logical section storage means 1 to set and store more than one logical section dividing the document data logically, an extracting means 4 for potential character string of Kana putting to extract potential character strings of Kana putting from the data, a Kana putting judgement means 2 to judge a potential character string appeared firstly as Kana putting subject strings for the strings composed of the same character strings by stored logical section and a Kana putting means 3 to put Kana on the judged subject strings in the data are equipped.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、書式制御コード処
理方式、特に文書処理における清書出力処理を行うシス
テムにおいて、ルビ制御コードを多用する文書データに
おいて、ルビ指示をする回数や時間の負荷を最小化する
とともに、組版ルールと整合性のあるルビ自動付与を可
能とするとともに、異なる基本書式文書間での変換にお
いて適正なルビ自動再付与を可能にしたルビ付与システ
ムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a format control code processing system, and more particularly, to a system for performing a clear copy output process in document processing. The present invention relates to a ruby assigning system that enables automatic ruby assigning in conformity with the typesetting rules, and enables appropriate automatic ruby reassignment in conversion between different basic format documents.

【０００２】[0002]

【従来の技術】一般に、データ処理装置の出力データで
ある文書データは、実際に出力すべき文字を示す文字コ
ードと、例えば、改行、改ページ、文字サイズ指示な
ど、文書の形を指示するいわゆる書式制御コードとから
なる。2. Description of the Related Art In general, document data, which is output data of a data processing apparatus, includes a character code indicating a character to be actually output, and a so-called "line feed, page break, character size instruction" or the like which indicates the form of the document. It consists of a format control code.

【０００３】また、例えば１ページの行数や１行の文字
数（行長）などの文書の基本書式に基づいて、書式制御
コードに従い文字コードを実際に出力される形に編集し
て、プリンタやディスプレイ装置用データに変換するも
のは組版と呼ばれる。Further, based on the basic format of a document such as the number of lines per page and the number of characters per line (line length), a character code is edited in accordance with a format control code into a form that is actually output, and a printer or the like is used. Data that is converted into data for a display device is called typesetting.

【０００４】書式制御コードは、多岐にわたり、読み方
の難しい文字列に「フリガナ」をふる「ルビ制御コー
ド」もその一部である。[0004] Format control codes are diversified, and "ruby control codes" for giving "reading" to character strings that are difficult to read are also part of them.

【０００５】特に、難しい古典、例えば、古事記など
を、「まんが・・・」などのように小・中学生向けに出
版されることが増加している。この場合、難しい読みに
は文書の全体にわたって同じ文字列であっても繰り返し
てルビ（フリガナ）がふられる。このように、ルビ組版
の中でも、ルビが多用されるルビ組版を「総ルビ」組版
と一般にいわれる。Particularly, it is increasing that difficult classics such as Kojiki are published for elementary and junior high school students, such as "manga ...". In this case, for the difficult reading, ruby (reading) is repeatedly given even if the same character string is used throughout the entire document. As described above, among the ruby typesettings, the ruby typesetting in which ruby is frequently used is generally referred to as “total ruby” typesetting.

【０００６】図３にルビを付与した文書データとルビ制
御コードの例を示す。図において「●」は前後の文字列
を示す。（ａ）においては漢字文字列「橘郎女」にフリ
ガナ「たちばなのいらつめ」を付与した例であり、ここ
では出現するすべての漢字文字列「橘郎女」にルビ付与
している。FIG. 3 shows an example of ruby-added document data and a ruby control code. In the figure, “●” indicates a character string before and after. (A) is an example in which the reading of the kanji character string "Tachibana-jo" is added to the kanji character string "Tachibana-no-jo". In this case, all the appearing kanji character strings "Tachibana-jo" are added with ruby.

【０００７】（ｂ）には文書データの中に入力してルビ
を付与するようにしたルビ制御コードを示す。ルビを付
与する（ここでは漢字の）文字列「橘郎女」に「ルビ制
御コード」（「＠ルビ；」と「＠」）で挟んでルビ文字
列（フリガナ）「たちばなのいらつめ」を入力してい
る。FIG. 2B shows a ruby control code which is input into document data to add ruby. Enter the ruby character string (furigana) "Tachibana no Irazume" with the "ruby control code"("＠ruby;" and "＠") sandwiched between the character strings "Tachibana woman" to add ruby (in this case, kanji) are doing.

【０００８】この例ではすべての漢字文字列にルビを付
与しているが、実際には１ページに何回も出現する場合
には先頭に出現した文字列にだけ付与するなど全体の読
みやすさを考慮して付与する文字列を選択している。In this example, ruby is added to all kanji character strings. However, in actuality, if the character string appears many times on one page, it is added only to the character string appearing at the beginning, so that the overall readability is improved. Is selected in consideration of the character string to be assigned.

【０００９】[0009]

【発明が解決しようとする課題】このような文書データ
にルビを付与するためには、文書データの入力時にルビ
制御コードを付けたり、編集の過程で追加して入力した
り、また、どの文字列に付与するか否かを人手で選択し
て入力することは総ルビの場合には、その作業負荷が高
く多くの労力を必要とし、作業品質向上や作業効率向上
などの課題があった。In order to add ruby to such document data, a ruby control code is added at the time of inputting the document data, an additional character is input during the editing process, In the case of total ruby, it is necessary to manually select whether or not to assign to a column, and in the case of total ruby, the work load is high and a lot of labor is required, and there are problems such as improvement of work quality and work efficiency.

【００１０】さらに、新聞・雑誌などでの連載ものの単
行本化や単行本の文庫本化をする場合など行長（１行の
文字数）・行数（１ページの行数）などの基本書式情報
が変更されたりするため、基本的に旧基本書式を新基本
書式に組版し直すことになる。こうした文書に総ルビが
ふられていた場合、新たに新組版結果に対応した適切な
総ルビ付与が必要となり、これを効率的に実現すること
も課題である。In addition, basic format information such as line length (number of characters per line) and number of lines (number of lines per page) has been changed, for example, in the case of serializing books serialized in newspapers and magazines, and converting books into paperback books. Basically reformat the old basic format to the new basic format. If total ruby is added to such a document, it is necessary to add a new total ruby appropriate for the new typesetting result, and achieving this efficiently is also an issue.

【００１１】このような課題の解決について、特開平７
−２８２０５０号公報「文書編集方法及びその装置」に
おいて、総ルビの付与するべき文字列の選択方法の技術
が開示されている。For solving such a problem, see Japanese Patent Application Laid-Open
Japanese Patent Application Publication No. 282050 / “Document Editing Method and Apparatus” discloses a technique for selecting a character string to be assigned to total ruby.

【００１２】すなわち、「文書内に存在する同一文字列
の中からルビを付けるべき文字列の適切な位置を自動的
に決定し、ユーザに手間をかけることなく見やすい位置
にルビを表示する」として、文書データ内の任意の文字
列に対するルビ設定の指定が行われると、その文字列の
文書内出現頻度を検出して、出現頻度情報に基づいて文
書内同一文字列のそれぞれに対するルビの表示、非表示
を決定するようにしている。ルビを表示する文字列をそ
の文字列の文書内出現頻度に従い、出現頻度の高い文字
列に対しては表示間隔を大きく、出現頻度の低い文字列
に対しては表示間隔を狭く決定するため、文書の適切な
位置の文字列にルビが表示されるようになると説明され
ている。That is, “the appropriate position of the character string to be added with ruby is automatically determined from the same character string existing in the document, and the ruby is displayed at a position that is easy to see without any trouble for the user”. When the ruby setting is specified for an arbitrary character string in the document data, the appearance frequency of the character string in the document is detected, and the ruby display for each of the same character strings in the document is performed based on the appearance frequency information. Hidden is determined. In order to determine the character string that displays ruby according to the frequency of occurrence of the character string in the document, the display interval is determined to be large for character strings with high frequency of appearance, and the display interval is narrow for character strings with low frequency of appearance. It is described that ruby will be displayed in a character string at an appropriate position in a document.

【００１３】確かに、図３に示したルビを付与した文書
データのように、出現したすべての文字列にルビを付与
すると出現頻度が高い場合には読み方が記憶に新しい文
字列が現れたときにもルビが頻繁に付けられていると煩
わしさを感じることになるので、適切な頻度でルビが付
与されることが望ましくこの課題の１解決方法として提
案されているものである。Indeed, if ruby is added to all character strings that appear, as in the case of ruby-added document data shown in FIG. Since it is annoying if rubies are frequently attached, it is desirable that rubies be given at an appropriate frequency, and this has been proposed as a solution to this problem.

【００１４】上記先行事例における実施例として、「文
字列の出現頻度が３回／ページ以上の場合には、３ペー
ジに１回ルビを表示する」場合に、一つの文字列にルビ
がふられたときには、次に続く２ページに渡って出現す
る同一文字列にはルビをふらないとして説明されてい
る。また、ユーザ指定情報に出現頻度がある値以下の場
合には出現するすべての文字列にルビをふる例も示され
ている。As an example of the above-mentioned prior example, when "the appearance frequency of a character string is three times / page or more, ruby is displayed once on three pages", ruby is applied to one character string. In this case, it is described that ruby is not added to the same character string appearing over the next two pages. Also, an example is shown in which, when the appearance frequency of the user-specified information is equal to or less than a certain value, ruby is added to all character strings that appear.

【００１５】しかしながら、文書データが実際のページ
に組版されるときに、各ページには写真や図表がスペー
スをとり文書データの文字がその他の空きのスペースに
割りつけられるので単純にルビを付与する文字列を頻度
のみによって決定することができない場合もあり、上記
先行事例ではこれを解決するようにはなっていない。However, when the document data is typeset on an actual page, a ruby is simply added to each page since photographs and charts take up space and characters of the document data are allocated to other empty spaces. In some cases, a character string cannot be determined only by frequency, and the preceding case does not solve this.

【００１６】そこで、本発明においては、組版した結果
のページと、利用者が指定した文字列へのルビ付与の条
件に基づいて、与えられた文書データあるいは他の組版
でルビが付与された文書データを入力して自動的にルビ
を付与する効率的な実現手段を提供することを目的とす
る。Therefore, in the present invention, based on a page obtained as a result of typesetting and a ruby setting condition for a character string specified by a user, a document to which ruby has been added with given document data or another typesetting. An object of the present invention is to provide an efficient means for automatically adding ruby by inputting data.

【００１７】[0017]

【課題を解決するための手段】図１に本発明の構成図を
示す。本発明は、文書データを論理的に区分する１以上
の論理区分を設定して記憶する論理区分記憶手段１と、
文書データからルビを付与する候補文字列を抽出するル
ビ付与候補文字列抽出手段４と、記憶された論理区分ご
とに、同一文字列からなる候補文字列については最初に
出現する候補文字列をルビ付与対象文字列として判定す
るルビ付与判定手段２と、文書データの上記判定された
ルビ付与対象文字列にルビを付与するルビ付与手段３と
を備えるようにしている。FIG. 1 is a block diagram of the present invention. The present invention provides a logical division storage unit 1 for setting and storing one or more logical divisions for logically dividing document data;
A ruby-attached candidate character string extracting means 4 for extracting a candidate character string to which ruby is to be added from document data, and a ruby-added candidate character string consisting of the same character string for each of the stored logical divisions. There is provided a ruby addition determining means 2 for determining as a character string to be added, and a ruby adding means 3 for adding ruby to the determined character string for ruby addition in the document data.

【００１８】また、各ページごとの収容文字数が変動す
る文書を編成して文書データを各ページに割り当てて組
版するとき、ルビ付与判定手段２は組版して定まるペー
ジに基づいた論理区分を用いてルビの付与判定をするよ
うにしている。Further, when organizing a document in which the number of characters accommodated in each page fluctuates and assigning document data to each page for typesetting, the ruby addition determining means 2 uses logical divisions based on the pages determined by typesetting. The ruby is determined to be given.

【００１９】さらに、論理区分記憶手段１は複数種類の
論理区分と各論理区分の種類ごとに優先度を記憶するよ
うにして、ルビ付与判定手段２は論理区分記憶手段１に
記憶された複数種類の論理区分ごとの優先度に基づいて
ルビ付与の判定をするようにもしている。Further, the logical partition storage means 1 stores a plurality of types of logical partitions and the priority for each type of logical partition, and the ruby assignment determining means 2 stores the plurality of logical partitions stored in the logical partition storage means 1. The determination of ruby addition is made based on the priority of each logical section.

【００２０】そして、入力する文書データは第一の組版
に基づきルビを付与されたルビ付与済文書データであ
り、第二の組版によって定まるページに基づいてルビを
付与するとき、ルビ付与判定手段２は記憶された論理区
分ごとに同一文字列について、最初に出現する文字列以
外の文字列にはルビを非付与と判定するようにして、ル
ビ付与手段３は文書データの上記ルビ非付与と判定され
た文字列にはルビを付与しないようにしている。The input document data is ruby-added document data to which ruby has been added based on the first typesetting. When adding ruby based on a page determined by the second typesetting, the ruby addition determining means 2 Is such that, for the same character string for each of the stored logical divisions, ruby is determined not to be added to a character string other than the character string that appears first, and the ruby adding means 3 determines that the ruby is not added to the document data. No ruby is added to the given character string.

【００２１】このように、論理区分記憶手段１にページ
単位とか見開き２ページを単位とするとか、あるいは、
章や節を単位としたり、オンライン文書の場合には行数
を単位としたりするように、各種の論理的な区分の種類
が記憶できるようにしてあり、これらの一つ、あるいは
複数の区分単位を同時に、有効にして、有効にされたそ
れぞれの区分の範囲ごとに、同じ文字列が１以上の回数
出現した場合にはその先頭の文字列のみにルビをふるよ
うにしている。As described above, in the logical partition storage means 1, a page unit or a two-page spread unit is used, or
Various types of logical divisions can be stored, such as a chapter or section as a unit, or in the case of an online document, a line number. One or more of these division units can be stored. At the same time, and if the same character string appears one or more times in each of the enabled sections, ruby is applied only to the first character string.

【００２２】複数種の論理区分が有効になっているとき
には、相互の優先度の違いにより両方の論理区分共有効
にしたり、優先度の高い方の判定を有効にしたりを制御
できるようにしている。When a plurality of types of logical divisions are valid, it is possible to control whether both logical divisions are valid or whether the judgment of the higher priority is valid, depending on the difference in priority. .

【００２３】実際のページへの割り付けは、組版を行っ
たときはじめて、そのページの字の大きさや図、表の量
などにより定まり、その割り当てられたページを基にペ
ージ単位の論理区分が定まるので、組版の進行に合わせ
てルビ付与を行うようになっている。The actual assignment to a page is determined only when the typesetting is performed, based on the character size of the page, the figure, the amount of the table, and the like. Since the logical division of each page is determined based on the assigned page, In addition, ruby is added according to the progress of typesetting.

【００２４】また、再組版の場合にはすでにある組版が
実施された結果定まるページへの割り付けを基に論理区
分がなされている場合に、改めて新しい組版でページ構
成をかえる場合には入力データにすでにふられているル
ビは、当然、ルビ入力されたものとして付与される文字
列と付与するべきルビの関係を利用することになるが、
論理区分内で判定されたルビ付与する位置の文字列でな
い場合には、そこにルビを付けない、すなわち、すでに
付けられていたときにはルビを削除するようにしてい
る。Further, in the case of re-formatting, when the logical division is made based on the allocation to the page determined as a result of the already performed type-setting, when the page composition is changed again with a new type, the input data is used. Ruby already touched naturally uses the relationship between the character string given as ruby input and the ruby to be given,
If the character string is not the position of the ruby addition position determined in the logical section, no ruby is added to the character string. That is, if the character string is already attached, the ruby is deleted.

【００２５】[0025]

【発明の実施の形態】図２に本発明の実施の形態の構成
図を示す。本実施の形態においては、パーソナルコンピ
ューター、ワークステーション等の汎用的な目的で使用
される計算機上で実行するコンピュータプログラムによ
り実現する形態を示す。FIG. 2 shows a configuration diagram of an embodiment of the present invention. In the present embodiment, a mode realized by a computer program executed on a computer used for general purposes such as a personal computer and a workstation will be described.

【００２６】本発明のルビ付与システムは、処理装置、
主記憶装置、補助記憶装置、入出力装置などから構成さ
れる計算機上で、コンピュータプログラムを実行して実
現される。また、コンピュータプログラムは、フロッピ
ー（登録商標）ディスクやＣＤ−ＲＯＭ等の可搬型媒体
やネットワーク接続された他の計算機の主記憶装置や補
助記憶装置等に格納されて提供される。本発明の記録媒
体は、上記可搬型媒体、主記憶装置、補助記憶装置に該
当する。The ruby providing system according to the present invention comprises a processing device,
It is realized by executing a computer program on a computer including a main storage device, an auxiliary storage device, an input / output device, and the like. The computer program is provided by being stored in a portable medium such as a floppy (registered trademark) disk or a CD-ROM, or in a main storage device or an auxiliary storage device of another computer connected to a network. The recording medium of the present invention corresponds to the portable medium, the main storage device, and the auxiliary storage device.

【００２７】提供されたコンピュータプログラムは、可
搬型媒体から直接計算機の主記憶装置にロードされ、ま
たは、可搬型媒体から一旦補助記憶装置にコピーまたは
インストール後に、主記憶装置にロードされて実行す
る。また、ネットワーク接続された他の装置に格納され
て提供された場合も、他の装置からネットワークを経由
して受信後に、補助記憶装置にコピー、主記憶装置にロ
ードされ実行するものである。The provided computer program is loaded directly from the portable medium to the main storage device of the computer, or once copied or installed from the portable medium to the auxiliary storage device, and then loaded and executed in the main storage device. Also, when provided by being stored in another device connected to a network, it is also copied to an auxiliary storage device, loaded into a main storage device, and executed after receiving from another device via a network.

【００２８】本発明の実施の形態では、文書データを入
力するときに初めて現れた文字列にルビ制御コードとル
ビ文字を合わせて入力した文書データ、あるいは、すで
にルビがふられて作成された文書からルビに関連した制
御コードのみを残して抽出した文書データを入力とし
て、設定されたルビ変換規則や組版条件に基づいて総ル
ビ文書を組版する総ルビ自動付与組版装置を示す。According to the embodiment of the present invention, a document string is input by adding a ruby control code and a ruby character to a character string that appears for the first time when the document data is input, or a document created by adding ruby. 1 shows an automatic total ruby typesetting device that typesets a total ruby document based on set ruby conversion rules and typesetting conditions by using as input the document data extracted leaving only control codes related to ruby.

【００２９】まず。図２に基づいて装置の構成を説明す
る。２０１は文書データ、ルビ入力機構である。これ
は、利用者が文書データを原稿から読み取った文字列と
して入力する装置であって、この段階でルビをふる必要
のある文字列については初めて出現するときにルビ制御
コードとともにルビを入力しておくようにしている。以
降に示す本装置の構成、動作の説明で分かるように、ル
ビは初めて出現するときに指定しないでも必要に応じて
追加入力することもできる。First, The configuration of the device will be described based on FIG. Reference numeral 201 denotes a document data and ruby input mechanism. This is a device in which a user inputs document data as a character string read from a manuscript. At this stage, when a character string that requires ruby appears for the first time, the ruby is input together with the ruby control code. I keep it. As will be understood from the following description of the configuration and operation of the present apparatus, ruby can be additionally input as necessary without specifying it when it first appears.

【００３０】文書データ部２０５は入力された文書デー
タを蓄積するバッファである。ここには、２０２で示す
旧文書入力機構から、すでに、ある形式で組版されて文
書として構成された文書データを旧文書データ部２０３
に入力して蓄積した後、逆変換機構２０６で組版時に各
種付加された制御コードを除き、ルビ制御コードのみを
残して抽出した文書データも２０１から入力された文書
データと同様の形式として２０５に蓄積して入力データ
とすることができる。The document data section 205 is a buffer for storing input document data. Here, from the old document input mechanism denoted by reference numeral 202, document data that has been formatted and configured as a document in a certain format has already been converted into the old document data section 203.
Then, the document data extracted by leaving only the ruby control code except for the control code added variously at the time of typesetting by the inverse conversion mechanism 206 is also converted to 205 in the same format as the document data input from 201. It can be stored and used as input data.

【００３１】組版処理機構２０８は本装置の基本となる
機構であり、２０５から文書データを順次入力し、組版
のための基本書式指示機構２０４から入力して設定され
た基本書式テーブル２０７の基本書式に基づいて組版処
理を行う機構である。組版は入力された文書データを文
書としての形式を整えて各種制御コードを追加して順次
組版結果出力データ部２１０に出力されていく。A typesetting processing mechanism 208 is a basic mechanism of the present apparatus, and sequentially inputs document data from 205, and sets a basic format of a basic format table 207 set by inputting from a basic format designating mechanism 204 for typesetting. This is a mechanism for performing typesetting processing based on. For the typesetting, the input document data is formatted as a document, various control codes are added, and the output is sequentially output to the typesetting result output data unit 210.

【００３２】このように書式に対応した制御コードを付
加されてページの割り付けがなされた組版結果出力デー
タに対するルビ制御コードの付与の仕組みを以下に説明
する。A mechanism for adding the ruby control code to the formatted output result data to which the control code corresponding to the format is added and the pages are allocated will be described below.

【００３３】ちなみに、書式制御コードの例としては、
ページを構成する条件として、１ページの行数、１行の
文字数、どのような文字の大きさにするかを示すフォン
トの種類、写真や図の配置の条件などにより同じ内容を
示す文字列も印刷あるいは表示出力するときの配置は異
なるものになり、配置の制御コードとして、改行制御コ
ード、先頭文字位置の制御コード、フォント制御コード
などがある。Incidentally, as an example of the format control code,
As a condition for forming a page, a character string indicating the same content according to the number of lines per page, the number of characters per line, a font type indicating what kind of character size is to be used, a layout condition of photographs and drawings, and the like. The layout at the time of printing or display output is different, and the control codes of the layout include a line feed control code, a control code of the first character position, a font control code, and the like.

【００３４】このように、ページの構成が定まるための
条件が与えられ、その条件にしたがって各ページに対応
した文書データの文字列が定まってくる。本装置ではど
の文字列にルビを付与するかを、このページへの文字列
の割り付けが定まってはじめて決定することができる場
合を主として説明する。その他の場合として本装置で扱
うものは、ページ構成によらずルビをふる文字列を決定
する場合であり、文書データの構成として、章、節、段
落などの区分がなされているときこれらを単位としてそ
こに初めて現れる文字列にのみルビをふる場合がこれに
相当する。As described above, the conditions for determining the structure of the page are given, and the character string of the document data corresponding to each page is determined according to the conditions. In the present apparatus, a description will be mainly given of a case where a character string to be given ruby can be determined only after the assignment of the character string to this page is determined. In other cases, this device handles ruby character strings regardless of the page configuration. When document data is divided into chapters, sections, paragraphs, etc., these are used as units. This is the case when ruby is applied only to a character string that first appears there.

【００３５】ページを意識したルビのふり方にも、ペー
ジ数を単位とする場合、例えば、１ページ単位であれば
同じ文字列が何回か出現する場合にはページの最初に出
現した文字列にのみルビをふる。また、２ページ単位、
３ページ単位など、同様である。印刷される文書のペー
ジの綴じ方を意識したルビのふり方としては、見開き２
ページを単位とするのが実用的である。すなわち、見開
きは一見できる範囲であり、単に２ページに１回ルビが
現れるのとは異なり、開いて見える２ページの中に初め
て出現した時にルビがふられていると読みやすくなる効
果が大きい。見開きも当然、縦書で右からページがふら
れる場合と横書きで左からページがふられる場合によっ
て右ページ、左ページは異なるが見開きの２ページの内
のページ番号の若い方のページから順に見て初めて、と
いうことになる。If the number of pages is used as a unit for ruby preserving the page, for example, if the same character string appears several times in one page unit, the character string that appears first in the page Only ruby. Also, in units of two pages,
The same applies to a unit of three pages. Two-page spread is used as ruby pretending to be aware of how to bind pages of a printed document.
It is practical to use a page as a unit. That is, the spread is a range that can be seen at a glance, and unlike the case where ruby appears only once every two pages, the effect that the ruby is touched when first appearing on the two pages that appear open has a large effect of making it easier to read. The right page and left page are different depending on the case where the page is touched from the right in vertical writing and the case where the page is touched from the left in horizontal writing, but the two pages of the two-page spread are viewed in order from the page with the smallest page number. It's the first time.

【００３６】また、ページ構成によるがページ単位での
制御とはならないルビのふりかたも本装置での処理に含
まれる。これは、行数を単位とするルビ付与の文字列判
定方法を採用した場合である。特に、電子文書データに
おいては文書データをページに区切らず画面上でスクロ
ールして表示することが多いが、このようなときは行数
を単位としてルビをふる文字列を判定するようにしてい
る。文字数を単位としても同様な効果が得られる。Also, ruby pretending, which depends on the page configuration but is not controlled on a page basis, is also included in the processing of the present apparatus. This is a case where a ruby-added character string determination method using the number of lines as a unit is adopted. In particular, in the case of electronic document data, the document data is often displayed by scrolling on the screen without dividing the page into pages. In such a case, a character string to be added with ruby is determined in units of the number of lines. A similar effect can be obtained by using the number of characters as a unit.

【００３７】ルビ変換規則指示機構２０９はルビ付与に
関する規則を指示する機構であり、これに利用者が入力
して指示したルビ変換の規則をルビ変換規則テーブル２
１１として記憶する。ルビ対象文字列抽出機構２１２は
このルビ変換規則テーブル２１１を参照しながら、これ
に基づきルビをふる文字列を判定して抽出する。The ruby conversion rule instructing mechanism 209 is a mechanism for instructing a rule relating to ruby addition. The ruby conversion rule input and instructed by the user is stored in the ruby conversion rule table 2.
11 is stored. The ruby target character string extracting mechanism 212 refers to the ruby conversion rule table 211 and determines and extracts a character string to be ruby based on the ruby conversion rule table 211.

【００３８】ルビ変換規則テーブルには、例えば、ルビ
付与の単位の指定がある。すなわち、この単位の範囲内
で同一文字列について初めて出現した文字列にのみルビ
をふるときに指定するもので、複数を重複して指定する
こともできる。この単位は例えば、ページ、見開きペー
ジ、章などがある。また、重複して指定可能なものとし
ては、章と見開きページ、のような組み合わせがある。In the ruby conversion rule table, for example, there is designation of a unit for adding ruby. That is, this is specified when ruby is applied only to a character string that first appears for the same character string within the range of this unit, and a plurality of characters can be specified redundantly. This unit includes, for example, pages, facing pages, chapters, and the like. In addition, combinations that can be specified redundantly include combinations such as chapters and facing pages.

【００３９】このような指定がある時、組版結果出力デ
ータの指定された単位について初めて現れた文字列にル
ビがふられるが、同じ単位内に再度出現する同じ文字列
にはルビをふらないようにする。複数の単位が指定され
たときその優先順位により付与をするか、非付与とする
かを判定する。例えば、章と見開きページの単位が指定
されたとき、見開きページの優先度を高くすると、見開
きページの中で章が変わっても見開きページですでにル
ビがふられれた文字列が新たな章に現れてもそこにはル
ビを付けないようになる。When such a designation is made, ruby is added to a character string that appears for the first time in the designated unit of the formatted result output data, but ruby is not added to the same character string that appears again in the same unit. To When a plurality of units are specified, it is determined whether to give or not to give according to the priority order. For example, when the unit of chapter and facing page is specified, if the priority of facing page is high, even if the chapter changes in the facing page, the character string already rubyed on the facing page will be replaced by a new chapter. Even if it appears, you will not attach ruby to it.

【００４０】また、既存の組版済の旧文書を再組版する
場合のように既に多くのルビ制御コードがふられていて
新たにルビ付与するべき文字列を判定するとき、非付与
とする文字列については既に付与されているルビ制御コ
ードを削除することも可能としている。When a character string to be newly added with ruby is determined when a large number of ruby control codes are already applied, as in the case of re-composing an existing type-composed old document, a character string not to be added For, it is also possible to delete the already assigned ruby control code.

【００４１】ルビ対象文字列抽出機構２１２でルビを付
与する対象文字列が抽出されたときにルビ対象文字列デ
ータ部２１４にそれを記憶すると共に、文書データ部２
０５に格納されたときに既にルビ制御コードが付けられ
ていていたときには、ルビ文字列登録機構２１３でルビ
変換用テーブル２１５に対象文字列と対応してルビ文字
列を記憶するようにして、以降にルビ付与とされた文字
列に付与するルビ文字列を取り出すことができるように
している。When the ruby target character string extraction mechanism 212 extracts the ruby target character string, it stores it in the ruby target character string data section 214 and the document data section 2.
If the ruby control code has already been added when the ruby character string is stored in the ruby character string registration mechanism 213, the ruby character string is stored in the ruby conversion table 215 in correspondence with the target character string. It is possible to extract a ruby character string to be added to a character string that is set to be ruby.

【００４２】ルビ対象文字列データ部２１４に記憶され
たルビ対象文字列はルビ変換機構２１７によりルビ変換
用テーブル２１５に記憶されたルビを付与する文字列と
ルビ文字の対応に基づいてルビ制御コードを順次付与す
る。ルビ対象文字列とされているものがルビ変換用テー
ブル２１５にない場合にはルビ文字列会話入力機構２１
６から利用者に入力を受けてルビ変換機構２１７がルビ
制御コードを付与することができる。このとき、あわせ
て、ルビ変換用テーブル２１５にあらたに対応を記憶す
ることにより以降の同じ文字列のルビ変換時に利用でき
るようにしている。ルビ変換機構２１７でルビ制御コー
ドを付与された文書データは文書データ部２０５にルビ
付与済文書データとして記憶される。ルビを付与された
文書データはまとめて文書として印刷されたり、表示に
使用したりされる。The ruby target character string stored in the ruby target character string data section 214 is converted into a ruby control code based on the correspondence between the ruby character string and the ruby character stored in the ruby conversion table 215 by the ruby conversion mechanism 217. Are sequentially given. If there is no ruby target character string in the ruby conversion table 215, the ruby character string conversation input mechanism 21
6, the ruby conversion mechanism 217 can add a ruby control code upon receiving an input from the user. At this time, the correspondence is newly stored in the ruby conversion table 215 so that it can be used in the subsequent ruby conversion of the same character string. The document data added with the ruby control code by the ruby conversion mechanism 217 is stored in the document data unit 205 as ruby-added document data. The document data to which the ruby is added is printed together as a document or used for display.

【００４３】ルビ対象文字列抽出機構２１２で積極的に
ルビを削除するようにルビ対象文字列データ部に記憶さ
れていたときには、ルビ変換機構２１７において既に付
与されているルビ制御コードを削除して出力するように
している。When the ruby object character string extraction mechanism 212 has stored the ruby object character string data section so as to positively delete ruby, the ruby conversion code 217 deletes the ruby control code that has already been added. Output.

【００４４】さて、上記のような構成の総ルビ自動付与
組版装置のさらに詳細な仕組みと動作について以下に説
明する。図４はルビ付与の論理区分の例の説明図であ
る。ルビを付与する文字列か否かを判定するとき、各種
の区切りについて同じ文字列は区切りの範囲内で最初に
出現した文字列にのみルビをふるようにしている。この
区切りは組版を行ったとき定まるＮページ、見開き２ペ
ージ、章、Ｍ行を単位として設定する。これらをまとめ
て論理区分と言うことにしている。Now, a more detailed structure and operation of the automatic total ruby typesetting apparatus having the above-described configuration will be described below. FIG. 4 is an explanatory diagram of an example of a logical division of ruby addition. When judging whether or not a character string is to be given a ruby, the same character string is used only for the first character string within the range of the delimiter. This delimiter is set in units of N pages, two facing pages, chapters, and M lines determined when the typesetting is performed. These are collectively called logical divisions.

【００４５】図４の例で説明する。図の中央に上から下
に向けて本など、文書のページをイメージして示してい
る。すなわちその中身は第１章、第２章、第３章、・・
・とあり、各ページに区切られていることがわかる。中
身として２ページには図表も配置されている。文書デー
タ構成の欄には各ページに対応してページ番号が１、
２、３、４、５、・・・で示されている。また、左端に
は行番号が１から１８・・・で示されている。This will be described with reference to the example of FIG. In the center of the figure, pages of a document such as a book are imagined from top to bottom. That is, the contents are Chapter 1, Chapter 2, Chapter 3, ...
・ It can be seen that each page is separated. Charts are also arranged on page 2 as contents. In the column of the document data structure, the page number corresponding to each page is 1,
2, 3, 4, 5,... The row numbers are indicated at 1 to 18... At the left end.

【００４６】各ページは文字の大きさ、行数、行内の文
字数が文書の印刷、あるいは表示などの用途に応じて基
本書式としてあらかじめ設定されている。組版ではこれ
らの基本書式の他に、特別な文字列には大きな字体を使
用したり、行の先頭の字下げ、図表、写真等の挿入、段
落などでの改行、等々の具体的適用が行われるので、文
書データが文字列で示されてもページ内での位置関係は
組版の後定まる。In each page, the size of characters, the number of lines, and the number of characters in each line are set in advance as a basic format according to the purpose of printing or displaying a document. In addition to these basic formats, formatting uses special fonts such as large fonts, indentation at the beginning of lines, insertion of figures and tables, photographs, line breaks in paragraphs, etc. Therefore, even if the document data is represented by a character string, the positional relationship within the page is determined after the typesetting.

【００４７】図の右側にルビ付与の論理区分の例に示
す。１ページとしてあるものは、各ページ番号ごとにル
ビ付与を判定する。また、２ページとあるものは先頭か
ら２ページごとに区切ってその範囲で同じ文字列があっ
たときのルビの付与を最初に出現したものとする。見開
き２ページの場合は、単に２ページ単位の区切りと異な
り偶数ページを先頭とする２ページを単位とすることが
図の範囲を示す矢印で比較できる。The right side of the figure shows an example of the logical division of ruby addition. For one page, ruby addition is determined for each page number. Also, it is assumed that the addition of ruby when there is the same character string in the range divided into every two pages from the top for the one with two pages first appears. In the case of two-page spreads, it is possible to compare two pages starting with an even-numbered page as a unit, unlike a two-page unit, using arrows indicating the range of the figure.

【００４８】章を区切りの単位とするものは、物理的な
ページとは異なり文書データの内容に章の区切りを検出
して単位を認識するものである。図の矢印で示すよう
に、１ページの１行目から３ページの２行目までが第１
章、続いて第２章となる。このような区切りは物理的な
区切りであるページに関わる区切りと併用してルビの付
与判定に利用されることがある。In the case where the chapter is used as the unit of division, unlike a physical page, the unit of the chapter is recognized by detecting the division of the chapter in the contents of the document data. As shown by the arrows in the figure, the first line of page 1 to the second line of page 3 are the first line.
Chapter and then Chapter 2. Such a break may be used for judging ruby in combination with a break related to a page which is a physical break.

【００４９】図の右端に示す１０行を区切り単位とする
例は、図の左端に行番号を付けてあるように文書の先頭
から行を数えて文書の位置を示すとき、行数を指定して
範囲を示し、その範囲内でルビの付与を判定することに
なる。これは、画面に順次表示するような形式の文書の
ときに有効になるもので、ページの区切りがないとき一
定行数内に現れた最初の文字列にルビをふるものであ
る。In the example in which the ten lines shown at the right end of the figure are used as a delimiter unit, when lines are counted from the beginning of the document to indicate the position of the document as indicated by the line numbers at the left end of the figure, the number of lines is specified. To indicate the range, and it is determined that the ruby is provided within the range. This is effective for a document in a format that is sequentially displayed on a screen. In the case where there is no page break, ruby is added to the first character string that appears within a certain number of lines.

【００５０】以上、図４で示したようにルビ付与を判定
する範囲はいろいろあるが、統一して判定する範囲を論
理区分とみなすようにしている。ルビの付与は同じ文字
列についてはこの論理区分の単位で最初に出現した文字
列にのみ行うように制御される。なお、図５はページ単
位、見開きページ単位のルビ付与の例を示す。As described above, there are various ranges for judging ruby as shown in FIG. 4, but the unified judging range is regarded as a logical division. The ruby is controlled so that the same character string is assigned only to the character string that first appears in the unit of the logical division. FIG. 5 shows an example of adding ruby on a page basis or on a double-page spread.

【００５１】ルビ付与の動作の説明にはいる前に、ルビ
付与対象文字列の抽出を行うとき論理区分単位で付与を
制御するためのルビ対象文字列データ部の構成を図６に
よって説明する。例として、ルビ付与論理区分として
「章」および「見開きページ」を併記している。これら
を独立して設定する場合には意味がないが、同時に指定
することもでき、そのときにはどちらの論理区分が優先
度が高いかにより両者で異なる文字列を付与対象文字列
として判定したときの採用の仕方を決めるようにしてい
る。ここでは数字が多い方が優先度が高いとしている。Before describing the ruby adding operation, the structure of the ruby target character string data section for controlling the addition in units of logical divisions when extracting the ruby adding character string will be described with reference to FIG. As an example, “chapters” and “spread pages” are also described as ruby assignment logical divisions. It does not make sense to set these independently, but they can also be specified at the same time, in which case a different character string is determined as the target character string depending on which logical section has higher priority. We decide how to recruit. Here, the higher the number, the higher the priority.

【００５２】付与決定文字列はルビをふる対象の文字列
とその発生した位置をページ番号、行番号で組にして構
成している。次に、具体例で説明する。Ａ，Ｂ、Ｃ、
Ｄ、Ｅはルビを付与する対象の文字列を示す。各文字列
の後に括弧で囲んだＰ、Ｌはページ番号、行番号を示
す。The character string to be assigned is composed of a character string to be subjected to ruby and a position where the character string is generated by a page number and a line number. Next, a specific example will be described. A, B, C,
D and E indicate character strings to which ruby is added. P and L enclosed in parentheses after each character string indicate a page number and a line number.

【００５３】ここでは、組版が進んで、論理区分として
「見開き２ページ」については４ページと５ページの範
囲を処理しており処理は５ページの３行目を処理したと
ころである。また、論理区分の「章」は４ページの何行
目かから新たな章が始まった状態でやはり５ページ目の
処理中である。In this case, as the typesetting progresses, the range of 4 pages and 5 pages is processed for the "two-page spread" as a logical division, and the processing has just been performed on the third line of the fifth page. The logical chapter "chapter" is also processing the fifth page in a state where a new chapter has been started from what line of the fourth page.

【００５４】まず、一般的に指定される見開き２ページ
のみの場合としてみると、処理の進行は見開き２ページ
の最初のページである４ページ目において文字列Ａ、
Ｂ、Ｃ、Ｄが少なくとも１回出現したことを示してい
る。すなわち、この２ページの範囲においては４ページ
の１行目で文字列Ａが現れている。その後，文字列Ａが
現れてもこの２ページの範囲ではルビふりは行われない
のでこのルビ対象文字列データ部に記録されていない。
他の文字列Ｂ、Ｃ、Ｄも４ページ目で既に出現している
ように位置が記録されていることが分かる。文字列Ｅは
５ページの２行目で初めて現れていると記録されてい
る。First, assuming that only a two-page spread, which is generally specified, proceeds with the character strings A, A on the fourth page, which is the first page of the two-page spread.
B, C, and D appear at least once. That is, in the range of the two pages, the character string A appears on the first line of the fourth page. After that, even if the character string A appears, ruby is not performed in the range of these two pages, so that it is not recorded in the ruby target character string data portion.
It can be seen that the positions of the other character strings B, C, and D are recorded as if they had already appeared on the fourth page. It is recorded that the character string E first appears on the second line of page 5.

【００５５】この後、５ページ目の３行目以降について
処理が進み、何らかの新たなルビを付与する対象が検出
されたときには、このルビ対象文字列データ部を検索し
て同じ文字列が既にルビ付与されていないかを調べるこ
とになる。既にルビがふられている場合にはそのまま処
理を次に進める。この論理区分において初めて現れた文
字列はこのルビ対象文字列データ部にその出現した位置
とともにあらたに記録することになる。Thereafter, the process proceeds from the third line on the fifth page, and when a target to which some new ruby is to be added is detected, the ruby target character string data portion is searched to find that the same character string is already in ruby. It will be checked whether it has been granted. If ruby has already been added, the process proceeds as it is. The character string that appears for the first time in this logical division is newly recorded in this ruby character string data section together with the position where it appears.

【００５６】同様に、「章」を論理区分としたときの例
を示している。同じ文書データを対象としているが、論
理区分「章」が先に述べたように、４ページの途中行か
ら始まっているので、４ページの１行目から３行目で現
れた文字列Ａ、Ｂ、Ｃは前の章ということで記録されて
いない。４ページ目５行目の文字列Ｄは前記見開きペー
ジを論理区分としたときに記録されているのと同じ位置
で初めて出現したとして記録されている。ページ５の１
行目の文字列Ｃは４ページ目の３行目に既に現れ、見開
きページの例ではそこを初めて出現した位置として記録
されているが、この章としては初めてであり５ページ目
の位置が初めてであると記録されている。Similarly, an example is shown in which “chapter” is a logical division. Although the same document data is targeted, since the logical division “chapter” starts from the middle line of page 4 as described above, the character string A appearing on the first to third lines of page 4 B and C are not recorded as the previous chapter. The character string D on the fourth line and the fifth line is recorded as appearing for the first time at the same position as that recorded when the facing page is set as the logical division. Page 5 of 1
The character string C on the fourth line has already appeared on the third line on the fourth page, and is recorded as the position where it first appeared in the double-page spread example, but this is the first time for this chapter and the position on the fifth page is the first time. Is recorded.

【００５７】文字列Ｅは見開きページと一緒である。文
字列Ａが５ページ目の３行目に章としては初めて現れて
いるので４ページの１行目にも現れているが５ページ目
に記録されている。The character string E is the same as the double-page spread. Since the character string A first appears as a chapter on the third line of the fifth page, it also appears on the first line of the fourth page, but is recorded on the fifth page.

【００５８】以上は各論理区分をひとつだけ設定したと
きの場合を併記したが、これらを二つ同時にルビ付与す
べき文字列を判別する論理区分として設定したときに
は、この説明図では優先順位が章を単位とする付与判定
を優先するようになっているので、見開きページを論理
区分とする場合のルビ付与文字に加えて章を単位とする
論理区分の付与文字列を重ね合わせるようにする。具体
例では、見開きページの付与決定文字列に５ページ１行
目の文字列Ｃ、５ページ３行目の文字列Ａを加えること
になる。In the above description, the case where only one logical section is set is described. However, when these two logical sections are set at the same time as the logical section for judging the character string to be added with ruby, the priority order in this explanatory diagram is changed to a chapter. Is given priority in addition to the ruby-added character in the case where a double-page spread is used as a logical division, and the added character string of the logical division in units of chapters is superimposed. In a specific example, a character string C on the first line of page 5 and a character string A on the third line of page 5 are added to the assignment determination character string of the double-page spread.

【００５９】以上のような構成の装置におけるルビ付与
文字列の判定を行う動作をフローチャートにしたがって
説明する。図７に総ルビ付与の動作を説明するフローチ
ャートを示す。The operation of determining the ruby-added character string in the apparatus having the above configuration will be described with reference to the flowchart. FIG. 7 is a flowchart illustrating the operation of the total ruby addition.

【００６０】図２の本発明の実施の形態の構成図におい
て基本書式、ルビ変換規則が設定され、組版処理機構２
０８が文書データ部２０５から文書データを入力して設
定された基本書式に基づいて組版が行われ、組版結果出
力データ部２１０に出力される。この組版結果出力デー
タを入力として、順次ルビ変換規則に基づいてルビ付与
を行う動作を説明する。In the block diagram of the embodiment of the present invention shown in FIG. 2, a basic format and ruby conversion rules are set,
In step 08, the document data is input from the document data unit 205, the document is formatted based on the set basic format, and output to the formatted result output data unit 210. An operation of sequentially inputting ruby conversion based on the ruby conversion rule using the output of the typesetting result will be described.

【００６１】初期設定については図示していないが、処
理を進めるにあたって組版結果出力データ部２１０から
入力するデータのどの部分を処理の対象としているのか
を示すポインタを初期値に設定するようになっている。Although the initial setting is not shown, a pointer indicating which part of the data input from the typesetting result output data section 210 is to be processed is set to an initial value when the processing is advanced. I have.

【００６２】すなわち、ポインタには、データの文字列
を取り出す文字位置、組版された結果のどのページのど
の行を処理しているのか、また、組版された文書データ
の先頭からの行番号、文書を構成する章、節、段落等の
位置がそれぞれカウントされて管理されるようにしてい
る。これらの位置情報を総称してポインタといってい
る。That is, the pointer indicates the character position at which the character string of the data is to be extracted, the line on which page of the formatted result is being processed, the line number from the beginning of the formatted document data, and the document. Are counted and managed. These pieces of position information are collectively called a pointer.

【００６３】ステップＳ７０２ではポインタ位置から組
版済文書データを順次走査して文字列を取り出す。ルビ
変換規則としてルビをふるべき文字列を判定する単位と
して論理区分を設定しているが、ステップＳ７０５では
文字列を順次取り出すときポインタの移動にともない設
定された論理区分が変化したか否かを検出するようにし
ている。例えば、行番号が変わり、論理区分として設定
された行数の区切りを超えたとき、あるいは、ページ番
号が変わったとき、もし論理区分が見開き２ページであ
れば、奇数ページから偶数ページに変化すれば論理区分
が変化したと判定することになる。In step S702, the formatted document data is sequentially scanned from the pointer position to extract a character string. Although a logical division is set as a unit for determining a character string to be added as ruby conversion rules, it is determined in step S705 whether or not the set logical division has changed with the movement of the pointer when sequentially extracting character strings. I try to detect. For example, when the line number changes and exceeds the line number set as a logical division, or when the page number changes, if the logical division is two-page spread, the page may change from an odd page to an even page. For example, it is determined that the logical division has changed.

【００６４】ステップＳ７０５で変化なしとしたときに
は、ステップＳ７０８で組版済の文書データから文字列
を切り出す。これは、通常の文書処理の技術でひとつな
がりの単語を認識して切り出すもので、例えば、フリガ
ナを付ける対象となる漢字列や外国語文字列を単位とす
るものである。If there is no change in step S705, a character string is cut out from the formatted document data in step S708. This is a method of recognizing and cutting out a series of words by a normal document processing technique, for example, using a kanji character string or a foreign language character string to which a reading is added as a unit.

【００６５】ステップＳ７１０では切り出した文字列が
既にルビ対象文字列データ部２１４の中の現在ポインタ
が示す論理区分内に登録された文字列があるかを検索し
て判定する。このとき、ルビ付与の変換規則として複数
の論理区分が指定されていたときには、論理区分間の優
先度にしたがい検索の順序を決めるようにする。In step S710, it is determined whether or not the cut-out character string is already registered in the logical segment indicated by the current pointer in the ruby target character string data section 214. At this time, when a plurality of logical divisions are specified as the conversion rule for adding ruby, the search order is determined according to the priority between the logical divisions.

【００６６】まだ登録されていないときには、ステップ
Ｓ７１１において、ルビ対象文字列データ部２１４の論
理区分毎に文字列とその位置を登録する。If not registered, in step S711, a character string and its position are registered for each logical division of the ruby target character string data section 214.

【００６７】ステップＳ７１３では、切り出した文字列
に既にルビ制御コードが付けられているか否かを判定す
る。既にルビ制御コードがふられているのは、文書デー
タを入力するときルビをふるべき文字列であるとして付
けられたか、あるいは旧文書入力機構２０２から旧文書
データが入力されその文書の構成においてルビがふられ
た文字列であった場合である。In step S713, it is determined whether a ruby control code has already been added to the extracted character string. The reason why the ruby control code has already been added is that a character string to be added with ruby is added when inputting document data, or that the old document data is input from the old document input mechanism 202 and the ruby is included in the structure of the document. This is the case when the character string is touched.

【００６８】未だルビ制御コードが付けられていないと
きにはステップＳ７１５において、対象とする切り出さ
れた文字列がルビ変換用テーブル２１５にあるか否かを
検索して調べる。ルビ変換用テーブル２１５にあったと
きにはステップＳ７１２に移動する。If the ruby control code has not been added yet, in step S 715, it is checked by searching whether or not the target cut-out character string is in the ruby conversion table 215. If it is in the ruby conversion table 215, the process moves to step S712.

【００６９】未だ登録されていないときにはステップＳ
７１６で新たにどのようなルビを付けるべきかを利用者
に入力させる。If not registered, step S
At 716, the user is prompted to enter what ruby is to be added.

【００７０】ステップＳ７１２において既に登録済ある
いは新たに入力されたルビをルビ制御コードとして文書
データに付与する。In step S712, the already registered or newly input ruby is added to the document data as a ruby control code.

【００７１】ステップＳ７１４において、ルビ未登録の
文字列はルビ変換用テーブルに登録する。これにより、
同じ文字列が検出されたときには同じルビをふることが
できるようになる。In step S714, the character string that has not been registered as ruby is registered in the ruby conversion table. This allows
When the same character string is detected, the same ruby can be played.

【００７２】ステップＳ７１３でルビ制御コードが付け
られていたときにはステップＳ７１４でルビ変換用テー
ブルに反映させた後ステップＳ７０２にもどる。If the ruby control code is added in step S713, the data is reflected in the ruby conversion table in step S714, and the process returns to step S702.

【００７３】ステップＳ７１０において論理区分内に既
にルビを付与した文字列であるとされたときには、ステ
ップＳ７０１で切り出した文字列にルビ制御コードが既
に付与されているか否かを調べる。ルビ制御コードが付
けられていなければステップＳ７０２に移り次の文書デ
ータから新たな文字列の切り出しを行うようにする。If it is determined in step S710 that the character string has already been provided with ruby in the logical section, it is checked whether or not the ruby control code has already been added to the character string cut out in step S701. If the ruby control code has not been added, the process proceeds to step S702, and a new character string is cut out from the next document data.

【００７４】もし、既にルビ制御コードが付与されてい
たとすると、ルビ変換規則ではルビを付与しないとして
いる文字列にルビが付けられないようにしなければなら
ないので、ルビ制御コードを削除しなければならない。
すなわち、ステップＳ７０４において、既に付与されて
いたルビ制御コードを削除してからステップＳ７０２に
戻る。If the ruby control code has already been assigned, the ruby conversion rule must prevent ruby from being attached to a character string for which no ruby is to be assigned, and the ruby control code must be deleted. .
That is, in step S704, the already assigned ruby control code is deleted, and the process returns to step S702.

【００７５】ステップＳ７０５において、新たに文字列
を探したとき論理区分が変化したときには、ステップＳ
７０６で変化した論理区分内の登録文字列をリセットす
る。これにより、変化した新たな区分については文書デ
ータから切り出される文字列は今まで出現していたとし
て記憶された文字列はなくなり、ここから初めて出現し
た文字列にルビが付けられることになる。複数の論理区
分がルビ付与判断の対象となっているときには変化のな
かった論理区分に記憶されている文字列はそのまま継続
してルビ付与の判断に使用される。In step S705, when a new character string is searched and the logical division changes,
In step 706, the registered character string in the changed logical section is reset. As a result, for the changed new section, the character string cut out from the document data does not have the character string stored as having appeared so far, and the character string that first appeared from this point is added with ruby. When a plurality of logical sections are subject to ruby addition determination, the character strings stored in the logical sections that have not changed are continuously used for ruby addition determination.

【００７６】ステップＳ７０９で文書データの最後まで
終わったことが確認されていればルビ付与処理を終了す
る。途中であればステップＳ７０２に戻ってポインタを
進めて、次の文字列の取り出しを行ってルビ付与処理を
継続する。If it is confirmed in step S709 that the end of the document data has been completed, the ruby adding process ends. If it is in the middle, the process returns to step S702 to advance the pointer, take out the next character string, and continue the ruby adding process.

【００７７】以上説明したように、文書全体にルビを付
与する総ルビ付与の処理が組版された文書データに１以
上の論理区分を設定することで適切な位置の文字列にル
ビを付与することができることが分かる。（付記１）文書データを論理的に区分する１以上の論理
区分を設定して記憶する論理区分記憶手段と、文書デー
タからルビを付与する候補文字列を抽出するルビ付与候
補文字列抽出手段と、記憶された論理区分ごとに、同一
文字列からなる候補文字列については最初に出現する候
補文字列をルビ付与対象文字列として判定するルビ付与
判定手段と、文書データの上記判定されたルビ付与対象
文字列にルビを付与するルビ付与手段と、を備えること
を特徴とするルビ付与システム。（付記２）各ページごとの収容文字数が変動する文書を
編成して文書データを各ページに割り当てて組版すると
き、ルビ付与判定手段は組版して定まるページに基づい
た論理区分を用いてルビの付与判定をすることを特徴と
する付記１記載のルビ付与システム。（付記３）論理区分記憶手段は複数種類の論理区分と各
論理区分の種類ごとに優先度を記憶するようにして、ル
ビ付与判定手段は論理区分記憶手段に記憶された複数種
類の論理区分ごとの優先度に基づいてルビ付与の判定を
することを特徴とする付記１乃至付記２のいずれかに記
載のルビ付与システム。（付記４）入力する文書データは第一の組版に基づきル
ビを付与されたルビ付与済文書データであり、第二の組
版によって定まるページに基づいてルビを付与すると
き、ルビ付与判定手段は記憶された論理区分ごとに同一
文字列について、最初に出現する文字列以外の文字列に
はルビを非付与と判定するようにして、ルビ付与手段は
文書データの上記ルビ非付与と判定された文字列にはル
ビを付与しないようにすることを特徴とする付記１乃至
付記３のいずれかに記載のルビ付与システム。（付記５）コンピュータを、文書データを論理的に区分
する１以上の論理区分を設定して記憶する論理区分記憶
手段、文書データからルビを付与する候補文字列を抽出
するルビ付与候補文字列抽出手段、記憶された論理区分
ごとに、同一文字列からなる候補文字列については最初
に出現する候補文字列をルビ付与対象文字列として判定
するルビ付与判定手段、文書データの上記判定されたル
ビ付与対象文字列にルビを付与するルビ付与手段、とし
て機能させるプログラム。（付記６）コンピュータを、文書データを論理的に区分
する１以上の論理区分を設定して記憶する論理区分記憶
手段、文書データからルビを付与する候補文字列を抽出
するルビ付与候補文字列抽出手段、記憶された論理区分
ごとに、同一文字列からなる候補文字列については最初
に出現する候補文字列をルビ付与対象文字列として判定
するルビ付与判定手段、文書データの上記判定されたル
ビ付与対象文字列にルビを付与するルビ付与手段、とし
て機能させるプログラムを記録したコンピュータ読み取
り可能な記録媒体。（付記７）各ページごとの収容文字数が変動する文書を
編成して文書データを各ページに割り当てて組版すると
き、論理区分記憶手段は組版して定まるページを論理区
分として記憶するようにすることを特徴とする付記１記
載のルビ付与システム。（付記８）各ページごとの収容文字数が変動する文書を
編成して文書データを各ページに割り当てて組版すると
き、論理区分記憶手段は組版して定まる見開き２ページ
を論理区分として記憶するようにすることを特徴とする
付記１記載のルビ付与システム。As described above, the ruby is added to the entire document by assigning one or more logical divisions to the formalized document data by adding the ruby to the character string at an appropriate position. You can see that it can be done. (Supplementary Note 1) Logical division storage means for setting and storing one or more logical divisions for logically dividing document data, and ruby adding candidate character string extracting means for extracting a candidate character string to add ruby from the document data For each of the stored logical divisions, for a candidate character string consisting of the same character string, a ruby addition determining unit that determines a candidate character string that appears first as a ruby addition target character string, and the determined ruby addition of the document data. A ruby providing system, comprising: a ruby providing means for adding ruby to a target character string. (Supplementary Note 2) When organizing a document in which the number of characters accommodated in each page fluctuates and assigning document data to each page for typesetting, the ruby addition determination unit uses the logical division based on the page determined by typesetting to determine the ruby. The ruby providing system according to claim 1, wherein the ruby is provided. (Supplementary Note 3) The logical partition storage means stores a plurality of types of logical partitions and the priority for each type of each logical partition, and the ruby addition determining unit stores a plurality of types of logical partitions stored in the logical partition storage means. 3. The ruby providing system according to any one of supplementary notes 1 and 2, wherein a ruby is determined based on the priority of the ruby. (Supplementary Note 4) The input document data is ruby-attached document data to which ruby has been added based on the first typesetting. When adding ruby based on a page determined by the second typesetting, the ruby addition determining means stores For the same character string for each of the logical divisions, ruby is determined not to be added to a character string other than the character string that appears first, and the ruby adding means determines the character of the document data that has been determined to be non-ruby. The ruby providing system according to any one of supplementary notes 1 to 3, wherein ruby is not provided to the columns. (Supplementary Note 5) A logical division storage means for setting and storing one or more logical divisions for logically dividing document data in a computer, and extracting a ruby addition candidate character string for extracting a candidate character string to which ruby is to be added from the document data Means, for each of the stored logical divisions, for a candidate character string consisting of the same character string, ruby addition determining means for determining a candidate character string appearing first as a character string to be added with ruby, and the above determined ruby addition of document data A program that functions as ruby adding means for adding ruby to a target character string. (Supplementary Note 6) Logical division storage means for setting and storing one or more logical divisions for logically dividing document data, and extracting a ruby addition candidate character string for extracting a candidate character string for adding ruby from the document data Means, for each of the stored logical divisions, for a candidate character string consisting of the same character string, ruby addition determining means for determining a candidate character string appearing first as a character string to be added with ruby, and the above determined ruby addition of document data A computer-readable recording medium that records a program that functions as ruby adding means for adding ruby to a target character string. (Supplementary Note 7) When organizing a document in which the number of characters accommodated for each page fluctuates and assigning document data to each page for typesetting, the logical section storage means stores the pages determined by typesetting as logical sections. 3. The ruby providing system according to claim 1, wherein (Supplementary Note 8) When organizing a document in which the number of characters accommodated in each page fluctuates and assigning document data to each page for typesetting, the logical division storage means stores two facing spread pages determined by typesetting as logical divisions. 3. The ruby providing system according to claim 1, wherein

【００７８】[0078]

【発明の効果】上記に説明したように、文書データを組
版によって物理的な文書の配置が決まった状態に対応し
て、文書の読み手にとって読みやすい間隔と位置に、文
字列の読解に役立つルビを文書全体にわたって配置して
付与することが実現できるので、読み手に対しては文書
読み取り効率の向上、すなわち、文書読解の正確さ向上
による各種業務処理品質の向上が図れ、また、読みやす
い文書を少ない工数で間違いなく作成することを容易に
実現することによる文書作成の生産性向上が図れるとい
う効果を奏することができる。As described above, in accordance with the state in which the physical arrangement of the document is determined by the typesetting of the document data, at the intervals and positions that are easy for the reader of the document to read, the ruby that is useful for reading the character string is provided. Can be arranged and applied over the entire document, so that readers can improve the document reading efficiency, that is, improve the quality of various business processes by improving the accuracy of document reading comprehension. It is possible to achieve an effect that productivity of document creation can be improved by easily realizing creation without mistake with a small number of man-hours.

[Brief description of the drawings]

【図１】本発明の構成図FIG. 1 is a block diagram of the present invention.

【図２】本発明の実施の形態の構成図FIG. 2 is a configuration diagram of an embodiment of the present invention.

【図３】ルビを付与した文書データとルビ制御コード
の例FIG. 3 shows an example of ruby-added document data and ruby control code

【図４】ルビ付与の論理区分の例の説明図FIG. 4 is an explanatory diagram of an example of logical division of ruby addition

【図５】ページ単位、見開きページ単位のルビ付与の
例[Fig. 5] Example of adding ruby in page units and spread pages

【図６】ルビ対象文字列データ部の構成例の説明図FIG. 6 is an explanatory diagram of a configuration example of a ruby target character string data section.

【図７】総ルビ付与の動作を説明するフローチャートFIG. 7 is a flowchart illustrating an operation of providing total ruby.

[Explanation of symbols]

１論理区分記憶手段２ルビ付与判定手段３ルビ付与手段４ルビ付与候補文字列抽出手段 DESCRIPTION OF SYMBOLS 1 Logical division storage means 2 Ruby addition determination means 3 Ruby addition means 4 Ruby addition candidate character string extraction means

フロントページの続きＦターム(参考） 5B009 MJ04 Continued on the front page F term (reference) 5B009 MJ04

Claims

[Claims]

1. A logical division storage means for setting and storing one or more logical divisions for logically dividing document data, and a ruby addition candidate character string extraction means for extracting a candidate character string to which ruby is to be added from the document data And for each of the stored logical divisions, for a candidate character string consisting of the same character string, ruby addition determining means for determining a candidate character string that appears first as a ruby addition target character string; A ruby providing system, comprising: a ruby providing unit for adding ruby to a character string to be provided.

2. The method according to claim 1, further comprising: when organizing a document in which the number of characters accommodated in each page varies and allocating document data to each page for typesetting, a ruby addition determination unit uses a logical division based on the page determined by typesetting. 2. The ruby providing system according to claim 1, wherein the determination is made.

3. The logical partition storage means stores a plurality of types of logical partitions and a priority for each type of logical partition, and the ruby assignment determining means stores the plurality of types of logical partitions stored in the logical partition storage means. The ruby adding system according to claim 1, wherein the ruby adding is determined based on the priority of each ruby.

4. The document data to be input is ruby-added document data to which ruby has been added based on the first typesetting. When adding ruby based on a page determined by the second typesetting, the ruby adding determining means includes: With respect to the same character string for each of the stored logical divisions, ruby is determined not to be added to a character string other than the character string that first appears, and the ruby adding means is determined to be the above-described non-ruby addition of the document data. 4. The ruby providing system according to claim 1, wherein ruby is not added to the character string.

5. A logical division storage means for setting and storing one or more logical divisions for logically dividing document data, a ruby addition candidate character string for extracting a candidate character string for adding ruby from the document data. Extracting means, for each of the stored logical divisions, for a candidate character string consisting of the same character string, ruby addition determining means for determining a candidate character string that appears first as a ruby addition target character string, A program that functions as ruby providing means for adding ruby to a character string to be provided.