JP2019057152A

JP2019057152A - Information processing method, information processing device and data structure

Info

Publication number: JP2019057152A
Application number: JP2017181428A
Authority: JP
Inventors: 吉和斎藤; Yoshikazu Saito; 忠雄富松; Tadao Tomimatsu; 直多田; Sunao Tada
Original assignee: Kyodo Printing Co Ltd
Current assignee: Kyodo Printing Co Ltd
Priority date: 2017-09-21
Filing date: 2017-09-21
Publication date: 2019-04-11

Abstract

To provide an information processing device by which it is not easy to extract character codes from generated text data.SOLUTION: An information processing device is provided with a processing part 11 having a container data generation part 11b which generates container data having arrangement information of one or more paragraphs based on arrangement information of one or more paragraphs included in text data having a plurality of character codes associated with fonts, a block data generation part 11c which generates block data having the arrangement information of one or more character codes based on the arrangement information of one or more character codes included in the paragraphs for each of the one or more paragraphs included in the text data, and a character image data generation part 11d which generates character image data in which the character codes are converted into image information based on the fonts associated with the character codes for each of the plurality of character codes included in the text data.SELECTED DRAWING: Figure 3

Description

本発明は、情報処理方法、情報処理装置及びデータ構造に関する。 The present invention relates to an information processing method, an information processing apparatus, and a data structure.

従来、端末を用いて電子書籍を表示して閲覧することが行われている。 Conventionally, an electronic book is displayed and browsed using a terminal.

例えば、ユーザは、端末を用いて、電子書籍データを記憶しているサーバに対して、ネットワークを介して電子書籍データを要求する。サーバは、要求された電子書籍データを、ネットワークを介して端末に送信する。ユーザは、端末を用いて、受信した電子書籍データを表示することにより、電子書籍を閲覧する。 For example, a user uses a terminal to request electronic book data via a network from a server storing electronic book data. The server transmits the requested electronic book data to the terminal via the network. The user browses the electronic book by displaying the received electronic book data using the terminal.

電子書籍データは、複数の文字コード及びそれらの配置情報と、文字コードのフォント等の修飾情報とを有する。文字コードの配置情報は、例えばＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ：ハイパーテキスト・マークアップ・ランゲージ）で記述されたＨＴＭＬ情報で与えられ、修飾情報は、例えばＣＳＳ（ＣａｓｃａｄｉｎｇＳｔｙｌｅＳｈｅｅｔｓ：カスケーディング・スタイル・シート）で記述されたＣＳＳ情報で与えられる。 The electronic book data includes a plurality of character codes and their arrangement information, and modification information such as character code fonts. The character code arrangement information is given as HTML information described in, for example, HTML (HyperText Markup Language), and the modification information is, for example, CSS (Cascading Style Sheets). It is given by CSS information described in (1).

端末は、ウェブブラウザ等の閲覧機能を実行して、電子書籍データのＣＳＳ情報を解析しながら、ＨＴＭＬ情報に基づいた文字を、所定のフォントで表示する。例えば、端末は、電子書籍データに含まれる文字コードに対して、フォントパッケージを参照して、文字コードと関連づけられたフォント字形データを読み出し、文字画像を生成して表示する。 The terminal executes a browsing function such as a web browser, and displays characters based on the HTML information in a predetermined font while analyzing the CSS information of the electronic book data. For example, the terminal refers to the font package with respect to the character code included in the electronic book data, reads font character form data associated with the character code, generates a character image, and displays the character image.

実開２０１３−２１０５８９号公報Japanese Utility Model Publication No. 2013-210589 特開２０１５−２６１４５号公報Japanese Patent Application Laid-Open No. 2015-26145

端末では、受信した電子書籍データに基づいて、文字コードを抜き取ることが可能である。仮に、電子書籍データから文字コード及び配置情報が抜き取られると、電子書籍の著作権保護の観点から問題となるおそれがある。 The terminal can extract the character code based on the received electronic book data. If the character code and the arrangement information are extracted from the electronic book data, there is a possibility that it may become a problem from the viewpoint of protecting the copyright of the electronic book.

そこで、端末が受信した電子書籍データから文字コードを容易に抜き取ることができないようにすることが好ましい。 Therefore, it is preferable that the character code cannot be easily extracted from the electronic book data received by the terminal.

本明細書では、文字コードを抜き取ることが容易ではない電子書籍のデータを生成する情報処理方法及び情報処理装置を提供することを課題とする。 This specification makes it a subject to provide the information processing method and information processing apparatus which produce | generate the data of the electronic book whose character code is not easy to extract.

また、本明細書では、文字コードを抜き取ることが容易ではない電子書籍のデータ構造を提供することを課題とする。 In addition, an object of the present specification is to provide a data structure of an electronic book in which it is not easy to extract a character code.

本明細書に開示する情報処理方法によれば、フォントと関連づけられた複数の文字コードを有する文章データに含まれる１又は複数の段落の配置情報に基づいて、１又は複数の段落の配置情報を有するコンテナデータを生成することと、文章データに含まれる１又は複数の段落のそれぞれについて、当該段落に含まれる１又は複数の文字コードの配置情報に基づいて、１又は複数の文字コードの配置情報を有するブロックデータを生成することと、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードと関連付けられているフォントに基づいて、当該文字コードが画像情報に変換された文字画像データを生成することと、を含む。 According to the information processing method disclosed in this specification, the arrangement information of one or more paragraphs is obtained based on the arrangement information of one or more paragraphs included in sentence data having a plurality of character codes associated with a font. Generating container data and, for each of one or more paragraphs included in the text data, arrangement information of one or more character codes based on arrangement information of one or more character codes included in the paragraph The character image data obtained by converting the character code into image information based on the font associated with the character code for each of the plurality of character codes included in the text data. Generating.

また、本明細書に開示する情報処理装置によれば、フォントと関連づけられた複数の文字コードを有する文章データに含まれる１又は複数の段落の配置情報に基づいて、１又は複数の段落の配置情報を有するコンテナデータを生成するコンテナデータ生成部と、文章データに含まれる１又は複数の段落のそれぞれについて、当該段落に含まれる１又は複数の文字コードの配置情報に基づいて、１又は複数の文字コードの配置情報を有するブロックデータを生成するブロックデータ生成部と、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードと関連付けられているフォントに基づいて、当該文字コードが画像情報に変換された文字画像データを生成する文字画像データ生成部と、を有する処理部を備える。 Further, according to the information processing apparatus disclosed in the present specification, the arrangement of one or more paragraphs based on the arrangement information of one or more paragraphs included in text data having a plurality of character codes associated with a font A container data generation unit that generates container data having information, and one or a plurality of paragraphs included in the sentence data based on arrangement information of one or a plurality of character codes included in the paragraph A block data generation unit that generates block data having character code arrangement information, and for each of a plurality of character codes included in text data, the character code is image information based on a font associated with the character code. And a character image data generation unit that generates character image data converted into a character image data.

更に、本明細書に開示するデータ構造によれば、フォントと関連づけられた複数の文字コードを有する文章データに含まれる１又は複数の段落の配置情報を有するコンテナデータと、文章データに含まれる１又は複数の段落のそれぞれについて、当該段落に含まれる１又は複数の文字コードの配置情報を有するブロックデータと、文章データに含まれる複数の文字コードのそれぞれが、当該文字コードと関連付けられているフォントに基づいて、当該文字コードが画像情報に変換された文字画像データと、を有する。 Furthermore, according to the data structure disclosed in this specification, container data having arrangement information of one or more paragraphs included in text data having a plurality of character codes associated with a font, and 1 included in text data. Alternatively, for each of a plurality of paragraphs, a block data having arrangement information of one or more character codes included in the paragraph and a plurality of character codes included in the sentence data are associated with the character codes. And character image data in which the character code is converted into image information.

上述した本明細書に開示する情報処理方法によれば、生成された文章データから文字コードを抜き取ることは容易ではない。 According to the information processing method disclosed in the present specification described above, it is not easy to extract a character code from the generated text data.

また、上述した本明細書に開示する情報処理装置によれば、生成された文章データから文字コードを抜き取ることは容易ではない。 Moreover, according to the information processing apparatus disclosed in the present specification described above, it is not easy to extract the character code from the generated sentence data.

更に、上述した本明細書に開示するデータ構造によれば、文字コードを抜き取ることは容易ではない。 Furthermore, according to the data structure disclosed in the present specification described above, it is not easy to extract a character code.

本明細書に開示するシステムの一実施形態を示す図である。FIG. 1 illustrates one embodiment of a system disclosed herein. 情報処理装置を示す図である。It is a figure which shows information processing apparatus. （Ａ）は、情報処理装置の処理部を説明する図であり、（Ｂ）は、情報処理装置のメモリを説明する図である。(A) is a figure explaining the process part of information processing apparatus, (B) is a figure explaining the memory of information processing apparatus. サーバを示す図である。It is a figure which shows a server. 端末を示す図である。It is a figure which shows a terminal. 複数の文字コードを有する文章データの例を示す図である。It is a figure which shows the example of the text data which has a some character code. データ構造を示す図である。It is a figure which shows a data structure. コンテナデータの例を示す図である。It is a figure which shows the example of container data. 情報処理装置の動作を説明するフローチャート（その１）である。3 is a flowchart (part 1) illustrating an operation of the information processing apparatus. 入力された文章データを示す図である。It is a figure which shows the input text data. 生成されたレイアウトデータを示す図である。It is a figure which shows the produced | generated layout data. （Ａ）は、コンテナデータ内に生成されるブロックデータを示す図であり、（Ｂ）は、文字コードが文字で表されたブロックデータを示す図である。(A) is a figure which shows the block data produced | generated in container data, (B) is a figure which shows the block data by which the character code was represented by the character. （Ａ）は、ルビ文字処理より生成されたコンテナデータを示す図（その１）であり、（Ｂ）は、ルビ文字処理より生成されたコンテナデータを示す図（その２）である。(A) is a figure (the 1) which shows the container data produced | generated by the ruby character process, (B) is a figure (the 2) which shows the container data produced | generated by the ruby character process. 親文字及びルビ文字処理より生成されたコンテナデータの他の例を示す図である。It is a figure which shows the other example of the container data produced | generated by the parent character and ruby character process. （Ａ）は、禁則文字処理を説明する図（その１）であり、（Ｂ）は、禁則文字処理を説明する図（その２）である。(A) is a diagram (part 1) for explaining prohibited character processing, and (B) is a diagram (part 2) for explaining prohibited character processing. （Ａ）は、欧文文字処理を説明する図（その１）であり、（Ｂ）は、欧文文字処理を説明する図（その２）である。(A) is a diagram (part 1) for explaining European character processing, and (B) is a diagram (part 2) for explaining European character processing. 情報処理装置の動作を説明するフローチャート（その２）である。It is a flowchart (the 2) explaining operation | movement of information processing apparatus. 文字コードの文字画像サイズを求める処理を説明する図である。It is a figure explaining the process which calculates | requires the character image size of a character code. カーニング処理を説明する図である。It is a figure explaining a kerning process. 文字画像データを生成する処理を説明する図である。It is a figure explaining the process which produces | generates character image data. 文章データが端末で表示された図である。It is the figure where text data was displayed with the terminal. 端末のレンダリング処理を説明する図（その１）である。It is FIG. (1) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その２）である。It is FIG. (2) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その３）である。It is FIG. (3) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その４）である。It is FIG. (4) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その５）である。It is FIG. (5) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その６）である。It is FIG. (6) explaining the rendering process of a terminal. 端末のレンダリング処理を説明する図（その７）である。It is FIG. (7) explaining the rendering process of a terminal.

以下、本明細書で開示するシステムの好ましい一実施形態を、図を参照して説明する。但し、本発明の技術範囲はそれらの実施形態に限定されず、特許請求の範囲に記載された発明とその均等物に及ぶものである。 Hereinafter, a preferred embodiment of the system disclosed in the present specification will be described with reference to the drawings. However, the technical scope of the present invention is not limited to these embodiments, but extends to the invention described in the claims and equivalents thereof.

図１は、本明細書に開示するシステムの一実施形態を示す図である。 FIG. 1 is a diagram illustrating one embodiment of a system disclosed in this specification.

本実施形態のシステム１は、情報処理装置１０と、サーバ２０と、端末３０を備える。情報処理装置１０とサーバ２０とは、ネットワークＮを介して、通信可能に接続される。また、サーバ２０と端末３０とは、ネットワークＮを介して、通信可能に接続される。更に、情報処理装置１０と端末３０とは、ネットワークＮを介して、通信可能に接続されていてもよい。 The system 1 according to this embodiment includes an information processing device 10, a server 20, and a terminal 30. The information processing apparatus 10 and the server 20 are communicably connected via the network N. The server 20 and the terminal 30 are connected via a network N so as to be communicable. Furthermore, the information processing apparatus 10 and the terminal 30 may be connected to be communicable via the network N.

システム１では、情報処理装置１０が、所定のフォントと関連づけられた複数の文字コード及びそれらの配置情報と、修飾情報とを有する文章データ（電子書籍データ）に基づいて、文字コードが画像情報に変換されたデータ構造を有する文章データを生成する。文字コードは、文字を識別する識別情報である。情報処理装置１０は、文字コードが画像情報に変換された文章データを、ネットワークＮを介して、サーバ２０へ送信する。サーバ２０は、文字コードが画像情報に変換された文章データを記憶する。 In the system 1, the information processing apparatus 10 converts the character code into image information based on text data (electronic book data) having a plurality of character codes associated with a predetermined font, arrangement information thereof, and modification information. Text data having the converted data structure is generated. The character code is identification information for identifying a character. The information processing apparatus 10 transmits the text data in which the character code is converted to image information to the server 20 via the network N. The server 20 stores text data in which the character code is converted into image information.

サーバ２０は、端末３０からの要求に応じて、文字コードが画像情報に変換された文章データを、ネットワークＮを介して、端末３０へ送信する。端末３０は、文字コードが画像情報に変換された文章データを表示して、ユーザが文章データを閲覧する。 In response to a request from the terminal 30, the server 20 transmits text data in which the character code is converted to image information to the terminal 30 via the network N. The terminal 30 displays the text data in which the character code is converted into the image information, and the user browses the text data.

情報処理装置１０により生成される文章データは、リフロー型に表示可能であってもよいし、フィックス（固定）型に表示可能であってもよい。 The text data generated by the information processing apparatus 10 may be displayed in a reflow type or may be displayed in a fixed type.

図２は、情報処理装置を示す図である。 FIG. 2 is a diagram illustrating the information processing apparatus.

情報処理装置１０は、処理部１１と、メモリ１２と、表示部１３と、入力インタフェース１４と、通信部１５を有する。 The information processing apparatus 10 includes a processing unit 11, a memory 12, a display unit 13, an input interface 14, and a communication unit 15.

処理部１１は、一つまたは複数の中央演算回路と、レジスタと、キャッシュメモリと、インタフェース等の周辺回路とを有する。処理部１１は、メモリ１２に予め記憶されている所定のコンピュータプログラム１２ａに従い、情報処理装置１０の各ハードウェア構成要素の制御及び各種処理を行い、処理中に生じるデータを一時的に保存するためにメモリ１２を利用する。 The processing unit 11 includes one or more central processing circuits, a register, a cache memory, and peripheral circuits such as an interface. The processing unit 11 controls each hardware component of the information processing apparatus 10 and performs various processes according to a predetermined computer program 12a stored in advance in the memory 12, and temporarily stores data generated during the process. The memory 12 is used.

メモリ１２は、ランダムアクセスメモリ（ＲＡＭ）若しくはリードオンリーメモリ（ＲＯＭ）等の半導体メモリ、又は磁気ディスク若しくはフラッシュメモリ等の不揮発性メモリを有していてもよい。また、メモリ１２は、非一時的な記憶媒体１２ｃに記憶されたコンピュータプログラムを、読み出し可能なドライブ（図示せず）を有していてもよい。 The memory 12 may include a semiconductor memory such as a random access memory (RAM) or a read only memory (ROM), or a nonvolatile memory such as a magnetic disk or a flash memory. The memory 12 may have a drive (not shown) that can read out the computer program stored in the non-transitory storage medium 12c.

図３（Ｂ）は、情報処理装置のメモリを説明する図である。 FIG. 3B illustrates the memory of the information processing device.

図３（Ｂ）に示すように、メモリ１２は、所定のコンピュータプログラム１２ａと、フォントパッケージ１２ｂを記憶する。 As shown in FIG. 3B, the memory 12 stores a predetermined computer program 12a and a font package 12b.

フォントパッケージ１２ｂは、１つ又は複数の種類のフォントデータを有する。フォントデータは、文字コードと関連付けた各文字のフォント字形データと、フォント字形の仮想ボディ高さ、フォント字形の仮想ボディ幅の情報を有する、また、フォントデータは、カーニング処理及び合字処理の対象となる文字コードの情報を有する。フォントデータは、フォント字形データをベクタ情報又はラスタ情報として有し得るが、フォント字形データをベクタ情報として有することが、文字が拡大又は縮小して表示されたときに、滑らかな文字を表示する観点から好ましい。 The font package 12b has one or more types of font data. The font data includes font glyph data of each character associated with the character code, information on the font glyph virtual body height and font glyph virtual body width, and the font data is subject to kerning and ligature processing. It has the information of the character code. Font data may have font glyph data as vector information or raster information, but having font glyph data as vector information means that smooth characters are displayed when the characters are displayed enlarged or reduced. To preferred.

表示部１３は、処理部１１に制御されて、情報処理装置１０の動作に伴う各種の情報を画面上に表示可能である。表示部１３として、例えば、液晶ディスプレイを用いることができる。 The display unit 13 is controlled by the processing unit 11 and can display various types of information associated with the operation of the information processing apparatus 10 on the screen. For example, a liquid crystal display can be used as the display unit 13.

入力インタフェース１４は、情報処理装置１０のユーザにより操作されて、操作を入力可能である。入力インタフェース１４として、例えばキーボード又はマウスを用いることができる。 The input interface 14 is operated by a user of the information processing apparatus 10 and can input an operation. For example, a keyboard or a mouse can be used as the input interface 14.

通信部１５は、ネットワークＮを介して、サーバ２０との間で情報の送受信を行う。通信部１５は、信号の送受信を行う通信回路及び通信線を有する。 The communication unit 15 transmits and receives information to and from the server 20 via the network N. The communication unit 15 includes a communication circuit and a communication line that transmit and receive signals.

図３（Ａ）に示すように、上述した処理部１１は、レイアウトデータ生成部１１ａ、コンテナデータ生成部１１ｂ、ブロックデータ生成部１１ｃ、画像データ生成部１１ｄ及び文字画像サイズ生成部１１ｅを有する。 As shown in FIG. 3A, the processing unit 11 described above includes a layout data generation unit 11a, a container data generation unit 11b, a block data generation unit 11c, an image data generation unit 11d, and a character image size generation unit 11e.

処理部１１が有するこれらの各部は、例えば、処理部１１上で動作するコンピュータプログラムにより実現される機能モジュールである。なお、処理部１１が有するこれらの各部は、それぞれ、別個の回路として、情報処理装置１０に実装されてもよい。各部の動作については、後述する。 Each of these units included in the processing unit 11 is, for example, a functional module realized by a computer program that operates on the processing unit 11. Each of these units included in the processing unit 11 may be mounted on the information processing apparatus 10 as a separate circuit. The operation of each part will be described later.

図４は、サーバを示す図である。 FIG. 4 is a diagram illustrating a server.

サーバ２０は、処理部２１と、メモリ２２と、表示部２３と、入力インタフェース２４と、通信部２５を有する。 The server 20 includes a processing unit 21, a memory 22, a display unit 23, an input interface 24, and a communication unit 25.

処理部２１は、一つまたは複数の中央演算回路と、レジスタと、キャッシュメモリと、インタフェース等の周辺回路とを有する。処理部２１は、メモリ２２に予め記憶されている所定のコンピュータプログラムに従い、サーバ２０の各ハードウェア構成要素の制御及び各種処理を行い、処理中に生じるデータを一時的に保存するためにメモリ２２を利用する。 The processing unit 21 includes one or a plurality of central processing circuits, a register, a cache memory, and peripheral circuits such as an interface. The processing unit 21 controls each hardware component of the server 20 and performs various processes according to a predetermined computer program stored in the memory 22 in advance, and temporarily stores data generated during the process. Is used.

メモリ２２は、ランダムアクセスメモリ（ＲＡＭ）若しくはリードオンリーメモリ（ＲＯＭ）等の半導体メモリ、又は磁気ディスク若しくはフラッシュメモリ等の不揮発性メモリを有していてもよい。また、メモリ２２は、非一時的な記憶媒体２２ａに記憶されたコンピュータプログラムを、読み出し可能なドライブ（図示せず）を有していてもよい。 The memory 22 may include a semiconductor memory such as a random access memory (RAM) or a read only memory (ROM), or a nonvolatile memory such as a magnetic disk or a flash memory. The memory 22 may have a drive (not shown) that can read out the computer program stored in the non-transitory storage medium 22a.

また、メモリ２２には、情報処理装置１０から送信された文章データが記憶される。 The memory 22 stores text data transmitted from the information processing apparatus 10.

表示部２３は、処理部２１に制御されて、サーバ２０の動作に伴う各種の情報を画面上に表示可能である。表示部２３として、例えば、液晶ディスプレイを用いることができる。 The display unit 23 is controlled by the processing unit 21 and can display various information associated with the operation of the server 20 on the screen. As the display unit 23, for example, a liquid crystal display can be used.

入力インタフェース２４は、サーバ２０のユーザにより操作されて、操作を入力可能である。入力インタフェース２４として、例えばキーボード又はマウスを用いることができる。 The input interface 24 is operated by a user of the server 20 and can input an operation. For example, a keyboard or a mouse can be used as the input interface 24.

通信部２５は、ネットワークＮを介して、情報処理装置１０又は端末３０との間で情報の送受信を行う。通信部２５は、信号の送受信を行う通信回路及び通信線を有する。 The communication unit 25 transmits / receives information to / from the information processing apparatus 10 or the terminal 30 via the network N. The communication unit 25 includes a communication circuit and a communication line that transmit and receive signals.

図５は、端末を示す図である。 FIG. 5 is a diagram illustrating a terminal.

端末３０は、処理部３１と、メモリ３２と、表示部３３と、入力インタフェース３４と、通信部３５を有する。 The terminal 30 includes a processing unit 31, a memory 32, a display unit 33, an input interface 34, and a communication unit 35.

処理部３１は、一つまたは複数の中央演算回路と、レジスタと、キャッシュメモリと、インタフェース等の周辺回路とを有する。処理部３１は、メモリ３２に予め記憶されている所定のコンピュータプログラムに従い、端末３０の各ハードウェア構成要素の制御及び各種処理を行い、処理中に生じるデータを一時的に保存するためにメモリ３２を利用する。 The processing unit 31 includes one or a plurality of central processing circuits, a register, a cache memory, and peripheral circuits such as an interface. The processing unit 31 controls each hardware component of the terminal 30 and performs various processes according to a predetermined computer program stored in advance in the memory 32, and temporarily stores data generated during the processing. Is used.

処理部３１は、閲覧実行部３１ａを有する。閲覧実行部３１ａは、例えば、処理部３１上で動作するコンピュータプログラムにより実現される機能モジュールである。なお、閲覧実行部３１ａは、処理部３１とは別個の回路として、端末３０に実装されてもよい。閲覧実行部３１ａの動作については、後述する。 The processing unit 31 includes a browsing execution unit 31a. The browsing execution unit 31a is a functional module realized by a computer program that operates on the processing unit 31, for example. Note that the browsing execution unit 31 a may be mounted on the terminal 30 as a separate circuit from the processing unit 31. The operation of the browsing execution unit 31a will be described later.

メモリ３２は、ランダムアクセスメモリ（ＲＡＭ）若しくはリードオンリーメモリ（ＲＯＭ）等の半導体メモリ、又は磁気ディスク若しくはフラッシュメモリ等の不揮発性メモリを有していてもよい。また、メモリ３２は、非一時的な記憶媒体３２ａに記憶されたコンピュータプログラムを、読み出し可能なドライブ（図示せず）を有していてもよい。 The memory 32 may include a semiconductor memory such as a random access memory (RAM) or a read only memory (ROM), or a nonvolatile memory such as a magnetic disk or a flash memory. The memory 32 may have a drive (not shown) that can read out the computer program stored in the non-transitory storage medium 32a.

表示部３３は、処理部３１に制御されて、端末３０の動作に伴う各種の情報を画面上に表示可能である。表示部３３として、例えば、液晶ディスプレイを用いることができる。 The display unit 33 is controlled by the processing unit 31 and can display various information associated with the operation of the terminal 30 on the screen. For example, a liquid crystal display can be used as the display unit 33.

入力インタフェース３４は、端末３０のユーザにより操作されて、操作を入力可能である。入力インタフェース３４として、例えばキーボード又はマウスを用いることができる。また、入力インタフェース３４として、入力インタフェース３４と表示部３３とが一体となったタッチパネルを用いてもよい。 The input interface 34 is operated by a user of the terminal 30 and can input an operation. As the input interface 34, for example, a keyboard or a mouse can be used. As the input interface 34, a touch panel in which the input interface 34 and the display unit 33 are integrated may be used.

通信部３５は、無線通信を用いて、ネットワークＮを介して、サーバ２０との間で情報の送受信を行う。通信部３５は、例えば、３ＧＰＰ（ＴｈｉｒｄＧｅｎｅｒａｔｉｏｎＰａｒｔｎｅｒｓｈｉｐＰｒｏｊｅｃｔ）又はＬＴＥ（ＬｏｎｇＴｅｒｍＥｖｏｌｕｔｉｏｎ）等の所定の通信規格に準拠して、基地局を介して、ネットワークＮと接続する。なお、通信部３５は、有線通信を用いて、ネットワークＮを介して、サーバ２０との間で情報の送受信を行ってもよい。通信部３５は、送受信を行う通信回路及び通信線又はアンテナを有し得る。 The communication unit 35 transmits and receives information to and from the server 20 via the network N using wireless communication. The communication unit 35 is connected to the network N via a base station according to a predetermined communication standard such as 3GPP (Third Generation Partnership Project) or LTE (Long Term Evolution). Note that the communication unit 35 may transmit / receive information to / from the server 20 via the network N using wired communication. The communication unit 35 may include a communication circuit that performs transmission and reception and a communication line or antenna.

図６は、複数の文字コードを有する文章データの例を示す図である。 FIG. 6 is a diagram illustrating an example of sentence data having a plurality of character codes.

文章データ６００は、複数の文字コード及び文字コードの配置情報を有する文字コード配置データ６０１と、文字コード配置データ６０１に含まれる文字コードが文字として表示される時のフォント、文字サイズ、色等の修飾情報を有する修飾データ６０２を有する。文字コード配置データ６０１に含まれる各文字コードは、修飾データ６０２が有するフォント、文字サイズ、文字色等の修飾情報と関連づけられている。なお、図６では、説明を分かり易くするために、文字コードが、文字コードにより識別される文字で示されている。即ち、図６では、文字コードの配置が、文字の配置として示されている。また、文字コード配置データ６０１は、写真又は絵等の画像データ及びこれらの配置情報を有し得る。 The sentence data 600 includes a character code arrangement data 601 having a plurality of character codes and character code arrangement information, and a font, a character size, a color, and the like when the character codes included in the character code arrangement data 601 are displayed as characters. It has modification data 602 having modification information. Each character code included in the character code arrangement data 601 is associated with modification information such as font, character size, and character color included in the modification data 602. In FIG. 6, for easy understanding of the description, the character code is indicated by a character identified by the character code. That is, in FIG. 6, the arrangement of character codes is shown as the arrangement of characters. The character code arrangement data 601 may include image data such as a photograph or a picture and arrangement information thereof.

修飾データ６０２は、修飾情報として、更に、文字が表示されるコンテナサイズ（最小幅、最大幅）、上下左右のマージン、上下左右のパディング、均等揃え又は中央揃え等の行単位の文字配置の揃え、上付き又は下付き等の文字単位の文字配置の揃え、圏点、罫線、下線、背景色等の情報を有し得る。 The decoration data 602 further includes, as decoration information, alignment of character arrangement in line units such as container size (minimum width, maximum width) in which characters are displayed, vertical and horizontal margins, vertical and horizontal padding, uniform alignment, and center alignment. , Information such as alignment of character units such as superscripts or subscripts, mark points, ruled lines, underlines, and background colors.

文字コード配置データ６０１は、例えば、ＨＴＭＬを用いて記述されるＨＴＭＬ情報として与えられ得る。また、修飾データ６０２は、ＣＳＳを用いて記述されるＣＳＳ情報として与えられ得る。なお、文章データ６００において、文字コード配置データ６０１と、修飾データ６０２とは、別個に配置されていてもよいし、又は混在するように配置されていてもよい。 The character code arrangement data 601 can be given as HTML information described using HTML, for example. Further, the modification data 602 can be given as CSS information described using CSS. In the text data 600, the character code arrangement data 601 and the modification data 602 may be arranged separately or arranged so as to be mixed.

図６に示す文字コード配置データ６０１は、第１章及び第２章の２つの章を含む。第１章は、第１段落及び第２段落の２つの段落を含む。なお、図６に示す文字コード配置データ６０１は一例であり、文字コード配置データはこれに制限されるものではない。 The character code arrangement data 601 shown in FIG. 6 includes two chapters, a first chapter and a second chapter. The first chapter includes two paragraphs, a first paragraph and a second paragraph. The character code arrangement data 601 shown in FIG. 6 is an example, and the character code arrangement data is not limited to this.

図７は、図６に示す文章データに基づいて、情報処理装置１０が生成したデータ構造を示す図である。 FIG. 7 is a diagram showing a data structure generated by the information processing apparatus 10 based on the text data shown in FIG.

データ構造７００は、レイアウトデータ７１０と、オブジェクトデータ７２０と、スタイルデータ７３０を有する。 The data structure 700 includes layout data 710, object data 720, and style data 730.

レイアウトデータ７１０は、主に、文章データに含まれる章、段落、行、文字等の配置情報を有する。レイアウトデータ７１０は、コンテナデータ１及びコンテナデータ２、コンテナサイズ情報テーブル７１１、ブロックサイズ情報テーブル７１２及びオブジェクトサイズ情報テーブル７１３を有する。 The layout data 710 mainly includes arrangement information such as chapters, paragraphs, lines, and characters included in the text data. The layout data 710 includes container data 1 and container data 2, a container size information table 711, a block size information table 712, and an object size information table 713.

レイアウトデータ７１０は、文章データに含まれる１又は複数の章の配置情報を、１又は複数のコンテナデータの配置情報として有する。１つのコンテナデータは、文章データに含まれる章に対応して配置される。図６に示す文章データ６００は、２つの章を有するので、レイアウトデータ７１０は、第１章に対応するコンテナデータ１及び第２章に対応するコンテナデータ２を有する。文章データ６００に含まれる章の組方向（縦書き、横書き）に対応して、レイアウトデータ７１０内のコンテナデータ１及びコンテナデータ２の順番及び配置関係が決定される。即ち、レイアウトデータ７１０は、文章データ６００に含まれる１又は複数の章の配置情報を有する。 The layout data 710 includes arrangement information of one or more chapters included in the text data as arrangement information of one or more container data. One container data is arranged corresponding to the chapter included in the text data. Since the text data 600 shown in FIG. 6 has two chapters, the layout data 710 has container data 1 corresponding to the first chapter and container data 2 corresponding to the second chapter. Corresponding to the chapter combination direction (vertical writing, horizontal writing) included in the text data 600, the order and arrangement relationship of the container data 1 and the container data 2 in the layout data 710 are determined. That is, the layout data 710 includes arrangement information for one or more chapters included in the text data 600.

コンテナデータは、複数の文字コードを有する文章データに含まれる１又は複数の段落の配置情報を有する。文章データに含まれる一の段落に対応して、１つのブロックデータが配置される。従って、コンテナデータは、文章データに含まれる１又は複数の段落の配置情報を、１又は複数のブロックデータの配置情報として有する。 The container data includes arrangement information of one or more paragraphs included in sentence data having a plurality of character codes. One block data is arranged corresponding to one paragraph included in the text data. Accordingly, the container data has the arrangement information of one or more paragraphs included in the text data as the arrangement information of one or more block data.

図６に示す文章データ６００では、第１章は、２つの段落を有する。図７において、第１章に対応するコンテナデータ１は、図６に示す文章データ６００に基づいて、第１段落に対応するブロックデータ１及び第２段落に対応するブロックデータ２を有する。ブロックデータは、段落に含まれる文字コードの配置情報を有する。 In the sentence data 600 shown in FIG. 6, the first chapter has two paragraphs. In FIG. 7, the container data 1 corresponding to the first chapter has block data 1 corresponding to the first paragraph and block data 2 corresponding to the second paragraph based on the text data 600 shown in FIG. The block data includes arrangement information of character codes included in the paragraph.

コンテナデータの直下には、ブロックデータのみを配置して、他のコンテナデータは配置しないようにすることが、不要な入れ子構造を許容しない観点から好ましい。 It is preferable from the viewpoint of not allowing unnecessary nesting structure to arrange only block data and not arrange other container data immediately below the container data.

ブロックデータは、文章データに含まれる１又は複数の段落のそれぞれについて、当該段落に含まれる１又は複数の文字コードの配置情報を有する。文章データに含まれる一の段落に対応して、１つのブロックデータが生成される。 The block data includes, for each of one or more paragraphs included in the text data, arrangement information of one or more character codes included in the paragraph. One block data is generated corresponding to one paragraph included in the text data.

ブロックデータの直下には、文字コードの配置情報のみを配置して、他のブロックデータは配置しないようにすることが、不要な入れ子構造を許容しない観点から好ましい。 It is preferable from the viewpoint of not allowing an unnecessary nested structure to arrange only the character code arrangement information immediately below the block data and not to arrange other block data.

図８は、コンテナデータの例を示す図である。 FIG. 8 is a diagram illustrating an example of container data.

情報処理装置１０は、図６に示す文章データ６００の第１章に対応させて、コンテナデータ１を生成する。また、情報処理装置１０は、図６に示す文章データ６００の第１章の第１段落に対応させて、ブロックデータ１を生成する。ブロックデータ１は、本来は、文字コードの配置情報を有するが、図８では、説明を分かり易くするために、文字コードが、文字コードにより識別される文字で示されている。即ち、図８では、文字コードの配置が、文字の配置として示されている。 The information processing apparatus 10 generates container data 1 in association with the first chapter of the text data 600 shown in FIG. Further, the information processing apparatus 10 generates block data 1 corresponding to the first paragraph of the first chapter of the text data 600 shown in FIG. The block data 1 originally has character code arrangement information, but in FIG. 8, the character code is indicated by a character identified by the character code in order to make the explanation easy to understand. That is, in FIG. 8, the arrangement of character codes is shown as the arrangement of characters.

図８に示すブロックデータ１は、コンテナデータ３、コンテナデータ４及びコンテナデータ５を有する。コンテナデータ１内に配置されるコンテナデータ３、コンテナデータ４及びコンテナデータ５は、コンテナデータ１及びコンテナデータ２とは異なり、文章データに含まれる１又は複数の段落の配置情報を有するものではない。 The block data 1 shown in FIG. 8 includes container data 3, container data 4, and container data 5. Unlike the container data 1 and the container data 2, the container data 3, the container data 4, and the container data 5 that are arranged in the container data 1 do not have the arrangement information of one or more paragraphs included in the text data. .

コンテナデータ３は、親文字の文字コードの配置情報と、ルビ文字の文字コードの配置情報とを含むコンテナとして生成されたものである。コンテナデータ４及びコンテナデータ５は、行末禁止文字である「。」の文字コード、及び「。」の直前の文字の文字コードの配置情報を含むコンテナとして生成されたものである。コンテナデータ１が、端末３０で表示される時に、コンテナデータ３内の文字コードを、同じ行に表示することにより、親文字とルビ文字とが、異なる行に表示されないようにすることができる。同様に、コンテナデータ１が、端末３０で表示される時に、コンテナデータ４及びコンテナデータ５内の文字コードを、同じ行に表示することにより、行末禁止文字と直前の文字とが、異なる行に表示されないようにすることができる。 The container data 3 is generated as a container including the arrangement information of the character code of the parent character and the arrangement information of the character code of the ruby character. The container data 4 and the container data 5 are generated as containers including the arrangement information of the character code of “.”, Which is a line ending prohibited character, and the character code of the character immediately before “.”. By displaying the character code in the container data 3 on the same line when the container data 1 is displayed on the terminal 30, it is possible to prevent the parent character and the ruby character from being displayed on different lines. Similarly, when the container data 1 is displayed on the terminal 30, the character codes in the container data 4 and the container data 5 are displayed on the same line, so that the line end prohibited character and the immediately preceding character are on different lines. It can be prevented from being displayed.

上述したように、他のコンテナデータ内に配置されるコンテナデータは、複数の文字が、異なる行に表示されないようにするために用いられる。コンテナデータ内にコンテナデータを生成することについては更に後述する。 As described above, container data arranged in other container data is used to prevent a plurality of characters from being displayed on different lines. The generation of container data in the container data will be further described later.

コンテナサイズ情報テーブル７１１には、コンテナデータのそれぞれに対して、コンテナサイズ（最小幅、最大幅、最小高さ、最大高さ）等に関する情報が、コンテナデータを識別する識別情報（例えば、コンテナデータ１、コンテナデータ２）と関連づけられて登録される。コンテナサイズは、端末３０において、文章データが表示される表示サイズに関する情報である。情報処理装置１０は、主に、文章データ６００の修飾データ６０２に基づいて、コンテナサイズ情報テーブル７１１を生成する。 In the container size information table 711, for each of the container data, information on the container size (minimum width, maximum width, minimum height, maximum height) and the like are identification information for identifying the container data (for example, container data 1. Registered in association with container data 2). The container size is information relating to a display size at which text data is displayed on the terminal 30. The information processing apparatus 10 generates the container size information table 711 mainly based on the modification data 602 of the text data 600.

ブロックサイズ情報テーブル７１２には、ブロックデータのそれぞれに対して、上下左右のマージン、上下左右のパディング、均等揃え又は中央揃え等の行単位の文字配置の揃え等に関する情報が、ブロックデータを識別する識別情報（例えば、ブロックデータ１、ブロックデータ２）と関連づけられて登録される。情報処理装置１０は、主に、文章データ６００の修飾データ６０２に基づいて、ブロックサイズ情報テーブル７１２を生成する。 In the block size information table 712, for each block data, information regarding vertical and horizontal margins, vertical and horizontal padding, alignment of character units in line units such as uniform alignment or center alignment, and the like identifies block data. Registered in association with identification information (for example, block data 1 and block data 2). The information processing apparatus 10 generates the block size information table 712 mainly based on the modification data 602 of the text data 600.

オブジェクトサイズ情報テーブル７１３には、文字コードのそれぞれに対して、文字サイズ、文字の上付き又は下付き等の文字単位の配置等に関する情報が、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけられて登録される。情報処理装置１０は、文章データ６００の文字コード配置データ６０１又は修飾データ６０２に基づいて、オブジェクトサイズ情報テーブル７１３を生成する。 In the object size information table 713, for each character code, information regarding the character size, the character unit arrangement such as the superscript or subscript of the character, and the like is identification information (for example, character code 1 Are registered in association with the character code 2). The information processing apparatus 10 generates the object size information table 713 based on the character code arrangement data 601 or the modification data 602 of the text data 600.

オブジェクトデータ７２０は、文字画像テーブル７２１及び画像デーブル７２２を有する。 The object data 720 includes a character image table 721 and an image table 722.

文字画像テーブル７２１には、文章データに含まれる文字コードが画像情報に変換された文字画像データが、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけられて登録される。文字画像データは、ベクタ情報又はラスタ情報であり得るが、ベクタ情報であることが、端末３０が、文字を拡大又は縮小して表示するときに、滑らかな文字を表示できる観点から好ましい。情報処理装置１０は、文章データ６００の文字コード配置データ６０１及び修飾データ６０２に基づいて、メモリ１２に記憶されているフォントパッケージ１２ｂを参照しながら、文字画像テーブル７２１を生成する。 In the character image table 721, character image data in which a character code included in text data is converted into image information is registered in association with identification information for identifying the character code (for example, character code 1, character code 2). The The character image data may be vector information or raster information. However, the vector information is preferable from the viewpoint that the terminal 30 can display a smooth character when the character is enlarged or reduced. The information processing apparatus 10 generates the character image table 721 while referring to the font package 12b stored in the memory 12 based on the character code arrangement data 601 and the modification data 602 of the text data 600.

画像テーブル７２２には、文章データに含まれる写真又は絵等の画像データが、画像を識別する識別情報（例えば、画像１、画像２）と関連づけられて登録される。情報処理装置１０は、文章データ６００の文字コード配置データ６０１に基づいて、画像テーブル７２２を生成する。なお、文章データは、写真又は絵等の画像データを含まない場合もある。 In the image table 722, image data such as a photograph or a picture included in the text data is registered in association with identification information (for example, image 1 and image 2) for identifying the image. The information processing apparatus 10 generates the image table 722 based on the character code arrangement data 601 of the text data 600. Note that the text data may not include image data such as photographs or pictures.

スタイルデータ７３０は、罫線テーブル７３１及びスタイル情報テーブル７３２を有する。 The style data 730 includes a ruled line table 731 and a style information table 732.

罫線テーブル７３１には、ブロックデータのそれぞれに対して、文字の囲み、下線、背景色等に関する情報が、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけられて登録される。情報処理装置１０は、主に、文章データ６００の修飾データ６０２に基づいて、罫線テーブル７３１を生成する。 In the ruled line table 731, for each block data, information relating to character enclosure, underline, background color, etc. is registered in association with identification information (for example, character code 1, character code 2) for identifying the character code. Is done. The information processing apparatus 10 generates a ruled line table 731 mainly based on the modification data 602 of the text data 600.

スタイル情報テーブル７３２には、文字コードのそれぞれに対して、文字色、圏点の有無、文字の装飾等に関する情報が、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけられて登録される。情報処理装置１０は、主に、文章データ６００の修飾データ６０２に基づいて、スタイル情報テーブル７３２を生成する。 In the style information table 732, for each character code, information on the character color, the presence / absence of a mark, character decoration, and the like includes identification information for identifying the character code (for example, character code 1, character code 2). Registered in association. The information processing apparatus 10 generates the style information table 732 mainly based on the modification data 602 of the text data 600.

次に、上述した情報処理装置１０の動作を、図９及び図１７に示すフローチャートを参照しながら、以下に説明する。図９に示すフローチャートの処理は、１次オーサリング処理ともいう。図９に示すフローチャートの処理は、メモリ１２に記憶されているフォントパッケージ１２ｂの情報を使用せずに行われ得る。図１７に示すフローチャートの処理は、２次オーサリング処理ともいう。図１７に示すフローチャートの処理は、メモリ１２に記憶されているフォントパッケージ１２ｂの情報を使用して行われる。 Next, the operation of the information processing apparatus 10 described above will be described below with reference to the flowcharts shown in FIGS. The process of the flowchart shown in FIG. 9 is also referred to as a primary authoring process. The process of the flowchart shown in FIG. 9 can be performed without using the information of the font package 12b stored in the memory 12. The process of the flowchart shown in FIG. 17 is also called a secondary authoring process. The processing of the flowchart shown in FIG. 17 is performed using information of the font package 12b stored in the memory 12.

図９を参照しながら、情報処理装置１０の動作を、以下に説明する。 The operation of the information processing apparatus 10 will be described below with reference to FIG.

まず、ステップＳ９０１において、情報処理装置１０の処理部１１は、図１０に示す文章データ１０００を入力する。情報処理装置１０は、例えば、通信部１５を用いて、文章データを、ネットワークＮを介して受信する。また、情報処理装置１０は、入力インタフェース１４を用いて、文章データを入力してもよい。更に、情報処理装置１０は、記憶媒体１２ｃに記憶された文章データを読み取ることにより、文章データを入力してもよい。 First, in step S901, the processing unit 11 of the information processing apparatus 10 inputs the text data 1000 illustrated in FIG. The information processing apparatus 10 receives text data via the network N using, for example, the communication unit 15. Further, the information processing apparatus 10 may input text data using the input interface 14. Furthermore, the information processing apparatus 10 may input sentence data by reading the sentence data stored in the storage medium 12c.

図１０に示す文章データ１０００の文字コード配置データ１００１は、１つの章（第１章）を有し、この章は、１つの段落（第１段落）を有する。文章データ１０００の修飾データ１００２は、文字コード配置データ１００１に含まれる文字コードが文字として表示される時のフォント、文字サイズ、色等の修飾情報を有する。文字コード配置データ１００１は、ＨＴＭＬ情報として与えられており、修飾データ１００２は、ＣＳＳ情報として与えられている。 The character code arrangement data 1001 of the text data 1000 shown in FIG. 10 has one chapter (first chapter), and this chapter has one paragraph (first paragraph). The modification data 1002 of the sentence data 1000 includes modification information such as a font, a character size, and a color when the character code included in the character code arrangement data 1001 is displayed as a character. The character code arrangement data 1001 is given as HTML information, and the modification data 1002 is given as CSS information.

次に、ステップＳ９０３において、情報処理装置１０の処理部１１のレイアウトデータ生成部１１ａは、文章データ１０００に含まれる章の配置情報に基づいて、図１１に示すように、レイアウトデータ１１１０を生成する。レイアウトデータ１１１０は、文章データ１０００の文字コード配置データ１００１に含まれる第１章の配置情報を有する。なお、レイアウトデータ生成部１１ａは、文章データが複数の章を有する場合には、複数の章の配置情報を有するレイアウトデータを生成する。複数の章の配置情報としては、例えば、各章が配置される順番及び組方向が挙げられる。 Next, in step S903, the layout data generation unit 11a of the processing unit 11 of the information processing apparatus 10 generates layout data 1110 as shown in FIG. 11 based on the chapter arrangement information included in the text data 1000. . The layout data 1110 has chapter 1 placement information included in the character code placement data 1001 of the text data 1000. The layout data generation unit 11a generates layout data having arrangement information of a plurality of chapters when the text data has a plurality of chapters. The arrangement information of a plurality of chapters includes, for example, the order in which each chapter is arranged and the group direction.

レイアウトデータ生成部１１ａは、文章コード配置データ１００１に含まれる＜ｂｏｄｙ＞タグで囲まれた文字コードの配置情報を１つの章として判断して、１つの章の配置情報を有するレイアウトデータ１１１０を生成する。なお、図１１において、鎖線で表示されるデータ又はテーブルは、この段階では、未だ生成されていない。 The layout data generation unit 11a determines the arrangement information of the character code enclosed in the <body> tag included in the text code arrangement data 1001 as one chapter, and generates the layout data 1110 having the arrangement information of one chapter. To do. In FIG. 11, the data or table displayed by the chain line is not yet generated at this stage.

次に、文章データに含まれる全ての章に対して、ステップＳ９０５〜ステップＳ９１９の処理が行われる。本実施形態では、文章データ１０００は、１つの章のみを有するので、ステップＳ９０５〜ステップＳ９１９の処理が１回だけ行われる。 Next, the processing from step S905 to step S919 is performed for all chapters included in the text data. In the present embodiment, since the sentence data 1000 has only one chapter, the processing from step S905 to step S919 is performed only once.

まず、ステップＳ９０７において、処理部１１のコンテナデータ生成部１１ｂは、文章データの各章に含まれる１又は複数の段落の配置情報に基づいて、各章について、当該章に含まれる１又は複数の段落の配置情報を有するコンテナデータを生成する。複数の段落の配置情報としては、例えば、各段落が配置される順番及び組方向が挙げられる。 First, in step S907, the container data generation unit 11b of the processing unit 11 determines, based on the arrangement information of one or more paragraphs included in each chapter of the sentence data, one or more included in the chapter. Container data having paragraph arrangement information is generated. As the arrangement information of a plurality of paragraphs, for example, the order in which each paragraph is arranged and the direction of the combination are listed.

本実施形態では、文章コード配置データ１００１の第１章は１つの段落（第１段落）を含むので、コンテナデータ生成部１１ｂは、第１章に対して、第１章に含まれる第１段落の配置情報を有するコンテナデータ１を、レイアウトデータ１１１０内に生成する。以下、コンテナデータを、単にコンテナともいう。 In the present embodiment, since the first chapter of the sentence code arrangement data 1001 includes one paragraph (first paragraph), the container data generation unit 11b performs the first paragraph included in the first chapter with respect to the first chapter. Is generated in the layout data 1110. Hereinafter, the container data is also simply referred to as a container.

具体的には、コンテナデータ生成部１１ｂは、文章コード配置データ１００１に含まれる第１章に対して、＜ｄｉｖ＞タグ又は＜ｐ＞タグで囲まれた文字コードの配置情報を第１段落と判断して、第１段落の配置情報を有するコンテナデータ１を生成する。なお、文章コード配置データ１００１に含まれる第１章が複数の段落を有している場合には、コンテナデータ生成部１１ｂは、複数の段落の配置情報を有するコンテナデータを生成する。 Specifically, the container data generation unit 11b sets the character code arrangement information enclosed by the <div> tag or the <p> tag as the first paragraph for the first chapter included in the sentence code arrangement data 1001. Determination is made to generate container data 1 having the arrangement information of the first paragraph. When the first chapter included in the sentence code arrangement data 1001 has a plurality of paragraphs, the container data generation unit 11b generates container data having arrangement information of the plurality of paragraphs.

ここで、コンテナデータ生成部１１ｂは、＜ｄｉｖ＞タグ又は＜ｐ＞タグの直下に他の＜ｄｉｖ＞タグ又は＜ｐ＞タグが配置される場合には、１つの段落と見なして、段落の配置情報を決定する。これにより、データ構造に不要な入れ子構造が生成されないようにできる。 Here, when another <div> tag or <p> tag is arranged immediately below the <div> tag or <p> tag, the container data generation unit 11b regards the paragraph as one paragraph. Determine placement information. Thereby, an unnecessary nested structure can be prevented from being generated in the data structure.

そして、コンテナデータ生成部１１ｂは、修飾データ１００２に基づいて、文章コード配置データ１００１の＜ｂｏｄｙ＞タグで囲まれた文字コードが表示される際のコンテナサイズ（最小幅、最大幅、最小高さ、最大高さ）等に関する情報を取得し、コンテナデータを識別する識別情報（例えば、コンテナデータ１）と関連づけて、コンテナサイズ情報テーブル１１１１に登録して、レイアウトデータ１１１０内にコンテナサイズ情報テーブル１１１１を配置する。 The container data generation unit 11b then displays the container size (minimum width, maximum width, minimum height) when the character code enclosed by the <body> tag of the text code arrangement data 1001 is displayed based on the modification data 1002. , The maximum height), etc., is registered in the container size information table 1111 in association with identification information (for example, container data 1) for identifying the container data, and the container size information table 1111 is included in the layout data 1110. Place.

次に、ステップＳ９０９において、処理部１１のブロックデータ生成部１１ｃは、文章データの各章に含まれる１又は複数の段落のそれぞれについて、当該段落に含まれる１又は複数の文字コードの配置情報に基づいて、１又は複数の文字コードの配置情報を有するブロックデータを生成する。複数の文字コードの配置情報として、例えば、各文字コードが配置される順番及び組方向が挙げられる。 Next, in step S909, the block data generation unit 11c of the processing unit 11 uses, for each of one or more paragraphs included in each chapter of the sentence data, the arrangement information of one or more character codes included in the paragraph. Based on this, block data having arrangement information of one or more character codes is generated. As the arrangement information of a plurality of character codes, for example, the order in which each character code is arranged and the composition direction can be mentioned.

本実施形態では、ブロックデータ生成部１１ｃは、文章コード配置データ１００１の第１章に含まれる第１段落について、第１段落に含まれる複数の文字コードの配置情報に基づいて、１又は複数の文字コードの配置情報を有するブロックデータ１を、コンテナデータ１内に生成する。以下、ブロックデータを、単にブロックともいう。 In the present embodiment, the block data generation unit 11c uses the one or more pieces of the first paragraph included in the first chapter of the sentence code arrangement data 1001 based on the arrangement information of the plurality of character codes included in the first paragraph. Block data 1 having character code arrangement information is generated in the container data 1. Hereinafter, the block data is also simply referred to as a block.

ブロックデータ生成部１１ｃは、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードの順番に基づいて、ブロック１に文字コードが配置される順番を決定する。 The block data generation unit 11c determines the order in which the character codes are arranged in the block 1 based on the order of the character codes enclosed in the <p> tag of the sentence code arrangement data 1001.

また、ブロックデータ生成部１１ｃは、修飾データ１００２を参照して、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードの組方向（縦書き、横書き）を取得する。なお、修飾データ１００２には、＜ｂｏｄｙ＞タグで囲まれた文字コードの組方向が規定されている場合もある。この場合、＜ｂｏｄｙ＞タグ内に配置される＜ｐ＞タグで囲まれた文字コードの組方向は、＜ｂｏｄｙ＞タグで囲まれた文字コードの組方向により規定される。 In addition, the block data generation unit 11c refers to the modification data 1002 and acquires the set direction (vertical writing, horizontal writing) of the character code enclosed by the <p> tag of the sentence code arrangement data 1001. Note that the modification data 1002 may define the set direction of the character code enclosed by the <body> tag. In this case, the set direction of the character code surrounded by the <p> tag arranged in the <body> tag is defined by the set direction of the character code surrounded by the <body> tag.

そして、ブロックデータ生成部１１ｃは、取得された文字コードの順番及び組方向（縦書き、横書き）に基づいて、ブロック１内に文字コードが配置される配置情報を決定する。本実施形態では、文字コードは、縦書きでブロック１内に配置される。 Then, the block data generation unit 11c determines arrangement information in which the character code is arranged in the block 1 based on the order of the acquired character code and the composition direction (vertical writing, horizontal writing). In the present embodiment, the character code is arranged in the block 1 in vertical writing.

図１２（Ａ）は、コンテナ１内に生成されるブロック１を示す図である。図１２（Ｂ）は、説明を分かり易くするために、ブロック１内の文字コードが、文字コードで識別される文字で示されたブロック１を示す図である。 FIG. 12A is a diagram showing the block 1 generated in the container 1. FIG. 12B is a diagram showing the block 1 in which the character code in the block 1 is indicated by a character identified by the character code for easy understanding of the description.

ブロックデータ生成部１１ｃは、文章コード配置データ１００１の第１章に含まれる第１段落に含まれる親文字の文字コード及びルビ文字の文字コード１〜６に対して、コンテナ２を生成する。具体的には、ブロックデータ生成部１１ｃは、文章コード配置データ１００１の＜ｒｕｂｙ＞タグで囲まれた文字コード１〜６が配置されるコンテナ２を生成する。 The block data generation unit 11c generates the container 2 for the character code of the parent character and the character codes 1 to 6 of the ruby character included in the first paragraph included in the first chapter of the sentence code arrangement data 1001. Specifically, the block data generation unit 11c generates the container 2 in which the character codes 1 to 6 surrounded by the <ruby> tag of the sentence code arrangement data 1001 are arranged.

そして、ブロックデータ生成部１１ｃは、コンテナ２を文字コード７の直前に配置し、文字コード７に続いて、残りの文字コード８〜２０を配置して、このような順番で文字コードが縦書きに配置される配置情報を有するブロック１を生成する。 Then, the block data generation unit 11c arranges the container 2 immediately before the character code 7, arranges the remaining character codes 8 to 20 after the character code 7, and writes the character code vertically in this order. The block 1 having the placement information placed in is generated.

コンテナ２は、親文字の文字コード５，６及びルビ文字の文字コード１〜４の配置情報を有することになるが、この段階では、未だこれらの配置情報は有していない。コンテナ２が有する配置情報は、後述するステップＳ９１１において生成される。 The container 2 has the arrangement information of the character codes 5 and 6 of the parent characters and the character codes 1 to 4 of the ruby characters, but at this stage, it does not yet have such arrangement information. The arrangement information included in the container 2 is generated in step S911 described later.

そして、ブロックデータ生成部１１ｃは、修飾データ１００２に基づいて、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードが表示される際の上下左右のマージン、上下左右のパディング、均等揃え又は中央揃え等の行単位の文字配置の揃え等に関する情報を取得し、ブロックデータを識別する識別情報（例えば、ブロックデータ１）と関連づけて、ブロックサイズ情報テーブル１１１２に登録して、レイアウトデータ１１１０内にブロックサイズ情報テーブル１１１２を配置する。 Then, the block data generation unit 11c, based on the modification data 1002, the top / bottom / left / right margin, the top / bottom / left / right padding, and the uniform alignment when the character code enclosed by the <p> tag of the text code arrangement data 1001 is displayed. Alternatively, information relating to line-by-line character arrangement alignment, such as center alignment, is acquired, and is associated with identification information (for example, block data 1) for identifying block data, and is registered in the block size information table 1112 to obtain layout data 1110. The block size information table 1112 is arranged in the inside.

また、ブロックデータ生成部１１ｃは、修飾データ１００２に基づいて、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードが表示される際の文字サイズ、文字の上付き又は下付き等の文字単位の配置等に関する情報を取得し、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけて、オブジェクトサイズ情報テーブル１１１３に登録して、レイアウトデータ１１１０内にオブジェクトサイズ情報テーブル１１１３を配置する。このようにして、ブロックデータ生成部１１ｃは、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードと関連づけられた文字の修飾情報を有する文字修飾データを生成する。 In addition, the block data generation unit 11c, based on the modification data 1002, determines the character size when the character code enclosed in the <p> tag of the text code arrangement data 1001 is displayed, the superscript or subscript of the character, etc. Information on the arrangement of character units, etc. is acquired, registered in the object size information table 1113 in association with identification information (for example, character code 1, character code 2) for identifying the character code, and the object size is stored in the layout data 1110. An information table 1113 is arranged. In this way, the block data generation unit 11c generates character modification data having character modification information associated with the character code for each of the plurality of character codes included in the text data.

また、ブロックデータ生成部１１ｃは、修飾データ１００２に基づいて、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードが表示される際の文字の囲み、下線、背景色等に関する情報を取得し、ブロックデータを識別する識別情報（例えば、ブロックデータ１）と関連づけて、罫線テーブル１１３１に登録して、スタイルデータ１１３０内に罫線テーブル１１３１を配置する。 Further, the block data generation unit 11c obtains information on the character enclosure, underline, background color, and the like when the character code enclosed in the <p> tag of the sentence code arrangement data 1001 is displayed based on the modification data 1002. The ruled line table 1131 is arranged in the style data 1130 by acquiring and registering it in the ruled line table 1131 in association with identification information for identifying block data (for example, block data 1).

更に、ブロックデータ生成部１１ｃは、修飾データ１００２に基づいて、文章コード配置データ１００１の＜ｐ＞タグで囲まれた文字コードが表示される際の文字色、圏点の有無、文字の装飾等に関する情報を取得し、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけて、スタイル情報テーブル１１３２に登録して、スタイルデータ１１３０内にスタイル情報テーブル１１３２を配置する。 Further, the block data generation unit 11c, based on the modification data 1002, character color when the character code enclosed by the <p> tag of the text code arrangement data 1001 is displayed, presence / absence of a mark, character decoration, etc. The information about the character code is acquired, is associated with identification information for identifying the character code (for example, character code 1, character code 2), is registered in the style information table 1132, and the style information table 1132 is arranged in the style data 1130.

次に、ステップＳ９１１において、処理部１１のコンテナデータ生成部１１ｂは、親文字及びルビ文字処理を実行する。具体的には、コンテナデータ生成部１１ｂは、文章データに含まれる連続して配置される文字コードが、親文字とルビ文字との関係を有する場合、親文字の文字コードの配置情報を有する第１のブロックデータと、ルビ文字の文字コードの配置情報を有する第２のブロックデータとを生成し、第１のブロックデータ及び第２のブロックデータの位置関係を決定して、第１のブロックデータ及び第２のブロックデータの配置情報を有するコンテナデータを生成する。 Next, in step S911, the container data generation unit 11b of the processing unit 11 performs parent character and ruby character processing. Specifically, the container data generation unit 11b includes the character code arrangement information of the parent character when the consecutively arranged character codes included in the sentence data have a relationship between the parent character and the ruby character. 1 block data and second block data having arrangement information of the character code of ruby characters are generated, the positional relationship between the first block data and the second block data is determined, and the first block data And container data having the arrangement information of the second block data.

更に説明すると、コンテナデータ生成部１１ｂは、文章コード配置データ１００１の＜ｒｕｂｙ＞タグで囲まれた連続して配置される６つの文字コードが、親文字とルビ文字との関係を有すると判断する。コンテナデータ生成部１１ｂは、＜ｒｕｂｙ＞タグで囲まれた連続して配置される６つの文字コードの内、＜ｒｔ＞タグで囲まれた文字コードをルビ文字として判断し、そうではない文字コードを親文字として判断する。 More specifically, the container data generation unit 11b determines that the six character codes arranged in succession surrounded by <ruby> tags in the sentence code arrangement data 1001 have a relationship between the parent character and the ruby character. . The container data generation unit 11b determines the character code enclosed by the <rt> tag as a ruby character from among the six character codes arranged consecutively enclosed by the <ruby> tag, and the character code that is not so Is determined as the parent character.

そして、図１３（Ａ）に示すように、コンテナデータ生成部１１ｂは、ルビ文字の文字コード１〜４の配置情報を有するブロック２と、親文字の文字コード５，６の配置情報を有するブロック３を生成する。ここで、ルビ文字の文字コード１〜４は、”わ”、”が”、”は”、”い”に対応し、親文字の文字コード５，６は、”吾”、”輩”に対応する。図１３（Ｂ）は、説明を分かり易くするために、ブロック２及びブロック３内の文字コードが、文字コードで識別される文字で表されたブロック２及びブロック３を示す図である。 Then, as shown in FIG. 13A, the container data generation unit 11b includes a block 2 having arrangement information of ruby character code 1 to 4 and a block having arrangement information of character codes 5 and 6 of the parent character. 3 is generated. Here, the character codes 1 to 4 of the ruby characters correspond to “Wa”, “ga”, “ha”, “i”, and the character codes 5 and 6 of the parent characters are “吾”, “m”. Correspond. FIG. 13B is a diagram illustrating the blocks 2 and 3 in which the character codes in the blocks 2 and 3 are represented by characters identified by the character codes for easy understanding.

そして、コンテナデータ生成部１１ｂは、ルビ文字の文字コード１〜４の配置情報を有するブロックデータ２が、親文字の文字コード５，６の配置情報を有するブロックデータ３の右側に配置されるように、ブロックデータ２及びブロックデータ３の位置関係を決定する。 The container data generation unit 11b then arranges the block data 2 having the arrangement information of the ruby character codes 1 to 4 on the right side of the block data 3 having the arrangement information of the character codes 5 and 6 of the parent character. Then, the positional relationship between the block data 2 and the block data 3 is determined.

そして、コンテナデータ生成部１１ｂは、ブロックデータ２及びブロックデータ３の配置情報を有するコンテナデータを、上述したコンテナデータ２として置き換える。 Then, the container data generation unit 11b replaces the container data having the arrangement information of the block data 2 and the block data 3 with the container data 2 described above.

文章データに複数の＜ｒｕｂｙ＞タグが含まれている場合には、コンテナデータ生成部１１ｂは、複数の＜ｒｕｂｙ＞タグのそれぞれについて、上述した親文字及びルビ文字処理を実行する。 When the text data includes a plurality of <ruby> tags, the container data generation unit 11b executes the above-described parent character and ruby character processing for each of the plurality of <ruby> tags.

また、コンテナデータ生成部１１ｂは、修飾データ１００２を参照して、＜ｒｔ＞タグで囲まれた文字コード（ルビ文字の文字コード）の文字サイズを取得して、親文字の文字コードの２分の１に変更し、文字コードを識別する識別情報と関連づけて、メモリ１２に記憶する。この変更されたルビ文字の文字コードの文字サイズは、後述するステップＳ１７０３において、ルビ文字の文字コードの文字画像サイズを求める時に使用される。 Further, the container data generation unit 11b refers to the modification data 1002, acquires the character size of the character code (ruby character code) enclosed in the <rt> tag, and divides the character code of the parent character by two. And stored in the memory 12 in association with the identification information for identifying the character code. The character size of the changed ruby character code is used when obtaining the character image size of the ruby character code in step S1703, which will be described later.

コンテナデータ生成部１１ｂは、文書データが、横中文字を有する場合にも、上述した親文字及びルビ文字処理と同様の処理を実行する。 The container data generation unit 11b executes the same processing as the above-described parent character and ruby character processing even when the document data has horizontal and middle characters.

図１４は、文書データが横中文字を有する場合のコンテナデータ生成部１１ｂの処理を説明する図である。 FIG. 14 is a diagram for explaining the processing of the container data generation unit 11b when the document data has horizontal and middle characters.

コンテナデータ生成部１１ｂは、文書データが横中文字を有する場合、横中文字の文字コードの配置情報を有するブロック１０及びブロック１１を生成し、ブロック１０の配置情報を有するコンテナ１０及びブロック１１の配置情報を有するコンテナ１１を生成する。そして、コンテナデータ生成部１１ｂは、コンテナ１０を、文字コード”月”の前に配置し、コンテナ１１を、文字コード”月”と文字コード”日”との間に配置して、これらの配置情報を有するブロック１２を生成する。なお、文章データ１０００は、横中文字を有していないので、本実施形態では、横中文字の処理は行われない。 When the document data has horizontal horizontal characters, the container data generation unit 11b generates the blocks 10 and 11 having the arrangement information of the character code of the horizontal middle characters, and the container 10 and the block 11 having the arrangement information of the block 10 A container 11 having arrangement information is generated. The container data generation unit 11b arranges the container 10 before the character code “month”, arranges the container 11 between the character code “month” and the character code “day”, and arranges these arrangements. A block 12 having information is generated. Note that the text data 1000 does not have horizontal horizontal characters, so in the present embodiment, processing of horizontal horizontal characters is not performed.

次に、ステップＳ９１３において、処理部１１のブロックデータ生成部１１ｃは、分割禁止文字処理を実行する。具体的には、ブロックデータ生成部１１ｃは、文章データに含まれる文字コードが、当該文字コードの直前又は直後に位置する文字コードと分割して文字として表示されることが禁止される分割禁止の文字コードである場合、当該文字コードと、当該文字コードの直前又は直後に位置する文字コードとの配置情報を有するブロックデータを生成する。 Next, in step S913, the block data generation unit 11c of the processing unit 11 executes a division prohibited character process. Specifically, the block data generation unit 11c divides the character code included in the text data from the character code located immediately before or after the character code and is prohibited from being displayed as a character. If it is a character code, block data having arrangement information of the character code and the character code located immediately before or after the character code is generated.

分割禁止の文字コードは、”、”、”？”、”。”、”」”、”）”のような行頭に配置されることが禁止される行頭禁則文字の文字コードを含む。行頭禁則文字は、直前に位置する文字と分割して表示されることが禁止される。また、分割禁止の文字コードは、”「”、”（”のような行末に配置されることが禁止される行末禁則文字の文字コードを含む。行末禁則文字は、直後に位置する文字と分割して表示されることが禁止される。 The character codes that cannot be divided include the character codes of prohibited characters that cannot be placed at the beginning of a line such as “,”, “?”, “.”, “” ”,“) ”. Characters are prohibited from being displayed separately from the immediately preceding character, and character codes that are prohibited from being divided are prohibited from being placed at the end of lines such as "" "or" (". Contains the character code of the end-of-line prohibited character, which is prohibited from being displayed separately from the character immediately after it.

更に説明すると、ブロックデータ生成部１１ｃは、まず、文章コード配置データ１００１が分割禁止の文字コードを含むか否かを判断する。 More specifically, the block data generation unit 11c first determines whether or not the sentence code arrangement data 1001 includes a character code that cannot be divided.

そして、ブロックデータ生成部１１ｃは、文章データに含まれる文字コードが、当該文字コードの直前に位置する文字コードと分割して表示されることが禁止される行頭禁則文字の文字コードの場合、当該文字コードと、当該文字コードの直前に位置する文字コードとの配置情報を有するブロックデータを生成する。 Then, the block data generation unit 11c, when the character code included in the sentence data is a character code of a forbidden character that is prohibited from being displayed separately from the character code positioned immediately before the character code, Block data having arrangement information of the character code and the character code located immediately before the character code is generated.

図１５（Ａ）及び図１５（Ｂ）に示すように、ブロックデータ生成部１１ｃは、文章コード配置データ１００１に含まれる文字コード１２が、行頭禁則文字である”。”の場合、文字コード１２と、当該文字コードの直前に位置する文字コード１１との配置情報を有するブロック４を生成する。図１５（Ａ）では、説明を分かり易くするために、図１５（Ｂ）に示されるブロック内の文字コードが、文字コードにより識別される文字で示されている。 As shown in FIGS. 15A and 15B, the block data generation unit 11c causes the character code 12 when the character code 12 included in the sentence code arrangement data 1001 is a forbidden character. And a block 4 having arrangement information with the character code 11 positioned immediately before the character code. In FIG. 15A, for easy understanding of the description, the character code in the block shown in FIG. 15B is indicated by a character identified by the character code.

また、図１５（Ａ）及び図１５（Ｂ）に示すように、ブロックデータ生成部１１ｃは、文章コード配置データ１００１に含まれる文字コード２０が、行頭禁則文字である”。”の場合、文字コード２０と、当該文字コードの直前に位置する文字コード１９との配置情報を有するブロック５を生成する。 Further, as shown in FIGS. 15A and 15B, the block data generation unit 11c causes the character code 20 included in the sentence code arrangement data 1001 to be a character when the character is a forbidden character. A block 5 having arrangement information of the code 20 and the character code 19 positioned immediately before the character code is generated.

そして、コンテナデータ生成部１１ｂは、ブロック４の配置情報を有するコンテナ３を生成して、コンテナ３を、文字コード１０と文字コード１３の間に位置するようにブロック１内に配置する。また、コンテナデータ生成部１１ｂは、ブロック５の配置情報を有するコンテナ４を生成して、コンテナ４が文字コード１８の後ろに位置するようにブロック１内に配置する。これにより、ブロック１の直下に、ブロック４及びブロック５が配置されることを回避される。 Then, the container data generation unit 11 b generates the container 3 having the arrangement information of the block 4 and arranges the container 3 in the block 1 so as to be positioned between the character code 10 and the character code 13. In addition, the container data generation unit 11 b generates the container 4 having the arrangement information of the block 5 and arranges the container 4 in the block 1 so that the container 4 is positioned behind the character code 18. Thereby, it is avoided that the block 4 and the block 5 are arranged immediately below the block 1.

また、ブロックデータ生成部１１ｃは、文章データに含まれる文字コードが、当該文字コードの直後に位置する文字コードと分割して表示されることが禁止される行末禁則文字の場合、当該文字コードと、当該文字コードの直後に位置する文字コードとの配置情報を有するブロックデータを生成する。なお、ブロックデータ生成部１１ｃが、文章コード配置データ１００１が分割禁止の文字コードを含まないと判断した場合、分割禁止文字処理は実行されない。 Further, the block data generation unit 11c, when the character code included in the text data is a line ending prohibited character that is prohibited from being displayed separately from the character code positioned immediately after the character code, Then, block data having arrangement information with the character code located immediately after the character code is generated. Note that when the block data generation unit 11c determines that the text code arrangement data 1001 does not include a character code that cannot be divided, the division-prohibited character processing is not executed.

次に、ステップＳ９１５において、処理部１１のブロックデータ生成部１１ｃは、欧文文字処理を実行する。ブロックデータ生成部１１ｃは、まず、文章データ内に連続して配置される複数の欧文の文字コードが含まれるか否かを判断する。本実施形態の文章データ１０００の処理では、コンテナデータ生成部１１ｂは、文章データ内に連続して配置される複数の欧文の文字コードが含まれないと判断する。 Next, in step S915, the block data generation unit 11c of the processing unit 11 executes a European character process. The block data generation unit 11c first determines whether or not a plurality of European character codes arranged continuously in the sentence data are included. In the processing of the text data 1000 according to the present embodiment, the container data generation unit 11b determines that a plurality of European character codes arranged continuously in the text data are not included.

ここでは、図１６（Ａ）及び図１６（Ｂ）に示す例を用いて、欧文文字処理について、以下に説明する。 Here, using the example shown in FIGS. 16A and 16B, the Western character processing will be described below.

図１６（Ａ）に示す例では、文章データに基づいて、ブロック２０が生成されている。ブロック２０は、文字コード”Ｉ”と、半角スペースと、連続する４つの文字コード”ｈ”、”ａ”、”ｖ”、”ｅ”と、半角スペースと、文字コード”ａ”と、連続する３つの文字コード”ｐ”、”ｅ”、”ｎ”を有する。なお、図１６（Ａ）及び図１６（Ｂ）では、説明を分かり易くするために、文字コードが、文字コードにより識別される文字で示されている。 In the example shown in FIG. 16A, the block 20 is generated based on the text data. The block 20 includes a character code “I”, a single-byte space, and four consecutive character codes “h”, “a”, “v”, “e”, a single-byte space, and a character code “a”. Has three character codes “p”, “e”, and “n”. In FIG. 16A and FIG. 16B, for easy understanding of the description, the character code is indicated by a character identified by the character code.

ブロックデータ生成部１１ｃは、文章データ内に連続して配置される複数の欧文の文字コードが含まれる場合、連続する複数の欧文の文字コードの配置情報に基づいて、連続する複数の欧文の文字コードの配置情報を有するブロックデータを生成する。複数の欧文の文字コードの配置情報として、例えば、各文字コードが配置される順番及び組方向が挙げられる。 The block data generation unit 11c, when the text data includes a plurality of continuous character codes arranged in the text data, the block data generation unit 11c, based on the arrangement information of the continuous plurality of European character codes, Block data having code arrangement information is generated. The arrangement information of a plurality of European character codes includes, for example, the order in which the character codes are arranged and the composition direction.

図１６（Ｂ）に示すように、ブロックデータ生成部１１ｃは、連続する４つの欧文の文字コード”ｈ”、”ａ”、”ｖ”、”ｅ”の配置情報を有するブロック２１を生成する。また、ブロックデータ生成部１１ｃは、連続する３つの欧文の文字コード”ｐ”、”ｅ”、”ｎ”の配置情報を有するブロック２２を生成する。 As shown in FIG. 16B, the block data generation unit 11c generates a block 21 having arrangement information of four consecutive European character codes “h”, “a”, “v”, and “e”. . Further, the block data generation unit 11c generates a block 22 having arrangement information of three consecutive European character codes “p”, “e”, and “n”.

そして、コンテナデータ生成部１１ｂは、ブロック２１の配置情報を有するコンテナ２０を生成して、コンテナ２０が文字コード”Ｉ”に続く半角スペースの後に位置するようにブロック２０内に配置する。また、コンテナデータ生成部１１ｂは、ブロック２２の配置情報を有するコンテナ２１を生成して、コンテナ２１が文字コード”ａ”に続く半角スペースの後に位置するようにブロック２０内に配置する。これにより、ブロック２０の直下に、ブロック２１及びブロック２２が配置されることを回避される。 Then, the container data generation unit 11b generates the container 20 having the arrangement information of the block 21, and arranges the container 20 in the block 20 so as to be positioned after the half-width space following the character code “I”. Further, the container data generation unit 11b generates the container 21 having the arrangement information of the block 22, and arranges the container 21 in the block 20 so as to be positioned after the half-width space following the character code “a”. Thereby, it is avoided that the block 21 and the block 22 are arranged immediately below the block 20.

次に、ステップＳ９１７において、処理部１１の画像データ生成部１１ｄは、文章コード配置データ１００１に含まれる画像データを抽出し、画像データを識別する識別情報（例えば、画像データ１、画像データ２）と関連づけて、画像テーブル１１２２に登録して、オブジェクトデータ１１２０内に画像テーブル１１２２を配置する。なお、文章コード配置データ１００１が画像データを含まない場合には、この処理は行われない。 Next, in step S917, the image data generation unit 11d of the processing unit 11 extracts image data included in the text code arrangement data 1001, and identification information for identifying the image data (for example, image data 1, image data 2). Is registered in the image table 1122, and the image table 1122 is arranged in the object data 1120. Note that this processing is not performed when the text code arrangement data 1001 does not include image data.

処理部１１は、文章データに含まれる他の章に対しても、上述した各処理を行う。本実施形態では、上述したように、文章データ１０００は、１つの章のみを有するので、ステップＳ９０５〜ステップＳ９１９の処理が１回だけ行われる。 The processing unit 11 performs the above-described processes on other chapters included in the text data. In the present embodiment, as described above, the sentence data 1000 includes only one chapter, and therefore, the processing from step S905 to step S919 is performed only once.

次に、図１７を参照しながら、情報処理装置１０の動作を、以下に説明する。 Next, the operation of the information processing apparatus 10 will be described below with reference to FIG.

文章データに含まれる全ての章に対して、ステップＳ１７０１〜ステップＳ１７０７の処理が行われる。本実施形態では、文章データ１０００は、１つの章のみを有するので、ステップＳ１７０１〜ステップＳ１７０７の処理は１回だけ行われる。 Steps S1701 to S1707 are performed on all chapters included in the text data. In the present embodiment, since the sentence data 1000 includes only one chapter, the processing from step S1701 to step S1707 is performed only once.

まず、ステップＳ１７０３において、処理部１１の文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードと関連づけられたフォント及び文字サイズに基づいて、当該文字コードの文字画像のサイズを示す文字画像サイズを求める。文字コードの文字画像は、文字コードと関連付けられているフォントに基づいて、文字コードが画像情報に変換された画像である。文字画像サイズは、文字画像の縦のサイズ及び文字画像の横のサイズを有する。この文字画像を生成する処理については後述する。 First, in step S1703, the character image size generation unit 11e of the processing unit 11 determines the character code of each of a plurality of character codes included in the sentence data based on the font and the character size associated with the character code. A character image size indicating the size of the character image is obtained. The character image of the character code is an image obtained by converting the character code into image information based on the font associated with the character code. The character image size has a vertical size of the character image and a horizontal size of the character image. The process for generating the character image will be described later.

文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、修飾データ１００２を参照して、当該文字コードと関連づけられたフォントを取得する。そして、文字画像サイズ生成部１１ｅは、メモリ１２に記憶されたフォントパッケージ１２ｂを参照して、当該文字コードと関連付けられたフォント字形の仮想ボディ高さ及びフォント字形の仮想ボディ幅を取得する。 The character image size generation unit 11e refers to the modification data 1002 for each of a plurality of character codes included in the text data, and acquires a font associated with the character code. Then, the character image size generation unit 11e refers to the font package 12b stored in the memory 12, and acquires the font body virtual body height and the font body virtual body width associated with the character code.

また、文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、修飾データ１００２を参照して、当該文字コードと関連づけられた文字サイズを取得する。ここで、文字コードがルビ文字の場合には、文字画像サイズ生成部１１ｅは、メモリ１２を参照して、当該文字コードと関連づけられた文字サイズを取得する。 Further, the character image size generation unit 11e refers to the modification data 1002 for each of the plurality of character codes included in the text data, and acquires the character size associated with the character code. Here, when the character code is a ruby character, the character image size generation unit 11e refers to the memory 12 and acquires the character size associated with the character code.

図１８に示すように、文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードの文字画像の縦のサイズを、下記式（１）により求める。 As shown in FIG. 18, the character image size generation unit 11 e obtains the vertical size of the character image of the character code for each of the plurality of character codes included in the text data by the following equation (1).

文字画像の縦のサイズ＝文字サイズ×フォント字形の仮想ボディ高さ÷フォント字形の仮想ボディ幅（１） Vertical size of character image = Character size × Virtual body height of font character shape ÷ Virtual body width of font character shape (1)

また、文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードの文字画像の横のサイズを、下記式（２）により求める。 Further, the character image size generation unit 11e obtains the horizontal size of the character image of the character code for each of the plurality of character codes included in the sentence data by the following equation (2).

文字画像の横のサイズ＝文字サイズ（２） Horizontal size of character image = Character size (2)

そして、文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、文字画像の縦のサイズ及び文字画像の横のサイズを、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけて、オブジェクトサイズ情報テーブル１１１３に登録する。 Then, the character image size generation unit 11e determines, for each of the plurality of character codes included in the text data, the vertical size of the character image and the horizontal size of the character image by identifying information (for example, a character code) 1 and associated with the character code 2) and registered in the object size information table 1113.

上述した文字画像の縦のサイズ及び文字画像の横のサイズの求め方は、組方向が縦書きの場合である。一方、組方向が横書きの場合には、文字画像の縦のサイズ及び文字画像の横のサイズの求め方が入れ替わる。 The above-described method for obtaining the vertical size of the character image and the horizontal size of the character image is when the writing direction is vertical writing. On the other hand, when the writing direction is horizontal writing, the method for obtaining the vertical size of the character image and the horizontal size of the character image are interchanged.

次に、ステップＳ１７０５において、処理部１１の文字画像サイズ生成部１１ｅは、カーニング処理を実行する。まず、文字画像サイズ生成部１１ｅは、文章データに含まれる複数の文字コードのそれぞれについて、メモリ１２に記憶されたフォントパッケージ１２ｂを参照して、当該文字コードがカーニング処理の対象となる文字コードであるか否かを判断する。文字画像サイズ生成部１１ｅは、カーニング処理の対象となる文字コードと判断した文字コードに対して、文字画像サイズを変更する処理を行う。 Next, in step S1705, the character image size generation unit 11e of the processing unit 11 performs a kerning process. First, the character image size generation unit 11e refers to the font package 12b stored in the memory 12 for each of a plurality of character codes included in the sentence data, and the character code is a character code to be subjected to kerning processing. Judge whether there is. The character image size generation unit 11e performs a process of changing the character image size for the character code determined as the character code to be subjected to the kerning process.

図１９は、組方向が縦書きの場合のカーニング処理を説明する図である。 FIG. 19 is a diagram for explaining kerning processing when the writing direction is vertical writing.

図１９に示す例では、文字画像サイズ生成部１１ｅは、２つの連続する文字コードである”Ｖ””Ａ”を、カーニング処理の対象と判断する。 In the example illustrated in FIG. 19, the character image size generation unit 11e determines that two consecutive character codes “V” and “A” are to be subjected to kerning processing.

図１９（Ｂ）に示すように、文字画像サイズ生成部１１ｅは、文字コード”Ａ”の文字画像の横のサイズを、所定の割合に縮小して、新たな文字画像の横のサイズを求める。文字画像の横のサイズを縮小する割合は、カーニング処理の対象となる文字コードごとにフォントパッケージ１２ｂに登録されている。 As shown in FIG. 19B, the character image size generation unit 11e reduces the horizontal size of the character image with the character code “A” to a predetermined ratio to obtain the horizontal size of a new character image. . The ratio of reducing the horizontal size of the character image is registered in the font package 12b for each character code to be subjected to kerning processing.

これにより、２つの連続する文字コード”Ｖ””Ａ”が表示される時には、図１９（Ａ）に示すように、カーニング処理後の文字コード”Ａ”の字形領域は、カーニング処理前よりも、文字コード”Ｖ”に近づいて表示される。 As a result, when two consecutive character codes “V” and “A” are displayed, as shown in FIG. 19A, the character-shaped area of the character code “A” after the kerning process is more than that before the kerning process. The character code “V” is displayed.

文字画像サイズ生成部１１ｅは、カーニング処理の対象となった文字コードの新たな文字画像の横のサイズを、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけて、オブジェクトサイズ情報テーブル１１１３に登録する。また、文字画像サイズ生成部１１ｅは、合字処理の対象となる文字コードに対しても、カーニング処理と同様の処理を行う。 The character image size generation unit 11e associates the horizontal size of the new character image of the character code subjected to kerning processing with identification information (for example, character code 1, character code 2) for identifying the character code, Register in the object size information table 1113. Further, the character image size generation unit 11e performs the same process as the kerning process on the character code that is the target of the ligature process.

処理部１１は、文章データに含まれる他の章に対しても、上述した各処理を行う。本実施形態では、上述したように、文章データ１０００は、１つの章のみを有するので、ステップＳ１７０１〜ステップＳ１７０７の処理が１回だけ行われる。 The processing unit 11 performs the above-described processes on other chapters included in the text data. In the present embodiment, as described above, the sentence data 1000 includes only one chapter, and therefore, the processing from step S1701 to step S1707 is performed only once.

次に、ステップＳ１７０９において、処理部１１の画像データ生成部１１ｄは、文章データに含まれる複数の文字コードのそれぞれについて、当該文字コードと関連付けられているフォントに基づいて、当該文字コードが画像情報に変換された文字画像データを生成する。 Next, in step S1709, the image data generation unit 11d of the processing unit 11 converts the character code into image information based on the font associated with the character code for each of the plurality of character codes included in the text data. Character image data converted into is generated.

画像データ生成部１１ｄは、文章データに含まれる複数の文字コードのそれぞれについて、修飾データ１００２を参照して、当該文字コードと関連づけられたフォントを取得する。そして、文字画像サイズ生成部１１ｅは、メモリ１２に記憶されたフォントパッケージ１２ｂを参照して、当該文字コードと関連付けられたフォント字形データを取得する。 The image data generation unit 11d refers to the modification data 1002 for each of a plurality of character codes included in the text data, and acquires a font associated with the character code. Then, the character image size generation unit 11e refers to the font package 12b stored in the memory 12, and acquires font character form data associated with the character code.

そして、文字画像サイズ生成部１１ｅは、図２０に示すように、フォント字形データに基づいて、文字コードが画像情報に変換された文字画像データを生成する。ここで、文字画像サイズ生成部１１ｅは、文字画像データをベクタ情報として生成することが好ましい。画像データ生成部１１ｄは、文字画像データを、文字コードを識別する識別情報（例えば、文字コード１、文字コード２）と関連づけて、文字画像テーブル１１２１に登録して、オブジェクトデータ１１２０内に文字画像テーブル１１２１を配置する。 Then, as shown in FIG. 20, the character image size generation unit 11e generates character image data in which the character code is converted into image information based on the font character shape data. Here, the character image size generation unit 11e preferably generates character image data as vector information. The image data generation unit 11d registers the character image data in the character image table 1121 in association with identification information (for example, character code 1, character code 2) for identifying the character code, and stores the character image in the object data 1120. A table 1121 is arranged.

このようにして、情報処理装置１０は、図１１に示すデータ構造１１００を有する文章データを生成する。 In this way, the information processing apparatus 10 generates text data having the data structure 1100 shown in FIG.

情報処理装置１０は、データ構造１１００を有する文章データを、ネットワークＮを介して、サーバ２０へ送信する。サーバ２０は、データ構造１１００を有する文章データを記憶する。 The information processing apparatus 10 transmits text data having the data structure 1100 to the server 20 via the network N. The server 20 stores sentence data having a data structure 1100.

次に、端末３０の動作を、図２１〜図２８を参照しながら、以下に説明する。 Next, the operation of the terminal 30 will be described below with reference to FIGS.

まず、端末３０のユーザは、端末３０を用いて、文書データを記憶しているサーバ２０に対して、ネットワークＮを介して、所定の文書データを要求する。サーバ２０は、要求された文書データを、ネットワークＮを介して端末３０に送信する。 First, the user of the terminal 30 uses the terminal 30 to request predetermined document data from the server 20 storing the document data via the network N. The server 20 transmits the requested document data to the terminal 30 via the network N.

端末３０は、サーバ２０から、図７又は図１１に示すようなデータ構造を有する文書データを受信する。 The terminal 30 receives document data having a data structure as shown in FIG.

図２１は、端末３０が、受信した文書データに基づいて表示する文章情報を生成して、表示部３３に表示される文書を示す図である。端末３０は、文書データに基づいて生成された文章を、１ページごとに表示部３３に表示する。 FIG. 21 is a diagram illustrating a document displayed on the display unit 33 by the terminal 30 generating text information to be displayed based on received document data. The terminal 30 displays the text generated based on the document data on the display unit 33 page by page.

以下、端末３０が、図７又は図１１に示すようなデータ構造を有する文書データに基づいて、表示部３３に表示する文章情報を生成して、文章を表示する処理を以下に説明する。 Hereinafter, a process in which the terminal 30 generates sentence information to be displayed on the display unit 33 based on document data having a data structure as illustrated in FIG. 7 or FIG. 11 and displays the sentence will be described.

まず、端末３０の処理部３１の閲覧実行部３１ａは、レイアウトデータ内のコンテナサイズ情報テーブルを参照して、コンテナサイズ（最小幅、最大幅、最小高さ、最大高さ）が登録されていれば、これらの情報を取得する。閲覧実行部３１ａは、文章データをリフロー表示するか、又はフィックス表示するかに応じて、コンテナデータの表示サイズを決定する。リフロー表示するか、又はフィックス表示するかについては、コンテナサイズ情報テーブルにより指定され得る。 First, the browsing execution unit 31a of the processing unit 31 of the terminal 30 refers to the container size information table in the layout data, and the container size (minimum width, maximum width, minimum height, maximum height) is registered. For example, the information is acquired. The browsing execution unit 31a determines the display size of the container data depending on whether the text data is reflow-displayed or fixed-displayed. Whether to perform reflow display or fix display can be specified by the container size information table.

まず、文章データをリフロー表示する場合の処理を以下に説明する。文章データがリフロー表示される場合には、コンテナサイズ情報テーブルには、コンテナサイズ（最小幅、最大幅、最小高さ、最大高さ）が登録されていない。閲覧実行部３１ａは、縦書き表示する場合には、図２２に示すように、閲覧実行部３１ａが文章データを表示する表示枠の高さを、コンテナデータの縦方向の表示サイズとして決定する。縦書き表示する場合、コンテナデータの横方向の表示サイズは、文章データに含まれる情報量に基づいて決定される。 First, the process for reflow display of text data will be described below. When the text data is reflow-displayed, the container size (minimum width, maximum width, minimum height, maximum height) is not registered in the container size information table. When performing vertical writing display, the browsing execution unit 31a determines the height of the display frame in which the browsing execution unit 31a displays text data as the vertical display size of the container data, as shown in FIG. When displaying vertically, the display size of the container data in the horizontal direction is determined based on the amount of information included in the text data.

また、閲覧実行部３１ａは、横書き表示する場合には、図２２に示すように、閲覧実行部３１ａが文章データを表示する表示枠の幅を、コンテナデータの横方向の表示サイズとして決定する。横書き表示する場合には、コンテナデータの縦方向の表示サイズは、文章データに含まれる情報量に基づいて決定される。 Moreover, when performing horizontal writing display, the browsing execution unit 31a determines the width of the display frame in which the browsing execution unit 31a displays text data as the horizontal display size of the container data, as shown in FIG. When displaying horizontally, the vertical display size of the container data is determined based on the amount of information included in the text data.

次に、文章データをフィックス表示する場合の処理を以下に説明する。閲覧実行部３１ａは、コンテナサイズの最大高さと、閲覧実行部３１ａが文章データを表示する表示枠の高さとを比較して、２つの高さの内の小さい方の高さを、コンテナデータの縦方向の表示サイズとして決定する。また、閲覧実行部３１ａは、コンテナサイズの最大幅と、閲覧実行部３１ａが文章データを表示する表示枠の幅とを比較して、２つの幅の内の小さい方の幅を、コンテナデータの幅方向の表示サイズとして決定する。これにより、コンテナデータの配置可能範囲が決定される。 Next, the processing when the text data is displayed as a fix will be described below. The browsing execution unit 31a compares the maximum height of the container size with the height of the display frame on which the browsing execution unit 31a displays the text data, and determines the smaller one of the two heights of the container data. Determined as the vertical display size. Further, the browsing execution unit 31a compares the maximum width of the container size with the width of the display frame on which the browsing execution unit 31a displays the text data, and determines the smaller one of the two widths of the container data. It is determined as the display size in the width direction. Thereby, the possible arrangement range of the container data is determined.

なお、文章データがリフロー表示される場合でも、写真又は絵等の画像データを含むコンテナデータに対しては、コンテナサイズ（最小幅、最大幅、最小高さ、最大高さ）が登録されている場合がある。この場合、閲覧実行部３１ａは、フィックス表示する場合と同様の処理により、コンテナデータの縦方向及び幅方向の表示サイズを決定する。 Even when text data is reflow-displayed, the container size (minimum width, maximum width, minimum height, maximum height) is registered for container data including image data such as photographs or pictures. There is a case. In this case, the browsing execution unit 31a determines the display size of the container data in the vertical direction and the width direction by the same processing as in the case of displaying the fix.

本実施形態では、閲覧実行部３１ａが、文章データを縦書きでリフロー表示する場合について、以下に説明する。 In the present embodiment, the case where the browsing execution unit 31a performs reflow display of text data in vertical writing will be described below.

次に、閲覧実行部３１ａは、レイアウトデータ内のブロックサイズ情報テーブルを参照して、ブロックデータのそれぞれに対して、上下左右のマージン、上下左右のパディングを取得する。本実施形態では、閲覧実行部３１ａは、文章データを縦書きでリフロー表示するので、上下のマージン及び上下のパディングを取得する。 Next, the browsing execution unit 31a refers to the block size information table in the layout data, and acquires vertical and horizontal margins and vertical and horizontal padding for each block data. In the present embodiment, the browsing execution unit 31a reflow-displays the text data in vertical writing, and thus acquires the upper and lower margins and the upper and lower padding.

図２３に示すように、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、上下のマージンに基づいて、ブロックデータの配置可能範囲を決定する。また、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、上下のパディングに基づいて、文字画像データの配置可能範囲を決定する。左右のマージン及び左右のパディングは、後述する処理において使用される。 As illustrated in FIG. 23, the browsing execution unit 31a determines the possible arrangement range of block data based on the upper and lower margins for each block data. In addition, the browsing execution unit 31a determines the possible arrangement range of the character image data based on the upper and lower padding for each of the block data. The left and right margins and the left and right padding are used in the processing described later.

なお、文章データを縦書きでリフロー表示する場合には、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、左右のマージンに基づいて、ブロックデータの配置可能範囲を決定し、左右のパディングに基づいて、文字画像データの配置可能範囲を決定する。 When the text data is reflow-displayed in vertical writing, the browsing execution unit 31a determines the possible arrangement range of the block data based on the left and right margins for each of the block data, and performs left and right padding. Based on this, the possible arrangement range of the character image data is determined.

次に、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、レイアウトデータ内のオブジェクトサイズ情報テーブルを参照して、文字コードのそれぞれについて、文字画像の縦のサイズ及び文字画像の横のサイズを取得する。そして、閲覧実行部３１ａは、文字コードの文字画像が配置される領域を、当該文字コードの文字画像の縦のサイズ及び文字画像の横のサイズに基づいて決定する。 Next, the browsing execution unit 31a refers to the object size information table in the layout data for each of the block data, and sets the vertical size of the character image and the horizontal size of the character image for each character code. get. And the browsing execution part 31a determines the area | region where the character image of a character code is arrange | positioned based on the vertical size of the character image of the said character code, and the horizontal size of a character image.

そして、閲覧実行部３１ａは、図２４に示すように、文字コードのそれぞれに対して、文字コードの文字画像が配置される領域を、文字画像データの配置可能範囲に配置する。図２４において、文字コードの文字画像が配置される領域は、矩形の領域である。以下、文字コードの文字画像が配置される領域を、単に文字コードの領域ともいう。 Then, as shown in FIG. 24, the browsing execution unit 31a arranges an area in which the character image of the character code is arranged in the character image data arrangement possible range for each character code. In FIG. 24, the area where the character image of the character code is arranged is a rectangular area. Hereinafter, a region where a character image of a character code is arranged is also simply referred to as a character code region.

また、閲覧実行部３１ａは、ブロックデータ内に配置されるコンテナデータ（他のコンテナデータ内に配置されるコンテナデータ）に含まれる文字コードは、同じ行に含まれるように、文字コードの領域を決定する。これにより、親文字とルビ文字、又は分割禁止文字等が、分離して表示されないようにする。 In addition, the browsing execution unit 31a sets the character code area so that the character codes included in the container data (container data arranged in other container data) arranged in the block data are included in the same line. decide. This prevents the parent character and the ruby character or the division prohibited character from being displayed separately.

上述した処理により、各ブロックに含まれる行数が決定される。本実施形態では、２つのブロック１及びブロック２が、文字画像データの配置可能範囲に配置される。この段階では、ページ区切りは配置されていない。 The number of rows included in each block is determined by the processing described above. In the present embodiment, two blocks 1 and 2 are arranged in the arrangement range of the character image data. At this stage, no page breaks are placed.

次に、閲覧実行部３１ａは、図２５に示すように、閲覧実行部３１ａが文章データを表示するページ幅の位置にページ区切りを配置する。なお、ページ区切りは、実際には表示さない。 Next, as shown in FIG. 25, the browsing execution unit 31a arranges page breaks at page width positions where the browsing execution unit 31a displays text data. Note that page breaks are not actually displayed.

そして、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、レイアウトデータ内のブロックサイズ情報テーブルから取得した左右のマージンに基づいて、文字画像データの配置可能範囲の幅を決定する。左右のマージンは、ページ区切りの位置と、文字画像データの配置可能範囲との間の左右の距離を規定する。 And the browsing execution part 31a determines the width | variety of the arrangement | positioning possible range of character image data based on the left-right margin acquired from the block size information table in layout data with respect to each of block data. The left and right margins define the left and right distance between the page break position and the character image data arrangement possible range.

また、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、レイアウトデータ内のブロックサイズ情報テーブルから取得した左右のパディングに基づいて、文字画像データの配置可能範囲に配置された文字コードの領域が配置される範囲を決定する。左右のパディングは、文字画像データの配置可能範囲の左右の境界と、内部に配置される文字コードの領域との間の左右の距離を規定する。 In addition, the browsing execution unit 31a has, for each block data, character code areas arranged in the arrangement range of the character image data based on the left and right padding acquired from the block size information table in the layout data. Determine the range to be placed. The left and right padding defines the left and right distances between the left and right boundaries of the character image data arrangement range and the character code area arranged inside.

そして、閲覧実行部３１ａは、１つのページ幅内に収まらない文字コードを、次ページへ移動する。 Then, the browsing execution unit 31a moves character codes that do not fit within one page width to the next page.

そして、閲覧実行部３１ａは、レイアウトデータ内のブロックサイズ情報テーブルを参照して、ブロックデータのそれぞれに対して、均等揃え又は中央揃え等の行単位の文字配置の揃えに関する情報を取得する。そして、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、行単位の文字配置の揃えに関する情報に基づいて、文字コードの領域を配置する位置を決定する。 Then, the browsing execution unit 31a refers to the block size information table in the layout data, and acquires information regarding the alignment of character arrangement in units of lines such as uniform alignment or center alignment with respect to each of the block data. And the browsing execution part 31a determines the position which arrange | positions the area | region of a character code based on the information regarding the alignment of the character arrangement of a line unit with respect to each of block data.

更に、閲覧実行部３１ａは、レイアウトデータ内のオブジェクトサイズ情報テーブルを参照して、文字コードのそれぞれに対して、文字の上付き又は下付き等の文字単位の配置に関する情報を取得する。そして、閲覧実行部３１ａは、文字コードのそれぞれに対して、文字単位の配置に関する情報に基づいて、文字コードの領域を配置する位置を決定する。 Further, the browsing execution unit 31a refers to the object size information table in the layout data, and acquires information regarding the arrangement of character units such as a superscript or subscript for each character code. And the browsing execution part 31a determines the position which arrange | positions the area | region of a character code based on the information regarding arrangement | positioning of a character unit with respect to each of a character code.

図２６は、上述した処理により生成された文字コードの配置を示す。 FIG. 26 shows the arrangement of character codes generated by the above-described processing.

次に、閲覧実行部３１ａは、スタイルデータの罫線テーブルを参照して、ブロックデータのそれぞれに対して、文字の囲み、下線、背景色等に関する情報を取得する。そして、閲覧実行部３１ａは、ブロックデータのそれぞれに対して、文字の囲み、下線、背景色等に関する情報に基づいて、修飾情報の画像を生成して、文字画像データの配置可能範囲の所定の位置に配置する。 Next, the browsing execution unit 31a refers to the ruled line table of the style data, and acquires information related to character enclosure, underline, background color, and the like for each of the block data. And the browsing execution part 31a produces | generates the image of modification information based on the information regarding the surrounding of a character, an underline, a background color, etc. with respect to each of block data, The predetermined | prescribed range of arrangement | positioning of character image data Place in position.

図２７に示す例では、３つの文字コードに対して背景色の領域が描画されている。 In the example shown in FIG. 27, background color areas are drawn for three character codes.

次に、閲覧実行部３１ａは、文字コードのそれぞれについて、オブジェクトデータの文字画像テーブルを参照して、文字画像データを取得する。そして、閲覧実行部３１ａは、図２８に示すように、文字コードのそれぞれについて、文字画像データと、当該文字コードの文字画像の縦のサイズ及び文字画像の横のサイズとに基づいて、文字画像を生成して、当該文字コードの領域に配置する。 Next, the browsing execution unit 31a refers to the character image table of the object data for each character code and acquires character image data. Then, as shown in FIG. 28, the browsing execution unit 31a, for each character code, based on the character image data, the vertical size of the character image of the character code, and the horizontal size of the character image. Is generated and placed in the area of the character code.

また、閲覧実行部３１ａは、文字コードのそれぞれについて、オブジェクトデータの画像テーブルを参照して、画像データを取得する。そして、閲覧実行部３１ａは、画像データに基づいて、画像を生成して、文字画像データの配置可能範囲の所定の位置に生成した画像を配置する。なお、図２８に示す例では、このような画像は配置されていない。 In addition, the browsing execution unit 31a refers to the image table of object data for each character code, and acquires image data. And the browsing execution part 31a produces | generates an image based on image data, and arrange | positions the produced | generated image in the predetermined position of the arrangement | positioning possible range of character image data. In the example shown in FIG. 28, such an image is not arranged.

次に、閲覧実行部３１ａは、文字コードのそれぞれについて、スタイルデータのスタイル情報テーブルを参照して、文字色、圏点の有無、文字の装飾等に関する情報を取得する。そして、閲覧実行部３１ａは、文字色、圏点の有無、文字の装飾等に関する情報に基づいて、当該文字コードを修飾する情報の画像を生成して、文字画像データの配置可能範囲の所定の位置に配置する。図２１に示す例では、３つの文字に対して圏点が描画されている。 Next, for each character code, the browsing execution unit 31a refers to the style information table of the style data, and acquires information regarding the character color, the presence / absence of a mark, character decoration, and the like. And the browsing execution part 31a produces | generates the image of the information which modifies the said character code based on the information regarding a character color, the presence or absence of a mark, the decoration of a character, etc., and the predetermined | prescribed range of arrangement | positioning of character image data is possible. Place in position. In the example shown in FIG. 21, the sphere is drawn for three characters.

以上の処理により、端末３０が表示する文章データの表示情報が得られる。 Through the above processing, display information of the text data displayed by the terminal 30 is obtained.

上述した本実施形態のシステムによれば、情報処理装置により生成された文章データは、文字コードではなく、文字コードが画像情報に変換された文字画像データを有するので、情報処理装置により生成された文章データから文字コードを抜き取ることは容易ではない。 According to the system of the present embodiment described above, the text data generated by the information processing apparatus has character image data in which the character code is converted into image information instead of the character code. It is not easy to extract character codes from text data.

また、情報処理装置により生成された文章データは、文字コードを有する元の文章データに対して、文字コードが文字画像データに変換され、且つ文字の修飾情報が、文字コート等と関連づけられて所定のテーブルに登録されている。そのため、端末が、文字コードを有する元の文章データに基づいて、文章を表示する場合よりも、情報処理装置により生成された文章データに基づいて、文章を表示する処理に要する時間の方が短くなる。具体的には、端末側で、電子書籍データに含まれる文字コードに対して、文字コードと関連づけられたフォント字形データを読み出したり、カーニング処理及び合字処理、禁則文字処理、欧文文字処理等の対象となる文字コードの情報を動的に処理し、文字画像を生成して表示するのではなく、予め情報処理装置１０が生成したデータ構造の中にカーニング処理及び合字処理、禁則文字処理、欧文文字処理等が行われた情報が組み込まれているため、端末側で動的に処理を行う必要がなく、文章を表示する処理に要する時間が短くなる。従って、端末を用いて文書を閲覧するユーザに対して、快適な閲覧環境を提供することができる。 In addition, the text data generated by the information processing apparatus has a character code converted into character image data with respect to the original text data having a character code, and character modification information is associated with a character code or the like. Registered in the table. Therefore, the time required for the process of displaying the sentence based on the sentence data generated by the information processing device is shorter than when the terminal displays the sentence based on the original sentence data having the character code. Become. Specifically, on the terminal side, for the character code included in the electronic book data, font character data associated with the character code is read, kerning processing and ligature processing, forbidden character processing, European character processing, etc. Rather than dynamically processing target character code information and generating and displaying a character image, kerning processing and ligature processing, forbidden character processing in the data structure generated in advance by the information processing apparatus 10, Since information that has been subjected to European character processing or the like is incorporated, it is not necessary to dynamically perform processing on the terminal side, and the time required for processing to display a sentence is shortened. Therefore, it is possible to provide a comfortable browsing environment for users who browse documents using the terminal.

本発明では、上述した実施形態の情報処理方法、情報処理装置及びデータ構造は、本発明の趣旨を逸脱しない限り適宜変更が可能である。また、一の実施形態が有する構成要件は、他の実施形態にも適宜適用することができる。 In the present invention, the information processing method, the information processing apparatus, and the data structure of the above-described embodiment can be appropriately changed without departing from the gist of the present invention. In addition, the configuration requirements of one embodiment can be applied to other embodiments as appropriate.

例えば、上述した実施形態では、１次オーサリング及び２次オーサリングを、情報処理装置が行っていたが、サーバが、２次オーサリングを行ってもよい。この場合、情報処理装置は、１次オーサリングが終了した状態の文書データを、サーバに送信する。そして、サーバは、フォントパッケージを参照して、受信した文書データに対して、２次オーサリングを実行する。 For example, in the embodiment described above, the primary authoring and the secondary authoring are performed by the information processing apparatus, but the server may perform the secondary authoring. In this case, the information processing apparatus transmits document data in a state where the primary authoring is completed to the server. Then, the server refers to the font package and executes secondary authoring on the received document data.

１システム
１０情報処理装置
１１処理部
１１ａレイアウトデータ生成部
１１ｂコンテナデータ生成部
１１ｃブロックデータ生成部
１１ｄ画像データ生成部（文字画像データ生成部）
１１ｅ文字画像サイズ生成部
１２メモリ
１２ａプログラム
１２ｂフォントパッケージ
１２ｃ記憶媒体
１３表示部
１４入力インタフェース
１５通信部
２０サーバ
２１処理部
２２メモリ
２２ａ記憶媒体
２３表示部
２４入力インタフェース
２５通信部
３０端末
３１処理部
３１ａ閲覧実行部
３２メモリ
３２ａ記憶媒体
３３表示部
３４入力インタフェース
３５通信部
Ｎネットワーク DESCRIPTION OF SYMBOLS 1 System 10 Information processing apparatus 11 Processing part 11a Layout data generation part 11b Container data generation part 11c Block data generation part 11d Image data generation part (character image data generation part)
11e Character image size generation unit 12 Memory 12a Program 12b Font package 12c Storage medium 13 Display unit 14 Input interface 15 Communication unit 20 Server 21 Processing unit 22 Memory 22a Storage medium 23 Display unit 24 Input interface 25 Communication unit 30 Terminal 31 Processing unit 31a Browsing execution unit 32 Memory 32a Storage medium 33 Display unit 34 Input interface 35 Communication unit N Network

Claims

Generating container data having arrangement information of one or more paragraphs based on arrangement information of one or more paragraphs included in sentence data having a plurality of character codes associated with a font;
For each of one or more paragraphs included in the sentence data, generating block data having arrangement information of one or more character codes based on arrangement information of one or more character codes included in the paragraph When,
For each of a plurality of character codes included in the text data, generating character image data in which the character code is converted into image information based on a font associated with the character code;
An information processing method including:

Generating the block data includes
When the character code included in the sentence data is a character code that is prohibited from being divided and displayed as a character divided with a character code located immediately before or after the character code, the character code, The information processing method according to claim 1, further comprising: generating the block data having arrangement information with a character code positioned immediately before or after the character code.

Generating the container data includes
When the consecutively arranged character codes included in the sentence data have a relationship between a parent character and a ruby character, the first block data having arrangement information of the character code of the parent character, and a ruby character Generating the second block data having code arrangement information, determining the positional relationship between the first block data and the second block data, and the first block data and the second block The information processing method according to claim 1, further comprising generating the container data having data arrangement information.

Generating the block data includes
In the case where a plurality of European character codes that are continuously arranged are included in the sentence data, the arrangement information of the plurality of consecutive European character codes is based on the arrangement information of the plurality of consecutive European character codes. The information processing method according to any one of claims 1 to 3, further comprising: generating the block data having:

The sentence data has a character size associated with a character code included in the sentence data;
The method includes: obtaining, for each of a plurality of character codes included in the text data, a character image size indicating a size of the character image of the character code based on a font and a character size associated with the character code. The information processing method as described in any one of 1-4.

The sentence data has character modification information associated with a character code included in the sentence data,
The information according to any one of claims 1 to 5, comprising generating, for each of a plurality of character codes included in the text data, character modification data having character modification information associated with the character code. Processing method.

The information processing method according to any one of claims 1 to 6, further comprising transmitting the container data, the block data, and the character image data.

A container data generation unit that generates container data having arrangement information of one or more paragraphs based on arrangement information of one or more paragraphs included in sentence data having a plurality of character codes associated with a font;
A block that generates block data having one or more character code arrangement information for each of one or more paragraphs included in the sentence data based on the arrangement information of one or more character codes included in the paragraph A data generator;
For each of a plurality of character codes included in the sentence data, a character image data generation unit that generates character image data in which the character code is converted into image information based on a font associated with the character code;
An information processing apparatus comprising a processing unit having

Container data having arrangement information of one or more paragraphs included in sentence data having a plurality of character codes associated with a font;
For each of one or more paragraphs included in the sentence data, block data having arrangement information of one or more character codes included in the paragraph;
Each of the plurality of character codes included in the sentence data is character image data obtained by converting the character code into image information based on a font associated with the character code;
A data structure with