JP2022107570A

JP2022107570A - Electronic device for viewing electronic document, and display method

Info

Publication number: JP2022107570A
Application number: JP2021024699A
Authority: JP
Inventors: 多一石川; Taichi Ishikawa
Original assignee: Individual
Current assignee: Individual
Priority date: 2021-01-11
Filing date: 2021-01-11
Publication date: 2022-07-22

Abstract

To provide an electronic device which reduces eye fatigue when reading an electronized document and improves a reader's reading speed and a display method of electronic document data.SOLUTION: An electronic device 10 for viewing electronic documents performs syntactic analysis using a morphological analyzer on character string data of a target electronic document, and generates noun phrases replaceable by demonstrative pronouns and second string data divided by other units, and displays the divided second string data to a reader sequentially in a sequence from front. In a case of sequential display, the display speed can be changed, and a format of the second character string data can be adjusted as appropriate by making a font size of noun phrases larger than the others, by coloring the characters, and the like.SELECTED DRAWING: Figure 2

Description

本発明は、電子文書を閲覧する際に利用される電子機器、および電子文書データの表示方法に関する。The present invention relates to an electronic device used when browsing electronic documents and a method of displaying electronic document data.

産業界では、電子文書の需要、供給が日々増大している。電子文書の例としてはインターネット上で公開される論文、電子書籍等が挙げられる。本明細書のような特許公報もその一例である。また、最終的に紙に印刷する場合でも、推敲段階では電子データであることが多い。In industry, the demand and supply of electronic documents are increasing day by day. Examples of electronic documents include papers and electronic books published on the Internet. Patent publications such as this specification are one example. Moreover, even when the data is finally printed on paper, it is often electronic data in the elaboration stage.

一方で電子文書を高速で読むという需要も日々増大している。スマートフォン等の電子機器が普及し、いつでもどこでも電子文書を読むことが可能となっている。On the other hand, demand for reading electronic documents at high speed is increasing day by day. With the spread of electronic devices such as smartphones, electronic documents can be read anytime and anywhere.

人が文章を読む際には眼球を上下左右に動かす。文書を早く読むために、眼球運動で用いる筋肉を鍛えるという速読法が存在したりする。When people read text, they move their eyeballs up, down, left, and right. In order to read documents faster, there is a speed reading method that trains the muscles used for eye movement.

ただし、前記筋肉を鍛えるのは容易ではない。さらに電子文書の場合は表示装置が発光しており、紙の本と比べると眼球が疲れる傾向にある。眼球を鍛えたものであっても疲労が蓄積し、読書速度が損なわれるおそれがある。However, training these muscles is not easy. Furthermore, in the case of an electronic document, the display device emits light, which tends to tire the eyes as compared to a paper book. Fatigue accumulates even with trained eyeballs and can impair reading speed.

本発明は、上記を鑑みてなされたものであって、電子化された文書を読む際の眼球の疲れを軽減し、読者の読書の速度を向上させることを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above, and it is an object of the present invention to reduce eye strain when reading electronic documents and to improve a reader's reading speed.

本発明である電子文書の閲覧用電子機器１０は以下を有する。すなわち
第一の文字列データ１２ｘを入力する文字列データ入力装置１２と、
前記第一の文字列データ１２ｘを構文解析し、指示代名詞により置き換え可能な名詞句とそれ以外という単位で分割された第二の文字列データ１４ｘを出力する構文解析装置１４と、
前記第二の文字列データ１４ｘを前から順に逐次的に表示する表示する表示装置１６と、
である。The electronic document browsing electronic device 10 of the present invention has the following. That is, the character string data input device 12 for inputting the first character string data 12x;
a syntactic analysis device 14 for parsing the first character string data 12x and outputting second character string data 14x divided into units of noun phrases replaceable by demonstrative pronouns and others;
a display device 16 for sequentially displaying the second character string data 14x from the front;
is.

さらに、本発明である電子文書データの表示方法は以下の工程を有する。すなわち、
第一の文字列データ１２ｘを入力する工程と、
前記第一の文字列データ１２ｘを構文解析し、指示代名詞により置き換え可能な名詞句とそれ以外という単位で分割された第二の文字列データ１４ｘを出力する工程と、
前記第二の文字列データ１４ｘを前から順に逐次的に示する工程と、
である。Further, the method for displaying electronic document data according to the present invention has the following steps. i.e.
inputting first string data 12x;
parsing the first character string data 12x and outputting second character string data 14x divided into units of noun phrases replaceable by demonstrative pronouns and others;
a step of sequentially showing the second character string data 14x from the front;
is.

ここで、名詞句とは指示代名詞により置き換えることが可能な単位を指すものとする。Here, a noun phrase refers to a unit that can be replaced by a demonstrative pronoun.

文書の文書を区切って逐次表示することで眼球運動をなくし、疲労を軽減している。そのため、読書に集中することができる。By dividing the documents and displaying them sequentially, eye movement is eliminated and fatigue is reduced. Therefore, you can concentrate on reading.

さらに予め文書の文章を構文解析して名詞句を単位として文書の一部を逐次表示するため、表示が終わった部分と現在表示されている部分とを結合する手間が省ける。これにより脳の疲れが軽減されることで読書に集中することができる。Furthermore, since the text of the document is syntactically analyzed in advance and the part of the document is sequentially displayed in units of noun phrases, it is possible to save the trouble of combining the displayed part with the currently displayed part. As a result, the fatigue of the brain is reduced, and the user can concentrate on reading.

図１は、本発明の特徴を最もよく表す代表図であり、ある和文に対し本発明を適用したものである。FIG. 1 is a representative diagram that best represents the features of the present invention, in which the present invention is applied to a certain Japanese sentence. 図２は、本発明の構成要素を本発明の工程の流れとともに示した図である。FIG. 2 is a diagram showing the constituent elements of the present invention together with the process flow of the present invention. 図３は、前記和文に本発明領域の従来技術を適用したものである。FIG. 3 shows the application of the prior art in the area of the present invention to the Japanese text.

以下、本発明を実施するための形態について詳細を説明する。EMBODIMENT OF THE INVENTION Hereinafter, the form for implementing this invention is demonstrated in detail.

まず、第一の文字列データ１２ｘを構文解析装置１４に入力する文字列データ入力装置１２を用意する。第一の文字列データ１２ｘは例えばＭｉｃｒｏｓｏｆｔＷｏｒｄのｄｏｃｘ形式やｐｄｆ、ｈｔｍｌ等の形で与えられる。他にも、それらの情報が予め得られていないもの（例えば紙の文書）に対しては、光学的文字認識（ＯＣＲ）を適用することにより、前記情報を得ることも可能である。文字列データ入力装置１２は、ＨＤＤやＳＳＤなどの記憶装置や、サーバーを想定しているが、文字列データを与えるものであれば何でも良い。First, the character string data input device 12 for inputting the first character string data 12x to the parsing device 14 is prepared. The first character string data 12x is provided, for example, in Microsoft Word docx format, pdf, html, or the like. Alternatively, the information can be obtained by applying optical character recognition (OCR) to those for which such information is not previously obtained (eg, paper documents). The character string data input device 12 is assumed to be a storage device such as an HDD or SSD, or a server, but any device that provides character string data may be used.

次に前記文字列データ１２ｘを構文解析し、名詞句とそれ以外という単位で分割された第二の文字列データ１４ｘを与える構文解析装置１４を用意する。なお、名詞句とは、指示代名詞により置き換え可能な単位を指すものとする。以下、日本語と英語の適用例を挙げる。Next, a syntactic analysis device 14 is prepared for parsing the character string data 12x and providing second character string data 14x divided into units of noun phrases and others. It should be noted that a noun phrase refers to a unit that can be replaced by a demonstrative pronoun. Examples of application in Japanese and English are given below.

日本語文書の場合、構文解析装置１４として例えばＭｅｃａｂなどの形態素解析器を利用することができる。Ｍｅｃａｂの出力結果から名詞句を抽出する簡素な方法として、例えば、助詞や句読点で挟まれた部分を抽出し、そこから動詞的要素を含むものは除く、という操作が一例としてあげられる。ただし、このやり方に限られるものではない。例えば、付加的要素、特許公報であれば例えば、「第一の○○」について「第一」と「○○」の２つの名詞句があるのではなく、「第一の○○」という一つの名詞句として扱う、と例外ルールを設ける等の工夫ができる。後者の場合も、名詞句は指示代名詞で置き換えられるものというルールに合致している。Ｍｅｃａｂによる形態素解析の結果から、このように名詞句をさまざまな態様で抽出するのは、本発明の技術の分野における通常の知識を有するものならば容易である。For Japanese documents, a morphological analyzer such as Mecab can be used as the parser 14 . As a simple method for extracting noun phrases from Mecab's output results, for example, an operation of extracting parts between particles and punctuation marks and excluding those containing verb-like elements can be mentioned. However, it is not limited to this method. For example, in the case of an additional element, a patent publication, for example, instead of having two noun phrases, "first" and "○○", for "first XX", It can be devised such as setting an exception rule to treat it as a single noun phrase. The latter case also conforms to the rule that noun phrases are replaced by demonstrative pronouns. It is easy for those who have ordinary knowledge in the technical field of the present invention to extract noun phrases in various forms from the results of morphological analysis by Mecab.

英語文書であれば、構文解析器１４として例えばＳｔａｎｆｏｒｄＰａｒｓｅｒを利用して名詞句（ＮｏｕｎＰｈｒａｓｅ）を抽出することができる。なお、名詞句としての抽出方法は日本語の場合と同様、複数ありうる。例えば前置詞で分節する場合としない場合、などの違いが存在する。In the case of an English document, for example, Stanford Parser can be used as the parser 14 to extract noun phrases. As in the case of Japanese, there are multiple possible extraction methods for noun phrases. For example, there are differences such as when segmenting with a preposition and when not.

最後に前記第二の文字列データ１４ｘを表示装置１６へ入力し、分割された文字列データを前から順番に逐次的に読者に表示する。表示装置としては、例えばパソコンやタブレット、スマートフォンのグラフィックボードとディスプレーの構成が挙げられる。Finally, the second character string data 14x is input to the display device 16, and the divided character string data are sequentially displayed to the reader in order from the front. Examples of the display device include the configuration of a graphic board and a display of a personal computer, a tablet, or a smartphone.

以上が本発明の実施の形態である。次に具体的な実施例としてある和文に対して本発明を適用した結果を示す。The above is the embodiment of the present invention. Next, the result of applying the present invention to a certain Japanese sentence will be shown as a specific example.

（第１実施形態）
図１は、「この発明は新規的かつ進歩的です」という文章１２ｘに対して、本発明を適用したものの一例である。以下、これらを得るまでの手順を具体的に説明する。(First embodiment)
FIG. 1 is an example of applying the present invention to a sentence 12x saying "This invention is new and innovative." The procedure for obtaining these will be specifically described below.

前記文章１２ｘを例えばＰＣ、スマートフォンなどの入力装置１２へ入力する。The text 12x is input to the input device 12 such as a PC or smart phone.

次に前記文章１２ｘを構文解析する。構文解析の結果、前記文章１２ｘは名詞句およびそれ以外という文字列データ１４ｘに変換される。なお前記文字列データ１４ｘは元の文章１２ｘでの順番を保っているものとする。すなわち前記文字列データ１４ｘを結合すると概ね前記文章１２ｘを復元できるものとする。概ねというのは一部全角が半角になったりなど変換がなされる可能性を想定している。The sentence 12x is then parsed. As a result of syntactic analysis, the sentence 12x is converted into character string data 14x of noun phrases and others. It is assumed that the character string data 14x maintains the order in the original sentence 12x. In other words, it is assumed that the text 12x can be roughly restored by combining the character string data 14x. Generally, I assume the possibility that some full-width characters will be converted to half-width characters.

次に前記文字列データ１４ｘを読者に対して逐次表示する。表示装置１６としては、前記ＰＣやスマートフォンを用いることができる。Next, the character string data 14x are sequentially displayed to the reader. As the display device 16, the PC or smart phone can be used.

前記逐次表示の際には、表示速度を変更できるようにしておくと読者の速読能力に合わせた形で表示できて好ましい。さらに前記文字列データ１４ｘの書式を適宜変更するのもよい。例えば、名詞句のフォントサイズをそれ以外よりも大きくしたり、文字に着色を施すなどである。In the case of the sequential display, it is preferable that the display speed can be changed so that the display can be made in accordance with the reader's ability to read quickly. Furthermore, the format of the character string data 14x may be appropriately changed. For example, the font size of noun phrases may be made larger than that of others, or the characters may be colored.

図３は本発明の従来技術である。上から順にある速度で表示される。こちらは形態素解析器Ｍｅｃａｂに前記文章１２ｘを入力して分かち書きされたものを逐次表示している。こちらは不自然な単位、かつ分割されたあとの部分数が多くなり、逐次表示を読み取る際に脳が疲れたり、表示数が多くなることで読書時間が伸びてしまうという欠点がある。FIG. 3 shows the prior art of the present invention. Displayed at a certain speed from top to bottom. Here, the text 12x is input to the morphological analyzer Mecab and the words written in spaces are sequentially displayed. This is an unnatural unit, and the number of parts after division increases, so there are disadvantages that the brain gets tired when reading the sequential display, and the reading time increases as the number of displays increases.

図１が本発明の実施した例である。こちらは名詞句という単位で分割されたものを表示するため、脳が疲れない。さらに分割後の部分数も少なくなるため、読書時間が短くなる。FIG. 1 shows an embodiment of the present invention. This display is divided into units called noun phrases, so the brain doesn't get tired. Furthermore, since the number of parts after division is reduced, the reading time is shortened.

１０電子文書の閲覧用電子機器
１２文字列データ入力装置
１２ｘ第一の文字列データ
１４構文解析装置
１４ｘ第二の文字列データ
１６表示装置10 electronic device for viewing electronic documents 12 character string data input device 12x first character string data 14 parsing device 14x second character string data 16 display device

Claims

An electronic device for viewing electronic documents,
a character string data input device for inputting first character string data;
a syntactic analysis device for parsing the first character string data and outputting second character string data divided into units of noun phrases replaceable by demonstrative pronouns and others;
a display device that sequentially displays the second character string data from the front,
An electronic device for viewing electronic documents.

A method of displaying electronic document data,
inputting first string data;
syntactically parsing the first character string data and outputting second character string data divided into units of noun phrases replaceable by demonstrative pronouns and others;
and sequentially showing the second character string data from the front,
How to display electronic document data.

2. The electronic device for viewing electronic documents according to claim 1, further comprising a changing device for changing the speed of said sequential display.

3. The method of displaying electronic document data according to claim 2, further comprising the step of changing the speed of said sequential display.

2. The electronic device for viewing electronic documents according to claim 1, further comprising a second changing device for changing the format of said second character string.

3. The method of displaying electronic document data according to claim 2, further comprising the step of changing the format of said second character string.