JP2010092323A

JP2010092323A - Document display system

Info

Publication number: JP2010092323A
Application number: JP2008262507A
Authority: JP
Inventors: Susumu Yasunaga; 晋安永
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2008-10-09
Filing date: 2008-10-09
Publication date: 2010-04-22

Abstract

<P>PROBLEM TO BE SOLVED: To enable a user to browse his or her desired document by displaying relationship between respective documents in a prescribed document group. <P>SOLUTION: A document display system facilitates a user to recognize differences between respective documents included in a document group (comprising a plurality of documents) designated by the user when the user downloads documents stored in a document server A to a mobile phone B to browse the documents on the mobile phone B. Specifically, feature information showing features of respective documents included in the document group designated by the user is extracted, and differences between the respective documents are displayed by using the feature information. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は文書を表示する文書表示システムに関する。 The present invention relates to a document display system for displaying a document.

近年、手軽に持ち運び、画面を通じて文書を閲覧出来る携帯端末が実用化されている。この携帯端末を通じて文書を閲覧する場合は、例えば文書が保存されているサーバにアクセスして携帯端末に文書データをダウンロードし、その上で携帯端末の画面で文書を閲覧する。 In recent years, portable terminals that can be easily carried and browsed through a screen have been put into practical use. When browsing a document through this portable terminal, for example, the server in which the document is stored is accessed, the document data is downloaded to the portable terminal, and the document is browsed on the screen of the portable terminal.

また、文書サーバには多数の文書が保存されており、この文書サーバ内をキーワード等により検索し、検索結果をもとにユーザーが指定した文書データを携帯端末にダウンロードするような場合もある。文書を検索する技術としては例えば特許文献１や特許文献２に記載された技術があり、当該技術を応用することにより、検索結果をもとにユーザーが指定した文書データを携帯端末にダウンロードすることが出来る。
特開２００２−１４９９９号公報特開２００４−６２８０６号公報 In addition, a large number of documents are stored in the document server, and the document server may be searched using keywords or the like, and document data designated by the user based on the search result may be downloaded to the mobile terminal. As a technique for searching for a document, for example, there are techniques described in Patent Document 1 and Patent Document 2, and by applying the technique, document data designated by a user based on the search result is downloaded to a mobile terminal. I can do it.
JP 2002-14999 A JP 2004-62806 A

ところで、文書サーバ内の所定の文書群に含まれる各々の文書の違いをユーザーが簡易に認識した上で、所定の文書をユーザーが指定して携帯端末で閲覧したいというニーズもある。しかし、特許文献１や特許文献２のような検索技術では、検索した各々の文書の違いを簡易な形態で示すことは出来ない。 Incidentally, there is a need for the user to specify a predetermined document and view it on a portable terminal after the user easily recognizes the difference between the documents included in the predetermined document group in the document server. However, the retrieval techniques such as Patent Document 1 and Patent Document 2 cannot show the difference between retrieved documents in a simple form.

そこで、本発明の目的は、所定の文書群における各文書の関係を表示し、所望の文書を閲覧出来る文書表示システムを提供することになる。 Accordingly, an object of the present invention is to provide a document display system that displays the relationship between documents in a predetermined document group and can browse a desired document.

上記目的を達成するため、本発明に係る文書表示システムは、
文書を保存する保存部と、
当該保存部に保存されている文書から構成された所定の文書群において、当該文書群に含まれた各文書の特徴情報を抽出する抽出部と、
当該抽出部により抽出された前記特徴情報に対し、前記文書群に含まれた全文書における共通度合を判断する判断部と、
前記文書群に含まれた各文書を前記特徴情報を用いて表示し、且つ前記共通度合により前記特徴情報の表示方法を変化させる表示部と、
を有することを特徴とするものである。 In order to achieve the above object, a document display system according to the present invention provides:
A storage unit for storing the document;
An extraction unit that extracts feature information of each document included in the document group in a predetermined document group composed of documents stored in the storage unit;
A determination unit that determines the degree of commonality among all documents included in the document group with respect to the feature information extracted by the extraction unit;
A display unit that displays each document included in the document group using the feature information and changes a display method of the feature information according to the degree of commonality;
It is characterized by having.

また、本発明に係る文書表示システムは、
文書を保存する保存部と、
当該保存部に保存されている文書から構成された所定の文書群において、当該文書群に含まれた各文書の特徴情報を抽出する抽出部と、
前記文書群に含まれた基準文書の特徴情報と、前記文書群に含まれた前記基準文書以外の文書の特徴情報と、を比較し、前記基準文書以外の文書の特徴情報に対し、前記基準文書との共通度合を判断する判断部と、
前記文書群に含まれた各文書を前記特徴情報を用いて表示し、且つ前記共通度合により前記特徴情報の表示方法を変化させる表示部と、
を有することを特徴とするものである。 Further, the document display system according to the present invention includes:
A storage unit for storing the document;
An extraction unit that extracts feature information of each document included in the document group in a predetermined document group composed of documents stored in the storage unit;
The feature information of the reference document included in the document group is compared with the feature information of the document other than the reference document included in the document group, and the feature information of the document other than the reference document is compared with the reference information. A determination unit for determining the degree of commonality with a document;
A display unit that displays each document included in the document group using the feature information and changes a display method of the feature information according to the degree of commonality;
It is characterized by having.

本発明に係る文書表示システムによれば、所定の文書群における各文書の関係を表示し、所望の文書を閲覧することが出来る。 According to the document display system of the present invention, it is possible to display the relationship between each document in a predetermined document group and browse a desired document.

［文書表示システムの概要］図１は本発明に係る文書表示システムの概略図である。 [Outline of Document Display System] FIG. 1 is a schematic view of a document display system according to the present invention.

文書表示システムは文書サーバＡと携帯端末Ｂから構成されている。図１に示す文書表示システムにおいて文書サーバＡと携帯端末Ｂは別体であるが、一体化されたものであっても良い。 The document display system includes a document server A and a portable terminal B. In the document display system shown in FIG. 1, the document server A and the portable terminal B are separate bodies, but may be integrated.

文書サーバＡは大容量のデータを保存可能であり、文書サーバＡ内に複数の文書データ（書籍データ等）が保存されている。文書サーバＡはオフィス等に設置され、必要に応じて携帯端末Ｂと通信可能である。 The document server A can store a large amount of data, and a plurality of document data (such as book data) is stored in the document server A. The document server A is installed in an office or the like, and can communicate with the portable terminal B as necessary.

携帯端末Ｂは、ユーザーが手軽に持ち運び可能な端末（電子ペーパーやモバイルパソコン等）である。携帯端末Ｂは内部に複数の文書を保存可能であり、携帯端末Ｂの表示部２０４において保存された文書が表示される。携帯端末Ｂに保存された文書のうち表示部２０４に表示させる文書は、選択部２０５を通じてユーザーが選択する。 The mobile terminal B is a terminal (such as electronic paper or a mobile personal computer) that can be easily carried by the user. The mobile terminal B can store a plurality of documents therein, and the stored document is displayed on the display unit 204 of the mobile terminal B. A document to be displayed on the display unit 204 among documents stored in the portable terminal B is selected by the user through the selection unit 205.

また、携帯端末Ｂは文書サーバＡと通信可能であり、文書サーバＡに保存されている文書を必要に応じてダウンロードする。ダウンロードした文書は携帯端末Ｂ内に保存され、必要な場合にユーザーの選択により表示される。 The mobile terminal B can communicate with the document server A, and downloads a document stored in the document server A as necessary. The downloaded document is stored in the portable terminal B, and is displayed by the user's selection when necessary.

なお、携帯端末ＢをＵＳＢやＩＥＥＥ１３９４で文書サーバＡに直に接続し、文書サーバＡに保存されている文書を文書サーバＡから携帯端末Ｂに転送させるようにしてもよい。また、無線ＬＡＮやＢｌｕｅｔｏｏｔｈを通じて文書を文書サーバＡから携帯端末Ｂに転送させても良い。 Note that the portable terminal B may be directly connected to the document server A by USB or IEEE 1394, and the document stored in the document server A may be transferred from the document server A to the portable terminal B. Further, the document may be transferred from the document server A to the portable terminal B through a wireless LAN or Bluetooth.

図２は図１に示す文書表示システムのブロック図であり、代表的な構成を示している。 FIG. 2 is a block diagram of the document display system shown in FIG. 1, and shows a typical configuration.

まず、文書サーバＡの代表的な構成について説明する。 First, a typical configuration of the document server A will be described.

制御部１０１は文書サーバＡにおける各部の動作を制御するものであり、具体的にはＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等から構成される。 The control unit 101 controls the operation of each unit in the document server A. Specifically, the control unit 101 includes a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and the like.

通信部１０２は、携帯端末Ｂとの間でデータを送信又は受信する。具体的には保存部１０３に保存された文書のデータを携帯端末Ｂに送信したり、携帯端末Ｂにおいて文書サーバＡからダウンロードしたい文書の情報を受信したりする。 The communication unit 102 transmits or receives data to / from the mobile terminal B. Specifically, the document data stored in the storage unit 103 is transmitted to the mobile terminal B, and the mobile terminal B receives information on the document to be downloaded from the document server A.

保存部１０３は、複数の文書を保存しており、ハードディスク等、データを保存出来るものであれば良い。 The storage unit 103 stores a plurality of documents and may be any device that can store data, such as a hard disk.

抽出部１０４は、保存部１０３に保存されている各文書の特徴を示す特徴情報を抽出するものである。特徴情報を抽出する方法については後述する。 The extraction unit 104 extracts feature information indicating features of each document stored in the storage unit 103. A method for extracting feature information will be described later.

判断部１０５は、抽出部１０４により抽出された特徴情報が、所定の文書群に含まれた全文書において、どれだけ共通の情報となっているかを示す共通度合を判断するものである。特徴情報を抽出する方法と同様に、共通度合を判断する方法についても後述する。 The determination unit 105 determines a degree of commonness that indicates how common the feature information extracted by the extraction unit 104 is in all documents included in a predetermined document group. Similar to the method of extracting feature information, a method of determining the degree of commonality will be described later.

制御部２０１は携帯端末Ｂにおける各部の動作を制御するものであり、具体的には文書サーバＡの制御部１０１と同様にＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等から構成される。 The control unit 201 controls the operation of each unit in the portable terminal B, and specifically, like the control unit 101 of the document server A, a CPU (Central Processing Unit), a ROM (Read Only Memory), and a RAM (Random Access). Memory) and the like.

通信部２０２は、文書サーバＡとの間でデータを送信又は受信する。 The communication unit 202 transmits or receives data to / from the document server A.

保存部２０３は、文書サーバＡからダウンロードした文書等を保存するものであり、ハードディスク等、情報を保存出来るものであれば良い。 The storage unit 203 stores a document downloaded from the document server A, and may be any device that can store information, such as a hard disk.

表示部２０４は、文書サーバＡからダウンロードした文書を表示する。表示部２０４はタッチパネル形式となっており、表示部２０４に表示された文書に対して書き込み等を行うことが出来る。表示部２０４は液晶、有機ＥＬ、プラズマ等、公知の表示手段である。 The display unit 204 displays the document downloaded from the document server A. The display unit 204 has a touch panel format, and writing or the like can be performed on a document displayed on the display unit 204. The display unit 204 is a known display unit such as liquid crystal, organic EL, or plasma.

選択部２０５は、表示部２０４に表示させる文書を選択したり、携帯端末Ｂにおいて実行させる動作を選択したりする。 The selection unit 205 selects a document to be displayed on the display unit 204 or selects an operation to be executed on the mobile terminal B.

［特徴情報・共通度合を用いた文書表示］文書サーバＡに保存されている文書を携帯端末Ｂにダウンロードし、その文書を携帯端末Ｂで閲覧する場合、ユーザーが指定した文書群（複数の文書から構成されたもの）に含まれる各々の文書の違いをユーザーが簡易に認識した上で、所定の文書をユーザーが指定して携帯端末Ｂで閲覧したいというニーズがある。そこで、本発明に係る文書表示システムでは、ユーザーが指定した文書群に含まれた各文書の特徴を示す特徴情報を抽出し、各々の文書の違いをユーザーが簡易に携帯端末上で認識出来るように各文書を表示する。以下、この点について詳しく説明する。 [Document Display Using Feature Information / Common Level] When a document stored in the document server A is downloaded to the portable terminal B and the document is viewed on the portable terminal B, a document group specified by the user (a plurality of documents There is a need for the user to easily recognize the difference between the documents included in the document) and to view the specified document on the portable terminal B by the user. Therefore, in the document display system according to the present invention, feature information indicating the characteristics of each document included in the document group designated by the user is extracted so that the user can easily recognize the difference between the documents on the portable terminal. To display each document. Hereinafter, this point will be described in detail.

図３は、文書の特徴情報と、その特徴情報の共通度合に基づき、文書群に含まれた各文書を携帯端末Ｂで表示するための文書サーバＡにおける動作を示すフローチャート図である。なお、図３における判断ステップ（ステップＳ１、Ｓ５、Ｓ７）は、文書サーバＡにおける制御部１０１により実行される。 FIG. 3 is a flowchart showing an operation in the document server A for displaying each document included in the document group on the portable terminal B based on the document feature information and the degree of commonality of the feature information. 3 is executed by the control unit 101 in the document server A.

まず、携帯端末Ｂにおいて、文書サーバＡの保存部１０３に保存されている複数の文書から所定の文書群がユーザーによって指定される。所定の文書群を指定する方法としては、例えば文書群が含まれたフォルダを指定する方法、書誌情報により文書群を絞り込んで指定する方法、キーワード検索により指定する方法等がある。 First, in the portable terminal B, a predetermined document group is designated by the user from a plurality of documents stored in the storage unit 103 of the document server A. As a method for specifying a predetermined document group, for example, there are a method for specifying a folder including the document group, a method for specifying a document group by narrowing down by bibliographic information, a method for specifying by keyword search, and the like.

携帯端末Ｂにおいて所定の文書群がユーザーによって指定されると、文書サーバＡでは携帯端末Ｂから指定された情報を受信し、所定の文書群が指定されたか否か判断する（ステップＳ１）。例えば、図４に示すように文書サーバＡの保存部１０３には複数の文書が保存されており、その中で破線Ｘに示すように所定の範囲に含まれる文書群が携帯端末Ｂのユーザーによって指定される。 When a predetermined document group is designated by the user in the portable terminal B, the document server A receives the information designated from the portable terminal B, and determines whether or not the predetermined document group is designated (step S1). For example, as shown in FIG. 4, a plurality of documents are stored in the storage unit 103 of the document server A, and among them, a document group included in a predetermined range as indicated by a broken line X is determined by the user of the mobile terminal B. It is specified.

ステップＳ１において所定の文書群が指定されたと判断した場合（ステップＳ１；Ｙｅｓ）、次に文書群に含まれた各文書の特徴情報を抽出する（ステップＳ２）。 If it is determined in step S1 that a predetermined document group has been designated (step S1; Yes), then feature information of each document included in the document group is extracted (step S2).

文書群に含まれた各文書の特徴情報は、例えば文書中に出現する特徴語である。特徴語とは、その文書の内容を示すものに成り得る語として認識されたものであり、助詞等はその文書の内容を示すものに成り得ないため、特徴語として認識されない。 The feature information of each document included in the document group is, for example, a feature word that appears in the document. A feature word is recognized as a word that can indicate the content of the document, and a particle or the like cannot be recognized as a feature word because it cannot indicate the content of the document.

文書中に出現する特徴語は、文書の内容に対して所定のプログラムにより形態素解析を実行し、また所定のプログラムにより文書に出現する語の品詞を解析することにより抽出される。文書に出現する語の品詞を解析することにより、助詞等は特徴語として認識されない。 The characteristic words appearing in the document are extracted by executing morphological analysis on the contents of the document by a predetermined program and analyzing the part of speech of the word appearing in the document by the predetermined program. By analyzing the part of speech of words appearing in a document, particles and the like are not recognized as feature words.

例えば、ユーザーによって指定された所定の文書群に含まれる文書が以下に示す文書Ａ〜文書Ｄの４つであったとする。
文書Ａ：
「文書間の類似度を算出し、これにその他の関連情報を加味して関連度を求め、結果を表形式で出力する」
文書Ｂ：
「文書間の関連度を、あらかじめ類似度やその他の関連情報を組み合わせて計算しておき、サーバに保持する」
文書Ｃ：
「文書間の関連度をもとに、多数の文書に対して自動的にクラスタリングを行い、見やすくグラフで表示する」
文書Ｄ：
「文書間の類似度を、語の出現頻度を用いて計算しておく」
このような文書について形態素解析及び品詞解析を実行し、各文書の特徴語を抽出すると、以下のようになる。
文書Ａの特徴語：
「文書間」「類似度」「関連情報」「関連度」「表形式」
文書Ｂの特徴語：
「文書間」「類似度」「関連情報」「関連度」「サーバ」
文書Ｃの特徴語：
「文書間」「関連度」「自動的」「クラスタリング」「グラフ」
文書Ｄの特徴語：
「文書間」「類似度」「出現頻度」
以上のように各文書の特徴語を抽出すると、次に各々の特徴語に対して文書群に含まれた全文書における共通度合を判断する（ステップＳ３）。共通度合とは、例えば文書群に含まれた全文書における特徴語の出現文書数に基づいた度合である。例えば、文書群に含まれる全文書が上記文書Ａ〜文書Ｄの４つである場合、共通度合は、４つの文書における特徴語の出現文書数に基づいた度合となる。 For example, it is assumed that the documents included in the predetermined document group designated by the user are the following four documents A to D.
Document A:
“Calculate the similarity between documents, add other related information to this to determine the degree of association, and output the results in tabular form”
Document B:
“Calculate the degree of association between documents in advance by combining similarity and other relevant information and store it on the server.”
Document C:
“Based on the degree of association between documents, clustering is automatically performed on a large number of documents and displayed in an easy-to-read graph”
Document D:
“Calculate the similarity between documents using the frequency of words”
When morphological analysis and part-of-speech analysis are performed on such a document and feature words of each document are extracted, the following results are obtained.
Feature words of document A:
"Between documents""Similarity""Relatedinformation""Relevance""Tableformat"
Feature words of document B:
“Between documents” “Similarity” “Related information” “Relevance” “Server”
Feature words of document C:
"Inter-document""Relevance""Automatic""Clustering""Graph"
Feature words of document D:
"Inter-document""Similarity""Appearancefrequency"
When the feature words of each document are extracted as described above, the degree of commonality among all documents included in the document group is determined for each feature word (step S3). The common degree is, for example, a degree based on the number of feature word appearance documents in all documents included in the document group. For example, when all the documents included in the document group are four documents A to D, the common degree is a degree based on the number of appearance documents of feature words in the four documents.

図５（ａ）は、文書Ａ〜文書Ｄにおける特徴語と、その特徴語の出現文書数を示す説明図である。特徴語として抽出された「文書間」という語は、文書Ａ〜文書Ｄの全てにおいて出現しているため出現文書数は「４」となり、特徴語として抽出された「類似度」いう語は、文書Ａ、文書Ｂ、文書Ｄにおいて出現しているため、出現文書数は「３」となる。「表形式」、「サーバ」の特徴語の出現文書数は各々「１」である。 FIG. 5A is an explanatory diagram showing the feature words in the documents A to D and the number of appearance documents of the feature words. Since the word “between documents” extracted as the feature word appears in all of the documents A to D, the number of appearing documents is “4”, and the word “similarity” extracted as the feature word is Since it appears in the document A, the document B, and the document D, the number of appearing documents is “3”. The number of appearance documents of feature words “table format” and “server” is “1”, respectively.

共通度合は、以下の式により算出することが出来る。
共通度合＝ｆ（ｔ）／Ｎ
ｆ（ｔ）：ｔという特徴語の出現文書数
Ｎ：全文書数
「文書間」という特徴語の共通度合は「１（＝４／４）」であり、「類似度」という特徴語の共通度合は「０．７５（＝３／４）」である。図５（ｂ）に文書Ａ〜文書Ｄにおける特徴語と、その特徴語の共通度合を示す。「表形式」、「サーバ」の特徴語の出現文書数は「０．２５（＝１／４）」である。 The degree of commonality can be calculated by the following formula.
Common degree = f (t) / N
f (t): Number of appearance documents of feature word t: Number of all documents The common degree of the feature word “between documents” is “1 (= 4/4)”, and the common common feature word “similarity” The degree is “0.75 (= 3/4)”. FIG. 5B shows the feature words in the documents A to D and the common degree of the feature words. The number of appearance documents of feature words “table format” and “server” is “0.25 (= 1/4)”.

共通度合は、各文書の履歴情報を考慮して決定されてもよい。多くの閲覧者が見ている文書はより重要な文書であると判断でき、あまり多くの閲覧者が見ていない文書は重要ではない文書であると判断できる。従って、多くの閲覧者が見ている文書に対しては全体的に共通度合を低めに決定し、他の文書と共通しない独自性のある文書であるようにする。また、あまり多くの閲覧者が見ていない文書に対しては全体的に共通度合を高めに決定し、他の文書と共通する傾向の文書であるようにする。 The degree of commonality may be determined in consideration of history information of each document. A document viewed by many viewers can be determined to be a more important document, and a document that is not viewed by many viewers can be determined to be an unimportant document. Accordingly, the document viewed by many viewers is determined to have a low degree of commonality so that the document is unique and not shared with other documents. In addition, a document that is not viewed by too many viewers is determined to have a high degree of commonality so that the document has a tendency common to other documents.

ステップＳ３において共通度合を判断すると、文書サーバＡから携帯端末Ｂへ特徴情報（特徴語）と共通度合が送信される（ステップＳ４）。そして携帯端末Ｂの表示部２０４では受信した特徴情報を用いて各文書が表示される。 When the common degree is determined in step S3, the feature information (feature word) and the common degree are transmitted from the document server A to the portable terminal B (step S4). Then, each document is displayed on the display unit 204 of the portable terminal B using the received feature information.

図６は、特徴語を用いて各文書を表示した表示部２０４の説明図である。図６では上述した文書Ａ〜文書Ｄが特徴語を用いて携帯端末Ｂの表示部２０４に表示されている。 FIG. 6 is an explanatory diagram of the display unit 204 that displays each document using feature words. In FIG. 6, the documents A to D described above are displayed on the display unit 204 of the portable terminal B using feature words.

図６で示すように携帯端末Ｂの表示部２０４では文書サーバＡから受信した共通度合により特徴語の表示方法を変化させている。即ち、共通度合の数値毎に特徴語の表示色を異ならせている。共通度合の数値が「１」のものは特徴語をＲｅｄ（赤）で表示し、共通度合の数値が「０．２５」のものは特徴語をＢｌｕｅ（青）で表示する。その他の数値のものは特徴語をＢｌａｃｋ（黒）で表示する。このように共通度合により特徴語の表示方法を異ならせることにより、各々の文書の違いをユーザーが簡易に携帯端末上で認識出来る。 As shown in FIG. 6, the display unit 204 of the portable terminal B changes the display method of the feature word according to the degree of commonness received from the document server A. That is, the display color of the feature word is made different for each numerical value of the common degree. When the commonness value is “1”, the feature word is displayed in red (red), and when the commonness value is “0.25”, the feature word is displayed in blue (blue). For other numerical values, feature words are displayed in black (black). Thus, by changing the display method of the feature word according to the degree of commonality, the user can easily recognize the difference between the documents on the portable terminal.

なお、特徴語の表示色を変化させるだけでなく、共通度合が高い特徴語の表示サイズを小さくし、共通度合が低い特徴語の表示サイズを大きくするなど、特徴情報の表示サイズを変化させても良い。 In addition to changing the display color of feature words, changing the display size of feature information, such as reducing the display size of feature words with a high degree of commonality and increasing the display size of feature words with a low degree of commonality Also good.

図６に示すような表示に基づき、ユーザーは選択部２０５を通じてダウンロードしたい文書を選択する。タッチペンにより表示部２０４に表示されている文書を選択してもよい。 Based on the display as shown in FIG. 6, the user selects a document to be downloaded through the selection unit 205. You may select the document currently displayed on the display part 204 with a touch pen.

図３におけるステップＳ４において特徴情報及び共通度合を送信した後、携帯端末Ｂでは図６に示すような形態で各文書が表示部２０４に表示され、文書サーバＡでは携帯端末Ｂより何れかの文書が選択されたという選択情報を通信部１０２が受信したか否か判断する（ステップＳ５）。ステップＳ５において文書の選択情報を受信したと判断すると、文書を携帯端末Ｂに対して送信する（ステップＳ６）。 After transmitting the feature information and the common degree in step S4 in FIG. 3, each document is displayed on the display unit 204 in the form shown in FIG. 6 on the portable terminal B, and any document from the portable terminal B on the document server A. It is determined whether or not the communication unit 102 has received selection information indicating that has been selected (step S5). If it is determined in step S5 that the document selection information has been received, the document is transmitted to the portable terminal B (step S6).

ステップＳ４において特徴情報及び共通度合を送信した後、所定時間経過しても文書が選択されたという選択情報を受信しなければ動作を終了する。 After transmitting the feature information and the degree of commonness in step S4, the operation is terminated if selection information indicating that a document has been selected is not received even after a predetermined time has elapsed.

以上説明したように、本発明によれば特徴情報と共通度合により各々の文書の違いを表示することにより、所定の文書群における各文書の関係を認識した上で、必要な場合に所望の文書を携帯端末上で閲覧することが出来る。 As described above, according to the present invention, the difference between each document is displayed according to the feature information and the degree of commonality, so that the relationship between the documents in a predetermined document group is recognized, and the desired document is obtained when necessary. Can be viewed on a mobile device.

［基準文書との関係を示した文書表示］携帯端末Ｂのユーザーが指定した基準文書に対して、所定の文書群に含まれる他の文書がどれだけ共通する特徴情報を有するかを携帯端末Ｂに表示することも出来る。以下、この点について詳しく説明する。 [Document Display Showing Relationship with Reference Document] For the reference document designated by the user of mobile terminal B, the mobile terminal B shows how much common information other documents included in the predetermined document group have. Can also be displayed. Hereinafter, this point will be described in detail.

図７は、基準文書が指定された場合に、文書群に含まれた各文書を携帯端末Ｂで表示するための文書サーバＡにおける動作を示すフローチャート図である。なお、図７における判断ステップ（ステップＳ１１、Ｓ１３、Ｓ１６、Ｓ１８）は、文書サーバＡにおける制御部１０１により実行される。 FIG. 7 is a flowchart showing an operation in the document server A for displaying each document included in the document group on the portable terminal B when the reference document is designated. Note that the determination steps (steps S11, S13, S16, and S18) in FIG. 7 are executed by the control unit 101 in the document server A.

携帯端末Ｂにおいて所定の文書群がユーザーによって指定されると、文書サーバＡでは携帯端末Ｂから指定された情報を受信し、所定の文書群が指定されたか否か判断する（ステップＳ１１）。 When a predetermined document group is designated by the user in the portable terminal B, the document server A receives information designated from the portable terminal B and determines whether or not the predetermined document group is designated (step S11).

ステップＳ１１において所定の文書群が指定されたと判断した場合（ステップＳ１１；Ｙｅｓ）、次に文書群に含まれた各文書の特徴情報を抽出する（ステップＳ２）。 If it is determined in step S11 that a predetermined document group has been designated (step S11; Yes), then feature information of each document included in the document group is extracted (step S2).

文書群に含まれた各文書の特徴情報は、前述したように、例えば文書中に出現する特徴語である。 As described above, the feature information of each document included in the document group is, for example, a feature word that appears in the document.

例えば、ユーザーによって指定された所定の文書群に含まれる文書が以下に示す文書Ａ〜文書Ｃの３つであったとする。
文書Ａ：
「文書間の類似度を算出し、これにその他の関連情報を加味して関連度を求め、結果を表形式で出力する」
文書Ｂ：
「文書間の関連度を、あらかじめ類似度やその他の関連情報を組み合わせて計算しておき、サーバに保持する」
文書Ｃ：
「文書間の関連度をもとに、多数の文書に対して自動的にクラスタリングを行い、見やすくグラフで表示する」
このような文書について形態素解析及び品詞解析を実行し、各文書の特徴語を抽出すると、以下のようになる。
文書Ａの特徴語：
「文書間」「類似度」「関連情報」「関連度」「表形式」
文書Ｂの特徴語：
「文書間」「類似度」「関連情報」「関連度」「サーバ」
文書Ｃの特徴語：
「文書間」「関連度」「自動的」「クラスタリング」「グラフ」
ステップＳ１２において所定の文書群に含まれた各文書の特徴情報（特徴語）を抽出すると、次に携帯端末Ｂから文書サーバＡに送信される情報により、基準文書が指定されたか否か判断する（ステップＳ１３）。ここでは文書Ａが基準文書として指定されたものとする。 For example, it is assumed that the documents included in a predetermined document group designated by the user are the following three documents A to C.
Document A:
“Calculate the similarity between documents, add other related information to this to determine the degree of association, and output the results in tabular form”
Document B:
“Calculate the degree of association between documents in advance by combining similarity and other relevant information and store it on the server.”
Document C:
“Based on the degree of association between documents, clustering is automatically performed on a large number of documents and displayed in an easy-to-read graph”
When morphological analysis and part-of-speech analysis are performed on such a document and feature words of each document are extracted, the following results are obtained.
Feature words of document A:
"Between documents""Similarity""Relatedinformation""Relevance""Tableformat"
Feature words of document B:
“Between documents” “Similarity” “Related information” “Relevance” “Server”
Feature words of document C:
"Inter-document""Relevance""Automatic""Clustering""Graph"
When the feature information (feature word) of each document included in the predetermined document group is extracted in step S12, it is determined whether or not the reference document is designated by the information transmitted from the portable terminal B to the document server A next. (Step S13). Here, it is assumed that document A is designated as the reference document.

次に基準文書（文書Ａ）の特徴情報と、所定の文書群に含まれた基準文書以外の文書（文書Ｂ及び文書Ｃ）の特徴情報と、を比較し、基準文書以外の文書の特徴情報に対し、基準文書との共通度合を判断する（ステップＳ１４）。ステップＳ１４における共通度合とは、基準文書以外の文書の特徴語が基準文書の特徴語となっているかどうかの結果である。 Next, the feature information of the reference document (document A) is compared with the feature information of the documents (document B and document C) other than the reference document included in the predetermined document group, and the feature information of the documents other than the reference document is compared. On the other hand, the degree of commonality with the reference document is determined (step S14). The degree of commonality in step S14 is a result of whether or not a feature word of a document other than the reference document is a feature word of the reference document.

ステップＳ１４において共通度合を判断すると、文書サーバＡから携帯端末Ｂへ特徴情報（特徴語）と共通度合が送信される（ステップＳ１５）。そして携帯端末Ｂの表示部２０４では受信した特徴情報を用いて基準文書以外の文書が表示される。 When the common degree is determined in step S14, the feature information (feature word) and the common degree are transmitted from the document server A to the portable terminal B (step S15). Then, the display unit 204 of the portable terminal B displays a document other than the reference document using the received feature information.

図８は、特徴語を用いて各文書を表示した表示部２０４の説明図である。図６では上述した基準文書（文書Ａ）以外の文書である文書Ｂ、文書Ｃが特徴語を用いて携帯端末Ｂの表示部２０４に表示されている。 FIG. 8 is an explanatory diagram of the display unit 204 that displays each document using feature words. In FIG. 6, documents B and C, which are documents other than the reference document (document A) described above, are displayed on the display unit 204 of the portable terminal B using feature words.

図８で示すように携帯端末Ｂの表示部２０４では文書サーバＡから受信した共通度合により特徴語の表示方法を変化させている。即ち、基準文書である文書Ａの特徴語にもなっている特徴語をＲｅｄ（赤）で表示し、基準文書である文書Ａの特徴語にはなっていない特徴語はＢｌｕｅ（青）で表示する。このように基準文書との共通度合により特徴語の表示方法を異ならせることにより、基準文書と共通するかどうかをユーザーが簡易に携帯端末上で認識出来る。なお、特徴語の表示色を変化させるだけでなく、特徴語の表示サイズを変化させても良い。 As shown in FIG. 8, the display unit 204 of the portable terminal B changes the display method of the feature word according to the common degree received from the document server A. That is, feature words that are also feature words of the document A that is the reference document are displayed in red (red), and feature words that are not feature words of the document A that is the reference document are displayed in blue (blue). To do. In this way, by changing the display method of the feature word depending on the degree of commonality with the reference document, the user can easily recognize on the portable terminal whether or not it is common with the reference document. In addition to changing the display color of the feature word, the display size of the feature word may be changed.

図８に示すような表示に基づき、ユーザーは選択部２０５を通じてダウンロードしたい文書を選択する。タッチペンにより表示部２０４に表示されている文書を選択してもよい。 Based on the display as shown in FIG. 8, the user selects a document to be downloaded through the selection unit 205. You may select the document currently displayed on the display part 204 with a touch pen.

図７におけるステップＳ１５において特徴情報及び共通度合を送信した後、携帯端末Ｂでは図８に示すような形態で各文書が表示部２０４に表示され、文書サーバＡでは携帯端末Ｂより何れかの文書が選択されたという選択情報を通信部１０２が受信したか否か判断する（ステップＳ１６）。ステップＳ１６において文書の選択情報を受信したと判断すると、文書を携帯端末Ｂに対して送信する（ステップＳ１７）。 After transmitting the feature information and the common degree in step S15 in FIG. 7, each document is displayed on the display unit 204 in the form shown in FIG. 8 on the mobile terminal B, and any document from the mobile terminal B on the document server A. It is determined whether or not the communication unit 102 has received selection information indicating that has been selected (step S16). If it is determined in step S16 that the document selection information has been received, the document is transmitted to the portable terminal B (step S17).

ステップＳ１６において特徴情報及び共通度合を送信した後、所定時間経過しても文書が選択されたという選択情報を受信しなければ動作を終了する。 After transmitting the feature information and the degree of commonality in step S16, the operation is terminated if selection information indicating that a document has been selected is not received even after a predetermined time has elapsed.

なお、本発明は当該実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲における変更や追加があっても本発明に含まれる。 In addition, this invention is not limited to the said embodiment, Even if there exists a change and addition in the range which does not deviate from the summary of this invention, it is contained in this invention.

本実施の形態では、特徴情報として文書中に出現する特徴語を用いているが、必ずしもこれに限らず、例えば、文書の作成者、作成日時、作成者が定めた文書のカテゴリといった書誌的事項、文書中に使用されている図・表・グラフなどのオブジェクト、これらのオブジェクトの工数や文書のサイズに対する割合、文書のフォーマット（箇条書きなのか論文形式なのかプレゼン用使用なのか等）と特徴情報としてもよい。 In the present embodiment, feature words appearing in the document are used as the feature information. However, the present invention is not limited to this. For example, bibliographic items such as the document creator, the creation date and time, and the document category defined by the creator. , Objects such as diagrams, tables, and graphs used in the document, the man-hours of these objects and the ratio to the size of the document, the format of the document (whether it is bulleted, paper-based or used for presentations) and features It may be information.

本発明に係る文書表示システムの概略図である。1 is a schematic diagram of a document display system according to the present invention. 図１に示す文書表示システムのブロック図である。It is a block diagram of the document display system shown in FIG. 文書の特徴情報と、その特徴情報の共通度合に基づき、文書群に含まれた各文書を携帯端末で表示するための文書サーバにおける動作を示すフローチャート図である。It is a flowchart figure which shows the operation | movement in the document server for displaying each document contained in a document group with a portable terminal based on the feature information of a document, and the common degree of the feature information. 文書における特徴語と、その特徴語の出現文書数や共通度合を示す説明図である。It is explanatory drawing which shows the feature word in a document, the number of appearance documents of the feature word, and a common degree. 特徴語を用いて各文書を表示した表示部の説明図である。It is explanatory drawing of the display part which displayed each document using the feature word. 特徴語を用いて各文書を表示した表示部の説明図である。It is explanatory drawing of the display part which displayed each document using the feature word. 基準文書が指定された場合に、文書群に含まれた各文書を携帯端末で表示するための文書サーバにおける動作を示すフローチャート図である。It is a flowchart figure which shows the operation | movement in the document server for displaying each document contained in a document group with a portable terminal, when a reference | standard document is designated. 特徴語を用いて各文書を表示した表示部の説明図である。It is explanatory drawing of the display part which displayed each document using the feature word.

Explanation of symbols

Ａ文書サーバ
Ｂ携帯端末
１０１、２０１制御部
１０２、２０２通信部
１０３、２０３保存部
１０４抽出部
１０５判断部
２０４表示部
２０５選択部 A Document server B Mobile terminal 101, 201 Control unit 102, 202 Communication unit 103, 203 Storage unit 104 Extraction unit 105 Judgment unit 204 Display unit 205 Selection unit

Claims

A storage unit for storing the document;
An extraction unit that extracts feature information of each document included in the document group in a predetermined document group composed of documents stored in the storage unit;
A determination unit that determines the degree of commonality among all documents included in the document group with respect to the feature information extracted by the extraction unit;
A display unit that displays each document included in the document group using the feature information and changes a display method of the feature information according to the degree of commonality;
A document display system comprising:

The feature information is a feature word that appears in each document,
The document display system according to claim 1, wherein the common degree is a degree based on the number of appearance documents of the feature word in all documents included in the document group.

The document display system according to claim 2, wherein the extraction unit extracts the feature word by analyzing a part of speech of a word appearing in each document.

The document display system according to claim 1, wherein the determination unit determines the degree of commonness in consideration of history information of each document.

The document display system according to claim 4, wherein the display unit changes a display color of the feature information according to the degree of commonality.

The document display system according to claim 4, wherein the display unit changes a display size of the feature information depending on the degree of commonality.

A storage unit for storing the document;
An extraction unit that extracts feature information of each document included in the document group in a predetermined document group composed of documents stored in the storage unit;
The feature information of the reference document included in the document group is compared with the feature information of the document other than the reference document included in the document group, and the feature information of the document other than the reference document is compared with the reference information. A determination unit for determining the degree of commonality with a document;
A display unit that displays each document included in the document group using the feature information and changes a display method of the feature information according to the degree of commonality;
A document display system comprising: