JP3570970B2

JP3570970B2 - Simple image display system

Info

Publication number: JP3570970B2
Application number: JP2000201182A
Authority: JP
Inventors: 健次上田
Original assignee: NEC Software Kyushu Ltd
Current assignee: NEC Software Kyushu Ltd
Priority date: 2000-07-03
Filing date: 2000-07-03
Publication date: 2004-09-29
Anticipated expiration: 2020-07-03
Also published as: JP2002024253A

Description

【０００１】
【発明の属する技術分野】
本発明は、ＷｅｂクライアントにおいてＷｅｂサーバから取り出したＸＭＬやＨＴＭＬテキスト形式文書を参照するに関し、特に当該文書を解析して予め定義した危険度に関する数値から当該文書のグラフ化を行い、文書内容の把握に便宜を与え危険度の判断を容易にするインターネットにおける簡易画像表示システムに関する。
【０００２】
【従来の技術】
インターネットにおけるＷｅｂサーバの利用において、インターネットを通しての業務の遂行上、利用者の個人情報を必要とする企業等が、利用者の当該個人情報を取得した際にその情報をどの様に守秘したりどの範囲に公開するか等を規定し予め利用者の使用に先だって提示するプライバシーポリシーや、又、オンラインショッピングにおける販売業者と購入者間の契約書などは、ＸＭＬ（ｅＸｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）やＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）などのテキスト形式で説明がされていた。
【０００３】
利用者は、パーソナルコンピュータや携帯電話などのＷｅｂクライアントから、Ｗｅｂブラウザ等のソフトウェアを使用し、Ｗｅｂサーバ上に登録されたプライバシーポリシーや契約書などを参照していた。
【０００４】
【発明が解決しようとする課題】
しかし、この従来技術には、次のような問題点があった。
【０００５】
第１の問題点は、インターネットを使ったオンラインショッピング利用時に、プライバシーポリシーや契約書などが表示されても、それをゆっくり精読する利用者は少ないか、あるいは目を通しても分かりにくい内容であることが多いため、十分な理解のないまま契約を行ってしまい、後に契約相手との間でトラブル発生の原因となるという問題があった。
【０００６】
第２の問題点は、プライバシーポリシーや契約書などは、含まれる情報が多すぎるために、利用者にとって危険となる項目を読み飛ばしたりする可能性があった。
【０００７】
第３の問題点は、情報提供者が提供し、インターネット上に多数存在するＷｅｂサーバに掲載されるプライバシーポリシーや契約書などの形式や内容がＷｅｂサーバによって異なっていたり、情報提供者毎に異なっていた。そのため、利用者はプライバシーポリシーや契約書などが表示される度に全てをくまなく読む面倒さがあった。
【０００８】
本発明は、情報提供者がＷｅｂサーバ上で公開しているプライバシーポリシーや契約内容等のテキストデータをもとに、その内容をグラフ化する等によって利用者がひと目でその記述内容を把握できるようにすることで、以上述べた問題点の解決を図ることを目的とした簡易画像表示システムを提供するものである。
【０００９】
【課題を解決するための手段】
本願の第１の発明の簡易画像表示システムは、Ｗｅｂクライアントにおいて、ＷｅｂサーバからＨＴＭＬテキストを受信すると、前記ＨＴＭＬテキストに記載されその内容を特徴付けるキーワードを定義した識別子定義部と且つ前記キーワードに関連する項目と該項目に与える数値とを対応付けて定義したデータ定義部とからなる素情報定義部と、前記ＨＴＭＬテキストの持つキーワードと前記素情報定義部に定義されたキーワードとを比較する識別子判断部と、前記識別子判断部による比較が一致すると前記ＨＴＭＬテキストに対する形態素解析等による構文解析の結果と前記素情報定義部の一致したキーワードについての前記データ定義部とを照合し前記項目と同じ項目を前記ＨＴＭＬテキスト中に検出すると前記データ定義部において該項目に与えた数値を適用し前記ＨＴＭＬテキストのグラフ化等に必要なデータを採取する構文解析部と、前記ＨＴＭＬテキストのデータ部にＨＴＭＬリンク情報を検出するとリンク先の静止画又は動画等からなるＨＴＭＬ情報を取り出しＷｅｂクライアントが保有する画像情報とを比較し一致すると該ＨＴＭＬ情報に関連する前記項目の数値の補正を行う類似画像解析部と、前記構文解析部から受け取ったグラフ化等に必要なデータを円グラフやレーダーチャートなどの簡易画像に変換する画像作成部と、を備える。
【００１１】
本願の第２の発明の簡易画像表示システムは、第１の発明において、前記素情報定義部は、前記ＨＴＭＬテキストのキーワードを定義した識別子定義部と、前記ＨＴＭＬテキスト又は前記ＸＭＬテキストの内容を表す項目の項目名と各項目に関連する内容を表す関連項目名とその関連項目に対して与えた数値とからなるデータ定義部と、から構成されることを備える。
【００１２】
本願の第３の発明の簡易画像表示システムは、第１の発明において、前記素情報定義部は、前記ＨＴＭＬテキストの１又は複数のキーワードを定義した識別子定義部と、それぞれのキーワードに関する前記ＨＴＭＬテキスト又は前記ＸＭＬテキストの内容を表す項目の項目名と各項目に関連する内容を表す関連項目名とその関連項目に対して与えた数値とからなるデータ定義部と、から構成されることを備える。
【００１４】
本願の第４の発明の簡易画像表示システムは、第１の発明において、前記識別子判断部による前記識別子部のキーワードと前記識別子定義部のキーワードとの比較の結果、一致しない場合はグラフ化などを行わずテキスト形式で表示することを備える。
【００１５】
本願の第５の発明の簡易画像表示システムは、第１の発明において、前記素情報定義部と、前記識別子判断部と、前記構文解析部と、前記画像表示部と、を前記ＨＴＭＬテキストを表示するＷｅｂブラウザが備え、キーボード等の入力装置からの指示に従いテキストによる表示と、前記グラフ等による表示との切り替えを行うことを備える。
【００１６】
【発明の実施の形態】
次に、本発明の実施の形態について図面を参照して詳細に説明する。
【００１７】
本発明の簡易画像表示システムの第１の実施例は、Ｗｅｂクライアント１０と、Ｗｅｂサーバ３０と、これを接続するネットワーク０１と、から構成されている。
【００１８】
Ｗｅｂサーバ３０は、インターネットサービスプロバイダなどによって提供されるサーバであり、Ｗｅｂクライアント１０からの参照要求によりクライアントに送信するＸＭＬテキストａ３４をサーバ上に蓄積し要求に応じてその提供を行うものである。
【００１９】
Ｗｅｂサーバ３０の保有するＸＭＬテキストａ３４は、識別子部４８と、データ部４９と、から構成されており、識別子部４８は、ＸＭＬテキストａ３４の内容を特徴づける１又は複数のキーワード文字列（以降、簡略化してキーワードと称する。）からなる。これに対して、Ｗｅｂクライアント１０の識別子判断部１３は、識別子定義部２１に定義されたキーワードによって識別を行う。データ部４９は、Ｗｅｂクライアント１０の構文解析部１４により解析が行われる。
【００２０】
ＸＭＬテキストａ３４は、パーソナルコンピュータ等の情報処理装置におけるファイル形式の構造であっても、あるいはＷｅｂサーバ３０上のプログラムの処理結果として返信されるような構造であってもよい。
【００２１】
また、ＸＭＬテキストａ３４の識別子部４８は、Ｗｅｂクライアント１０の素情報定義部２０の識別子定義部２１に定義されたキーワードを考慮したキーワードであってもよいし、あるいは識別子定義部２１とデータ定義部２２を意識することなく記述されたキーワードであってもよい。このキーワードはＸＭＬテキストａ３４の内容を特徴付けるようなもので文書のキーワードとして捉えて良い。
【００２２】
ＸＭＬテキストａ３４の識別子部４８は、一般にはＷｅｂクライアント１０の素情報定義部２０を意識することなく記述されたテキストであるので、素情報定義部２０の識別子定義部２１においては、識別子部４８に出現する可能性の高い複数のキーワードの定義を行なうことによってＸＭＬテキストａ３４に対する解析能力を高めることができる。この場合データ定義部２２においてもそれぞれのキーワードに対応する内容を持ったデータ定義が存在することになる。
【００２３】
Ｗｅｂクライアント１０は、パーソナルコンピュータや携帯電話などの情報処理装置である。
【００２４】
Ｗｅｂクライアント１０は、その利用者がアクセスするＷｅｂサーバ３０を指定するなどの情報の入力を行うマウス、キーボード等からなる入力部１１と、ネットワーク０１との通信を制御する通信制御部１２と、Ｗｅｂサーバ３０から受信したＸＭＬテキストａ３４の識別子部４８を識別するための識別子定義部２１と同じＸＭＬテキストａ３４のデータ部４９の解析に使用する当該識別子が持つ項目情報を定義したデータ定義部２２とを含む素情報定義部２０と、識別子定義部２１に定義されたキーワードにより識別子部４８を識別する識別子判断部１３と、データ定義部２２の設定情報によりＸＭＬテキストａ３４のデータ部４９を解析し項目分けし数値化する構文解析部１４と、構文解析部１４の結果により画像を作成する画像作成部１５と、作成された画像の表示の制御を行う表示制御部１６と、表示される画面である表示部１７と、から構成されている。
【００２５】
識別子定義部２１とデータ定義部２２からなる素情報定義部２０は、Ｗｅｂクライアント１０のディスク装置等の記憶媒体上に１つのテキスト形式ファイルとして記述されており、予め初期設定されたものであっても、利用者が適宜設定するようになっていてもよい。運用の経過に従い、内容を変更していくことも可能である。また、画像作成部１５が作成し、表示制御部１６が表示部１７に表示する画像は、円グラフやレーダーチャートを含み、利用者が容易にその内容を把握できれば、どのような画像形式であっても構わない。
【００２６】
ネットワーク０１は、携帯電話における無線電波やインターネットなどであり、通信する手段を問わない。
【００２７】
次に、図１から図５を参照して、本実施例の動作について詳細に説明する。
【００２８】
図１は、本発明の第１の実施の形態の全体構成図である。
【００２９】
図２は、図１における識別子部４８とデータ部４９を含むＸＭＬテキストａ３４の例を示している。
【００３０】
図３は、図１における識別子定義部２１と、データ定義部２２と、を含む素情報定義部２０の例を示している。
【００３１】
図４は、本実施例が動作した結果、ＸＭＬテキストａ３４の処理結果を表示部１７に円グラフで表示した例を示している。
【００３２】
図５は、本実施例が動作した結果、ＸＭＬテキストａ３４の処理結果を表示部１７にレーダーチャートで表示した例を示している。
【００３３】
図１の全体構成図を参照すると、利用者は、Ｗｅｂクライアント１０の入力部１１を使ってＷｅｂサーバ３０に対するＸＭＬテキストａ３４の表示要求を作成し、通信制御部１２にこれを渡す。通信制御部１２は、この要求を受け、ネットワーク０１を介してＷｅｂサーバ３０にこの表示要求を送る。
【００３４】
Ｗｅｂサーバ３０は、受け取った表示要求に該当するＸＭＬテキストａ３４をネットワーク０１を介してＷｅｂクライアント１０に送る。
【００３５】
Ｗｅｂクライアント１０の通信制御部１２は、Ｗｅｂサーバ３０から送られてきたＸＭＬテキストａ３４を受け取り、これを識別子判断部１３に渡す。
【００３６】
識別子判断部１３は、ＸＭＬテキストａ３４に含まれる識別子部４８のキーワードを元に素情報定義部２０の識別子定義部２１で定義されたキーワードと逐次比較し、一致しなければ本発明によるグラフなどを利用する簡易画像表示はせず、表示制御部１６に当該ＸＭＬテキストａ３４を渡し、表示制御部１６は表示部１７にこれを表示することで、今までと同じテキストによる表示を行う。
【００３７】
識別子判断部１３は、ＸＭＬテキストａ３４の識別子部４８のキーワードを元に識別子定義部２１のキーワードとを逐次比較した結果、一致するキーワードが識別子定義部２１に定義されていれば、簡易画像表示するべきと判断し、構文解析部１４にＸＭＬテキストａ３４を渡す。
【００３８】
構文解析部１４は、素情報定義部２０の一致したキーワードに関するデータ定義部２２の定義に従って、ＸＭＬテキストａ３４のデータ部４９を解析し、当該キーワードに関連する項目について、その項目の持つ１乃至複数の詳細項目に定義された数値から各項目について危険度の数値化を行い、その結果を画像作成部１５に渡す。画像作成部１５は、その項目と数値に基づき、円グラフやレーダーチャートなどの画像データを作成し、これを表示制御部１６に渡す。表示制御部１６は、表示部１７にグラフ表示して当該テキストの記述から危険度をわかりやすく表現する。この場合、表示制御部１６にＸＭＬテキストａ３４を渡すか否かは、規定しない。尚、円グラフで表示するかレーダーチャートで表示するかその他の形式で表示するかの表示形式については、素情報定義部２０に表示形式の定義部を設け、そこで定義を行っても良いし入力部１１のキーボードのファンクションキーの押下によって円グラフからレーダーチャート等の表示形式の切り替えを可能にするように構成しても良い。
【００３９】
ここで、識別子判断部１３における比較処理の１例をＸＭＬテキストａ３４の１例である図２と、素情報定義部２０の１例である図３と、を使用して具体的に示す。
図３における識別子定義部２１の例では、”Ｄｉｓｔｉｎｃｔｉｏｎ”行には”ＰｒｉｖａｃｙＰｏｌｉｃｙ”文字列が記述されていることを識別子判断部１３が読み込む。そして、比較処理を行うため図２におけるＸＭＬテキスト中の識別子部４８の”Ｄｉｓｔｉｎｃｔｉｏｎ”行を読み込むと、これが”ＰｒｉｖａｃｙＰｏｌｉｃｙ”文字列となっている。これらの２つの文字列を比較することにより、同一文字列であれば簡易画像表示を行うこと、逆に、同一でなければ簡易画像表示を行わないと判断する。尚、ここでは例として英大文字と英小文字で示したが、漢字シフト演算などを行ってもよい。
【００４０】
次に、構文解析部１４における解析処理の具体例をＸＭＬテキストａ３４の例である図２と素情報定義部２０の例である図３とを使って説明する。
【００４１】
まず、構文解析部１４は、図３におけるデータ定義部２２の情報を読み込み、＜Ｐｕｒｐｏｓｅ＞、＜Ｄｉｓｃｌｏｓｕｒｅ＞、＜Ｄａｔａ＞といったグラフ化する際の項目を表す文字を元に、図２におけるデータ部４９を解析する。例えば、データ部４９の１行目の”Ｐｕｒｐｏｓｅ”には”ｍａｒｋｅｔｉｎｇ”と”ｕｓｅｒｐｒｏｃｅｓｓ”とが定義されており、これを図３から項目として得た＜Ｐｕｒｐｏｓｅ＞の定義内容ブロックを見ると、”ｍａｒｋｅｔｉｎｇ”＝１、”ｕｓｅｒｐｒｏｃｅｓｓ”＝２とあることから、図２の項目”Ｐｕｒｐｏｓｅ”については、１＋２の合計３であると危険度を数値化する。つまり、図２はＸＭＬテキストａ３４の例であるから、ＸＭＬテキストａ３４の１つの項目である”Ｐｕｒｐｏｓｅ”について解析した結果は３であると数値化できる訳である。同様に＜Ｄｉｓｃｌｏｓｕｒｅ＞、＜Ｄａｔａ＞といった項目で数値化を行う。ここで得た項目と数値とを画像作成部１５に渡した後は、”目的”を意味する”Ｐｕｒｐｏｓｅ”が３であることから相当する画像の数値として図４や図５を作成するものである。
【００４２】
ＸＭＬテキストａ３４の処理結果を表示部１７に表示を行う。円グラフで表示した例は、図４に示している。ＸＭＬテキストａ３４の処理結果をレーダーチャートで表示した例は、図５に示している。尚、以上の説明では、図３のデータ定義部２２の各詳細項目が持つ数値は、大きいほど危険度が高いと想定しているが逆でも構わない。
【００４３】
図４の円グラフの表示処理について図３を使用して、さらに細かく説明を行う。
尚、以降のグラフの詳細表示処理についての説明は、実施例２や実施例３にも共通する内容のものである。
【００４４】
例えば、図３のデータ定義部２２の例では、＜Ｐｕｒｐｏｓｅ＞、＜Ｄｉｓｃｌｏｓｕｒｅ＞・・・等の項目があるが、それらの項目についてそれぞれ均等に１００点ずつ前もって与えこれを各項目についての安全度とする。尚、項目ごとに重みをつけて必ずしも全ての項目の安全度を均等にしなくても構わない。
【００４５】
すべての項目が６つあるとすると１００ｘ６で６００点となりこれが全体の安全度となる。この時前述した評価の結果、例えば＜Ｐｕｒｐｏｓｅ＞については、”ｍａｒｋｅｔｉｎｇ”＝１と”ｕｓｅｒｐｒｏｓｅｓｓ”＝２が該当したものとする。その時＜ｐｕｒｐｏｓｅ＞についての危険値は、１＋２＝３となり、安全度は、危険値を取り除いて
（１−３／６）ｘ１００＝５０の５０点が与えられ、円グラフにおける項目＜ｐｕｒｐｏｓｅ＞の占める割合は、５０／６００＝８．３％となる。
尚、上式での数値６は、図３のデータ定義部２２の＜Ｐｕｒｐｏｓｅ＞で定義されたｎｏｎｅ、ｍａｒｋｅｔｉｎｇ、ｕｓｅｒｐｒｏｓｅｓｓ、ｏｔｈｅｒに対して与えられた数値の合計値である。これを６つの項目すべてに対して合計
して、仮に４５０点となったとする。その時の危険度は、
６００−４５０＝１５０点となり、円グラフにおける危険度の占める割合は、１５０／６００＝２５％となる。図４における危険度の表示は予め規定されたシキイ値に対応して危険度の評価（例えば安全、やや危険、危険、非常に危険等）を下すようにしてある。
【００４６】
次に、本発明の第２の実施例について図面を参照して詳細に説明する。
【００４７】
本発明の第２の実施例は、図１におけるＷｅｂサーバ３０から返されるテキストがＸＭＬテキストａ３４であったのに対して、図６におけるテキストは、ＨＴＭＬ形式のＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２である点で第１の実施例とは異なる。図２に示す第１の実施例におけるＸＭＬテキストａ３４と比較して、ＨＴＭＬテキストａ３１とＨＴＭＬテキストｂ３２の具体例は、図７、図８に示すとおりである。また、第１の実施例おける図３に示す素情報定義部２０の例と対比して、第２の実施例である素情報定義部１８の具体例は、図９に示すとおりである。
【００４８】
本実施例における構文解析部１４は、素情報定義部１８のデータ定義部２４の定義に従い、Ｗｅｂサーバ３０から受け取ったＨＴＭＬ形式のテキストの解析を最長一致法を使用した形態素解析機能によって行うという特徴を持つ。
【００４９】
次に、本実施例の動作について図６から図１３を用いて詳細に説明する。
【００５０】
図６は、本発明の第２の実施の形態の全体構成図である。
【００５１】
図７は、図６における識別子部４１とデータ部４２を含むＨＴＭＬテキストａ３１の例を示している。この例は、図８の例と比較すると、危険度が低い例である。
【００５２】
図８は、図６における識別子部４３とデータ部４４を含むＨＴＭＬテキストｂ３２の例を示している。この例は、図７にあげた例と比較すると危険度が高い例である。
【００５３】
図９は、図６における識別子定義部２３とデータ定義部２４を含む素情報定義部１８の例を示している。
【００５４】
図１０は、本実施例が動作した結果、ＨＴＭＬテキストａ３１の処理結果を表示部１７に円グラフで表示した例を示している。このテキストの解析の結果、危険度は少ないという例である。
【００５５】
図１１は、本実施例が動作した結果、ＨＴＭＬテキストａ３１の処理結果を表示部１７にレーダーチャートで表示した例を示している。このテキストの解析の結果、図１０と同様に危険度は少ないという例である。
【００５６】
図１２は、本実施例が動作した結果、ＨＴＭＬテキストｂ３２の処理結果を表示部１７に円グラフで表示した例を示している。このテキストの解析の結果、危険度は大きいという例である。
【００５７】
図１３は、本実施例が動作した結果、ＨＴＭＬテキストｂ３２の処理結果を表示部１７にレーダーチャート円グラフで表示した例を示している。このテキストの解析の結果、図１２と同様に危険度は大きいという例である。
【００５８】
図６を参照すると、利用者は、Ｗｅｂクライアント１０の入力部１１を使ってＷｅｂサーバ３０のＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２の表示要求を行い、通信制御部１２がこの表示要求を受けネットワーク０１を介してＷｅｂサーバ３０にこの表示要求を送る。
【００５９】
Ｗｅｂサーバ３０は、受け取った表示要求に該当するＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２をネットワーク０１を介してＷｅｂクライアント１０に送る。
【００６０】
Ｗｅｂクライアント１０の通信制御部１２は、Ｗｅｂサーバ３０から送られてきたＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２を受け取り、これを識別子判断部１３に渡す。識別子判断部１３は、ＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２の識別子部４１または、識別子部４３と素情報定義部１８の識別子定義部２３のキーワードとを比較し、同一なものでなければ簡易画像表示は行わず、表示制御部１６にＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２を渡し、表示制御部１６は、表示部１７にこれをテキスト形式で表示する。
【００６１】
識別子判断部１３は、ＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２の識別子部４１または識別子部４３と、素情報定義部１８の識別子定義部２３のキーワードとを比較し、同一であれば、簡易画像表示するべきと判断する。そして、構文解析部１４にＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２を渡す。構文解析部１４は、素情報定義部１８のデータ定義部２４の定義に従って、ＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２のデータ部４２またはデータ部４４を形態素解析などの手法によって解析してグラフ表示における項目に分類し、数値化を行った結果を画像作成部１５に渡す。画像作成部１５はその項目と数値に基づき円グラフやレーダーチャートなどの画像データを作成し、これを表示制御部１６に渡し、表示制御部１６は表示部１７に表示する。
【００６２】
ここで、識別子判断部１３における比較処理の例をＨＴＭＬテキストａ３１の例である図７、および素情報定義部１８の例である図９を使って具体的に示す。図９における識別子部４３の例では、”Ｄｉｓｔｉｎｃｔｉｏｎ”行には”プライバシーポリシー”文字列が記述されていることを識別子判断部１３が読み込む。そして、比較処理を行うため図７におけるファイルを先頭から順次比較すると”プライバシーポリシー”を含む文字列がこのファイルに存在することが分かる。この文字列が存在する場合は、簡易画像表示するべきであると判断する。逆に、一致する文字列が存在しない場合は、簡易画像表示しないと判断する。又、ここでは例として示したが、大文字や小文字、漢字シフト演算などを行ってもよい。また、図９におけるデータ定義部２４における項目である”使用目的”、”開示範囲”といった文字列とのＡＮＤ検索を行っても良い。
【００６３】
次に、構文解析部１４における解析処理の例をＨＴＭＬテキストａ３１の例である図７、および素情報定義部１８の例である図９を使って具体的に示す。
【００６４】
まず、構文解析部１４は、図９におけるデータ定義部２４の情報を読み込み、”使用目的”、”開示範囲”、”収集する情報”といった項目を表す文字列、およびデータ定義部２４の”なし”、”マーケティング”、”ユーザ要求の遂行”といった文字列をキーワードとし、図７におけるデータ部４２を最長一致法による形態素解析を使用した構文解析を行い、辞書引きされた候補単語の中から表記の長さが一番長い単語によって分割する。
【００６５】
例えば、データ部４２の１行目の＜ｃｅｎｔｅｒ＞行の後ろからの解析を具体的に示すと、先頭にある”情報の使用目的は、マーケティングとユーザ要求の遂行のためです。”という１行は、構文解析により、”使用目的”、”マーケティング”、”ユーザ要求の遂行”と切り出される。ここで、図９のデータ定義部２４における＜使用目的＞には”マーケティング”＝１、”ユーザ要求の遂行”＝２とあることから、図７の”使用目的”は、１＋２の合計３であると数値化する。つまり、図７は、ＨＴＭＬテキストａ３１の例であるから、ＨＴＭＬテキストａ３１を”使用目的”の項目について解析した結果は３であると数値化できる訳である。
【００６６】
同様に”開示範囲”、”収集する情報”・・・といった項目で数値化を行う。これらを画像作成部１５に渡した後は、”使用目的”が３であることから相当する画像の数値として図１０や図１１を作成するものである。
【００６７】
ＨＴＭＬテキストａ３１の処理結果を表示部１７に円グラフで表示した例を図１０に示している。このテキストの解析結果、危険度は少ないという例である。ＨＴＭＬテキストａ３１の処理結果を表示部１７にレーダーチャートで表示した例を図１１に示している。このテキストの解析結果も図１０と同じなので危険度は少ないという例である。
【００６８】
ＨＴＭＬテキストｂ３２の処理結果を表示部１７に円グラフで表示した例は、図１２に示している。このテキストの解析結果、危険度は大きいという例である。ＨＴＭＬテキストｂ３２の処理結果を表示部１７にレーダーチャートで表示した例を図１３に示している。このテキストの解析結果も図１２と同じなので危険度は大きいという例である。
【００６９】
次に、本発明の第３の実施例について図面を参照して詳細に説明する。
【００７０】
本発明の第３の実施例は、図６におけるＷｅｂサーバ３０から返されるテキストがＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２であったのに対して、図１４におけるテキストは、データ部４６の中にＨＴＭＬのリンク情報を含むＨＴＭＬテキストｃ３３である点で第２の実施例とは異なる。図７または図８に示すＨＴＭＬテキストａ３１またはＨＴＭＬテキストｂ３２の具体例と比較して、ＨＴＭＬテキストｃ３３の具体例は、図１６に示すとおりである。また、図１６に示すＨＴＭＬテキストｃ３３のデータ部４６のＨＴＭＬリンク先の情報を示す画像データ部４７の具体例は、図１５に示すとおりである。
【００７１】
また、本実施例の構成をブロック図で表した図１４においては、第２の実施例の構成を示す図６においては存在しなかった類似画像解析部１９が存在する点で第２の実施例とは異なる。
【００７２】
したがって、構文解析部１４によるＨＴＭＬテキストｃ３３の解析後、ＨＴＭＬテキストｃ３３の一部からリンクする画像データ部４７を類似画像のマッチング処理をすることにより、保証機関からのマークであると認識した場合は、”保証機関”という項目の数値を下げて危険度を下げ、危険度の少ない意味付けをして、画像作成部１５が画像を作成することが可能となる。
【００７３】
次に、本実施例の動作について図１４、図１５、図１６を用いて詳細に説明する。
【００７４】
図１４は、本発明の第３の実施の形態の全体構成図である。
【００７５】
図１５は、図１４における画像データ部４７の例を示している。これは保証機関によるマークの例であり、静止画像であるか音楽データであるか動画データであるかは問わない。
【００７６】
図１６は、図１４における識別子部４５とデータ部４６を含むＨＴＭＬテキストｃ３３の例を示している。このデータ部４６の記述の一部には画像データ部４７をリンクするための情報を含んでいる。
【００７７】
図１４を参照すると、利用者は、Ｗｅｂクライアント１０の入力部１１を使ってＷｅｂサーバ３０のＨＴＭＬテキストｃ３３の表示要求を行い、通信制御部１２がこの表示要求を受けネットワーク０１を介してＷｅｂサーバ３０にこの表示要求を送る。
【００７８】
Ｗｅｂサーバ３０は、受け取った表示要求に該当するＨＴＭＬテキストｃ３３をネットワーク０１を介してＷｅｂクライアント１０に送る。
【００７９】
Ｗｅｂクライアント１０の通信制御部１２は、Ｗｅｂサーバ３０から送られてきたＨＴＭＬテキストｃ３３を受け取り、これを識別子判断部１３に渡す。識別子判断部１３はＨＴＭＬテキストｃ３３に含まれる識別子部４５と素情報定義部１８に含まれる識別子定義部２３の情報とを比較し、同一でなければ先の実施例と同様に簡易画像表示は行わず、表示制御部１６にＨＴＭＬテキストｃ３３を渡し、表示制御部１６は表示部１７にこれをテキスト形式で表示する。
【００８０】
識別子判断部１３は、ＨＴＭＬテキストｃ３３の識別子部４５と素情報定義部１８の識別子定義部２３のキーワードとを比較し、同一であれば簡易画像表示を実行し、構文解析部１４にＨＴＭＬテキストｃ３３を渡し、構文解析部１４は、素情報定義部１８に含まれるデータ定義部２４の定義に従って、ＨＴＭＬテキストｃ３３に含まれるデータ部４２またはデータ部４４を形態素解析などの手法をもって解析し、項目分けする。次に類似画像解析部１９は、ＨＴＭＬテキストｃ３３のデータ部４６の一部に記載された画像データ部４７へのリンクを参照してＷｅｂサーバ３０にある画像データ部４７をネットワーク０１を介して受け取る。受け取った画像データ部４７に対し、Ｗｅｂクライアント１０が保有する保証機関についての画像情報を元に類似する画像の検索を行い、類似度のチェックの結果、保証機関による保証マーク画像であると判定した場合は、構文解析部１４で項目分けした内容の保証機関の項目の危険度を下げる数値を設定し、その他の構文解析部１４が処理した項目と数値化を行った結果を画像作成部１５に渡し、画像作成部１５は、その項目と数値に基づき円グラフやレーダーチャートなどの画像データを作成し、これを表示制御部１６に渡し、表示制御部１６は表示部１７に表示する。
【００８１】
画像類似検索の方法は特に規定しない。また、本実施例では保証機関による保証マーク画像を類似画像検索により識別する例を示したが、暗号処理技術を使った電子署名による識別する方法であってもかまわない。
【００８２】
次に、本発明の第４の実施例について、図１７を参照して詳細に説明する。
【００８３】
本実施例では、実施例１におけるＷｅｂクライアント１０のグラフなどによるＸＭＬテキストの表示の構成をＷｅｂブラウザ２５に組み込んでいる。
【００８４】
Ｗｅｂクライアント１０の通信制御部１２は、Ｗｅｂサーバ３０から受信したＸＭＬテキストａ３４を受信すると、これをＷｅｂブラウザ２５に渡す。Ｗｅｂブラウザ２５によるテキスト表示状態において、入力部１１のキーボード上のあるファンクションキー等の押下をすると、これを認知したＷｅｂブラウザ２４は、当該ＸＭＬテキストａ３４を引数として識別子判断部１３を呼び出す。以降、先の実施例で説明したのと同様の処理によって、当該ＸＭＬテキストａ３４のグラフ表示が行われる。尚、ＸＭＬテキストａ３４の識別子部４８のキーワードが素情報定義部２０の識別子定義部２１の持つキーワードと一致しなかった場合は、Ｗｅｂブラウザ２５に戻り、再度テキストによる表示が行われる。
【００８５】
又、グラフなどによる表示中に、先のファンクションキーを再び押下するとＷｅｂブラウザ２５に復帰することになる。これによって、利用者は、テキスト形式で表示される内容から必要に応じてＷｅｂブラウザ２５の持つグラフ等による表示処理を呼び出しテキストの内容をグラフ化して簡易な表示情報を参照することができる。尚、ＸＭＬテキストについて説明をしたが、ＨＴＭＬテキストについても同様である。
【００８６】
【発明の効果】
第１の効果は、利用者は本システムにより簡易化された画像を見ることにより、サーバから送られてくるテキストをすべて読む手間を必要としないことである。
【００８７】
その理由は、テキストを構文解析し容易に把握できるような画像を作成する機能を持つようにしたためである。
【００８８】
第２の効果は、Ｗｅｂブラウザに本発明の内容を組み込むことでＷｅｂクライアントにおける既存の操作性を保証した上で、さらに簡易化された画像を見るというＷｅｂブラウザの機能性の向上を図ることが可能となる。
【図面の簡単な説明】
【図１】本発明の第１の実施例の構成を説明するブロック図である。
【図２】本発明の第１の実施例の詳細な構成を説明するブロック図である。
【図３】本発明の第１の実施例の詳細な構成を説明するブロック図である。
【図４】本発明の第１の実施例の詳細な構成を説明するブロック図である。
【図５】本発明の第１の実施例の詳細な構成を説明するブロック図である。
【図６】本発明の第２の実施例の構成を説明するブロック図である。
【図７】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図８】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図９】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図１０】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図１１】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図１２】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図１３】本発明の第２の実施例の詳細な構成を説明するブロック図である。
【図１４】本発明の第３の実施例の構成を説明するブロック図である。
【図１５】本発明の第３の実施例の詳細な構成を説明するブロック図である。
【図１６】本発明の第３の実施例の詳細な構成を説明するブロック図である。
【図１７】本発明の第４の実施例の構成を説明するブロック図である。
【符号の説明】
０１ネットワーク
１０Ｗｅｂクライアント
１１入力部
１２通信制御部
１３識別子判断部
１４構文解析部
１５画像作成部
１６表示制御部
１７表示部
２０素情報定義部
２１識別子定義部
２２データ定義部
３０Ｗｅｂサーバ
３４ＸＭＬテキストａ
４８識別子部
４９データ部[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a Web client that refers to an XML or HTML text format document retrieved from a Web server, and in particular, analyzes the document and graphs the document from numerical values relating to a predetermined risk level to grasp the contents of the document. The present invention relates to a simplified image display system on the Internet that provides convenience to a user and facilitates determination of a risk level.
[0002]
[Prior art]
When using a Web server on the Internet, a company or the like who needs the personal information of a user for the purpose of performing business through the Internet, how the information is kept confidential when the personal information of the user is acquired. The privacy policy, which prescribes whether or not to be disclosed in a range, and is presented in advance of the use of the user, and the contract between the seller and the purchaser in online shopping, etc., are described in XML (extensible Markup Language) or HTML (HyperText Markup). (Language).
[0003]
Users have referred to privacy policies and contracts registered on the Web server using software such as a Web browser from a Web client such as a personal computer or a mobile phone.
[0004]
[Problems to be solved by the invention]
However, this conventional technique has the following problems.
[0005]
The first problem is that, even when a privacy policy or contract is displayed when using the online shopping on the Internet, few users read it slowly, or the content is difficult to see even through the eyes. Due to the large number, there is a problem that a contract is made without sufficient understanding, which may cause troubles with the contract partner later.
[0006]
The second problem is that a privacy policy, a contract, and the like may skip items that are dangerous to the user because the information contained is too much.
[0007]
The third problem is that the format and contents of privacy policies and contracts provided by the information provider and posted on many Web servers on the Internet differ depending on the Web server, or differ for each information provider. I was For this reason, the user has to read all the information every time the privacy policy or contract is displayed.
[0008]
The present invention enables a user to grasp the contents of a description at a glance by graphing the contents based on text data such as a privacy policy and contract contents disclosed by an information provider on a Web server. Thus, the present invention provides a simplified image display system for solving the above-mentioned problems.
[0009]
[Means for Solving the Problems]
The simplified image display system according to the first invention of the present application is configured such that, when a Web client receives an HTML text from a Web server, an identifier definition unit defining a keyword described in the HTML text and characterizing the content, and an associated with the keyword. An elementary information definition unit including an item and a data definition unit defining a numerical value given to the item in association with each other, and an identifier determining unit for comparing a keyword of the HTML text with a keyword defined in the elementary information definition unit When the comparison by the identifier determining unit matches, the result of syntax analysis by morphological analysis or the like on the HTML text is compared with the data definition unit for the matched keyword of the elementary information definition unit, and the same item as the item is matched. When detected in HTML text, the data definition section A syntax analysis unit for collecting data necessary for graphing such numerical values given to the application the HTML text item, When the HTML link information is detected in the data portion of the HTML text, the HTML information including the still image or the moving image of the link destination is extracted, and compared with the image information held by the Web client, and when they match, the numerical value of the item related to the HTML information A similar image analysis unit that corrects An image creation unit that converts data necessary for graphing and the like received from the syntax analysis unit into a simple image such as a pie chart or a radar chart.
[0011]
No. of this application 2 The simplified image display system of the invention of the One In the invention, the elementary information definition unit includes the HTML text. Of Data comprising an identifier definition part defining a keyword, an item name of an item representing the contents of the HTML text or the XML text, a related item name representing contents related to each item, and a numerical value given to the related item And a definition unit.
[0012]
No. of this application 3 The simplified image display system of the invention of the One In the invention, the elementary information definition unit includes the HTML text. Of An identifier definition part defining one or more keywords, an item name of an item representing the contents of the HTML text or the XML text for each keyword, a related item name representing contents related to each item, and related items And a data definition unit comprising a given numerical value.
[0014]
No. of this application 4 The simplified image display system of the invention of the One According to the present invention, when the result of the comparison between the keyword of the identifier section and the keyword of the identifier definition section by the identifier determination section does not match, display in a text format without performing graphing or the like is provided.
[0015]
No. of this application 5 In the simplified image display system according to the first aspect of the present invention, in the first aspect, the elementary information definition unit, the identifier determination unit, the syntax analysis unit, and the image display unit are connected to the HTML text. To The display includes a Web browser that switches between text display and graph display in accordance with an instruction from an input device such as a keyboard.
[0016]
BEST MODE FOR CARRYING OUT THE INVENTION
Next, embodiments of the present invention will be described in detail with reference to the drawings.
[0017]
The first embodiment of the simplified image display system according to the present invention includes a Web client 10, a Web server 30, and a network 01 connecting the Web client 10 and the Web server 30.
[0018]
The Web server 30 is a server provided by an Internet service provider or the like. The Web server 30 stores XML text a34 to be transmitted to a client in response to a reference request from the Web client 10 on the server and provides the XML text a34 in response to the request.
[0019]
The XML text a34 held by the Web server 30 includes an identifier section 48 and a data section 49. The identifier section 48 includes one or more keyword character strings (hereinafter, referred to as character strings) characterizing the content of the XML text a34. (Simplified and referred to as a keyword). On the other hand, the identifier determining unit 13 of the Web client 10 performs identification by using a keyword defined in the identifier defining unit 21. The data section 49 is analyzed by the syntax analysis section 14 of the Web client 10.
[0020]
The XML text a34 may have a file format structure in an information processing device such as a personal computer, or may have a structure returned as a processing result of a program on the Web server 30.
[0021]
Also, the identifier 48 of the XML text a34 may be a keyword that takes into account the keyword defined in the identifier definition 21 of the elementary information definition 20 of the Web client 10, or the identifier definition 21 and the data definition The keyword may be described without being aware of the keyword 22. This keyword characterizes the content of the XML text a34 and may be regarded as a keyword of the document.
[0022]
Since the identifier part 48 of the XML text a34 is generally a text described without being aware of the elementary information definition part 20 of the Web client 10, the identifier part 48 of the elementary information definition part 20 By defining a plurality of keywords that are likely to appear, the ability to analyze the XML text a34 can be increased. In this case, a data definition having contents corresponding to each keyword also exists in the data definition unit 22.
[0023]
The Web client 10 is an information processing device such as a personal computer or a mobile phone.
[0024]
The Web client 10 includes an input unit 11 composed of a mouse, a keyboard, and the like for inputting information such as designating the Web server 30 to be accessed by the user, a communication control unit 12 for controlling communication with the network 01, An identifier definition part 21 for identifying the identifier part 48 of the XML text a34 received from the server 30 and a data definition part 22 defining the item information of the identifier used for analyzing the data part 49 of the same XML text a34. The element information definition unit 20 including the identifier unit 13 for identifying the identifier unit 48 by a keyword defined in the identifier definition unit 21, and the data unit 49 of the XML text a 34 are analyzed and classified according to the setting information of the data definition unit 22. A parsing unit 14 for converting the image into a numerical value, and an image forming unit for forming an image based on the result of the parsing unit 14 15, a display control unit 16 for controlling the display of the image generated, the display unit 17 is a screen displayed, and a.
[0025]
The element information definition unit 20 including the identifier definition unit 21 and the data definition unit 22 is described as one text format file on a storage medium such as a disk device of the Web client 10 and is initially set in advance. May be set by the user as appropriate. It is also possible to change the contents as the operation progresses. The image created by the image creation unit 15 and displayed on the display unit 17 by the display control unit 16 includes a pie chart and a radar chart, and is in any image format as long as the user can easily understand the contents. It does not matter.
[0026]
The network 01 is a radio wave of a mobile phone, the Internet, or the like, and means for communication is not limited.
[0027]
Next, the operation of the present embodiment will be described in detail with reference to FIGS.
[0028]
FIG. 1 is an overall configuration diagram of a first embodiment of the present invention.
[0029]
FIG. 2 shows an example of the XML text a34 including the identifier part 48 and the data part 49 in FIG.
[0030]
FIG. 3 shows an example of the element information definition unit 20 including the identifier definition unit 21 and the data definition unit 22 in FIG.
[0031]
FIG. 4 shows an example in which the processing result of the XML text a34 is displayed as a pie chart on the display unit 17 as a result of the operation of the present embodiment.
[0032]
FIG. 5 shows an example in which the processing result of the XML text a34 is displayed on the display unit 17 in a radar chart as a result of the operation of the present embodiment.
[0033]
Referring to the overall configuration diagram of FIG. 1, the user creates a display request of the XML text a34 to the Web server 30 using the input unit 11 of the Web client 10 and passes the request to the communication control unit 12. The communication control unit 12 receives the request and sends the display request to the Web server 30 via the network 01.
[0034]
The Web server 30 sends the XML text a34 corresponding to the received display request to the Web client 10 via the network 01.
[0035]
The communication control unit 12 of the Web client 10 receives the XML text a34 sent from the Web server 30, and passes it to the identifier determination unit 13.
[0036]
The identifier determining unit 13 sequentially compares the keyword defined by the identifier defining unit 21 of the element information defining unit 20 based on the keyword of the identifier unit 48 included in the XML text a34. Instead of displaying a simple image to be used, the XML text a34 is passed to the display control unit 16, and the display control unit 16 displays the XML text a34 on the display unit 17, thereby displaying the same text as before.
[0037]
The identifier determination unit 13 sequentially compares the keywords of the identifier definition unit 21 with the keywords of the identifier definition unit 21 based on the keywords of the identifier unit 48 of the XML text a34, and if a matching keyword is defined in the identifier definition unit 21, displays the simplified image. It determines that it should be, and passes the XML text a34 to the parsing unit 14.
[0038]
The syntax analysis unit 14 analyzes the data part 49 of the XML text a34 according to the definition of the matched keyword of the elementary information definition unit 20 and, for items related to the keyword, one or more items of the item. Of each item from the numerical values defined in the detailed items, and passes the result to the image creating unit 15. The image creating unit 15 creates image data such as a pie chart and a radar chart based on the items and the numerical values, and passes the image data to the display control unit 16. The display control unit 16 displays a graph on the display unit 17 to express the degree of danger easily from the description of the text. In this case, whether to pass the XML text a34 to the display control unit 16 is not specified. In addition, regarding the display format of displaying in a pie chart, a radar chart, or another format, a definition unit of a display format is provided in the elementary information definition unit 20 and the definition may be performed there. The display format of a pie chart, a radar chart, or the like may be switched by pressing a function key of the keyboard of the unit 11.
[0039]
Here, an example of the comparison processing in the identifier determination unit 13 will be specifically described using FIG. 2 which is an example of the XML text a34 and FIG. 3 which is an example of the element information definition unit 20.
In the example of the identifier definition unit 21 in FIG. 3, the identifier determination unit 13 reads that the “Privacy Policy” character string is described in the “Distination” line. Then, when the "Distraction" line of the identifier section 48 in the XML text in FIG. 2 is read in order to perform the comparison processing, this is a "Privacy Policy" character string. By comparing these two character strings, it is determined that if the character strings are the same, simple image display is performed, and if they are not the same, simple image display is not performed. Note that, here, as an example, uppercase and lowercase letters are used, but a kanji shift operation or the like may be performed.
[0040]
Next, a specific example of the analysis processing in the syntax analysis unit 14 will be described with reference to FIG. 2 which is an example of the XML text a34 and FIG. 3 which is an example of the elementary information definition unit 20.
[0041]
First, the syntax analysis unit 14 reads the information of the data definition unit 22 in FIG. 3 and, based on characters representing items to be graphed such as <Purpose>, <Disclosure>, and <Data>, reads the data in FIG. Analyze 49. For example, “Marking” and “user process” are defined in “Purpose” on the first line of the data part 49, and when looking at the definition content block of <Purpose> obtained as items from FIG. Since “marketing” = 1 and “user process” = 2, the risk is quantified when the item “Purpose” in FIG. 2 has a total of 1 + 2 = 3. That is, since FIG. 2 is an example of the XML text a34, the analysis result of one item of the XML text a34, "Purpose", can be numerically converted to 3. Similarly, a numerical value is formed using items such as <Disclosure> and <Data>. After the obtained items and numerical values are passed to the image generating unit 15, since “Purpose” meaning “purpose” is 3, the values of FIGS. 4 and 5 are generated as the numerical values of the corresponding images. is there.
[0042]
The processing result of the XML text a34 is displayed on the display unit 17. FIG. 4 shows an example of a pie chart. FIG. 5 shows an example in which the processing result of the XML text a34 is displayed in a radar chart. In the above description, it is assumed that the larger the numerical value of each detailed item of the data definition unit 22 in FIG. 3 is, the higher the risk is. However, the reverse is also possible.
[0043]
The display processing of the pie chart in FIG. 4 will be described in more detail with reference to FIG.
Note that the following description of the graph detailed display processing is common to the second and third embodiments.
[0044]
For example, in the example of the data definition unit 22 in FIG. 3, there are items such as <Purpose>, <Disclosure>,... And Note that it is not always necessary to assign a weight to each item to equalize the degrees of security of all items.
[0045]
Assuming that there are all six items, the score is 100 × 6 and 600 points, which is the overall safety level. At this time, as a result of the evaluation described above, for example, for <Purpose>, it is assumed that “marketing” = 1 and “user process” = 2. At that time, the danger value for <purpose> is 1 + 2 = 3, and the safety level is calculated by removing the danger value.
50 points of (1-3 / 6) × 100 = 50 are given, and the ratio of the item <purpose> in the pie chart is 50/600 = 8.3%.
The numerical value 6 in the above equation is the total value of numerical values given to none, marketing, user process, and other defined by <Purpose> of the data definition unit 22 in FIG. This is summed over all six items
Then, it is assumed that 450 points are obtained. The danger at that time is
600-450 = 150 points, and the ratio of the risk in the pie graph is 150/600 = 25%. In the display of the degree of danger in FIG. 4, the degree of danger is evaluated (for example, safety, somewhat dangerous, dangerous, extremely dangerous, etc.) in accordance with a predetermined threshold value.
[0046]
Next, a second embodiment of the present invention will be described in detail with reference to the drawings.
[0047]
In the second embodiment of the present invention, the text returned from the Web server 30 in FIG. 1 is the XML text a34, whereas the text in FIG. 6 is the HTML text a31 or the HTML text b32 in the HTML format. This is different from the first embodiment in the point. Compared to the XML text a34 in the first embodiment shown in FIG. 2, specific examples of the HTML text a31 and the HTML text b32 are as shown in FIGS. Further, in contrast to the example of the elementary information definition unit 20 shown in FIG. 3 in the first example, a specific example of the elementary information definition unit 18 of the second example is as shown in FIG.
[0048]
According to the present embodiment, the syntax analysis unit 14 analyzes the text in the HTML format received from the Web server 30 by the morphological analysis function using the longest match method according to the definition of the data definition unit 24 of the element information definition unit 18. have.
[0049]
Next, the operation of this embodiment will be described in detail with reference to FIGS.
[0050]
FIG. 6 is an overall configuration diagram of the second embodiment of the present invention.
[0051]
FIG. 7 shows an example of the HTML text a31 including the identifier part 41 and the data part 42 in FIG. This example is an example in which the degree of risk is lower than the example of FIG.
[0052]
FIG. 8 shows an example of the HTML text b32 including the identifier section 43 and the data section 44 in FIG. This example is an example having a higher degree of risk than the example shown in FIG.
[0053]
FIG. 9 shows an example of the element information definition unit 18 including the identifier definition unit 23 and the data definition unit 24 in FIG.
[0054]
FIG. 10 shows an example in which the processing result of the HTML text a31 is displayed as a pie chart on the display unit 17 as a result of the operation of this embodiment. In this example, as a result of analyzing the text, the risk is low.
[0055]
FIG. 11 shows an example in which the processing result of the HTML text a31 is displayed on the display unit 17 as a radar chart as a result of the operation of this embodiment. In this example, as a result of analyzing the text, the degree of risk is small as in FIG.
[0056]
FIG. 12 shows an example in which the processing result of the HTML text b32 is displayed as a pie chart on the display unit 17 as a result of the operation of this embodiment. In this example, as a result of analyzing the text, the risk is high.
[0057]
FIG. 13 shows an example in which the processing result of the HTML text b32 is displayed on the display unit 17 as a radar chart pie chart as a result of the operation of this embodiment. In this example, as a result of analyzing the text, the degree of risk is high as in FIG.
[0058]
Referring to FIG. 6, the user makes a display request of HTML text a31 or HTML text b32 of Web server 30 using input unit 11 of Web client 10, and communication control unit 12 receives this display request and establishes network 01. The display request is sent to the Web server 30 via the Web server 30.
[0059]
The Web server 30 sends the HTML text a31 or the HTML text b32 corresponding to the received display request to the Web client 10 via the network 01.
[0060]
The communication control unit 12 of the Web client 10 receives the HTML text a31 or the HTML text b32 sent from the Web server 30, and passes it to the identifier determination unit 13. The identifier determining unit 13 compares the identifier unit 41 or the identifier unit 43 of the HTML text a31 or the HTML text b32 with the keyword of the identifier definition unit 23 of the elementary information definition unit 18. Instead, the HTML text a31 or the HTML text b32 is passed to the display control unit 16, and the display control unit 16 displays this in the text format on the display unit 17.
[0061]
The identifier determination unit 13 compares the identifier unit 41 or the identifier unit 43 of the HTML text a31 or the HTML text b32 with the keyword of the identifier definition unit 23 of the elementary information definition unit 18, and if they are the same, displays a simplified image. Judge. Then, the HTML text a31 or the HTML text b32 is passed to the syntax analysis unit 14. The syntax analysis unit 14 analyzes the data part 42 or the data part 44 of the HTML text a31 or the HTML text b32 by a method such as morphological analysis according to the definition of the data definition part 24 of the elementary information definition part 18 and converts the data into items in the graph display. The result of the classification and digitization is passed to the image creation unit 15. The image creating unit 15 creates image data such as a pie chart and a radar chart based on the items and the numerical values, passes the created image data to the display control unit 16, and the display control unit 16 displays it on the display unit 17.
[0062]
Here, an example of the comparison process in the identifier determination unit 13 will be specifically described with reference to FIG. 7 which is an example of the HTML text a31 and FIG. 9 which is an example of the elementary information definition unit 18. In the example of the identifier section 43 shown in FIG. 9, the identifier determination section 13 reads that the "Privacy Policy" character string is described in the "Distraction" line. Then, when the files in FIG. 7 are sequentially compared from the beginning in order to perform the comparison process, it is found that a character string including “Privacy Policy” exists in this file. If this character string exists, it is determined that a simple image should be displayed. Conversely, if no matching character string exists, it is determined that a simple image is not to be displayed. In addition, although shown as an example here, uppercase and lowercase characters, kanji shift operation, and the like may be performed. Further, an AND search with character strings such as “purpose of use” and “disclosure range” which are items in the data definition unit 24 in FIG. 9 may be performed.
[0063]
Next, an example of the analysis processing in the syntax analysis unit 14 will be specifically described using FIG. 7 which is an example of the HTML text a31 and FIG. 9 which is an example of the elementary information definition unit 18.
[0064]
First, the syntax analysis unit 14 reads the information of the data definition unit 24 in FIG. 9, and stores character strings representing items such as “purpose of use”, “disclosure range”, and “information to be collected”, and “none” of the data definition unit 24. Using a character string such as "", "marketing", or "performing user request" as a keyword, the data part 42 in FIG. 7 is subjected to syntax analysis using morphological analysis by the longest match method, and is expressed from among candidate words that have been extracted from the dictionary. Is divided by the word with the longest length.
[0065]
For example, the analysis from the end of the <center> line on the first line of the data section 42 is specifically shown. One line at the top, "The purpose of use of information is for marketing and fulfilling user requests." Are extracted as "purpose of use", "marketing", and "performance of user request" by parsing. Here, since <marketing purpose> in the data definition unit 24 in FIG. 9 includes “marketing” = 1 and “performance of user request” = 2, the “purpose of use” in FIG. Digitize if there is. That is, since FIG. 7 is an example of the HTML text a31, the result of analyzing the HTML text a31 for the item of "purpose of use" can be quantified to be 3.
[0066]
Similarly, quantification is performed on items such as "disclosure range", "information to be collected", and so on. After these are passed to the image creation unit 15, since the “purpose of use” is 3, the values of FIGS. 10 and 11 are created as numerical values of the corresponding images.
[0067]
FIG. 10 shows an example in which the processing result of the HTML text a31 is displayed on the display unit 17 in a pie chart. In this example, the analysis result of the text indicates that the risk is low. FIG. 11 shows an example in which the processing result of the HTML text a31 is displayed on the display unit 17 in a radar chart. In this example, the analysis result of the text is the same as that of FIG.
[0068]
FIG. 12 shows an example in which the processing result of the HTML text b32 is displayed as a pie chart on the display unit 17. In this example, as a result of analyzing the text, the risk is high. FIG. 13 shows an example in which the processing result of the HTML text b32 is displayed on the display unit 17 in a radar chart. Since the analysis result of this text is the same as that in FIG. 12, the risk is high.
[0069]
Next, a third embodiment of the present invention will be described in detail with reference to the drawings.
[0070]
In the third embodiment of the present invention, the text returned from the Web server 30 in FIG. 6 is the HTML text a31 or the HTML text b32, whereas the text in FIG. It differs from the second embodiment in that it is an HTML text c33 containing link information. Compared to the specific example of the HTML text a31 or the HTML text b32 shown in FIG. 7 or FIG. 8, a specific example of the HTML text c33 is as shown in FIG. Further, a specific example of the image data section 47 indicating the information of the HTML link destination of the data section 46 of the HTML text c33 shown in FIG. 16 is as shown in FIG.
[0071]
FIG. 14 is a block diagram showing the configuration of the present embodiment. In the configuration of the second embodiment, a similar image analysis unit 19 which does not exist in FIG. 6 showing the configuration of the second embodiment exists. And different.
[0072]
Therefore, after parsing the HTML text c33 by the syntax analysis unit 14, when the image data unit 47 linked from a part of the HTML text c33 is recognized as a mark from the guarantee organization by performing similar image matching processing. , The value of the item “guarantee agency” is reduced to reduce the risk, and the image creating unit 15 can create an image by giving a meaning with less risk.
[0073]
Next, the operation of this embodiment will be described in detail with reference to FIG. 14, FIG. 15, and FIG.
[0074]
FIG. 14 is an overall configuration diagram of the third embodiment of the present invention.
[0075]
FIG. 15 shows an example of the image data section 47 in FIG. This is an example of a mark by a guarantee organization, and it does not matter whether it is a still image, music data, or moving image data.
[0076]
FIG. 16 shows an example of the HTML text c33 including the identifier part 45 and the data part 46 in FIG. Part of the description of the data section 46 includes information for linking the image data section 47.
[0077]
Referring to FIG. 14, the user makes a display request of the HTML text c33 of the Web server 30 using the input unit 11 of the Web client 10, and the communication control unit 12 receives the display request and receives the display request via the network 01. This display request is sent to 30.
[0078]
The Web server 30 sends the HTML text c33 corresponding to the received display request to the Web client 10 via the network 01.
[0079]
The communication control unit 12 of the Web client 10 receives the HTML text c33 sent from the Web server 30, and passes it to the identifier determination unit 13. The identifier judging section 13 compares the identifier section 45 included in the HTML text c33 with the information of the identifier defining section 23 included in the elementary information defining section 18, and if they are not the same, simple image display is performed as in the previous embodiment. Instead, the HTML text c33 is passed to the display control unit 16, and the display control unit 16 displays this in the text format on the display unit 17.
[0080]
The identifier determining unit 13 compares the identifier 45 of the HTML text c33 with the keyword of the identifier defining unit 23 of the elementary information defining unit 18 and executes a simplified image display if they are the same. The parsing unit 14 analyzes the data part 42 or the data part 44 included in the HTML text c33 by a method such as morphological analysis according to the definition of the data definition part 24 included in the elementary information definition part 18 and classifies the items. I do. Next, the similar image analysis unit 19 receives the image data unit 47 in the Web server 30 via the network 01 with reference to the link to the image data unit 47 described in a part of the data unit 46 of the HTML text c33. . The received image data unit 47 is searched for a similar image based on the image information on the guaranty institution held by the Web client 10, and as a result of the similarity check, it is determined that the image is a guarantee mark image by the guaranty institution. In this case, the parsing unit 14 sets a numerical value that lowers the risk of the items of the certifying institution of the contents classified by the parsing unit 14, and digitizes the other items processed by the parsing unit 14 and the result of digitization to the image creating unit 15. The image creation unit 15 creates image data such as a pie chart and a radar chart based on the item and the numerical value, and delivers the image data to the display control unit 16. The display control unit 16 displays the data on the display unit 17.
[0081]
The method of image similarity search is not particularly defined. Further, in the present embodiment, an example has been described in which the assurance mark image is identified by the assurance institution by similar image search, but an identification method using an electronic signature using encryption processing technology may be used.
[0082]
Next, a fourth embodiment of the present invention will be described in detail with reference to FIG.
[0083]
In the present embodiment, the configuration of displaying the XML text by the graph or the like of the Web client 10 in the first embodiment is incorporated in the Web browser 25.
[0084]
When receiving the XML text a34 received from the Web server 30, the communication control unit 12 of the Web client 10 passes the received XML text a34 to the Web browser 25. When a certain function key or the like on the keyboard of the input unit 11 is pressed in the text display state of the web browser 25, the web browser 24 recognizing this presses the identifier determination unit 13 using the XML text a34 as an argument. Thereafter, the graph display of the XML text a34 is performed by the same processing as that described in the previous embodiment. If the keyword of the identifier section 48 of the XML text a34 does not match the keyword of the identifier definition section 21 of the element information definition section 20, the process returns to the Web browser 25, and the text display is performed again.
[0085]
When the function key is pressed again during the display of the graph or the like, the display returns to the Web browser 25. As a result, the user can call the display process using a graph or the like of the Web browser 25 as needed from the contents displayed in the text format, graph the contents of the text, and refer to simple display information. Although the description has been given of the XML text, the same applies to the HTML text.
[0086]
【The invention's effect】
The first effect is that the user does not need to read all the text sent from the server by viewing the image simplified by the present system.
[0087]
The reason for this is that the system has a function of creating an image that can be easily analyzed by parsing the text.
[0088]
The second effect is that, by incorporating the contents of the present invention into the Web browser, the existing operability in the Web client is guaranteed, and further, the functionality of the Web browser for viewing a simplified image can be improved. It becomes possible.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of a first exemplary embodiment of the present invention.
FIG. 2 is a block diagram illustrating a detailed configuration of the first embodiment of the present invention.
FIG. 3 is a block diagram illustrating a detailed configuration of the first exemplary embodiment of the present invention.
FIG. 4 is a block diagram illustrating a detailed configuration of the first embodiment of the present invention.
FIG. 5 is a block diagram illustrating a detailed configuration of the first embodiment of the present invention.
FIG. 6 is a block diagram illustrating a configuration of a second exemplary embodiment of the present invention.
FIG. 7 is a block diagram illustrating a detailed configuration of a second embodiment of the present invention.
FIG. 8 is a block diagram illustrating a detailed configuration of a second embodiment of the present invention.
FIG. 9 is a block diagram illustrating a detailed configuration of a second embodiment of the present invention.
FIG. 10 is a block diagram illustrating a detailed configuration of a second example of the present invention.
FIG. 11 is a block diagram illustrating a detailed configuration of a second example of the present invention.
FIG. 12 is a block diagram illustrating a detailed configuration of a second embodiment of the present invention.
FIG. 13 is a block diagram illustrating a detailed configuration of a second example of the present invention.
FIG. 14 is a block diagram illustrating a configuration of a third exemplary embodiment of the present invention.
FIG. 15 is a block diagram illustrating a detailed configuration of a third embodiment of the present invention.
FIG. 16 is a block diagram illustrating a detailed configuration of a third example of the present invention.
FIG. 17 is a block diagram illustrating a configuration of a fourth exemplary embodiment of the present invention.
[Explanation of symbols]
01 Network
10 Web Client
11 Input section
12 Communication control unit
13 Identifier judgment unit
14 Syntax analyzer
15 Image creation unit
16 Display control unit
17 Display
20 Elementary Information Definition Section
21 Identifier definition part
22 Data definition section
30 Web server
34 XML text a
48 Identifier part
49 Data Division

Claims

In the Web client, when receiving the HTML text from the Web server, the identifier definition unit that defines the keyword described in the HTML text and characterizing the content, and defines the item related to the keyword and the numerical value given to the item in association with each other. A text information definition unit, an identifier determination unit for comparing a keyword of the HTML text with a keyword defined in the text information definition unit, and the HTML when the comparison by the identifier determination unit matches. When the result of the syntax analysis of the text by the morphological analysis or the like is compared with the data definition part for the keyword matched by the elementary information definition part, when the same item as the item is detected in the HTML text, the item is determined in the data definition part. To the HTML text A syntax analysis unit for collecting data necessary for bets graphing such, the HTML text data portion in the image held by the Web client retrieves the HTML information comprising upon detecting an HTML link information linked still or moving, etc. A similar image analysis unit that compares the information with each other and, when they match, corrects the numerical value of the item related to the HTML information, and converts the data required for graphing and the like received from the syntax analysis unit into a simple graph such as a pie chart or radar chart. A simple image display system, comprising: an image creation unit that converts an image into an image.

The element information definition unit, related item name and associated item representing an identifier defining unit that defines a keyword possessed by the HTML text, the item name and contents associated with each item in the item indicating the content associated with the keyword simplified image display system of claim 1 Symbol placement wherein the data definition section comprising a numerical value given, in that they are composed of relative.

The HTML text includes an identifier portion that contains the keyword characterizing the content of the text, the items and the data portion comprising a document body information including the related items, according to claim 1, characterized in that they are composed of Simple image display system.

The identifier determination unit result of comparison between the keywords of the keyword and the identifier definition of the identifier part by, if they do not match claim 1 Symbol placement simple in and displaying in text format without including graphing Image display system.

The elementary information definition unit, the identifier determination unit, the syntax analysis unit, and the image display unit are provided with a Web browser that displays the HTML text, and a text display according to an instruction from an input device such as a keyboard. , claim 1 Symbol placement simplified image display system and performs the switching between the display by the graph or the like.