JP5338298B2

JP5338298B2 - Page browsing device and program

Info

Publication number: JP5338298B2
Application number: JP2008324227A
Authority: JP
Inventors: 高弘冨田
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2008-12-19
Filing date: 2008-12-19
Publication date: 2013-11-13
Anticipated expiration: 2028-12-19
Also published as: JP2010146381A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a web page browsing apparatus to easily find a link text to be linked to a web page desired by a user without the need of jumping to the linking destination of a hyper link and opening the web page. <P>SOLUTION: When an optional link text Ltxtn in the hyper text of the web page displayed at present is selectively focused, the hyper text of the linking destination web page is obtained according to a linking destination URL corresponding to the link text Ltxtn, and tag elements having the same (or similar) character string as the link text Ltxtn are listed up. Then, on the basis of the tag elements having the same character string as the link text Ltxtn in the hyper text of the linking destination web page, the text Htxt included in the succeeding tag elements is extracted, and the read-out voice is voice-synthesized and output. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、ページ閲覧装置およびプログラムに関する。 The present invention relates to a page browsing apparatus and a program.

従来から、ＬＡＮ(Local Area Network)やＷＡＮ(Wide Area Network)、インターネットなど、通信ネットワークにおけるサーバ・クライアント・システムでは、クライアント装置が備えるＷｅｂブラウザにより、サーバ装置が生成保存している種々のＷｅｂページを取得表示して閲覧することが行われる。 Conventionally, in a server client system in a communication network such as a LAN (Local Area Network), a WAN (Wide Area Network), and the Internet, various Web pages generated and stored by the server device by a Web browser provided in the client device. Is obtained, displayed, and browsed.

一般に、Ｗｅｂページは、ＨＴＭＬ(Hyper Text Markup Language)のソースコードにより記述されており、そのドキュメント構造をＷｅｂブラウザが解析して表示するものである。 In general, a Web page is described by HTML (Hyper Text Markup Language) source code, and a Web browser analyzes and displays the document structure.

このＷｅｂページには、ハイパーテキストとして、記述されたテキストを見出しとし、当該テキストの内容に対応する他のページへ遷移するためのハイパーリンクが設定されたテキスト部分（リンクテキスト）が複数箇所存在することが多い。 In this Web page, there are a plurality of text portions (link text) in which the described text is used as a headline and a hyperlink is set for transition to another page corresponding to the content of the text. There are many cases.

ユーザが、前記ハイパーテキストのリンクテキストにフォーカスして決定操作することにより、そのリンク先のＵＲＬへアクセスされて対応するＷｅｂページが取得され、画面展開され表示されるものである。 When the user focuses on the hypertext link text and performs a determination operation, the URL of the link destination is accessed, the corresponding Web page is acquired, and the screen is expanded and displayed.

この際、前記Ｗｅｂページ上でユーザがフォーカスするリンクテキストは、あくまで見出しであり、その内容の詳細はリンク先のＷｅｂページが取得され画面展開されないと把握できないので、ユーザが所望のリンク先のＷｅｂページを開くまでに、幾つかの必要としないリンク先のＷｅｂページを開いてしまうことが多々ある。 At this time, the link text focused on by the user on the Web page is just a headline, and the details of the content cannot be grasped unless the linked Web page is acquired and expanded on the screen. Before opening a page, a number of unnecessary linked Web pages are often opened.

この場合、リンク先のＷｅｂページが取得され画面展開される都度、その画面展開（レンダリング）の処理に時間が掛かり、しかも画面展開されたＷｅｂページの内容を見て、ユーザ所望のリンク先であったかを判断するので、元のＷｅｂページに戻ったり、別のリンクテキストにフォーカスして新たなＷｅｂページを開いたりする繰り返し操作が多くなる。このため、特に高速にレンダリング処理できない携帯端末のＷｅｂブラウザにおいては、所望のリンク先のＷｅｂページを見つけて開くまでに、無駄な待ち時間を要してしまう。 In this case, each time a linked Web page is acquired and the screen is expanded, it takes time to perform the screen expansion (rendering), and the contents of the Web page expanded on the screen are viewed to determine whether the link destination is desired by the user. Therefore, there are many repeated operations of returning to the original Web page or opening a new Web page by focusing on another link text. For this reason, a web browser of a portable terminal that cannot perform rendering processing at high speed requires a wasteful waiting time until a desired linked web page is found and opened.

一方で、Ｗｅｂページのハイパーテキストを音声合成して読み上げたり、当該ハイパーテキストに設定されたリンクテキストのみを順次音声合成して読み上げたりするのに伴い、リンク先へのジャンプを指示するユーザ操作が行われると、その読み上げ位置に応じてフォーカスされているリンクテキストのリンク先Ｗｅｂページへジャンプする機能を備えたハイパーテキスト制御装置が考えられている（例えば、特許文献１参照。）。
特開平１０−０７８９５２号公報 On the other hand, as the hypertext of a web page is synthesized by speech and read out or only the link text set in the hypertext is sequentially synthesized by speech and read out, a user operation for instructing a jump to a link destination is performed. When performed, a hypertext control device having a function of jumping to a linked Web page of a link text focused according to the reading position has been considered (for example, see Patent Document 1).
Japanese Patent Laid-Open No. 10-078952

前記従来のハイパーテキスト制御装置では、ハイパーテキストを、ユーザが読まずしてその読み上げ音声を聞くことにより知ることができ、任意のテキスト読み上げ位置でのジャンプ指示により、関連のあるリンク先Ｗｅｂページへジャンプすることが可能である。しかしながら、リンク先のＷｅｂページがユーザ所望の内容であるか否かは、当該リンク先Ｗｅｂページが画面展開されてそれを確認するか、同リンク先Ｗｅｂページのハイパーテキスト先頭からの読み上げ音声を全て聞いていないと判断できない。このため、結局は各リンクテキストにフォーカスしてリンク先Ｗｅｂページを開く処理と元のＷｅｂページへ戻る処理とを所望のリンク先Ｗｅｂページが見つかるまで繰り返すことになり、ユーザの手間が多く時間的効率も悪い問題がある。 In the conventional hypertext control device, the user can know the hypertext by listening to the read-out voice without reading the hypertext, and by jump instruction at an arbitrary text-reading position, the user can go to the related linked web page. It is possible to jump. However, whether or not the linked web page has the user-desired content is determined by expanding the screen of the linked web page and confirming it, or by reading out all of the reading voice from the hypertext head of the linked web page. I can't judge without listening. For this reason, after all, the process of opening the link destination web page by focusing on each link text and the process of returning to the original web page are repeated until a desired link destination web page is found. There is also a problem with poor efficiency.

本発明は、このような課題に鑑みなされたもので、ページを閲覧する際に、ハイパーリンクのリンク先へジャンプしてそのページを開く必要なく、ユーザ所望のページにリンクするリンクテキストを容易に見つけることが可能になるページ閲覧装置およびプログラムを提供することを目的とする。 The present invention has been made in view of such problems, when viewing a page, without having to open the page to jump to the hyperlink destination, easily link text that links to a user desired page It is an object to provide a page browsing device and a program that can be found.

請求項１は、ページ情報を受信するページ受信手段と、前記ページ受信手段により受信されたページ情報を表示画面上に画面展開して表示するページ表示手段と、前記ページ表示手段により表示されたページ情報に含まれるリンクテキストの中で、任意のリンクテキストにフォーカスが当たった段階で、そのフォーカスが当たっているリンクテキストに対応するリンク先ページ情報を取得するページ取得手段と、前記ページ取得手段により取得されたリンク先ページ情報の中から、前記リンクテキストと同一あるいは類似の文字列を含む要素を抽出する要素抽出手段と、前記要素抽出手段により抽出された要素以降に含まれる本文テキストを抽出するテキスト抽出手段と、前記テキスト抽出手段により抽出された本文テキストを出力する出力手段と、を備えたことを特徴としている。 The first aspect of the present invention provides a page receiving means for receiving page information, a page display means for displaying the page information received by the page receiving means on a display screen, and a page displayed by the page display means. A page acquisition unit that acquires link destination page information corresponding to a link text that is in focus at a stage where an arbitrary link text is focused in the link text included in the information, and the page acquisition unit From the acquired linked page information, an element extraction unit that extracts an element including a character string that is the same as or similar to the link text, and a body text that is included after the element extracted by the element extraction unit is extracted. Text extraction means and output for outputting the body text extracted by the text extraction means Is characterized by comprising: a stage, a.

本発明によれば、ページを閲覧する際に、ハイパーリンクのリンク先へジャンプしてそのページを開く必要なく、ユーザ所望のページにリンクするリンクテキストを容易に見つけることが可能になる。 According to the present invention, when viewing the page, without the need to open the page to jump to hyper-link to point, to be able to easily find the link text that you want to link to the user's desired page ing.

以下図面により本発明の実施の形態について説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明のＷｅｂページ閲覧装置の実施形態に係るサーバ・クライアント・システムの構成を示すブロック図である。 FIG. 1 is a block diagram showing a configuration of a server / client system according to an embodiment of a Web page browsing apparatus of the present invention.

このサーバ・クライアント・システムは、インターネット、ＷＡＮ、ＬＡＮなどからなるネットワークＮ上に接続された複数のサーバ装置１０，…および複数のクライアント装置２０，…を備える。 This server / client system includes a plurality of server devices 10,... And a plurality of client devices 20,... Connected on a network N composed of the Internet, WAN, LAN, and the like.

サーバ装置１０は、Ｗｅｂコンテンツ生成処理プログラム，登録ユーザ管理処理プログラム，Ｗｅｂページ配信処理プログラムなど、当該サーバ装置１０の本体操作により機能する複数のアプリケーションプログラムを有し、例えば本サーバ装置１０にユーザ登録されたクライアント装置２０，…からの指定のＷｅｂサイトへのアクセス要求に応じて当該要求されたＷｅｂサイトにおけるＷｅｂコンテンツ１５ａのページを各クライアント装置２０，…へ配信する。 The server device 10 has a plurality of application programs that function by operating the main body of the server device 10 such as a Web content generation processing program, a registered user management processing program, and a Web page distribution processing program. In response to an access request to the designated website from the requested client apparatus 20,..., The page of the web content 15a in the requested website is distributed to each client apparatus 20,.

クライアント装置２０，…は、携帯電話，ＰＤＡ(Personal Digital Assistant)，ＰＣなどからなり、インターネット接続処理プログラム、Ｗｅｂブラウザプログラム２３ａ，Ｗｅｂ印刷プログラムなど、当該端末装置２０の本体操作により機能する複数のアプリケーションプログラムを有する。そして、例えば所望のＷｅｂサイト[http://www.sight_a.co.jp/]のサーバ装置（Ａ）１０にアクセスしてそのＷｅｂコンテンツＡ１５ａを取得し、当該ＷｅｂコンテンツＡ１５ａのＨＴＭＬ(Hyper Text Markup Language)のタグ要素から成るツリー構造を解析してＷｅｂページとして画面展開し表示したり印刷したりする。 The client device 20,... Includes a mobile phone, a PDA (Personal Digital Assistant), a PC, and the like. Have a program. Then, for example, the server device (A) 10 of a desired website [http://www.sight_a.co.jp/] is accessed to acquire the web content A15a, and HTML (Hyper Text Markup of the web content A15a) is acquired. Analyzes the tree structure consisting of the (Language) tag elements, expands the screen as a Web page, and displays or prints it.

図２は、前記サーバ・クライアント・システムにおけるサーバ装置１０の回路構成を示すブロック図である。 FIG. 2 is a block diagram showing a circuit configuration of the server apparatus 10 in the server / client system.

サーバ装置１０は、コンピュータとしてのＣＰＵ１１を備え、このＣＰＵ１１には、バス１２を介してＲＯＭ１３、ＲＡＭ１４、外付けハードディスクなどの外部記憶装置１５が接続される。 The server device 10 includes a CPU 11 as a computer, and an external storage device 15 such as a ROM 13, a RAM 14, and an external hard disk is connected to the CPU 11 via a bus 12.

また、ＣＰＵ１１には、バス１２を介してキーボード，マウスなどの入力装置１６、ＬＣＤ(Liquid Crystal Display)などの表示装置１７、クライアント装置２０，…との通信Ｉ／Ｆ（インターフェイス）１８が接続される。 Further, an input device 16 such as a keyboard and a mouse, a display device 17 such as an LCD (Liquid Crystal Display), and a communication I / F (interface) 18 with the client devices 20 are connected to the CPU 11 via the bus 12. The

ＣＰＵ１１は、ＲＯＭ１３に予め記憶されているシステムプログラムや種々のアプリケーションプログラムに従ってＲＡＭ１４を作業用メモリとし回路各部の動作を制御するもので、入力装置１６からのキー入力信号や通信Ｉ／Ｆ１８を介して受信されるクライアント装置２０からのユーザ操作に応じたＷｅｂコンテンツ取得要求信号などに応じて前記種々のプログラムが起動・実行される。 The CPU 11 controls the operation of each part of the circuit by using the RAM 14 as a working memory in accordance with a system program or various application programs stored in advance in the ROM 13, and via a key input signal from the input device 16 or a communication I / F 18. The various programs are activated and executed in response to a Web content acquisition request signal or the like corresponding to a user operation from the client device 20 received.

前記Ｗｅｂコンテンツ１５ａは、例えば外部記憶装置１５に適宜更新生成されて記憶されている。 The web content 15a is updated and generated as appropriate and stored in the external storage device 15, for example.

図３は、前記サーバ・クライアント・システムにおけるクライアント装置２０の回路構成を示すブロック図である。 FIG. 3 is a block diagram showing a circuit configuration of the client device 20 in the server client system.

クライアント装置２０は、コンピュータとしてのＣＰＵ２１を備え、このＣＰＵ２１には、バス２２を介してＲＯＭ２３、ＲＡＭ２４、メモリカードや光ディスク読み書き部などの外部記憶装置２５が接続される。 The client device 20 includes a CPU 21 as a computer, and a ROM 23, a RAM 24, and an external storage device 25 such as a memory card and an optical disk read / write unit are connected to the CPU 21 via a bus 22.

また、ＣＰＵ２１には、バス２２を介してキーボード，マウスなどの入力装置２６、ＬＣＤからなる表示装置２７、前記サーバ装置１０との通信Ｉ／Ｆ（インターフェイス）２８、Ｗｅｂページなどに記述されているテキストデータを音声データに変換する音声合成処理部２９ａ、音声データを出力する音声出力部２９ｂおよびスピーカＳＰが接続される。 Further, the CPU 21 is described via the bus 22 in an input device 26 such as a keyboard and a mouse, a display device 27 including an LCD, a communication I / F (interface) 28 with the server device 10, a Web page, and the like. A speech synthesis processing unit 29a that converts text data into speech data, a speech output unit 29b that outputs speech data, and a speaker SP are connected.

ＣＰＵ２１は、ＲＯＭ２３に予め記憶されているシステムプログラムおよび種々のアプリケーションプログラムに従ってＲＡＭ２４を作業用メモリとし回路各部の動作を制御するもので、入力装置２７からの入力信号に応じて前記種々のアプリケーションプログラムが起動され実行される。 The CPU 21 controls the operation of each part of the circuit by using the RAM 24 as a working memory in accordance with a system program and various application programs stored in advance in the ROM 23. The various application programs are executed according to input signals from the input device 27. It is activated and executed.

前記サーバ装置１０…をインターネット（Ｎ）上のＷｅｂサイト、前記クライアント装置２０，…を前記Ｗｅｂサイトにアクセス可能なユーザ端末とした場合、ユーザ端末（２０）からＷｅｂサイト（１０）へのアクセス要求に応じて、当該Ｗｅｂサイト（１０）においてＨＴＭＬにより記述生成されたＷｅｂコンテンツ１５ａがアクセス要求元のユーザ端末（２０）へ配信され、そのＷｅｂブラウザプログラム２３ａによりＷｅｂページに展開されて表示装置２７に表示される。 When the server apparatus 10 is a website on the Internet (N) and the client apparatus 20 is a user terminal that can access the website, an access request to the website (10) from the user terminal (20). Accordingly, the Web content 15a described and generated in HTML on the Web site (10) is distributed to the user terminal (20) that is the access request source, developed into a Web page by the Web browser program 23a, and displayed on the display device 27. Is displayed.

このクライアント装置（ユーザ端末）２０のＷｅｂブラウザプログラム２３ａは、ユーザ指定のＷｅｂサイトのサーバ装置１０へのアクセスに伴い、そのＷｅｂコンテンツ１５ａを取得し、当該Ｗｅｂコンテンツ１５ｂのＨＴＭＬのタグから成るツリー構造を解析して、ＷｅｂページとしてＲＡＭ２４内のフレームバッファＦＢに画面展開し表示する機能を有する。そして、表示中のＷｅｂページのハイパーテキストにおけるリンクテキストがフォーカスにより指示されると、リンク先のＷｅｂページのハイパーテキストから前記リンクテキストに対応する内容の本文テキストを抽出し、その読み上げ音声を音声合成して出力する機能を有する。 The Web browser program 23a of the client device (user terminal) 20 acquires the Web content 15a in accordance with access to the server device 10 of the Web site specified by the user, and has a tree structure composed of HTML tags of the Web content 15b. And has a function of developing and displaying a screen as a Web page in the frame buffer FB in the RAM 24. Then, when the link text in the hypertext of the Web page being displayed is instructed by the focus, the body text of the content corresponding to the link text is extracted from the hypertext of the Web page of the link destination, and the speech read out is synthesized with speech And has a function of outputting.

そして、前記ＲＡＭ２４には、リンクテキスト類似要素メモリ２４ａおよび読み上げ対象テキストメモリ２４ｂが備えられる。 The RAM 24 includes a link text similar element memory 24a and a reading target text memory 24b.

前記リンクテキスト類似要素メモリ２４ａには、前記リンク先Ｗｅｂページのハイパーテキストから前記リンクテキストと同一あるいは類似の文字列を含むタグ要素がリストアップされて記憶される。 In the link text similar element memory 24a, tag elements including character strings identical or similar to the link text are listed and stored from the hypertext of the link destination Web page.

前記読み上げ対象テキストメモリ２４ｂには、前記リンクテキストの類似文字列を含むタグ要素の中で最も強いスタイルの文字列を含むタグ要素が判断され、それ以降のタグ要素から、設定数以上の読点を有し且つ設定数以上の文字からなるテキストが、前記リンクテキストに対応する本文テキストとして抽出されて記憶される。 In the reading target text memory 24b, a tag element including a character string of the strongest style among tag elements including similar character strings of the link text is determined, and more than a set number of reading marks are determined from the tag elements thereafter. The text having more than the set number of characters is extracted and stored as the body text corresponding to the link text.

このようなリンク先本文テキストの抽出・音声出力機能は、例えば前記Ｗｅｂブラウザプログラム２３ａにプラグインあるいはアドオンするプログラムにより実現される。 Such link destination body text extraction / speech output function is realized, for example, by a program plug-in or add-on to the Web browser program 23a.

図４は、前記クライアント装置２０においてサーバ装置（Ａ）１０[http://www.sight.a.co.jp]から取得されたＷｅｂページＰの画面表示例を示す図である。 FIG. 4 is a diagram showing a screen display example of the Web page P acquired from the server device (A) 10 [http://www.sight.a.co.jp] in the client device 20.

図５は、前記図４におけるＷｅｂページＰを記述したＨＴＭＬソースＰhtmを示す図である。 FIG. 5 is a diagram showing an HTML source Phtm describing the Web page P in FIG.

図４に示すように、クライアント装置２０の表示装置２７に表示させたＷｅｂページＰはハイパーテキストであり、タイトルＴ「今日の速報ニュース」で示される５項目の見出しが何れもリンクテキストＬtxt1〜Ｌtxt5に設定されている。そして、当該各リンクテキストＬtxt1〜Ｌtxt5は、図５に示すＨＴＭＬソースＰhtmにおいて、何れも“Ａ”で括られるタグ要素に記述され、リンク先ＷｅｂページのＵＲＬが対応付けられている。 As shown in FIG. 4, the Web page P displayed on the display device 27 of the client device 20 is hypertext, and the headings of the five items indicated by the title T “Today's breaking news” are all link texts Ltxt1 to Ltxt5. Is set to Each of the link texts Ltxt1 to Ltxt5 is described in a tag element surrounded by “A” in the HTML source Phtm shown in FIG. 5 and is associated with the URL of the link destination Web page.

例えば表示装置２７に表示されたＷｅｂページＰにおいて、３番目のリンクテキスト「首相、内閣支持率に注文」Ｌtxt3にフォーカスすると、当該リンクテキストＬtxt3を記述したＨＴＭＬソースＰhtmから、破線ａで囲んで示すように、リンク先ＷｅｂページのＵＲＬ[http://www.sight_b.co.jp/news002.html]が取得される。 For example, in the Web page P displayed on the display device 27, when the third link text “Prime Minister, Order to Cabinet Support Rate” Ltxt3 is focused, the HTML source Phtm describing the link text Ltxt3 is surrounded by a broken line a. As described above, the URL [http://www.sight_b.co.jp/news002.html] of the link destination Web page is acquired.

図６は、前記図５におけるリンクテキストＬtxt3のリンク先ＵＲＬに対応するＷｅｂページＬＰのＨＴＭＬソースＬＰhtmを示す図である。 FIG. 6 is a diagram showing the HTML source LPhtm of the Web page LP corresponding to the link destination URL of the link text Ltxt3 in FIG.

図７は、前記図６におけるＨＴＭＬソースＬＰhtmにより記述されたリンク先ＷｅｂページＬＰの画面表示例を示す図である。 FIG. 7 is a diagram showing a screen display example of the linked Web page LP described by the HTML source LPhtm in FIG.

図６に示すように、リンク先ＷｅｂページＬＰのＨＴＭＬソースＬＰhtmには、前記リンク元ＷｅｂページＰにてフォーカスしたリンクテキスト「首相、内閣支持率に注文」Ｌtxt3と同一（あるいは類似）の見出しテキスト「首相、内閣支持率に注文」Ｍtxtを含んでいる“Ｈ１”で括られるタグ要素が存在する。 As shown in FIG. 6, the HTML source LPhtm of the link destination web page LP includes the same (or similar) heading text as the link text “Prime Minister, Order to Cabinet Support Rate” Ltxt3 focused on the link source web page P. There is a tag element enclosed in “H1” that contains Mtxt, “Prime Minister, Order to Cabinet Support”.

そして、前記見出しテキスト「首相、内閣支持率に注文」Ｍtxtに対応する本文テキストＨtxtは、当該見出しテキスト「首相、内閣支持率に注文」Ｍtxtを含むタグ要素“Ｈ１”以降のタグ要素“Ｐ”において、幾つもの読点を有する比較的長い文字列として記述されている。 The body text Htxt corresponding to the heading text “Prime Minister, Order with Cabinet Support Rate” Mtxt is a tag element “P” after the tag element “H1” including the heading text “Order with Prime Minister, Cabinet Support Rate” Mtxt. Are described as a relatively long character string having several readings.

このため、リンク先ＷｅｂページＬＰのＨＴＭＬソースＬＰhtmから、前記リンク元ＷｂページＰにてフォーカスしたリンクテキストＬtxtnに対応する本文テキストＨtxtを抽出するには、当該リンクテキストＬtxtnと同一（あるいは類似）の見出しテキストＭtxtを含んでいるタグ要素“Ｈ１”を検索し、これ以降のタグ要素“Ｐ”において、読点を設定個数以上含み且つ設定数以上の文字数からなるテキストを抽出すればよい。 For this reason, in order to extract the body text Htxt corresponding to the link text Ltxtn focused on the link source Wb page P from the HTML source LPhtm of the link destination Web page LP, the same (or similar) as the link text Ltxtn. The tag element “H1” including the headline text Mtxt is searched, and in the subsequent tag element “P”, the text including the set number of readings and the number of characters exceeding the set number may be extracted.

図８は、リンク元ＷｅｂページＰにてフォーカスしたリンクテキスト「首相、内閣支持率に注文」Ｌtxtと同一（あるいは類似）の複数の見出しテキストＭtxt1,Ｍtxt2を含んでいるリンク先ＷｅｂページＬＰ′の画面表示例を示す図である。 FIG. 8 shows the linked web page LP ′ that includes a plurality of headline texts Mtxt1, Mtxt2 that are the same as (or similar to) Ltxt that is the link text “Prime Minister, Order to Cabinet Support Rate” Ltxt focused on the link source web page P. It is a figure which shows the example of a screen display.

このような、リンクテキスト「首相、内閣支持率に注文」Ｌtxtと同一（あるいは類似）の複数の見出しテキストＭtxt1,Ｍtxt2を含んでいるリンク先ＷｅｂページＬＰ′の場合には、当該リンクテキストＬtxtと同一（あるいは類似）の複数の見出しテキストＭtxt1,Ｍtxt2から、最も「強い」文字列（フォントサイズ・フォントスタイル）からなる見出しテキストＭtxt2を判断する。そして、この最も「強い」文字列からなる見出しテキストＭtxt2を含んでいるタグ要素“Ｈ１”以降のタグ要素“Ｐ”において、読点を設定個数以上含み且つ設定数以上の文字数からなるテキストを抽出すればよい。 In the case of a linked Web page LP ′ including a plurality of headline texts Mtxt1 and Mtxt2 that are the same (or similar) as the link text “Prime Minister, Order from the Cabinet's Support Rate” Ltxt, From a plurality of the same (or similar) headline texts Mtxt1, Mtxt2, the headline text Mtxt2 composed of the most “strong” character string (font size / font style) is determined. Then, in the tag element “P” after the tag element “H1” including the heading text Mtxt2 composed of the most “strong” character string, the text including the set number of readings and the number of characters exceeding the set number is extracted. That's fine.

次に、前記構成のクライアント装置２０におけるＷｅｂページ閲覧機能について説明する。 Next, the Web page browsing function in the client device 20 having the above configuration will be described.

図９は、前記クライアント装置２０によるＷｅｂページの閲覧に伴いリンク先Ｗｅｂページの本文テキストを取得してその読み上げ音声を出力するための読み上げ対象テキスト取得処理を示すフローチャートである。 FIG. 9 is a flowchart showing the text-to-speech acquisition process for acquiring the body text of the linked Web page and outputting the text to be read out as the client device 20 browses the Web page.

例えばサーバ装置（Ａ）１０のＷｅｂサイト[http://www.sight_a.co.jp/]から取得されたユーザ所望のＷｅｂコンテンツ（Ａ）１５ａのページＰ（図４参照）が表示装置２７に表示されると、図９における読み上げ対象テキスト取得処理が起動され、先ず、ＲＡＭ２４ｂ内の読み上げ対象テキストメモリ２４ｂの内容が初期化によりクリアされる（ステップＳ１）。 For example, the page P (see FIG. 4) of the Web content (A) 15a desired by the user acquired from the Web site [http://www.sight_a.co.jp/] of the server device (A) 10 is displayed on the display device 27. When displayed, the reading target text acquisition process in FIG. 9 is started, and first, the contents of the reading target text memory 24b in the RAM 24b are cleared by initialization (step S1).

そして、この表示中のＷｅｂページＰにおいて、所望のリンクテキスト（例えば「首相、内閣支持率に注文」Ｌtxt3）にフォーカスが移動されると、当該フォーカスの当たっているリンクテキストＬtxt3がＲＡＭ２４に記憶される（ステップＳ２）。 Then, when the focus is moved to the desired link text (for example, “Order by Prime Minister, Cabinet Support Rate” Ltxt3) on the displayed Web page P, the focused link text Ltxt3 is stored in the RAM 24. (Step S2).

すると、前記表示中のＷｅｂページＰのＨＴＭＬソースＰhtm（図５参照）から、前記フォーカスされたリンクテキストＬtxt3に対応するリンク先ＷｅｂページのＵＲＬ[http://www.sight_b.co.jp/news002.html]が取得され、当該リンク先のＨＴＭＬソースＬＰhtm（図６参照）が取得される（ステップＳ３）。 Then, the URL [http://www.sight_b.co.jp/news002] of the linked web page corresponding to the focused link text Ltxt3 from the HTML source Phtm (see FIG. 5) of the web page P being displayed. .html] is acquired, and the linked HTML source LPhtm (see FIG. 6) is acquired (step S3).

すると、前記取得されたリンク先ＷｅｂページのＨＴＭＬソースＬＰhtmから、前記リンクテキスト「首相、内閣支持率に注文」Ｌtxt3と同じ（あるいは類似の）文字列を含むタグ要素を検索してリストアップする処理が、次の図１０のフローチャートに従い実行される。 Then, a process of searching for and listing a tag element including the same (or similar) character string as the link text “Prime Minister, Order to Cabinet Support” Ltxt3 from the HTML source LPhtm of the acquired link destination web page Is executed according to the flowchart of FIG.

図１０は、前記クライアント装置２０の読み上げ対象テキスト取得処理に伴うリンクテキストと同じ文字列を含む要素のリストアップ処理を示すフローチャートである。 FIG. 10 is a flowchart showing a list-up process of elements including the same character string as the link text accompanying the reading-target text acquisition process of the client device 20.

このリストアップ処理が起動されると、ＲＡＭ２４内のリンクテキスト類似要素メモリ２４ａが初期化によりクリアされ（ステップＡ１）、前記ステップＳ３にて取得されたリンク先ＷｅｂページのＨＴＭＬソースＬＰhtm（図６参照）において、テキストの記述を含むタグ要素が存在するか否か、当該ＨＴＭＬソースＬＰhtmの先頭から各タグ要素毎に判断される（ステップＡ２）。 When this list-up process is started, the link text similar element memory 24a in the RAM 24 is cleared by initialization (step A1), and the HTML source LPhtm (see FIG. 6) of the link destination Web page acquired in step S3. ), It is determined for each tag element from the head of the HTML source LPhtm whether or not there is a tag element including a text description (step A2).

ここで、テキストの記述を含むタグ要素が存在すると判断された場合には（ステップＡ２（ｙｅｓ））、当該タグ要素に含まれるテキストが、前記ステップＳ２にて記憶されたリンクテキスト「首相、内閣支持率に注文」Ｌtxt3と同じ（あるいは類似の）文字列を含むか否か判断される（ステップＡ３）。 If it is determined that there is a tag element including a description of the text (step A2 (yes)), the text included in the tag element is stored in the link text “Prime Minister, Cabinet” stored in step S2. It is determined whether or not the support rate includes the same (or similar) character string as “order” Ltxt3 (step A3).

そして、図６におけるリンク先ＷｅｂページのＨＴＭＬソースＬＰhtmの“Ｈ１”で括られるタグ要素において、リンクテキスト「首相、内閣支持率に注文」Ｌtxt3と同一（あるいは類似）の文字列（見出しテキスト「首相、内閣支持率に注文」Ｍtxt）を含んでいると判断されると（ステップＡ３（ｙｅｓ））、このタグ要素“Ｈ１”がＲＡＭ２４内のリンクテキスト類似要素メモリ２４ａに登録される（ステップＡ４）。 Then, in the tag element enclosed by “H1” in the HTML source LPhtm of the linked Web page in FIG. 6, the same (or similar) character string (headline text “Prime Minister” If it is determined that the order “Mtxt) is included in the cabinet support rate (step A3 (yes)), this tag element“ H1 ”is registered in the link text similar element memory 24a in the RAM 24 (step A4). .

前記ステップＡ２において、テキストの記述を含まないタグ要素と判断された場合（ステップＡ２（ｎｏ））、又は前記ステップＡ３において、テキストの記述を含んでいても前記リンクテキストＬtxt3と同一（あるいは類似）の文字列を含まないと判断された場合（ステップＡ３（ｎｏ））、又は前記ステップＡ４においてリンクテキストＬtxt3と同一（あるいは類似）の文字列を含むタグ要素が登録されると、当該各ステップにおいて処理対象となったタグ要素がリンク先ＷｅｂページのＨＴＭＬソースＬＰhtmにおける最後のタグ要素であるか否か判断される（ステップＡ５）。 If it is determined in step A2 that the tag element does not include a text description (step A2 (no)), or even if the text description is included in step A3, it is the same as (or similar to) the link text Ltxt3. If the tag element including the same (or similar) character string as the link text Ltxt3 is registered in the step A4, it is determined that the character string is not included (step A3 (no)). It is determined whether or not the tag element to be processed is the last tag element in the HTML source LPhtm of the link destination Web page (step A5).

ここで、リンク先ＷｅｂページのＨＴＭＬソースＬＰhtmにおける最後のタグ要素でないと判断された場合には（ステップＡ５（ｎｏ））、次のタグ要素を処理対象として前記ステップＡ２以降の処理が繰り返される（ステップＡ６→Ａ２）。 Here, when it is determined that it is not the last tag element in the HTML source LPhtm of the link destination Web page (step A5 (no)), the process after the step A2 is repeated with the next tag element as a processing target ( Step A6 → A2).

こうした図１０におけるリストアップ処理に基づき（ステップＳＡ）、１以上のタグ要素がリストアップされたと判断されると（ステップＳ４（ｙｅｓ））、当該リストアップされたタグ要素は１つのみか否か判断される（ステップＳ５）。 Based on the list-up process in FIG. 10 (step SA), if it is determined that one or more tag elements are listed (step S4 (yes)), it is determined whether there is only one tag element listed. (Step S5).

ここで、前記リンクテキストＬtxtnと同一（あるいは類似）の文字列を含むタグ要素が１つのみでなく複数リストアップされたと判断された場合には（ステップＳ５（ｎｏ））、当該リストアップされた複数のタグ要素にそれぞれ含まれる文字列のうち、最も「強い」文字列（フォントサイズ・フォントスタイル）を、リンク先Ｗｅｂページの本文テキストＨtxtの直前に位置する文字列として抽出するための、図１１における文字列の「強さ」の比較処理に移行される（ステップＳＢ）。 Here, when it is determined that a plurality of tag elements including the same (or similar) character string as the link text Ltxtn are listed instead of only one (step S5 (no)), the list is listed. A diagram for extracting the most “strong” character string (font size / font style) from among character strings included in a plurality of tag elements as a character string located immediately before the body text Htxt of the linked Web page. 11 shifts to the comparison processing of the “strength” of the character string (step SB).

一方、前記図６で示したように、「首相、内閣支持率に注文」Ｌtxt3と同一（あるいは類似）の文字列（見出しテキストＭtxt）を含むタグ要素が、“Ｈ１”の１つのみであると判断された場合には（ステップＳ５（ｙｅｓ））、当該１つのタグ要素“Ｈ１”に基づきリンク先Ｗｅｂページの本文テキストＨtxtを抽出するための、図１２における本文テキストの抽出処理に移行される（ステップＳＣ）。 On the other hand, as shown in FIG. 6, there is only one tag element “H1” that includes the same (or similar) character string (heading text Mtxt) as “Prime, Order from Cabinet Support Rate” Ltxt3. If it is determined (step S5 (yes)), the process proceeds to the body text extraction process in FIG. 12 for extracting the body text Htxt of the linked web page based on the one tag element “H1”. (Step SC).

図１１は、前記クライアント装置２０の読み上げ対象テキスト取得処理に伴う文字列の「強さ」比較処理を示すフローチャートである。 FIG. 11 is a flowchart showing a character string “strength” comparison process associated with the reading-target text acquisition process of the client device 20.

前記リンクテキスト「首相、内閣支持率に注文」Ｌtxt3に対応するリンク先Ｗｅｂページが、例えば図８で示したように、当該リンクテキストＬtxt3と同一（あるいは類似）の文字列（見出しテキストＭtxt1，Ｍtxt2）を含むタグ要素が複数リストアップされたＷｅｂページＬＰ′である場合（ステップＳ５（ｎｏ））、図１１における文字列の「強さ」比較処理に移行され（ステップＳＢ）、先ず比較対象のタグ要素を管理するための変数ｘ，ｙが、それぞれｘ＝２，ｙ＝１として初期化される（ステップＢ１）。 As shown in FIG. 8, for example, the linked Web page corresponding to the link text “Prime Minister, Order to Cabinet Support Rate” Ltxt3 is the same (or similar) character string (headline text Mtxt1, Mtxt2). ) Is a Web page LP ′ listed in plural (step S5 (no)), the process proceeds to the character string “strength” comparison process in FIG. 11 (step SB). Variables x and y for managing tag elements are initialized as x = 2 and y = 1, respectively (step B1).

また、変数ｚに、前記リストアップ処理（ステップＳＡ）に従いＲＡＭ２４内のリンクテキスト類似要素メモリ２４ａに登録されたタグ要素の数（図８で示すリンク先Ｗｅｂページの場合は“２”）が代入される（ステップＢ２）。 Also, the number of tag elements registered in the link text similar element memory 24a in the RAM 24 in accordance with the list-up process (step SA) is substituted for the variable z (“2” in the case of the linked Web page shown in FIG. 8). (Step B2).

すると、前記テキスト類似要素メモリ２４ａに登録されたリスト上のｘ番目（ｘ＝２）のタグ要素に記述された文字列について、そのフォントサイズおよびフォントスタイルが取得される（ステップＢ３）。 Then, the font size and font style of the character string described in the xth (x = 2) tag element on the list registered in the text similar element memory 24a are acquired (step B3).

そして、前記リスト上ｘ番目（ｘ＝２）のタグ要素の文字列のフォントサイズＳＺｘとｙ番目（ｙ＝１）のタグ要素の文字列のフォントサイズＳＺｙとが比較され（ステップＢ４）、等しいか否か判断される（ステップＢ５）。 Then, the font size SZx of the character string of the x-th (x = 2) tag element on the list is compared with the font size SZy of the character string of the y-th (y = 1) tag element (step B4) and are equal. Is determined (step B5).

ここで、ｘ番目（ｘ＝２）のタグ要素の文字列のフォントサイズＳＺｘとｙ番目（ｙ＝１）のタグ要素の文字列のフォントサイズＳＺｙとが等しくないと判断され（ステップＢ５（ｎｏ））、ｙ番目（ｙ＝１）よりｘ番目（ｘ＝２）が大きいと判断された場合には（ステップＢ６（ｙｅｓ））、当該変数ｘの値“２”が変数ｙに代入される（ステップＢ７）。 Here, it is determined that the font size SZx of the character string of the xth (x = 2) tag element is not equal to the font size SZy of the character string of the yth (y = 1) tag element (step B5 (no )), When it is determined that the xth (x = 2) is larger than the yth (y = 1) (step B6 (yes)), the value “2” of the variable x is assigned to the variable y. (Step B7).

逆に、ｙ番目（ｙ＝１）よりｘ番目（ｘ＝２）が小さいと判断された場合には（ステップＢ６（ｎｏ））、当該変数ｙの値はそのまま維持される。 Conversely, if it is determined that the xth (x = 2) is smaller than the yth (y = 1) (step B6 (no)), the value of the variable y is maintained as it is.

一方、前記ステップＢ５において、ｘ番目（ｘ＝２）のタグ要素の文字列のフォントサイズＳＺｘとｙ番目（ｙ＝１）のタグ要素の文字列のフォントサイズＳＺｙとが等しいと判断された場合には（ステップＢ５（ｙｅｓ））、さらに当該ｘ番目（ｘ＝２）のタグ要素の文字列のフォントスタイルＳＴｘとｙ番目（ｙ＝１）のタグ要素の文字列のフォントスタイルＳＴｙとが比較され（ステップＢ８）、ｘ番目のフォントスタイルＳＴｘのみボールドか（ステップＢ９）、またはｙ番目のフォントスタイルＳＴｙのみボールドか（ステップＢ１０）、または何れのフォントスタイルも同じであるか（ステップＢ１０（ｎｏ））が判断される。 On the other hand, when it is determined in step B5 that the font size SZx of the character string of the xth (x = 2) tag element is equal to the font size SZy of the character string of the yth (y = 1) tag element. (Step B5 (yes)), the font style STx of the character string of the x-th (x = 2) tag element is compared with the font style STy of the character string of the y-th (y = 1) tag element. (Step B8), whether only the xth font style STx is bold (step B9), only the yth font style STy is bold (step B10), or is any font style the same (step B10 (no) )) Is determined.

ここで、ｘ番目（ｘ＝２）のタグ要素の文字列のフォントスタイルＳＴｘのみボールドであると判断された場合には（ステップＢ９（ｙｅｓ））、当該変数ｘの値“２”が変数ｙに代入される（ステップＢ７）。 When it is determined that only the font style STx of the character string of the xth (x = 2) tag element is bold (step B9 (yes)), the value “2” of the variable x is set to the variable y. (Step B7).

逆に、ｙ番目（ｙ＝１）のタグ要素の文字列のフォントスタイルＳＴｙのみボールドであると判断された場合には（ステップＢ１０（ｙｅｓ））、当該変数ｙの値はそのまま維持される。 Conversely, when it is determined that only the font style STy of the character string of the y-th (y = 1) tag element is bold (step B10 (yes)), the value of the variable y is maintained as it is.

そして、前記フォントサイズＳＺが大きい方、または当該フォンサイズＳＺが等しくてもそのフォンスタイルＳＴがボールドである方の文字列を含むタグ要素の出現番号が変数ｙに設定されると、変数ｘと変数ｚが等しい、つまり変数ｘが前記リンクテキスト類似要素メモリ２４ａに登録されたタグ要素の総数に達したと判断されるまで（ステップＢ１１
)、当該変数ｘがインクリメントされ（ステップＢ１２）、前記ステップＢ３以降の処理が繰り返される（ステップＢ１２→Ｂ３）。 When the appearance number of the tag element including the character string whose font size SZ is larger or whose font style SZ is equal but whose phone style ST is bold is set in the variable y, the variable x Until it is determined that the variables z are equal, that is, the variable x has reached the total number of tag elements registered in the link text similar element memory 24a (step B11).
), The variable x is incremented (step B12), and the processing after step B3 is repeated (step B12 → B3).

一方、ｘ番目のタグ要素の文字列のフォントサイズＳＺｘとｙ番目のタグ要素の文字列のフォントサイズＳＺｙとが等しく（ステップＢ５（ｙｅｓ））、しかも何れのフォントスタイルＳＴｘ，ＳＴｙも同じであると判断された場合には（ステップＢ１０（ｎｏ））、変数ｘと変数ｚが等しい、つまり変数ｘが前記リンクテキスト類似要素メモリ２４ａに登録されたタグ要素の数に達したと判断されるまでは（ステップＢ１３）、変数ｘの値が変数ｙに代入された後（ステップＢ１４）、当該変数ｘがインクリメントされ（ステップＢ１２）、前記ステップＢ３以降の処理が繰り返される（ステップＢ１２→Ｂ３）。 On the other hand, the font size SZx of the character string of the xth tag element is equal to the font size SZy of the character string of the yth tag element (step B5 (yes)), and both font styles STx and STy are the same. (Step B10 (no)), the variable x is equal to the variable z, that is, until it is determined that the variable x has reached the number of tag elements registered in the link text similar element memory 24a. (Step B13) After the value of the variable x is substituted into the variable y (Step B14), the variable x is incremented (Step B12), and the processing after Step B3 is repeated (Step B12 → B3).

なお、前記ステップＢ１３において、変数ｘと変数ｚが等しく、当該変数ｘが前記リンクテキスト類似要素メモリ２４ａに登録されたタグ要素の総数に達したと判断された場合には（ステップＢ１３（ｙｅｓ））、リンク先ＷｅｂページのＨＴＭＬソースにおいて、前記リンクテキストＬtxtnと同一（あるいは類似）の文字列を含む複数のタグ要素のうち、最も「強い」文字列含むタグ要素への絞り込みは不可としてエラー処理される（ステップＢ１５）。 If it is determined in step B13 that the variable x is equal to the variable z and the variable x has reached the total number of tag elements registered in the link text similar element memory 24a (step B13 (yes)). ) In the HTML source of the linked Web page, it is impossible to narrow down to a tag element including the most “strong” character string among a plurality of tag elements including the same (or similar) character string as the link text Ltxtn. (Step B15).

そして、前記ステップＢ１１において、変数ｘと変数ｚが等しく、当該変数ｘが前記リンクテキスト類似要素メモリ２４ａに登録されたタグ要素の総数に達したと判断されると（ステップＢ１１（ｙｅｓ））、当該登録されたリスト上のｙ番目のタグ要素が、最も「強い」文字列を含むタグ要素として設定される（ステップＢ１６）。 In Step B11, when it is determined that the variable x is equal to the variable z and the variable x has reached the total number of tag elements registered in the link text similar element memory 24a (Step B11 (yes)), The y-th tag element on the registered list is set as a tag element including the most “strong” character string (step B16).

すなわち、前記リンクテキスト「首相、内閣支持率に注文」Ｌtxt3に対応するリンク先Ｗｅｂページが、例えば図８で示したＷｅｂページＬＰ′であって、当該リンクテキストＬtxt3と同一（あるいは類似）の文字列（見出しテキストＭtxt1，Ｍtxt2）を含む２つのタグ要素がリストアップされた場合には、フォントサイズＳＺの大きい方の見出しテキストＭtxt2を含むｙ番目（ｙ＝２）のタグ要素が、最も「強い」文字列を含むタグ要素として設定される。 That is, the linked Web page corresponding to the link text “Prime Minister, Order to Cabinet Support Rate” Ltxt3 is, for example, the Web page LP ′ shown in FIG. 8, and the same (or similar) character as the link text Ltxt3. When two tag elements including a column (heading text Mtxt1, Mtxt2) are listed, the y-th (y = 2) tag element including the heading text Mtxt2 having the larger font size SZ is the strongest. "Is set as a tag element containing a character string.

こうした前記一連の文字列の「強さ」比較処理を経て、リンクテキストＬtxtnと同一（あるいは類似）の最も「強い」文字列を含むタグ要素が抽出されたと判断されると（ステップＳ６（ｙｅｓ））、当該最も「強い」文字列を含むタグ要素に基づきリンク先Ｗｅｂページの本文テキストＨtxtを抽出するための、図１２における本文テキストの抽出処理に移行される（ステップＳＣ）。 When it is determined that the tag element including the most “strong” character string that is the same (or similar) to the link text Ltxtn is extracted through the “strength” comparison process of the series of character strings (step S6 (yes)). ), The process proceeds to the body text extraction process in FIG. 12 for extracting the body text Htxt of the linked Web page based on the tag element including the most “strong” character string (step SC).

図１２は、前記クライアント装置２０の読み上げ対象テキスト取得処理に伴う本文テキストの抽出処理を示すフローチャートである。 FIG. 12 is a flowchart showing a body text extraction process associated with the reading target text acquisition process of the client device 20.

この本文テキスト抽出処理が起動されると、先ず、前記リンクテキストＬtxtnに対応するリンク先ＵＲＬに従い取得されたＷｅｂページのＨＴＭＬソースにおいて、前記ステップＳＡにてリストアップされたリンクテキストＬtxtnと同一（あるいは類似）の文字列を含む１つのタグ要素の次のタグ要素か、または前記ステップＳＢにて抽出されたリンクテキストＬtxtnと同一（あるいは類似）で且つ最も「強い」文字列を含むタグ要素の次のタグ要素に注目する（ステップＣ１）。 When this body text extraction process is started, first, in the HTML source of the Web page acquired according to the link destination URL corresponding to the link text Ltxtn, the same as the link text Ltxtn listed in the step SA (or The next tag element after one tag element including a similar character string, or the next tag element that is the same (or similar) to the link text Ltxtn extracted in step SB and includes the most “strong” character string. Note the tag element (step C1).

そして、前記注目したタグ要素について、テキスト情報を含むタグ要素であるか否か判断される（ステップＣ２）。 Then, it is determined whether or not the noted tag element is a tag element including text information (step C2).

ここで、前記注目したタグ要素が、テキスト情報を含むタグ要素であると判断された場合には（ステップＣ２（ｙｅｓ））、当該テキスト情報は読点を設定個数Ｎpunc以上含むか否か判断される（ステップＣ３）。 Here, when it is determined that the noted tag element is a tag element including text information (step C2 (yes)), it is determined whether or not the text information includes a set number Npunc of reading points. (Step C3).

そして、前記注目したタグ要素のテキスト情報が、読点を設定個数Ｎpunc以上含むテキスト情報であると判断された場合には（ステップＣ３（ｙｅｓ））、さらに当該テキスト情報は設定文字数Ｍlen以上の長さであるか判断される（ステップＣ４）。 When it is determined that the text information of the noted tag element is text information including a set number of readings Npunc or more (step C3 (yes)), the text information has a length of the set number of characters Mlen or more. Is determined (step C4).

そして、前記注目したタグ要素のテキスト情報が、設定文字数Ｍlen以上の長さであると判断された場合には（ステップＣ４（ｙｅｓ））、当該テキスト情報が前記リンクテキストＬtxtnに対応するリンク先ＷｅｂページＬＰの読み上げ対象の本文テキストＨtxtであるとして設定される（ステップＣ５）。 If it is determined that the text information of the noted tag element is longer than the set number of characters Mlen (step C4 (yes)), the link destination Web corresponding to the link text Ltxtn corresponds to the text information. It is set as the body text Htxt to be read out of the page LP (step C5).

すると、前記リンクテキストＬtxtnに対応するリンク先ＷｅｂページＬＰの本文テキストＨtxtが抽出されたと判断され（ステップＳ７（ｙｅｓ））、当該リンクテキストＬtxtnと抽出された本文テキストＨtxtとがそれぞれＲＡＭ２４内の読み上げ対象テキストメモリ２４ｂに記憶される（ステップＳ８）。そして、この読み上げ対象テキストメモリ２４ｂに記憶されたリンクテキストＬtxtnと本文テキストＨtxtとが、前記音声合成処理部２９ａにより音声信号に変換され、音声出力部２９ｂを介してスピーカＳＰから音声出力される。 Then, it is determined that the body text Htxt of the link destination Web page LP corresponding to the link text Ltxtn is extracted (step S7 (yes)), and the link text Ltxtn and the extracted body text Htxt are read out in the RAM 24, respectively. It is stored in the target text memory 24b (step S8). The link text Ltxtn and the body text Htxt stored in the text-to-speech text memory 24b are converted into voice signals by the voice synthesis processing unit 29a and output from the speaker SP via the voice output unit 29b.

すなわち、前記図４で示したＷｅｂページＰにおいて、リンクテキスト「首相、内閣支持率に注文」Ｌtxt3にフォーカスを移動させると、当該リンクテキストtxt3に対応するリンク先ＷｅｂページＬＰのハイパーテキスト（図６参照）から本文テキスト「政権発足を受け、…と冷静に受け止めた。」Ｈtxtが抽出され、前記リンクテキスト「首相、内閣支持率に注文」Ｌtxt3と共にその読み上げ音声が出力される。これにより、リンクテキストＬtxtからそのリンク先ＷｅｂページＬＰを実際に開いて表示させ、その内容を確認する必要なく、当該リンクテキストＬtxtに対応する本文テキストＨtxtの内容を簡単に知ることができ、所望のリンク先を時間のロスなく効率的に見つけて表示させることができる。 That is, in the Web page P shown in FIG. 4, when the focus is moved to the link text “Prime Minister, Order to the Cabinet Support Rate” Ltxt3, the hypertext of the link destination Web page LP corresponding to the link text txt3 (FIG. 6). Htxt is extracted from the text of the text “Refer to the administration, and received it calmly.” From the text, and the read-out voice is output together with the link text “Prime Minister, Order from the Cabinet” Ltxt3. As a result, it is possible to easily know the contents of the body text Htxt corresponding to the link text Ltxt without having to confirm the contents by actually opening and displaying the linked web page LP from the link text Ltxt. Can be found and displayed efficiently without loss of time.

一方、現在注目しているタグ要素がテキスト情報を含んでいないと判断された場合（ステップＣ２（ｎｏ））、またはテキスト情報を含んでいると判断されても、当該テキスト情報は設定個数Ｎpunc以上の読点を含まないと判断された場合（ステップＣ３（ｎｏ））、または当該テキスト情報が設定個数Ｎpunc以上の読点を含んでいても、設定文字数Ｍlen以上の長さがないと判断された場合には（ステップＣ４（ｎｏ））、注目中のタグ要素がリンク先ＷｅｂページのＨＴＭＬソースにおける最後のタグ要素か否か判断される（ステップＣ６）。 On the other hand, if it is determined that the tag element currently focused on does not include text information (step C2 (no)), or if it is determined that it includes text information, the text information is equal to or greater than the set number Npunc. If it is determined that it does not include a punctuation mark (step C3 (no)), or if it is determined that the text information includes a punctuation mark greater than or equal to the set number Npunc but does not have a length equal to or greater than the set character number Mlen (Step C4 (no)), it is determined whether or not the tag element under attention is the last tag element in the HTML source of the linked web page (step C6).

ここで、最後のタグ要素でないと判断された場合には（ステップＣ６（ｎｏ））、次のタグ要素に注目し（ステップＣ７）、前記ステップＣ２以降の処理が繰り返される（ステップＣ７→Ｃ２）。 If it is determined that the tag element is not the last tag element (step C6 (no)), the next tag element is noticed (step C7), and the processes after step C2 are repeated (step C7 → C2). .

そして、前記ステップＣ６において、最後のタグ要素であると判断された場合には（ステップＣ６（ｙｅｓ））、前記リンクテキストtxtnに対応するリンク先ＷｅｂページＬＰのハイパーテキストから本文テキストとしてのテキスト情報は抽出されなかったと判断され（ステップＳ７（ｎｏ））、エラー処理される。 If it is determined in step C6 that it is the last tag element (step C6 (yes)), the text information as the body text from the hypertext of the linked web page LP corresponding to the link text txtn. Is not extracted (step S7 (no)), and error processing is performed.

したがって、前記構成のクライアント装置２０におけるＷｅｂページ閲覧機能によれば、現在表示中のＷｅｂページのハイパーテキストにおける任意のリンクテキストＬtxtnに、カーソル操作などによって選択的にフォーカスを当てると、当該リンクテキストＬtxtnに対応するリンク先ＵＲＬに従い、そのリンク先Ｗｅｂページのハイパーテキスト（ＨＴＭＬソース）が取得されると共に、前記リンクテキストＬtxtnと同一（あるいは類似）の文字列を有するタグ要素がリストアップされる。そして、前記リンク先Ｗｅｂページのハイパーテキスト（ＨＴＭＬソース）における、前記リンクテキストＬtxtnと同一（あるいは類似）の文字列を有するタグ要素に基づき、それ以降のタグ要素に含まれる本文テキストＨtxtが抽出され、その読み上げ音声が音声合成されて出力される。このため、リンクテキストＬtxtnからそのリンク先ＷｅｂページＬＰを実際に開いて表示させ、その内容を確認する必要なく、当該リンクテキストＬtxtに対応する本文テキストＨtxtの内容を簡単に知ることができ、所望のリンク先を時間のロスなく効率的に見つけて表示させることができる。 Therefore, according to the Web page browsing function in the client device 20 having the above-described configuration, when an arbitrary link text Ltxtn in the hypertext of the currently displayed Web page is selectively focused by a cursor operation or the like, the link text Ltxtn The hypertext (HTML source) of the link destination Web page is acquired according to the link destination URL corresponding to, and tag elements having the same (or similar) character string as the link text Ltxtn are listed. Then, based on the tag element having the same (or similar) character string as the link text Ltxtn in the hypertext (HTML source) of the link destination Web page, the body text Htxt included in the subsequent tag elements is extracted. The read-out speech is synthesized and output. For this reason, it is possible to easily know the content of the body text Htxt corresponding to the link text Ltxt without having to actually open and display the linked Web page LP from the link text Ltxtn and confirm the content. Can be found and displayed efficiently without loss of time.

また、前記構成のクライアント装置２０におけるＷｅｂページ閲覧機能によれば、リンク先Ｗｅｂページのハイパーテキスト（ＨＴＭＬソース）から、リンクテキストＬtxtnと同一（あるいは類似）の文字列を有するタグ要素が複数リストアップされた場合には、当該複数のタグ要素のうち、フォンサイズＳＺやフォントスタイルＳＴにおいて最も「強い」文字列を含むタグ要素が抽出され、この抽出されたタグ要素以降のタグ要素に含まれるテキスト情報から本文テキストＨtxtが判断抽出され、その読み上げ音声が出力される。このため、リンク先Ｗｅｂページのハイパーテキスト（ＨＴＭＬソース）に、リンクテキストＬtxtnと同一（あるいは類似）の文字列を有するタグ要素が複数存在していても、当該リンクテキストＬtxtnに対応する本文テキストＨtxtを含んでいるタグ要素を従えたタグ要素を確実に抽出できる。 Further, according to the Web page browsing function in the client device 20 having the above-described configuration, a plurality of tag elements having the same (or similar) character string as the link text Ltxtn are listed from the hypertext (HTML source) of the link destination Web page. If the tag element is selected, a tag element including the most “strong” character string in the phone size SZ or the font style ST is extracted from the plurality of tag elements, and the text included in the tag elements after the extracted tag element is extracted. The body text Htxt is judged and extracted from the information, and the reading voice is output. For this reason, even if a plurality of tag elements having the same (or similar) character string as the link text Ltxtn exist in the hypertext (HTML source) of the link destination Web page, the body text Htxt corresponding to the link text Ltxtn It is possible to reliably extract a tag element that follows a tag element that includes.

また、前記構成のクライアント装置２０におけるＷｅｂページ閲覧機能によれば、リンクテキストＬtxtnと同一（あるいは類似）の文字列を有するタグ要素以降のタグ要素に含まれるテキスト情報を本文テキストＨtxtとして判断抽出するには、当該テキスト情報が設定個数Ｎpunc以上の読点を含み、且つ設定文字数Ｍlen以上の長さであるかを判断して抽出する。このため、リンクテキストＬtxtnに対応する本文テキストＨtxtを正しく抽出してその読み上げ音声を出力できる。 Further, according to the Web page browsing function in the client device 20 having the above configuration, the text information included in the tag elements after the tag element having the same (or similar) character string as the link text Ltxtn is determined and extracted as the body text Htxt. Is extracted by determining whether or not the text information includes reading points equal to or greater than the set number Npunc and has a length equal to or greater than the set number of characters Mlen. Therefore, it is possible to correctly extract the body text Htxt corresponding to the link text Ltxtn and output the read-out voice.

なお、前記実施形態では、通常のサーバ・クライアント・システムにおけるクライアント装置２０のＷｅｂブラウザ２３ａに対し前記Ｗｅｂページ閲覧機能を搭載して、表示中のＷｅｂページＰのリンクテキストＬtxtnに対応するリンク先サイトの本文テキストＨtxtを抽出しその読み上げ音声を出力する場合について説明した。これに対し、サーバベース・コンピューティング・システムにおけるシン・クライアント端末にて表示中のＷｅｂページＰのリンクテキストＬtxtnに対応するリンク先サイトの本文テキストＨtxtを抽出しその読み上げ音声を出力する場合には、当該シン・クライアント端末からの入力イベントによって起動するサーバ装置のＷｅｂブラウザに対し、前記同様のＷｅｂページ閲覧機能を搭載すればよい。 In the embodiment, the Web page browsing function is installed in the Web browser 23a of the client device 20 in a normal server / client system, and the link destination site corresponding to the link text Ltxtn of the Web page P being displayed. The case where the body text Htxt is extracted and the reading voice is output has been described. On the other hand, when extracting the text text Htxt of the link destination site corresponding to the link text Ltxtn of the Web page P being displayed on the thin client terminal in the server-based computing system, and outputting the reading voice A web page browsing function similar to that described above may be installed in the web browser of the server device activated by an input event from the thin client terminal.

なお、前記各実施形態において記載したＷｅｂページ閲覧装置による各処理の手法、すなわち、図９のフローチャートに示す読み上げ対象テキスト取得処理、図１０のフローチャートに示す同読み上げ対象テキスト取得処理に伴うリンクテキストと同一文字列を含むタグ要素のリストアップ処理、図１１のフローチャートに示す同読み上げ対象テキスト取得処理に伴う文字列の「強さ」比較処理、図１２のフローチャートに示す同読み上げ対象テキスト取得処理に伴う本文テキスト抽出処理などの各手法は、何れもコンピュータに実行させることができるプログラムとして、メモリカード（ＲＯＭカード、ＲＡＭカード等）、磁気ディスク（フロッピディスク、ハードディスク等）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤ等）、半導体メモリ等の外部記憶装置２５（１５）の媒体に格納して配布することができる。そして、Ｗｅｂページ閲覧装置のコンピュータ（ＣＰＵ２１（１１））は、この外部記憶装置２５（１５）の媒体に記憶されたプログラムを記憶装置（フラッシュＲＯＭ２３（１３）やＲＡＭ２４（１４））に読み込み、この読み込んだプログラムによって動作が制御されることにより、前記各実施形態において説明したＷｅｂページ閲覧機能を実現し、前述した手法による同様の処理を実行することができる。 It should be noted that each processing method by the Web page browsing apparatus described in each of the above embodiments, that is, the reading target text acquisition process shown in the flowchart of FIG. 9, the link text accompanying the reading target text acquisition process shown in the flowchart of FIG. A process for listing tag elements including the same character string, a “strength” comparison process for character strings associated with the text-to-speech target text acquisition process shown in the flowchart of FIG. 11, and a text-to-speech target text acquisition process shown in the flowchart of FIG. Each method such as text extraction processing is a program that can be executed by a computer, such as a memory card (ROM card, RAM card, etc.), magnetic disk (floppy disk, hard disk, etc.), optical disc (CD-ROM, DVD, etc.). Etc.), semiconductor memory, etc. It may be distributed and stored in the medium of the external storage device 25 (15). Then, the computer (CPU 21 (11)) of the Web page browsing device reads the program stored in the medium of the external storage device 25 (15) into the storage device (flash ROM 23 (13) or RAM 24 (14)), and this By controlling the operation by the read program, the Web page browsing function described in each of the above embodiments can be realized, and the same processing by the above-described method can be executed.

また、前記各手法を実現するためのプログラムのデータは、プログラムコードの形態として通信ネットワーク（Ｎ）上を伝送させることができ、この通信ネットワーク（Ｎ）に接続されたコンピュータ装置（プログラムサーバ）から前記のプログラムデータを取り込んで記憶装置（フラッシュＲＯＭ２３（１３）やＲＡＭ２４（１４））に記憶させ、前述したＷｅｂページ閲覧機能を実現することもできる。 Further, program data for realizing each of the above methods can be transmitted on the communication network (N) in the form of a program code, and from a computer device (program server) connected to the communication network (N). It is also possible to capture the program data and store it in a storage device (flash ROM 23 (13) or RAM 24 (14)) to realize the Web page browsing function described above.

なお、本願発明は、前記各実施形態に限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で種々に変形することが可能である。さらに、前記各実施形態には種々の段階の発明が含まれており、開示される複数の構成要件における適宜な組み合わせにより種々の発明が抽出され得る。例えば、各実施形態に示される全構成要件から幾つかの構成要件が削除されたり、幾つかの構成要件が異なる形態にして組み合わされても、発明が解決しようとする課題の欄で述べた課題が解決でき、発明の効果の欄で述べられている効果が得られる場合には、この構成要件が削除されたり組み合わされた構成が発明として抽出され得るものである。 Note that the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the invention at the stage of implementation. Further, each of the embodiments includes inventions at various stages, and various inventions can be extracted by appropriately combining a plurality of disclosed constituent elements. For example, even if some constituent elements are deleted from all the constituent elements shown in each embodiment or some constituent elements are combined in different forms, the problems described in the column of the problem to be solved by the invention If the effects described in the column “Effects of the Invention” can be obtained, a configuration in which these constituent requirements are deleted or combined can be extracted as an invention.

本発明のＷｅｂページ閲覧装置の実施形態に係るサーバ・クライアント・システムの構成を示すブロック図。The block diagram which shows the structure of the server client system which concerns on embodiment of the web page browsing apparatus of this invention. 前記サーバ・クライアント・システムにおけるサーバ装置１０の回路構成を示すブロック図。The block diagram which shows the circuit structure of the server apparatus 10 in the said server client system. 前記サーバ・クライアント・システムにおけるクライアント装置２０の回路構成を示すブロック図。The block diagram which shows the circuit structure of the client apparatus 20 in the said server client system. 前記クライアント装置２０においてサーバ装置（Ａ）１０[http://www.sight.a.co.jp]から取得されたＷｅｂページＰの画面表示例を示す図。The figure which shows the example of a screen display of the web page P acquired from server apparatus (A) 10 [http://www.sight.a.co.jp] in the said client apparatus 20. FIG. 前記図４におけるＷｅｂページＰを記述したＨＴＭＬソースＰhtmを示す図。The figure which shows the HTML source Phtm which described the web page P in the said FIG. 前記図５におけるリンクテキストＬtxt3のリンク先ＵＲＬに対応するＷｅｂページＬＰのＨＴＭＬソースＬＰhtmを示す図。The figure which shows HTML source LPhtm of Web page LP corresponding to the link destination URL of the link text Ltxt3 in the said FIG. 前記図６におけるＨＴＭＬソースＬＰhtmにより記述されたリンク先ＷｅｂページＬＰの画面表示例を示す図。The figure which shows the example of a screen display of the link destination web page LP described by the HTML source LPhtm in the said FIG. リンク元ＷｅｂページＰにてフォーカスしたリンクテキスト「首相、内閣支持率に注文」Ｌtxtと同一（あるいは類似）の複数の見出しテキストＭtxt1,Ｍtxt2を含んでいるリンク先ＷｅｂページＬＰ′の画面表示例を示す図。Example of screen display of linked web page LP 'containing multiple headline texts Mtxt1 and Mtxt2 that are the same as (or similar to) Ltxt, the link text focused on the link source web page P FIG. 前記クライアント装置２０によるＷｅｂページの閲覧に伴いリンク先Ｗｅｂページの本文テキストを取得してその読み上げ音声を出力するための読み上げ対象テキスト取得処理を示すフローチャート。7 is a flowchart showing a text-to-speech acquisition process for acquiring the body text of a linked Web page and outputting the text to be read when the client device 20 browses the Web page. 前記クライアント装置２０の読み上げ対象テキスト取得処理に伴うリンクテキストと同じ文字列を含む要素のリストアップ処理を示すフローチャート。The flowchart which shows the list-up process of the element containing the same character string as the link text accompanying the reading-target text acquisition process of the said client apparatus. 前記クライアント装置２０の読み上げ対象テキスト取得処理に伴う文字列の「強さ」比較処理を示すフローチャート。The flowchart which shows the "strength" comparison process of the character string accompanying the reading target text acquisition process of the said client apparatus. 前記クライアント装置２０の読み上げ対象テキスト取得処理に伴う本文テキストの抽出処理を示すフローチャート。The flowchart which shows the extraction process of the body text accompanying the reading-target text acquisition process of the said client apparatus.

Explanation of symbols

１０ …サーバ装置
２０ …クライアント装置
１１，２１…ＣＰＵ
１２，２２…バス
１３，２３…ＲＯＭ
２３ａ…Ｗｅｂブラウザプログラム
１４，２４…ＲＡＭ
２４ａ…リンクテキスト類似要素メモリ
２４ｂ…読み上げ対象テキストメモリ
１５，２５…外部記憶装置
１５ａ…Ｗｅｂコンテンツ
１６，２６…入力装置
１７，２７…表示装置
１８，２８…通信Ｉ／Ｆ
２９ａ…音声合成処理部
２９ｂ…音声出力部
ＳＰ …スピーカ
Ｎ …通信ネットワーク
ＦＢ …フレームバッファ
Ｐ …Ｗｅｂページ
Ｐhtm…ＷｅｂページのＨＴＭＬソース
ＬＰ …リンク先Ｗｅｂページ
ＬＰhtm…リンク先ＷｅｂページのＨＴＭＬソース
Ｌtxtn…リンクテキスト
Ｍtxt…見出しテキスト
Ｈtxt…本文テキスト DESCRIPTION OF SYMBOLS 10 ... Server apparatus 20 ... Client apparatus 11, 21 ... CPU
12, 22 ... Bus 13, 23 ... ROM
23a ... Web browser program 14, 24 ... RAM
24a ... Link text similar element memory 24b ... Read target text memory 15, 25 ... External storage device 15a ... Web content 16,26 ... Input device 17,27 ... Display device 18,28 ... Communication I / F
29a ... speech synthesis processing unit 29b ... speech output unit SP ... speaker N ... communication network FB ... frame buffer P ... web page Phtm ... html source of web page LP ... link destination web page LPhtm ... HTML source of link destination web page Ltxtn ... Link text Mtxt ... Headline text Htxt ... Body text

Claims

Page receiving means for receiving page information;
Page display means for expanding and displaying the page information received by the page receiving means on a display screen;
A page that obtains linked page information corresponding to the link text that is in focus when any link text is focused on among the link text included in the page information displayed by the page display means. Acquisition means;
Element extraction means for extracting an element including a character string that is the same as or similar to the link text from the linked page information acquired by the page acquisition means;
Text extraction means for extracting the body text included after the element extracted by the element extraction means;
Output means for outputting the body text extracted by the text extraction means;
A page browsing device comprising:

The page acquisition means performs a jump instruction operation for opening link destination page information corresponding to the link text by selecting the link text from the link text included in the page information displayed by the page display means. Before doing, when the focus is on any link text, get the linked page information corresponding to the link text that has the focus,
The output means performs a jump instruction operation to open link destination page information corresponding to the link text by selecting the link text from the link text included in the page information displayed by the page display means. Before, output the body text extracted by the text extraction means,
The page browsing apparatus according to claim 1.

An emphasis element extraction means for extracting an element including an emphasized character string among character strings included in the plurality of elements when a plurality of elements are extracted by the element extraction means;
When one element is extracted by the element extracting unit, the text extracting unit extracts a body text included after the element, and when a plurality of elements are extracted by the element extracting unit, Extracting the body text included after the element extracted by the emphasized element extraction means;
The page browsing apparatus according to claim 1 , wherein the page browsing apparatus is characterized.

The emphasis element extraction unit extracts an element including the emphasized character string by using a character string having a large font size or a character string having a font style as an emphasis style as the emphasized character string.
The page browsing apparatus of Claim 3 characterized by the above-mentioned.

The output means converts the body text into speech and outputs the speech.
The page browsing apparatus in any one of Claims 1-4 characterized by the above-mentioned.

The output means expands the body text on the display screen for display output;
The page browsing apparatus in any one of Claims 1-5 characterized by the above-mentioned.

A program for controlling a computer of a page browsing device,
The computer,
Page receiving means for receiving page information,
Page display means for expanding and displaying the page information received by the page receiving means on the display screen;
When the link text included in the page information displayed by this page display means is focused on any link text, the page that obtains the linked page information corresponding to the focused link text Acquisition means,
Element extraction means for extracting an element including a character string that is the same as or similar to the link text from the linked page information acquired by the page acquisition means;
Text extraction means for extracting the body text included after the element extracted by the element extraction means;
Output means for outputting the body text extracted by the text extraction means;
A computer-readable program designed to function as a computer.