JP4598800B2

JP4598800B2 - Content conversion apparatus, content conversion method, and content conversion program

Info

Publication number: JP4598800B2
Application number: JP2007132416A
Authority: JP
Inventors: 大介朝井; 昌洋渡辺; 陽子浅野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2007-05-18
Filing date: 2007-05-18
Publication date: 2010-12-15
Anticipated expiration: 2027-05-18
Also published as: JP2008287538A

Description

本発明は、音声ブラウザ、テキストブラウザ、白黒モニタ、白黒印刷等、特定の環境下でもコンテンツ情報が欠落しないように変換する装置、方法およびプログラムに関し、特に、コンテンツ内で装飾が付与された箇所を、音声ブラウザ、テキストブラウザ、白黒モニタ、白黒印刷環境下で利用しても、上記装飾の存在が読み上げられるように、代替情報を付与して変換する技術に関する。
The present invention relates to an apparatus, a method, and a program for converting content information so as not to be lost even in a specific environment such as a voice browser, a text browser, a black and white monitor, and black and white printing. Further, the present invention relates to a technique for adding and converting alternative information so that the presence of the decoration is read out even when used in a voice browser, text browser, monochrome monitor, and monochrome printing environment.

インターネットの普及は進展し、平成１７年度末の時点で、インターネットの人口普及率は６６．８％に達している。自治体から企業まで、多岐にわたるウェブサイトが公開され、ウェブサイトは、日常生活において重要な情報源の１つである。さらに、近年において、ｗｅｂｌｏｇ、ＳＮＳ等、個人が情報発信し、ウェブという世界の中で、社会的コミュニティを形成することも可能である。社会生活における重要な基盤ともなりつつあり、誰でも使えること、つまり、アクセシビリティは、重要な要素である。 The spread of the Internet has progressed, and as of the end of 2005, the population penetration rate of the Internet has reached 66.8%. A wide variety of websites are released from local governments to companies, and the websites are one of the important information sources in daily life. Furthermore, in recent years, it is also possible for individuals such as web log and SNS to send out information and form a social community in the world of the web. It is becoming an important foundation in social life, and anyone can use it, that is, accessibility is an important factor.

一般的に、多くの利用者は、ウェブブラウザに表示された画面を目で見ることによって、視覚的に情報を取得し、マウスを使用し、画面内を選択しながら操作することが多い。その一方で、視覚的に情報を得ることができない利用者もウェブサイトを利用している。その方法は、ウェブコンテンツの中身を音声で読み上げる音声読み上げソフトを利用し、情報を音によって取得している（たとえば、非特許文献１参照）。 In general, many users often obtain information visually by viewing a screen displayed on a web browser and use a mouse to select and operate the screen. On the other hand, users who cannot obtain information visually also use websites. The method uses voice reading software that reads the contents of web content by voice, and acquires information by sound (for example, see Non-Patent Document 1).

しかし、ウェブコンテンツから視覚的に得られる情報を、全て音声読み上げソフトによって取得できるわけではなく、音声読み上げソフト利用者は、ウェブコンテンツ利用時に情報の欠落という問題に直面しているのが現状である。また、音声読み上げソフトに限らず、白黒ディスプレイやテキストのみを表示するブラウザ、白黒印刷でも、ウェブコンテンツ利用時に情報が欠落するという問題が発生する。
北村浩三、浅川智恵子、伊藤隆、伊東伸泰、西村雅史著「音声認識・合成によるホームページの閲覧方式」情報処理学会研究報告、音声言語情報処理２５−２、１９９９年、ｐ．７−１２ However, not all the information that can be obtained visually from web content can be obtained by voice reading software, and users of voice reading software are currently facing the problem of missing information when using web content. . In addition to voice reading software, a black and white display, a browser that displays only text, and black and white printing also have a problem that information is lost when using web content.
Kitamura Kozo, Asakawa Chieko, Ito Takashi, Ito Nobuyasu, Nishimura Masafumi “Homepage Browsing Method by Speech Recognition / Synthesis” Information Processing Society of Japan, Spoken Language Information Processing 25-2, 1999, p. 7-12

欠落する情報の１つとして、コンテンツ内の任意要素に付与されている装飾が挙げられる。たとえば、コンテンツ内のテキスト列の一部が、強調表示のために太字表示されていたとする。しかし、音声読み上げソフトでは、文字装飾を読み上げることができないので、太字文字以外と同様に読み上げる。したがって、音声読み上げソフト利用者には、強調部分が伝わらないという問題がある。 As one piece of missing information, there is a decoration given to an arbitrary element in the content. For example, it is assumed that a part of the text string in the content is displayed in bold for highlighting. However, since the text-to-speech software cannot read out the character decorations, it is read out in the same manner as for the non-bold characters. Therefore, there is a problem that the emphasized part is not transmitted to the voice reading software user.

また、たとえば、「赤字の地区は、月曜日、青字の地区は、火曜日にごみ収集」のように、文字の色別で情報を分類していたとする。音声読み上げソフトは、文字の色情報も伝えることができないので、利用者は、どの地区が赤字であるのかが、分からないという問題がある。 In addition, for example, suppose that information is classified by character color, such as “Trash collection is on Monday for red letter districts and Tuesday for blue letter districts”. Since the voice-to-speech software cannot convey the color information of characters, there is a problem that the user does not know which district is in red.

本発明は、ウェブコンテンツ等の所定記述言語で記述されているコンテンツ内の装飾が付与されている箇所を、特定の環境下で利用しても、装飾の存在を欠落させないようにすることができるコンテンツ変換装置、コンテンツ変換方法およびコンテンツ変換プログラムを提供することを目的とする。
The present invention can prevent the presence of a decoration from being lost even if a portion provided with a decoration in content described in a predetermined description language such as web content is used in a specific environment. It is an object to provide a content conversion device, a content conversion method, and a content conversion program.

本発明は、
所定記述言語で記述されているコンテンツを取得するコンテンツ取得手段と；
上記取得したコンテンツを解析し、上記コンテンツに含まれるテキストを装飾する装飾内容と装飾対象とを示す情報である装飾情報を抽出する装飾情報抽出手段と；
上記装飾内容を表現する音声読み上げ用の説明文を、上記取得したコンテンツに付与する代替情報付与手段と；
上記代替情報付与手段によって付与された後のコンテンツを出力するコンテンツ出力手段と；
を有し、上記説明文が付与されたコンテンツは、視覚的にコンテンツを表示するテキストブラウザで表示した際に付加部分であることが分かるように、上記説明文が特定の記号で囲まれているコンテンツ変換装置である。
The present invention
Content acquisition means for acquiring content described in a predetermined description language;
A decoration information extraction unit that analyzes the acquired content and extracts decoration information that is information indicating a decoration content and a decoration target that decorates the text included in the content;
Alternative information giving means for giving an explanation for reading aloud expressing the decoration content to the acquired content;
Content output means for outputting the content provided by the alternative information providing means;
The content to which the description is added is surrounded by a specific symbol so that it can be seen that the content is an additional part when displayed with a text browser that visually displays the content . It is a content conversion device.

本発明によれば、ウェブコンテンツ等の所定記述言語で記述されたコンテンツにおいて、文字色や太字等、コンテンツに付加されている装飾を、音声読み上げソフトによって読み上げることができるという効果を奏する。
ADVANTAGE OF THE INVENTION According to this invention, there exists an effect that the decoration added to content, such as a character color and bold type, can be read out by audio | voice reading software in the content described in predetermined description languages, such as web content.

発明を実施するための最良の形態は、以下の実施例である。 The best mode for carrying out the invention is the following examples.

（クライアント型）
図１は、本発明の実施例１であるコンテンツ変換装置１０を有するコンテンツ変換システム１００を示すブロック図である。 (Client type)
FIG. 1 is a block diagram showing a content conversion system 100 having a content conversion apparatus 10 that is Embodiment 1 of the present invention.

実施例１は、ウェブコンテンツ内において、任意のテキストに装飾された文字色を音声読み上げソフトでも読めるように、装飾情報を、上記ウェブコンテンツに付与するように変換する実施例である。音声読み上げソフトによってウェブコンテンツを閲覧する者が、利用する上記音声読み上げソフト内において実施される。なお、「装飾」は、テキストに関連付けされた情報であり、文字の太さ、フォント、文字色、文字サイズ、文字網かけ、下線等である。また、「装飾情報」は、コンテンツを装飾する装飾内容と装飾対象とを示す情報である。 Embodiment 1 is an embodiment in which decoration information is converted so as to be given to the web content so that the text color decorated in an arbitrary text can be read by voice reading software in the web content. A person who browses web contents with the voice reading software is implemented in the voice reading software used. Note that “decoration” is information associated with text, and includes character thickness, font, character color, character size, character shading, underline, and the like. The “decoration information” is information indicating the decoration contents that decorate the content and the decoration target.

コンテンツ変換装置１０は、コンテンツ取得部１１と、装飾情報抽出部１２と、コンテンツ変換部１３と、コンテンツ出力部１４とを有し、たとえば、コンピュータによって実現される。 The content conversion apparatus 10 includes a content acquisition unit 11, a decoration information extraction unit 12, a content conversion unit 13, and a content output unit 14, and is realized by a computer, for example.

コンテンツ取得部１１は、変換する対象であるコンテンツを取得するコンテンツ取得手段である。 The content acquisition unit 11 is content acquisition means for acquiring content to be converted.

装飾情報抽出部１２は、コンテンツ取得部１１が取得したコンテンツを解析し、そのコンテンツを装飾する装飾内容と装飾対象とを示す情報である装飾情報を抽出する装飾情報抽出手段の例である。実施例１において、上記装飾対象は、色が設定されているテキスト箇所である。 The decoration information extraction unit 12 is an example of a decoration information extraction unit that analyzes the content acquired by the content acquisition unit 11 and extracts decoration information that is information indicating a decoration content and a decoration target that decorates the content. In the first embodiment, the decoration target is a text portion where a color is set.

コンテンツ変換部１３は、装飾情報抽出部１２が抽出した装飾箇所の前後に、音声読み上げソフトで上記装飾内容（実施例１では、文字色）を読み上げることができるように、代替情報を付与し、コンテンツを変換するコンテンツ変換手段である。 The content conversion unit 13 assigns alternative information so that the decoration content (character color in the first embodiment) can be read out by the voice reading software before and after the decoration part extracted by the decoration information extraction unit 12. Content conversion means for converting content.

コンテンツ出力部１４は、コンテンツ変換部１３が変換したコンテンツを出力するコンテンツ出力手段である。 The content output unit 14 is a content output unit that outputs the content converted by the content conversion unit 13.

図２は、コンテンツ変換装置１０の動作を示すフローチャートである。 FIG. 2 is a flowchart showing the operation of the content conversion apparatus 10.

Ｓ１１で、コンテンツ取得部１１が、変換の対象であるコンテンツを取得する。実施例１では、コンテンツとして、マークアップの言語の１つであり、文書と共に構造・装飾等を示す表現が付加されているＨＴＭＬ文書を取得する場合について説明する。 In S11, the content acquisition unit 11 acquires content to be converted. In the first embodiment, a case will be described in which an HTML document, which is one of markup languages and to which an expression indicating structure / decoration is added together with a document, is acquired as content.

図３は、実施例１における変換対象のコンテンツであるＨＴＭＬ文書の例を示す図である。 FIG. 3 is a diagram illustrating an example of an HTML document that is content to be converted in the first embodiment.

図４は、実施例１において、変換対象のコンテンツをブラウザで表示している例を示す図である。 FIG. 4 is a diagram illustrating an example in which the content to be converted is displayed on the browser in the first embodiment.

実施例１では、図３に示すようなＨＴＭＬ文書が取得され、一般的なウェブブラウザを利用し、図４に示すように表示される。 In the first embodiment, an HTML document as shown in FIG. 3 is acquired and displayed using a general web browser as shown in FIG.

図４に示すコンテンツでは、ゴミ収集日についての情報を表示している。コンテンツ中に、「赤字の地区は火曜、木曜、土曜の昼の回収」と記述されている。視覚的に情報を得られる場合には、赤字で記述された「中央１区」、「中央２区」、「西光町２区」、「南光町」、「岩戸区」、「丘区」が、「火曜、木曜、土曜の昼の回収」であることが分かる。しかし、従来の音声読み上げソフトでは、文字への装飾情報を読み上げず、テキスト内容のみを読み上げるので、どの区・町が赤字であるのかを判別することができず、ゴミ回収日が分からないという問題がある。 In the content shown in FIG. 4, information about the garbage collection date is displayed. In the content, it is described that “the deficit area is the daytime collection on Tuesday, Thursday and Saturday”. If the information can be obtained visually, the “Chuo 1 Ward”, “Chuo 2 Ward”, “Seikocho 2 Ward”, “Nankocho”, “Iwato Ward”, “Oka Ward” written in red , "Tuesday, Thursday, Saturday lunch collection". However, the conventional speech-to-speech software does not read out the decoration information on the characters, but only the text content, so it is impossible to determine which ward / town is in red and the garbage collection date is unknown There is.

次に、Ｓ１２で、装飾箇所取得部１２が取得したコンテンツに基づいて、装飾が施されている箇所を抽出する。 Next, in S12, a location where decoration is applied is extracted based on the content acquired by the decoration location acquisition unit 12.

図５は、実施例１における装飾リストを示す図である。 FIG. 5 is a diagram illustrating a decoration list according to the first embodiment.

実施例１では、装飾対象として、文字色が設定されているテキスト箇所と、その箇所に設定されている文字色とを抽出し、記憶装置に記憶すると、図５に示すように、抽出した装飾対象（抽出箇所）と、その行数と、文字色とを持つ装飾リストが抽出される。ここで、行数は、抽出箇所を判定するものであるが、ＨＴＭＬ文書内の位置を示すことができるものであれば、行数以外のものでもよい。 In the first embodiment, as a decoration target, a text portion in which a character color is set and a character color set in the portion are extracted and stored in a storage device. As shown in FIG. A decoration list having an object (extraction location), the number of lines, and a character color is extracted. Here, the number of lines is used to determine the extraction location, but may be other than the number of lines as long as it can indicate the position in the HTML document.

上記設定されている文字色は、色のコードと、その色についての読み上げ語句との２つから構成されている。ＨＴＭＬ文書規則では、＃に続く１６進数を用いて、色を表現していることが多い。色と１６進数とは、対応付けられ、読み上げ語句は、１６進数によって、対応する色の呼び名を読み上げる語句である。 The set character color is composed of two colors: a color code and a reading phrase for the color. In HTML document rules, colors are often expressed using hexadecimal numbers following #. A color and a hexadecimal number are associated with each other, and the read-out phrase is a phrase that reads out the name of the corresponding color in hexadecimal.

次に、Ｓ１３では、コンテンツ変換部１３が、Ｓ１２で抽出した装飾リストに基づいて、装飾内容を読み上げるように、ＨＴＭＬ文書を変換し、記憶装置に記憶する。 Next, in S13, the content conversion unit 13 converts the HTML document so as to read out the decoration content based on the decoration list extracted in S12, and stores it in the storage device.

実施例１では、装飾のうちで、文字色を読み上げるように、装飾箇所の前後に読み上げ用のテキストを付加する。つまり、装飾対象の前に付与する情報であって、装飾対象の始まり部分であることを音声で表現する情報と、装飾対象の後に付与する情報であって、装飾対象の終わり部分であることを音声で表現する情報とである代替情報を付加する。 In the first embodiment, a text to be read out is added before and after the decoration portion so as to read out the character color among the decorations. In other words, it is information that is given before the decoration target, that is, information that expresses by voice that it is the start part of the decoration target, and information that is given after the decoration target and that is the end part of the decoration target. Alternative information that is information expressed by voice is added.

図６は、読み上げテキストを付加した例を示す図である。 FIG. 6 is a diagram illustrating an example in which the text to be read is added.

装飾箇所の始まりで「（ここからあかじ）」のテキストを挿入し、装飾箇所の終わりで「（ここまであかじ）」のテキストを挿入することによって、音声読み上げにおいても、赤字箇所が分かる。また、カッコを付けることによって、テキストブラウザで表示した際に、カッコ内が付加部分であることを示す。 By inserting the text “(from here)” at the beginning of the decoration part, and inserting the text “(from here)” at the end of the decoration part, the red part can be recognized even when reading aloud. In addition, by adding parentheses, it indicates that the portion inside the parentheses is an additional part when displayed in a text browser.

Ｓ１４では、Ｓ１３で変換されたＨＴＭＬ文書を、コンテンツ出力部１４が出力する。表示装置や記憶装置、ネットワークを介した他のシステム等、システムによって、変換されたＨＴＭＬ文書の出力先は、異なる。 In S14, the content output unit 14 outputs the HTML document converted in S13. The output destination of the converted HTML document differs depending on the system, such as a display device, a storage device, or another system via a network.

コンテンツ取得部１１は、コンテンツを構成するＨＴＭＬ文書を取得するが、ＨＴＭＬ文書の代わりに、スタイルを記述するＣＳＳ（Cascade Style Sheet）や、他のマークアップ言語を取得するようにしてもよい。 The content acquisition unit 11 acquires an HTML document constituting the content. However, instead of the HTML document, a CSS (Cascade Style Sheet) describing a style or another markup language may be acquired.

装飾情報抽出部１２が抽出する装飾は、テキストに設定されている文字色であるが、文字色を抽出する代わりに、表の背景色、下線、太字の設定等を抽出するようにしてもよい。また、テキストに限らず、表、画像を、抽出するようにしてもよい。 The decoration extracted by the decoration information extraction unit 12 is the character color set in the text, but instead of extracting the character color, the background color, underline, bold setting, etc. of the table may be extracted. . In addition to text, tables and images may be extracted.

コンテンツ変換部１３は、装飾箇所の前後に、音声読み上げ用のテキストを挿入するが、音声読み上げ用のテキストを挿入する代わりに、＜ＩＭＧ＞属性としてテキストを挿入してもよく、また、ブザー音等の効果音を発するようにしてもよい。 The content conversion unit 13 inserts text for speech reading before and after the decoration part. Instead of inserting text for speech reading, the content conversion unit 13 may insert text as an <IMG> attribute, and a buzzer sound. And so on.

図７は、実施例１において、変換した後のＨＴＭＬ文書の一例を示す図である。 FIG. 7 is a diagram illustrating an example of an HTML document after conversion in the first embodiment.

たとえば、図７に示すように、装飾箇所の前に、「＜ｉｍｇｓｒｃ＝“ｃｌｅａｒ．ｇｉｆ” ａｌｔ＝“ここからあかじ”＞」を挿入し、装飾箇所の後に、「＜ｉｍｇｓｒｃ＝“ｃｌｅａｒ．ｇｉｆ” ａｌｔ＝“ここまであかじ”＞」を挿入するようにしてもよい。そして、視覚的にコンテンツを表示する一般的なブラウザで表示するようにしてもよい。この場合、音声読み上げ用のテキストを表示せずに、音声読み上げソフトが、「ここからあかじ」、「ここまであかじ」と読み上げるので、コンテンツの見た目を崩すことなく表示することができる。
For example, as shown in FIG. 7, “<img src =“ clear. gif ”alt =“ Akaji from here ”>” is inserted, and “<img src =“ clear. You may make it insert gif "alt =" this time ">". And you may make it display with the general browser which displays a content visually. In this case, since the text-to-speech software reads out “Akaji from here” and “Akaji from here” without displaying the text for reading aloud, it can be displayed without breaking the appearance of the content.

（サーバクライアント型）
実施例１は、サーバ上で実行する実施例である。 (Server client type)
Example 1 is an example executed on a server.

図８は、本発明の実施例２であるコンテンツ変換装置４０を有するコンテンツ変換システム２００を示す図である。 FIG. 8 is a diagram showing a content conversion system 200 having a content conversion apparatus 40 that is Embodiment 2 of the present invention.

図１に示す部分と同一部分には、同一符号を付与し、その説明を省略する。 The same reference numerals are given to the same parts as those shown in FIG. 1, and the description thereof is omitted.

コンテンツ変換装置４０は、ネットワークＮＷ２を介して、クライアント装置３０、コンテンツサーバ５０と接続されている。 The content conversion device 40 is connected to the client device 30 and the content server 50 via the network NW2.

クライアント装置３０は、コンテンツ利用者が使用し、通信部３１と、コンテンツ指定部３２と、コンテンツ出力部３３とを有し、たとえばコンピュータ端末で構成されている。 The client device 30 is used by a content user and includes a communication unit 31, a content designating unit 32, and a content output unit 33, and is configured by a computer terminal, for example.

クライアント装置３０によって指定されたコンテンツを、コンテンツ変換装置４０がコンテンツサーバ５０から取得し、この取得したコンテンツ内の装飾情報（装飾箇所等）を抽出し、コンテンツ変換装置４０が音声読み上げ可能な代替情報を付与し、クライアント装置３０に送信する。 The content conversion device 40 acquires the content specified by the client device 30 from the content server 50, extracts the decoration information (decoration location, etc.) in the acquired content, and the content conversion device 40 can read out the alternative information that can be read aloud. Is transmitted to the client device 30.

コンテンツ変換装置４０は、コンテンツ取得部１１と、装飾情報抽出部１２と、コンテンツ変換部１３と、コンテンツ出力部１４と、通信部４５とを有する。 The content conversion device 40 includes a content acquisition unit 11, a decoration information extraction unit 12, a content conversion unit 13, a content output unit 14, and a communication unit 45.

コンテンツサーバ５０は、変換する対象であるコンテンツを蓄積し、この蓄積されたコンテンツは、クライアント装置３０内のコンテンツ出力部３３が出力可能である。実施例２では、コンテンツは、ＨＴＭＬ文書であり、出力部３３は、ＨＴＭＬ文書を音声で読み上げる音声読み上げソフトであるとする。 The content server 50 accumulates content to be converted, and the accumulated content can be output by the content output unit 33 in the client device 30. In the second embodiment, it is assumed that the content is an HTML document, and the output unit 33 is voice reading software that reads the HTML document by voice.

図９は、実施例２におけるクライアント装置３０、コンテンツ変換装置４０の処理を示すフローチャートである。 FIG. 9 is a flowchart illustrating processing of the client device 30 and the content conversion device 40 according to the second embodiment.

Ｓ２１で、クライアント装置３０が、コンテンツを指定する。すなわち、文書利用者が、クライアント装置３０内のコンテンツ指定部３２を介して、変換する対象であるコンテンツを、ＵＲＬ等によって指定し、この指定された情報は、ネットワークＮＷ２を介して、通信部３１からコンテンツ変換装置４０に送信される。 In S21, the client device 30 specifies content. That is, the document user designates the content to be converted via the content designation unit 32 in the client device 30 by using a URL or the like, and the designated information is transmitted via the network NW2 to the communication unit 31. To the content conversion device 40.

Ｓ２２で、コンテンツ変換装置４０がコンテンツの情報を取得する。すなわち、コンテンツ変換装置４０内の通信部４５は、クライアント装置３０内の通信部３１から受信したＵＲＬ情報を受信し、指定されたＵＲＬに基づいて、コンテンツサーバ５０上にあるコンテンツを取得する。実施例２では、コンテンツサーバ５０からＨＴＭＬ文書を取得する。コンテンツ変換装置４０の通信部４５は、取得したコンテンツを、コンテンツ取得部１１に送信する。 In S22, the content conversion device 40 acquires content information. That is, the communication unit 45 in the content conversion device 40 receives the URL information received from the communication unit 31 in the client device 30, and acquires content on the content server 50 based on the designated URL. In the second embodiment, an HTML document is acquired from the content server 50. The communication unit 45 of the content conversion device 40 transmits the acquired content to the content acquisition unit 11.

次に、Ｓ２３で、装飾情報抽出部１２は、コンテンツ取得部１１が取得したコンテンツ内の装飾情報（装飾箇所等）を抽出する。装飾情報を抽出する方法は、実施例１と同様でもよい。Ｓ２４で、コンテンツ変換部１３は、実施例１と同様に、コンテンツに読み上げ用の付加情報を付与するように変換する。 Next, in S <b> 23, the decoration information extraction unit 12 extracts decoration information (decoration part and the like) in the content acquired by the content acquisition unit 11. The method for extracting the decoration information may be the same as in the first embodiment. In S24, the content conversion unit 13 performs conversion so that additional information for reading is added to the content, as in the first embodiment.

次に、Ｓ２５で、上記変換されたコンテンツを、コンテンツ変換装置４０の通信部４５に送り、変換されたコンテンツを、コンテンツ変換装置４０の通信部４５が、ネットワークＮＷ２を介して、クライアント装置３０に送信する。 Next, in S25, the converted content is sent to the communication unit 45 of the content conversion device 40, and the communication unit 45 of the content conversion device 40 sends the converted content to the client device 30 via the network NW2. Send.

クライアント装置３０の通信部３１は、コンテンツ変換装置４０が変換したコンテンツを取得し、コンテンツ出力部３３に送る。Ｓ２６で、コンテンツ出力部３３が、通信部３１から受信したコンテンツを解釈し、音声として出力する。 The communication unit 31 of the client device 30 acquires the content converted by the content conversion device 40 and sends it to the content output unit 33. In S26, the content output unit 33 interprets the content received from the communication unit 31 and outputs it as sound.

クライアント装置３０におけるコンテンツ指定部３２が、ＵＲＬによってコンテンツを指定するが、ＵＲＬを使用する代わりに、文書名等、コンテンツサーバ５０内の文書を一意に特定する内容を使用するようにしてもよい。 The content specifying unit 32 in the client device 30 specifies the content by the URL, but instead of using the URL, content that uniquely identifies the document in the content server 50 such as a document name may be used.

クライアント装置３０におけるコンテンツ出力部３３において、音声読み上げソフトによって、コンテンツを音声で出力するが、一般的なブラウザのように、音声出力する代わりに、画像で出力する等、利用者に何らかの手段で出力すればよい。音声出力する代わりに出力する画像の例として、装飾箇所の前に、「（ここから赤字）」を挿入し、装飾対象の後に、「（ここまで赤字）」を挿入して画像表示する例や、「赤⇒」を画像として挿入する例が考えられる。なお、「赤⇒」をテキストとして挿入するようにしてもよい。 In the content output unit 33 in the client device 30, the content is output by voice using voice reading software. Instead of outputting the voice as in a general browser, the content is output to the user by some means such as an image. do it. As an example of an image to be output instead of outputting audio, an example of displaying an image by inserting “(red here)” before the decoration part, and inserting “(red here)” after the decoration target, , “Red ⇒” can be inserted as an image. “Red⇒” may be inserted as text.

上記実施例によれば、ウェブコンテンツ等の所定記述言語で記述されているコンテンツにおいて、文字色や太字等、コンテンツに付加されている装飾について、音声読み上げソフトによって読み上げることができる。 According to the above embodiment, in contents described in a predetermined description language such as web contents, decorations added to the contents such as character color and bold can be read out by the voice reading software.

また、従来技術では、音声読み上げソフト利用者は得ることができなかった情報（たとえば、強調部分等）を、上記実施例によれば取得可能である。これによって、音声読み上げソフト利用者は、利用できるウェブコンテンツが増加するので、多くの情報を取得することができる。 In addition, according to the above-described embodiment, information (for example, an emphasis part) that cannot be obtained by the voice reading software user can be obtained according to the conventional technology. As a result, the user of the voice reading software increases the amount of web content that can be used, and thus can acquire a large amount of information.

さらに、テキストブラウザや白黒ディスプレイ利用者も、従来であれば、欠落して取得できなかった装飾の情報を取得することができる。 Furthermore, a text browser and a monochrome display user can also acquire decoration information that could not be acquired due to loss in the past.

つまり、上記実施例は、所定記述言語で記述されているコンテンツを取得するコンテンツ取得手段と、上記取得したコンテンツを解析し、上記コンテンツを装飾する装飾内容と装飾対象とを示す情報である装飾情報を抽出する装飾情報抽出手段と、上記装飾内容を表現する読み上げ可能な代替情報を、上記取得したコンテンツに付与する代替情報付与手段と、上記代替情報付与手段によって付与された後のコンテンツを出力するコンテンツ出力手段とを有するコンテンツ変換装置である。 In other words, in the above embodiment, content acquisition means for acquiring content described in a predetermined description language, and decoration information that is information indicating the decoration content and the decoration target for analyzing the acquired content and decorating the content Information extracting means for extracting the content, alternative information giving means for giving the read-out alternative information representing the decoration content to the acquired content, and the content after being given by the alternative information giving means A content conversion apparatus having content output means.

この場合、上記代替情報は、上記装飾対象の前に付与する情報であって、上記装飾対象の始まり部分であることを音声で表現する情報と、上記装飾対象の後に付与する情報であって、上記装飾対象の終わり部分であることを音声で表現する情報とである。 In this case, the alternative information is information that is given before the decoration target, information that expresses by voice that it is the beginning of the decoration target, and information that is given after the decoration target, It is information that expresses by voice that it is the end part of the decoration object.

また、上記装飾内容を示す情報は、テキストに装飾されている色情報である。 The information indicating the decoration content is color information decorated in the text.

また、上記実施例を方法の発明として把握するウことができる．つまり、上記実施例は、所定記述言語で記述されているコンテンツを取得し、記憶装置に記憶するコンテンツ取得工程と、上記取得したコンテンツを解析し、上記コンテンツを装飾する装飾内容と装飾対象とを示す情報である装飾情報を抽出し、記憶手段に記憶する装飾情報抽出工程と、上記装飾内容を表現する読み上げ可能な代替情報を、上記取得したコンテンツに付与し、記憶装置に記憶する代替情報付与工程と、上記代替情報付与工程で付与された後のコンテンツを出力するコンテンツ出力工程とを有するコンテンツ変換方法の例である。 In addition, the above embodiment can be grasped as a method invention. In other words, in the above embodiment, the content described in the predetermined description language is acquired and stored in the storage device, the acquired content is analyzed, and the decoration content and the decoration target for decorating the content are determined. The decoration information extraction process which extracts the decoration information which is the information shown, and memorize | stores it in a memory | storage means, and the alternative information provision which gives the read-out alternative information expressing the said decoration content to the said acquired content, and memorize | stores it in a memory | storage device It is an example of the content conversion method which has a process and the content output process which outputs the content after provided in the said alternative information provision process.

さらに、上記実施例をプログラムの発明として把握することができる。つまり、上記実施例は、請求項４記載の方法をコンピュータに実行させるコンテンツ変換プログラムである。 Further, the above embodiment can be grasped as a program invention. That is, the said Example is a content conversion program which makes a computer perform the method of Claim 4.

また、上記プログラムを記録した記録媒体を、システムまたは装置に供給し、そのシステムまたは装置のＣＰＵ（ＭＰＵ）が記録媒体に格納されたプログラムを読み出し実行するようにしてもよい。この場合、上記記録媒体は、ＣＤ、ＤＶＤ、ＨＤ、半導体メモリ等である。
Alternatively, a recording medium in which the program is recorded may be supplied to a system or apparatus, and a CPU (MPU) of the system or apparatus may read and execute the program stored in the recording medium. In this case, the recording medium is a CD, DVD, HD, semiconductor memory, or the like.

本発明の実施例１であるコンテンツ変換装置１０を有するコンテンツ変換システム１００を示すブロック図である。1 is a block diagram illustrating a content conversion system 100 including a content conversion apparatus 10 that is Embodiment 1 of the present invention. コンテンツ変換装置１０の動作を示すフローチャートである。3 is a flowchart showing the operation of the content conversion apparatus 10. 実施例１における変換対象のコンテンツであるＨＴＭＬ文書の例を示す図である。6 is a diagram illustrating an example of an HTML document that is a content to be converted in Embodiment 1. FIG. 実施例１において、変換対象のコンテンツをブラウザで表示している例を示す図である。In Example 1, it is a figure which shows the example which is displaying the content of conversion object with a browser. 実施例１における装飾リストを示す図である。It is a figure which shows the decoration list | wrist in Example 1. FIG. 読み上げテキストを付加した例を示す図である。It is a figure which shows the example which added the reading text. 実施例１において、変換した後のＨＴＭＬ文書の一例を示す図である。In Example 1, it is a figure which shows an example of the HTML document after converting. 本発明の実施例２であるコンテンツ変換装置４０を有するコンテンツ変換システム２００を示す図である。It is a figure which shows the content conversion system 200 which has the content conversion apparatus 40 which is Example 2 of this invention. 実施例２におけるクライアント装置３０、コンテンツ変換装置４０の処理を示すフローチャートである。10 is a flowchart illustrating processing of a client device 30 and a content conversion device 40 in Embodiment 2.

Explanation of symbols

１０…コンテンツ変換装置、
１１…コンテンツ取得部、
１２…装飾抽出部、
１３…コンテンツ変換部、
１４…コンテンツ出力部、
２０…コンテンツサーバ、
３０…クライアント装置、
３１…通信部、
３２…コンテンツ指定部、
３３…コンテンツ出力部、
４０…コンテンツ変換装置、
４５…通信部、
５０…コンテンツサーバ。 10: Content conversion device,
11 ... content acquisition unit,
12 ... decoration extraction part,
13 ... content conversion part,
14 ... content output section,
20 ... Content server,
30: Client device,
31 ... communication section,
32 ... Content designation part,
33. Content output unit,
40. Content conversion device,
45. Communication part,
50: Content server.

Claims

Content acquisition means for acquiring content described in a predetermined description language;
A decoration information extraction unit that analyzes the acquired content and extracts decoration information that is information indicating a decoration content and a decoration target that decorates the text included in the content;
Alternative information giving means for giving an explanation for reading aloud expressing the decoration content to the acquired content;
Content output means for outputting the content provided by the alternative information providing means;
The content to which the description is added is surrounded by a specific symbol so that it can be seen that the content is an additional part when displayed with a text browser that visually displays the content . The content conversion apparatus characterized by the above-mentioned.

In claim 1,
The description is given before the decoration object and expresses the beginning part of the decoration object, and the sentence is given after the decoration object and expresses the end part of the decoration object A content conversion apparatus characterized by the above.

In claim 2,
The content conversion apparatus, wherein the information indicating the decoration content is color information decorated in a text.

The content conversion program for functioning a computer as each means which comprises the content conversion apparatus of any one of Claim 1 thru | or 3.