JP2019175245A

JP2019175245A - Speech synthesizer

Info

Publication number: JP2019175245A
Application number: JP2018064313A
Authority: JP
Inventors: 清孝 ▲高▼見; Kiyotaka Takami; 洋一荻堂; Yoichi Ogido; 明美富岡; Akemi Tomioka
Original assignee: Shinbiyo Shupan Co Ltd
Current assignee: Shinbiyo Shupan Co Ltd
Priority date: 2018-03-29
Filing date: 2018-03-29
Publication date: 2019-10-10
Anticipated expiration: 2038-03-29
Also published as: JP6506438B1

Abstract

To provide a speech synthesizer capable of reproducing a desired sentence at a desired speed and a desired voice pitch.SOLUTION: The speech synthesizer includes: a speech synthesizing unit that synthesizes and outputs a predetermined text at a predetermined speed and a predetermined voice pitch; a text memory that stores a text to be input to the speech synthesizing unit; a read-out speed memory that stores a read-out speed that is synthesized by the speech synthesizing unit; and a voice pitch memory that stores a pitch of a voice to be synthesized by the speech synthesizing unit. The speech synthesizing unit synthesizes and outputs the text stored in the text memory at the read-out speed stored in the read-out speed memory and the voice pitch stored in the voice pitch memory.SELECTED DRAWING: Figure 5

Description

本発明は音声合成装置に関する。 The present invention relates to a speech synthesizer.

従来の音声再生装置として、外国語に関する外国語音声情報として単語とその単語の使用例としての例文がそれぞれ再生単位とする状態で記憶されている音声情報記憶手段と、音声情報記憶手段に記憶されている外国語音声情報の再生速度を変更可能に再生単位ごとに再生する再生処理部と、再生処理部により再生される外国語音声情報をリピート再生するリピート制御部とを有し、リピート制御部は、再生単位が例文である場合には、さらに、再生単位の外国語音声情報を複数のステージとしてリピート再生を行い、標準速度で再生するステージの後、ステージごとにリピート回数、再生速度の設定を設定可能に再生処理部の再生処理を行うものが知られている（特許文献１参照）。 As a conventional voice reproduction device, a voice information storage means in which a word and an example sentence as an example of use of the word as foreign language voice information related to a foreign language are stored in a playback unit, and stored in the voice information storage means. A repeat processing unit having a reproduction processing unit that reproduces each reproduction unit so that the reproduction speed of the foreign language voice information being played can be changed, and a repeat control unit that repeats the foreign language audio information reproduced by the reproduction processing unit. If the playback unit is an example sentence, repeat playback of the foreign language audio information of the playback unit as a plurality of stages is performed, and after the stage of playback at the standard speed, the number of repeats and the playback speed are set for each stage. Is known to perform the reproduction processing of the reproduction processing unit so that the setting can be set (see Patent Document 1).

特開２０１７−０４５０５２号公報JP 2017-045052 A

しかしながら、従来の音声再生装置は、利用者が所望の文章を所望のスピードや所望の声の高さで再生させることができなかった。 However, the conventional audio reproducing apparatus cannot reproduce a desired sentence at a desired speed and a desired voice level by a user.

本発明の目的は、利用者が所望の文章を所望のスピードや所望の声の高さで再生させることができる音声合成装置を提供することにある。 An object of the present invention is to provide a speech synthesizer that allows a user to reproduce a desired sentence at a desired speed and a desired voice pitch.

本発明に係る音声合成装置は、所定のテキストを、所定のスピードと所定の音の高さで音声を合成して出力する音声合成部と、前記音声合成部に入力するテキストを格納するテキストメモリと、前記音声合成部で合成する読み上げスピードを格納する読み上げスピードメモリと、前記音声合成部で合成する声の高さを格納する声の高さメモリとを有し、前記音声合成部は、前記テキストメモリに格納されたテキストを、前記読み上げスピードメモリに格納された読み上げスピードと、前記声の高さメモリに格納された声の高さで音声合成して出力することを特徴とする。 A speech synthesizer according to the present invention includes a speech synthesizer that synthesizes and outputs a predetermined text at a predetermined speed and a predetermined pitch, and a text memory that stores text to be input to the speech synthesizer A reading speed memory for storing a reading speed to be synthesized by the voice synthesizing unit, and a voice pitch memory for storing a voice pitch to be synthesized by the voice synthesizing unit. The text stored in the text memory is synthesized by voice synthesis at the reading speed stored in the reading speed memory and the voice pitch stored in the voice pitch memory and output.

上述した音声合成装置において、前記音声合成部に入力する前記テキストが表示されたテキストボタンと、前記読み上げスピード及び前記声の高さを設定するコントロールボタンとを表示画面に表示する表示部を更に有し、前記テキストボタンの表示位置が前記表示画面内で変更されても、前記コントロールボタンの表示位置は前記表示画面内の所定位置に固定されているようにしてもよい。 The above-described speech synthesizer may further include a display unit that displays a text button on which the text to be input to the speech synthesizer is displayed and a control button for setting the reading speed and the voice pitch on a display screen. Even if the display position of the text button is changed in the display screen, the display position of the control button may be fixed at a predetermined position in the display screen.

上述した音声合成装置において、前記テキストボタンを含むページ外に移動した場合には前記読み上げスピードメモリ及び前記声の高さメモリの値がリセットされ、前記ページ内の前記テキストボタンを表示している限り前記読み上げスピードメモリ及び前記声の高さメモリの値が維持されるようにしてもよい。 In the above-described speech synthesizer, as long as it moves out of the page including the text button, the values of the reading speed memory and the voice pitch memory are reset and the text button in the page is displayed. The values of the reading speed memory and the voice pitch memory may be maintained.

以上の通り、本発明によれば、所定のテキストを、所定のスピードと所定の音の高さで音声を合成して出力する音声合成部と、音声合成部に入力するテキストを格納するテキストメモリと、音声合成部で合成する読み上げスピードを格納する読み上げスピードメモリと、音声合成部で合成する声の高さを格納する声の高さメモリとを有し、音声合成部は、テキストメモリに格納されたテキストを、読み上げスピードメモリに格納された読み上げスピードと、声の高さメモリに格納された声の高さで音声合成して出力するようにしたので、所望の文章を所望のスピードや所望の声の高さで再生させることができる。 As described above, according to the present invention, a speech synthesizer that synthesizes and outputs a predetermined text at a predetermined speed and a predetermined pitch, and a text memory that stores text to be input to the speech synthesizer And a reading speed memory for storing the reading speed to be synthesized by the voice synthesizing unit, and a voice pitch memory for storing the pitch of the voice synthesized by the voice synthesizing unit. The voice synthesizing unit is stored in the text memory. Since the synthesized text is synthesized and output at the reading speed stored in the reading speed memory and the voice pitch stored in the voice pitch memory, the desired text can be output at the desired speed or desired level. Can be played at the pitch of the voice.

図１は本発明の一実施形態による音声合成装置に関連する書籍を示す図（その１）である。FIG. 1 is a diagram (part 1) illustrating a book related to a speech synthesizer according to an embodiment of the present invention. 図２は本発明の一実施形態による音声合成装置に関連する書籍を示す図（その２）である。FIG. 2 is a diagram (part 2) showing a book related to the speech synthesizer according to the embodiment of the present invention. 図３は本発明の一実施形態による音声合成装置の操作画面であるＷＥＢサイトを示す図（その１）である。FIG. 3 is a diagram (part 1) illustrating a WEB site that is an operation screen of the speech synthesizer according to the embodiment of the present invention. 図４は本発明の一実施形態による音声合成装置の操作画面であるＷＥＢサイトを示す図（その２）である。FIG. 4 is a diagram (part 2) showing a WEB site which is an operation screen of the speech synthesizer according to the embodiment of the present invention. 図５は本発明の一実施形態による音声合成装置を示すブロック図である。FIG. 5 is a block diagram showing a speech synthesizer according to an embodiment of the present invention. 図６は本発明の一実施形態による音声合成装置の処理方法を示すフローチャート（その１）である。FIG. 6 is a flowchart (No. 1) showing the processing method of the speech synthesizer according to the embodiment of the present invention. 図７は本発明の一実施形態による音声合成装置の処理方法を示すフローチャート（その２）である。FIG. 7 is a flowchart (part 2) showing the processing method of the speech synthesizer according to the embodiment of the present invention. 図８は本発明の一実施形態による音声合成装置の処理方法を示すフローチャート（その３）である。FIG. 8 is a flowchart (No. 3) showing the processing method of the speech synthesizer according to the embodiment of the present invention. 図９は本発明の一実施形態による音声合成装置の処理方法を示すフローチャート（その４）である。FIG. 9 is a flowchart (No. 4) showing the processing method of the speech synthesizer according to the embodiment of the present invention.

［一実施形態］
本発明の一実施形態による音声合成装置について図１乃至図９を用いて説明する。 [One Embodiment]
A speech synthesizer according to an embodiment of the present invention will be described with reference to FIGS.

（英会話書籍）
本発明の一実施形態による音声合成装置は、英会話書籍の付録として提供するものである。図１及び図２に本実施形態の音声合成装置に関連する英会話書籍の要部を示す。 (English conversation books)
A speech synthesizer according to an embodiment of the present invention is provided as an appendix of an English conversation book. 1 and 2 show the main part of an English conversation book related to the speech synthesizer of this embodiment.

英会話書籍の表紙（図１（ａ））には、書籍名「くるくるＣＯＮＶＥＲＳＡＴＩＯＮ」がデザイン文字と共に記載され、更に、発行者「ＳＨＩＮＢＩＹＯＣＯ．，ＬＴＤ」が記載されている。 On the cover of the English conversation book (FIG. 1A), the book name “Kurukuru CONVERSATION” is described together with the design characters, and further, the issuer “SHINBIYO CO., LTD” is described.

英会話書籍の目次（図１（ｂ））には、この書籍の目次が記載されている。この英会話書籍には、美容師のための英会話例が施術別に記載されていることがわかる。 The table of contents of the English conversation book (FIG. 1B) describes the table of contents of this book. In this English conversation book, it can be seen that English conversation examples for beauticians are listed according to treatment.

英会話書籍の内容（図１（ｃ）、図２（ａ）、図２（ｂ））には、接客基本編の英会話例が記載されている。図２（ａ）には「こんにちは（もしもし）、ＳＨＩＮＢＩＹＯです。」の英会話例や、「いつ、何時がご希望ですか？」の英会話例が記載され、図２（ｂ）には「ご指名はありますか？」の英会話例や、「指名料は○○円です。」の英会話例が記載されている。 The contents of English conversation books (FIG. 1 (c), FIG. 2 (a), FIG. 2 (b)) describe English conversation examples of the customer service basics. In FIGS. 2 (a) "is Hello (Hello), SHINBIYO." English examples of and, "when, when the Do you want?" Is described English example, in Figure 2 (b) "your nomination There is an English conversation example of "Is there any?" And an English conversation example of "Nomination fee is XX yen."

英会話書籍の８３頁対向（目次の８４頁）（図２（ｃ））には、この書籍の付録としての「Communication Tools」について記載されている。「Communication Tools」のひとつとして「リスニングツール」が紹介されている。「弊社ＷＥＢサイトにて、本書籍に掲載したフレーズの音声を聞く事ができます。」と記載され、ＷＥＢサイトにアクセスするためのＱＲコード（登録商標）が印刷されている。また、ＰＣでアクセスする人のために、ＷＥＢサイトのＵＲＬ「https://www.shinbiyo.com/books/other/kurukuruc/listen/」が記載されている。 Opposite page 83 of the English conversation book (page 84 of the table of contents) (FIG. 2 (c)) describes “Communication Tools” as an appendix to this book. "Listening tool" is introduced as one of "Communication Tools". “You can listen to the voice of the phrase posted in this book on our website” and the QR code (registered trademark) for accessing the website is printed. For those who access with a PC, the URL “https://www.shinbiyo.com/books/other/kurukuruc/listen/” of the WEB site is described.

（ＷＥＢサイト）
本発明の一実施形態による音声合成装置の操作画面であるＷＥＢサイトを図３及び図４に示す。 (WEB site)
3 and 4 show a WEB site which is an operation screen of the speech synthesizer according to the embodiment of the present invention.

図３（ａ）は、本実施形態の音声合成装置の操作画面へのログイン画面である。操作者は、ログイン画面の文字入力窓ＩＷにパスワードを入力する。 FIG. 3A is a login screen to the operation screen of the speech synthesizer of this embodiment. The operator inputs a password in the character input window IW on the login screen.

ログイン画面の文字入力窓ＩＷに予め定められたパスワードが入力されると、英会話書籍の内容と同等な施術別の英会話例が表示された最初の英会話表示画面（図３（ｂ））となる。 When a predetermined password is entered in the character input window IW of the login screen, the first English conversation display screen (FIG. 3B) is displayed on which an English conversation example for each procedure equivalent to the content of the English conversation book is displayed.

図３（ｂ）の英会話表示画面では「くるくるconversation リスニングツール」と表示されている。更に、その下には施術別の英会話例の目次として「接客基本編」「カット編」「仕上げ編」「カラー編」「パーマ編」「ストレートパーマ編」「ヘッドスパ編」と記載されている。上記の各目次をクリックすると、クリックした施術別の英会話例が記載されたページにジャンプする。 In the English conversation display screen of FIG. 3B, “Kurukuru conversation listening tool” is displayed. Furthermore, the table of contents of English conversation examples according to treatment is described as “basic customer service”, “cut”, “finish”, “color”, “perm”, “straight perm”, “head spa”. Clicking on each table of contents jumps to a page that contains an English conversation example for the clicked procedure.

施術別の英会話例の目次の次には、最初の接客基本編の英会話例が表示される。英会話教材のページ数「Ｐ．４」と共に、英会話例が表示される。 Next to the table of contents of English conversation examples by treatment, the English conversation examples of the first basic customer service are displayed. An English conversation example is displayed together with the page number “P.4” of the English conversation material.

日本語文として「こんにちは、ＳＨＩＮＢＩＹＯです。」が表示され、その日本語に対応する英語文「Hello, this is Shinbiyo. How may I help you?」と、そのカタカナ読み「ハローディスイズＳＨＩＮＢＩＹＯハウメイアイヘルプユー？」が表示される。 As a Japanese sentence "Hello, this is SHINBIYO." Is displayed, English sentence "Hello, this is Shinbiyo. How may I help you?" Corresponding to the Japanese and, the katakana reading "Hello This Is SHINBIYO Howe May I Help You? "Is displayed.

英語文「Hello, this is Shinbiyo. How may I help you?」は、クリック可能な英文ボタンＴＢ１内に表示されている。 The English sentence “Hello, this is Shinbiyo. How may I help you?” Is displayed in the clickable English button TB1.

操作者が、英文ボタンＴＢ１をクリックすると、英文ボタンＴＢ１内に表示された英語文「Hello, this is Shinbiyo. How may I help you?」が音声出力される。 When the operator clicks the English button TB1, the English sentence “Hello, this is Shinbiyo. How may I help you?” Displayed in the English button TB1 is output as a voice.

図３（ｂ）の英会話表示画面の右上の所定位置には、読み上げスピードと声の高さを設定するためのコントロールボタンＣＢが表示されている。 A control button CB for setting the reading speed and the voice pitch is displayed at a predetermined position on the upper right of the English conversation display screen in FIG.

「voice speed（読み上げのスピード）」なる記載の下には、「slow <<」ボタン、「reset」ボタン、「>> fast」ボタンが表示されている。 Below the description of “voice speed”, a “slow <<” button, a “reset” button, and a “>> fast” button are displayed.

「voice pitch（声の高さ）」なる記載の下には、「low <<」ボタン、「reset」ボタン、「>> high」ボタンが表示されている。 Below the description of “voice pitch”, a “low <<” button, a “reset” button, and a “>> high” button are displayed.

操作者が、コントロールボタンＣＢ内のボタンをクリックすると、読み上げスピードと声の高さを設定することができる。 When the operator clicks a button in the control button CB, the reading speed and the voice pitch can be set.

操作者が、「voice speed（読み上げのスピード）」の「reset」ボタンをクリックすると、読み上げスピードが標準値に設定される。 When the operator clicks the “reset” button of “voice speed”, the reading speed is set to the standard value.

操作者が、「voice speed（読み上げのスピード）」の「slow <<」ボタンをクリックすると、読み上げスピードがより遅い値に設定される。 When the operator clicks the “slow <<” button of “voice speed”, the reading speed is set to a slower value.

操作者が、「voice speed（読み上げのスピード）」の「>> fast」ボタンをクリックすると、読み上げスピードがより速い値に設定される。 When the operator clicks the “>> fast” button of “voice speed”, the reading speed is set to a faster value.

操作者が、「voice pitch（声の高さ）」の「reset」ボタンをクリックすると、声の高さが標準値に設定される。 When the operator clicks the “reset” button of “voice pitch”, the voice pitch is set to the standard value.

操作者が、「voice pitch（声の高さ）」の「low <<」ボタンをクリックすると、声の高さがより低い値に設定される。 When the operator clicks the “low <<” button of “voice pitch”, the voice pitch is set to a lower value.

操作者が、「voice pitch（声の高さ）」の「>> high」ボタンをクリックすると、声の高さがより高い値に設定される。 When the operator clicks the “>> high” button of “voice pitch”, the voice pitch is set to a higher value.

図３（ｂ）の英会話表示画面を下方向にスクロールすると、図４（ａ）の画面が表示される。図４（ａ）の英会話表示画面では、図３（ｂ）の英会話表示画面の英会話例及び英文ボタンＴＢ１に続く、英会話教材のページ数「Ｐ．４」の英会話例及び英文ボタンＴＢ２と、ページ数「Ｐ．５」の英会話例及び英文ボタンＴＢ３が表示される。 When the English conversation display screen of FIG. 3B is scrolled downward, the screen of FIG. 4A is displayed. In the English conversation display screen of FIG. 4 (a), the English conversation example and English button TB2 of the number of pages “P.4” of the English conversation teaching material following the English conversation example and English button TB1 of the English conversation display screen of FIG. An English conversation example of the number “P.5” and an English button TB3 are displayed.

図４（ａ）の英会話表示画面を更に下方向にスクロールすると、図４（ｂ）の画面が表示される。図４（ｂ）の英会話表示画面では、図４（ａ）の英会話表示画面に続く、英会話教材のページ数「Ｐ．５」の英会話例及び英文ボタンＴＢ４と、ページ数「Ｐ．６」の英会話例及び英文ボタンＴＢ５が表示される。更に下方向にスクロールすると、それ以降の英会話例及び英文ボタンが順次表示される。 When the English conversation display screen of FIG. 4A is further scrolled downward, the screen of FIG. 4B is displayed. In the English conversation display screen of FIG. 4B, the English conversation example of the English conversation teaching material page number “P.5” and the English button TB4 and the page number “P.6” of the English conversation teaching material following the English conversation display screen of FIG. An English conversation example and an English sentence button TB5 are displayed. When scrolling further downward, English conversation examples and English buttons after that are sequentially displayed.

図４（ａ）、（ｂ）の英会話表示画面にも、読み上げスピードと声の高さを設定するためのコントロールボタンＣＢが表示される。そのコントロールボタンＣＢの表示位置は、図３（ｂ）と同じ画面右上の所定位置である。英会話表示画面がスクロールされて、表示される英会話例が変更されても、読み上げスピードと声の高さを設定するためのコントロールボタンＣＢは常に同じ画面右上の所定位置である。これにより操作者は、表示内容が変更されても画面内の表示位置が変らないコントロールボタンＣＢを容易に操作することができる。 Control buttons CB for setting the reading speed and the voice pitch are also displayed on the English conversation display screens of FIGS. 4 (a) and 4 (b). The display position of the control button CB is the predetermined position at the upper right of the screen as in FIG. Even if the English conversation display screen is scrolled and the displayed English conversation example is changed, the control button CB for setting the reading speed and the voice pitch is always at a predetermined position on the upper right of the screen. Thus, the operator can easily operate the control button CB whose display position in the screen does not change even when the display content is changed.

本実施形態では、図３（ｂ）から図４（ａ）、図４（ａ）から図４（ｂ）のように、英会話表示画面がスクロールされても、読み上げスピードと声の高さを設定するためのコントロールボタンＣＢがクリックされない限り、読み上げスピードと声の高さの設定値は維持される。 In this embodiment, as shown in FIGS. 3 (b) to 4 (a) and FIGS. 4 (a) to 4 (b), the reading speed and the voice pitch are set even if the English conversation display screen is scrolled. As long as the control button CB is not clicked, the setting values of the reading speed and the voice pitch are maintained.

また、本実施形態では、施術別の英会話例を含む全ての英会話例がひとつのページ内に構成されている。そのため、図３（ｂ）に示す施術別の英会話例の目次をクリックして、クリックした施術別の英会話例が記載されたページにジャンプしても、読み上げスピードと声の高さを設定するためのコントロールボタンＣＢがクリックされない限り、読み上げスピードと声の高さの設定値は維持される。 In this embodiment, all English conversation examples including treatment-specific English conversation examples are configured in one page. Therefore, even if you click the table of contents of the English conversation example for each operation shown in FIG. 3B and jump to the page on which the English conversation example for each clicked operation is described, the reading speed and voice pitch are set. Unless the control button CB is clicked, the setting values of the reading speed and the voice pitch are maintained.

これにより、操作者の自分に適した読み上げスピードと声の高さで英会話例を続けて聞くことができる。 As a result, it is possible to continue to listen to English conversation examples at a reading speed and a voice pitch suitable for the operator.

本実施形態における英会話例を表示するページと全く別のページ、例えば、書籍紹介ページ等にジャンプした場合には、この操作者による英会話例の利用が終了したと判断し、読み上げスピードと声の高さの設定値をリセットして標準値（１．０）とする。 When jumping to a page completely different from the page displaying the English conversation example in the present embodiment, for example, a book introduction page, it is determined that the use of the English conversation example by the operator has ended, and the reading speed and high voice The set value is reset to the standard value (1.0).

これにより、新たな利用者が英会話例を表示しようとするとき等に、それ以前の利用者の設定に惑わされることなく自分に適した読み上げスピードと声の高さに自由に設定することができる。 As a result, when a new user wants to display an English conversation example, he / she can freely set the reading speed and loudness suitable for him without being confused by the previous user's settings. .

（音声合成装置）
本発明の一実施形態による音声合成装置のブロック図を図５に示す。 (Speech synthesizer)
A block diagram of a speech synthesizer according to an embodiment of the present invention is shown in FIG.

本実施形態の音声合成装置は、音声データをウェブアプリに組み入れることを可能にするＷｅｂＳｐｅｅｃｈＡＰＩを含むプログラムにより実現する。コンピュータを、本実施形態の音声合成装置として機能させるためのプログラムである。このプログラムは、コンピュータ読み取り可能な記録媒体に記録される。 The speech synthesizer of the present embodiment is realized by a program including a Web Speech API that makes it possible to incorporate speech data into a web application. It is a program for causing a computer to function as the speech synthesizer of this embodiment. This program is recorded on a computer-readable recording medium.

ＷｅｂＳｐｅｅｃｈＡＰＩ（https://developer.mozilla.org/ja/docs/Web/API/Web_Speech_API）は、２つの部分から成り立っている。音声合成（Text-to-Speech）と音声認識（Asynchronous Speech Recognition）である。本実施形態では、音声合成（Text-to-Speech）を用いる。 The Web Speech API (https://developer.mozilla.org/en/docs/Web/API/Web_Speech_API) consists of two parts. Speech synthesis (Text-to-Speech) and speech recognition (Asynchronous Speech Recognition). In this embodiment, speech synthesis (Text-to-Speech) is used.

図５に示すように、音声合成装置１０には、音声合成ＡＰＩ１２が設けられている。操作者が英文ボタン１４をクリックすると、英文ボタン１４に示された英文が英文テキストメモリ１６に格納される。 As shown in FIG. 5, the speech synthesizer 10 is provided with a speech synthesis API 12. When the operator clicks the English button 14, the English text indicated by the English button 14 is stored in the English text memory 16.

読み上げのスピードと声の高さとを設定するためのコントロールボタン２０が設けられている。 A control button 20 is provided for setting the reading speed and voice pitch.

コントロールボタン２０には、読み上げのスピードを設定するために、slowボタン２２、resetボタン２３、fastボタン２４が設けられている。声の高さを設定するために、lowボタン２６、resetボタン２７、highボタン２８が設けられている。 The control button 20 is provided with a slow button 22, a reset button 23, and a fast button 24 in order to set the reading speed. In order to set the pitch of the voice, a low button 26, a reset button 27, and a high button 28 are provided.

操作者がコントロールボタン２０を操作することにより、読み上げスピードの値と声の高さの値が設定される。 When the operator operates the control button 20, a reading speed value and a voice pitch value are set.

コントロールボタン２０で設定された読み上げスピードの値は、読み上げスピードメモリ３０に格納される。コントロールボタン２０で設定された声の高さの値は、声の高さメモリ３２に格納される。 The value of the reading speed set by the control button 20 is stored in the reading speed memory 30. The voice pitch value set by the control button 20 is stored in the voice pitch memory 32.

音声合成ＡＰＩ１２は、英文テキストメモリ１６に格納された英文を、読み上げスピードメモリ３０に格納された読み上げスピードと、声の高さメモリ３２に格納された声の高さとで音声出力部３４から合成音声を出力する。 The voice synthesizing API 12 synthesizes an English sentence stored in the English text memory 16 from the voice output unit 34 with the reading speed stored in the reading speed memory 30 and the voice pitch stored in the voice pitch memory 32. Is output.

（読み上げスピードの設定）
本発明の一実施形態による音声合成装置における読み上げスピードの設定方法のフローチャートを図６に示す。 (Reading speed setting)
FIG. 6 is a flowchart of a reading speed setting method in the speech synthesizer according to the embodiment of the present invention.

まず、読み上げスピードのresetボタン２３が押されたか否かを判断する（ステップＳ１０）。読み上げスピードのresetボタン２３が押されたと判断されると、読み上げスピードの値として１．０を読み上げスピードメモリ３０に格納し（ステップＳ１１）、最初のステップＳ１０に処理が移行する。読み上げスピードのresetボタン２３が押されていないと判断されると、ステップＳ１２に処理が移行する。 First, it is determined whether or not the read speed reset button 23 has been pressed (step S10). If it is determined that the reading speed reset button 23 has been pressed, 1.0 is stored in the reading speed memory 30 as the reading speed value (step S11), and the process proceeds to the first step S10. If it is determined that the read speed reset button 23 has not been pressed, the process proceeds to step S12.

次に、読み上げスピードのslowボタン２２が押されたか否かを判断する（ステップＳ１２）。読み上げスピードのslowボタン２２が押されたと判断されると、読み上げスピードメモリ３０に格納されている読み上げスピードの値を０．１減算し（ステップＳ１３）、最初のステップＳ１０に処理が移行する。読み上げスピードメモリ３０に格納されている読み上げスピードの値が下限値（０．１）の場合には、それ以上減算されない。読み上げスピードのslowボタン２２が押されていないと判断されると、ステップＳ１４に処理が移行する。 Next, it is determined whether or not the reading speed slow button 22 has been pressed (step S12). If it is determined that the reading speed slow button 22 has been pressed, the value of the reading speed stored in the reading speed memory 30 is subtracted by 0.1 (step S13), and the process proceeds to the first step S10. When the value of the reading speed stored in the reading speed memory 30 is the lower limit value (0.1), no further subtraction is performed. If it is determined that the reading speed slow button 22 has not been pressed, the process proceeds to step S14.

次に、読み上げスピードのfastボタン２４が押されたか否かを判断する（ステップＳ１４）。読み上げスピードのfastボタン２４が押されたと判断されると、読み上げスピードメモリ３０に格納されている読み上げスピードの値を０．１加算する（ステップＳ１５）。読み上げスピードメモリ３０に格納されている読み上げスピードの値が上限値（１０．０）の場合には、それ以上加算されない。読み上げスピードのfastボタン２４が押されていないと判断されると、最初のステップＳ１０に処理が移行する。 Next, it is determined whether or not the reading speed fast button 24 has been pressed (step S14). If it is determined that the reading speed fast button 24 has been pressed, 0.1 is added to the value of the reading speed stored in the reading speed memory 30 (step S15). When the value of the reading speed stored in the reading speed memory 30 is the upper limit value (10.0), no further addition is performed. If it is determined that the reading speed fast button 24 has not been pressed, the process proceeds to the first step S10.

（声の高さの設定）
本発明の一実施形態による音声合成装置における声の高さの設定方法のフローチャートを図７に示す。 (Voice pitch setting)
FIG. 7 shows a flowchart of a voice pitch setting method in the speech synthesizer according to the embodiment of the present invention.

まず、声の高さのresetボタン２７が押されたか否かを判断する（ステップＳ２０）。声の高さのresetボタン２７が押されたと判断されると、声の高さの値として１．０を声の高さメモリ３２に格納し（ステップＳ２１）、最初のステップＳ２０に処理が移行する。声の高さのresetボタン２７が押されていないと判断されると、ステップＳ２２に処理が移行する。 First, it is determined whether or not the voice reset button 27 has been pressed (step S20). If it is determined that the voice pitch reset button 27 has been pressed, 1.0 is stored in the voice pitch memory 32 as the voice pitch value (step S21), and the process proceeds to the first step S20. To do. If it is determined that the voice reset button 27 has not been pressed, the process proceeds to step S22.

次に、声の高さのlowボタン２６が押されたか否かを判断する（ステップＳ２２）。声の高さのlowボタン２６が押されたと判断されると、声の高さメモリ３２に格納されている声の高さの値を０．１減算し（ステップＳ２３）、最初のステップＳ２０に処理が移行する。声の高さのlowボタン２６が押されていないと判断されると、ステップＳ２４に処理が移行する。声の高さメモリ３２に格納されている声の高さの値が下限値（０．０）の場合には、それ以上減算されない。声の高さのlowボタン２６が押されていないと判断されると、ステップＳ２４に処理が移行する。 Next, it is determined whether or not the voice low button 26 has been pressed (step S22). If it is determined that the voice pitch low button 26 has been pressed, the voice pitch value stored in the voice pitch memory 32 is subtracted by 0.1 (step S23), and the first step S20 is performed. Processing shifts. If it is determined that the voice low button 26 has not been pressed, the process proceeds to step S24. When the voice pitch value stored in the voice pitch memory 32 is the lower limit (0.0), no further subtraction is performed. If it is determined that the voice low button 26 has not been pressed, the process proceeds to step S24.

次に、声の高さのhighボタン２８が押されたか否かを判断する（ステップＳ２４）。声の高さのhighボタン２８が押されたと判断されると、声の高さメモリ３２に格納されている声の高さの値を０．１加算する（ステップＳ２５）。声の高さメモリ３２に格納されている声の高さの値が上限値（２．０）の場合には、それ以上加算されない。声の高さのhighボタン２８が押されていないと判断されると、最初のステップＳ２０に処理が移行する。 Next, it is determined whether or not the voice high button 28 has been pressed (step S24). If it is determined that the voice high button 28 has been pressed, 0.1 is added to the value of the voice pitch stored in the voice pitch memory 32 (step S25). When the voice pitch value stored in the voice pitch memory 32 is the upper limit (2.0), no further addition is performed. If it is determined that the high voice pitch button 28 has not been pressed, the process proceeds to the first step S20.

（表示画面スクロール時とページジャンプ時の設定値制御）
本発明の一実施形態による音声合成装置では、英会話表示画面がスクロールされても読み上げスピードと声の高さの設定値は維持される。また、施術別の英会話例の目次をクリックして、クリックした施術別の英会話例が記載されたページにジャンプし、英会話表示画面がスクロールされても読み上げスピードと声の高さの設定値は維持される。一方、英会話例を表示するページと全く別のページにジャンプした場合には、この操作者による英会話例の利用が終了したと判断し、読み上げスピードと声の高さの設定値をリセットする。 (Set value control during display screen scrolling and page jump)
In the speech synthesizer according to the embodiment of the present invention, the set values of the reading speed and the voice pitch are maintained even if the English conversation display screen is scrolled. Also, click the table of contents of the English conversation example by operation, jump to the page where the clicked English conversation example by operation is written, and maintain the reading speed and voice pitch settings even if the English conversation display screen is scrolled Is done. On the other hand, when jumping to a page completely different from the page displaying the English conversation example, it is determined that the operator has finished using the English conversation example, and the setting values of the reading speed and the voice pitch are reset.

本発明の一実施形態による音声合成装置における英会話表示画面のスクロール時のフローチャートを図８に示す。 FIG. 8 shows a flowchart when the English conversation display screen is scrolled in the speech synthesizer according to the embodiment of the present invention.

操作者の操作により英会話表示画面がスクロールされると、コントロールボタン２０の最上部のＹ座標を取得する（ステップＳ３０）。Ｙ座標とは、全ての英会話例を含む縦長のひとつのページにおける縦方向の座標値である。 When the English conversation display screen is scrolled by the operation of the operator, the uppermost Y coordinate of the control button 20 is acquired (step S30). The Y coordinate is a coordinate value in the vertical direction on one vertically long page including all English conversation examples.

次に、スクロール後の表示画面の最上部のＹ座標がコントロールボタンの最上部のＹ座標と一致するか否かを判断する（ステップＳ３１）。 Next, it is determined whether or not the top Y coordinate of the scrolled display screen matches the top Y coordinate of the control button (step S31).

スクロール後の表示画面の最上部のＹ座標がコントロールボタンの最上部のＹ座標と一致していなければ、表示画面の最上部のＹ座標をコントロールボタンの最上部のＹ座標と一致させて（ステップＳ３２）スクロール時の処理を終了する。 If the Y coordinate at the top of the display screen after scrolling does not match the Y coordinate at the top of the control button, the Y coordinate at the top of the display screen matches the Y coordinate at the top of the control button (step S32) The process at the time of scrolling is terminated.

スクロール後の表示画面の最上部のＹ座標がコントロールボタンの最上部のＹ座標と一致していれば、そのまま、スクロール時の処理を終了する。 If the Y coordinate at the top of the display screen after scrolling matches the Y coordinate at the top of the control button, the process at the time of scrolling is terminated.

本発明の一実施形態による音声合成装置におけるページジャンプ時のフローチャートを図９に示す。 FIG. 9 shows a flowchart at the time of page jump in the speech synthesizer according to the embodiment of the present invention.

例えば、操作者の操作によりリンクがクリックされる（ステップＳ４０）と、そのリンクによるジャンプ先が本実施形態の英会話例を含む同じページ内であるか否かを判断する（ステップＳ４１）。 For example, when the link is clicked by the operation of the operator (step S40), it is determined whether or not the jump destination by the link is within the same page including the English conversation example of the present embodiment (step S41).

ジャンプ先が同じページ内である場合には、英文テキストメモリ１６、読み上げスピードメモリ３０、声の高さメモリ３２等のメモリの記憶内容を保持したままジャンプして（ステップＳ４２）、ジャンプ時の処理を終了する。 If the jump destination is in the same page, the jump is made while retaining the memory contents of the English text memory 16, the reading speed memory 30, the voice pitch memory 32, etc. (step S42), and processing at the time of the jump Exit.

ジャンプ先が英会話例を含む同じページ外の別のページである場合には、英文テキストメモリ１６、読み上げスピードメモリ３０、声の高さメモリ３２等のメモリの記憶内容をリセットして（ステップＳ４３）、指定された別のページにジャンプして（ステップＳ４４）、ジャンプ時の処理を終了する。 When the jump destination is another page outside the same page including an English conversation example, the memory contents of the English text memory 16, the reading speed memory 30, the voice pitch memory 32, etc. are reset (step S43). Then, it jumps to another designated page (step S44), and the processing at the time of the jump is terminated.

［変形実施形態］
上記実施形態に限らず種々の変形が可能である。 [Modified Embodiment]
The present invention is not limited to the above embodiment, and various modifications are possible.

例えば、上記実施形態では、英会話書籍における英会話例の音声出力に本発明を適用したが、これに限らない。中国語会話等の他の言語の会話例の音声出力に本発明を適用してもよいし、本の朗読等の文章の音声出力に本発明を適用してもよい。 For example, in the above embodiment, the present invention is applied to audio output of an English conversation example in an English conversation book, but the present invention is not limited to this. The present invention may be applied to voice output of conversation examples in other languages such as Chinese conversation, or may be applied to voice output of sentences such as reading a book.

１０…音声合成装置
１２…音声合成ＡＰＩ
１４…英文ボタン
１６…英文テキストメモリ
２０…コントロールボタン
２２…slowボタン
２３…resetボタン
２４…fastボタン
２６…lowボタン
２７…resetボタン
２８…highボタン
３０…読み上げスピードメモリ
３２…声の高さメモリ
３４…音声出力部
ＩＷ…文字入力窓
ＴＢ１〜ＴＢ５…英文ボタン
ＣＢ…コントロールボタン 10: Speech synthesizer 12 ... Speech synthesis API
14 ... English button 16 ... English text memory 20 ... Control button 22 ... Slow button 23 ... Reset button 24 ... Fast button 26 ... Low button 27 ... Reset button 28 ... High button 30 ... Reading speed memory 32 ... Voice pitch memory 34 ... Sound output part IW ... Character input windows TB1-TB5 ... English button CB ... Control button

本発明に係る音声合成装置は、所定のテキストを、所定のスピードと所定の音の高さで音声を合成して出力する音声合成部と、前記音声合成部に入力するテキストを格納するテキストメモリと、前記音声合成部で合成する読み上げスピードを格納する読み上げスピードメモリと、前記音声合成部で合成する声の高さを格納する声の高さメモリと、前記音声合成部に入力する前記テキストが表示されたテキストボタンと、前記読み上げスピード及び前記声の高さを設定するコントロールボタンとを表示画面に表示する表示部とを有し、前記音声合成部は、前記テキストメモリに格納されたテキストを、前記読み上げスピードメモリに格納された読み上げスピードと、前記声の高さメモリに格納された声の高さで音声合成して出力し、前記テキストボタンの表示位置が前記表示画面内で変更されても、前記コントロールボタンの表示位置は前記表示画面内の所定位置に固定されており、前記テキストボタンを含むページ外に移動した場合には前記読み上げスピードメモリ及び前記声の高さメモリの値がリセットされ、前記ページ内の前記テキストボタンを表示している限り前記読み上げスピードメモリ及び前記声の高さメモリの値が維持されることを特徴とする。 A speech synthesizer according to the present invention includes a speech synthesizer that synthesizes and outputs a predetermined text at a predetermined speed and a predetermined pitch, and a text memory that stores text to be input to the speech synthesizer. A reading speed memory for storing a reading speed to be synthesized by the voice synthesizing unit, a voice pitch memory for storing a voice pitch to be synthesized by the voice synthesizing unit, and the text to be input to the voice synthesizing unit. A display unit configured to display a displayed text button and a control button for setting the reading speed and the voice pitch on a display screen, and the speech synthesizer stores the text stored in the text memory. the a speed reading that is stored in the speed memory reading, and outputs the speech synthesized voice pitch stored in the height memory of the voice, the text Even if the display position of the button is changed in the display screen, the display position of the control button is fixed at a predetermined position in the display screen. When the button is moved outside the page including the text button, the reading out is performed. The values of the speed memory and the voice pitch memory are reset, and the values of the reading speed memory and the voice pitch memory are maintained as long as the text button in the page is displayed. .

Claims

A speech synthesizer that synthesizes and outputs a predetermined text at a predetermined speed and a predetermined pitch;
A text memory for storing text to be input to the speech synthesizer;
A reading speed memory for storing a reading speed to be synthesized by the voice synthesis unit;
A voice pitch memory for storing a voice pitch to be synthesized by the voice synthesizer;
The speech synthesizer synthesizes and outputs the text stored in the text memory at a reading speed stored in the reading speed memory and a voice pitch stored in the voice pitch memory; A speech synthesizer characterized by the above.

The speech synthesizer according to claim 1.
A display unit for displaying on the display screen a text button on which the text to be input to the speech synthesizer is displayed and a control button for setting the reading speed and the voice pitch;
Even if the display position of the text button is changed in the display screen, the display position of the control button is fixed at a predetermined position in the display screen.

The speech synthesizer according to claim 2,
When moving outside the page including the text button, the values of the reading speed memory and the voice pitch memory are reset, and as long as the text button in the page is displayed, the reading speed memory and the voice The speech synthesizer is characterized in that the height memory value is maintained.

A computer-readable recording medium on which a program for causing a computer to function as the speech synthesizer according to any one of claims 1 to 3 is recorded.