JP2005077873A

JP2005077873A - Method and system for providing speech content

Info

Publication number: JP2005077873A
Application number: JP2003309587A
Authority: JP
Inventors: Hajime Kudo; 哉工藤; Nagatoshi So; 永敏宋; Takayuki Kamogawa; 高之加茂川
Original assignee: Hitachi Software Engineering Co Ltd
Current assignee: Hitachi Software Engineering Co Ltd
Priority date: 2003-09-02
Filing date: 2003-09-02
Publication date: 2005-03-24

Abstract

<P>PROBLEM TO BE SOLVED: To provide a method and a system for providing speech contents that can realize guidance broadcast of voice quality, and to provide a tone matching use environment. <P>SOLUTION: A speech-providing service device has the steps of: receiving document data, corresponding to speech contents that a requester needs from a requester's device through a communication line, such as Web; selecting a vocal actor who voices the received document data out of vocal actors previously registered in a vocal actor table of the speech providing service device and sending document data to be vocalized to a terminal device of the selected vocal actor; receiving speech contents, corresponding to the document data that the vocal actor voices through the terminal device of the vocal actor by the speech providing service device; and transmitting the speech contents of the vocal actor received from the terminal device of the vocal actor to the requester's device of the requester by the speech providing device. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、依頼主から依頼された文章を声優に発声させた音声コンテンツをＷｅｂ等の通信回線経由で依頼主に提供する音声コンテンツの提供方法およびシステムに係り、例えばデパートや各種イベント会場における案内放送などに用いて好適な音声コンテンツの提供方法およびシステムに関するものである。 The present invention relates to an audio content providing method and system for providing audio content in which a voice requested by a client is uttered by a voice actor to the client via a communication line such as the Web, for example, guidance in department stores and various event venues. The present invention relates to a method and system for providing audio content suitable for use in broadcasting or the like.

従来、音声データの販売を行う方法として、例えば特開２００２−３５８０９２公報および特開２００２−５５６８６公報に記載のものが知られている。
特開２００２−３５８０９２公報に記載の技術は、任意の脚本データに自分の好きな俳優や声優の合成音声を配し、脚本データの朗読を提供するものである。
また、特開２００２−５５６８６公報に記載の技術は、キャラクタの声を表情や行動態様に合わせて一定の品質でユーザに提供するものである。
特開２００２−３５８０９２公報特開２００２−５５６８６公報 Conventionally, as a method for selling audio data, for example, methods described in Japanese Patent Application Laid-Open Nos. 2002-358092 and 2002-55686 are known.
The technique described in Japanese Patent Laid-Open No. 2002-358092 provides read-out of script data by arranging synthesized speech of a favorite actor or voice actor in arbitrary script data.
The technique described in Japanese Patent Laid-Open No. 2002-55686 provides a user's voice with a certain quality according to facial expressions and behaviors.
JP 2002-358092 A JP 2002-55686 A

ところで、デパート等の店内放送では、通常、専門の係員が各種の案内放送を行うようにしているが、係員が不在の場合、デパートの店内放送にふさわしい口調で案内放送をできなくなる。
そこで、前記の従来技術に記載された方法を適用し、案内放送を行うことが考えられる。
しかし、特開２００２−３５８０９２公報および特開２００２−５５６８６公報に記載された技術では、予め案内放送用の音声データまたは音素データを記憶させておく必要があるため、迷子の案内放送などのように臨時の案内放送を行うことができないという問題がある。
また、音声データまたは音素データを合成してユーザから要求された音声データを作成するものであるため、自然な発音に聞こえず、違和感がある。特に、例えばデパートの案内放送と遊園地の案内放送とでは口調が異なっているが、合成音では使用環境にふさわしい口調の案内放送を実現するのが難しいという問題がある。また、デパートの案内放送であっても、ＡデパートとＢデパートでは案内放送の口調に微妙な相違があり、合成音ではそれぞれのデパートの個性に適した声質や口調の案内放送を実現するのが難しいという問題がある。 By the way, in-store broadcasting of department stores and the like, a professional staff usually performs various types of guidance broadcasting. However, if there is no staff, guidance broadcasting cannot be performed in a tone suitable for in-store broadcasting of department stores.
Therefore, it is conceivable to perform guidance broadcasting by applying the method described in the prior art.
However, in the techniques described in Japanese Patent Laid-Open Nos. 2002-358092 and 2002-55686, it is necessary to store voice data or phoneme data for guidance broadcasting in advance. There is a problem that temporary information broadcasting cannot be performed.
Moreover, since voice data requested by the user is created by synthesizing voice data or phoneme data, it does not sound natural and is uncomfortable. In particular, for example, department store guidance broadcasts and amusement park guidance broadcasts have different tone, but there is a problem that it is difficult to achieve a tone guidance broadcast suitable for the use environment with synthesized sounds. In addition, even for department store guidance broadcasts, there are subtle differences in the tone of guidance broadcasts between department stores A and B, and with synthesized sounds, it is possible to realize guidance broadcasts with voice quality and tone suitable for the individuality of each department store. There is a problem that it is difficult.

本発明の目的は、使用環境にふさわしい声質や口調の案内放送などを実現することができる音声コンテンツの提供方法およびシステムを提供することにある。 An object of the present invention is to provide an audio content providing method and system capable of realizing voice quality and tone guidance broadcasting suitable for the use environment.

上記目的を達成するため、本発明に係る音声コンテンツの提供方法は、依頼主から依頼された文章を声優に発声させた音声コンテンツをインターネット等の通信回線経由で依頼主に提供する音声コンテンツの提供方法であって、
音声提供サービス装置において依頼主装置から依頼主が必要としている音声コンテンツに対応する文章データをインターネット等の通信回線経由で受付ける第１のステップと、受付けた文章データを発声させる声優を前記音声提供サービス装置の声優テーブルに予め登録されている声優の中から選択し、選択した声優の端末装置に対し発声対象の文章データをインターネット等の通信回線経由で送信する第２のステップと、声優の端末装置から当該声優が発声した前記文章データに対応する音声コンテンツを前記音声提供サービス装置においてインターネット等の通信回線経由で受信する第３のステップと、前記音声提供サービス装置において声優の端末装置から受信した声優の音声コンテンツを依頼元の依頼主装置にインターネット等の通信回線経由で送信する第４のステップとを備えることを特徴とする。
また、前記第２のステップにおいて、依頼主装置から受信した声優の固有名詞、性別、声質、言語などの指定情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、前記第２のステップにおいて、依頼主装置から受信した依頼主の業種、音声コンテンツの用途、使用環境の情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、声優の音声コンテンツを依頼元の依頼主装置に送信した後、依頼元に対し音声コンテンツの利用料金を課金する第５のステップをさらに備えることを特徴とする。 In order to achieve the above object, the audio content providing method according to the present invention provides audio content in which a voice request is made by a voice actor to speak to a client via a communication line such as the Internet. A method,
In the voice providing service device, the voice providing service includes a first step of receiving text data corresponding to the voice content required by the client from the client main device via a communication line such as the Internet, and a voice actor that utters the received text data. A second step of selecting voice actors pre-registered in the voice actor table of the apparatus, and transmitting the sentence data to be spoken to the selected voice actor terminal device via a communication line such as the Internet; and a voice actor terminal device A third step of receiving audio content corresponding to the sentence data uttered by the voice actor from the voice providing service device via a communication line such as the Internet; and a voice actor received from the voice actor terminal device in the voice providing service device. The audio content of the Characterized in that it comprises a fourth step of transmitting over the wire.
Further, in the second step, voice actors that utter text data based on designation information such as proper names, gender, voice quality, language, etc. of voice actors received from the client apparatus are stored in the voice actor table in advance. It is characterized by selecting from among them.
In the second step, voice actors who utter text data based on information about the client's business type, usage of audio content, and usage environment received from the client device are stored in advance in the voice actor table. It is characterized by selecting from among them.
Further, the present invention is further characterized by further comprising a fifth step of charging the request source for the usage fee of the audio content after transmitting the voice content of the voice actor to the requesting main apparatus.

また、依頼主から依頼された文章を声優に発声させた音声コンテンツをインターネット等の通信回線経由で依頼主に提供する音声コンテンツの提供方法であって、
音声提供サービス装置において依頼主装置から依頼主が必要としている音声コンテンツに対応する文章データをインターネット等の通信回線経由で受付ける第１のステップと、受付けた文章データを発声させる声優を前記音声提供サービス装置の声優テーブルに予め登録されている声優の中から選択し、選択した１乃至複数の声優の端末装置に対し発声対象の文章データをインターネット等の通信回線経由で送信する第２のステップと、１乃至複数の声優の端末装置から当該声優が発声した前記文章データに対応する音声コンテンツを前記音声提供サービス装置においてインターネット等の通信回線経由で受信する第３のステップと、前記音声提供サービス装置において１乃至複数の声優の端末装置から受信した声優の音声コンテンツを１組にして依頼元の依頼主装置にインターネット等の通信回線経由で送信する第４のステップと、依頼主装置において受信した１乃至複数の声優の音声コンテンツを試聴用に発音させる第５のステップと、発音された試聴用の１乃至複数の音声コンテンツの中から依頼主が選択した声優の音声コンテンツの選択情報を依頼主装置から前記音声提供サービス装置にインターネット等の通信回線経由で送信する第６のステップと、音声提供サービス装置において依頼主が選択した声優の音声コンテンツの選択情報に基づき、選択された声優の端末装置に対し前記文章データの発声録音依頼をインターネット等の通信回線経由で送信する第７のステップと、発声録音依頼を受けた声優が発声録音した前記文章データの録音音声コンテンツを当該声優の前記端末装置または他の装置から前記音声提供サービス装置にインターネット等の通信回線経由で送信する第８のステップと、音声提供サービス装置において受信した声優の前記録音音声コンテンツを依頼元の依頼主装置にインターネット等の通信回線経由で送信する第９のステップとを備えることを特徴とする。
また、前記第２のステップにおいて、依頼主装置から受信した声優の固有名詞、性別、声質、言語などの指定情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、前記第２のステップにおいて、依頼主装置から受信した依頼主の業種、音声コンテンツの用途、使用環境の情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、前記第２〜第４のステップおよび第７、第８のステップにおいて、声優の端末装置として携帯電話機機を使用して文章データの受信、視聴用の音声コンテンツの送信、録音音声コンテンツの送信を行うことを特徴とする。
また、声優の録音音声コンテンツを依頼元の依頼主装置に送信した後、依頼元に対し録音音声コンテンツの利用料金を課金する第１０のステップをさらに備えることを特徴とする。 In addition, the audio content providing method for providing the client with the voice content in which the voice requested by the client is uttered by the voice actor via a communication line such as the Internet,
In the voice providing service device, the voice providing service includes a first step of receiving text data corresponding to the voice content required by the client from the client main device via a communication line such as the Internet, and a voice actor that utters the received text data. A second step of selecting from among voice actors registered in advance in the voice actor table of the device, and transmitting the sentence data to be spoken to the selected one or more voice actor terminal devices via a communication line such as the Internet; A third step of receiving audio content corresponding to the sentence data uttered by the voice actor from one or more voice actor terminal devices in the audio providing service device via a communication line such as the Internet; and in the audio providing service device One set of voice content of voice actors received from one or more voice actor terminal devices A fourth step of transmitting the requesting main apparatus of the request source via a communication line such as the Internet, and a fifth step of causing the audio content of one or more voice actors received by the requesting main apparatus to sound for trial listening; The sixth information is transmitted from the requesting main apparatus to the voice providing service apparatus via the communication line such as the Internet from the requesting main apparatus. A voice recording request for the sentence data is transmitted to the selected voice actor terminal device via a communication line such as the Internet based on the voice information selection information of the voice actor selected by the client in the voice providing service device. Step 7 and the recorded voice content of the sentence data recorded by the voice actor who received the voice recording request An eighth step of transmitting from the terminal device or another device to the voice providing service device via a communication line such as the Internet; and the recorded voice content of the voice actor received by the voice providing service device is sent to the requesting requesting client device via the Internet And a ninth step of transmitting via a communication line such as the above.
Further, in the second step, voice actors that utter text data based on designation information such as proper names, gender, voice quality, language, etc. of voice actors received from the client apparatus are stored in the voice actor table in advance. It is characterized by selecting from among them.
In the second step, voice actors who utter text data based on information about the client's business type, usage of audio content, and usage environment received from the client device are stored in advance in the voice actor table. It is characterized by selecting from among them.
In the second to fourth steps and the seventh and eighth steps, the mobile phone is used as a voice actor terminal device to receive text data, to transmit audio content for viewing, and to transmit recorded audio content. It is characterized by performing.
In addition, the method further comprises a tenth step of charging the request source for the usage fee of the recorded audio content after transmitting the recorded audio content of the voice actor to the requesting main apparatus.

本発明に係る音声コンテンツの提供システムは、依頼主が使用する依頼主装置と、声優が使用する端末装置と、音声コンテンツの提供サービスを行う音声提供サービス装置とから成り、依頼主から依頼された文章を声優に発声させた音声コンテンツをインターネット等の通信回線経由で依頼主に提供する音声コンテンツの提供システムであって、
音声提供サービス装置が、
前記依頼主装置から依頼主が必要としている音声コンテンツに対応する文章データをインターネット等の通信回線経由で受付ける第１の手段と、受付けた文章データを発声させる声優を声優テーブルに予め登録されている声優の中から選択し、選択した声優の端末装置に対し発声対象の文章データをインターネット等の通信回線経由で送信する第２の手段と、声優の端末装置から当該声優が発声した前記文章データに対応する音声コンテンツをインターネット等の通信回線経由で受信する第３の手段と、声優の端末装置から受信した声優の音声コンテンツを依頼元の依頼主装置にインターネット等の通信回線経由で送信する第４の手段とを備え、
前記依頼主装置が、依頼主から文章データを受付け、前記音声提供サービス装置にインターネット等の通信回線経由で送信する第５の手段と、前記音声提供サービス装置から前記文章データの音声コンテンツをインターネット等の通信回線経由で受信する第６の手段とを備え、
前記声優の端末装置が、前記音声提供サービス装置から文章データをインターネット等の通信回線経由で受信する第７の手段と、受信した文章データを当該声優が発声した音声コンテンツを受付け、前記音声提供サービス装置にインターネット等の通信回線経由で送信する第８の手段とを備えることを特徴とする。
また、前記依頼主装置が、文章データを発音させる声優の固有名詞、性別、声質、言語などの指定情報を送信する手段をさらに備え、
前記音声提供サービス装置の前記第２の手段は、依頼主装置から受信した声優の固有名詞、性別、声質、言語などの前記指定情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、前記依頼主装置が、依頼主の業種、音声コンテンツの用途、使用環境の情報を送信する手段をさらに備え、
前記音声提供サービス装置の前記第２の手段は、依頼主装置から受信した依頼主の業種、音声コンテンツの用途、使用環境の情報に基づいて文章データを発声させる声優を前記声優テーブルに予め登録されている声優の中から選択することを特徴とする。
また、前記音声提供サービス装置が、声優の音声コンテンツを依頼元の依頼主装置に送信した後、依頼元に対し音声コンテンツの利用料金を課金する課金手段をさらに備えることを特徴とする。 An audio content providing system according to the present invention includes a request main device used by a client, a terminal device used by a voice actor, and an audio providing service device that provides an audio content providing service. A voice content providing system that provides voice content to a client via a communication line such as the Internet, wherein the voice content is produced by a voice actor.
Voice providing service device
First means for receiving sentence data corresponding to the audio content required by the requester from the requester apparatus via a communication line such as the Internet, and voice actors for uttering the received sentence data are registered in advance in the voice actor table. A second means for selecting the voice actor from the voice actor terminal device, and transmitting the text data to be spoken to the selected voice actor terminal device via a communication line such as the Internet; and the sentence data uttered by the voice actor from the voice actor terminal device. Third means for receiving the corresponding audio content via a communication line such as the Internet, and fourth means for transmitting the voice content of the voice actor received from the voice actor terminal device to the requesting main apparatus of the request source via the communication line such as the Internet. With the means of
A fifth means for receiving text data from the requester and transmitting it to the voice providing service device via a communication line such as the Internet; and a voice content of the text data from the voice providing service device on the Internet. And 6th means for receiving via the communication line of
The voice actor terminal device receives seventh voice data received from the voice providing service device via a communication line such as the Internet, and voice content uttered by the voice actor for the received text data. And an eighth means for transmitting to the apparatus via a communication line such as the Internet.
In addition, the requester device further comprises means for transmitting designation information such as a proper noun, gender, voice quality, and language of a voice actor that pronounces sentence data,
The second means of the voice providing service device pre-registers in the voice actor table a voice actor that utters sentence data based on the designation information such as the proper noun, gender, voice quality, language, etc. of the voice actor received from the client main device. It is characterized by selecting from voice actors.
Further, the requester device further comprises means for transmitting information on the client's type of business, usage of audio content, and usage environment,
The second means of the voice providing service device is pre-registered in the voice actor table for a voice actor that utters text data based on information of the client's business type, usage of voice content, and usage environment received from the client main device. It is characterized by selecting from voice actors.
The voice providing service device may further include a billing unit that charges the request source for the usage fee of the voice content after transmitting the voice content of the voice actor to the requesting master device.

また、依頼主が使用する依頼主装置と、声優が使用する端末装置と、音声コンテンツの提供サービスを行う音声提供サービス装置とから成り、依頼主から依頼された文章を声優に発声させた音声コンテンツを依頼主にインターネット等の通信回線経由で提供する音声コンテンツの提供システムであって、
音声提供サービス装置が、
依頼主装置から依頼主が必要としている音声コンテンツに対応する文章データをインターネット等の通信回線経由で受付ける第１の手段と、受付けた文章データを発声させる声優を声優テーブルに予め登録されている声優の中から選択し、選択した１乃至複数の声優の端末装置に対し発声対象の文章データをインターネット等の通信回線経由で送信する第２の手段と、１乃至複数の声優の端末装置から当該声優が発声した前記文章データに対応する音声コンテンツをインターネット等の通信回線経由で受信する第３の手段と、１乃至複数の声優の端末装置から受信した声優の試聴用の音声コンテンツを１組にして依頼元の依頼主装置にインターネット等の通信回線経由で送信する第４の手段と、依頼主が選択した声優の視聴用の音声コンテンツの選択情報に基づき、選択された声優の端末装置に対し前記文章データの発声録音依頼をインターネット等の通信回線経由で送信する第５の手段と、録音依頼した声優の端末装置または他の装置から前記文章データの録音音声コンテンツをインターネット等の通信回線経由で受信し、依頼元の依頼主装置にインターネット等の通信回線経由で送信する第６の手段とを備え、
前記依頼主装置が、依頼主から文章データを受付け、前記音声提供サービス装置にインターネット等の通信回線経由で送信する第７の手段と、前記音声提供サービス装置から前記文章データに対応する音声コンテンツをインターネット等の通信回線経由で受信する第８の手段と、受信した１乃至複数の声優の音声コンテンツを試聴用に発音させる第９の手段と、発音された試聴用の１乃至複数の音声コンテンツの中から依頼主が選択した声優の音声コンテンツの選択情報を前記音声提供サービス装置にインターネット等の通信回線経由で送信する第１０の手段とを備え、
前記声優の端末装置が、前記音声提供サービス装置から文章データをインターネット等の通信回線経由で受信する第１１の手段と、受信した文章データを当該声優が発声した音声コンテンツを受付け、試聴用の音声コンテンツとして前記音声提供サービス装置にインターネット等の通信回線経由で送信する第１２の手段と、前記音声提供サービス装置から前記文章データの発声録音依頼を受信し、前記文章データの録音音声コンテンツを前記音声提供サービス装置にインターネット等の通信回線経由で送信する第１３の手段とを備えることを特徴とする。
また、前記音声提供サービス装置が、声優の録音音声コンテンツを依頼元の依頼主装置に送信した後、依頼元に対し録音音声コンテンツの利用料金を課金する課金手段をさらに備えることを特徴とする。 Also, the audio content that consists of the requester device used by the client, the terminal device used by the voice actor, and the audio providing service device that provides the audio content providing service, the voice requested by the client A system for providing audio content to a client via a communication line such as the Internet,
Voice providing service device
A first means for receiving text data corresponding to the audio content required by the requester from the requester apparatus via a communication line such as the Internet, and a voice actor in which voice actors for speaking the received text data are registered in the voice actor table in advance. A second means for transmitting text data to be uttered to the selected terminal device of one or more voice actors via a communication line such as the Internet, and the voice actors from the one or more voice actor terminal devices. A third means for receiving audio content corresponding to the sentence data uttered by the voice data via a communication line such as the Internet, and a set of audio content for audition of voice actors received from one or more terminal devices of voice actors. A fourth means for transmitting to the requester apparatus of the request source via a communication line such as the Internet, and an audio content for viewing the voice actor selected by the requester Based on the selected information, the fifth means for transmitting the voice data recording request to the selected voice actor terminal device via a communication line such as the Internet, and the voice actor terminal device or other device that requested the recording. Receiving the recorded audio content of the text data via a communication line such as the Internet, and transmitting to the requesting main apparatus of the request source via a communication line such as the Internet,
The requesting apparatus receives text data from the requester and transmits to the voice providing service apparatus via a communication line such as the Internet, and a voice content corresponding to the text data from the voice providing service apparatus. An eighth means for receiving via a communication line such as the Internet; a ninth means for producing the received audio content of one or more voice actors for trial listening; and one or more audio contents for the produced trial listening. Tenth means for transmitting selection information of the voice content of the voice actor selected by the client from among the voice providing service device via a communication line such as the Internet,
The voice actor's terminal device receives eleventh means for receiving text data from the voice providing service device via a communication line such as the Internet, and voice content produced by the voice actor for the received text data. A twelfth means for transmitting the content to the voice providing service device via a communication line such as the Internet; and receiving a voice recording request for the text data from the voice providing service device; And a thirteenth means for transmitting to the provided service device via a communication line such as the Internet.
The voice providing service device may further include a billing unit that charges the requester for a usage fee of the recorded voice content after the voice actor recorded voice content is transmitted to the requesting client main device.

本発明によれば、予め音声データを登録しておく必要がないため、迷子案内などの臨時の案内放送を容易に実現することができる。
また、自然な発音で、使用環境にふさわしい声質や口調の案内放送などを実現することができる。 According to the present invention, since it is not necessary to register voice data in advance, it is possible to easily realize temporary guidance broadcasting such as lost child guidance.
In addition, with natural pronunciation, it is possible to achieve voice quality and tone guidance broadcasting suitable for the usage environment.

以下、本発明を実施する場合の一形態を図面を参照して具体的に説明する。
図１は、本発明の音声コンテンツの提供システムの基本構成を示すシステム構成図である。
ここで図示するシステムでは、音声提供サーバ装置を管理する事業主体１がインターネット等の通信回線でユーザ（依頼主）３の情報端末３ａから声優の指定情報、文章データを受付ける。事業主体１の音声提供サーバ装置は、受付情報を基に声優を選択し、その選択した声優２の端末装置（例えば携帯電話機やパーソナルコンピュータ）に対し、文章データの発声依頼を送信する。発声依頼を受けた声優２は文章データを読み上げる形式で発声し、事業主体１の音声提供サーバ装置に送信する。この場合、文章データは声優が所有する携帯電話機の送話器から入力し、録音せずに事業主体１の音声提供サーバ装置に送信するようにしてもよいし、録音スタジオ等の適切な環境で録音させ、携帯電話機のデータ通信機能を用いて送信させる、あるいは録音した音声コンテンツを声優が使用するパーソナルコンピュータなどから送信させる形態を採用することができる。
また、テレビコマーシャル等の映像に合わせて複数の声優を用いる場合、音声を加える映像および文章データと共にユーザの情報端末３ａから事業主体１の音声提供サーバ装置に送信させる。そして、複数の声優の端末装置に対し、それぞれの声優の発声部分を指定して映像および文章データを送信し、映像の進行に合せて文章データを発声させる。そして、その発声した音声コンテンツを事業主体１の音声提供サーバ装置に返信させる。音声提供サーバ装置では、それぞれの声優から受信した音声コンテンツをユーザ３の情報端末３ａに返信する。
ユーザ３の情報端末３ａに音声コンテンツを送信した後または送信に先立ち、決済会社４により利用料金の精算を行う。ユーザ３では受信した音声コンテンツを用いて案内放送などを行う。 Hereinafter, an embodiment for carrying out the present invention will be specifically described with reference to the drawings.
FIG. 1 is a system configuration diagram showing a basic configuration of an audio content providing system according to the present invention.
In the system shown here, the business entity 1 that manages the voice providing server device receives voice actor designation information and text data from the information terminal 3a of the user (client) 3 via a communication line such as the Internet. The voice providing server device of the business entity 1 selects a voice actor based on the received information, and transmits a voice data voice request to the selected voice actor 2 terminal device (for example, a mobile phone or a personal computer). The voice actor 2 that has received the utterance request utters the text data in a read-out format and transmits it to the voice providing server device of the business entity 1. In this case, the text data may be input from the handset of the mobile phone owned by the voice actor and transmitted to the voice providing server device of the business entity 1 without recording, or in an appropriate environment such as a recording studio. It is possible to adopt a form in which recording is performed and transmitted using the data communication function of the mobile phone, or the recorded audio content is transmitted from a personal computer used by the voice actor.
In addition, when using a plurality of voice actors in accordance with a video such as a television commercial, it is transmitted from the user information terminal 3a to the voice providing server device of the business entity 1 together with the video and text data to which the voice is added. Then, to the terminal devices of a plurality of voice actors, the voice part of each voice actor is designated and the video and text data are transmitted, and the text data is uttered as the video progresses. Then, the voice content uttered is returned to the voice providing server device of the business entity 1. In the voice providing server device, the voice content received from each voice actor is returned to the information terminal 3a of the user 3.
After the audio content is transmitted to the information terminal 3a of the user 3 or prior to the transmission, the payment fee is settled by the settlement company 4. The user 3 performs guidance broadcasting using the received audio content.

図２は、本発明の一実施形態を示すシステム構成図である。
図１に示すシステムは、音声コンテンツの購入依頼主が使用するクライアントコンピュータ１１、１２、１３、および声優が録音するために使用するクライアントコンピュータ１４，１５と、声優が試聴用の音声を発声するための携帯電話機１６，１７とを備え、さらにインターネット１９を介してクライアントコンピュータ１１〜１３から受信した音声配信依頼情報をサーバ装置１８にて声優別に振り分け、クライアントコンピュータ１４〜１５または携帯電話機１６，１７を用いて発声させ、音声コンテンツを作成させる。作成した音声コンテンツは、再びインターネット１９を介してサーバ装置１８に返信させる。音声コンテンツを受け取ったサーバ装置１８はインターネット１９を介し依頼元のクライアントコンピュータ１１〜１３へ音声コンテンツを送信する。音声コンテンツを受け取ったクライアントコンピュータ１１〜１３は、音声コンテンツを取り出し、案内放送などに使用する。 FIG. 2 is a system configuration diagram showing an embodiment of the present invention.
The system shown in FIG. 1 is for client computers 11, 12, 13 used by a client who purchases audio content, client computers 14, 15 used for recording by a voice actor, and voice actors uttering audio for trial listening. The voice distribution request information received from the client computers 11 to 13 through the Internet 19 is sorted by voice actor by the server device 18, and the client computers 14 to 15 or the mobile phones 16 and 17 are connected. Use it to utter and create audio content. The created audio content is sent back to the server device 18 via the Internet 19 again. The server device 18 that has received the audio content transmits the audio content to the requesting client computers 11 to 13 via the Internet 19. Upon receiving the audio content, the client computers 11 to 13 take out the audio content and use it for guidance broadcasting or the like.

サーバ装置１８の構成について、図３と図４を参照して説明する。
サーバ装置１８は、例えば通常のパーソナルコンピュータと同様な構成を有するものであり、依頼主のクライアントコンピュータ１１〜１３から送信された文章データ、声優が発声した音声コンテンツを受信する受信部３０１と、声優の選択を行う声優選択部３０２と、サーバ装置１８により音声コンテンツの提供サービスを受けることができるユーザの情報を登録したユーザ情報データベース３０３と、声優の固有名刺や声質、特徴などの情報を蓄える声優情報データベース３０４と、依頼元から受付けた文章データなどの依頼情報を蓄える受付情報データベース３０５と、試聴用の音声を携帯電話機から受信して記録する試聴用録音部３０６と、試聴用音声をＷｅｂに登録する試聴画面アップロード部３０７と、声優へ音声録音用文章や映像のデータを送信したり、試聴用音声コンテンツをユーザへ送信する送信部３０８と、声優へ音声録音の依頼を行う音声録音依頼部３０９と、音声コンテンツや文章データを蓄える登録音声情報データベース３１０と、声優の録音した音声コンテンツを配信用のコンテンツとして受け取る配信データ受付部３１１と、音声コンテンツを送付する配信データ送付部３１２と、課金処理を行う課金処理部３１３と、課金結果を蓄える料金情報データベース３１４と、決済を外部の決済会社に委託する決済処理部３１５からなる。 The configuration of the server device 18 will be described with reference to FIGS. 3 and 4.
The server device 18 has a configuration similar to, for example, an ordinary personal computer, and includes a receiving unit 301 that receives sentence data transmitted from the client computers 11 to 13 of the requester, and audio content uttered by the voice actor, and a voice actor. A voice actor selection unit 302, a user information database 303 in which information of users who can receive a voice content providing service is registered by the server device 18, and a voice actor that stores information such as a unique name card, voice quality, and characteristics of the voice actor. Information database 304, reception information database 305 that stores request information such as text data received from the request source, trial recording unit 306 that receives and records audio for trial listening from a mobile phone, and audio for trial on the Web Audition screen upload section 307 to be registered and voice recordings to voice actors A transmission unit 308 that transmits image data or audio content for trial listening to the user, an audio recording request unit 309 that requests audio recording to the voice actor, and a registered audio information database 310 that stores audio content and sentence data , A distribution data reception unit 311 that receives audio content recorded by a voice actor as distribution content, a distribution data transmission unit 312 that transmits audio content, a charging processing unit 313 that performs charging processing, and a charge information database that stores charging results 314 and a settlement processing unit 315 entrusting settlement to an external settlement company.

図４はサーバ装置１８の詳細構成を示すブロック構成図であり、キーボード４０１、マウス４０２、携帯電話機に録音機能がない場合に使用する録音装置４０３、回線接続部４０４、ＣＰＵ４０５、図３の各種のデータベース群を保持する外部記憶装置４０６、表示装置４０７、記憶装置４０８を備えている。記憶装置４０８内には、図２の声優選択部３０２と、試聴用録音部３０６と、音声録音依頼部３０９と、配信データ受付部３１１と、配信データ送付部３１２と、課金処理部３１３と、決済処理部３１５に相当するプログラムが格納されている。 FIG. 4 is a block diagram showing the detailed configuration of the server device 18. The keyboard 401, the mouse 402, the recording device 403 used when the mobile phone does not have a recording function, the line connection unit 404, the CPU 405, and the various types shown in FIG. An external storage device 406 holding a database group, a display device 407, and a storage device 408 are provided. In the storage device 408, the voice actor selection unit 302, the audition recording unit 306, the voice recording request unit 309, the distribution data reception unit 311, the distribution data transmission unit 312, the billing processing unit 313, and the like shown in FIG. A program corresponding to the settlement processing unit 315 is stored.

次に、依頼者のクライアントコンピュータ１１〜１３の構成について、図５と図６を参照して説明する。
図５は依頼者のクライアントコンピュータ１１〜１３の構成を示すブロック構成図であり、例えば通常のパーソナルコンピュータと同様な構成有するものである。その内部には、利用者管理を行うログイン部５０１と、音声として利用する言語を選択する言語選択部５０２と、音声として購入する文章を入力する文章入力部５０３と、文章を読む声優を選択する声優選択部５０４と、音声コンテンツの依頼、または音声録音の依頼を送信する送信部５０５と、試聴用音声コンテンツまたは録音音声コンテンツを受信する受信部５０６と、声優を選択する場合に試聴するための試聴部５０７と録音依頼送信部５０８と、サーバ装置１８から受信した配信データ（音声コンテンツ）を保存する音声受信保存部５０９と、受信した音声コンテンツを放送する放送設備５１０からなる。 Next, the configuration of the client computers 11 to 13 of the requester will be described with reference to FIGS.
FIG. 5 is a block diagram showing the configuration of the client computers 11 to 13 of the client, and has the same configuration as that of an ordinary personal computer, for example. Inside, a login unit 501 that performs user management, a language selection unit 502 that selects a language to be used as speech, a text input unit 503 that inputs a text to be purchased as speech, and a voice actor that reads the text are selected. A voice actor selection unit 504, a transmission unit 505 that transmits a request for audio content or a request for audio recording, a reception unit 506 that receives audio content for audition or audio recording, and a sample for listening when selecting a voice actor The listening part 507, the recording request transmission part 508, the audio | voice reception preservation | save part 509 which preserve | saves the delivery data (audio | voice content) received from the server apparatus 18, and the broadcast equipment 510 which broadcasts the received audio | voice content.

図６は、依頼者のクライアントコンピュータ１１〜１３の詳細構成を示すブロック構成図であり、キーボード６０１と、マウス６０２と、回線接続部６０３と、図５の放送設備５１０と接続する入出力部６０４と、ＣＰＵ６０５と、外部記憶装置６０６と、表示装置６０７と、記憶装置６０８を備えている。記憶装置６０８内には、図５のログイン部５０１と、言語選択部５０２と、文章入力部５０３と、声優選択部５０４と、送信部５０５と、試聴部５０７と、録音依頼送信部５０８、音声受信保存部５０９に相当するプログラムが格納されている。 FIG. 6 is a block diagram showing a detailed configuration of the client computers 11 to 13 of the requester. The keyboard 601, the mouse 602, the line connection unit 603, and the input / output unit 604 connected to the broadcasting facility 510 in FIG. 5. A CPU 605, an external storage device 606, a display device 607, and a storage device 608. In the storage device 608, the login unit 501, the language selection unit 502, the text input unit 503, the voice actor selection unit 504, the transmission unit 505, the audition unit 507, the recording request transmission unit 508, and the voice in FIG. A program corresponding to the reception storage unit 509 is stored.

次に、声優が使用するクライアントコンピュータ１４，１５の構成について、図７と図８を参照して説明する。
図７は、クライアントコンピュータ１４，１５の構成を示すブロック図であり、例えば通常のパーソナルコンピュータと同様な構成を有するものである。その内部には録音依頼を受信する受信部７０１と文章や映像にあわせ録音を行う録音部７０２と、音声を配信用の音声コンテンツに変換する配信データ作成部７０４と、配信データを送信する送信部７０５と、録音のため外部から音声の入力を行う録音機器７０３からなる。
声優が使用する携帯電話機１６，１７は、インターネット接続機能を備えた汎用の携帯電話機を使用することができる。 Next, the configuration of the client computers 14 and 15 used by the voice actor will be described with reference to FIGS.
FIG. 7 is a block diagram showing the configuration of the client computers 14 and 15 and has the same configuration as that of a normal personal computer, for example. Inside, a receiving unit 701 that receives a recording request, a recording unit 702 that records in accordance with text and video, a distribution data creation unit 704 that converts audio into audio content for distribution, and a transmission unit that transmits distribution data 705 and a recording device 703 for inputting voice from outside for recording.
As the cellular phones 16 and 17 used by the voice actor, general-purpose cellular phones having an Internet connection function can be used.

図８はクライアントコンピュータ１４、１５の詳細構成を示すブロック図であり、キーボード８０１と、マウス８０２と、回線接続部８０３と、録音機器７０３との入出力を行う入出力部８０４と、ＣＰＵ８０５と、録音した音声を蓄える外部記憶装置８０６と、表示装置８０７と、記憶装置８０８からなる。記憶装置８０８内には、図７の受信部７０１と、録音部７０２と、配信データ作成部７０４と送信部７０５に相当するプログラムが格納されている。 FIG. 8 is a block diagram showing a detailed configuration of the client computers 14 and 15, and includes a keyboard 801, a mouse 802, a line connection unit 803, an input / output unit 804 that performs input / output with the recording device 703, a CPU 805, It consists of an external storage device 806 for storing the recorded voice, a display device 807, and a storage device 808. In the storage device 808, programs corresponding to the reception unit 701, the recording unit 702, the distribution data creation unit 704, and the transmission unit 705 of FIG. 7 are stored.

次に、音声コンテンツの提供方法の一連の手順について、図９を参照して説明する。
図９は音声コンテンツを依頼主に提供する一連の手順を示すフローチャートである。
ステップＳ１において、依頼主であるユーザは自身が所有するクライアントコンピュータ１１にログインする。次に、ステップＳ２においてユーザは使用する言語（声優に発声させる言語）を選択する。次に、ステップＳ３において声優を選択し、さらにステップＳ４において声優に発声させる音声の文章データを入力する。
次に、ステップＳ５、Ｓ６において、ステップＳ２，Ｓ３で選択した言語、声優の指定情報とステップＳ４で入力した文章データを発声依頼データとしてサーバ装置１８にインターネット１９経由で送信する。
サーバ装置１８は、ステップＳ７において発声依頼データを受信する。そして、ステップＳ８において発声依頼データを基に発声依頼データ中の声優および言語の指定情報に基づき、文章データを発声させる声優を決定する。
なお、声優は声優情報データベース２４の声優情報テーブル（後述）に予め登録されており、この登録された声優の中から選択する。
選択の方法は、（１）ユーザから受信した声優の固有名詞、性別、声質、言語などの指定情報に基づいて声優情報テーブル内に登録されている声優を選択する、（２）ユーザから受信したユーザの業種、音声コンテンツの用途、使用環境の情報に基づき、声優情報テーブル内に登録されている声優を選択する方法がある。
この場合、ユーザが特定の１人の声優のみを指定しない限り、複数人の声優を選択する。但し、条件に該当する声優が１人しか登録されていない場合は、この限りでない。本発明では、上記（１）、（２）のいずれか、または両方を適宜に組合わせて声優を選択する。
なお、声優登録テーブルに声優を登録する場合、事業主体１は登録料を徴収する。 Next, a series of procedures of the audio content providing method will be described with reference to FIG.
FIG. 9 is a flowchart showing a series of procedures for providing audio content to the client.
In step S1, the user who is the client logs in to the client computer 11 owned by the user. Next, in step S2, the user selects a language to be used (a language to be spoken by a voice actor). Next, a voice actor is selected in step S3, and voice sentence data to be uttered by the voice actor is input in step S4.
Next, in steps S5 and S6, the language and voice actor designation information selected in steps S2 and S3 and the text data input in step S4 are transmitted to the server device 18 via the Internet 19 as utterance request data.
The server device 18 receives the utterance request data in step S7. In step S8, based on the voice request data, based on the voice actor in the voice request data and the language designation information, the voice actor to utter the sentence data is determined.
The voice actor is registered in advance in a voice actor information table (described later) of the voice actor information database 24, and is selected from the registered voice actors.
The selection method is as follows: (1) Select a voice actor registered in the voice actor information table based on the specified information such as proper name, gender, voice quality, language, etc. of the voice actor received from the user, (2) received from the user There is a method of selecting a voice actor registered in a voice actor information table based on information on a user's business type, usage of audio content, and usage environment.
In this case, unless the user designates only one specific voice actor, a plurality of voice actors are selected. However, this is not the case when only one voice actor corresponding to the condition is registered. In the present invention, a voice actor is selected by appropriately combining either or both of the above (1) and (2).
In addition, when registering a voice actor in the voice actor registration table, the business entity 1 collects a registration fee.

声優を選択したならば、次に、ステップＳ９において声優の携帯電話機１６，１７に連絡し、当該声優に発声させる文章データをインターネット１９を経由して送信し、その文章データを発声させる。発声された音声コンテンツは携帯電話機１６，１７の基地局およびインターネット１９を介してサーバ装置１８で受信される。
サーバ装置１８では、回線接続部４４により受信した音声コンテンツを試聴用録音部３０６に渡し、外部記憶装置４０６内の登録音声情報データベース３１０内の試聴用音声データテーブル（図２９参照）に記録する。 If the voice actor is selected, next, in step S9, the voice actor's mobile telephones 16 and 17 are contacted, sentence data to be uttered by the voice actor is transmitted via the Internet 19, and the sentence data is uttered. The voice content uttered is received by the server device 18 via the base stations of the mobile phones 16 and 17 and the Internet 19.
In the server device 18, the audio content received by the line connection unit 44 is transferred to the trial recording unit 306 and recorded in the trial audio data table (see FIG. 29) in the registered audio information database 310 in the external storage device 406.

次に、ステップＳ１０において試聴用画面に試聴用の音声コンテンツをアップロードする。
ステップＳ１１においてユーザは、サーバ装置１８のＷｅｂページにアクセスし、試聴用の音声コンテンツを選択し、ダウンロードする。ダウンロードするコンテンツは複数の声優の試聴用コンテンツを選択順に１つにまとめたものである。
なお、サーバ装置１８のＷｅｂページからダウンロードする代わりに、メール形式で依頼元のユーザに送信するようにしてもよい。 Next, in step S10, audio content for trial listening is uploaded to the trial listening screen.
In step S11, the user accesses the Web page of the server device 18, selects audio content for trial listening, and downloads it. The content to be downloaded is a collection of trial listening content for a plurality of voice actors in a selected order.
Instead of downloading from the Web page of the server device 18, it may be transmitted to the requesting user in the mail format.

続いて、サーバ装置１８から送信された音声コンテンツは依頼元ユーザのクライアントコンピュータ１１内の試聴部５０７により、再生される。
ステップＳ１２においてユーザは再生された試聴用の音声を試聴する。そして、ステップＳ１３、Ｓ１４において、試聴した声優の中で気に入った声優を選択し、その声優に対する録音依頼のデータをインターネット１９経由でサーバ装置１８に送信する。
録音依頼データを受信したサーバ装置１８は、ステップＳ１５，Ｓ１６において、ユーザが選択した声優のクライアントコンピュータ１４，１５に対し、録音依頼を音声またはメール形式で送信する。なお、携帯電話機１６，１７が録音機能を備え、かつ案内放送などで要求される音質を確保できるものであれば、携帯電話機１６、１７に録音依頼を送信するようにすることができる。 Subsequently, the audio content transmitted from the server device 18 is reproduced by the audition unit 507 in the client computer 11 of the requesting user.
In step S12, the user auditiones the reproduced audio for trial listening. In steps S 13 and S 14, a favorite voice actor is selected from the sampled voice actors, and recording request data for the voice actor is transmitted to the server device 18 via the Internet 19.
The server device 18 that has received the recording request data transmits the recording request in voice or mail format to the client computers 14 and 15 of the voice actor selected by the user in steps S15 and S16. Note that if the mobile phones 16 and 17 have a recording function and can ensure the sound quality required by the guidance broadcast or the like, the recording request can be transmitted to the mobile phones 16 and 17.

ステップＳ１７において、声優のクライアントコンピュータ１４，１５は録音依頼を受信する。録音依頼を受けた声優は、ステップＳ１８において、自身が使用するクライアントコンピュータ１４，１５に付属した録音機器７０３を用いて録音依頼された文章データを発声する。発声された文章データの音声コンテンツは録音部７０２により外部記憶装置８０６に記録（録音）される。
この場合、録音機器７０３の代わりに、専用の録音スタジオで録音した音声コンテンツを録音部７０２により外部記憶装置８０６内に記録するようにしてもよい。テレビコマーシャルなどの高品質の音声が要求される場合には、録音スタジオで録音したものを用いるのが望ましい。 In step S17, the voice actor client computers 14 and 15 receive the recording request. In step S18, the voice actor who has received the recording request utters the sentence data requested to be recorded using the recording device 703 attached to the client computers 14 and 15 used by the voice actor. The audio content of the spoken text data is recorded (recorded) in the external storage device 806 by the recording unit 702.
In this case, instead of the recording device 703, audio content recorded in a dedicated recording studio may be recorded in the external storage device 806 by the recording unit 702. When high quality sound such as a TV commercial is required, it is desirable to use the one recorded in the recording studio.

ステップＳ１９において、配信データ作成部７０４により、録音が終了した音声コンテンツの配信データを作成し、ステップＳ２０、Ｓ２１において配信データを送信部７０５からサーバ装置１８に送信する。
ステップＳ２２において、サーバ装置１８は声優のクライアントコンピュータ１４、１５から配信データを受信する。
サーバ装置１８は、ステップＳ２３において配信データを受信部３０１で受信し、ステップＳ２３において配信データ受付部３１１により登録音声情報データベース３１０に記録する。続く、ステップＳ２４において依頼元のユーザに対する課金処理を行う。課金処理はクレジットカードによる決済であり、外部の決済会社により決済を行う。このとき決済会社により声優に報酬が支払われる。
サーバ装置１８は、ステップＳ２５において配信データ、すなわち依頼元のユーザが依頼した文章データをユーザが選択した声優に発声させた録音音声コンテンツをインターネット１９経由で依頼元のクライアントコンピュータ１１へ送信する。 In step S19, the distribution data creation unit 704 creates distribution data of the audio content that has been recorded. In steps S20 and S21, the distribution data is transmitted from the transmission unit 705 to the server device 18.
In step S <b> 22, the server device 18 receives distribution data from the voice actor client computers 14 and 15.
In step S23, the server device 18 receives the distribution data by the reception unit 301, and records it in the registered voice information database 310 by the distribution data reception unit 311 in step S23. In step S24, billing processing is performed for the requesting user. The billing process is a credit card settlement, and settlement is performed by an external settlement company. At this time, a compensation is paid to the voice actor by the settlement company.
In step S25, the server device 18 transmits the recorded data, which is the voice data selected by the user to the distribution data, that is, the text data requested by the requesting user, to the requesting client computer 11 via the Internet 19.

依頼元のクライアントコンピュータ１１は、ステップＳ２６において録音音声コンテンツを受信し、ステップＳ２７において音声受信保存部５０９により外部記憶装置６０６に保存する。
依頼元のクライアントコンピュータ１１は、ステップＳ２８において保存した音声コンテンツを放送設備５１０に渡し、案内放送として出力させる。 The requesting client computer 11 receives the recorded audio content in step S26 and stores it in the external storage device 606 by the audio reception storage unit 509 in step S27.
The requesting client computer 11 passes the audio content stored in step S28 to the broadcasting facility 510 and outputs it as a guidance broadcast.

次に、サーバ装置１８の外部記憶装置４６に保持されている各種のデータベースについて図１０〜図１４を用いて説明する。
図１０は、ユーザ情報データベース３０３内のユーザ情報テーブルの構成例を示す図である。ユーザ情報テーブルには、音声配信サービスを利用可能なユーザのユーザＩＤ、パスワード、個人情報（氏名、住所、電話番号、業務種別または音声コンテンツの用途や使用環境等）が登録されており、音声コンテンツの配信サービスを受ける場合には、このユーザ情報テーブルの登録情報に基づき、配信サービスが提供可能であるかのユーザ認証を実施する。このユーザ認証は、例えば図９のステップＳ７で行う。
個人情報の中の業務種別または音声コンテンツの用途や使用環境としては、例えば、デパート、イベント運営、遊園地、携帯電話機用音声コンテンツ作成、コマーシャル作成などの業務内容、場内放送用、場外放送用などの用途、静寂、騒音大などの環境の情報が登録される。 Next, various databases held in the external storage device 46 of the server device 18 will be described with reference to FIGS.
FIG. 10 is a diagram illustrating a configuration example of a user information table in the user information database 303. In the user information table, the user ID, password, and personal information (name, address, telephone number, business type, usage of the audio content, usage environment, etc.) of the user who can use the audio distribution service are registered. When receiving the distribution service, user authentication is performed to determine whether the distribution service can be provided based on the registration information in the user information table. This user authentication is performed, for example, in step S7 in FIG.
Business type in personal information or usage and usage environment of audio content include, for example, department store, event management, amusement park, business content such as mobile phone audio content creation, commercial creation, in-field broadcasting, out-of-field broadcasting, etc. Information on the environment such as usage, silence, and loud noise is registered.

図１１は、受付情報データベース３０５内に保持されている受付情報テーブルの構成例を示す図である。受付情報テーブルには、ユーザから依頼された情報として、受付日、ユーザＩＤ、声優、文章データが登録され、また、依頼元に配信した音声コンテンツの識別情報（音声１２など）とその利用金額が登録されるようになっている。
文章データは、依頼元のユーザから受信した文章データが登録される。声優の情報は、依頼元のユーザから受信した発声依頼データの中に、声優を指定する情報が含まれていた場合には、指定された条件を満たす声優を後述の声優情報テーブルから検索し、その検索結果の声優の固有名詞または識別情報が１〜複数人分登録される。
声優を選択する方法は、（１）ユーザから受信した声優の固有名詞、性別、声質、言語などの指定情報に基づいて声優情報テーブル内に登録されている声優を選択する、（２）ユーザ情報テーブル３０３に登録されているユーザの業種、音声コンテンツの用途、使用環境の情報に基づき、声優情報テーブル内に登録されている声優を選択する方法がある。
この場合、ユーザが特定の１人の声優のみを指定しない限り、複数人の声優を選択する。但し、条件に該当する声優が１人しか登録されていない場合は、この限りでない。本発明では、上記（１）、（２）のいずれか、または両方を適宜に組合わせて声優を選択する。
発声依頼データの中に、声優の指定情報が含まれていなかった場合には、（２）の方法により、適切な声優を選択する。 FIG. 11 is a diagram illustrating a configuration example of a reception information table held in the reception information database 305. In the reception information table, the reception date, user ID, voice actor, and text data are registered as information requested by the user, and the identification information (such as sound 12) of the audio content distributed to the request source and the usage amount thereof are stored. It is supposed to be registered.
As the text data, text data received from the requesting user is registered. If the voice request data received from the requesting user includes information specifying the voice actor, the voice actor information is searched for a voice actor satisfying the specified condition from the voice actor information table described below, The proper noun or identification information of the voice actor of the search result is registered for one to a plurality of people.
The method for selecting a voice actor is (1) selecting a voice actor registered in the voice actor information table based on designation information such as the proper noun, gender, voice quality and language of the voice actor received from the user. (2) User information There is a method of selecting a voice actor registered in the voice actor information table based on information on the user's business type, usage of audio content, and usage environment registered in the table 303.
In this case, unless the user designates only one specific voice actor, a plurality of voice actors are selected. However, this is not the case when only one voice actor corresponding to the condition is registered. In the present invention, a voice actor is selected by appropriately combining either or both of the above (1) and (2).
If the voice request data does not include voice actor designation information, an appropriate voice actor is selected by the method (2).

図１２は、声優情報データベース３０４内に保持される声優情報テーブルの構成例を示す図である。声優情報テーブルには、文章データの発声を依頼する声優について、声優の識別情報、名前、性別、年齢、個人情報（本名、住所、電話番号、メールアドレス、言語、声質、声の特徴等）が登録されている。
言語は、それぞれの声優が発声可能な言語を示すものであり、例えば日本語の他に英語、ドイツ語が発声できる場合には、日本語、英語、ドイツ語といった言語種別の情報が登録される。声質には、例えば透明感がある、さわやかな感じ、優しい温和な感じ、迫力がある、などの情報が登録される。声の特徴には、デパート向け、イベント会場向け、コマーシャル向けなどの情報が登録される。 FIG. 12 is a diagram illustrating a configuration example of a voice actor information table held in the voice actor information database 304. The voice actor information table contains voice actor identification information, name, gender, age, and personal information (real name, address, phone number, email address, language, voice quality, voice characteristics, etc.) for voice actors requesting voice data. It is registered.
The language indicates the language that each voice actor can speak. For example, when English and German can be spoken in addition to Japanese, information on the language type such as Japanese, English, and German is registered. . In the voice quality, for example, information such as transparency, refreshing feeling, gentle and gentle feeling, and powerfulness is registered. Voice features include information for department stores, event venues, and commercials.

図１３は、登録音声情報データベース３１０内の登録音声情報テーブルの構成例を示す図である。登録音声情報テーブルには、ユーザに配信した音声コンテンツおよび声優の識別情報、文章データが登録されている。なお、配信済みの音声コンテンツを再利用する要求があった場合、サーバ装置１８は、登録音声情報テーブルの登録情報の一覧を依頼元に配信し、試聴させ、その中から再利用する音声コンテンツを提供することができる。なお、図示していないが、それぞれの音声コンテンツには、試聴用か否かを示すフラグが設定されている。 FIG. 13 is a diagram illustrating a configuration example of a registered voice information table in the registered voice information database 310. In the registered voice information table, voice content distributed to the user, voice actor identification information, and sentence data are registered. When there is a request to reuse the distributed audio content, the server device 18 distributes the list of registered information in the registered audio information table to the request source, and listens to the audio content to be reused. Can be provided. Although not shown, each audio content is set with a flag indicating whether or not it is for trial listening.

図１４は、料金情報データベース３１４内に保持されている料金情報テーブルの構成例を示す図である。料金情報テーブルには、それぞれのユーザが購入した音声コンテンツについて、ユーザＩＤ、声優、音声、文章データの識別情報及び金額が登録されるようになっている。 FIG. 14 is a diagram showing a configuration example of a fee information table held in the fee information database 314. In the fee information table, the user ID, voice actor, voice, text data identification information and amount are registered for the audio content purchased by each user.

図１５は依頼主のユーザが使用するクライアントコンピュータ１１〜１３のログイン画面１５０１の例を示す図である。ログイン画面１５０１におけるユーザ操作の手順を、図１６を参照して説明する。図１６はログイン画面例におけるユーザ操作の一連の流れを示すフローチャートである。
ユーザは、自身が使用するクライアントコンピュータ１１〜１３において、ログイン操作を行い、図１５のログイン画面１５０１を表示装置６０７に表示させる。そして、ステップＳ１６０１においてログイン画面１５０１にユーザＩＤを入力し、続くステップＳ１６０２においてパスワードを入力する。入力後、ＯＫボタン１５０２を押下し、サーバ装置１８が提供する音声提供配信サービスにログインする。 FIG. 15 is a diagram showing an example of the login screen 1501 of the client computers 11 to 13 used by the client user. A user operation procedure on the login screen 1501 will be described with reference to FIG. FIG. 16 is a flowchart showing a series of user operations in the login screen example.
The user performs a login operation on the client computers 11 to 13 used by the user, and displays the login screen 1501 of FIG. In step S1601, the user ID is input to the login screen 1501, and in the subsequent step S1602, a password is input. After the input, the user presses an OK button 1502 to log in to the voice providing distribution service provided by the server device 18.

図１７は、ユーザのクライアントコンピュータ１１〜１３における言語選択画面１７０１の例を示す図である。図１８は言語選択画面１７０１におけるユーザ操作の一連の流れを示すフローチャートである。
ユーザは、ステップＳ１８０１において、言語選択画面１７０１に表示された言語一覧から声優に発声させる音声の言語を選択し、続くステップＳ１８０２においてＯＫボタン１７０２を押して言語を決定する。 FIG. 17 is a diagram illustrating an example of the language selection screen 1701 on the user client computers 11 to 13. FIG. 18 is a flowchart showing a series of user operations on the language selection screen 1701.
In step S1801, the user selects the language of the voice to be uttered by the voice actor from the language list displayed on the language selection screen 1701, and in step S1802, presses the OK button 1702 to determine the language.

図１９は、ユーザのクライアントコンピュータ１１〜１３における声優選択画面１９０１の例を示す図である。図２０は声優選択画面１９０１におけるユーザ操作の一連の流れを示すフローチャートである。
ユーザは、ステップＳ２００１において、声優選択画面１９０１に表示された声優一覧から、氏名、年齢、性別、声質、特徴などの情報を参考にして声優を選択し、続くステップＳ２００２においてＯＫボタン１９０２を押下して声優を決定する。この声優選択画面１９０１では、複数の声優を同時に選択することができる。また、適切な声優が見当たらない場合には、声優を選択しないでＯＫボタン１９０２を押下する。その場合には、ユーザ情報テーブルに登録されているユーザの業種や音声コンテンツの用途などの情報によって適切な声優がサーバ装置１８で選択される。
なお、声優選択画面に表示される声優一覧は、図１２の声優情報テーブルに登録された情報を元にサーバ装置１８が作成し、ユーザ登録を行ったユーザのクライアントコンピュータ１１〜１３に配信したものである。 FIG. 19 is a diagram showing an example of a voice actor selection screen 1901 in the user client computers 11 to 13. FIG. 20 is a flowchart showing a series of user operations on the voice actor selection screen 1901.
In step S2001, the user selects a voice actor from the list of voice actors displayed on the voice actor selection screen 1901 with reference to information such as name, age, sex, voice quality, and characteristics, and presses an OK button 1902 in subsequent step S2002. Determine the voice actor. On this voice actor selection screen 1901, a plurality of voice actors can be selected simultaneously. If an appropriate voice actor is not found, the OK button 1902 is pressed without selecting a voice actor. In that case, an appropriate voice actor is selected by the server device 18 according to information such as the user's business type and the usage of the audio content registered in the user information table.
The voice actor list displayed on the voice actor selection screen is created by the server device 18 based on the information registered in the voice actor information table of FIG. 12 and distributed to the client computers 11 to 13 of the user who performed user registration. It is.

図２１は声優が使用する携帯電話機１６，１７に表示される試聴用録音画面２１０１の例を示す図である。図２２は試聴用録音画面２１０１における声優の操作手順を示すフローチャートである。
声優は、ステップＳ２２０１で携帯電話機１６、１７からサーバ装置１８の試聴用録音画面２１０１に接続し、ステップＳ２２０２で録音開始ボタン２１０２を押下し、続くステップＳ２２０３で携帯電話機１６、１７の送話器に文章データを発声音声を試聴用として入力し、サーバ装置１８に送信する。サーバ装置１８では受信した音声を視聴用の音声コンテンツとして図１３の登録音声情報テーブルに記録する。 FIG. 21 is a diagram showing an example of a test recording screen 2101 displayed on the mobile phones 16 and 17 used by the voice actor. FIG. 22 is a flowchart showing a voice actor operating procedure on the trial recording screen 2101.
In step S2201, the voice actor connects to the test recording screen 2101 of the server device 18 from the cellular phones 16 and 17, presses the recording start button 2102 in step S2202, and then becomes a transmitter of the cellular phones 16 and 17 in step S2203. Sentence data is input for listening as sentence data and transmitted to the server device 18. The server device 18 records the received audio as audio content for viewing in the registered audio information table of FIG.

図２３はユーザのクライアントコンピュータ１１〜１３に表示される試聴画面２３０１の例を示す図である。図２４は試聴画面２３０１におけるユーザ操作の流れを示すフローチャートである。
ユーザは、ステップＳ２４０１において試聴画面２３０１から試聴したい声優を選択し、続くステップＳ２４０２でＯＫボタン２３０２を押下して試聴する声優の音声コンテンツを決定する。
なお、試聴画面２３０１は、サーバ装置１８の試聴画面に接続することによってユーザのクライアントコンピュータ１１〜１３の表示装置６０７に表示される。 FIG. 23 is a diagram showing an example of a preview screen 2301 displayed on the user's client computers 11 to 13. FIG. 24 is a flowchart showing the flow of user operations on the trial listening screen 2301.
In step S2401, the user selects a voice actor to be auditioned from the audition screen 2301, and in step S2402, the user presses an OK button 2302 to determine the audio content of the voice actor to be auditioned.
Note that the trial listening screen 2301 is displayed on the display device 607 of the user client computer 11 to 13 by connecting to the trial listening screen of the server device 18.

図２５は、音声コンテンツの試聴を終えたユーザが所望の声優に対する音声コンテンツの録音依頼を行う場合の録音依頼画面２５０１の例を示す図である。図２６は録音依頼を行う場合のユーザ操作の流れを示すフローチャートである。
ユーザは、ステップＳ２６０１において録音依頼画面２５０１のプルダウンメニュー２５０２に表示された複数の選択候補の声優の中から希望する声優を選択し、続くステップＳ２６０２において録音依頼する文章データを文章入力欄２５０３に入力する。入力後、ステップＳ２６０３においてＯＫボタン２５０４を押下して録音依頼を行う。
なお、試聴用に発声依頼を行う文章データと、案内放送等で実際に使用するために録音依頼を行う文章データとは同じであることが多いが、試聴用の文章データは声優を決定するための参考にするものであるため、録音依頼を行う文章データの一部分を省略したものにすることができる。 FIG. 25 is a diagram illustrating an example of a recording request screen 2501 when a user who has finished listening to audio content requests to record audio content for a desired voice actor. FIG. 26 is a flowchart showing the flow of user operation when a recording request is made.
In step S2601, the user selects a desired voice actor from a plurality of selection candidate voice actors displayed in the pull-down menu 2502 of the recording request screen 2501, and in the subsequent step S2602, the sentence data to be requested for recording is input to the sentence input field 2503. To do. After the input, an OK button 2504 is pressed in step S2603 to request recording.
In many cases, the text data that is requested for trial listening is the same as the text data that is requested for recording for actual use in guidance broadcasts, but the text data for audition determines the voice actor. Therefore, it is possible to omit a part of sentence data for requesting recording.

図２７は録音依頼を受けた声優のクライアントコンピュータ１４、１５における録音画面２７０１の例を示す図である。図２８は録音画面を使用した声優の操作手順を示すフローチャートである。
声優は、ステップＳ２８０１において自身が使用するクライアントコンピュータ装置１４の録音部７０２を起動し、図２７の録音画面２７０１を表示装置８０７に表示させる。
次に、ステップＳ２８０２において録音開始ボタン２７０２を押下し、次のステップＳ２８０３で録音機器７０３を使用して録音依頼を受けた文章データの録音を行う。 FIG. 27 is a diagram showing an example of a recording screen 2701 on the client computers 14 and 15 of a voice actor who has received a recording request. FIG. 28 is a flowchart showing the voice actor operation procedure using the recording screen.
In step S2801, the voice actor activates the recording unit 702 of the client computer device 14 used by the voice actor, and displays the recording screen 2701 of FIG.
Next, in step S2802, the recording start button 2702 is pressed, and in the next step S2803, the sentence data requested for recording is recorded using the recording device 703.

図２９は、登録音声情報データベース３１０の試聴用音声データテーブルの構成例を示す図である。試聴用音声データテーブルは、ユーザから依頼された試聴用の文章データに対する複数の声優の音声データを声優の選択順に１つにまとめたものであり、複数の試聴用音声コンテンツ２９０２，２９０４、２９０６の間には声優の名称が「音声１」などの音声識別情報２９０１，２９０３、２９０５を付して登録される。
この試聴用音声データテーブルは、ユーザからの発声依頼を受ける都度、発声依頼別に作成される。 FIG. 29 is a diagram showing a configuration example of a trial audio data table of the registered audio information database 310. The audition audio data table is a collection of audio data of a plurality of voice actors for the test audition sentence data requested by the user in the order of selection of voice actors, and includes a plurality of audio contents for audition 2902, 2904, 2906. In the meantime, the name of the voice actor is registered with voice identification information 2901, 2903, 2905 such as “voice 1”.
This audition audio data table is created for each utterance request each time an utterance request is received from the user.

なお、上記説明において、声優が使用するクライアントコンピュータ１４，１５を無線回線によってインターネット１９に接続可能なノート型パーソナルコンピュータなどで構成することにより、携帯電話機１６、１７を使用せずに試聴用音声コンテンツの送受信、案内放送等で実際に使用する録音音声コンテンツの送受信を行うことができる。要するに、インターネットに接続可能で、携帯可能な端末装置を用いて試聴用音声コンテンツの送受信、案内放送等で実際に使用する録音音声コンテンツの送受信を行うことができる。 In the above description, the client computers 14 and 15 used by the voice actor are composed of a notebook personal computer or the like that can be connected to the Internet 19 via a wireless line, so that the audio content for trial listening can be used without using the mobile phones 16 and 17. The recorded audio content that is actually used in the transmission / reception, guidance broadcasting, etc. can be transmitted / received. In short, it is possible to send and receive audio content for trial listening, and to transmit and receive recorded audio content that is actually used in guidance broadcasting, etc., using a portable terminal device that can be connected to the Internet.

以上のように上記説明した実施の形態によれば、依頼主が必要とする案内放送等の音声の文章データをサーバ装置で複数の声優の端末装置に配信し、配信先の声優に文章データを発生させ、その音声コンテンツをサーバ装置から依頼主の端末装置に配信するように構成したことにより、依頼主側では、臨時に必要になった案内放送などの音声を容易に取得し、使用することが可能になる。
また、依頼主側が期待している口調や声質の声優の音声で案内放送を行うことが可能になる。 As described above, according to the embodiment described above, voice text data such as guidance broadcasting required by the client is distributed to a plurality of voice actor terminal devices by the server device, and the text data is distributed to the voice actors of the distribution destinations. By generating and delivering the audio content from the server device to the requester's terminal device, the requester can easily acquire and use audio such as guidance broadcasts that are temporarily required Is possible.
In addition, it is possible to perform the guidance broadcast with the voice of the voice and voice quality expected by the client.

本発明に係る音声コンテンツの提供方法の基本構成を示すシステム構成図である。1 is a system configuration diagram showing a basic configuration of an audio content providing method according to the present invention. FIG. 本発明の一実施の形態を示すシステム構成図である。1 is a system configuration diagram showing an embodiment of the present invention. サーバ装置の構成を示すブロック図である。It is a block diagram which shows the structure of a server apparatus. サーバ装置の詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of a server apparatus. 依頼主のクライアントコンピュータの構成を示すブロック図である。It is a block diagram which shows the structure of the client computer of a requester. 依頼主のクライアントコンピュータの詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of a client computer of a requester. 声優が使用するクライアントコンピュータの構成を示すブロック図である。It is a block diagram which shows the structure of the client computer which a voice actor uses. 声優が使用するクライアントコンピュータの詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the client computer which a voice actor uses. 音声コンテンツの提供方法の手順を示すフローチャートである。It is a flowchart which shows the procedure of the provision method of an audio | voice content. ユーザ情報テーブルの構成例を示す図である。It is a figure which shows the structural example of a user information table. 受付情報テーブルの構成例を示す図である。It is a figure which shows the structural example of a reception information table. 登録音声情報テーブルの構成例を示す図である。It is a figure which shows the structural example of a registration audio | voice information table. 声優情報テーブルの構成例を示す図である。It is a figure which shows the structural example of a voice actor information table. 料金情報テーブルの構成例を示す図である。It is a figure which shows the structural example of a charge information table. ユーザのクライアントコンピュータにおけるログイン画面例を示す図である。It is a figure which shows the example of a login screen in a user's client computer. ユーザのクライアントコンピュータにおけるログイン操作の手順を示すフローチャートである。It is a flowchart which shows the procedure of login operation in a user's client computer. ユーザのクライアントコンピュータにおける言語選択画面例を示す図である。It is a figure which shows the example of a language selection screen in a user's client computer. ユーザのクライアントコンピュータにおける言語選択操作の手順を示すフローチャートである。It is a flowchart which shows the procedure of language selection operation in a user's client computer. ユーザのクライアントコンピュータにおける声優選択画面例を示す図である。It is a figure which shows the example of a voice actor selection screen in a user's client computer. ユーザのクライアントコンピュータにおける声優選択操作の手順を示すフローチャートである。It is a flowchart which shows the procedure of the voice actor selection operation in a user's client computer. 声優が使用する携帯電話機における試聴用録音画面の例を示す図である。It is a figure which shows the example of the recording screen for trial listening in the mobile telephone which a voice actor uses. 声優が使用する携帯電話機における試聴用録音操作の手順を示すフローチャートである。It is a flowchart which shows the procedure of the recording operation for trial listening in the mobile telephone which a voice actor uses. ユーザのクライアントコンピュータにおける試聴画面の画面例を示す図である。It is a figure which shows the example of a screen of the trial listening screen in a user's client computer. ユーザのクライアントコンピュータにおける試聴画面に対する操作手順を示すフローチャートである。It is a flowchart which shows the operation procedure with respect to the preview screen in a user's client computer. ユーザのクライアントコンピュータにおける録音依頼画面の画面例を示す図である。It is a figure which shows the example of a screen of the recording request screen in a user's client computer. ユーザのクライアントコンピュータにおける録音依頼画面の操作手順を示すフローチャートである。It is a flowchart which shows the operation procedure of the recording request screen in a user's client computer. 声優が使用するクライアントコンピュータおける録音画面の画面例を示す図である。It is a figure which shows the example of a recording screen in the client computer which a voice actor uses. 声優が使用するクライアントコンピュータおけ録音画面の操作手順を示すフローチャートである。It is a flowchart which shows the operation procedure of the recording screen in the client computer which a voice actor uses. 試聴用音声データテーブルの構成例を示す図である。It is a figure which shows the structural example of the audio | voice data table for trial listening.

Explanation of symbols

１事業主体
２声優
３ユーザ
３ａ情報端末
４決済会社
１１、１２、１３依頼者のクライアントコンピュータ
１４、１５声優クライアントコンピュータ
１６、１７声優の持つ携帯電話機
１８サーバ装置
１９インターネット
３０１受信部
３０２声優選択部
３０３ユーザ情報データベース
３０４声優情報データベース
３０５受付情報データベース
３０６試聴用録音部
３０８送信部
３０９音声録音依頼部
３１０登録音声情報データベース
３１１配信データ受付部
３１２配信データ送付部
３１３課金処理部
３１４料金情報データベース
３１５決済処理部
５０１ログイン部
５０２言語選択部
５０３文章入力部
５０４声優選択部
５０５送信部
５０６受信部
５０７試聴部
５０８録音依頼送信部
５０９音声受信保存部
５１０放送設備 DESCRIPTION OF SYMBOLS 1 Business entity 2 Voice actor 3 User 3a Information terminal 4 Settlement company 11, 12, 13 Requester's client computer 14, 15 Voice actor client computer 16, 17 Mobile phone of voice actor 18 Server device 19 Internet 301 Receiving unit 302 Voice actor selecting unit 303 User information database 304 Voice actor information database 305 Reception information database 306 Audition recording unit 308 Transmission unit 309 Audio recording request unit 310 Registered audio information database 311 Distribution data reception unit 312 Distribution data transmission unit 313 Charge processing unit 314 Charge information database 315 Payment processing Section 501 Login section 502 Language selection section 503 Text input section 504 Voice actor selection section 505 Transmission section 506 Reception section 507 Audition section 508 Recording request transmission section 509 510 broadcasting equipment

Claims

A method of providing audio content in which audio content in which a voice requested by a client is uttered by a voice actor is provided to the client via a communication line such as the Internet,
A first step of receiving text data corresponding to the audio content required by the client from the client in the voice providing service device via a communication line such as the Internet;
A voice actor that utters the received text data is selected from voice actors registered in the voice actor table of the voice providing service device, and the text data to be uttered is transmitted to the selected voice actor's terminal device via a communication line such as the Internet. A second step of transmitting at
A third step of receiving audio content corresponding to the sentence data uttered by the voice actor from a voice actor terminal device via a communication line such as the Internet in the voice providing service device;
The audio providing service device further comprises a fourth step of transmitting the audio content of the voice actor received from the voice actor terminal device to the requesting main apparatus via a communication line such as the Internet. Method.

In the second step, a voice actor that utters text data based on designation information such as a proper name, sex, voice quality, language, etc. of the voice actor received from the client apparatus is selected from voice actors registered in the voice actor table in advance. The audio content providing method according to claim 1, wherein the audio content is selected.

In the second step, a voice actor that utters text data based on information of the client's business type, usage of audio content, and usage environment received from the client apparatus is selected from voice actors registered in the voice actor table in advance. The audio content providing method according to claim 1, wherein the audio content is selected.

4. The method according to claim 1, further comprising a fifth step of charging the request source with a usage fee of the audio content after transmitting the voice content of the voice actor to the requesting main apparatus. The audio content providing method described in 1.

A method of providing audio content in which audio content in which a voice requested by a client is uttered by a voice actor is provided to the client via a communication line such as the Internet,
A first step of receiving text data corresponding to the audio content required by the client from the client in the voice providing service device via a communication line such as the Internet;
A voice actor to utter the received text data is selected from voice actors registered in advance in the voice actor table of the voice providing service device, and the text data to be uttered is selected for the selected one or more voice actor terminal devices via the Internet or the like. A second step of transmitting via the communication line of
A third step of receiving audio content corresponding to the text data uttered by the voice actor from one or more voice actor terminal devices via a communication line such as the Internet in the voice providing service device;
A fourth step of transmitting the voice content of voice actors received from one or more voice actor terminal devices as a set to the requesting main apparatus of the request source via a communication line such as the Internet in the voice providing service device;
A fifth step of generating the audio content of one or more voice actors received at the requesting apparatus for trial listening;
The sixth information is transmitted from the requesting main apparatus to the voice providing service apparatus via the communication line such as the Internet from the requesting main apparatus. Steps,
A seventh step of transmitting a voice recording request for the sentence data to a selected voice actor terminal device via a communication line such as the Internet, based on selection information of the voice content of the voice actor selected by the client in the voice providing service device When,
An eighth step of transmitting the recorded voice content of the sentence data recorded by the voice actor who has received a voice recording request from the terminal device or other device of the voice actor to the voice providing service device via a communication line such as the Internet; ,
A voice content providing method comprising: a ninth step of transmitting the recorded voice content of a voice actor received at a voice providing service device to a requesting main device of a requester via a communication line such as the Internet.

In the second step, voice actors that utter text data based on designation information such as proper nouns, gender, voice quality, language, etc. of voice actors received from the client apparatus are selected from voice actors registered in the voice actor table in advance. 6. The audio content providing method according to claim 5, wherein the audio content providing method is selected.

In the second step, a voice actor that utters text data based on information of the client's business type, usage of audio content, and usage environment received from the client apparatus is selected from voice actors registered in the voice actor table in advance. 6. The audio content providing method according to claim 5, wherein the audio content providing method is selected.

In the second to fourth steps and the seventh and eighth steps, a cellular phone is used as a voice actor terminal device to receive text data, transmit audio content for trial listening, and transmit recorded audio content. The audio content providing method according to claim 5, wherein the audio content is provided.

9. The method according to claim 5, further comprising: a tenth step of charging the request source for a usage fee of the recorded audio content after transmitting the recorded audio content of the voice actor to the requesting main apparatus. The audio content providing method according to one item.

Voice content that consists of a client device used by a client, a terminal device used by a voice actor, and a voice providing service device that provides a voice content providing service. Audio content providing system provided to the client via a communication line such as
Voice providing service device
First means for receiving sentence data corresponding to the audio content required by the requester from the requester apparatus via a communication line such as the Internet, and voice actors for uttering the received sentence data are registered in advance in the voice actor table. A second means for selecting the voice actor from the voice actor terminal device, and transmitting the text data to be spoken to the selected voice actor terminal device via a communication line such as the Internet; and the sentence data uttered by the voice actor from the voice actor terminal device. Third means for receiving the corresponding audio content via a communication line such as the Internet, and fourth means for transmitting the voice content of the voice actor received from the voice actor terminal device to the requesting main apparatus of the request source via the communication line such as the Internet. With the means of
A fifth means for receiving text data from the requester and transmitting it to the voice providing service device via a communication line such as the Internet; and a voice content of the text data from the voice providing service device on the Internet. And 6th means for receiving via the communication line of
The voice actor terminal device receives seventh voice data received from the voice providing service device via a communication line such as the Internet, and voice content uttered by the voice actor for the received text data. An audio content providing system comprising: an eighth means for transmitting to a device via a communication line such as the Internet.

The requesting apparatus further comprises means for transmitting designation information such as a proper name of a voice actor that pronounces sentence data, gender, voice quality, language,
The second means of the voice providing service device pre-registers in the voice actor table a voice actor that utters sentence data based on the designation information such as the proper noun, gender, voice quality, language, etc. of the voice actor received from the client main device. 11. The audio content providing system according to claim 10, wherein the voice content is selected from voice actors.

The client apparatus further comprises means for transmitting information on the client's business type, usage of audio content, and usage environment,
The second means of the voice providing service device is pre-registered in the voice actor table for a voice actor that utters text data based on information of the client's business type, usage of voice content, and usage environment received from the client main device. The audio content providing system according to claim 10, wherein the voice content is selected from voice actors.

13. The voice providing service device further comprises billing means for billing the request source for the usage fee of the voice content after the voice content of the voice actor is transmitted to the request source device of the request source. The audio content providing system according to any one of the above.

A requester device used by a client, a terminal device used by a voice actor, and a voice providing service device that provides a voice content providing service. An audio content providing system mainly provided via a communication line such as the Internet,
Voice providing service device
A first means for receiving text data corresponding to the audio content required by the requester from the requester apparatus via a communication line such as the Internet, and a voice actor in which voice actors for speaking the received text data are registered in the voice actor table in advance. A second means for transmitting text data to be uttered to the selected terminal device of one or more voice actors via a communication line such as the Internet, and the voice actors from the one or more voice actor terminal devices. The third means for receiving the voice content of the sentence data uttered by the voice via a communication line such as the Internet and the voice content for audition of the voice actor received from the terminal device of one or more voice actors as one set The fourth means for transmitting to the client main apparatus via a communication line such as the Internet, and selection of audio content for viewing the voice actor selected by the client Based on the information, a fifth means for transmitting the voice data recording request to the selected voice actor terminal device via a communication line such as the Internet, and the sentence from the voice actor terminal device or other device requested to record. Receiving a recorded audio content of the data via a communication line such as the Internet, and transmitting the content to the requesting main apparatus of the request source via a communication line such as the Internet,
A seventh means for receiving text data from the requester and transmitting the text data to the voice providing service device via a communication line such as a Web; and voice content corresponding to the text data from the voice providing service device. An eighth means for receiving via a communication line such as the Internet; a ninth means for producing the received audio content of one or more voice actors for trial listening; and one or more audio contents for the produced trial listening. Tenth means for transmitting selection information of the voice content of the voice actor selected by the client from among the voice providing service device via a communication line such as the Internet,
The voice actor's terminal device receives eleventh means for receiving text data from the voice providing service device via a communication line such as the Internet, and voice content produced by the voice actor for the received text data. A twelfth means for transmitting the content to the voice providing service device via a communication line such as the Internet; and receiving a voice recording request for the text data from the voice providing service device; An audio content providing system comprising: a thirteenth means for transmitting to a providing service device via a communication line such as the Internet.

15. The voice providing service device further comprises billing means for billing the request source for the usage fee of the recorded voice content after transmitting the voice actor's recorded voice content to the requesting master device. The audio content providing system described in 1.