JP2021107873A5 - - Google Patents

Download PDF

Info

Publication number
JP2021107873A5
JP2021107873A5 JP2019239264A JP2019239264A JP2021107873A5 JP 2021107873 A5 JP2021107873 A5 JP 2021107873A5 JP 2019239264 A JP2019239264 A JP 2019239264A JP 2019239264 A JP2019239264 A JP 2019239264A JP 2021107873 A5 JP2021107873 A5 JP 2021107873A5
Authority
JP
Japan
Prior art keywords
customer
receiver
operator
server
uttered voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2019239264A
Other languages
Japanese (ja)
Other versions
JP2021107873A (en
Filing date
Publication date
Application filed filed Critical
Priority to JP2019239264A priority Critical patent/JP2021107873A/en
Priority claimed from JP2019239264A external-priority patent/JP2021107873A/en
Publication of JP2021107873A publication Critical patent/JP2021107873A/en
Publication of JP2021107873A5 publication Critical patent/JP2021107873A5/ja
Pending legal-status Critical Current

Links

Description

また、本開示は、映像およびオペレータの発話音声をオペレータ端末から受信して出力する受信機と、サーバとにより構成される音声特性変更システムにより実行される音声特性変更方法であって、前記受信機により、前記映像および前記発話音声を視聴する顧客を撮像するカメラを有し、前記カメラにより撮像された前記顧客の撮像画像を取得するステップと、前記サーバにより、前記受信機から送られた前記顧客の撮像画像に基づいて、前記顧客の前記映像および前記発話音声に対する感情を示す感情データを導出するステップと、前記サーバにより、前記顧客の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の変更に関する処理指示を生成して前記受信機に送るステップと、前記受信機により、前記サーバから送られた前記処理指示に基づいて、前記オペレータの発話音声の特性を変更して出力するステップと、を有する、音声特性変更方法を提供する。また、本開示は、映像およびオペレータの発話音声をオペレータ端末から受信して出力する受信機と通信可能に接続される音声特性変更装置であって、前記映像および前記発話音声を視聴する顧客を撮像するカメラと接続された前記受信機から、前記カメラにより撮像された前記顧客の撮像画像を取得し、前記受信機から送られた前記顧客の撮像画像に基づいて、前記顧客の前記映像および前記発話音声に対する感情を示す感情データを導出し、前記顧客の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の変更に関する処理指示を生成して前記受信機に送る、音声特性変更装置を提供する。 Further, the present disclosure is an audio characteristic changing method executed by an audio characteristic changing system configured by a receiver that receives and outputs video and an operator's uttered voice from an operator terminal, and a server, wherein the receiver a step of obtaining a captured image of the customer captured by the camera, and the customer sent from the receiver by the server; a step of deriving emotion data indicating the emotion of the customer with respect to the video and the uttered voice based on the captured image of the operator; a step of generating a processing instruction for changing the characteristics of the operator and sending it to the receiver, and changing and outputting the characteristics of the operator's uttered voice by the receiver based on the processing instruction sent from the server and a method for modifying audio characteristics. Further, the present disclosure is an audio characteristic changing device communicably connected to a receiver that receives and outputs video and an operator's uttered voice from an operator terminal, and captures a customer viewing the video and the uttered voice. a captured image of the customer captured by the camera is acquired from the receiver connected to the camera connected to the receiver, and based on the captured image of the customer sent from the receiver, the image and the utterance of the customer are obtained. A voice characteristic changing device for deriving emotion data indicating an emotion toward voice, and based on the derivation result of the emotion data of the customer, generating a processing instruction for changing the characteristic of the voice uttered by the operator and transmitting the processing instruction to the receiver. I will provide a.

Claims (12)

映像およびオペレータの発話音声をオペレータ端末から受信して出力する受信機と、サーバとが通信可能に接続される音声特性変更システムであって、
前記受信機は、
前記映像および前記発話音声を視聴する顧客を撮像するカメラと接続され、前記カメラにより撮像された前記顧客の撮像画像を取得して前記サーバに送り、
前記サーバは、
前記受信機から送られた前記顧客の撮像画像に基づいて、前記顧客の前記映像および前記発話音声に対する感情を示す感情データを導出し、
前記顧客の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の変更に関する処理指示を生成して前記受信機に送り、
前記受信機は、
前記サーバから送られた前記処理指示に基づいて、前記オペレータの発話音声の特性を変更して出力する、
音声特性変更システム。
An audio characteristic changing system in which a receiver that receives and outputs video and an operator's uttered voice from an operator terminal and a server are communicably connected,
The receiver is
connected to a camera that captures an image of the customer viewing the video and the uttered voice, acquires an image of the customer captured by the camera, and sends the captured image to the server;
The server is
deriving emotion data indicating the customer's emotion toward the video and the uttered voice based on the captured image of the customer sent from the receiver;
Based on the derivation result of the emotion data of the customer, generating a processing instruction for changing characteristics of the operator's uttered voice and sending it to the receiver;
The receiver is
Based on the processing instruction sent from the server, the characteristics of the operator's uttered voice are changed and output.
Voice characteristic change system.
前記受信機は、
前記顧客の発話音声を収音するマイクと接続され、前記マイクにより収音された前記顧客の発話音声を取得して前記サーバに送り、
前記サーバは、
前記受信機から送られた前記顧客の撮像画像または前記顧客の発話音声に基づいて、前記顧客の前記感情データを導出する、
請求項1に記載の音声特性変更システム。
The receiver is
connected to a microphone that picks up the customer's uttered voice, acquires the customer's uttered voice picked up by the microphone, and sends it to the server;
The server is
Deriving the emotional data of the customer based on the customer's captured image or the customer's uttered voice sent from the receiver;
2. A system for modifying audio characteristics according to claim 1.
前記サーバは、
前記顧客の前記感情データが怒りを示すと判定した場合に、前記オペレータの発話音声の語尾部分のピッチを下げる旨の前記処理指示を生成する、
請求項1に記載の音声特性変更システム。
The server is
when it is determined that the emotional data of the customer indicates anger, generating the processing instruction to lower the pitch of the ending part of the operator's uttered voice;
2. A system for modifying audio characteristics according to claim 1.
前記サーバは、
前記顧客の前記感情データが怒りを示すと判定した場合に、前記オペレータによる発話の継続の中止を促すアドバイス情報を生成して前記オペレータ端末に送信し、
前記オペレータ端末は、
前記サーバから送られた前記アドバイス情報を受信して表示する、
請求項1に記載の音声特性変更システム。
The server is
when determining that the emotion data of the customer indicates anger, generating advice information prompting the operator to stop continuation of speech and transmitting the advice information to the operator terminal;
The operator terminal is
receiving and displaying the advice information sent from the server;
2. A system for modifying audio characteristics according to claim 1.
前記サーバは、
前記顧客の前記感情データが悩みを示すと判定した場合に、前記オペレータの発話音声のボリュームを上げる旨の前記処理指示を生成する、
請求項1に記載の音声特性変更システム。
The server is
generating the processing instruction to increase the volume of the operator's uttered voice when it is determined that the emotion data of the customer indicates distress;
2. A system for modifying audio characteristics according to claim 1.
前記サーバは、
前記受信機から送られた前記顧客の撮像画像および前記顧客の発話音声の両方に基づいて、前記顧客の前記感情データを導出する、
請求項2に記載の音声特性変更システム。
The server is
Deriving the emotional data of the customer based on both the customer's captured image and the customer's spoken voice sent from the receiver;
3. A system for modifying audio characteristics according to claim 2.
前記受信機は、前記オペレータとの間の対話を支援する対面型情報提供装置である、
請求項1~6のうちいずれか一項に記載の音声特性変更システム。
The receiver is a face-to-face information providing device that supports dialogue with the operator,
A system for modifying audio characteristics according to any one of claims 1-6.
前記受信機は、家庭内に配置されるテレビジョン受像機である、
請求項1~5のうちいずれか一項に記載の音声特性変更システム。
The receiver is a television receiver placed in the home,
A system for modifying audio characteristics according to any one of claims 1-5.
前記受信機は、複数の前記家庭内のそれぞれに少なくとも1台が配置され、
前記サーバは、前記家庭内の受信機ごとに、前記オペレータの発話音声の特性の変更に関する異なる処理指示を生成して対応する前記受信機に送る、
請求項8に記載の音声特性変更システム。
At least one receiver is arranged in each of the plurality of homes,
wherein the server generates different processing instructions for changing characteristics of the operator's spoken voice for each receiver in the home and sends them to the corresponding receiver;
9. A system for modifying audio characteristics according to claim 8.
前記受信機は、
前記受信機から出力される前記映像および前記発話音声を視聴する顧客が複数名である場合、所定の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の
変更に関する処理指示を生成する、
請求項8に記載の音声特性変更システム。
The receiver is
When there are a plurality of customers viewing the video and the uttered voice output from the receiver, a processing instruction for changing the characteristics of the operator's uttered voice is generated based on the derivation result of the predetermined emotion data. do,
9. A system for modifying audio characteristics according to claim 8.
映像およびオペレータの発話音声をオペレータ端末から受信して出力する受信機と、サーバとにより構成される音声特性変更システムにより実行される音声特性変更方法であって、
前記受信機により、前記映像および前記発話音声を視聴する顧客を撮像するカメラを有し、前記カメラにより撮像された前記顧客の撮像画像を取得するステップと、
前記サーバにより、前記受信機から送られた前記顧客の撮像画像に基づいて、前記顧客の前記映像および前記発話音声に対する感情を示す感情データを導出するステップと、
前記サーバにより、前記顧客の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の変更に関する処理指示を生成して前記受信機に送るステップと、
前記受信機により、前記サーバから送られた前記処理指示に基づいて、前記オペレータの発話音声の特性を変更して出力するステップと、を有する、
音声特性変更方法。
An audio characteristic changing method executed by an audio characteristic changing system composed of a receiver that receives and outputs video and an operator's uttered voice from an operator terminal and a server,
a step of obtaining a captured image of the customer captured by the camera, wherein the receiver has a camera that captures an image of the customer viewing the video and the uttered voice;
a step of deriving, by the server, emotion data indicating the customer's emotion toward the video and the uttered voice based on the captured image of the customer sent from the receiver;
a step of generating, by the server, a processing instruction for changing characteristics of the operator's uttered voice based on the derivation result of the emotion data of the customer, and transmitting the processing instruction to the receiver;
a step of changing and outputting characteristics of the operator's uttered voice by the receiver based on the processing instruction sent from the server;
How to change voice characteristics.
映像およびオペレータの発話音声をオペレータ端末から受信して出力する受信機と通信可能に接続される音声特性変更装置であって、An audio characteristic changing device communicatively connected to a receiver that receives and outputs video and an operator's uttered voice from an operator terminal,
前記映像および前記発話音声を視聴する顧客を撮像するカメラと接続された前記受信機から、前記カメラにより撮像された前記顧客の撮像画像を取得し、obtaining an image of the customer captured by the camera from the receiver connected to the camera that captures the customer viewing the video and the uttered voice;
前記受信機から送られた前記顧客の撮像画像に基づいて、前記顧客の前記映像および前記発話音声に対する感情を示す感情データを導出し、deriving emotion data indicating the customer's emotion toward the video and the uttered voice based on the captured image of the customer sent from the receiver;
前記顧客の前記感情データの導出結果に基づいて、前記オペレータの発話音声の特性の変更に関する処理指示を生成して前記受信機に送る、Based on the derivation result of the emotion data of the customer, a processing instruction for changing characteristics of the operator's uttered voice is generated and sent to the receiver.
音声特性変更装置。Voice characteristic modifier.
JP2019239264A 2019-12-27 2019-12-27 Voice characteristic change system and voice characteristic change method Pending JP2021107873A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2019239264A JP2021107873A (en) 2019-12-27 2019-12-27 Voice characteristic change system and voice characteristic change method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2019239264A JP2021107873A (en) 2019-12-27 2019-12-27 Voice characteristic change system and voice characteristic change method

Publications (2)

Publication Number Publication Date
JP2021107873A JP2021107873A (en) 2021-07-29
JP2021107873A5 true JP2021107873A5 (en) 2022-12-23

Family

ID=76967866

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2019239264A Pending JP2021107873A (en) 2019-12-27 2019-12-27 Voice characteristic change system and voice characteristic change method

Country Status (1)

Country Link
JP (1) JP2021107873A (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4145444A1 (en) * 2021-09-07 2023-03-08 Avaya Management L.P. Optimizing interaction results using ai-guided manipulated speech

Similar Documents

Publication Publication Date Title
JP6791356B2 (en) Control method of voice terminal, voice command generation system, and voice command generation system
JP5533854B2 (en) Speech recognition processing system and speech recognition processing method
KR101825569B1 (en) Technologies for audiovisual communication using interestingness algorithms
JP6400445B2 (en) Conversation analyzer, conversation analysis system, conversation analysis method, and conversation analysis program
JP2018156044A (en) Voice recognition device, voice recognition method, and voice recognition program
US10089980B2 (en) Sound reproduction method, speech dialogue device, and recording medium
US10283114B2 (en) Sound conditioning
JP7427408B2 (en) Information processing device, information processing method, and information processing program
US20190385589A1 (en) Speech Processing Device, Teleconferencing Device, Speech Processing System, and Speech Processing Method
JP2019220848A (en) Data processing apparatus, data processing method and program
JP2013042356A (en) Image processor, image processing method and program
KR101376292B1 (en) Method and apparatus for providing emotion analysis service during telephone conversation
JP2021107873A5 (en)
KR101874836B1 (en) Display apparatus, hearing level control apparatus and method for correcting sound
JP2011205353A (en) Viewing situation recognition device, and viewing situation recognition system
WO2017067319A1 (en) Information transmission method and apparatus, and terminal
JP2016206646A (en) Voice reproduction method, voice interactive device, and voice interactive program
US20170289712A1 (en) A method for operating a hearing system as well as a hearing system
JP2019176375A (en) Moving image output apparatus, moving image output method, and moving image output program
CN114400013A (en) Speaker prediction method, speaker prediction device, and communication system
KR101892268B1 (en) method and apparatus for controlling mobile in video conference and recording medium thereof
JP2018081147A (en) Communication device, server, control method and information processing program
JP2010164992A (en) Speech interaction device
WO2018088210A1 (en) Information processing device and method, and program
KR20210054246A (en) Electorinc apparatus and control method thereof