TWI555349B

TWI555349B - Voice chat device

Info

Publication number: TWI555349B
Application number: TW103116799A
Authority: TW
Inventors: 安砥中
Original assignee: 嘯天科技股份有限公司
Priority date: 2014-05-13
Filing date: 2014-05-13
Publication date: 2016-10-21
Also published as: TW201543842A

Description

Voice chat device

本發明係關於一種語音聊天裝置；特別是指供使用者透過網路與另一語音聊天裝置之使用者通訊之裝置者。 The present invention relates to a voice chat device; and more particularly to a device for a user to communicate with a user of another voice chat device over a network.

習知的語音聊天裝置10請參閱圖1所示，其係包括一輸入模組11，一編碼器12、一解碼器13、一輸出模組14以及一迴音消除模組15，藉由該語音聊天裝置10，使用者得以透過網路16而與另一使用者(對話方)通訊。使用者之聲音係由該輸入模組11接收為一類比音訊111，並透過該編碼器12將該類比音訊111轉換為一數位音訊121後，經由網路16傳送至對話方，而對話方之聲音亦經同樣流程而傳送至使用者之語音聊天裝置10，透過該解碼器13接收該對話方之數位訊號161，並轉換為一類比訊號131後，透過該輸出模組14播出；而為了避免該輸出模組14播出之聲音141被該輸入模組再次接收產生迴音而影響通訊之品質，故習知之語音聊天裝置10會透過該迴音消除模組15對該使用者之類比音訊111進行迴音消除，藉此得到良好的通訊品質。 As shown in FIG. 1 , the conventional voice chat device 10 includes an input module 11 , an encoder 12 , a decoder 13 , an output module 14 , and an echo cancellation module 15 . The chat device 10 allows the user to communicate with another user (interviewer) via the network 16. The user's voice is received by the input module 11 as an analog audio 111, and the analog audio 111 is converted into a digital audio 121 through the encoder 12, and then transmitted to the dialogue party via the network 16, and the dialogue party The voice is also transmitted to the voice chat device 10 of the user through the same process, and the digital signal 161 of the dialogue party is received by the decoder 13 and converted into an analog signal 131, and then broadcasted through the output module 14; The sound 141 broadcasted by the output module 14 is prevented from being received by the input module to generate an echo, which affects the quality of the communication. Therefore, the conventional voice chat device 10 transmits the analog audio 111 to the user through the echo cancellation module 15. Echo cancellation, which leads to good communication quality.

然而，由於網路16的速度並非穩定於定值，且使用者方及對話方之網路速度亦不一定一致，因此，迴音消除模組15難以準確地針對迴音進行消除。 However, since the speed of the network 16 is not stable at a fixed value, and the network speeds of the user side and the conversation party are not necessarily the same, the echo cancellation Except for the module 15, it is difficult to accurately eliminate the echo.

有鑑於此，本發明人潛心構思並更深入研究，終於發明出一種語音聊天裝置。 In view of this, the inventors have conceived and further studied, and finally invented a voice chat device.

本發明提供一種語音聊天裝置，其主要目的是使語音聊天裝置中之迴音消除模組能適應網路延遲之時間，藉此可精準地針對迴音進行消除。 The invention provides a voice chat device, the main purpose of which is to enable the echo cancellation module in the voice chat device to adapt to the time of the network delay, thereby accurately eliminating the echo.

為達前述目的，本發明提供一種語音聊天裝置，用以供使用者端透過網路與至少一遠端裝置進行通訊，該語音聊天裝置包括：一輸入模組，用以接收該使用者端之聲音並產生一輸入類比音訊；一編碼器，與該輸入模組連接，用以將該輸入模組所產生之輸入類比音訊轉換為一輸入數位訊號，並將該輸入數位訊號經由網路傳送至該至少一遠端裝置；一解碼器，用以經由網路接收該至少一遠端裝置所發出之一輸出數位訊號，並將該輸出數位訊號轉換為一輸出類比音訊；一輸出模組，與該解碼器連接，用以於該使用者端播放該輸出類比音訊；一延遲模組，擷取該輸入數位訊號及該輸出數位訊號，以偵測網路之延遲時間，並得出一延遲參考值；以及一迴音消除模組，接受該輸出類比音訊及該延遲模組發出之延遲參考值，並根據該輸出類比音訊及該延遲參考值而對該輸入類比音訊進行迴音消除。 To achieve the foregoing objective, the present invention provides a voice chat device for a user to communicate with at least one remote device through a network, the voice chat device comprising: an input module for receiving the user terminal The sound generates an input analog audio; an encoder is coupled to the input module for converting the input analog audio generated by the input module into an input digital signal, and transmitting the input digital signal to the network via the network to The at least one remote device; a decoder for receiving, by the network, one of the output digital signals sent by the at least one remote device, and converting the output digital signal into an output analog audio; an output module, and The decoder is connected to play the output analog audio to the user end; a delay module captures the input digital signal and the output digital signal to detect a network delay time and obtain a delay reference And an echo cancellation module that accepts the output analog audio and the delay reference value sent by the delay module, and based on the output analog audio and the delay reference value The analog audio input performs echo cancellation.

本發明利用所提供的語音聊天裝置，可以獲得的功效在於：藉由於該語音聊天裝置裝設置一延遲模組，該延遲模組係用以擷取該輸入數位訊號及該輸出數位訊號，以偵測網路之延遲時間，並得出一延遲參考值後，將該延遲參考值送至迴音消除模組，該迴音消除模組則根據該輸出類比音訊及該延遲參考值而對該輸入類比音訊進行迴音消除，藉此可精準地針對目標迴音進行消除。 The invention can be obtained by using the provided voice chat device The effect is that the delay module is configured to capture the input digital signal and the output digital signal to detect the delay time of the network and obtain a delay reference. After the value is sent, the delay reference value is sent to the echo cancellation module, and the echo cancellation module performs echo cancellation on the input analog audio signal according to the output analog audio signal and the delay reference value, thereby accurately performing the target echo eliminate.

有關本發明為達成上述目的，所採用之技術、手段及其他之功效，茲舉一較佳可行實施例並配合圖式詳細說明如后。 The present invention has been described in connection with the preferred embodiments of the present invention in accordance with the accompanying drawings.

[study]

10‧‧‧語音聊天裝置 10‧‧‧Voice chat device

11‧‧‧輸入模組 11‧‧‧Input module

111‧‧‧類比音訊 111‧‧‧ analog audio

12‧‧‧編碼器 12‧‧‧Encoder

121‧‧‧數位音訊 121‧‧‧Digital audio

13‧‧‧解碼器 13‧‧‧Decoder

131‧‧‧類比訊號 131‧‧‧ analog signal

14‧‧‧輸出模組 14‧‧‧Output module

141‧‧‧聲音 141‧‧‧ Sound

15‧‧‧迴音消除模組 15‧‧‧Echo Cancellation Module

16‧‧‧網路 16‧‧‧Network

161‧‧‧數位訊號 161‧‧‧ digital signal

〔this invention〕

2‧‧‧使用者端 2‧‧‧User side

20‧‧‧語音聊天裝置 20‧‧‧Voice chat device

21‧‧‧輸入模組 21‧‧‧ Input Module

211‧‧‧輸入類比音訊 211‧‧‧ Input analog audio

22‧‧‧編碼器 22‧‧‧Encoder

221‧‧‧輸入數位訊號 221‧‧‧Enter digital signal

23‧‧‧解碼器 23‧‧‧Decoder

231‧‧‧輸出類比音訊 231‧‧‧ Output analog audio

24‧‧‧輸出模組 24‧‧‧Output module

241‧‧‧播放聲音 241‧‧‧Play sound

25‧‧‧延遲模組 25‧‧‧Delay module

251‧‧‧延遲參考值 251‧‧‧Delay reference value

26‧‧‧迴音消除模組 26‧‧‧Echo Cancellation Module

3‧‧‧網路 3‧‧‧Network

4‧‧‧遠端裝置 4‧‧‧ Remote device

411‧‧‧輸出數位訊號 411‧‧‧Output digital signal

圖1係習知語音聊天裝置之示意圖。 1 is a schematic diagram of a conventional voice chat device.

圖2係本發明實施例之示意圖。 2 is a schematic view of an embodiment of the present invention.

圖3係本發明實施例之示意圖，係顯示出多方通訊時之態樣。 FIG. 3 is a schematic diagram of an embodiment of the present invention, showing a state of multi-party communication.

在本發明被詳細描述之前，要注意的是在以下的說明內容中，類似的元件是以相同的編號來表示。 Before the present invention is described in detail, it is noted that in the following description, similar elements are denoted by the same reference numerals.

為使貴審查委員對本發明之目的、特徵及功效能夠有更進一步之瞭解與認識，以下茲請配合【圖式簡單說明】詳述如后：本發明語音聊天裝置的較佳實施例如圖2所示，其係用以供使用者端2透過網路3與一遠端裝置4進行通訊，該使用者端2及該遠端裝置4係均為一電腦系統，該語音聊天裝置20係設於該使用者端2，包括：一輸入模組21、一編碼器22、一解碼器23、一輸出模組24、一延遲模組25以及一迴音消除模組26，其中：該輸入模組21，如麥克風，用以接收該使用者端2之聲音並產生一輸入類比音訊211，值得注意的是，該使用者端2之聲音除使用者講話的聲音、使用者所處環境之聲音外，更包含了由該輸出模組24所發出之來自該遠端裝置4之聲音，而該輸出模組24所發出之來自該遠端裝置4之聲音，則為該迴音消除模組26所欲消除之迴音，藉以避免回授之發生；該編碼器22係使用OPUS格式或AAC(Advanced Audio Coding，AAC，進階聲音編碼)格式，其中，OPUS格式是指一種本領域熟知的完全開放與無版權費及多用途的聲音編碼方式；該編碼器22係與該輸入模組21連接，用以將該輸入模組21所產生之輸入類比音訊211轉換為一輸入數位訊號221，並將該輸入數位訊號經221由網路3傳送至該遠端裝置4；該解碼器23係使用OPUS格式或AAC格式，用以經由網路3接收該遠端裝置4所發出之一輸出數位訊號411，並將該輸出數位訊號411轉換為一輸出類比音訊231；該輸出模組24，如喇叭，係與該解碼器23連接，用以於該使用者端2播放該輸出類比音訊231以成為一播放聲音241；該延遲模組25擷取該輸入數位訊號221及該輸出數位訊號411，以偵測網路3之延遲時間，並得出一延遲參考值251，該延遲參考值251係與該網路3之延遲時間長短相關；迴音消除模組26接受該輸出類比音訊231及該延遲模組25發出之延遲參考值251，並根據該輸出類比音訊231及該延遲參考值251而對該輸入類比音訊231進行迴音消除。 In order to enable the reviewing committee to have a better understanding and understanding of the purpose, features and functions of the present invention, the following is a detailed description of the following: a preferred embodiment of the voice chat device of the present invention is as shown in FIG. The user terminal 2 is configured to communicate with a remote device 4 through the network 3, The user terminal 2 and the remote device 4 are both a computer system. The voice chat device 20 is disposed on the user terminal 2, and includes an input module 21, an encoder 22, and a decoder 23. An output module 24, a delay module 25, and an echo cancellation module 26, wherein the input module 21, such as a microphone, receives the sound of the user terminal 2 and generates an input analog audio 211. The sound of the user terminal 2 includes the sound from the remote device 4 emitted by the output module 24 in addition to the sound of the user's speech and the sound of the user's environment, and the output is output. The sound from the remote device 4 sent by the module 24 is the echo that the echo cancellation module 26 wants to eliminate, so as to avoid the occurrence of feedback; the encoder 22 uses the OPUS format or AAC (Advanced Audio Coding). , AAC, Advanced Voice Coding) format, wherein the OPUS format refers to a completely open and royalty-free and versatile voice coding method well known in the art; the encoder 22 is connected to the input module 21 for The input generated by the input module 21 The analog audio 211 is converted into an input digital signal 221, and the input digital signal is transmitted from the network 3 to the remote device 4 via 221; the decoder 23 uses the OPUS format or the AAC format for receiving via the network 3. One of the remote devices 4 outputs a digital signal 411, and converts the output digital signal 411 into an output analog audio 231; the output module 24, such as a speaker, is coupled to the decoder 23. The output analog audio 231 is played by the user terminal 2 to be a playback sound 241. The delay module 25 captures the input digital signal 221 and the output digital signal 411 to detect the delay time of the network 3. And obtaining a delay reference value 251, wherein the delay reference value 251 is related to the delay time of the network 3; the echo cancellation module 26 receives the output analog audio 231 and the delay reference value 251 sent by the delay module 25, And the input analog audio 231 is echo-cancelled according to the output analog audio 231 and the delay reference value 251.

以上所述為本發明實施例主要構件及其組態說明。至於本發明實施例的使用方式及功效，請參閱圖2所示，通常根據本發明，為了使該語音聊天裝置20得到網路3的延遲數值，因此，需先取得該語音聊天裝置20上傳及接收之網路封包以進行比對。該語音聊天裝置20上傳至網路3之封包係由該輸入裝置21透過該編碼器22所得出之輸入數位訊號221，而該接收之網路封包則來自該網路3傳送至該解碼器23之輸出數位訊號411，藉此該延遲模組25可得出一延遲參考值251供迴音消除模組26參考以精準地針對目標迴音進行消除。 The above description is the main components of the embodiment of the present invention and their configuration description. As shown in FIG. 2, in accordance with the present invention, in order to enable the voice chat device 20 to obtain the delay value of the network 3, the voice chat device 20 needs to be uploaded first. The received network packets are compared for comparison. The packet that the voice chat device 20 uploads to the network 3 is the input digital signal 221 obtained by the input device 21 through the encoder 22, and the received network packet is transmitted from the network 3 to the decoder 23. The digital signal 411 is output, whereby the delay module 25 can derive a delay reference value 251 for the echo cancellation module 26 to reference to accurately eliminate the target echo.

需說明的是，前述之目標迴音係指由該輸出模組所播出之播放聲音241，而又被該輸入模組21接收者，該迴音消除模組26之目的即在於將該輸入模組21所產生之輸入類比音訊211中，將該播放聲音241消除。 It should be noted that the foregoing target echo refers to the playing sound 241 broadcasted by the output module, and is received by the input module 21, the back The purpose of the tone cancellation module 26 is to eliminate the playback sound 241 from the analog analog audio 211 generated by the input module 21.

除此之外，請參閱圖3所示，其係顯示出本發明於多方通訊時示意圖，其中，使用者端2之語音聊天裝置20係透過網路3而與複數個遠端裝置4通訊，然而，雖然是與複數個遠端裝置4通訊，但由於延遲模組25係設於該使用者端2，且擷取該網路3送自該使用者端之輸出數位訊號411，以偵測網路3之延遲時間，因此可知，本發明無論在與單一個遠端裝置4或複數個遠端裝置4通訊時，均可正常地使用。 In addition, please refer to FIG. 3, which shows a schematic diagram of the present invention in multi-party communication, wherein the voice chat device 20 of the user terminal 2 communicates with a plurality of remote devices 4 through the network 3. However, although it is in communication with a plurality of remote devices 4, the delay module 25 is disposed at the user terminal 2, and the network 3 is sent from the output signal 411 of the user terminal to detect The delay time of the network 3, therefore, it can be seen that the present invention can be used normally when communicating with a single remote device 4 or a plurality of remote devices 4.

由上述得知本發明確實符合「具有產業可利用性」、「新穎性」、「進步性」，爰依法提出發明專利申請，祈請惠予審查並早日賜准專利，實感德便。 From the above, it is known that the present invention truly conforms to "industrial availability," "novelty," and "progressiveness", and submits an invention patent application in accordance with the law, praying for review and early granting of a patent, and it is truly sensible.

2‧‧‧使用者端 2‧‧‧User side

20‧‧‧語音聊天裝置 20‧‧‧Voice chat device

21‧‧‧輸入模組 21‧‧‧ Input Module

211‧‧‧輸入類比音訊 211‧‧‧ Input analog audio

22‧‧‧編碼器 22‧‧‧Encoder

221‧‧‧輸入數位訊號 221‧‧‧Enter digital signal

23‧‧‧解碼器 23‧‧‧Decoder

231‧‧‧輸出類比音訊 231‧‧‧ Output analog audio

24‧‧‧輸出模組 24‧‧‧Output module

241‧‧‧播放聲音 241‧‧‧Play sound

25‧‧‧延遲模組 25‧‧‧Delay module

251‧‧‧延遲參考值 251‧‧‧Delay reference value

26‧‧‧迴音消除模組 26‧‧‧Echo Cancellation Module

3‧‧‧網路 3‧‧‧Network

4‧‧‧遠端裝置 4‧‧‧ Remote device

411‧‧‧輸出數位訊號 411‧‧‧Output digital signal

Claims

A voice chat device, configured for the user end to communicate with at least one remote device through the network, the voice chat device comprising: an input module for receiving the sound of the user end and generating an input analog audio; An encoder is coupled to the input module for converting the input analog audio generated by the input module into an input digital signal, and transmitting the input digital signal to the at least one remote device via the network; The decoder is configured to receive, by the network, one of the output digital signals sent by the at least one remote device, and convert the output digital signal into an output analog audio; an output module is connected to the decoder for The user terminal plays the output analog audio; a delay module captures the input digital signal and the output digital signal to detect a network delay time and obtain a delay reference value; and an echo cancellation module Receiving the output analog audio and the delay reference value sent by the delay module, and returning the input analog audio according to the output analog audio and the delay reference value Eliminated.

The voice chat device of claim 1, wherein the encoder and the decoder use an OPUS format of a voice coding method.

The voice chat device of claim 1, wherein the encoder and the decoder use an Advanced Voice Coding (AAC) format.