CN110351690B - Intelligent voice system and voice processing method thereof - Google Patents

Intelligent voice system and voice processing method thereof Download PDF

Info

Publication number
CN110351690B
CN110351690B CN201810300017.7A CN201810300017A CN110351690B CN 110351690 B CN110351690 B CN 110351690B CN 201810300017 A CN201810300017 A CN 201810300017A CN 110351690 B CN110351690 B CN 110351690B
Authority
CN
China
Prior art keywords
voice
intelligent
audio signal
microphone array
a2dp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810300017.7A
Other languages
Chinese (zh)
Other versions
CN110351690A (en
Inventor
陶永耀
肖凯平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Actions Technology Co Ltd
Original Assignee
Actions Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Actions Technology Co Ltd filed Critical Actions Technology Co Ltd
Priority to CN201810300017.7A priority Critical patent/CN110351690B/en
Publication of CN110351690A publication Critical patent/CN110351690A/en
Application granted granted Critical
Publication of CN110351690B publication Critical patent/CN110351690B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/80Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W76/00Connection management
    • H04W76/10Connection setup
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Abstract

The embodiment of the invention provides an intelligent voice system, which comprises a Bluetooth terminal and intelligent equipment; the Bluetooth terminal comprises a microphone array, a voice preprocessing device, a first transmission device and a playback device; the intelligent device comprises a second transmission device and an intelligent processor; the first transmitting device comprises a first BLE communications device and a first A2DP communications device, the second transmitting device comprises a second BLE communications device and a second A2DP communications device; the voice preprocessing device is connected with the microphone array and used for preprocessing a first audio signal acquired by the microphone array and then sending the preprocessed first audio signal to the second BLE communication device through the first BLE communication device; the playback device is connected to the first A2DP communication device for receiving a second audio signal transmitted by the second A2DP communication device via the first A2DP communication device. And the non-delay voice acquisition can be realized.

Description

Intelligent voice system and voice processing method thereof
Technical Field
The invention relates to the technical field of multimedia, in particular to an intelligent voice system and a voice processing method thereof.
Background
With the advent of voice human-computer interaction interfaces, more and more products are required to interact with intelligent voice. The intelligent voice interaction products in the current market are all products based on wifi, but wifi power consumption is large, and portability is poor. Moreover, the development of smart phones has formed a portable computing center. Therefore, an intelligent voice interaction product which relies on the Bluetooth technology to realize portability can be an important development trend. And the traditional mode of transmitting voice between the Bluetooth and the mobile phone is the classic Bluetooth mode, so that the problem of frequent switching of the classic Bluetooth mode of the mobile phone can be caused, and the experience is poor.
The Audio transmission of the classic bluetooth is realized by two bluetooth profiles, HFP (Hands-free Profile) and A2DP (Advanced Audio Distribution Profile, bluetooth Audio transmission model protocol). HFP is used in a telephone call scenario featuring real-time two-way voice communication, while A2DP is used in a listening stereo music scenario featuring one-way audio push. The two bluetooth audio communications occupy the classic bluetooth channel and in the current implementation are switched from one to the other depending on the scene. Therefore, the current voice requirement of the bluetooth audio product is to perform voice data acquisition by switching the mode from the A2DP scene listening to songs to the HFP scene, which has two problems, 1, establishing a new system link requires time, and needs to wait for approximately 2 seconds, which is very poor in experience. 2, the voice data needs to be collected and transmitted immediately after a collection instruction is sent, and the former data is lost due to the old mode switching mode. Since the access mode of the smart voice of the old handset is accessed by HFP of the classic bluetooth, the A2DP mode is used when music is played. When a smart voice input requirement is initiated during music playing, the mode needs to be switched from the A2DP mode to the HFP mode, and the time for the mobile phone and the bluetooth terminal to establish HFP switching needs to be close to 2 seconds. And old implementation mode can't gather pronunciation and upload data when establishing the bluetooth connection, waits for 2 seconds the link to be established, and this 2 seconds's pronunciation data is not gathered, so produce the data loss, and experience the not good condition.
A new speech processing technique is needed to address the deficiencies of the prior art.
Disclosure of Invention
Based on the above problems, the embodiment of the present invention provides an intelligent voice system, which directly implements a function of returning voice without switching bluetooth services on an original bluetooth audio channel of A2DP, and implements non-delay voice acquisition.
The embodiment of the invention is realized in such a way that the intelligent voice system comprises a Bluetooth terminal and intelligent equipment; the Bluetooth terminal comprises a microphone array, a voice preprocessing device, a first transmission device and a playback device; the intelligent device comprises a second transmission device and an intelligent processor; the first transmission device comprises a first BLE (Bluetooth Low Energy) communication device and a first A2DP communication device, and the second transmission device comprises a second BLE communication device and a second A2DP communication device; the voice preprocessing device is connected with the microphone array and used for preprocessing a first audio signal acquired by the microphone array and then sending the preprocessed first audio signal to the second BLE communication device through the first BLE communication device; the playback device is connected to the first A2DP communication device for receiving a second audio signal transmitted by the second A2DP communication device via the first A2DP communication device.
Further, the first BLE communication device transmits a first audio signal with the second BLE communication device while the first A2DP communication device transmits a second audio signal with the second A2DP communication device.
Furthermore, the intelligent voice system further comprises a voice cloud server, and the voice cloud server is in remote communication with the intelligent device and acquires the first audio signal sent by the intelligent device, so as to process the first audio signal.
Further, data transmission is carried out between the voice cloud server and the intelligent device through a wireless network.
Further, the microphone array is an analog microphone array or a digital microphone array, and the microphone array comprises 1-8 microphones.
Further, the intelligent device is a smart phone, a tablet computer, a smart television or a smart set-top box.
Further, the voice preprocessing apparatus further includes:
the awakening device is connected with the microphone array and used for awakening the voice preprocessing device and the first transmission device;
the noise reduction device is connected between the microphone array and the first transmission device and is used for carrying out noise reduction processing on the acquired audio signals;
the beam forming device is connected with the microphone array and used for enhancing the voice acquisition in a specific direction;
and the echo cancellation device is connected between the noise reduction device and the first transmission device and is used for carrying out echo cancellation processing on the acquired audio signal.
Further, the smart device further includes:
the awakening device is used for awakening the voice preprocessing device and the first transmission device;
the noise reduction device is used for carrying out noise reduction processing on the acquired audio signal;
and the beam forming device is used for enhancing the voice acquisition in a specific direction.
Further, the first audio signal is a voice control signal.
According to another aspect of the embodiments of the present invention, an embodiment of the present invention further provides a voice processing method for an intelligent voice system, which directly implements a function of returning voice without performing bluetooth service switching on an original bluetooth audio path of A2DP, and implements non-delay voice acquisition.
The embodiment of the invention is realized in such a way that a voice processing method in an intelligent voice system comprises the following steps: (1) a microphone array of the Bluetooth terminal acquires a first audio signal and sends the first audio signal to a voice preprocessing device; (2) the voice preprocessing device transmits the first audio signal to intelligent equipment through a BLE transmission protocol after preprocessing; (3) and the intelligent equipment processes the first audio signal and then returns a control signal to the voice preprocessing device.
Further, the first audio signal is transmitted to the smart device through the BLE transmission protocol, and the second audio signal is transmitted to the bluetooth terminal through the A2DP protocol by the smart device.
By adopting the technical scheme, the method has the following beneficial effects: the embodiment of the invention provides an intelligent voice transmission technology based on ble, which does not conflict with classic Bluetooth and does not need the old system switching working mode of the classic Bluetooth to realize the function of voice acquisition. The product form of realization is bluetooth audio equipment such as bluetooth speaker and bluetooth headset, makes it have more smooth interactive experience.
Drawings
FIG. 1 is a block diagram of an intelligent speech system provided in accordance with one embodiment of the present invention;
FIG. 2 is a block diagram of an intelligent speech system according to another embodiment of the present invention;
fig. 3 is a flowchart of a speech processing method in an intelligent speech system according to another embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
An embodiment of the present invention provides an intelligent voice system, and referring to fig. 1, which is a structural block diagram of the intelligent voice system provided in the embodiment of the present invention, and the intelligent voice system includes a bluetooth terminal 1 and an intelligent device 2; the Bluetooth terminal 1 comprises a microphone array 101, a voice preprocessing device 103, a first transmission device 102 and a playback device 104; the intelligent device 2 comprises a second transmission device 201 and an intelligent processor 202; the first transmission device 102 comprises a first BLE communications device 2021 and a first A2DP communications device 1022, and the second transmission device 201 comprises a second BLE communications device 2011 and a second A2DP communications device 2012; the voice preprocessing device 103 is connected to the microphone array 101, and is configured to preprocess a first audio signal acquired by the microphone array 101 and send the preprocessed first audio signal to the second BLE communication device 2011 through the first BLE communication device 1021; the playback device 104 is connected to the first A2DP communication device 1022 for receiving a second audio signal transmitted by the second A2DP communication device 2012 through the first A2DP communication device 1022. While the first A2DP communication device 1022 is in second audio signal transmission with the second A2DP communication device 2012, the first BLE communication device 1021 is in first audio signal transmission with the second BLE communication device 2011.
The intelligent device is an intelligent mobile phone, a tablet personal computer, an intelligent television or an intelligent set top box. The following description will be made in detail by taking a mobile phone as an example.
Specifically, when the mobile phone terminal is connected to a bluetooth terminal (e.g., a bluetooth sound box) in an A2DP mode and plays a song, the mobile phone terminal performs a communication connection with a first A2DP communication device of the bluetooth terminal through a second A2DP communication device to transmit the song (i.e., the second audio signal) of the mobile phone terminal to the bluetooth terminal, and the song is played by a playback device of the bluetooth terminal, where the connection established between the two devices is an A2DP protocol. When the Bluetooth terminal receives a voice control request, namely a first audio signal, the first audio signal is not required to be sent to the mobile phone end through classical Bluetooth transmission at the moment, but data transmission is carried out through a BLE channel, and a first BLE communication device of the Bluetooth terminal is connected with a second BLE communication device of the mobile phone end after preprocessing the first audio signal and sends the first audio signal to the mobile phone end. Because the BLE Bluetooth mode does not conflict with the classic Bluetooth transmission, when the multi-array microphone receives the first audio signal, the first audio signal can be directly transmitted through the BLE channel without waiting for the switching of the classic mode Bluetooth transmission protocol.
The embodiment of the present invention is specifically described below with a specific example, when the mobile phone end is connected to the bluetooth terminal and the bluetooth terminal plays a song in the mobile phone, the connection is established between the mobile phone end and the bluetooth terminal through the A2DP transmission device, the second audio data is transmitted, the audio data is unidirectionally transmitted from the mobile phone end to the bluetooth terminal, and the playback device of the bluetooth terminal decodes the audio data and plays the audio data. At this time, when the user sends a voice instruction to the bluetooth terminal: the voice command is a first audio signal described in the embodiment of the present invention, and at this time, when the multi-array microphone of the bluetooth terminal receives the voice command, since the voice preprocessing device of the bluetooth terminal cannot recognize the voice command, the voice command needs to be transmitted to the mobile phone end or the voice cloud server for recognition processing, so that the bluetooth terminal immediately sends the voice command to the second BLE communication device of the mobile phone end through the first BLE communication device of the bluetooth terminal after performing voice preprocessing through a BLE protocol established between the mobile phone end and the bluetooth terminal, after the intelligent processor of the mobile phone end processes the first audio signal, the instruction is fed back to the bluetooth terminal, and the bluetooth terminal receives the instruction to start playing the song of dunlijun. During the whole BLE transmission and feedback command, the A2DP protocol connection is still established and not terminated, that is, the song is still playing. It will be appreciated that if the voice command is a relatively simple command, for example: playing, stopping, increasing the volume, reducing the volume, answering the call, hanging up the call and the like, and directly carrying out identification processing by the Bluetooth terminal. Only voice commands are complex, such as: please help me find where the nearest gasoline station is? Today is the weather? And when the instruction is received, the instruction needs to be transmitted to a mobile phone end or a voice cloud server for processing.
The invention provides another embodiment, when the intelligent processor at the mobile phone end cannot meet the requirement of voice recognition processing, the voice recognition function of the voice cloud server 3 needs to be further used, the embodiment of the invention further comprises the voice cloud server on the basis of the system, and the voice cloud server is in remote communication with the intelligent equipment and acquires the first audio signal sent by the intelligent equipment for processing the first audio signal. And data transmission is carried out between the voice cloud server and the intelligent equipment through a wireless network. According to the size of the voice calculation amount, generally, simple voice processing can be completed at the mobile phone end intelligent processor, the voice recognition function application of most front-end equipment can be met, and the voice processing can be completed by using the voice cloud server under the condition that the mobile phone end intelligent processor cannot meet the operation or processing.
According to the embodiment of the invention, the microphone array is an analog microphone array or a digital microphone array, and usually the microphone array comprises 1-8 microphones, which is a standard far-field voice acquisition configuration, and usually 2 microphones are used to form the microphone array, and only 1 microphone may be used in some quiet situations.
In another embodiment of the present invention, the voice pre-processing apparatus further comprises:
a wake-up device 1031 connected to the microphone array for waking up the voice preprocessing device and the first transmission device;
a noise reduction device 1032 connected between the microphone array and the first transmission device, for performing noise reduction processing on the acquired audio signal;
a beam forming device 1033 connected with the microphone array for enhancing the voice collection in a specific direction;
an echo cancellation device 1034 connected between the noise reduction device and the first transmission device, and configured to perform echo cancellation processing on the acquired audio signal.
In order to provide further advanced speech processing, as shown in fig. 2, the speech preprocessing unit 103 further comprises a wake-up unit 1031 connected to the microphone array 101 for waking up the speech preprocessing unit 103 and the first transmission unit 102. The voice preprocessing device 103 further includes a noise reduction device 1032 connected between the microphone array 101 and the first transmission device 102 for performing noise reduction processing on the acquired audio signal. The voice preprocessing device 103 further includes a beam forming device 1033 connected to the microphone array 101 for enhancing the voice acquisition in a specific direction by the microphone array 101. The voice pre-processing device 103 further comprises an echo cancellation device 1034 connected between the noise reduction device 1032 and the encoding device 1035, and configured to perform echo cancellation processing on the acquired first audio signal. The awakening device is used for voice signals collected by a microphone array, determining an algorithm for starting voice awakening according to energy or characteristics (zero crossing point detection, spectrum analysis and the like) of human voice, comparing input voice with a maximum likelihood algorithm of a large batch of training sequences in advance, determining whether the voice input is an awakening word, and starting subsequent processing if the voice input is the awakening word. The beam forming device 1033 is used for determining the input direction of the sound signal relative to the microphone array according to the time delay and phase difference of the voice data of each microphone when the multi-microphone voice is input, and determining the parameter of the noise reduction device according to the information. The noise reducer 1032 strengthens or weakens signals in different directions according to noise reduction parameters of a beam forming algorithm or a preset noise reduction directional diagram curve, and highlights the signal strength in the latest direction. Meanwhile, according to the frequency spectrum difference of human voice and environmental sound (periodic noise, music) and the difference of time domain correlation, the signal is processed in frequency domain or time domain, and the human voice is extracted and enhanced from background sound or noise. When the playback module exists, the echo cancellation device 1034 adds the predetermined or predicted transfer function to the decoded data, and cancels the reflected part of the sound emitted from the loudspeaker in the data collected by the microphone, so as to obtain the clean voice without echo.
It can be understood that, when the voice pre-processing device has insufficient computing power and is not suitable for performing the above processing, the wake-up device, the noise reduction device, and the beam forming device may be disposed in the smart device 2 for processing, so as to reduce the computation of the voice pre-processing device 103. The smart device further includes: the awakening device is used for awakening the voice preprocessing device and the first transmission device; the noise reduction device is used for carrying out noise reduction processing on the acquired audio signal; and the beam forming device is used for enhancing the voice acquisition in a specific direction. The above apparatus may be provided in an intelligent processor of an intelligent device.
According to another aspect of the embodiments of the present invention, an embodiment of the present invention further provides a voice processing method for an intelligent voice system, which directly implements a function of returning voice without performing bluetooth service switching on an original bluetooth audio path of A2DP, and implements non-delay voice acquisition.
The embodiment of the present invention is implemented as follows, and a speech processing method in an intelligent speech system, as shown in fig. 3, includes the following steps: (S101) a microphone array of the Bluetooth terminal acquires a first audio signal and sends the first audio signal to a voice preprocessing device; (S102) the voice preprocessing device transmits the first audio signal to intelligent equipment through a BLE transmission protocol after preprocessing; (S103) the intelligent equipment processes the first audio signal and then returns a control signal to the voice preprocessing device.
The first audio signal is transmitted to the intelligent device through a BLE transmission protocol, and the second audio signal is transmitted to the Bluetooth terminal through an A2DP protocol by the intelligent device.
The following description will be made in detail by taking a mobile phone as an example.
Specifically, when the mobile phone terminal is connected to the bluetooth terminal in the A2DP mode and plays a song, the mobile phone terminal performs communication with the first A2DP communication device of the bluetooth terminal through the second A2DP communication device to transmit the song (i.e., the second audio signal) of the mobile phone terminal to the bluetooth terminal and play the song by the playback device of the bluetooth terminal, and at this time, the connection established between the two is the A2DP protocol. A microphone array of the Bluetooth terminal acquires a first audio signal and sends the first audio signal to a voice preprocessing device; the voice preprocessing device transmits the first audio signal to a mobile phone end through a BLE transmission protocol after preprocessing; and the mobile phone end intelligent processor identifies the first audio signal and then returns a control signal to the voice preprocessing device, and the voice preprocessing device acquires the returned control signal and then controls the Bluetooth terminal according to the first audio signal. When the Bluetooth terminal receives a voice control request, namely a first audio signal, the first audio signal is not required to be sent to the mobile phone end through classical Bluetooth transmission at the moment, but data transmission is carried out through a BLE channel, and a first BLE communication device of the Bluetooth terminal is connected with a second BLE communication device of the mobile phone end after preprocessing the first audio signal and sends the first audio signal to the mobile phone end. Because the BLE Bluetooth mode does not conflict with the classic Bluetooth transmission, when the multi-array microphone receives the first audio signal, the first audio signal can be directly transmitted through the BLE channel without waiting for the switching of the classic mode Bluetooth transmission protocol.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (11)

1. An intelligent voice system is characterized by comprising a Bluetooth terminal and intelligent equipment; the Bluetooth terminal comprises a microphone array, a voice preprocessing device, a first transmission device and a playback device; the intelligent device comprises a second transmission device and an intelligent processor; the first transmitting device comprises a first BLE communications device and a first A2DP communications device, the second transmitting device comprises a second BLE communications device and a second A2DP communications device; the voice preprocessing device is connected with the microphone array and used for preprocessing a first audio signal acquired by the microphone array and then sending the preprocessed first audio signal to the second BLE communication device through the first BLE communication device; the playback device is connected to the first A2DP communication device for receiving a second audio signal transmitted by the second A2DP communication device via the first A2DP communication device.
2. The smart speech system of claim 1, wherein the first BLE communication device performs a first audio signal transmission with the second BLE communication device while the first A2DP communication device performs a second audio signal transmission with the second A2DP communication device.
3. The intelligent voice system according to claim 2, further comprising a voice cloud server, wherein the voice cloud server is in remote communication with the intelligent device and obtains the first audio signal sent by the intelligent device for processing the first audio signal.
4. The intelligent voice system according to claim 3, wherein data transmission is performed between the voice cloud server and the intelligent device through a wireless network.
5. The intelligent speech system of claim 1, wherein the microphone array is an analog microphone array or a digital microphone array, and wherein the microphone array comprises 1-8 microphones.
6. The intelligent voice system of claim 1, wherein the intelligent device is a smartphone, a tablet, a smart television, or a smart set-top box.
7. The intelligent voice system according to any one of claims 1 to 6, wherein the voice preprocessing device further comprises:
the awakening device is connected with the microphone array and used for awakening the voice preprocessing device and the first transmission device;
the noise reduction device is connected between the microphone array and the first transmission device and is used for carrying out noise reduction processing on the acquired audio signals;
the beam forming device is connected with the microphone array and used for enhancing the voice acquisition in a specific direction;
and the echo cancellation device is connected between the noise reduction device and the first transmission device and is used for carrying out echo cancellation processing on the acquired audio signal.
8. The intelligent voice system according to any one of claims 1 to 6, wherein the intelligent device further comprises:
the awakening device is used for awakening the voice preprocessing device and the first transmission device;
the noise reduction device is used for carrying out noise reduction processing on the acquired audio signal;
and the beam forming device is used for enhancing the voice acquisition in a specific direction.
9. The intelligent voice system according to any one of claims 1 to 6, wherein the first audio signal is a voice control signal.
10. A speech processing method for use in an intelligent speech system, comprising the steps of:
(1) a microphone array of the Bluetooth terminal acquires a first audio signal and sends the first audio signal to a voice preprocessing device;
(2) the voice preprocessing device transmits the first audio signal to intelligent equipment through a BLE transmission protocol after preprocessing;
(3) and the intelligent equipment processes the first audio signal and then returns a control signal to the voice preprocessing device.
11. The speech processing method according to claim 10, wherein the first audio signal is transmitted to the smart device via a BLE transmission protocol, and the second audio signal is transmitted from the smart device to the bluetooth terminal via an A2DP protocol.
CN201810300017.7A 2018-04-04 2018-04-04 Intelligent voice system and voice processing method thereof Active CN110351690B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810300017.7A CN110351690B (en) 2018-04-04 2018-04-04 Intelligent voice system and voice processing method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810300017.7A CN110351690B (en) 2018-04-04 2018-04-04 Intelligent voice system and voice processing method thereof

Publications (2)

Publication Number Publication Date
CN110351690A CN110351690A (en) 2019-10-18
CN110351690B true CN110351690B (en) 2022-04-15

Family

ID=68173757

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810300017.7A Active CN110351690B (en) 2018-04-04 2018-04-04 Intelligent voice system and voice processing method thereof

Country Status (1)

Country Link
CN (1) CN110351690B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151057A (en) * 2020-11-04 2020-12-29 苏州思必驰信息科技有限公司 Echo cancellation method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101499824A (en) * 2008-02-01 2009-08-05 中兴通讯股份有限公司 Method for connecting Bluetooth audio equipment
CN102857610A (en) * 2011-06-30 2013-01-02 株式会社电装 Short range wireless communication apparatus
CN102948250A (en) * 2010-05-17 2013-02-27 株式会社电装 Short-range wireless communication apparatus
WO2016050298A1 (en) * 2014-10-01 2016-04-07 Binauric SE Audio terminal
CN205249263U (en) * 2015-09-07 2016-05-18 深圳楼兰辉煌科技有限公司 Bluetooth routing equipment based on bluetooth car kit
CN107710148A (en) * 2015-09-30 2018-02-16 华为技术有限公司 A kind for the treatment of method and apparatus of Voice command

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101499824A (en) * 2008-02-01 2009-08-05 中兴通讯股份有限公司 Method for connecting Bluetooth audio equipment
CN102948250A (en) * 2010-05-17 2013-02-27 株式会社电装 Short-range wireless communication apparatus
CN102857610A (en) * 2011-06-30 2013-01-02 株式会社电装 Short range wireless communication apparatus
WO2016050298A1 (en) * 2014-10-01 2016-04-07 Binauric SE Audio terminal
CN205249263U (en) * 2015-09-07 2016-05-18 深圳楼兰辉煌科技有限公司 Bluetooth routing equipment based on bluetooth car kit
CN107710148A (en) * 2015-09-30 2018-02-16 华为技术有限公司 A kind for the treatment of method and apparatus of Voice command

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"基于BC03蓝牙模组的无线有源音箱设计";杨本全;《CNKI》;20110930;全文 *

Also Published As

Publication number Publication date
CN110351690A (en) 2019-10-18

Similar Documents

Publication Publication Date Title
US11153678B1 (en) Two-way wireless headphones
US11501779B2 (en) Bluetooth speaker base, method and system for controlling thereof
JP6489563B2 (en) Volume control method, system, device and program
KR101786533B1 (en) Multi-level speech recofnition
CN108141498B (en) Translation method and terminal
KR20200139198A (en) Earphone wireless communication method, master earphone, slave earphone and earphone system
CN206819732U (en) Intelligent music player
CN110189753B (en) Bluetooth sound box and control method, system and storage medium thereof
CN109379490B (en) Audio playing method and device, electronic equipment and computer readable medium
CN110351419B (en) Intelligent voice system and voice processing method thereof
CN107731231A (en) A kind of method for supporting more high in the clouds voice services and a kind of storage device
CN112017652A (en) Interaction method and terminal equipment
CN113542960A (en) Audio signal processing method, system, device, electronic equipment and storage medium
CN205829977U (en) A kind of wireless speech dialogue being applicable to tradition sound equipment and control device
CN110351690B (en) Intelligent voice system and voice processing method thereof
CN213906675U (en) Portable wireless bluetooth recording equipment
CN108877799A (en) A kind of phonetic controller and method
CN111081238B (en) Bluetooth sound box voice interaction control method, device and system
US20230110708A1 (en) Intelligent speech control for two way radio
CN112312368A (en) Data processing method, device, equipment and storage medium
US11056106B2 (en) Voice interaction system and information processing apparatus
CN111524512A (en) Method for starting one-shot voice conversation with low delay, peripheral equipment and voice interaction device with low delay response
KR20090027817A (en) Method for output background sound and mobile communication terminal using the same
CN113709906B (en) Wireless audio system, wireless communication method and equipment
CN113271385A (en) Call forwarding method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Zone C, floor 1, plant 1, No.1, Keji 4th Road, Tangjiawan Town, high tech Zone, Zhuhai City, Guangdong Province 519085

Applicant after: ACTIONS TECHNOLOGY Co.,Ltd.

Address before: 519085 High-tech Zone, Tangjiawan Town, Zhuhai City, Guangdong Province

Applicant before: ACTIONS (ZHUHAI) TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant