WO2017071183A1 - Voice processing method and device, and pickup circuit - Google Patents

Voice processing method and device, and pickup circuit Download PDF

Info

Publication number
WO2017071183A1
WO2017071183A1 PCT/CN2016/082426 CN2016082426W WO2017071183A1 WO 2017071183 A1 WO2017071183 A1 WO 2017071183A1 CN 2016082426 W CN2016082426 W CN 2016082426W WO 2017071183 A1 WO2017071183 A1 WO 2017071183A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
signal
recognition processor
voice signal
speech
Prior art date
Application number
PCT/CN2016/082426
Other languages
French (fr)
Chinese (zh)
Inventor
石武
Original Assignee
北京云知声信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京云知声信息技术有限公司 filed Critical 北京云知声信息技术有限公司
Publication of WO2017071183A1 publication Critical patent/WO2017071183A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems

Definitions

  • the present invention relates to the field of voice recognition technology, and in particular, to a voice processing method, apparatus, and sound pickup circuit.
  • Voice input is an input method for converting a person's spoken content into text by voice recognition.
  • the user can perform the corresponding command by means of voice input instead of manual input, and at the same time, the electronic device feeds back in the form of voice announcement. For example, if the user voice inputs "play the next song", the electronic device that receives the voice input recognizes the voice content input by the user, and switches the currently played song to the next song according to the voice command. It can be seen that the voice input can bring great convenience and fun to the user's life and work.
  • the embodiment of the invention provides a voice processing method, a device and a sound collecting circuit, which are used for eliminating noise in a user's voice input signal, thereby improving the accuracy of voice recognition.
  • a speech processing method for use in a sound pickup circuit, the method comprising the steps of:
  • the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
  • the canceling the voice feedback signal in the voice signal to obtain the voice input signal includes:
  • the speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
  • a "clean" voice input signal can be obtained, so that the collected voice signal is not interfered by the feedback signal.
  • the transmitting the voice input signal to the voice recognition processor comprises:
  • the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
  • the speech recognition signal received by the speech recognition processor has no noise interference, thereby enabling more accurate recognition.
  • Voice input signal by performing analog-to-digital conversion on the voice input signal, and processing such as noise reduction, reverberation elimination, and echo cancellation, the speech recognition signal received by the speech recognition processor has no noise interference, thereby enabling more accurate recognition.
  • the collecting a voice signal includes:
  • the voice recognition processor Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
  • the acquisition of the voice signal is started according to the acquisition instruction sent by the voice recognition processor, and the reset control of the circuit is realized.
  • the method before the acquiring the voice signal, the method further includes:
  • the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
  • the speech recognition processor wakes up the pickup circuit, thereby avoiding picking up
  • the tone circuit receives an invalid voice signal.
  • a speech processing apparatus for use in a sound pickup circuit, the apparatus comprising:
  • An acquisition module configured to collect a voice signal, where the voice signal includes a voice input signal input by a user and a voice feedback signal broadcast by the voice recognition processor;
  • a cancellation module configured to cancel a voice feedback signal in the voice signal, to obtain the voice input signal
  • a transmission module configured to transmit the voice input signal to the voice recognition processor, where the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
  • the elimination module comprises:
  • an operation submodule configured to perform subtraction on the voice signal and the voice feedback signal to obtain the voice input signal.
  • the transmission module comprises:
  • a conversion submodule configured to perform analog-to-digital conversion on the voice input signal to obtain a digital voice input signal
  • a processing submodule configured to perform preset processing on the digital voice input signal, where the preset processing includes any one or more of a noise reduction process, a reverberation elimination process, and an echo cancellation process;
  • a transmission submodule configured to transmit the processed digital voice input signal to the voice recognition processor.
  • the acquisition module comprises:
  • a receiving submodule configured to receive an acquisition instruction sent by the voice recognition processor, where the collection instruction includes a sampling clock of a reset signal and a voice signal;
  • the collecting submodule is configured to collect a voice signal according to the collecting instruction.
  • the apparatus further includes:
  • An identification module configured to identify the voice signal before acquiring the voice signal
  • a determining module configured to determine, according to a preset effective voice signal feature, whether the voice signal is a valid voice signal
  • An output module configured to output an execution instruction to the voice recognition processor when the voice signal is the valid voice signal, where the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, the collection instruction And used to instruct the sound collecting circuit to collect a voice signal.
  • the above device collects the voice input signal input by the user and the voice feedback signal broadcasted by the voice recognition processor, and eliminates the voice feedback signal therein, so that the finally collected voice signal is not interfered by the feedback signal, thereby obtaining “clean”.
  • the voice input signal improves the accuracy of speech recognition.
  • a pickup circuit including a microphone array, a digital signal processor, a voice broadcast interface, and a language Sound output interface; among them,
  • the microphone array is connected to the digital signal processor and configured to collect a voice signal, where the voice signal includes a voice input signal input by a user and a voice feedback reported by a voice recognition processor received through the voice broadcast interface. signal;
  • the digital signal processor is connected to the microphone array, the voice broadcast interface, and the voice output interface, and is configured to cancel a voice feedback signal in the voice signal to obtain the voice input signal;
  • the voice broadcast interface is connected to the digital signal processor and configured to receive a voice feedback signal broadcast by the voice recognition processor;
  • the voice output interface is coupled to the digital signal processor for transmitting the voice input signal to the voice recognition processor, and the voice recognition processor is configured to identify and report the voice input signal The speech feedback signal.
  • the sound collecting circuit further includes:
  • control unit coupled to the digital signal processor, configured to receive an acquisition instruction sent by the speech recognition processor, the acquisition instruction includes a sampling clock of a reset signal and a voice signal; and control the microphone according to the acquisition instruction
  • the array collects speech signals.
  • the above-mentioned sound collecting circuit simultaneously collects the voice input signal input by the user and the voice feedback signal broadcasted by the voice recognition processor through the microphone array, and eliminates the voice feedback signal therein by the digital signal processor, so that the finally collected voice signal is not subjected to feedback.
  • Signal interference resulting in a "clean" voice input signal, improving the accuracy of speech recognition.
  • a voice processing device which is applied to a sound pickup circuit, and the device includes:
  • a memory for storing the processor executable instructions
  • processor is configured to:
  • the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
  • the above processor is also configured to:
  • the speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
  • the above processor is also configured to:
  • the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
  • the above processor is also configured to:
  • the voice recognition processor Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
  • the above processor is also configured to:
  • the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
  • a non-transitory computer readable recording medium having recorded thereon a computer program, the program comprising instructions for performing the method of the first aspect of the embodiment of the present invention.
  • a computer program comprising: instructions for performing the method of the first aspect of the embodiments of the invention when the program is executed by a computer.
  • FIG. 1 is a flowchart of a voice processing method according to an embodiment of the present invention
  • FIG. 2 is a circuit block diagram of a sound pickup circuit and a voice recognition processor according to an embodiment of the present invention
  • 3 is a timing diagram of signals in an embodiment of the present invention.
  • step S13 is a flowchart of step S13 in a voice processing method according to an embodiment of the present invention.
  • FIG. 4(a) is a timing diagram of a reset signal in an embodiment of the present invention.
  • FIG. 5 is a flowchart of a voice processing method according to an embodiment of the present invention.
  • FIG. 5(a) is a timing diagram of a wake-up signal according to an embodiment of the present invention.
  • FIG. 6 is a block diagram of a voice processing apparatus according to an embodiment of the present invention.
  • FIG. 7 is a block diagram of a cancellation module in a voice processing device according to an embodiment of the present invention.
  • FIG. 8 is a block diagram of a transmission module in a voice processing device according to an embodiment of the present invention.
  • FIG. 9 is a block diagram of an acquisition module in a voice processing device according to an embodiment of the present invention.
  • FIG. 10 is a block diagram of a voice processing apparatus according to an embodiment of the present invention.
  • FIG. 11 is a block diagram of a sound pickup circuit according to an embodiment of the present invention.
  • FIG. 12 is a block diagram of a voice processing system according to an embodiment of the present invention.
  • FIG. 13 is a block diagram of an apparatus for performing a voice processing method according to an embodiment of the present invention.
  • FIG. 1 is a flowchart of a voice processing method according to an embodiment of the present invention. As shown in FIG. 1, the method is applied to a sound pickup circuit, and includes the following steps S11-S13:
  • step S11 a voice signal is collected, where the voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor.
  • the voice feedback signal broadcast by the voice recognition processor is a feedback signal to the voice input signal.
  • Figure 2 shows a circuit block diagram of the pickup circuit and the speech recognition processor.
  • the backend speech recognition processor 22 broadcasts the voice feedback signal through the speaker 23.
  • the speaker 23 forms a loop with the microphone array 211 that collects the voice signal, as shown in the dotted circuit of FIG.
  • the voice feedback signal broadcast by the recognition processor 22 through the speaker 23 causes interference to the voice signal. Therefore, the voice recognition processor 22 outputs the voice feedback signal to the sound pickup circuit 21 while the voice is being broadcast through the speaker 23.
  • the circuit 21 eliminates the speech feedback signal causing the interference, thereby eliminating the interference of the speech feedback signal on the speech signal.
  • the sound collecting circuit 21 receives the voice feedback signal output by the voice recognition processor 23 through the voice broadcast interface 22, and the voice broadcast interface 22 can be an IIS (Inter-IC Sound) integrated bus mode.
  • IIS Inter-IC Sound
  • the signal name is defined as shown in Table 1, and the corresponding timing diagram is shown in Figure 3.
  • Step S12 canceling the voice feedback signal in the voice signal to obtain a voice input signal.
  • the step may be specifically implemented by: subtracting the voice signal and the voice feedback signal to obtain a voice input signal.
  • step S13 the voice input signal is transmitted to the voice recognition processor, and the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
  • the sound collecting circuit transmits the voice input signal to the voice recognition processor through the voice output interface, wherein the voice output interface can be an IIS (Inter-IC Sound) bus mode, and the signal of the interface
  • IIS Inter-IC Sound
  • the voice input signal input by the user and the voice feedback signal broadcast by the voice recognition processor are simultaneously collected, and the voice feedback signal is eliminated, so that the finally collected voice signal is not affected by the feedback signal. Interference, resulting in a "clean" voice input signal, improving the accuracy of speech recognition.
  • step S13 may be implemented as the following steps S131-S133:
  • Step S131 performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal.
  • Step S132 performing preset processing on the digital voice input signal, and the preset processing includes any one or more of noise reduction processing, elimination of reverberation processing, and cancellation of echo processing.
  • Step S133 the processed digital voice input signal is transmitted to the voice recognition processor.
  • the sound pickup circuit transmits the processed digital voice input signal to the voice recognition processor through the voice output interface.
  • the sound pickup circuit performs analog-to-digital conversion on the voice input signal, as well as processing such as noise reduction, reverberation elimination, and echo cancellation, so that the voice recognition signal received by the voice recognition processor has no noise interference, thereby more accurately recognizing the voice input. signal.
  • step S11 may be implemented as: receiving an acquisition instruction sent by the speech recognition processor, the acquisition instruction comprising a reset signal and a sampling clock of the speech signal; and acquiring the speech signal according to the acquisition instruction.
  • the voice signal can be collected according to the acquisition instruction sent by the voice recognition processor, and the reset control of the circuit is realized. That is, after the system is powered on, the speech recognition processor of the back end generates an acquisition instruction including a reset signal, and after receiving the acquisition instruction, the pickup circuit starts to collect the speech signal, that is, it is in a normal working state.
  • the timing diagram of the reset signal RESET is shown in Figure 4(a).
  • the above method further includes the following steps S51-S53:
  • step S51 the voice signal is identified.
  • Step S52 Determine whether the voice signal is a valid voice signal according to a preset effective voice signal feature. If the voice signal is a valid voice signal, step S53 is performed; otherwise, returning to step S51, the newly received voice signal is continuously identified.
  • the effective voice signal feature can be set according to the voice service performed by the current system. For example, if the current system performs banking, the effective voice signal feature may be set to include keywords involved in the banking service.
  • the sound collecting circuit identifies the keyword according to the received voice signal.
  • the voice signal can be determined to be a valid voice signal.
  • Step S53 outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound pickup circuit to collect the voice signal.
  • the issuance of the execution instruction actually implements the wake-up function of the sound pickup circuit to the voice recognition processor, that is, equivalent A wake-up signal is issued, and the timing chart of the wake-up signal WAKE_INT is as shown in FIG. 5(a).
  • the voice recognition processor wakes up the sound pickup circuit to avoid picking up
  • the tone circuit receives an invalid voice signal.
  • the present invention further provides a voice processing device for performing the above method.
  • FIG. 6 is a block diagram of a voice processing apparatus according to an embodiment of the present invention. As shown in FIG. 6, the device is applied to a sound pickup circuit, and includes:
  • the acquiring module 61 is configured to collect a voice signal, where the voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
  • the eliminating module 62 is configured to cancel the voice feedback signal in the voice signal to obtain a voice input signal
  • the transmission module 63 is configured to transmit the voice input signal to the voice recognition processor, and the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
  • the elimination module 62 includes:
  • the operation sub-module 621 is configured to perform a subtraction operation on the voice signal and the voice feedback signal to obtain a voice input signal.
  • the transmission module 63 includes:
  • a conversion sub-module 631 configured to perform analog-to-digital conversion on the voice input signal to obtain a digital voice input signal
  • the processing sub-module 632 is configured to perform preset processing on the digital voice input signal, where the preset processing includes any one or more of noise reduction processing, anti-reverberation processing, and echo cancellation processing;
  • the transmission sub-module 633 is configured to transmit the processed digital voice input signal to the voice recognition processor.
  • the acquisition module 61 includes:
  • the receiving sub-module 611 is configured to receive an acquisition instruction sent by the voice recognition processor, where the collection instruction includes a sampling clock of the reset signal and the voice signal;
  • the collecting sub-module 612 is configured to collect a voice signal according to the acquisition instruction.
  • the foregoing apparatus further includes:
  • the identification module 64 is configured to identify the voice signal before acquiring the voice signal
  • the determining module 65 is configured to determine, according to the preset effective voice signal feature, whether the voice signal is a valid voice signal
  • the output module 66 is configured to output an execution instruction to the voice recognition processor when the voice signal is a valid voice signal, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound collection circuit to collect the voice signal.
  • the voice input signal input by the user and the voice feedback signal broadcast by the voice recognition processor are simultaneously collected, and the voice feedback signal is eliminated, so that the finally collected voice signal is not interfered by the feedback signal.
  • the accuracy of speech recognition is improved.
  • FIG. 11 is a block diagram of a sound pickup circuit in accordance with an embodiment of the present invention.
  • the sound collecting circuit includes a microphone array 111, a digital signal processor 112, a voice broadcast interface 113, and a voice output interface 114;
  • the microphone array 111 is connected to the digital signal processor 112 for collecting a voice signal.
  • the voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor received through the voice broadcast interface 113.
  • the microphone array 111 uses an array of multiple microphones to eliminate interference noise in different directions. It can usually be composed of 2 mics, 4 mics or 6 mics, and the mic array 111 is uniformly distributed according to the corresponding angles.
  • the digital signal processor 112 is connected to the microphone array 111, the voice broadcast interface 113, and the voice output interface 114. It is used to cancel the speech feedback signal in the speech signal and obtain the speech input signal.
  • the digital signal processor 112 includes an analog to digital conversion component for analog to digital conversion of the received speech input signal to obtain a digital speech input signal.
  • the digital signal processor 112 performs a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a reverberation canceling process, and an echo cancellation process, and the processing is performed through the voice output interface 114.
  • the digital voice input signal is transmitted to the speech recognition processor.
  • the voice broadcast interface 113 is connected to the digital signal processor 112 for receiving the voice feedback signal broadcast by the voice recognition processor.
  • the voice broadcast interface 113 may be in the form of an IIS (Inter-IC Sound) bus.
  • IIS Inter-IC Sound
  • the voice output interface 114 is coupled to the digital signal processor 112 for transmitting the voice input signal to the voice recognition processor, the voice recognition processor for identifying the voice input signal and broadcasting the voice feedback signal.
  • the voice output interface 114 can be in the form of an IIS (Inter-IC Sound) bus.
  • the working principle of the sound collecting circuit is as follows: First, the microphone array 111 collects the voice input signal input by the user, and simultaneously collects the voice feedback signal output by the voice recognition processor through the voice broadcast interface 113; secondly, the digital signal processor 112 pairs the microphone array 111. The collected voice signal is processed, the voice feedback signal is eliminated, a "clean" voice input signal is obtained, and the "clean" voice input signal is transmitted through the voice output interface 114 to the voice recognition processor at the back end to make the voice
  • the recognition processor is capable of recognizing "clean" speech input signals, thereby improving the accuracy of speech recognition.
  • the sound collecting circuit further includes:
  • the control unit is connected to the digital signal processor 112 for receiving an acquisition instruction sent by the voice recognition processor, and the acquisition instruction includes a sampling clock of the reset signal and the voice signal; and controlling the microphone array 111 to collect the voice signal according to the acquisition instruction.
  • the control component can also be used for transmission of control commands between the digital signal processor 112 and the speech recognition processor, for example, to increase the front end gain of the microphone array, to adjust the speech output format, and the like.
  • the interface of the control component may be an IIC (Inter-Integrated Circuit) bus interface, an SPI (Serial Peripheral Interface) or a UART (Universal Asynchronous Receiver/Transmitter) bus interface.
  • IIC Inter-Integrated Circuit
  • SPI Serial Peripheral Interface
  • UART Universal Asynchronous Receiver/Transmitter
  • the digital signal processor 112 is further configured to identify the voice signal; determine whether the voice signal is a valid voice signal according to a preset effective voice signal feature; and pass the control component when the voice signal is a valid voice signal And outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound collection circuit to acquire the voice signal.
  • FIG. 12 is a block diagram of a voice processing system according to an embodiment of the present invention.
  • the voice processing system package A sound pickup circuit 121 and a voice recognition processor 122 are included.
  • the sound collecting circuit 121 is configured to collect a voice signal, process the voice signal, and then transmit the processed voice signal to the voice recognition processor 122, and the voice recognition processor 122 performs a recognition process on the voice signal.
  • the voice recognition processor 122 After the voice processing system is powered on, the voice recognition processor 122 generates an acquisition instruction, and sends the collection instruction to the sound collection circuit 121, where the collection instruction includes a sampling clock of the reset signal and the voice signal. .
  • the reset signal is used to control the sound collecting circuit 121 to enter an operating state, and start collecting voice signals.
  • the speech recognition processor 122 sends an acquisition instruction to the pickup circuit 121 through a corresponding hardware interface to implement reset control.
  • the voice recognition processor 122 when a valid voice signal is not received, the voice recognition processor 122 is in a sleep state, and the collected voice signal is matched by the digital signal processor in the sound pickup circuit 121 to determine the collected voice. Whether the signal is a valid speech signal, when it is a valid signal, the pickup circuit 121 issues an execution instruction to the speech recognition processor 122 to wake up the speech recognition processor 122. The pickup circuit 121 transmits an execution instruction to the voice recognition processor 122 through a corresponding hardware interface to implement a wake-up function to the voice recognition processor 122.
  • FIG. 13 is a block diagram of an apparatus for performing a voice processing method, according to an exemplary embodiment.
  • device 1600 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
  • device 1600 can include one or more of the following components: processor 1601, memory 1602, and communication component 1603.
  • the processor 1601 typically controls the overall operation of the device 1600, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • the processor 1601 can execute instructions to perform all or part of the steps of the above method.
  • Memory 1602 is configured to store various types of data to support operation at device 1600. Examples of such data include instructions for any application or method operating on device 1600, contact data, phone book data, messages, pictures, videos, and the like.
  • the memory 1602 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read only memory
  • EPROM Electrically erasable programmable read only memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Disk Disk or Optical Disk.
  • Communication component 1603 is configured to facilitate wired or wireless communication between device 1600 and other devices.
  • the device 1600 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or a combination thereof.
  • the communication component 1603 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel.
  • communication component 1603 also includes a near field communication (NFC) module to facilitate short Cheng Communication.
  • NFC near field communication
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • device 1600 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the voice processing method described above.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A gate array
  • controller microcontroller, microprocessor or other electronic component implementation for performing the voice processing method described above.
  • non-transitory computer readable storage medium comprising instructions, such as a memory 1602 comprising instructions executable by processor 1601 of apparatus 1600 to perform the voice processing method described above.
  • the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
  • the present invention also provides a non-transitory computer readable recording medium having recorded thereon a computer program including instructions for executing the voice processing method according to the above-described embodiments of the present invention.
  • the present invention also provides a computer program comprising: instructions for executing a speech processing method according to the above-described embodiments of the present invention when the program is executed by a computer.
  • embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that the computer Or performing a series of operational steps on other programmable devices to produce computer-implemented processing such that instructions executed on a computer or other programmable device are provided for implementing a block in a flow or a flow and/or block diagram of the flowchart Or the steps of the function specified in multiple boxes.

Abstract

A voice processing method and device, and a pickup circuit. The method comprises: collecting a voice signal, the voice signal comprising a voice incoming signal inputted by a user, and a voice feedback signal broadcast by a voice recognition processor (S11); eliminating the voice feedback signal in the voice signal to obtain the voice incoming signal (S12); and transmitting the voice incoming signal to the voice recognition processor which is used for identifying the voice incoming signal and broadcasting the voice feedback signal (S13). By means of simultaneously collecting the voice incoming signal inputted by the user and the voice feedback signal outputted by the voice recognition processor and eliminating the voice feedback signal therein, the final collected voice signal is not interfered by the feedback signal, allowing for a ''clean'' voice incoming signal to obtained, and thereby improving voice recognition accuracy.

Description

一种语音处理方法、装置及拾音电路Voice processing method, device and sound collecting circuit
本申请基于申请日为2015年10月29日、申请号为CN201510719799.4、题为“一种语音处理方法、装置及拾音电路”的发明专利申请提出,并要求该发明专利申请的优先权,该发明专利申请的全部内容在此引入本申请作为参考。This application is based on an invention patent application filed on October 29, 2015, the application number is CN201510719799.4, entitled "A Voice Processing Method, Apparatus, and Sound Picking Circuit", and claims the priority of the invention patent application. The entire disclosure of this patent application is incorporated herein by reference.
技术领域Technical field
本发明涉及语音识别技术领域,尤其涉及一种语音处理方法、装置及拾音电路。The present invention relates to the field of voice recognition technology, and in particular, to a voice processing method, apparatus, and sound pickup circuit.
背景技术Background technique
随着人工智能技术的发展,语音作为一种很好的人机交互的模式,逐渐被应用到很多智能电子设备中。语音输入是通过语音识别将人说话的内容转换为文本的一种输入方式。在很多领域,用户可通过语音输入的方式代替手动输入来执行相应命令,同时,电子设备以语音播报的形式进行反馈。例如,用户语音输入“播放下一首歌”,那么接收语音输入的电子设备就会识别用户输入的语音内容,并根据该语音命令将当前播放的歌曲切换至下一首。可见,语音输入能够为用户的生活、工作带来极大的方便和乐趣,然而,在语音输入的过程中,喇叭通过语音播报反馈前端时,会和前端的麦克形成一个回路,使得用户语音输入时,麦克接收到的语音信号就会混合反馈信号的干扰,从而导致语音识别的准确率下降。With the development of artificial intelligence technology, voice as a good human-computer interaction mode has gradually been applied to many intelligent electronic devices. Voice input is an input method for converting a person's spoken content into text by voice recognition. In many fields, the user can perform the corresponding command by means of voice input instead of manual input, and at the same time, the electronic device feeds back in the form of voice announcement. For example, if the user voice inputs "play the next song", the electronic device that receives the voice input recognizes the voice content input by the user, and switches the currently played song to the next song according to the voice command. It can be seen that the voice input can bring great convenience and fun to the user's life and work. However, in the process of voice input, when the speaker broadcasts the feedback front end through the voice broadcast, it will form a loop with the front end microphone, so that the user voice input When the voice signal received by the microphone is mixed with the interference of the feedback signal, the accuracy of the voice recognition is lowered.
发明内容Summary of the invention
本发明实施例提供一种语音处理方法、装置及拾音电路,用于实现消除用户语音输入信号中的杂音,从而提高语音识别的准确率。The embodiment of the invention provides a voice processing method, a device and a sound collecting circuit, which are used for eliminating noise in a user's voice input signal, thereby improving the accuracy of voice recognition.
第一方面,提供一种语音处理方法,应用于拾音电路,所述方法包括以下步骤:In a first aspect, a speech processing method is provided for use in a sound pickup circuit, the method comprising the steps of:
采集语音信号,所述语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;Acquiring a voice signal, the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
消除所述语音信号中的语音反馈信号,获得所述语音输入信号;Eliminating a voice feedback signal in the voice signal to obtain the voice input signal;
将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。Transmitting the speech input signal to the speech recognition processor, the speech recognition processor for identifying the speech input signal and broadcasting the speech feedback signal.
本发明实施例的一些有益效果可以包括:Some beneficial effects of embodiments of the present invention may include:
上述技术方案,通过同时采集用户输入的语音输入信号以及语音识别处理器播报的语 音反馈信号,并消除其中的语音反馈信号,使得最终采集到的语音信号不受反馈信号的干扰,从而得到“干净”的语音输入信号,提高语音识别的准确率。The above technical solution, by simultaneously collecting the voice input signal input by the user and the language broadcasted by the voice recognition processor The sound feedback signal, and the speech feedback signal is eliminated, so that the finally collected speech signal is not interfered by the feedback signal, thereby obtaining a "clean" speech input signal, thereby improving the accuracy of speech recognition.
在一个实施例中,所述消除所述语音信号中的语音反馈信号,获得所述语音输入信号,包括:In one embodiment, the canceling the voice feedback signal in the voice signal to obtain the voice input signal includes:
将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。The speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
该实施例中,通过对语音信号和语音反馈信号进行减法运算,能够得到“干净”的语音输入信号,从而使采集到的语音信号不受反馈信号的干扰。In this embodiment, by subtracting the voice signal and the voice feedback signal, a "clean" voice input signal can be obtained, so that the collected voice signal is not interfered by the feedback signal.
在一个实施例中,所述将所述语音输入信号传输至所述语音识别处理器,包括:In one embodiment, the transmitting the voice input signal to the voice recognition processor comprises:
将所述语音输入信号进行模数转换,获得数字语音输入信号;Performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;Performing a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
将所述处理后的数字语音输入信号传输至所述语音识别处理器。Transmitting the processed digital speech input signal to the speech recognition processor.
该实施例中,通过对语音输入信号进行模数转换,以及降噪、消除混响、消除回音等处理,使得语音识别处理器接收到的语音输入信号中没有杂音干扰,从而能够更加准确地识别语音输入信号。In this embodiment, by performing analog-to-digital conversion on the voice input signal, and processing such as noise reduction, reverberation elimination, and echo cancellation, the speech recognition signal received by the speech recognition processor has no noise interference, thereby enabling more accurate recognition. Voice input signal.
在一个实施例中,所述采集语音信号,包括:In one embodiment, the collecting a voice signal includes:
接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
根据所述采集指令采集语音信号。Acquiring a voice signal according to the acquisition instruction.
该实施例中,根据语音识别处理器发送的采集指令开始采集语音信号,实现了电路的复位控制。In this embodiment, the acquisition of the voice signal is started according to the acquisition instruction sent by the voice recognition processor, and the reset control of the circuit is realized.
在一个实施例中,所述采集语音信号之前,所述方法还包括:In an embodiment, before the acquiring the voice signal, the method further includes:
对所述语音信号进行识别;Identifying the voice signal;
根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
该实施例中,通过对接收到的语音信号进行识别,并在语音信号为有效语音信号时指示语音识别处理器发出采集指令,实现了语音识别处理器对拾音电路的唤醒控制,从而避免拾音电路接收无效的语音信号。 In this embodiment, by recognizing the received speech signal and instructing the speech recognition processor to issue an acquisition instruction when the speech signal is a valid speech signal, the speech recognition processor wakes up the pickup circuit, thereby avoiding picking up The tone circuit receives an invalid voice signal.
第二方面,提供一种语音处理装置,应用于拾音电路,所述装置包括:In a second aspect, a speech processing apparatus is provided for use in a sound pickup circuit, the apparatus comprising:
采集模块,用于采集语音信号,所述语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;An acquisition module, configured to collect a voice signal, where the voice signal includes a voice input signal input by a user and a voice feedback signal broadcast by the voice recognition processor;
消除模块,用于消除所述语音信号中的语音反馈信号,获得所述语音输入信号;a cancellation module, configured to cancel a voice feedback signal in the voice signal, to obtain the voice input signal;
传输模块,用于将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。And a transmission module, configured to transmit the voice input signal to the voice recognition processor, where the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
在一个实施例中,所述消除模块包括:In one embodiment, the elimination module comprises:
运算子模块,用于将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。And an operation submodule, configured to perform subtraction on the voice signal and the voice feedback signal to obtain the voice input signal.
在一个实施例中,所述传输模块包括:In an embodiment, the transmission module comprises:
转换子模块,用于将所述语音输入信号进行模数转换,获得数字语音输入信号;a conversion submodule, configured to perform analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
处理子模块,用于对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;a processing submodule, configured to perform preset processing on the digital voice input signal, where the preset processing includes any one or more of a noise reduction process, a reverberation elimination process, and an echo cancellation process;
传输子模块,用于将所述处理后的数字语音输入信号传输至所述语音识别处理器。And a transmission submodule, configured to transmit the processed digital voice input signal to the voice recognition processor.
在一个实施例中,所述采集模块包括:In an embodiment, the acquisition module comprises:
接收子模块,用于接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;a receiving submodule, configured to receive an acquisition instruction sent by the voice recognition processor, where the collection instruction includes a sampling clock of a reset signal and a voice signal;
采集子模块,用于根据所述采集指令采集语音信号。The collecting submodule is configured to collect a voice signal according to the collecting instruction.
在一个实施例中,所述装置还包括:In one embodiment, the apparatus further includes:
识别模块,用于所述采集语音信号之前,对所述语音信号进行识别;An identification module, configured to identify the voice signal before acquiring the voice signal;
判断模块,用于根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;a determining module, configured to determine, according to a preset effective voice signal feature, whether the voice signal is a valid voice signal;
输出模块,用于当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。An output module, configured to output an execution instruction to the voice recognition processor when the voice signal is the valid voice signal, where the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, the collection instruction And used to instruct the sound collecting circuit to collect a voice signal.
本发明实施例的一些有益效果可以包括:Some beneficial effects of embodiments of the present invention may include:
上述装置,通过同时采集用户输入的语音输入信号以及语音识别处理器播报的语音反馈信号,并消除其中的语音反馈信号,使得最终采集到的语音信号不受反馈信号的干扰,从而得到“干净”的语音输入信号,提高语音识别的准确率。The above device collects the voice input signal input by the user and the voice feedback signal broadcasted by the voice recognition processor, and eliminates the voice feedback signal therein, so that the finally collected voice signal is not interfered by the feedback signal, thereby obtaining “clean”. The voice input signal improves the accuracy of speech recognition.
第三方面,提供一种拾音电路,包括麦克阵列、数字信号处理器、语音播报接口和语 音输出接口;其中,In a third aspect, a pickup circuit is provided, including a microphone array, a digital signal processor, a voice broadcast interface, and a language Sound output interface; among them,
所述麦克阵列,与所述数字信号处理器连接,用于采集语音信号,所述语音信号包括用户输入的语音输入信号以及通过所述语音播报接口接收到的语音识别处理器所播报的语音反馈信号;The microphone array is connected to the digital signal processor and configured to collect a voice signal, where the voice signal includes a voice input signal input by a user and a voice feedback reported by a voice recognition processor received through the voice broadcast interface. signal;
所述数字信号处理器,与所述麦克阵列、所述语音播报接口以及所述语音输出接口连接,用于消除所述语音信号中的语音反馈信号,获得所述语音输入信号;The digital signal processor is connected to the microphone array, the voice broadcast interface, and the voice output interface, and is configured to cancel a voice feedback signal in the voice signal to obtain the voice input signal;
所述语音播报接口,与所述数字信号处理器连接,用于接收所述语音识别处理器所播报的语音反馈信号;The voice broadcast interface is connected to the digital signal processor and configured to receive a voice feedback signal broadcast by the voice recognition processor;
所述语音输出接口,与所述数字信号处理器连接,用于将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。The voice output interface is coupled to the digital signal processor for transmitting the voice input signal to the voice recognition processor, and the voice recognition processor is configured to identify and report the voice input signal The speech feedback signal.
在一个实施例中,所述拾音电路还包括:In an embodiment, the sound collecting circuit further includes:
控制部件,与所述数字信号处理器连接,用于接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;并根据所述采集指令控制所述麦克阵列采集语音信号。a control unit, coupled to the digital signal processor, configured to receive an acquisition instruction sent by the speech recognition processor, the acquisition instruction includes a sampling clock of a reset signal and a voice signal; and control the microphone according to the acquisition instruction The array collects speech signals.
本发明实施例的一些有益效果可以包括:Some beneficial effects of embodiments of the present invention may include:
上述拾音电路,通过麦克阵列同时采集用户输入的语音输入信号以及语音识别处理器播报的语音反馈信号,并通过数字信号处理器消除其中的语音反馈信号,使得最终采集到的语音信号不受反馈信号的干扰,从而得到“干净”的语音输入信号,提高语音识别的准确率。The above-mentioned sound collecting circuit simultaneously collects the voice input signal input by the user and the voice feedback signal broadcasted by the voice recognition processor through the microphone array, and eliminates the voice feedback signal therein by the digital signal processor, so that the finally collected voice signal is not subjected to feedback. Signal interference, resulting in a "clean" voice input signal, improving the accuracy of speech recognition.
第四方面,提供一种语音处理装置,其特征在于,应用于拾音电路,所述装置包括:According to a fourth aspect, a voice processing device is provided, which is applied to a sound pickup circuit, and the device includes:
处理器;processor;
用于存储所述处理器可执行指令的存储器;a memory for storing the processor executable instructions;
其中,所述处理器被配置为:Wherein the processor is configured to:
采集语音信号,所述语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;Acquiring a voice signal, the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
消除所述语音信号中的语音反馈信号,获得所述语音输入信号;Eliminating a voice feedback signal in the voice signal to obtain the voice input signal;
将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。Transmitting the speech input signal to the speech recognition processor, the speech recognition processor for identifying the speech input signal and broadcasting the speech feedback signal.
上述处理器还被配置为: The above processor is also configured to:
将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。The speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
上述处理器还被配置为:The above processor is also configured to:
将所述语音输入信号进行模数转换,获得数字语音输入信号;Performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;Performing a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
将所述处理后的数字语音输入信号传输至所述语音识别处理器。Transmitting the processed digital speech input signal to the speech recognition processor.
上述处理器还被配置为:The above processor is also configured to:
接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
根据所述采集指令采集语音信号。Acquiring a voice signal according to the acquisition instruction.
上述处理器还被配置为:The above processor is also configured to:
对所述语音信号进行识别;Identifying the voice signal;
根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
第五方面,提供一种非暂时性计算机可读记录介质,所述介质上记录有计算机程序,所述程序包括用于执行如本发明实施例的第一方面所述的方法的指令。In a fifth aspect, there is provided a non-transitory computer readable recording medium having recorded thereon a computer program, the program comprising instructions for performing the method of the first aspect of the embodiment of the present invention.
第六方面,提供一种计算机程序,所述程序包括:用于在所述程序由计算机执行时执行如本发明实施例的第一方面所述的方法的指令。In a sixth aspect, a computer program is provided, the program comprising: instructions for performing the method of the first aspect of the embodiments of the invention when the program is executed by a computer.
本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在所写的说明书、权利要求书、以及附图中所特别指出的结构来实现和获得。Other features and advantages of the invention will be set forth in the description which follows, The objectives and other advantages of the invention may be realized and obtained by means of the structure particularly pointed in the appended claims.
下面通过附图和实施例,对本发明的技术方案做进一步的详细描述。The technical solution of the present invention will be further described in detail below through the accompanying drawings and embodiments.
附图说明DRAWINGS
附图用来提供对本发明的进一步理解,并且构成说明书的一部分,与本发明的实施例一起用于解释本发明,并不构成对本发明的限制。在附图中:The drawings are intended to provide a further understanding of the invention, and are intended to be a In the drawing:
图1为本发明实施例中一种语音处理方法的流程图;FIG. 1 is a flowchart of a voice processing method according to an embodiment of the present invention;
图2为本发明实施例中拾音电路和语音识别处理器构成的回路框图;2 is a circuit block diagram of a sound pickup circuit and a voice recognition processor according to an embodiment of the present invention;
图3为本发明实施例中的信号时序图; 3 is a timing diagram of signals in an embodiment of the present invention;
图4为本发明实施例中一种语音处理方法中步骤S13的流程图;4 is a flowchart of step S13 in a voice processing method according to an embodiment of the present invention;
图4(a)为本发明实施例中复位信号的时序图;4(a) is a timing diagram of a reset signal in an embodiment of the present invention;
图5为本发明实施例中一种语音处理方法的流程图;FIG. 5 is a flowchart of a voice processing method according to an embodiment of the present invention;
图5(a)为本发明实施例中唤醒信号的时序图;FIG. 5(a) is a timing diagram of a wake-up signal according to an embodiment of the present invention; FIG.
图6为本发明实施例中的一种语音处理装置的框图;FIG. 6 is a block diagram of a voice processing apparatus according to an embodiment of the present invention; FIG.
图7为本发明实施例中的一种语音处理装置中消除模块的框图;7 is a block diagram of a cancellation module in a voice processing device according to an embodiment of the present invention;
图8为本发明实施例中的一种语音处理装置中传输模块的框图;8 is a block diagram of a transmission module in a voice processing device according to an embodiment of the present invention;
图9为本发明实施例中的一种语音处理装置中采集模块的框图;FIG. 9 is a block diagram of an acquisition module in a voice processing device according to an embodiment of the present invention; FIG.
图10为本发明实施例中的一种语音处理装置的框图;FIG. 10 is a block diagram of a voice processing apparatus according to an embodiment of the present invention; FIG.
图11为本发明实施例中一种拾音电路的框图;11 is a block diagram of a sound pickup circuit according to an embodiment of the present invention;
图12为本发明实施例中一种语音处理系统的框图;FIG. 12 is a block diagram of a voice processing system according to an embodiment of the present invention; FIG.
图13为本发明实施例中一种可执行语音处理方法的装置的框图。FIG. 13 is a block diagram of an apparatus for performing a voice processing method according to an embodiment of the present invention.
具体实施方式detailed description
以下结合附图对本发明的优选实施例进行说明,应当理解,此处所描述的优选实施例仅用于说明和解释本发明,并不用于限定本发明。The preferred embodiments of the present invention are described with reference to the accompanying drawings, which are intended to illustrate and illustrate the invention.
图1为本发明实施例中一种语音处理方法的流程图。如图1所示,该方法应用于拾音电路中,包括以下步骤S11-S13:FIG. 1 is a flowchart of a voice processing method according to an embodiment of the present invention. As shown in FIG. 1, the method is applied to a sound pickup circuit, and includes the following steps S11-S13:
步骤S11,采集语音信号,语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号。In step S11, a voice signal is collected, where the voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor.
该步骤中,语音识别处理器所播报的语音反馈信号即为对语音输入信号的反馈信号。图2所示为拾音电路和语音识别处理器构成的回路框图。由于采集语音信号的同时,后端语音识别处理器22通过喇叭23播报语音反馈信号,喇叭23会和采集语音信号的麦克阵列211形成一个回路,如图2中所示的虚线回路,因此,语音识别处理器22通过喇叭23播报的语音反馈信号会造成对语音信号的干扰,因此,语音识别处理器通22通过喇叭23播报语音的同时,将语音反馈信号输出给拾音电路21,由拾音电路21对造成干扰的语音反馈信号进行消除,即可消除语音反馈信号对语音信号的干扰。In this step, the voice feedback signal broadcast by the voice recognition processor is a feedback signal to the voice input signal. Figure 2 shows a circuit block diagram of the pickup circuit and the speech recognition processor. As the voice signal is collected, the backend speech recognition processor 22 broadcasts the voice feedback signal through the speaker 23. The speaker 23 forms a loop with the microphone array 211 that collects the voice signal, as shown in the dotted circuit of FIG. The voice feedback signal broadcast by the recognition processor 22 through the speaker 23 causes interference to the voice signal. Therefore, the voice recognition processor 22 outputs the voice feedback signal to the sound pickup circuit 21 while the voice is being broadcast through the speaker 23. The circuit 21 eliminates the speech feedback signal causing the interference, thereby eliminating the interference of the speech feedback signal on the speech signal.
其中,拾音电路21通过语音播报接口22接收语音识别处理器23输出的语音反馈信号,该语音播报接口22的方式可以是IIS(Inter-IC Sound,集成电路内置音频)总线方式,该接口的信号名称定义如表1所示,对应的时序图如图3所示。 The sound collecting circuit 21 receives the voice feedback signal output by the voice recognition processor 23 through the voice broadcast interface 22, and the voice broadcast interface 22 can be an IIS (Inter-IC Sound) integrated bus mode. The signal name is defined as shown in Table 1, and the corresponding timing diagram is shown in Figure 3.
表1Table 1
信号名称Signal name 信号方向Signal direction 信号描述Signal description
SCLKSCLK 输出Output 位采样时钟Bit sampling clock
LRCKLRCK 输出Output 左右声道同步时钟Left and right channel sync clock
SDISDI 输入Input 输入信号input signal
SDOSDO 输出Output 输出信号output signal
步骤S12,消除语音信号中的语音反馈信号,获得语音输入信号。Step S12, canceling the voice feedback signal in the voice signal to obtain a voice input signal.
在一个实施例中,该步骤可具体实施为:将语音信号和语音反馈信号进行减法运算,获得语音输入信号。In an embodiment, the step may be specifically implemented by: subtracting the voice signal and the voice feedback signal to obtain a voice input signal.
步骤S13,将语音输入信号传输至语音识别处理器,语音识别处理器用于对语音输入信号进行识别并播报语音反馈信号。In step S13, the voice input signal is transmitted to the voice recognition processor, and the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
该步骤中,拾音电路通过语音输出接口将语音输入信号传输至语音识别处理器,其中,语音输出接口的方式可以是IIS(Inter-IC Sound,集成电路内置音频)总线方式,该接口的信号名称定义如表1所示,对应的时序图如图3所示。In this step, the sound collecting circuit transmits the voice input signal to the voice recognition processor through the voice output interface, wherein the voice output interface can be an IIS (Inter-IC Sound) bus mode, and the signal of the interface The name definition is shown in Table 1, and the corresponding timing diagram is shown in Figure 3.
采用本发明实施例提供的技术方案,通过同时采集用户输入的语音输入信号以及语音识别处理器播报的语音反馈信号,并消除其中的语音反馈信号,使得最终采集到的语音信号不受反馈信号的干扰,从而得到“干净”的语音输入信号,提高语音识别的准确率。According to the technical solution provided by the embodiment of the present invention, the voice input signal input by the user and the voice feedback signal broadcast by the voice recognition processor are simultaneously collected, and the voice feedback signal is eliminated, so that the finally collected voice signal is not affected by the feedback signal. Interference, resulting in a "clean" voice input signal, improving the accuracy of speech recognition.
在一个实施例中,如图4所示,步骤S13可实施为以下步骤S131-S133:In an embodiment, as shown in FIG. 4, step S13 may be implemented as the following steps S131-S133:
步骤S131,将语音输入信号进行模数转换,获得数字语音输入信号。Step S131, performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal.
步骤S132,对数字语音输入信号执行预设处理,预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种。Step S132, performing preset processing on the digital voice input signal, and the preset processing includes any one or more of noise reduction processing, elimination of reverberation processing, and cancellation of echo processing.
步骤S133,将处理后的数字语音输入信号传输至语音识别处理器。Step S133, the processed digital voice input signal is transmitted to the voice recognition processor.
本实施例中,拾音电路通过语音输出接口将处理后的数字语音输入信号传输至语音识别处理器。拾音电路通过对语音输入信号进行模数转换,以及降噪、消除混响、消除回音等处理,使得语音识别处理器接收到的语音输入信号中没有杂音干扰,从而能够更加准确地识别语音输入信号。 In this embodiment, the sound pickup circuit transmits the processed digital voice input signal to the voice recognition processor through the voice output interface. The sound pickup circuit performs analog-to-digital conversion on the voice input signal, as well as processing such as noise reduction, reverberation elimination, and echo cancellation, so that the voice recognition signal received by the voice recognition processor has no noise interference, thereby more accurately recognizing the voice input. signal.
在一个实施例中,步骤S11可实施为以下步骤:接收语音识别处理器发送的采集指令,该采集指令包括复位信号和语音信号的采样时钟;根据采集指令采集语音信号。本实施例能够根据语音识别处理器发送的采集指令开始采集语音信号,实现了电路的复位控制。即,系统上电之后,后端的语音识别处理器就会生成包含复位信号的采集指令,拾音电路接收到采集指令后,开始采集语音信号,即处于正常工作状态。复位信号RESET的时序图如图4(a)所示。In an embodiment, step S11 may be implemented as: receiving an acquisition instruction sent by the speech recognition processor, the acquisition instruction comprising a reset signal and a sampling clock of the speech signal; and acquiring the speech signal according to the acquisition instruction. In this embodiment, the voice signal can be collected according to the acquisition instruction sent by the voice recognition processor, and the reset control of the circuit is realized. That is, after the system is powered on, the speech recognition processor of the back end generates an acquisition instruction including a reset signal, and after receiving the acquisition instruction, the pickup circuit starts to collect the speech signal, that is, it is in a normal working state. The timing diagram of the reset signal RESET is shown in Figure 4(a).
在一个实施例中,如图5所示,在步骤S11之前,上述方法还包括以下步骤S51-S53:In an embodiment, as shown in FIG. 5, before step S11, the above method further includes the following steps S51-S53:
步骤S51,对语音信号进行识别。In step S51, the voice signal is identified.
步骤S52,根据预设的有效语音信号特征,判断语音信号是否为有效语音信号。如果语音信号为有效语音信号,则执行步骤S53;否则,返回步骤S51,继续对新接收到的语音信号进行识别。Step S52: Determine whether the voice signal is a valid voice signal according to a preset effective voice signal feature. If the voice signal is a valid voice signal, step S53 is performed; otherwise, returning to step S51, the newly received voice signal is continuously identified.
其中,有效语音信号特征可根据当前系统执行的语音业务来设定。例如,当前系统执行银行业务,那么可设定有效语音信号特征为包含银行业务所涉及的关键词,当接收到语音信号时,拾音电路根据接收到的语音信号中包含的关键词来识别,当该关键词为银行业务所涉及的关键词时,即可确定该语音信号为有效语音信号。The effective voice signal feature can be set according to the voice service performed by the current system. For example, if the current system performs banking, the effective voice signal feature may be set to include keywords involved in the banking service. When the voice signal is received, the sound collecting circuit identifies the keyword according to the received voice signal. When the keyword is a keyword involved in banking, the voice signal can be determined to be a valid voice signal.
步骤S53,向语音识别处理器输出执行指令,执行指令用于指示语音识别处理器发出采集指令,采集指令用于指示拾音电路采集语音信号。Step S53, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound pickup circuit to collect the voice signal.
由于在没有接收到有效的语音信号时,语音识别处理器会处于睡眠状态,因此在本实施例中,执行指令的发出实际上实现了拾音电路对语音识别处理器的唤醒功能,即相当于发出一个唤醒信号,该唤醒信号WAKE_INT的时序图如图5(a)所示。Since the voice recognition processor is in a sleep state when no valid voice signal is received, in the embodiment, the issuance of the execution instruction actually implements the wake-up function of the sound pickup circuit to the voice recognition processor, that is, equivalent A wake-up signal is issued, and the timing chart of the wake-up signal WAKE_INT is as shown in FIG. 5(a).
本实施例中,通过对接收到的语音信号进行识别,并在语音信号为有效语音信号时指示语音识别处理器发出采集指令,实现了语音识别处理器对拾音电路的唤醒控制,从而避免拾音电路接收无效的语音信号。In this embodiment, by recognizing the received voice signal, and instructing the voice recognition processor to issue an acquisition instruction when the voice signal is a valid voice signal, the voice recognition processor wakes up the sound pickup circuit to avoid picking up The tone circuit receives an invalid voice signal.
对应于上述实施例中的语音处理方法,本发明还提供一种语音处理装置,用以执行上述方法。Corresponding to the voice processing method in the above embodiment, the present invention further provides a voice processing device for performing the above method.
图6为本发明实施例中的一种语音处理装置的框图。如图6所示,该装置应用于拾音电路中,包括:FIG. 6 is a block diagram of a voice processing apparatus according to an embodiment of the present invention. As shown in FIG. 6, the device is applied to a sound pickup circuit, and includes:
采集模块61,用于采集语音信号,语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;The acquiring module 61 is configured to collect a voice signal, where the voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
消除模块62,用于消除语音信号中的语音反馈信号,获得语音输入信号; The eliminating module 62 is configured to cancel the voice feedback signal in the voice signal to obtain a voice input signal;
传输模块63,用于将语音输入信号传输至语音识别处理器,语音识别处理器用于对语音输入信号进行识别并播报语音反馈信号。The transmission module 63 is configured to transmit the voice input signal to the voice recognition processor, and the voice recognition processor is configured to identify the voice input signal and broadcast the voice feedback signal.
在一个实施例中,如图7所示,消除模块62包括:In one embodiment, as shown in FIG. 7, the elimination module 62 includes:
运算子模块621,用于将语音信号和语音反馈信号进行减法运算,获得语音输入信号。The operation sub-module 621 is configured to perform a subtraction operation on the voice signal and the voice feedback signal to obtain a voice input signal.
在一个实施例中,如图8所示,传输模块63包括:In one embodiment, as shown in FIG. 8, the transmission module 63 includes:
转换子模块631,用于将语音输入信号进行模数转换,获得数字语音输入信号;a conversion sub-module 631, configured to perform analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
处理子模块632,用于对数字语音输入信号执行预设处理,预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;The processing sub-module 632 is configured to perform preset processing on the digital voice input signal, where the preset processing includes any one or more of noise reduction processing, anti-reverberation processing, and echo cancellation processing;
传输子模块633,用于将处理后的数字语音输入信号传输至语音识别处理器。The transmission sub-module 633 is configured to transmit the processed digital voice input signal to the voice recognition processor.
在一个实施例中,如图9所示,采集模块61包括:In an embodiment, as shown in FIG. 9, the acquisition module 61 includes:
接收子模块611,用于接收语音识别处理器发送的采集指令,采集指令包括复位信号和语音信号的采样时钟;The receiving sub-module 611 is configured to receive an acquisition instruction sent by the voice recognition processor, where the collection instruction includes a sampling clock of the reset signal and the voice signal;
采集子模块612,用于根据采集指令采集语音信号。The collecting sub-module 612 is configured to collect a voice signal according to the acquisition instruction.
在一个实施例中,如图10所示,上述装置还包括:In an embodiment, as shown in FIG. 10, the foregoing apparatus further includes:
识别模块64,用于采集语音信号之前,对语音信号进行识别;The identification module 64 is configured to identify the voice signal before acquiring the voice signal;
判断模块65,用于根据预设的有效语音信号特征,判断语音信号是否为有效语音信号;The determining module 65 is configured to determine, according to the preset effective voice signal feature, whether the voice signal is a valid voice signal;
输出模块66,用于当语音信号为有效语音信号时,向语音识别处理器输出执行指令,执行指令用于指示语音识别处理器发出采集指令,采集指令用于指示拾音电路采集语音信号。The output module 66 is configured to output an execution instruction to the voice recognition processor when the voice signal is a valid voice signal, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound collection circuit to collect the voice signal.
采用本发明实施例提供的装置,通过同时采集用户输入的语音输入信号以及语音识别处理器播报的语音反馈信号,并消除其中的语音反馈信号,使得最终采集到的语音信号不受反馈信号的干扰,从而得到“干净”的语音输入信号,提高语音识别的准确率。By using the device provided by the embodiment of the present invention, the voice input signal input by the user and the voice feedback signal broadcast by the voice recognition processor are simultaneously collected, and the voice feedback signal is eliminated, so that the finally collected voice signal is not interfered by the feedback signal. In order to get a "clean" voice input signal, the accuracy of speech recognition is improved.
图11为本发明实施例中一种拾音电路的框图。如图11所示,该拾音电路包括麦克阵列111、数字信号处理器112、语音播报接口113和语音输出接口114;其中,Figure 11 is a block diagram of a sound pickup circuit in accordance with an embodiment of the present invention. As shown in FIG. 11, the sound collecting circuit includes a microphone array 111, a digital signal processor 112, a voice broadcast interface 113, and a voice output interface 114;
麦克阵列111,与数字信号处理器112连接,用于采集语音信号,语音信号包括用户输入的语音输入信号以及通过语音播报接口113接收到的语音识别处理器所播报的语音反馈信号。The microphone array 111 is connected to the digital signal processor 112 for collecting a voice signal. The voice signal includes a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor received through the voice broadcast interface 113.
麦克阵列111即采用多个麦克组成的阵列,可消除不同方向的干扰噪声。通常可由2个麦克、4个麦克或6个麦克组成,根据相应的角度均匀分布组成麦克阵列111。The microphone array 111 uses an array of multiple microphones to eliminate interference noise in different directions. It can usually be composed of 2 mics, 4 mics or 6 mics, and the mic array 111 is uniformly distributed according to the corresponding angles.
数字信号处理器112,与麦克阵列111、语音播报接口113以及语音输出接口114连接, 用于消除语音信号中的语音反馈信号,获得语音输入信号。The digital signal processor 112 is connected to the microphone array 111, the voice broadcast interface 113, and the voice output interface 114. It is used to cancel the speech feedback signal in the speech signal and obtain the speech input signal.
数字信号处理器112中包括模数转换部件,该模数转换部件用于将接收到的语音输入信号进行模数转换,获得数字语音输入信号。数字信号处理器112对数字语音输入信号执行预设处理,该预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种,并通过语音输出接口114将处理后的数字语音输入信号传输至语音识别处理器。The digital signal processor 112 includes an analog to digital conversion component for analog to digital conversion of the received speech input signal to obtain a digital speech input signal. The digital signal processor 112 performs a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a reverberation canceling process, and an echo cancellation process, and the processing is performed through the voice output interface 114. The digital voice input signal is transmitted to the speech recognition processor.
语音播报接口113,与数字信号处理器112连接,用于接收语音识别处理器所播报的语音反馈信号。The voice broadcast interface 113 is connected to the digital signal processor 112 for receiving the voice feedback signal broadcast by the voice recognition processor.
语音播报接口113的方式可以是IIS(Inter-IC Sound,集成电路内置音频)总线方式。The voice broadcast interface 113 may be in the form of an IIS (Inter-IC Sound) bus.
语音输出接口114,与数字信号处理器112连接,用于将语音输入信号传输至语音识别处理器,语音识别处理器用于对语音输入信号进行识别并播报语音反馈信号。The voice output interface 114 is coupled to the digital signal processor 112 for transmitting the voice input signal to the voice recognition processor, the voice recognition processor for identifying the voice input signal and broadcasting the voice feedback signal.
语音输出接口114的方式可以是IIS(Inter-IC Sound,集成电路内置音频)总线方式。The voice output interface 114 can be in the form of an IIS (Inter-IC Sound) bus.
该拾音电路的工作原理如下:首先,麦克阵列111采集用户输入的语音输入信号,同时通过语音播报接口113采集语音识别处理器输出的语音反馈信号;其次,数字信号处理器112对麦克阵列111采集到的语音信号进行处理,消除其中的语音反馈信号,获得“干净”的语音输入信号,并将“干净”的语音输入信号通过语音输出接口114传输至后端的语音识别处理器,以使语音识别处理器能够对“干净”的语音输入信号进行识别,从而提高语音识别的准确率。The working principle of the sound collecting circuit is as follows: First, the microphone array 111 collects the voice input signal input by the user, and simultaneously collects the voice feedback signal output by the voice recognition processor through the voice broadcast interface 113; secondly, the digital signal processor 112 pairs the microphone array 111. The collected voice signal is processed, the voice feedback signal is eliminated, a "clean" voice input signal is obtained, and the "clean" voice input signal is transmitted through the voice output interface 114 to the voice recognition processor at the back end to make the voice The recognition processor is capable of recognizing "clean" speech input signals, thereby improving the accuracy of speech recognition.
在一个实施例中,上述拾音电路还包括:In an embodiment, the sound collecting circuit further includes:
控制部件,与数字信号处理器112连接,用于接收语音识别处理器发送的采集指令,采集指令包括复位信号和语音信号的采样时钟;并根据采集指令控制麦克阵列111采集语音信号。此外,控制部件还可用于数字信号处理器112和语音识别处理器之间的控制命令的传输,例如,调增麦克阵列的前端增益、调整语音输出格式等。The control unit is connected to the digital signal processor 112 for receiving an acquisition instruction sent by the voice recognition processor, and the acquisition instruction includes a sampling clock of the reset signal and the voice signal; and controlling the microphone array 111 to collect the voice signal according to the acquisition instruction. In addition, the control component can also be used for transmission of control commands between the digital signal processor 112 and the speech recognition processor, for example, to increase the front end gain of the microphone array, to adjust the speech output format, and the like.
控制部件的接口可以是IIC(Inter-Integrated Circuit,集成电路)总线接口、SPI(Serial Peripheral Interface,串行外设接口)或者UART(Universal Asynchronous Receiver/Transmitter,通用异步收发传输器)总线接口。The interface of the control component may be an IIC (Inter-Integrated Circuit) bus interface, an SPI (Serial Peripheral Interface) or a UART (Universal Asynchronous Receiver/Transmitter) bus interface.
在一个实施例中,数字信号处理器112还用于对语音信号进行识别;根据预设的有效语音信号特征,判断语音信号是否为有效语音信号;当语音信号为有效语音信号时,通过控制部件向语音识别处理器输出执行指令,执行指令用于指示语音识别处理器发出采集指令,采集指令用于指示拾音电路采集语音信号。In one embodiment, the digital signal processor 112 is further configured to identify the voice signal; determine whether the voice signal is a valid voice signal according to a preset effective voice signal feature; and pass the control component when the voice signal is a valid voice signal And outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound collection circuit to acquire the voice signal.
图12为本发明实施例中一种语音处理系统的框图。如图12所示,该语音处理系统包 括拾音电路121和语音识别处理器122。其中,拾音电路121用于采集语音信号,并对语音信号进行处理,然后将处理后的语音信号传输给语音识别处理器122,由语音识别处理器122对语音信号进行识别处理。FIG. 12 is a block diagram of a voice processing system according to an embodiment of the present invention. As shown in Figure 12, the voice processing system package A sound pickup circuit 121 and a voice recognition processor 122 are included. The sound collecting circuit 121 is configured to collect a voice signal, process the voice signal, and then transmit the processed voice signal to the voice recognition processor 122, and the voice recognition processor 122 performs a recognition process on the voice signal.
在一个实施例中,上述语音处理系统一经上电,语音识别处理器122就会生成采集指令,并将该采集指令发送给拾音电路121,该采集指令中包括复位信号和语音信号的采样时钟。其中,复位信号用于控制拾音电路121进入工作状态,开始采集语音信号。语音识别处理器122通过相应的硬件接口将采集指令发送给拾音电路121,实现复位控制。In an embodiment, after the voice processing system is powered on, the voice recognition processor 122 generates an acquisition instruction, and sends the collection instruction to the sound collection circuit 121, where the collection instruction includes a sampling clock of the reset signal and the voice signal. . The reset signal is used to control the sound collecting circuit 121 to enter an operating state, and start collecting voice signals. The speech recognition processor 122 sends an acquisition instruction to the pickup circuit 121 through a corresponding hardware interface to implement reset control.
在一个实施例中,当没有接收到有效的语音信号时,语音识别处理器122处于睡眠状态,由拾音电路121中的数字信号处理器对采集到的语音信号进行匹配,判断采集到的语音信号是否为有效语音信号,当为有效信号时,拾音电路121就会向语音识别处理器122发出执行指令,用以唤醒语音识别处理器122。拾音电路121通过相应的硬件接口将执行指令发送给语音识别处理器122,实现对语音识别处理器122的唤醒功能。In one embodiment, when a valid voice signal is not received, the voice recognition processor 122 is in a sleep state, and the collected voice signal is matched by the digital signal processor in the sound pickup circuit 121 to determine the collected voice. Whether the signal is a valid speech signal, when it is a valid signal, the pickup circuit 121 issues an execution instruction to the speech recognition processor 122 to wake up the speech recognition processor 122. The pickup circuit 121 transmits an execution instruction to the voice recognition processor 122 through a corresponding hardware interface to implement a wake-up function to the voice recognition processor 122.
图13是根据一示例性实施例示出的一种可执行语音处理方法的装置的框图。例如,装置1600可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。FIG. 13 is a block diagram of an apparatus for performing a voice processing method, according to an exemplary embodiment. For example, device 1600 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
参照图13,装置1600可以包括以下一个或多个组件:处理器1601,存储器1602以及通信组件1603。Referring to Figure 13, device 1600 can include one or more of the following components: processor 1601, memory 1602, and communication component 1603.
处理器1601通常控制装置1600的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理器1601可以执行指令,以完成上述的方法的全部或部分步骤。The processor 1601 typically controls the overall operation of the device 1600, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processor 1601 can execute instructions to perform all or part of the steps of the above method.
存储器1602被配置为存储各种类型的数据以支持在装置1600的操作。这些数据的示例包括用于在装置1600上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器1602可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。 Memory 1602 is configured to store various types of data to support operation at device 1600. Examples of such data include instructions for any application or method operating on device 1600, contact data, phone book data, messages, pictures, videos, and the like. The memory 1602 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable. Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Disk or Optical Disk.
通信组件1603被配置为便于装置1600和其他设备之间有线或无线方式的通信。装置1600可以接入基于通信标准的无线网络,如Wi-Fi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件1603经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,通信组件1603还包括近场通信(NFC)模块,以促进短 程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。 Communication component 1603 is configured to facilitate wired or wireless communication between device 1600 and other devices. The device 1600 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 1603 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, communication component 1603 also includes a near field communication (NFC) module to facilitate short Cheng Communication. For example, the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
在示例性实施例中,装置1600可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述语音处理方法。In an exemplary embodiment, device 1600 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the voice processing method described above.
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器1602,上述指令可由装置1600的处理器1601执行以完成上述语音处理方法。例如,非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer readable storage medium comprising instructions, such as a memory 1602 comprising instructions executable by processor 1601 of apparatus 1600 to perform the voice processing method described above. For example, the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
本发明还提供一种非暂时性计算机可读记录介质,所述介质上记录有计算机程序,所述程序包括用于执行如本发明上述实施例所述的语音处理方法的指令。The present invention also provides a non-transitory computer readable recording medium having recorded thereon a computer program including instructions for executing the voice processing method according to the above-described embodiments of the present invention.
本发明还提供一种计算机程序,所述程序包括:用于在所述程序被计算机执行时执行如本发明上述实施例所述的语音处理方法的指令。The present invention also provides a computer program comprising: instructions for executing a speech processing method according to the above-described embodiments of the present invention when the program is executed by a computer.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (system), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or FIG. These computer program instructions can be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing device to produce a machine for the execution of instructions for execution by a processor of a computer or other programmable data processing device. Means for implementing the functions specified in one or more of the flow or in a block or blocks of the flow chart.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。The computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device. The apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机 或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device such that the computer Or performing a series of operational steps on other programmable devices to produce computer-implemented processing such that instructions executed on a computer or other programmable device are provided for implementing a block in a flow or a flow and/or block diagram of the flowchart Or the steps of the function specified in multiple boxes.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。 It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and modifications of the invention

Claims (18)

  1. 一种语音处理方法,其特征在于,应用于拾音电路,所述方法包括:A voice processing method is characterized in that it is applied to a sound pickup circuit, and the method includes:
    采集语音信号,所述语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;Acquiring a voice signal, the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
    消除所述语音信号中的语音反馈信号,获得所述语音输入信号;Eliminating a voice feedback signal in the voice signal to obtain the voice input signal;
    将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。Transmitting the speech input signal to the speech recognition processor, the speech recognition processor for identifying the speech input signal and broadcasting the speech feedback signal.
  2. 根据权利要求1所述的方法,其特征在于,所述消除所述语音信号中的语音反馈信号,获得所述语音输入信号,包括:The method according to claim 1, wherein the canceling the voice feedback signal in the voice signal to obtain the voice input signal comprises:
    将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。The speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
  3. 根据权利要求1所述的方法,其特征在于,所述将所述语音输入信号传输至所述语音识别处理器,包括:The method of claim 1, wherein the transmitting the voice input signal to the voice recognition processor comprises:
    将所述语音输入信号进行模数转换,获得数字语音输入信号;Performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
    对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;Performing a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
    将所述处理后的数字语音输入信号传输至所述语音识别处理器。Transmitting the processed digital speech input signal to the speech recognition processor.
  4. 根据权利要求1所述的方法,其特征在于,所述采集语音信号,包括:The method according to claim 1, wherein the acquiring the voice signal comprises:
    接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
    根据所述采集指令采集语音信号。Acquiring a voice signal according to the acquisition instruction.
  5. 根据权利要求1所述的方法,其特征在于,所述采集语音信号之前,所述方法还包括:The method according to claim 1, wherein before the acquiring the voice signal, the method further comprises:
    对所述语音信号进行识别;Identifying the voice signal;
    根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
    当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
  6. 根据权利要求4所述的方法,其特征在于,所述采集语音信号之前,所述方法还包括: The method according to claim 4, wherein before the acquiring the voice signal, the method further comprises:
    对所述语音信号进行识别;Identifying the voice signal;
    根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
    当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
  7. 一种拾音电路,其特征在于,包括麦克阵列、数字信号处理器、语音播报接口和语音输出接口;其中,A sound pickup circuit, comprising: a microphone array, a digital signal processor, a voice broadcast interface, and a voice output interface; wherein
    所述麦克阵列,与所述数字信号处理器连接,用于采集语音信号,所述语音信号包括用户输入的语音输入信号以及通过所述语音播报接口接收到的语音识别处理器所播报的语音反馈信号;The microphone array is connected to the digital signal processor and configured to collect a voice signal, where the voice signal includes a voice input signal input by a user and a voice feedback reported by a voice recognition processor received through the voice broadcast interface. signal;
    所述数字信号处理器,与所述麦克阵列、所述语音播报接口以及所述语音输出接口连接,用于消除所述语音信号中的语音反馈信号,获得所述语音输入信号;The digital signal processor is connected to the microphone array, the voice broadcast interface, and the voice output interface, and is configured to cancel a voice feedback signal in the voice signal to obtain the voice input signal;
    所述语音播报接口,与所述数字信号处理器连接,用于接收所述语音识别处理器所播报的语音反馈信号;The voice broadcast interface is connected to the digital signal processor and configured to receive a voice feedback signal broadcast by the voice recognition processor;
    所述语音输出接口,与所述数字信号处理器连接,用于将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。The voice output interface is coupled to the digital signal processor for transmitting the voice input signal to the voice recognition processor, and the voice recognition processor is configured to identify and report the voice input signal The speech feedback signal.
  8. 根据权利要求7所述的拾音电路,其特征在于,所述数字信号处理器被配置为:将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。The sound pickup circuit according to claim 7, wherein said digital signal processor is configured to subtract said speech signal and said speech feedback signal to obtain said speech input signal.
  9. 根据权利要求8所述的拾音电路,其特征在于,所述语音输出接口被配置为:The sound pickup circuit according to claim 8, wherein said voice output interface is configured to:
    将所述语音输入信号进行模数转换,获得数字语音输入信号;Performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
    对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;Performing a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
    将所述处理后的数字语音输入信号传输至所述语音识别处理器。Transmitting the processed digital speech input signal to the speech recognition processor.
  10. 根据权利要求8所述的拾音电路,其特征在于,所述麦克阵列被配置为:The sound pickup circuit according to claim 8, wherein said microphone array is configured to:
    接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
    根据所述采集指令采集语音信号。Acquiring a voice signal according to the acquisition instruction.
  11. 根据权利要求8所述的拾音电路,其特征在于,在采集语音信号之前,所述拾音电路还对所述语音信号进行识别;根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执 行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。The sound collecting circuit according to claim 8, wherein the sound collecting circuit further identifies the voice signal before acquiring the voice signal; and determining whether the voice signal is based on a preset effective voice signal characteristic Is an effective voice signal; when the voice signal is the valid voice signal, outputting to the voice recognition processor a line instruction, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, and the collection instruction is used to instruct the sound collection circuit to acquire a voice signal.
  12. 根据权利要求10所述的拾音电路,其特征在于,在采集语音信号之前,所述拾音电路还对所述语音信号进行识别;根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。The sound collecting circuit according to claim 10, wherein the sound collecting circuit further identifies the voice signal before acquiring the voice signal; and determining whether the voice signal is based on a preset effective voice signal characteristic An effective voice signal; when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction being used to instruct the voice recognition processor to issue an acquisition instruction, the collection instruction And used to instruct the sound collecting circuit to collect a voice signal.
  13. 一种非暂时性计算机可读记录介质,所述介质上记录有计算机程序,所述程序包括用于执行语音处理方法的指令,所述方法包括:A non-transitory computer readable recording medium having recorded thereon a computer program, the program comprising instructions for executing a voice processing method, the method comprising:
    采集语音信号,所述语音信号包括用户输入的语音输入信号以及语音识别处理器所播报的语音反馈信号;Acquiring a voice signal, the voice signal including a voice input signal input by the user and a voice feedback signal broadcast by the voice recognition processor;
    消除所述语音信号中的语音反馈信号,获得所述语音输入信号;Eliminating a voice feedback signal in the voice signal to obtain the voice input signal;
    将所述语音输入信号传输至所述语音识别处理器,所述语音识别处理器用于对所述语音输入信号进行识别并播报所述语音反馈信号。Transmitting the speech input signal to the speech recognition processor, the speech recognition processor for identifying the speech input signal and broadcasting the speech feedback signal.
  14. 根据权利要求13所述的计算机可读记录介质,其特征在于,所述消除所述语音信号中的语音反馈信号,获得所述语音输入信号,包括:The computer readable recording medium according to claim 13, wherein the canceling the voice feedback signal in the voice signal to obtain the voice input signal comprises:
    将所述语音信号和所述语音反馈信号进行减法运算,获得所述语音输入信号。The speech signal and the speech feedback signal are subtracted to obtain the speech input signal.
  15. 根据权利要求13所述的计算机可读记录介质,其特征在于,所述将所述语音输入信号传输至所述语音识别处理器,包括:The computer readable recording medium according to claim 13, wherein said transmitting said voice input signal to said voice recognition processor comprises:
    将所述语音输入信号进行模数转换,获得数字语音输入信号;Performing analog-to-digital conversion on the voice input signal to obtain a digital voice input signal;
    对所述数字语音输入信号执行预设处理,所述预设处理包括降噪处理、消除混响处理、消除回音处理中的任一种或多种;Performing a preset process on the digital voice input signal, the preset process including any one or more of a noise reduction process, a canceling reverberation process, and an echo cancellation process;
    将所述处理后的数字语音输入信号传输至所述语音识别处理器。Transmitting the processed digital speech input signal to the speech recognition processor.
  16. 根据权利要求13所述的计算机可读记录介质,其特征在于,所述采集语音信号,包括:The computer readable recording medium according to claim 13, wherein the acquiring the voice signal comprises:
    接收所述语音识别处理器发送的采集指令,所述采集指令包括复位信号和语音信号的采样时钟;Receiving an acquisition instruction sent by the voice recognition processor, where the acquisition instruction includes a sampling clock of a reset signal and a voice signal;
    根据所述采集指令采集语音信号。Acquiring a voice signal according to the acquisition instruction.
  17. 根据权利要求13所述的计算机可读记录介质,其特征在于,所述采集语音信号之前,所述方法还包括:The computer readable recording medium according to claim 13, wherein before the acquiring the voice signal, the method further comprises:
    对所述语音信号进行识别; Identifying the voice signal;
    根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
    当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
  18. 根据权利要求16所述的计算机可读记录介质,其特征在于,所述采集语音信号之前,所述方法还包括:The computer readable recording medium according to claim 16, wherein before the acquiring the voice signal, the method further comprises:
    对所述语音信号进行识别;Identifying the voice signal;
    根据预设的有效语音信号特征,判断所述语音信号是否为有效语音信号;Determining whether the voice signal is a valid voice signal according to a preset effective voice signal feature;
    当所述语音信号为所述有效语音信号时,向所述语音识别处理器输出执行指令,所述执行指令用于指示所述语音识别处理器发出采集指令,所述采集指令用于指示所述拾音电路采集语音信号。 And when the voice signal is the valid voice signal, outputting an execution instruction to the voice recognition processor, the execution instruction is used to instruct the voice recognition processor to issue an acquisition instruction, where the collection instruction is used to indicate the The sound pickup circuit collects a voice signal.
PCT/CN2016/082426 2015-10-29 2016-05-18 Voice processing method and device, and pickup circuit WO2017071183A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510719799.4 2015-10-29
CN201510719799.4A CN105427866A (en) 2015-10-29 2015-10-29 Voice processing method and device, and pickup circuit

Publications (1)

Publication Number Publication Date
WO2017071183A1 true WO2017071183A1 (en) 2017-05-04

Family

ID=55506021

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/082426 WO2017071183A1 (en) 2015-10-29 2016-05-18 Voice processing method and device, and pickup circuit

Country Status (2)

Country Link
CN (1) CN105427866A (en)
WO (1) WO2017071183A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108022593A (en) * 2018-01-16 2018-05-11 成都福兰特电子技术股份有限公司 A kind of high sensitivity speech recognition system and its control method
CN108663942A (en) * 2017-04-01 2018-10-16 青岛有屋科技有限公司 A kind of speech recognition apparatus control method, speech recognition apparatus and control server

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105427866A (en) * 2015-10-29 2016-03-23 北京云知声信息技术有限公司 Voice processing method and device, and pickup circuit
CN106453761B (en) * 2016-10-31 2019-10-15 北京小米移动软件有限公司 The processing method and processing device of voice signal
CN107566874A (en) * 2017-09-22 2018-01-09 百度在线网络技术(北京)有限公司 Far field speech control system based on television equipment
CN109389976A (en) * 2018-09-27 2019-02-26 珠海格力电器股份有限公司 Intelligent appliance apparatus control method, device, intelligent appliance equipment and storage medium
CN110085223A (en) * 2019-04-02 2019-08-02 北京云知声信息技术有限公司 A kind of voice interactive method of cloud interaction
CN111009239A (en) * 2019-11-18 2020-04-14 北京小米移动软件有限公司 Echo cancellation method, echo cancellation device and electronic equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009066401A1 (en) * 2007-11-22 2009-05-28 Mitsubishi Electric Corporation Sound recognition device for audio apparatus
US20110238417A1 (en) * 2010-03-26 2011-09-29 Kabushiki Kaisha Toshiba Speech detection apparatus
US20120232890A1 (en) * 2011-03-11 2012-09-13 Kabushiki Kaisha Toshiba Apparatus and method for discriminating speech, and computer readable medium
CN102831897A (en) * 2012-08-15 2012-12-19 歌尔声学股份有限公司 Multimedia device and multimedia signal processing method
CN203165458U (en) * 2012-08-15 2013-08-28 歌尔声学股份有限公司 Multimedia device
CN103971681A (en) * 2014-04-24 2014-08-06 百度在线网络技术(北京)有限公司 Voice recognition method and system
CN204681444U (en) * 2015-04-30 2015-09-30 乐视致新电子科技(天津)有限公司 There is the equipment of alarm clock function
CN105427866A (en) * 2015-10-29 2016-03-23 北京云知声信息技术有限公司 Voice processing method and device, and pickup circuit

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101719199A (en) * 2009-11-26 2010-06-02 中山大学 Device and method used for digital home identity multi-recognition
JP2013050605A (en) * 2011-08-31 2013-03-14 Nippon Hoso Kyokai <Nhk> Language model switching device and program for the same
CN103219006A (en) * 2012-01-18 2013-07-24 北京德信互动网络技术有限公司 Man-machine interaction system and method
CN103024117A (en) * 2012-11-29 2013-04-03 广东欧珀移动通信有限公司 System, method and mobile terminal for entering contact person through speech recognition
CN104347072A (en) * 2013-08-02 2015-02-11 广东美的制冷设备有限公司 Remote-control unit control method and device and remote-control unit
CN203721182U (en) * 2013-12-25 2014-07-16 安徽科大讯飞信息科技股份有限公司 A vehicle-mounted speech processing system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009066401A1 (en) * 2007-11-22 2009-05-28 Mitsubishi Electric Corporation Sound recognition device for audio apparatus
US20110238417A1 (en) * 2010-03-26 2011-09-29 Kabushiki Kaisha Toshiba Speech detection apparatus
US20120232890A1 (en) * 2011-03-11 2012-09-13 Kabushiki Kaisha Toshiba Apparatus and method for discriminating speech, and computer readable medium
CN102831897A (en) * 2012-08-15 2012-12-19 歌尔声学股份有限公司 Multimedia device and multimedia signal processing method
CN203165458U (en) * 2012-08-15 2013-08-28 歌尔声学股份有限公司 Multimedia device
CN103971681A (en) * 2014-04-24 2014-08-06 百度在线网络技术(北京)有限公司 Voice recognition method and system
CN204681444U (en) * 2015-04-30 2015-09-30 乐视致新电子科技(天津)有限公司 There is the equipment of alarm clock function
CN105427866A (en) * 2015-10-29 2016-03-23 北京云知声信息技术有限公司 Voice processing method and device, and pickup circuit

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108663942A (en) * 2017-04-01 2018-10-16 青岛有屋科技有限公司 A kind of speech recognition apparatus control method, speech recognition apparatus and control server
CN108663942B (en) * 2017-04-01 2021-12-07 青岛有屋科技有限公司 Voice recognition equipment control method, voice recognition equipment and central control server
CN108022593A (en) * 2018-01-16 2018-05-11 成都福兰特电子技术股份有限公司 A kind of high sensitivity speech recognition system and its control method

Also Published As

Publication number Publication date
CN105427866A (en) 2016-03-23

Similar Documents

Publication Publication Date Title
WO2017071183A1 (en) Voice processing method and device, and pickup circuit
CN107454508B (en) TV set and TV system of microphone array
CN107509153B (en) Detection method and device of sound playing device, storage medium and terminal
CN103259898B (en) The method of Automatic adjusument frequency response and terminal
CN107995360B (en) Call processing method and related product
US10277750B2 (en) Method and system for improving echo in hands-free call of mobile terminal
CN106463112A (en) Voice recognition method, voice wake-up device, voice recognition device and terminal
CN107452395B (en) Voice signal echo cancellation device and television
CN107452398B (en) Echo acquisition method, electronic device and computer readable storage medium
CN110349582A (en) Display device and far field speech processing circuit
CN112037825B (en) Audio signal processing method and device and storage medium
CN111741394A (en) Data processing method and device and readable medium
CN105049802A (en) Speech recognition law-enforcement recorder and recognition method thereof
CN207603881U (en) A kind of intelligent sound wireless sound box
CN103117083B (en) A kind of audio-frequency information harvester and method
US9779731B1 (en) Echo cancellation based on shared reference signals
CN103139688A (en) Method, device and hearing-aid for eliminating environmental noise
WO2016177204A1 (en) Noise processing method and apparatus
CN103559878A (en) Method for eliminating noise in audio information and device thereof
JP2019184809A (en) Voice recognition device and voice recognition method
CN106210290B (en) A kind of voice communication method and mobile terminal
US11227423B2 (en) Image and sound pickup device, sound pickup control system, method of controlling image and sound pickup device, and method of controlling sound pickup control system
CN105244037B (en) Audio signal processing method and device
CN107370898B (en) Ring tone playing method, terminal and storage medium thereof
CN113053371A (en) Voice control system and method, voice suite, bone conduction and voice processing device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16858629

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16/08/2018)

122 Ep: pct application non-entry in european phase

Ref document number: 16858629

Country of ref document: EP

Kind code of ref document: A1