CN108200492A - Voice control optimization method, device and the earphone and wearable device that integrate In-Ear microphone - Google Patents
Voice control optimization method, device and the earphone and wearable device that integrate In-Ear microphone Download PDFInfo
- Publication number
- CN108200492A CN108200492A CN201711448752.4A CN201711448752A CN108200492A CN 108200492 A CN108200492 A CN 108200492A CN 201711448752 A CN201711448752 A CN 201711448752A CN 108200492 A CN108200492 A CN 108200492A
- Authority
- CN
- China
- Prior art keywords
- audio
- voice
- digital
- acoustic
- asr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1083—Reduction of ambient noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/10—Details of earpieces, attachments therefor, earphones or monophonic headphones covered by H04R1/10 but not provided for in any of its subgroups
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Circuit For Audible Band Transducer (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
The present invention provides a kind of voice control optimization method, device and the earphones and wearable device that integrate In-Ear microphone, are related to intelligent wearable device technical field.Wherein, voice control optimization method includes:Audio signal is received, enters speech recognition system in audio signal(ASR)Before, acoustic filtering is carried out to audio signal, obtains preprocessed audio;Preprocessed audio is acquired by microphone, and digital audio is generated by analog-to-digital conversion after analog circuit filters;Digital audio is via speech recognition system(ASR)It is treated as efficient voice instruction;According to the corresponding function of efficient voice instruction triggers smart machine.The present invention is directed to the application scenarios of pleasant microphone, proposes series of optimum method, effectively inhibits noise contribution, so as to obtain purer voice data, relevant back-end services module is then pushed to, to ensure the offer of high-quality service.
Description
Technical field
The present invention relates to intelligent wearable device technical field, in particular to a kind of voice control optimization method and dress
It puts, a kind of earphone for integrating In-Ear microphone and a kind of wearable device.
Background technology
It listens and says, be the essential attribute of communication possessed by people.Correspondingly, intelligent wearable device be also required to it is basic
Communication ability can be said, can also be listened.The listening and speaking ability of intelligent wearable device is passed by the acoustics of two kinds of input and output
Sensor is completed.The former, sound is acquired by microphone input sensor;The latter is broadcast by loud speaker output transducer
Sound reproduction sound., in general will not be only comprising a kind of simple sound by the collected sound of common microphone, but voice,
The mixture of musical sound and various ambient sounds (noise).And many service (voices that intelligent wearable device is provided
Wake up, speech recognition, voice communication, music etc.) completion, often only depend on the vocal sections of mixing sound, and
It is the voice of equipment wearer, ambient sound in addition to this, the presence of the voice of non-wearer and other noise contributions, instead
It can hinder the normal operation of certain services.Therefore, intelligent wearable device needs an acoustics front end subsystem, to complete externally
The acquisition of boundary's sound and relevant treatment provide effective audio data to the greatest extent, to ensure rear end various functions service mould
The normal operation of block.
Invention content
The present invention is directed at least solve one of technical problem present in the prior art or the relevant technologies.Propose a kind of language
Sound control optimization method, device and integrate the earphone of In-Ear microphone and wearable device completes the acquisition to external sound
And relevant treatment, effective audio data is provided to the greatest extent, to ensure the normal operation of rear end various functions service module.
For this purpose, an object of the present invention is to provide a kind of voice control optimization methods.
A kind of voice control optimization device is provided it is another object of the present invention to corresponding.
It is yet a further object of the present invention to provide a kind of earphones for integrating In-Ear microphone.
It is yet a further object of the present invention to provide a kind of wearable devices.
To achieve the above object, the first aspect of the present invention provides a kind of voice control optimization method, is known by voice
Other system (ASR) the processing collected voice signal of audio signal reception device obtains command signal, is realized according to described instruction signal to intelligence
The voice control of energy equipment, including:
Audio signal is received, before the audio signal enters the speech recognition system (ASR), the audio is believed
Number carry out acoustic filtering, obtain preprocessed audio;
The preprocessed audio is acquired by microphone, and digital sound is generated by analog-to-digital conversion after analog circuit filters
Frequently;
The digital audio is treated as efficient voice instruction via speech recognition system (ASR);
According to the corresponding function of the efficient voice instruction triggers smart machine.
Preferably, the acoustic filtering be by mechanical mechanism and/or acoustic construction to the voice signal of spatial domain into
Row noise reduction process.
In any of the above-described scheme preferably, the digital audio is effective language by speech recognition system (ASR) processing
Sound instructs, and specifically includes:The number is handled by the duct adaptation algorithm of DSP module, dereverberation algorithm and noise suppression algorithms
Word tone frequency, to the speech recognition system (ASR), the speech recognition system (ASR) will connect output treated digital signal
The digital signal received is converted into text message, and the text message is treated as institute by natural language processing system (NLP)
State efficient voice instruction.
In any of the above-described scheme preferably, the microphone is bone-conduction microphone and/or In-Ear microphone.
In any of the above-described scheme preferably, the preprocessed audio is believed according to the ssystem transfer function of acoustic construction
Number be filtered shaping, the ssystem transfer function of the acoustic construction by helmholtz resonance centre frequency fcIt determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
In any of the above-described scheme preferably, it is described that digital sound is generated by analog-to-digital conversion after analog circuit filters
Frequently, it specifically includes:The filtering is carried out according to the ssystem transfer function of analogue filter circuit.
In any of the above-described scheme preferably, it is treated as in the digital audio via speech recognition system (ASR)
Before efficient voice instruction, further include:Digital filter carries out the digital audio according to the ssystem transfer function of digital filter
Wave.
In any of the above-described scheme preferably, the duct adaptation algorithm specifically includes:It is read aloud according to wearer is measured
The spectral response of fixed voice sequence obtains duct system relative to the transmission letter of acoustic microphones system to the digital audio
It carries out duct and adapts to calibration.
The second part of the present invention is related to a kind of voice control optimization device, is handled and received by speech recognition system (ASR)
The collected voice signal of mixer obtains command signal, and the voice control to smart machine is realized according to described instruction signal,
It is characterised in that it includes:
Acoustic filtering unit, receive audio signal, the audio signal enter the speech recognition system (ASR) it
Before, acoustic filtering is carried out to the audio signal, obtains preprocessed audio;
AD conversion unit, the preprocessed audio are acquired by microphone, are turned after analog circuit filters by modulus
Change generation digital audio;
Voice recognition unit, the digital audio are treated as efficient voice via speech recognition system (ASR) and instruct;
Execute instruction unit, according to the corresponding function of the efficient voice instruction triggers smart machine.
Preferably, the acoustic filtering be by mechanical mechanism and/or acoustic construction to the voice signal of spatial domain into
Row noise reduction process.
In any of the above-described scheme preferably, the voice recognition unit specifically includes:Digital processing subelement leads to
Duct adaptation algorithm, dereverberation algorithm and the noise suppression algorithms for crossing DSP module handle the digital audio, and treated for output
Digital signal to the speech recognition system (ASR), text conversion subunit, the speech recognition system (ASR) will receive
Digital signal be converted into text message, instruction generation unit, the text message passes through natural language processing system (NLP) quilt
It handles and is instructed for the efficient voice.
In any of the above-described scheme preferably, the microphone is bone-conduction microphone and/or In-Ear microphone.
In any of the above-described scheme preferably, it further includes:Shaping filter unit transmits letter according to the system of acoustic construction
Several that shaping is filtered to the preprocessed audio signal, the ssystem transfer function of the acoustic construction is by helmholtz resonance
Centre frequency fcIt determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
In any of the above-described scheme preferably, the AD conversion unit specifically includes:Analog filtering subelement, root
The filtering is carried out according to the ssystem transfer function of analogue filter circuit.
In any of the above-described scheme preferably, it further includes:Digital filtering subelement is passed according to the system of digital filter
Delivery function carries out digital filtering to the digital audio.
In any of the above-described scheme preferably, the duct adaptation algorithm specifically includes:It is read aloud according to wearer is measured
The spectral response of fixed voice sequence obtains duct system relative to the transmission letter of acoustic microphones system to the digital audio
It carries out duct and adapts to calibration.
The Part III of the present invention is related to a kind of earphone for integrating In-Ear microphone, including housing, analogue audio frequency processing
Module and Digital audio process module, further include:
Voice control described in any one of the above embodiments optimizes device;And Acoustic treatment module, at the analogue audio frequency
The preposition processing module of module is managed, acoustic filtering is carried out according to mechanical mechanism and/or acoustic construction;
Radio reception pipe in ear is connected to the Acoustic treatment module and the analogue audio frequency processing module, is arranged on described
Enclosure interior carries out sound wave collection in duct.
The fourth aspect of the present invention is related to a kind of wearable device, including:It is integrated In-Ear as described in above-mentioned technical proposal
The earphone of microphone.
The advantageous effect that the present invention obtains is:Pass through acoustic module to spatial domain, the circuit module of analog domain, numeric field
DSP module related algorithm collaboration processing, complete filtering/shaping/noise restraint to effective sound element, extract pendant
The voice and other noise contributions of the voice of wearer, filtering environmental noise and non-wearer, improve smart machine and voice are referred to
The discrimination of order.
The additional aspect and advantage of the present invention will be set forth in part in the description, and will partly become from the following description
It obtains significantly or is recognized by the practice of the present invention.
Description of the drawings
The above-mentioned and/or additional aspect and advantage of the present invention will become in the description from combination accompanying drawings below to embodiment
Significantly and it is readily appreciated that, wherein:
Fig. 1 is the flow chart of a preferred embodiment of voice control optimization method according to the invention;
Fig. 2 is the schematic block diagram for the preferred embodiment that voice control according to the invention optimizes device;
Fig. 3 is the schematic block diagram of a preferred embodiment of the earphone according to the invention for integrating In-Ear microphone;
Fig. 4 is a kind of schematic block diagram of a preferred embodiment of wearable device according to the invention;
Fig. 5 is the hardware system schematic diagram according to a preferred embodiment of wearable device of the present invention;
Fig. 6 is the signal processing flow schematic diagram of a preferred embodiment of voice control optimization method according to the invention.
Specific embodiment
It is to better understand the objects, features and advantages of the present invention, below in conjunction with the accompanying drawings and specific real
Mode is applied the present invention is further described in detail.It should be noted that in the absence of conflict, the implementation of the application
Feature in example and embodiment can be combined with each other.
Many details are elaborated in the following description to facilitate a thorough understanding of the present invention, still, the present invention may be used also
To be implemented using other different from other modes described here, therefore, protection scope of the present invention is not by described below
Specific embodiment limitation.
Fig. 1 is the flow chart of a preferred embodiment of voice control optimization method according to the invention;
As shown in Figure 1, a kind of voice control optimization method, including:Step 102, receive audio signal, audio signal into
Before entering speech recognition system (ASR), acoustic filtering is carried out to audio signal, obtains preprocessed audio;Step 104, it pre-processes
Audio is acquired by microphone, and digital audio is generated by analog-to-digital conversion after analog circuit filters;Step 106, digital audio
Efficient voice instruction is treated as via speech recognition system (ASR);Step 108, it is intelligently set according to efficient voice instruction triggers
Standby corresponding function.
According to the abovementioned embodiments of the present invention, it is preferable that acoustic filtering is to sky by mechanical mechanism and/or acoustic construction
Between domain voice signal carry out noise reduction process.
According to the abovementioned embodiments of the present invention, it is preferable that digital audio is effective language by speech recognition system (ASR) processing
Sound instructs, and specifically includes:Digital sound is handled by the duct adaptation algorithm of DSP module, dereverberation algorithm and noise suppression algorithms
Frequently, output treated digital signal is to speech recognition system (ASR), the number letter that speech recognition system (ASR) will receive
Number text message is converted into, text message passes through natural language processing system (NLP, Natural Language
Processing) it is treated as efficient voice instruction.
According to the abovementioned embodiments of the present invention, it is preferable that microphone is bone-conduction microphone and/or In-Ear microphone.
According to the abovementioned embodiments of the present invention, it is preferable that further include:According to the ssystem transfer function of acoustic construction to locating in advance
Reason audio signal is filtered shaping, the ssystem transfer function of acoustic construction by helmholtz resonance centre frequency fcIt determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
According to the abovementioned embodiments of the present invention, it is preferable that number is generated by analog-to-digital conversion after analog circuit filters
Audio specifically includes:It is filtered according to the ssystem transfer function of analogue filter circuit.
According to the abovementioned embodiments of the present invention, it is preferable that be treated as in digital audio via speech recognition system (ASR)
Before efficient voice instruction, further include:Digital filtering is carried out to digital audio according to the ssystem transfer function of digital filter.
According to the abovementioned embodiments of the present invention, it is preferable that duct adaptation algorithm specifically includes:It is read aloud according to wearer is measured
The spectral response of fixed voice sequence obtains duct system and digital audio is carried out relative to the transmission letter of acoustic microphones system
Duct adapts to calibration.
As shown in Fig. 2, the second aspect of the present invention proposes a kind of voice control optimization device 200, pass through speech recognition
System (ASR) the processing collected voice signal of audio signal reception device obtains command signal, is realized according to command signal to smart machine
Voice control, further include:Acoustic filtering unit 202 receives audio signal, enters speech recognition system in audio signal
(ASR) before, acoustic filtering is carried out to audio signal, obtains preprocessed audio;AD conversion unit 204, preprocessed audio quilt
Microphone acquires, and digital audio is generated by analog-to-digital conversion after analog circuit filters;Voice recognition unit 206, digital sound
Frequency is treated as efficient voice instruction via speech recognition system (ASR);Execute instruction unit 208, instructs according to efficient voice
Trigger the corresponding function of smart machine.
According to the abovementioned embodiments of the present invention, it is preferable that acoustic filtering is to sky by mechanical mechanism and/or acoustic construction
Between domain voice signal carry out noise reduction process.
According to the abovementioned embodiments of the present invention, it is preferable that voice recognition unit 206 specifically includes:Digital processing subelement
2062, handle digital audio, output processing by the duct adaptation algorithm of DSP module, dereverberation algorithm and noise suppression algorithms
Digital signal afterwards to speech recognition system (ASR), text conversion subunit 2064, speech recognition system (ASR) will receive
Digital signal be converted into text message, instruction generation unit 2066, text message passes through natural language processing system (NLP) quilt
It handles and is instructed for efficient voice.
According to the abovementioned embodiments of the present invention, it is preferable that microphone is bone-conduction microphone and/or In-Ear microphone.
According to the abovementioned embodiments of the present invention, it is preferable that further include:Shaping filter unit 212, what it is according to acoustic construction is
System transmission function is filtered preprocessed audio signal shaping, and the ssystem transfer function of acoustic construction is by helmholtz resonance
Centre frequency fcIt determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
According to the abovementioned embodiments of the present invention, it is preferable that AD conversion unit 204 specifically includes:Analog filtering subelement
2042, it is filtered according to the ssystem transfer function of analogue filter circuit.
According to the abovementioned embodiments of the present invention, it is preferable that further include:Digital filtering subelement 210, according to digital filter
Ssystem transfer function to digital audio carry out digital filtering.
As shown in figure 3, the earphone 300 of In-Ear microphone is integrated, including housing 302,304 and of analogue audio frequency processing module
Digital audio process module 306, further includes:Such as the voice control optimization device 200 of above-mentioned any embodiment;And Acoustic treatment
As the preposition processing module of analogue audio frequency processing module, acoustics is carried out according to mechanical mechanism and/or acoustic construction for module 308
Filtering;Radio reception pipe 310 in ear, is connected to Acoustic treatment module and analogue audio frequency processing module, and setting inside housings, is located at
Sound wave collection is carried out in duct.
A kind of wearable device 400 as shown in Figure 4, including:Integrated In-Ear microphone as described in above-described embodiment
Earphone 300.
As shown in figure 5, acoustics front terminal system sounds signal processing flow, as described below:
Acoustical vibration when people speaks, by bone conduction to head everywhere and in duct, then by Acoustic treatment mould
Block (mechanical mechanism/acoustic construction) carries out acoustic filtering and relevant treatment;
The vibration/sound signal that acoustic module is handled is converted into analogue audio frequency electricity by osteoacusis/pleasant microphone
Signal;
Analog audio signal carries out analog filtering and relevant treatment by analog circuit (analogue audio frequency processing module);
The electric signal that analog module is handled is converted into storable digital signal by A/D converter;
Digital signal is converted into and is directed to by the complex process of related algorithm loaded on digital signal processor (DSP)
Effective audio data of all kinds of back-end services;
Treated audio data activates relevant back-end services module, and by high in the clouds, push/acquisition/presentation is all kinds of
The specific service of terminaloriented user;
To sum up, acoustics front end subsystem, effective to activate by the relevant treatment to equipment wearer human voice signal
Rear module, to provide the service of high quality.
According to further embodiment of this invention, the performance of acoustics front end subsystem is highly dependent on through sensor (Mike
Wind) metadata that obtains, and different sensors can provide the audio metadata of different characteristics:
Bone-conduction microphone (piezoelectric ceramics Mike) can collect trunnion when people speaks/bone vibration, physically every
Non-contacting ambient noise/non-wearer voice absolutely, however, as the difference and microphone of sound conduction path/medium
The acoustic characteristic of itself, compared with the common microphone by air transmitted, collected voice has had great change,
It needs to do corresponding shaping correction, significantly more efficient data could be provided used in rear end.
Pleasant microphone (electret mike) is in duct, after the bone conduct vibrations to duct when people speaks, pushes
Air vibration in duct, by the sound conduction of wearer to microphone.And the passive Noise Reduction that earplug is fabulous, it equally can be with
Effectively isolated environment noise/non-wearer voice.However, pleasant microphone similarly there is sound conduction path/medium to ask
Data distortion is asked caused by topic;Moreover, duct closed by earplug after black-out effect, the low frequency part amplitude of voice can be generated
Up to 20~30dB influences;In addition, the reverberation problem also in duct, the clarity of voice data can be also influenced.Case above,
Relevant shaping is required for correct, to provide significantly more efficient data as used in rear end.Therefore, acoustics front end subsystem, needs
The acoustic module of spatial domain, the circuit module of analog domain, numeric field DSP module related algorithm collaboration processing, complete pair
Filtering/shaping/noise restraint of effective sound element.Relevant signal processing flow block diagram, as shown in Figure 6:
Since the related algorithm model of rear module (speech recognition/voice wakes up) is built upon standard acoustic Mike's elegance
On the data of collection, and the data that current system obtains are based on osteoacusis/pleasant microphone, and therefore, it is necessary to pass through experiment
Obtain ssystem transfer function H of the osteoacusis/pleasant microphone relative to acoustic microphonesRBE(n):
HRBE(n)=HBE(n)/HRef(n)
Wherein, the ssystem transfer function H of standard acoustic microphoneRef(5n) is by the amplitude of the standard microphone actually measured
Frequency response and phase-frequency response accordingly determine;And the ssystem transfer function H of osteoacusis/pleasant microphoneBE(n) by actually surveying
The amplitude-frequency response and phase-frequency response of the osteoacusis of amount/pleasant microphone accordingly determine.
When carrying out relevant filter shape to the data of osteoacusis/pleasant microphone, need to use HRBE(n) inverse function
HIRBE(n), so that osteoacusis/pleasant microphone data and standard microphone data are more consistent after processing:
HIRBE(n)=[HRBE(n)]-1
Again because of acoustic filtering module and analog filter block that osteoacusis/pleasant microphone data flows through, by acoustics
Various limitations of structure design and hardware simulation circuit design, therefore, HIRBE(n) specific implementation, meeting cutting are HAcoustic
(n), HAnalog(n) and HDigital(n) three parts first meet acoustic construction and the limitation of analog hardware circuit, recycle number step by step
The flexibility of filter module reaches overall filter shape requirement.
HIRBE(n)=HAcoustic(n)HAnalog(n)HDigital(n)
HAcoustic(n) it is the relevant ssystem transfer function of acoustic construction, key property, by Helmholtz resonance
Frequency of heart formula determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
Party formula assumes that resonator is the simple structure that is connected to form of pipeline of a cavity and sliver transvers section equalization.
HAnalog(n) it is the relevant ssystem transfer function of analog hardware filter circuit, HDigital(n) it is digital filter
Ssystem transfer function.
Generally, analog filter block and digital filtering module can reach approximate filter effect, still, simulation filter
Wave module can provide the dynamic range of bigger, and digital filtering module can provide better flexibility, and the combination of the two can be with
Higher validity and reliability is provided to system design.
HIRBE(n) it decomposes after determining, it is possible to which related procedure according to fig. 2 carries out algorithm process step by step.
Audio data is through acoustic module according to HAcoustic(n) (sound is not by microphone at this time after correlation properties processing
How acquisition, handled), incoming osteoacusis/pleasant microphone is acquired data.Then, through hardware simulation module
According to HAnalog(n) after correlation properties processing, then after A/D converter is handled, it is stored as digital signal.
Digital signal is by digital filtering module according to HDigital(n) correlation properties processing after, basically reach osteoacusis/
The effect of pleasant microphone enhancing algorithm.
But due to individual wearing mode and the difference of duct structure, HIRBE(n) it can be deposited with different because of wearer
In certain individual difference, it is H that duct is needed, which to adapt to module using characteristic,comp(n) be adjusted to difference.
Duct adapts to module, and there are both of which:Calibration mode and modification model.
Under calibration mode, duct adapts to the spectral response that module reads aloud fixed voice sequence by measuring wearer, obtains
Hcomp(n)。
Under modification model, duct adapts to module and directly uses Hcomp(n) voice data is handled, to match individual spy
Property.
Then, for the application scenarios of pleasant microphone, it is also necessary to general dereverberation mould and noise suppression module, into one
Step effectively inhibits relevant noise contribution, so as to obtain purer voice data, is then pushed to relevant rear end
Service module, to ensure the offer of high-quality service.
After the present invention pre-processes collected acoustic signals by acoustic filtering according to above-mentioned each embodiment again
Subsequent filtering process is carried out through analog circuit and/or digital circuit, collected noise contribution is effectively inhibited, is passed through
ASR/NLP systems substantially increase the standard of speech recognition to identifying effective phonetic order in more pure voice signal
Exactness, to ensure the offer of high-quality service.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field
For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, that is made any repaiies
Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.
Claims (10)
1. a kind of voice control optimization method handles the collected voice signal of audio signal reception device by speech recognition system (ASR)
Command signal is obtained, the voice control to smart machine is realized according to described instruction signal, which is characterized in that including:
Receive audio signal, the audio signal enter the speech recognition system (ASR) before, to the audio signal into
Row acoustic filtering obtains preprocessed audio;
The preprocessed audio is acquired by microphone, and digital audio is generated by analog-to-digital conversion after analog circuit filters;
The digital audio is treated as efficient voice instruction via speech recognition system (ASR);
According to the corresponding function of the efficient voice instruction triggers smart machine.
2. voice control optimization method according to claim 1, which is characterized in that the acoustic filtering is by mechanical machine
Structure and/or acoustic construction carry out noise reduction process to the voice signal of spatial domain.
3. voice control optimization method according to claim 1, which is characterized in that the digital audio is by speech recognition system
(ASR) processing of uniting is instructed for efficient voice, is specifically included:Pass through the duct adaptation algorithm of DSP module, dereverberation algorithm and noise
The restrainable algorithms processing digital audio, output treated digital signal to the speech recognition system (ASR), the voice
The digital signal received is converted into text message by identifying system (ASR), and the text message passes through natural language processing system
System (NLP) is treated as the efficient voice instruction.
4. voice control optimization method according to claim 1, which is characterized in that the microphone is bone-conduction microphone
And/or In-Ear microphone.
5. voice control optimization method according to claim 4, which is characterized in that further include:What it is according to acoustic construction is
System transmission function is filtered the preprocessed audio signal shaping, and the ssystem transfer function of the acoustic construction is by Hai Muhuo
The hereby centre frequency f of resonancecIt determines:
Wherein, wherein c is air velocity, AHIt is the cross-sectional area of acoustic aperture, LHIt is the length of acoustic aperture, VCIt is the volume of cavity.
6. voice control optimization method according to claim 4, which is characterized in that described to lead to after analog circuit filters
Analog-to-digital conversion generation digital audio is crossed, is specifically included:The filtering is carried out according to the ssystem transfer function of analogue filter circuit.
7. voice control optimization method according to claim 4, which is characterized in that know in the digital audio via voice
Other system (ASR) is treated as before efficient voice instruction, is further included:According to the ssystem transfer function of digital filter to described
Digital audio carries out digital filtering.
8. a kind of voice control optimizes device, the collected voice signal of audio signal reception device is handled by speech recognition system (ASR)
Command signal is obtained, the voice control to smart machine is realized according to described instruction signal, which is characterized in that including:
Acoustic filtering unit receives audio signal, right before the audio signal enters the speech recognition system (ASR)
The audio signal carries out acoustic filtering, obtains preprocessed audio;
AD conversion unit, the preprocessed audio are acquired by microphone, are given birth to after analog circuit filters by analog-to-digital conversion
Into digital audio;
Voice recognition unit, the digital audio are treated as efficient voice via speech recognition system (ASR) and instruct;
Execute instruction unit, according to the corresponding function of the efficient voice instruction triggers smart machine.
9. a kind of earphone for integrating In-Ear microphone, including housing, analogue audio frequency processing module and Digital audio process module,
It is characterized in that, it further includes:
Voice control optimization device as claimed in claim 8;And
Acoustic treatment module, as the preposition processing module of the analogue audio frequency processing module, according to mechanical mechanism and/or acoustics
Structure carries out acoustic filtering;
Radio reception pipe in ear is connected to the Acoustic treatment module and the analogue audio frequency processing module, is arranged on the housing
Inside carries out sound wave collection in duct.
10. a kind of wearable device, including processor and wireless communication module, which is characterized in that further include:As claim 9 institute
That states integrates the earphone of In-Ear microphone.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710566244X | 2017-07-12 | ||
CN201710566244 | 2017-07-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108200492A true CN108200492A (en) | 2018-06-22 |
Family
ID=62584633
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711448752.4A Pending CN108200492A (en) | 2017-07-12 | 2017-12-27 | Voice control optimization method, device and the earphone and wearable device that integrate In-Ear microphone |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108200492A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109345817A (en) * | 2018-10-09 | 2019-02-15 | 中天智领(北京)科技有限公司 | Large screen system control method, device and electronic equipment |
CN110766573A (en) * | 2019-07-06 | 2020-02-07 | 杨林 | Interactive reading system for tissue life and corresponding tissue life activity room |
CN113096665A (en) * | 2019-12-23 | 2021-07-09 | 上海浩宜信息科技有限公司 | Intelligent microphone array |
CN113259793A (en) * | 2020-02-07 | 2021-08-13 | 杭州智芯科微电子科技有限公司 | Intelligent microphone and signal processing method thereof |
WO2023087572A1 (en) * | 2021-11-19 | 2023-05-25 | 深圳市韶音科技有限公司 | Acoustic apparatus and transfer function determination method therefor |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103096197A (en) * | 2011-11-08 | 2013-05-08 | 富准精密工业(深圳)有限公司 | Microphone module |
CN103444208A (en) * | 2011-01-05 | 2013-12-11 | 皇家飞利浦电子股份有限公司 | Seal-quality estimation for a seal for an ear canal |
CN104768115A (en) * | 2013-12-31 | 2015-07-08 | Gn瑞声达A/S | Earmold for active occlusion cancellation |
CN105785767A (en) * | 2014-12-26 | 2016-07-20 | 北京奇虎科技有限公司 | Intelligent earphone capable of controlling intelligent household environment |
CN106601245A (en) * | 2016-12-15 | 2017-04-26 | 北京塞宾科技有限公司 | Vehicle-mounted intelligent audio device and audio processing method |
CN106604181A (en) * | 2016-12-15 | 2017-04-26 | 北京塞宾科技有限公司 | Distributed microphone smart home system |
CN206181373U (en) * | 2013-11-20 | 2017-05-17 | 索诺瓦公司 | A sound duct, sound duct are arranged, earphone and hearing equipment for earphone |
CN106920548A (en) * | 2015-12-25 | 2017-07-04 | 比亚迪股份有限公司 | Phonetic controller, speech control system and sound control method |
CN106937195A (en) * | 2015-12-29 | 2017-07-07 | 哈曼国际工业有限公司 | Noise reduction external ear headphone |
-
2017
- 2017-12-27 CN CN201711448752.4A patent/CN108200492A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103444208A (en) * | 2011-01-05 | 2013-12-11 | 皇家飞利浦电子股份有限公司 | Seal-quality estimation for a seal for an ear canal |
CN103096197A (en) * | 2011-11-08 | 2013-05-08 | 富准精密工业(深圳)有限公司 | Microphone module |
CN206181373U (en) * | 2013-11-20 | 2017-05-17 | 索诺瓦公司 | A sound duct, sound duct are arranged, earphone and hearing equipment for earphone |
CN104768115A (en) * | 2013-12-31 | 2015-07-08 | Gn瑞声达A/S | Earmold for active occlusion cancellation |
CN105785767A (en) * | 2014-12-26 | 2016-07-20 | 北京奇虎科技有限公司 | Intelligent earphone capable of controlling intelligent household environment |
CN106920548A (en) * | 2015-12-25 | 2017-07-04 | 比亚迪股份有限公司 | Phonetic controller, speech control system and sound control method |
CN106937195A (en) * | 2015-12-29 | 2017-07-07 | 哈曼国际工业有限公司 | Noise reduction external ear headphone |
CN106601245A (en) * | 2016-12-15 | 2017-04-26 | 北京塞宾科技有限公司 | Vehicle-mounted intelligent audio device and audio processing method |
CN106604181A (en) * | 2016-12-15 | 2017-04-26 | 北京塞宾科技有限公司 | Distributed microphone smart home system |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109345817A (en) * | 2018-10-09 | 2019-02-15 | 中天智领(北京)科技有限公司 | Large screen system control method, device and electronic equipment |
CN110766573A (en) * | 2019-07-06 | 2020-02-07 | 杨林 | Interactive reading system for tissue life and corresponding tissue life activity room |
CN113096665A (en) * | 2019-12-23 | 2021-07-09 | 上海浩宜信息科技有限公司 | Intelligent microphone array |
CN113259793A (en) * | 2020-02-07 | 2021-08-13 | 杭州智芯科微电子科技有限公司 | Intelligent microphone and signal processing method thereof |
CN113259793B (en) * | 2020-02-07 | 2022-12-06 | 杭州智芯科微电子科技有限公司 | Intelligent microphone and signal processing method thereof |
WO2023087572A1 (en) * | 2021-11-19 | 2023-05-25 | 深圳市韶音科技有限公司 | Acoustic apparatus and transfer function determination method therefor |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11671773B2 (en) | Hearing aid device for hands free communication | |
CN108200492A (en) | Voice control optimization method, device and the earphone and wearable device that integrate In-Ear microphone | |
CN106664498B (en) | For generating the artificial ear device and its correlation technique of head relevant to audio frequency transmission function | |
US10803857B2 (en) | System and method for relative enhancement of vocal utterances in an acoustically cluttered environment | |
US20060159297A1 (en) | Ear canal signal converting method, ear canal transducer and headset | |
JPH09503889A (en) | Voice canceling transmission system | |
US20200213776A1 (en) | External ear insert for hearing enhancement | |
CN209642968U (en) | Bone conduction conductance double type hearing aid | |
JP2002125298A (en) | Microphone device and earphone microphone device | |
JP2022531363A (en) | Auditory device with bone conduction sensor | |
CN112116918A (en) | Speech signal enhancement processing method and earphone | |
CN109729448A (en) | Neck wears the voice control optimization method and device of formula interactive voice earphone | |
US11589173B2 (en) | Hearing aid comprising a record and replay function | |
CN207518797U (en) | Neck wears the voice control optimization device of formula interactive voice earphone | |
CN109511070A (en) | A kind of hearing aid gain processing method | |
CN113194374A (en) | Earphone based on bone voiceprint recognition technology | |
EP4351171A1 (en) | A hearing aid comprising a speaker unit | |
EP4297436A1 (en) | A hearing aid comprising an active occlusion cancellation system and corresponding method | |
US20120250918A1 (en) | Method for improving the comprehensibility of speech with a hearing aid, together with a hearing aid | |
CN115942190A (en) | ENC noise reduction method and system for simulating sound generation in auditory canal based on artificial head frame device | |
TW202406367A (en) | Earphone device, compensation method thereof and computer program product | |
CN116709116A (en) | Sound signal processing method and earphone device | |
EP3318071A1 (en) | External ear insert for hearing enhancement |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180622 |
|
RJ01 | Rejection of invention patent application after publication |