CN109493877A - A kind of sound enhancement method and device of auditory prosthesis - Google Patents

A kind of sound enhancement method and device of auditory prosthesis Download PDF

Info

Publication number
CN109493877A
CN109493877A CN201710817728.7A CN201710817728A CN109493877A CN 109493877 A CN109493877 A CN 109493877A CN 201710817728 A CN201710817728 A CN 201710817728A CN 109493877 A CN109493877 A CN 109493877A
Authority
CN
China
Prior art keywords
audio data
sound
subband
parameter
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710817728.7A
Other languages
Chinese (zh)
Other versions
CN109493877B (en
Inventor
王志华
孙卓异
姜汉钧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN201710817728.7A priority Critical patent/CN109493877B/en
Publication of CN109493877A publication Critical patent/CN109493877A/en
Application granted granted Critical
Publication of CN109493877B publication Critical patent/CN109493877B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0224Processing in the time domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Neurosurgery (AREA)
  • Otolaryngology (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Prostheses (AREA)

Abstract

The embodiment of the invention discloses a kind of sound enhancement method of auditory prosthesis and devices, are related to medical electronic technology and Audio Signal Processing field.Method in the embodiment of the present invention includes: the four-way audio data for obtaining auditory prosthesis;The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;Exporting two-way enhances audio data.The audio data that terminal will acquire carries out speech enhan-cement processing, final output two-way realaudio data on portable terminal.Intelligence improves sound quality, substantially increase the prevalence of hearing aid fits, it can achieve better hearing aid effect and enhancement method, simultaneously because audio data processing is solidificated on auditory prosthesis processor, but the general processor chip based on portable terminal, conducive to the perfect of following system upgrade and sound enhancement method.

Description

A kind of sound enhancement method and device of auditory prosthesis
Technical field
The present invention relates to medical electronic technology and Audio Signal Processing field, espespecially a kind of speech enhan-cement of auditory prosthesis Method and apparatus.
Background technique
Current China, which comes into, accelerates aging society epoch, the raising of the elderly's life expectancy, electronics applications The number quantity for excessively causing hearing level to decline and damage is in up-trend, is mentioned recently as health care is horizontal Height, the elderly of wear hearing aid and listen barrier patient ratio it is more and more.Nowadays, Hearing aid technology is based on advanced number Word signal processing, wireless communication and artificial intelligence technology.With the fast development of technology, the size of hearing aid is smaller and smaller, helps Listen the function of device more and more comprehensive, if multi-channel wide dynamic range compress, active noise reduction, Adaptive directivity, Analysis of The Acoustic Fields and It is wirelessly connected to other audios or communication system.
One importance of hearing aid is ensured that under the premise of the not further loss hearing patient sense of hearing, helps patient Hearing loss is compensated, audio quality is improved.Existing hearing aid built-in algorithm is to solidify in the processor, cannot be with processor Change and auto upgrading.
Summary of the invention
In order to solve the above-mentioned technical problem, the embodiment of the invention provides a kind of sound enhancement method of auditory prosthesis and dresses It sets, the corresponding voice enhanced function of hearing-aid function is realized using Portable intelligent terminal (such as mobile phone).
In a first aspect, the present invention provides a kind of sound enhancement method of auditory prosthesis, comprising:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
Preferably, the audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics of the audio data Scene includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
Preferably, the compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene Include:
Pretreatment and subchannel filtering are carried out to the audio data;
The filtered audio data of subchannel is subjected to sub-band division;Frequency spectrum point is carried out to the subband of each audio data Analysis, obtains the signal-to-noise ratio of the subband of the audio data;
The corresponding sound source of the audio data is gated according to identified environmental pattern, calculates sound source The angle of position;
According to the signal-to-noise ratio of the angle of determining sound source position and subband, to every height of the audio data Band carries out noise reduction and elimination and utters long and high-pitched sounds processing;
Dynamic compression and intensity of sound enhanced processing are carried out to each subband of the audio data after noise reduction;
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert, and is carried out Linear phase compensation;
Each subband of the audio data is merged into time domain speech signal.
Preferably, carrying out pretreatment to the audio data includes:
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
Preferably, the audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics of the audio data After scene further include:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
Preferably, the corresponding sound source of the audio data is gated according to identified environmental pattern, is calculated The angle of sound source position includes:
It is gated according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
Preferably, carrying out noise reduction process to each subband of the audio data includes:
Envelope modulation characteristic and result of spectrum analysis based on the audio data are identified according to the modulation amplitude parameter Whether the audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
Preferably, the corresponding frequency-region signal of each subband for compressing the amplified audio data time-frequency is carried out to turn It changes, line phase compensation of going forward side by side includes:
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
Second aspect, the present invention also provides a kind of speech sound enhancement devices of auditory prosthesis, comprising:
Voice pickup module is set as obtaining the four-way audio data of auditory prosthesis;
Acoustic enviroment monitoring modular is set as the audio data that will acquire and extracts acoustic enviroment feature, obtains the sound Frequency is according to corresponding acoustics scene;
Sound processing module is set as carrying out subchannel sound benefit to the audio data got according to the acoustics scene It repays and speech enhan-cement;
Output module is set as output two-way enhancing audio data.
Preferably, the audio data that the acoustic enviroment monitoring modular will acquire extracts acoustic enviroment feature, obtains institute Stating the corresponding acoustics scene of audio data includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
Preferably, the sound processing module includes:
Pretreatment unit is set as carrying out the audio data pretreatment and subchannel filtering;
Sub-band division unit is set as the filtered audio data of subchannel carrying out sub-band division;To each audio number According to subband carry out spectrum analysis, obtain the signal-to-noise ratio of the subband of the audio data;
Auditory localization unit, be set as according to identified environmental pattern to the corresponding sound source of the audio data into Row gating, calculates the angle of sound source position;
Chauvent's criterion and feedback cancellation unit are set as according to the angle of determining sound source position and subband Signal-to-noise ratio, carries out noise reduction to each subband of the audio data and elimination is uttered long and high-pitched sounds processing;
Compression and amplifying unit are set as carrying out dynamic compression harmony to each subband of the audio data after noise reduction Loudness of a sound degree enhanced processing;
Sound compensating unit is set as that the corresponding frequency-region signal of each subband of the amplified audio data will be compressed Carry out time-frequency convert, line phase compensation of going forward side by side;
Sound comprehensive unit is set as each subband of the audio data being merged into time domain speech signal.
Preferably, the preprocessing module, which pre-process to the audio data, includes:
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
Preferably, the acoustic enviroment monitoring modular is also configured to:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
Preferably, the auditory localization unit according to identified environmental pattern to the corresponding sound of the audio data come Source is gated, and the angle for calculating sound source position includes:
Logical choosing is carried out according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
Preferably, the chauvent's criterion and feedback cancellation unit carry out noise reduction process to each subband of the audio data Include:
Envelope modulation characteristic and result of spectrum analysis based on the audio data are identified according to the modulation amplitude parameter Whether the audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
Preferably, the sound compensating unit will compress the corresponding frequency domain of each subband of the amplified audio data Signal carries out time-frequency convert, and line phase compensation of going forward side by side includes:
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
The third aspect, the present invention also provides a kind of speech sound enhancement devices, comprising: memory and processor;
The memory, for saving executable instruction;
The processor, the executable instruction saved for executing the memory, proceeds as follows:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
Fourth aspect, the present invention also provides a kind of computer readable storage medium, the computer readable storage medium is deposited Computer executable instructions are contained, when executing the computer executable instructions, are proceeded as follows:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
The sound enhancement method and device of auditory prosthesis provided in an embodiment of the present invention are obtained by the auditory prosthesis of ear side Four road audio datas, and be transmitted on portable terminal, the audio data that will acquire carries out speech enhan-cement on portable terminal Processing, final output two-way realaudio data.Different from the hearing assistance system in common auditory prosthesis, the embodiment of the present invention is abundant In view of the different performance condition of portable terminal processor, the auto upgrading that can be used under different portable terminals is proposed The method of voice hearing aid enhancing.Intelligence improves sound quality, substantially increases the prevalence of hearing aid fits, can achieve preferably Hearing aid effect and enhancement method, simultaneously because audio data processing is not solidificated in instead of on auditory prosthesis processor, base In the general processor chip of portable terminal, conducive to the perfect of following system upgrade and sound enhancement method.
Detailed description of the invention
Attached drawing is used to provide to further understand technical solution of the present invention, and constitutes part of specification, with this The embodiment of application technical solution for explaining the present invention together, does not constitute the limitation to technical solution of the present invention.
Fig. 1 is a kind of flow chart of the sound enhancement method of auditory prosthesis provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of the speech sound enhancement device of auditory prosthesis provided in an embodiment of the present invention;
Fig. 3 is the structural schematic diagram of sound processing module provided in an embodiment of the present invention.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with attached drawing to the present invention Embodiment be described in detail.It should be noted that in the absence of conflict, in the embodiment and embodiment in the application Feature can mutual any combination.
It is higher and higher in the popularity rate of current Portable intelligent terminal (such as mobile phone), and the general procedure of portable terminal The operational capability of device is also increasingly stronger.However, now most of hearing aids there is no the intact hearing-aid functions that may be implemented The corresponding speech enhan-cement implementation method of portable terminal is matched, existing hearing aid built-in algorithm is to solidify in the processor, no It can change and auto upgrading with processor.As shown in Figure 1, the embodiment of the present invention provides a kind of speech enhan-cement side of auditory prosthesis Method is realized by portable terminal processor, comprising:
S101, the four-way audio data for obtaining auditory prosthesis;
S102, the audio data that will acquire extract acoustic enviroment feature, obtain the corresponding acoustic field of the audio data Scape;
S103, the compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
S104, output two-way enhance audio data.
Four-way audio data in the embodiment of the present invention refers to: the left preposition wheat in ear side of portable terminal acquisition auditory prosthesis The voice input of microphone and the right side before the voice input and right ear side of microphone after the voice input of gram wind and left ear side The voice input of microphone behind ear side.
The embodiment of the present invention is enhanced based on the realization intelligent sound of portable terminal, before the auditory prosthesis for not updating ear side It puts, only can realize that the hearing loss of barrier patient is listened in compensation using portable terminal, obtained by the auditory prosthesis of ear side Four road audio datas, and be transmitted on portable terminal, the audio data that will acquire carries out speech enhan-cement on portable terminal Processing, final output two-way realaudio data.Intelligence improves sound quality, substantially increases the prevalence of hearing aid fits.Energy Complete basis voice hearing-aid function while, real-time perfoming intelligent sound enhancing, it is user-friendly and be convenient for product Upgrading.
The audio data that step S102 will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics of the audio data Scene includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
Step S103 carries out the compensation of subchannel sound to the audio data got according to the acoustics scene and voice increases Include: by force
S1031, the progress audio data carry out pretreatment and subchannel filtering;
S1032, the filtered audio data of subchannel is subjected to sub-band division;Frequency is carried out to the subband of each audio data Spectrum analysis obtains the signal-to-noise ratio of the subband of the audio data;
S1033, the corresponding sound source of the audio data is gated according to identified environmental pattern, calculating sound The angle of sound source position;
S1034, according to the angle of determining sound source position and the signal-to-noise ratio of subband, to the audio data Processing that each subband carries out noise reduction and elimination is uttered long and high-pitched sounds;
S1035, dynamic compression and intensity of sound enhanced processing are carried out to each subband of the audio data after noise reduction;
S1036, the corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert, It goes forward side by side line phase compensation;
S1037, each subband of the audio data is merged into time domain speech signal.
Subchannel filters in step S1031 in the present embodiment, judges ambient noise according to the sound end of detection, adopts It is filtered for the first time with spectrum-subtraction, obtains the four road voice signals for tentatively removing noise.
Wherein, carrying out pretreatment to the audio data includes:
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
In the embodiment of the present invention, pretreatment is primarily referred to as the processing of preemphasis, carries out firstorder filter to the component of high frequency High-pass filtering, increase the high frequency resolution of voice.
The embodiment of the present invention carries out subchannel filtering by gammatone filter, and process is as follows:
According to the particularity that human ear constructs, basilar membrane has different specific frequencies in different positions.This Species specificity can indicate that time-domain expression meets following formula with the non-wide Gammatone filter of n rank, that is,
Wherein,Phase is represented, fc represents centre frequency, and b represents bandwidth, and N is the order of filter, and t represents time, A generation Table amplitude.
It takes the subband noise reduction technology of subchannel to carry out noise reduction in the step S1034 of the embodiment of the present invention, is different from existing Subchannel noise reduction technology, the embodiment of the present invention takes different noise reduction schemes, Lai Jinhang according to the different band structure of subband The raising of voice quality, and then achieve the purpose that speech enhan-cement.The noise signal included in the quadrophonic audio data It is typically found in low-frequency band, using spectrum-subtraction and variable noise subtraction parameters, so that noise is decayed, such mode Voice distortion degree is controllable.For the voice signal of high frequency band, the noise spectrum point of high band is removed using cross-correlation function method Amount, relevant parameter needed for remaining positioning, and without voice signal of decaying.The wherein determination of variable noise subtraction parameters It can obtain according to the following formula:
Wherein, k represents the sequence number of subband, and l represents subband frame number,It is the random starting values for representing α, SNRpGeneration The posterior snr value of table, σ are positive integer, the range subtracted for controlling subband noise spectrum spectrum, β and αi(k) most value and priori SNR Valuation it is related, be assessment parameter, β be for preventing (can have the case where posteriori SNR goes to zero) that denominator is zero, β's Calculating is that the inverse of the minimax value difference of the α obtained by voice segments obtains.
Division between high band and low-frequency range is calculated by the noise power spectrum of each subband output signal, is usually selected The division frequency range selected is in 800Hz~1000Hz or so.
The audio data that step S102 will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics of the audio data After scene further include:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
Wherein, directionality control parameter includes: ears time difference, intensity difference at two ears, binaural phase difference and front and back ear phase The parameters such as potential difference.
The corresponding sound source of the audio data is gated according to identified environmental pattern, calculates sound source The angle of position includes:
It is gated according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
Carrying out noise reduction process to each subband of the audio data includes:
Envelope modulation characteristic and result of spectrum analysis based on the audio data are identified according to the modulation amplitude parameter Whether the audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
In the embodiment of the present invention, modulation amplitude parameter is determined by environment, because voice signal envelope has modulating characteristic, root After spectrum analysis, it can be used to identify that the acoustic signal of input is voice or noise according to the size of modulation rate.Noise suppression Parameter processed is then according to locating environment, and environment is noisy different with the noise spectrum under quiet environment, calculates gained input signal-to-noise ratio Also different, the calculating for variable noise subtraction parameters.
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert, and is carried out Linear phase compensates
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
Compression magnification ratio parameter is determined by the situation of the hearing loss of patient, has hearing loss figure after audiometry, is marked The hearing condition for infusing patient at different frequencies determines compression magnification ratio parameter according to this data, normal to be amplified to Hearing level degree, in sound compensation, varying environment pushes scaling large scale coefficient difference, carries out different degrees of compensation.
The embodiment of the present invention first passes through the filtering of preemphasis, then is filtered by the subchannel of gammatone, then subtracted by spectrum The filtering of method carries out voice signal integration after the subsequent processings such as spectrum-subtraction.It can achieve better hearing aid effect and enhancing.
As shown in Fig. 2, the embodiment of the present invention also provides a kind of speech sound enhancement device of auditory prosthesis, it is arranged in portable terminal End side, comprising:
Voice pickup module 11 is set as obtaining the four-way audio data of auditory prosthesis;
Acoustic enviroment monitoring modular 12 is set as the audio data that will acquire and extracts acoustic enviroment feature, described in acquisition The corresponding acoustics scene of audio data;
Sound processing module 13 is set as carrying out subchannel sound to the audio data got according to the acoustics scene Compensation and speech enhan-cement;
Output module 14 is set as output two-way enhancing audio data.
The audio data that the acoustic enviroment monitoring modular 12 will acquire extracts acoustic enviroment feature, obtains the audio The corresponding acoustics scene of data includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
The sound processing module 13 includes:
Pretreatment unit 131 is arranged for the audio data and carries out pretreatment and subchannel filtering;
Sub-band division unit 132 is set as the filtered audio data of subchannel carrying out sub-band division;To each audio The subband of data carries out spectrum analysis, obtains the signal-to-noise ratio of the subband of the audio data;
Auditory localization unit 133, be set as according to identified environmental pattern to the corresponding sound of the audio data come Source is gated, and the angle of sound source position is calculated;
Chauvent's criterion and feedback cancellation unit 134, are set as the angle and son according to determining sound source position The signal-to-noise ratio of band, carries out noise reduction to each subband of the audio data and elimination is uttered long and high-pitched sounds processing;
Compression and amplifying unit 135 are set as carrying out dynamic compression to each subband of the audio data after noise reduction With intensity of sound enhanced processing;
Sound compensating unit 136 is set as that the corresponding frequency domain of each subband of the amplified audio data will be compressed Signal carries out time-frequency convert, line phase compensation of going forward side by side;
Sound comprehensive unit 137 is set as each subband of the audio data being merged into time domain speech signal.
The preprocessing module carries out pretreatment to the audio data
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
The acoustic enviroment monitoring modular is also configured to:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
The auditory localization unit 133 according to identified environmental pattern to the corresponding sound source of the audio data into Row gating, the angle for calculating sound source position include:
Logical choosing is carried out according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
The chauvent's criterion and feedback cancellation unit carry out noise reduction process to each subband of the audio data and include:
Envelope modulation characteristic and result of spectrum analysis based on the audio data are identified according to the modulation amplitude parameter Whether the audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
The sound compensating unit will compress the corresponding frequency-region signal of each subband of the amplified audio data into Row time-frequency convert, line phase compensation of going forward side by side include:
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
The embodiment of the present invention also provides a kind of speech sound enhancement device, comprising: memory and processor;
The memory, for saving executable instruction;
The processor, the executable instruction saved for executing the memory, proceeds as follows:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
The embodiment of the present invention, which also provides computer-readable recording medium storage described in a kind of computer readable storage medium, to be had Computer executable instructions proceed as follows when the processor executes the computer executable instructions:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
Those of ordinary skill in the art will appreciate that all or part of the steps in the above method can be referred to by program Related hardware (such as processor) is enabled to complete, described program can store in computer readable storage medium, such as read-only storage Device, disk or CD etc..Optionally, one or more integrated circuits also can be used in all or part of the steps of above-described embodiment To realize.Correspondingly, each module/unit in above-described embodiment can take the form of hardware realization, such as pass through integrated electricity Its corresponding function is realized on road, can also be realized in the form of software function module, such as is stored in by processor execution Program/instruction in memory realizes its corresponding function.The embodiment of the present invention be not limited to any particular form hardware and The combination of software.
Although disclosed herein embodiment it is as above, the content only for ease of understanding the present invention and use Embodiment is not intended to limit the invention.Technical staff in any fields of the present invention is taken off not departing from the present invention Under the premise of the spirit and scope of dew, any modification and variation, but the present invention can be carried out in the form and details of implementation Scope of patent protection, still should be subject to the scope of the claims as defined in the appended claims.

Claims (18)

1. a kind of sound enhancement method of auditory prosthesis characterized by comprising
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
2. sound enhancement method according to claim 1, which is characterized in that the audio data that will acquire extracts acoustics ring Border feature, obtaining the corresponding acoustics scene of the audio data includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
3. sound enhancement method according to claim 2, which is characterized in that according to the acoustics scene to the sound got According to carrying out, subchannel sound compensates frequency and speech enhan-cement includes:
Pretreatment and subchannel filtering are carried out to the audio data;
The filtered audio data of subchannel is subjected to sub-band division;Spectrum analysis is carried out to the subband of each audio data, is obtained Obtain the signal-to-noise ratio of the subband of the audio data;
The corresponding sound source of the audio data is gated according to identified environmental pattern, calculates sound source place The angle of position;
According to the signal-to-noise ratio of the angle of determining sound source position and subband, to each subband of the audio data into Row noise reduction and elimination are uttered long and high-pitched sounds processing;
Dynamic compression and intensity of sound enhanced processing are carried out to each subband of the audio data after noise reduction;
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert, line of going forward side by side Phase compensation;
Each subband of the audio data is merged into time domain speech signal.
4. sound enhancement method according to claim 3, which is characterized in that carry out pretreatment packet to the audio data It includes:
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
5. sound enhancement method according to claim 3, which is characterized in that the audio data that will acquire extracts acoustics ring Border feature, after obtaining the corresponding acoustics scene of the audio data further include:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
6. sound enhancement method according to claim 5, which is characterized in that according to identified environmental pattern to the sound Frequency is gated according to corresponding sound source, and the angle for calculating sound source position includes:
It is gated according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
7. sound enhancement method according to claim 5, which is characterized in that carried out to each subband of the audio data Noise reduction process includes:
Envelope modulation characteristic and result of spectrum analysis based on the audio data, according to modulation amplitude parameter identification Whether audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
8. sound enhancement method according to claim 5, which is characterized in that the amplified audio data will be compressed The corresponding frequency-region signal of each subband carries out time-frequency convert, and line phase compensation of going forward side by side includes:
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
9. a kind of speech sound enhancement device of auditory prosthesis characterized by comprising
Voice pickup module is set as obtaining the four-way audio data of auditory prosthesis;
Acoustic enviroment monitoring modular is set as the audio data that will acquire and extracts acoustic enviroment feature, obtains the audio number According to corresponding acoustics scene;
Sound processing module, be set as according to the acoustics scene to the audio data that gets carry out the compensation of subchannel sound and Speech enhan-cement;
Output module is set as output two-way enhancing audio data.
10. speech sound enhancement device according to claim 9, which is characterized in that the acoustic enviroment monitoring modular will acquire The audio data arrived extracts acoustic enviroment feature, and obtaining the corresponding acoustics scene of the audio data includes:
Extract the acoustic enviroment feature of the audio data;
The acoustic enviroment feature of extraction is matched with preset voice environment, determines environmental pattern locating for user.
11. speech sound enhancement device according to claim 10, which is characterized in that the sound processing module includes:
Pretreatment unit is set as carrying out the audio data pretreatment and subchannel filtering;
Sub-band division unit is set as the filtered audio data of subchannel carrying out sub-band division;To each audio data Subband carries out spectrum analysis, obtains the signal-to-noise ratio of the subband of the audio data;
Auditory localization unit is set as selecting the corresponding sound source of the audio data according to identified environmental pattern It is logical, calculate the angle of sound source position;
Chauvent's criterion and feedback cancellation unit are set as the noise of the angle and subband according to determining sound source position Than carrying out noise reduction to each subband of the audio data and elimination being uttered long and high-pitched sounds processing;
Compression and amplifying unit are set as carrying out dynamic compression harmony loudness of a sound to each subband of the audio data after noise reduction Spend enhanced processing;
Sound compensating unit, the corresponding frequency-region signal of each subband for being set as to compress the amplified audio data carry out Time-frequency convert, line phase compensation of going forward side by side;
Sound comprehensive unit is set as each subband of the audio data being merged into time domain speech signal.
12. speech sound enhancement device according to claim 11, which is characterized in that the preprocessing module is to the audio number Include: according to pretreatment is carried out
The component for being greater than preset value to frequency in the audio data carries out single order high-pass filtering.
13. speech sound enhancement device according to claim 11, which is characterized in that the acoustic enviroment monitoring modular is also set up Are as follows:
Obtain the parameter of at least one of of the environmental pattern:
Modulation amplitude parameter, directionality control parameter, compression magnification ratio parameter and noise reduction parameter.
14. speech sound enhancement device according to claim 13, which is characterized in that the auditory localization unit is according to determining Environmental pattern the corresponding sound source of the audio data is gated, calculate sound source position angle packet It includes:
Logical choosing is carried out according to sound source of the directionality control parameter to whole directions of the auditory prosthesis;
Calculate the angle of sound source position.
15. speech sound enhancement device according to claim 13, which is characterized in that the chauvent's criterion and feedback cancellation unit Carrying out noise reduction process to each subband of the audio data includes:
Envelope modulation characteristic and result of spectrum analysis based on the audio data, according to modulation amplitude parameter identification Whether audio data is noise;
According to determining signal-to-noise ratio and the noise reduction parameter, inhibition processing is carried out to the noise.
16. speech sound enhancement device according to claim 13, which is characterized in that the sound compensating unit amplifies compression The corresponding frequency-region signal of each subband of the audio data afterwards carries out time-frequency convert, and line phase compensation of going forward side by side includes:
The corresponding frequency-region signal of each subband for compressing the amplified audio data is subjected to time-frequency convert;
According to the compression magnification ratio coefficient, the phase compensation of degree of correspondence is carried out.
17. a kind of speech sound enhancement device characterized by comprising memory and processor;
The memory, for saving executable instruction;
The processor, the executable instruction saved for executing the memory, proceeds as follows:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
18. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer can It executes instruction, when executing the computer executable instructions, proceeds as follows:
Obtain the four-way audio data of auditory prosthesis;
The audio data that will acquire extracts acoustic enviroment feature, obtains the corresponding acoustics scene of the audio data;
The compensation of subchannel sound and speech enhan-cement are carried out to the audio data got according to the acoustics scene;
Exporting two-way enhances audio data.
CN201710817728.7A 2017-09-12 2017-09-12 Voice enhancement method and device of hearing aid device Active CN109493877B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710817728.7A CN109493877B (en) 2017-09-12 2017-09-12 Voice enhancement method and device of hearing aid device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710817728.7A CN109493877B (en) 2017-09-12 2017-09-12 Voice enhancement method and device of hearing aid device

Publications (2)

Publication Number Publication Date
CN109493877A true CN109493877A (en) 2019-03-19
CN109493877B CN109493877B (en) 2022-01-28

Family

ID=65688095

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710817728.7A Active CN109493877B (en) 2017-09-12 2017-09-12 Voice enhancement method and device of hearing aid device

Country Status (1)

Country Link
CN (1) CN109493877B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110022514A (en) * 2019-05-17 2019-07-16 深圳市湾区通信技术有限公司 Noise-reduction method, device, system and the computer storage medium of audio signal
CN110728970A (en) * 2019-09-29 2020-01-24 华声设计研究院(深圳)有限公司 Method and device for digital auxiliary sound insulation treatment
CN112492495A (en) * 2019-09-11 2021-03-12 西万拓私人有限公司 Method for operating a hearing device and hearing device
CN112562265A (en) * 2020-12-23 2021-03-26 江苏集萃智能集成电路设计技术研究所有限公司 Intelligent monitoring system and monitoring method based on hearing aid
CN112954569A (en) * 2021-02-20 2021-06-11 深圳市智听科技有限公司 Multi-core hearing aid chip, hearing aid method and hearing aid
WO2021189946A1 (en) * 2020-03-24 2021-09-30 青岛罗博智慧教育技术有限公司 Speech enhancement system and method, and handwriting board
CN113825082A (en) * 2021-09-19 2021-12-21 武汉左点科技有限公司 Method and device for relieving hearing aid delay
WO2022017424A1 (en) * 2020-07-24 2022-01-27 华为技术有限公司 Active noise control method and apparatus, and audio playback device
CN115314824A (en) * 2022-10-12 2022-11-08 深圳市婕妤达电子有限公司 Signal processing method and device for hearing aid, electronic equipment and storage medium
CN113949955B (en) * 2020-07-16 2024-04-09 Oppo广东移动通信有限公司 Noise reduction processing method and device, electronic equipment, earphone and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1333994A (en) * 1998-11-16 2002-01-30 伊利诺伊大学评议会 Binaural signal processing techniques
US20060206320A1 (en) * 2005-03-14 2006-09-14 Li Qi P Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers
CN1967659A (en) * 2005-11-14 2007-05-23 北京大学科技开发部 Speech enhancement method applied to deaf-aid
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
US20130195302A1 (en) * 2010-12-08 2013-08-01 Widex A/S Hearing aid and a method of enhancing speech reproduction
CN103686575A (en) * 2013-11-28 2014-03-26 清华大学 Hearing aid
CN104038880A (en) * 2014-06-26 2014-09-10 南京工程学院 Method for enhancing voice of double-ear hearing-aid device
CN105741849A (en) * 2016-03-06 2016-07-06 北京工业大学 Voice enhancement method for fusing phase estimation and human ear hearing characteristics in digital hearing aid

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1333994A (en) * 1998-11-16 2002-01-30 伊利诺伊大学评议会 Binaural signal processing techniques
US20060206320A1 (en) * 2005-03-14 2006-09-14 Li Qi P Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers
CN1967659A (en) * 2005-11-14 2007-05-23 北京大学科技开发部 Speech enhancement method applied to deaf-aid
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
US20130195302A1 (en) * 2010-12-08 2013-08-01 Widex A/S Hearing aid and a method of enhancing speech reproduction
CN103686575A (en) * 2013-11-28 2014-03-26 清华大学 Hearing aid
CN104038880A (en) * 2014-06-26 2014-09-10 南京工程学院 Method for enhancing voice of double-ear hearing-aid device
CN105741849A (en) * 2016-03-06 2016-07-06 北京工业大学 Voice enhancement method for fusing phase estimation and human ear hearing characteristics in digital hearing aid

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110022514A (en) * 2019-05-17 2019-07-16 深圳市湾区通信技术有限公司 Noise-reduction method, device, system and the computer storage medium of audio signal
CN112492495A (en) * 2019-09-11 2021-03-12 西万拓私人有限公司 Method for operating a hearing device and hearing device
US11388514B2 (en) 2019-09-11 2022-07-12 Sivantos Pte. Ltd. Method for operating a hearing device, and hearing device
CN110728970B (en) * 2019-09-29 2022-02-25 东莞市中光通信科技有限公司 Method and device for digital auxiliary sound insulation treatment
CN110728970A (en) * 2019-09-29 2020-01-24 华声设计研究院(深圳)有限公司 Method and device for digital auxiliary sound insulation treatment
WO2021189946A1 (en) * 2020-03-24 2021-09-30 青岛罗博智慧教育技术有限公司 Speech enhancement system and method, and handwriting board
CN113949955B (en) * 2020-07-16 2024-04-09 Oppo广东移动通信有限公司 Noise reduction processing method and device, electronic equipment, earphone and storage medium
WO2022017424A1 (en) * 2020-07-24 2022-01-27 华为技术有限公司 Active noise control method and apparatus, and audio playback device
CN112562265A (en) * 2020-12-23 2021-03-26 江苏集萃智能集成电路设计技术研究所有限公司 Intelligent monitoring system and monitoring method based on hearing aid
CN112954569B (en) * 2021-02-20 2022-10-25 深圳市智听科技有限公司 Multi-core hearing aid chip, hearing aid method and hearing aid
CN112954569A (en) * 2021-02-20 2021-06-11 深圳市智听科技有限公司 Multi-core hearing aid chip, hearing aid method and hearing aid
CN113825082A (en) * 2021-09-19 2021-12-21 武汉左点科技有限公司 Method and device for relieving hearing aid delay
CN115314824A (en) * 2022-10-12 2022-11-08 深圳市婕妤达电子有限公司 Signal processing method and device for hearing aid, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN109493877B (en) 2022-01-28

Similar Documents

Publication Publication Date Title
CN109493877A (en) A kind of sound enhancement method and device of auditory prosthesis
US11245993B2 (en) Hearing device comprising a noise reduction system
US11812223B2 (en) Electronic device using a compound metric for sound enhancement
CN109121057B (en) Intelligent hearing aid method and system
CN107479030B (en) Frequency division and improved generalized cross-correlation based binaural time delay estimation method
US20230352038A1 (en) Voice activation detecting method of earphones, earphones and storage medium
US20060206320A1 (en) Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers
US8504360B2 (en) Automatic sound recognition based on binary time frequency units
US10154353B2 (en) Monaural speech intelligibility predictor unit, a hearing aid and a binaural hearing system
CN109195042B (en) Low-power-consumption efficient noise reduction earphone and noise reduction system
CN108235181B (en) Method for noise reduction in an audio processing apparatus
CN110708625A (en) Intelligent terminal-based environment sound suppression and enhancement adjustable earphone system and method
EP3340657B1 (en) A hearing device comprising a dynamic compressive amplification system and a method of operating a hearing device
US11689869B2 (en) Hearing device configured to utilize non-audio information to process audio signals
CN112367600A (en) Voice processing method and hearing aid system based on mobile terminal
US20220124444A1 (en) Hearing device comprising a noise reduction system
WO2013067145A1 (en) Systems and methods for enhancing place-of-articulation features in frequency-lowered speech
WO2022256577A1 (en) A method of speech enhancement and a mobile computing device implementing the method
CN112367599B (en) Hearing aid system with cloud background support
CN213462323U (en) Hearing aid system based on mobile terminal
CN115314823A (en) Hearing aid method, system and equipment based on digital sounding chip
CN117392994B (en) Audio signal processing method, device, equipment and storage medium
WO2021239254A1 (en) A own voice detector of a hearing device
CN116405822A (en) Bass enhancement system and method applied to open Bluetooth headset
CN117097371A (en) Bluetooth hearing assistance system capable of being autonomously tested, matched and designed and implementation method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant