WO2022227982A1 - Tws earphone and playing method and device of tws earphone - Google Patents

Tws earphone and playing method and device of tws earphone Download PDF

Info

Publication number
WO2022227982A1
WO2022227982A1 PCT/CN2022/083464 CN2022083464W WO2022227982A1 WO 2022227982 A1 WO2022227982 A1 WO 2022227982A1 CN 2022083464 W CN2022083464 W CN 2022083464W WO 2022227982 A1 WO2022227982 A1 WO 2022227982A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
filter
speaker
frequency bands
ear canal
Prior art date
Application number
PCT/CN2022/083464
Other languages
French (fr)
Chinese (zh)
Inventor
熊伟
仇存收
恽毅
许超
郭琴
李岩
田立生
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to EP22794455.0A priority Critical patent/EP4297428A1/en
Publication of WO2022227982A1 publication Critical patent/WO2022227982A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1785Methods, e.g. algorithms; Devices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • G10K11/17879General system configurations using both a reference signal and an error signal
    • G10K11/17881General system configurations using both a reference signal and an error signal the reference signal being an acoustic signal, e.g. recorded with a microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • G10K11/17885General system configurations additionally using a desired external signal, e.g. pass-through audio such as music or speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R23/00Transducers other than those covered by groups H04R9/00 - H04R21/00
    • H04R23/02Transducers using more than one principle simultaneously
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • H04R3/14Cross-over networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/10Applications
    • G10K2210/108Communication systems, e.g. where useful sound is kept and noise is cancelled
    • G10K2210/1081Earphones, e.g. for telephones, ear protectors or headsets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1016Earpieces of the intra-aural type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/22Arrangements for obtaining desired frequency or directional characteristics for obtaining desired frequency characteristic only 
    • H04R1/24Structural combinations of separate transducers or of two parts of the same transducer and responsive respectively to two or more frequency ranges
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/03Synergistic effects of band splitting and sub-band processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/01Hearing devices using active noise cancellation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/033Headphones for stereophonic communication

Definitions

  • the present application relates to audio processing technology, and in particular, to a true wireless stereo (true wireless stereo, TWS) headset and a TWS headset playback method and device.
  • true wireless stereo true wireless stereo, TWS
  • the general demand is to achieve stable active noise cancellation (active noise cancellation or active noise control, ANC) or transparent transmission (hear through, HT) function while having high-quality music or broadband HD calls .
  • TWS earphones implement the ANC function, they will also eliminate some music or call sounds, which will affect the sound quality and call clarity.
  • the present application provides a TWS earphone and a TWS earphone playing method and device, which can not only reflect high sound quality in various frequency bands of an audio source, but also support ultra-bandwidth voice calls.
  • the present application provides a TWS earphone, comprising: an audio signal processing path, a frequency divider and at least two speakers; wherein the output end of the audio signal processing path is connected to the input end of the frequency divider; the frequency divider The output end of the frequency converter is connected to the at least two speakers; the audio signal processing path is configured to output a speaker drive signal after noise reduction or transparent transmission processing is performed on the audio source; the audio source is original music or call voice; or , the audio source includes the voice signal enhanced by the human voice and the original music or call voice; the frequency divider is configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two frequency bands The frequency bands correspond to the main working frequency bands of the at least two speakers; the adjacent frequency bands in the at least two frequency bands partially overlap, or the adjacent frequency bands in the at least two frequency bands do not overlap; the at least two speakers are configured with To play the corresponding sub audio signal.
  • the frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
  • the present application controls the frequency divider to divide the frequency of the speaker drive signal in a preset manner by setting the parameters of the frequency divider.
  • the frequency divider may be configured to divide the frequency of the speaker drive signal based on the main working frequency bands of the at least two speakers to obtain sub-audio signals of at least two frequency bands corresponding to the main working frequency bands of the at least two speakers, and then let each speaker Play the sub audio signals of the corresponding frequency bands respectively, so that the speaker maintains the best frequency response when playing the sub audio signals transmitted to it.
  • At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical.
  • the frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands,
  • the aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
  • the audio signal processing path includes: a secondary path SP filter configured to prevent the noise reduction or pass-through processing when the noise reduction or pass-through processing is concurrent with the audio source Cancellation of sound from this audio source.
  • the audio signal processing path further includes: a feedback FB microphone and a feedback filter; wherein the FB microphone is configured to pick up an ear canal signal, and the ear canal signal includes residual noise inside the ear canal signal and the music or the voice of the conversation; the SP filter is configured to input the audio source, process the audio source, and transmit the output signal and the ear canal signal to the feedback filter after superimposing; the feedback filter, is configured to generate a signal for the noise reduction or pass-through processing, the noise reduction or pass-through processed signal being one of the superimposed signals for generating the speaker drive signal.
  • the feedback FB microphone, the feedback filter and the SP filter are set in the codec CODEC.
  • the SP filter is configured to input the audio source, process the audio source, and the output signal is one of the superimposed signals of the speaker driving signal.
  • the SP filter is set in a digital signal processing DSP chip.
  • the SP filter (including fixed SP filter or adaptive SP filter) is determined by the above method, which can not only realize the function of noise reduction or transparent transmission, but also prevent the transparent transmission or noise reduction technology in the process of playing music or passing through. Also remove music or call sounds.
  • it further includes: a first digital-to-analog converter DAC; the input end of the first DAC is connected to the output end of the audio signal processing path, and the output end of the first DAC is connected to the frequency divider
  • the first DAC is configured to convert the speaker driving signal from a digital form to an analog form; correspondingly, the frequency divider is an analog frequency divider.
  • the speaker drive signal After the speaker drive signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so the digital speaker drive signal can be converted into analog form through the first DAC
  • the speaker driving signal in the analog form is then divided into sub audio signals of at least two frequency bands through an analog frequency divider.
  • This embodiment adopts the structure of first conversion and then frequency division.
  • the method further includes: at least two second DACs; the input ends of the at least two second DACs are connected to the output ends of the frequency divider, and the output ends of the at least two second DACs are connected to each other. are respectively connected with one of the at least two speakers; the second DAC is configured to convert one of the sub-audio signals of the at least two frequency bands from a digital form to an analog form; correspondingly, the frequency divider is a digital divider.
  • This embodiment adopts the structure of frequency division first and then conversion.
  • the main working frequency bands of the at least two speakers are not identical.
  • the at least two speakers include a moving coil speaker and a moving iron speaker.
  • Two speakers are set in the TWS earphone of this embodiment, namely a moving coil speaker and a moving iron speaker.
  • the main working frequency band of the moving coil speaker is below 8.5kHz, and the main working frequency band of the moving iron speaker is above 8.5kHz.
  • the frequency divider divides the analog speaker drive signal into a sub audio signal below 8.5kHz and a sub audio signal above 8.5kHz.
  • the moving coil speaker can maintain the best frequency response by playing the sub audio signal below 8.5kHz.
  • the sub-audio signal above 8.5kHz can maintain the best frequency response, so that the TWS headset can not only reflect the high sound quality in each frequency band of the audio source, but also support ultra-bandwidth voice calls.
  • the at least two speakers include a moving coil speaker, a moving iron speaker, a MEMS speaker and a planar vibration diaphragm.
  • the main working frequency band of MEMS speakers depends on the application form.
  • the main working frequency band of in-ear headphones is the full frequency band
  • the main working frequency band of headphone is weaker below 7kHz
  • the main working frequency band is high frequency above 7kHz.
  • the main working frequency band of the flat vibrating diaphragm is 10kHz ⁇ 20kHz, in this way, the frequency divider can be set to divide the analog speaker driving signal into four sub-frequency bands, and the sub-audio signal below 8.5kHz can be played by the moving coil speaker to maintain the best frequency response.
  • the moving iron speaker can maintain the best frequency response when playing sub-audio signals above 8.5kHz
  • the MEMS speaker can maintain the best frequency response when playing sub-audio signals above 7kHz
  • the flat vibrating diaphragm can maintain the best frequency response when playing sub-audio signals above 10kHz frequency response, so that the TWS headset can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
  • the present application provides a method for playing a TWS headset, which is applied to the TWS headset according to any one of the first aspects above; the method includes: acquiring an audio source, where the audio source is original music or call voice, or, the audio source includes a voice signal enhanced by human voice and the original music or call voice; noise reduction or transparent transmission processing is performed on the audio source to obtain a speaker drive signal; The driving signal is divided into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap, or the adjacent frequency bands in the at least two frequency bands do not overlap; Playing one of the sub audio signals of the at least two frequency bands.
  • performing noise reduction or transparent transmission processing on the audio source to obtain the speaker drive signal includes: obtaining a fixed secondary path SP filter through a codec CODEC; filtering according to the fixed SP The device processes the audio source to obtain a filtered signal; performs noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
  • the obtaining the fixed secondary path SP filter through the codec CODEC includes: obtaining an estimated SP filter according to a preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone , the ear canal signal includes the residual noise signal inside the ear canal; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, the estimated SP filter Determined as the fixed SP filter.
  • the method further includes: when the signal obtained by the estimated SP filter When the difference signal with the ear canal signal is within the set range, the parameters of the cascaded second-order filter are obtained according to the target frequency response of the estimated SP filter and the preset frequency division requirements; The parameters of the second-order filter are obtained from the SP cascaded second-order filter, and the SP cascaded second-order filter is used as the fixed SP filter.
  • performing noise reduction or transparent transmission processing on the audio source to obtain the speaker drive signal includes: obtaining an adaptive SP filter through a digital signal processing DSP chip; filtering according to the adaptive SP The device processes the audio source to obtain a filtered signal; performs noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
  • obtaining the adaptive SP filter through a digital signal processing DSP chip includes: obtaining a real-time noise signal; obtaining an estimated SP filter according to the audio source and the real-time noise signal; When the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
  • the acquiring a real-time noise signal includes: acquiring an external signal picked up by a feed-forward FF microphone and an ear canal signal picked up by a feedback FB microphone, where the external signal includes an external noise signal and the music Or call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal from the voice signal The signal obtains the real-time noise signal.
  • the main working frequency bands of the at least two speakers are not identical.
  • the at least two speakers include a moving coil speaker and a moving iron speaker.
  • the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
  • the present application provides a playback device for a TWS headset, which is applied to the TWS headset in the first aspect;
  • the device includes: an acquisition module for acquiring an audio source, where the audio source is original music or call voice, or the audio source includes the voice signal enhanced by human voice and the original music or call voice;
  • the processing module is used to perform noise reduction or transparent transmission processing on the audio source to obtain a speaker drive signal;
  • a frequency dividing module configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, and adjacent frequency bands in the at least two frequency bands partially overlap, or, adjacent frequency bands in the at least two frequency bands The frequency bands do not overlap;
  • a playing module is configured to play one of the sub audio signals of the at least two frequency bands through at least two speakers respectively.
  • the processing module is specifically configured to obtain a fixed secondary path SP filter through a codec CODEC; process the audio source according to the fixed SP filter to obtain a filtered signal; The filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driving signal.
  • the processing module is specifically configured to obtain an estimated SP filter according to a preset speaker drive signal and an ear canal signal picked up by the feedback FB microphone, and the ear canal signal includes an internal ear canal signal.
  • the processing module is further configured to, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, according to the estimated SP
  • the target frequency response of the filter and the preset frequency division requirements obtain the parameters of the cascaded second-order filter; according to the parameters of the cascaded second-order filter, the SP cascaded second-order filter is obtained, and the SP cascaded two order filter as the fixed SP filter.
  • the processing module is specifically configured to obtain an adaptive SP filter through a digital signal processing DSP chip; process the audio source according to the adaptive SP filter to obtain a filtered signal; The filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driving signal.
  • the processing module is specifically configured to obtain a real-time noise signal; obtain an estimated SP filter according to the audio source and the real-time noise signal; when the signal obtained through the estimated SP filter When the difference signal with the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
  • the processing module is specifically configured to acquire an external signal picked up by a feedforward FF microphone and an ear canal signal picked up by a feedback FB microphone, where the external signal includes an external noise signal and the music or Call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal signal from the voice signal Obtain a signal difference; obtain the estimated SP filter according to the audio source and the signal difference.
  • the main working frequency bands of the at least two speakers are not identical.
  • the at least two speakers include a moving coil speaker and a moving iron speaker.
  • the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
  • the present application provides a computer-readable storage medium, comprising a computer program, which, when executed on a computer, causes the computer to execute the method of any one of the above-mentioned second aspects.
  • the present application provides a computer program, when the computer program is executed by a computer, for performing the method of any one of the above-mentioned second aspects.
  • FIG. 1 is an exemplary structural schematic diagram of a related art TWS headset
  • FIG. 2a is an exemplary structural schematic diagram of a related art TWS headset
  • FIG. 2b is an exemplary structural schematic diagram of a related art TWS headset
  • FIG. 3 is a schematic structural diagram of an exemplary TWS headset of the present application.
  • FIG. 4 is a schematic structural diagram of an exemplary TWS headset of the present application.
  • Fig. 5a is an exemplary acquisition flow chart of the fixed SP filter of the present application
  • Fig. 5b is an exemplary acquisition flow chart of the fixed SP filter of the present application
  • FIG. 6 is a schematic structural diagram of an exemplary TWS headset of the present application.
  • FIG. 7a is an exemplary schematic diagram of the signal frequency division of the present application.
  • FIG. 7b is an exemplary schematic diagram of signal frequency division of the present application.
  • FIG. 7c is an exemplary schematic diagram of signal frequency division of the present application.
  • FIG. 7d is an exemplary schematic diagram of signal frequency division of the present application.
  • FIG. 8a is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 8b is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 8c is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 8d is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 8e is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 9a is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 9b is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 9c is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 9d is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 9e is an exemplary structural schematic diagram of the TWS headset of the present application.
  • FIG. 10 is an exemplary flowchart of the playback method of the TWS headset of the present application.
  • FIG. 11 is an exemplary structural diagram of the playback device of the TWS earphone of the present application.
  • At least one (item) refers to one or more, and "a plurality” refers to two or more.
  • “And/or” is used to describe the relationship between related objects, indicating that there can be three kinds of relationships, for example, “A and/or B” can mean: only A, only B, and both A and B exist , where A and B can be singular or plural.
  • the character “/” generally indicates that the associated objects are an “or” relationship.
  • At least one item(s) below” or similar expressions thereof refer to any combination of these items, including any combination of single item(s) or plural items(s).
  • At least one (a) of a, b or c can mean: a, b, c, "a and b", “a and c", “b and c", or "a and b and c" ", where a, b, c can be single or multiple.
  • FIG. 1 is a schematic diagram of an exemplary structure of a related art TWS headset.
  • the TWS headset includes three types of microphones, namely a main microphone, a feedforward (FF) microphone and a feedback (FB) microphone, wherein the main microphone is used to pick up the human voice in the call, and the FF microphone It is used to pick up the external noise signal, and the FB microphone is used to pick up the residual noise signal inside the ear canal; the TWS headset also includes a moving coil speaker, which is used to play the processed music or call voice.
  • FF feedforward
  • FB feedback
  • FIG. 2 a is a schematic structural diagram of an exemplary TWS headset in the related art.
  • the main microphone and the FF microphone are respectively connected to the input end of the vocal enhancement filter, and the output end of the vocal enhancement filter and the audio source (including music and call voice) are superimposed by the stacker 1 to obtain a downlink link signal; the downlink signal is transmitted to one input of the stacker 2; the downlink signal is also transmitted to the input of the secondary path (SP) filter; the input of the FF microphone and the feedforward filter
  • SP secondary path
  • the output end of the feedforward filter is connected to the other input end of the stacker 2;
  • the FB microphone is connected to one input end of the stacker 3, the output end of the SP filter is connected to the other input end of the stacker 3,
  • the output end of the stacker 3 is connected to the input end of the feedback filter, and the output end of the feedback filter is connected to the third input end of the stacker 2; the output end of the stacker 2 is connected to the digital to analog
  • AHA joint controllers are respectively connected with vocal enhancement filter, feedforward filter, feedback filter and SP filter.
  • the role of the AHA joint controller In order to ensure the normal and stable operation of the TWS headset, some abnormal situations need to be handled by the TWS headset itself, so an AHA joint controller is required.
  • the AHA joint controller analyzes multiple signals to determine the current state of ANC, HT or augmented hearing (AH), and then determines whether abnormal conditions such as whistling and clipping occur, so as to implement corresponding processing.
  • the aforementioned filter parameter values are used to realize the control of the system.
  • the vocal enhancement filter, the audio source and the AHA joint controller are arranged in the digital signal processing (digital signal process, DSP) chip, and the feedforward filter, the feedback filter, the SP filter and the DAC are arranged in the codec. in the encoder (coder-decoder, CODEC).
  • DSP digital signal processing
  • CODEC encoder-decoder
  • FIG. 2 b is a schematic structural diagram of an exemplary TWS headset of the related art. As shown in Figure 2b, the difference from the structure shown in Figure 2a is that the vocal enhancement filter is moved from the DSP chip to the CODEC.
  • the TWS headset can implement the following functions:
  • the FF microphone picks up the external noise signal and generates the feedforward noise reduction signal through the feedforward filter; the FB microphone picks up the residual noise signal inside the ear canal, and generates the feedback noise reduction signal through the feedback filter.
  • the feedforward noise reduction signal, the feedback noise reduction signal and the downlink signal are superimposed to form the final speaker drive signal, and the analog speaker drive signal is generated after digital-to-analog conversion by the DAC.
  • the analog loudspeaker driving signal is played in a reverse manner in the moving coil loudspeaker, the analog audio signal can be obtained by canceling the audio signal in the space. In this way, low-frequency noise within a certain frequency band can be eliminated, thereby achieving the purpose of noise reduction.
  • the FF microphone picks up the external noise signal and generates the feedforward compensation signal through the feedforward filter; the FB microphone picks up the acoustic signal inside the ear canal and generates the feedback suppression signal through the feedback filter.
  • the feedforward compensation signal, the feedback suppression signal and the downlink signal are superimposed to form the final speaker drive signal, and the analog speaker drive signal is generated after digital-to-analog conversion by the DAC.
  • the analog speaker driving signal is played in the moving coil speaker, which can weaken or suppress low-frequency noise and compensate high-frequency signals, so as to obtain an analog audio signal that realizes acoustic compensation.
  • the "occlusion" effect of active voice listening (the wearer's own voice heard by himself) or the "stethoscope” effect of body part vibration (such as walking, chewing, scratching your head, etc. with headphones) is weakened or suppressed, passive
  • the high-frequency sound within a certain frequency band of listening sounds (such as human voice or music in the environment) is compensated, so as to achieve the purpose of transparent transmission.
  • the superimposed signal may include the feedforward compensation signal, the feedback suppression signal and the downlink signal, or may include the feedforward compensation signal and the feedback suppression signal. and either or both of the downlink signal.
  • the superimposed signal may include the aforementioned three; when the user just wants to perform noise reduction through headphones so as to be in a quiet environment, the superimposed signal may include the feed compensation signal and feedback suppression signals, excluding downlink signals.
  • Vocal enhancement and music/call concurrency When the vocal enhancement function is implemented on the DSP side, the FF microphone and main microphone signals are sent to the vocal enhancement filter for processing to obtain a signal in which ambient noise is suppressed and the human voice is preserved. Then the signal is down-streamed to the CODEC, superimposed with the signals output by the feedforward filter and the feedback filter respectively to obtain the speaker drive signal, and after digital-to-analog conversion by the DAC, the analog speaker drive signal is output for the moving coil speaker to play.
  • the estimated signal played by the moving coil speaker can be subtracted from the signal picked up by the FB microphone, In this way, this part of the signal will not be noise-reduced by the feedback filter, thus realizing the concurrency of vocal enhancement and music/calling, and the ANC/HT function can also be realized on this basis.
  • the realization of vocal enhancement in DSP is a guarantee of computational power overhead.
  • the noise reduction effect and the playback effect after processing are stable, but the delay is long, and it sounds like a reverberation.
  • the human voice enhancement is implemented in the CODEC.
  • the algorithm is relatively fixed and simple, the time delay is short, and the reverberation sense is low, but the noise reduction effect is limited.
  • the structure of the TWS earphone shown in Figure 1 to Figure 2b for high-frequency music, especially for music signals in the frequency band above 8.5kHz, the playback sound quality may be seriously damaged, affecting the playback effect of music; for high-frequency speech, especially for 8.5kHz Voice signals in the above frequency bands are not supported, resulting in limited bandwidth for voice calls.
  • the present application provides a speaker structure of a TWS earphone, which can improve the above technical problems.
  • FIG. 3 is a schematic structural diagram of an exemplary TWS headset of the present application.
  • the TWS headset 30 includes three types of microphones, namely the main microphone 31 , the FF microphone 32 and the FB microphone 33 , wherein the main microphone 31 is used to pick up the human voice during the call, and the FF microphone 32 is used to pick up the external noise signal , the FB microphone 33 is used to pick up the residual noise signal inside the ear canal.
  • the TWS earphone 30 also includes a moving coil speaker 34a and a moving iron speaker 34b.
  • the main working frequency band of the moving coil speaker 34a is less than 8.5kHz, and the main working frequency band of the moving iron speaker 34b is greater than 8.5kHz. It should be noted that this application does not specifically limit the number of speakers, as long as there are at least two speakers, and the at least two speakers may have different main working frequency bands of each speaker, or may also have the main operating frequency bands of some speakers.
  • the working frequency band is the same, and the main working frequency band of another part of the speakers is different from the main working frequency band of the aforementioned part of the speakers.
  • the main working frequency band of speaker 1 is the same as that of speaker 2, and the main working frequency band of speaker 3 is different from that of speaker 1 and speaker 2.
  • the main working frequency bands of the speaker 1, the speaker 2 and the speaker 3 are all different.
  • the at least two speakers may also include a moving coil speaker, a moving iron speaker, a micro-electro-mechanical systems (MEMS) MEMS speaker and a planar vibrating diaphragm, wherein the main working frequency band of the moving coil speaker is Less than 8.5kHz, the main working frequency band of the moving iron speaker is greater than 8.5kHz, the main working frequency band of the MEMS speaker depends on the application form, the main working frequency band of the in-ear headphone is the full frequency band, and the main working frequency band of the headphone is below 7kHz. Weak, the main working frequency band is high frequency above 7kHz, and the main working frequency band of the plane vibrating diaphragm is 10kHz ⁇ 20kHz.
  • MEMS micro-electro-mechanical systems
  • FIG. 4 is a schematic structural diagram of an exemplary TWS headset of the present application.
  • the TWS earphone 40 includes: an audio signal processing path 41, a frequency divider 42 and at least two speakers 43; wherein, the output end of the audio signal processing path 41 is connected to the input end of the frequency divider 42; frequency division The output end of the speaker 42 is connected to at least two speakers 43 .
  • the audio signal processing path 41 is configured to output a speaker drive signal after performing noise reduction or transparent transmission processing on the audio source; the audio source is the original music or call voice; music or call voice.
  • the frequency divider 42 is configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two frequency bands correspond to the main working frequency bands of the at least two speakers 43; the adjacent frequency band parts in the at least two frequency bands Overlapping, or at least two adjacent bands in the two bands do not overlap. At least two speakers 43 are configured to play corresponding sub audio signals.
  • the frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
  • the frequency divider 42 of the present application may be configured to divide the frequency of the speaker drive signal based on the main operating frequency bands of the at least two speakers, to obtain sub-audio signals of at least two frequency bands corresponding to the main operating frequency bands of the at least two speakers, respectively, Then let each speaker play the sub audio signal of the corresponding frequency band, so that the speaker maintains the best frequency response when playing the sub audio signal transmitted to it.
  • the frequency divider 42 can be controlled to divide the frequency of the speaker driving signal in a preset manner by setting the parameters of the frequency divider 42 .
  • the above-mentioned audio signal processing path 41 may adopt the structure shown in FIG. 2 a or FIG. 2 b .
  • the SP filter is set in the codec (coder-decoder, CODEC), and the fixed value is obtained through the CODEC. SP filter.
  • the CODEC can obtain the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the FB microphone.
  • the ear canal signal includes the residual noise signal inside the ear canal; when the signal obtained by the estimated SP filter and the ear canal signal When the difference signal of is within the set range, the estimated SP filter is determined as a fixed SP filter.
  • Fig. 5a is an exemplary acquisition flow chart of the fixed SP filter of the present application.
  • a speaker driving signal x[n] is preset, and after passing through the frequency divider, at least two speakers are pushed to emit sound.
  • the sound is transmitted to the FB microphone, picked up by the FB microphone, and converted into a digital signal y[n].
  • the transfer function of the SP filter can be assumed to be a high-order FIR filter, which can be iteratively modeled by the least mean square error (LMS) algorithm.
  • LMS least mean square error
  • the transfer function S(z) of the real SP filter is unknown, but its All the information is contained in the speaker drive signal x[n] and the digital signal y[n] picked up by the FB microphone, so it can be passed through a higher order FIR filter Model S(z) as an estimated SP filter.
  • input x[n] get for y[n] and Calculate the difference to get the error signal e[n].
  • Fig. 5b is an exemplary acquisition flow chart of the fixed SP filter of the present application. As shown in Fig. 5b, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, according to the estimation The parameters of the cascaded second-order filter are obtained from the target frequency response of the SP filter and the preset frequency division requirements; the SP cascaded second-order filter is obtained according to the parameters of the cascaded second-order filter, and the SP cascaded second-order filter is as a fixed SP filter.
  • FIR finite-length unit impulse response
  • IIR infinite impulse response
  • CODEC is based on the FIR filter that has been obtained Calculate the target frequency response.
  • N the filter order
  • k the frequency point number.
  • the IIR filter parameters are obtained from the conversion process of the FIR filter to the IIR filter, and the process requires the target responses of the two to be as consistent as possible.
  • B[k] and A[k] are the complex frequency responses of the IIR filter coefficients b and a respectively, and the calculation method is the same as that of formula (1).
  • a set of IIR filter parameters under the minimum mean square can be found, and the IIR filter obtained from this set of IIR filter parameters can be used as the final fixed SP filter and put into the CODEC for hardening implementation.
  • the above-mentioned audio signal processing path 41 may adopt the structure shown in FIG. 6 .
  • the SP filter is set in a digital signal processing (digital signal process, DSP) chip, and the adaptive filter is obtained through the DSP chip. SP filter.
  • DSP digital signal processing
  • the DSP chip can obtain the real-time noise signal, and obtain the estimated SP according to the audio source (the audio source is the original music or voice of the call; or, the audio source includes the voice signal that has been enhanced by the human voice and the original music or voice of the call) and the real-time noise signal.
  • the filter determines the estimated SP filter as an adaptive SP filter when the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within the set range.
  • the above-mentioned acquisition of the real-time noise signal may be to acquire the external signal picked up by the FF microphone (the external signal includes the external noise signal and music or voice of a call) and the ear canal signal picked up by the FB microphone (the ear canal signal includes the residual noise signal inside the ear canal and music. Or call voice), and then obtain the voice signal picked up by the main microphone, and subtract the external signal and the ear canal signal from the voice signal to obtain a real-time noise signal.
  • x[n] represents an audio source
  • y[n] fb[n]-ff[n]*A(z)-fb[n-1]*C(z)
  • fb[n] represents the voice signal picked up by the main microphone
  • ff[n]*A(z) It represents the external signal picked up by the FF microphone
  • fb[n-1]*C(z) represents the ear canal signal picked up by the FB microphone.
  • the noise reduction or transparent transmission function of the TWS headset will be concurrent with the music or the voice of the call.
  • the signal picked up by the FB microphone cannot be directly estimated by the SP filter, and it is necessary to remove the influence of other signals.
  • Modeling analysis is to obtain real-time noise signal, so the SP filter obtained at this time can be adapted to the real-time situation of the audio source and noise signal instead of being fixed.
  • the SP filter (including fixed SP filter or adaptive SP filter) is determined by the above method, which can not only realize the function of noise reduction or transparent transmission, but also prevent the transparent transmission or noise reduction technology in the process of playing music or passing through. Also remove music or call sounds.
  • the TWS earphone includes two speakers: a moving coil speaker and a moving iron speaker.
  • the dotted line represents the frequency response curve of the moving coil speaker
  • the single-dotted line represents the frequency response curve of the moving iron speaker
  • the solid line represents the crossover line.
  • FIG. 7a is an exemplary schematic diagram of signal frequency division of the present application.
  • the frequency divider 42 is configured to attenuate the dynamic speaker in the non-main operating frequency band, and keep the dynamic speaker in the main operating frequency band.
  • the power of the moving iron speaker is attenuated in the non-main working frequency band of the moving iron speaker, the power of the moving iron speaker in the main working frequency band is retained, and the frequency is divided at the intersection frequency of the attenuation frequency band of the moving coil speaker and the moving iron speaker. Get the sub audio signal of the two frequency bands.
  • FIG. 7b is an exemplary schematic diagram of signal frequency division of the application.
  • the frequency divider 42 is configured to align and attenuate in the non-main working frequency band of the moving iron speaker, and retain the moving iron speaker in the main working frequency band. power, and divide the frequency at a certain frequency point in the attenuation frequency band of the moving iron speaker to obtain sub-audio signals of two frequency bands.
  • Fig. 7c is an exemplary schematic diagram of the frequency division of the signal of the present application.
  • the frequency divider 42 is configured to perform two-stage attenuation in the non-main operating frequency band of the moving coil speaker, leaving the moving coil speaker at The power of the main working frequency band is attenuated in two stages in the non-main working frequency band of the moving iron speaker, retaining the power of the moving iron speaker in the main working frequency band, and attenuating the first and second attenuation bands of the moving iron speaker
  • a frequency division is performed at the transition frequency point of the frequency band, and a frequency division is performed at the transition frequency point of the first attenuation frequency band and the second attenuation frequency band of the moving coil speaker to obtain sub audio signals of three frequency bands.
  • the TWS earphone includes three speakers: a moving coil speaker, a moving iron speaker, and a MEMS speaker.
  • the dotted line represents the frequency response curve of the moving coil speaker
  • the single-dotted line represents the frequency response curve of the moving iron speaker
  • the double-dotted line represents the frequency response curve of the MEMS speaker
  • the solid line represents the crossover line.
  • Fig. 7d is an exemplary schematic diagram of the frequency division of the signal of the present application.
  • the frequency divider 42 is configured to attenuate the MEMS speaker in the non-main operating frequency band , retain the power of the MEMS speaker in the main working frequency band, and perform a frequency division at the intersection frequency point in the respective attenuation frequency bands of the moving iron speaker and the MEMS speaker, and obtain a total of four frequency bands of sub-audio signals.
  • FIGS. 7 a to 7 d are several examples in which the frequency divider 42 divides the speaker driving signal, and the present application does not limit the specific frequency division method of the frequency divider 42 .
  • At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical.
  • the frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands,
  • the aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
  • FIG. 8a is a schematic structural diagram of an exemplary TWS headset of the present application.
  • the TWS earphone 40 further includes: a first DAC 44 .
  • the input end of the first DAC 44 is connected with the output end of the audio signal processing path 41, and the output end of the first DAC 44 is connected with the input end of the frequency divider 42.
  • the first DAC 44 is configured to convert the speaker drive signal from digital to analog.
  • the frequency divider 42 is an analog frequency divider.
  • the speaker driving signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so it can pass the first DAC first.
  • the speaker driving signal in digital form is converted into a speaker driving signal in analog form, and then the analog frequency divider is used to divide the frequency of the speaker driving signal in analog form into sub audio signals of at least two frequency bands.
  • This embodiment adopts a structure of converting first and then dividing the frequency.
  • FIG. 8b is an exemplary structural schematic diagram of the TWS headset of the present application. As shown in Fig. 8b, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 8a.
  • the main microphone 601 and the FF microphone 602 in the TWS earphone 60 are respectively connected to the input end of the vocal enhancement filter 603 , and the output end of the vocal enhancement filter 603 and the audio source 604 (including music and call speech)
  • the downlink signal is obtained after superposition; the downlink signal is transmitted to an input of the stacker 2; the downlink signal is also transmitted to the input of the SP filter 605; connected, the output end of the feedforward filter 606 is connected to the other input end of the stacker 2; the FB microphone 607 is connected to one input end of the stacker 3, and the output end of the SP filter 605 is connected to the other input end of the stacker 3 connected, the output end of the stacker 3 is connected to the input end of the feedback filter 608, and the output end of the feedback filter 608 is connected to the third input end of the stacker 2; the output end of the stacker 2 is connected to the digital-to-analog converter (DAC) ) 609 is connected to the input end, the output end of the DAC
  • the TWS earphone 60 in this embodiment is provided with two speakers, namely a moving coil speaker 611a and a moving iron speaker 611b.
  • the main working frequency band of the moving coil speaker 611a is below 8.5 kHz, and the main working frequency band of the moving iron speaker 611b is above 8.5 kHz.
  • the frequency divider 42 can be set to divide the analog speaker drive signal into a sub audio signal below 8.5kHz and a sub audio signal above 8.5kHz, and the moving coil speaker 611a can maintain the optimal playback of the sub audio signal below 8.5kHz Frequency response, the moving iron speaker 611b can maintain the best frequency response by playing sub-audio signals above 8.5kHz, so that the TWS earphone 60 can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
  • FIG. 8c is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 8c, the structure of this embodiment is another more detailed implementation of the structure shown in FIG. 8a.
  • the difference from the structure shown in FIG. 8b is that the output end of the analog frequency divider 610 is connected to the moving coil speaker 611a, the moving iron speaker 611b, the MEMS speaker 611c and the planar vibrating diaphragm 611d.
  • the TWS earphone 60 in this embodiment is provided with four speakers, namely a moving coil speaker 611a, a moving iron speaker 611b, a MEMS speaker 611c, and a plane vibration diaphragm 611d.
  • the main working frequency of the moving coil speaker 611a is below 8.5 kHz, and the moving iron
  • the main working frequency band of the speaker 611b is above 8.5kHz, and the main working frequency band of the MEMS speaker 611c depends on the application form.
  • the main working frequency band of the planar vibrating diaphragm 611d is 10 kHz to 20 kHz.
  • the frequency divider 42 can be set to divide the analog speaker driving signal into four sub-frequency bands, and the moving coil speaker 611a plays 8.5 kHz.
  • the following sub-audio signals can maintain the best frequency response
  • the moving iron speaker 611b can maintain the best frequency response by playing the sub-audio signal above 8.5kHz
  • the MEMS speaker 611c can maintain the best frequency response by playing the sub-audio signal above 7kHz
  • the plane vibrates The diaphragm 611d can maintain the best frequency response by playing the sub-audio signal above 10 kHz, so that the TWS earphone 60 can not only reflect the high sound quality in each frequency band of the audio source, but also support ultra-bandwidth voice calls.
  • the vocal enhancement filter 603, the audio source 604 and the AHA joint controller 612 are arranged in a digital signal processing (digital signal process, DSP) chip, the feedforward filter 606, the feedback filter 608, and the SP filter 605.
  • DSP digital signal processing
  • the DAC 609 is set in the codec (coder-decoder, CODEC).
  • FIG. 8d is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in Fig. 8d, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 8a.
  • the difference from the structure shown in Fig. 8b is that the vocal enhancement filter 603 is moved from the DSP chip to the CODEC.
  • FIG. 8e is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 8e, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 8a.
  • the difference from the structure shown in Fig. 8c is that the vocal enhancement filter 603 is moved from the DSP chip to the CODEC.
  • FIG. 9a is a schematic structural diagram of an exemplary TWS headset of the present application.
  • the TWS earphone 40 further includes: at least two second DACs 45 .
  • the input ends of the at least two second DACs 45 are connected to the output ends of the frequency divider 42, and the output ends of the at least two second DACs 45 are respectively connected to one of the at least two speakers 43.
  • the second DAC 45 is configured to convert one of the sub-audio signals of at least two frequency bands from a digital form to an analog form.
  • the frequency divider 42 is a digital frequency divider.
  • the speaker driving signal After the speaker driving signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so it can be divided by digital frequency first.
  • the device divides the speaker driving signal in digital form into sub audio signals in at least two frequency bands, and then converts the sub audio signals in digital form transmitted to it into sub audio in analog form through at least two second DACs respectively. Signal.
  • This embodiment adopts the structure of frequency division first and then conversion.
  • FIG. 9b is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9b, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
  • the main microphone 701 and the FF microphone 702 in the TWS earphone 70 are respectively connected to the input end of the vocal enhancement filter 703 , and the output end of the vocal enhancement filter 703 and the audio source 704 (including music and talking voice) are connected to the
  • the downlink signal is obtained after superposition; the downlink signal is transmitted to one input of the stacker 2; the downlink signal is also transmitted to the input of the SP filter 705; connected, the output end of the feedforward filter 706 is connected to the other input end of the stacker 2; the FB microphone 707 is connected to one input end of the stacker 3, and the output end of the SP filter 705 is connected to the other input end of the stacker 3 connected, the output end of the superimposed device 3 is connected to the input end of the feedback filter 708, and the output end of the feedback filter 708 is connected to the third input end of the superimposed device 2; The input end is connected, and the output end of the digital frequency divider 709 is connected to the input ends of the two DACs 710a and 7
  • FIG. 9c is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in Fig. 9c, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 9a.
  • the TWS earphone 70 in this embodiment is provided with four speakers, namely a moving coil speaker 711a, a moving iron speaker 711b, a MEMS speaker 711c, and a plane vibration diaphragm 711d.
  • the main working frequency of the moving coil speaker 711a is below 8.5 kHz, and the moving iron
  • the main working frequency band of the speaker 711b is above 8.5kHz, and the main working frequency band of the MEMS speaker 711c depends on the application form.
  • the main working frequency band of the planar vibrating diaphragm 711d is 10 kHz to 20 kHz.
  • the frequency divider 42 can be set to divide the analog speaker driving signal into four sub-frequency bands, and the moving coil speaker 711a plays 8.5 kHz.
  • the following sub-audio signals can maintain the best frequency response
  • the moving iron speaker 711b can maintain the best frequency response by playing the sub-audio signal above 8.5kHz
  • the MEMS speaker 711c can maintain the best frequency response by playing the sub-audio signal above 7kHz
  • the plane vibrates The diaphragm 711d can maintain the best frequency response by playing sub-audio signals above 10 kHz, so that the TWS earphone 70 can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
  • the vocal enhancement filter 703, the audio source 704 and the AHA joint controller 712 are arranged in the DSP chip, and the feedforward filter 706, the feedback filter 708, the SP filter 705 and the DAC 709 are arranged in the CODEC.
  • FIG. 9d is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9d, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
  • the difference from the structure shown in Fig. 9b is that the vocal enhancement filter 703 is moved from the DSP chip to the CODEC.
  • FIG. 9e is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9e, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
  • the difference from the structure shown in Fig. 9c is that the vocal enhancement filter 703 is moved from the DSP chip to the CODEC.
  • FIG. 10 is an exemplary flowchart of a method for playing a TWS headset of the present application. As shown in FIG. 10 , the method of this embodiment may be applied to the TWS headset in the above-mentioned embodiments. The method can include:
  • Step 1001 Acquire an audio source.
  • the audio source is original music or call voice, that is, the audio source may be music, video sound, etc. that the user is listening to with the headset, or the call voice when the user is making a call with the headset.
  • the audio source may come from the player of the electronic device.
  • the audio source includes the voice signal enhanced by the human voice and the original music or voice of the call, that is, in addition to the music or voice of the call in the above two cases, the audio source can also superimpose the voice of the outside world enhanced by the human voice. Signal.
  • the external speech signal processed by the human voice enhancement can be obtained by the human voice enhancement filter in the structure shown in FIG. 2 a or FIG. 2 b , which will not be repeated here.
  • Step 1002 performing noise reduction or transparent transmission processing on the audio source to obtain a speaker driving signal.
  • the fixed secondary path SP filter can be obtained through the CODEC, then the audio source is processed according to the fixed SP filter to obtain a filtered signal, and then the filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driver Signal.
  • the CODEC can obtain the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone.
  • the ear canal signal includes the residual noise signal inside the ear canal.
  • the cascaded second-order filter is obtained according to the target frequency response of the estimated SP filter and the preset frequency division requirement. According to the parameters of the cascaded second-order filter, the SP cascaded second-order filter is obtained, and the SP cascaded second-order filter is used as a fixed SP filter.
  • the adaptive SP filter can be obtained through the DSP chip, and then the audio source is processed according to the adaptive SP filter to obtain a filtered signal, and then the filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driver Signal.
  • the DSP chip can obtain the real-time noise signal, and obtain the estimated SP filter according to the audio source and the real-time noise signal.
  • the estimated SP The filter is determined to be an adaptive SP filter.
  • the DSP chip can first obtain the external signal picked up by the FF microphone and the ear canal signal picked up by the FB microphone.
  • the external signal includes the external noise signal and music or call voice
  • the ear canal signal includes the residual noise signal inside the ear canal and music. Or call voice, and then obtain the voice signal picked up by the main microphone, and finally subtract the external signal and the ear canal signal from the voice signal to obtain a real-time noise signal.
  • Step 1003 Divide the speaker driving signal into sub-audio signals of at least two frequency bands.
  • the frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
  • the frequency divider of the present application may be configured to divide the frequency of the speaker drive signal based on the main working frequency bands of the at least two speakers, so as to obtain sub-audio signals of at least two frequency bands corresponding to the main working frequency bands of the at least two speakers, and then Let each speaker play the sub audio signal of the corresponding frequency band, so that the speaker maintains the best frequency response when playing the sub audio signal transmitted to it.
  • Adjacent frequency bands in the aforementioned at least two frequency bands partially overlap, or adjacent frequency bands in the at least two frequency bands do not overlap. For example, a crossover divides the speaker drive signal into high and low frequency bands, the high and low frequency bands are completely separated, and there is no overlap; or, the high and low frequency bands partially overlap.
  • the frequency divider divides the speaker driving signal into three frequency bands: high, middle and low frequency.
  • the high frequency band and the middle frequency band are completely separated without overlapping.
  • the present application can control the frequency divider 42 to divide the frequency of the speaker driving signal in a preset manner by setting the parameters of the frequency divider 42 .
  • frequency division reference may be made to FIG. 6a to FIG. 6d , which will not be repeated here.
  • Step 1004 Play one of the sub audio signals of at least two frequency bands through at least two speakers respectively.
  • At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical.
  • the frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands,
  • the aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
  • FIG. 11 is an exemplary structural diagram of a playback device for a TWS headset of the present application.
  • the device 1100 of this embodiment can be applied to the TWS headset in the above-mentioned embodiment, and the device 1100 includes: an acquisition module 1101, Processing module 1102 , frequency dividing module 1103 and playing module 1104 . in,
  • the acquisition module 1101 is used to acquire an audio source, the audio source is the original music or call voice, or, the audio source includes the voice signal enhanced by the human voice and the original music or call voice; processing module 1102 , used to perform noise reduction or transparent transmission processing on the audio source to obtain a speaker drive signal; the frequency division module 1103 is used to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two Adjacent frequency bands in the frequency bands partially overlap, or, adjacent frequency bands in the at least two frequency bands do not overlap; the playing module 1104 is configured to play the sub audio signals of the at least two frequency bands through at least two speakers respectively. one.
  • the processing module 1102 is specifically configured to obtain a fixed secondary path SP filter through a codec CODEC; process the audio source according to the fixed SP filter to obtain a filtered signal;
  • the speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
  • the processing module 1102 is specifically configured to obtain an estimated SP filter according to a preset speaker drive signal and an ear canal signal picked up by the feedback FB microphone, where the ear canal signal includes the ear canal Internal residual noise signal and the music or talking speech; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, the estimated SP filter is determined as the fixed SP filter.
  • the processing module 1102 is further configured to, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, according to the estimation
  • the parameters of the cascaded second-order filter are obtained from the target frequency response of the SP filter and the preset frequency division requirements; the SP cascaded second-order filter is obtained according to the parameters of the cascaded second-order filter, and the SP is cascaded A second order filter acts as the fixed SP filter.
  • the processing module 1102 is specifically configured to obtain an adaptive SP filter through a digital signal processing DSP chip; process the audio source according to the adaptive SP filter to obtain a filtered signal;
  • the speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
  • the processing module 1102 is specifically configured to obtain a real-time noise signal; obtain an estimated SP filter according to the audio source and the real-time noise signal; When the difference signal between the signal and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
  • the processing module 1102 is specifically configured to acquire the external signal picked up by the feedforward FF microphone and the ear canal signal picked up by the feedback FB microphone, where the external signal includes external noise signals and the music Or call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal from the voice signal The signal obtains a signal difference; the estimated SP filter is obtained from the audio source and the signal difference.
  • the main working frequency bands of the at least two speakers are not identical.
  • the at least two speakers include a moving coil speaker and a moving iron speaker.
  • the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
  • the apparatus of this embodiment can be used to execute the technical solution of the method embodiment shown in FIG. 10 , and its implementation principle and technical effect are similar, and details are not repeated here.
  • each step of the above method embodiments may be completed by a hardware integrated logic circuit in a processor or an instruction in the form of software.
  • the processor can be a general-purpose processor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field programmable gate array (FPGA), or other Programming logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSP digital signal processor
  • ASIC application-specific integrated circuit
  • FPGA field programmable gate array
  • a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • the steps of the methods disclosed in the embodiments of the present application may be directly embodied as executed by a hardware coding processor, or executed by a combination of hardware and software modules in the coding processor.
  • the software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art.
  • the storage medium is located in the memory, and the processor reads the information in the memory, and completes the steps of the above method in combination with its hardware.
  • the memory mentioned in the above embodiments may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory.
  • the non-volatile memory may be read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically programmable Erase programmable read-only memory (electrically EPROM, EEPROM) or flash memory.
  • Volatile memory may be random access memory (RAM), which acts as an external cache.
  • RAM random access memory
  • DRAM dynamic random access memory
  • SDRAM synchronous DRAM
  • SDRAM double data rate synchronous dynamic random access memory
  • ESDRAM enhanced synchronous dynamic random access memory
  • SLDRAM synchronous link dynamic random access memory
  • direct rambus RAM direct rambus RAM
  • the disclosed system, apparatus and method may be implemented in other manners.
  • the apparatus embodiments described above are only illustrative.
  • the division of the units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium.
  • the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution.
  • the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program codes .

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

The present application provides a TWS earphone and a playing method and device of the TWS earphone. The present TWS earphone comprises an audio signal processing path, a frequency divider, and at least two loudspeakers; an output end of the audio signal processing path is connected to an input end of the frequency divider; an output end of the frequency divider is connected to the at least two loudspeakers; the audio signal processing path is configured to output a loudspeaker driving signal; the frequency divider is configured to perform frequency division on the loudspeaker driving signal to obtain sub-audio signals of at least two frequency bands, the at least two frequency bands corresponding to main working frequency bands of the at least two loudspeakers; adjacent frequency bands among the at least two frequency bands partially overlap, or adjacent frequency bands among the at least two frequency bands do not overlap; and the at least two loudspeakers are configured to play the corresponding sub-audio signals. The present application can not only reflect high sound quality in frequency bands of an audio source, but also support an ultra-bandwidth voice call.

Description

TWS耳机和TWS耳机的播放方法及装置TWS earphone and TWS earphone playing method and device
本申请要求于2021年4月28日提交中国专利局、申请号为202110467311.9、申请名称为“TWS耳机和TWS耳机的播放方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number of 202110467311.9 and the application name "TWS earphone and TWS earphone playback method and device" filed with the Chinese Patent Office on April 28, 2021, the entire contents of which are incorporated by reference in in this application.
技术领域technical field
本申请涉及音频处理技术,尤其涉及一种真无线立体声(true wireless stereo,TWS)耳机和TWS耳机的播放方法及装置。The present application relates to audio processing technology, and in particular, to a true wireless stereo (true wireless stereo, TWS) headset and a TWS headset playback method and device.
背景技术Background technique
在用户的耳机使用场景中,普遍需求是在拥有高品质音乐或宽带高清通话的同时,实现稳定的主动降噪(active noise cancellation or active noise control,ANC)或透传(hear through,HT)功能。In the user's headset usage scenario, the general demand is to achieve stable active noise cancellation (active noise cancellation or active noise control, ANC) or transparent transmission (hear through, HT) function while having high-quality music or broadband HD calls .
目前的各类TWS耳机虽然实现ANC功能,但同时也会消除部分音乐或通话声音,影响音质和通话清晰度。Although the current various TWS earphones implement the ANC function, they will also eliminate some music or call sounds, which will affect the sound quality and call clarity.
发明内容SUMMARY OF THE INVENTION
本申请提供一种TWS耳机和TWS耳机的播放方法及装置,既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。The present application provides a TWS earphone and a TWS earphone playing method and device, which can not only reflect high sound quality in various frequency bands of an audio source, but also support ultra-bandwidth voice calls.
第一方面,本申请提供一种TWS耳机,包括:音频信号处理通路、分频器和至少两个扬声器;其中,该音频信号处理通路的输出端和该分频器的输入端连接;该分频器的输出端和该至少两个扬声器连接;该音频信号处理通路,被配置为对音频源进行降噪或透传处理后输出扬声器驱动信号;该音频源为原始的音乐或通话语音;或者,该音频源包括经人声增强处理的语音信号和该原始的音乐或通话语音;该分频器,被配置为将该扬声器驱动信号分频成至少两个频段的子音频信号,该至少两个频段和该至少两个扬声器的主要工作频段对应;该至少两个频段中的相邻频段部分重叠,或者,该至少两个频段中的相邻频段不重叠;该至少两个扬声器,被配置为播放对应的子音频信号。In a first aspect, the present application provides a TWS earphone, comprising: an audio signal processing path, a frequency divider and at least two speakers; wherein the output end of the audio signal processing path is connected to the input end of the frequency divider; the frequency divider The output end of the frequency converter is connected to the at least two speakers; the audio signal processing path is configured to output a speaker drive signal after noise reduction or transparent transmission processing is performed on the audio source; the audio source is original music or call voice; or , the audio source includes the voice signal enhanced by the human voice and the original music or call voice; the frequency divider is configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two frequency bands The frequency bands correspond to the main working frequency bands of the at least two speakers; the adjacent frequency bands in the at least two frequency bands partially overlap, or the adjacent frequency bands in the at least two frequency bands do not overlap; the at least two speakers are configured with To play the corresponding sub audio signal.
经处理后的扬声器驱动信号的频段与音频源的频段相对应,可能包含低中高整个频段,但由于单个扬声器的主要工作频段可能只能覆盖低中高的其中部分频段,导致单个扬声器无法在整个频段体现出高音质来。The frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
本申请通过设置分频器的参数以控制分频器按照预设的方式对扬声器驱动信号进行分频。分频器可以被配置为基于至少两个扬声器的主要工作频段对扬声器驱动信号进行分频,得到与至少两个扬声器的主要工作频段分别对应的至少两个频段的子音频信号,然后让各个扬声器分别播放对应频段的子音频信号,从而使扬声器在播放传输给其的子音频信号时维持最佳频响。通过设置分频器的参数以控制分频器按照预设的方式对扬声器驱动信号进行分频。The present application controls the frequency divider to divide the frequency of the speaker drive signal in a preset manner by setting the parameters of the frequency divider. The frequency divider may be configured to divide the frequency of the speaker drive signal based on the main working frequency bands of the at least two speakers to obtain sub-audio signals of at least two frequency bands corresponding to the main working frequency bands of the at least two speakers, and then let each speaker Play the sub audio signals of the corresponding frequency bands respectively, so that the speaker maintains the best frequency response when playing the sub audio signals transmitted to it. By setting the parameters of the frequency divider to control the frequency divider to divide the frequency of the speaker driving signal according to the preset method.
本申请的TWS耳机中设置了至少两个扬声器,该至少两个扬声器的主要工作频段不完全相同。分频器可以将扬声器驱动信号分频成至少两个频段的子音频信号,该至少两个 频段中的相邻频段部分重叠或者不重叠,这样将各个子音频信号分别传输至频段匹配的扬声器,前述频段匹配可以是指扬声器的主要工作频段覆盖了传输给其的子音频信号的频段,这样扬声器在播放传输给其的子音频信号时维持最佳频响,既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical. The frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands, The aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
在一种可能的实现方式中,该音频信号处理通路包括:次级路径SP滤波器,被配置为在降噪或透传处理与该音频源并发的情况下,防止该降噪或透传处理对该音频源的声音的消除。In a possible implementation, the audio signal processing path includes: a secondary path SP filter configured to prevent the noise reduction or pass-through processing when the noise reduction or pass-through processing is concurrent with the audio source Cancellation of sound from this audio source.
在一种可能的实现方式中,该音频信号处理通路还包括:反馈FB麦克风、反馈滤波器;其中,该FB麦克风,被配置为拾取耳道信号,该耳道信号包括耳道内部的残留噪声信号和该音乐或通话语音;该SP滤波器,被配置为输入该音频源,对该音频源进行处理,输出的信号与该耳道信号叠加后传输至该反馈滤波器;该反馈滤波器,被配置为生成用于该降噪或透传处理的信号,该降噪或透传处理的信号是生成该扬声器驱动信号的其中一个叠加信号。In a possible implementation manner, the audio signal processing path further includes: a feedback FB microphone and a feedback filter; wherein the FB microphone is configured to pick up an ear canal signal, and the ear canal signal includes residual noise inside the ear canal signal and the music or the voice of the conversation; the SP filter is configured to input the audio source, process the audio source, and transmit the output signal and the ear canal signal to the feedback filter after superimposing; the feedback filter, is configured to generate a signal for the noise reduction or pass-through processing, the noise reduction or pass-through processed signal being one of the superimposed signals for generating the speaker drive signal.
在一种可能的实现方式中,该反馈FB麦克风、该反馈滤波器和该SP滤波器设置于编译码器CODEC中。In a possible implementation manner, the feedback FB microphone, the feedback filter and the SP filter are set in the codec CODEC.
在一种可能的实现方式中,该SP滤波器,被配置为输入该音频源,对该音频源进行处理,输出的信号是该扬声器驱动信号的其中一个叠加信号。In a possible implementation manner, the SP filter is configured to input the audio source, process the audio source, and the output signal is one of the superimposed signals of the speaker driving signal.
在一种可能的实现方式中,该SP滤波器设置于数字信号处理DSP芯片中。In a possible implementation manner, the SP filter is set in a digital signal processing DSP chip.
本申请通过上述方法确定SP滤波器(包括固定SP滤波器或者自适应SP滤波器),可以在播放音乐或通过过程中,既实现降噪或透传功能,还可以防止透传或降噪技术将音乐或通话声音也消除掉。In the present application, the SP filter (including fixed SP filter or adaptive SP filter) is determined by the above method, which can not only realize the function of noise reduction or transparent transmission, but also prevent the transparent transmission or noise reduction technology in the process of playing music or passing through. Also remove music or call sounds.
在一种可能的实现方式中,还包括:第一数模转换器DAC;该第一DAC的输入端和该音频信号处理通路的输出端连接,该第一DAC的输出端和该分频器的输入端连接;该第一DAC,被配置为将该扬声器驱动信号从数字形式转换成模拟形式;相应的,该分频器为模拟分频器。In a possible implementation, it further includes: a first digital-to-analog converter DAC; the input end of the first DAC is connected to the output end of the audio signal processing path, and the output end of the first DAC is connected to the frequency divider The first DAC is configured to convert the speaker driving signal from a digital form to an analog form; correspondingly, the frequency divider is an analog frequency divider.
在得到扬声器驱动信号后,如果该信号没有经过DAC之前,均是数字形式的,但扬声器播放的信号需要是模拟形式的,因此可以先通过第一DAC将数字形式的扬声器驱动信号转换成模拟形式的扬声器驱动信号,然后通过模拟分频器将模拟形式的扬声器驱动信号分频成至少两个频段的子音频信号。After the speaker drive signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so the digital speaker drive signal can be converted into analog form through the first DAC The speaker driving signal in the analog form is then divided into sub audio signals of at least two frequency bands through an analog frequency divider.
本实施例采用的是先转换后分频的结构。This embodiment adopts the structure of first conversion and then frequency division.
在一种可能的实现方式中,还包括:至少两个第二DAC;该至少两个第二DAC的输入端均与该分频器的输出端连接,该至少两个第二DAC的输出端分别和该至少两个扬声器的其中之一连接;该第二DAC,被配置为将该至少两个频段的子音频信号的其中之一从数字形式转换成模拟形式;相应的,该分频器为数字分频器。In a possible implementation manner, the method further includes: at least two second DACs; the input ends of the at least two second DACs are connected to the output ends of the frequency divider, and the output ends of the at least two second DACs are connected to each other. are respectively connected with one of the at least two speakers; the second DAC is configured to convert one of the sub-audio signals of the at least two frequency bands from a digital form to an analog form; correspondingly, the frequency divider is a digital divider.
本实施例采用的是先分频后转换的结构。This embodiment adopts the structure of frequency division first and then conversion.
在一种可能的实现方式中,该至少两个扬声器的主要工作频段不完全相同。In a possible implementation manner, the main working frequency bands of the at least two speakers are not identical.
在一种可能的实现方式中,该至少两个扬声器包括动圈扬声器和动铁扬声器。In a possible implementation manner, the at least two speakers include a moving coil speaker and a moving iron speaker.
本实施例的TWS耳机中设置了两个扬声器,即动圈扬声器和动铁扬声器,动圈扬声器的主要工作频段在8.5kHz以下,动铁扬声器的主要工作频段在8.5kHz以上,这样可以 设置分频器将模拟形式的扬声器驱动信号分频成8.5kHz以下的子音频信号和8.5kHz以上的子音频信号,动圈扬声器播放8.5kHz以下的子音频信号可以维持最佳频响,动铁扬声器播放8.5kHz以上的子音频信号可以维持最佳频响,从而使得TWS耳机既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。Two speakers are set in the TWS earphone of this embodiment, namely a moving coil speaker and a moving iron speaker. The main working frequency band of the moving coil speaker is below 8.5kHz, and the main working frequency band of the moving iron speaker is above 8.5kHz. The frequency divider divides the analog speaker drive signal into a sub audio signal below 8.5kHz and a sub audio signal above 8.5kHz. The moving coil speaker can maintain the best frequency response by playing the sub audio signal below 8.5kHz. The sub-audio signal above 8.5kHz can maintain the best frequency response, so that the TWS headset can not only reflect the high sound quality in each frequency band of the audio source, but also support ultra-bandwidth voice calls.
在一种可能的实现方式中,该至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。In a possible implementation manner, the at least two speakers include a moving coil speaker, a moving iron speaker, a MEMS speaker and a planar vibration diaphragm.
本实施例的TWS耳机中设置了四个扬声器,即动圈扬声器、动铁扬声器、MEMS扬声器和平面振动膜片,动圈扬声器的主要工作频段在8.5kHz以下,动铁扬声器的主要工作频段在8.5kHz以上,MEMS扬声器的主要工作频段取决于应用形态,入耳式耳机的主要工作频段是全频带,头戴式耳机的主要工作频段在7kHz以下偏弱,主要作用频带在7kHz以上的高频,平面振动膜片的主要工作频段是10kHz~20kHz,这样可以设置分频器将模拟形式的扬声器驱动信号分频成四个子频段,动圈扬声器播放8.5kHz以下的子音频信号可以维持最佳频响,动铁扬声器播放8.5kHz以上的子音频信号可以维持最佳频响,MEMS扬声器播放7kHz以上的子音频信号可以维持最佳频响,平面振动膜片播放10kHz以上的子音频信号可以维持最佳频响,从而使得TWS耳机既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。Four speakers are set in the TWS earphone of this embodiment, namely, a moving coil speaker, a moving iron speaker, a MEMS speaker and a plane vibration diaphragm. Above 8.5kHz, the main working frequency band of MEMS speakers depends on the application form. The main working frequency band of in-ear headphones is the full frequency band, the main working frequency band of headphone is weaker below 7kHz, and the main working frequency band is high frequency above 7kHz. The main working frequency band of the flat vibrating diaphragm is 10kHz~20kHz, in this way, the frequency divider can be set to divide the analog speaker driving signal into four sub-frequency bands, and the sub-audio signal below 8.5kHz can be played by the moving coil speaker to maintain the best frequency response. , the moving iron speaker can maintain the best frequency response when playing sub-audio signals above 8.5kHz, the MEMS speaker can maintain the best frequency response when playing sub-audio signals above 7kHz, and the flat vibrating diaphragm can maintain the best frequency response when playing sub-audio signals above 10kHz frequency response, so that the TWS headset can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
第二方面,本申请提供一种TWS耳机的播放方法,该方法应用于上述第一方面中任一项所述的TWS耳机;所述方法包括:获取音频源,所述音频源为原始的音乐或通话语音,或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;对所述音频源进行降噪或透传处理得到扬声器驱动信号;将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;通过至少两个扬声器分别播放所述至少两个频段的子音频信号的其中之一。In a second aspect, the present application provides a method for playing a TWS headset, which is applied to the TWS headset according to any one of the first aspects above; the method includes: acquiring an audio source, where the audio source is original music or call voice, or, the audio source includes a voice signal enhanced by human voice and the original music or call voice; noise reduction or transparent transmission processing is performed on the audio source to obtain a speaker drive signal; The driving signal is divided into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap, or the adjacent frequency bands in the at least two frequency bands do not overlap; Playing one of the sub audio signals of the at least two frequency bands.
在一种可能的实现方式中,所述对所述音频源进行降噪或透传处理得到扬声器驱动信号,包括:通过编译码器CODEC获取固定次级路径SP滤波器;根据所述固定SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, performing noise reduction or transparent transmission processing on the audio source to obtain the speaker drive signal includes: obtaining a fixed secondary path SP filter through a codec CODEC; filtering according to the fixed SP The device processes the audio source to obtain a filtered signal; performs noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
在一种可能的实现方式中,所述通过编译码器CODEC获取固定次级路径SP滤波器,包括:根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,所述耳道信号包括耳道内部的残留噪声信号;当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述固定SP滤波器。In a possible implementation manner, the obtaining the fixed secondary path SP filter through the codec CODEC includes: obtaining an estimated SP filter according to a preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone , the ear canal signal includes the residual noise signal inside the ear canal; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, the estimated SP filter Determined as the fixed SP filter.
在一种可能的实现方式中,所述根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器之后,还包括:当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,根据所述估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;根据所述级联二阶滤波器的参数获取SP级联二阶滤波器,将所述SP级联二阶滤波器作为所述固定SP滤波器。In a possible implementation manner, after obtaining the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone, the method further includes: when the signal obtained by the estimated SP filter When the difference signal with the ear canal signal is within the set range, the parameters of the cascaded second-order filter are obtained according to the target frequency response of the estimated SP filter and the preset frequency division requirements; The parameters of the second-order filter are obtained from the SP cascaded second-order filter, and the SP cascaded second-order filter is used as the fixed SP filter.
在一种可能的实现方式中,所述对所述音频源进行降噪或透传处理得到扬声器驱动信号,包括:通过数字信号处理DSP芯片获取自适应SP滤波器;根据所述自适应SP滤波 器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, performing noise reduction or transparent transmission processing on the audio source to obtain the speaker drive signal includes: obtaining an adaptive SP filter through a digital signal processing DSP chip; filtering according to the adaptive SP The device processes the audio source to obtain a filtered signal; performs noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
在一种可能的实现方式中,所述通过数字信号处理DSP芯片获取自适应SP滤波器,包括:获取实时噪音信号;根据所述音频源和所述实时噪音信号获取估计SP滤波器;当经所述估计SP滤波器得到的信号和所述实时噪音信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述自适应SP滤波器。In a possible implementation manner, obtaining the adaptive SP filter through a digital signal processing DSP chip includes: obtaining a real-time noise signal; obtaining an estimated SP filter according to the audio source and the real-time noise signal; When the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
在一种可能的实现方式中,所述获取实时噪音信号,包括:获取前馈FF麦克风拾取的外界信号和反馈FB麦克风拾取到的耳道信号,所述外界信号包括外界噪声信号和所述音乐或通话语音,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;获取主麦克风拾取到的语音信号;将所述语音信号减去所述外界信号和所述耳道信号得到所述实时噪音信号。In a possible implementation manner, the acquiring a real-time noise signal includes: acquiring an external signal picked up by a feed-forward FF microphone and an ear canal signal picked up by a feedback FB microphone, where the external signal includes an external noise signal and the music Or call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal from the voice signal The signal obtains the real-time noise signal.
在一种可能的实现方式中,所述至少两个扬声器的主要工作频段不完全相同。In a possible implementation manner, the main working frequency bands of the at least two speakers are not identical.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器和动铁扬声器。In a possible implementation manner, the at least two speakers include a moving coil speaker and a moving iron speaker.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。In a possible implementation manner, the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
第三方面,本申请提供一种TWS耳机的播放装置,该装置应用于上述第一方面中的TWS耳机;该装置包括:获取模块,用于获取音频源,所述音频源为原始的音乐或通话语音,或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;处理模块,用于对所述音频源进行降噪或透传处理得到扬声器驱动信号;分频模块,用于将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;播放模块,用于通过至少两个扬声器分别播放所述至少两个频段的子音频信号的其中之一。In a third aspect, the present application provides a playback device for a TWS headset, which is applied to the TWS headset in the first aspect; the device includes: an acquisition module for acquiring an audio source, where the audio source is original music or call voice, or the audio source includes the voice signal enhanced by human voice and the original music or call voice; the processing module is used to perform noise reduction or transparent transmission processing on the audio source to obtain a speaker drive signal; A frequency dividing module, configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, and adjacent frequency bands in the at least two frequency bands partially overlap, or, adjacent frequency bands in the at least two frequency bands The frequency bands do not overlap; a playing module is configured to play one of the sub audio signals of the at least two frequency bands through at least two speakers respectively.
在一种可能的实现方式中,所述处理模块,具体用于通过编译码器CODEC获取固定次级路径SP滤波器;根据所述固定SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, the processing module is specifically configured to obtain a fixed secondary path SP filter through a codec CODEC; process the audio source according to the fixed SP filter to obtain a filtered signal; The filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driving signal.
在一种可能的实现方式中,所述处理模块,具体用于根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述固定SP滤波器。In a possible implementation manner, the processing module is specifically configured to obtain an estimated SP filter according to a preset speaker drive signal and an ear canal signal picked up by the feedback FB microphone, and the ear canal signal includes an internal ear canal signal. The residual noise signal and the music or talking speech; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, the estimated SP filter is determined as the The fixed SP filter described above.
在一种可能的实现方式中,所述处理模块,还用于当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,根据所述估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;根据所述级联二阶滤波器的参数获取SP级联二阶滤波器,将所述SP级联二阶滤波器作为所述固定SP滤波器。In a possible implementation manner, the processing module is further configured to, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, according to the estimated SP The target frequency response of the filter and the preset frequency division requirements obtain the parameters of the cascaded second-order filter; according to the parameters of the cascaded second-order filter, the SP cascaded second-order filter is obtained, and the SP cascaded two order filter as the fixed SP filter.
在一种可能的实现方式中,所述处理模块,具体用于通过数字信号处理DSP芯片获取自适应SP滤波器;根据所述自适应SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, the processing module is specifically configured to obtain an adaptive SP filter through a digital signal processing DSP chip; process the audio source according to the adaptive SP filter to obtain a filtered signal; The filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driving signal.
在一种可能的实现方式中,所述处理模块,具体用于获取实时噪音信号;根据所述音频源和所述实时噪音信号获取估计SP滤波器;当经所述估计SP滤波器得到的信号和所述实时噪音信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述自适应 SP滤波器。In a possible implementation manner, the processing module is specifically configured to obtain a real-time noise signal; obtain an estimated SP filter according to the audio source and the real-time noise signal; when the signal obtained through the estimated SP filter When the difference signal with the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
在一种可能的实现方式中,所述处理模块,具体用于获取前馈FF麦克风拾取的外界信号和反馈FB麦克风拾取到的耳道信号,所述外界信号包括外界噪声信号和所述音乐或通话语音,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;获取主麦克风拾取到的语音信号;将所述语音信号减去所述外界信号和所述耳道信号得到信号差;根据所述音频源和所述信号差获取所述估计SP滤波器。In a possible implementation manner, the processing module is specifically configured to acquire an external signal picked up by a feedforward FF microphone and an ear canal signal picked up by a feedback FB microphone, where the external signal includes an external noise signal and the music or Call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal signal from the voice signal Obtain a signal difference; obtain the estimated SP filter according to the audio source and the signal difference.
在一种可能的实现方式中,所述至少两个扬声器的主要工作频段不完全相同。In a possible implementation manner, the main working frequency bands of the at least two speakers are not identical.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器和动铁扬声器。In a possible implementation manner, the at least two speakers include a moving coil speaker and a moving iron speaker.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。In a possible implementation manner, the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
第四方面,本申请提供一种计算机可读存储介质,包括计算机程序,该计算机程序在计算机上被执行时,使得该计算机执行上述第二方面中任一项该的方法。In a fourth aspect, the present application provides a computer-readable storage medium, comprising a computer program, which, when executed on a computer, causes the computer to execute the method of any one of the above-mentioned second aspects.
第五方面,本申请提供一种计算机程序,当该计算机程序被计算机执行时,用于执行上述第二方面中任一项该的方法。In a fifth aspect, the present application provides a computer program, when the computer program is executed by a computer, for performing the method of any one of the above-mentioned second aspects.
附图说明Description of drawings
图1为相关技术TWS耳机的一个示例性的结构示意图;FIG. 1 is an exemplary structural schematic diagram of a related art TWS headset;
图2a为相关技术TWS耳机的一个示例性的结构示意图;FIG. 2a is an exemplary structural schematic diagram of a related art TWS headset;
图2b为相关技术TWS耳机的一个示例性的结构示意图;FIG. 2b is an exemplary structural schematic diagram of a related art TWS headset;
图3为本申请TWS耳机的一个示例性的结构示意图;3 is a schematic structural diagram of an exemplary TWS headset of the present application;
图4为本申请TWS耳机的一个示例性的结构示意图;4 is a schematic structural diagram of an exemplary TWS headset of the present application;
图5a为本申请固定SP滤波器的一个示例性的获取流程图Fig. 5a is an exemplary acquisition flow chart of the fixed SP filter of the present application
图5b为本申请固定SP滤波器的一个示例性的获取流程图Fig. 5b is an exemplary acquisition flow chart of the fixed SP filter of the present application
图6为本申请TWS耳机的一个示例性的结构示意图FIG. 6 is a schematic structural diagram of an exemplary TWS headset of the present application
图7a为本申请信号分频的一个示例性的示意图;FIG. 7a is an exemplary schematic diagram of the signal frequency division of the present application;
图7b为本申请信号分频的一个示例性的示意图;FIG. 7b is an exemplary schematic diagram of signal frequency division of the present application;
图7c为本申请信号分频的一个示例性的示意图;FIG. 7c is an exemplary schematic diagram of signal frequency division of the present application;
图7d为本申请信号分频的一个示例性的示意图;FIG. 7d is an exemplary schematic diagram of signal frequency division of the present application;
图8a为本申请TWS耳机的一个示例性的结构示意图;FIG. 8a is an exemplary structural schematic diagram of the TWS headset of the present application;
图8b为本申请TWS耳机的一个示例性的结构示意图;FIG. 8b is an exemplary structural schematic diagram of the TWS headset of the present application;
图8c为本申请TWS耳机的一个示例性的结构示意图;FIG. 8c is an exemplary structural schematic diagram of the TWS headset of the present application;
图8d为本申请TWS耳机的一个示例性的结构示意图;FIG. 8d is an exemplary structural schematic diagram of the TWS headset of the present application;
图8e为本申请TWS耳机的一个示例性的结构示意图;FIG. 8e is an exemplary structural schematic diagram of the TWS headset of the present application;
图9a为本申请TWS耳机的一个示例性的结构示意图;FIG. 9a is an exemplary structural schematic diagram of the TWS headset of the present application;
图9b为本申请TWS耳机的一个示例性的结构示意图;FIG. 9b is an exemplary structural schematic diagram of the TWS headset of the present application;
图9c为本申请TWS耳机的一个示例性的结构示意图;FIG. 9c is an exemplary structural schematic diagram of the TWS headset of the present application;
图9d为本申请TWS耳机的一个示例性的结构示意图;FIG. 9d is an exemplary structural schematic diagram of the TWS headset of the present application;
图9e为本申请TWS耳机的一个示例性的结构示意图;FIG. 9e is an exemplary structural schematic diagram of the TWS headset of the present application;
图10为本申请TWS耳机的播放方法的一个示例性的流程图;FIG. 10 is an exemplary flowchart of the playback method of the TWS headset of the present application;
图11为本申请TWS耳机的播放装置的一个示例性的结构图。FIG. 11 is an exemplary structural diagram of the playback device of the TWS earphone of the present application.
具体实施方式Detailed ways
为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请中的附图,对本申请中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the present application clearer, the technical solutions in the present application will be described clearly and completely below with reference to the accompanying drawings in the present application. Obviously, the described embodiments are part of the embodiments of the present application. , not all examples. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
本申请的说明书实施例和权利要求书及附图中的术语“第一”、“第二”等仅用于区分描述的目的,而不能理解为指示或暗示相对重要性,也不能理解为指示或暗示顺序。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元。方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", etc. in the description, embodiments and claims of the present application and the drawings are only used for the purpose of distinguishing and describing, and should not be construed as indicating or implying relative importance, nor should they be construed as indicating or implied order. Furthermore, the terms "comprising" and "having" and any variations thereof, are intended to cover non-exclusive inclusion, eg, comprising a series of steps or elements. A method, system, product or device is not necessarily limited to those steps or units expressly listed, but may include other steps or units not expressly listed or inherent to the process, method, product or device.
应当理解,在本申请中,“至少一个(项)”是指一个或者多个,“多个”是指两个或两个以上。“和/或”,用于描述关联对象的关联关系,表示可以存在三种关系,例如,“A和/或B”可以表示:只存在A,只存在B以及同时存在A和B三种情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。“以下至少一项(个)”或其类似表达,是指这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b或c中的至少一项(个),可以表示:a,b,c,“a和b”,“a和c”,“b和c”,或“a和b和c”,其中a,b,c可以是单个,也可以是多个。It should be understood that, in this application, "at least one (item)" refers to one or more, and "a plurality" refers to two or more. "And/or" is used to describe the relationship between related objects, indicating that there can be three kinds of relationships, for example, "A and/or B" can mean: only A, only B, and both A and B exist , where A and B can be singular or plural. The character "/" generally indicates that the associated objects are an "or" relationship. "At least one item(s) below" or similar expressions thereof refer to any combination of these items, including any combination of single item(s) or plural items(s). For example, at least one (a) of a, b or c, can mean: a, b, c, "a and b", "a and c", "b and c", or "a and b and c" ", where a, b, c can be single or multiple.
图1为相关技术TWS耳机的一个示例性的结构示意图。如图1所示,该TWS耳机包括三类麦克风,即主麦克风、前馈(feedforward,FF)麦克风和反馈(feedback,FB)麦克风,其中,主麦克风用于拾取通话中的人声,FF麦克风用于拾取外界噪声信号,FB麦克风用于拾取耳道内部的残留噪声信号;TWS耳机还包括动圈扬声器,该动圈扬声器用于播放处理后的音乐或通话语音。FIG. 1 is a schematic diagram of an exemplary structure of a related art TWS headset. As shown in FIG. 1 , the TWS headset includes three types of microphones, namely a main microphone, a feedforward (FF) microphone and a feedback (FB) microphone, wherein the main microphone is used to pick up the human voice in the call, and the FF microphone It is used to pick up the external noise signal, and the FB microphone is used to pick up the residual noise signal inside the ear canal; the TWS headset also includes a moving coil speaker, which is used to play the processed music or call voice.
基于图1所示的结构,图2a为相关技术TWS耳机的一个示例性的结构示意图。如图2a所示,主麦克风、FF麦克风分别和人声增强滤波器的输入端连接,人声增强滤波器的输出端和音频源(包括音乐和通话语音)经叠加器1的叠加后得到下行链路信号;下行链路信号传输至叠加器2的一个输入端;下行链路信号还传输至次级路径(secondary path,SP)滤波器的输入端;FF麦克风还和前馈滤波器的输入端连接,前馈滤波器的输出端连接至叠加器2的另一个输入端;FB麦克风和叠加器3的一个输入端连接,SP滤波器的输出端和叠加器3的另一个输入端连接,叠加器3的输出端和反馈滤波器的输入端连接,反馈滤波器的输出端连接至叠加器2的第三个输入端;叠加器2的输出端和数模转换器(digital to analog converter,DAC)的输入端连接,DAC的输出端和动圈扬声器连接。Based on the structure shown in FIG. 1 , FIG. 2 a is a schematic structural diagram of an exemplary TWS headset in the related art. As shown in Figure 2a, the main microphone and the FF microphone are respectively connected to the input end of the vocal enhancement filter, and the output end of the vocal enhancement filter and the audio source (including music and call voice) are superimposed by the stacker 1 to obtain a downlink link signal; the downlink signal is transmitted to one input of the stacker 2; the downlink signal is also transmitted to the input of the secondary path (SP) filter; the input of the FF microphone and the feedforward filter The output end of the feedforward filter is connected to the other input end of the stacker 2; the FB microphone is connected to one input end of the stacker 3, the output end of the SP filter is connected to the other input end of the stacker 3, The output end of the stacker 3 is connected to the input end of the feedback filter, and the output end of the feedback filter is connected to the third input end of the stacker 2; the output end of the stacker 2 is connected to the digital to analog converter (digital to analog converter, The input end of the DAC) is connected, and the output end of the DAC is connected to the moving coil speaker.
主动降噪&透传&增强听力(ANC&HT&AH,AHA)联合控制器分别和人声增强滤波器、前馈滤波器、反馈滤波器以及SP滤波器连接。AHA联合控制器的作用:为了保证TWS耳机正常稳定的运行,需要通过TWS耳机本身来处理一些异常情况,因此需要一个AHA联合控制器。AHA联合控制器通过对多个信号进行分析,判断当前ANC、HT或增强听力(augmented hearing,AH)的状态,然后判断是否发生啸叫、削波等异常情况,从而实施相应的处理,通过控制前述各个滤波器参数值来实现系统的控制。Active Noise Cancellation & Transparent Transmission & Enhanced Hearing (ANC&HT&AH, AHA) joint controllers are respectively connected with vocal enhancement filter, feedforward filter, feedback filter and SP filter. The role of the AHA joint controller: In order to ensure the normal and stable operation of the TWS headset, some abnormal situations need to be handled by the TWS headset itself, so an AHA joint controller is required. The AHA joint controller analyzes multiple signals to determine the current state of ANC, HT or augmented hearing (AH), and then determines whether abnormal conditions such as whistling and clipping occur, so as to implement corresponding processing. The aforementioned filter parameter values are used to realize the control of the system.
上述结构中,人声增强滤波器、音频源和AHA联合控制器设置于数字信号处理(digital signal process,DSP)芯片中,前馈滤波器、反馈滤波器、SP滤波器和DAC设置于编译码器(coder-decoder,CODEC)中。In the above structure, the vocal enhancement filter, the audio source and the AHA joint controller are arranged in the digital signal processing (digital signal process, DSP) chip, and the feedforward filter, the feedback filter, the SP filter and the DAC are arranged in the codec. in the encoder (coder-decoder, CODEC).
基于图1所示的结构,图2b为相关技术TWS耳机的一个示例性的结构示意图。如图2b所示,与图2a所示的结构区别在于,人声增强滤波器从DSP芯片移到了CODEC中。Based on the structure shown in FIG. 1 , FIG. 2 b is a schematic structural diagram of an exemplary TWS headset of the related art. As shown in Figure 2b, the difference from the structure shown in Figure 2a is that the vocal enhancement filter is moved from the DSP chip to the CODEC.
基于图1~图2b所示的结构,TWS耳机可以实现如下功能:Based on the structures shown in Figures 1 to 2b, the TWS headset can implement the following functions:
主动降噪:FF麦克风拾取外界噪声信号,通过前馈滤波器生成前馈降噪信号;FB麦克风拾取耳道内部的残留噪声信号,通过反馈滤波器生成反馈降噪信号。前馈降噪信号、反馈降噪信号和下行链路信号叠加成最终的扬声器驱动信号,通过DAC进行数模转换后生成模拟扬声器驱动信号。该模拟扬声器驱动信号在动圈扬声器中以反向方式播放时,与空间中的音频信号抵消,即可得到抵消噪声的模拟音频信号。这样一定频带范围内的低频噪声可以被消除,从而实现降噪的目的。Active noise reduction: The FF microphone picks up the external noise signal and generates the feedforward noise reduction signal through the feedforward filter; the FB microphone picks up the residual noise signal inside the ear canal, and generates the feedback noise reduction signal through the feedback filter. The feedforward noise reduction signal, the feedback noise reduction signal and the downlink signal are superimposed to form the final speaker drive signal, and the analog speaker drive signal is generated after digital-to-analog conversion by the DAC. When the analog loudspeaker driving signal is played in a reverse manner in the moving coil loudspeaker, the analog audio signal can be obtained by canceling the audio signal in the space. In this way, low-frequency noise within a certain frequency band can be eliminated, thereby achieving the purpose of noise reduction.
透传:FF麦克风拾取外界噪声信号,通过前馈滤波器生成前馈补偿信号;FB麦克风拾取耳道内部的声信号,通过反馈滤波器生成反馈抑制信号。前馈补偿信号、反馈抑制信号和下行链路信号叠加成最终的扬声器驱动信号,通过DAC进行数模转换后生成模拟扬声器驱动信号。该模拟扬声器驱动信号在动圈扬声器中播放,可以对低频噪声进行减弱或抑制、对高频信号进行补偿,即可得到实现声学补偿的模拟音频信号。这样主动发声听音(佩戴者自己听到自己说话的声音)的“闭塞”效应或身体部位振动发声(例如配戴耳机走路、咀嚼、挠头等声音)的“听诊器”效应被减弱或抑制,被动听音(例如环境中的人声或音乐等)在一定频带范围内的高频声音被补偿,从而实现透传目的。Transparent transmission: The FF microphone picks up the external noise signal and generates the feedforward compensation signal through the feedforward filter; the FB microphone picks up the acoustic signal inside the ear canal and generates the feedback suppression signal through the feedback filter. The feedforward compensation signal, the feedback suppression signal and the downlink signal are superimposed to form the final speaker drive signal, and the analog speaker drive signal is generated after digital-to-analog conversion by the DAC. The analog speaker driving signal is played in the moving coil speaker, which can weaken or suppress low-frequency noise and compensate high-frequency signals, so as to obtain an analog audio signal that realizes acoustic compensation. In this way, the "occlusion" effect of active voice listening (the wearer's own voice heard by himself) or the "stethoscope" effect of body part vibration (such as walking, chewing, scratching your head, etc. with headphones) is weakened or suppressed, passive The high-frequency sound within a certain frequency band of listening sounds (such as human voice or music in the environment) is compensated, so as to achieve the purpose of transparent transmission.
需要说明的是,上述叠加得到最终的扬声器驱动信号的步骤中,被叠加的信号可以包括前馈补偿信号、反馈抑制信号和下行链路信号三者,也可以包括前馈补偿信号、反馈抑制信号和下行链路信号的其中之二或之一。例如,在用户听音乐的过程中进行降噪处理,被叠加的信号可以包括前述三者;用户只是想通过耳机进行降噪使得处于一个安静的环境中时,被叠加的信号可以包括馈补偿信号和反馈抑制信号,而不包括下行链路信号。It should be noted that in the above step of superimposing to obtain the final speaker driving signal, the superimposed signal may include the feedforward compensation signal, the feedback suppression signal and the downlink signal, or may include the feedforward compensation signal and the feedback suppression signal. and either or both of the downlink signal. For example, when noise reduction is performed while the user is listening to music, the superimposed signal may include the aforementioned three; when the user just wants to perform noise reduction through headphones so as to be in a quiet environment, the superimposed signal may include the feed compensation signal and feedback suppression signals, excluding downlink signals.
人声增强和音乐/通话并发:在DSP侧实现人声增强功能时,FF麦克风和主麦克风信号被送到人声增强滤波器中处理,得到环境噪声被抑制且人声得到保留的信号。然后将该信号下行到CODEC,与前馈滤波器和反馈滤波器分别输出的信号叠加得到扬声器驱动信号,再经过DAC数模转换后输出模拟扬声器驱动信号供动圈扬声器进行播放。由于动圈扬声器播放出来的声音会被FB麦克风拾取到,为了防止播放出来的声音被反馈滤波器降噪处理掉,可以将估计的动圈扬声器播放的信号从FB麦克风拾取的信号中减掉,这样这部分信号也就不会被反馈滤波器降噪处理,从而实现了人声增强和音乐/通话的并发,还可以在此基础上实现ANC/HT功能。Vocal enhancement and music/call concurrency: When the vocal enhancement function is implemented on the DSP side, the FF microphone and main microphone signals are sent to the vocal enhancement filter for processing to obtain a signal in which ambient noise is suppressed and the human voice is preserved. Then the signal is down-streamed to the CODEC, superimposed with the signals output by the feedforward filter and the feedback filter respectively to obtain the speaker drive signal, and after digital-to-analog conversion by the DAC, the analog speaker drive signal is output for the moving coil speaker to play. Since the sound played by the moving coil speaker will be picked up by the FB microphone, in order to prevent the played sound from being denoised by the feedback filter, the estimated signal played by the moving coil speaker can be subtracted from the signal picked up by the FB microphone, In this way, this part of the signal will not be noise-reduced by the feedback filter, thus realizing the concurrency of vocal enhancement and music/calling, and the ANC/HT function can also be realized on this basis.
图2a中,在DSP中实现人声增强,是有算力开销的保证,降噪效果和处理后的播放效果平稳性较好,但是时延较长,听起来有混响感。图2b中,在CODEC中实现人声增强,算法相对固定和简单,时延较短,混响感较低,但降噪效果有限。In Figure 2a, the realization of vocal enhancement in DSP is a guarantee of computational power overhead. The noise reduction effect and the playback effect after processing are stable, but the delay is long, and it sounds like a reverberation. In Figure 2b, the human voice enhancement is implemented in the CODEC. The algorithm is relatively fixed and simple, the time delay is short, and the reverberation sense is low, but the noise reduction effect is limited.
图1~图2b所示的TWS耳机结构,针对高频音乐,尤其是8.5kHz以上频段的音乐信号,播放音质可能会严重受损,影响音乐的播放效果;针对高频语音,尤其是8.5kHz以上频段的语音信号,是不支持的,导致语音通话的带宽受限。The structure of the TWS earphone shown in Figure 1 to Figure 2b, for high-frequency music, especially for music signals in the frequency band above 8.5kHz, the playback sound quality may be seriously damaged, affecting the playback effect of music; for high-frequency speech, especially for 8.5kHz Voice signals in the above frequency bands are not supported, resulting in limited bandwidth for voice calls.
本申请提供了一种TWS耳机的扬声器结构,可以改善上述技术问题。The present application provides a speaker structure of a TWS earphone, which can improve the above technical problems.
图3为本申请TWS耳机的一个示例性的结构示意图。如图3所示,TWS耳机30包括三类麦克风,即主麦克风31、FF麦克风32和FB麦克风33,其中,主麦克风31用于拾取通话中的人声,FF麦克风32用于拾取外界噪声信号,FB麦克风33用于拾取耳道内部的残留噪声信号。FIG. 3 is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 3 , the TWS headset 30 includes three types of microphones, namely the main microphone 31 , the FF microphone 32 and the FB microphone 33 , wherein the main microphone 31 is used to pick up the human voice during the call, and the FF microphone 32 is used to pick up the external noise signal , the FB microphone 33 is used to pick up the residual noise signal inside the ear canal.
TWS耳机30还包括动圈扬声器34a和动铁扬声器34b,动圈扬声器34a的主要工作频段是小于8.5kHz,动铁扬声器34b的主要工作频段是大于8.5kHz。需要说明的是,本申请对扬声器的个数不做具体限定,只要是至少两个即可,该至少两个扬声器,可以各个扬声器的主要工作频段均不相同,或者也可以有部分扬声器的主要工作频段是相同的,另一部分扬声器的主要工作频段与前述部分扬声器的主要工作频段不相同。例如,三个扬声器(1~3),扬声器1的主要工作频段和扬声器2的主要工作频段相同,扬声器3的主要工作频段与扬声器1和扬声器2的主要工作频段不相同。又例如,三个扬声器(1~3),扬声器1、扬声器2和扬声器3的主要工作频段均不相同。可选的,至少两个扬声器也可以包括动圈扬声器、动铁扬声器、微机电系统(micro-electro-mechanical systems,MEMS)MEMS扬声器和平面振动膜片,其中,动圈扬声器的主要工作频段是小于8.5kHz,动铁扬声器的主要工作频段是大于8.5kHz,MEMS扬声器的主要工作频段取决于应用形态,入耳式耳机的主要工作频段是全频带,头戴式耳机的主要工作频段在7kHz以下偏弱,主要作用频带在7kHz以上的高频,平面振动膜片的主要工作频段是10kHz~20kHz。The TWS earphone 30 also includes a moving coil speaker 34a and a moving iron speaker 34b. The main working frequency band of the moving coil speaker 34a is less than 8.5kHz, and the main working frequency band of the moving iron speaker 34b is greater than 8.5kHz. It should be noted that this application does not specifically limit the number of speakers, as long as there are at least two speakers, and the at least two speakers may have different main working frequency bands of each speaker, or may also have the main operating frequency bands of some speakers. The working frequency band is the same, and the main working frequency band of another part of the speakers is different from the main working frequency band of the aforementioned part of the speakers. For example, for three speakers (1-3), the main working frequency band of speaker 1 is the same as that of speaker 2, and the main working frequency band of speaker 3 is different from that of speaker 1 and speaker 2. For another example, among the three speakers (1-3), the main working frequency bands of the speaker 1, the speaker 2 and the speaker 3 are all different. Optionally, the at least two speakers may also include a moving coil speaker, a moving iron speaker, a micro-electro-mechanical systems (MEMS) MEMS speaker and a planar vibrating diaphragm, wherein the main working frequency band of the moving coil speaker is Less than 8.5kHz, the main working frequency band of the moving iron speaker is greater than 8.5kHz, the main working frequency band of the MEMS speaker depends on the application form, the main working frequency band of the in-ear headphone is the full frequency band, and the main working frequency band of the headphone is below 7kHz. Weak, the main working frequency band is high frequency above 7kHz, and the main working frequency band of the plane vibrating diaphragm is 10kHz~20kHz.
基于图3所示的TWS耳机,图4为本申请TWS耳机的一个示例性的结构示意图。如图4所示,TWS耳机40包括:音频信号处理通路41、分频器42和至少两个扬声器43;其中,音频信号处理通路41的输出端和分频器42的输入端连接;分频器42的输出端和至少两个扬声器43连接。Based on the TWS headset shown in FIG. 3 , FIG. 4 is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 4 , the TWS earphone 40 includes: an audio signal processing path 41, a frequency divider 42 and at least two speakers 43; wherein, the output end of the audio signal processing path 41 is connected to the input end of the frequency divider 42; frequency division The output end of the speaker 42 is connected to at least two speakers 43 .
音频信号处理通路41,被配置为对音频源进行降噪或透传处理后输出扬声器驱动信号;音频源为原始的音乐或通话语音;或者,音频源包括经人声增强处理的语音信号和原始的音乐或通话语音。The audio signal processing path 41 is configured to output a speaker drive signal after performing noise reduction or transparent transmission processing on the audio source; the audio source is the original music or call voice; music or call voice.
分频器42,被配置为将扬声器驱动信号分频成至少两个频段的子音频信号,至少两个频段和至少两个扬声器43的主要工作频段对应;至少两个频段中的相邻频段部分重叠,或者,至少两个频段中的相邻频段不重叠。至少两个扬声器43,被配置为播放对应的子音频信号。The frequency divider 42 is configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two frequency bands correspond to the main working frequency bands of the at least two speakers 43; the adjacent frequency band parts in the at least two frequency bands Overlapping, or at least two adjacent bands in the two bands do not overlap. At least two speakers 43 are configured to play corresponding sub audio signals.
经处理后的扬声器驱动信号的频段与音频源的频段相对应,可能包含低中高整个频段,但由于单个扬声器的主要工作频段可能只能覆盖低中高的其中部分频段,导致单个扬声器无法在整个频段体现出高音质来。The frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
本申请的分频器42可以被配置为基于至少两个扬声器的主要工作频段对扬声器驱动信号进行分频,得到与至少两个扬声器的主要工作频段分别对应的至少两个频段的子音频信号,然后让各个扬声器分别播放对应频段的子音频信号,从而使扬声器在播放传输给其的子音频信号时维持最佳频响。可以通过设置分频器42的参数以控制分频器42按照预设的方式对扬声器驱动信号进行分频。The frequency divider 42 of the present application may be configured to divide the frequency of the speaker drive signal based on the main operating frequency bands of the at least two speakers, to obtain sub-audio signals of at least two frequency bands corresponding to the main operating frequency bands of the at least two speakers, respectively, Then let each speaker play the sub audio signal of the corresponding frequency band, so that the speaker maintains the best frequency response when playing the sub audio signal transmitted to it. The frequency divider 42 can be controlled to divide the frequency of the speaker driving signal in a preset manner by setting the parameters of the frequency divider 42 .
在一种可能的实现方式中,上述音频信号处理通路41可以采用图2a或图2b所示的结构,此时SP滤波器设置于编译码器(coder-decoder,CODEC)中,通过CODEC获取 固定SP滤波器。In a possible implementation manner, the above-mentioned audio signal processing path 41 may adopt the structure shown in FIG. 2 a or FIG. 2 b . At this time, the SP filter is set in the codec (coder-decoder, CODEC), and the fixed value is obtained through the CODEC. SP filter.
CODEC可以根据预先设定的扬声器驱动信号和FB麦克风拾取到的耳道信号获取估计SP滤波器,耳道信号包括耳道内部的残留噪声信号;当经估计SP滤波器得到的信号和耳道信号的差值信号处于设定范围内时,将估计SP滤波器确定为固定SP滤波器。The CODEC can obtain the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the FB microphone. The ear canal signal includes the residual noise signal inside the ear canal; when the signal obtained by the estimated SP filter and the ear canal signal When the difference signal of is within the set range, the estimated SP filter is determined as a fixed SP filter.
图5a为本申请固定SP滤波器的一个示例性的获取流程图,如图5a所示,预先设定一个扬声器驱动信号x[n],经过分频器后推动至少两个扬声器发出声音,该声音传到FB麦克风,被FB麦克风拾取到,这些声音转换为数字信号y[n]。Fig. 5a is an exemplary acquisition flow chart of the fixed SP filter of the present application. As shown in Fig. 5a, a speaker driving signal x[n] is preset, and after passing through the frequency divider, at least two speakers are pushed to emit sound. The sound is transmitted to the FB microphone, picked up by the FB microphone, and converted into a digital signal y[n].
通常SP滤波器的传递函数可以假设是一个高阶FIR滤波器,可以利用最小均方误差(LMS)算法进行迭代建模,真实的SP滤波器的传递函数S(z)是未知的,但是其所有的信息都包含在扬声器驱动信号x[n]和FB麦克风拾取的数字信号y[n]中,因此可以通过一个高阶FIR滤波器
Figure PCTCN2022083464-appb-000001
作为一个估计SP滤波器去模拟S(z)。将x[n]输入
Figure PCTCN2022083464-appb-000002
得到
Figure PCTCN2022083464-appb-000003
对y[n]和
Figure PCTCN2022083464-appb-000004
求差得到误差信号e[n],当e[n]处于设定范围内时,就认为算法收敛到了令人满意的状态,可以认为此时的高阶FIR滤波器
Figure PCTCN2022083464-appb-000005
与真实的S(z)近似相等,可以将其确定为是固定SP滤波器的传递函数。上述过程可以利用最小均方算法(least mean square,LMS)进行迭代建模。
Usually, the transfer function of the SP filter can be assumed to be a high-order FIR filter, which can be iteratively modeled by the least mean square error (LMS) algorithm. The transfer function S(z) of the real SP filter is unknown, but its All the information is contained in the speaker drive signal x[n] and the digital signal y[n] picked up by the FB microphone, so it can be passed through a higher order FIR filter
Figure PCTCN2022083464-appb-000001
Model S(z) as an estimated SP filter. input x[n]
Figure PCTCN2022083464-appb-000002
get
Figure PCTCN2022083464-appb-000003
for y[n] and
Figure PCTCN2022083464-appb-000004
Calculate the difference to get the error signal e[n]. When e[n] is within the set range, it is considered that the algorithm has converged to a satisfactory state, and it can be considered that the high-order FIR filter at this time
Figure PCTCN2022083464-appb-000005
It is approximately equal to the true S(z), which can be determined to be the transfer function of the fixed SP filter. The above process can be iteratively modeled using a least mean square (Least mean square, LMS).
进一步的,除了利用有限长单位冲激响应(finite impulse response,FIR)滤波器模拟得到固定SP滤波器外,还可以利用CODEC侧的无限脉冲响应(infinite impulse response,IIR)滤波器模拟得到固定SP滤波器。图5b为本申请固定SP滤波器的一个示例性的获取流程图,如图5b所示,当经估计SP滤波器得到的信号和耳道信号的差值信号处于设定范围内时,根据估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;根据级联二阶滤波器的参数获取SP级联二阶滤波器,将SP级联二阶滤波器作为固定SP滤波器。Further, in addition to using a finite-length unit impulse response (FIR) filter to simulate a fixed SP filter, the infinite impulse response (IIR) filter on the CODEC side can also be used to simulate a fixed SP filter. filter. Fig. 5b is an exemplary acquisition flow chart of the fixed SP filter of the present application. As shown in Fig. 5b, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, according to the estimation The parameters of the cascaded second-order filter are obtained from the target frequency response of the SP filter and the preset frequency division requirements; the SP cascaded second-order filter is obtained according to the parameters of the cascaded second-order filter, and the SP cascaded second-order filter is as a fixed SP filter.
(1)CODEC根据已经求取的FIR滤波器
Figure PCTCN2022083464-appb-000006
计算目标频响。
(1) CODEC is based on the FIR filter that has been obtained
Figure PCTCN2022083464-appb-000006
Calculate the target frequency response.
s=[s 0 s 1…s N-1],N表示滤波器阶数,k表示频点数。采用公式(1)计算对应的目标响应: s=[s 0 s 1 …s N-1 ], N represents the filter order, and k represents the frequency point number. The corresponding target response is calculated using formula (1):
Figure PCTCN2022083464-appb-000007
Figure PCTCN2022083464-appb-000007
(2)对目标频响进行分频处理,并在不同的频带设置不同的权重系数ω[k]。(2) Perform frequency division processing on the target frequency response, and set different weight coefficients ω[k] in different frequency bands.
(3)从FIR滤波器到IIR滤波器的转化处理获取IIR滤波器参数,该处理过程要求二者的目标响应尽可能的一致。(3) The IIR filter parameters are obtained from the conversion process of the FIR filter to the IIR filter, and the process requires the target responses of the two to be as consistent as possible.
从数学角度来讲就是要求最优化公式(2)的目标函数,以求取多个IIR滤波器的系数:From a mathematical point of view, it is required to optimize the objective function of formula (2) to obtain the coefficients of multiple IIR filters:
Figure PCTCN2022083464-appb-000008
Figure PCTCN2022083464-appb-000008
其中,B[k]、A[k]分别是IIR滤波器系数b、a的复频响应,其计算方法和公式(1)相同。Among them, B[k] and A[k] are the complex frequency responses of the IIR filter coefficients b and a respectively, and the calculation method is the same as that of formula (1).
(4)IIR滤波器实现(4) IIR filter implementation
利用最优化算法,即可找到一组最小均方下的IIR滤波器参数,由这组IIR滤波器参 数得到的IIR滤波器即可当做最终的固定SP滤波器,放到CODEC中进行硬化实现。Using the optimization algorithm, a set of IIR filter parameters under the minimum mean square can be found, and the IIR filter obtained from this set of IIR filter parameters can be used as the final fixed SP filter and put into the CODEC for hardening implementation.
在一种可能的实现方式中,上述音频信号处理通路41可以采用图6所示的结构,此时SP滤波器设置于数字信号处理(digital signal process,DSP)芯片中,通过DSP芯片获取自适应SP滤波器。In a possible implementation manner, the above-mentioned audio signal processing path 41 may adopt the structure shown in FIG. 6 . At this time, the SP filter is set in a digital signal processing (digital signal process, DSP) chip, and the adaptive filter is obtained through the DSP chip. SP filter.
DSP芯片可以获取实时噪音信号,根据音频源(音频源为原始的音乐或通话语音;或者,音频源包括经人声增强处理的语音信号和原始的音乐或通话语音)和实时噪音信号获取估计SP滤波器,当经估计SP滤波器得到的信号和实时噪音信号的差值信号处于设定范围内时,将估计SP滤波器确定为自适应SP滤波器。The DSP chip can obtain the real-time noise signal, and obtain the estimated SP according to the audio source (the audio source is the original music or voice of the call; or, the audio source includes the voice signal that has been enhanced by the human voice and the original music or voice of the call) and the real-time noise signal. The filter determines the estimated SP filter as an adaptive SP filter when the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within the set range.
上述获取实时噪音信号可以是获取FF麦克风拾取的外界信号(外界信号包括外界噪声信号和音乐或通话语音)和FB麦克风拾取到的耳道信号(耳道信号包括耳道内部的残留噪声信号和音乐或通话语音),再获取主麦克风拾取到的语音信号,将语音信号减去外界信号和耳道信号得到实时噪音信号。The above-mentioned acquisition of the real-time noise signal may be to acquire the external signal picked up by the FF microphone (the external signal includes the external noise signal and music or voice of a call) and the ear canal signal picked up by the FB microphone (the ear canal signal includes the residual noise signal inside the ear canal and music. Or call voice), and then obtain the voice signal picked up by the main microphone, and subtract the external signal and the ear canal signal from the voice signal to obtain a real-time noise signal.
本实施例与上述获取固定SP滤波器的实施例的区别在于:x[n]表示音频源,而y[n]表示实时噪音信号,即x[n]=dnlink[n],y[n]=fb[n]-ff[n]*A(z)-fb[n-1]*C(z),fb[n]表示主麦克风拾取到的语音信号,ff[n]*A(z)表示FF麦克风拾取的外界信号,fb[n-1]*C(z)表示FB麦克风拾取到的耳道信号。本实施例也可以采用图5a所示的流程获取一个对应于最小误差信号e[n]时的高阶FIR滤波器
Figure PCTCN2022083464-appb-000009
由于音频源是实时的,TWS耳机的降噪或透传功能会与音乐或者通话语音并发,此时FB麦克风拾音的信号不能直接进行SP滤波器的估计,需要去掉其他信号的影响之后再进行建模分析,即获取实时噪音信号,因此此时获取的SP滤波器可以自适应于音频源和噪音信号的实时情况而非固定的。
The difference between this embodiment and the above-mentioned embodiment for obtaining a fixed SP filter is that: x[n] represents an audio source, and y[n] represents a real-time noise signal, that is, x[n]=dnlink[n], y[n] =fb[n]-ff[n]*A(z)-fb[n-1]*C(z), fb[n] represents the voice signal picked up by the main microphone, ff[n]*A(z) It represents the external signal picked up by the FF microphone, and fb[n-1]*C(z) represents the ear canal signal picked up by the FB microphone. In this embodiment, the process shown in FIG. 5a can also be used to obtain a high-order FIR filter corresponding to the minimum error signal e[n]
Figure PCTCN2022083464-appb-000009
Since the audio source is real-time, the noise reduction or transparent transmission function of the TWS headset will be concurrent with the music or the voice of the call. At this time, the signal picked up by the FB microphone cannot be directly estimated by the SP filter, and it is necessary to remove the influence of other signals. Modeling analysis is to obtain real-time noise signal, so the SP filter obtained at this time can be adapted to the real-time situation of the audio source and noise signal instead of being fixed.
本申请通过上述方法确定SP滤波器(包括固定SP滤波器或者自适应SP滤波器),可以在播放音乐或通过过程中,既实现降噪或透传功能,还可以防止透传或降噪技术将音乐或通话声音也消除掉。In the present application, the SP filter (including fixed SP filter or adaptive SP filter) is determined by the above method, which can not only realize the function of noise reduction or transparent transmission, but also prevent the transparent transmission or noise reduction technology in the process of playing music or passing through. Also remove music or call sounds.
示例性的,假设TWS耳机包括两个扬声器:动圈扬声器和动铁扬声器。虚线表示动圈扬声器的频响曲线,单点线表示动铁扬声器的频响曲线,实线表示分频线。Exemplarily, it is assumed that the TWS earphone includes two speakers: a moving coil speaker and a moving iron speaker. The dotted line represents the frequency response curve of the moving coil speaker, the single-dotted line represents the frequency response curve of the moving iron speaker, and the solid line represents the crossover line.
图7a为本申请信号分频的一个示例性的示意图,如图7a所示,分频器42被配置为在动圈扬声器的非主要工作频段对其进行衰减,保留动圈扬声器在主要工作频段的功率,在动铁扬声器的非主要工作频段对其进行衰减,保留动铁扬声器在主要工作频段的功率,并且在动圈扬声器和动铁扬声器各自衰减频段内的交点频点处进分频,得到两个频段的子音频信号。FIG. 7a is an exemplary schematic diagram of signal frequency division of the present application. As shown in FIG. 7a, the frequency divider 42 is configured to attenuate the dynamic speaker in the non-main operating frequency band, and keep the dynamic speaker in the main operating frequency band. The power of the moving iron speaker is attenuated in the non-main working frequency band of the moving iron speaker, the power of the moving iron speaker in the main working frequency band is retained, and the frequency is divided at the intersection frequency of the attenuation frequency band of the moving coil speaker and the moving iron speaker. Get the sub audio signal of the two frequency bands.
图7b为本申请信号分频的一个示例性的示意图,如图7b所示,分频器42被配置为在动铁扬声器的非主要工作频段对齐进行衰减,保留动铁扬声器在主要工作频段的功率,并且在动铁扬声器的衰减频段内的某频点处进分频,得到两个频段的子音频信号。FIG. 7b is an exemplary schematic diagram of signal frequency division of the application. As shown in FIG. 7b, the frequency divider 42 is configured to align and attenuate in the non-main working frequency band of the moving iron speaker, and retain the moving iron speaker in the main working frequency band. power, and divide the frequency at a certain frequency point in the attenuation frequency band of the moving iron speaker to obtain sub-audio signals of two frequency bands.
图7c为本申请信号分频的一个示例性的示意图,如图7c所示,分频器42被配置为在动圈扬声器的非主要工作频段对其进行两段式衰减,保留动圈扬声器在主要工作频段的功率,在动铁扬声器的非主要工作频段对其进行两段式衰减,保留动铁扬声器在主要工作频段的功率,并且在动铁扬声器的第一段衰减频段和第二段衰减频段的转变频点处进行一次分频,在动圈扬声器的第一段衰减频段和第二段衰减频段的转变频点处进行一次分频, 得到三个频段的子音频信号。Fig. 7c is an exemplary schematic diagram of the frequency division of the signal of the present application. As shown in Fig. 7c, the frequency divider 42 is configured to perform two-stage attenuation in the non-main operating frequency band of the moving coil speaker, leaving the moving coil speaker at The power of the main working frequency band is attenuated in two stages in the non-main working frequency band of the moving iron speaker, retaining the power of the moving iron speaker in the main working frequency band, and attenuating the first and second attenuation bands of the moving iron speaker A frequency division is performed at the transition frequency point of the frequency band, and a frequency division is performed at the transition frequency point of the first attenuation frequency band and the second attenuation frequency band of the moving coil speaker to obtain sub audio signals of three frequency bands.
示例性的,假设TWS耳机包括三个扬声器:动圈扬声器、动铁扬声器和MEMS扬声器。虚线表示动圈扬声器的频响曲线,单点线表示动铁扬声器的频响曲线,双点线表示MEMS扬声器的频响曲线,实线表示分频线。Exemplarily, it is assumed that the TWS earphone includes three speakers: a moving coil speaker, a moving iron speaker, and a MEMS speaker. The dotted line represents the frequency response curve of the moving coil speaker, the single-dotted line represents the frequency response curve of the moving iron speaker, the double-dotted line represents the frequency response curve of the MEMS speaker, and the solid line represents the crossover line.
图7d为本申请信号分频的一个示例性的示意图,如图7d所示,在图7c所示示例的基础上,分频器42被配置为在MEMS扬声器的非主要工作频段对其进行衰减,保留MEMS扬声器在主要工作频段的功率,并且在动铁扬声器和MEMS扬声器各自衰减频段内的交点频点处进一次分频,共得到四个频段的子音频信号。Fig. 7d is an exemplary schematic diagram of the frequency division of the signal of the present application. As shown in Fig. 7d, on the basis of the example shown in Fig. 7c, the frequency divider 42 is configured to attenuate the MEMS speaker in the non-main operating frequency band , retain the power of the MEMS speaker in the main working frequency band, and perform a frequency division at the intersection frequency point in the respective attenuation frequency bands of the moving iron speaker and the MEMS speaker, and obtain a total of four frequency bands of sub-audio signals.
需要说明的是,图7a~图7d所示的示例是分频器42对扬声器驱动信号进行分频的几个示例,本申请并不限定分频器42的具体分频方式。It should be noted that the examples shown in FIGS. 7 a to 7 d are several examples in which the frequency divider 42 divides the speaker driving signal, and the present application does not limit the specific frequency division method of the frequency divider 42 .
本申请的TWS耳机中设置了至少两个扬声器,该至少两个扬声器的主要工作频段不完全相同。分频器可以将扬声器驱动信号分频成至少两个频段的子音频信号,该至少两个频段中的相邻频段部分重叠或者不重叠,这样将各个子音频信号分别传输至频段匹配的扬声器,前述频段匹配可以是指扬声器的主要工作频段覆盖了传输给其的子音频信号的频段,这样扬声器在播放传输给其的子音频信号时维持最佳频响,既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical. The frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands, The aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
在一种可能的实现方式中,图8a为本申请TWS耳机的一个示例性的结构示意图。如图8a所示,在图4所示结构的基础上,TWS耳机40还包括:第一DAC 44。该第一DAC44的输入端和音频信号处理通路41的输出端连接,第一DAC 44的输出端和分频器42的输入端连接。In a possible implementation manner, FIG. 8a is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 8a , on the basis of the structure shown in FIG. 4 , the TWS earphone 40 further includes: a first DAC 44 . The input end of the first DAC 44 is connected with the output end of the audio signal processing path 41, and the output end of the first DAC 44 is connected with the input end of the frequency divider 42.
第一DAC 44,被配置为将扬声器驱动信号从数字形式转换成模拟形式。相应的,分频器42为模拟分频器。The first DAC 44 is configured to convert the speaker drive signal from digital to analog. Correspondingly, the frequency divider 42 is an analog frequency divider.
图2a或图2b所示的结构中,在得到扬声器驱动信号后,如果该信号没有经过DAC之前,均是数字形式的,但扬声器播放的信号需要是模拟形式的,因此可以先通过第一DAC将数字形式的扬声器驱动信号转换成模拟形式的扬声器驱动信号,然后通过模拟分频器将模拟形式的扬声器驱动信号分频成至少两个频段的子音频信号。In the structure shown in Figure 2a or Figure 2b, after the speaker driving signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so it can pass the first DAC first. The speaker driving signal in digital form is converted into a speaker driving signal in analog form, and then the analog frequency divider is used to divide the frequency of the speaker driving signal in analog form into sub audio signals of at least two frequency bands.
本实施例采用的是先转换后分频的结构。This embodiment adopts a structure of converting first and then dividing the frequency.
示例性的,图8b为本申请TWS耳机的一个示例性的结构示意图。如图8b所示,本实施例的结构是图8a所示结构的一种更详细的实施方式。Exemplarily, FIG. 8b is an exemplary structural schematic diagram of the TWS headset of the present application. As shown in Fig. 8b, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 8a.
TWS耳机60中的主麦克风601、FF麦克风602分别和人声增强滤波器603的输入端连接,人声增强滤波器603的输出端和音频源604(包括音乐和通话语音)经叠加器1的叠加后得到下行链路信号;下行链路信号传输至叠加器2的一个输入端;下行链路信号还传输至SP滤波器605的输入端;FF麦克风602还和前馈滤波器606的输入端连接,前馈滤波器606的输出端连接至叠加器2的另一个输入端;FB麦克风607和叠加器3的一个输入端连接,SP滤波器605的输出端和叠加器3的另一个输入端连接,叠加器3的输出端和反馈滤波器608的输入端连接,反馈滤波器608的输出端连接至叠加器2的第三个输入端;叠加器2的输出端和数模转换器(DAC)609的输入端连接,DAC 609的输出端和模拟分频器610的输入端连接,模拟分频器610的输出端连接动圈扬声器611a和动铁扬声器611b。AHA联合控制器612分别和人声增强滤波器603、前馈滤波器606、反馈滤波 器608以及SP滤波器605连接。The main microphone 601 and the FF microphone 602 in the TWS earphone 60 are respectively connected to the input end of the vocal enhancement filter 603 , and the output end of the vocal enhancement filter 603 and the audio source 604 (including music and call speech) The downlink signal is obtained after superposition; the downlink signal is transmitted to an input of the stacker 2; the downlink signal is also transmitted to the input of the SP filter 605; connected, the output end of the feedforward filter 606 is connected to the other input end of the stacker 2; the FB microphone 607 is connected to one input end of the stacker 3, and the output end of the SP filter 605 is connected to the other input end of the stacker 3 connected, the output end of the stacker 3 is connected to the input end of the feedback filter 608, and the output end of the feedback filter 608 is connected to the third input end of the stacker 2; the output end of the stacker 2 is connected to the digital-to-analog converter (DAC) ) 609 is connected to the input end, the output end of the DAC 609 is connected to the input end of the analog frequency divider 610, and the output end of the analog frequency divider 610 is connected to the moving coil speaker 611a and the moving iron speaker 611b. The AHA joint controller 612 is connected to the vocal enhancement filter 603, the feedforward filter 606, the feedback filter 608 and the SP filter 605, respectively.
本实施例的TWS耳机60中设置了两个扬声器,即动圈扬声器611a和动铁扬声器611b,动圈扬声器611a的主要工作频段在8.5kHz以下,动铁扬声器611b的主要工作频段在8.5kHz以上,这样可以设置分频器42将模拟形式的扬声器驱动信号分频成8.5kHz以下的子音频信号和8.5kHz以上的子音频信号,动圈扬声器611a播放8.5kHz以下的子音频信号可以维持最佳频响,动铁扬声器611b播放8.5kHz以上的子音频信号可以维持最佳频响,从而使得TWS耳机60既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。The TWS earphone 60 in this embodiment is provided with two speakers, namely a moving coil speaker 611a and a moving iron speaker 611b. The main working frequency band of the moving coil speaker 611a is below 8.5 kHz, and the main working frequency band of the moving iron speaker 611b is above 8.5 kHz. , so that the frequency divider 42 can be set to divide the analog speaker drive signal into a sub audio signal below 8.5kHz and a sub audio signal above 8.5kHz, and the moving coil speaker 611a can maintain the optimal playback of the sub audio signal below 8.5kHz Frequency response, the moving iron speaker 611b can maintain the best frequency response by playing sub-audio signals above 8.5kHz, so that the TWS earphone 60 can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
示例性的,图8c为本申请TWS耳机的一个示例性的结构示意图。如图8c所示,本实施例的结构是图8a所示结构的另一种更详细的实施方式。Exemplarily, FIG. 8c is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 8c, the structure of this embodiment is another more detailed implementation of the structure shown in FIG. 8a.
与图8b所示结构的区别在于:模拟分频器610的输出端连接动圈扬声器611a、动铁扬声器611b、MEMS扬声器611c和平面振动膜片611d。The difference from the structure shown in FIG. 8b is that the output end of the analog frequency divider 610 is connected to the moving coil speaker 611a, the moving iron speaker 611b, the MEMS speaker 611c and the planar vibrating diaphragm 611d.
本实施例的TWS耳机60中设置了四个扬声器,即动圈扬声器611a、动铁扬声器611b、MEMS扬声器611c和平面振动膜片611d,动圈扬声器611a的主要工作频段在8.5kHz以下,动铁扬声器611b的主要工作频段在8.5kHz以上,MEMS扬声器611c的主要工作频段取决于应用形态,入耳式耳机的主要工作频段是全频带,头戴式耳机的主要工作频段在7kHz以下偏弱,主要作用频带在7kHz以上的高频,平面振动膜片611d的主要工作频段是10kHz~20kHz,这样可以设置分频器42将模拟形式的扬声器驱动信号分频成四个子频段,动圈扬声器611a播放8.5kHz以下的子音频信号可以维持最佳频响,动铁扬声器611b播放8.5kHz以上的子音频信号可以维持最佳频响,MEMS扬声器611c播放7kHz以上的子音频信号可以维持最佳频响,平面振动膜片611d播放10kHz以上的子音频信号可以维持最佳频响,从而使得TWS耳机60既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。The TWS earphone 60 in this embodiment is provided with four speakers, namely a moving coil speaker 611a, a moving iron speaker 611b, a MEMS speaker 611c, and a plane vibration diaphragm 611d. The main working frequency of the moving coil speaker 611a is below 8.5 kHz, and the moving iron The main working frequency band of the speaker 611b is above 8.5kHz, and the main working frequency band of the MEMS speaker 611c depends on the application form. For high frequencies above 7 kHz, the main working frequency band of the planar vibrating diaphragm 611d is 10 kHz to 20 kHz. In this way, the frequency divider 42 can be set to divide the analog speaker driving signal into four sub-frequency bands, and the moving coil speaker 611a plays 8.5 kHz. The following sub-audio signals can maintain the best frequency response, the moving iron speaker 611b can maintain the best frequency response by playing the sub-audio signal above 8.5kHz, and the MEMS speaker 611c can maintain the best frequency response by playing the sub-audio signal above 7kHz, and the plane vibrates The diaphragm 611d can maintain the best frequency response by playing the sub-audio signal above 10 kHz, so that the TWS earphone 60 can not only reflect the high sound quality in each frequency band of the audio source, but also support ultra-bandwidth voice calls.
上述结构中,人声增强滤波器603、音频源604和AHA联合控制器612设置于数字信号处理(digital signal process,DSP)芯片中,前馈滤波器606、反馈滤波器608、SP滤波器605和DAC 609设置于编译码器(coder-decoder,CODEC)中。In the above structure, the vocal enhancement filter 603, the audio source 604 and the AHA joint controller 612 are arranged in a digital signal processing (digital signal process, DSP) chip, the feedforward filter 606, the feedback filter 608, and the SP filter 605. And the DAC 609 is set in the codec (coder-decoder, CODEC).
示例性的,图8d为本申请TWS耳机的一个示例性的结构示意图。如图8d所示,本实施例的结构是图8a所示结构的一种更详细的实施方式。Exemplarily, FIG. 8d is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in Fig. 8d, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 8a.
与图8b所示结构的区别在于:人声增强滤波器603从DSP芯片移到了CODEC中。The difference from the structure shown in Fig. 8b is that the vocal enhancement filter 603 is moved from the DSP chip to the CODEC.
示例性的,图8e为本申请TWS耳机的一个示例性的结构示意图。如图8e所示,本实施例的结构是图8a所示结构的一种更详细的实施方式。Exemplarily, FIG. 8e is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 8e, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 8a.
与图8c所示结构的区别在于:人声增强滤波器603从DSP芯片移到了CODEC中。The difference from the structure shown in Fig. 8c is that the vocal enhancement filter 603 is moved from the DSP chip to the CODEC.
在一种可能的实现方式中,图9a为本申请TWS耳机的一个示例性的结构示意图。如图9a所示,在图4所示结构的基础上,TWS耳机40还包括:至少两个第二DAC 45。该至少两个第二DAC 45的输入端均与分频器42的输出端连接,至少两个第二DAC 45的输出端分别和至少两个扬声器43的其中之一连接。In a possible implementation manner, FIG. 9a is a schematic structural diagram of an exemplary TWS headset of the present application. As shown in FIG. 9a , on the basis of the structure shown in FIG. 4 , the TWS earphone 40 further includes: at least two second DACs 45 . The input ends of the at least two second DACs 45 are connected to the output ends of the frequency divider 42, and the output ends of the at least two second DACs 45 are respectively connected to one of the at least two speakers 43.
第二DAC 45,被配置为将至少两个频段的子音频信号的其中之一从数字形式转换成模拟形式。相应的,分频器42为数字分频器。The second DAC 45 is configured to convert one of the sub-audio signals of at least two frequency bands from a digital form to an analog form. Correspondingly, the frequency divider 42 is a digital frequency divider.
图2a或图2b所示的结构中,在得到扬声器驱动信号后,如果该信号没有经过DAC 之前,均是数字形式的,但扬声器播放的信号需要是模拟形式的,因此可以先通过数字分频器将数字形式的扬声器驱动信号分频成至少两个频段的子音频信号,然后通过至少两个第二DAC分别将传输给其的数字形式的子音频信号转换成模拟形式的模拟形式的子音频信号。In the structure shown in Figure 2a or Figure 2b, after the speaker driving signal is obtained, if the signal does not pass through the DAC, it is in digital form, but the signal played by the speaker needs to be in analog form, so it can be divided by digital frequency first. The device divides the speaker driving signal in digital form into sub audio signals in at least two frequency bands, and then converts the sub audio signals in digital form transmitted to it into sub audio in analog form through at least two second DACs respectively. Signal.
本实施例采用的是先分频后转换的结构。This embodiment adopts the structure of frequency division first and then conversion.
示例性的,图9b为本申请TWS耳机的一个示例性的结构示意图。如图9b所示,本实施例的结构是图9a所示结构的一种更详细的实施方式。Exemplarily, FIG. 9b is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9b, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
TWS耳机70中的主麦克风701、FF麦克风702分别和人声增强滤波器703的输入端连接,人声增强滤波器703的输出端和音频源704(包括音乐和通话语音)经叠加器1的叠加后得到下行链路信号;下行链路信号传输至叠加器2的一个输入端;下行链路信号还传输至SP滤波器705的输入端;FF麦克风702还和前馈滤波器706的输入端连接,前馈滤波器706的输出端连接至叠加器2的另一个输入端;FB麦克风707和叠加器3的一个输入端连接,SP滤波器705的输出端和叠加器3的另一个输入端连接,叠加器3的输出端和反馈滤波器708的输入端连接,反馈滤波器708的输出端连接至叠加器2的第三个输入端;叠加器2的输出端和数字分频器709的输入端连接,数字分频器709的输出端连接两个DAC 710a和710b的输入端;DAC 710a的输出端和动扬声器711a连接,DAC 710b的输出端和动铁扬声器711b连接。AHA联合控制器712分别和人声增强滤波器703、前馈滤波器706、反馈滤波器708以及SP滤波器705连接。The main microphone 701 and the FF microphone 702 in the TWS earphone 70 are respectively connected to the input end of the vocal enhancement filter 703 , and the output end of the vocal enhancement filter 703 and the audio source 704 (including music and talking voice) are connected to the The downlink signal is obtained after superposition; the downlink signal is transmitted to one input of the stacker 2; the downlink signal is also transmitted to the input of the SP filter 705; connected, the output end of the feedforward filter 706 is connected to the other input end of the stacker 2; the FB microphone 707 is connected to one input end of the stacker 3, and the output end of the SP filter 705 is connected to the other input end of the stacker 3 connected, the output end of the superimposed device 3 is connected to the input end of the feedback filter 708, and the output end of the feedback filter 708 is connected to the third input end of the superimposed device 2; The input end is connected, and the output end of the digital frequency divider 709 is connected to the input ends of the two DACs 710a and 710b; the output end of the DAC 710a is connected to the moving speaker 711a, and the output end of the DAC 710b is connected to the moving iron speaker 711b. The AHA joint controller 712 is respectively connected to the vocal enhancement filter 703 , the feedforward filter 706 , the feedback filter 708 and the SP filter 705 .
示例性的,图9c为本申请TWS耳机的一个示例性的结构示意图。如图9c所示,本实施例的结构是图9a所示结构的一种更详细的实施方式。Exemplarily, FIG. 9c is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in Fig. 9c, the structure of this embodiment is a more detailed implementation of the structure shown in Fig. 9a.
本实施例的TWS耳机70中设置了四个扬声器,即动圈扬声器711a、动铁扬声器711b、MEMS扬声器711c和平面振动膜片711d,动圈扬声器711a的主要工作频段在8.5kHz以下,动铁扬声器711b的主要工作频段在8.5kHz以上,MEMS扬声器711c的主要工作频段取决于应用形态,入耳式耳机的主要工作频段是全频带,头戴式耳机的主要工作频段在7kHz以下偏弱,主要作用频带在7kHz以上的高频,平面振动膜片711d的主要工作频段是10kHz~20kHz,这样可以设置分频器42将模拟形式的扬声器驱动信号分频成四个子频段,动圈扬声器711a播放8.5kHz以下的子音频信号可以维持最佳频响,动铁扬声器711b播放8.5kHz以上的子音频信号可以维持最佳频响,MEMS扬声器711c播放7kHz以上的子音频信号可以维持最佳频响,平面振动膜片711d播放10kHz以上的子音频信号可以维持最佳频响,从而使得TWS耳机70既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。The TWS earphone 70 in this embodiment is provided with four speakers, namely a moving coil speaker 711a, a moving iron speaker 711b, a MEMS speaker 711c, and a plane vibration diaphragm 711d. The main working frequency of the moving coil speaker 711a is below 8.5 kHz, and the moving iron The main working frequency band of the speaker 711b is above 8.5kHz, and the main working frequency band of the MEMS speaker 711c depends on the application form. For high frequencies above 7 kHz, the main working frequency band of the planar vibrating diaphragm 711d is 10 kHz to 20 kHz. In this way, the frequency divider 42 can be set to divide the analog speaker driving signal into four sub-frequency bands, and the moving coil speaker 711a plays 8.5 kHz. The following sub-audio signals can maintain the best frequency response, the moving iron speaker 711b can maintain the best frequency response by playing the sub-audio signal above 8.5kHz, and the MEMS speaker 711c can maintain the best frequency response by playing the sub-audio signal above 7kHz, and the plane vibrates The diaphragm 711d can maintain the best frequency response by playing sub-audio signals above 10 kHz, so that the TWS earphone 70 can not only reflect high sound quality in various frequency bands of the audio source, but also support ultra-bandwidth voice calls.
上述结构中,人声增强滤波器703、音频源704和AHA联合控制器712设置于DSP芯片中,前馈滤波器706、反馈滤波器708、SP滤波器705和DAC 709设置于CODEC中。In the above structure, the vocal enhancement filter 703, the audio source 704 and the AHA joint controller 712 are arranged in the DSP chip, and the feedforward filter 706, the feedback filter 708, the SP filter 705 and the DAC 709 are arranged in the CODEC.
示例性的,图9d为本申请TWS耳机的一个示例性的结构示意图。如图9d所示,本实施例的结构是图9a所示结构的一种更详细的实施方式。Exemplarily, FIG. 9d is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9d, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
与图9b所示结构的区别在于:人声增强滤波器703从DSP芯片移到了CODEC中。The difference from the structure shown in Fig. 9b is that the vocal enhancement filter 703 is moved from the DSP chip to the CODEC.
示例性的,图9e为本申请TWS耳机的一个示例性的结构示意图。如图9e所示,本实施例的结构是图9a所示结构的一种更详细的实施方式。Exemplarily, FIG. 9e is an exemplary schematic structural diagram of the TWS headset of the present application. As shown in FIG. 9e, the structure of this embodiment is a more detailed implementation of the structure shown in FIG. 9a.
与图9c所示结构的区别在于:人声增强滤波器703从DSP芯片移到了CODEC中。The difference from the structure shown in Fig. 9c is that the vocal enhancement filter 703 is moved from the DSP chip to the CODEC.
图10为本申请TWS耳机的播放方法的一个示例性的流程图,如图10所示,本实施例的方法可以应用于上述实施例中的TWS耳机。该方法可以包括:FIG. 10 is an exemplary flowchart of a method for playing a TWS headset of the present application. As shown in FIG. 10 , the method of this embodiment may be applied to the TWS headset in the above-mentioned embodiments. The method can include:
步骤1001、获取音频源。Step 1001: Acquire an audio source.
可选的,音频源为原始的音乐或通话语音,即音频源可以是用户正在用耳机听的音乐、视频声音等,也可以是用户正在用耳机打电话时的通话语音。该音频源可以来自于电子设备的播放器。可选的,音频源包括经人声增强处理的语音信号和原始的音乐或通话语音,即除了上述两种情况下的音乐或通话语音外,音频源还可以叠加经人声增强处理的外界语音信号。该经人声增强处理的外界语音信号可以通过图2a或图2b所示的结构中的人声增强滤波器获得,此处不再赘述。Optionally, the audio source is original music or call voice, that is, the audio source may be music, video sound, etc. that the user is listening to with the headset, or the call voice when the user is making a call with the headset. The audio source may come from the player of the electronic device. Optionally, the audio source includes the voice signal enhanced by the human voice and the original music or voice of the call, that is, in addition to the music or voice of the call in the above two cases, the audio source can also superimpose the voice of the outside world enhanced by the human voice. Signal. The external speech signal processed by the human voice enhancement can be obtained by the human voice enhancement filter in the structure shown in FIG. 2 a or FIG. 2 b , which will not be repeated here.
步骤1002、对音频源进行降噪或透传处理得到扬声器驱动信号。 Step 1002 , performing noise reduction or transparent transmission processing on the audio source to obtain a speaker driving signal.
在一种可能的实现方式中,可以通过CODEC获取固定次级路径SP滤波器,然后根据固定SP滤波器对音频源进行处理得到滤波信号,再对滤波信号进行降噪或透传处理得到扬声器驱动信号。其中,CODEC可以根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,耳道信号包括耳道内部的残留噪声信号,当经估计SP滤波器得到的信号和耳道信号的差值信号处于设定范围内时,将估计SP滤波器确定为固定SP滤波器。可选的,当经估计SP滤波器得到的信号和耳道信号的差值信号处于设定范围内时,根据估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数,根据级联二阶滤波器的参数获取SP级联二阶滤波器,将SP级联二阶滤波器作为固定SP滤波器。In a possible implementation, the fixed secondary path SP filter can be obtained through the CODEC, then the audio source is processed according to the fixed SP filter to obtain a filtered signal, and then the filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driver Signal. Among them, the CODEC can obtain the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone. The ear canal signal includes the residual noise signal inside the ear canal. When the signal obtained by the estimated SP filter and the When the difference signal of the ear canal signal is within the set range, the estimated SP filter is determined as a fixed SP filter. Optionally, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, the cascaded second-order filter is obtained according to the target frequency response of the estimated SP filter and the preset frequency division requirement. According to the parameters of the cascaded second-order filter, the SP cascaded second-order filter is obtained, and the SP cascaded second-order filter is used as a fixed SP filter.
在一种可能的实现方式中,可通过DSP芯片获取自适应SP滤波器,然后根据自适应SP滤波器对音频源进行处理得到滤波信号,再对滤波信号进行降噪或透传处理得到扬声器驱动信号。其中,DSP芯片可以获取实时噪音信号,根据音频源和实时噪音信号获取估计SP滤波器,当经估计SP滤波器得到的信号和实时噪音信号的差值信号处于设定范围内时,将估计SP滤波器确定为自适应SP滤波器。可选的,DSP芯片可以先获取FF麦克风拾取的外界信号和FB麦克风拾取到的耳道信号,外界信号包括外界噪声信号和音乐或通话语音,耳道信号包括耳道内部的残留噪声信号和音乐或通话语音,再获取主麦克风拾取到的语音信号,最后将语音信号减去外界信号和耳道信号得到实时噪音信号。In a possible implementation, the adaptive SP filter can be obtained through the DSP chip, and then the audio source is processed according to the adaptive SP filter to obtain a filtered signal, and then the filtered signal is subjected to noise reduction or transparent transmission processing to obtain the speaker driver Signal. Among them, the DSP chip can obtain the real-time noise signal, and obtain the estimated SP filter according to the audio source and the real-time noise signal. When the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within the set range, the estimated SP The filter is determined to be an adaptive SP filter. Optionally, the DSP chip can first obtain the external signal picked up by the FF microphone and the ear canal signal picked up by the FB microphone. The external signal includes the external noise signal and music or call voice, and the ear canal signal includes the residual noise signal inside the ear canal and music. Or call voice, and then obtain the voice signal picked up by the main microphone, and finally subtract the external signal and the ear canal signal from the voice signal to obtain a real-time noise signal.
本申请中的降噪处理和透传处理可以参照上述实施例,此处不再赘述。For noise reduction processing and transparent transmission processing in this application, reference may be made to the foregoing embodiments, which will not be repeated here.
步骤1003、将扬声器驱动信号分频成至少两个频段的子音频信号。Step 1003: Divide the speaker driving signal into sub-audio signals of at least two frequency bands.
经处理后的扬声器驱动信号的频段与音频源的频段相对应,可能包含低中高整个频段,但由于单个扬声器的主要工作频段可能只能覆盖低中高的其中部分频段,导致单个扬声器无法在整个频段体现出高音质来。The frequency band of the processed speaker driving signal corresponds to the frequency band of the audio source, and may include the whole frequency range of low, medium and high. However, because the main working frequency band of a single speaker may only cover part of the low, middle and high frequency bands, a single speaker cannot operate in the entire frequency band. reflect high-quality sound.
本申请的分频器可以被配置为基于至少两个扬声器的主要工作频段对扬声器驱动信号进行分频,得到与至少两个扬声器的主要工作频段分别对应的至少两个频段的子音频信号,然后让各个扬声器分别播放对应频段的子音频信号,从而使扬声器在播放传输给其的子音频信号时维持最佳频响。前述至少两个频段中的相邻频段部分重叠,或者,至少两个频段中的相邻频段不重叠。例如,分频器将扬声器驱动信号分成高低两个频段,高频段和低频段完全分开,没有重叠;或者,高频段和低频段有部分频段是重叠的。又例如,分频 器将扬声器驱动信号分成高中低三个频段,高频段和中频段完全分开,没有重叠,中频段和低频段有部分频段是重叠的;或者,高频段、中频段和低频段完全分开,没有重叠;或者,高频段和中频段有部分频段是重叠的,中频段和低频段完全分开,没有重叠。The frequency divider of the present application may be configured to divide the frequency of the speaker drive signal based on the main working frequency bands of the at least two speakers, so as to obtain sub-audio signals of at least two frequency bands corresponding to the main working frequency bands of the at least two speakers, and then Let each speaker play the sub audio signal of the corresponding frequency band, so that the speaker maintains the best frequency response when playing the sub audio signal transmitted to it. Adjacent frequency bands in the aforementioned at least two frequency bands partially overlap, or adjacent frequency bands in the at least two frequency bands do not overlap. For example, a crossover divides the speaker drive signal into high and low frequency bands, the high and low frequency bands are completely separated, and there is no overlap; or, the high and low frequency bands partially overlap. For another example, the frequency divider divides the speaker driving signal into three frequency bands: high, middle and low frequency. The high frequency band and the middle frequency band are completely separated without overlapping. Some frequency bands of the middle frequency band and the low frequency band overlap; or, the high frequency band, the middle frequency band and the low frequency band Complete separation, no overlap; or, high frequency and mid frequency bands are partially overlapping, mid frequency and low frequency bands are completely separated, no overlap.
本申请可以通过设置分频器42的参数以控制分频器42按照预设的方式对扬声器驱动信号进行分频。分频实施方式可以参照图6a~图6d,此处不再赘述。The present application can control the frequency divider 42 to divide the frequency of the speaker driving signal in a preset manner by setting the parameters of the frequency divider 42 . For the implementation of frequency division, reference may be made to FIG. 6a to FIG. 6d , which will not be repeated here.
步骤1004、通过至少两个扬声器分别播放至少两个频段的子音频信号的其中之一。Step 1004: Play one of the sub audio signals of at least two frequency bands through at least two speakers respectively.
本申请的TWS耳机中设置了至少两个扬声器,该至少两个扬声器的主要工作频段不完全相同。分频器可以将扬声器驱动信号分频成至少两个频段的子音频信号,该至少两个频段中的相邻频段部分重叠或者不重叠,这样将各个子音频信号分别传输至频段匹配的扬声器,前述频段匹配可以是指扬声器的主要工作频段覆盖了传输给其的子音频信号的频段,这样扬声器在播放传输给其的子音频信号时维持最佳频响,既可以在音频源的各个频段上体现出高音质,又可以支持超带宽的语音通话。At least two speakers are set in the TWS earphone of the present application, and the main working frequency bands of the at least two speakers are not identical. The frequency divider can divide the speaker drive signal into sub-audio signals of at least two frequency bands, and the adjacent frequency bands in the at least two frequency bands partially overlap or do not overlap, so that each sub-audio signal is respectively transmitted to the speakers with matching frequency bands, The aforementioned frequency band matching can mean that the main working frequency band of the speaker covers the frequency band of the sub-audio signal transmitted to it, so that the speaker maintains the best frequency response when playing the sub-audio signal transmitted to it. It reflects high sound quality and supports ultra-bandwidth voice calls.
图11为本申请TWS耳机的播放装置的一个示例性的结构图,如图11所示,本实施例的装置1100可以应用于上述实施例中的TWS耳机,该装置1100包括:获取模块1101、处理模块1102、分频模块1103和播放模块1104。其中,FIG. 11 is an exemplary structural diagram of a playback device for a TWS headset of the present application. As shown in FIG. 11 , the device 1100 of this embodiment can be applied to the TWS headset in the above-mentioned embodiment, and the device 1100 includes: an acquisition module 1101, Processing module 1102 , frequency dividing module 1103 and playing module 1104 . in,
获取模块1101,用于获取音频源,所述音频源为原始的音乐或通话语音,或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;处理模块1102,用于对所述音频源进行降噪或透传处理得到扬声器驱动信号;分频模块1103,用于将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;播放模块1104,用于通过至少两个扬声器分别播放所述至少两个频段的子音频信号的其中之一。The acquisition module 1101 is used to acquire an audio source, the audio source is the original music or call voice, or, the audio source includes the voice signal enhanced by the human voice and the original music or call voice; processing module 1102 , used to perform noise reduction or transparent transmission processing on the audio source to obtain a speaker drive signal; the frequency division module 1103 is used to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two Adjacent frequency bands in the frequency bands partially overlap, or, adjacent frequency bands in the at least two frequency bands do not overlap; the playing module 1104 is configured to play the sub audio signals of the at least two frequency bands through at least two speakers respectively. one.
在一种可能的实现方式中,所述处理模块1102,具体用于通过编译码器CODEC获取固定次级路径SP滤波器;根据所述固定SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, the processing module 1102 is specifically configured to obtain a fixed secondary path SP filter through a codec CODEC; process the audio source according to the fixed SP filter to obtain a filtered signal; The speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
在一种可能的实现方式中,所述处理模块1102,具体用于根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述固定SP滤波器。In a possible implementation manner, the processing module 1102 is specifically configured to obtain an estimated SP filter according to a preset speaker drive signal and an ear canal signal picked up by the feedback FB microphone, where the ear canal signal includes the ear canal Internal residual noise signal and the music or talking speech; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, the estimated SP filter is determined as the fixed SP filter.
在一种可能的实现方式中,所述处理模块1102,还用于当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,根据所述估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;根据所述级联二阶滤波器的参数获取SP级联二阶滤波器,将所述SP级联二阶滤波器作为所述固定SP滤波器。In a possible implementation manner, the processing module 1102 is further configured to, when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, according to the estimation The parameters of the cascaded second-order filter are obtained from the target frequency response of the SP filter and the preset frequency division requirements; the SP cascaded second-order filter is obtained according to the parameters of the cascaded second-order filter, and the SP is cascaded A second order filter acts as the fixed SP filter.
在一种可能的实现方式中,所述处理模块1102,具体用于通过数字信号处理DSP芯片获取自适应SP滤波器;根据所述自适应SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。In a possible implementation manner, the processing module 1102 is specifically configured to obtain an adaptive SP filter through a digital signal processing DSP chip; process the audio source according to the adaptive SP filter to obtain a filtered signal; The speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
在一种可能的实现方式中,所述处理模块1102,具体用于获取实时噪音信号;根据所述音频源和所述实时噪音信号获取估计SP滤波器;当经所述估计SP滤波器得到的信号和所述实时噪音信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述自适应SP滤波器。In a possible implementation manner, the processing module 1102 is specifically configured to obtain a real-time noise signal; obtain an estimated SP filter according to the audio source and the real-time noise signal; When the difference signal between the signal and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
在一种可能的实现方式中,所述处理模块1102,具体用于获取前馈FF麦克风拾取的外界信号和反馈FB麦克风拾取到的耳道信号,所述外界信号包括外界噪声信号和所述音乐或通话语音,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;获取主麦克风拾取到的语音信号;将所述语音信号减去所述外界信号和所述耳道信号得到信号差;根据所述音频源和所述信号差获取所述估计SP滤波器。In a possible implementation manner, the processing module 1102 is specifically configured to acquire the external signal picked up by the feedforward FF microphone and the ear canal signal picked up by the feedback FB microphone, where the external signal includes external noise signals and the music Or call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or the call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and the ear canal from the voice signal The signal obtains a signal difference; the estimated SP filter is obtained from the audio source and the signal difference.
在一种可能的实现方式中,所述至少两个扬声器的主要工作频段不完全相同。In a possible implementation manner, the main working frequency bands of the at least two speakers are not identical.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器和动铁扬声器。In a possible implementation manner, the at least two speakers include a moving coil speaker and a moving iron speaker.
在一种可能的实现方式中,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。In a possible implementation manner, the at least two speakers include a moving coil speaker, a moving iron speaker, a microelectromechanical system MEMS speaker, and a planar vibrating diaphragm.
本实施例的装置,可以用于执行图10所示方法实施例的技术方案,其实现原理和技术效果类似,此处不再赘述。The apparatus of this embodiment can be used to execute the technical solution of the method embodiment shown in FIG. 10 , and its implementation principle and technical effect are similar, and details are not repeated here.
在实现过程中,上述方法实施例的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。处理器可以是通用处理器、数字信号处理器(digital signal processor,DSP)、特定应用集成电路(application-specific integrated circuit,ASIC)、现场可编程门阵列(field programmable gate array,FPGA)或其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。本申请实施例公开的方法的步骤可以直接体现为硬件编码处理器执行完成,或者用编码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法的步骤。In the implementation process, each step of the above method embodiments may be completed by a hardware integrated logic circuit in a processor or an instruction in the form of software. The processor can be a general-purpose processor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field programmable gate array (FPGA), or other Programming logic devices, discrete gate or transistor logic devices, discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the methods disclosed in the embodiments of the present application may be directly embodied as executed by a hardware coding processor, or executed by a combination of hardware and software modules in the coding processor. The software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art. The storage medium is located in the memory, and the processor reads the information in the memory, and completes the steps of the above method in combination with its hardware.
上述各实施例中提及的存储器可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是只读存储器(read-only memory,ROM)、可编程只读存储器(programmable ROM,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)或闪存。易失性存储器可以是随机存取存储器(random access memory,RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(static RAM,SRAM)、动态随机存取存储器(dynamic RAM,DRAM)、同步动态随机存取存储器(synchronous DRAM,SDRAM)、双倍数据速率同步动态随机存取存储器(double data rate SDRAM,DDR SDRAM)、增强型同步动态随机存取存储器(enhanced SDRAM,ESDRAM)、同步连接动态随机存取存储器(synchlink DRAM,SLDRAM)和直接内存总线随机存取存储器(direct rambus RAM,DR RAM)。应注意,本文描述的系统和方法的存储器旨在包括但不限于这些和任意其它适合类型的存储器。The memory mentioned in the above embodiments may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. The non-volatile memory may be read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically programmable Erase programmable read-only memory (electrically EPROM, EEPROM) or flash memory. Volatile memory may be random access memory (RAM), which acts as an external cache. By way of example and not limitation, many forms of RAM are available, such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (double data rate SDRAM, DDR SDRAM), enhanced synchronous dynamic random access memory (enhanced SDRAM, ESDRAM), synchronous link dynamic random access memory (synchlink DRAM, SLDRAM) ) and direct memory bus random access memory (direct rambus RAM, DR RAM). It should be noted that the memory of the systems and methods described herein is intended to include, but not be limited to, these and any other suitable types of memory.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art can realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装 置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process of the above-described systems, devices and units can refer to the corresponding processes in the foregoing method embodiments, and will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (personal computer, server, or network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program codes .
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present application, but the protection scope of the present application is not limited to this. should be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.

Claims (33)

  1. 一种真无线立体声TWS耳机,其特征在于,包括:音频信号处理通路、分频器和至少两个扬声器;其中,A true wireless stereo TWS earphone, characterized in that it includes: an audio signal processing path, a frequency divider and at least two speakers; wherein,
    所述音频信号处理通路的输出端和所述分频器的输入端连接;所述分频器的输出端和所述至少两个扬声器连接;The output end of the audio signal processing path is connected with the input end of the frequency divider; the output end of the frequency divider is connected with the at least two speakers;
    所述音频信号处理通路,被配置为对音频源进行降噪或透传处理后输出扬声器驱动信号;所述音频源为原始的音乐或通话语音;或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;The audio signal processing path is configured to output a speaker drive signal after performing noise reduction or transparent transmission processing on the audio source; the audio source is original music or voice of a call; the voice signal and the original music or call voice;
    所述分频器,被配置为将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段和所述至少两个扬声器的主要工作频段对应;所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;The frequency divider is configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, the at least two frequency bands correspond to the main working frequency bands of the at least two speakers; the at least two frequency bands are Adjacent frequency bands in the frequency bands partially overlap, or adjacent frequency bands in the at least two frequency bands do not overlap;
    所述至少两个扬声器,被配置为播放对应的子音频信号。The at least two speakers are configured to play corresponding sub audio signals.
  2. 根据权利要求1所述的TWS耳机,其特征在于,所述音频信号处理通路包括:The TWS headset according to claim 1, wherein the audio signal processing path comprises:
    次级路径SP滤波器,被配置为在降噪或透传处理与所述音频源并发的情况下,防止所述降噪或透传处理对所述音频源的声音的消除。A secondary path SP filter configured to prevent the noise reduction or passthrough process from canceling the sound of the audio source in the event that the noise reduction or passthrough process is concurrent with the audio source.
  3. 根据权利要求2所述的TWS耳机,其特征在于,所述音频信号处理通路还包括:反馈FB麦克风、反馈滤波器;其中,The TWS headset according to claim 2, wherein the audio signal processing path further comprises: a feedback FB microphone and a feedback filter; wherein,
    所述FB麦克风,被配置为拾取耳道信号,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;the FB microphone is configured to pick up the ear canal signal, the ear canal signal includes the residual noise signal inside the ear canal and the music or the voice of the call;
    所述SP滤波器,被配置为输入所述音频源,对所述音频源进行处理,输出的信号与所述耳道信号叠加后传输至所述反馈滤波器;The SP filter is configured to input the audio source, process the audio source, and transmit the output signal to the feedback filter after superimposing the ear canal signal;
    所述反馈滤波器,被配置为生成用于所述降噪或透传处理的信号,所述降噪或透传处理的信号是生成所述扬声器驱动信号的其中一个叠加信号。The feedback filter is configured to generate a signal for the noise reduction or transparent transmission processing, and the noise reduction or transparent transmission processed signal is one of the superimposed signals for generating the speaker driving signal.
  4. 根据权利要求3所述的TWS耳机,其特征在于,所述反馈FB麦克风、所述反馈滤波器和所述SP滤波器设置于编译码器CODEC中。The TWS headset according to claim 3, wherein the feedback FB microphone, the feedback filter and the SP filter are set in a codec CODEC.
  5. 根据权利要求2所述的TWS耳机,其特征在于,所述SP滤波器,被配置为输入所述音频源,对所述音频源进行处理,输出的信号是所述扬声器驱动信号的其中一个叠加信号。The TWS earphone according to claim 2, wherein the SP filter is configured to input the audio source, process the audio source, and the output signal is a superposition of the speaker driving signal Signal.
  6. 根据权利要求5所述的TWS耳机,其特征在于,所述SP滤波器设置于数字信号处理DSP芯片中。The TWS earphone according to claim 5, wherein the SP filter is arranged in a digital signal processing DSP chip.
  7. 根据权利要求1-6中任一项所述的TWS耳机,其特征在于,还包括:第一数模转换器DAC;所述第一DAC的输入端和所述音频信号处理通路的输出端连接,所述第一DAC的输出端和所述分频器的输入端连接;The TWS headset according to any one of claims 1-6, further comprising: a first digital-to-analog converter DAC; an input end of the first DAC is connected to an output end of the audio signal processing path , the output end of the first DAC is connected with the input end of the frequency divider;
    所述第一DAC,被配置为将所述扬声器驱动信号从数字形式转换成模拟形式;the first DAC configured to convert the speaker drive signal from a digital form to an analog form;
    相应的,所述分频器为模拟分频器。Correspondingly, the frequency divider is an analog frequency divider.
  8. 根据权利要求1-6中任一项所述的TWS耳机,其特征在于,还包括:至少两个第二DAC;所述至少两个第二DAC的输入端均与所述分频器的输出端连接,所述至少两个第二DAC的输出端分别和所述至少两个扬声器的其中之一连接;The TWS headset according to any one of claims 1-6, further comprising: at least two second DACs; the input ends of the at least two second DACs are both connected to the output of the frequency divider terminals are connected, and the output terminals of the at least two second DACs are respectively connected to one of the at least two speakers;
    所述第二DAC,被配置为将所述至少两个频段的子音频信号的其中之一从数字形式转换成模拟形式;the second DAC configured to convert one of the sub-audio signals of the at least two frequency bands from a digital form to an analog form;
    相应的,所述分频器为数字分频器。Correspondingly, the frequency divider is a digital frequency divider.
  9. 根据权利要求1-8中任一项所述的TWS耳机,其特征在于,所述至少两个扬声器的主要工作频段不完全相同。The TWS earphone according to any one of claims 1-8, wherein the main working frequency bands of the at least two speakers are not identical.
  10. 根据权利要求9所述的TWS耳机,其特征在于,所述至少两个扬声器包括动圈扬声器和动铁扬声器。The TWS earphone according to claim 9, wherein the at least two speakers include a moving coil speaker and a moving iron speaker.
  11. 根据权利要求9所述的TWS耳机,其特征在于,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。The TWS earphone according to claim 9, wherein the at least two speakers include a moving coil speaker, a moving iron speaker, a micro-electromechanical system MEMS speaker, and a planar vibrating diaphragm.
  12. 一种真无线立体声TWS耳机的播放方法,其特征在于,所述方法应用于权利要求1-11中任一项所述的TWS耳机;所述方法包括:A method for playing a true wireless stereo TWS headset, wherein the method is applied to the TWS headset according to any one of claims 1-11; the method comprises:
    获取音频源,所述音频源为原始的音乐或通话语音,或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;Acquiring an audio source, where the audio source is original music or call voice, or, the audio source includes a voice signal enhanced by human voice and the original music or call voice;
    对所述音频源进行降噪或透传处理得到扬声器驱动信号;Perform noise reduction or transparent transmission processing on the audio source to obtain a speaker drive signal;
    将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;frequency-dividing the speaker drive signal into sub-audio signals of at least two frequency bands, and adjacent frequency bands in the at least two frequency bands partially overlap, or adjacent frequency bands in the at least two frequency bands do not overlap;
    通过至少两个扬声器分别播放所述至少两个频段的子音频信号的其中之一。One of the sub audio signals of the at least two frequency bands is played respectively through at least two speakers.
  13. 根据权利要求12所述的方法,其特征在于,所述对所述音频源进行降噪或透传处理得到扬声器驱动信号,包括:The method according to claim 12, characterized in that, performing noise reduction or transparent transmission processing on the audio source to obtain a speaker driving signal, comprising:
    通过编译码器CODEC获取固定次级路径SP滤波器;Obtain the fixed secondary path SP filter through the codec CODEC;
    根据所述固定SP滤波器对所述音频源进行处理得到滤波信号;The audio source is processed according to the fixed SP filter to obtain a filtered signal;
    对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。The speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
  14. 根据权利要求13所述的方法,其特征在于,所述通过编译码器CODEC获取固定次级路径SP滤波器,包括:The method according to claim 13, wherein the obtaining the fixed secondary path SP filter through a codec CODEC comprises:
    根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,所述耳道信号包括耳道内部的残留噪声信号;Obtain the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone, where the ear canal signal includes the residual noise signal inside the ear canal;
    当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述固定SP滤波器。When the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, the estimated SP filter is determined as the fixed SP filter.
  15. 根据权利要求14所述的方法,其特征在于,所述根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器之后,还包括:The method according to claim 14, wherein after obtaining the estimated SP filter according to the preset speaker drive signal and the ear canal signal picked up by the feedback FB microphone, the method further comprises:
    当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,根据所述估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;When the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within the set range, the cascade two is obtained according to the target frequency response of the estimated SP filter and the preset frequency division requirement. parameters of the order filter;
    根据所述级联二阶滤波器的参数获取SP级联二阶滤波器,将所述SP级联二阶滤波器作为所述固定SP滤波器。The SP cascaded second-order filter is obtained according to the parameters of the cascaded second-order filter, and the SP cascaded second-order filter is used as the fixed SP filter.
  16. 根据权利要求12所述的方法,其特征在于,所述对所述音频源进行降噪或透传处理得到扬声器驱动信号,包括:The method according to claim 12, characterized in that, performing noise reduction or transparent transmission processing on the audio source to obtain a speaker driving signal, comprising:
    通过数字信号处理DSP芯片获取自适应SP滤波器;Obtain adaptive SP filter through digital signal processing DSP chip;
    根据所述自适应SP滤波器对所述音频源进行处理得到滤波信号;The audio source is processed according to the adaptive SP filter to obtain a filtered signal;
    对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。The speaker driving signal is obtained by performing noise reduction or transparent transmission processing on the filtered signal.
  17. 根据权利要求16所述的方法,其特征在于,所述通过数字信号处理DSP芯片获取自适应SP滤波器,包括:The method according to claim 16, wherein the acquiring the adaptive SP filter through a digital signal processing DSP chip comprises:
    获取实时噪音信号;Get real-time noise signal;
    根据所述音频源和所述实时噪音信号获取估计SP滤波器;Obtaining an estimated SP filter from the audio source and the real-time noise signal;
    当经所述估计SP滤波器得到的信号和所述实时噪音信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述自适应SP滤波器。When the difference signal between the signal obtained by the estimated SP filter and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
  18. 根据权利要求17所述的方法,其特征在于,所述获取实时噪音信号,包括:The method according to claim 17, wherein the acquiring a real-time noise signal comprises:
    获取前馈FF麦克风拾取的外界信号和反馈FB麦克风拾取到的耳道信号,所述外界信号包括外界噪声信号和所述音乐或通话语音,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;Obtain the external signal picked up by the feedforward FF microphone and the ear canal signal picked up by the feedback FB microphone, the external signal includes the external noise signal and the music or the voice of the call, and the ear canal signal includes the residual noise signal inside the ear canal and the music or voice of the call;
    获取主麦克风拾取到的语音信号;Get the voice signal picked up by the main microphone;
    将所述语音信号减去所述外界信号和所述耳道信号得到所述实时噪音信号。The real-time noise signal is obtained by subtracting the external signal and the ear canal signal from the speech signal.
  19. 根据权利要求12-18中任一项所述的方法,其特征在于,所述至少两个扬声器的主要工作频段不完全相同。The method according to any one of claims 12-18, wherein the main operating frequency bands of the at least two speakers are not identical.
  20. 根据权利要求19所述的方法,其特征在于,所述至少两个扬声器包括动圈扬声器和动铁扬声器。The method of claim 19, wherein the at least two speakers comprise a moving coil speaker and a moving iron speaker.
  21. 根据权利要求19所述的方法,其特征在于,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。The method of claim 19, wherein the at least two loudspeakers comprise moving coil loudspeakers, moving iron loudspeakers, MEMS loudspeakers and planar vibrating diaphragms.
  22. 一种真无线立体声TWS耳机的播放装置,其特征在于,所述装置应用于权利要求1-11中任一项所述的TWS耳机;所述装置包括:A playback device for a true wireless stereo TWS headset, wherein the device is applied to the TWS headset according to any one of claims 1-11; the device comprises:
    获取模块,用于获取音频源,所述音频源为原始的音乐或通话语音,或者,所述音频源包括经人声增强处理的语音信号和所述原始的音乐或通话语音;an acquisition module, configured to acquire an audio source, where the audio source is original music or call voice, or, the audio source includes a voice signal enhanced by human voice and the original music or call voice;
    处理模块,用于对所述音频源进行降噪或透传处理得到扬声器驱动信号;a processing module, configured to perform noise reduction or transparent transmission processing on the audio source to obtain a speaker driving signal;
    分频模块,用于将所述扬声器驱动信号分频成至少两个频段的子音频信号,所述至少两个频段中的相邻频段部分重叠,或者,所述至少两个频段中的相邻频段不重叠;A frequency dividing module, configured to divide the speaker drive signal into sub-audio signals of at least two frequency bands, and adjacent frequency bands in the at least two frequency bands partially overlap, or, adjacent frequency bands in the at least two frequency bands frequency bands do not overlap;
    播放模块,用于通过至少两个扬声器分别播放所述至少两个频段的子音频信号的其中之一。The playing module is used for playing one of the sub audio signals of the at least two frequency bands through the at least two speakers respectively.
  23. 根据权利要求22所述的装置,其特征在于,所述处理模块,具体用于通过编译码器CODEC获取固定次级路径SP滤波器;根据所述固定SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。The device according to claim 22, wherein the processing module is specifically configured to obtain a fixed secondary path SP filter through a codec CODEC; and the audio source is obtained by processing the fixed SP filter according to the fixed SP filter. Filter the signal; perform noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
  24. 根据权利要求23所述的装置,其特征在于,所述处理模块,具体用于根据预先设定的扬声器驱动信号和反馈FB麦克风拾取到的耳道信号获取估计SP滤波器,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述固定SP滤波器。The device according to claim 23, wherein the processing module is specifically configured to obtain an estimated SP filter according to a preset speaker drive signal and an ear canal signal picked up by a feedback FB microphone, the ear canal signal Including the residual noise signal inside the ear canal and the music or talking speech; when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, the estimated SP is filtered is determined as the fixed SP filter.
  25. 根据权利要求24所述的装置,其特征在于,所述处理模块,还用于当经所述估计SP滤波器得到的信号和所述耳道信号的差值信号处于设定范围内时,根据所述估计SP滤波器的目标频响和预先设置的分频需求获取级联二阶滤波器的参数;根据所述级联二阶滤波器的参数获取SP级联二阶滤波器,将所述SP级联二阶滤波器作为所述固定SP滤波 器。The device according to claim 24, wherein the processing module is further configured to: when the difference signal between the signal obtained by the estimated SP filter and the ear canal signal is within a set range, according to The target frequency response of the estimated SP filter and the preset frequency division requirements are obtained to obtain the parameters of the cascaded second-order filter; according to the parameters of the cascaded second-order filter, the SP cascaded second-order filter is obtained, and the An SP cascaded second-order filter acts as the fixed SP filter.
  26. 根据权利要求22所述的装置,其特征在于,所述处理模块,具体用于通过数字信号处理DSP芯片获取自适应SP滤波器;根据所述自适应SP滤波器对所述音频源进行处理得到滤波信号;对所述滤波信号进行降噪或透传处理得到所述扬声器驱动信号。The device according to claim 22, wherein the processing module is specifically configured to obtain an adaptive SP filter through a digital signal processing DSP chip; and the audio source is obtained by processing the adaptive SP filter according to the adaptive SP filter. Filter the signal; perform noise reduction or transparent transmission processing on the filtered signal to obtain the speaker drive signal.
  27. 根据权利要求26所述的装置,其特征在于,所述处理模块,具体用于获取实时噪音信号;根据所述音频源和所述实时噪音信号获取估计SP滤波器;当经所述估计SP滤波器得到的信号和所述实时噪音信号的差值信号处于设定范围内时,将所述估计SP滤波器确定为所述自适应SP滤波器。The device according to claim 26, wherein the processing module is specifically configured to obtain a real-time noise signal; obtain an estimated SP filter according to the audio source and the real-time noise signal; When the difference signal between the signal obtained by the detector and the real-time noise signal is within a set range, the estimated SP filter is determined as the adaptive SP filter.
  28. 根据权利要求27所述的装置,其特征在于,所述处理模块,具体用于获取前馈FF麦克风拾取的外界信号和反馈FB麦克风拾取到的耳道信号,所述外界信号包括外界噪声信号和所述音乐或通话语音,所述耳道信号包括耳道内部的残留噪声信号和所述音乐或通话语音;获取主麦克风拾取到的语音信号;将所述语音信号减去所述外界信号和所述耳道信号得到信号差;根据所述音频源和所述信号差获取所述估计SP滤波器。The device according to claim 27, wherein the processing module is specifically configured to acquire external signals picked up by the feedforward FF microphone and ear canal signals picked up by the feedback FB microphone, the external signals comprising external noise signals and For the music or call voice, the ear canal signal includes the residual noise signal inside the ear canal and the music or call voice; obtain the voice signal picked up by the main microphone; subtract the external signal and all the voice signals from the voice signal; Obtain the signal difference from the ear canal signal; obtain the estimated SP filter according to the audio source and the signal difference.
  29. 根据权利要求22-28中任一项所述的装置,其特征在于,所述至少两个扬声器的主要工作频段不完全相同。The device according to any one of claims 22-28, wherein the main working frequency bands of the at least two speakers are not identical.
  30. 根据权利要求29所述的装置,其特征在于,所述至少两个扬声器包括动圈扬声器和动铁扬声器。30. The apparatus of claim 29, wherein the at least two speakers comprise a moving coil speaker and a moving iron speaker.
  31. 根据权利要求29所述的装置,其特征在于,所述至少两个扬声器包括动圈扬声器、动铁扬声器、微机电系统MEMS扬声器和平面振动膜片。The apparatus of claim 29, wherein the at least two speakers comprise a moving coil speaker, a moving iron speaker, a MEMS speaker, and a planar vibrating diaphragm.
  32. 一种计算机可读存储介质,其特征在于,包括计算机程序,所述计算机程序在计算机上被执行时,使得所述计算机执行权利要求12-21中任一项所述的方法。A computer-readable storage medium, characterized by comprising a computer program, which, when executed on a computer, causes the computer to execute the method of any one of claims 12-21.
  33. 一种计算机程序,其特征在于,当所述计算机程序被计算机执行时,用于执行权利要求12-21中任一项所述的方法。A computer program, characterized in that, when the computer program is executed by a computer, it is used to execute the method of any one of claims 12-21.
PCT/CN2022/083464 2021-04-28 2022-03-28 Tws earphone and playing method and device of tws earphone WO2022227982A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP22794455.0A EP4297428A1 (en) 2021-04-28 2022-03-28 Tws earphone and playing method and device of tws earphone

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110467311.9 2021-04-28
CN202110467311.9A CN115250397A (en) 2021-04-28 2021-04-28 TWS earphone and playing method and device thereof

Publications (1)

Publication Number Publication Date
WO2022227982A1 true WO2022227982A1 (en) 2022-11-03

Family

ID=83697287

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/083464 WO2022227982A1 (en) 2021-04-28 2022-03-28 Tws earphone and playing method and device of tws earphone

Country Status (3)

Country Link
EP (1) EP4297428A1 (en)
CN (1) CN115250397A (en)
WO (1) WO2022227982A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116156385A (en) * 2023-04-19 2023-05-23 深圳市汇顶科技股份有限公司 Filtering method, filtering device, chip and earphone

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024092478A1 (en) * 2022-11-01 2024-05-10 深圳市韶音科技有限公司 Audio processing device and method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090225990A1 (en) * 2005-04-25 2009-09-10 Niigata Seimitsu Co., Ltd. Clock generating circuit and audio system
US20130243216A1 (en) * 2012-03-15 2013-09-19 Jerry Harvey Crossover based canalphone system
US20160171962A1 (en) * 2014-12-15 2016-06-16 Honeywell International Inc. Active Noise Reduction Earcup With Speaker Array
CN208850008U (en) * 2018-11-16 2019-05-10 深圳市星科启电子商务有限公司 A kind of more vibration unit TWS earphones of embedded frequency dividing circuit
CN110401901A (en) * 2019-08-22 2019-11-01 杭州聚声科技有限公司 A kind of parametric array sonification system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090225990A1 (en) * 2005-04-25 2009-09-10 Niigata Seimitsu Co., Ltd. Clock generating circuit and audio system
US20130243216A1 (en) * 2012-03-15 2013-09-19 Jerry Harvey Crossover based canalphone system
US20160171962A1 (en) * 2014-12-15 2016-06-16 Honeywell International Inc. Active Noise Reduction Earcup With Speaker Array
CN208850008U (en) * 2018-11-16 2019-05-10 深圳市星科启电子商务有限公司 A kind of more vibration unit TWS earphones of embedded frequency dividing circuit
CN110401901A (en) * 2019-08-22 2019-11-01 杭州聚声科技有限公司 A kind of parametric array sonification system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116156385A (en) * 2023-04-19 2023-05-23 深圳市汇顶科技股份有限公司 Filtering method, filtering device, chip and earphone
CN116156385B (en) * 2023-04-19 2023-07-07 深圳市汇顶科技股份有限公司 Filtering method, filtering device, chip and earphone

Also Published As

Publication number Publication date
CN115250397A (en) 2022-10-28
EP4297428A1 (en) 2023-12-27

Similar Documents

Publication Publication Date Title
JP7008806B2 (en) Parallel Active Noise Reduction (ANR) and Hear-Through Signal Transduction Paths for Acoustic Devices
JP6964581B2 (en) Feedback Adaptive Noise Cancellation (ANC) Controllers and Methods with Feedback Responses Partially Provided by Fixed Response Filters
CN109600698B (en) Noise reduced sound reproduction system and method
JP6069829B2 (en) Ear hole mounting type sound collecting device, signal processing device, and sound collecting method
CN110166865B (en) Earphone with noise cancellation of acoustic noise from haptic vibration driver
WO2022227982A1 (en) Tws earphone and playing method and device of tws earphone
JP2020510240A5 (en)
US20150332662A1 (en) Anc noise active control audio headset with prevention of the effects of a saturation of the feedback microphone signal
JP2013110746A (en) Audio head set having non-adaptive active noise control function for listening audio sound source and/or "hands-free" telephone function
JP2015204627A (en) Anc active noise control audio headset reducing electrical hiss
JP2012231468A (en) Combined microphone and earphone audio headset having means for denoising near speech signal, in particular for "hands-free" telephony system
CN108140380B (en) Adaptive noise cancellation feedback controller and method with feedback response provided in part by fixed response filter
KR20210141585A (en) own voice occlusion compensation
US11062687B2 (en) Compensation for microphone roll-off variation in acoustic devices
EP2337020A1 (en) A device for and a method of processing an acoustic signal
EP3155826B1 (en) Self-voice feedback in communications headsets
CN114787911A (en) Noise elimination system and signal processing method of ear-wearing type playing device
CN111629313B (en) Hearing device comprising loop gain limiter
JP6197930B2 (en) Ear hole mounting type sound collecting device, signal processing device, and sound collecting method
US11335315B2 (en) Wearable electronic device with low frequency noise reduction
JP5228647B2 (en) Noise canceling system, noise canceling signal forming method, and noise canceling signal forming program
US11074903B1 (en) Audio device with adaptive equalization
TWI549525B (en) With three-dimensional effect and to enhance the noise reduction of the environment noise loudspeakers active noise reduction headphones
TWI840100B (en) Active noise reduction audio device, method, and storage medium
WO2022227992A1 (en) Audio playback method and audio playback device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22794455

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2022794455

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2022794455

Country of ref document: EP

Effective date: 20230920

NENP Non-entry into the national phase

Ref country code: DE