WO2018010375A1 - Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur - Google Patents

Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur Download PDF

Info

Publication number
WO2018010375A1
WO2018010375A1 PCT/CN2016/110995 CN2016110995W WO2018010375A1 WO 2018010375 A1 WO2018010375 A1 WO 2018010375A1 CN 2016110995 W CN2016110995 W CN 2016110995W WO 2018010375 A1 WO2018010375 A1 WO 2018010375A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
digital
voice signal
earphone
analog
Prior art date
Application number
PCT/CN2016/110995
Other languages
English (en)
Chinese (zh)
Inventor
赵燕鹏
徐敏龙
王东
Original Assignee
歌尔股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 歌尔股份有限公司 filed Critical 歌尔股份有限公司
Publication of WO2018010375A1 publication Critical patent/WO2018010375A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/02Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party

Definitions

  • the present invention relates to the field of earphone technology, and more particularly, to a method, device and earphone for implementing a karaoke function through an earphone.
  • a method for implementing a karaoke function through a headset including:
  • the method further includes:
  • the digital voice signal includes a portion whose amplitude exceeds the first set value, and if so, filtering out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, the The digital music signal is subjected to digital-to-analog conversion processing to obtain an analog music signal, and the analog music signal is transmitted to the earpiece of the earphone.
  • the method further includes:
  • the voice signal collected by the microphone of the receiving headset includes:
  • performing beamforming processing on the first digital voice signal and the second digital voice signal, and obtaining the digital voice signal includes:
  • the digital speech signal is obtained by eliminating a portion of the signal obtained by the beamforming process that has an amplitude exceeding a second set value.
  • an apparatus for implementing a karaoke function through a headset including:
  • a receiving module configured to receive a voice signal collected by a microphone of the earphone
  • An analog-to-digital conversion module configured to perform analog-to-digital conversion processing on the voice signal to obtain a digital voice signal
  • a superimposing module configured to superimpose the background music signal and the digital speech signal to obtain a digital superimposed signal
  • the first digital-to-analog conversion module is configured to perform digital-to-analog conversion processing on the digital superimposed signal to obtain an analog superimposed signal, and send the analog superimposed signal to an earpiece of the earphone.
  • the device further includes a detection module and a second digital-to-analog conversion module, the detection module is configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes an amplitude exceeding a first setting. a portion of the value, if yes, controlling the filtered vocal signal module to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital to analog conversion module And performing digital-to-analog conversion processing on the digital music signal to obtain an analog music signal, and transmitting the analog music signal to an earpiece of the earphone.
  • the detection module is configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes an amplitude exceeding a first setting. a portion of the value, if yes, controlling the filtered vocal signal module to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital to analog conversion module And performing digital-to-analog conversion processing on the digital music
  • the device further includes:
  • a first reverberation module configured to add a reverberation signal corresponding to the background music signal to the background music signal
  • a second plus reverberation module configured to add a reverberation signal corresponding to the digital voice signal to the digital voice signal.
  • the receiving module includes:
  • a first receiving unit configured to receive a voice signal collected by a microphone mounted on a earphone as a first voice signal
  • a second receiving unit configured to receive a voice signal collected by a microphone mounted on another earphone as a second voice signal
  • the first digital to analog conversion module includes:
  • a digital-to-analog conversion unit configured to perform analog-to-digital conversion processing on the first voice signal and the second voice signal, respectively, to obtain a corresponding first digital voice signal and a second digital voice signal;
  • a beam forming unit configured to perform beamforming processing on the first digital voice signal and the second digital voice signal to obtain the digital voice signal.
  • the beam forming unit includes:
  • a beamforming subunit configured to perform beamforming processing on the first digital voice signal and the second digital voice signal
  • a eliminating subunit configured to cancel a portion of the signal obtained by the beamforming process and having an amplitude exceeding a second set value, to obtain the digital voice signal
  • an earphone comprising the aforementioned means for realizing a karaoke function through a headphone.
  • an earphone comprising a microphone, a processor for collecting a voice signal, and a memory for storing the instruction for storing an instruction
  • the processor operates to perform the aforementioned method of implementing a karaoke function through a headset.
  • the inventors of the present invention found that in the prior art, there is a problem that the karaoke software cannot realize the karaoke effect in real time.
  • the karaoke effect can be realized only by the earphone in real time, thereby improving the user experience. Therefore, the technical task to be achieved by the present invention or the technical problem to be solved is not thought of or expected by those skilled in the art, so the present invention is a new technical solution.
  • FIG. 1 is a flow chart of an embodiment of a method of implementing a karaoke function through a headset in accordance with the present invention
  • FIG. 2 is a flow chart of another embodiment of a method for implementing a karaoke function through a headset in accordance with the present invention
  • FIG. 3 is a block schematic diagram of an implementation structure of an apparatus for implementing a karaoke function through an earphone according to the present invention
  • Figure 4 is a schematic illustration of an embodiment of an earphone in accordance with the present invention.
  • the invention solves the problem that the existing K-song software can not realize the karaoke effect in real time, and provides a new technical solution for realizing the karaoke function through the earphone.
  • FIG. 1 is a flow chart of an embodiment of a method of implementing a karaoke function through a headset of the present invention.
  • the method of the invention comprises the following steps:
  • Step S101 receiving a voice signal collected by a microphone.
  • the voice signal collected by the microphone is received for subsequent processing steps.
  • Step S102 performing analog-to-digital conversion processing on the voice signal to obtain a digital voice signal.
  • Converting the analog voice signal into a digital voice signal can realize the karaoke function through the DSP processor in the earphone, while the analog signal cannot realize the karaoke function through the processor.
  • Step S103 filtering out the human voice signal in the digital music signal received by the earphone to obtain a background music signal.
  • the characteristics of the original vocal are roughly divided into two types: the sound image position of the human voice is in the center of the entire sound field (the left and right channels are balancedly distributed); the sound frequency of the human voice is concentrated in the intermediate frequency and the high frequency portion. Therefore, the vocal signal in the digital music signal received by the earphone is filtered out, specifically, the background music signal can be obtained by filtering out the equal sound of the left and right channels in the digital music signal and the frequency is concentrated in the intermediate frequency and the high frequency part. .
  • the music file played by the electronic device includes two tracks: the original vocal track and the vocal track, and the background music is obtained by switching the audio track by controlling the electronic device connected to the earphone to switch between the original vocal and the accompaniment. signal.
  • Step S104 superimposing the background music signal and the digital voice signal to obtain a digital superposition signal.
  • step S105 the digital superimposed signal is subjected to digital-to-analog conversion processing, and then sent to the earphone.
  • the digital superimposed signal needs to be digital-to-analog converted to obtain an analog superimposed signal and sent to the earpiece.
  • the headset realizes the karaoke effect in real time.
  • the opening of the karaoke function can be controlled by a button on the earphone, or the earphone can be automatically turned on.
  • the method of the present invention further includes: detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if so, performing step S103; if not, digitizing the digital music signal The analog conversion process obtains an analog music signal and sends the analog music signal to the earpiece of the earphone.
  • the amplitude of the singing signal is the largest, so by detecting whether the digital voice signal contains a portion whose amplitude exceeds the first set value, It is judged whether the wearer is singing, and if so, the karaoke function is turned on, and if not, the music signal received by the earphone is played. In this way, the function of automatically turning on karaoke is realized.
  • the present invention can also implement a karaoke function through an earphone having two microphones
  • FIG. 2 is another embodiment of a method for implementing a karaoke function through a headphone according to the present invention. flow chart.
  • the method includes the following steps:
  • Step S201 receiving a first voice signal and a second voice signal.
  • the earphones can also realize the functions of active noise reduction and call noise reduction through the two microphones, wherein the active noise reduction is to process the inverse of the microphone signal collected by the microphone.
  • the active noise reduction is to process the inverse of the microphone signal collected by the microphone.
  • the call noise reduction may specifically eliminate the noise signal in the first voice signal, and send the first voice signal that cancels the noise signal to the earphone Connected electronic devices.
  • two microphones collect the voice of the person and the surrounding noise.
  • the mouth is located in the middle of the ears in the vertical direction.
  • the distance from the first microphone to the mouth is the same.
  • the distance between the two microphones and the mouth is equal, so that the time when the first microphone and the second microphone receive the same sound signal from the human mouth is the same; in general, the distance between the first microphone and the noise source is the same.
  • the distance between the two microphones to the noise source is not equal, thus causing a time difference between the time when the two microphones receive the same noise signal from the noise source.
  • the distance from the first microphone to the mouth is not equal to the distance from the second microphone to the mouth.
  • the mouth is offset from the vertical direction of the center point of the ears by a certain angle, and the mouth is in the angular range.
  • the time difference between the two microphones receiving the sound signal from the mouth should not exceed the set value ⁇ T.
  • the microphone signal received from the first microphone is extracted from the microphone signal received by the second microphone, but the time difference is not greater than the set value ⁇ T component, as the sound signal emitted by the mouth;
  • the microphone signal received by the microphone extracts a component of the same signal as the microphone signal received by the second microphone but has a received time difference greater than the set value ⁇ T, and as the noise signal, inverts the noise signal to obtain the noise.
  • the noise reduction signal with the same signal amplitude and opposite phase is superimposed and processed by the noise reduction signal and transmitted to the electronic device connected to the earphone. In this way, the headset can achieve call noise reduction through two microphones.
  • Earphones with two microphones can also be implemented while enabling karaoke Active noise reduction and call noise reduction.
  • Step S202 performing analog-to-digital conversion processing on the first speech signal and the second speech signal, respectively, to obtain a first digital speech signal and a second digital speech signal.
  • the first speech signal is subjected to analog-to-digital conversion processing to obtain a first digital speech signal
  • the second speech signal is subjected to analog-to-digital conversion processing to obtain a second digital speech signal.
  • Step S203 performing beamforming processing on the first digital voice signal and the second digital voice signal.
  • the beamforming process is performed on the first digital voice signal and the second digital voice signal, so that the voice sung by the wearer in the voice signal collected by the microphone can be made clearer, wherein the beamforming process can mainly suppress the first voice signal and the second voice signal.
  • the sound signal in the left and right direction during the wearing of the earphone outputs a sound signal in the front and rear direction, that is, the sound of the wearer singing.
  • beamforming refers to a method of forming spatial directivity by processing (eg, weighting, delay, summation, etc.) of each array element of a multi-array array arranged with a certain geometric shape (straight line, cylinder, arc, etc.).
  • a wide-side microphone array refers to a series of microphones arranged in a direction perpendicular to the direction of the sound waves to be picked. d is the spacing of the two microphone elements in the array. The sound from the wide side of the array is usually the sound to pick up.
  • the wide-edge array can be implemented by basic processing, and the microphones in the array are simply added. The disadvantage of this type of array is that it only attenuates the sound from the sides of the array.
  • the rear response is always consistent with the front response because the array is axisymmetric and cannot distinguish between sound pressure waves arriving from the front and from the rear to the microphone.
  • the minimum response occurs at 90° and 270°.
  • the signal attenuation at these points is highly dependent on the frequency.
  • the half wavelength of the incident frequency approaches the pitch of the microphone, the response approaches cancellation completely.
  • the frequency is about 2.3 kHz (343 m/s ⁇ (0.075 m x 2) ⁇ 2.3 kHz)
  • the response is completely cancelled.
  • the frequencies will alias and the polar coordinate response will begin to show zero at other angles. At this point, the side attenuation begins to decrease again.
  • the omnidirectional microphone since the omnidirectional microphone is mostly used in the earphone, it is capable of equally responding to sounds from all directions.
  • the two microphones can be configured in an array to form a directional response or beam pattern. Beamformed microphone arrays are designed to be more sensitive to sound from one or more specific directions. Thus, by the beamforming process, the noise signals in the first speech signal and the second speech signal can be filtered out to obtain a clearer singing voice signal.
  • Step S204 the portion of the signal obtained by the beamforming process whose amplitude exceeds the second set value is removed, and a digital voice signal is obtained.
  • the portion of the signal obtained by removing the beamforming process that exceeds the second set value is a howling, and since the microphone is mounted on the earpiece of the earphone, a harsh whistling may occur, in order to prevent the human ear from hearing howling, resulting in a comparison.
  • a poor user experience can eliminate the portion of the signal resulting from the beamforming process that exceeds the set value.
  • the conditions for howling are required to satisfy three points: the microphone and the speaker are used at the same time; the sound reproduced by the sound system can be transmitted to the microphone through the space; the sound energy emitted by the speaker is large enough, and the pickup sensitivity of the microphone is sufficiently high.
  • the microphone should be non-directional.
  • the directional microphone especially the sharp directional microphone
  • the adjustment distance has little effect on increasing the sound volume and preventing whistling.
  • the sound reinforcement system is easy to whistle is not directly related to the sensitivity of the microphone.
  • Only the high-sensitivity microphones are sharply directional, which is prone to whistling. Shortening the distance between the sounding device and the listener can actually increase the loudness of the sound. The total gain of the system can be appropriately reduced. If it is supplemented by a near-field speaker with wide directivity, the microphone can be screamed away from the far point.
  • the farther the microphone is from the speaker the closer the speaker is to the listener.
  • the microphone should be placed on the back of the speaker's radiation direction. If the microphone is likely to be held around, the speaker should be placed where the microphone is not close enough.
  • the frequency equalizer can be used to compensate the sound reinforcement curve, and the frequency response of the system is adjusted to an approximate straight line, so that the gains of the respective frequency bands are basically the same, and the sound transmission gain of the system is improved.
  • the audio feedback automatic suppression device is commonly used.
  • This device can automatically track the feedback point frequency, automatically adjust the Q-value bandwidth, automatically eliminate the sound feedback and maximize the protection. Sound quality.
  • the principle is to suppress howling by notching.
  • Sabine's FBX series feedback suppressor which is a 9-segment narrow-band automatic compression device controlled by a microcomputer, can better distinguish between feedback self-excited signals and music signals, and can react quickly when the system is self-excited.
  • set a very narrow digital filter at the feedback frequency the notch depth is also automatically set, the filtering bandwidth is only 1/3 octave, such a narrow notch band, almost no loudness and The tone has an effect.
  • Inverting cancellation prevents self-excitation in high frequency amplification circuits.
  • two microphones of the same specification can be used to respectively pick up the direct sound and the reflected sound, and the inverted sound circuit cancels the phase of the reflected sound signal before entering the power amplifier, which can effectively prevent the howling self-excitation.
  • the self-excited howling of the sound reinforcement system, the feedback loop is positive feedback, if the microphone signal is phase-modulated, it will destroy the self-excited phase condition, thus preventing the self-excited howling of the system. It is shown that the stability is best when the phase deviation value is 140°; and the higher the frequency of modulation, the better the stability of the system. In order to prevent the processed sound quality from being too distorted, the maximum allowable value of the phase modulation frequency is 4 Hz.
  • the method for removing the howling may also be to eliminate the portion of the signal obtained by the beamforming process whose amplitude exceeds the second set value, so that the generated howling is filtered out and played out without the earphone. This makes the human ear not hear the howling and enhance the user experience.
  • Step S205 adding a reverberation signal corresponding to the digital voice signal to the digital voice signal.
  • the reverberation signal corresponding to the digital voice signal may be added to the digital voice signal.
  • the method for obtaining the reverberation signal of the digital speech signal can be:
  • the digital speech signal is subjected to a first delay processing to generate a first signal; the digital speech signal is subjected to a second delay processing and a frequency domain compression filtering process to generate a second signal; and the digital speech signal is subjected to a third delay processing and frequency
  • the domain compression filtering process generates a third signal; the first sound signal, the second signal, and the third signal are combined into a fourth signal as a reverberation signal.
  • the number of delay processing and the number of reflections are the same, and may be more than three times, and may be more.
  • Adding a reverberation signal corresponding to the digital voice signal to the digital voice signal enables the earphone wearer to experience the stereo sound effect and enhance the user experience.
  • Step S206 detecting whether the digital speech signal includes a portion whose amplitude exceeds the first set value, and if yes, executing step S207; if not, executing step S211.
  • Step S207 filtering out the human voice signal in the digital music signal received by the earphone to obtain a background music signal.
  • Step S208 adding a reverberation signal corresponding to the background music signal to the background music signal.
  • the reverberation corresponding to the background music signal may be added to the background music signal.
  • the specific method is similar to the method of adding the reverberation corresponding to the digital voice signal in the digital voice signal, and details are not described herein again.
  • step S209 the background music signal and the digital voice signal are superimposed to obtain a digital superimposed signal.
  • step S210 the digital superimposed signal is subjected to digital-to-analog conversion processing, and then sent to the earphone receiver.
  • step S211 the digital music signal is subjected to digital-to-analog conversion processing, and then sent to the earphone.
  • the present invention also provides a K song processing device for an earphone, and FIG. 3 is a block schematic diagram of an implementation structure of the device.
  • the apparatus 300 includes a receiving module 301, an analog to digital conversion module 302, a filtered vocal signal module 303, a superimposing module 304, and a first digital to analog conversion module 305.
  • the receiving module 301 is configured to receive a voice signal collected by a microphone of the earphone.
  • the analog-to-digital conversion module 302 is configured to perform analog-to-digital conversion processing on the voice signal to obtain a digital voice signal.
  • the ergonomic signal module 303 is configured to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal.
  • the superimposing module 304 is configured to superimpose the background music signal and the digital speech signal to obtain a digital superimposed signal.
  • the first digital-to-analog conversion module 305 is configured to perform digital-to-analog conversion processing on the digital superimposed signal to obtain an analog superimposed signal, and send the analog superimposed signal to an earpiece of the earphone.
  • the apparatus 300 further includes a detecting module and a second digital-to-analog conversion module, configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if yes, Controlling the ergonomic signal module to filter out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital-to-analog conversion module to perform digital-to-analog conversion processing on the digital music signal to obtain analog music Signal and send an analog music signal to the handset of the headset.
  • a detecting module and a second digital-to-analog conversion module configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if yes, Controlling the ergonomic signal module to filter out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital-to-analog conversion module to perform digital-to-ana
  • the receiving module includes a first receiving unit and a second receiving unit, where the first receiving unit is configured to receive a voice signal collected by a microphone mounted on a earphone as a first voice signal; the second receiving unit is configured to: Receiving a voice signal collected by a microphone mounted on another earphone as a second voice signal.
  • the first digital-to-analog conversion module includes a digital-to-analog conversion unit and a beamforming unit, and the digital-to-analog conversion unit is configured to respectively perform analog-to-digital conversion processing on the first voice signal and the second voice signal to obtain a corresponding first digital voice signal and a first And a second digital voice signal; the beam forming unit is configured to perform beamforming processing on the first digital voice signal and the second digital voice signal to obtain a digital voice signal.
  • the beamforming unit includes a beamforming subunit and a cancellation subunit for performing beamforming processing on the first digital speech signal and the second digital speech signal; the cancellation subunit is configured to eliminate beamforming processing A portion of the obtained signal whose amplitude exceeds the second set value yields a digital speech signal.
  • the apparatus 300 further includes a first reverberation module and a second reverberation module, where the first reverberation module is configured to add a reverberation signal corresponding to the background music signal to the background music signal;
  • the second reverberation module is configured to add a reverberation signal corresponding to the digital speech signal to the digital speech signal.
  • the present invention also provides an earphone, which in one aspect comprises the aforementioned apparatus 300 for implementing a karaoke function through a headset.
  • the earphone includes a memory and a processor, wherein the memory is for storing instructions that control the processor to perform the aforementioned method of implementing a karaoke function through the earphone.
  • the earphone includes an earphone earpiece 1, a headphone cable 2, a control box 3, and a headphone plug 4, as shown in FIG. 4, wherein the earphone is a digital earphone, and the earphone plug 4 is a USB plug.
  • the earphone further includes a microphone for collecting a voice signal, and if the earphone has a microphone, the microphone can be mounted on the control box 3 or on the earphone 1; if the earphone has two microphones, each microphone can be installed differently Headphones on the handset 1.
  • the processor can be, for example, a DSP digital processor.
  • the memory includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), a nonvolatile memory such as a hard disk, and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • General Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Otolaryngology (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

L'invention concerne un procédé pour réaliser une fonction de karaoké via un écouteur, et un écouteur. Le procédé consiste à : recevoir un signal vocal collecté par un microphone d'un écouteur ; exécuter un traitement de conversion analogique-numérique sur le signal vocal pour obtenir un signal vocal numérique ; filtrer un signal vocal humain dans un signal musical numérique reçu par l'écouteur pour obtenir un signal musical de fond ; superposer le signal musical de fond et le signal vocal numérique pour obtenir un signal numérique superposé ; et exécuter un traitement de conversion numérique-analogique sur le signal numérique superposé pour obtenir un signal superposé simulé, et envoyer le signal superposé simulé à un récepteur de l'écouteur. Dans la présente invention, un signal vocal chanté, collecté par un microphone, d'un chanteur, et un signal musical de fond obtenu par élimination d'un signal vocal humain dans un signal musical sont superposés par un écouteur, de sorte qu'un effet de karaoké est réalisé en temps réel via l'écouteur et que l'expérience de l'utilisateur est améliorée.
PCT/CN2016/110995 2016-07-11 2016-12-20 Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur WO2018010375A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610541669.0 2016-07-11
CN201610541669.0A CN106210983A (zh) 2016-07-11 2016-07-11 一种通过耳机实现卡拉ok功能的方法、装置及耳机

Publications (1)

Publication Number Publication Date
WO2018010375A1 true WO2018010375A1 (fr) 2018-01-18

Family

ID=57473618

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/110995 WO2018010375A1 (fr) 2016-07-11 2016-12-20 Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur

Country Status (2)

Country Link
CN (1) CN106210983A (fr)
WO (1) WO2018010375A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110460924A (zh) * 2018-05-08 2019-11-15 张德明 一种有线去人声k歌耳机

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106210983A (zh) * 2016-07-11 2016-12-07 歌尔股份有限公司 一种通过耳机实现卡拉ok功能的方法、装置及耳机
CN107018461A (zh) * 2017-03-22 2017-08-04 广东欧珀移动通信有限公司 耳机及其降噪方法
WO2019042459A1 (fr) * 2017-09-04 2019-03-07 深圳市硕泰华科技有限公司 Casque d'écoute numérique
CN107343242A (zh) * 2017-09-04 2017-11-10 深圳市硕泰华科技有限公司 一种数字耳机
CN107742517A (zh) * 2017-10-10 2018-02-27 广东中星电子有限公司 一种对异常声音的检测方法及装置
CN109729448A (zh) * 2017-10-27 2019-05-07 北京金锐德路科技有限公司 脖戴式语音交互耳机的语音控制优化方法和装置
CN107911777B (zh) * 2017-11-15 2019-12-10 维沃移动通信有限公司 一种耳返功能的处理方法、装置及移动终端
CN109905789A (zh) * 2017-12-10 2019-06-18 张德明 一种k歌话筒
CN110544487A (zh) * 2019-10-18 2019-12-06 珠海格力智能装备有限公司 基于麦克风的语音检测方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201479213U (zh) * 2009-07-30 2010-05-19 盟讯实业股份有限公司 具伴唱功能的移动电话
CN201910901U (zh) * 2010-11-25 2011-07-27 刘梅 一种多功能卡拉ok耳机
CN103686535A (zh) * 2013-12-02 2014-03-26 Tcl集团股份有限公司 自动识别耳机麦克风与卡拉ok设备的终端及识别方法
CN106210983A (zh) * 2016-07-11 2016-12-07 歌尔股份有限公司 一种通过耳机实现卡拉ok功能的方法、装置及耳机

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7262358B2 (en) * 2003-11-03 2007-08-28 Supply Unlimited, Inc. Portable voice studio system and method
JP2012252240A (ja) * 2011-06-06 2012-12-20 Sony Corp 再生装置、信号処理装置、信号処理方法
US9892729B2 (en) * 2013-05-07 2018-02-13 Qualcomm Incorporated Method and apparatus for controlling voice activation
CN104866067A (zh) * 2015-05-11 2015-08-26 联想(北京)有限公司 一种低功耗控制方法及电子设备
CN105245991A (zh) * 2015-10-26 2016-01-13 汪国印 一种头戴式k歌耳机及k歌处理方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201479213U (zh) * 2009-07-30 2010-05-19 盟讯实业股份有限公司 具伴唱功能的移动电话
CN201910901U (zh) * 2010-11-25 2011-07-27 刘梅 一种多功能卡拉ok耳机
CN103686535A (zh) * 2013-12-02 2014-03-26 Tcl集团股份有限公司 自动识别耳机麦克风与卡拉ok设备的终端及识别方法
CN106210983A (zh) * 2016-07-11 2016-12-07 歌尔股份有限公司 一种通过耳机实现卡拉ok功能的方法、装置及耳机

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110460924A (zh) * 2018-05-08 2019-11-15 张德明 一种有线去人声k歌耳机

Also Published As

Publication number Publication date
CN106210983A (zh) 2016-12-07

Similar Documents

Publication Publication Date Title
WO2018010375A1 (fr) Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur
US8184823B2 (en) Headphone device, sound reproduction system, and sound reproduction method
KR100638960B1 (ko) 음향 지향 방법 및 장치
JP5654513B2 (ja) 音識別方法および装置
JP4984683B2 (ja) 放収音装置
JP4752403B2 (ja) 拡声システム
WO2011118595A1 (fr) Casque d'écoute
JP2008543143A (ja) 音響変換器のアセンブリ、システムおよび方法
KR20090065255A (ko) 지향성 음향 생성 장치 및 그를 이용한 휴대용 단말기
JP2008543144A (ja) 音響信号用装置、システム、方法
CN114051738A (zh) 可操纵扬声器阵列、系统及其方法
JP2007274131A (ja) 拡声システム及び集音装置
JP2008017469A (ja) 音声処理システムおよび方法
JP2018518893A (ja) 状況認識を備えたスポーツヘッドフォン
JP4835151B2 (ja) オーディオシステム
JP2010177891A (ja) スピーカアレイ装置、信号処理方法およびプログラム
US10643597B2 (en) Method and device for generating and providing an audio signal for enhancing a hearing impression at live events
US10885896B2 (en) Real-time detection of feedforward instability
JP2020102834A (ja) バイノーラル補聴器でのビームフォーミングのための方法
JP2011155500A (ja) モニタ制御装置及び音響システム
JP2007318521A (ja) 放収音装置
JP2006246312A (ja) 拡声システム
JPH05316588A (ja) 実耳指向特性制御装置
Goldin Close talking autodirective dual microphone
JP4530423B2 (ja) 立体音用マイクロフォンおよび立体音再生装置。

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16908697

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16908697

Country of ref document: EP

Kind code of ref document: A1