WO2018010375A1 - Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur - Google Patents
Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur Download PDFInfo
- Publication number
- WO2018010375A1 WO2018010375A1 PCT/CN2016/110995 CN2016110995W WO2018010375A1 WO 2018010375 A1 WO2018010375 A1 WO 2018010375A1 CN 2016110995 W CN2016110995 W CN 2016110995W WO 2018010375 A1 WO2018010375 A1 WO 2018010375A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- digital
- voice signal
- earphone
- analog
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 238000006243 chemical reaction Methods 0.000 claims abstract description 54
- 238000001914 filtration Methods 0.000 claims abstract description 15
- 230000006870 function Effects 0.000 claims description 25
- 230000001755 vocal effect Effects 0.000 claims description 21
- 230000008569 process Effects 0.000 claims description 15
- 230000000694 effects Effects 0.000 abstract description 14
- 230000009467 reduction Effects 0.000 description 10
- 230000004044 response Effects 0.000 description 9
- 230000005236 sound signal Effects 0.000 description 7
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 230000002787 reinforcement Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- OXSYGCRLQCGSAQ-UHFFFAOYSA-N CC1CCC2N(C1)CC3C4(O)CC5C(CCC6C(O)C(O)CCC56C)C4(O)CC(O)C3(O)C2(C)O Chemical compound CC1CCC2N(C1)CC3C4(O)CC5C(CCC6C(O)C(O)CCC56C)C4(O)CC(O)C3(O)C2(C)O OXSYGCRLQCGSAQ-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/02—Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72433—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Definitions
- the present invention relates to the field of earphone technology, and more particularly, to a method, device and earphone for implementing a karaoke function through an earphone.
- a method for implementing a karaoke function through a headset including:
- the method further includes:
- the digital voice signal includes a portion whose amplitude exceeds the first set value, and if so, filtering out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, the The digital music signal is subjected to digital-to-analog conversion processing to obtain an analog music signal, and the analog music signal is transmitted to the earpiece of the earphone.
- the method further includes:
- the voice signal collected by the microphone of the receiving headset includes:
- performing beamforming processing on the first digital voice signal and the second digital voice signal, and obtaining the digital voice signal includes:
- the digital speech signal is obtained by eliminating a portion of the signal obtained by the beamforming process that has an amplitude exceeding a second set value.
- an apparatus for implementing a karaoke function through a headset including:
- a receiving module configured to receive a voice signal collected by a microphone of the earphone
- An analog-to-digital conversion module configured to perform analog-to-digital conversion processing on the voice signal to obtain a digital voice signal
- a superimposing module configured to superimpose the background music signal and the digital speech signal to obtain a digital superimposed signal
- the first digital-to-analog conversion module is configured to perform digital-to-analog conversion processing on the digital superimposed signal to obtain an analog superimposed signal, and send the analog superimposed signal to an earpiece of the earphone.
- the device further includes a detection module and a second digital-to-analog conversion module, the detection module is configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes an amplitude exceeding a first setting. a portion of the value, if yes, controlling the filtered vocal signal module to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital to analog conversion module And performing digital-to-analog conversion processing on the digital music signal to obtain an analog music signal, and transmitting the analog music signal to an earpiece of the earphone.
- the detection module is configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes an amplitude exceeding a first setting. a portion of the value, if yes, controlling the filtered vocal signal module to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital to analog conversion module And performing digital-to-analog conversion processing on the digital music
- the device further includes:
- a first reverberation module configured to add a reverberation signal corresponding to the background music signal to the background music signal
- a second plus reverberation module configured to add a reverberation signal corresponding to the digital voice signal to the digital voice signal.
- the receiving module includes:
- a first receiving unit configured to receive a voice signal collected by a microphone mounted on a earphone as a first voice signal
- a second receiving unit configured to receive a voice signal collected by a microphone mounted on another earphone as a second voice signal
- the first digital to analog conversion module includes:
- a digital-to-analog conversion unit configured to perform analog-to-digital conversion processing on the first voice signal and the second voice signal, respectively, to obtain a corresponding first digital voice signal and a second digital voice signal;
- a beam forming unit configured to perform beamforming processing on the first digital voice signal and the second digital voice signal to obtain the digital voice signal.
- the beam forming unit includes:
- a beamforming subunit configured to perform beamforming processing on the first digital voice signal and the second digital voice signal
- a eliminating subunit configured to cancel a portion of the signal obtained by the beamforming process and having an amplitude exceeding a second set value, to obtain the digital voice signal
- an earphone comprising the aforementioned means for realizing a karaoke function through a headphone.
- an earphone comprising a microphone, a processor for collecting a voice signal, and a memory for storing the instruction for storing an instruction
- the processor operates to perform the aforementioned method of implementing a karaoke function through a headset.
- the inventors of the present invention found that in the prior art, there is a problem that the karaoke software cannot realize the karaoke effect in real time.
- the karaoke effect can be realized only by the earphone in real time, thereby improving the user experience. Therefore, the technical task to be achieved by the present invention or the technical problem to be solved is not thought of or expected by those skilled in the art, so the present invention is a new technical solution.
- FIG. 1 is a flow chart of an embodiment of a method of implementing a karaoke function through a headset in accordance with the present invention
- FIG. 2 is a flow chart of another embodiment of a method for implementing a karaoke function through a headset in accordance with the present invention
- FIG. 3 is a block schematic diagram of an implementation structure of an apparatus for implementing a karaoke function through an earphone according to the present invention
- Figure 4 is a schematic illustration of an embodiment of an earphone in accordance with the present invention.
- the invention solves the problem that the existing K-song software can not realize the karaoke effect in real time, and provides a new technical solution for realizing the karaoke function through the earphone.
- FIG. 1 is a flow chart of an embodiment of a method of implementing a karaoke function through a headset of the present invention.
- the method of the invention comprises the following steps:
- Step S101 receiving a voice signal collected by a microphone.
- the voice signal collected by the microphone is received for subsequent processing steps.
- Step S102 performing analog-to-digital conversion processing on the voice signal to obtain a digital voice signal.
- Converting the analog voice signal into a digital voice signal can realize the karaoke function through the DSP processor in the earphone, while the analog signal cannot realize the karaoke function through the processor.
- Step S103 filtering out the human voice signal in the digital music signal received by the earphone to obtain a background music signal.
- the characteristics of the original vocal are roughly divided into two types: the sound image position of the human voice is in the center of the entire sound field (the left and right channels are balancedly distributed); the sound frequency of the human voice is concentrated in the intermediate frequency and the high frequency portion. Therefore, the vocal signal in the digital music signal received by the earphone is filtered out, specifically, the background music signal can be obtained by filtering out the equal sound of the left and right channels in the digital music signal and the frequency is concentrated in the intermediate frequency and the high frequency part. .
- the music file played by the electronic device includes two tracks: the original vocal track and the vocal track, and the background music is obtained by switching the audio track by controlling the electronic device connected to the earphone to switch between the original vocal and the accompaniment. signal.
- Step S104 superimposing the background music signal and the digital voice signal to obtain a digital superposition signal.
- step S105 the digital superimposed signal is subjected to digital-to-analog conversion processing, and then sent to the earphone.
- the digital superimposed signal needs to be digital-to-analog converted to obtain an analog superimposed signal and sent to the earpiece.
- the headset realizes the karaoke effect in real time.
- the opening of the karaoke function can be controlled by a button on the earphone, or the earphone can be automatically turned on.
- the method of the present invention further includes: detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if so, performing step S103; if not, digitizing the digital music signal The analog conversion process obtains an analog music signal and sends the analog music signal to the earpiece of the earphone.
- the amplitude of the singing signal is the largest, so by detecting whether the digital voice signal contains a portion whose amplitude exceeds the first set value, It is judged whether the wearer is singing, and if so, the karaoke function is turned on, and if not, the music signal received by the earphone is played. In this way, the function of automatically turning on karaoke is realized.
- the present invention can also implement a karaoke function through an earphone having two microphones
- FIG. 2 is another embodiment of a method for implementing a karaoke function through a headphone according to the present invention. flow chart.
- the method includes the following steps:
- Step S201 receiving a first voice signal and a second voice signal.
- the earphones can also realize the functions of active noise reduction and call noise reduction through the two microphones, wherein the active noise reduction is to process the inverse of the microphone signal collected by the microphone.
- the active noise reduction is to process the inverse of the microphone signal collected by the microphone.
- the call noise reduction may specifically eliminate the noise signal in the first voice signal, and send the first voice signal that cancels the noise signal to the earphone Connected electronic devices.
- two microphones collect the voice of the person and the surrounding noise.
- the mouth is located in the middle of the ears in the vertical direction.
- the distance from the first microphone to the mouth is the same.
- the distance between the two microphones and the mouth is equal, so that the time when the first microphone and the second microphone receive the same sound signal from the human mouth is the same; in general, the distance between the first microphone and the noise source is the same.
- the distance between the two microphones to the noise source is not equal, thus causing a time difference between the time when the two microphones receive the same noise signal from the noise source.
- the distance from the first microphone to the mouth is not equal to the distance from the second microphone to the mouth.
- the mouth is offset from the vertical direction of the center point of the ears by a certain angle, and the mouth is in the angular range.
- the time difference between the two microphones receiving the sound signal from the mouth should not exceed the set value ⁇ T.
- the microphone signal received from the first microphone is extracted from the microphone signal received by the second microphone, but the time difference is not greater than the set value ⁇ T component, as the sound signal emitted by the mouth;
- the microphone signal received by the microphone extracts a component of the same signal as the microphone signal received by the second microphone but has a received time difference greater than the set value ⁇ T, and as the noise signal, inverts the noise signal to obtain the noise.
- the noise reduction signal with the same signal amplitude and opposite phase is superimposed and processed by the noise reduction signal and transmitted to the electronic device connected to the earphone. In this way, the headset can achieve call noise reduction through two microphones.
- Earphones with two microphones can also be implemented while enabling karaoke Active noise reduction and call noise reduction.
- Step S202 performing analog-to-digital conversion processing on the first speech signal and the second speech signal, respectively, to obtain a first digital speech signal and a second digital speech signal.
- the first speech signal is subjected to analog-to-digital conversion processing to obtain a first digital speech signal
- the second speech signal is subjected to analog-to-digital conversion processing to obtain a second digital speech signal.
- Step S203 performing beamforming processing on the first digital voice signal and the second digital voice signal.
- the beamforming process is performed on the first digital voice signal and the second digital voice signal, so that the voice sung by the wearer in the voice signal collected by the microphone can be made clearer, wherein the beamforming process can mainly suppress the first voice signal and the second voice signal.
- the sound signal in the left and right direction during the wearing of the earphone outputs a sound signal in the front and rear direction, that is, the sound of the wearer singing.
- beamforming refers to a method of forming spatial directivity by processing (eg, weighting, delay, summation, etc.) of each array element of a multi-array array arranged with a certain geometric shape (straight line, cylinder, arc, etc.).
- a wide-side microphone array refers to a series of microphones arranged in a direction perpendicular to the direction of the sound waves to be picked. d is the spacing of the two microphone elements in the array. The sound from the wide side of the array is usually the sound to pick up.
- the wide-edge array can be implemented by basic processing, and the microphones in the array are simply added. The disadvantage of this type of array is that it only attenuates the sound from the sides of the array.
- the rear response is always consistent with the front response because the array is axisymmetric and cannot distinguish between sound pressure waves arriving from the front and from the rear to the microphone.
- the minimum response occurs at 90° and 270°.
- the signal attenuation at these points is highly dependent on the frequency.
- the half wavelength of the incident frequency approaches the pitch of the microphone, the response approaches cancellation completely.
- the frequency is about 2.3 kHz (343 m/s ⁇ (0.075 m x 2) ⁇ 2.3 kHz)
- the response is completely cancelled.
- the frequencies will alias and the polar coordinate response will begin to show zero at other angles. At this point, the side attenuation begins to decrease again.
- the omnidirectional microphone since the omnidirectional microphone is mostly used in the earphone, it is capable of equally responding to sounds from all directions.
- the two microphones can be configured in an array to form a directional response or beam pattern. Beamformed microphone arrays are designed to be more sensitive to sound from one or more specific directions. Thus, by the beamforming process, the noise signals in the first speech signal and the second speech signal can be filtered out to obtain a clearer singing voice signal.
- Step S204 the portion of the signal obtained by the beamforming process whose amplitude exceeds the second set value is removed, and a digital voice signal is obtained.
- the portion of the signal obtained by removing the beamforming process that exceeds the second set value is a howling, and since the microphone is mounted on the earpiece of the earphone, a harsh whistling may occur, in order to prevent the human ear from hearing howling, resulting in a comparison.
- a poor user experience can eliminate the portion of the signal resulting from the beamforming process that exceeds the set value.
- the conditions for howling are required to satisfy three points: the microphone and the speaker are used at the same time; the sound reproduced by the sound system can be transmitted to the microphone through the space; the sound energy emitted by the speaker is large enough, and the pickup sensitivity of the microphone is sufficiently high.
- the microphone should be non-directional.
- the directional microphone especially the sharp directional microphone
- the adjustment distance has little effect on increasing the sound volume and preventing whistling.
- the sound reinforcement system is easy to whistle is not directly related to the sensitivity of the microphone.
- Only the high-sensitivity microphones are sharply directional, which is prone to whistling. Shortening the distance between the sounding device and the listener can actually increase the loudness of the sound. The total gain of the system can be appropriately reduced. If it is supplemented by a near-field speaker with wide directivity, the microphone can be screamed away from the far point.
- the farther the microphone is from the speaker the closer the speaker is to the listener.
- the microphone should be placed on the back of the speaker's radiation direction. If the microphone is likely to be held around, the speaker should be placed where the microphone is not close enough.
- the frequency equalizer can be used to compensate the sound reinforcement curve, and the frequency response of the system is adjusted to an approximate straight line, so that the gains of the respective frequency bands are basically the same, and the sound transmission gain of the system is improved.
- the audio feedback automatic suppression device is commonly used.
- This device can automatically track the feedback point frequency, automatically adjust the Q-value bandwidth, automatically eliminate the sound feedback and maximize the protection. Sound quality.
- the principle is to suppress howling by notching.
- Sabine's FBX series feedback suppressor which is a 9-segment narrow-band automatic compression device controlled by a microcomputer, can better distinguish between feedback self-excited signals and music signals, and can react quickly when the system is self-excited.
- set a very narrow digital filter at the feedback frequency the notch depth is also automatically set, the filtering bandwidth is only 1/3 octave, such a narrow notch band, almost no loudness and The tone has an effect.
- Inverting cancellation prevents self-excitation in high frequency amplification circuits.
- two microphones of the same specification can be used to respectively pick up the direct sound and the reflected sound, and the inverted sound circuit cancels the phase of the reflected sound signal before entering the power amplifier, which can effectively prevent the howling self-excitation.
- the self-excited howling of the sound reinforcement system, the feedback loop is positive feedback, if the microphone signal is phase-modulated, it will destroy the self-excited phase condition, thus preventing the self-excited howling of the system. It is shown that the stability is best when the phase deviation value is 140°; and the higher the frequency of modulation, the better the stability of the system. In order to prevent the processed sound quality from being too distorted, the maximum allowable value of the phase modulation frequency is 4 Hz.
- the method for removing the howling may also be to eliminate the portion of the signal obtained by the beamforming process whose amplitude exceeds the second set value, so that the generated howling is filtered out and played out without the earphone. This makes the human ear not hear the howling and enhance the user experience.
- Step S205 adding a reverberation signal corresponding to the digital voice signal to the digital voice signal.
- the reverberation signal corresponding to the digital voice signal may be added to the digital voice signal.
- the method for obtaining the reverberation signal of the digital speech signal can be:
- the digital speech signal is subjected to a first delay processing to generate a first signal; the digital speech signal is subjected to a second delay processing and a frequency domain compression filtering process to generate a second signal; and the digital speech signal is subjected to a third delay processing and frequency
- the domain compression filtering process generates a third signal; the first sound signal, the second signal, and the third signal are combined into a fourth signal as a reverberation signal.
- the number of delay processing and the number of reflections are the same, and may be more than three times, and may be more.
- Adding a reverberation signal corresponding to the digital voice signal to the digital voice signal enables the earphone wearer to experience the stereo sound effect and enhance the user experience.
- Step S206 detecting whether the digital speech signal includes a portion whose amplitude exceeds the first set value, and if yes, executing step S207; if not, executing step S211.
- Step S207 filtering out the human voice signal in the digital music signal received by the earphone to obtain a background music signal.
- Step S208 adding a reverberation signal corresponding to the background music signal to the background music signal.
- the reverberation corresponding to the background music signal may be added to the background music signal.
- the specific method is similar to the method of adding the reverberation corresponding to the digital voice signal in the digital voice signal, and details are not described herein again.
- step S209 the background music signal and the digital voice signal are superimposed to obtain a digital superimposed signal.
- step S210 the digital superimposed signal is subjected to digital-to-analog conversion processing, and then sent to the earphone receiver.
- step S211 the digital music signal is subjected to digital-to-analog conversion processing, and then sent to the earphone.
- the present invention also provides a K song processing device for an earphone, and FIG. 3 is a block schematic diagram of an implementation structure of the device.
- the apparatus 300 includes a receiving module 301, an analog to digital conversion module 302, a filtered vocal signal module 303, a superimposing module 304, and a first digital to analog conversion module 305.
- the receiving module 301 is configured to receive a voice signal collected by a microphone of the earphone.
- the analog-to-digital conversion module 302 is configured to perform analog-to-digital conversion processing on the voice signal to obtain a digital voice signal.
- the ergonomic signal module 303 is configured to filter out a vocal signal in the digital music signal received by the earphone to obtain a background music signal.
- the superimposing module 304 is configured to superimpose the background music signal and the digital speech signal to obtain a digital superimposed signal.
- the first digital-to-analog conversion module 305 is configured to perform digital-to-analog conversion processing on the digital superimposed signal to obtain an analog superimposed signal, and send the analog superimposed signal to an earpiece of the earphone.
- the apparatus 300 further includes a detecting module and a second digital-to-analog conversion module, configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if yes, Controlling the ergonomic signal module to filter out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital-to-analog conversion module to perform digital-to-analog conversion processing on the digital music signal to obtain analog music Signal and send an analog music signal to the handset of the headset.
- a detecting module and a second digital-to-analog conversion module configured to detect whether the digital voice signal includes detecting whether the digital voice signal includes a portion whose amplitude exceeds the first set value, and if yes, Controlling the ergonomic signal module to filter out the vocal signal in the digital music signal received by the earphone to obtain a background music signal; if not, controlling the second digital-to-analog conversion module to perform digital-to-ana
- the receiving module includes a first receiving unit and a second receiving unit, where the first receiving unit is configured to receive a voice signal collected by a microphone mounted on a earphone as a first voice signal; the second receiving unit is configured to: Receiving a voice signal collected by a microphone mounted on another earphone as a second voice signal.
- the first digital-to-analog conversion module includes a digital-to-analog conversion unit and a beamforming unit, and the digital-to-analog conversion unit is configured to respectively perform analog-to-digital conversion processing on the first voice signal and the second voice signal to obtain a corresponding first digital voice signal and a first And a second digital voice signal; the beam forming unit is configured to perform beamforming processing on the first digital voice signal and the second digital voice signal to obtain a digital voice signal.
- the beamforming unit includes a beamforming subunit and a cancellation subunit for performing beamforming processing on the first digital speech signal and the second digital speech signal; the cancellation subunit is configured to eliminate beamforming processing A portion of the obtained signal whose amplitude exceeds the second set value yields a digital speech signal.
- the apparatus 300 further includes a first reverberation module and a second reverberation module, where the first reverberation module is configured to add a reverberation signal corresponding to the background music signal to the background music signal;
- the second reverberation module is configured to add a reverberation signal corresponding to the digital speech signal to the digital speech signal.
- the present invention also provides an earphone, which in one aspect comprises the aforementioned apparatus 300 for implementing a karaoke function through a headset.
- the earphone includes a memory and a processor, wherein the memory is for storing instructions that control the processor to perform the aforementioned method of implementing a karaoke function through the earphone.
- the earphone includes an earphone earpiece 1, a headphone cable 2, a control box 3, and a headphone plug 4, as shown in FIG. 4, wherein the earphone is a digital earphone, and the earphone plug 4 is a USB plug.
- the earphone further includes a microphone for collecting a voice signal, and if the earphone has a microphone, the microphone can be mounted on the control box 3 or on the earphone 1; if the earphone has two microphones, each microphone can be installed differently Headphones on the handset 1.
- the processor can be, for example, a DSP digital processor.
- the memory includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), a nonvolatile memory such as a hard disk, and the like.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- General Health & Medical Sciences (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Computer Networks & Wireless Communication (AREA)
- Otolaryngology (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
L'invention concerne un procédé pour réaliser une fonction de karaoké via un écouteur, et un écouteur. Le procédé consiste à : recevoir un signal vocal collecté par un microphone d'un écouteur ; exécuter un traitement de conversion analogique-numérique sur le signal vocal pour obtenir un signal vocal numérique ; filtrer un signal vocal humain dans un signal musical numérique reçu par l'écouteur pour obtenir un signal musical de fond ; superposer le signal musical de fond et le signal vocal numérique pour obtenir un signal numérique superposé ; et exécuter un traitement de conversion numérique-analogique sur le signal numérique superposé pour obtenir un signal superposé simulé, et envoyer le signal superposé simulé à un récepteur de l'écouteur. Dans la présente invention, un signal vocal chanté, collecté par un microphone, d'un chanteur, et un signal musical de fond obtenu par élimination d'un signal vocal humain dans un signal musical sont superposés par un écouteur, de sorte qu'un effet de karaoké est réalisé en temps réel via l'écouteur et que l'expérience de l'utilisateur est améliorée.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610541669.0 | 2016-07-11 | ||
CN201610541669.0A CN106210983A (zh) | 2016-07-11 | 2016-07-11 | 一种通过耳机实现卡拉ok功能的方法、装置及耳机 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018010375A1 true WO2018010375A1 (fr) | 2018-01-18 |
Family
ID=57473618
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/110995 WO2018010375A1 (fr) | 2016-07-11 | 2016-12-20 | Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN106210983A (fr) |
WO (1) | WO2018010375A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110460924A (zh) * | 2018-05-08 | 2019-11-15 | 张德明 | 一种有线去人声k歌耳机 |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106210983A (zh) * | 2016-07-11 | 2016-12-07 | 歌尔股份有限公司 | 一种通过耳机实现卡拉ok功能的方法、装置及耳机 |
CN107018461A (zh) * | 2017-03-22 | 2017-08-04 | 广东欧珀移动通信有限公司 | 耳机及其降噪方法 |
WO2019042459A1 (fr) * | 2017-09-04 | 2019-03-07 | 深圳市硕泰华科技有限公司 | Casque d'écoute numérique |
CN107343242A (zh) * | 2017-09-04 | 2017-11-10 | 深圳市硕泰华科技有限公司 | 一种数字耳机 |
CN107742517A (zh) * | 2017-10-10 | 2018-02-27 | 广东中星电子有限公司 | 一种对异常声音的检测方法及装置 |
CN109729448A (zh) * | 2017-10-27 | 2019-05-07 | 北京金锐德路科技有限公司 | 脖戴式语音交互耳机的语音控制优化方法和装置 |
CN107911777B (zh) * | 2017-11-15 | 2019-12-10 | 维沃移动通信有限公司 | 一种耳返功能的处理方法、装置及移动终端 |
CN109905789A (zh) * | 2017-12-10 | 2019-06-18 | 张德明 | 一种k歌话筒 |
CN110544487A (zh) * | 2019-10-18 | 2019-12-06 | 珠海格力智能装备有限公司 | 基于麦克风的语音检测方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN201479213U (zh) * | 2009-07-30 | 2010-05-19 | 盟讯实业股份有限公司 | 具伴唱功能的移动电话 |
CN201910901U (zh) * | 2010-11-25 | 2011-07-27 | 刘梅 | 一种多功能卡拉ok耳机 |
CN103686535A (zh) * | 2013-12-02 | 2014-03-26 | Tcl集团股份有限公司 | 自动识别耳机麦克风与卡拉ok设备的终端及识别方法 |
CN106210983A (zh) * | 2016-07-11 | 2016-12-07 | 歌尔股份有限公司 | 一种通过耳机实现卡拉ok功能的方法、装置及耳机 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7262358B2 (en) * | 2003-11-03 | 2007-08-28 | Supply Unlimited, Inc. | Portable voice studio system and method |
JP2012252240A (ja) * | 2011-06-06 | 2012-12-20 | Sony Corp | 再生装置、信号処理装置、信号処理方法 |
US9892729B2 (en) * | 2013-05-07 | 2018-02-13 | Qualcomm Incorporated | Method and apparatus for controlling voice activation |
CN104866067A (zh) * | 2015-05-11 | 2015-08-26 | 联想(北京)有限公司 | 一种低功耗控制方法及电子设备 |
CN105245991A (zh) * | 2015-10-26 | 2016-01-13 | 汪国印 | 一种头戴式k歌耳机及k歌处理方法 |
-
2016
- 2016-07-11 CN CN201610541669.0A patent/CN106210983A/zh active Pending
- 2016-12-20 WO PCT/CN2016/110995 patent/WO2018010375A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN201479213U (zh) * | 2009-07-30 | 2010-05-19 | 盟讯实业股份有限公司 | 具伴唱功能的移动电话 |
CN201910901U (zh) * | 2010-11-25 | 2011-07-27 | 刘梅 | 一种多功能卡拉ok耳机 |
CN103686535A (zh) * | 2013-12-02 | 2014-03-26 | Tcl集团股份有限公司 | 自动识别耳机麦克风与卡拉ok设备的终端及识别方法 |
CN106210983A (zh) * | 2016-07-11 | 2016-12-07 | 歌尔股份有限公司 | 一种通过耳机实现卡拉ok功能的方法、装置及耳机 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110460924A (zh) * | 2018-05-08 | 2019-11-15 | 张德明 | 一种有线去人声k歌耳机 |
Also Published As
Publication number | Publication date |
---|---|
CN106210983A (zh) | 2016-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018010375A1 (fr) | Procédé et dispositif pour réaliser une fonction de karaoké via un écouteur, et écouteur | |
US8184823B2 (en) | Headphone device, sound reproduction system, and sound reproduction method | |
KR100638960B1 (ko) | 음향 지향 방법 및 장치 | |
JP5654513B2 (ja) | 音識別方法および装置 | |
JP4984683B2 (ja) | 放収音装置 | |
JP4752403B2 (ja) | 拡声システム | |
WO2011118595A1 (fr) | Casque d'écoute | |
JP2008543143A (ja) | 音響変換器のアセンブリ、システムおよび方法 | |
KR20090065255A (ko) | 지향성 음향 생성 장치 및 그를 이용한 휴대용 단말기 | |
JP2008543144A (ja) | 音響信号用装置、システム、方法 | |
CN114051738A (zh) | 可操纵扬声器阵列、系统及其方法 | |
JP2007274131A (ja) | 拡声システム及び集音装置 | |
JP2008017469A (ja) | 音声処理システムおよび方法 | |
JP2018518893A (ja) | 状況認識を備えたスポーツヘッドフォン | |
JP4835151B2 (ja) | オーディオシステム | |
JP2010177891A (ja) | スピーカアレイ装置、信号処理方法およびプログラム | |
US10643597B2 (en) | Method and device for generating and providing an audio signal for enhancing a hearing impression at live events | |
US10885896B2 (en) | Real-time detection of feedforward instability | |
JP2020102834A (ja) | バイノーラル補聴器でのビームフォーミングのための方法 | |
JP2011155500A (ja) | モニタ制御装置及び音響システム | |
JP2007318521A (ja) | 放収音装置 | |
JP2006246312A (ja) | 拡声システム | |
JPH05316588A (ja) | 実耳指向特性制御装置 | |
Goldin | Close talking autodirective dual microphone | |
JP4530423B2 (ja) | 立体音用マイクロフォンおよび立体音再生装置。 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16908697 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16908697 Country of ref document: EP Kind code of ref document: A1 |