WO2017071045A1 - 录音方法及装置 - Google Patents

录音方法及装置 Download PDF

Info

Publication number
WO2017071045A1
WO2017071045A1 PCT/CN2015/098845 CN2015098845W WO2017071045A1 WO 2017071045 A1 WO2017071045 A1 WO 2017071045A1 CN 2015098845 W CN2015098845 W CN 2015098845W WO 2017071045 A1 WO2017071045 A1 WO 2017071045A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
channel
sound
channel signal
sound signal
Prior art date
Application number
PCT/CN2015/098845
Other languages
English (en)
French (fr)
Inventor
史润宇
熊达蔚
李巍山
Original Assignee
小米科技有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 小米科技有限责任公司 filed Critical 小米科技有限责任公司
Priority to KR1020167004126A priority Critical patent/KR101848458B1/ko
Priority to JP2017547041A priority patent/JP6364130B2/ja
Priority to RU2016107764A priority patent/RU2635838C2/ru
Priority to MX2016002669A priority patent/MX361094B/es
Publication of WO2017071045A1 publication Critical patent/WO2017071045A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/307Frequency adjustment, e.g. tone control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/326Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only for microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/20Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/05Generation or adaptation of centre channel in multi-channel audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/07Generation or adaptation of the Low Frequency Effect [LFE] channel, e.g. distribution or signal processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/09Electronic reduction of distortion of stereophonic sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • H04S5/005Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation  of the pseudo five- or more-channel type, e.g. virtual surround

Definitions

  • the present disclosure relates to the field of multimedia processing, and in particular, to a recording method and apparatus.
  • a microphone is provided on a mobile terminal such as a smartphone, tablet or palmtop. The user can record with the microphone.
  • the recorded audio data is mono data or two-channel data, and the recorded audio data has a poor sound field range and presence.
  • the present disclosure provides a recording method and apparatus.
  • the technical solution is as follows:
  • a recording method for a mobile terminal provided with three microphones comprising:
  • the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal are combined to obtain a 5.1 channel sound signal.
  • the above three microphones include a first microphone in a 5.1 channel center channel direction, a 5.1 channel rear left channel direction second microphone, and a 5.1 channel rear right channel direction.
  • the third microphone is calculated according to the three-way sound signal, and the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, and the rear right channel signal in the 5.1 channel are calculated, including:
  • the first sound signal collected by the first microphone is used as a center channel signal
  • the second sound signal collected by the second microphone is used as a rear left channel signal
  • the third sound signal collected by the third microphone is used as a rear right channel signal
  • the fourth sound signal is used as a left channel signal
  • the fifth sound signal is obtained by weighting and averaging the amplitudes of the first sound signal and the third sound signal at the same time, and the fifth sound signal is used as the right channel signal.
  • the three microphones are dispersedly arranged with respect to the origin; each of the 5.1 channels has two microphones closest to each other in the three microphones; and 5.1 channels are calculated according to the three sound signals.
  • the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, and the rear right channel signal including:
  • the arrival phase difference is the difference between the initial phase angles when the sounds from the channel respectively reach the two microphones, and the sound signals corresponding to the channel are the center channel signal, the left channel signal, and the right channel signal. Any of the rear left channel signal and the rear right channel signal.
  • separating the sound signal corresponding to the channel from the two sound signals according to the arrival phase difference corresponding to the channel including:
  • the same portion of the first filtered data and the second filtered data is extracted as a sound signal corresponding to the channel.
  • the subwoofer channel signal in the 5.1 channel is calculated according to the three-way sound signal, including:
  • the average sound signal is obtained by averaging the amplitudes of the three sound signals at the same time;
  • the method further includes:
  • the three-way sound signal is subjected to noise reduction processing.
  • a recording apparatus in which three microphones are provided, the apparatus comprising:
  • An acquisition module configured to acquire three sound signals collected by three microphones
  • a first calculating module configured to calculate a center channel signal, a left channel signal, a right channel signal, a rear left channel signal, and a rear right channel signal in the 5.1 channel according to the three channel sound signals;
  • a second calculating module which calculates a subwoofer channel signal in the 5.1 channel according to the three-way sound signal
  • the combination module combines the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal to obtain a 5.1 channel sound signal.
  • the above three microphones include a first microphone in a 5.1 channel center channel direction, a 5.1 channel rear left channel direction second microphone, and a 5.1 channel rear right channel direction.
  • the third microphone; the first computing module includes:
  • a first submodule configured to use the first sound signal collected by the first microphone as a center channel signal
  • a second submodule configured to use the second sound signal collected by the second microphone as a rear left channel signal
  • a third submodule configured to use the third sound signal collected by the third microphone as a rear right channel signal
  • a first averaging sub-module configured to weight-average the amplitudes of the first sound signal and the second sound signal at the same time to obtain a fourth sound signal, and use the fourth sound signal as a left channel signal;
  • the second averaging sub-module is configured to weight-average the amplitudes of the first sound signal and the third sound signal at the same time to obtain a fifth sound signal, and use the fifth sound signal as a right channel signal.
  • the three microphones are dispersedly disposed with respect to the origin; each of the 5.1 channels has two microphones closest to each other in the three microphones; and the first calculating module includes:
  • a sub-module configured to acquire, for any of the 5.1 channels, two sound signals collected by two microphones closest to the channel;
  • a ion-dividing module configured to separate a sound signal corresponding to the channel from the two-way sound signal according to an arrival phase difference corresponding to the channel
  • the arrival phase difference is a phase difference corresponding to when the sound from the channel reaches the two microphones respectively, and the sound signal corresponding to the channel is a center channel signal, a left channel signal, a right channel signal, and a rear Any of the left channel signal and the rear right channel signal.
  • the separation sub-module includes:
  • a filtering submodule configured to filter the first sound data according to a phase difference corresponding to the channel to obtain first filtered data; and filter the second sound data according to a phase difference corresponding to the channel to obtain second filtered data ;
  • the extraction submodule is configured to extract the same portion of the first filtered data and the second filtered data as the sound signal corresponding to the channel.
  • the second computing module includes:
  • An averaging submodule configured to average the amplitudes of the three sound signals at the same time to obtain an average sound signal
  • the low pass filtering sub-module is configured to perform low pass filtering on the average sound signal to obtain a subwoofer channel signal.
  • the device further includes:
  • the noise reduction module is configured to perform noise reduction processing on the three sound signals.
  • a recording apparatus in which three microphones are provided, the apparatus comprising:
  • a memory for storing processor executable instructions
  • processor is configured to:
  • the bass channel signal combination gives a 5.1 channel sound signal.
  • the three channels of sound signals are collected by three microphones in the terminal, and the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, and the rear right channel signal are established and calculated according to the three channel sound signals.
  • the subwoofer channel signal, and the 6 channel signals are composed into a 5.1 channel sound signal; solving the related art, the audio data recorded by the user can only be mono data or two channel data and cause recording. The problem of poor sound field range and presence is achieved; the user can also record 5.1 channel data without changing the hardware configuration of the terminal, thereby improving the sound quality of the recorded file.
  • FIG. 1A is a schematic diagram of a channel distribution of a 5.1 channel system according to various embodiments of the present disclosure
  • FIG. 1B is a schematic diagram of an implementation environment according to an exemplary embodiment of the present disclosure.
  • FIG. 1C is a schematic diagram of an implementation environment involved in another exemplary embodiment of the present disclosure.
  • FIG. 1D is a schematic diagram of an implementation environment involved in another exemplary embodiment of the present disclosure.
  • FIG. 2 is a flow chart showing a recording method according to an exemplary embodiment
  • FIG. 3 is a flowchart of a recording method according to another exemplary embodiment
  • FIG. 4 is a flowchart of a recording method according to still another exemplary embodiment
  • FIG. 5 is a block diagram of a recording apparatus according to an exemplary embodiment
  • FIG. 6 is a block diagram of a recording apparatus according to another exemplary embodiment
  • FIG. 7 is a block diagram of a recording apparatus according to still another exemplary embodiment.
  • FIG. 8 is a block diagram of an apparatus, according to an exemplary embodiment.
  • the 5.1 channel system may include: a center channel C, a left channel L, and a right sound. Lane R, rear left channel LS, rear right channel RS, and subwoofer channel LFE.
  • each channel with The center point where the user is located is the same distance and in the same plane.
  • the center channel C is directly in front of the user's facing direction.
  • the left channel L and the right channel R are respectively located on both sides of the center channel C, and are respectively placed at an angle of 30 degrees with the user's facing direction, and are symmetrically arranged.
  • the rear left channel LS and the rear right channel RS are respectively located on the opposite sides of the user facing direction, and are respectively arranged at an angle of 100-120 degrees with the user's facing direction, and are symmetrically arranged.
  • the placement position of the subwoofer channel LFE is not strictly required, and the angle between the user and the facing direction of the user is different, which causes the change of the bass signal in the 5.1 channel sound signal.
  • the user can adjust the placement position of the subwoofer channel LFE as needed.
  • the present disclosure does not limit the angle between the subwoofer channel LFE and the user's face orientation, only exemplarily identified in FIG. 1A.
  • each channel and the facing direction of the user in the 5.1 channel system is merely exemplary, and the distance between each channel and the user may be different.
  • the height of each channel can also be different, that is, the channel can also be not in a plane. The user can adjust it according to the needs. The difference in the position of each channel will cause different sound signals. limited.
  • FIG. 1B is a schematic diagram of a terminal according to various embodiments of the present disclosure.
  • the terminal 110 may include: a first microphone 120 , a second microphone 130 , and a third microphone 140 .
  • the terminal 110 may be a mobile terminal provided with three microphones such as a mobile phone, a tablet, or the like.
  • the first microphone 120, the second microphone 130, and the third microphone 140 are three microphones disposed in the terminal 110 for collecting sound signals.
  • the first microphone 120, the second microphone 130, and the third microphone 140 are arranged in two ways:
  • the first microphone 120 faces forward
  • the second microphone 130 faces the left and is at an angle of 100-120 degrees with the first microphone 120
  • the third microphone 140 faces the right and It is at an angle of 100-120 degrees with the first microphone 120. That is, the placement position of the first microphone 120 corresponds to the direction of the center channel in the 5.1 channel, the placement position of the second microphone 130 corresponds to the direction of the rear left channel, and the placement position of the third microphone 140 is rearward. Set the right channel direction.
  • FIG. 1D Another arrangement of the three microphones is shown in Figure 1D.
  • the three microphones are freely dispersed, and each of the 5.1 channel systems has the closest two microphones among the three microphones.
  • the closest to the center channel C is the first microphone 120 and the second microphone 130;
  • the closest to the left channel L is the first microphone 120 and the second microphone 130;
  • the closest to the channel R is the first microphone 120 and the third microphone 140;
  • the closest to the rear left channel LS is the second microphone 120 and the third microphone 140;
  • the closest to the rear right channel RS is the first A microphone 120 and a third microphone 140.
  • the positions of the three microphones may be other positions, and may be dispersed as much as possible. This embodiment does not limit this.
  • FIG. 2 is a flowchart of a recording method according to an exemplary embodiment. As shown in FIG. 2, the recording method is applied.
  • FIG. 1B and FIG. 1C referring to the 5.1 channel system shown in FIG. 1A, the following steps are included:
  • step 202 three sound signals acquired by three microphones are acquired.
  • the three sound signals collected by the three microphones are from the same sound source, and the distance between the three microphones and the sound source is different. Since the time when the sound reaches each microphone is different, in the embodiment of the present disclosure, it can be considered that the three sound signals collected by the three microphones at the same time have the same frequency but different amplitudes.
  • step 204 the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, and the rear right channel signal in the 5.1 channel are calculated from the three channel sound signals.
  • step 206 the subwoofer channel signal in 5.1 channel is calculated from the three-way sound signal.
  • step 204 and step 206 are juxtaposed, and there is no specific order.
  • step 208 the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal are combined to obtain a 5.1 channel sound signal.
  • the recording method collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel according to the three sound signals.
  • the signal, the rear left channel signal, the rear right channel signal and the subwoofer channel signal, and the 6 channel signals are combined into a 5.1 channel sound signal; the audio data recorded by the user in the related art is solved only.
  • step 204 can alternatively be implemented as the steps included in FIG. 331 ⁇ 335.
  • step 204 can alternatively be implemented as step 338, step 339a in FIG. Step 339b.
  • FIG. 3 is a flowchart of a recording method according to another exemplary embodiment. As shown in FIG. 3, the embodiment is illustrated by using the recording method in the first setting manner shown in FIG. 1B, and includes the following steps. :
  • step 310 three sound signals acquired by three microphones are acquired.
  • the terminal acquires three sound signals collected by three microphones respectively.
  • the sound signals collected by the first microphone, the second microphone, and the third microphone are respectively recorded as A_mic1, A_mic2, and A_mic3.
  • the sound signal obtained by the terminal is an analog signal, and after the terminal obtains the sound signal, the analog signal can be converted into a digital signal for subsequent processing, or the collected analog signal can be directly used for processing, which is not limited in this embodiment.
  • This embodiment is described by taking an example of converting the collected sound signal into a digital signal.
  • step 320 the three-way sound signal is subjected to noise reduction processing.
  • the terminal performs noise reduction processing on the acquired three-way sound, and the noise-reduced first microphone, second microphone, and third
  • the sound signals of the microphones are denoted as A_mic1', A_mic2' and A_mic3', respectively.
  • a method for noise reduction is based on the noise in the wavelet removal signal, and the collected first sound signal A_mic1 is decomposed into a multi-layer wavelet signal, and an appropriate threshold is selected to process the high frequency coefficients of each layer of the wavelet signal, and the processing is performed. After the signal is wavelet reconstructed, the output signal is A_mic1'. Both the second sound signal and the third sound signal can be noise-reduced using the method to obtain the noise-reduced sound signals as A_mic2' and A_mic3'.
  • this step is not necessary, and only to improve the quality of the sound signal, this step is optional.
  • step 331 the first sound signal collected by the first microphone is used as the center channel signal.
  • step 332 the second sound signal collected by the second microphone is used as a rear left channel signal.
  • step 333 the third sound signal collected by the third microphone is used as a rear right channel signal.
  • step 334 the amplitude of the first sound signal and the second sound signal at the same time is averaged to obtain a fourth sound signal, and the fourth sound signal is used as a left channel signal.
  • the fourth sound signal obtained by the terminal after the A_mic1' obtained by denoising the first sound signal and the A_mic2' obtained by denoising the second sound signal at the same time is used as the left channel signal, and is recorded as A_L' , that is, the left channel signal is A_L',
  • A_L' a1*A_mic1’+b1*A_mic2’
  • a1 is the weight of A_mic1'
  • b1 is the weight of A_mic2'.
  • the specific values of a1 and b1 may be preset according to the positions of the three microphones and the positions of the respective channels, or may be set by the user.
  • a1+b1 1 in the above possible value mode. In other possible value modes, a1+b1 may not
  • the setting manner of a1 and b1 and the specific value thereof are not limited in the embodiment of the present disclosure.
  • step 335 the amplitudes of the first sound signal and the third sound signal at the same time are weighted and averaged to obtain a fifth sound signal, and the fifth sound signal is used as a right channel signal.
  • the fifth sound signal obtained by the terminal after the A_mic1' obtained by denoising the first sound signal and the A_mic3' obtained by denoising the third sound signal at the same time is used as the right channel signal, and is recorded as A_R' , that is, the right channel signal is A_R',
  • A_R' a2*A_mic1’+b2*A_mic3’
  • a2 is the weight of A_mic1'
  • b2 is the weight of A_mic3'
  • the specific values of a2 and b2 may be According to the position of the three microphones and the position of each channel, it can also be set by the user.
  • the setting manner of a2 and b2 and the specific value thereof are not limited in the embodiment of the present disclosure.
  • the subwoofer channel signal in the 5.1 channel is calculated according to the three-way sound signal.
  • the implementation process of this step is:
  • step 341 the average sound signal is obtained by averaging the amplitudes of the three sound signals at the same time.
  • the terminal averages the amplitudes of A_mic1', A_mic2' and A_mic3' obtained by denoising the three-way sound signals at the same time to obtain an average sound signal, which is denoted as A_LFE, that is, the average sound signal is A_LFE,
  • A_LFE (A_mic1'+A_mic2'+A_mic3’)/3
  • step 342 the average sound signal is low pass filtered to obtain a subwoofer channel signal.
  • the terminal performs low-pass filtering on the average sound signal obtained in step 341 to obtain a subwoofer channel signal.
  • the cut-off frequency of the low-pass filter is optional. Generally, the cutoff frequency is set to a value between 80 Hz and 120 Hz, which is not limited in this embodiment.
  • step 341 and steps 331-335 are also juxtaposed, and there is no specific order.
  • step 350 the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal are combined to obtain a 5.1 channel signal.
  • A_C' The central channel signal A_C', the left channel signal A_L', the right channel signal A_R', the rear left channel signal A_LS', the rear right channel A_RS', and the subwoofer channel signal obtained by the terminal through the above steps.
  • A_LFE' is combined to obtain a 5.1-channel signal, which is denoted as A_5.1ch.
  • A_5.1ch The optional combination is well understood by those skilled in the art, and will not be further described in this embodiment.
  • step 360 the combined 5.1 channel signal is saved in memory.
  • the terminal saves the combined 5.1 channel signal in the terminal's own memory or an external storage device.
  • the terminal When the terminal stores a 5.1 channel signal, it can be in a format such as uncompressed PCM or WAV.
  • the terminal may also adopt 5.1 channel support such as Dolby Digital (Dolby Digital Technology), AAC (Advanced Audio Coding), DTS (Digital Theatre System), 3D-Audio, etc. Compression format.
  • Dolby Digital Dolby Digital Technology
  • AAC Advanced Audio Coding
  • DTS Digital Theatre System
  • 3D-Audio etc. Compression format.
  • the method provided in this embodiment collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel signal according to the three sound signals.
  • the left channel signal, the rear right channel signal and the subwoofer channel signal are set, and the 6 channel signals are composed into 5.1 channel sound signals;
  • the audio data recorded by the user in the related art can only be single Channel data or two-channel data resulting in a sound field The problem of poor range and presence; the user can also record 5.1 channel data without changing the hardware configuration of the terminal, thereby greatly improving the sound quality of the recording and the user's listening experience.
  • the recording method provided by the embodiment can display three sound signals collected by three microphones into 5.1 channel data with less calculation amount by placing three microphones according to a predetermined position.
  • the user can record the effects of 5.1 channel data without changing the terminal hardware configuration and using less computation.
  • FIG. 4 is a flowchart of a recording method according to an embodiment of the present invention. As shown in FIG. 4, the recording method is applied to the second setting mode shown in FIG. 1D to illustrate, and includes the following steps. :
  • step 310 three sound signals acquired by three microphones are acquired.
  • the terminal acquires three sound signals collected by three microphones respectively.
  • the sound signals collected by the first microphone, the second microphone, and the third microphone are respectively recorded as A_mic1, A_mic2, and A_mic3.
  • the sound signal obtained by the terminal is an analog signal, and after the terminal obtains the sound signal, the analog signal can be converted into a digital signal for subsequent processing, or the collected analog signal can be directly used for processing, which is not limited in this embodiment.
  • This embodiment is described by taking an example of converting the collected sound signal into a digital signal.
  • step 320 the three-way sound signal is subjected to noise reduction processing.
  • the terminal performs noise reduction processing on the acquired three-way sound, and records the sound signals of the noise-reduced first microphone, the second microphone, and the third microphone as A_mic1', A_mic2', and A_mic3', respectively.
  • a method for noise reduction is based on the noise in the wavelet removal signal, and the collected first sound signal A_mic1 is decomposed into a multi-layer wavelet signal, and an appropriate threshold is selected to process the high frequency coefficients of each layer of the wavelet signal, and the processing is performed. After the signal is wavelet reconstructed, the output signal is A_mic1'. Both the second sound signal and the third sound signal can be noise-reduced using the method to obtain the noise-reduced sound signals as A_mic2' and A_mic3'.
  • this step is not necessary, and only to improve the quality of the sound signal, this step is optional.
  • step 338 for any of the 5.1 channels, two sound signals acquired by the two microphones closest to the channel are acquired.
  • the terminal acquires position information of three microphones relative to the origin.
  • the origin here refers to the center point 10 position of the 5.1 channel system, and the terminal establishes a coordinate system according to the origin.
  • a method for establishing a coordinate system is: taking a center point of the 5.1 channel system as an origin, a direction of the center point pointing to the center channel is a positive direction of the y axis, and a direction perpendicular to the y axis pointing to the right side is an x axis
  • the present embodiment is exemplified by this coordinate system in conjunction with FIG. 1A. This embodiment does not limit the method of establishing a coordinate system.
  • the terminal records the positions of the first microphone, the second microphone, and the third microphone in this coordinate system as P_mic1(x1, y1), P_mic2(x2, y2), and P_mic3(x3, y3), respectively.
  • the channels in the 5.1 channel system have different directions. As shown in Figure 1A, the center channel direction is the y-axis direction, the left channel direction is the y-axis positive direction 30 degrees to the left, and the right channel direction is the y-axis. Positive direction 30 degrees to the right, rear left channel direction The positive direction of the y-axis is 100-120 degrees to the left, and the direction of the rear right channel is the positive direction of the y-axis to the right 100-120 direction.
  • the terminal For any of the 5.1 channels, the terminal first acquires two sound signals collected by the two microphones closest to the channel, and then from the two sound signals according to the arrival phase difference corresponding to the channel. The sound signal corresponding to the channel is separated.
  • the center channel is taken as an example.
  • the two microphones closest to the center channel are the first microphone and the second microphone, and then the two microphones are acquired and noise-reduced.
  • the two sound signals are A_mic1' and A_mic2', respectively.
  • the terminal separates the sound signal corresponding to the channel from the two sound signals according to the arrival phase difference corresponding to the channel, and may include the following two sub-steps:
  • step 339a the first path sound signal is filtered according to the arrival phase difference corresponding to the channel to obtain the first path data; the two channel sound signals are obtained according to the arrival phase difference corresponding to the channel. The second sound signal is filtered to obtain second filtered data.
  • each microphone Since each microphone receives sound signals from all directions, the arrival phase of the sound signals in each direction reaching the three microphones is different.
  • the terminal can extract the sound signal from a certain channel according to the phase difference of arrival with each channel.
  • the first microphone and the second microphone are closest to the center channel, and the first sound signal is the first road sound signal, and the second sound signal is the second road sound signal. Since the distance between the center channel and the first microphone and the second microphone that are closest to it is different, the sound in the direction of the center channel arrives at the first microphone and arrives at the second microphone, and there is a fixed arrival phase difference, which will reach the phase. The difference is ⁇ .
  • the sound signals of the first road sound signal and the second road sound signal are divided into a plurality of sub-signals in the same manner, and each of the first road sound signals usually has another one belonging to the same time in the second road sound signal Sub signal. Then, the terminal compares the arrival phase difference between the pair of sub-signals belonging to the same time in the first road sound signal and the second road sound signal, and when the phase difference is ⁇ , it is considered to be a signal in the direction of the center channel. It is retained; when the phase difference is not ⁇ , it is considered that it is not a signal in the direction of the center channel, and it is filtered out. In this way, the first path sound signal is filtered to obtain first filter data, and the second path sound signal is filtered to obtain second filter data.
  • each audio frame can be used as a sub-signal according to the encoding protocol, but the manner of dividing each sub-signal is not limited in this embodiment.
  • the arrival phase difference corresponding to each channel is calculated by the terminal in advance based on the coordinate position of each microphone.
  • step 339b the same portion of the first filtered data and the second filtered data is extracted as a sound signal corresponding to the channel.
  • the terminal extracts the same portion of the obtained first filtered signal and the second filtered signal as a sound signal with the channel.
  • the channel here may be any one of a center channel signal, a left channel signal, a right channel signal, a rear left channel signal, and a rear right channel signal.
  • Every channel processing method can be To adopt a processing method similar to the center channel in the above example.
  • the terminal obtains the sound signal of each channel, the extracted sound signals of these channels are respectively recorded as the center channel signal A_C', the left channel signal A_L', the right channel signal A_R', and the rear left sound.
  • Channel signal A_LS' and post-right channel signal A_RS' are respectively recorded as the center channel signal A_C', the left channel signal A_L', the right channel signal A_R', and the rear left sound.
  • step 341 the average sound signal is obtained by averaging the amplitudes of the three sound signals at the same time.
  • the terminal averages the amplitudes of the noise-reduced first sound signal A_mic1', the second sound signal A_mic2', and the third sound signal A_mic3' at the same time to obtain an average sound signal, which is denoted as A_LFE, that is, the average sound signal is A_LFE,
  • A_LFE (A_mic1'+A_mic2'+A_mic3’)/3
  • step 342 the average sound signal is low pass filtered to obtain a subwoofer channel signal.
  • the terminal performs low-pass filtering on the average sound signal obtained in step 341 to obtain a subwoofer channel signal.
  • the cut-off frequency of the low-pass filter is optional. Generally, the cutoff frequency is set to a value between 80 Hz and 120 Hz, which is not limited in this embodiment.
  • step 341 and step 338 are juxtaposed, and there is no specific order.
  • step 350 the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal are combined to obtain a 5.1 channel signal.
  • A_C' The central channel signal A_C', the left channel signal A_L', the right channel signal A_R', the rear left channel signal A_LS', the rear right channel A_RS', and the subwoofer channel signal obtained by the terminal through the above steps.
  • A_LFE' is combined to obtain a 5.1-channel signal, which is denoted as A_5.1ch.
  • A_5.1ch The optional combination is well understood by those skilled in the art, and will not be further described in this embodiment.
  • step 360 the combined 5.1 channel signal is saved in memory.
  • the terminal saves the combined 5.1 channel signal in the terminal's own memory or an external storage device.
  • the terminal When the terminal stores a 5.1 channel signal, it can be in a format such as uncompressed PCM or WAV.
  • the terminal may also adopt a 5.1-channel compression format such as Dolby Digital, AAC, DTS, 3D-Audio.
  • the recording method provided by the embodiment collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel signal according to the three sound signals.
  • the rear left channel signal, the rear right channel signal and the subwoofer channel signal, and the 6 channel signals are composed into a 5.1 channel sound signal;
  • the audio data recorded by the user in the related art can only be Mono-channel data or two-channel data leads to a problem of poor sound field range and presence; it is possible to record 5.1-channel data without changing the hardware configuration of the terminal, thereby improving the recording file. The effect of the sound quality.
  • the recording method provided by the embodiment provides three microphones freely set according to the actual space in the terminal by placing the three microphones in a free position, and then recording the three sound signals collected by the three microphones as The 5.1 channel data achieves the effect of recording 5.1 channel data without changing the hardware configuration of the terminal and not requiring a more demanding microphone placement position.
  • FIG. 5 is a block diagram of a recording apparatus according to an exemplary embodiment. As shown in FIG. 5, the recording apparatus is applied to the implementation environment shown in FIG. 1B, and relates to the 5.1 channel system shown in FIG. 1A, and the apparatus includes However, it is not limited to: the acquisition module 500, the first calculation module 520, the second calculation module 540, and the combination module 560.
  • the obtaining module 500 is configured to acquire three sound signals collected by three microphones.
  • the first calculating module 520 is configured to calculate a center channel signal, a left channel signal, a right channel signal, a rear left channel signal, and a rear right channel signal in the 5.1 channel according to the three channel sound signals. .
  • the second calculating module 540 calculates the subwoofer channel signal in the 5.1 channel according to the three-way sound signal.
  • the combining module 560 combines the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal to obtain a 5.1 channel sound signal.
  • the recording method provided in the embodiment of the present disclosure collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel according to the three sound signals.
  • the signal, the rear left channel signal, the rear right channel signal and the subwoofer channel signal, and the 6 channel signals are composed into 5.1 channel sound signals; the audio data recorded by the user in the related art is solved only.
  • the sound quality of the file can be mono data or two-channel data, resulting in poor sound field range and presence; it is possible to record 5.1 channel data without changing the hardware configuration of the terminal, thus improving the recording.
  • the sound quality of the file can be mono data or two-channel data, resulting in poor sound field range and presence; it is possible to record 5.1 channel data without changing the hardware configuration of the terminal, thus improving the recording.
  • FIG. 6 is a block diagram of a recording apparatus according to another exemplary embodiment. As shown in FIG. 6, the embodiment is exemplified by the recording method applied to the first setting manner shown in FIG. 1B.
  • the method includes an acquisition module 500, a noise reduction module 510, a first calculation module 520, a second calculation module 540, a combination module 560, and a storage module 580.
  • the obtaining module 500 is configured to acquire three sound signals collected by three microphones.
  • the noise reduction module 510 is configured to perform noise reduction processing on the three-way sound signal.
  • the first calculating module 520 is configured to calculate a center channel signal, a left channel signal, a right channel signal, a rear left channel signal, and a rear right channel signal in the 5.1 channel according to the three channel sound signals. .
  • the first calculation module 520 specifically includes: a first sub-module 521, a second sub-module 522, a third sub-module 523, a first average sub-module 524, and a second average sub-module 525.
  • the first sub-module 521 is configured to use the first sound signal collected by the first microphone as a center channel signal.
  • the second sub-module 522 is configured to use the second sound signal collected by the second microphone as the rear left channel signal.
  • the third sub-module 523 is configured to use the third sound signal collected by the third microphone as the rear right channel signal.
  • a first averaging sub-module 524 configured to amplitude the first sound signal and the second sound signal at the same time A weighted average is obtained to obtain a fourth sound signal, and the fourth sound signal is used as a left channel signal.
  • the second averaging sub-module 525 is configured to weight-average the amplitudes of the first sound signal and the third sound signal at the same time to obtain a fifth sound signal, and use the fifth sound signal as a right channel signal.
  • the second calculating module 540 is configured to calculate the subwoofer channel signal in the 5.1 channel according to the three-way sound signal, and the second calculating module comprises: an average sub-module 541 and a low-pass filtering sub-module 542.
  • the averaging sub-module 541 is configured to average the amplitudes of the three-way sound signals at the same time to obtain an average sound signal.
  • the low pass filtering sub-module 542 is configured to perform low pass filtering on the average sound signal to obtain a subwoofer channel signal.
  • the combination module 560 is configured to combine the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal to obtain a 5.1 channel sound signal. .
  • the storage module 580 is configured to save the combined 5.1 channel signal in a memory.
  • the apparatus collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel signal according to the three sound signals.
  • the left channel signal, the rear right channel signal and the subwoofer channel signal are set, and the 6 channel signals are composed into 5.1 channel sound signals;
  • the audio data recorded by the user in the related art can only be single
  • the channel data or the two-channel data causes the sound field range and the sense of presence of the recording to be poor; the user can also record the 5.1 channel data without changing the hardware configuration of the terminal, thereby improving the sound quality of the recorded file. Effect.
  • An exemplary embodiment of the present disclosure provides a recording apparatus for a mobile terminal provided with three microphones, capable of implementing the recording method provided by the present disclosure, the apparatus comprising: a processor, and a processor for storing processor executable instructions Memory
  • processor is configured to:
  • the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal are combined to obtain a 5.1 channel sound signal.
  • the above three microphones include a first microphone in the center channel direction of the 5.1 channel, a second microphone in the rear left channel direction of the 5.1 channel, and a rear right channel in the 5.1 channel.
  • the processor is configured to:
  • the first sound signal collected by the first microphone is used as a center channel signal
  • the second sound signal collected by the second microphone is used as a rear left channel signal
  • the third sound signal collected by the third microphone is used as a rear right channel signal
  • the fifth sound signal is obtained by weighting and averaging the amplitudes of the first sound signal and the third sound signal at the same time, and the fifth sound signal is used as the right channel signal.
  • the processor is configured to:
  • the arrival phase difference is a phase difference corresponding to when the sound from the channel reaches the two microphones respectively, and the sound signal corresponding to the channel is a center channel signal, a left channel signal, a right channel signal, and a rear Any of the left channel signal and the rear right channel signal.
  • the processor is configured to:
  • the average sound signal is obtained by averaging the amplitudes of the three sound signals at the same time;
  • the processor is configured to:
  • the three-way sound signal is subjected to noise reduction processing.
  • FIG. 7 is a block diagram of a recording apparatus according to still another exemplary embodiment. As shown in FIG. 7, the embodiment is illustrated by the second recording mode shown in FIG. 1D.
  • the method includes an acquisition module 500, a noise reduction module 510, a first calculation module 520, a second calculation module 540, a combination module 560, and a storage module 580.
  • the obtaining module 500 is configured to acquire three sound signals collected by three microphones.
  • the noise reduction module 510 is configured to perform noise reduction processing on the three-way sound signal.
  • the first calculating module 520 is configured to calculate a center channel signal, a left channel signal, a right channel signal, a rear left channel signal, and a rear right channel signal in the 5.1 channel according to the three channel sound signals. .
  • the first calculation module 520 specifically includes: an acquisition submodule 528 and a separation submodule 529.
  • the acquisition sub-module 528 is configured to acquire, for any of the 5.1 channels, two sound signals acquired by two microphones closest to the channel.
  • the ion splitting module 529 is configured to be from the two sound signals according to the arrival phase difference corresponding to the channel The sound signal corresponding to the channel is separated.
  • the separation sub-module 529 further includes a first separation sub-module 529a and a filtering sub-module 529b.
  • the first separating sub-module 529a is configured to filter the first sound data according to a phase difference corresponding to the channel to obtain first filtered data; and filter the second sound data according to a phase difference corresponding to the channel. Two filtered data.
  • the extraction sub-module 529b is configured to extract the same portion of the first filtered data and the second filtered data as a sound signal corresponding to the channel.
  • the second calculating module 540 is configured to calculate the subwoofer channel signal in the 5.1 channel according to the three-way sound signal, and the second calculating module comprises: an average sub-module 541 and a low-pass filtering sub-module 542.
  • the averaging sub-module 541 is configured to average the amplitudes of the three-way sound signals at the same time to obtain an average sound signal.
  • the low pass filtering sub-module 542 is configured to perform low pass filtering on the average sound signal to obtain a subwoofer channel signal.
  • the combination module 560 is configured to combine the center channel signal, the left channel signal, the right channel signal, the rear left channel signal, the rear right channel signal, and the subwoofer channel signal to obtain a 5.1 channel sound signal. .
  • the storage module 580 is configured to save the combined 5.1 channel signal in a memory.
  • the apparatus collects three sound signals through three microphones in the terminal, and establishes and calculates a center channel signal, a left channel signal, and a right channel signal according to the three sound signals.
  • the left channel signal, the rear right channel signal and the subwoofer channel signal are set, and the 6 channel signals are composed into 5.1 channel sound signals;
  • the audio data recorded by the user in the related art can only be single
  • the channel data or the two-channel data causes the sound field range and the sense of presence of the recording to be poor; the user can also record the 5.1 channel data without changing the hardware configuration of the terminal, thereby improving the sound quality of the recorded file. Effect.
  • FIG. 8 is a block diagram of an apparatus 800 for recording, according to an exemplary embodiment.
  • device 800 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
  • apparatus 800 can include one or more of the following components: processing component 802, memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, and Communication component 816.
  • Processing component 802 typically controls the overall operation of device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • Processing component 802 can include one or more processors 818 to execute instructions to perform all or part of the steps described above.
  • processing component 802 can include one or more modules to facilitate interaction between component 802 and other components.
  • processing component 802 can include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802.
  • Memory 804 is configured to store various types of data to support operation at device 800. Examples of these data Instructions including any application or method for operation on device 800, contact data, phone book data, messages, pictures, videos, and the like.
  • the memory 804 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read only memory
  • EPROM Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Disk Disk or Optical Disk.
  • Power component 806 provides power to various components of device 800.
  • Power component 806 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for device 800.
  • the multimedia component 808 includes a screen between the device 800 and the user that provides an output interface.
  • the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
  • the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor can sense not only the boundaries of the touch or sliding action, but also the duration and pressure associated with the touch or slide operation.
  • the multimedia component 808 includes a front camera and/or a rear camera. When the device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
  • the audio component 810 is configured to output and/or input an audio signal.
  • the audio component 810 includes a microphone (MIC) that is configured to receive an external audio signal when the device 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
  • the received audio signal may be further stored in memory 804 or transmitted via communication component 816.
  • the audio component 810 also includes a speaker for outputting an audio signal.
  • the I/O interface 812 provides an interface between the processing component 802 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
  • Sensor assembly 814 includes one or more sensors for providing device 800 with a status assessment of various aspects.
  • sensor assembly 814 can detect an open/closed state of device 800, a relative positioning of components, such as a display and a keypad of device 800, and sensor component 814 can also detect a change in position of a component of device 800 or device 800, the user The presence or absence of contact with device 800, device 800 orientation or acceleration/deceleration and temperature variation of device 800.
  • Sensor assembly 814 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
  • Sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor assembly 814 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • Communication component 816 is configured to facilitate wired or wireless communication between device 800 and other devices.
  • the device 800 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or a combination thereof.
  • communication component 816 receives broadcast signals or broadcast associated information from an external broadcast management system via a broadcast channel.
  • communication component 816 also includes a near field communication (NFC) module to facilitate Short-range communication.
  • NFC near field communication
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the above recording method.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A gate array
  • controller microcontroller, microprocessor or other electronic component implementation for performing the above recording method.
  • non-transitory computer readable storage medium comprising instructions, such as a memory 804 comprising instructions executable by processor 818 of apparatus 800 to perform the recording method described above.
  • the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Otolaryngology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic Arrangements (AREA)
  • Stereophonic System (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

一种录音方法及装置,属于多媒体处理领域。所述方法用于设备有三个麦克风的移动终端中,所述方法包括:获取三个麦克风(120,130,140)采集的三路声音信号(202);根据三路声音信号计算得到5.1声道中的中央声道信号(C)、左声道(L)信号、右声道(R)信号、后置左声道(LS)信号和后置右声道(RS)信号(204);根据三路声音信号计算得出5.1声道中的重低音声道(LFE)信号(206);将中央声道信号(C)、左声道(L)信号、右声道(R)信号、后置左声道(LS)信号、后置右声道(RS)信号和重低音声道(LFE)信号组合得到5.1声道的声音信号(208)。同时,解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端的硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。

Description

录音方法及装置
本申请基于申请号为201510719339.1、申请日为2015年10月29日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。
技术领域
本公开涉及多媒体处理领域,特别涉及一种录音方法及装置。
背景技术
诸如智能手机、平板电脑或掌上电脑之类的移动终端上都设置有麦克风。用户通过麦克风可以进行录音。
由于移动终端上通常只设置有1-3个麦克风,所以录制的音频数据是单声道数据或双声道数据,所录制的音频数据的声场范围和临场感都较差。
发明内容
为了解决由于硬件限制只能录制单声道或双声道的音频数据,而导致声场范围和临场感较差的问题,本公开提供一种录音方法及装置。该技术方案如下:
根据本公开实施例的第一方面,提供一种录音方法,用于设置有三个麦克风的移动终端中,该方法包括:
获取三个麦克风采集的三路声音信号;
根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
根据三路声音信号计算得出5.1声道中的重低音声道信号;
将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
可选的,上述三个麦克风包括位于5.1声道的中央声道方向的第一麦克风、位于5.1声道的后置左声道方向的第二麦克风和位于5.1声道的后置右声道方向的第三麦克风;则根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号,包括:
将第一麦克风采集到的第一声音信号作为中央声道信号;
将第二麦克风采集到的第二声音信号作为后置左声道信号;
将第三麦克风采集到的第三声音信号作为后置右声道信号;
将第一声音信号和第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音 信号,将第四声音信号作为左声道信号;
将第一声音信号和第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将第五声音信号作为右声道信号。
可选的,上述三个麦克风相对于原点分散设置;5.1声道中的每个声道在该三个麦克风中都存在距离最近的两个麦克风;则根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号,包括:
对于5.1声道中的任一声道,获取与该声道最近的两个麦克风所采集的两路声音信号;
根据与该声道对应的到达相位差,从上述两路声音信号中分离出该声道对应的声音信号;
其中,到达相位差是来自该声道的声音分别抵达上述两个麦克风时所对应的初相角之差,该声道对应的声音信号是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。
可选的,根据与该声道对应的到达相位差,从上述两路声音信号中分离出该声道对应的声音信号,包括:
根据与该声道对应的相位差对第一声音数据进行滤波得到第一滤波数据;根据与该声道对应的相位差对第二声音数据进行滤波得到第二滤波数据;
将第一滤波数据和第二滤波数据中的相同部分提取为该声道对应的声音信号。
可选的,根据三路声音信号计算得出5.1声道中的重低音声道信号,包括:
将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号;
将平均声音信号进行低通滤波后,得到重低音声道信号。
可选的,该方法还包括:
对该三路声音信号进行降噪处理。
根据本公开实施例的第二方面,提供一种录音装置,该装置中设置有三个麦克风,该装置包括:
获取模块,被配置为获取三个麦克风采集的三路声音信号;
第一计算模块,被配置为根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
第二计算模块,根据三路声音信号计算得出5.1声道中的重低音声道信号;
组合模块,将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
可选的,上述三个麦克风包括位于5.1声道的中央声道方向的第一麦克风、位于5.1声道的后置左声道方向的第二麦克风和位于5.1声道的后置右声道方向的第三麦克风;则上述第一计算模块包括:
第一子模块,被配置为将第一麦克风采集到的第一声音信号作为中央声道信号;
第二子模块,被配置为将第二麦克风采集到的第二声音信号作为后置左声道信号;
第三子模块,被配置为将第三麦克风采集到的第三声音信号作为后置右声道信号;
第一平均子模块,被配置为将第一声音信号和第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音信号,将第四声音信号作为左声道信号;
第二平均子模块,被配置为将第一声音信号和第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将第五声音信号作为右声道信号。
可选的,上述三个麦克风相对于原点分散设置;5.1声道中的每个声道在该三个麦克风中都存在距离最近的两个麦克风;则上述第一计算模块包括:
获取子模块,被配置为对于5.1声道中的任一声道,获取与该声道最近的两个麦克风所采集的两路声音信号;
分离子模块,被配置为根据与该声道对应的到达相位差,从上述两路声音信号中分离出该声道对应的声音信号;
其中,到达相位差是来自该声道的声音分别抵达上述两个麦克风时所对应的相位差,该声道对应的声音信号是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。
可选的,分离子模块包括:
滤波子模块,被配置为根据与该声道对应的相位差对第一声音数据进行滤波得到第一滤波数据;根据与该声道对应的相位差对第二声音数据进行滤波得到第二滤波数据;
提取子模块,被配置为将第一滤波数据和第二滤波数据中的相同部分提取为该声道对应的声音信号。
可选的,第二计算模块包括:
平均子模块,被配置为将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号;
低通滤波子模块,被配置为将平均声音信号进行低通滤波后,得到重低音声道信号。
可选的,该装置还包括:
降噪模块,被配置为对该三路声音信号进行降噪处理。
根据本公开实施例的第三方面,提供一种录音装置,该装置中设置有三个麦克风,该装置包括:
处理器;
用于存储处理器可执行指令的存储器;
其中,该处理器被配置为:
获取三个麦克风采集的三路声音信号;
根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
根据三路声音信号计算得出5.1声道中的重低音声道信号;
将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重 低音声道信号组合得到5.1声道的声音信号。
本公开的实施例提供的技术方案可以包括以下有益效果:
通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端的硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。
应当理解的是,以上的一般描述和后文的细节描述仅是示例性的,并不能限制本公开。
附图说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并于说明书一起用于解释本公开的原理。
图1A是本公开各个实施例所涉及的一种5.1声道系统的声道分布的示意图;
图1B是本公开一示例性实施例所涉及的一种实施环境的示意图;
图1C是本公开另一示例性实施例所涉及的一种实施环境的示意图;
图1D是本公开另一示例性实施例所涉及的一种实施环境的示意图;
图2是根据一示例性实施例示出的一种录音方法的流程图;
图3是根据另一示例性实施例示出的一种录音方法的流程图;
图4是根据又一示例性实施例示出的一种录音方法的流程图;
图5是根据一示例性实施例示出的一种录音装置的框图;
图6是根据另一示例性实施例示出的一种录音装置的框图;
图7是根据又一示例性实施例示出的一种录音装置的框图;
图8是根据一示例性实施例示出的一种装置的框图。
具体实施方式
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。
图1A是本公开各个实施例所涉及的一种5.1声道系统的声道分布的示意图,如图1A所示,该5.1声道系统可以包括:中央声道C、左声道L、右声道R、后置左声道LS、后置右声道RS和重低音声道LFE。
假设用户的位置处于图1A中的中心点10并朝向中央声道C所在位置,每个声道与 用户所在的中心点的距离相等且处于同一平面。
中央声道C处于用户的面对方向的正前方。
左声道L和右声道R分别处于中央声道C的两侧,分别与用户的面对方向呈30度夹角,呈对称设置。
后置左声道LS和后置右声道RS分别处于用户面对方向的两侧靠后,分别与用户的面对方向呈100-120度夹角,呈对称设置。
由于重低音的方向感较弱,重低音声道LFE的摆放位置没有严格要求,其与用户的面对方向之间呈现的角度不同,会引起5.1声道的声音信号中低音信号的变化,用户可以根据需要调整重低音声道LFE的摆放位置。本公开不对重低音声道LFE与用户的面对方向的夹角作出限定,仅示例性的在图1A中进行标识。
需要说明的一点:本公开实施例所涉及的5.1声道系统中每个声道与用户的面对方向的夹角仅是示例性的,另外,每个声道与用户之间的距离可以不同,每个声道所在的高度也可以不同,即声道也可以不处于一个平面,用户可以根据需要自行调整,每个声道摆放位置的不同都会引起声音信号的不同,本公开对此不作限定。
图1B是本公开各个实施例所涉及的一种终端的示意图,如图1B所示,该终端110可以包括:第一麦克风120、第二麦克风130和第三麦克风140。
终端110可以是诸如手机、平板电脑之类的设置有三个麦克风的移动终端。
第一麦克风120、第二麦克风130、第三麦克风140是终端110中设置的三个麦克风,用于采集声音信号。
可选地,第一麦克风120、第二麦克风130和第三麦克风140的设置方式有两种:
三个麦克风的一种设置方式如图1C所示,第一麦克风120朝向前方,第二麦克风130朝向左方且与第一麦克风120呈100-120度夹角,第三麦克风140朝向右方且与第一麦克风120呈100-120度夹角。也即,第一麦克风120的摆放位置与5.1声道中的中央声道方向对应,第二麦克风130的摆放位置与后置左声道方向对应,第三麦克风140的摆放位置与后置右声道方向对应。
三个麦克风的另一种设置方式如图1D所示,三个麦克风自由的分散设置,则5.1声道系统中的每个声道都在这三个麦克风中存在距离最近的两个麦克风。以图1D所示为例进行说明,与中央声道C距离最近的是第一麦克风120和第二麦克风130;与左声道L距离最近的是第一麦克风120和第二麦克风130;与右声道R距离最近的是第一麦克风120和第三麦克风140;与后置左声道LS距离最近的是第二麦克风120和第三麦克风140;与后置右声道RS距离最近的是第一麦克风120和第三麦克风140。当然,三个麦克风的位置还可以是其它位置,尽量分散即可,本实施例对此不作限定。
图2是根据一示例性实施例示出的录音方法的流程图,如图2所示,该录音方法应用 于图1B和图1C所示的实施环境中,涉及图1A所示的5.1声道系统,包括以下步骤:
在步骤202中,获取三个麦克风采集的三路声音信号。
通常,三个麦克风采集的三路声音信号来自同一个声源,且三个麦克风与声源之间的距离不同。由于声音到达每个麦克风的时间不同,在本公开实施例中,可以认为三个麦克风在同一时刻采集到的三路声音信号的频率相同但幅度不同。
在步骤204中,根据三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号。
在步骤206中,根据三路声音信号计算得出5.1声道中的重低音声道信号。
需要说明的是,步骤204与步骤206是并列的,没有特定的先后顺序之分。
在步骤208中,将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
综上所述,本公开实施例中提供的录音方法,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组合成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端硬件配置的情况下,用户也能录制5.1声道数据,从而大大提高了录音的音质和用户的收听体验的效果。
由于终端110中的三个麦克风的设置方式有两种,对应于每种设置方式,上述步骤204中的计算声道信号的具体实现方式也不同。
对应于图1B所示的第一种设置方式,即三个麦克风与5.1声道系统对应,具体实现方式如图3的流程图所示,上述步骤204可替代地实现成为图3中的包括步骤331~335。
对应于图1D所示的第二种设置方式,即三个麦克风自由放置,具体实现方式如图4的流程图所示,上述步骤204可替代地实现成为图4中的步骤338、步骤339a和步骤339b。
图3是根据另一示例性实施例示出的录音方法的流程图,如图3所示,本实施例以该录音方法应用于图1B示出的第一种设置方式来举例说明,包括以下步骤:
在步骤310中,获取三个麦克风采集的三路声音信号。
终端获取三个麦克风分别采集到的三路声音信号。在本实施例中,将第一麦克风、第二麦克风和第三麦克风采集的声音信号分别记为A_mic1、A_mic2和A_mic3。
终端获取的声音信号为模拟信号,终端获取到声音信号后,可以将模拟信号转换成数字信号进行后续处理,也可以直接使用采集到的模拟信号进行处理,本实施例对此不作限定。本实施例以将采集到的声音信号转换成数字信号为例进行说明。
在步骤320中,对三路声音信号进行降噪处理。
终端对获取到的三路声音进行降噪处理,将降噪后的第一麦克风、第二麦克风和第三 麦克风的声音信号分别记为A_mic1’、A_mic2’和A_mic3’。
一种降噪的方法是基于小波去除信号中的噪声,将采集到的第一声音信号A_mic1做多层小波信号分解,选取合适的阈值对每一层小波信号的高频系数进行处理,对处理后的信号做小波重建,输出信号为A_mic1’。第二声音信号和第三声音信号都可以使用该方法进行降噪获得降噪后的声音信号为A_mic2’和A_mic3’。
本领域的技术人员可以理解的是,此步骤进行的降噪处理不是必须的,仅为了提高声音信号的质量,此步骤是可选的。另外,降低噪声的方法有很多种,可以经过各种信号处理的方法将三路声音信号中的噪声滤除,本实施例对此不作限定。
在步骤331中,将第一麦克风采集到的第一声音信号作为中央声道信号。
终端将第一麦克风采集到的第一声音信号进行降噪后得到的A_mic1’作为中央声道信号,记为A_C’,即中央声道信号为A_C’,A_C’=A_mic1’。
在步骤332中,将第二麦克风采集到的第二声音信号作为后置左声道信号。
终端将第二麦克风采集到的第二声音信号进行降噪后得到的A_mic2’作为后置左声道信号,记为A_LS’,即后置左声道信号为A_LS’,A_LS’=A_mic2’。
在步骤333中,将第三麦克风采集到的第三声音信号作为后置右声道信号。
终端将第三麦克风采集到的第三声音信号进行降噪后得到的A_mic3’作为后置右声道信号,记为A_RS’,即后置右声道信号为A_RS’,A_RS’=A_mic3’。
在步骤334中,将第一声音信号和第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音信号,将第四声音信号作为左声道信号。
终端将第一声音信号降噪后得到的A_mic1’和第二声音信号降噪后得到的A_mic2’在相同时刻上的幅度加权平均后得到的第四声音信号作为左声道信号,记为A_L’,即左声道信号为A_L’,
A_L’=a1*A_mic1’+b1*A_mic2’
其中,a1为A_mic1’的权重,b1为A_mic2’的权重,a1和b1的具体取值可以是根据三个麦克风的位置和各声道的位置预先设定的,也可以由用户进行设定,一种可能的取值方式为:a1=0.375,b1=0.625,需要说明的是,上述可能的取值方式中a1+b1=1,在其他可能的取值方式中,a1+b1也可以不为1,本公开实施例对a1、b1的设定方式以及其具体取值不作限定。
在步骤335中,将第一声音信号和第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将第五声音信号作为右声道信号。
终端将第一声音信号降噪后得到的A_mic1’和第三声音信号降噪后得到的A_mic3’在相同时刻上的幅度加权平均后得到的第五声音信号作为右声道信号,记为A_R’,即右声道信号为A_R’,
A_R’=a2*A_mic1’+b2*A_mic3’
其中,a2为A_mic1’的权重,b2为A_mic3’的权重,a2和b2的具体取值可以是 根据三个麦克风的位置和各声道的位置预先设定,也可以由用户进行设定,一种可能的取值方式为:a2=0.375,b2=0.625,需要说明的是,上述可能的取值方式中a2+b2=1,在其他可能的取值方式中,a2+b2也可以不为1,本公开实施例对a2、b2的设定方式以及其具体取值不作限定。
需要说明的是,上述步骤331~335是并列的,没有特定的先后顺序之分。
根据三路声音信号计算得到5.1声道中的重低音声道信号,可选地,该步骤的实现过程为:
在步骤341中,将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号。
终端将三路声音信号降噪后得到的A_mic1’、A_mic2’和A_mic3’在相同时刻上的幅度平均后得到平均声音信号,记为A_LFE,即平均声音信号为A_LFE,
A_LFE=(A_mic1’+A_mic2’+A_mic3’)/3
在步骤342中,将平均声音信号进行低通滤波后,得到重低音声道信号。
终端将在步骤341中得到的平均声音信号进行低通滤波,得到重低音声道信号。低通滤波器的截止频率是可选的,一般将截止频率设为80Hz至120Hz之间的值,本实施例对此不作限定。
将低通滤波后得到的重低音声道信号记为A_LFE’,即重低音声道信号为A_LFE’,A_LFE’=LPASS(A_LFE)。
其中,y=LPASS(x)函数表示信号y是信号x经过低通滤波器后得到的信号。
需要说明的是,步骤341与步骤331~335也是并列的,没有特定的先后顺序之分。
在步骤350中,将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道信号。
终端将通过上述步骤得到的中央声道信号A_C’、左声道信号A_L’、右声道信号A_R’、后置左声道信号A_LS’、后置右声道A_RS’和重低音声道信号A_LFE’组合得到5.1声道信号,记为A_5.1ch,可选的组合方式是本领域技术人员都能理解的,本实施例对此不再赘述。
在步骤360中,将组合得到的5.1声道信号保存在存储器中。
终端将组合得到的5.1声道信号保存在终端自身的存储器中或者外接的存储设备中。
终端存储5.1声道信号时,可以采用非压缩的PCM或WAV等格式。
可选地,终端也可以采用诸如DolbyDigital(杜比数字技术)、AAC(Advanced Audio Coding,高级音频编码)、DTS(Digital Theatre System,数字化影院系统)、3D-Audio之类的支持5.1声道的压缩格式。
综上所述,本实施例提供的方法,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场 范围和临场感较差的问题;达到了在不改变终端硬件配置的情况下,用户也能录制5.1声道数据,从而大大提高了录音的音质和用户的收听体验的效果。
本实施例提供的录音方法,通过将三个麦克风按照预定位置进行摆放,从而能够以较少的计算量就能够将三个麦克风采集到的三路声音信号录制为5.1声道数据,达到了在不改变终端硬件配置且使用较少的计算量,用户就能够录制5.1声道数据的效果。
图4是根据又一实施例实施例示出的录音方法的流程图,如图4所示,本实施例以该录音方法应用于图1D示出的第二种设置方式来举例说明,包括以下步骤:
在步骤310中,获取三个麦克风采集的三路声音信号。
终端获取三个麦克风分别采集到的三路声音信号。在本实施例中,将第一麦克风、第二麦克风和第三麦克风采集的声音信号分别记为A_mic1、A_mic2和A_mic3。
终端获取的声音信号为模拟信号,终端获取到声音信号后,可以将模拟信号转换成数字信号进行后续处理,也可以直接使用采集到的模拟信号进行处理,本实施例对此不作限定。本实施例以将采集到的声音信号转换成数字信号为例进行说明。
在步骤320中,对三路声音信号进行降噪处理。
终端对获取到的三路声音进行降噪处理,将降噪后的第一麦克风、第二麦克风和第三麦克风的声音信号分别记为A_mic1’、A_mic2’和A_mic3’。
一种降噪的方法是基于小波去除信号中的噪声,将采集到的第一声音信号A_mic1做多层小波信号分解,选取合适的阈值对每一层小波信号的高频系数进行处理,对处理后的信号做小波重建,输出信号为A_mic1’。第二声音信号和第三声音信号都可以使用该方法进行降噪获得降噪后的声音信号为A_mic2’和A_mic3’。
本领域的技术人员可以理解的是,此步骤进行的降噪处理不是必须的,仅为了提高声音信号的质量,此步骤是可选的。另外,降低噪声的方法有很多种,可以经过各种信号处理的方法将三路声音信号中的噪声滤除,本实施例对此不作限定。
在步骤338中,对于5.1声道中的任一声道,获取与该声道最近的两个麦克风所采集的两路声音信号。
终端获取三个麦克风相对原点的位置信息。这里的原点指的是5.1声道系统的中心点10位置,终端根据该原点建立坐标系统。
可选地,一种建立坐标系统的方法为:以5.1声道系统的中心点为原点,中心点指向中央声道的方向为y轴正方向,与y轴垂直指向右侧的方向为x轴正方向,本实施例以此坐标系统结合图1A进行举例说明。本实施例对建立坐标系统的方法不作限定。
终端将第一麦克风、第二麦克风和第三麦克风在这个坐标系统中的位置分别记为P_mic1(x1,y1)、P_mic2(x2,y2)和P_mic3(x3,y3)。
5.1声道系统中的声道都有不同的方向,如图1A所示,中央声道方向为y轴方向,左声道方向为y轴正方向偏左30度,右声道方向为y轴正方向偏右30度,后置左声道方向 为y轴正方向偏左100-120度,后置右声道方向为y轴正方向偏右100-120方向。
对于5.1声道中的任一声道,终端先获取与该声道最近的两个麦克风所采集的两路声音信号,然后再根据与该声道对应的到达相位差,从两路声音信号中分离出与该声道对应的声音信号。
本实施例以中央声道为例进行说明,如图1D所示,与中央声道距离最近的两个麦克风为第一麦克风和第二麦克风,则获取这两个麦克风采集并降噪后得到的两路声音信号分别为A_mic1’和A_mic2’。
可选的,终端根据与该声道对应的到达相位差,从两路声音信号中分离出与该声道对应的声音信号,可以包括以下两个子步骤:
在步骤339a中,根据与该声道对应的到达相位差对两路声音信号中的第一路声音信号进行滤波得到第一滤波数据;根据与该声道对应的到达相位差对两路声音信号中的第二路声音信号进行滤波得到第二滤波数据。
由于每个麦克风都会接收到来自各个方向的声音信号,但是每个方向上的声音信号到达三个麦克风上的到达相位是不同。终端可以根据与每个声道的到达相位差来提取来自某一个声道上的声音信号。
以中央声道为例,与中央声道距离最近的是第一麦克风和第二麦克风,则第一声音信号为上述第一路声音信号,第二声音信号为上述第二路声音信号。由于中央声道与距离它最近的第一麦克风和第二麦克风之间的距离不同,所以中央声道方向上的声音抵达第一麦克风和抵达第二麦克风时存在固定的到达相位差,将到达相位差记为Δ。
将第一路声音信号和第二路声音信号的声音信号以相同的方式划分为多个子信号,第一路声音信号中的每个子信号通常在第二路声音信号中存在属于相同时刻的另一个子信号。然后,终端比较第一路声音信号和第二路声音信号中属于相同时刻的一对子信号之间的到达相位差,当到达相位差为Δ时,认为是属于中央声道方向上的信号,将其保留;当到达相位差不为Δ时,认为不是属于中央声道方向上的信号,将其滤除。通过这种方法,对第一路声音信号进行滤波得到第一滤波数据,对第二路声音信号进行滤波得到第二滤波数据。
终端将声音信号划分为多个子信号时,可以按照编码协议将每个音频帧作为一个子信号,但本实施例对每个子信号的划分方式不作限定。
另外,每个声道对应的到达相位差由终端预先根据每个麦克风的坐标位置计算得到。
在步骤339b中,将第一滤波数据和第二滤波数据中的相同部分提取为该声道对应的声音信号。
终端将得到的第一滤波信号和第二滤波信号中的相同部分提取为与该声道的声音信号。
本领域技术人员可以理解的是,这里的声道可以是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。每一个声道的处理方法,都可 以采用类似于上述举例中对中央声道的处理方法。在终端获得每个声道的声音信号后,将提取出来的这些声道的声音信号分别记为中央声道信号A_C’、左声道信号A_L’、右声道信号A_R’、后置左声道信号A_LS’和后置右声道信号A_RS’。
在步骤341中,将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号。
终端将降噪后的第一声音信号A_mic1’、第二声音信号A_mic2’和第三声音信号A_mic3’在相同时刻上的幅度平均后得到平均声音信号,记为A_LFE,即平均声音信号为A_LFE,
A_LFE=(A_mic1’+A_mic2’+A_mic3’)/3
在步骤342中,将平均声音信号进行低通滤波后,得到重低音声道信号。
终端将在步骤341中得到的平均声音信号进行低通滤波,得到重低音声道信号。
低通滤波器的截止频率是可选的,一般将截止频率设为80Hz至120Hz之间的值,本实施例对此不作限定。
将低通滤波后得到的重低音声道信号记为A_LFE’,即重低音声道信号为A_LFE’,A_LFE’=LPASS(A_LFE)。
其中,y=LPASS(x)函数表示信号y是信号x经过低通滤波器后得到的信号。
需要说明的是,步骤341与步骤338是并列的,没有特定的先后顺序之分。
在步骤350中,将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道信号。
终端将通过上述步骤得到的中央声道信号A_C’、左声道信号A_L’、右声道信号A_R’、后置左声道信号A_LS’、后置右声道A_RS’和重低音声道信号A_LFE’组合得到5.1声道信号,记为A_5.1ch,可选的组合方式是本领域技术人员都能理解的,本实施例对此不再赘述。
在步骤360中,将组合得到的5.1声道信号保存在存储器中。
终端将组合得到的5.1声道信号保存在终端自身的存储器中或者外接的存储设备中。
终端存储5.1声道信号时,可以采用非压缩的PCM或WAV等格式。
可选地,终端也可以采用诸如DolbyDigital、AAC、DTS、3D-Audio之类的支持5.1声道的压缩格式。
综上所述,本实施例提供的录音方法,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端的硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。
本实施例提供的录音方法,通过将三个麦克风按照自由位置进行摆放,从而可以按照终端中实际空间而自由设置三个麦克风,然后将三个麦克风采集到的三路声音信号录制为 5.1声道数据,达到了在不改变终端硬件配置且不要求较为苛刻的麦克风摆放位置,也能够录制5.1声道数据的效果。
下述为本公开装置实施例,可以用于执行本公开方法实施例。对于本公开装置实施例中未披露的细节,请参照本公开方法实施例。
图5是根据一示例性实施例示出的录音装置的框图,如图5所示,该录音装置应用于图1B所示的实施环境中,涉及图1A所示的5.1声道系统,该装置包括但不限于:获取模块500、第一计算模块520、第二计算模块540和组合模块560。
获取模块500,被配置为获取三个麦克风采集的三路声音信号。
第一计算模块520,被配置为根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号。
第二计算模块540,根据三路声音信号计算得出5.1声道中的重低音声道信号。
组合模块560,将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
综上所述,本公开实施例中提供的录音方法,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。
图6是根据另一示例性实施例示出的一种录音装置的框图,如图6所示,本实施例以该录音方法应用于图1B示出的第一种设置方式来举例说明,该装置包括但不限于:获取模块500、降噪模块510、第一计算模块520、第二计算模块540、组合模块560和存储模块580。
获取模块500,被配置为获取三个麦克风采集的三路声音信号。
降噪模块510,被配置为对三路声音信号进行降噪处理。
第一计算模块520,被配置为根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号。
第一计算模块520具体包括:第一子模块521、第二子模块522、第三子模块523、第一平均子模块524和第二平均子模块525。
第一子模块521,被配置为将第一麦克风采集到的第一声音信号作为中央声道信号。
第二子模块522,被配置为将第二麦克风采集到的第二声音信号作为后置左声道信号。
第三子模块523,被配置为将第三麦克风采集到的第三声音信号作为后置右声道信号。
第一平均子模块524,被配置为将第一声音信号和第二声音信号在相同时刻上的幅度 进行加权平均后得到第四声音信号,将第四声音信号作为左声道信号。
第二平均子模块525,被配置为将第一声音信号和第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将第五声音信号作为右声道信号。
第二计算模块540,被配置为根据三路声音信号计算得出5.1声道中的重低音声道信号,第二计算模块包括:平均子模块541和低通滤波子模块542。
平均子模块541,被配置为将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号。
低通滤波子模块542,被配置为将平均声音信号进行低通滤波后,得到重低音声道信号。
组合模块560,被配置为将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
存储模块580,被配置为将组合得到的5.1声道信号保存在存储器中。
综上所述,本实施例提供的装置,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。
本公开一示例性实施例提供了一种录音装置,用于设置有三个麦克风的移动终端中,能够实现本公开提供的录音方法,该装置包括:处理器、用于存储处理器可执行指令的存储器;
其中,处理器被配置为:
获取三个麦克风采集的三路声音信号;
根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
根据三路声音信号计算得出5.1声道中的重低音声道信号;
将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
可选地,在上述三个麦克风包括位于5.1声道的中央声道方向的第一麦克风、位于5.1声道的后置左声道方向的第二麦克风和位于5.1声道的后置右声道方向的第三麦克风时,处理器被配置为:
将第一麦克风采集到的第一声音信号作为中央声道信号;
将第二麦克风采集到的第二声音信号作为后置左声道信号;
将第三麦克风采集到的第三声音信号作为后置右声道信号;
将第一声音信号和第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音信号,将第四声音信号作为左声道信号;
将第一声音信号和第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将第五声音信号作为右声道信号。
可选地,在上述三个麦克风相对于原点分散设置时,处理器被配置为:
对于5.1声道中的任一声道,获取与该声道最近的两个麦克风所采集的两路声音信号;
从上述两路声音信号中,分离出符合与该声道对应的到达相位差的第一声音数据和第二声音数据;
根据与该声道对应的相位差对第一声音数据进行滤波得到第一滤波数据;根据与该声道对应的相位差对第二声音数据进行滤波得到第二滤波数据;
将第一滤波数据和第二滤波数据中的相同部分提取为该声道对应的声音信号;
其中,到达相位差是来自该声道的声音分别抵达上述两个麦克风时所对应的相位差,该声道对应的声音信号是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。
可选地,处理器被配置为:
将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号;
将平均声音信号进行低通滤波后,得到重低音声道信号。
可选地,处理器被配置为:
对该三路声音信号进行降噪处理。
图7是根据又一示例性实施例示出的一种录音装置的框图,如图7所示,本实施例以该录音方法应用于图1D示出的第二种设置方式来举例说明,该装置包括但不限于:获取模块500、降噪模块510、第一计算模块520、第二计算模块540、组合模块560和存储模块580。
获取模块500,被配置为获取三个麦克风采集的三路声音信号。
降噪模块510,被配置为对三路声音信号进行降噪处理。
第一计算模块520,被配置为根据该三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号。
第一计算模块520具体包括:获取子模块528和分离子模块529。
获取子模块528,被配置为对于5.1声道中的任一声道,获取与该声道最近的两个麦克风所采集的两路声音信号。
分离子模块529,被配置为根据与该声道对应的到达相位差,从上述两路声音信号中 分离出该声道对应的声音信号。
上述分离子模块529还包括:第一分离子模块529a和滤波子模块529b。
第一分离子模块529a,被配置为根据与该声道对应的相位差对第一声音数据进行滤波得到第一滤波数据;根据与该声道对应的相位差对第二声音数据进行滤波得到第二滤波数据。
提取子模块529b,被配置为将第一滤波数据和第二滤波数据中的相同部分提取为该声道对应的声音信号。
第二计算模块540,被配置为根据三路声音信号计算得出5.1声道中的重低音声道信号,第二计算模块包括:平均子模块541和低通滤波子模块542。
平均子模块541,被配置为将三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号。
低通滤波子模块542,被配置为将平均声音信号进行低通滤波后,得到重低音声道信号。
组合模块560,被配置为将中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号组合得到5.1声道的声音信号。
存储模块580,被配置为将组合得到的5.1声道信号保存在存储器中。
综上所述,本实施例提供的装置,通过终端中的三个麦克风采集三路声音信号,根据这三路声音信号建立并计算中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和重低音声道信号,并将这6个声道信号组成为5.1声道的声音信号;解决了相关技术中用户录制的音频数据只能是单声道数据或双声道数据而导致录音的声场范围和临场感较差的问题;达到了在不改变终端硬件配置的情况下,用户也能录制5.1声道数据,从而提高了录音文件的音质的效果。
图8是根据一示例性实施例示出的一种用于录音的装置800的框图。例如,装置800可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。
参照图8,装置800可以包括以下一个或多个组件:处理组件802,存储器804,电源组件806,多媒体组件808,音频组件810,输入/输出(I/O)接口812,传感器组件814,以及通信组件816。
处理组件802通常控制装置800的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件802可以包括一个或多个处理器818来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件802可以包括一个或多个模块,便于处理组件802和其他组件之间的交互。例如,处理组件802可以包括多媒体模块,以方便多媒体组件808和处理组件802之间的交互。
存储器804被配置为存储各种类型的数据以支持在装置800的操作。这些数据的示例 包括用于在装置800上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。
电源组件806为装置800的各种组件提供电力。电源组件806可以包括电源管理系统,一个或多个电源,及其他与为装置800生成、管理和分配电力相关联的组件。
多媒体组件808包括在装置800和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件808包括一个前置摄像头和/或后置摄像头。当装置800处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。
音频组件810被配置为输出和/或输入音频信号。例如,音频组件810包括一个麦克风(MIC),当装置800处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例中,音频组件810还包括一个扬声器,用于输出音频信号。
I/O接口812为处理组件802和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。
传感器组件814包括一个或多个传感器,用于为装置800提供各个方面的状态评估。例如,传感器组件814可以检测到装置800的打开/关闭状态,组件的相对定位,例如组件为装置800的显示器和小键盘,传感器组件814还可以检测装置800或装置800一个组件的位置改变,用户与装置800接触的存在或不存在,装置800方位或加速/减速和装置800的温度变化。传感器组件814可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件814还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件814还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。
通信组件816被配置为便于装置800和其他设备之间有线或无线方式的通信。装置800可以接入基于通信标准的无线网络,如Wi-Fi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件816经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,通信组件816还包括近场通信(NFC)模块,以促进 短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。
在示例性实施例中,装置800可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述录音方法。
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器804,上述指令可由装置800的处理器818执行以完成上述录音方法。例如,非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。

Claims (13)

  1. 一种录音方法,其特征在于,用于设置有三个麦克风的移动终端中,所述方法包括:
    获取所述三个麦克风采集的三路声音信号;
    根据所述三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
    根据所述三路声音信号计算得出所述5.1声道中的重低音声道信号;
    将所述中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和所述重低音声道信号组合得到所述5.1声道的声音信号。
  2. 根据权利要求1所述的方法,其特征在于,所述三个麦克风包括位于所述5.1声道的中央声道方向的第一麦克风、位于所述5.1声道的后置左声道方向的第二麦克风和位于所述5.1声道的后置右声道方向的第三麦克风;
    所述根据所述三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号,包括:
    将所述第一麦克风采集到的第一声音信号作为所述中央声道信号;
    将所述第二麦克风采集到的第二声音信号作为所述后置左声道信号;
    将所述第三麦克风采集到的第三声音信号作为所述后置右声道信号;
    将所述第一声音信号和所述第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音信号,将所述第四声音信号作为所述左声道信号;
    将所述第一声音信号和所述第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将所述第五声音信号作为所述右声道信号。
  3. 根据权利要求1所述的方法,其特征在于,所述三个麦克风相对于原点分散设置;所述5.1声道中的每个声道在所述三个麦克风中都存在距离最近的两个麦克风;
    所述根据所述三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号,包括:
    对于所述5.1声道中的任一声道,获取与所述声道最近的两个麦克风所采集的两路声音信号;
    根据与所述声道对应的到达相位差,从所述两路声音信号中分离出所述声道对应的声音信号;
    其中,所述到达相位差是来自所述声道的声音分别抵达所述两个麦克风时所对应的初相角之差,所述声道对应的声音信号是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。
  4. 根据权利要求3所述的方法,其特征在于,所述根据与所述声道对应的到达相位差,从所述两路声音信号中分离出所述声道对应的声音信号,包括:
    根据与所述声道对应的到达相位差对所述两路声音信号中的第一路声音信号进行滤波得到第一滤波数据;根据与所述声道对应的到达相位差对所述两路声音信号中的第二路声音信号进行滤波得到第二滤波数据;
    将所述第一滤波数据和所述第二滤波数据中的相同部分提取为所述声道对应的声音信号。
  5. 根据权利要求2至4任一所述的方法,其特征在于,所述根据所述三路声音信号计算得出所述5.1声道中的重低音声道信号,包括:
    将所述三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号;
    将所述平均声音信号进行低通滤波后,得到所述重低音声道信号。
  6. 根据权利要求1至5任一所述的方法,其特征在于,所述方法还包括:
    对所述三路声音信号进行降噪处理。
  7. 一种录音装置,其特征在于,所述装置中设置有三个麦克风,所述装置包括:
    获取模块,被配置为获取所述三个麦克风采集的三路信号;
    第一计算模块,被配置为根据所述三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
    第二计算模块,被配置为根据所述三路声音信号计算得出所述5.1声道中的重低音声道信号;
    组合模块,被配置为将所述中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和所述重低音声道信号组合得到所述5.1声道的声音信号。
  8. 根据权利要求7所述的装置,其特征在于,所述三个麦克风包括位于所述5.1声道的中央声道方向的第一麦克风、位于所述5.1声道的后置左声道方向的第二麦克风和位于所述5.1声道的后置右声道方向的第三麦克风;
    所述第一计算模块包括:
    第一子模块,被配置为将所述第一麦克风采集到的第一声音信号作为所述中央声道信号;
    第二子模块,被配置为将所述第二麦克风采集到的第二声音信号作为所述后置左声道信号;
    第三子模块,被配置为将所述第三麦克风采集到的第三声音信号作为所述后置右声道信号;
    第一平均子模块,被配置为将所述第一声音信号和所述第二声音信号在相同时刻上的幅度进行加权平均后得到第四声音信号,将所述第四声音信号作为所述左声道信号;
    第二平均子模块,被配置为将所述第一声音信号和所述第三声音信号在相同时刻上的幅度进行加权平均后得到第五声音信号,将所述第五声音信号作为所述右声道信号。
  9. 根据权利要求7所述的装置,其特征在于,所述三个麦克风相对于原点分散设置;所述5.1声道中的每个声道在所述三个麦克风中都存在距离最近的两个麦克风;
    所述第一计算模块包括:
    获取子模块,被配置为对于所述5.1声道中的任一声道,获取与所述声道最近的两个麦克风所采集的两路声音信号;
    分离子模块,被配置为根据与所述声道对应的到达相位差,从所述两路声音信号中分离出所述声道对应的声音信号;
    其中,所述到达相位差是来自所述声道的声音分别抵达所述两个麦克风时所对应的初相角之差,所述声道对应的声音信号是中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号中的任意一种。
  10. 根据权利要求9所述的装置,其特征在于,所述分离子模块包括:
    滤波子模块,被配置为根据与所述声道对应的相位差对所述第一声音数据进行滤波得到第一滤波数据;根据与所述声道对应的相位差对所述第二声音数据进行滤波得到第二滤波数据;
    提取子模块,被配置为将所述第一滤波数据和所述第二滤波数据中的相同部分提取为所述声道对应的声音信号。
  11. 根据权利要求8至10任一所述的装置,其特征在于,所述第二计算模块包括:
    平均子模块,被配置为将所述三路声音信号在相同时刻上的幅度进行平均后得到平均声音信号;
    低通滤波子模块,被配置为将所述平均声音信号进行低通滤波后,得到所述重低音声道信号。
  12. 根据权利要求7至11任一所述的装置,其特征在于,所述装置还包括:
    降噪模块,被配置为对所述三路声音信号进行降噪处理。
  13. 一种录音装置,其特征在于,所述装置中设置有三个麦克风,所述装置包括:
    处理器;
    用于存储所述处理器可执行指令的存储器;
    其中,所述处理器被配置为:
    获取所述三个麦克风采集的三路声音信号;
    根据所述三路声音信号计算得到5.1声道中的中央声道信号、左声道信号、右声道信号、后置左声道信号和后置右声道信号;
    根据所述三路声音信号计算得出所述5.1声道中的重低音声道信号;
    将所述中央声道信号、左声道信号、右声道信号、后置左声道信号、后置右声道信号和所述重低音声道信号组合得到所述5.1声道的声音信号。
PCT/CN2015/098845 2015-10-29 2015-12-25 录音方法及装置 WO2017071045A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
KR1020167004126A KR101848458B1 (ko) 2015-10-29 2015-12-25 레코딩 방법 및 그 장치
JP2017547041A JP6364130B2 (ja) 2015-10-29 2015-12-25 レコーディング方法、装置、プログラム及び記録媒体
RU2016107764A RU2635838C2 (ru) 2015-10-29 2015-12-25 Способ и устройство для звукозаписи
MX2016002669A MX361094B (es) 2015-10-29 2015-12-25 Método y dispositivo para grabación de sonido.

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510719339.1A CN105407443B (zh) 2015-10-29 2015-10-29 录音方法及装置
CN201510719339.1 2015-10-29

Publications (1)

Publication Number Publication Date
WO2017071045A1 true WO2017071045A1 (zh) 2017-05-04

Family

ID=55472640

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/098845 WO2017071045A1 (zh) 2015-10-29 2015-12-25 录音方法及装置

Country Status (8)

Country Link
US (1) US9930467B2 (zh)
EP (1) EP3163904A1 (zh)
JP (1) JP6364130B2 (zh)
KR (1) KR101848458B1 (zh)
CN (1) CN105407443B (zh)
MX (1) MX361094B (zh)
RU (1) RU2635838C2 (zh)
WO (1) WO2017071045A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106297815B (zh) * 2016-07-27 2017-09-01 武汉诚迈科技有限公司 一种语音识别场景中回音消除的方法
CN110881164B (zh) * 2018-09-06 2021-01-26 宏碁股份有限公司 增益动态调节的音效控制方法及音效输出装置
CN115474117B (zh) * 2022-11-03 2023-01-10 深圳黄鹂智能科技有限公司 基于三麦克风的收音方法和收音装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1091889A (zh) * 1992-12-31 1994-09-07 德斯帕产品公司 用于声象增强的立体声控制装置和方法
CN101138275A (zh) * 2005-04-05 2008-03-05 索尼株式会社 音频信号处理装置以及音频信号处理方法
CN101902679A (zh) * 2009-05-31 2010-12-01 比亚迪股份有限公司 立体声音频信号模拟5.1声道音频信号的处理方法
CN102082991A (zh) * 2010-11-24 2011-06-01 蔡庸成 一种专为耳机试听设计的模拟现场全息音频的方法
EP2530957A2 (en) * 2011-05-30 2012-12-05 Sony Mobile Communications AB Sensor-based placement of sound in video recording
CN103905960A (zh) * 2012-11-08 2014-07-02 Dsp集团有限公司 手持装置中的增强立体声音频记录

Family Cites Families (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5386473A (en) * 1994-01-21 1995-01-31 Harrison; Robert W. Passive surround sound circuit
US5796843A (en) * 1994-02-14 1998-08-18 Sony Corporation Video signal and audio signal reproducing apparatus
JP4478220B2 (ja) * 1997-05-29 2010-06-09 ソニー株式会社 音場補正回路
JP3344647B2 (ja) * 1998-02-18 2002-11-11 富士通株式会社 マイクロホンアレイ装置
JP4089020B2 (ja) * 1998-07-09 2008-05-21 ソニー株式会社 オーディオ信号処理装置
DE69904822T2 (de) * 1999-10-07 2003-11-06 Zlatan Ribic Verfahren und Anordnung zur Aufnahme von Schallsignalen
JP2002232988A (ja) * 2001-01-30 2002-08-16 Matsushita Electric Ind Co Ltd マルチチャンネル収音装置
JP2002232989A (ja) * 2001-02-01 2002-08-16 Matsushita Electric Ind Co Ltd マルチチャンネル音場収音装置
US6804565B2 (en) * 2001-05-07 2004-10-12 Harman International Industries, Incorporated Data-driven software architecture for digital sound processing and equalization
JP4062905B2 (ja) * 2001-10-24 2008-03-19 ヤマハ株式会社 ディジタル・ミキサ
JP4428552B2 (ja) * 2001-10-24 2010-03-10 ヤマハ株式会社 ディジタル・ミキサ
EP1370115B1 (en) * 2002-06-07 2009-07-15 Panasonic Corporation Sound image control system
JP2005311604A (ja) * 2004-04-20 2005-11-04 Sony Corp 情報処理装置及び情報処理装置に用いるプログラム
JP2006060720A (ja) * 2004-08-24 2006-03-02 Hitachi Ltd 集音システム
JP4685106B2 (ja) * 2005-07-29 2011-05-18 ハーマン インターナショナル インダストリーズ インコーポレイテッド オーディオ調整システム
JP2007068021A (ja) * 2005-09-01 2007-03-15 Matsushita Electric Ind Co Ltd マルチチャンネルオーディオ信号の補正装置
JP4670682B2 (ja) * 2006-02-28 2011-04-13 日本ビクター株式会社 オーディオ装置及び指向音生成方法
JP4850628B2 (ja) * 2006-08-28 2012-01-11 キヤノン株式会社 記録装置
JP4367484B2 (ja) * 2006-12-25 2009-11-18 ソニー株式会社 音声信号処理装置、音声信号処理方法及び撮像装置
US8068620B2 (en) * 2007-03-01 2011-11-29 Canon Kabushiki Kaisha Audio processing apparatus
DE102008029352A1 (de) * 2008-06-20 2009-12-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und Computerprogramm zum Lokalisieren einer Schallquelle
CN102273233B (zh) * 2008-12-18 2015-04-15 杜比实验室特许公司 音频通道空间转换
JP5691130B2 (ja) * 2009-03-11 2015-04-01 ヤマハ株式会社 聴者を取り囲むように配置される複数のスピーカで音響再生を行う際にクロストークをキャンセルする装置、方法、プログラム、およびシステム
US8351623B2 (en) * 2009-03-27 2013-01-08 Yamaha Corporation Audio mixing apparatus
US8559655B2 (en) * 2009-05-18 2013-10-15 Harman International Industries, Incorporated Efficiency optimized audio system
US8654997B2 (en) * 2010-03-18 2014-02-18 Donald Eugene Meehan, Sr. Personal miniaturized loudspeaker placement platform
US8638951B2 (en) * 2010-07-15 2014-01-28 Motorola Mobility Llc Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
CN102438200A (zh) * 2010-09-29 2012-05-02 联想(北京)有限公司 音频信号输出的方法及终端设备
US9552840B2 (en) * 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
US9055371B2 (en) * 2010-11-19 2015-06-09 Nokia Technologies Oy Controllable playback system offering hierarchical playback options
US9313599B2 (en) * 2010-11-19 2016-04-12 Nokia Technologies Oy Apparatus and method for multi-channel signal playback
US9031268B2 (en) * 2011-05-09 2015-05-12 Dts, Inc. Room characterization and correction for multi-channel audio
CN102802112B (zh) * 2011-05-24 2014-08-13 鸿富锦精密工业(深圳)有限公司 具有音频文件格式转换功能的电子装置
EP2530956A1 (en) * 2011-06-01 2012-12-05 Tom Van Achte Method for generating a surround audio signal from a mono/stereo audio signal
WO2013028393A1 (en) * 2011-08-23 2013-02-28 Dolby Laboratories Licensing Corporation Method and system for generating a matrix-encoded two-channel audio signal
US9253567B2 (en) * 2011-08-31 2016-02-02 Stmicroelectronics S.R.L. Array microphone apparatus for generating a beam forming signal and beam forming method thereof
US20130343549A1 (en) * 2012-06-22 2013-12-26 Verisilicon Holdings Co., Ltd. Microphone arrays for generating stereo and surround channels, method of operation thereof and module incorporating the same
JP2014017645A (ja) * 2012-07-09 2014-01-30 Sony Corp 音声信号処理装置、音声信号処理方法、プログラム及び記録媒体
KR20140016780A (ko) * 2012-07-31 2014-02-10 인텔렉추얼디스커버리 주식회사 오디오 신호 처리 방법 및 장치
AU2014236850C1 (en) * 2013-03-14 2017-02-16 Apple Inc. Robust crosstalk cancellation using a speaker array
JP6430506B2 (ja) * 2013-11-22 2018-11-28 アップル インコーポレイテッドApple Inc. ハンズフリー・ビームパターン構成
CN105451151B (zh) * 2014-08-29 2018-09-21 华为技术有限公司 一种处理声音信号的方法及装置
CN104581512A (zh) * 2014-11-21 2015-04-29 广东欧珀移动通信有限公司 一种立体声录制方法及装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1091889A (zh) * 1992-12-31 1994-09-07 德斯帕产品公司 用于声象增强的立体声控制装置和方法
CN101138275A (zh) * 2005-04-05 2008-03-05 索尼株式会社 音频信号处理装置以及音频信号处理方法
CN101902679A (zh) * 2009-05-31 2010-12-01 比亚迪股份有限公司 立体声音频信号模拟5.1声道音频信号的处理方法
CN102082991A (zh) * 2010-11-24 2011-06-01 蔡庸成 一种专为耳机试听设计的模拟现场全息音频的方法
EP2530957A2 (en) * 2011-05-30 2012-12-05 Sony Mobile Communications AB Sensor-based placement of sound in video recording
CN103905960A (zh) * 2012-11-08 2014-07-02 Dsp集团有限公司 手持装置中的增强立体声音频记录

Also Published As

Publication number Publication date
MX2016002669A (es) 2017-05-30
KR20170061098A (ko) 2017-06-02
US20170127207A1 (en) 2017-05-04
JP2018500858A (ja) 2018-01-11
KR101848458B1 (ko) 2018-04-13
US9930467B2 (en) 2018-03-27
RU2016107764A (ru) 2017-09-07
CN105407443A (zh) 2016-03-16
RU2635838C2 (ru) 2017-11-16
EP3163904A1 (en) 2017-05-03
MX361094B (es) 2018-11-26
CN105407443B (zh) 2018-02-13
JP6364130B2 (ja) 2018-07-25

Similar Documents

Publication Publication Date Title
US9966084B2 (en) Method and device for achieving object audio recording and electronic apparatus
US10585486B2 (en) Gesture interactive wearable spatial audio system
EP2594087B1 (en) Electronic apparatus for generating modified wideband audio signals based on two or more wideband microphone signals
CN109155135B (zh) 用于降噪的方法、装置和计算机程序
US9071900B2 (en) Multi-channel recording
US9344826B2 (en) Method and apparatus for communicating with audio signals having corresponding spatial characteristics
WO2016105620A1 (en) Binaural recording for processing audio signals to enable alerts
US20130106997A1 (en) Apparatus and method for generating three-dimension data in portable terminal
US20170195817A1 (en) Simultaneous Binaural Presentation of Multiple Audio Streams
CN107113496B (zh) 移动设备的环绕声记录
WO2017071045A1 (zh) 录音方法及装置
US11632643B2 (en) Recording and rendering audio signals
US20210037336A1 (en) An apparatus and associated methods for telecommunications
WO2023151526A1 (zh) 音频采集方法、装置、电子设备及外设组件
EP2941770A1 (en) Method for determining a stereo signal
WO2023231787A1 (zh) 音频处理方法和装置
WO2018064883A1 (zh) 一种录音方法、装置、设备及计算机存储介质
US10419851B2 (en) Retaining binaural cues when mixing microphone signals
US20230099275A1 (en) Method and system for context-dependent automatic volume compensation
US11823706B1 (en) Voice activity detection in audio signal

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 20167004126

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2017547041

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: MX/A/2016/002669

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2016107764

Country of ref document: RU

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15907106

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15907106

Country of ref document: EP

Kind code of ref document: A1