WO2013094103A1 - 音響処理装置および音響処理方法 - Google Patents
音響処理装置および音響処理方法 Download PDFInfo
- Publication number
- WO2013094103A1 WO2013094103A1 PCT/JP2012/006816 JP2012006816W WO2013094103A1 WO 2013094103 A1 WO2013094103 A1 WO 2013094103A1 JP 2012006816 W JP2012006816 W JP 2012006816W WO 2013094103 A1 WO2013094103 A1 WO 2013094103A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- sound
- level
- delay
- directivity
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/326—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only for microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/40—Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
Definitions
- the present invention relates to an acoustic processing apparatus and an acoustic processing method for performing directivity synthesis processing on sound collected signals output from at least two sound collectors.
- a device that enables directional sound collection is, for example, a remote conference system including a sound collection device, a digital video camera, or a digital still camera (DSC: Digital Still Camera).
- DSC Digital Still Camera
- a device capable of collecting directional sound (hereinafter, also referred to as “sound collecting device”), a device unit (hereinafter referred to as “acoustic processing device”) that performs directivity synthesis processing uses sound waves for directivity synthesis processing.
- the phase difference of is used.
- the sound processing device requires a delay process for the collected sound signal.
- the delay amount used for the delay process is set based on the distance between the acoustic terminals.
- the distance between acoustic terminals refers to an acoustic distance between two terminals that collect sound (here, a microphone, hereinafter also referred to as “sound collector”). More specifically, the distance between acoustic terminals refers to the difference between arrival times of sound waves between terminals multiplied by the speed of sound when the sound source exists on a linear axis connecting the terminals.
- the delay amount needs to be an appropriate value corresponding to the actual distance between the acoustic terminals.
- the sound processing device sets a delay amount corresponding to the actual distance between the sound terminals, for example, when collecting sound, suppresses sound from a specific direction such as speech sound, ambient noise, and the like. It is possible to pick up sound in the state.
- the actual distance between the acoustic terminals may deviate from the actually measured distance (mechanical design value) between the terminals due to the influence of the structure around the terminals such as the housing in which the microphone is incorporated.
- the sound processing apparatus may use an inappropriate delay amount.
- Patent Document 1 (hereinafter referred to as “conventional technique”) describes a technique for setting an appropriate delay amount.
- the position of a sound source is estimated based on the known distance between acoustic terminals from the collected sound signals of two microphones whose distance between acoustic terminals is known among the four microphones.
- the related art estimates the position of the other microphone based on the estimated position of the sound source from the collected sound signal of the other microphone.
- the prior art reduces the amount of delay between two microphones whose distance between acoustic terminals calculated from the position of the sound source is unknown and the square error between the measured values of the delay amounts.
- the estimated value of the sound source position and the position of each microphone is adjusted.
- the sound source is arranged in a predetermined position in one direction in a direction on a straight line connecting the two microphones of the sound collection device (hereinafter referred to as “axial direction”) in the anechoic chamber. Then, by applying the above-described conventional technique, the estimated value of the microphone position is adjusted so that the square error is minimized.
- the sound processing device to which the conventional technology is applied can accurately estimate the actual distance between the sound terminals from the angle of the sound source direction and the delay amount of the directivity synthesis processing, and can realize an arbitrary directivity pattern with high accuracy. Can do.
- the sound processing apparatus to which the conventional technology is applied is used for the sound collecting device of the teleconference system, and the sound collecting device is embedded in a large object such as a desk.
- restricting the microphone mounting structure itself can be a restriction on the structure of the mounting side and the design of the device.
- the position of the sound source relative to the sound processing device is not always constant, and in situations where the sound source position changes or multiple sound sources exist at the same time, the follow-up capability of the sound source direction search deteriorates and delay estimation is performed. It is difficult to do correctly.
- the conventional technique has a problem that correct delay estimation cannot be performed when acoustic changes occur in the microphone mounting structure and mounting position, the structure around the microphone, and the like.
- An object of the present invention is to accurately adjust a delay amount in an actual environment even if an acoustic change occurs in a microphone mounting structure, a mounting position, and a structure around the microphone.
- the acoustic processing device is directed to the first sound collection signal output from the first sound collector and the second sound collection signal output from the second sound collector.
- a sound processing device for performing a sex synthesis process wherein a first directional sound pickup signal is generated by delaying the second sound pickup signal with respect to the first sound pickup signal, and the second sound pickup signal is generated.
- a directivity synthesis processing unit that generates a second directional sound collection signal obtained by delaying and synthesizing the first sound collection signal with respect to the first sound collection signal, the first directional sound collection signal, and the first A directional level signal indicating the level of a signal obtained by adding the two directional sound pickup signals, a first level signal indicating the level of the first directional sound pickup signal, and the second directivity.
- Comparison signal calculation for generating a directivity level signal obtained by adding the second level signal indicating the level of the characteristic sound pickup signal A level comparison unit that obtains a level difference between the omnidirectional level signal and the directivity level signal, and a delay that adjusts the amount of delay in the directivity synthesis processing unit so that the level difference is reduced And an operation unit.
- the acoustic processing method is directed to the first sound collection signal output from the first sound collector and the second sound collection signal output from the second sound collector.
- a sound processing method in a sound processing device that performs sex synthesis processing, wherein a first directional sound pickup signal is generated by delaying and synthesizing the second sound pickup signal with respect to the first sound pickup signal.
- the first directional sound collection unit generates a second directional sound collection signal that is generated by delaying and synthesizing the first sound collection signal with respect to the second sound collection signal.
- An omnidirectional signal indicating a level of a signal obtained by adding the signal and the second directional sound collection signal, and the first directional sound collection signal and the second directional sound collection signal.
- the present invention can accurately determine the distance between the acoustic terminals in the real space even if an acoustic change occurs in the microphone mounting structure and mounting position and the structure around the microphone.
- the block diagram which shows an example of a structure of the sound processing apparatus which concerns on Embodiment 1 of this invention.
- the block diagram which shows an example of a structure of the sound collection apparatus containing the sound processing apparatus based on Embodiment 2 of this invention.
- the figure which shows the definition of the direction in Embodiment 2 of this invention The figure which shows the simulation result of the polar pattern of the 1st directional sound collection signal in case the delay amount of the 2nd delay device in Embodiment 2 of this invention is small.
- the figure which shows the simulation result of the polar pattern of the 1st directivity sound collection signal in case the delay amount of the 2nd delay device is an appropriate value in Embodiment 2 of this invention.
- the figure which shows the simulation result of the polar pattern of the 1st directivity sound collection signal in case the delay amount of the 2nd delay device is large in Embodiment 2 of this invention.
- the figure which shows the simulation result of the polar pattern of a non-directional level signal and the polar pattern of a directivity level signal in case the delay amount of the 2nd delay device is small in Embodiment 2 of this invention.
- the figure which shows the simulation result of the polar pattern of a non-directional level signal and the polar pattern of a directivity level signal in case the delay amount of a 2nd delay device is an appropriate value in Embodiment 2 of this invention.
- the figure which shows the simulation result of the polar pattern of a non-directional level signal and the polar pattern of a directivity level signal in case the delay amount of the 2nd delay device is large in Embodiment 2 of this invention.
- the block diagram which shows an example of a structure of the sound collection apparatus containing the sound processing apparatus which concerns on Embodiment 3 of this invention.
- the flowchart which shows an example of operation
- the block diagram which shows an example of a structure of the sound processing apparatus which concerns on Embodiment 4 of this invention.
- the flowchart which shows an example of operation
- the block diagram which shows an example of a structure of the sound processing apparatus which concerns on Embodiment 5 of this invention.
- the figure which shows an example of the relationship between the microphone for obtaining the designated directivity pattern, and the designated direction angle (theta) in Embodiment 5 of this invention.
- the flowchart which shows an example of operation
- Embodiment 1 of the present invention is an example of a basic aspect of the present invention.
- FIG. 1 is a block diagram showing an example of the configuration of the sound processing apparatus according to the present embodiment.
- the sound processing apparatus 400 includes a first sound collection signal output from a first sound collector (not shown) and a second sound output from a second sound collector (not shown). It is a device that performs directivity synthesis processing on a collected sound signal.
- the sound processing apparatus 400 includes a directivity synthesis processing unit 410, a comparison signal calculation unit 440, a level comparison unit 451, and a delay operation unit 452.
- the directivity synthesis processing unit 410 generates a first directional sound collection signal synthesized by delaying the second sound collection signal with respect to the first sound collection signal. That is, the directivity synthesis processing unit 410 synthesizes the first sound pickup signal by delaying the second sound collection signal in the first direction, which is the first sound collector side direction. Try to have directivity.
- the directivity synthesis processing unit 410 generates a second directional sound collection signal obtained by synthesizing the second sound collection signal by delaying the first sound collection signal. That is, the directivity synthesis processing unit 410 delays and synthesizes the first sound collection signal with respect to the second sound collection signal, thereby causing the second sound collection side to move in the second direction. Try to have directivity.
- the comparison signal calculation unit 440 generates an omnidirectional level signal indicating the level of the signal obtained by adding the first directional sound collection signal and the second directional sound collection signal. Further, the comparison signal calculation unit 440 adds the first level signal indicating the level of the first directional sound collection signal and the second level signal indicating the level of the second directional sound collection signal. The resulting directivity level signal is generated.
- the level comparison unit 451 acquires a level difference between the omnidirectional level signal and the directional level signal.
- the delay operation unit 452 adjusts the amount of delay in the directivity synthesis processing unit 410 so that the level difference becomes small.
- the sound processing apparatus 400 includes, for example, a CPU (Central Processing Unit), a storage medium such as a ROM (Read Only Memory) storing a control program, and a working memory such as a RAM (Random Access Memory), although not shown.
- a CPU Central Processing Unit
- ROM Read Only Memory
- RAM Random Access Memory
- the sound processing apparatus 400 adjusts the delay amount so that phase inversion does not occur with respect to the directional sound pickup signal having directivity in the direction of at least one sound collector side.
- the sound processing device 400 can accurately realize an arbitrary directivity pattern by adjusting the amount of delay to the minimum value at which phase inversion does not occur, and obtains necessary sound with high quality. be able to. In other words, the sound processing apparatus 400 according to the present embodiment can correctly calculate the distance between the sound terminals and process the sound collection signal.
- the sound processing device 400 specifically adjusts the adjustment of the delay amount so that the level difference between the omnidirectional level signal and the directional level signal becomes small. Thereby, the sound processing apparatus 400 can easily adjust the delay amount so that phase inversion does not occur. This adjustment is possible if there is any sound source in the axial direction. Therefore, the acoustic processing device 400 can more easily realize an arbitrary directivity pattern with high accuracy, and can more easily acquire a necessary sound (sound, sound) with high quality.
- the sound processing device 400 can accurately adjust the delay amount by adjusting the delay amount.
- the acoustic processing device 400 can easily prevent phase reversal in an actual environment even if an acoustic change occurs in the microphone and the surrounding structures and the distance between the acoustic terminals changes.
- the delay amount can be adjusted. This adjustment is possible if there is any sound source in the axial direction. Therefore, the acoustic processing device 400 can accurately adjust the delay amount in an actual environment even if an acoustic change occurs in the microphone mounting structure and mounting position, the structure around the microphone, and the like.
- the second embodiment of the present invention is an example of a specific mode when the present invention is applied to a sound collection device such as a digital camera provided with two microphones.
- the sound collection device performs stereo sound collection with cardioid directional characteristics extending in both directions (axial direction) on a straight line connecting two microphones.
- a general stereo microphone is provided with a frequency characteristic correction unit (equalizer) for amplifying a low frequency at the output of the subtraction unit.
- a frequency characteristic correction unit (equalizer) for amplifying a low frequency at the output of the subtraction unit.
- circuit noise is superimposed and adversely affects the delay correction process, a configuration in which the frequency characteristic correction unit is omitted will be described here.
- Each unit of the sound processing device described below includes, for example, hardware including two microphones arranged inside the housing of the sound collection device, a CPU, and a storage medium such as a ROM storing a control program. It is realized by.
- FIG. 2 is a block diagram showing an example of the configuration of a sound collection device including the sound processing apparatus according to the present embodiment.
- the sound collection device 100 includes a first microphone 200, a second microphone 300, and the acoustic processing device 400 according to the present embodiment.
- the first microphone 200, the second microphone 300, and the sound processing device 400 are arranged, for example, inside a housing (not shown) of the sound collection device 100. Further, the first microphone 200 and the second microphone 300 are arranged at different positions at a distance from each other.
- the first microphone 200 is an omnidirectional microphone (first sound collector).
- the first microphone 200 collects sound and outputs a sound collection signal.
- the sound collection signal output by the first microphone 200 is referred to as a “first sound collection signal”.
- the second microphone 300 is an omnidirectional microphone (second sound collector).
- the second microphone 300 collects sound and outputs a sound collection signal.
- the sound collection signal output by the second microphone 300 is referred to as a “second sound collection signal”.
- the actual distance between the acoustic terminals between the first microphone 200 and the second microphone 300 is 10 mm (millimeters). This is an initially unknown value.
- the sound processing device 400 inputs the first sound collection signal and the second sound collection signal. Then, the sound processing device 400 performs directivity synthesis processing on the first sound collection signal and the second sound collection signal.
- the sound processing device 400 includes a directivity synthesis processing unit 410, a first signal output unit 421, a second signal output unit 422, a first band limiting unit 431, and a second band limiting unit 432. , A comparison signal calculation unit 440, a level comparison unit 451, and a delay operation unit 452.
- the directivity synthesis processing unit 410 delays and synthesizes the second sound collection signal with respect to the first sound collection signal, thereby directing the directivity in the first direction that is the direction of the first sound collector.
- a first directional sound pickup signal having the above is generated.
- the directivity synthesis processing unit 410 synthesizes the second sound collection signal by delaying and synthesizing the first sound collection signal in the second direction, which is the direction on the second sound collector side.
- a second directional sound pickup signal having directivity is generated. That is, the directivity synthesis processing unit 410 generates two directional sound collection signals having directivity characteristics that are paired in the axial direction from the first sound collection signal and the second sound collection signal.
- the directivity synthesis processing unit 410 includes a first delay unit 411, a second delay unit 412, a first adder 413, and a second adder 414.
- the first delay unit 411 inputs the first sound pickup signal.
- the first delay unit 411 outputs a first delayed sound collection signal obtained by delaying the first sound collection signal.
- the second delay device 412 inputs the second sound collection signal. Then, the second delay unit 412 outputs a second delayed sound collection signal obtained by delaying the second sound collection signal.
- the delay amount of the first delayed sound pickup signal with respect to the first sound pickup signal and the delay amount of the second delay sound pickup signal with respect to the second sound pickup signal are respectively determined by a delay operation unit 452 described later. It is adjustable.
- the first adder 413 inputs the first sound collection signal and the second delayed sound collection signal with the polarity reversed. Then, the first adder 413 adds the first sound collection signal and the second delayed sound collection signal whose polarity is inverted, and outputs a first directional sound collection signal as a result of the addition.
- the second adder 414 inputs the second sound pickup signal and the first delayed sound pickup signal with the polarity reversed. Then, the second adder 414 adds the second sound pickup signal and the first delayed sound pickup signal whose polarity is inverted, and outputs a second directional sound pickup signal as a result of the addition. .
- the first signal output unit 421 receives the first directional sound pickup signal and outputs it to the outside of the sound processing device 400.
- the second signal output unit 422 inputs the second directional sound pickup signal and outputs it to the outside of the sound processing device 400.
- the first band limiting unit 431 inputs the first directional sound pickup signal. Then, the first band limiting unit 431 outputs a signal obtained by performing band limitation on the first directional sound pickup signal to the comparison signal calculating unit 440. That is, the first band limiting unit 431 has a frequency at which spatial aliasing does not occur even if the amount of delay is changed with respect to the first directional sound pickup signal input to the comparison signal calculation unit 440. Limit the bandwidth to the bandwidth.
- the second band limiting unit 432 inputs the second directional sound pickup signal. Then, the second band limiting unit 432 outputs the signal obtained by performing the band limitation to the comparison signal calculating unit 440. That is, the second band limiting unit 432 converts the second directional sound pickup signal input to the comparison signal calculation unit 440 to a frequency band in which spatial aliasing does not occur even if the amount of delay is changed. Perform bandwidth limitation.
- Spatial aliasing occurs due to phase interference of incident waves having a relatively high frequency when performing directivity synthesis processing, and is a phenomenon having a directivity gain in an unintended direction.
- Bandwidth limiting method is not limited to a specific one.
- Such band limitation can be realized by, for example, a band-pass filter that performs time-domain filtering.
- windowing is performed while overlapping every certain number of samples, and frequency decomposition is performed by FFT (Fast Fourier Transform).
- FFT Fast Fourier Transform
- the band limitation can be realized by extracting a complex spectrum signal corresponding to a desired frequency. Details of the limited frequency bands in the first band limiting unit 431 and the second band limiting unit 432 will be described later.
- the comparison signal calculation unit 440 includes the first directional sound pickup signal after the band limitation is performed by the first band limitation unit 431 and the first band after the band limitation is performed by the second band limitation unit 432. 2 directional sound pickup signals are input.
- the first directional sound collection signal after the band restriction is performed by the first band restriction unit 431 is referred to as a “band-limited first directional sound collection signal”.
- the second directional sound collection signal after the band restriction by the second band restriction unit 432 is referred to as a “band-limited second directional sound collection signal”.
- the comparison signal calculation unit 440 generates two types of omnidirectional level signals and directivity level signals from the band-limited first directional sound collection signal and the band-limited second directional sound collection signal. Generate and output a level signal.
- the omnidirectional level signal is a signal indicating the level of a signal obtained by adding the band-limited first directional sound collection signal and the band-limited second directional sound collection signal.
- the directivity level signal includes a first level signal indicating the level of the band-limited first directional sound collection signal and a second level signal indicating the level of the band-limited second directional sound collection signal. Is a signal obtained by adding.
- the comparison signal calculation unit 440 includes a third adder 441, a first level signal calculation unit 442, a second level signal calculation unit 443, a third level signal calculation unit 444, and a fourth level signal calculation unit.
- the adder 445 is included.
- the third adder 441 inputs the band-limited first directional sound collection signal and the band-limited second directional sound collection signal.
- the third adder 441 adds the band-limited first directional sound collection signal and the band-limited second directional sound collection signal.
- the first level signal calculation unit 442 inputs the output signal of the third adder 441. Then, the first level signal calculation unit 442 extracts level information from the output signal of the third adder 441 and converts the output signal of the third adder 441 into an omnidirectional level signal.
- the second level signal calculation unit 443 inputs the first directional sound pickup signal whose band is limited. Then, the second level signal calculation unit 443 extracts level information from the band-limited first directional sound pickup signal, and the band-limited first directional sound pickup signal is converted into the first level signal. Convert to
- the third level signal calculation unit 444 inputs the second directional sound pickup signal whose band is limited. Then, the third level signal calculation unit 444 extracts level information from the band-limited second directional sound pickup signal, and converts the band-limited second directional sound pickup signal into the second level signal. Convert to
- the fourth adder 445 receives the first level signal and the second level signal. Then, the fourth adder 445 adds the first level signal and the second level signal, and outputs a directivity level signal as a result of the addition.
- the first to third level signal calculation units 442 to 444 extract the absolute value or square value of the input signal as level information, respectively, when the input signal is a waveform signal such as the output of a bandpass filter. .
- the first to third level signal calculation units 442 to 444 respectively extract the amplitude spectrum of the input signal or the power spectrum of the input signal as level information when the input signal is a complex spectrum signal such as FFT.
- the first to third level signal calculation units 442 to 444 may extract the amplitude spectrum or power spectrum as level information as it is. Further, when inputting frequency spectrum signals of a plurality of bands, the first to third level signal calculation units 442 to 444 calculate the average value of the amplitude for each frequency bin or the average value of the power spectrum for each frequency bin. What is necessary is just to extract as level information.
- the level comparison unit 451 inputs the omnidirectional level signal and the directional level signal, and acquires the level difference between them.
- the level difference is, for example, a level ratio between the omnidirectional level signal and the directional level signal, or a difference between the omnidirectional level signal and the directional level signal.
- the delay operation unit 452 adjusts the delay amounts of the first delay unit 411 and the second delay unit 412 in the directivity synthesis processing unit 410 so that the level difference becomes small. Specifically, the delay operation unit 452 increases the delay amounts of the first delay device 411 and the second delay device 412 step by step from a sufficiently small value. The delay operation unit 452 fixes the delay amounts of the first delay device 411 and the second delay device 412 with the delay amount when the level difference becomes a predetermined value. Details of the relationship between the delay amount and the first directional sound pickup signal, the level difference, and the predetermined value serving as the reference will be described later.
- FIG. 3 is a diagram showing a simulation result of the frequency amplitude characteristic of the first directional sound pickup signal.
- FIG. 4 is a diagram showing a simulation result of frequency amplitude characteristics of the second directional sound pickup signal.
- the delay amount equivalent to 6 mm is a delay amount corresponding to the distance between acoustic terminals of 6 mm, and is a value smaller than a value corresponding to an actual distance between acoustic terminals (hereinafter referred to as “appropriate value”).
- the 10 mm delay amount is a delay amount corresponding to a distance of 10 mm between the acoustic terminals, and is an appropriate value.
- the delay amount equivalent to 14 mm is a delay amount corresponding to the distance between acoustic terminals of 14 mm, and is a value larger than an appropriate value.
- lines 511 to 514 indicate the frequency amplitude characteristics of the first directional sound pickup signal in order of a delay amount equivalent to 2 mm, a delay amount equivalent to 6 mm, a delay amount equivalent to 10 mm, and a delay amount equivalent to 14 mm, respectively. .
- lines 521 to 524 indicate the frequency amplitude characteristics of the second directional sound pickup signal in the order of 2 mm equivalent delay amount, 6 mm equivalent delay amount, 10 mm equivalent delay amount, and 14 mm equivalent delay amount, respectively. Indicates.
- the first microphone 200 and the second microphone 300 are used in a state where the sensitivity is corrected, but it is difficult to avoid the inclusion of residual sensitivity errors in actual use. Therefore, here, the case where the second sound collection signal includes a sensitivity error of the microphone output of ⁇ 0.087 dB (0.99 times) with respect to the first sound collection signal is shown as an example.
- the output level of the second directional sound pickup signal becomes zero in terms of the amplitude value regardless of the frequency, as shown by a line 523 in FIG. A close value.
- the logarithmic amplitude is ⁇ 40 dB due to the influence of the sensitivity difference between the microphones.
- the output level of the second directional sound pickup signal is high frequency band as shown by lines 521, 522, and 524 in FIG. In almost all cases, the value is high.
- the output level of the first directional sound pickup signal depends on the effect of spatial aliasing in the highest frequency band (7 kHz or higher) among the high frequency bands. Disturbance of characteristics (decrease in output level) occurs. Spatial aliasing involves the distance between microphones and the range of adjustment delay values.
- the sound processing apparatus 400 limits the signal to be subjected to delay processing to a frequency band in which the polar pattern is not disturbed in the first band limiting unit 431 and the second band limiting unit 432.
- the example in which the sound source is arranged in the axial direction shown in FIG. 3 and FIG. 4 corresponds to a condition in which the distance between the acoustic terminals is maximum, that is, a condition in which the frequency restriction condition is strictest. Therefore, it is desirable that the limited frequency bands in the first band limiting unit 431 and the second band limiting unit 432 are set so as to reduce the influence of spatial aliasing that occurs when the sound source is arranged in the axial direction. . In other words, it is desirable that the limited frequency band is set in a range in which the subsequent signal comparison is suitably performed. Therefore, the pass band is set in a frequency region where spatial aliasing does not occur in a frequency region where the output level increases as the frequency increases.
- FIG. 5 is a diagram showing the definition of directions in the following description.
- the direction is defined by defining the direction on the first microphone 200 side as 0 ° (degrees) in the axial direction that is the direction on the straight line connecting the first microphone 200 and the second microphone 300. ).
- the angle is defined clockwise in the normal use state as viewed from above.
- the microphone sensitivity of the first microphone 200 and the microphone sensitivity of the second microphone 300 are equal.
- 6 to 8 are diagrams showing the simulation results of the polar pattern (directivity pattern) of the first directional sound collection signal when the delay amount of the second delay device 412 is changed.
- FIG. 6 shows a polar pattern in the case where the delay amount of the second delay device 412 is a delay amount equivalent to 8 mm.
- FIG. 7 shows a polar pattern when the delay amount of the second delay device 412 is a delay amount equivalent to 10 mm (that is, an appropriate value).
- FIG. 8 shows a polar pattern when the delay amount of the second delay unit 412 is a delay amount equivalent to 12 mm.
- lines 561 to 564 indicate polar patterns of the first directional sound collection signal at 500 Hz (Hertz), 1000 Hz, 4000 Hz, and 12000 Hz, respectively.
- lines 571 to 574 indicate polar patterns of the first directional sound pickup signal at 500 Hz, 1000 Hz, 4000 Hz, and 12000 Hz, respectively.
- lines 581 to 584 indicate polar patterns of the first directional sound pickup signal at 500 Hz, 1000 Hz, 4000 Hz, and 12000 Hz, respectively.
- the polar pattern extends in the 180 ° direction in addition to the main lobe 565 extending in the 0 ° direction.
- side lobe 566 With side lobe 566. That is, the directivity is different from the cardioid characteristics described later.
- the phase of the side lobe 566 is inverted with respect to the phase of the main lobe 565.
- Such a side lobe having a negative phase is hereinafter referred to as a “negative lobe”.
- the polar pattern has only a main lobe without a negative lobe. Further, the value of the main lobe in the 180 ° direction is almost zero in terms of amplitude value ( ⁇ in terms of logarithmic amplitude).
- the polar pattern has only a main lobe without a negative lobe.
- the value of the main lobe in the 180 ° direction is not zero in terms of amplitude value ( ⁇ in terms of logarithmic amplitude).
- delay amount of the first delay unit 411 and the delay amount of the second delay unit 412 are set to the same value, and are simply referred to as “delay amount”.
- FIG. 9 shows a polar pattern when the delay amount of the second delay device 412 is a delay amount equivalent to 8 mm.
- FIG. 10 shows a polar pattern when the delay amount of the second delay device 412 is a delay amount equivalent to 10 mm (that is, an appropriate value).
- FIG. 11 shows a polar pattern when the delay amount of the second delay unit 412 is a delay amount equivalent to 12 mm.
- lines 611 to 614 indicate the polar pattern of the first directional sound collection signal, the polar pattern of the second directional sound collection signal, the polar pattern of the directional level signal, and the omnidirectional level signal, respectively.
- a polar pattern is shown.
- lines 621 to 624 indicate the polar pattern of the first directional sound collection signal, the polar pattern of the second directional sound collection signal, the polar pattern of the directional level signal, and the non-directional level signal. A polar pattern is shown.
- lines 631 to 634 indicate the polar pattern of the first directional sound collection signal, the polar pattern of the second directional sound collection signal, the polar pattern of the directional level signal, and the omnidirectional level signal. A polar pattern is shown.
- the first directional sound collection signal and the second directional sound collection signal have no negative lobe. Therefore, as indicated by lines 623 and 624 in FIG. 10, the polar pattern of the directional level signal and the polar pattern of the omnidirectional level signal match in all directions.
- the first directional sound collection signal and the second directional sound collection signal do not have negative lobes even when the delay amount is larger than the appropriate value. Therefore, as indicated by lines 633 and 634 in FIG. 11, the polar pattern of the directional level signal and the polar pattern of the omnidirectional level signal match in all directions. However, the first directional sound collection signal and the second directional sound collection signal are slightly omnidirectional directional characteristics due to cardioid characteristics.
- a delay amount equal to or greater than the distance between the acoustic terminals is given to the second delay device 412, a negative lobe is not substantially generated. If a smaller delay amount is given to the second delay device 412, sharper directivity is maintained. Conversely, it can be said that a delay amount having a value as small as possible within a range in which a negative lobe does not occur is an appropriate value for the delay amount of the second delay device 412.
- Whether or not a negative lobe has occurred can be determined based on whether or not the omnidirectional level signal matches the directional level signal, as is apparent from FIGS. 9 to 11. it can.
- the acoustic processing device 400 gradually increases the delay amount from a value sufficiently smaller than the value corresponding to the assumed minimum value of the distance between the acoustic terminals in a state where some sound source exists in the axial direction. . Then, the sound processing device 400 fixes the delay amount when the omnidirectional level signal and the directivity level signal match. As a result, the sound processing device 400 can set the delay amount to an appropriate value corresponding to the actual distance between the sound terminals.
- the level comparison unit 451 when using the level ratio between the nondirectional level signal and the directional level signal, sets the level difference cmp_inf to, for example, the following formula (1 ) To calculate.
- sum_abs indicates the value of the directional level signal
- omni_abs indicates the value of the omnidirectional level signal.
- the delay operation unit 452 fixes the delay amount when the level difference cmp_inf becomes zero.
- the level comparison part 451 calculates level difference cmp_inf using the following formula
- the value sum_abs of the directivity level signal and the value omni_abs of the omnidirectional level signal coincide with each other in both the directivity characteristics of the first directivity sound collection signal and the directivity characteristics of the second directivity sound collection signal. It is synonymous with the absence of negative lobes. That is, the fact that the directional level signal value sum_abs and the omnidirectional level signal value omni_abs coincide with each other indicates that for all frequencies ⁇ and all directions (sound incident angles) ⁇ , It is equivalent to satisfying (4).
- a ( ⁇ , ⁇ ) represents the output characteristic of the first directional sound collection signal
- B ( ⁇ , ⁇ ) represents the output characteristic B ( ⁇ ) of the second directional sound collection signal.
- sgn () indicates the sign of the value in parentheses.
- the configuration of the directivity synthesis processing unit 410 includes an omnidirectional level signal corresponding to the left side of Expression (3) and a directivity level signal corresponding to the right side of Expression (3). It has a configuration to generate.
- the first microphone 200 and the second microphone 300 actually have a sensitivity error. For this reason, even if the delay amount is an appropriate value, the omnidirectional level signal and the directional level signal often do not completely match.
- the sensitivity error for example, there is a sensitivity difference between the first microphone 200 and the second microphone 300, or uncorrelated noise existing between the first sound collection signal and the second sound collection signal. Can be mentioned.
- the uncorrelated noise is, for example, circuit noise, wind noise, vibration noise, or the like.
- FIG. 12 is a diagram showing the influence of sensitivity error on the relationship between delay amount and level difference.
- the horizontal axis indicates the delay amount, and the distance between acoustic terminals (electrical distance) [m] corresponding to the delay amount.
- the vertical axis represents the level difference cmp_inf [dB] calculated by the above equation (1).
- the relationship between the delay amount at the frequency of 1 kHz and the level difference when the actual distance between the acoustic terminals is 10 mm (0.01 m) and the sound source is positioned in the direction of 0 ° is shown.
- a line 661 indicates the relationship between the delay amount and the level difference when there is no sensitivity error between the first microphone 200 and the second microphone 300.
- a line 662 indicates the relationship between the delay amount and the level difference when the second microphone 300 has a sensitivity error of ⁇ 0.087 dB with respect to the first microphone 200.
- the level difference decreases as the delay amount increases, and decreases to 0 dB when the delay amount reaches a value corresponding to the distance between the acoustic terminals of 10 mm.
- the level difference is not completely 0 dB even when the delay amount is a value corresponding to the distance between the acoustic terminals of 10 mm.
- the threshold value serving as a criterion for fixing the delay amount is determined in consideration of the sensitivity error.
- the second microphone 300 has an amplitude gain that is a times that of the first microphone 200.
- the output characteristic A ( ⁇ ) of the first directional sound collection signal and the output characteristic B ( ⁇ ) of the second directional sound collection signal are expressed by the following expressions (5) and (6). Can do. Note that ⁇ represents the frequency of the input signal, and ⁇ represents the delay amount [sec] of the first delay device 411 and the second delay device 412.
- the directivity level signal value sum_abs ( ⁇ ) and the omnidirectional level signal value omni_abs ( ⁇ ) can be expressed by the following equations (7) and (8).
- FIG. 13 is a diagram illustrating the relationship between the residual gain error and the level difference.
- the horizontal axis indicates the residual gain error between the first microphone 200 and the second microphone 300 by 20 log 10 (a) [dB] using the amplitude gain a described above.
- the vertical axis indicates the level difference cmp_inf [dB] calculated by the above equation (1).
- a line 671 indicates a level difference cmp_inf at 1 kHz when the above formulas (5) to (8) are substituted into the above formula (1).
- the level difference cmp_inf is 0.2 or less. Therefore, in this case, if the threshold value as a criterion for fixing the delay amount is about 0.2, it is considered that the sensitivity error can be absorbed and the delay amount can be corrected.
- the delay operation unit 452 adjusts the delay amount using a threshold value (threshold value) set based on the above method. More specifically, the delay operation unit 452 increases the delay amount while, for example, the level difference cmp_info is 0.2 or more. Then, the delay operation unit 452 stops increasing the delay amount when the level difference cmp_info becomes 0.2. Thereby, the delay amount is fixed at an appropriate value. Then, the first signal output unit 421 and the second signal output unit 422 output a first directional sound collection signal and a second directional sound collection signal having cardioid directivity characteristics.
- a threshold value threshold value
- dist_term is expressed by, for example, the following equation (9) using the delay amount ⁇ opt [sec] when the delay amount stops increasing.
- c is the speed of sound [m / sec].
- FIG. 14 is a flowchart showing an example of the operation of the sound processing apparatus 400.
- the sound processing device 400 starts the operation illustrated in FIG. 14 when the power switch or the directional sound collection function is turned on. Further, it is assumed that the first microphone 200 and the second microphone 300 are continuously collecting sound while the operation shown in FIG. 14 is being performed.
- step S1000 the directivity synthesis processing unit 410 acquires the first sound collection signal and the second sound collection signal from the first microphone 200 and the second microphone 300.
- step S1010 the directivity synthesis processing unit 410 acquires the first directivity sound collection signal and the second directivity sound collection signal by directivity synthesis processing.
- step S ⁇ b> 1020 the first signal output unit 421 and the second signal output unit 422 output the first directional sound collection signal and the second directional sound collection signal to the outside of the sound processing device 400.
- the first band limiting unit 431 and the second band limiting unit 432 are input to the frequency band of the first directional sound pickup signal input to the comparison signal calculation unit 440 and the comparison signal calculation unit 440.
- the frequency band of the second directional sound pickup signal is limited.
- step S1030 the comparison signal calculation unit 440 calculates the directivity level signal value sum_abs and the omnidirectional level signal value omni_abs.
- step S1040 the level comparison unit 451 calculates a level difference cmp_inf between the directivity level signal value sum_abs and the non-directional level signal value omni_abs.
- step S1050 the delay operation unit 452 determines whether or not the level difference cmp_inf is greater than or equal to a predetermined threshold value thr.
- step S1060 If the level difference cmp_inf is greater than or equal to the predetermined threshold thr (S1050: YES), the delay operation unit 452 proceeds to step S1060. When the level difference cmp_inf is less than the predetermined threshold thr (S1050: NO), the delay operation unit 452 skips step S1060 and proceeds to step S1070 described later.
- step S1060 the delay operation unit 452 increases the delay amount ⁇ opt that the directivity synthesis processing unit 410 uses for the directivity synthesis processing.
- the initial value of the delay amount ⁇ opt is a sufficiently small value. Further, increment of the delay tau opt, the time and the processing load until convergence to the proper value of the delay amount tau opt, and is a value determined in relation to the accuracy required for the directivity pattern.
- step S1070 the directivity synthesis processing unit 410 determines whether or not the end of the directivity synthesis processing is instructed by a user operation or the like. This instruction is, for example, input of a signal indicating that the power switch is turned off or that the directional sound collection function is turned off.
- the sound processing apparatus 400 can repeat the directivity synthesis process. Then, the sound processing device 400 adjusts the delay amount used for the directivity synthesis processing based on the first directional sound collection signal and the second directional sound collection signal so that phase inversion does not occur in these signals. be able to. Finally, the sound processing apparatus 400 performs directivity synthesis processing with the delay amount set to an appropriate value.
- the sound processing apparatus 400 can output a first directional sound pickup signal having a directivity characteristic close to a cardioid and a second directivity sound pickup signal having a directivity characteristic close to a cardioid.
- the sound collection device 100 including the sound processing apparatus 400 according to the present embodiment has a directivity synthesis process so that phase inversion does not occur in a directional sound collection signal having directivity in the axial direction. It is possible to adjust the amount of delay used for.
- the sound collection device 100 can easily set the delay amount used in the directivity synthesis process so that the cardioid directivity is realized as long as some sound source exists in the axial direction.
- the sound collection device 100 performs measurement in an anechoic room or the like by an acoustic design engineer every time the casing in which the microphone is installed changes, and directivity synthesis is performed. There is no need to adjust the amount of processing delay.
- the sound collection device 100 calculates an appropriate value of the delay amount without using a conventional method such as correlation, and thus malfunctions even in an actual environment with reflection and ambient noise. Can be avoided.
- the sound collection device 100 has a sound source direction search followability even in an acoustic change around the microphone or in a situation where a plurality of sound sources exist simultaneously. It won't get worse.
- the sound collecting device 100 according to the present embodiment has a microphone mounting structure and mounting position, and a structure around the microphone, even in the real environment, even in the actual environment, as compared with the related art.
- the amount of delay can be adjusted accurately.
- the sound collection device 100 according to the present embodiment can realize an arbitrary directivity pattern with high accuracy, and can easily obtain a necessary sound with high quality.
- the present invention is suitable for such a sound collecting device 100.
- the method for adjusting the delay amount is not limited to the above example.
- the delay operation unit 452 may continue adjusting the delay amount without fixing the delay amount even after the level difference cmp_inf becomes less than a predetermined threshold. That is, the delay operation unit 452 may readjust the delay amount. Specifically, for example, the delay operation unit 452 holds the minimum value of the level difference cmp_inf, and when the held minimum value is updated within a certain time, the delay amount is monotonously decreased. Also good.
- the delay operation unit 452 may adjust the delay amount by limiting to a predetermined range so that the delay amount does not change greatly due to the influence of uncorrelated components between microphones.
- the sound processing apparatus has a component (hereinafter referred to as “non-correlated component”) having no correlation between the first sound pickup signal and the second sound pickup signal.
- a function is added to prevent delay amount correction when it is detected.
- circuit noise has no correlation between the first sound collection signal and the second sound collection signal, but is always present, so that it is distinguished from an uncorrelated component.
- the vibration source that vibrates the vibration version of the microphone is not a sound wave, such as mechanical vibration during zooming or wind pressure due to wind when shooting outdoors in a digital still camera that can be zoomed during recording. There is a case.
- the wind is turbulent with different characteristics near the microphone. For this reason, the vibration due to the wind appears as an uncorrelated component on the sound pickup signals of the two microphones.
- Such a non-correlated component greatly disturbs the polar pattern that should be obtained by sound waves when the directivity synthesis processing is performed while being included in the first and second collected sound signals. For this reason, when the delay amount adjustment described in the second embodiment is performed even though many uncorrelated components are included, there is a possibility that an incorrect value is set or the value converges to an appropriate value. May take longer.
- the sound processing apparatus is configured not to adjust the delay amount based on the directional sound collection signal when many uncorrelated components are included.
- FIG. 15 is a block diagram illustrating an example of a configuration of a sound collection device including the sound processing device according to the present embodiment, and corresponds to FIG. 2 of the second embodiment.
- the same parts as those in FIG. 2 are denoted by the same reference numerals, and description thereof will be omitted.
- the sound processing device 400a of the sound collection device 100a includes a comparison signal calculation unit 440a and a delay operation unit 452a instead of the comparison signal calculation unit 440 and the delay operation unit 452 illustrated in FIG.
- the acoustic processing device 400a further includes an uncorrelated level signal output unit 461a, an uncorrelated component detection unit 462a, and an OR circuit 463a.
- the comparison signal calculation unit 440a outputs a value obtained by subtracting the omnidirectional level signal from the directional level signal as an uncorrelated level signal indicating the level of the uncorrelated component. More specifically, the comparison signal calculation unit 440a includes a fifth adder 446a in addition to the configuration described in the second embodiment.
- the fifth adder 446a adds the directional level signal and the omnidirectional level signal whose polarity is inverted, and outputs an uncorrelated level signal as a result of the addition.
- the band-limited first directional sound pickup signal from the first band limiter 431 and the band-limited second directional sound pickup signal from the second band limiter 432 are mechanically transmitted to the device.
- the signals include vibration components that are uncorrelated with each other.
- the first directional sound collection signal and the second directional sound collection signal are converted into information having only amplitude without phase information and added to each other, thereby being uncorrelated with a correlated sound wave component.
- a directivity level signal with enhanced both vibration components is obtained.
- the uncorrelated level signal output unit 461a receives the uncorrelated level signal from the comparison signal calculation unit 440a and outputs a determination result signal indicating whether or not an uncorrelated component is included.
- the uncorrelated component detection unit 462a determines whether or not there is an uncorrelated component between the first sound collection signal and the second sound collection signal. More specifically, the uncorrelated component detection unit 462a receives an uncorrelated level signal from the uncorrelated level signal output unit 461a, and includes a large amount of uncorrelated components when the uncorrelated level signal exceeds a predetermined threshold. It is determined that
- the uncorrelated component detection unit 462a sequentially outputs a determination result signal indicating the determination result to the logical sum circuit 463a.
- the determination result signal takes a value of 0 when it is determined that there is no uncorrelated component, and takes a value of 1 when it is determined that many uncorrelated components are included.
- the OR circuit 463a receives the determination result signal output from the uncorrelated component detection unit 462a and the instruction signal input from the outside of the sound processing device 400a.
- the instruction signal is a signal that specifies whether or not to adjust the delay amount.
- the instruction signal takes a value of 0 when it is designated to perform delay amount adjustment, and takes a value of 1 when it is designated not to perform delay amount adjustment.
- the logical sum circuit 463a takes a logical sum of the determination result signal and the instruction signal, and outputs the obtained signal as a control signal. That is, the control signal takes a value of 0 when it is specified to adjust the delay amount and it is determined that there is no uncorrelated component, and takes a value of 1 in other cases.
- the instruction signal is a signal generated by a user operation, for example.
- the instruction signal may be a detection signal of a sensor that detects wind noise. In this case, for example, the instruction signal takes a value of 1 while the wind noise is detected, and takes a value of 0 while the wind noise is not detected.
- the delay operation unit 452a performs the delay amount adjustment described in the second embodiment on the condition that the delay amount adjustment is specified and it is determined that there is no uncorrelated component. That is, the delay operation unit 452a receives the control signal from the OR circuit 463a, and adjusts the delay amount when the control signal is zero. On the other hand, when the input control signal is 1, the delay operation unit 452a does not adjust the delay amount.
- FIG. 16 is a flowchart showing an example of the operation of the sound processing apparatus 400a, and corresponds to FIG. 14 of the second embodiment.
- the same parts as those in FIG. 14 are denoted by the same step numbers, and description thereof will be omitted.
- steps S1000 to S1040 are the same as those in the second embodiment.
- step S1041a the comparison signal calculation unit 440a subtracts the non-directional level signal value omni_abs from the directional level signal value sum_abs. Then, the comparison signal calculation unit 440a outputs the obtained signal as an uncorrelated level signal (uncorr_fact). Note that step S1041a may be performed after step S1030.
- step S1051a If the level difference cmp_inf is greater than or equal to the predetermined threshold value thr (S1050: YES), the delay operation unit 452 proceeds to step S1051a.
- step S1051a the uncorrelated component detection unit 462a compares the uncorrelated level signal value uncorr_fact with a predetermined threshold value thr_uncorr, and outputs a determination result signal in_uncorr_det indicating the comparison result.
- step S1052a the logical sum circuit 463a calculates the logical sum of the determination result signal in_uncorr_det and the instruction signal ext_uncorr_det, and calculates the control signal uncorr_det that is the result of the logical sum.
- step S1053a the delay operation unit 452a determines whether or not the value of the control signal uncorr_det is 1.
- Step S1060 When the value of the control signal uncorr_det is 0 (S1053a: NO), the delay operation unit 452a proceeds to Step S1060.
- the delay operation unit 452a proceeds to Step S1070.
- the sound processing apparatus 400a determines whether or not a lot of uncorrelated components are included in the collected sound signal from the difference between the directivity level signal and the omnidirectional level signal. Can do. Then, the sound processing device 400a can be configured not to adjust the delay amount when many uncorrelated components are included in the collected sound signal.
- the sound processing device 400a can reduce the influence on the delay amount adjustment due to mechanical vibration or noise such as wind pressure, and can easily realize an arbitrary directivity pattern with high accuracy. can do.
- the method of extracting the uncorrelated component is not limited to the above example.
- the acoustic processing apparatus 400a may use the uncorrelated component extraction method described in Patent Document 2.
- the content of the uncorrelated level signal that is the output of the comparison signal calculation unit 440a is synonymous with the content of the expression (2) in the second embodiment. Therefore, the level comparison unit 451 may use an uncorrelated level signal instead of calculating the level difference cmp_inf. Furthermore, the level comparison unit 451 may not be provided, and an uncorrelated level signal may be directly input to the delay operation unit 452a as a level difference.
- the fourth embodiment of the present invention is an example in which an audio signal having an arbitrary directivity pattern is output using the adjusted delay amount.
- FIG. 17 is a block diagram showing an example of the configuration of the sound processing apparatus according to the present embodiment, and corresponds to FIG. 15 of the third embodiment.
- the same parts as those in FIG. 15 are denoted by the same reference numerals, and description thereof will be omitted.
- the sound processing device 400b of the sound collection device 100b has a configuration in which other functional units are added to the configuration shown in FIG.
- the acoustic processing device 400b includes a delay calculation unit 470b, an output directivity synthesis processing unit 410b, a first equalizer (EQ) 481b, a second equalizer (EQ) 482b, a first audio signal output unit 491b, A second audio signal output unit 492b is included.
- the delay calculation unit 470b receives the designation of the directivity direction, and performs directivity synthesis processing in the output directivity synthesis processing unit 410b, which will be described later, based on the distance between the sound terminals corresponding to the delay amount adjusted by the delay operation unit 452a. Control. Specifically, the delay calculation unit 470b calculates the distance between the sound terminals from the delay amount adjusted by the delay operation unit 452a using, for example, the above-described equation (9). Then, the delay calculation unit 470b calculates and outputs an optimum delay amount based on the value of the directivity instruction signal input from the outside of the sound processing device 400b and the calculated distance between the sound terminals.
- the directivity instruction signal is a signal generated by a user operation, for example.
- the instruction signal may be a detection signal of a sensor that detects a direction in which the user's conversation partner is located.
- the output directivity synthesis processing unit 410b has, for example, the same configuration as the directivity synthesis processing unit 410, and includes a first delay unit 411b, a second delay unit 412b, a first adder 413b, and a second adder 413b.
- Adder 414b correspond to the first delay device 411, the second delay device 412, the first adder 413, and the second adder 414 of the second embodiment. That is, the first adder 413b outputs a first output directional sound collection signal, and the second adder 414b outputs a second output directional sound collection signal.
- the output directivity synthesis processing unit 410b uses the delay amount output from the delay calculation unit 470b (hereinafter referred to as “output delay amount”) and outputs the first output directivity sound collection signal and the second output directivity sound collection signal. An output directional sound pickup signal is generated.
- the first equalizer 481b inputs the first output directional sound pickup signal and corrects its frequency characteristic. Then, the first equalizer 481b outputs a first equivalent directional sound pickup signal that is a correction result.
- the second equalizer 482b receives the second output directional sound pickup signal and corrects its frequency characteristic. Then, the second equalizer 482b outputs a second equivalent directional sound pickup signal that is a correction result.
- the frequency characteristic is corrected by using the first output directional sound collection signal and the second output directional sound collection signal as shown in FIG. 3 and FIG. Is a correction to make the frequency characteristic opposite.
- the frequency amplitude characteristic is equivalent to 0 dB.
- the first audio signal output unit 491b receives the first output directional sound pickup signal. And the 1st audio
- the second audio signal output unit 492b inputs the second output directional sound collection signal. And the 2nd audio
- the first audio signal output unit 491b and the second audio signal output unit 492b are arranged, the first signal output unit 421 and the second signal output of the third embodiment.
- the part 422 is unnecessary, it is not limited to this.
- FIG. 18 is a diagram illustrating an example of a relationship between a microphone for obtaining a specified directivity pattern and an incident angle ⁇ .
- the acoustic processing apparatus 400b according to the present embodiment also forms a blind spot in the direction of the angle ⁇ correspondingly.
- the delay calculation unit 470b first calculates the actual distance between acoustic terminals dist_term from the delay amount ⁇ opt output from the delay operation unit 452a using the above-described equation (9). Then, the delay calculation unit 470b calculates the output delay amount ⁇ act from the specified angle ⁇ and the calculated inter-acoustic terminal distance dist_term using, for example, the following equation (10).
- the sound processing apparatus 400b uses the output delay amount ⁇ act calculated from the actual distance between acoustic terminals dist_term in this way, thereby accurately generating a sound having a directivity pattern having a blind spot in the ⁇ direction (and ⁇ direction). A signal can be output.
- FIG. 19 is a flowchart showing an example of the operation of the sound processing apparatus 400b, and corresponds to FIG. 16 of the third embodiment.
- the same parts as those in FIG. 16 are denoted by the same step numbers, and description thereof will be omitted.
- steps S1000 to S1041a is the same as that in the third embodiment.
- step S1042b the output directivity synthesis processing unit 410b outputs the first output directivity sound collection signal and the second output directivity sound collection signal by output directivity synthesis processing. get.
- step S1043b the first equalizer 481b and the second equalizer 482b perform frequency equivalent processing on the first output directional sound collection signal and the second output directional sound collection signal. Then, the first audio signal output unit 491b and the second audio signal output unit 492b have the first output directional sound collection signal and the second output directional sound collection after the frequency equivalent processing is performed. Output a signal.
- timing which performs the process of step S1042b and S1043b is not limited to the said timing.
- step S1050 the delay operation unit 452a determines whether the level difference cmp_inf is greater than or equal to a predetermined threshold value thr and the value of the control signal uncorr_det is 1.
- the delay operation unit 452a proceeds to step S1061b through steps S1051a to 1060.
- step S1061b the delay calculation unit 470b calculates the output delay amount ⁇ act from the directivity instruction signal, sets it in the output directivity synthesis processing unit 410b, and proceeds to step S1070.
- the acoustic processing device 400b generates an arbitrary directivity pattern from the delay amount corresponding to the actual distance between the acoustic terminals calculated in response to the acoustic change around the microphone. Can be realized accurately. As a result, the acoustic processing device 400b can accurately adjust the delay amount in the actual environment even if acoustic changes occur in the microphone mounting structure and mounting position, the structure around the microphone, and the like. As a result, the sound processing device 400b can easily and accurately realize directional sound collection having an arbitrary directional pattern, and can acquire necessary sound with high quality.
- the output directivity synthesis processing forms a blind spot by subtraction, but is not limited to this.
- the output directivity synthesis process may be based on an addition type (Delay_And_Sum). Even in this case, since the actual distance between the acoustic terminals is required, it is possible to obtain desired directivity with high accuracy.
- the delay amount of the first sound pickup signal and the delay amount of the second sound pickup signal are adjusted and set to the same value.
- the acoustic path may be significantly different due to the difference in the surrounding environment where the two microphones are installed.
- the delay amount of the first sound pickup signal and the delay amount of the second sound pickup signal may be adjusted and set to different values.
- the present invention is not limited to this.
- the delay amount correction according to the present invention is performed for each pair of two microphones, and when there are three or more microphones, they may be performed for each pair. Therefore, the present invention can also be applied to the case where directivity synthesis processing is performed on the collected sound signals output from a plurality of three or more microphones.
- the target of sound output to the user may be the first directional sound collection signal and the second directional sound collection signal output from the directivity synthesis processing unit 410.
- the low frequency level is insufficient in the frequency characteristics compared to the high frequency level (see FIGS. 3 and 4).
- the equivalents of the first equalizer 481b and the second equalizer 482b are added, and correction is performed to amplify the low band or attenuate the high band. desirable.
- Embodiment 5 of the present invention is an example of a specific mode when the present invention is applied to a sound collecting device in a remote conference system or the like that includes four microphones.
- the sound collection device performs delay-and-sum addition (Delay And Sum) on the sound pickup signals of the four microphones, and performs directional sound collection for the speaker in the designated direction.
- Delay And Sum delay-and-sum addition
- FIG. 20 is a block diagram showing an example of a processing configuration in the microphone array according to the present embodiment, and corresponds to FIG. 2 of the second embodiment.
- the same parts as those in FIG. 2 are denoted by the same reference numerals, and description thereof will be omitted.
- a hyphen and a serial number are added.
- the sound collection device 100c includes a third microphone 301 and a fourth microphone 302 in addition to the extended sound processing device 400c, the first microphone 200, and the second microphone 300 shown in FIG.
- the first microphone 200, the second microphone 300, the third microphone 301, and the fourth microphone 302 are arranged at different positions at a distance from each other. Here, for the sake of simplicity, it is assumed that the microphones are aligned.
- the first microphone 200, the second microphone 300, the third microphone 301, the fourth microphone 302, and the extended sound processing device 400c are, for example, inside the housing (not shown) of the sound collection device 100c. Is arranged.
- the third microphone 301 is an omnidirectional microphone (third sound collector).
- the third microphone 301 collects sound and outputs a sound collection signal.
- the sound collection signal output by the third microphone 301 is referred to as a “third sound collection signal”.
- the fourth microphone 302 is an omnidirectional microphone (fourth sound collector).
- the fourth microphone 302 collects sound and outputs a sound collection signal.
- the sound collection signal output by the fourth microphone 302 is referred to as a “fourth sound collection signal”.
- the extended sound processing device 400c inputs the first sound collection signal, the second sound collection signal, the third sound collection signal, and the fourth sound collection signal. Then, the extended sound processing device 400c performs directional sound collection in the direction indicated by the directivity instruction signal that is an external signal of the extended sound processing device 400c.
- the extended sound processing device 400c includes first to third sound processing devices (400-1, 400-2, 400-3), a delay calculation unit 470c, and an output device.
- a directivity synthesis unit 410c and an audio signal output unit 491c are included.
- the first sound processing device 400-1 inputs the first sound collection signal and the second sound collection signal. Then, the first acoustic processing device 400-1 has an amount of delay corresponding to the distance between the acoustic terminals between the first microphone 200 and the second microphone 300 (hereinafter referred to as “first delay amount”). Then, the first sound processing device 400-1 outputs the calculated first delay amount to the delay calculation unit 470c.
- the second sound processing device 400-2 inputs the second sound collection signal and the third sound collection signal. Then, the second acoustic processing device 400-2 has a delay amount corresponding to the distance between the acoustic terminals between the second microphone 300 and the third microphone 301 (hereinafter referred to as “second acoustic terminal distance”). (Hereinafter referred to as “second delay amount”). Then, the second sound processing device 400-2 outputs the calculated second delay amount to the delay calculation unit 470c.
- the third sound processing device 400-3 inputs the third sound collection signal and the fourth sound collection signal. Then, the third acoustic processing device 400-3 has an amount of delay corresponding to the distance between the acoustic terminals between the third microphone 301 and the fourth microphone 302 (hereinafter referred to as “third delay amount”). Then, the third sound processing device 400-3 outputs the calculated third delay amount to the delay calculation unit 470c.
- the delay calculation unit 470c multiplies each of the first to third delay amounts output from the first to third acoustic processing devices 400-1 to 400-3 by the speed of sound to obtain first to third acoustic terminals. Calculate the distance.
- the delay calculation unit 470c uses the first to third output directivity synthesis units 410c based on the sound collection direction angle ⁇ specified by the directivity instruction signal and the calculated first to third distances between the sound terminals.
- the delay amounts of the four delay units 411c to 414c are calculated. Then, the delay calculation unit 470c outputs the first output delay amount to the first delay unit 411c, and outputs the second output delay amount to the second delay unit 412c.
- the delay calculation unit 470c outputs the third output delay amount to the third delay unit 413c, and outputs the fourth output delay amount to the fourth delay unit 414c.
- the directivity instruction signal is a signal generated by a user operation, for example, and is a signal indicating an operation angle when directivity synthesis is performed.
- the operation angle is, for example, an angle between the front direction of the sound processing apparatus of the conference system and the direction with respect to the position of the speaker.
- the directivity direction of the sound collection designated by the directivity instruction signal may be automatically calculated.
- the direction specified by the directivity instruction signal may be a speaker direction that is automatically specified based on a detection signal of a sensor that detects the speaker direction.
- the audio signal output unit 491c receives the output directivity synthesis signal output from the output directivity synthesis unit 410, and outputs the output directivity synthesis signal to the outside of the extended sound processing apparatus 400c as a target of sound output to the user. More specifically, the sound is output as sound input by the sound collection device 100c (here, the conference system main body (not shown)).
- the output directivity synthesis unit 410c includes a first delay unit 411c, a second delay unit 412c, a third delay unit 413c, a fourth delay unit 414c, and an adder 415c.
- the first delay unit 411c performs a delay operation on the first sound collection signal output from the first microphone 200 based on the first output delay amount output from the delay calculation unit 470c. Then, the first delay unit 411c outputs a first delayed sound collection signal obtained by delaying the first sound collection signal by the first output delay amount to the adder 415c.
- the second delay unit 412c performs a delay operation on the second sound collection signal output from the second microphone 300 based on the second output delay amount output from the delay calculation unit 470c. Then, the second delay device 412c outputs the second delayed sound pickup signal obtained by delaying the second sound pickup signal by the second output delay amount to the adder 415c.
- the third delay unit 413c performs a delay operation on the third sound collection signal output from the third microphone 301 based on the third output delay amount output from the delay calculation unit 470c. Then, the third delay device 413c outputs a third delayed sound pickup signal obtained by delaying the third sound pickup signal by the third output delay amount to the adder 415c.
- the fourth delay unit 414c performs a delay operation on the fourth sound collection signal output from the fourth microphone 302 based on the fourth output delay amount output from the delay calculation unit 470c. Then, the fourth delay unit 414c outputs a fourth delayed sound collection signal obtained by delaying the fourth sound collection signal by the fourth output delay amount to the adder 415c.
- the adder 415c adds the first delayed sound pickup signal, the second delay sound pickup signal, the third delay sound pickup signal, and the fourth delay sound pickup signal to generate an output directivity composite signal,
- the audio signal is output to the audio signal output unit 491c.
- FIG. 21 is a diagram illustrating an example of a relationship between a microphone for obtaining a designated directivity pattern and a designated direction angle ⁇ .
- a directivity pattern having a directivity angle in the direction of the designated angle ⁇ is formed by the directivity instruction signal with the positional relationship as shown in FIG. Note that when the pointing angle is set in the direction of angle ⁇ , extended sound processing apparatus 400c according to the present embodiment forms a pointing angle in the direction of angle ⁇ 180 + ⁇ correspondingly.
- ⁇ opt [i] indicates the i-th delay amount described above.
- the delay calculation unit 470c calculates the i-th output delay amount ⁇ act [i], for example, It calculates using the following formula
- the delay calculation unit 470c calculates the fourth output delay amount ⁇ act [4] using, for example, the following equation (13).
- the delay calculation unit 470c sets the i-th output delay amount ⁇ act [i] as follows: For example, it calculates using the following formula
- the delay calculation unit 470c calculates the fourth output delay amount ⁇ act [1] using, for example, the following equation (15).
- the extended sound processing device 400c thus calculates the actual distance between the sound terminals for each pair of microphones, and gives the output delay amount for each delay device. Thereby, the extended sound processing apparatus 400c can output a sound signal having a directivity pattern having a directivity angle in the ⁇ direction (and in the ⁇ 180 + ⁇ direction) accurately.
- FIG. 22 is a flowchart showing an example of the operation of the extended sound processing apparatus 400c, and corresponds to FIG. 14 of the second embodiment.
- the same parts as those in FIG. 14 are denoted by the same step numbers, and description thereof will be omitted.
- the extended sound processing apparatus 400c performs the same processing as FIG. 14 in a loop three times. Therefore, in this embodiment, for convenience, “i” used in the above description is used as an index of the number of loops.
- step S1001c the delay calculation unit 470c initializes the index i to 1.
- step S1002c the directivity synthesis processing unit 410-i (not shown) of the i-th acoustic processing device 400-i performs directivity synthesis processing.
- the directivity synthesis processing unit 410- (i + 1) (not shown) of the (i + 1) th acoustic processing device 400- (i + 1) performs directivity synthesis processing.
- the extended sound processing apparatus 400c acquires the i-th directional sound collection signal and the (i + 1) -th directional sound collection signal.
- steps S1010 to S1040 is the same as that in the second embodiment, and is executed for each index i.
- step S1061c the delay operation unit 452-i (not shown) of the i-th acoustic processing device 400-i determines whether or not the level difference cmp_inf is greater than or equal to a predetermined threshold value thr.
- step S1062c If the level difference cmp_inf is greater than or equal to the predetermined threshold value thr (S1061c: YES), the delay operation unit 452 proceeds to step S1062c. If the level difference cmp_inf is less than the predetermined threshold thr (S1061c: NO), the delay operation unit 452 skips step S1062c and proceeds to step S1063c described later.
- step S1062c for each index i, the delay operation unit 452-i (not shown) of the i-th acoustic processing device 400-i is used by the directivity synthesis processing unit 410-i (not shown). Increase the delay amount ⁇ opt [i].
- the initial value of the i-th delay amount ⁇ opt [i] is a sufficiently small value. Further, increment of the delay tau opt [i] of the i-th time and the processing load until convergence to delay tau opt proper value of [i] of the i, as well as the accuracy required for the directivity pattern This value is determined based on the relationship.
- step S1063c the delay calculation unit 470c increments the loop count index i by one in order to perform processing for the next microphone pair.
- step S1064c the delay calculation unit 470c checks whether the index i has exceeded a predetermined number, that is, whether the loop has been rotated a predetermined number of times. In the present embodiment, since there are four microphones and three adjacent microphone pairs exist, the upper limit value of the index i is 3. Therefore, the delay calculation unit 470c determines whether the index i is greater than 3.
- the delay calculation unit 470c When the index i is 3 or less (S1064c: NO), the delay calculation unit 470c returns to step S1002c. If the index i is greater than 3 (S1064c: YES), the delay calculation unit 470c proceeds to step S1064c.
- step S1065c the delay calculation unit 470c transmits the directivity instruction signal indicating the directivity angle designated from the outside, the first delay amount ⁇ opt [1], the second ⁇ opt [2], and the third ⁇ opt.
- the output delay amount is calculated using [3]. That is, the delay calculation unit 470c includes the first to fourth output delay amounts ⁇ act [1] and ⁇ act [2] ⁇ act [, which are the delay amounts used by the first to fourth delay units 411c to 414c. 3] Calculate ⁇ act [4].
- the directivity synthesis processing unit 410c performs output directivity synthesis processing to obtain an output directivity synthesis signal, and the process proceeds to step S1070.
- the extended sound processing apparatus 400c has an arbitrary directivity from the delay amount corresponding to the actual distance between the acoustic terminals calculated each time in response to the acoustic change around the actual microphone. Sex patterns can be realized accurately.
- the acoustic processing device 400b can accurately adjust the delay amount in the actual environment even if an acoustic change occurs in the microphone mounting structure and mounting position, the structure around the microphone, and the like. That is, the acoustic processing device 400b can easily and accurately realize directional sound collection having an arbitrary directional pattern even in a real environment, and can acquire necessary sound with high quality. it can.
- the output directivity synthesis processing forms the directivity angle by addition, but is not limited to this.
- the output directivity synthesis processing may be based on a sound pressure gradient type (Sound Pressure Gradient) by subtraction processing. Even in this case, since the actual distance between the acoustic terminals is required, it is possible to obtain desired directivity with high accuracy.
- the microphone array is linear, but the present invention is not limited to this. If the distance between the acoustic terminals of the pair related to the directivity synthesis is obtained in a square shape, accurate directivity sound collection is possible as well.
- the number of microphones is four, it is not limited to this as long as two or more microphones can be paired.
- an arbitrary directivity pattern can be obtained by accurately adjusting a delay amount in a real environment even if an acoustic change occurs in a microphone mounting structure, a mounting position, and a structure around the microphone. It can be realized with high accuracy. That is, the present invention is useful as a sound processing apparatus and a sound processing method that can more easily acquire a required sound with high quality.
- the present invention is suitable for a digital still camera having a video shooting function, a digital video camera, a sound collector, a sound collecting device in a remote conference system, or various stereo recording devices.
Landscapes
- Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- General Health & Medical Sciences (AREA)
- Circuit For Audible Band Transducer (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
Abstract
Description
本発明の実施の形態1は、本発明の基本的態様の一例である。
本発明の実施の形態2は、本発明を、2個のマイクロホンを備えたデジタルカメラなどの収音機器に適用した場合の具体的態様の一例である。
まず、本実施の形態に係る音響処理装置を含む収音機器の構成について説明する。
次に、第1の帯域制限部431および第2の帯域制限部432における制限周波数帯域の詳細について説明する。かかる帯域制限は、上述の通り、エイリアジング現象の遅延量調整への影響を低減するために行われるものである。
次に、遅延量と第1の指向性収音信号(および第2の指向性収音信号)との関係について説明する。
次に、レベル差異およびその基準となる所定の値について説明する。
次に、音響処理装置400の動作について説明する。
本発明の実施の形態3は、実施の形態2の音響処理装置に、第1の収音信号と第2の収音信号との間で相関の無い成分(以下「無相関成分」という)が検出された場合に、遅延量補正を行わないようにする機能を追加したものである。なお、回路ノイズは、第1の収音信号と第2の収音信号との間で相関がないが、常に存在することから、無相関成分とは区別される。
まず、無相関成分の発生原因と、無相関成分が遅延量の調整に与える影響について説明する。
図15は、本実施の形態に係る音響処理装置を含む収音機器の構成の一例を示すブロック図であり、実施の形態2の図2に対応するものである。図2と同一部分には、同一符号を付し、これについての説明を省略する。
図16は、音響処理装置400aの動作の一例を示すフローチャートであり、実施の形態2の図14に対応するものである。図14と同一部分には同一ステップ番号を付し、これについての説明を省略する。
本発明の実施の形態4は、調整された遅延量を用いて、任意の指向性パターンの音声信号を出力するようにした例である。
図17は、本実施の形態に係る音響処理装置の構成の一例を示すブロック図であり、実施の形態3の図15に対応するものである。図15と同一部分には同一符号を付し、これについての説明を省略する。
ここで、任意の指向性パターンを得るための出力用遅延量の演算手法について説明する。
図19は、音響処理装置400bの動作の一例を示すフローチャートであり、実施の形態3の図16に対応するものである。図16と同一部分には同一ステップ番号を付し、これについての説明を省略する。
本発明の実施の形態5は、本発明を、4個のマイクロホンを備えた、遠隔会議システムなどにおける収音機器に適用した場合の、具体的様態の一例である。
まず、本実施の形態に係る音響処理装置を含む収音機器の構成について説明する。
ここで、指向性合成部410cにおいて、任意の方向に対して指向性合成処理を行うための、第1~第4の出力用遅延量の算出方法について説明する。
図22は、拡張音響処理装置400cの動作の一例を示すフローチャートであり、実施の形態2の図14に対応する。図14と同一部分には、同一ステップ番号を付し、これについての説明を省略する。
200 第1のマイクロホン
300 第2のマイクロホン
301 第3のマイクロホン
302 第4のマイクロホン
400、400a、400b 音響処理装置
400-1 第1の音響処理装置
400-2 第2の音響処理装置
400-3 第3の音響処理装置
400c 拡張音響処理装置
410 指向性合成処理部
410b、410c 出力用指向性合成処理部
411、411b、411c 第1の遅延器
412、412b、412c 第2の遅延器
413c 第3の遅延器
414c 第4の遅延器
413、413b 第1の加算器
414、414b 第2の加算器
415c 加算器
421 第1の信号出力部
422 第2の信号出力部
431 第1の帯域制限部
432 第2の帯域制限部
440、440a 比較信号算出部
441 第3の加算器
442 第1のレベル信号算出部
443 第2のレベル信号算出部
444 第3のレベル信号算出部
445 第4の加算器
446a 第5の加算器
451 レベル比較部
452、452a 遅延操作部
461a 無相関レベル信号出力部
462a 無相関成分検出部
463a 論理和回路
470b、470c 遅延算出部
481b 第1の等価器
482b 第2の等価器
491b 第1の音声信号出力部
491c 音声信号出力部
492b 第2の音声信号出力部
Claims (10)
- 第1の収音器から出力される第1の収音信号および第2の収音器から出力される第2の収音信号に対して、指向性合成処理を行う音響処理装置であって、
前記第1の収音信号に対して前記第2の収音信号を遅延させて合成した第1の指向性収音信号を生成し、前記第2の収音信号に対して前記第1の収音信号を遅延させて合成した第2の指向性収音信号を生成する指向性合成処理部と、
前記第1の指向性収音信号と前記第2の指向性収音信号とを加算して得られる信号のレベルを示す無指向性レベル信号と、前記第1の指向性収音信号のレベルを示す第1のレベル信号と前記第2の指向性収音信号のレベルを示す第2のレベル信号とを加算して得られる指向性レベル信号と、を生成する比較信号算出部と、
前記無指向性レベル信号と前記指向性レベル信号とのレベル差異を取得するレベル比較部と、
前記レベル差異が小さくなるように、前記指向性合成処理部における前記遅延の量を調整する遅延操作部と、を有する、
音響処理装置。 - 前記比較信号算出部は、
前記第1の指向性収音信号と前記第2の指向性収音信号とを加算する第3の加算器と、
前記第3の加算器の出力信号からレベル情報を抽出して前記無指向性レベル信号に変換する第1のレベル信号算出部と、
前記第1の指向性収音信号からレベル情報を抽出して前記第1のレベル信号に変換する第2のレベル信号算出部と、
前記第2の指向性収音信号からレベル情報を抽出して前記第2のレベル信号に変換する第3のレベル信号算出部と、
前記第1のレベル信号と前記第2のレベル信号とを加算して前記指向性レベル信号を出力する第4の加算器と、を有する、
請求項1記載の音響処理装置。 - 前記比較信号算出部に入力される前記第1の指向性収音信号に対して、前記遅延の量を変化させても空間エイリアジングが生じない周波数帯域への帯域制限を行う第1の帯域制限部と、
前記比較信号算出部に入力される前記第2の指向性収音信号に対して、前記遅延の量を変化させても空間エイリアジングが生じない周波数帯域への帯域制限を行う第2の帯域制限部と、を更に有する、
請求項1記載の音響処理装置。 - 前記遅延操作部は、
前記遅延の量を十分に小さい値から段階的に増大させていき、前記レベル差異が所定の値となったときに前記遅延の量を固定する、
請求項1記載の音響処理装置。 - 前記遅延操作部は、
前記レベル差異の最小値をホールドし、ホールドした最小値の更新が一定時間内に行われた場合には、遅延量を単調減少させる、
請求項4記載の音響処理装置。 - 前記遅延操作部は、
予め定められた範囲に制限して、前記遅延量の調整を行う、
請求項1記載の音響処理装置。 - 前記第1の収音信号と前記第2の収音信号との間に無相関成分が多く含まれているか否かを判定する無相関成分検出部、を更に有し、
前記遅延操作部は、
前記無相関成分が多く含まれていると判定されたとき、前記第1の指向性収音信号に基づいて前記遅延の量を調整しない、
請求項1記載の音響処理装置。 - 前記比較信号算出部は、
前記指向性レベル信号から前記無指向性レベル信号を減算して得られる値を、無相関レベル信号として出力し、
前記無相関レベル信号成分が所定の閾値を超えているとき、前記無相関成分が多く含まれていると判定する、
請求項7記載の音響処理装置。 - 指向方向の指定を受け付け、前記遅延操作部により調整された前記遅延の量に相当する音響端子間距離に基づいて、前記指向性合成処理を制御する遅延算出部、を更に有する、
請求項1記載の音響処理装置。 - 第1の収音器から出力される第1の収音信号および第2の収音器から出力される第2の収音信号に対して、指向性合成処理を行う音響処理装置における音響処理方法であって、
前記第1の収音信号に対して前記第2の収音信号を遅延させて合成した第1の指向性収音信号を生成し、前記第2の収音信号に対して前記第1の収音信号を遅延させて合成した第2の指向性収音信号を生成する指向性合成処理部から、前記第1の指向性収音信号および前記第2の指向性収音信号を取得するステップと、
前記第1の指向性収音信号と前記第2の指向性収音信号とを加算して得られる信号のレベルを示す無指向性レベル信号を生成するステップと、
前記第1の指向性収音信号のレベルを示す第1のレベル信号と前記第2の指向性収音信号のレベルを示す第2のレベル信号とを加算して得られる指向性レベル信号を生成するステップと、
前記無指向性レベル信号と前記指向性レベル信号とのレベル差異を取得するステップと、
前記レベル差異が小さくなるように、前記指向性合成処理部における前記遅延の量を調整するステップと、を有する、
音響処理方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013550081A JP6041244B2 (ja) | 2011-12-20 | 2012-10-24 | 音響処理装置および音響処理方法 |
US14/366,702 US9326065B2 (en) | 2011-12-20 | 2012-10-24 | Sound processing device, and sound processing method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2011-278242 | 2011-12-20 | ||
JP2011278242 | 2011-12-20 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013094103A1 true WO2013094103A1 (ja) | 2013-06-27 |
Family
ID=48668023
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2012/006814 WO2013094102A1 (ja) | 2011-12-20 | 2012-10-24 | 音響処理装置および音響処理方法 |
PCT/JP2012/006816 WO2013094103A1 (ja) | 2011-12-20 | 2012-10-24 | 音響処理装置および音響処理方法 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2012/006814 WO2013094102A1 (ja) | 2011-12-20 | 2012-10-24 | 音響処理装置および音響処理方法 |
Country Status (3)
Country | Link |
---|---|
US (2) | US9326065B2 (ja) |
JP (2) | JP6041244B2 (ja) |
WO (2) | WO2013094102A1 (ja) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013094102A1 (ja) * | 2011-12-20 | 2013-06-27 | パナソニック株式会社 | 音響処理装置および音響処理方法 |
US9800981B2 (en) * | 2014-09-05 | 2017-10-24 | Bernafon Ag | Hearing device comprising a directional system |
US11209985B2 (en) * | 2019-04-23 | 2021-12-28 | Macronix International Co., Ltd. | Input/output delay optimization method, electronic system and memory device using the same |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003153372A (ja) * | 2001-11-14 | 2003-05-23 | Matsushita Electric Ind Co Ltd | マイクロホン装置 |
JP2004187283A (ja) * | 2002-11-18 | 2004-07-02 | Matsushita Electric Ind Co Ltd | マイクロホン装置および再生装置 |
JP2005136628A (ja) * | 2003-10-29 | 2005-05-26 | Sharp Corp | ステレオマイクロホン装置 |
JP2007005849A (ja) * | 2005-06-21 | 2007-01-11 | Sony Corp | 記録装置、記録方法、再生装置、再生方法、記録方法のプログラム及び記録方法のプログラムを記録した記録媒体 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3132529B2 (ja) | 1992-09-07 | 2001-02-05 | ソニー株式会社 | 音声入力装置 |
US7577262B2 (en) | 2002-11-18 | 2009-08-18 | Panasonic Corporation | Microphone device and audio player |
JP4422662B2 (ja) | 2005-09-09 | 2010-02-24 | 日本電信電話株式会社 | 音源位置・受音位置推定方法、その装置、そのプログラム、およびその記録媒体 |
US20070255816A1 (en) | 2006-05-01 | 2007-11-01 | Schuyler Quackenbush | System and method for processing data signals |
US9473850B2 (en) * | 2007-07-19 | 2016-10-18 | Alon Konchitsky | Voice signals improvements in compressed wireless communications systems |
US8340316B2 (en) * | 2007-08-22 | 2012-12-25 | Panasonic Corporation | Directional microphone device |
CN101569209B (zh) | 2007-10-04 | 2013-08-21 | 松下电器产业株式会社 | 噪声抽取装置和方法、麦克风装置、集成电路以及摄像机 |
WO2013094102A1 (ja) * | 2011-12-20 | 2013-06-27 | パナソニック株式会社 | 音響処理装置および音響処理方法 |
-
2012
- 2012-10-24 WO PCT/JP2012/006814 patent/WO2013094102A1/ja active Application Filing
- 2012-10-24 JP JP2013550081A patent/JP6041244B2/ja not_active Expired - Fee Related
- 2012-10-24 WO PCT/JP2012/006816 patent/WO2013094103A1/ja active Application Filing
- 2012-10-24 US US14/366,702 patent/US9326065B2/en active Active
- 2012-10-24 US US14/364,508 patent/US9319788B2/en active Active
- 2012-10-24 JP JP2013550080A patent/JP6025068B2/ja not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003153372A (ja) * | 2001-11-14 | 2003-05-23 | Matsushita Electric Ind Co Ltd | マイクロホン装置 |
JP2004187283A (ja) * | 2002-11-18 | 2004-07-02 | Matsushita Electric Ind Co Ltd | マイクロホン装置および再生装置 |
JP2005136628A (ja) * | 2003-10-29 | 2005-05-26 | Sharp Corp | ステレオマイクロホン装置 |
JP2007005849A (ja) * | 2005-06-21 | 2007-01-11 | Sony Corp | 記録装置、記録方法、再生装置、再生方法、記録方法のプログラム及び記録方法のプログラムを記録した記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
JP6025068B2 (ja) | 2016-11-16 |
US20150124997A1 (en) | 2015-05-07 |
JPWO2013094102A1 (ja) | 2015-04-27 |
JP6041244B2 (ja) | 2016-12-07 |
US9326065B2 (en) | 2016-04-26 |
JPWO2013094103A1 (ja) | 2015-04-27 |
WO2013094102A1 (ja) | 2013-06-27 |
US9319788B2 (en) | 2016-04-19 |
US20140321665A1 (en) | 2014-10-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8098844B2 (en) | Dual-microphone spatial noise suppression | |
US11245976B2 (en) | Earphone signal processing method and system, and earphone | |
KR101415026B1 (ko) | 마이크로폰 어레이를 이용한 다채널 사운드 획득 방법 및장치 | |
US8675890B2 (en) | Speaker localization | |
CA2407855C (en) | Interference suppression techniques | |
EP2633699B1 (en) | Systems, methods, apparatus, and computer-readable media for orientation-sensitive recording control | |
US20140003635A1 (en) | Audio signal processing device calibration | |
CN110140359B (zh) | 使用波束形成的音频捕获 | |
EP2882170A1 (en) | Audio information processing method and apparatus | |
RU2759715C2 (ru) | Звукозапись с использованием формирования диаграммы направленности | |
EP3384684A2 (en) | Conference system with a microphone array system and a method of speech acquisition in a conference system | |
EP3704873B1 (en) | Method of operating a hearing aid system and a hearing aid system | |
JP2013543987A (ja) | 遠距離場マルチ音源追跡および分離のためのシステム、方法、装置およびコンピュータ可読媒体 | |
JP2004289762A (ja) | 音声信号処理方法と装置及びプログラム | |
US9781509B2 (en) | Signal processing apparatus and signal processing method | |
EP2868117A1 (en) | Systems and methods for surround sound echo reduction | |
KR20140089580A (ko) | 근거리 널 및 빔 형성 | |
WO2007123052A1 (ja) | 適応アレイ制御装置、方法、プログラム、及び適応アレイ処理装置、方法、プログラム | |
CN112492445B (zh) | 利用罩耳式耳机实现信号均衡的方法及处理器 | |
WO2007059255A1 (en) | Dual-microphone spatial noise suppression | |
JP6041244B2 (ja) | 音響処理装置および音響処理方法 | |
CN108735228B (zh) | 语音波束形成方法及系统 | |
CN115932733A (zh) | 声源定位及语音增强方法、装置 | |
WO2022047606A1 (en) | Method and system for authentication and compensation | |
TWI700004B (zh) | 減少干擾音影響之方法及聲音播放裝置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12860643 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2013550081 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14366702 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12860643 Country of ref document: EP Kind code of ref document: A1 |