CN111757239A - Audio processing method and audio processing system - Google Patents

Audio processing method and audio processing system Download PDF

Info

Publication number
CN111757239A
CN111757239A CN201910244713.5A CN201910244713A CN111757239A CN 111757239 A CN111757239 A CN 111757239A CN 201910244713 A CN201910244713 A CN 201910244713A CN 111757239 A CN111757239 A CN 111757239A
Authority
CN
China
Prior art keywords
signal
left channel
channel
right channel
panning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910244713.5A
Other languages
Chinese (zh)
Other versions
CN111757239B (en
Inventor
虞登翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Realtek Semiconductor Corp
Original Assignee
Realtek Semiconductor Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Realtek Semiconductor Corp filed Critical Realtek Semiconductor Corp
Priority to CN201910244713.5A priority Critical patent/CN111757239B/en
Publication of CN111757239A publication Critical patent/CN111757239A/en
Application granted granted Critical
Publication of CN111757239B publication Critical patent/CN111757239B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems

Abstract

The invention provides an audio processing method and an audio processing system. The audio processing system comprises a classification module, a conversion module, a translation module, a broadening module and an inverse conversion module. In the audio processing method, a sound signal is first provided. Next, a plurality of categories are provided. Then, the sound signal is subjected to a classification step according to the categories. Then, a conversion step is performed on the sound signal to convert the sound signal into a frequency domain. Then, the amplitude signal of the sound signal is subjected to a panning step and an addition step to obtain an added amplitude signal. Next, the phase signal of the sound signal is subjected to a separation step and an addition step to obtain an addition phase signal. The summed amplitude signal and the summed phase signal are then subjected to an inverse conversion step to obtain an optimized sound signal corresponding to the time domain.

Description

Audio processing method and audio processing system
Technical Field
The present invention relates to an audio processing method and an audio processing system, and more particularly, to an audio processing method and an audio processing system for widening sound effect and making sound effect stereoscopic.
Background
When a person hears a sound signal generated from a sound source, the sound signal usually reaches the left ear and the right ear of the person at two different times, and has different volume levels. The human brain interprets these differences in time and volume levels to generate an auditory scene (auditory scene). Stereo (stereo) is a method of generating an auditory scene by providing an audio signal to a plurality of speakers through a plurality of independent audio channels, the speakers being arranged in a symmetrical manner such that the speakers can generate the auditory scene. In general, stereo is implemented by two channels.
Disclosure of Invention
An aspect of the present invention is to provide an audio processing method and an audio processing system to optimize an auditory scene of a stereo.
According to some embodiments of the present invention, in the audio processing method, an input sound signal is provided first. Next, a plurality of categories are provided. The categories correspond to a plurality of processing parameter sets one to one, and each processing parameter set comprises a translation angle curve, a separation curve and a weight parameter. Then, the sound signals are classified according to the categories to obtain an input sound category corresponding to the input sound signal, and a translation angle curve, a separation curve and a weight parameter corresponding to the input sound category, wherein the input sound category is one of the categories. Then, a conversion step is performed on the input sound signal to convert the input sound signal into a frequency domain, and an amplitude signal and a phase signal corresponding to the input sound signal are obtained. Then, according to the input sound type of the input sound signal, the translation angle curve and the weight parameter corresponding to the input sound type, the translation step is carried out on the amplitude signal corresponding to the input sound signal, so as to obtain a weighted translation amplitude signal of the input sound signal. The weighted shifted amplitude signals are then summed to obtain a summed amplitude signal. Then, according to the input sound type of the input sound signal and the separation curve and the weight parameter corresponding to the input sound type, a separation step is carried out on the phase signal corresponding to the input sound signal so as to obtain a weighted separation phase signal of the input sound signal. When the number of weighted shifted amplitude signals and the number of weighted separated phase signals are one, the weighted shifted amplitude signals and the weighted separated phase signals are subjected to an inverse conversion step to obtain optimized sound signals corresponding to the time domain.
According to an embodiment of the present invention, in the panning step, a panning curve is first calculated according to the panning angle curve corresponding to the input sound type. Then, the translation curve corresponding to the input sound type is multiplied by the weight parameter corresponding to the sub-sound type to obtain the weighted translation curve corresponding to the input sound signal. Then, the amplitude signal corresponding to the input sound signal is multiplied by the corresponding weighted translation curve to obtain the weighted translation amplitude signal.
According to an embodiment of the present invention, in the separating step, the phase signal corresponding to the input audio signal is added to the corresponding separating curve to obtain a separated phase signal corresponding to the input audio signal. The split phase signals are then multiplied by corresponding weight parameters to obtain the weighted split phase signals described above.
According to an embodiment of the present invention, when the number of weighted shifted amplitude signals and the number of weighted separated phase signals are greater than one, the weighted shifted amplitude signals are summed to obtain a summed amplitude signal, and the weighted separated phase signals are summed to obtain a summed phase signal; and performing an inverse conversion step on the summed amplitude signal and the summed phase signal to obtain an optimized audio signal corresponding to the time domain.
According to an embodiment of the present invention, the transforming step is Fourier Transform (Fourier Transform), and the Inverse transforming step is Inverse Fourier Transform (Inverse Fourier Transform).
According to some embodiments of the present invention, in the audio processing method, an input sound signal is first provided, wherein the input sound signal includes a left channel input signal and a right channel input signal. Next, a plurality of categories are provided. The classes are corresponding to a plurality of processing parameter sets one-to-one, each processing parameter set includes a panning angle curve, a first separation curve, a second separation curve and a weight parameter, wherein the first separation curve corresponds to a left channel, and the second separation curve corresponds to a right channel. Then, a first classification step is performed on the left channel input signals according to the classes to obtain a left channel sound class corresponding to the left channel input signals, and a left channel panning angle curve, a left channel separation curve and a left channel weight parameter corresponding to the left channel input signals are obtained according to the left channel sound class. Then, a second classification step is performed on the right channel input signal according to the above-mentioned classification to obtain a right channel sound classification corresponding to the right channel input signal, and a right channel panning angle curve, a right channel separation curve and a right channel weight parameter are obtained according to the right channel sound classification corresponding to the right channel input signal. The left channel sound category is one of the above categories, and the right channel sound category is one of the above categories. Then, a left channel audio adjusting step is performed. In the step of adjusting the audio of the left channel, a first conversion step is first performed to convert the input signal of the left channel to the frequency domain, and obtain a left channel amplitude signal and a left channel phase signal corresponding to the input signal of the left channel. Then, according to the left channel panning angle curve and the left channel weighting parameter corresponding to the left channel input signal, a first panning step is performed on the left channel amplitude signal corresponding to the left channel input signal to obtain a left channel weighted panning amplitude signal of the left channel input signal. Then, according to the left channel separation curve and the left channel weight parameter corresponding to the left channel input signal, a first separation step is performed on the left channel phase signal corresponding to the left channel input signal to obtain a left channel weighted separation phase signal of the left channel input signal. Then, when the number of the left channel weighted panning amplitude signals and the number of the left channel weighted separating phase signals are one, a first inverse conversion step is performed on the left channel weighted panning amplitude signals and the left channel weighted separating phase signals to obtain optimized left channel sound signals corresponding to a time domain. Then, a right channel audio adjustment step is performed. In the step of adjusting the right channel audio, a second conversion step is first performed to convert the right channel input signal to a frequency domain, and obtain a right channel amplitude signal and a right channel phase signal corresponding to the right channel input signal. Then, according to the right channel panning angle curve and the right channel weighting parameter corresponding to the right channel input signal, a second panning step is performed on the right channel amplitude signal corresponding to the right channel input signal to obtain a right channel weighted panning amplitude signal of the right channel input signal. Then, according to the right channel separation curve and the right channel weight parameter corresponding to the right channel input signal, a second separation step is performed on the right channel phase signal corresponding to the right channel input signal to obtain a right channel weighted separation phase signal of the right channel input signal. Then, when the number of the right channel weighted panning amplitude signals and the number of the right channel weighted separating phase signals are one, a second inverse conversion step is performed on the right channel weighted panning amplitude signals and the right channel weighted separating phase signals to obtain an optimized right channel sound signal corresponding to the time domain.
According to an embodiment of the present invention, in the first panning step, a left channel panning curve is first calculated according to the left channel panning angle curve. Then, the left channel panning curve is multiplied by the left channel weighting parameter to obtain a left channel weighted panning curve corresponding to the left channel input signal. The left channel amplitude signal is then multiplied by a corresponding left channel weighted panning curve to obtain the left channel weighted panning amplitude signal.
According to an embodiment of the present invention, in the first separating step, a left channel phase signal corresponding to the left channel input signal is added to the corresponding left channel separation curve to obtain a left channel separated phase signal corresponding to the left channel input signal. The left channel separated phase signal is then multiplied by a corresponding left channel weight parameter to obtain a left channel weighted separated phase signal.
According to an embodiment of the present invention, in the second panning step, a right channel panning curve is first calculated according to the right channel panning angle curve. Then, the right channel panning curve is multiplied by the right channel weighting parameter to obtain a right channel weighting panning curve corresponding to the right channel input signal. The right channel amplitude signal is then multiplied by a corresponding right channel weighted panning curve to obtain the right channel weighted panning amplitude signal described above.
According to an embodiment of the present invention, when the number of the right channel sound types is one, in the second separating step, the right channel phase signal corresponding to the right channel input signal and the corresponding right channel separation curve are added to obtain a right channel separation phase signal corresponding to the right channel input signal. Then, the right channel separation phase signal is multiplied by the corresponding right channel weight parameter to obtain the right channel weighted separation phase signal.
According to an embodiment of the present invention, when the number of left channel weighted panning amplitude signals and the number of left channel weighted separating phase signals are greater than one, summing the left channel weighted panning amplitude signals to obtain a left channel summed amplitude signal, and summing the left channel weighted separating phase signals to obtain a left channel summed phase signal; and performing a first inverse conversion step on the left channel sum amplitude signal and the left channel sum phase signal to obtain an optimized left channel sound signal corresponding to the time domain.
According to an embodiment of the present invention, when the number of right channel weighted panning amplitude signals and the number of right channel weighted separating phase signals are greater than one, the right channel weighted panning amplitude signals are summed to obtain a right channel summed amplitude signal, and the right channel weighted separating phase signals are summed to obtain a right channel summed phase signal; and performing a second inverse conversion step on the right channel sum amplitude signal and the right channel sum phase signal to obtain an optimized right channel sound signal corresponding to the time domain.
According to an embodiment of the present invention, the first transforming step and the second transforming step are fourier transforms, and the first inverse transforming step and the second inverse transforming step are inverse fourier transforms.
According to some embodiments of the present invention, the audio processing system includes a classification module, a conversion module, a left channel panning module, a right channel panning module, a left channel widening module, a right channel widening module, and an inverse conversion module. The classification module is used for storing a plurality of processing parameter groups. The processing parameter sets correspond to a plurality of categories one to one, and each processing parameter set comprises a translation angle curve, a first separation curve corresponding to the left channel, a second separation curve corresponding to the right channel and a weight parameter. The classification module is further configured to perform a first classification step and a second classification step on the left channel input signal and the right channel input signal according to the categories to obtain a left channel sound category, a left channel panning angle curve, a left channel separation curve and a left channel weight parameter corresponding to the left channel input signal, and obtain a right channel sound category, a right channel panning curve, a right channel separation curve and a right channel weight parameter corresponding to the right channel input signal, where the left channel sound category is one of the categories, and the right channel sound category is one of the categories. The conversion module is used for performing a conversion step on the left channel input signal and the right channel input signal to convert the left channel input signal and the right channel input signal to a frequency domain, obtain a left channel amplitude signal and a left channel phase signal corresponding to the left channel input signal, and obtain a right channel amplitude signal and a right channel phase signal corresponding to the right channel input signal. The left channel translation module is used for performing a first translation step on a left channel amplitude signal corresponding to the left channel input signal according to a left channel translation angle curve and a left channel weight parameter corresponding to the left channel input signal so as to obtain a left channel weighted translation amplitude signal of the left channel input signal. The right channel translation module is used for performing a second translation step on a right channel amplitude signal corresponding to the right channel input signal according to a right channel translation angle curve and a right channel weight parameter corresponding to the right channel input signal so as to obtain a right channel weighted translation amplitude signal of the right channel input signal. The left channel widening module is used for carrying out a first separation step on a left channel phase signal corresponding to the left channel input signal according to a left channel separation curve corresponding to the left channel input signal and a left channel weight parameter so as to obtain a left channel weighted separation phase signal of the left channel input signal. The right channel widening module is used for carrying out a second separation step on a right channel phase signal corresponding to the right channel input signal according to a right channel separation curve corresponding to the right channel input signal and a right channel weight parameter so as to obtain a right channel weighted separation phase signal of the right channel input signal. The inverse conversion module is used for carrying out a first inverse conversion step on the left channel weighted translation amplitude signal and the left channel weighted separation phase signal when the number of the left channel weighted translation amplitude signals and the number of the left channel weighted separation phase signals are one, so as to obtain an optimized left channel sound signal corresponding to a time domain. The inverse conversion module is also configured to perform a second inverse conversion step on the right channel weighted panning amplitude signal and the right channel weighted separating phase signal when the number of the right channel weighted panning amplitude signals and the number of the right channel weighted separating phase signals are one, so as to obtain an optimized right channel sound signal corresponding to the time domain.
According to an embodiment of the present invention, in the first panning step, when the number of the left channel sound categories is one, the left channel panning module is further configured to calculate a left channel panning curve according to the left channel panning angle curve; multiplying the left channel translation curve by the left channel weight parameter to obtain a left channel weighted translation curve corresponding to the left channel input signal; and multiplying the left channel amplitude signal by a corresponding left channel weighted panning curve to obtain the left channel weighted panning amplitude signal.
According to an embodiment of the present invention, in the first separating step, when the number of the left channel sound types is one, the left channel widening module is further configured to add the left channel phase signal and the left channel separation curve to obtain a left channel separation phase signal corresponding to the left channel input signal; and multiplying the left channel separated phase signal by the left channel weight parameter to obtain the left channel weighted separated phase signal.
According to an embodiment of the present invention, in the second panning step, when the number of the right channel sound categories is one, the right channel panning module is further configured to calculate a right channel panning curve according to the right channel panning angle curve; multiplying the right channel translation curve by the right channel weight parameter to obtain a right channel weighted translation curve corresponding to the right channel input signal; and multiplying the right channel amplitude signal by a corresponding right channel weighted panning curve to obtain the right channel weighted panning amplitude signal.
According to an embodiment of the present invention, in the second separating step, when the number of the right channel sound types is one, the right channel widening module is further configured to add the right channel phase signal and the right channel separation curve to obtain a right channel separation phase signal corresponding to the right channel input signal; and multiplying the right channel separated phase signal by the corresponding right channel weight parameter to obtain the right channel weighted separated phase signal.
According to an embodiment of the present invention, the inverse conversion module is further configured to sum the left channel weighted panning amplitude signals to obtain left channel summed amplitude signals and sum the left channel weighted separating phase signals to obtain left channel summed phase signals when the number of the left channel weighted panning amplitude signals and the number of the left channel weighted separating phase signals are greater than one; a first inverse conversion step is performed on the left channel summed amplitude signal and the left channel summed phase signal to obtain an optimized left channel sound signal corresponding to the time domain.
According to an embodiment of the present invention, the inverse conversion module is further configured to sum the right channel weighted panning amplitude signals to obtain right channel summed amplitude signals and sum the right channel weighted separating phase signals to obtain right channel summed phase signals, when the number of the right channel weighted panning amplitude signals and the number of the right channel weighted separating phase signals are greater than one; a second inverse conversion step is performed on the right channel summed amplitude signal and the right channel summed phase signal to obtain an optimized right channel sound signal corresponding to the time domain.
Drawings
The foregoing and other objects, features, and advantages of the invention will be apparent from the following more particular description of the embodiments of the invention, as illustrated in the accompanying drawings in which:
FIG. 1 shows a functional block schematic of an audio processing system according to an embodiment of the invention;
FIG. 2a shows a translation curve corresponding to a class according to an embodiment of the invention;
FIG. 2b shows a translation curve corresponding to a class according to an embodiment of the invention;
FIG. 2c shows a left channel separation curve and a right channel separation curve according to an embodiment of the invention;
FIG. 3 shows a schematic flow diagram of an audio processing method according to an embodiment of the invention;
FIG. 4 is a flow chart illustrating the left channel adjustment step according to an embodiment of the present invention; and
fig. 5 is a flowchart illustrating a right channel adjusting step according to an embodiment of the present invention.
Detailed Description
As used herein, the terms first, second, …, etc. do not denote any order or sequence, but rather are used to distinguish one element or operation from another element or operation described in the same technical language.
Referring to fig. 1, a functional block diagram of an audio processing system 100 according to an embodiment of the invention is shown. The audio processing system 100 is used for externally inputting sound signals to optimize the sound performance. The sound signal includes a left channel signal and a right channel signal. In embodiments of the present invention, the sound signal may be composed of a plurality of different sound signals. For convenience of explanation, the input sound signals of the following embodiments include two different sound signals of speech and music, but the embodiments of the present invention are not limited thereto.
The audio processing system 100 includes a classification module 110, a conversion module 120, a left channel panning module 130, a right channel panning module 140, a left channel widening module 150, a right channel widening module 160, and an inverse conversion module 170. The classification module 110 is used to perform a classification step on the left channel signal and the right channel signal. In an embodiment of the present invention, the classification module 110 stores a plurality of processing parameter sets and a plurality of class labels C1-CnWherein the processing parameter sets are one-to-one corresponding to the class labels C1-CnAnd each class label represents a class of sound signal, such as speech or music. In an embodiment of the present invention, the classification module 110 is implemented by Machine Learning (ML) technology, but embodiments of the present invention are not limited thereto.
Each processing parameter set includes a panning angle (panning angle) curve, a separation curve corresponding to the left channel, a separation curve corresponding to the right channel, and a weight parameter. Referring to fig. 2a and 2b together, fig. 2a shows a panning angle curve PC1 for a music category, and fig. 2b shows a panning angle curve PC2 for a lecture category. In fig. 2a and 2b, the panning angle curves PC1 and PC2 are representative of the relation of time to panning angle (panningangle), which is representative of the angle of the sound signal in the left-right direction to indicate the directivity of the sound signal. In the present embodiment, the panning angle curve PC1 represents panning angle curves corresponding to music categories, wherein the panning angle curve PC1 can be expressed by the following formula:
θ1=0.01x sin70t (1)
where θ 1 represents the translation angle and t represents time. The pan angle curve PC2 is a curve representing pan angles corresponding to lecture categories, where the pan angle curve PC2 can be expressed as follows:
θ2=0.1x sin50t (2)
where θ 2 represents the translation angle. In this embodiment, the units of θ 1 and θ 2 are rad.
As can be seen from the above equations (1) and (2), the panning angle curves PC1 corresponding to the music category and the panning angle curves PC2 corresponding to the lecture category are sinusoidal functions in the present embodiment, but the embodiment of the present invention is not limited thereto.
Referring to fig. 2c, a separation curve SC1 of the left channel and a separation curve SC2 of the right channel corresponding to the lecture category are shown. As shown in fig. 2c, the separation curve SC1 for the left channel and the separation curve SC2 for the right channel are relationships between angles representing the separation phase angles and the spectral frequencies S, where the separation phase angles are phase angle differences between the phase angles corresponding to different frequencies in the sound signal. In the present embodiment, the separation curve SC1 for the left channel and the separation curve SC2 for the right channel correspond to the lecture category. The separation curve SC1 for the left channel may be represented by:
Figure BDA0002010743160000101
wherein
Figure BDA0002010743160000102
Representing the split phase angle of the left channel,
Figure BDA0002010743160000103
representing the maximum phase angle of separation, f1And f2The default frequency value can be adjusted according to the user's requirement. The separation curve SC2 for the right channel may be represented by:
Figure BDA0002010743160000104
wherein
Figure BDA0002010743160000105
Representing the split phase angle of the right channel. In one embodiment of the present invention, the first and second electrodes are,
Figure BDA0002010743160000106
f1=700,f20.5, embodiments of the invention are not limited thereto.
As can be seen from the above equations (3) and (4), in the present embodiment, the separation curve SC1 for the left channel and the separation curve SC2 for the right channel are inverted from each other, but the embodiments of the present invention are not limited thereto. In addition, in the present embodiment, the separation curve of the left channel and the separation curve of the right channel corresponding to the music genre are constant functions, and the constants thereof are 0.
Thus, the classification module 110 stores the class label C1-CnCurve Sh of translation angle1-ShnLeft channel separation curve LSe1-LSenSeparation curve RSe for right channel1-RSenAnd a weight parameter W1-WnWhere the curve of translation angle Sh1Left channel separation curve LSe1Separation curve RSe for right channel1And a weight parameter W1Form a processing parameter set and corresponding to the class label C1(ii) a Curve Sh of translation angle2Left channel separation curve LSe2Separation curve RSe for right channel2And a weight parameter W2Form a processing parameter set and corresponding to the class label C2(ii) a Curve Sh of translation anglenLeft channel separation curve LSenSeparation curve RSe for right channelnAnd a weight parameter WnForm a processing parameter set and corresponding to the class label Cn
When the classification module 110 performs the classification step on the left channel input signal and the right channel input signal, the classification module 110 will classify the left channel input signal and the right channel input signal according to the class label C1-CnTo classify the left channel input signal and the right channel input signal. For example, the left channel input signal is classified to correspond to a speech category and a sound category. In other words, the left channel input signal contains audio components of the speech class as well as audio components of the sound class. For another example, the right channel input signal is classified to correspond to a speech category and a sound category. In other words, the right channel input signal contains an audio component of the speech class as well as an audio component of the sound class.
In an embodiment of the invention, the classification module 110 classifies audio features of the left channel input signal and the right channel input signal, and provides different weight values for different classes. These weighting values are the weighting parameters W1-Wn
Thus, after the classification module 110 performs the classification step on the left channel input signal, at least one class (hereinafter referred to as a left channel sound class) corresponding to the left channel input signal, a panning angle curve (hereinafter referred to as a left channel panning angle curve) corresponding to the left channel sound class, a separation curve (hereinafter referred to as a left channel separation curve), and a weight parameter (hereinafter referred to as a left channel weight parameter) corresponding to the left channel sound class can be obtained. Similarly, when the classification module 110 performs the classification step on the right channel input signal, at least one class (hereinafter referred to as a right channel sound class) corresponding to the right channel input signal, a panning angle curve (hereinafter referred to as a right channel panning angle curve) corresponding to the right channel sound class, a separation curve (hereinafter referred to as a right channel separation curve), and a weight parameter (hereinafter referred to as a right channel weight parameter) can be obtained.
For example, the left channel input signal of the present embodiment corresponds to the lecture class label C1And music category label C2. Through speech category label C1The left channel input signal isCorresponding to the left channel translation angle curve Sh1Left channel separation curve LSe1And left channel weight parameter W1. By music category label C2The left channel input signal is corresponding to the left channel panning angle curve Sh2Left channel separation curve LSe2And left channel weight parameter W2. For another example, the right channel input signal of the present embodiment corresponds to the lecture class label C1And music category label C2. Through speech category label C1The right channel input signal is corresponding to the right channel translation angle curve Sh1Right channel separation curve RSe1And a right channel weight parameter W1. By music category label C2The right channel input signal is corresponding to the right channel translation angle curve Sh2Right channel separation curve RSe2And a right channel weight parameter W2
The converting module 120 is configured to perform a converting step on the left channel input signal and the right channel input signal, so as to convert the left channel input signal and the right channel input signal into a frequency domain, obtain a left channel amplitude signal and a left channel phase signal corresponding to the left channel input signal, and obtain a right channel amplitude signal and a right channel phase signal corresponding to the right channel input signal. For example, the left channel input signal is converted into a left channel amplitude signal LSA and a left channel phase signal LSP. For another example, the right channel input signal is converted into a right channel amplitude signal RSA and a right channel phase signal RSP. In the present embodiment, the converting module 120 uses Fourier Transform (Fourier Transform) to convert the left channel input signal and the right channel input signal into the frequency domain, but the embodiments of the present invention are not limited thereto.
The left channel panning module 130 is used to perform a first panning step on the left channel amplitude signal LSA to adjust the directivity of the left channel input signal according to the category of the left channel input signal. In an embodiment of the present invention, after the classification step of the classification module 110, the left channel input signal is a left channel panning angle curve and a left channel weight parameter corresponding to at least one class label. In the first translation step, the left channel is flatThe shift module 130 calculates a left channel panning angle curve corresponding to the left channel input signal according to the left channel panning angle curve. Left track panning curve PL(θ) may be represented by the following formula:
Figure BDA0002010743160000121
where θ is the aforementioned translation angle, e.g., θ 1 or θ 2.
Then, a left channel panning curve corresponding to the left channel input signal is multiplied by the corresponding left channel weighting parameter to obtain a left channel weighted panning curve. Then, the left channel panning module 130 multiplies the left channel amplitude signal LSA by the corresponding left channel weighted panning curve to obtain a left channel weighted panning amplitude signal. After the first panning step, the left channel panning module 130 further performs a first summing step to sum all the left channel weighted panning amplitude signals to obtain a left channel summed amplitude signal.
For example, the left channel input signal corresponds to the lecture class label C1Then the left channel panning module 130 first follows the left channel panning angle curve Sh1To calculate the left channel panning curve PL(Sh1) Then, the left channel panning curve and the left channel weighting parameter W are used1Multiplying to obtain a left channel weighted panning curve (W)1*PL(Sh1)). Then, the left channel amplitude signal LSA is multiplied by the left channel weighted panning curve to obtain a left channel weighted panning amplitude signal (LSA W)1*PL(Sh1)). As another example, the left channel input signal also corresponds to the music genre label C2Then the left channel panning module 130 first follows the left channel panning angle curve Sh2To calculate the left channel panning curve PL(Sh2) Then, the left channel panning curve and the left channel weighting parameter W are used2Multiplying to obtain a left channel weighted panning curve (W)2*PL(Sh2)). Then, the left channel amplitude signal LSA is multiplied by the left channel weighted panning curve to obtain another left channel weighted panning amplitude signal (LSA W)2*PL(Sh2)). The left channel panning module 130 then sums the left channel weighted panning amplitude signals to obtain a left channel summed amplitude signal (LSA W)1*PL(Sh1)+LSA*W2*PL(Sh2))。
In other embodiments of the present invention, the left channel panning module 130 may first multiply the left channel panning curve by the left channel amplitude signal LSA, and then multiply the product by the left channel weighting parameter. In addition, if the left channel input signal only corresponds to one category, it means that the left channel panning module 130 only generates one left channel weighted panning amplitude signal. Thus, the left channel panning module 130 omits the summation step.
The right channel panning module 140 functions similarly to the left channel panning module 130. The right channel panning module 140 is used for performing a second panning step on a right channel amplitude signal RSA corresponding to the right channel input signal, so as to adjust the directionality of the right channel input signal according to the type of the right channel input signal. In an embodiment of the present invention, after the classifying step of the classifying module 110, the right channel input signal is a right channel panning angle curve and a right channel weighting parameter corresponding to at least one class label. In the second panning step, the right channel panning module 140 calculates a right channel panning curve according to the right channel panning angle curve. Translation curve P of right trackR(θ) may be represented by the following formula:
Figure BDA0002010743160000141
where θ is the aforementioned translation angle, e.g., θ 1 or θ 2.
Then, a right channel panning curve corresponding to the right channel input signal is multiplied by the corresponding right channel weighting parameter to obtain a corresponding right channel weighting panning curve. Next, the right channel panning module 140 multiplies the right channel amplitude signal RSA corresponding to the right channel input signal by the corresponding right channel weighted panning curve to obtain a right channel weighted panning amplitude signal. After the second panning step, the right channel panning module 140 further performs a second summing step to sum all of the right channel weighted panned amplitude signals to obtain a right channel summed amplitude signal.
For example, the right channel input signal corresponds to the lecture class label C1Then the right channel panning module 140 first pans the angle curve Sh according to the right channel1To calculate the translation curve P of the right channelR(Sh1) Then, the right channel panning curve and the right channel weighting parameter W are used1Multiplying to obtain a right channel weighted panning curve (W)1*PR(Sh1)). Then, the right channel amplitude signal RSA is multiplied by the right channel weighted panning curve to obtain the right channel weighted panning amplitude signal (RSA W)1*PR(Sh1)). As another example, the right channel input signal also corresponds to music class C2Then the right channel panning module 140 first pans the angle curve Sh according to the right channel2To calculate the translation curve P of the right channelR(Sh2) Then, the right channel panning curve and the right channel weighting parameter W are used2Multiplying to obtain a right channel weighted panning curve (W)2*PR(Sh2)). Then, the right channel amplitude signal RSA is multiplied by the right channel weighted panning curve to obtain the right channel weighted panning amplitude signal (RSA W)2*PR(Sh2)). The right channel panning module 140 then sums the right channel weighted panning amplitude signals to obtain a right channel summed amplitude signal (RSA W)1*PR(Sh1)+RSA*W2*PR(Sh2))。
In other embodiments of the present invention, the right channel panning module 140 may first multiply the right channel panning curve by the right channel amplitude signal RSA, and then multiply the product by the right channel weight parameter. In addition, if the right channel input signal only corresponds to one category, it means that the right channel panning module 140 only generates one right channel weighted panning amplitude signal. Thus, the right channel panning module 140 omits the summation step.
The left channel widening module 150 is configured to perform a first separation step on a left channel phase signal corresponding to the left channel input signal, so as to adjust the widening degree of the left channel input signal according to the type of the left channel input signal. In an embodiment of the present invention, the left channel input signal is a left channel separation curve and a left channel weight parameter corresponding to at least one class label. In the first separation step, the left channel widening module 150 first adds the left channel phase signal LSP corresponding to the left channel input signal to the left channel separation curve to obtain a left channel separation phase signal corresponding to the left channel input signal. Then, the left channel widening module 150 multiplies the left channel separation phase signal corresponding to the left channel input signal by the corresponding left channel weight parameter to obtain a left channel weighted separation phase signal. After the first separation step, the left channel widening module 150 further performs a third summation step to sum all the left channel weighted separation phase signals to obtain a left channel summed phase signal.
For example, the left channel input signal corresponds to the lecture class label C1Then the left channel widening module 150 separates the left channel phase signal LSP and the left channel separation curve LSe1Adding to obtain a left channel split phase signal (LSP + LSe)1). Then, the left channel separation phase signal is multiplied by a left channel weight parameter to obtain a left channel weighted separation phase signal ((LSP + LSe)1)*W1). As another example, the left channel input signal also corresponds to the music genre label C2Then the left channel widening module 150 separates the left channel phase signal LSP and the left channel separation curve LSe2Adding to obtain a left channel split phase signal (LSP + LSe)2). Then, the left channel separation phase signal is multiplied by a left channel weight parameter to obtain a left channel weighted separation phase signal ((LSP + LSe)2)*W2). The left channel widening module 150 then sums the left channel weighted split phase signals described above to obtain a left channel summed phase signal ((LSP + LSe)1)*W1+(LSP+LSe2)*W2)。
In addition, if the left channel input signal only corresponds to one category, it means that the left channel widening module 150 only generates one left channel weighted separation phase signal. Thus, the left channel widening module 150 omits the summing step.
The right channel widening module 160 is similar to the left channel widening module 150. The right channel widening module 160 is configured to perform a second separation step on the right channel phase signal corresponding to the right channel input signal, so as to correspondingly adjust the widening degree of the right channel input signal according to the category of the right channel input signal. In an embodiment of the invention, the right channel input signal is a right channel separation curve and a right-left channel weight parameter corresponding to at least one class label. In the second separation step, the right channel widening module 160 first adds the right channel phase signal RSP corresponding to the right channel input signal to the right channel separation curve to obtain a right channel separation phase signal corresponding to the right channel input signal. Then, the right channel widening module 160 multiplies the right channel separation phase signal corresponding to the right channel input signal by the corresponding right channel weight parameter to obtain a right channel weighted separation phase signal. After the second separation step, the right channel widening module 160 further performs a fourth summation step to sum all the right channel weighted separation phase signals to obtain a right channel summed phase signal.
For example, the right channel input signal corresponds to the lecture class label C1Then the right channel widening module 160 separates the right channel phase signal RSP and the right channel separation curve RSe1Adding to obtain a right channel split phase signal (RSP + RSe)1). Then, the right channel separated phase signal is multiplied by a right channel weight parameter to obtain a right channel weighted separated phase signal ((RSP + RSe)1)*W1). As another example, the right channel input signal corresponds to a music genre label C2Then the right channel widening module 160 separates the right channel phase signal RSP and the right channel separation curve RSe2Adding to obtain a right channel split phase signal (RSP + RSe)2). Then, the right channel separated phase signal is multiplied by a right channel weight parameter to obtain a right channel weighted separated phase signal ((RSP + RSe)2)*W2). Then, the right channel widening module 160 separates the right channel weightingThe phase signals are summed to obtain a right channel summed phase signal ((RSP + RSe)1)*W1+(RSP+RSe2)*W2)。
In addition, if the right channel input signal only corresponds to one category, it means that the right channel broadening module 160 only generates one right channel weighted separation phase signal. Thus, the right channel widening module 160 does not perform the summation step.
The inverse transform module 170 is used for performing an inverse transform step on the left channel sum amplitude signal, the left channel sum phase signal, the right channel sum amplitude signal, and the right channel sum phase signal to obtain an optimized left channel sound signal and an optimized right channel sound signal corresponding to the time domain. For example, the inverse conversion module 170 performs an inverse conversion step on the left channel sum amplitude signal and the left channel sum phase signal to obtain an optimized left channel sound signal. For another example, the inverse transform module 170 performs an inverse transform step on the right channel sum amplitude signal and the right channel sum phase signal to obtain an optimized right channel sound signal. In the present embodiment, the Inverse transformation step is Inverse fourier transform (Inverse fourier transform), but the embodiment of the present invention is not limited thereto.
In one embodiment of the present invention, when the left channel input signal corresponds to only one category, it means that there is only one left channel weighted panning amplitude signal and one left channel weighted separating phase signal in this embodiment. Thus, the inverse transform module 170 performs the inverse transform on the left channel weighted panning amplitude signal and the left channel weighted separating phase signal. Similarly, in another embodiment of the present invention, when the right channel input signal corresponds to only one class, it means that there is only one right channel weighted panning amplitude signal and one right channel weighted separating phase signal in this embodiment. Thus, the inverse transform module 170 performs the inverse transform on the right channel weighted panning amplitude signal and the right channel weighted separating phase signal.
In another embodiment of the present invention, the audio output module 180 is used to output the optimized left channel sound signal and the optimized right channel sound signal. In the embodiment, the audio output module 180 is a sound card (sound card), but the implementation of the invention is not limited thereto.
As can be seen from the above embodiments, the audio processing system 100 classifies the input sound signals to process different classes of sub-sound signals with different processing parameter sets to optimize the sound effect of the input sound signals. Since the processing parameter set includes the panning curve, the separation curve and the weight parameter, the audio processing system 100 can make the stereo sound effect and the wide effect of the input sound signal more obvious, and can make the switching of the left and right channels smoother.
Referring to fig. 3, a flow chart of an audio processing method 300 corresponding to the audio processing system 100 according to the embodiment of the invention is shown. In the audio processing method 300, a step 310 is first performed to provide an input sound signal. Next, step 320 is performed to provide a plurality of categories (i.e., category labels) and sets of processing parameters. In the embodiment of the present invention, the categories and the processing parameter sets are preset in the classification module 110. Then, step 330 is performed to classify the input sound signal according to the category. In an embodiment of the present invention, step 330 is performed using classification module 110. Next, a left channel adjustment step 340 and a right channel adjustment step 350 are performed, respectively, to obtain an optimized left channel sound signal and an optimized right channel sound signal. Then, step 360 is performed to output the optimized left channel sound signal and the optimized right channel sound signal.
Referring to fig. 4, a flowchart of the left channel adjusting step 340 according to the embodiment of the invention is shown. In the left channel adjusting step 340, step 341 is performed first to perform the aforementioned converting step by the converting module 120 to convert the left channel input signal into the frequency domain. Then, steps 342-. In step 342, a first panning step is performed on the left channel amplitude signals to obtain a plurality of left channel weighted panned amplitude signals. Next, in step 343, the left channel weighted pan amplitude signals are summed to obtain a left channel summed amplitude signal. In the embodiment of the present invention, the step 342-343 is performed by using the left channel panning module 130. In step 344, a first separation step is performed on the left channel phase signals to obtain a plurality of left channel weighted separated phase signals. Next, in step 345, the left channel weighted split phase signals are summed to obtain a left channel summed phase signal. In the embodiment of the present invention, the step 344-345 is performed by using the left channel widening module 150. After step 342-. In an embodiment of the present invention, step 346 is performed using inverse transform module 170.
In addition, when the left channel input signal corresponds to only one category, the number of the aforementioned left channel weighted panning amplitude signals and left channel weighted separating phase signals may be only one. Thus, the steps 343 and 345 can be omitted and the step 346 can perform an inverse transform on the left channel weighted panning amplitude signal and the left channel weighted separating phase signal.
Referring to fig. 5, a flowchart of the right channel adjusting step 350 according to the embodiment of the invention is shown. In the right channel adjusting step 350, step 351 is first performed to perform the aforementioned converting step by the converting module 120, so as to convert the right channel input signal into the frequency domain. Then, steps 352-. In step 352, a second panning step is performed on the right channel amplitude signals to obtain a plurality of right channel weighted panned amplitude signals. Next, in step 353, the right channel weighted pan amplitude signals are summed to obtain a right channel summed amplitude signal. In the embodiment of the present invention, the steps 352-353 are performed by using the right channel panning module 140. In step 354, a second separation step is performed on the right channel phase signals to obtain a plurality of right channel weighted separated phase signals. Next, in step 355, the right channel weighted split phase signals are summed to obtain a right channel summed phase signal. In an embodiment of the present invention, steps 354 and 355 are performed by the right channel widening module 160. After step 352 and 355, a step 356 follows to perform an inverse conversion step on the left channel sum amplitude signal and the left channel sum phase signal to obtain an optimized right channel sound signal corresponding to the time domain. In an embodiment of the present invention, step 356 is performed using the inverse transform module 170.
In addition, when the right channel input signal corresponds to only one category, the number of the aforementioned right channel weighted panning amplitude signals and right channel weighted separating phase signals may be only one. Thus, the steps 353 and 355 can be omitted, and the step 356 can perform an inverse transform on the right channel weighted panned amplitude signal and the right channel weighted split phase signal.
Although the present invention has been described with reference to the above embodiments, it should be understood that the invention is not limited to the embodiments, and various changes and modifications can be made by one skilled in the art without departing from the spirit and scope of the invention.
[ notation ] to show
100: audio processing system
110: classification module
120: conversion module
130: left sound track translation module
140: right track translation module
150: left sound track broadening module
160: right sound channel broadening module
170: reverse conversion module
180: audio output module
300: audio processing method
310-360: step (ii) of
341-346: step (ii) of
351-356: step (ii) of
C1-Cn: category label
LSe1-LSen: left channel separation curve
PC1, PC 2: curve of translation angle
SC 1: left channel separation curve
SC 2: right track separation curve
RSe1-RSen: right track separation curve
Sh1-Shn: curve of translation
W1-Wn: weight parameter

Claims (10)

1. An audio processing method, comprising:
providing an input sound signal;
providing a plurality of categories, wherein the categories correspond to a plurality of processing parameter sets one to one, and each processing parameter set comprises a translation angle curve, a separation curve and a weight parameter;
classifying the input sound signal according to the plurality of classes to obtain at least one input sound class corresponding to the input sound signal, and the panning angle curve, the separation curve and the weight parameter corresponding to the at least one input sound class, wherein the at least one input sound class is at least one of the plurality of classes;
performing a conversion step on the input sound signal to convert the input sound signal into a frequency domain and obtain an amplitude signal and a phase signal corresponding to the input sound signal;
performing a panning step on the amplitude signal according to the at least one input sound type of the input sound signal, the panning angle curve corresponding to the at least one input sound type, and the weighting parameter, so as to obtain at least one weighted panning amplitude signal of the input sound signal;
performing a separation step on the phase signal according to the at least one input sound category of the input sound signal and the separation curve and the weight parameter corresponding to the at least one input sound category to obtain at least one weighted separation phase signal of the input sound signal;
wherein, when the number of the at least one weighted translational amplitude signal and the number of the at least one weighted separation phase signal are one, an inverse transformation step is performed on the weighted translational amplitude signal and the weighted separation phase signal to obtain an optimized sound signal corresponding to the time domain.
2. The audio processing method of claim 1, wherein the panning step comprises:
calculating a translation curve according to the translation angle curve corresponding to the at least one input sound category;
multiplying the panning curve corresponding to the at least one input sound type by the weighting parameter corresponding to the at least one input sound type to obtain a weighted panning curve corresponding to the input sound signal; and
the amplitude signal is multiplied by the corresponding weighted shift curve to obtain the at least one weighted shifted amplitude signal.
3. The audio processing method of claim 1, wherein the separating step comprises:
adding the phase signal and the corresponding separation curve to obtain a separation phase signal corresponding to the input sound signal; and
multiplying the split phase signal by the corresponding weight parameter to obtain the at least one weighted split phase signal.
4. The audio processing method of claim 1, wherein:
summing the plurality of weighted shifted amplitude signals to obtain a summed amplitude signal and summing the plurality of weighted separated phase signals to obtain a summed phase signal when the number of the at least one weighted shifted amplitude signal and the number of the at least one weighted separated phase signal are greater than one; and
an inverse transformation step is performed on the summed amplitude signal and the summed phase signal to obtain an optimized sound signal corresponding to the time domain.
5. The audio processing method according to claim 1, wherein the transforming step is a fourier transform and the inverse transforming step is an inverse fourier transform.
6. An audio processing method, comprising:
providing an input sound signal, wherein the input sound signal comprises a left channel input signal and a right channel input signal;
providing a plurality of classes, wherein the classes correspond to a plurality of processing parameter sets one-to-one, each processing parameter set comprises a panning angle curve, a first separation curve, a second separation curve and a weight parameter, wherein the first separation curve corresponds to a left channel, and the second separation curve corresponds to a right channel;
performing a first classification step on the left channel input signal according to the plurality of classes to obtain at least one left channel sound class corresponding to the left channel input signal, and obtaining at least one left channel panning angle curve, at least one left channel separation curve and at least one left channel weight parameter corresponding to the left channel input signal according to the at least one left channel sound class;
performing a second classification step on the right channel input signal according to the plurality of categories to obtain at least one right channel sound category corresponding to the right channel input signal, and obtaining at least one right channel panning angle curve, at least one right channel separation curve and at least one right channel weight parameter corresponding to the right channel input signal according to the at least one right channel sound category, wherein the at least one left channel sound category is at least one of the plurality of categories, and the at least one right channel sound category is at least one of the plurality of categories;
performing a left channel audio adjustment step, comprising:
performing a first conversion step to convert the left channel input signal to a frequency domain and obtain a left channel amplitude signal and a left channel phase signal corresponding to the left channel input signal;
performing a first panning step on the left channel amplitude signal according to the at least one left channel panning angle curve and the at least one left channel weighting parameter to obtain at least one left channel weighted panning amplitude signal of the left channel input signal;
performing a first separation step on the left channel phase signal according to the at least one left channel separation curve and the at least one left channel weight parameter to obtain at least one left channel weighted separation phase signal of the left channel input signal;
when the number of the at least one left channel weighted panning amplitude signal and the number of the at least one left channel weighted separating phase signal are one, performing a first inverse transformation step on the left channel weighted panning amplitude signal and the left channel weighted separating phase signal to obtain an optimized left channel sound signal corresponding to a time domain; and
performing a right channel audio adjustment step, comprising:
performing a second conversion step to convert the right channel input signal to a frequency domain and obtain a right channel amplitude signal and a right channel phase signal corresponding to the right channel input signal;
performing a second panning step on the right channel amplitude signal according to the at least one right channel panning angle curve and the at least one right channel weighting parameter corresponding to the right channel input signal to obtain at least one right channel weighted panning amplitude signal of the right channel input signal;
performing a second separation step on the right channel phase signal corresponding to the right channel input signal according to the at least one right channel separation curve corresponding to the right channel input signal and the at least one right channel weight parameter, so as to obtain at least one right channel weighted separation phase signal of the right channel input signal;
when the number of the at least one right channel weighted panning amplitude signal and the number of the at least one right channel weighted separating phase signal are one, a second inverse transformation step is performed on the right channel weighted panning amplitude signal and the right channel weighted separating phase signal to obtain an optimized right channel sound signal corresponding to the time domain.
7. The audio processing method of claim 6, wherein when the number of the at least one left channel sound category is one, the first panning step comprises:
calculating a left channel panning curve according to the at least one left channel panning angle curve;
multiplying the left channel panning curve by the at least one left channel weighting parameter to obtain a left channel weighting panning curve corresponding to the left channel input signal; and
multiplying the left channel amplitude signal by the corresponding left channel weighted panning curve to obtain the at least one left channel weighted panning amplitude signal.
8. The audio processing method of claim 6, wherein when the number of the at least one left channel sound category is one, the first separating step comprises:
adding the left channel phase signal and the at least one left channel separation curve to obtain a left channel separation phase signal corresponding to the left channel input signal; and
multiplying the left channel separation phase signal by the corresponding left channel weight parameter to obtain the at least one left channel weighted separation phase signal.
9. The audio processing method of claim 6, wherein:
summing the plurality of left channel weighted panning amplitude signals to obtain a left channel summed amplitude signal and summing the plurality of left channel weighted separating phase signals to obtain a left channel summed phase signal when the number of the at least one left channel weighted panning amplitude signal and the number of the at least one left channel weighted separating phase signal are greater than one; and
the first inverse conversion step is performed on the left channel sum amplitude signal and the left channel sum phase signal to obtain an optimized left channel sound signal corresponding to the time domain.
10. An audio processing system for processing an input sound signal, wherein the input sound signal comprises a left channel input signal and a right channel input signal, the audio processing system comprising:
a classification module for storing a plurality of processing parameter sets, wherein the plurality of processing parameter sets correspond to a plurality of classes one-to-one, each processing parameter set comprises a panning angle curve, a first separation curve corresponding to a left channel, a second separation curve corresponding to a right channel, and a weight parameter, the classification module is further for performing a first classification step and a second classification step on the left channel input signal and the right channel input signal according to the classes to obtain at least one left channel panning angle curve, at least one left channel separation curve, and at least one left channel weight parameter of at least one left channel sound class corresponding to the left channel input signal, and to obtain at least one right channel sound class, at least one right channel panning curve, at least one right channel separation curve, and at least one right channel weight parameter corresponding to the right channel input signal, wherein the at least one left channel sound category is one of the plurality of categories and the at least one right channel sound category is one of the plurality of categories;
a conversion module, configured to perform a conversion step on the left channel input signal and the right channel input signal, so as to convert the left channel input signal and the right channel input signal into a frequency domain, obtain a left channel amplitude signal and a left channel phase signal corresponding to the left channel input signal, and obtain a right channel amplitude signal and a right channel phase signal corresponding to the right channel input signal;
a left channel panning module, configured to perform a first panning step on the left channel amplitude signal according to the at least one left channel panning angle curve and the at least one left channel weighting parameter, so as to obtain at least one left channel weighted panning amplitude signal of the left channel input signal;
a right channel panning module, configured to perform a second panning step on the right channel amplitude signal according to the at least one right channel panning angle curve and the at least one right channel weighting parameter, so as to obtain at least one right channel weighted panning amplitude signal of the right channel input signal;
a left channel widening module, configured to perform a first separation step on the left channel phase signal according to the at least one left channel separation curve and the at least one left channel weight parameter, so as to obtain at least one left channel weighted separation phase signal of the left channel input signal;
a right channel widening module, configured to perform a second separation step on the right channel phase signal according to the at least one right channel separation curve and the at least one right channel weight parameter, so as to obtain at least one right channel weighted separation phase signal of the right channel input signal; and
an inverse transform module, wherein:
the inverse conversion module is configured to perform a first inverse conversion step on the left channel weighted panning amplitude signal and the left channel weighted separating phase signal when the number of the at least one left channel weighted panning amplitude signal and the number of the at least one left channel weighted separating phase signal are one, so as to obtain an optimized left channel sound signal corresponding to a time domain; and
the inverse transform module is configured to perform a second inverse transform step on the right channel weighted panning amplitude signal and the right channel weighted separating phase signal when the number of the at least one right channel weighted panning amplitude signal and the number of the at least one right channel weighted separating phase signal are one, so as to obtain an optimized right channel sound signal corresponding to a time domain.
CN201910244713.5A 2019-03-28 2019-03-28 Audio processing method and audio processing system Active CN111757239B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910244713.5A CN111757239B (en) 2019-03-28 2019-03-28 Audio processing method and audio processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910244713.5A CN111757239B (en) 2019-03-28 2019-03-28 Audio processing method and audio processing system

Publications (2)

Publication Number Publication Date
CN111757239A true CN111757239A (en) 2020-10-09
CN111757239B CN111757239B (en) 2021-11-19

Family

ID=72672350

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910244713.5A Active CN111757239B (en) 2019-03-28 2019-03-28 Audio processing method and audio processing system

Country Status (1)

Country Link
CN (1) CN111757239B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080205659A1 (en) * 2007-02-22 2008-08-28 Siemens Audiologische Technik Gmbh Method for improving spatial perception and corresponding hearing apparatus
US20100157726A1 (en) * 2006-01-19 2010-06-24 Nippon Hoso Kyokai Three-dimensional acoustic panning device
CN101884065A (en) * 2007-10-03 2010-11-10 创新科技有限公司 The spatial audio analysis that is used for binaural reproduction and format conversion is with synthetic
CN103329571A (en) * 2011-01-04 2013-09-25 Dts有限责任公司 Immersive audio rendering system
US20140362998A1 (en) * 2013-06-11 2014-12-11 Harman Becker Automotive Systems Gmbh Directional coding conversion
CN104919822A (en) * 2012-11-15 2015-09-16 弗兰霍菲尔运输应用研究公司 Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
CN104995681A (en) * 2013-02-15 2015-10-21 高通股份有限公司 Video analysis assisted generation of multi-channel audio data
US9197977B2 (en) * 2007-03-01 2015-11-24 Genaudio, Inc. Audio spatialization and environment simulation
CN108206978A (en) * 2016-12-16 2018-06-26 大北欧听力公司 Binaural listening apparatus system with ears pulse environmental detector

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100157726A1 (en) * 2006-01-19 2010-06-24 Nippon Hoso Kyokai Three-dimensional acoustic panning device
US20080205659A1 (en) * 2007-02-22 2008-08-28 Siemens Audiologische Technik Gmbh Method for improving spatial perception and corresponding hearing apparatus
US9197977B2 (en) * 2007-03-01 2015-11-24 Genaudio, Inc. Audio spatialization and environment simulation
CN101884065A (en) * 2007-10-03 2010-11-10 创新科技有限公司 The spatial audio analysis that is used for binaural reproduction and format conversion is with synthetic
CN103329571A (en) * 2011-01-04 2013-09-25 Dts有限责任公司 Immersive audio rendering system
CN104919822A (en) * 2012-11-15 2015-09-16 弗兰霍菲尔运输应用研究公司 Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
CN104995681A (en) * 2013-02-15 2015-10-21 高通股份有限公司 Video analysis assisted generation of multi-channel audio data
US20140362998A1 (en) * 2013-06-11 2014-12-11 Harman Becker Automotive Systems Gmbh Directional coding conversion
CN108206978A (en) * 2016-12-16 2018-06-26 大北欧听力公司 Binaural listening apparatus system with ears pulse environmental detector

Also Published As

Publication number Publication date
CN111757239B (en) 2021-11-19

Similar Documents

Publication Publication Date Title
US10595144B2 (en) Method and apparatus for generating audio content
US8160270B2 (en) Method and apparatus for acquiring multi-channel sound by using microphone array
CN102907120B (en) For the system and method for acoustic processing
US10412226B2 (en) Audio signal processing apparatus and method
CN103811023A (en) Audio processing device, method and program
CN111757239B (en) Audio processing method and audio processing system
TWI692719B (en) Audio processing method and audio processing system
CN109036455B (en) Direct sound and background sound extraction method, loudspeaker system and sound reproduction method thereof
EP3787311B1 (en) Sound image reproduction device, sound image reproduction method and sound image reproduction program
US11863946B2 (en) Method, apparatus and computer program for processing audio signals
Härmä Classification of Time–Frequency Regions in Stereo Audio
CN111757240B (en) Audio processing method and audio processing system
Stefanakis et al. Foreground suppression for capturing and reproduction of crowded acoustic environments
Kim et al. Multi-channel audio recording based on superdirective beamforming for portable multimedia recording devices
Hwang et al. On-Device Intelligence for Real-Time Audio Classification and Enhancement
TWI719429B (en) Audio processing method and audio processing system
KR101607334B1 (en) Method for decoding multi-channel audio signals and multi-channel audio codec
Moon et al. Multi-channel audio source separation using azimuth-frequency analysis and convolutional neural network
Härmä Estimation of the energy ratio between primary and ambience components in stereo audio data
Tsai et al. Neural Network-Based Low-Frequency Perception Enhancement Used in Biomedical Hearing Applications
JP6684651B2 (en) Channel number converter and its program
Miyauchi et al. Depth Estimation of Sound Images Using Directional Clustering and Activation-Shared Nonnegative Matrix Factorization
CN117198314A (en) Voice processing method, device, electronic equipment and storage medium
Unoki et al. A model for selective segregation of a target instrument sound from the mixed sound of various instruments.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant