CN1581292A - Non-linear overlapping method for sequence switch - Google Patents

Non-linear overlapping method for sequence switch Download PDF

Info

Publication number
CN1581292A
CN1581292A CN 03127827 CN03127827A CN1581292A CN 1581292 A CN1581292 A CN 1581292A CN 03127827 CN03127827 CN 03127827 CN 03127827 A CN03127827 A CN 03127827A CN 1581292 A CN1581292 A CN 1581292A
Authority
CN
China
Prior art keywords
value
maximum index
index value
critical value
predetermined number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 03127827
Other languages
Chinese (zh)
Other versions
CN1244901C (en
Inventor
吴俊德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ali Corp
Original Assignee
Ali Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ali Corp filed Critical Ali Corp
Priority to CN 03127827 priority Critical patent/CN1244901C/en
Publication of CN1581292A publication Critical patent/CN1581292A/en
Application granted granted Critical
Publication of CN1244901C publication Critical patent/CN1244901C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Complex Calculations (AREA)

Abstract

The invention discloses non-linear superimposed sequence conversion method for synthesizing S3[n] from S1[n] and S2[n]. S1[n] contains N1 pieces of signals, and S2[n] contains N2 pieces of signals. The method includes following steps: (1) S2[n] is delayed a preconcerted certain number in order to form S5[n]; (2) building correlation table between S1[n] and S5[n]; (3) setting up S3[n] as following: S1[n], when 0 is less than or equal to n<(preconcerted number + maximum index value corresponding to maximum correlation value + first critical value); synthesized S4[n] from weighted S1[n], when (preconcerted certain number + the maximum index value + the first critical value) is less than or equal to n<(N1-second critical value); S4[n-(preconcerted certain number + the maximum index value)], when (N1- second critical value) 0 is less than or equal to n is less than or equal to N2 + preconcerted certain number + the maximum index value; Where the first critical value and second critical value are not equal to zero at same time, S4[n] is as S5[n] delayed a maximum index value.

Description

The non-linear method of superposition that is used for the sequential conversion
Technical field
The present invention relates to provide a kind of signal synthesis method, relate in particular to a kind of non-linear overlapping (nonlinear overlap) method that is applied to sequential conversion (timescaling).
Background technology
Along with the progress of science and technology, some are also more and more as the function that the video-audio playing device of Karaoke and so on can provide, and for example similarly are that audio purifies (audio clean-up), dreamlike sound field (dream), reaches sequential conversion functions such as (time scaling).So-called sequential conversion (being called time stretching, time compression/expansion or time correction again) is under the situation that does not influence tone (pitch), change the length of sound signal, that is change the playback rate (tempo) of this sound signal.
At present, AV device on the market mostly is to see through three kinds of following methods to finish the sequential conversion, and one is that another then is Time Domain Harmonic Scaling (TDHS) to Phase Vocoder, for MPEX (Minimum Perceived Loss TimeExpansion/Compression).To be the mode of utilizing STFT (Short Time Fourier Transform) earlier convert a sound signal frequency-region signal (complex Fourierrepresentation) of a Fourier pattern to Phase vocoder, and the mode of utilizing interpolation and iSTFT (inverse) again converts this frequency-region signal to one and changed the sound signal of (time scaled) corresponding to the sequential of this sound signal.MPEX is developed by Prosoniq recently, and MPEX is a kind of method of simulating human auditory properties, is similar to artificial neural network (artificial neural network).MPEX is the sound signal of being included according in the specific time sequence, and and then " study " this specific period in the various characteristics of sound signal, to attempt to prolong or shorten this sound signal.TDHS then is the method that a kind of more general sequential is changed, it is each correlation (magnitudes of a autocorrelation function) that calculates earlier in the correlation table (autocorrelogram) of first sound signal, then postpone this first sound signal to produce second sound signal according to the pairing maximum index value of the maximum related value in this correlation table, and then with this first sound signal with overlapping addition (synchronized overlap-add, SOLA) mode is replicated on this second sound signal, to produce the 3rd sound signal long than first sound signal.
Generally speaking, above-mentioned correlation table is to see through numerical digit signal processor (DSP) to set up, and DSP is specially as the calculating (convolution) of handling as circle round, fast fourier transform (fast Fouriertransform, the usefulness of complex mathematical computing such as FFT).Even so, DSP and also there is no need with regard to a certain degree all are overlapped in that the part of this second sound signal is all overlapping synthesizes in this second sound signal not only tediously longly with the process that forms the 3rd sound signal in this first sound signal.
Summary of the invention
Therefore fundamental purpose of the present invention is to provide a kind of non-linear method of superposition that is used for the sequential conversion, this method is unlikely to influence significantly the quality of the 3rd sound signal again apace this first sound signal and this second sound signal being synthesized in the 3rd sound signal.
According to claims of the present invention, the present invention discloses a kind of being used for S 1[n] and S 2[n] synthesizes S 3The non-linear overlapping sequential conversion of [n], wherein S 1[n] comprises N 1Individual signal, and S 2[n] comprises N 2Individual signal, this method comprises the following step: (a) with S 2[n] postpones a predetermined number to form S 5[n] (b) sets up S 1[n] and S 5The correlation table of [n], and (c) with S 3[n] sets for:
S 1[n] is when 0<=n<(the pairing maximum index value of maximum related value+first critical value in this predetermined number+this correlation table);
S 1[n] weighting is synthesized in S 4[n] is as (this predetermined number+this maximum index value+this first critical value)<=n<(N 1-the second critical value) time;
S 4[n-(this predetermined number+this maximum index value)] is as (N 1-this second critical value)<=n<=N 2+ this predetermined number+this maximum index value;
Wherein this first, second critical value is not zero simultaneously, and S 4[n] is S 5[n] postpones this maximum index value.
Method of the present invention is only a part of weighting that is overlapped in this first sound signal in the part of this second sound signal to be synthesized in this second sound signal to produce the 3rd sound signal, therefore, can increase the operational effectiveness of the computer at the DSP place that is used for handling the sequential conversion.
Description of drawings
Fig. 1 is the process flow diagram of the inventive method.
Fig. 2 is that the inventive method is with S 1[n] and S 2[n] synthesizes S 3The synoptic diagram of [n].
Fig. 3 increases the synoptic diagram of sound signal for the inventive method.
Fig. 4 shortens the synoptic diagram of sound signal for the inventive method.
Graphic symbol description
Δ predetermined number τ MaxMaximum index value
Th 1The first critical value th 2Second critical value
Embodiment
Behind the correlation table of setting up corresponding to first sound signal and second sound signal (or postponing in the sound signal of this second sound signal), the method 100 in the preferred embodiment of the present invention is to calculate the 3rd sound signal according to the pairing maximum index value of the maximum related value in this correlation table, first critical value, second critical value and this first sound signal and this second sound signal.Specifically, in order to save in order to synthetic this first sound signal and this second sound signal computing time with the DSP that produces the 3rd sound signal, method 100 calculate this maximum index value and with this this maximum index value of second delayed audio signal after, be not with all parts that are overlapped in this second sound signal in this first sound signal all weighting synthesize in this second sound signal, be only to synthesize in this second sound signal to produce the 3rd sound signal on the contrary with being overlapped in a part in the part of this second sound signal (that is be positioned in this lap between this first critical value and this second critical value lap) weighting in this first sound signal.
See also Fig. 1, Fig. 1 is the process flow diagram of method 100 in the preferred embodiment of the present invention.Method 100 comprises the following step:
Step 102: beginning;
(S 1[n] and S 2[n] will be synthesized and be S 3[n] supposes S 1[n] and S 2[n] comprises N respectively 1And N 2Individual signal)
Step 104: with S 2[n] postpones a predetermined number Δ to form S 5[n];
(optical read head (pickuphead) in video-audio playing device is reading S 3The phenomenon of reading of data deficiency (run-in) takes place when [n], so method of the present invention 100 is earlier with S 2After [n] postpones the predetermined number Δ, just calculate synthetic S 1[n] and S 5The maximum index value τ that [n] is required MaxIn a preferred embodiment of the invention, the predetermined number Δ is to equal [N 1/ 3])
Step 106: set up S 1[n] and S 5The correlation table of [n] (crosscorrelogram) and according to the pairing maximum index value τ of the maximum related value in this correlation table MaxPostpone S 5[n] is to form S 4[n];
(comprising a plurality of correlations (magnitudes of a crosscorrelationfunction) in this correlation table, all corresponding index value of each correlation)
Step 108: with S 1[n] and S 4[n] synthesizes in S 3[n];
(S 3[n] is configured to:
S 1[n] is when 0<=n<(predetermined number Δ+maximum index value τ Max+ the first critical value th 1) time;
S 1[n] weighting is synthesized in S 4[n] is when (predetermined number Δ+maximum index value τ Max+ the first critical value th 1)<=n<(N 1-the second critical value th 2) time;
S 4[n-(predetermined number Δ+maximum index value τ Max)], as (N 1-the second critical value th 2)<=n<=N 2+ predetermined number Δ+maximum index value τ Max
The first critical value th wherein 1And the second critical value th 2Be not zero simultaneously)
Step 110: finish.
See also Fig. 2, Fig. 2 is the S in the preferred embodiments of the present invention 1[n] and S 2[n] synthesizes S 3The synoptic diagram of [n].First 401 among Fig. 4 is the S in the step 102 of display packing 100 1[n] and S 2[n], second portion 402 are the S in the step 104 of display packing 100 1[n] and S 5[n], third part 403 are the τ that calculated in the step 106 of display packing 100 MaxAnd S 4[n] the 4th part 404 and the 5th part 405 then in the step 108 of display packing 100 by S 1[n] and S 4The S that [n] synthesized 3[n].
Shown S in the 4th part 404 of Fig. 2 3[n] is at (predetermined number Δ+maximum index value τ Max+ the first critical value th 1)<=n<(N 1-the second critical value th 2) time be to equal:
( N 1 - th 2 - n ) ( N 1 - ( &Delta; + &tau; max + th 1 + th 2 ) ) * S 1 [ n ] + n - ( &Delta; + th 1 + &tau; max ) ( N 1 - ( &Delta; + &tau; max + th 1 + th 2 ) ) * S 4 [ n - ( &Delta; + &tau; max ) ]
And shown S in the 5th part 405 of Fig. 2 3[n] is at (predetermined number Δ+maximum index value τ Max+ the first critical value th 1)<=n<(N 1-the second critical value th 2) time be to equal:
( N 1 - n ) ( N 1 - ( &Delta; + &tau; max ) ) * S 1 [ n ] + n - ( &Delta; + &tau; max ) ( N 1 - ( &Delta; + &tau; max ) ) * S 4 [ n - ( &Delta; + &tau; max ) ]
Above-mentioned S 1[n] is if be congruent to S 2[n], that is S 1[n] and S 2[n] separates from S[n] same position, as shown in Figure 3, then method 100 is to increase S 1[n].On the contrary, S 1[n] and S 2[n] is as if unequal, that is S 1[n] and S 2[n] separates from S[n] diverse location, as shown in Figure 4, then method 100 is with S 1[n], S 6[n] (being rejected), and S 2[n] shortens to S 3[n].
Compare with known TDHS, method of the present invention is to be used for reducing S according to the pairing maximum index value of the maximum related value in the correlation table and two 1[n] and S 2First and second critical value of the lap of [n] is calculated and is synthesized in S 1[n] and S 2The S of [n] 3[n].Because the present invention after calculating this maximum index value, does not need to calculate one by one S 1[n] is overlapped in S 2Whole numerical value of [n], that is only need calculate S 3Therefore part numerical value in [n] between between this first and second critical value can be saved and is used for according to S 1[n] and S 2[n] is with synthetic S 3The DSP of [n] calculates S 3The time of [n] required cost, jointly, also increase the operational effectiveness of the computer at this DSP place.
The above only is the preferred embodiments of the present invention, and all equalizations of making according to claim of the present invention change and revise, and all should belong to the covering scope of patent of the present invention.

Claims (19)

1. a non-linear method of superposition that is used for the sequential conversion is used for S 1[n] and S 2[n] synthesizes S 3[n], S 1[n] comprises N 1Individual signal, and S 2[n] comprises N 2Individual signal, this method comprises the following step:
(a) with S 2[n] postpones a predetermined number to form S 5[n];
(b) set up S 1[n] and S 5The correlation table of [n] comprises a plurality of correlations in this correlation table, all corresponding index value of each correlation; And
(c) according to the pairing maximum index value of the maximum related value in this correlation table, with S 3[n] sets for:
S 1[n] is when 0<=n<(this predetermined number+this maximum index value+first critical value);
S 1[n] weighting is synthesized in S 4[n] is as (this predetermined number+this maximum index value+this first critical value)<=(N 1-the second critical value) time;
S 4[n-(this predetermined number+this maximum index value)] is as (N 1-this second critical value)<=n<=N 2+ this predetermined number+this maximum index value;
Wherein this first, second critical value is not zero simultaneously, and S 4[n] is S 5[n] postpones this maximum index value.
2. the method for claim 1 is wherein worked as (this predetermined number+this maximum index value+this first critical value)<=n<(N 1-the second critical value) time, S 3[n] equals (N 1-this second critical value-n)/(N 1-(this predetermined number+this maximum index value+this first critical value+this second critical value)) * S 1[n]+(n-(this predetermined number+this maximum index value+this first critical value))/(N 1-(this predetermined number+this maximum index value+this first critical value+this second critical value)) * S 4[n-(this predetermined number+this maximum index value)].
3. the method for claim 1 is wherein worked as (this predetermined number+this maximum index value+this first critical value)<=n<(N 1-the second critical value) time, S 3[n] equals (N 1-n)/(N 1-(this predetermined number+this maximum index value)) * S 1[n]+(n-(this predetermined number+this maximum index value))/(N 1-(this predetermined number+this maximum index value)) * S 4[n-(this predetermined number+this maximum index value)].
4. the method for claim 1, wherein S 1[n] and S 2[n] takes a sample from S respectively 1(t) and S 2(t).
5. method as claimed in claim 3, wherein S 1(t) and S 2(t) be to separate from an original signal.
6. method as claimed in claim 5, wherein this original signal is a sound signal.
7. method as claimed in claim 5, wherein this original signal is a vision signal.
8. method as claimed in claim 4, wherein S 1(t) be to equal S 2(t).
9. method as claimed in claim 4, wherein S 1(t) be to be not equal to S 2(t).
10. the method for claim 1, wherein this predetermined number is to equal [N 1/ 3].
11. a non-linear method of superposition that is used for the sequential conversion is used for S 1[n] and S 2[n] synthesizes S 3[n], S 1[n] comprises N 1Individual signal, and S 2[n] comprises N 2Individual signal, this method comprises the following step:
(a) set up S 1[n] and S 2The correlation table of [n] comprises a plurality of correlations in this correlation table, all corresponding index value of each correlation; And
(b) according to the pairing maximum index value of the maximum related value in this correlation table, with S 3[n] sets for:
S 1[n] is when 0<=n<(this maximum index value+first critical value);
S 1[n] weighting is synthesized in S 4[n] is as (this maximum index value+this first critical value)<=n<(N 1-the second critical value) time;
S 4[this maximum index value of n-]], as (N 1-this second critical value)<=n<=(N 2+ this maximum index value);
Wherein this first, second critical value is not zero simultaneously, and S 4[n] is S 2[n] postpones this maximum index value.
12. method as claimed in claim 11 is wherein as (this maximum index value+this first critical value)<=n<(N 1-the second critical value) time, S 3[n] equals (N 1-this second critical value-n)/(N 1-(this maximum index value+this first critical value+this second critical value)) * S 1[n]+(n-(this maximum index value+this first critical value))/(N 1-(this maximum index value+this first critical value+this second critical value)) * S 4[n-(this maximum index value)].
13. method as claimed in claim 11 is wherein as (this predetermined number+this maximum index value+this first critical value)<=n<(N 1-the second critical value) time, S 3[n] equals (N 1-n)/(N 1-(this predetermined number+this maximum index value)) * S 1[n]+(n-(this predetermined number+this maximum index value))/(N 1-(this predetermined number+this maximum index value)) * S 4[n-(this predetermined number+this maximum index value)].
14. method as claimed in claim 11, wherein S 1[n] and S 2[n] takes a sample from S respectively 1(t) and S 2(t).
15. method as claimed in claim 14, wherein S 1(t) and S 2(t) be to separate from an original signal.
16. method as claimed in claim 15, wherein this original signal is a sound signal.
17. method as claimed in claim 15, wherein this original signal is a vision signal.
18. method as claimed in claim 14, wherein S 1(t) be to equal S 2(t).
19. method as claimed in claim 14, wherein S 1(t) be to be not equal to S 2(t).
CN 03127827 2003-08-11 2003-08-11 Non-linear overlapping method for sequence switch Expired - Fee Related CN1244901C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 03127827 CN1244901C (en) 2003-08-11 2003-08-11 Non-linear overlapping method for sequence switch

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 03127827 CN1244901C (en) 2003-08-11 2003-08-11 Non-linear overlapping method for sequence switch

Publications (2)

Publication Number Publication Date
CN1581292A true CN1581292A (en) 2005-02-16
CN1244901C CN1244901C (en) 2006-03-08

Family

ID=34578871

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 03127827 Expired - Fee Related CN1244901C (en) 2003-08-11 2003-08-11 Non-linear overlapping method for sequence switch

Country Status (1)

Country Link
CN (1) CN1244901C (en)

Also Published As

Publication number Publication date
CN1244901C (en) 2006-03-08

Similar Documents

Publication Publication Date Title
US6073100A (en) Method and apparatus for synthesizing signals using transform-domain match-output extension
US20050025263A1 (en) Nonlinear overlap method for time scaling
CA2448178C (en) Method for time aligning audio signals using characterizations based on auditory events
CN1144369A (en) Autokeying for musical accompaniment playing apparatus
US20100145708A1 (en) System and method for identifying original music
CN113314140A (en) Sound source separation algorithm of end-to-end time domain multi-scale convolutional neural network
WO1997034289A1 (en) System for automatically morphing audio information
EP1303855A2 (en) Continuously variable time scale modification of digital audio signals
JPH0863197A (en) Method of decoding voice signal
GB2060321A (en) Speech synthesizer
JPH06266390A (en) Waveform editing type speech synthesizing device
CN113241082B (en) Sound changing method, device, equipment and medium
CN111192594B (en) Method for separating voice and accompaniment and related product
EP1074968B1 (en) Synthesized sound generating apparatus and method
Ferreira-Paiva et al. A survey of data augmentation for audio classification
Sudo et al. Multichannel environmental sound segmentation: with separately trained spectral and spatial features
Jensen The timbre model
CN1244901C (en) Non-linear overlapping method for sequence switch
CN112309425B (en) Sound tone changing method, electronic equipment and computer readable storage medium
CN118696375A (en) Method and system for real-time low-delay synthesis of audio using neural networks and differentiable digital signal processors
CN100343893C (en) Method of synthesis for a steady sound signal
US5647005A (en) Pitch and rate modifications of audio signals utilizing differential mean absolute error
US5832442A (en) High-effeciency algorithms using minimum mean absolute error splicing for pitch and rate modification of audio signals
WO2017164216A1 (en) Acoustic processing method and acoustic processing device
CN1246825C (en) Method for predicationg intonation estimated value of voice signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20060308

Termination date: 20140811

EXPY Termination of patent right or utility model