WO2003005342A1 - Procede et appareil de couplage de signaux - Google Patents
Procede et appareil de couplage de signaux Download PDFInfo
- Publication number
- WO2003005342A1 WO2003005342A1 PCT/JP2002/006479 JP0206479W WO03005342A1 WO 2003005342 A1 WO2003005342 A1 WO 2003005342A1 JP 0206479 W JP0206479 W JP 0206479W WO 03005342 A1 WO03005342 A1 WO 03005342A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- waveform
- upper limit
- waveform signals
- frequency
- Prior art date
Links
- 238000010168 coupling process Methods 0.000 title claims abstract description 33
- 238000001228 spectrum Methods 0.000 claims abstract description 51
- 238000001914 filtration Methods 0.000 claims abstract description 39
- 230000008878 coupling Effects 0.000 claims abstract description 29
- 238000005859 coupling reaction Methods 0.000 claims abstract description 29
- 238000000034 method Methods 0.000 claims description 48
- 239000002131 composite material Substances 0.000 claims description 17
- 230000003595 spectral effect Effects 0.000 claims description 6
- 238000010183 spectrum analysis Methods 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000001308 synthesis method Methods 0.000 description 4
- 101100488882 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) YPL080C gene Proteins 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G10L13/07—Concatenation rules
Definitions
- the present invention relates to a signal combining method and a signal combining device for generating a composite waveform signal by combining signals representing waveforms, and more particularly to a method and a device suitable for combining a plurality of audio waveform signals. Things.
- speech synthesized by a speech synthesis technique has been widely used. Specifically, it is used in many situations, such as text-to-speech software, telephone number guidance, stock information, travel information, store information, and traffic information.
- Speech synthesis methods are roughly classified into a rule synthesis method and a shape editing method.
- the rule synthesis method is a method in which morphological analysis is performed on the text for which speech is to be synthesized, and speech is generated by performing phonological processing on the text based on the analysis result.
- this rule synthesizing method there are few restrictions on the content of text used for speech synthesis, and text having various contents can be used for speech synthesis.
- the quality of the output voice is inferior to the rule editing method compared to the waveform editing method.
- the waveform editing method is a method of recording a voice actually uttered by a human and connecting the components obtained by dividing the recorded voice to obtain a target voice.
- the waveform editing method is superior to the rule synthesis method in voice quality.
- this waveform editing method cannot synthesize speech that includes parts that cannot be extracted from recorded speech. Therefore, the larger the unit for dividing the recorded voice, the greater the restrictions on the voice to be synthesized. For this reason, in the waveform editing method, the recorded voice is subdivided into individual vowels and consonants.
- techniques have been proposed to enable synthesis of various voices.
- the waveform of the connecting portion that connects the components of the recorded voice becomes discontinuous, as shown in Fig. 6 (a), for example, and this becomes the source of noise. If the unit for segmenting the recorded voice is small, this noise caused by discontinuous connection parts becomes conspicuous, and the quality of the synthesized voice deteriorates.
- a method of improving the connection by connecting discontinuous connection parts with a straight line can be considered.
- the connected portion generates a harmonic component, and the harmonic component also becomes noise.
- MDS Minimum Distance Search
- Fig. 6 (C) the MDS method is used to join two waveforms together, the part near the rear end of the preceding waveform and the part near the front end as much as possible.
- the point where the instantaneous value and the slope of the tangent line are almost the same from each other is found one by one, and these points are connected.
- the connection between the waveforms is usually not the end of each connected waveform. For this reason, a part of the joined waveforms is usually truncated, and as a result, the synthesized speech becomes unnatural.
- the present invention has been made in view of the above circumstances, and has as its object to provide a signal coupling method and a signal coupling device which can generate a natural synthesized voice with less noise.
- a signal combining method includes a step of combining a plurality of waveform signals in a predetermined order to combine a plurality of waveform signals to generate a combined waveform signal; The combined waveforms for a predetermined time period including each combined portion of the signal. Filtering the signal.
- the predetermined time period is set to be equal to or less than 1/10 of a time length of each waveform signal.
- a signal combining method includes a step of combining a plurality of waveform signals with each other in a predetermined order, and a step of determining an upper limit frequency of a frequency spectrum of each of the plurality of waveform signals.
- the filtering is low-pass filtering
- the predetermined filtering characteristic is a cut-off frequency of the low-pass filtering.
- the cut-off frequency of the low-pass filtering is set to the higher upper limit frequency of the spectrum upper frequencies of the two waveform signals before and after the combined portion of the waveform signals.
- the upper limit frequency of the frequency spectrum of each waveform signal is typically determined by spectrum analysis using Fourier transform. The upper limit frequency is calculated by using a high-pass filter to obtain the average amplitude of the high frequency components. You may make it calculate
- a harmonic component generated by a discontinuous change in the coupling portion of the waveform signal is converted into a spectrum of the waveform signal before and after the coupling portion of the waveform signal. It can be effectively removed by a filter having an adapted filter characteristic. For this reason, the sense of noise in the synthesized waveform signal is significantly reduced.
- a method of the present invention comprises combining a plurality of input waveform signals with each other to generate a composite waveform signal, and calculating a spectrum of a pair of adjacent waveform signals in the composite waveform signal.
- a bandwidth for filtering a combined portion of the pair of waveform signals is determined based on the upper limit frequency, and a combined portion of the pair of waveform signals in the composite waveform signal is filtered with the determined bandwidth. It can be understood as a signal combining method including each signal processing step.
- the signal combining device of the present invention includes means for combining a plurality of waveform signals with each other in a predetermined order to generate a combined waveform signal by combining the plurality of waveform signals. And a filter for filtering the combined plurality of waveform signals for a predetermined time period including each combined portion of the combined plurality of waveform signals.
- the signal coupling device of the present invention comprises: a unit for coupling the plurality of waveform signals to each other in a predetermined order; and a unit for coupling frequency waveforms of the plurality of waveform signals to each other.
- the filter is a low-pass filter, and the predetermined filter characteristic is a cut-off frequency of the low-pass filter.
- the cut-off frequency of the low-pass filtering is set to the higher one of the upper limit frequencies of the spectra of the two waveform signals before and after the combined portion of the waveform signals. ing.
- the upper limit frequency determining means of the present invention includes a spectrum analyzer using a Fourier transformer or a high-pass filter.
- a signal combiner of the present invention includes: coupling means for coupling a plurality of input waveform signals to each other to generate a composite waveform signal; and a pair of adjacent waveforms in the composite waveform signal.
- Bandwidth determining means for determining a bandwidth for filtering a coupling portion of the pair of waveform signals based on a frequency of an upper limit of a spectrum of the signal; and the pair of waveform signals of the composite waveform signal And the bandwidth determined by the bandwidth determining means. It is grasped as a signal combining device including a filtering means for filtering at.
- the combined portion of the two input signals combined by such a signal combiner is filtered with a bandwidth determined by the upper limit frequency of the spectrum of these input waveform signals, so that the composite waveform signal is Thus, noise caused by higher harmonic components is reduced. Further, according to such a signal coupling device, since the end of the input signal is not truncated, when the input waveform signal represents a voice waveform, a natural synthesized voice is generated.
- the bandwidth determining unit includes, for example, a unit that performs Fourier transform on each of the pair of waveform signals, and specifies an upper limit frequency of a spectrum of the two input signals based on a result of the Fourier transform.
- the bandwidth determining means includes a table storing means for storing a table indicating, for each candidate, an upper limit frequency of a spectrum of a plurality of candidates that can be an input waveform signal; Obtains identification data for identifying a pair of waveform signals from the outside, reads out the upper limit frequency of the spectrum of each input waveform signal identified by the obtained identification data from the table, and reads the frequency.
- the maximum value of the obtained frequencies is specified as the upper limit frequency of the spectrum of the pair of waveform signals.
- FIG. 1 is a diagram showing a speech synthesizer according to an embodiment of the present invention.
- FIG. 2 is a block diagram showing an internal configuration of the speech synthesizer according to the embodiment of the present invention.
- FIG. 3 (a) shows the spectrum of the signal supplied to the input terminal IN-A.
- FIG. 3 (b) is a graph showing a spectrum of a signal supplied to the input terminal IN-B
- FIG. 3 (c) is a graph showing a frequency characteristic of the low-pass filter. It is a graph.
- Fig. 4 (a) is a graph showing the waveform signal supplied to the input terminal IN-A
- Fig. 4 (b) is a graph showing the waveform signal supplied to the input terminal IN-B
- FIG. 4 (c) is a graph showing a signal output from the adder
- FIG. 4 (d) is a graph showing a signal output from the one-pass filter.
- FIG. 5 is a block diagram showing an internal configuration of a modified example of the speech synthesizer of FIG.
- FIG. 6 (a) is a diagram showing a state in which signals to be connected are discontinuous
- FIG. 6 (b) is a diagram showing a conventional method of connecting discontinuous portions with straight lines
- FIG. 6 (c) is a diagram showing a state where signals are connected by the MDS method.
- the speech synthesizer 10 converts a waveform signal obtained by subdividing a pre-recorded speech into individual vowel and consonant levels at an input terminal IN-A. And a basic audio signal that is supplied from IN-B and synthesized from the supplied waveform signal is output from the output terminal OUT.
- a specific internal configuration of the speech synthesizer 10 is connected to a delay unit 1A and a Fourier transform unit 2A connected to an input terminal IN-A, and to an input terminal IN-B.
- the delay units 1A and 1B have substantially the same configuration as each other, and are each configured of a delay circuit such as a shift register. You.
- the delay unit 1A is connected to the input terminal IN-A, and the delay unit 1B is connected to the input terminal IN-B.
- the delay unit 1 A delays this signal for a predetermined time and supplies the signal to the addition unit 3.
- the delay unit 1B delays this signal for a predetermined time and supplies the signal to the addition unit 3.
- time lengths in which the delay units 1A and 1B delay the signals supplied thereto are substantially the same. This time length is selected so that the timing at which the filter characteristic determination unit 4 supplies a control signal described later to the LPF 5 is as described later.
- the Fourier transform units 2A and 2B have substantially the same configuration as each other, and are each composed of a digital signal processor (DSP), a CPU (Central Processing Unit), and the like. You.
- the Fourier transform unit 2A is connected to the input terminal IN-A, and the Fourier transform unit 2B is connected to the input terminal IN-B. Therefore, the same signal is supplied to the Fourier transform unit 2A and the delay unit 1A practically simultaneously from the input terminal INA.
- the same signal is supplied to the Fourier transform unit 2B and the delay unit 1B from the input terminal INB at substantially the same time.
- the Fourier transform unit 2A When supplied with a signal representing a waveform from the input terminal IN-A, the Fourier transform unit 2A uses a fast Fourier transform technique (or any other technique that generates data representing the result of Fourier transform of the signal). Then, spectrum data representing the spectrum of the waveform represented by this signal is generated and supplied to the filter characteristic determination unit 4. Similarly, when a signal representing a waveform is supplied from the input terminal IN-B to the Fourier transform unit 2B, the Fourier transform unit 2B performs substantially the same operation as the Fourier transform unit 2A, and the waveform represented by this signal is spread. The spectrum data representing the torque is generated and supplied to the filter characteristic determination unit 4.
- a fast Fourier transform technique or any other technique that generates data representing the result of Fourier transform of the signal.
- the addition unit 3 is configured by an addition circuit and the like.
- the adder 3 generates a signal representing the sum of the value of the signal supplied from the delay unit 1A and the value of the signal supplied from the delay unit 1B, and supplies the signal to the LPF 5.
- the filter characteristic determining unit 4 is composed of a DSP and a CPU.
- the filter characteristic determining unit 4 receives the spectral data from the Fourier transform units 2A and 2B, respectively. Based on these spectrum data, the cutoff frequency of LPF5 (specifically, for example, the frequency at which the gain of LPF5 drops 3 dB below the peak on the high frequency side) is determined, and the determined cutoff frequency is determined. Is generated and supplied to the LPF 5.
- the filter characteristic determining unit 4 determines the spectrum indicated by the spectral data supplied from the Fourier transform unit 2A.
- the frequency at which the intensity of Sa attenuates by 20 dB from the peak on the high frequency side is specified as the upper limit fa of this spectrum Sa.
- the filter characteristic determination unit 4 determines that the intensity of the spectrum Sb indicated by the spectrum data supplied from the Fourier transform unit 2B is higher on the high frequency side.
- the frequency that attenuates by 20 dB from the peak is specified as the upper limit fb of this vector Sb.
- FIG. 3 (c) is a graph showing the frequency characteristics of the LPF 5 when f a and f b (however, the frequency characteristics while the control signal is supplied to the LP F 5).
- the LPF 5 is composed of, for example, a FIR (Finite Inpulse Response) type digital filter or the like.
- the LPF 5 filters the signal supplied from the adder based on the presence or absence of the control signal from the filter characteristic determiner 4 and the frequency indicated by the control signal, and outputs the result.
- FIR Finite Inpulse Response
- the LPF 5 determines the frequency indicated by the control signal in the waveform represented by the signal supplied from the adding unit 3. A signal representing a component passing through a 5 1 2nd order one-pass filter so as to have a cut-off frequency is generated, and the generated signal is output from an output terminal OUT as a signal representing a result of filtering.
- the LPF 5 outputs the signal supplied from the adder 3 from the output terminal OUT without substantially filtering.
- waveform signals are alternately supplied to the input terminals IN-A and IN-B. That is, as shown in, for example, FIGS. 4 (a) and (b), if an nth (n is an arbitrary positive odd) waveform signal s (n) is supplied to the input terminal IN-A, the nth Substantially at the same time as the waveform signal reaches the end, the (n + 1) th waveform signal s (n + 1) is supplied to the input terminal IN-B, and so on. Are sequentially supplied.
- the n-th waveform signal When the n-th waveform signal is supplied to the input terminal IN—A and the (n + 1) th waveform signal is supplied to the input terminal IN—B, the n-th waveform signal is delayed by the delay unit 1A.
- the (n + 1) -th waveform signal is delayed by the delay unit 1B and supplied to the addition unit 3. Since the time lengths of delay of the signals by the delay units 1A and 1B (the time lengths denoted as “t 0” in FIG. 4 (c)) are substantially equal to each other, the adder unit 3 outputs the signals shown in FIG. As shown in), the nth waveform signal and the (n + 1) th waveform signal are supplied to the LPF 5 substantially continuously without any gap.
- the nth waveform signal is also supplied to the Fourier transform unit 2A
- the (n + 1) th waveform signal is also supplied to the Fourier transform unit 2B.
- the Fourier transform unit 2A generates spectrum data representing the spectrum of the waveform represented by the n-th waveform signal, and supplies the spectrum data to the filter characteristic determination unit 4.
- the Fourier transform unit 2B generates spectrum data representing the spectrum of the waveform represented by the (n + 1) th waveform signal, and supplies the spectrum data to the filter characteristic determination unit 4.
- the filter characteristic determining unit 4 When supplied with two spectral data representing the spectrum of the n-th and (n + 1) -th waveform signals, the filter characteristic determining unit 4 receives each of the spectral data indicated by these spectral data. Specify the frequency at which the intensity of the spectrum at the high frequency side attenuates by 20 dB from the average value. And two identified The higher value of the frequencies is determined as the cut-off frequency of the LPF 5, and a control signal indicating the determined cut-off frequency is supplied to the LPF 5.
- the control signal indicating the cutoff frequency determined based on the nth and (n + 1) th waveform signals has the signal output by the adder 3 as n
- the filter characteristic deciding section 4 sends the LPF 5 It is supplied to.
- the time length from the start of the supply of the control signal to the point at which the waveform signal is switched is determined by the time length of the n-th waveform signal (Fig. 4 ( It is desirable that it be less than one tenth of the length of time indicated by “L (n)” in a).
- the time length from the switching of the waveform signal to the end of the supply of the control signal is the time length of the (n + 1) th waveform signal (shown as “L (n + l)” in FIG. 4 (b)). It is desirable to set it to 1/10 or less of (time length).
- the nth and (n + 1) th waveform signals do not generate unnecessary harmonic components and also remove the frequency components originally contained in each waveform. Combined with one another without substantial loss. Therefore, the voice represented by the combined waveform signal has little noise and a natural synthesized voice is uttered.
- the configuration of the speech synthesizer is not limited to the above.
- the number of LPF 5 filter stages is arbitrary, and the upper limit frequency of the spectrum indicated by the spectrum data supplied by the Fourier transform units 2A and 2B is defined.
- the manner of defining the frequency is not limited to the above definition, but is arbitrary.
- the delay unit 1A, the delay unit 1B, the Fourier transform unit 2A, the Fourier transform unit 2B, the adder unit 3, the filter characteristic determination unit 4, and the LPF 5 are integrated into a single unit. DSP or CPU may do it.
- this speech synthesizer reads a waveform signal from a recording medium (for example, a flexible disk or a MO (Magneto-Optical Disk)) on which the waveform signal is recorded, instead of the input terminals IN-A and IN-B.
- a recording medium drive device for example, a flexible disk drive, a M ⁇ drive, etc. for supplying to the delay units 1A and 1B and the Fourier transform units 2A and 2B.
- the voice synthesizing device may include a recording medium drive device that writes the signal generated by the LPF 5 to a recording medium, instead of the output terminal OUT.
- the same recording medium drive device outputs the waveform signal from the recording medium. Both the reading function and the function of writing the signal generated by the LPF 5 to the recording medium may be performed.
- the waveform signal supplied to the input terminal I N-A or I N-B may be a signal representing a silent state.
- a portion including the end of the signal representing the speech state (specifically, for example, the beginning and end of a voice or a breathing portion) ) Can avoid noise, and this part can be heard as natural sound.
- the speech synthesizer of the present invention does not necessarily require the Fourier transform units 2A and 2B. Instead, for example, a candidate for a waveform signal supplied to the input terminals IN-A and IN-B is determined. A method of providing a table for storing the identification data to be identified and the frequency data indicating the upper limit frequency of the candidate spectrum in association with each other is considered.
- the identification data for identifying the waveform signals supplied to the input terminals IN-A and IN-B is separately obtained from the outside, and the frequency data associated with the obtained identification data is separately obtained. Is read from the table and supplied to the filter characteristic determining unit 4, and the filter characteristic determining unit 4 determines the higher value of the frequencies indicated by the frequency data as the cut-off frequency of the LPF 5.
- this speech synthesizer may include high-pass filters (HPF) 6A and 6B instead of Fourier transform sections 2A and 2B.
- HPF high-pass filters
- the HPFs 6A and 6B have substantially the same configuration as each other, and are each composed of, for example, an IIR (Infinite Impulse Response) type digital filter or the like.
- IIR Infinite Impulse Response
- HPF 6 A is connected to input IN-A
- HPF 6 B is connected to input IN-B
- HPF 6 A and delay unit 1 A have the same signal from input IN-A. Are supplied substantially simultaneously, and the same signal is supplied to the HPF 6 B and the delay unit 1 B from the input terminal IN-B substantially simultaneously. Supplied.
- the HPF 6A When the HPF 6A is supplied with a signal representing a waveform from the input terminal IN-A, the HPF 6A substantially cuts off components below a predetermined power cutoff frequency, and sends the signal to another component filter characteristic determination unit 4. And supply.
- the HP F 6 B substantially blocks a component having a frequency equal to or lower than a predetermined cut-off frequency in the signal supplied from the input terminal IN-B, and supplies the signal to another component filter characteristic determination unit 4. Note that the cutoff frequencies of the HPFs 6A and 6B are substantially equal to each other.
- the filter characteristic determination unit 4 uses the waveform signals supplied from HP F 6 A and 6 B, respectively. (Specifically, based on the larger of the average amplitude level of the component supplied by HP F 6A and the average amplitude level of the component supplied by HP F 6B), Shall be determined.
- this speech synthesizer is provided with HP F6A and 6B instead of Fourier transform units 2A and 2B, complicated Fourier transform processing is omitted, so that the processing of this speech synthesizer can be performed at higher speed. It becomes possible to.
- the embodiments of the present invention have been described.
- the signal coupling device according to the present invention can be realized using an ordinary computer system without using a dedicated system.
- the delay unit 1A (or HP F6A), the delay unit IB (or) HP F6B, the Fourier transform unit 2A, the Fourier transform unit 2B, the adder unit 3, and the filter characteristic determination
- a speech synthesizer that executes the above processing is configured.
- the program may be posted on a bulletin board (BBS) of a communication line and distributed via the communication line.
- the carrier wave is modulated by a signal representing the program, and the obtained modulated wave To transmit this
- the device that has received the modulated wave may demodulate the modulated wave and restore the program.
- the recording medium shall include the program excluding the part. May be stored. Also in this case, in the present invention, it is assumed that the recording medium stores a program for executing each function or step executed by the computer.
- the present invention employs the above-described configuration, the harmonic component generated by the discontinuous change of the coupling portion of the audio waveform signal is effectively removed. As a result, the sense of noise in the synthesized speech signal is significantly reduced, and a very natural synthesized speech can be generated.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
- Noise Elimination (AREA)
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE0001403851T DE02738817T1 (de) | 2001-07-02 | 2002-06-27 | Signalkoppelverfahren und -vorrichtung |
US10/362,870 US7739112B2 (en) | 2001-07-02 | 2002-06-27 | Signal coupling method and apparatus |
DE60233658T DE60233658D1 (de) | 2001-07-02 | 2002-06-27 | Konkatenation von Sprachsignalen |
EP02738817A EP1403851B1 (en) | 2001-07-02 | 2002-06-27 | Concatenation of voice signals |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001201408A JP3901475B2 (ja) | 2001-07-02 | 2001-07-02 | 信号結合装置、信号結合方法及びプログラム |
JP2001-201408 | 2001-07-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2003005342A1 true WO2003005342A1 (fr) | 2003-01-16 |
Family
ID=19038376
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2002/006479 WO2003005342A1 (fr) | 2001-07-02 | 2002-06-27 | Procede et appareil de couplage de signaux |
Country Status (5)
Country | Link |
---|---|
US (1) | US7739112B2 (ja) |
EP (1) | EP1403851B1 (ja) |
JP (1) | JP3901475B2 (ja) |
DE (2) | DE02738817T1 (ja) |
WO (1) | WO2003005342A1 (ja) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7440902B2 (en) * | 2002-04-12 | 2008-10-21 | International Business Machines Corporation | Service development tool and capabilities for facilitating management of service elements |
US7533026B2 (en) * | 2002-04-12 | 2009-05-12 | International Business Machines Corporation | Facilitating management of service elements usable in providing information technology service offerings |
US7562022B2 (en) * | 2002-04-12 | 2009-07-14 | International Business Machines Corporation | Packaging and distributing service elements |
JP4396646B2 (ja) * | 2006-02-07 | 2010-01-13 | ヤマハ株式会社 | 応答波形合成方法、応答波形合成装置、音響設計支援装置および音響設計支援プログラム |
JP4973492B2 (ja) * | 2007-01-30 | 2012-07-11 | 株式会社Jvcケンウッド | 再生装置、再生方法及び再生プログラム |
JP4470122B2 (ja) * | 2007-06-18 | 2010-06-02 | 株式会社アクセル | 音声符号化装置、音声復号化装置、音声符号化プログラムおよび音声復号化プログラム |
US20090167947A1 (en) * | 2007-12-27 | 2009-07-02 | Naoko Satoh | Video data processor and data bus management method thereof |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS62139599A (ja) * | 1985-12-13 | 1987-06-23 | 松下電工株式会社 | 音声合成装置 |
JPH05273998A (ja) * | 1992-03-30 | 1993-10-22 | Toshiba Corp | 音声符号化装置 |
JPH0772897A (ja) * | 1993-09-01 | 1995-03-17 | Nippon Telegr & Teleph Corp <Ntt> | 音声合成方法および装置 |
JPH08335095A (ja) * | 1995-06-02 | 1996-12-17 | Matsushita Electric Ind Co Ltd | 音声波形接続方法 |
JPH10207455A (ja) * | 1996-11-20 | 1998-08-07 | Yamaha Corp | 音信号分析装置及び方法 |
JPH11352996A (ja) * | 1998-06-10 | 1999-12-24 | Nec Corp | 音声規則合成装置 |
JP2000172285A (ja) * | 1998-11-25 | 2000-06-23 | Matsushita Electric Ind Co Ltd | フィルタパラメ―タとソ―ス領域において独立にクロスフェ―ドを行う半音節結合型のフォルマントベ―スのスピ―チシンセサイザ |
JP2000310994A (ja) * | 1999-04-27 | 2000-11-07 | Ntt Data Corp | 音声素片作成装置、音声合成装置、音声素片作成方法、音声合成方法及び記録媒体 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3678416A (en) * | 1971-07-26 | 1972-07-18 | Richard S Burwen | Dynamic noise filter having means for varying cutoff point |
FR2636163B1 (fr) * | 1988-09-02 | 1991-07-05 | Hamon Christian | Procede et dispositif de synthese de la parole par addition-recouvrement de formes d'onde |
DE69028072T2 (de) * | 1989-11-06 | 1997-01-09 | Canon Kk | Verfahren und Einrichtung zur Sprachsynthese |
US5765127A (en) * | 1992-03-18 | 1998-06-09 | Sony Corp | High efficiency encoding method |
GB2272615A (en) * | 1992-11-17 | 1994-05-18 | Rudolf Bisping | Controlling signal-to-noise ratio in noisy recordings |
US5463715A (en) * | 1992-12-30 | 1995-10-31 | Innovation Technologies | Method and apparatus for speech generation from phonetic codes |
JPH08254993A (ja) * | 1995-03-16 | 1996-10-01 | Toshiba Corp | 音声合成装置 |
US6240384B1 (en) * | 1995-12-04 | 2001-05-29 | Kabushiki Kaisha Toshiba | Speech synthesis method |
BE1010336A3 (fr) * | 1996-06-10 | 1998-06-02 | Faculte Polytechnique De Mons | Procede de synthese de son. |
JPH10187195A (ja) * | 1996-12-26 | 1998-07-14 | Canon Inc | 音声合成方法および装置 |
US6490562B1 (en) * | 1997-04-09 | 2002-12-03 | Matsushita Electric Industrial Co., Ltd. | Method and system for analyzing voices |
DE19861167A1 (de) * | 1998-08-19 | 2000-06-15 | Christoph Buskies | Verfahren und Vorrichtung zur koartikulationsgerechten Konkatenation von Audiosegmenten sowie Vorrichtungen zur Bereitstellung koartikulationsgerecht konkatenierter Audiodaten |
-
2001
- 2001-07-02 JP JP2001201408A patent/JP3901475B2/ja not_active Expired - Fee Related
-
2002
- 2002-06-27 EP EP02738817A patent/EP1403851B1/en not_active Expired - Lifetime
- 2002-06-27 US US10/362,870 patent/US7739112B2/en not_active Expired - Fee Related
- 2002-06-27 WO PCT/JP2002/006479 patent/WO2003005342A1/ja active Application Filing
- 2002-06-27 DE DE0001403851T patent/DE02738817T1/de active Pending
- 2002-06-27 DE DE60233658T patent/DE60233658D1/de not_active Expired - Lifetime
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS62139599A (ja) * | 1985-12-13 | 1987-06-23 | 松下電工株式会社 | 音声合成装置 |
JPH05273998A (ja) * | 1992-03-30 | 1993-10-22 | Toshiba Corp | 音声符号化装置 |
JPH0772897A (ja) * | 1993-09-01 | 1995-03-17 | Nippon Telegr & Teleph Corp <Ntt> | 音声合成方法および装置 |
JPH08335095A (ja) * | 1995-06-02 | 1996-12-17 | Matsushita Electric Ind Co Ltd | 音声波形接続方法 |
JPH10207455A (ja) * | 1996-11-20 | 1998-08-07 | Yamaha Corp | 音信号分析装置及び方法 |
JPH11352996A (ja) * | 1998-06-10 | 1999-12-24 | Nec Corp | 音声規則合成装置 |
JP2000172285A (ja) * | 1998-11-25 | 2000-06-23 | Matsushita Electric Ind Co Ltd | フィルタパラメ―タとソ―ス領域において独立にクロスフェ―ドを行う半音節結合型のフォルマントベ―スのスピ―チシンセサイザ |
JP2000310994A (ja) * | 1999-04-27 | 2000-11-07 | Ntt Data Corp | 音声素片作成装置、音声合成装置、音声素片作成方法、音声合成方法及び記録媒体 |
Non-Patent Citations (1)
Title |
---|
See also references of EP1403851A4 * |
Also Published As
Publication number | Publication date |
---|---|
DE02738817T1 (de) | 2004-08-26 |
EP1403851A4 (en) | 2005-10-26 |
EP1403851A1 (en) | 2004-03-31 |
US20040015359A1 (en) | 2004-01-22 |
EP1403851B1 (en) | 2009-09-09 |
JP2003015681A (ja) | 2003-01-17 |
JP3901475B2 (ja) | 2007-04-04 |
DE60233658D1 (de) | 2009-10-22 |
US7739112B2 (en) | 2010-06-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0910065B1 (en) | Speaking speed changing method and device | |
US20060130637A1 (en) | Method for differentiated digital voice and music processing, noise filtering, creation of special effects and device for carrying out said method | |
JP2008191659A (ja) | 音声強調方法及び音声再生システム | |
JP4254479B2 (ja) | オーディオ帯域拡張再生装置 | |
JP3430985B2 (ja) | 合成音生成装置 | |
JP2005157363A (ja) | フォルマント帯域を利用したダイアログエンハンシング方法及び装置 | |
WO2003005342A1 (fr) | Procede et appareil de couplage de signaux | |
JP3379348B2 (ja) | ピッチ変換器 | |
JP2001255882A (ja) | 音声信号処理装置及びその信号処理方法 | |
JP2003157100A (ja) | 音声通信方法及び装置、並びに音声通信プログラム | |
JP2000081897A (ja) | 音声情報の記録方法、音声情報記録媒体、並びに音声情報の再生方法及び再生装置 | |
JP4433668B2 (ja) | 帯域拡張装置及び方法 | |
EP0421531B1 (en) | Device for sound synthesis | |
JP3515215B2 (ja) | 音声符号化装置 | |
JP2650355B2 (ja) | 音声分析合成装置 | |
JPH11109993A (ja) | 音声素片接続方法及び音声合成装置 | |
JPH10111699A (ja) | 音声再生装置 | |
JP2002311980A (ja) | 音声合成方法、音声合成装置、半導体装置及び音声合成プログラム | |
JP2005062442A (ja) | 波形接続装置、波形接続方法及びプログラム | |
JP2000099094A (ja) | 時系列信号処理装置 | |
JP3927617B2 (ja) | ゲーム用音源装置 | |
JP5899865B2 (ja) | 音響信号加工装置及びプログラム | |
JPH02247700A (ja) | 音声合成装置 | |
JP2000242287A (ja) | 発声補助装置およびプログラム記録媒体 | |
JPS59168494A (ja) | 音声合成方式 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA CN KR US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002738817 Country of ref document: EP Ref document number: 10362870 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 2002738817 Country of ref document: EP |