CN101930747A - Method and device for converting voice into mouth shape image - Google Patents

Method and device for converting voice into mouth shape image Download PDF

Info

Publication number
CN101930747A
CN101930747A CN2010102408835A CN201010240883A CN101930747A CN 101930747 A CN101930747 A CN 101930747A CN 2010102408835 A CN2010102408835 A CN 2010102408835A CN 201010240883 A CN201010240883 A CN 201010240883A CN 101930747 A CN101930747 A CN 101930747A
Authority
CN
China
Prior art keywords
mouth
voice
shape
mouth shape
speaks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102408835A
Other languages
Chinese (zh)
Inventor
蒋一宁
付晓毅
蒋涛
张�成
蔺君刚
赵旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SICHUAN WEIDI DIGITAL TECHNOLOGY Co Ltd
Original Assignee
SICHUAN WEIDI DIGITAL TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SICHUAN WEIDI DIGITAL TECHNOLOGY Co Ltd filed Critical SICHUAN WEIDI DIGITAL TECHNOLOGY Co Ltd
Priority to CN2010102408835A priority Critical patent/CN101930747A/en
Publication of CN101930747A publication Critical patent/CN101930747A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a method and a device for converting a voice into a mouth shape image. The method comprises the following steps of: firstly, acquiring the voice through an acquisition unit and performing spectrum analysis on the acquired voice through a recognition unit; secondly, recognizing phonemes in the voice according to a resonance peak and a volume parameter obtained by the spectrum analysis, forming sequences by using the phonemes obtained by the recognition and converting the sequences into corresponding mouth shape models one by one by using a conversion unit; thirdly, correcting mouth opening degree parameters of the mouth shape models according to the resonance peak and the volume parameter; and lastly, continuously playing the mouth shape models obtained by the correction according to the phoneme sequences to form the mouth shape image by using a display unit. The phonemes in the voice can be recognized, the parameters of the mouth shape models are determined through the phonemes and a correct mouth shape model is obtained by coordinating with the correction of the resonance peak and the volume parameter.

Description

A kind of method and apparatus that speech conversion is become mouth shape image
Technical field
The present invention relates to voice in the communications field and the switch technology between the shape of the mouth as one speaks, particularly a kind of method and apparatus that speech conversion is become mouth shape image.
Background introduction
The conversion plan of the existing shape of the mouth as one speaks and language at first is the sound of synchronous acquisition language and the video of the shape of the mouth as one speaks, then to video by specific recognizer, in voice, find out some syllable and corresponding image sequence thereof; When using, change mutually according to the image or the sound bite that identify again.
At publication number be: the Chinese patent literature of CN101510256A, denomination of invention is: a kind of mouth-shape language conversion method and device, disclosed method is: the lip motion Video Segmentation of gathering is become the mouth shape image sequence set; Described mouth shape image sequence set is discerned, obtained the speech syllable of described mouth shape image sequence set correspondence; Described device comprises: acquisition module, cut apart module and identification module.This invention becomes the mouth shape image sequence by the lip motion Video Segmentation of will gather, and the pairing speech syllable of identification mouth shape image sequence, realize the conversion of mouth shape language to speech syllable, solved voice disorder personage's conversation problem, thereby satisfy voice disorder personage's conversation demand, have the effect of providing convenience for the voice disorder personage.
(syllable in the so-called Chinese is that sense of hearing sensation can be distinguished the clearly base unit of voice to syllable in the voice that the related method that voice are converted into image of the document is identification, a Chinese character is exactly a syllable in the Chinese, each syllable is by initial consonant, three parts of simple or compound vowel of a Chinese syllable and tone are formed), what that is to say identification is initial consonant, three parts of simple or compound vowel of a Chinese syllable and tone one of them or several contents, but how the technical scheme of document the inside not explanation goes to discern syllable method, do not have how to obtain corresponding mouth shape image after the concrete identification of explanation yet, because such technical scheme has impracticable suspicion, even method identification syllable and conversion mouth shape image are arranged, also have the error rate of identification and the error rate of conversion, such technical scheme can not provide real demand and easy to use for the user.
Summary of the invention
The present invention is for overcoming above-mentioned technical matters, a kind of method and apparatus that speech conversion is become mouth shape image is provided, can identify the phoneme in the voice, determine the parameter of shape of the mouth as one speaks model by phoneme, cooperate resonance peak and volume correction to obtain correct shape of the mouth as one speaks model then, resulting shape of the mouth as one speaks model can be formed continuous mouth shape image and use for the user.
Technical scheme of the present invention is as follows:
A kind of speech conversion is become the method for mouth shape image, it is characterized in that step is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
Phoneme (phoneme):, be divided into the limited least speech unit of number by the character of its physiology and physics the speech sound in a kind of language.Phoneme is divided into vowel and consonant.The peak value of some broads is arranged in spectrum envelope figure, be called resonance peak.Can represent the variation of speech signal with time, frequency and intensity, resonance peak can be expressed as has certain intensity energy in the certain frequency scope, and the signal of certain time.Usually the speech signal has 3 resonance peaks, can identify vowel and consonant according to the Changing Pattern of first and second resonance peak, and in addition, formant frequency and volume also have relation with the open size of lip.Open greatly more as mouth, sound is just loud more.
Shape of the mouth as one speaks model can be described with the lip and the size of dehiscing that upperlip constitutes, and lip is as circle, semicircle etc.
Resonance peak is some zones that energy is concentrated relatively in the frequency spectrum of sound, the determinative of tonequality still not, and reflected the physical features of sound channel (resonant cavity).Sound is through resonant cavity the time, be subjected to the filter action of cavity, make that the energy of different frequency is redistributed in the frequency domain, a part is because the resonance effect of resonant cavity is strengthened, another part is then decayed, and those frequencies that strengthened show as dense blackstreak on the sonagram of time frequency analysis.Because energy distribution is inhomogeneous, strong part is just as the mountain peak, so be referred to as resonance peak.All have some fixing frequency peak (Formant Synthesis) in the very wide spectrum distribution of voice and most of musical instruments, this frequency peak just is called resonance peak (Formants) in sound spectrum.In voice acoustics, resonance peak is determining the tonequality of vowel, and in Computer Music, they are important parameters of decision tone color and tonequality.
Resonance peak and volume can be obtained by voice are carried out spectrum analysis, and the vowel and the consonant of phoneme in the voice can be identified.
According to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, be because: when voice were carried out time-domain analysis, time domain parameter was identical sometimes, but can not illustrate be converted to that shape of the mouth as one speaks model is rescued and actual voice identical.Because voice signal not only changes in time, and is also information-related with frequency, phase place etc., this just needs the frequency structure of further analytic signal, and in frequency field signal is described.
A kind of device that speech conversion is become mouth shape image, comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme changed the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
When collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
Described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, by digital signal processor level signal is converted to the frequency-region signal that spectrum analysis is used, identification obtains formant frequency, volume and phoneme to frequency-region signal by voice recognition unit then.
Digital signal processor also is converted to digital signal with level signal, and digital signal spreads out of by the loudspeaker that is connected with digital signal processor.
The mouth shape image that obtains by display unit comprises that the basic shape of the mouth as one speaks and lip open the parameter of size.
Beneficial effect of the present invention is as follows:
The present invention obtains determining the resonance peak and the volume of vowel quality by spectrum analysis, and identify the phoneme of voice, determine the parameter of shape of the mouth as one speaks model by phoneme, cooperate resonance peak and volume correction to obtain correct shape of the mouth as one speaks model then, revised shape of the mouth as one speaks model can obtain the very high mouth shape image of accuracy, be more convenient for like this voice disorder personage more easily with other people communication exchange.
Description of drawings
Fig. 1 is the structural representation of apparatus of the present invention
Fig. 2 is a kind of embodiment structural representation of apparatus of the present invention
Embodiment
A kind of speech conversion is become the method for mouth shape image, its switch process is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
Resonance peak and volume can be obtained by voice are carried out spectrum analysis, and the vowel and the consonant of voice phoneme can be identified.Utilize formant frequency and volume to correct shape of the mouth as one speaks model then, then can obtain the mouth shape image that accuracy is arrived very much.
Shape of the mouth as one speaks model is described with the lip and the size of dehiscing that upperlip constitutes, and lip is as circle, semicircle etc.
Mouth shape image comprises that the basic shape of the mouth as one speaks (as semicircle, circle) and lip open the parameter (big more as volume, lip opens also greatly more) of size.
Shown in Fig. 1-2, this device that speech conversion is become mouth shape image, comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme changed the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
When collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
Described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, earlier level signal is converted to the time-domain digital signal by digital signal processor, the frequency-region signal that becomes spectrum analysis to use the time-domain digital conversion of signals again, voice recognition unit identification obtains formant frequency then, the vowel of volume and phoneme, consonant, form syllable sequence by discerning the phoneme that obtains one by one, according to converting shape of the mouth as one speaks model to because of sequence, because the shape of the mouth as one speaks model that at this moment obtains is accurate not enough, so need be by formant frequency, volume is corrected, to be the unit adjust duration of the type image of whenever dehiscing according to the duration of phoneme to shape of the mouth as one speaks model after the correction by receiving, and just constituted continuous mouth shape image.
Described frequency-region signal can extract resonance peak by wave filter, by selecting the suitable filters bandwidth, can obtain the frequency of first, second and third resonance peak, be called F1, F2, F3, the more lasting duration in binding resonant peak, just can identify vowel (as F1 at 300-400Hz, F2 is about 1000Hz, and duration just can be identified as vowel u less than 200ms) and consonant (as F1=200, F2=720, F3=2100 is identified as consonant/b, p/).
The mouth shape image that obtains by this method and device is because accuracy is very high, so can help voice disorder personage and other people communication well.

Claims (6)

1. one kind becomes the method for mouth shape image with speech conversion, it is characterized in that step is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
2. according to claim 1ly a kind of speech conversion is become the method for mouth shape image, it is characterized in that: what spectrum analysis obtained is resonance peak and volume, and identification obtains is phoneme in the voice, i.e. vowel and consonant.
3. a kind of device for carrying out said that speech conversion is become the method for mouth shape image according to claim 1 and 2, it is characterized in that: comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme is converted to the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
4. a kind of device that speech conversion is become mouth shape image according to claim 3, it is characterized in that: when collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
5. a kind of device that speech conversion is become mouth shape image according to claim 3, it is characterized in that: described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, by digital signal processor level signal is converted to the frequency-region signal that spectrum analysis is used, identification obtains formant frequency, volume and phoneme to frequency-region signal by voice recognition unit then.
6. according to claim 3ly a kind of speech conversion is become the device of mouth shape image, it is characterized in that: the mouth shape image that obtains by display unit comprises that the basic shape of the mouth as one speaks and lip open the parameter of size.
CN2010102408835A 2010-07-30 2010-07-30 Method and device for converting voice into mouth shape image Pending CN101930747A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102408835A CN101930747A (en) 2010-07-30 2010-07-30 Method and device for converting voice into mouth shape image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102408835A CN101930747A (en) 2010-07-30 2010-07-30 Method and device for converting voice into mouth shape image

Publications (1)

Publication Number Publication Date
CN101930747A true CN101930747A (en) 2010-12-29

Family

ID=43369880

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102408835A Pending CN101930747A (en) 2010-07-30 2010-07-30 Method and device for converting voice into mouth shape image

Country Status (1)

Country Link
CN (1) CN101930747A (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104574474A (en) * 2015-01-09 2015-04-29 何玉欣 Matching method for generating language mouth shapes of cartoon characters through subtitles
CN104574478A (en) * 2014-12-30 2015-04-29 北京像素软件科技股份有限公司 Method and device for editing mouth shapes of animation figures
CN106297792A (en) * 2016-09-14 2017-01-04 厦门幻世网络科技有限公司 The recognition methods of a kind of voice mouth shape cartoon and device
CN106446406A (en) * 2016-09-23 2017-02-22 天津大学 Simulation system and simulation method for converting Chinese sentences into human mouth shapes
CN106653050A (en) * 2017-02-08 2017-05-10 康梅 Method for matching animation mouth shapes with voice in real time
CN107831684A (en) * 2016-09-16 2018-03-23 天津思博科科技发展有限公司 Using the shape of the mouth as one speaks pronunciation transposition of realizing of Robot Vision
CN108962251A (en) * 2018-06-26 2018-12-07 珠海金山网络游戏科技有限公司 A kind of game role Chinese speech automatic identifying method
CN109087629A (en) * 2018-08-24 2018-12-25 苏州玩友时代科技股份有限公司 A kind of mouth shape cartoon implementation method and device based on speech recognition
CN109087651A (en) * 2018-09-05 2018-12-25 广州势必可赢网络科技有限公司 A kind of vocal print identification method, system and equipment based on video and sound spectrograph
CN109949390A (en) * 2017-12-21 2019-06-28 腾讯科技(深圳)有限公司 Image generating method, dynamic expression image generating method and device
CN110149548A (en) * 2018-09-26 2019-08-20 腾讯科技(深圳)有限公司 Video dubbing method, electronic device and readable storage medium storing program for executing
CN110867177A (en) * 2018-08-16 2020-03-06 林其禹 Voice playing system with selectable timbre, playing method thereof and readable recording medium
CN112700520A (en) * 2020-12-30 2021-04-23 上海幻维数码创意科技股份有限公司 Mouth shape expression animation generation method and device based on formants and storage medium
CN112750187A (en) * 2021-01-19 2021-05-04 腾讯科技(深圳)有限公司 Animation generation method, device and equipment and computer readable storage medium
CN113112575A (en) * 2021-04-08 2021-07-13 深圳市山水原创动漫文化有限公司 Mouth shape generation method and device, computer equipment and storage medium
CN113327483A (en) * 2021-04-30 2021-08-31 云南北飞科技有限公司 Language training method for simulating pronunciation and air flow changes based on 3D tongue position model
CN116580721A (en) * 2023-07-13 2023-08-11 中国电信股份有限公司 Expression animation generation method and device and digital human platform

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1379348A (en) * 2002-05-17 2002-11-13 清华大学 Method and system for computer conversion between Chinese audio and video parameters
US20030160791A1 (en) * 2000-07-13 2003-08-28 Gaspard Breton Facial animation method
CN101290720A (en) * 2008-06-17 2008-10-22 李伟 Visualized pronunciation teaching method and apparatus
US20090037179A1 (en) * 2007-07-30 2009-02-05 International Business Machines Corporation Method and Apparatus for Automatically Converting Voice
CN101482975A (en) * 2008-01-07 2009-07-15 丰达软件(苏州)有限公司 Method and apparatus for converting words into animation
CN101510256A (en) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 Mouth shape language conversion method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030160791A1 (en) * 2000-07-13 2003-08-28 Gaspard Breton Facial animation method
CN1379348A (en) * 2002-05-17 2002-11-13 清华大学 Method and system for computer conversion between Chinese audio and video parameters
US20090037179A1 (en) * 2007-07-30 2009-02-05 International Business Machines Corporation Method and Apparatus for Automatically Converting Voice
CN101482975A (en) * 2008-01-07 2009-07-15 丰达软件(苏州)有限公司 Method and apparatus for converting words into animation
CN101290720A (en) * 2008-06-17 2008-10-22 李伟 Visualized pronunciation teaching method and apparatus
CN101510256A (en) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 Mouth shape language conversion method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《计算机工程与设计》 20040229 侯亚荣等 唇同步的自动识别与验证研究 166-169 1-6 第25卷, 第2期 2 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104574478A (en) * 2014-12-30 2015-04-29 北京像素软件科技股份有限公司 Method and device for editing mouth shapes of animation figures
CN104574474A (en) * 2015-01-09 2015-04-29 何玉欣 Matching method for generating language mouth shapes of cartoon characters through subtitles
CN106297792A (en) * 2016-09-14 2017-01-04 厦门幻世网络科技有限公司 The recognition methods of a kind of voice mouth shape cartoon and device
CN107831684A (en) * 2016-09-16 2018-03-23 天津思博科科技发展有限公司 Using the shape of the mouth as one speaks pronunciation transposition of realizing of Robot Vision
CN106446406A (en) * 2016-09-23 2017-02-22 天津大学 Simulation system and simulation method for converting Chinese sentences into human mouth shapes
CN106653050A (en) * 2017-02-08 2017-05-10 康梅 Method for matching animation mouth shapes with voice in real time
CN109949390A (en) * 2017-12-21 2019-06-28 腾讯科技(深圳)有限公司 Image generating method, dynamic expression image generating method and device
CN108962251A (en) * 2018-06-26 2018-12-07 珠海金山网络游戏科技有限公司 A kind of game role Chinese speech automatic identifying method
CN110867177A (en) * 2018-08-16 2020-03-06 林其禹 Voice playing system with selectable timbre, playing method thereof and readable recording medium
CN109087629A (en) * 2018-08-24 2018-12-25 苏州玩友时代科技股份有限公司 A kind of mouth shape cartoon implementation method and device based on speech recognition
CN109087651A (en) * 2018-09-05 2018-12-25 广州势必可赢网络科技有限公司 A kind of vocal print identification method, system and equipment based on video and sound spectrograph
CN110149548A (en) * 2018-09-26 2019-08-20 腾讯科技(深圳)有限公司 Video dubbing method, electronic device and readable storage medium storing program for executing
CN112700520A (en) * 2020-12-30 2021-04-23 上海幻维数码创意科技股份有限公司 Mouth shape expression animation generation method and device based on formants and storage medium
CN112700520B (en) * 2020-12-30 2024-03-26 上海幻维数码创意科技股份有限公司 Formant-based mouth shape expression animation generation method, device and storage medium
CN112750187A (en) * 2021-01-19 2021-05-04 腾讯科技(深圳)有限公司 Animation generation method, device and equipment and computer readable storage medium
CN113112575A (en) * 2021-04-08 2021-07-13 深圳市山水原创动漫文化有限公司 Mouth shape generation method and device, computer equipment and storage medium
CN113112575B (en) * 2021-04-08 2024-04-30 深圳市山水原创动漫文化有限公司 Mouth shape generating method and device, computer equipment and storage medium
CN113327483A (en) * 2021-04-30 2021-08-31 云南北飞科技有限公司 Language training method for simulating pronunciation and air flow changes based on 3D tongue position model
CN116580721A (en) * 2023-07-13 2023-08-11 中国电信股份有限公司 Expression animation generation method and device and digital human platform
CN116580721B (en) * 2023-07-13 2023-09-22 中国电信股份有限公司 Expression animation generation method and device and digital human platform

Similar Documents

Publication Publication Date Title
CN101930747A (en) Method and device for converting voice into mouth shape image
CN104272382B (en) Personalized singing synthetic method based on template and system
CN106898340B (en) Song synthesis method and terminal
CN108847215B (en) Method and device for voice synthesis based on user timbre
CN109767778B (en) Bi-L STM and WaveNet fused voice conversion method
CN1815552B (en) Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter
WO1997029482A1 (en) Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
CN102543073A (en) Shanghai dialect phonetic recognition information processing method
Hansen et al. On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks
CN109616131B (en) Digital real-time voice sound changing method
Wand et al. Deep Neural Network Frontend for Continuous EMG-Based Speech Recognition.
WO2015129465A1 (en) Voice clarification device and computer program therefor
CN113436606B (en) Original sound speech translation method
CN105825868B (en) A kind of extracting method of the effective range of singer
EP1280137A1 (en) Method for speaker identification
CN103035252B (en) Chinese speech signal processing method, Chinese speech signal processing device and hearing aid device
CN112992109A (en) Auxiliary singing system, auxiliary singing method and non-instantaneous computer readable recording medium
CN110570842B (en) Speech recognition method and system based on phoneme approximation degree and pronunciation standard degree
JP4381404B2 (en) Speech synthesis system, speech synthesis method, speech synthesis program
CN114550701A (en) Deep neural network-based Chinese electronic larynx voice conversion device and method
CN103035237B (en) Chinese speech signal processing method, device and hearing aid device
Millhouse et al. Perceptual characterisation of the singer’s formant region: a preliminary study
CN114283822A (en) Many-to-one voice conversion method based on gamma pass frequency cepstrum coefficient
Nasreen et al. Speech analysis for automatic speech recognition
CN109697985B (en) Voice signal processing method and device and terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101229