CN101930747A - Method and device for converting voice into mouth shape image - Google Patents
Method and device for converting voice into mouth shape image Download PDFInfo
- Publication number
- CN101930747A CN101930747A CN2010102408835A CN201010240883A CN101930747A CN 101930747 A CN101930747 A CN 101930747A CN 2010102408835 A CN2010102408835 A CN 2010102408835A CN 201010240883 A CN201010240883 A CN 201010240883A CN 101930747 A CN101930747 A CN 101930747A
- Authority
- CN
- China
- Prior art keywords
- mouth
- voice
- shape
- mouth shape
- speaks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention discloses a method and a device for converting a voice into a mouth shape image. The method comprises the following steps of: firstly, acquiring the voice through an acquisition unit and performing spectrum analysis on the acquired voice through a recognition unit; secondly, recognizing phonemes in the voice according to a resonance peak and a volume parameter obtained by the spectrum analysis, forming sequences by using the phonemes obtained by the recognition and converting the sequences into corresponding mouth shape models one by one by using a conversion unit; thirdly, correcting mouth opening degree parameters of the mouth shape models according to the resonance peak and the volume parameter; and lastly, continuously playing the mouth shape models obtained by the correction according to the phoneme sequences to form the mouth shape image by using a display unit. The phonemes in the voice can be recognized, the parameters of the mouth shape models are determined through the phonemes and a correct mouth shape model is obtained by coordinating with the correction of the resonance peak and the volume parameter.
Description
Technical field
The present invention relates to voice in the communications field and the switch technology between the shape of the mouth as one speaks, particularly a kind of method and apparatus that speech conversion is become mouth shape image.
Background introduction
The conversion plan of the existing shape of the mouth as one speaks and language at first is the sound of synchronous acquisition language and the video of the shape of the mouth as one speaks, then to video by specific recognizer, in voice, find out some syllable and corresponding image sequence thereof; When using, change mutually according to the image or the sound bite that identify again.
At publication number be: the Chinese patent literature of CN101510256A, denomination of invention is: a kind of mouth-shape language conversion method and device, disclosed method is: the lip motion Video Segmentation of gathering is become the mouth shape image sequence set; Described mouth shape image sequence set is discerned, obtained the speech syllable of described mouth shape image sequence set correspondence; Described device comprises: acquisition module, cut apart module and identification module.This invention becomes the mouth shape image sequence by the lip motion Video Segmentation of will gather, and the pairing speech syllable of identification mouth shape image sequence, realize the conversion of mouth shape language to speech syllable, solved voice disorder personage's conversation problem, thereby satisfy voice disorder personage's conversation demand, have the effect of providing convenience for the voice disorder personage.
(syllable in the so-called Chinese is that sense of hearing sensation can be distinguished the clearly base unit of voice to syllable in the voice that the related method that voice are converted into image of the document is identification, a Chinese character is exactly a syllable in the Chinese, each syllable is by initial consonant, three parts of simple or compound vowel of a Chinese syllable and tone are formed), what that is to say identification is initial consonant, three parts of simple or compound vowel of a Chinese syllable and tone one of them or several contents, but how the technical scheme of document the inside not explanation goes to discern syllable method, do not have how to obtain corresponding mouth shape image after the concrete identification of explanation yet, because such technical scheme has impracticable suspicion, even method identification syllable and conversion mouth shape image are arranged, also have the error rate of identification and the error rate of conversion, such technical scheme can not provide real demand and easy to use for the user.
Summary of the invention
The present invention is for overcoming above-mentioned technical matters, a kind of method and apparatus that speech conversion is become mouth shape image is provided, can identify the phoneme in the voice, determine the parameter of shape of the mouth as one speaks model by phoneme, cooperate resonance peak and volume correction to obtain correct shape of the mouth as one speaks model then, resulting shape of the mouth as one speaks model can be formed continuous mouth shape image and use for the user.
Technical scheme of the present invention is as follows:
A kind of speech conversion is become the method for mouth shape image, it is characterized in that step is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
Phoneme (phoneme):, be divided into the limited least speech unit of number by the character of its physiology and physics the speech sound in a kind of language.Phoneme is divided into vowel and consonant.The peak value of some broads is arranged in spectrum envelope figure, be called resonance peak.Can represent the variation of speech signal with time, frequency and intensity, resonance peak can be expressed as has certain intensity energy in the certain frequency scope, and the signal of certain time.Usually the speech signal has 3 resonance peaks, can identify vowel and consonant according to the Changing Pattern of first and second resonance peak, and in addition, formant frequency and volume also have relation with the open size of lip.Open greatly more as mouth, sound is just loud more.
Shape of the mouth as one speaks model can be described with the lip and the size of dehiscing that upperlip constitutes, and lip is as circle, semicircle etc.
Resonance peak is some zones that energy is concentrated relatively in the frequency spectrum of sound, the determinative of tonequality still not, and reflected the physical features of sound channel (resonant cavity).Sound is through resonant cavity the time, be subjected to the filter action of cavity, make that the energy of different frequency is redistributed in the frequency domain, a part is because the resonance effect of resonant cavity is strengthened, another part is then decayed, and those frequencies that strengthened show as dense blackstreak on the sonagram of time frequency analysis.Because energy distribution is inhomogeneous, strong part is just as the mountain peak, so be referred to as resonance peak.All have some fixing frequency peak (Formant Synthesis) in the very wide spectrum distribution of voice and most of musical instruments, this frequency peak just is called resonance peak (Formants) in sound spectrum.In voice acoustics, resonance peak is determining the tonequality of vowel, and in Computer Music, they are important parameters of decision tone color and tonequality.
Resonance peak and volume can be obtained by voice are carried out spectrum analysis, and the vowel and the consonant of phoneme in the voice can be identified.
According to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, be because: when voice were carried out time-domain analysis, time domain parameter was identical sometimes, but can not illustrate be converted to that shape of the mouth as one speaks model is rescued and actual voice identical.Because voice signal not only changes in time, and is also information-related with frequency, phase place etc., this just needs the frequency structure of further analytic signal, and in frequency field signal is described.
A kind of device that speech conversion is become mouth shape image, comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme changed the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
When collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
Described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, by digital signal processor level signal is converted to the frequency-region signal that spectrum analysis is used, identification obtains formant frequency, volume and phoneme to frequency-region signal by voice recognition unit then.
Digital signal processor also is converted to digital signal with level signal, and digital signal spreads out of by the loudspeaker that is connected with digital signal processor.
The mouth shape image that obtains by display unit comprises that the basic shape of the mouth as one speaks and lip open the parameter of size.
Beneficial effect of the present invention is as follows:
The present invention obtains determining the resonance peak and the volume of vowel quality by spectrum analysis, and identify the phoneme of voice, determine the parameter of shape of the mouth as one speaks model by phoneme, cooperate resonance peak and volume correction to obtain correct shape of the mouth as one speaks model then, revised shape of the mouth as one speaks model can obtain the very high mouth shape image of accuracy, be more convenient for like this voice disorder personage more easily with other people communication exchange.
Description of drawings
Fig. 1 is the structural representation of apparatus of the present invention
Fig. 2 is a kind of embodiment structural representation of apparatus of the present invention
Embodiment
A kind of speech conversion is become the method for mouth shape image, its switch process is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
Resonance peak and volume can be obtained by voice are carried out spectrum analysis, and the vowel and the consonant of voice phoneme can be identified.Utilize formant frequency and volume to correct shape of the mouth as one speaks model then, then can obtain the mouth shape image that accuracy is arrived very much.
Shape of the mouth as one speaks model is described with the lip and the size of dehiscing that upperlip constitutes, and lip is as circle, semicircle etc.
Mouth shape image comprises that the basic shape of the mouth as one speaks (as semicircle, circle) and lip open the parameter (big more as volume, lip opens also greatly more) of size.
Shown in Fig. 1-2, this device that speech conversion is become mouth shape image, comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme changed the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
When collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
Described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, earlier level signal is converted to the time-domain digital signal by digital signal processor, the frequency-region signal that becomes spectrum analysis to use the time-domain digital conversion of signals again, voice recognition unit identification obtains formant frequency then, the vowel of volume and phoneme, consonant, form syllable sequence by discerning the phoneme that obtains one by one, according to converting shape of the mouth as one speaks model to because of sequence, because the shape of the mouth as one speaks model that at this moment obtains is accurate not enough, so need be by formant frequency, volume is corrected, to be the unit adjust duration of the type image of whenever dehiscing according to the duration of phoneme to shape of the mouth as one speaks model after the correction by receiving, and just constituted continuous mouth shape image.
Described frequency-region signal can extract resonance peak by wave filter, by selecting the suitable filters bandwidth, can obtain the frequency of first, second and third resonance peak, be called F1, F2, F3, the more lasting duration in binding resonant peak, just can identify vowel (as F1 at 300-400Hz, F2 is about 1000Hz, and duration just can be identified as vowel u less than 200ms) and consonant (as F1=200, F2=720, F3=2100 is identified as consonant/b, p/).
The mouth shape image that obtains by this method and device is because accuracy is very high, so can help voice disorder personage and other people communication well.
Claims (6)
1. one kind becomes the method for mouth shape image with speech conversion, it is characterized in that step is as follows:
Gather voice, and the voice that collect are discerned by spectrum analysis;
The phoneme that identification obtains forms syllable sequence;
Syllable sequence is converted to corresponding shape of the mouth as one speaks model one by one;
Parameter according to formant frequency and volume correction shape of the mouth as one speaks model obtains playing the formation mouth shape image continuously according to syllable sequence.
2. according to claim 1ly a kind of speech conversion is become the method for mouth shape image, it is characterized in that: what spectrum analysis obtained is resonance peak and volume, and identification obtains is phoneme in the voice, i.e. vowel and consonant.
3. a kind of device for carrying out said that speech conversion is become the method for mouth shape image according to claim 1 and 2, it is characterized in that: comprise the collecting unit that is used to gather voice, be used for voice are carried out the recognition unit that spectrum analysis obtains phoneme, be used for that phoneme is converted to the converting unit of shape of the mouth as one speaks model and be used for display unit the continuous dynamic play of shape of the mouth as one speaks model.
4. a kind of device that speech conversion is become mouth shape image according to claim 3, it is characterized in that: when collecting unit collects voice, by recognition unit voice are carried out synchronous spectrum analysis simultaneously and obtain resonance peak and volume, and identification obtains syllable sequence, converting unit will be converted to shape of the mouth as one speaks model and according to the parameter of formant frequency and volume correction shape of the mouth as one speaks model, obtain mouth shape image by the continuous dynamic play shape of the mouth as one speaks of display unit model at last according to syllable sequence then.
5. a kind of device that speech conversion is become mouth shape image according to claim 3, it is characterized in that: described collecting unit is a microphone, microphone is converted to the voice signal that collects level signal and inputs to digital signal processor, by digital signal processor level signal is converted to the frequency-region signal that spectrum analysis is used, identification obtains formant frequency, volume and phoneme to frequency-region signal by voice recognition unit then.
6. according to claim 3ly a kind of speech conversion is become the device of mouth shape image, it is characterized in that: the mouth shape image that obtains by display unit comprises that the basic shape of the mouth as one speaks and lip open the parameter of size.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010102408835A CN101930747A (en) | 2010-07-30 | 2010-07-30 | Method and device for converting voice into mouth shape image |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010102408835A CN101930747A (en) | 2010-07-30 | 2010-07-30 | Method and device for converting voice into mouth shape image |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101930747A true CN101930747A (en) | 2010-12-29 |
Family
ID=43369880
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2010102408835A Pending CN101930747A (en) | 2010-07-30 | 2010-07-30 | Method and device for converting voice into mouth shape image |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101930747A (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104574474A (en) * | 2015-01-09 | 2015-04-29 | 何玉欣 | Matching method for generating language mouth shapes of cartoon characters through subtitles |
CN104574478A (en) * | 2014-12-30 | 2015-04-29 | 北京像素软件科技股份有限公司 | Method and device for editing mouth shapes of animation figures |
CN106297792A (en) * | 2016-09-14 | 2017-01-04 | 厦门幻世网络科技有限公司 | The recognition methods of a kind of voice mouth shape cartoon and device |
CN106446406A (en) * | 2016-09-23 | 2017-02-22 | 天津大学 | Simulation system and simulation method for converting Chinese sentences into human mouth shapes |
CN106653050A (en) * | 2017-02-08 | 2017-05-10 | 康梅 | Method for matching animation mouth shapes with voice in real time |
CN107831684A (en) * | 2016-09-16 | 2018-03-23 | 天津思博科科技发展有限公司 | Using the shape of the mouth as one speaks pronunciation transposition of realizing of Robot Vision |
CN108962251A (en) * | 2018-06-26 | 2018-12-07 | 珠海金山网络游戏科技有限公司 | A kind of game role Chinese speech automatic identifying method |
CN109087629A (en) * | 2018-08-24 | 2018-12-25 | 苏州玩友时代科技股份有限公司 | A kind of mouth shape cartoon implementation method and device based on speech recognition |
CN109087651A (en) * | 2018-09-05 | 2018-12-25 | 广州势必可赢网络科技有限公司 | A kind of vocal print identification method, system and equipment based on video and sound spectrograph |
CN109949390A (en) * | 2017-12-21 | 2019-06-28 | 腾讯科技(深圳)有限公司 | Image generating method, dynamic expression image generating method and device |
CN110149548A (en) * | 2018-09-26 | 2019-08-20 | 腾讯科技(深圳)有限公司 | Video dubbing method, electronic device and readable storage medium storing program for executing |
CN110867177A (en) * | 2018-08-16 | 2020-03-06 | 林其禹 | Voice playing system with selectable timbre, playing method thereof and readable recording medium |
CN112700520A (en) * | 2020-12-30 | 2021-04-23 | 上海幻维数码创意科技股份有限公司 | Mouth shape expression animation generation method and device based on formants and storage medium |
CN112750187A (en) * | 2021-01-19 | 2021-05-04 | 腾讯科技(深圳)有限公司 | Animation generation method, device and equipment and computer readable storage medium |
CN113112575A (en) * | 2021-04-08 | 2021-07-13 | 深圳市山水原创动漫文化有限公司 | Mouth shape generation method and device, computer equipment and storage medium |
CN113327483A (en) * | 2021-04-30 | 2021-08-31 | 云南北飞科技有限公司 | Language training method for simulating pronunciation and air flow changes based on 3D tongue position model |
CN116580721A (en) * | 2023-07-13 | 2023-08-11 | 中国电信股份有限公司 | Expression animation generation method and device and digital human platform |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1379348A (en) * | 2002-05-17 | 2002-11-13 | 清华大学 | Method and system for computer conversion between Chinese audio and video parameters |
US20030160791A1 (en) * | 2000-07-13 | 2003-08-28 | Gaspard Breton | Facial animation method |
CN101290720A (en) * | 2008-06-17 | 2008-10-22 | 李伟 | Visualized pronunciation teaching method and apparatus |
US20090037179A1 (en) * | 2007-07-30 | 2009-02-05 | International Business Machines Corporation | Method and Apparatus for Automatically Converting Voice |
CN101482975A (en) * | 2008-01-07 | 2009-07-15 | 丰达软件(苏州)有限公司 | Method and apparatus for converting words into animation |
CN101510256A (en) * | 2009-03-20 | 2009-08-19 | 深圳华为通信技术有限公司 | Mouth shape language conversion method and device |
-
2010
- 2010-07-30 CN CN2010102408835A patent/CN101930747A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030160791A1 (en) * | 2000-07-13 | 2003-08-28 | Gaspard Breton | Facial animation method |
CN1379348A (en) * | 2002-05-17 | 2002-11-13 | 清华大学 | Method and system for computer conversion between Chinese audio and video parameters |
US20090037179A1 (en) * | 2007-07-30 | 2009-02-05 | International Business Machines Corporation | Method and Apparatus for Automatically Converting Voice |
CN101482975A (en) * | 2008-01-07 | 2009-07-15 | 丰达软件(苏州)有限公司 | Method and apparatus for converting words into animation |
CN101290720A (en) * | 2008-06-17 | 2008-10-22 | 李伟 | Visualized pronunciation teaching method and apparatus |
CN101510256A (en) * | 2009-03-20 | 2009-08-19 | 深圳华为通信技术有限公司 | Mouth shape language conversion method and device |
Non-Patent Citations (1)
Title |
---|
《计算机工程与设计》 20040229 侯亚荣等 唇同步的自动识别与验证研究 166-169 1-6 第25卷, 第2期 2 * |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104574478A (en) * | 2014-12-30 | 2015-04-29 | 北京像素软件科技股份有限公司 | Method and device for editing mouth shapes of animation figures |
CN104574474A (en) * | 2015-01-09 | 2015-04-29 | 何玉欣 | Matching method for generating language mouth shapes of cartoon characters through subtitles |
CN106297792A (en) * | 2016-09-14 | 2017-01-04 | 厦门幻世网络科技有限公司 | The recognition methods of a kind of voice mouth shape cartoon and device |
CN107831684A (en) * | 2016-09-16 | 2018-03-23 | 天津思博科科技发展有限公司 | Using the shape of the mouth as one speaks pronunciation transposition of realizing of Robot Vision |
CN106446406A (en) * | 2016-09-23 | 2017-02-22 | 天津大学 | Simulation system and simulation method for converting Chinese sentences into human mouth shapes |
CN106653050A (en) * | 2017-02-08 | 2017-05-10 | 康梅 | Method for matching animation mouth shapes with voice in real time |
CN109949390A (en) * | 2017-12-21 | 2019-06-28 | 腾讯科技(深圳)有限公司 | Image generating method, dynamic expression image generating method and device |
CN108962251A (en) * | 2018-06-26 | 2018-12-07 | 珠海金山网络游戏科技有限公司 | A kind of game role Chinese speech automatic identifying method |
CN110867177A (en) * | 2018-08-16 | 2020-03-06 | 林其禹 | Voice playing system with selectable timbre, playing method thereof and readable recording medium |
CN109087629A (en) * | 2018-08-24 | 2018-12-25 | 苏州玩友时代科技股份有限公司 | A kind of mouth shape cartoon implementation method and device based on speech recognition |
CN109087651A (en) * | 2018-09-05 | 2018-12-25 | 广州势必可赢网络科技有限公司 | A kind of vocal print identification method, system and equipment based on video and sound spectrograph |
CN110149548A (en) * | 2018-09-26 | 2019-08-20 | 腾讯科技(深圳)有限公司 | Video dubbing method, electronic device and readable storage medium storing program for executing |
CN112700520A (en) * | 2020-12-30 | 2021-04-23 | 上海幻维数码创意科技股份有限公司 | Mouth shape expression animation generation method and device based on formants and storage medium |
CN112700520B (en) * | 2020-12-30 | 2024-03-26 | 上海幻维数码创意科技股份有限公司 | Formant-based mouth shape expression animation generation method, device and storage medium |
CN112750187A (en) * | 2021-01-19 | 2021-05-04 | 腾讯科技(深圳)有限公司 | Animation generation method, device and equipment and computer readable storage medium |
CN113112575A (en) * | 2021-04-08 | 2021-07-13 | 深圳市山水原创动漫文化有限公司 | Mouth shape generation method and device, computer equipment and storage medium |
CN113112575B (en) * | 2021-04-08 | 2024-04-30 | 深圳市山水原创动漫文化有限公司 | Mouth shape generating method and device, computer equipment and storage medium |
CN113327483A (en) * | 2021-04-30 | 2021-08-31 | 云南北飞科技有限公司 | Language training method for simulating pronunciation and air flow changes based on 3D tongue position model |
CN116580721A (en) * | 2023-07-13 | 2023-08-11 | 中国电信股份有限公司 | Expression animation generation method and device and digital human platform |
CN116580721B (en) * | 2023-07-13 | 2023-09-22 | 中国电信股份有限公司 | Expression animation generation method and device and digital human platform |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101930747A (en) | Method and device for converting voice into mouth shape image | |
CN104272382B (en) | Personalized singing synthetic method based on template and system | |
CN106898340B (en) | Song synthesis method and terminal | |
CN108847215B (en) | Method and device for voice synthesis based on user timbre | |
CN109767778B (en) | Bi-L STM and WaveNet fused voice conversion method | |
CN1815552B (en) | Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter | |
WO1997029482A1 (en) | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves | |
CN102543073A (en) | Shanghai dialect phonetic recognition information processing method | |
Hansen et al. | On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks | |
CN109616131B (en) | Digital real-time voice sound changing method | |
Wand et al. | Deep Neural Network Frontend for Continuous EMG-Based Speech Recognition. | |
WO2015129465A1 (en) | Voice clarification device and computer program therefor | |
CN113436606B (en) | Original sound speech translation method | |
CN105825868B (en) | A kind of extracting method of the effective range of singer | |
EP1280137A1 (en) | Method for speaker identification | |
CN103035252B (en) | Chinese speech signal processing method, Chinese speech signal processing device and hearing aid device | |
CN112992109A (en) | Auxiliary singing system, auxiliary singing method and non-instantaneous computer readable recording medium | |
CN110570842B (en) | Speech recognition method and system based on phoneme approximation degree and pronunciation standard degree | |
JP4381404B2 (en) | Speech synthesis system, speech synthesis method, speech synthesis program | |
CN114550701A (en) | Deep neural network-based Chinese electronic larynx voice conversion device and method | |
CN103035237B (en) | Chinese speech signal processing method, device and hearing aid device | |
Millhouse et al. | Perceptual characterisation of the singer’s formant region: a preliminary study | |
CN114283822A (en) | Many-to-one voice conversion method based on gamma pass frequency cepstrum coefficient | |
Nasreen et al. | Speech analysis for automatic speech recognition | |
CN109697985B (en) | Voice signal processing method and device and terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20101229 |