JPH0427560B2 - - Google Patents
Info
- Publication number
- JPH0427560B2 JPH0427560B2 JP19538285A JP19538285A JPH0427560B2 JP H0427560 B2 JPH0427560 B2 JP H0427560B2 JP 19538285 A JP19538285 A JP 19538285A JP 19538285 A JP19538285 A JP 19538285A JP H0427560 B2 JPH0427560 B2 JP H0427560B2
- Authority
- JP
- Japan
- Prior art keywords
- recognition
- vowels
- vowel
- learning
- speaker
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 238000000034 method Methods 0.000 claims description 51
- 239000013598 vector Substances 0.000 claims description 25
- 230000001755 vocal effect Effects 0.000 claims description 5
- 238000002474 experimental method Methods 0.000 description 15
- 238000010586 diagram Methods 0.000 description 13
- 238000001228 spectrum Methods 0.000 description 13
- 230000000694 effects Effects 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 239000011159 matrix material Substances 0.000 description 7
- 238000004088 simulation Methods 0.000 description 6
- 238000000605 extraction Methods 0.000 description 5
- 230000003595 spectral effect Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000012795 verification Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 241000498886 Collimonas arenae Species 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000008602 contraction Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP19538285A JPS6255700A (ja) | 1985-09-04 | 1985-09-04 | 音声母音認識方法 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP19538285A JPS6255700A (ja) | 1985-09-04 | 1985-09-04 | 音声母音認識方法 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPS6255700A JPS6255700A (ja) | 1987-03-11 |
| JPH0427560B2 true JPH0427560B2 (enrdf_load_stackoverflow) | 1992-05-12 |
Family
ID=16340235
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP19538285A Granted JPS6255700A (ja) | 1985-09-04 | 1985-09-04 | 音声母音認識方法 |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPS6255700A (enrdf_load_stackoverflow) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH067358B2 (ja) * | 1988-08-20 | 1994-01-26 | 正行 木村 | 相対関係に基づく音声認識方式 |
| JP4906776B2 (ja) * | 2008-04-16 | 2012-03-28 | 株式会社アルカディア | 音声制御装置 |
-
1985
- 1985-09-04 JP JP19538285A patent/JPS6255700A/ja active Granted
Also Published As
| Publication number | Publication date |
|---|---|
| JPS6255700A (ja) | 1987-03-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Palo et al. | Wavelet based feature combination for recognition of emotions | |
| Qawaqneh et al. | Deep neural network framework and transformed MFCCs for speaker's age and gender classification | |
| CN110400579B (zh) | 基于方向自注意力机制和双向长短时网络的语音情感识别 | |
| Yogesh et al. | A new hybrid PSO assisted biogeography-based optimization for emotion and stress recognition from speech signal | |
| Nwe et al. | Speech based emotion classification | |
| US7957959B2 (en) | Method and apparatus for processing speech data with classification models | |
| Bezoui et al. | Feature extraction of some Quranic recitation using mel-frequency cepstral coeficients (MFCC) | |
| Samantaray et al. | A novel approach of speech emotion recognition with prosody, quality and derived features using SVM classifier for a class of North-Eastern Languages | |
| CN103456302B (zh) | 一种基于情感gmm模型权重合成的情感说话人识别方法 | |
| Al Anazi et al. | A machine learning model for the identification of the holy quran reciter utilizing k-nearest neighbor and artificial neural networks | |
| CN106531192A (zh) | 基于冗余特征和多词典表示的语音情感识别方法及系统 | |
| Nanavare et al. | Recognition of human emotions from speech processing | |
| Daouad et al. | An automatic speech recognition system for isolated Amazigh word using 1D & 2D CNN-LSTM architecture | |
| Agrawal et al. | Speech emotion recognition of Hindi speech using statistical and machine learning techniques | |
| Sinha et al. | Acoustic-phonetic feature based dialect identification in Hindi Speech | |
| Xue et al. | Learning speech emotion features by joint disentangling-discrimination | |
| Saputri et al. | Identifying Indonesian local languages on spontaneous speech data | |
| Shafieian | Hidden Markov model and Persian speech recognition | |
| JPH0427560B2 (enrdf_load_stackoverflow) | ||
| Dwijayanti et al. | Speech-to-text conversion in indonesian language using a deep bidirectional long short-term memory algorithm | |
| Hassine et al. | Hybrid techniques for Arabic Letter recognition | |
| Rashmi et al. | Optimization of Convolutional Neural Network Architectures for High-Accuracy Spoken Digit Classification Using Mel-Frequency Cepstral Coefficients. | |
| Hanifa et al. | Comparative analysis on different cepstral features for speaker identification recognition | |
| Cai et al. | Deep speaker embeddings with convolutional neural network on supervector for text-independent speaker recognition | |
| Ma et al. | Statistical formant descriptors with linear predictive coefficients for accent classification |