JPS58132299A - 不特定話者単語音声認識方法 - Google Patents

不特定話者単語音声認識方法

Info

Publication number
JPS58132299A
JPS58132299A JP57014685A JP1468582A JPS58132299A JP S58132299 A JPS58132299 A JP S58132299A JP 57014685 A JP57014685 A JP 57014685A JP 1468582 A JP1468582 A JP 1468582A JP S58132299 A JPS58132299 A JP S58132299A
Authority
JP
Japan
Prior art keywords
voice
spectral
bang
distance
series
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP57014685A
Other languages
English (en)
Japanese (ja)
Other versions
JPH0221598B2 (cg-RX-API-DMAC7.html
Inventor
貞煕 古井
管村 昇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP57014685A priority Critical patent/JPS58132299A/ja
Publication of JPS58132299A publication Critical patent/JPS58132299A/ja
Publication of JPH0221598B2 publication Critical patent/JPH0221598B2/ja
Granted legal-status Critical Current

Links

JP57014685A 1982-02-01 1982-02-01 不特定話者単語音声認識方法 Granted JPS58132299A (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57014685A JPS58132299A (ja) 1982-02-01 1982-02-01 不特定話者単語音声認識方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57014685A JPS58132299A (ja) 1982-02-01 1982-02-01 不特定話者単語音声認識方法

Publications (2)

Publication Number Publication Date
JPS58132299A true JPS58132299A (ja) 1983-08-06
JPH0221598B2 JPH0221598B2 (cg-RX-API-DMAC7.html) 1990-05-15

Family

ID=11868056

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57014685A Granted JPS58132299A (ja) 1982-02-01 1982-02-01 不特定話者単語音声認識方法

Country Status (1)

Country Link
JP (1) JPS58132299A (cg-RX-API-DMAC7.html)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60123000A (ja) * 1983-11-08 1985-07-01 テキサス インスツルメンツ インコーポレイテッド 話者に影響を受けない音声認識方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60123000A (ja) * 1983-11-08 1985-07-01 テキサス インスツルメンツ インコーポレイテッド 話者に影響を受けない音声認識方法

Also Published As

Publication number Publication date
JPH0221598B2 (cg-RX-API-DMAC7.html) 1990-05-15

Similar Documents

Publication Publication Date Title
Das et al. Urban sound classification using convolutional neural network and long short term memory based on multiple features
Davis et al. Environmental sound classification using deep convolutional neural networks and data augmentation
CN111724770B (zh) 一种基于深度卷积生成对抗网络的音频关键词识别方法
JPS59216284A (ja) パタ−ン認識装置
CN111400540B (zh) 一种基于挤压和激励残差网络的歌声检测方法
CN110992988B (zh) 一种基于领域对抗的语音情感识别方法及装置
CN111785262B (zh) 一种基于残差网络及融合特征的说话人年龄性别分类方法
CN116580706B (zh) 一种基于人工智能的语音识别方法
CN113691382A (zh) 会议记录方法、装置、计算机设备及介质
Phan et al. Multi-view audio and music classification
CN114999508A (zh) 一种利用多源辅助信息的通用语音增强方法和装置
CN114512134A (zh) 声纹信息提取、模型训练与声纹识别的方法和装置
US6131089A (en) Pattern classifier with training system and methods of operation therefor
Sarkar et al. Music genre classification using EMD and pitch based feature
Renisha et al. Cascaded feedforward neural networks for speaker identification using perceptual wavelet based cepstral coefficients
CN114817622A (zh) 歌曲片段搜索方法及其装置、设备、介质、产品
Cheng et al. Speech emotion recognition based on interactive convolutional neural network
Kareem et al. Multi-label bird species classification using sequential aggregation strategy from audio recordings
Zhang et al. Introducing self-supervised phonetic information for text-independent speaker verification
Sarkar et al. Raga identification from Hindustani classical music signal using compositional properties
JPS58132299A (ja) 不特定話者単語音声認識方法
Bhavya et al. Deep learning approach for sound signal processing
Gupta et al. Emotion recognition from speech using wavelet packet transform and prosodic features
Mehta et al. Cover song identification with pairwise cross-similarity matrix using deep learning
CN115910070A (zh) 语音识别方法、装置、设备及存储介质