KR100202424B1 - 실시간 음성인식방법 - Google Patents
실시간 음성인식방법 Download PDFInfo
- Publication number
- KR100202424B1 KR100202424B1 KR1019950047885A KR19950047885A KR100202424B1 KR 100202424 B1 KR100202424 B1 KR 100202424B1 KR 1019950047885 A KR1019950047885 A KR 1019950047885A KR 19950047885 A KR19950047885 A KR 19950047885A KR 100202424 B1 KR100202424 B1 KR 100202424B1
- Authority
- KR
- South Korea
- Prior art keywords
- speech
- signal
- neural network
- recognition
- speech recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (3)
- 음성신호로부터 특징을 추출하고, 추출된 특징으로부터 음성을 분류하여 음성인식에 적용하기 위한 음성인식방법에 있어서, 다수의 샘플 음성신호로부터 구해진 신호차이를 저장하는 신호차이누적과정; 상기 저장된 신호차이의 최대값으로 된 일련의 열을 계산하고, 상기 열로부터 음성을 음절로 분리하고 자음과 모음을 구별하는 세그먼트화 과정; 상기 세그먼트화과정에서 분리된 자음과 모음에 의하여 음성신호를 시간영역에서 정규화하여 음성특징을 추출하는 정규화과정; 및 실수형 신경회로망에서와 같은 학습상수 및 활성함수를 가진 정수형 입력구동 다층 퍼셉트론 신경회로망에 상기 추출된 특징을 적용하여 특징으로부터 음성을 분류하는 과정을 포함함을 특징으로 하는 음성인식방법.
- 제1항에 있어서, 상기 신호차이누적과정은 소정 개수의 샘플을 한 프레임으로 선택하고, 순차적으로 각각 샘플간의 차이를 구하는 과정; 상기 계산된 차이값을 대응하는 복수의 멜스케일 저장장치에 저장하는 과정; 및 상기 각 멜스케일 저장장치에 누적된 데이터의 수를 카운트하는 과정; 및 시간측에서 일정시간만큼 이동시키고 나머지 프레임에 대하여 상기 제1프레임에서와 같은 과정을 반복하는 과정을 포함함을 특징으로 하는 음성인식방법.
- 정수형 입력구동 다층 퍼셉트론 신경회로망의 활성함수에서 최적의 오프셋값을 설정하기 위한 방법에 있어서, 초기 오프셋값으로 0을 선택하는 제1과정; 전체 에러가 더 이상의 반복에 의해서도 감소되지 않을 때 그 반복회수를 카운트하는 제2과정; 만일 상기 카운트된 반복회수가 소정의 상수값보다 더 크면 정미가(net value)의 평균을 계산하는 제3과정; 만일 상기 계산된 평균값이 음수이면 오프셋을 1포인트 감소시키는 제3과정; 상기 과정에서 증가 또는 감소된 새로운 오프셋값을 사용하여 가증치와 에러를 계산하는 제4과정; 및 전체 에러가 소정의 원하는 값으로 될 때까지 상기 제2과정부터 반복하는 제5과정을 포함함을 특징으로 하는 오프셋값 설정방법.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019950047885A KR100202424B1 (ko) | 1995-12-08 | 1995-12-08 | 실시간 음성인식방법 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019950047885A KR100202424B1 (ko) | 1995-12-08 | 1995-12-08 | 실시간 음성인식방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR970050112A KR970050112A (ko) | 1997-07-29 |
KR100202424B1 true KR100202424B1 (ko) | 1999-06-15 |
Family
ID=19438634
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1019950047885A Expired - Fee Related KR100202424B1 (ko) | 1995-12-08 | 1995-12-08 | 실시간 음성인식방법 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR100202424B1 (ko) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9972305B2 (en) | 2015-10-16 | 2018-05-15 | Samsung Electronics Co., Ltd. | Apparatus and method for normalizing input data of acoustic model and speech recognition apparatus |
US10714077B2 (en) | 2015-07-24 | 2020-07-14 | Samsung Electronics Co., Ltd. | Apparatus and method of acoustic score calculation and speech recognition using deep neural networks |
-
1995
- 1995-12-08 KR KR1019950047885A patent/KR100202424B1/ko not_active Expired - Fee Related
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10714077B2 (en) | 2015-07-24 | 2020-07-14 | Samsung Electronics Co., Ltd. | Apparatus and method of acoustic score calculation and speech recognition using deep neural networks |
US9972305B2 (en) | 2015-10-16 | 2018-05-15 | Samsung Electronics Co., Ltd. | Apparatus and method for normalizing input data of acoustic model and speech recognition apparatus |
Also Published As
Publication number | Publication date |
---|---|
KR970050112A (ko) | 1997-07-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kohonen | The'neural'phonetic typewriter | |
Dewa | Suitable CNN weight initialization and activation function for Javanese vowels classification | |
CN111724770B (zh) | 一种基于深度卷积生成对抗网络的音频关键词识别方法 | |
Bose et al. | Deep learning for audio signal classification | |
Shome et al. | Speaker recognition through deep learning techniques: a comprehensive review and research challenges | |
Gaudani et al. | Comparative study of robust feature extraction techniques for ASR for limited resource Hindi language | |
Alkhatib et al. | Building an assistant mobile application for teaching arabic pronunciation using a new approach for arabic speech recognition | |
KR100202424B1 (ko) | 실시간 음성인식방법 | |
Watrous¹ et al. | Learned phonetic discrimination using connectionist networks | |
Jain et al. | Investigation using MLP-SVM-PCA classifiers on speech emotion recognition | |
Sunny et al. | Feature extraction methods based on linear predictive coding and wavelet packet decomposition for recognizing spoken words in malayalam | |
Yousfi et al. | Isolated Iqlab checking rules based on speech recognition system | |
Jegan et al. | MFCC and texture descriptors based stuttering dysfluencies classification using extreme learning machine | |
Tomar et al. | CNN-MFCC model for speaker recognition using emotive speech | |
Kamarudin et al. | Analysis on Mel frequency cepstral coefficients and linear predictive cepstral coefficients as feature extraction on automatic accents identification | |
Hmad et al. | Biologically inspired continuous Arabic speech recognition | |
Ameta et al. | Statistical and deep convolutional feature fusion for emotion detection from audio signal | |
Camarena-Ibarrola et al. | Speaker identification using entropygrams and convolutional neural networks | |
Bansod et al. | Speaker Recognition using Marathi (Varhadi) Language | |
Jain et al. | Speaker Recognition | |
Kherdekar et al. | Feature Fusion Extraction Method for Improvement of Recognition of Continuous Speech: A Feature Fusion Method for Recognition of Continuous Speech | |
Siyad et al. | Spoken Indian Language Identification using MFCC and Vowel Onset Points | |
Lee | Automatic recognition of isolated cantonese syllables using neural networks | |
Sunny et al. | A comparative study of parametric coding and wavelet coding based feature extraction techniques in recognizing spoken words | |
Cosi | On the use of auditory models in speech technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 19951208 |
|
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 19951208 Comment text: Request for Examination of Application |
|
PG1501 | Laying open of application | ||
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 19980929 Patent event code: PE09021S01D |
|
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 19981229 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 19990319 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 19990320 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
LAPS | Lapse due to unpaid annual fee | ||
PC1903 | Unpaid annual fee |
Termination category: Default of registration fee Termination date: 20021210 |