KR970050112A

KR970050112A - Real time voice recognition

Info

Publication number: KR970050112A
Application number: KR1019950047885A
Authority: KR
Inventors: 정호선; 조영탁
Original assignee: 정호선
Priority date: 1995-12-08
Filing date: 1995-12-08
Publication date: 1997-07-29
Also published as: KR100202424B1

Abstract

본 발명은 음성인식방법에 관한 것으로, 다수의 심플음성신호로부터 구해진 신호차이를 저장하는 신호차이 누적과정; 저장된 신호차이의 최대값으로 된 일련의 열을 계산하고, 상기 열로부터 음성을 음절로 분리하고 자음과 모음을 구별하는 세그먼트화과정; 세그먼트화과정에서 분리된 자음과 모음에 의하여 음성신호를 시간 영역에서 정규화하여 음성특징으로 추출하고 정규화과정; 및 실수형 신경회로망에서와 같은 학습상수 및 활성함수를 가진 정수형 입력구도 다층 퍼셉트론 신경회로망에 상기 추출된 특징으로 적용하여 특징으로 부터 음성을 분류하는 과정을 포함함을 특징으로 한다.The present invention relates to a speech recognition method, comprising: a signal difference accumulation process for storing signal differences obtained from a plurality of simple voice signals; A segmentation process of calculating a series of maximum values of stored signal differences, separating speech from syllables into syllables, and distinguishing consonants from vowels; Normalizing the speech signal in the time domain by consonants and vowels separated in the segmentation process and extracting the speech signal into a speech feature; And an integer input sphere having a learning constant and an activation function as in a real neural network, is applied to the multilayer perceptron neural network as the extracted feature to classify a voice from the feature.

본 발명에 의하면, 음성인식처리에 잇어서, 저가형, 소형이면서도 실시간 처리가 가능하며 높은 인식률을 가지는 시스템을 구현할 수 있다.According to the present invention, in addition to the voice recognition processing, it is possible to implement a system having a low recognition rate, a small size and real-time processing and a high recognition rate.

Description

Real time voice recognition

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제1도는 본 발명에 의한 음성인식장난감의 하드웨어적인 구성을 도시한 구성블럭도.1 is a block diagram showing a hardware configuration of a voice recognition toy according to the present invention.

제2도는 제1도에 도시된 음성인식장치의 상세 구성블럭도.2 is a detailed block diagram of the speech recognition apparatus shown in FIG.

제3도는 본 발명에 의한 음성인식 알고리즘을 설명하기 위한 블럭도.3 is a block diagram for explaining a speech recognition algorithm according to the present invention.

Claims

A speech recognition method for extracting a feature from a speech signal, classifying a speech from the extracted feature, and applying it to speech recognition, comprising: a signal difference accumulation process of storing signal differences obtained from a plurality of sample speech signals; A segmentation process of calculating a series of columns of maximum values of the stored signal differences, separating speech into syllables, and distinguishing consonants and vowels from the rows; Normalizing the voice call in the time domain by consonants and vowels separated in the segmentation process and extracting the voice feature into a voice feature; And classifying speech from the features by applying the extracted features to an integer input driving multilayer perceptron neural network having a learning constant and an activation function as in a real neural network.

The method of claim 1, wherein the signal difference accumulating process comprises: selecting a predetermined number of samples in one frame, and sequentially calculating differences between samples; Storing the calculated difference value in a plurality of melscale storage devices; Counting the number of data accumulated in each of the melscale storage devices; And repeating the same process as the first frame with respect to the remaining frames by a predetermined time.

CLAIMS 1. A method for setting an optimal offset value in an activation function of an integer input drive multilayer perceptron neural network, comprising: a first step of selecting zero as an initial offset value; Counting the number of iterations when the total error is not reduced by further iterations; A third step of calculating an average of net values if the counted number of repetitions is greater than a predetermined constant value; A third step of increasing the offset by one point if the calculated average value is positive, and decreasing the offset by one point if the calculated average value is negative; A fourth step of calculating a weight and an error by using the new offset value increased or decreased in the step; And a fifth step of repeating the second step until the total error reaches a predetermined desired value.

※ Note: The disclosure is based on the initial application.