KR940005044B1

KR940005044B1 - Voice recognizing apparatus and voice recording method

Info

Publication number: KR940005044B1
Application number: KR1019910024619A
Authority: KR
Inventors: 한석진
Original assignee: 주식회사 금성사; 이헌조
Priority date: 1991-12-27
Filing date: 1991-12-27
Publication date: 1994-06-10
Also published as: KR930014265A

Abstract

The device records a voice by one sound with a voice synthesizer (IC4) and memory (6), The device includes a voice record and recognition unit (1) which compares data of 64K SRAM (IC3) and recognizes a voice, an amplifier (5) which amplifies a voice signal of microphone (M), a memory (6) which stores voice data, a storage unit (7), a 1st micom (8) which inputs output data of a voice record and recognition unit (1), a displaying unit (3) which displays results, and a processor (9) which has a 1st micom (8) and displaying unit (3).

Description

Speech recognition device and voice registration method

제1도는 종래 음성인식 장치의 블록도.1 is a block diagram of a conventional speech recognition device.

제2도는 제1도에 따른 음성등록 모드시의 동작 흐름도.2 is a flowchart of operation in the voice registration mode according to FIG.

제3도는 제1도에 따른 음성인식 모드시의 동작 흐름도.3 is a flowchart of operation in the voice recognition mode according to FIG.

제4도는 본 발명 음성인식 장치의 블럭도.4 is a block diagram of the voice recognition device of the present invention.

제5도는 제4도에 따른 음성등록 모드시의 동작 흐름도.5 is a flowchart of operation in the voice registration mode according to FIG. 4;

제6도는 제4도에 따른 음성인식 모드시의 동작 흐름도.6 is an operation flowchart in the voice recognition mode according to FIG.

* 도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

1 : IC 음성등록 및 인식부 3 : IC 표시기1: IC voice registration and recognition unit 3: IC indicator

5 : IC 증폭기 6 : IC 메모리5: IC amplifier 6: IC memory

7 : IC 저장부 8 : IC 제1마이컴7: IC storage unit 8: IC first microcomputer

9 : IC 처리부, IC₁IC 아날로그 음성인식기, IC₂IC 디지탈 음성인식기, IC₃IC64K SRAM, IC₄IC 음성합성기9: IC processing unit, IC ₁ IC analog voice recognizer, IC ₂ IC digital voice recognizer, IC ₃ IC64K SRAM, IC ₄ IC voice synthesizer

본 발명은 음성인식 장치 및 그의 음성등록 방법에 관한 것으로, 특히 한번의 발음으로 음성을 등록시키므로 등록과정을 간편하게 할 수 있도록 한 것이다.The present invention relates to a voice recognition device and a voice registration method thereof, and in particular, to register a voice by one pronunciation, thereby simplifying the registration process.

일반적으로 특정화자 음성인식 IC는 음성으로 대상기기를 제어하기 위한 시스템의 핵심 IC이나 현재의 기술수준에서는 모든 일반 사람의 음성을 인식하지는 못하고 시스템에 미리 설정한 제어용 단어는 음성으로 등록시킨 사람만의 음성을 인식할 수 있는데 이와 같은 것을 특정화자 음성인식이라고 한다.Generally speaking, the specific speaker voice recognition IC is the core IC of the system for controlling the target device by voice, but at the current technology level, it does not recognize the voice of all ordinary people, and the control word set in the system is registered only by the voice registered by the voice. Speech can be recognized, which is called speech recognition.

종래 음성인식 장치는 제1도와 같이 음성등록 및 인식부(1)와 처리부(4)로 이루어졌다.The conventional voice recognition device is composed of a voice registration and recognition unit 1 and a processing unit 4 as shown in FIG.

음성등록 및 인식부(1)는 마이크(M)로부터 아날로그(Analog)음성신호를 받아 디지탈(Digital)신호로 변환하여 상기 디지탈 신호중 음성의 특징있는 데이타를 추출해서 일정 데이타로 바꾸어 소정부분에 저장(등록모드경우)하거나 이미 저장되어 있는 데이타와 비교(인식 모드 경우)하여 일정신호를 발생하는 것으로서, 마이크(M)로부터 아날로그 음성신호를 받아 디지탈 신호로 변환하는 아날로그 음성인식기(IC₄; TC 8861), 디지탈 음성인식기(IC₂; TC 8864)와 등록모드시 디지탈신호중 음성의 특징 데이타를 소정부분에 저장하고, 인식모드시 저장되어 있던 특정데이타를 출력하는 64K SRAM(Static Random Access Memory)(IC₃)으로 구성되고, 처리부(4)는 상기 음성등록 및 인식부(1)와 일정신호를 주고 받음에 따라 음성등록 및 인식결과를 표시하는 마이컴(2)과 표시기(3)로 구성되었다.The voice registration and recognition unit 1 receives an analog voice signal from the microphone M, converts the analog voice signal into a digital signal, extracts characteristic data of the voice from the digital signal, converts the voice data into predetermined data, and stores the predetermined data ( Analog voice recognizer (IC ₄ ; TC 8861) that generates a certain signal by comparing to the data already stored (in case of registration mode) or in comparison with the already stored data (in case of recognition mode). 64K SRAM (IC ₃ ) that stores the characteristic data of voice among the digital signals in the registration mode and the digital voice recognizer (IC ₂ ; TC 8864) in the registration mode and outputs the specific data stored in the recognition mode. And a processing unit 4 as a microcomputer 2 and an indicator 3 for displaying a voice registration and recognition result as a signal is transmitted to and received from the voice registration and recognition unit 1. Castle was.

음성인식용 아날로그 음성인식기(IC₁)와 디지탈 음성인식기(IC₂)는 음성등록시 반드시 3번의 발음이 입력되어 비교된 후 그 발음이 유사하면 등록을 완료하는 스펙(Spec)을 갖는 IC이다.The analog voice recognizer IC ₁ and the digital voice recognizer IC _{2 for} voice recognition are ICs having a specification that completes registration if the pronunciation is similar when 3 pronunciations are inputted and compared when voice registration is performed.

이와 같이 구성된 종래의 음성인식기기의 음성등록 동작을 제2도를 참조하여 설명한다.The voice registration operation of the conventional voice recognition device configured as described above will be described with reference to FIG.

마이크(M)를 통해 어떤 단어를 두번 반복발음한 후 상기 두번 발음이 유사한지를 판단한다.(S₁-S₃).After repeating the word twice through the microphone (M) it is determined whether the two pronunciations are similar (S ₁ -S ₃ ).

또한, 세번째로서 한번더 발음한 후 상기 발음과 유사하면 이 음성은 아날로그음성인식기 및 디지털 음성인식기(IC₁, IC₂)를 통해 디지탈 신호로 변환되고, 64K SRAM(IC₃)은 음성신호중 특징있는 데이타를 저장하여 한 단어등록을 완료한다(S₄-S₇).In addition, if the sound is pronounced as a third time and is similar to the pronunciation, the voice is converted into a digital signal through an analog voice recognizer and a digital voice recognizer (IC ₁ , IC ₂ ), and 64K SRAM (IC ₃ ) is characterized by the characteristic of the voice signal. Save the data to complete one word registration (S ₄ -S ₇ ).

그리고 제3도를 참조하여 음성인식 동작을 설명한다.And voice recognition operation will be described with reference to FIG.

마이트(M)로 발음을 하면 음성등록 및 인식부(1)의 아날로그 음성인식기(IC₁)에서 아날로그 음성데이타의 특징을 추출해서 디지탈 데이타로 변환한 후 디지탈 음성인식기(IC₂)에서는 상기 추출된 특징 데이타를 64K SRAM(IC₃)에 등록되어 있는 단어들의 특징을 비교한다(S₈-S₁₀).The boehmite (M) When a pronounced voice registration and recognition unit 1, an analog speech recognizer (IC ₁₎ digital speech recognizer (IC ₂₎ and then to extract the characteristics of the analog audio data converted to digital data in the above extraction The feature data is compared with features of words registered in 64K SRAM (IC ₃ ) (S ₈ -S ₁₀ ).

따라서 64K SRAM(IC₃)은 가장 유사한 단어의 번호를 처리부(4)의 마이컴(2)으로 전송하며 마이컴(2)은 표시기(3)에 인식결과를 표시하므로 인식이 완료된다(S₁₁-S₁₃).Therefore, the 64K SRAM IC _{3 transmits} the number of the most similar word to the microcomputer 2 of the processing unit 4, and the microcomputer 2 displays the recognition result on the display 3 so that the recognition is completed (S ₁₁ -S). ₁₃ ).

그런데 이와 같은 종래기술은 아날로그 음성인식기(IC₁)와 디지탈 음성인식기(IC₄)의 스펙특성상 한 단어를 등록하는데 세번의 발음을 해야 하며, 세번 발음 중 한번이라도 에러가 발생하면 다시 세번 발음해야 하는 번거로움이 발생하는 결점이 있다.However, such a prior art has to pronounce three words in order to register a word due to the characteristics of the analog voice recognizer (IC ₁ ) and the digital voice recognizer (IC ₄ ). There is a drawback to the hassle.

본 발명은 이와 같은 종래의 결점을 해결하기 위한 것으로 종래의 아날로그 음성인식기와 디지탈 음성인식기를 이용하여 한번의 발음으로 단어를 등록할 수 있는 음성인식장치 및 그의 음성등록 및 인식방법을 제공하는데 그 목적이 있다.The present invention is to solve the above-mentioned shortcomings, and to provide a voice recognition device and a voice registration and recognition method thereof capable of registering a word with a single pronunciation using a conventional analog voice recognizer and a digital voice recognizer. There is this.

이하에서 이와 같은 목적을 달성하기 위한 본 발명의 실시예를 첨부된 도면에 의하여 상세히 설명하면 다음과 같다.Hereinafter, an embodiment of the present invention for achieving such an object will be described in detail with reference to the accompanying drawings.

제4도는 본 발명 음성인식장치의 블럭도로서, 본 발명의 음성인식 장치는 저장부(7), 음성등록 및 인식부(1) 및 처리부(9)로 이루어졌다. 마이크(M)의 음성인식을 증폭해서 디지탈 신호로 변환시켜 제어부에 따라 메모리에 저장 및 재생하여 음성신호를 발생한 증폭기(5), 음성합성기(IC₄; M 6258), 메모리(6)로 구성되고, 상기 저장부(7)의 신호를 받아 디지탈 신호로 변환하여 상기 디지탈 신호중 음성의 특징있는 데이타를 추출해서 일정데이타로 바꾸어 소정부분에 저장(등록모드경우)하거나 이미 저장되어 있는 데이타와 비교(인식모드 경우)하여 일정신호를 발생하는 음성등록 및 인식부(1)는 아날로그 음성인식기(IC₁), 디지탈 음성인식기(IC₂), 64K SRAM(IC₃)으로 구성되며, 상기 음성등록 및 인식부(1)와 일정신호를 주고 받음에 따라 음성등록 및 인식결과를 표시하며 일정제어 신호를 저장부(7)로 발생하는 처리부(9)는 제1마이컴(8), 표시기(3)로 구성된다.4 is a block diagram of the speech recognition apparatus of the present invention, wherein the speech recognition apparatus of the present invention comprises a storage unit 7, a voice registration and recognition unit 1, and a processing unit 9. Amplified voice recognition of the microphone (M) is converted into a digital signal, stored in the memory according to the control unit and reproduced by the amplifier (5), a voice synthesizer (IC ₄ ; M 6258), a memory 6 generating a voice signal, After receiving the signal from the storage unit 7, converting it into a digital signal, extracting characteristic data of the voice from the digital signal, converting it into a constant data, storing it in a predetermined portion (when registering mode) or comparing it with already stored data (recognition). In the case of a mode), the voice registration and recognition unit 1 that generates a predetermined signal includes an analog voice recognizer IC ₁ , a digital voice recognizer IC ₂ , and a 64K SRAM IC ₃ . The processing unit 9 which displays voice registration and recognition results as a result of sending and receiving a predetermined signal with (1) and generates a constant control signal to the storage unit 7 is composed of a first microcomputer 8 and an indicator 3. .

이와 같이 구성된 본 발명을 제5도를 참조해서 보면 마이크(M)로 일정 단어를 발음하면 저장부(7)의 중폭기(5)에서 증폭하여 제1처리부(9)의 제1마이컴(8)에 따라 음성합성기(IC₄)에서 디지탈 신호로 변환하여 메모리(6)에 저장함과 동시에 첫번째 발음으로 음성신호를 발생한다(S₂₀-S₂₂).Referring to FIG. 5 of the present invention configured as described above, when a certain word is pronounced by the microphone M, the first microcomputer 8 of the first processing unit 9 is amplified by the amplification unit 5 of the storage unit 7. The voice synthesizer IC ₄ converts the digital signal into a digital signal, stores the same in the memory 6, and generates a voice signal with the first pronunciation (S ₂₀ -S ₂₂ ).

또한, 계속해서 음성합성기(IC₄)에서 두번째 발음으로 음성신호를 발생하며 상기 두 발음이 유사하면 세번째 발음으로 음성신호를 발생하게 되어 결과적으로 세번 발음하게 되는 것이다.(S₂₃-S₂₇).In addition, the voice synthesizer IC ₄ continuously generates a voice signal with the second pronunciation, and if the two pronunciations are similar, the voice signal is generated with the third pronunciation, resulting in three pronunciations (S ₂₃ -S ₂₇ ).

본 발명에서는 음싱인식기(IC₁,IC₂)가 스펙특성상 3번의 발음을 입력하여 비교해야 하므로, 동일발음을 세번 발생하는 것이며, 음성합성기(IC₄)에 의해 재생되는 두번째, 세번째 발음이 노이즈 등에 의해 첫번째 입력된 발음과 달라질 수 있으므로 첫번째 발음과 두번째 및 세번째 발음을 비교하는 것이다.In the present invention, since the sounding recognizers IC ₁ and IC ₂ have to input three phonetic pronunciations for comparison, the same sound is generated three times, and the second and third pronunciations reproduced by the voice synthesizer IC ₄ are noise and the like. The first pronunciation is compared with the second and third pronunciation because it may differ from the first input pronunciation.

그리고 음성등록 및 인식부(1)의 아날로그 음성인식기(IC₁)는 상기 세개의 발음을 받아 유사하면 디지탈 음성인식기(IC₂)를 통해 64D SRAM(IC₃)에 음성데이타중 특징있는 부분의 데이타를 일정데이타로 바꾸어 저장하므로 한 단어의 등록이 완료된다(S₂₈-S₃₃).The analog voice recognizer IC ₁ of the voice register and recognizer ₁ receives the three pronunciations, and if similar, data of the characteristic part of the voice data in the 64D SRAM IC ₃ through the digital voice recognizer IC ₂ . Since the data is converted into schedule data and stored, registration of one word is completed (S ₂₈ -S ₃₃ ).

또한, 제6도를 참조해서 보면 마이크(M)로 일정단어를 발음하면 저장부(7)의 증폭기(5)에서 증폭되고 음성합성기(IC₂)에서 64K SRAM(IC₃)에 등록되어 있는 단어들과 특징을 비교한다(S₃₄)(S₃₅).In addition, referring to FIG. 6, when a certain word is pronounced by the microphone M, the word is amplified by the amplifier 5 of the storage unit 7 and registered in the 64K SRAM IC _{3 of the} voice synthesizer IC ₂ . Compare the features with the features (S ₃₄ ) (S ₃₅ ).

이어서 음성등록 및 인식부(1)의 아날로그 음성인식기(IC₁)에서 음성데이타의 특징을 추출해서 디지탈 음성인식기(IC₂)에서 64K SRAM(IC₃)에 등록되어 있는 단어들과 특징을 비교한다.(S₃₄)(S₃₅).Subsequently, features of voice data are extracted from the analog voice recognizer IC ₁ of the voice register and recognizer ₁ , and the features are compared with words registered in the 64K SRAM IC _{3 of} the digital voice recognizer IC ₂ . (S ₃₄ ) (S ₃₅ ).

그리고, 상기 비교후에 가장 유사한 단어의 번호를 처리부(9)의 제1마이컴(8)으로 전송하며 표시기(3)에서 인식결과를 표시하므로서 인식이 완료된다(S₃₆-S₃₈).After the comparison, the number of the most similar words is transmitted to the first microcomputer 8 of the processing unit 9, and the recognition is completed by displaying the recognition result on the display 3 (S _36- S ₃₈ ).

이상에서 설명한 바와 같이 본 발명은 음성합성기(IC₄) 및 메모리(6)를 사용함으로써 한번의 발음으로 음성등록을 할 수 있다.As described above, according to the present invention, voice registration can be performed by one pronunciation by using the voice synthesizer IC ₄ and the memory 6.

따라서 종래의 아날로그 음성인식기와 디지탈 음성인식기를 사용하면서도 음성등록 및 인식시 3번 발음해야 하는 번거로움을 피하여 간단히 음성등록을 할 수 있는 효과가 있다.Therefore, while using a conventional analog voice recognizer and a digital voice recognizer, there is an effect that can simply register the voice to avoid the hassle of having to pronounce three times during voice registration and recognition.

Claims

Amplifying the input voice signal, converting the digital signal into a digital signal, and storing the converted first voice signal and simultaneously converting the stored voice signal into the original voice signal and generating the second voice signal. Converting the signal into an original voice signal to generate a third pronunciation, and if all of the first to third pronunciations are similar, converting and storing the characteristic part of the voice signal into digital data and indicating that the voice is registered. A voice registration method of a voice recognition device.

Amplify the voice signal of the microphone (M) with the amplifier (5), convert the amplified signal into a digital signal through the voice synthesizer (IC ₄ ), and stores and reproduces the converted signal in the memory (6) in accordance with the control signal The storage unit 7 which generates a voice signal, and digitally converts the signals of the storage unit 7 through the analog voice recognizer IC ₁ and the digital voice recognizer IC ₂ , and extracts the feature data of the voice from the digital signal. and by storing the 64K SRAM (IC _3), and the voice registration, voice recognition as compared to through the digital speech recognizer (IC ₂₎ for data to be applied is stored in the 64K SRAM (IC ₃₎ data from an analog speech recognizer (IC ₁₎ The first microcomputer 8 inputs the control signal to the voice registration and recognition unit 1 and the storage unit 7 during voice registration, and outputs data of the voice registration and recognition unit 1 during voice recognition. And a processor 9 for displaying the recognition result through the indicator 3. A voice recognition device, characterized in that.