KR19980017116A

KR19980017116A - Driver's voice signal section detection device and method

Info

Publication number: KR19980017116A
Application number: KR1019960036868A
Authority: KR
Inventors: 심갑종
Original assignee: 박병재; 현대자동차주식회사
Priority date: 1996-08-30
Filing date: 1996-08-30
Publication date: 1998-06-05
Also published as: KR100262576B1

Abstract

이 발명은 운전자의 음성 신호 검출 장치 및 그 방법에 관한 것으로서, 운전자의 음성을 입력받아 음성 신호를 출력하는 음성 신호 출력부와, 상기에서 출력된 운전자의 음성 신호를 증폭하여 출력하는 증폭부와, 상기에서 증폭되어 출력된 운전자의 음성 신호를 디지탈 신호로 변환하여 출력하는 아날로그/디지탈 변환부와, 단어를 여러 사람의 음성으로 여러번 반복하여 데이타로 저장한 학습 단어 음성 신호를 출력하는 학습 단어 음성 신호 저장부와, 상기에서 출력된 음성 신호를 음성 에너지와 음성 신호 영교차율로 나누어 설정된 산출식에 의해 계산하여 음성 신호 구간을 산출하고, 설정된 최저 기준 신호값과 비교하여 음성 신호와 잡음 신호로 구분하고, 음성 신호를 저장부에 저장된 학습 단어 음성과 비교해서 편차를 구하여, 설정된 기준값 이하이면 음성 신호에 해당하는 제어 신호를 출력하는 제어부와, 상기 제어부에서 출력되는 제어 신호에 따라 차량의 각종 장치가 운전자의 음성신호에 따라 구동하는 구동부로 이루어져 있어, 운전자가 음성 명령을 내리면, 잡음이 많은 환경에서도 정확한 운전자의 음성 신호를 검출하여 오동작을 하지 않고, 정확한 구동부의 동작을 수행할 수 있다.The present invention relates to an apparatus and method for detecting a driver's voice signal, comprising: a voice signal output unit for receiving a driver's voice and outputting a voice signal, an amplifier for amplifying and outputting the driver's voice signal; An analog / digital converter for converting and outputting the driver's voice signal amplified and output into a digital signal, and a learning word voice signal for outputting a learning word voice signal in which words are repeatedly stored as data by multiple people's voices The storage unit calculates the speech signal interval by calculating the speech signal outputted above by dividing the speech energy and the speech signal zero crossing rate into a calculation formula, and compares the speech signal and the noise signal by comparing with the lowest reference signal value. Sets the reference value by comparing the voice signal with the learning word voice stored in the storage unit The controller is configured to output a control signal corresponding to a high surface voice signal and a driver to drive various devices of the vehicle according to the driver's voice signal according to the control signal output from the controller. When the driver gives a voice command, noise is generated. Even in many environments, an accurate driver's voice signal may be detected to perform an accurate driving unit without malfunctioning.

Description

Driver's voice signal section detection device and method

제1도는 종래의 운전자의 음성 신호 구간 검출 파형이고,1 is a waveform signal detection waveform of a conventional driver,

제2도는 이 발명의 실시예에 따른 운전자의 음성 신호 구간 검출 장치의 블럭 구성도이고,2 is a block diagram of an apparatus for detecting a voice signal section of a driver according to an exemplary embodiment of the present invention.

제3도는 이 발명의 실시예에 따른 아이들 상태의 차량에서 검출된 운전자의 음성 신호 파형도이고,3 is a waveform diagram of a voice signal of a driver detected in a vehicle in an idle state according to an embodiment of the present invention.

제4도는 이 발명의 실시예에 따른 주행차량에서 운전자의 음성 신호 파형도이고,4 is a waveform diagram of a voice signal of a driver in a driving vehicle according to an exemplary embodiment of the present invention.

제5도는 이 발명의 실시예에 따른 운전자의 음성 신호 구간 검출 방법의 순서도이다.5 is a flowchart of a method for detecting a voice signal section of a driver according to an exemplary embodiment of the present invention.

이 발명은 운전자의 음성 신호 구간 검출 장치 및 그 방법에 관한 것으로서, 더욱 상세하게 말하자면 운전자가 음성으로 자동차의 구동장치를 구동시키는 경우에, 음성 신호에 섞여 있는 잡음을 제거하고, 정학한 운전자의 음성을 검출하기 위한 음성 신호 구간 검출 장치 및 그 방법에 관한 것이다.The present invention relates to a device for detecting a voice signal section of a driver and a method thereof, and more particularly, when a driver drives a driving device of a car by voice, the noise of the voice signal is removed and the driver's voice is suspended. An apparatus and a method for detecting a speech signal section for detecting a signal.

일반적으로 자동차에서 사용되는 음성 장치란 자동차를 주행중 운전자가 자동차에 부착되어 있는 다양한 편의기능을 손과 눈을 사용하지 않고, 운전자의 음성을 인식하여 조작하는 것을 말하며, 주행시 편의성과 안정성을 높일 수 있다.In general, a voice device used in a car means that a driver recognizes and operates a driver's voice without using hands and eyes while driving a vehicle, and can increase convenience and stability while driving. .

현재 국내에서는 음성인식 기술은 대개 조용한 환경에서의 음성인식을 목표로 하고 있으며, 차량주행시 발생하는 소음환경에서의 음성인식에 대한 연구는 미비한 실정이다.Currently, voice recognition technology aims at voice recognition in a quiet environment, and studies on voice recognition in a noise environment generated when driving a vehicle are insufficient.

이하, 첨부된 도면을 참조로 하여 종래의 음성 신호 검출 방법에 대하여 설명한다.Hereinafter, a conventional voice signal detection method will be described with reference to the accompanying drawings.

제1도는 종래의 운전자의 음성 신호 구간 검출 파형도이다.1 is a waveform diagram of a conventional driver's voice signal section detection.

제1도에 도시되어 있듯이, 운전자의 음성은 마이크 등을 통해 전기적 신호로 바뀌어진 연속 파형을 음성 인식장치에서 적합하고 유용한 형태의 신호로 변환되기 위한 전처리 과정을 거쳐, 음성신호를 묵음 구간으로부터 분리해내는 음성구간 검출 과정이 여기에 포함된다.As shown in FIG. 1, the driver's voice is separated from the silent section by a preprocessing process for converting a continuous waveform converted into an electrical signal through a microphone or the like into a suitable and useful form in a speech recognition device. This includes the process of detecting negative segments.

음성구간 검출에 의해 얻은 음성 신호는 특징 추출 과정을 통과하여 10ms정도의 구간별로 음성의 특징을 표현하는 특징 파라미터를 구하며 이 구간을 프라임(frame)이라고 한다.The speech signal obtained by the speech segment detection passes through the feature extraction process to obtain a feature parameter representing the speech feature for each section of about 10 ms. This segment is called a frame.

종래에는 자동차가 아이들 상태나 주행시 운전자의 음성이 입력되어 다양한 편의기능을 동작 시키려할 때, 프라임 단위로 처리되는 음성신호에서 한 프라임내에 발생한 음성 특징 파라미터의 음성 신호 에너지와, 음성 신호가 기준점 0을 지나는 횟수를 나타내는 영 교차율(zer-crossingrate)로써 운전자의 음성 구간을 판단하였다.Conventionally, when a driver inputs a driver's voice during idle state or driving, and operates various convenience functions, the voice signal energy of the voice feature parameter generated within one prime of the voice signal processed in the prime unit, and the voice signal are set to the reference point 0. The driver's voice section was determined by the zer-crossing rate representing the number of passes.

그러나 상기한 종래의 기술은 자동차을 구동시키지 않는 조용한 환경에서는 운전자의 정확한 음성 신호 구간이 검출되었으나, 자동차아 아이들 상태 및 주행중일 때 자동차의 엔진에서 출력되는 잡음과 유리창의 울림이 운전자의 음성 신호에 섞여서 운전자의 음성 구간내에 잡음이 크게 분포하여 정확한 운전자 음성 구간이 검출되지 않았다.However, the above-described conventional technology detects the driver's exact voice signal section in a quiet environment where the vehicle is not driven. However, the noise output from the engine of the car and the ringing of the windshield are mixed with the driver's voice signal when the vehicle is idle and driving. Noise was largely distributed in the driver's voice section, so an accurate driver's voice section was not detected.

또한, 잡음이 심한 환경에서 운전자의 발성 위치와 음성을 받아들이는 마이크간의 거리가 먼 경우에는, 부정확한 운전자의 음성 신호 구간 검출로인하여 성능이 급격히 저하되어 자동차의 다양한 편의기능이 오동작하는 문제점이 있다.In addition, when the distance between the vocalization position of the driver and the microphone receiving the voice is far in a noisy environment, the performance is drastically deteriorated due to the inaccurate driver's voice signal section detection, which causes various convenience functions of the vehicle to malfunction. .

따라서, 이 발명의 목적은 상기한 종래의 문제점을 해결하기 위한 것으로서, 자동차의 통상주행 상태에서, 운전자의 음성에 따라 자동차을 구동하는 경우에 차량의 잡음이 일정한 주파수 분포를 가지는 것에 착안하여 각 프라임간의 주파수 분포 차이를 일정한 기준값을 두어 주파수 영역에서의 스펙트럼 변화를 이용하여, 음성 신호 프레임에서 매 프라임마다 구한 셉스트럼(cepstrum)과 인접 프라임에서의 셉스트럼과의 거리인 아에스디(ISD:interframe spectral distance)를 구하므로써, 잡음이 많은 환경에서도 정확한 운전자의 음성 프레임을 검출하기 위한 운전자의 음성 신호 검출 장치 및 그 방법을 제공하기 위한 것이다.Accordingly, an object of the present invention is to solve the above-described problems, and focuses on the fact that the noise of the vehicle has a constant frequency distribution when driving the vehicle according to the driver's voice in the normal driving state of the vehicle. Using a spectral change in the frequency domain with a constant reference value of the frequency distribution difference, an ISD (interframe), which is a distance between a cepstrum obtained at every prime in a voice signal frame and a cepstrum at an adjacent prime. By obtaining a spectral distance, an object of the present invention is to provide an apparatus and method for detecting a driver's voice signal for detecting an accurate driver's voice frame even in a noisy environment.

상기한 목적을 달성하기 위한 수단으로써 이 발명의 구성은, 운전자의 음성을 입력받아 음성 신호를 출력하는 음성 신호 출력부와; 상기에서 출력된 운전자의 음성 신호를 증폭하여 출력하는 증폭부와; 상기에서 증폭되어 출력된 운전자의 음성 신호를 디지탈 신호로 변환하여 출력하는 아날로그/디지탈 변환부와; 단어를 여러 사람의 음성으로 여러번 반복하여 데이타로 저장한 학습 단어 음성 신호를 출력하는 학습 단어 음성 신호 저장부와; 상기에서 출력된 음성 신호를 음성 에너지와 음성 신호 영교차율로 나누어 설정된 산출식에 의해 계산하여 음성 신호 구간을 산출하고, 설정된 최저 기준 신호값과 비교하여 음성 신호와 잡음 신호로 구분하고, 음성 신호를 저장부에 저장된 학습 단어 음성과 비교해서 편차를 구하여, 설정된 기준값 이하이면 음성 신호에 대항하는 제어 신호를 출력하는 제어부와; 상기 제어부에서 출력되는 제어 신호에 따라 차량의 각종 장치가 운전자의 음성신호에 따라 구동하는 구동부로 이루어져 있다.As a means for achieving the above object, the configuration of the present invention, the voice signal output unit for receiving a driver's voice and outputs a voice signal; An amplifier for amplifying and outputting the driver's voice signal; An analog / digital converter for converting and outputting the driver's voice signal amplified and output into a digital signal; A learning word voice signal storage unit for outputting a learning word voice signal in which a word is repeated as many times and stored as data; The voice signal output from the above is calculated by the calculation formula set by dividing the voice energy and the voice signal zero crossing rate, and the voice signal interval is calculated. The voice signal is divided into the voice signal and the noise signal by comparing with the set minimum reference signal value. A control unit which obtains a deviation from the learning word voice stored in the storage unit and outputs a control signal against the voice signal when the deviation is less than or equal to the set reference value; According to a control signal output from the controller, various devices of the vehicle are configured to drive according to the driver's voice signal.

상기한 목적을 달성하기 위한 수단으로써 이 발명의 다른 구성은, 전원이 인가되면, 운전자 음성을 입력하여, 음성 에너지와 음성 신호 영 교차율로 나누어 판독하는 단계와; 상기 단계에서 판독된 음성 에너지와 음성 신호 영 교차율을 설정된 산출식에 따라 계산하여 제1 음성 신호 편차값을 산출하여 그에 해당하는 운전자의 음성 신호 프레임을 검출하여 임의 음성 신호 프레임을 선택하는 단계와; 상기 단계에서 선택된 음성 신호 프레임을 설정된 최저 기준값과 비교하는 단계와; 상기 비교 단계에서 선택된 음성 신호 프레임값이 설정된 최저 기준 신호값 이상인 경우, 학습 단어 음성 신호를 입력하여 설정된 산술식에 따라 선택된 음성 신호 프레임값과 계산하여 제2 음성 신호의 편차를 산출하는 단계와; 상기 단계에서 산출된 음성 신호 편차값이 설정된 기준 신호값 이하인가 비교하는 단계와; 상기 비교단계에서 음성 신호 편차값이 설정된 기준 신호값 이하인 경우, 검출된 음성 신호 프레임에 해당하는 제어 신호를 출력하는 단계와; 상기 단계에서 선택된 음성 신호 프레임의 값이 설정된 최저 기준 신호값 이하로 설정 시간 이상 지속되면 음성 신호 프레임의 끝으로 판단하고 초기단계로 리턴하는 단계로 이루어진다.As a means for achieving the above object, another configuration of the present invention includes the steps of: inputting a driver's voice when the power is applied, and dividing and reading the voice energy and the voice signal zero crossing rate; Calculating the first speech signal deviation value by calculating the speech energy and the zero crossing rate of the speech signal according to a predetermined formula, detecting a speech signal frame of the driver corresponding to the selected speech signal frame, and selecting an arbitrary speech signal frame; Comparing the voice signal frame selected in the step with a set lowest reference value; Calculating a deviation of the second speech signal by inputting a learning word speech signal and calculating the speech signal frame value according to a set arithmetic expression if the selected speech signal frame value is equal to or greater than a set minimum reference signal value in the comparing step; Comparing whether the voice signal deviation value calculated in the step is equal to or less than a set reference signal value; Outputting a control signal corresponding to a detected voice signal frame when the voice signal deviation value is equal to or less than a set reference signal value in the comparing step; When the value of the voice signal frame selected in the above step continues for less than a set minimum reference signal value for a predetermined time or more, it is determined that the end of the voice signal frame and returning to the initial step.

상기한 구성에 의하여, 이 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 이 발명을 용이하게 실시할 수 있는 가장 바람직한 실시예를 첨부된 도면을 참조로 하여 상세히 설명한다.By the above configuration, the most preferred embodiment that can be easily carried out by those skilled in the art with reference to the present invention will be described in detail with reference to the accompanying drawings.

제2도는 이 발명의 실시예에 따른 운전자의 음성 신호 프레임 검출 장치의 구성도이고, 제3도는 이 발명의 실시예에 따른 아이들 상태의 차량에서 운전자의 음성 신호 프레임 검출 파형이고, 제4도는 이 발명의 실시예에 따른 주행증인 차량에서 운전자의 음성 신호 검출 파형이고, 제5도는 이 발명의 실시예에 따른 운전자의 음성 신호 프레임 검출 방법의 순서도이다.2 is a block diagram of a device for detecting a voice signal frame of a driver according to an embodiment of the present invention, and FIG. 3 is a waveform signal detection waveform of a driver in an idle vehicle according to an embodiment of the present invention. The driver's voice signal detection waveform in the vehicle which is a driving symptom according to an embodiment of the present invention, and FIG. 5 is a flowchart of a method for detecting a voice signal frame of the driver according to the embodiment of the present invention.

제2도에 도시되어 있듯이, 이 발명의 실시예에 따른 운전자의 음성 신호 프레임 검출 장치의 구성은, 운전자의 음성을 입력받아 음성 신호를 출력하는 음성 신호 출력부(10)와; 상기에서 출력된 음성 신호를 증폭하여 출력하는 증폭부(20)와; 상기에서 증폭되어 출력된 운전자의 음성 신호를 디지탈 신호로 변환하여 출력하는 아날로그/디지탈 변환부(30)와; 여러 사람의 음성이 여러번 반복하여 학습되어 저장된 학습 단어 음성 신호를 제어부에 출력하는 학습 단어 음성 신호 저장부(50)와; 상기에서 출력된 음성 신호를 음성 에너지와 음성 신호 영교차율로 나누어 설정된 산출식에 의해 계산하여 음성 신호 구간을 산출하고, 설정된 최저 기준 신호값과 비교하여 음성 신호와 잡음 신호로 구분하고, 음성 신호를 저장부에 저장된 학습 단어 음성과 비교해서 편차를 구하여, 설정된 기준값 이하이면 음성 신호에 해당하는 제어 신호를 출력하는 제어부(40)와; 상기 제어부(40)에서 출력되는 제어 신호에 따라 차량의 각종 장치가 운전자의 음성신호에 따라 구동하는 구동부(60)로 이루어져 있다.As shown in FIG. 2, the configuration of an apparatus for detecting a voice signal frame of a driver according to an embodiment of the present invention includes: a voice signal output unit 10 for receiving a driver's voice and outputting a voice signal; An amplifier 20 for amplifying and outputting the output audio signal; An analog / digital converter 30 for converting and outputting the driver's voice signal amplified and output as a digital signal; A learning word voice signal storage unit 50 for repeatedly learning a plurality of voices and outputting a stored learning word voice signal to a controller; The voice signal output from the above is calculated by the calculation formula set by dividing the voice energy and the voice signal zero crossing rate, and the voice signal interval is calculated. The voice signal is divided into the voice signal and the noise signal by comparing with the set minimum reference signal value. A control unit 40 which obtains a deviation from the learning word voice stored in the storage unit and outputs a control signal corresponding to the voice signal when the deviation is less than or equal to the set reference value; According to the control signal output from the controller 40, the various devices of the vehicle are configured to drive the driver 60 according to the driver's voice signal.

이 발명의 실시예에 따른 운전자의 음성 신호 프레임 검출 방법은, 전원이 인가되면, 운전자 음성을 입력하는 단계(S100,S110)와; 상기에서 입력된 운전자 음성을 음성 에너지와 음성 신호 영 교차율로 나누어 판독하는 단계(S120)와; 상기 단계에서(S120) 판독된 음성 에너지와 음성 신호 영 교차율를 설정된 산술식에 따라 계산하여 음성 신호 프레임 편차값을 산출하여 그에 해당하는 운전자의 제1음성 신호 프레임 검출하는 단계(S130)와; 상기 단계에서(S130) 검출된 음성 신호 프레임중 임의 음성 신호 프레임을 선택하는 단계(S140)와; 상기 단계에서(S140) 선택된 음성 신호 프레임값을 설정된 최저 기준신호 프레임값과 비교하는 단계(S150)와; 상기 비교단계(S150)에서 선택된 음성 신호 프레임값이 설정된 최저 기준 신호 프레임값 이상일 경우, 학습 단어 음성 신호를 입력하는 단계(S160)와; 상기 단계(S160)에서 입력한 학습 단어 음성 신호의 값과 상기 단계(S140)에서 선택된 음성 신호 프레임값을 설정된 산술식에 따라 계산하여 제2음성 신호의 편차를 산출하는 단계(S170)와; 상기 단계(170)에서 산출된 음성 신호 프레임 편차값이 설정된 기준 신호 프레임값 이하인가 비교하는 단계(S180)와; 상기 비교 단계(S180)에서 음성 신호 프레임 편차값이 설정된 기준 신호 프레임값 이하인 경우, 검출된 음성 신호 프레임에 해당하는 제어 신호를 출력하는 단계(S190)와; 상기 단계 (S140)에서 선택된 음성 신호 프레임의 값이 설정된 최저 기준 신호값 이하로 설정 시간 이상 지속되면 음성 신호 프레임의 끝으로 판단하고(S200), 초기단계로 리턴하는 단계(S210)와; 상기 비교 단계(S150)에서 선택된 음성 신호 프레임값이 설정된 최저 기준 신호값 이하인 경우, 선택된 음성 신호 프레임값이 설정된 최저 기준신호 프레임값 이하로 설정 시간 지속되는지 비교하는 단계와(S200); 상기 단계(S140)에서 선택된 음성 신호 프레임의 값이 설정된 최저 기준 신호값 이하로 설정 시간 이하이면 검출된 음성 신호 프레임이 계속 입력되는것으로 판단하여 임의 음성 프레임을 선택하는 단계와(S140); 상기 비교 단계(S180)에서 음성 신호 편차값이 설정된 설정된 기준 신호값 이상인 경우, 잡음으로 판단하고 초기 단계로 리턴하는 단계(S210)로 이루어진다.According to an embodiment of the present invention, a method for detecting a voice signal frame of a driver may include: inputting a driver voice when power is applied (S100 and S110); (S120) dividing the driver's voice input into the voice energy and the voice signal zero crossing rate; Calculating the voice signal frame deviation value by calculating the read voice energy and the voice signal zero crossing rate according to a set arithmetic equation (S120) and detecting a first voice signal frame of the driver corresponding thereto (S130); Selecting an arbitrary voice signal frame among the detected voice signal frames (S140); Comparing the selected voice signal frame value with the set lowest reference signal frame value in step S140; Inputting a learning word speech signal when the speech signal frame value selected in the comparing step S150 is equal to or greater than a set minimum reference signal frame value (S160); Calculating a deviation of the second voice signal by calculating a value of the learning word voice signal input in the step S160 and a voice signal frame value selected in the step S140 according to a set arithmetic expression (S170); Comparing (S180) whether the voice signal frame deviation value calculated in the step 170 is equal to or less than a set reference signal frame value; Outputting a control signal corresponding to the detected voice signal frame when the voice signal frame deviation value is less than or equal to the set reference signal frame value in the comparison step (S180); If the value of the voice signal frame selected in the step (S140) lasts for less than a set minimum reference signal value for a predetermined time or more (S200) and determines the end of the voice signal frame to return to the initial step (S210); Comparing the selected voice signal frame value to a set minimum reference signal frame value or less for a preset time period when the selected voice signal frame value is less than or equal to the set minimum reference signal value (S200); Selecting a random voice frame by determining that the detected voice signal frame is continuously input if the value of the voice signal frame selected in the step S140 is equal to or less than a set minimum reference signal value and continues to be input (S140); When the speech signal deviation value is equal to or greater than the set reference signal value in the comparison step (S180), it is determined that the noise and return to the initial step (S210).

상기한 구성에 의한, 이 발명의 실시에에 따른 운전자의 음성 신호 검출 장치의 작용은 다음과 같다.The operation of the driver's voice signal detection device according to the embodiment of the present invention by the above configuration is as follows.

자동차 주행을 위해 전원을 인가하고, 자동차의 구동부(60)를 구동시키기위해 운전자가 음성 명령을 내리면, 음성 신호 출력부(10)에서 운전자의 음성을 입력하여 전기적인 신호를 출력하고, 상기에서 출력된 음성 신호는 증폭부(20)에 입력되어 일정값으로 중폭되어 아날로그/디지탈 변환부(30)로 출력된다.When power is supplied for driving a vehicle and a driver gives a voice command to drive the driving unit 60 of the vehicle, the voice signal output unit 10 inputs the driver's voice to output an electrical signal and outputs the above. The audio signal is input to the amplifier 20, is amplified to a predetermined value, and output to the analog / digital converter 30.

상기 아날로그/디지탈 변환부(30)에 출력된 음성 신호는 디지탈 신호로 변환되어 제어부(40)에 출력된다.The audio signal output to the analog / digital converter 30 is converted into a digital signal and output to the controller 40.

상기 제어부(40)는 디지탈 신호로 변환된 운전자 음성을 입력하여(S110), 운전자의 음성 신호를 음성 에너지 프레임과 음성 신호가 0점을 교차하는 횟수인 영 교차율 프레임으로 나누어 판독한다(S120).The controller 40 inputs a driver's voice converted into a digital signal (S110), and reads the driver's voice signal by dividing the driver's voice signal into a zero crossing rate frame, which is the number of times that the voice energy frame and the voice signal cross the zero point (S120).

상기에서 판독된 음성 에너지 프레임과 음성 신호 영 교차율 프레임을 계산하는 산술방법은 제3도에 도시되어 있듯이, 자동차의 아이들시 음성 에너지 파형 프레임(a)에서 음성 신호 영 교차율 파형 프레임 (b)을 빼줌으로써 아이들시 제1음성 신호 프레임 검출 파형 (c)을 얻을수 있다.As shown in FIG. 3, the arithmetic method for calculating the read voice energy frame and the voice signal zero crossing rate frame is obtained by subtracting the voice signal zero crossing rate waveform frame (b) from the voice energy waveform frame (a) when the vehicle is idle. By zooming in, the first audio signal frame detection waveform (c) can be obtained.

또한 제4도에 도시되어 있는 자동차의 주행시 운전자의 음성 신호 프레임을 검출하는 방법도 상기의 방법과 동일하게 음성 에너지 파형 프레임(a)에서 음성 신호 영교차율 프레임(b)를 빼줌으로서 운전자의 제1음성 신호 프레임 검출 파형 (c)를 얻을수 있다.Also, the method of detecting the voice signal frame of the driver when driving the vehicle shown in FIG. 4 is similar to the above method by subtracting the voice signal zero crossing frame b from the voice energy waveform frame a. The audio signal frame detection waveform (c) can be obtained.

상기에서와 같이 설정된 산술식 의해 음성 신호 프레임 편차값을 산출하여 그에 해당하는 운전자의 제1음성 신호 프레임이 검출되면, 검출된 제1음성 신호 프레임에서 약 5프레임 정도의 아디에스 평균값을 산출하고, 아디에스의 평균값을 설정된 최저 기준 신호값과 비교하여(S150), 아디에스 평균값이 설정된 최저 기준 신호값 이상인 경우, 학습 단어 음성 신호 입력한다(S160).When the first audio signal frame corresponding to the driver is detected by calculating the audio signal frame deviation value according to the arithmetic formula set as described above, an average value of about 5 frames is calculated from the detected first audio signal frame, The average value of the AD is compared with the set minimum reference signal value (S150), and if the average value of the AD is greater than or equal to the set minimum reference signal value, the learning word voice signal is input (S160).

상기에서 입력한 학습 단어 음성 신호의 값과 상기에서 검출한 제1음성 신호프레임값을 설정된 산술식에 따라 계산하여 제2 음성 신호의 편차를 산출하는 한다(S170).The deviation of the second speech signal is calculated by calculating the value of the input learning word speech signal and the detected first speech signal frame value according to the set arithmetic equation (S170).

즉, 학습 단어 음성 신호값 - 제1 음성 신호 프레임값 = 제2 음성 프레임값이 된다.That is, the learning word speech signal value-the first speech signal frame value = the second speech frame value.

그러므로, 상기에서 검출된 제2 음성 신호 프레임값의 편차을 설정된 기준값과 비교하여, 제2 음성 신호 프레임값이 설정된 기준값보다 작은 값이라 판단되면, 검출된 제1 음성 신호 구간에 해당하는 제어 신호를 구동부(60)에 출력한다(S190).Therefore, when the deviation of the detected second voice signal frame value is compared with the set reference value, and the second voice signal frame value is determined to be smaller than the set reference value, the control unit corresponding to the detected first voice signal section is driven. Output to 60 (S190).

상기에서 운전자의 음성 신호에 해당하는 제어 신호를 출력한후, 운전자의 음성 신호가 또 있는지 확인 해야한다.After outputting a control signal corresponding to the driver's voice signal, it is necessary to check whether the driver's voice signal is present again.

그래서 상기에서(S150) 비교된 선택된 음성 신호 구간의 값이 설정된 최저 기준 신호의 값 이하로 설정된 시간인 0.5ms이상 지속되면, 운전자의 음성이 없는 것으로 판단하고(S200) 초기단계로 돌아간다(S210).Thus, if the value of the selected voice signal section compared in the above (S150) is maintained for more than 0.5ms, which is a time set to be less than the value of the set minimum reference signal, it is determined that there is no driver's voice (S200) and returns to the initial stage (S210). ).

하지만 상기에서(S200) 비교된 선택된 음성 신호 구간의 값이 설정된 최저 기준 신호값 이하로 설정된 시간인 0.5ms이하 이면, 운전자의 음성 신호가 계속 입력 되는것으로 판단하여, 상기에서(S130) 검출된 음성 신호 프레임중 임의 음성 신호 프레임을 선택하는 단계(S140)로 돌아간다.However, when the value of the selected voice signal section compared in the above (S200) is 0.5 ms or less, which is a time set to be equal to or less than the set minimum reference signal value, it is determined that the driver's voice signal is continuously input, and the detected voice is detected (S130). Returning to step S140, an arbitrary voice signal frame is selected from the signal frames.

이상에서와 같이 이 발명의 실시예에서, 운전자가 음성 명령을 내리면 운전자의 음성 신호를 일정 프레임 동안 음성 에너지 음성 신호 영교차율 횟수를 나누어 판독하여 계산하므로써, 차량 주행 소음에 강한 정확한 운전자의 음성 신호 프레임을 검출할 수 있는 효과를 가진 운전자의 음성 신호 검출 장치 및 그 방법을 제공할 수 있다.As described above, in the exemplary embodiment of the present invention, when the driver issues a voice command, the driver's voice signal is read out and calculated by dividing the number of times of the voice energy voice signal zero crossing rate for a predetermined frame, thereby accurately correcting the voice signal frame of the driver strong against vehicle driving noise. It is possible to provide an apparatus and method for detecting a voice signal of a driver having an effect of detecting.

Claims

Voice signal output means for receiving a driver's voice and outputting a voice signal;

Amplifying means for amplifying and outputting the driver's voice signal output from the driver;

Analog / digital conversion means for converting and outputting the driver's voice signal amplified and output as a digital signal;

Learning word speech signal storage means for outputting a learning word speech signal stored as data by repeating the word several times with voices of several persons;

The voice signal output from the above is calculated by the calculation formula set by dividing the voice energy and the voice signal zero crossing rate, and the voice signal interval is calculated. The voice signal is divided into the voice signal and the noise signal by comparing with the set minimum reference signal value. Control means for obtaining a deviation from the learning word voice stored in the storage unit and outputting a control signal corresponding to the voice signal when the deviation is less than or equal to the set reference value;

And a driving means for driving various devices of the vehicle according to the control signal output from the control means.

The method of claim 1, wherein the control means,

When the power is applied, the driver's voice is input and read by dividing the voice energy and the number of zero crossing rates, and the calculation is made according to the set arithmetic expression, and the correct driver's voice signal is detected and compared with the set minimum reference value. In this case, when the signal value of the selected voice frame lasts for a predetermined time or more, and compares the signal value of the selected voice frame with a predetermined time or more, inputs the learned word voice signal and inputs the detected voice signal and the arithmetic expression. The driver's voice signal detection device, comprising: calculating and outputting a control signal when the value is less than or equal to the set value.

Inputting a driver's voice and reading the driver's voice by dividing the voice energy and the voice signal zero crossing rate;

Calculating the first speech signal deviation value by calculating the speech energy and the zero crossing rate of the speech signal according to the set arithmetic expression, detecting the speech signal frame of the driver corresponding thereto, and selecting an arbitrary speech signal frame;

Comparing the voice signal frame selected in the step with a set lowest reference value;

Calculating a deviation of the second speech signal by inputting a learning word speech signal and calculating the speech signal frame value according to a set arithmetic expression if the selected speech signal frame value is equal to or greater than a set minimum reference signal value in the comparing step;

Comparing whether the voice signal deviation value calculated in the step is equal to or less than a set reference signal value;

Outputting a control signal corresponding to a detected voice signal frame when the voice signal deviation value is equal to or less than a set reference signal value in the comparing step;

And determining the end of the voice signal frame and returning to the initial step when the value of the voice signal frame selected in the step is longer than the set minimum reference signal value. .

The method of claim 3,

And determining that there is no driver's voice signal when the detected signal is less than or equal to a predetermined reference value, and checking whether the driver's voice signal is input again.

The method of claim 4, wherein

And determining that there is no driver's voice signal when the detected signal is lower than or equal to a preset reference value and longer than a preset time, and returning to an initial step.

The method of claim 4, wherein

And detecting the driver's voice and returning to selecting an arbitrary voice signal frame when the detected signal is less than or equal to the preset reference value and the driver's voice is input within the preset time.

The method of claim 3,

And determining that the detected voice signal and the learned word voice signal are different voice signals and returning to the initial stage when the deviation between the calculated voice signal and the learning word voice signal value is greater than or equal to the set reference value. Voice signal detection method.

The method of claim 3,

And detecting a driver's voice and returning to selecting an arbitrary voice signal frame when the driver's voice is input within a preset time when the detected signal is equal to or less than a preset reference value.