KR20120098383A

KR20120098383A - Apparatus and method diagnosing health using voice

Info

Publication number: KR20120098383A
Application number: KR1020110091316A
Authority: KR
Inventors: 오희진; 김영태; 심환
Original assignee: 삼성전자주식회사
Priority date: 2011-02-28
Filing date: 2011-09-08
Publication date: 2012-09-05
Also published as: KR101894390B1

Abstract

PURPOSE: A health status diagnosis apparatus and method thereof are provided to automatically diagnose health status using voice in a communication process. CONSTITUTION: A voice sensing unit(110) senses voice by monitoring the voice. A voice analysis unit(120) extracts voice characteristics according to a health status from the voice sensed by the voice sensing unit. A voice diagnosis unit(130) diagnoses the health status by comparing a diseased condition standard with the voice characteristic extracted from the voice analysis unit. The voice diagnosis unit monitors a health change status. A diagnosis output unit(140) outputs signals for the health status and the health change status information. [Reference numerals] (110) Voice sensing unit; (120) Voice analysis unit; (130) Voice diagnosis unit; (140) Diagnosis output unit

Description

Apparatus and method diagnosing health using voice}

본 발명은 신체 장기의 건강 상태 진단 장치 및 방법에 관한 것이며, 특히 음성을 이용하여 신체 장기의 건강 상태를 진단하는 장치 및 방법에 관한 것이다.The present invention relates to an apparatus and method for diagnosing the state of health of body organs, and more particularly, to an apparatus and method for diagnosing the state of health of body organs using voice.

최근 대기오염 및 제한된 공간에서의 활동, 이동 전화기의 사용 증가 등으로 인해 후두의 통증 및 목소리의 변화를 호소하는 사례가 증가하고 있다. 후두 질환은 조기에 치료하지 않아 악성으로 전이될 경우, 수술로 인한 절제 및 정상적인 음성 회복이 불가능 할 수 있어, 언어 발성을 통한 의사 전달에 큰 어려움을 겪을 수 있다. 따라서 후두 질환의 조기 진단은, 악성 종양으로의 전이를 예방하고 후두의 기능 보존하며, 치료의 성공율 및 환자의 생존율을 높이는데 있어 매우 중요한 요소라 할 수 있다. Recently, there have been increasing cases of complaints of laryngeal pain and voice changes due to air pollution, limited space activities, and increased use of mobile phones. Laryngeal disease can be difficult to communicate through speech utterance, if it is not treated early, and then metastasize to malignant, it is impossible to recover from surgery and restore normal voice. Therefore, early diagnosis of laryngeal disease is a very important factor in preventing metastasis to malignant tumors, preserving the function of the larynx, and increasing the success rate of treatment and the survival rate of patients.

음성으로 진단 가능한 질병의 범위는 점차 넓어지고 있으며, 또 다른 음성 진단의 예로 파킨슨병을 들 수 있다. 파킨슨병은 운동(movement)를 조절하는 뇌 부위에서 분비되는 신경 전달 물질 도파민 생산 세포가 파괴되어 나타나는 질환이기 때문에 근육 경직, 몸 떨림, 느린 동작, 균형 상실 등의 증세로 진단할 수 있으나, 이러한 증상으로 진단되었을 때는 이미 도파민 생산 세포가 많이 파괴된 이후이기 때문에, 도파민 생산 세포의 파괴를 막아 높은 치료 효과를 갖기 위해, 조기 진단이 매우 중요한 요소라 할 수 있다. 이러한 조기 진단을 위해 사람의 귀로는 구분하기 어려운 음성을 패턴을 분석함으로써, 중추 신경계 질환인 파킨슨병을 조기 진단할 수 있다는 연구 결과가 발표되고 있으며, 앞으로도 음성을 이용한 건강 상태의 진단 범위가 점차 넓어 질 수 있다. The range of negatively diagnosed diseases is widening, and another example of negative diagnosis is Parkinson's disease. Parkinson's disease is caused by the destruction of the neurotransmitter dopamine-producing cells secreted by the brain regions that control movement, which can be diagnosed with muscle stiffness, tremor, slow motion, and loss of balance. When it is diagnosed as a dopamine-producing cells are already destroyed a lot, to prevent the destruction of dopamine-producing cells to have a high therapeutic effect, early diagnosis is a very important factor. For this early diagnosis, research results suggest that Parkinson's disease, which is a central nervous system disease, can be diagnosed early by analyzing patterns that are difficult to distinguish with the human ear. Can lose.

따라서 음성을 이용하여 신체 장기의 건강 상태를 진단할 수 있는 방법 및 장치를 제공할 필요가 있다. Therefore, there is a need to provide a method and apparatus for diagnosing the state of health of body organs using voice.

본 발명이 해결하고자 하는 과제는 사용자의 음성을 이용하여 신체 장기의 건강 상태를 확인하고 진단할 수 있는 신체 장기의 건강 상태 진단 장치 및 방법을 제공하는 데 있다.
An object of the present invention is to provide an apparatus and method for diagnosing a state of health of body organs, which can identify and diagnose a state of health of body organs using a voice of a user.

상기의 과제를 해결하기 위하여, 상기의 과제를 해결하기 위하여, 본 발명의 일 실시예에 따른 건강 상태 진단 장치는, In order to solve the above problems, in order to solve the above problems, a medical condition diagnostic apparatus according to an embodiment of the present invention,

음성을 모니터링 하여 음성을 감지하는 음성 감지부;A voice detector for detecting a voice by monitoring the voice;

상기 음성 감지부에서 감지된 음성으로부터 진단할 건강 상태 종류별로 음성 특징을 추출하는 음성 분석부;A voice analyzer extracting a voice feature for each type of health condition to be diagnosed from the voice detected by the voice detector;

상기 음성 분석부로 부터 추출된 음성 특징과 병적 상태 기준을 비교하여 건강 상태를 진단하고 건강 변화 상태를 모니터링 하는 음성 진단부;A voice diagnosis unit for diagnosing a state of health and monitoring a state of health change by comparing a voice feature extracted from the voice analyzer with a pathological state criterion;

상기 음성 진단부에서 진단된 건강 상태 및 건강 변화 상태 정보를 정해진 형태의 신호로 출력하는 진단 출력부를 포함한다.And a diagnosis output unit configured to output the health state and health change state information diagnosed by the voice diagnosis unit as a predetermined type of signal.

상기 음성 감지부는 사용자의 음성 변화 상태를 모니터링 하는 것이 바람직한다.Preferably, the voice detector monitors a voice change state of the user.

상기 음성 감지부는 입력되는 아날로그 신호를 디지털 신호로 변환하는 아날로그 디지털 변환부; 및 The voice detector may include an analog to digital converter for converting an input analog signal into a digital signal; And

상기 아날로그 디지털 변환부에서 변환된 디지털 신호로부터 음성을 판별하는 음성 판별부를 구비하는 것이 바람직하다.It is preferable to include a voice discrimination unit for discriminating the voice from the digital signal converted by the analog-digital converter.

상기 음성 분석부는 The voice analysis unit

상기 음성 감지부에서 감지된 음성에 대해 건강 상태 타입을 결정하는 건강 상태 선택부;A health state selector configured to determine a health state type with respect to the voice detected by the voice detector;

상기 건강 상태 선택부에서 선택된 건강 상태 타입에 따라 진단을 위한 음성 구간을 선택하는 음성 구간 선택부;A voice section selector configured to select a voice section for diagnosis according to the health condition type selected by the health condition selector;

상기 건강 상태 선택부에서 선택된 건강 상태 타입에 해당하는 음성 특징을 선택하는 음성 특징 선택부; 및 A voice feature selector which selects a voice feature corresponding to the health state type selected by the health state selector; And

상기 특징 선택부에서 선택된 음성 특징에 따라 입력되는 사용자 음성으로부터 상기 음성 특징을 추출하는 음성 특징 추출부를 구비하는 것이 바람직하다.Preferably, a voice feature extracting unit extracts the voice feature from a user voice input according to the voice feature selected by the feature selector.

상기 음성 특징 추출부는 정상 음성과 병적 음성을 구분하기 위해 정상 음성 데이터 베이스와 병적 음성 데이터 베이스를 구비하는 것이 바람직하다.Preferably, the speech feature extractor includes a normal speech database and a pathological speech database to distinguish between normal speech and pathological speech.

상기 음성 진단부는 The voice diagnosis unit

상기 음성 분석부에서 추출된 음성 특징과 병적 진단 기준을 비교하는 비교부;A comparison unit comparing the voice feature extracted from the voice analysis unit with a pathological diagnosis criterion;

시간에 따른 사용자의 음성 상태 변화를 감지하는 모니터링부; 및A monitoring unit for detecting a change in the voice state of the user over time; And

상기 비교부의 비교 결과 및 상기 모니터링부의 음성 변화에 따라 신체 장기의 건강 상태를 진단하는 진단 처리부를 구비하는 것이 바람직하다.It is preferable to include a diagnostic processing unit for diagnosing the state of health of body organs according to the comparison result of the comparison unit and the voice change of the monitoring unit.

상기 진단 처리부에서 진단된 진단 결과 및 진단 시간 정보를 저장하고 갱신하는 저장부를 더 구비하는 것이 바람직하다.It is preferable to further include a storage unit for storing and updating the diagnostic result and the diagnostic time information diagnosed by the diagnostic processing unit.

상기 진단 출력부는 The diagnostic output unit

상기 음성 진단부에서 진단된 건강 상태 및 건강 변화 상태 정보를 오디오 및 비디오 신호로 변환하는 디지털 신호 처리부;A digital signal processor converting the health state and health change state information diagnosed by the voice diagnosis unit into audio and video signals;

상기 디지털 신호 처리부에서 처리된 비디오 신호를 출력하는 디스플레이부; 및 A display unit for outputting a video signal processed by the digital signal processor; And

상기 디지털 신호 처리부에서 처리된 오디오 신호를 출력하는 스피커부를 구비하는 것이 바람직하다.It is preferable to include a speaker unit for outputting the audio signal processed by the digital signal processing unit.

상기의 다른 과제를 해결하기 위하여, 본 발명의 일실시예에 따른 건강 상태 진단 방법은, 음성을 감지하여 수집하는 과정;In order to solve the above other problem, the medical condition diagnosis method according to an embodiment of the present invention, the process of detecting and collecting voice;

상기 수집된 음성으로부터 진단 건강 상태 종류별로 음성 특징을 추출하는 과정;Extracting voice features from the collected voices according to types of diagnostic health conditions;

상기 음성 특징과 병적 상태 기준을 비교하여 건강 상태를 진단하고 건강 변화 상태를 모니터링 하는 과정;Diagnosing a health condition and monitoring a health change state by comparing the voice characteristic with a pathological condition criterion;

상기 진단된 건강 상태 및 건강 변화 상태 정보를 정해진 형태의 알림 신호로 출력하는 과정을 포함한다.And outputting the diagnosed health state and health change state information as a notification signal of a predetermined form.

상기 음성 수집하는 과정은The voice collecting process

입력되는 음성을 디지털 음성 신호로 변환하는 과정;Converting the input voice into a digital voice signal;

상기 변환된 디지털 음성 신호로부터 음성을 판별하는 과정을 구비하는 것이바람직하다.It is desirable to have a process for discriminating speech from the converted digital speech signal.

상기 건강 상태 특징을 추출하는 과정은The process of extracting the state of health features

상기 수집된 음성에 대해 건강 상태 타입을 결정하는 과정;Determining a health state type for the collected voices;

상기 결정된 건강 상태 타입에 따라 진단을 위한 음성 구간을 선택하는 과정;Selecting a voice section for diagnosis according to the determined health state type;

상기 선택된 건강 상태 타입에 해당하는 음성 특징을 선택하는 과정;Selecting a voice characteristic corresponding to the selected health state type;

상기 선택된 음성 특징에 따라 상기 음성 구간으로부터 상기 음성 특징을 추출하는 과정을 구비하는 것이 바람직하다.And extracting the speech feature from the speech section according to the selected speech feature.

상기 건강 상태 진단 과정은 The medical condition diagnosis process

시간에 따른 사용자의 건강 상태 변화를 분석하는 과정;Analyzing a change in the health status of the user over time;

상기 시간에 따른 사용자의 건강 상태 변화에 따라 권고 메시지를 다르게 알리는 과정을 포함하는 것이 바람직하다.It is preferable to include a process of differently notifying the recommendation message according to the change in the health state of the user over time.

상기 건강 상태를 진단하고 건강 변화 상태를 모니터링 하는 과정은, The process of diagnosing the state of health and monitoring the state of health change,

상기 추출된 음성 특징과 병적 진단 기준을 비교하는 과정;Comparing the extracted voice features with pathological diagnostic criteria;

시간에 따른 사용자의 음성 변화를 감지하는 과정;Detecting a change in a user's voice over time;

상기 비교 결과 및 상기 음성 변화에 따라 신체 장기의 건강 상태를 진단하는 과정을 구비하는 것이 바람직하다.It is preferable to have a process for diagnosing the state of health of body organs according to the comparison result and the negative change.

상기 진단된 진단 결과 및 진단 시간 정보를 수집 및 저장하고 갱신하는 과정을 더 구비하는 것이 바람직하다.The method may further include collecting, storing, and updating the diagnosed diagnosis result and diagnosis time information.

상기 추출된 음성 특징과 병적 진단 기준을 비교하기 위해 병적 음성과 정상 음성을 위한 진단 기준을 도출하는 과정을 더 포함하는 것이 바람직하다.It is preferable to further include a step of deriving the diagnostic criteria for the pathological voice and the normal voice in order to compare the extracted voice features and the pathological diagnostic criteria.

상기 병적 음성과 정상 음성을 위한 진단 기준을 도출하는 과정은,Deriving the diagnostic criteria for the pathological voice and normal voice,

병적 음성 데이터 베이스 및 정상 음성 데이터 베이스를 구축하는 과정;Constructing a pathological speech database and a normal speech database;

상기 병적 음성 데이터 베이스 및 정상 음성 데이터 베이스에서 병적 음성 특징 및 정상 음성 특징을 추출하는 과정;Extracting pathological speech features and normal speech features from the pathological speech database and the normal speech database;

상기 병적 음성과 정상 음성 특징에 대한 비교 트레이닝을 수행하는 과정;Performing comparative training on the pathological and normal speech features;

상기 비교 트레이닝에 따라 병적 음성과 정상 음성을 위한 진단 기준을 도출하는 과정을 구비하는 것이 바람직하다.According to the comparative training, it is preferable to have a process for deriving diagnostic criteria for pathological voice and normal voice.

상기 출력 과정은 The output process

상기 진단된 건강 상태 및 건강 변화 상태 정보를 오디오 및 비디오 신호로 변환하는 과정;Converting the diagnosed health state and health change state information into audio and video signals;

상기 변환된 오디오 신호 및 비디오 신호를 출력하는 과정을 구비하는 것이 바람직하다.
It is preferable to have a process of outputting the converted audio signal and video signal.

도 1은 본 발명의 일 실시 예에 따른 건강 상태 진단 장치의 전체 블록도 이다.
도 2는 도 1의 음성 감지부의 일 실시예 이다.
도 3은 도 1의 음성 분석부의 일 실시예 이다.
도 4는 도 1의 음성 진단부의 일 실시예 이다.
도 5는 도 1의 진단 출력부의 일 실시예 이다.
도 6은 본 발명의 일 실시 예에 따른 건강 상태 진단 방법의 전체 흐름도 이다.
도 7은 본 발명의 일 실시 예에 따른 건강 상태 진단 방법의 상세 흐름도 이다.
도 8은 본 발명의 다른 실시 예에 따른 건강 상태 진단 방법의 상세 흐름도이다.
도 9a는 도 6의 병적 음성과 정상 음성을 위한 진단 기준을 도출하는 일 실시 예이다.
도 9b는 GMM을 이용하여 2개의 가우시안(gaussian)으로 모델링한 GMM 파라메터의 진단 기준을 도시한 것이다.
도 10은 본 발명에 따른 진단 기준을 이용한 자동 진단 방법의 흐름도 이다.
도 11은 본 발명의 다른 실시 예에 따른 핸드폰을 이용한 건강 상태 진단 방법의 흐름도 이다. 1 is a block diagram of an apparatus for diagnosing a state of health according to an embodiment of the present invention.
FIG. 2 is an embodiment of the voice detector of FIG. 1.
3 is an embodiment of the voice analyzer of FIG. 1.
4 is an embodiment of the negative diagnosis unit of FIG. 1.
5 is an embodiment of the diagnostic output unit of FIG. 1.
6 is a flowchart illustrating a method of diagnosing a state of health according to an embodiment of the present invention.
7 is a detailed flowchart of a medical condition diagnosis method according to an embodiment of the present invention.
8 is a detailed flowchart of a medical condition diagnosis method according to another embodiment of the present invention.
FIG. 9A illustrates an example of deriving diagnostic criteria for pathological and normal voices of FIG. 6.
FIG. 9B illustrates diagnostic criteria of GMM parameters modeled with two Gaussian using GMM.
10 is a flowchart of an automatic diagnostic method using diagnostic criteria according to the present invention.
11 is a flowchart illustrating a medical condition diagnosis method using a mobile phone according to another embodiment of the present invention.

이하 첨부된 도면을 참조로하여 본 발명의 바람직한 실시예를 설명하기로 한다. Hereinafter, exemplary embodiments of the present invention will be described with reference to the accompanying drawings.

도 1은 본 발명의 일 실시 예에 따른 건강 상태 진단 장치의 전체 블록도 이다. 1 is a block diagram of an apparatus for diagnosing a state of health according to an embodiment of the present invention.

본 발명의 건강 상태 진단 장치는 음성 입력이나 녹음 또는 통화가 가능한 전자기기에 해당한다. 일 실시 예로 음성 입력이나 녹음 또는 통화가 가능한 전자 기기는 TV, 핸드폰, 스마트폰, 모니터, VoIP(Voice over Internet Protocol )전화기, PMP(Portable Multimedia Player), PDA(Personal Digital Assistants), 노트북을 들 수 있으나, 이에 한정하지 않는다.The medical condition diagnosis apparatus of the present invention corresponds to an electronic device capable of voice input, recording, or making a call. For example, an electronic device capable of voice input, recording, or calling may be a TV, a mobile phone, a smartphone, a monitor, a Voice over Internet Protocol (VoIP) phone, a portable multimedia player (PMP), a personal digital assistant (PDA), or a notebook computer. However, it is not limited thereto.

도 1의 건강 상태 진단 장치는 음성 감지부(110), 음성 분석부(120), 음성 진단부(130), 진단 출력부(140)를 구비한다.The health state diagnosis apparatus of FIG. 1 includes a voice detector 110, a voice analyzer 120, a voice diagnosis unit 130, and a diagnosis output unit 140.

음성 감지부(110)는 사용자의 녹음 또는 통화를 통해 사용자의 음성을 모니터링하고 감지한다. 또한 음성 감지부(110)는 사용자가 말하는 순간(통화, 음성을 이용한 게임)을 감지할 수 있는 기능을 추가한다. The voice detector 110 monitors and detects the voice of the user through a recording or a call of the user. In addition, the voice detection unit 110 adds a function that can detect the moment the user speaks (call, game using voice).

이때 음성 감지부(110)는 사용자가 말하는 순간을 감지할 수 있는 기능을 포함하여, 사용자가 인식하지 못하는 상태에서도 사용자의 음성 변화 상태를 지속적으로 모니터링 해야 한다. In this case, the voice detector 110 may include a function of detecting a moment when the user speaks, and continuously monitors the voice change state of the user even when the user does not recognize the voice.

음성 분석부(120)는 음성 감지부(110)에서 감지된 음성으로부터 진단 건강 상태 종류별로 음성 특징을 추출한다. 이때 음성 분석부(120)는 진단의 효율성과 성능을 높일 수 있는 음성 특징을 선정하고, 의료 기기 대비 제한된 계산 량과 제한된 메모리를 가질 수 있는 전자 기기의 특성을 감안한 음성 특징을 추출한다.The voice analyzer 120 extracts a voice feature for each type of diagnosis health state from the voice detected by the voice detector 110. At this time, the voice analysis unit 120 selects a voice feature to increase the efficiency and performance of the diagnosis, and extracts the voice feature in consideration of the characteristics of the electronic device that can have a limited amount of calculation and limited memory compared to the medical device.

음성 진단부(130)는 음성 분석부(120)로 부터 추출된 건강 상태 특징과 병적 상태 기준을 비교하여 건강 상태를 진단하고 건강 변화 상태를 모니터링 한다. 또한 음성 진단부(130)는 진단 결과를 수집/갱신/저장하는 기능을 포함한다.The voice diagnosis unit 130 compares the health condition features extracted from the voice analyzer 120 with the pathological condition criteria to diagnose a health condition and monitor a health change state. In addition, the voice diagnosis unit 130 includes a function of collecting / update / save the diagnosis result.

진단 출력부(140)는 음성 진단부(130)에서 진단된 건강 상태 및 건강 변화 상태 정보를 정해진 신호로 변환하여 스피커나 모니터로 출력한다. 이때 정해진 신호는 일 실시 예로 예를 들면, 그래픽 또는 오디오 및/또는 비디오 신호를 들 수 있다.The diagnosis output unit 140 converts the health state and health change state information diagnosed by the voice diagnosis unit 130 into a predetermined signal and outputs the signal to a speaker or a monitor. In this case, the predetermined signal may be, for example, a graphic or audio and / or video signal.

도 2는 도 1의 음성 감지부(110)의 일 실시 예 이다.2 is a diagram illustrating an example of the voice detector 110 of FIG. 1.

도 2의 음성 감지부(110)는 아날로그-디지털 변환부(112) 및 음성 판별부(114)를 구비한다.The voice detector 110 of FIG. 2 includes an analog-digital converter 112 and a voice discriminator 114.

아날로그-디지털 변환부(112)는 입력되는 아날로그 신호를 디지털 신호로 변환한다.The analog-digital converter 112 converts an input analog signal into a digital signal.

음성 판별부(114)는 아날로그-디지털 변환부(112)에서 변환된 디지털 신호로부터 음성 신호를 판별한다. The voice discriminator 114 determines a voice signal from the digital signal converted by the analog-digital converter 112.

음성 신호를 판별하는 방법의 일 예를 들면, VAD(Voice Activity Detection)나 EPD(End Point Detection)를 들 수 있다. As an example of a method of determining a voice signal, voice activity detection (VAD) or end point detection (EPD) may be mentioned.

도 3은 도 1의 음성 분석부(120)의 일 실시예 이다. 3 is an embodiment of the voice analyzer 120 of FIG. 1.

도 3의 음성 분석부(120)는 건강 상태 선택부(122), 음성 구간 선택부(124), 음성 특징 선택부(126), 음성 특징 추출부(128)를 구비한다. 음성 분석부(120)는 기기에 적용하고자 하는 진단 기능에 따라 각 구성 블록들을 다르게 구성할 수 있고 각 구성 블록들을 선택적으로 적용할 수 있다. The voice analyzer 120 of FIG. 3 includes a health state selector 122, a voice section selector 124, a voice feature selector 126, and a voice feature extractor 128. The voice analyzer 120 may configure each component block differently according to a diagnostic function to be applied to the device, and selectively apply each component block.

건강 상태 선택부(122)는 음성으로부터 다양한 건강 상태를 진단하기 위해 음성과 관련된 진단 가능한 건강 상태 타입을 결정한다. 예를 들면 건강 상태 타입은 후두암 또는 파킨슨병이 될 수 있다.The health state selector 122 determines a diagnoseable health state type associated with the voice to diagnose various health conditions from the voice. For example, the medical condition type may be laryngeal cancer or Parkinson's disease.

음성 구간 선택부(124)는 건강 상태 선택부(122)에서 선택된 건강 상태 타입에 따라 사용자의 음성 구간을 선택한다. 음성 구간 선택부(124)는 선택된 건강 상태 타입에 따라 사용자의 음성 구간을 선별적으로 선택할 수 있다. 예를 들면, 후두암이 선택되었을 경우 사용자 음성의 모음 부분만이 선택되고, 파킨슨병이 선택되었을 경우 사용자 음성의 문장 전체가 선택된다.The voice section selector 124 selects a voice section of the user according to the health condition type selected by the health condition selector 122. The voice section selector 124 may selectively select a voice section of the user according to the selected health state type. For example, when laryngeal cancer is selected, only the vowel portion of the user's voice is selected, and when Parkinson's disease is selected, the entire sentence of the user's voice is selected.

음성 특징 선택부(126)는 건강 상태 선택부(122)에서 선택된 건강 상태 타입에 해당하는 음성 특징을 선택한다. The voice feature selector 126 selects a voice feature corresponding to the health state type selected by the health state selector 122.

예를 들면, 정상 음성과 병적 음성을 구분하기 위한 음성 특징의 일 실시 예로써, JITT(Jitter), SHIMM(Shimmer), HNR(Harmonic-to-Noise Ratio), MFCC (Mel-Frequency Cepstral Coefficient)를 들 수 있으나, 이에 한정하지 않는다. For example, as an example of speech features for distinguishing normal speech from pathological speech, JITT (Jitter), SHIMM (Shimmer), HNR (Harmonic-to-Noise Ratio), and MFCC (Mel-Frequency Cepstral Coefficient) But it is not limited thereto.

여기서, JITT 는 피치(pitch)의 평균 변화량을 나타내며, SHIMM는 앰플리튜드(amplitude)의 평균 변화량을 나타내며, HNR은 Harmonic 과 Noise의 비율을 나타내며, MFCC 는 사운드 모델링을 위한 수학적 계수를 나타낸다.Here, JITT represents an average change in pitch, SHIMM represents an average change in amplitude, HNR represents a ratio of Harmonic and Noise, and MFCC represents a mathematical coefficient for sound modeling.

통상적으로 병적 음성의 피치의 평균 변화량은 정상 음성 피치의 평균 변화량보다 커진다. Usually, the average change amount of pitch of pathological speech becomes larger than the average change amount of normal speech pitch.

통상적으로 병적 음성의 앰플리튜드의 평균 변화량은 정상 음성 앰플리튜드의 평균 변화량보다 커진다. Typically, the average change in the amplitude of pathological speech is greater than the average change in the normal speech amplifier.

통상적으로 병적 음성의 하모닉스 성분 대비 노이즈 성분의 비율은 정상 음성의 하모닉스 성분 대비 노이즈 성분의 비율 보다 커진다. Typically, the ratio of the noise component to the harmonic component of the pathological speech is greater than the ratio of the noise component to the harmonic component of the normal speech.

일 실시 예로써 후두암이 선택되었을 경우 병적 음성을 구분하기 위한 음성 특징은 JITT(jitter) 및 SHIM(Shimmer)가 선정될 수 있다. As an example, when laryngeal cancer is selected, JITT (jitter) and SHIM (Shimmer) may be selected as voice features for distinguishing pathological voice.

음성 특징 추출부(128)는 사용자의 음성으로부터 음성 특징 선택부(126)에서 선택된 JITT, SHIMM, HNR, MFCC와 같은 음성 특징을 추출한다. 음성 특징 추출부(128)는 일반적인 소프트웨어 또는 MFC(Mel-Frequency Cepstral)를 사용하여 음성 특징을 추출할 수 있다. The speech feature extractor 128 extracts speech features such as JITT, SHIMM, HNR, and MFCC selected by the speech feature selector 126 from the user's speech. The speech feature extractor 128 may extract speech features using general software or Mel-Frequency Cepstral (MFC).

이때 음성 특징 추출부(128)는 정상 음성과 병적 음성을 구분하기 위해 정상 음성 데이터 베이스와 병적 음성 데이터 베이스를 구비한다.In this case, the voice feature extractor 128 includes a normal voice database and a pathological voice database to distinguish between the normal voice and the pathological voice.

도 4는 도 1의 음성 진단부(130)의 일 실시예 이다.FIG. 4 is an embodiment of the voice diagnosis unit 130 of FIG. 1.

도 4의 음성 진단부(130)는 비교부(132), 진단 처리부(134), 저장부(136), 모니터링부(138)를 구비한다.The voice diagnosis unit 130 of FIG. 4 includes a comparison unit 132, a diagnosis processing unit 134, a storage unit 136, and a monitoring unit 138.

비교부(132)는 음성 분석부(120)에서 추출된 음성 특징과 병적 진단 기준을 비교한다. 이때 병적 진단 기준은 진단 전문가가 객관적 음성 특징을 통해 신체 장기의 건강 상태를 판단하는 기준을 말한다. 이때 정상 음성과 병적 음성을 구분할 수 있는 진단 기준은 GMM(Gaussian Mixture Model)/HMM(Hidden Markov Model), neural network 등을 통해 학습할 수 있다. The comparison unit 132 compares the voice feature extracted from the voice analyzer 120 with the pathological diagnosis criteria. At this time, the pathological diagnosis standard refers to a standard for determining a health state of body organs through an objective voice feature. In this case, a diagnostic criterion for distinguishing a normal voice from a pathological voice may be learned through a Gaussian Mixture Model (GMM) / Hidden Markov Model (HMM) or a neural network.

모니터링부(138)는 시간에 따른 사용자의 음성 변화를 감지한다. The monitoring unit 138 detects a user's voice change over time.

진단 처리부(134)는 비교부(132)의 비교 결과 및 모니터링부(138)의 사용자 음성 변화에 따른 신체 장기의 건강 상태를 진단한다. The diagnosis processor 134 diagnoses the health state of the body organs according to the comparison result of the comparator 132 and the user voice change of the monitoring unit 138.

저장부(136)는 진단 처리부(134)에서 진단된 진단 결과를 저장하고 진단 결과를 갱신한다. 저장부(136)는, 예를 들면 하드디스크 등의 자기 기록매체나 EEPROM, 플래시 메모리 등의 불휘발성 메모리를 들 수 있지만, 상기에 한정되지는 않는다. The storage unit 136 stores the diagnosis result diagnosed by the diagnosis processing unit 134 and updates the diagnosis result. The storage unit 136 may be, for example, a magnetic recording medium such as a hard disk, or a nonvolatile memory such as an EEPROM or a flash memory, but is not limited thereto.

도 5는 도 1의 진단 출력부(140)의 일 실시예 이다.5 is an embodiment of the diagnostic output unit 140 of FIG. 1.

도 5의 진단 출력부(140)는 디지털 신호 처리부(142), 디스플레이부(144), 스피커(146)를 구비한다.The diagnostic output unit 140 of FIG. 5 includes a digital signal processing unit 142, a display unit 144, and a speaker 146.

디지털 신호 처리부(142)는 음성 진단부(130)에서 진단된 건강 상태 및 건강 변화 상태 정보를 오디오 및 비디오 신호로 변환한다.The digital signal processor 142 converts the health state and health change state information diagnosed by the voice diagnosis unit 130 into audio and video signals.

디스플레이부(144)는 디지털 신호 처리부(142)에서 처리된 비디오 신호를 그래픽이나 비디오로 표시한다.The display unit 144 displays the video signal processed by the digital signal processor 142 as a graphic or video.

스피커부(146)는 디지털 신호 처리부(142)에서 처리된 오디오 신호를 소리로 출력한다.The speaker unit 146 outputs the audio signal processed by the digital signal processor 142 as sound.

도 6은 본 발명의 일 실시 예에 따른 건강 상태 진단 방법의 전체 흐름도 이다. 6 is a flowchart illustrating a method of diagnosing a state of health according to an embodiment of the present invention.

먼저, 녹음 또는 통화를 통해 입력되는 사용자의 음성을 감지하고 수집한다(610 과정).First, a voice of a user input through recording or a call is detected and collected (step 610).

이어서, 수집된 사용자 음성으로부터 건강 상태 종류별로 음성 특징을 추출한다(620 과정).Next, a voice feature is extracted for each type of health state from the collected user voices (620).

이어서, 추출된 음성 특징과 병적 상태 기준을 비교하여 건강 상태를 진단하고 건강 변화 상태를 모니터링 한다(630 과정).Subsequently, the health condition is diagnosed and the health change state is monitored by comparing the extracted negative features with the pathological criteria (step 630).

이어서, 진단된 건강 상태 및 건강 변화 상태 정보를 정해진 형태의 알림 신호로 출력한다(640 과정).In operation 640, the diagnosed health state and health change state information are output as a notification signal of a predetermined form.

결국, 본 발명의 일 실시예에 따르면 전자 기기에서 음성으로 도출할 수 있는 신체 장기의 건강 상태를 진단하고 사용자에게 알림으로써 신체 장기의 건강 상태에 대한 조기 진단을 수행할 수 있다. As a result, according to an exemplary embodiment of the present invention, an early diagnosis of the health state of the body organs may be performed by diagnosing and informing the user of the health state of the body organs that can be derived by voice from the electronic device.

먼저, 녹음이나 통화를 통해 사용자의 음성이 입력되는 가를 체크한다(710 과정).First, it is checked whether a user's voice is input through recording or a call (step 710).

이어서, 사용자의 음성이 입력되면 입력 음성으로부터 건강 상태를 확인하기 위한 음성 특징을 추출한다(720 과정).Subsequently, when a user's voice is input, a voice feature for checking a state of health is extracted from the input voice (S720).

이어서, 추출된 음성 특징과 병적 상태 기준을 비교하여 음성 진단을 수행한다(730 과정).Next, a voice diagnosis is performed by comparing the extracted voice features with the pathological condition criteria (step 730).

이어서, 지속적인 건강 상태의 관리를 위해 음성 진단 결과 및 진단 시간 정보를 저장한다(740 과정). 여기서, 진단 시간 정보는 일 실시 예로 진단 날짜, 진단 히스토리(history)를 들 수 있으나 이에 한정하지 않는다. In operation 740, the voice diagnosis result and the diagnosis time information are stored for the continuous management of the medical condition. Here, the diagnostic time information may include, but is not limited to, a diagnostic date and a diagnostic history.

이때, 사용자가 건강 상태 진단을 원해서 진단 기능이 수행된 경우를 1 회 성 진단으로 설정한다. At this time, the case where the user wants to diagnose the health state is set as a one-time diagnosis.

이어서, 건강 상태 진단이 1 회 성 진단인가를 인가를 체크한다(750 과정). Then, it is checked whether the medical condition diagnosis is a one-time diagnosis (step 750).

이어서, 건강 상태 진단이 1회 성 진단이면 사용자에게 진단 결과를 알리고 진단 결과에 적합한 치료를 권고한다(770 과정). Subsequently, if the medical condition diagnosis is a one-time diagnosis, the user is notified of the diagnosis result and recommended treatment appropriate for the diagnosis result (step 770).

그러나, 건강 상태 진단이 1회 성 진단이 아니면 지속적으로 사용자의 음성을 통한 건강 상태의 모니터링을 수행한다. 이때, 시간에 따른 사용자의 건강 상태에 변화가 발생하였는가를 지속적으로 체크한다(760 과정).However, if the medical condition diagnosis is not a one-time diagnosis, the health condition is continuously monitored through the voice of the user. In this case, the controller continuously checks whether a change has occurred in the state of health of the user over time (step 760).

이어서, 시간에 따른 사용자의 건강 상태에 변화가 발생하였으면 사용자에게 진단 결과를 알리고 진단 결과에 적합한 치료를 권고한다(770 과정).
Subsequently, if the user's health condition changes over time, the user is notified of the diagnosis result, and treatment recommended for the diagnosis result is recommended (step 770).

도 8은 본 발명의 다른 실시 예에 따른 건강 상태 진단 방법의 상세 흐름도이다.8 is a detailed flowchart of a medical condition diagnosis method according to another embodiment of the present invention.

먼저, 녹음이나 통화를 통해 사용자의 음성이 입력되는 가를 체크한다(810 과정). First, it is checked whether a voice of a user is input through recording or a call (step 810).

이어서, 사용자의 음성이 입력되면 입력 음성으로부터 건강 상태를 확인하기 위한 음성 특징을 추출한다(820 과정).Subsequently, when a user's voice is input, a voice feature for checking a state of health is extracted from the input voice (S820).

이어서, 추출된 음성 특징과 병적 상태 기준을 비교하여 음성 진단을 수행한다(830 과정).Subsequently, a voice diagnosis is performed by comparing the extracted voice features with the pathological condition criteria (step 830).

이어서, 지속적인 건강 상태의 관리를 위해 음성 진단 결과 및 진단 시간 정보를 저장한다(840 과정). 여기서, 진단 시간 정보는 일 실시 예로 진단 날짜, 진단 히스토리(history)를 들 수 있으나 이에 한정하지 않는다. In operation 840, the voice diagnosis result and the diagnosis time information are stored for the continuous management of the health condition. Here, the diagnostic time information may include, but is not limited to, a diagnostic date and a diagnostic history.

이어서, 건강 상태 진단이 1 회 성 진단인가를 인가를 체크한다(850 과정). Then, it is checked whether the medical condition diagnosis is a one-time diagnosis (step 850).

이어서, 건강 상태 진단이 1회 성 진단이면 사용자에게 진단 결과를 알리고 진단 결과에 적합한 치료를 권고한다(852 과정). Subsequently, if the medical condition diagnosis is a one-time diagnosis, the user is notified of the diagnosis result and recommended treatment appropriate for the diagnosis result (step 852).

그러나, 건강 상태 진단이 1회 성 진단이 아니면 지속적으로 사용자의 음성을 통한 건강 상태의 모니터링을 수행함으로써 시간에 따른 사용자의 건강 상태 변화를 분석한다(860 과정).However, if the medical condition diagnosis is not a one-time diagnosis, the health condition of the user is analyzed over time by continuously monitoring the health condition through the user's voice (step 860).

이어서, 시간에 따른 사용자의 건강 상태 변화에 따라 권고 메시지를 다르게 나타낸다. Subsequently, the recommendation message is displayed differently according to the change in the health state of the user over time.

예를 들면, 사용자의 건강 상태 변화가 1 - 2일 경과되면 사용자에게 "무리하지 마세요"란 메시지를 알리고(862 과정), 사용자의 건강 상태 변화가 3일 이상 경과되면 단기성 질병으로 판단하여 사용자에게 병원 진찰을 권고하는 메시지를 알리고(864 과정), 사용자의 건강 상태 변화가 2주 이상 경과되면 병의 악화를 판단하여 사용자에게 병원 진찰을 적극 권고하는 메시지를 알린다(866 과정).For example, if the change in the user's state of health passes 1 to 2 days, the user is notified of the message, "Do not oversee" (step 862). Notify the hospital recommendation message (step 864), and when the user's health condition changes more than two weeks, determine the worsening of the disease and notify the user actively recommending the hospital (step 866).

이어서, 진단 결과에 의해 사용자의 건강 상태가 호전되었나를 체크한다(870 과정). Then, it is checked whether the health state of the user is improved by the diagnosis result (step 870).

이어서, 사용자의 건강 상태가 호전되었으면 호전 상태를 사용자에게 알린다(880 과정).
Subsequently, if the user's health condition is improved, the user is notified of the improved state (S880).

도 9a는 도 6의 병적 음성과 정상 음성을 위한 진단 기준을 도출하는 일 실시 예이다.FIG. 9A illustrates an example of deriving diagnostic criteria for pathological and normal voices of FIG. 6.

먼저, 병적 음성 데이터 베이스 및 정상 음성 데이터 베이스를 구축한다.First, a pathological speech database and a normal speech database are constructed.

이어서, 병적 음성 데이터 베이스 및 정상 음성 데이터 베이스에서 병적 음성 특징 및 정상 음성 특징을 추출한다(910 과정 및 920 과정).Then, the pathological speech feature and the normal speech feature are extracted from the pathological speech database and the normal speech database (steps 910 and 920).

이어서, 널리 알려진 분석 모델링을 이용하여 병적 음성과 정상 음성 특징에 대한 비교 트레이닝을 수행한다(930 과정). Next, comparative training is performed on pathological speech and normal speech features using well-known analytical modeling (step 930).

이어서, 비교 트레이닝을 통해 병적 음성과 정상 음성을 위한 진단 기준을 도출한다(940 과정).Then, the comparison training derives the diagnostic criteria for the pathological voice and the normal voice (step 940).

도 9b는 GMM(Gaussian Mixture Model)을 이용하여 2개의 가우시안(gaussian)으로 모델링한 GMM 파라메터의 진단 기준을 도시한 것이다. FIG. 9B illustrates diagnostic criteria of GMM parameters modeled using two Gaussian models using a Gaussian Mixture Model (GMM).

도 9b를 참조하면, 제1, 제2 가우시안(gaussian)에 의해 모델링된 GMM 파라메터들(shimmer, MFCC2, MFCC3)은 정상 음성과 병적 음성에 대한 진단의 기준으로 나타낸다. Referring to FIG. 9B, the GMM parameters modeled by the first and second Gaussian (shimmer, MFCC2, MFCC3) are shown as a standard of diagnosis for normal voice and pathological voice.

도 10은 본 발명에 따른 진단 기준을 이용한 자동 진단 방법의 흐름도 이다. 10 is a flowchart of an automatic diagnostic method using diagnostic criteria according to the present invention.

먼저, 녹음이나 통화를 통해 사용자의 음성을 입력한다(1010 과정).First, a voice of a user is input through recording or a call (step 1010).

이어서, 입력된 음성으로부터 후두 상태에 대한 음성 특징을 추출한다(1020 과정).Subsequently, a voice feature for a laryngeal state is extracted from the input voice (step 1020).

이어서, 추출된 음성 특징과 진단 기준을 비교하여(1030 과정) 후두 상태에 대한 자동 진단을 수행한다(1040 과정).Subsequently, an automatic diagnosis of a laryngeal state is performed by comparing the extracted voice feature with a diagnosis criterion (step 1030).

이어서, 사용자에게 진단 결과를 안내한다(1050 과정). Subsequently, the diagnosis result is notified to the user (step 1050).

도 11은 본 발명의 다른 실시 예에 따른 핸드폰을 이용한 건강 상태 진단 방법의 흐름도 이다. 11 is a flowchart illustrating a medical condition diagnosis method using a mobile phone according to another embodiment of the present invention.

먼저, 핸드폰을 이용해 사용자의 통화를 감지한다(1110 과정).First, a user's call is detected using a mobile phone (step 1110).

이어서, 사용자의 통화 음성을 감지한다(1120 과정). In operation 1120, the user's voice is detected.

이어서, 사용자 통화 음성을 실시간으로 분석하여 사용자 음성으로부터 진단을 위한 음성 특징을 추출한다(1130 과정).Subsequently, the voice feature for diagnosis is extracted from the user voice by analyzing the user call voice in real time (step 1130).

이어서, 추출된 음성 특징을 이용하여 건강 상태를 추출한다(1140 과정).Subsequently, the health state is extracted using the extracted voice feature (step 1140).

이어서, 사용자에게 진단 결과를 안내한다(1150 과정).Subsequently, the diagnosis result is notified to the user (step 1150).

결국, 본 발명의 일 실시 예에 따르면 핸드폰 통화시 사용자는 통화 음성을 이용하여 자동으로 건강 상태를 진단할 수 있다. 또한 사용자는 통화 시마다 진단 결과를 저장하고 확인할 수 있다.
As a result, according to an exemplary embodiment of the present invention, the user may automatically diagnose a health state by using a voice call during a mobile phone call. In addition, the user can save and check the diagnosis result for each call.

한편, 상술한 본 발명의 실시 예들은 컴퓨터에서 실행될 수 있는 프로그램으로 작성 가능하고, 컴퓨터로 읽을 수 있는 기록매체를 이용하여 상기 프로그램을 동작시키는 범용 디지털 컴퓨터에서 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록매체는 마그네틱 저장매체(예를 들면, 롬, 플로피 디스크, 하드디스크 등), 광학적 판독 매체(예를 들면, 시디롬, 디브이디 등)와 같은 저장 매체를 포함한다.Meanwhile, the above-described embodiments of the present invention can be written as a program that can be executed in a computer, and can be implemented in a general-purpose digital computer that operates the program using a computer-readable recording medium. The computer-readable recording medium may include a storage medium such as a magnetic storage medium (eg, a ROM, a floppy disk, a hard disk, etc.), an optical reading medium (eg, a CD-ROM, a DVD, etc.).

이제까지 본 발명에 대하여 그 바람직한 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허 청구 범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the disclosed embodiments should be considered in an illustrative rather than a restrictive sense. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

Claims

A voice detector for detecting a voice by monitoring the voice;
A voice analyzer extracting a voice feature for each type of health condition to be diagnosed from the voice detected by the voice detector;
A voice diagnosis unit for diagnosing a state of health and monitoring a state of health change by comparing a voice feature extracted from the voice analyzer with a pathological state criterion;
And a diagnostic output unit configured to output the health state and health change state information diagnosed by the voice diagnosis unit as a predetermined signal.

The apparatus of claim 1, wherein the voice detector monitors a voice change state of the user.

The apparatus of claim 1, wherein the voice detector comprises: an analog to digital converter configured to convert an input analog signal into a digital signal; And
And a voice discriminating unit for discriminating a voice from the digital signal converted by the analog-digital converting unit.

The method of claim 1, wherein the voice analysis unit
A health state selector configured to determine a health state type with respect to the voice detected by the voice detector;
A voice section selector configured to select a voice section for diagnosis according to the health condition type selected by the health condition selector;
A voice feature selector which selects a voice feature corresponding to the health state type selected by the health state selector; And
And a voice feature extractor configured to extract the voice feature from a user voice input according to the voice feature selected by the feature selector.

The apparatus of claim 4, wherein the speech feature extractor comprises a normal speech database and a pathological speech database to distinguish between normal speech and pathological speech.

The method of claim 1, wherein the voice diagnostic unit
A comparison unit comparing the voice feature extracted from the voice analysis unit with a pathological diagnosis criterion;
A monitoring unit for detecting a change in the voice state of the user over time; And
And a diagnosis processing unit for diagnosing a state of health of body organs according to a comparison result of the comparing unit and a change in voice of the monitoring unit.

The apparatus of claim 6, further comprising a storage unit configured to store and update the diagnosis result and diagnosis time information diagnosed by the diagnosis processing unit.

The method of claim 1, wherein the diagnostic output unit
A digital signal processor converting the health state and health change state information diagnosed by the voice diagnosis unit into audio and video signals;
A display unit for outputting a video signal processed by the digital signal processor; And
And a speaker unit configured to output an audio signal processed by the digital signal processor.

Detecting and collecting voice;
Extracting voice features from the collected voices according to types of diagnostic health conditions;
Diagnosing a health condition and monitoring a health change state by comparing the voice characteristic with a pathological condition criterion;
And a method of outputting the diagnosed health state and health change state information as a notification signal of a predetermined form.

The method of claim 9, wherein the collecting of the voice is performed.
Converting the input voice into a digital voice signal;
And a step of discriminating the voice from the converted digital voice signal.

The method of claim 9, wherein the extracting the health state characteristic
Determining a health state type for the collected voices;
Selecting a voice section for diagnosis according to the determined health state type;
Selecting a voice characteristic corresponding to the selected health state type;
And extracting the speech feature from the speech section according to the selected speech feature.

The method of claim 9, wherein the medical condition diagnosis process
Analyzing a change in the health status of the user over time;
And a method of notifying the recommendation message differently according to the change of the health state of the user over time.

The method of claim 9, wherein the diagnosing the state of health and monitoring the state of health change,
Comparing the extracted voice features with pathological diagnostic criteria;
Detecting a change in a user's voice over time;
And a method of diagnosing a state of health of body organs according to the comparison result and the change of voice.

The method of claim 13, further comprising collecting, storing, and updating the diagnosed diagnosis result and diagnosis time information.

The method of claim 13, further comprising deriving diagnostic criteria for pathological voice and normal voice to compare the extracted voice features with pathological diagnosis criteria.

The method of claim 15, wherein the derivation of diagnostic criteria for pathological and normal voice comprises:
Constructing a pathological speech database and a normal speech database;
Extracting pathological speech features and normal speech features from the pathological speech database and the normal speech database;
Performing comparative training on the pathological and normal speech features;
A method of diagnosing a health condition using a voice, the method comprising deriving diagnostic criteria for pathological voices and normal voices according to the comparative training.

The method of claim 9, wherein the output process
Converting the diagnosed health state and health change state information into audio and video signals;
And a step of outputting the converted audio signal and the video signal.

A computer-readable recording medium having recorded thereon a program for implementing the method of claim 9.