TWI622980B - Disease detecting and classifying system of voice - Google Patents

Disease detecting and classifying system of voice Download PDF

Info

Publication number
TWI622980B
TWI622980B TW106130351A TW106130351A TWI622980B TW I622980 B TWI622980 B TW I622980B TW 106130351 A TW106130351 A TW 106130351A TW 106130351 A TW106130351 A TW 106130351A TW I622980 B TWI622980 B TW I622980B
Authority
TW
Taiwan
Prior art keywords
data
voice
medical history
module
comparison
Prior art date
Application number
TW106130351A
Other languages
Chinese (zh)
Other versions
TW201913649A (en
Inventor
王棨德
Original Assignee
醫療財團法人徐元智先生醫藥基金會亞東紀念醫院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 醫療財團法人徐元智先生醫藥基金會亞東紀念醫院 filed Critical 醫療財團法人徐元智先生醫藥基金會亞東紀念醫院
Priority to TW106130351A priority Critical patent/TWI622980B/en
Application granted granted Critical
Publication of TWI622980B publication Critical patent/TWI622980B/en
Publication of TW201913649A publication Critical patent/TW201913649A/en

Links

Landscapes

  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

本發明嗓音疾病偵測分類系統具有應用程式、嗓音資料庫、病史資料庫以及伺服終端,該應用程式能將一受測病史資料以及一受測嗓音資料傳送至伺服終端,而嗓音資料庫與病史資料庫分別能提供複數個比對嗓音資料以及複數個比對病史資料至伺服終端,使得伺服終端依據受測病史資料與受測嗓音資料分別對複數個比對病史資料與複數個比對嗓音資料進行分類篩選以共同形成一診斷資料,藉此,任何人都能透過診斷資料了解自己的嗓音狀況,進而可以得知聲帶是否健康,並能再進一步判斷是否該就醫治療。 The voice disease detection and classification system of the invention has an application program, a voice database, a medical history database and a servo terminal, and the application can transmit a measured medical history data and a measured voice data to a servo terminal, and the voice database and medical history The database can provide a plurality of comparison voice data and a plurality of comparison medical history data to the servo terminal, so that the servo terminal compares the medical history data and the plurality of comparison voice data according to the measured medical history data and the measured voice data respectively. Sorting and screening to form a diagnostic data, so that anyone can understand the voice status through the diagnostic data, and then can know whether the vocal cord is healthy, and can further determine whether it should be treated.

Description

嗓音疾病偵測分類系統 Voice disease detection classification system

本發明有關於一種檢測嗓音狀態的嗓音疾病偵測分類系統,特別是一種任何人能自行檢測嗓音狀態的嗓音疾病偵測分類系統。 The invention relates to a voice disease detection and classification system for detecting a voice state, in particular to a voice disease detection and classification system capable of detecting a voice state by anyone.

隨著時代的演進,醫療技術也日漸進步,進而已有研究指出患者喉嚨的健康出現狀況確實會影響患者的聲音,其中,目前常見影響患者嗓音的病理包含:結構性病變(例如:聲帶結節、息肉和囊腫)、腫瘤和神經源性疾病(例如:聲帶麻痺和痙攣性發音障礙)。 With the evolution of the times, medical technology is also gradually improving. Further studies have pointed out that the health of the patient's throat does affect the patient's voice. Among them, the pathology that currently affects the patient's voice includes: structural lesions (eg, vocal cord nodules, Polyps and cysts), tumors and neurogenic diseases (eg vocal cord paralysis and spastic dysphonia).

當患者因為聲帶出現病理現象而影響聲音時,患者通常會去醫院找醫師尋求治療方法,並且,目前需要透過耳鼻喉專科醫師使用喉內視鏡來檢測患者的喉嚨狀況,進而加以判斷患者得到哪種病症,然而,使用內視鏡進行檢測時,需要將喉內視鏡的管體伸入患者的喉嚨裡,進而患者會因為喉嚨有異物而感到不舒服,並且,患者透過喉內視鏡進行檢測往往還必須花費一筆醫療費用。 When a patient influences the sound due to pathological phenomena in the vocal cords, the patient usually goes to the hospital to seek a doctor for treatment. At present, it is necessary to use the laryngoscope to detect the throat condition of the patient through the otolaryngology specialist, and then to determine which patient the patient receives. A condition, however, when using an endoscope to detect, the tube of the laryngoscope needs to be inserted into the patient's throat, and the patient may feel uncomfortable because of a foreign body in the throat, and the patient passes through the laryngoscope. Testing often also costs a medical bill.

此外,當患者的聲音因為疾病而產生變化時,並非所有的患者都會立即前往醫院進行就醫,而有些許患者會等待疾病自行痊癒,進而患者不但無法自行檢測疾病是否痊癒,甚至可能延誤就醫而導致病情嚴重,進而需要多時間與金錢來治療。 In addition, when the patient's voice changes due to illness, not all patients will go to the hospital immediately for medical treatment, and some patients will wait for the disease to heal itself, so that the patient can not only detect whether the disease has healed or even delay the medical treatment. The condition is serious and it takes more time and money to treat.

藉此,當今患者不但無法自行檢測喉嚨的健康狀態來達到預防的效果,甚至當喉嚨的健康出現問題時必須需要花費時間與金錢至醫院進行檢測,並且還必須忍受檢測時的不舒服,進而當今檢測喉嚨的健康狀態仍有許多不方便之處。 In this way, today's patients can not only detect the health of the throat to achieve the preventive effect, even when the health of the throat is problematic, it must take time and money to the hospital for testing, and must also endure the uncomfortable detection, and nowadays There are still many inconveniences in detecting the health of the throat.

本發明的主要目的在於任何人可以隨時檢測自己的嗓音狀況,進而能即時得知聲帶的健康狀況,若得知聲帶的健康狀況出現問題,則可以立即就醫治療,以提高治癒的機會。 The main purpose of the present invention is that anyone can detect their own voice condition at any time, and thus can immediately know the health status of the vocal cords. If the health condition of the vocal cords is known to be problematic, the medical treatment can be immediately sought to improve the chance of cure.

為達上述目的,本發明嗓音疾病偵測分類系統主要由一病史資料庫、一嗓音資料庫、一應用程式以及一伺服終端所構成。 To achieve the above objective, the voice disease detection and classification system of the present invention mainly comprises a medical history database, a voice database, an application program and a servo terminal.

上述病史資料庫用以提供複數個比對病史資料,而上述嗓音資料庫用以儲存複數個原始嗓音資料,並設有一第一提取模組,其中,上述第一提取模組用以將上述複數個原始嗓音資料提取形成複數個對應於上述比對病史資料的比對嗓音資料,並且,上述嗓音資料庫進一步具有一標籤模組,上述標籤模組能對每一個上述比對嗓音資料標註至少一種病理,而上述標籤模組能標註有結節、息肉、囊腫、腫瘤、癌症、萎縮、震顫、麻痺與聲帶溝其中一種聲帶病理。 The medical history database is configured to provide a plurality of comparison medical history data, and the sound data database is configured to store a plurality of original voice data, and a first extraction module is provided, wherein the first extraction module is configured to The original voice data is extracted to form a plurality of comparative voice data corresponding to the comparison medical history data, and the voice database further has a label module, and the label module can mark at least one of each of the compared voice data Pathology, and the above label module can be labeled with nodules, polyps, cysts, tumors, cancer, atrophy, tremor, paralysis and vocal cords.

上述應用程式可安裝於電子裝置,並電性連接於上述伺服終端,而上述應用程式用以形成一傳送至上述伺服終端的受測病史資料,並能透過使用者的聲音形成一傳送至上述伺服終端的受測嗓音資料,其中,上述應用程式具有一第二提取模組,上述第二提取模組能將一使用者輸入的輸入嗓音資料提取形成上述受測嗓音資料。 The application can be installed on the electronic device and electrically connected to the servo terminal, and the application is used to form a measured medical history data transmitted to the servo terminal, and can be transmitted to the servo through the user's voice. The measured voice data of the terminal, wherein the application has a second extraction module, and the second extraction module can extract the input voice data input by the user to form the measured voice data.

上述伺服終端具有一對應於上述受測病史資料的病史問卷、一比對分類模組以及一電性連接於上述比對分類模組的處理模組,上述比對分類模組用以同時連接於上述病史資料庫與嗓音資料庫,而上述處理模組電性連接於上述應用程式,其中,上述伺服終端接受上述複數個比對病史資料、複數個比對嗓音資料、受測病史資料以及受測嗓音資料,並能將複數個比對病史資料與複數個比對嗓音資料分別依據上述受測病史資料以及受測嗓音資料進行分類篩選以共同形成一診斷資料。 The servo terminal has a medical history questionnaire corresponding to the measured medical history data, a comparison classification module, and a processing module electrically connected to the comparison classification module, wherein the comparison classification module is used for simultaneously connecting to the processing module. The medical history database and the voice database, wherein the processing module is electrically connected to the application program, wherein the servo terminal receives the plurality of comparison medical history data, a plurality of comparative voice data, the measured medical history data, and the measured The voice data can be combined and screened according to the above-mentioned medical history data and the measured voice data to form a diagnostic data, respectively, according to the plurality of comparative medical history data and the plurality of comparative voice data.

於一較佳實施例中,上述處理模組將上述受測病史資料與受測嗓音資料,使得上述比對分類模組單獨透過上述複數個比對病史資料、上述複數個比對嗓音資料、上述受測病史資料以及上述受測嗓音資料來形成上述診斷資料。 In a preferred embodiment, the processing module compares the measured medical history data with the measured voice data, so that the comparison classification module separately transmits the plurality of comparative medical history data, the plurality of comparative voice data, and the foregoing The medical history data and the above-mentioned measured voice data are used to form the above diagnostic data.

於另一較佳實施例中,上述比對分類模組將上述複數個比對病史資料與上述複數個比對嗓音資料依照不同的病理分類形成複數個分類資料,而上述比對分類模組與處理模組兩者其之一將上述複數個分類資料依據上述受測病史資料以及上述受測嗓音資料來形成上述診斷資料,並且,當形成上述診斷資料時,上述比對分類模組或處理模組會停止針對其餘的分類資料進行篩選比對。 In another preferred embodiment, the comparison classification module forms a plurality of classification data according to different pathological classifications of the plurality of comparison medical history data and the plurality of comparison voice data, and the comparison classification module and One of the processing modules forms the diagnostic data according to the plurality of classified data according to the measured medical history data and the measured voice data, and, when the diagnostic data is formed, the comparison classification module or the processing module The group will stop screening for the remaining classification data.

然而,於前述兩實施例中,上述第一提取模組透過梅爾頻率倒譜係數將上述原始嗓音資料提取形成上述比對嗓音資料,而上述第二提取模組透過梅爾頻率倒譜係數將上述輸入嗓音資料提取形成上述受測嗓音資料,並且,上述比對分類模組選自於監督式機器學習中的其中一種系統模組進行分類,如 決策樹、自適應增強分類、最近鄰居法、支撐向量機、高斯混合模型或深度神經網路的其中一種或多種系統模組進行分類。 However, in the foregoing two embodiments, the first extraction module extracts the original voice data by using a Mel frequency cepstrum coefficient to form the comparison voice data, and the second extraction module transmits the Mel frequency cepstrum coefficient. The input voice data is extracted to form the measured voice data, and the comparison classification module is selected from one of the system modules in the supervised machine learning, such as One or more system modules of the decision tree, adaptive enhanced classification, nearest neighbor method, support vector machine, Gaussian mixture model or deep neural network are classified.

由前述說明可知,本發明的特點在於應用程式能將受測嗓音資料與受測病史資料傳送至伺服終端,使得伺服終端依據接受到的受測嗓音資料與受測病史資料分別對嗓音資料庫與病史資料庫內的資料進行篩選並加以判斷是否有疾病,藉此,只要使用者在電子裝置安裝應用程式,進而任何人可以隨時經由電子裝置輸入自己的受測嗓音資料與受測病史資料來檢測自己的嗓音狀況,進而可以即時得知聲帶的是否健康,若得知聲帶的健康狀況出現問題,則可以立即就醫治療,以提高治癒的機會。 It can be seen from the foregoing description that the application program can transmit the measured voice data and the measured medical history data to the servo terminal, so that the servo terminal respectively according to the received voice data and the measured medical history data to the voice database and The data in the medical history database is screened and judged whether there is a disease, so that as long as the user installs the application on the electronic device, any person can input his or her own measured voice data and the measured medical history data through the electronic device at any time to detect Your own voice status, you can immediately know if the vocal cords are healthy. If you know that there is a problem with the health of the vocal cords, you can seek medical treatment immediately to improve the chance of cure.

1‧‧‧嗓音疾病偵測分類系統 1‧‧‧Voice Disease Detection and Classification System

10‧‧‧伺服終端 10‧‧‧Servo terminal

11‧‧‧比對分類模組 11‧‧‧Comparative classification module

12‧‧‧處理模組 12‧‧‧Processing module

20‧‧‧病史資料庫 20‧‧‧ medical history database

30‧‧‧嗓音資料庫 30‧‧‧Audio database

31‧‧‧第一提取模組 31‧‧‧First extraction module

32‧‧‧標籤模組 32‧‧‧ label module

40‧‧‧應用程式 40‧‧‧Application

41‧‧‧第二提取模組 41‧‧‧Second extraction module

50‧‧‧電子裝置 50‧‧‧Electronic devices

51‧‧‧輸入模組 51‧‧‧Input module

52‧‧‧顯示螢幕 52‧‧‧ Display screen

圖1為本發明嗓音疾病偵測分類系統的模組圖;圖2為本發明嗓音疾病偵測分類系統於第一較佳實施例的使用示意圖;圖3為使用輸入模組的示意圖;以及圖4為本發明嗓音疾病偵測分類系統於第二較佳實施例的使用示意圖。 1 is a block diagram of a voice disease detection and classification system according to the present invention; FIG. 2 is a schematic diagram of the use of the voice disease detection and classification system of the present invention in a first preferred embodiment; FIG. 3 is a schematic diagram of using an input module; 4 is a schematic diagram of the use of the voice sickness detection and classification system of the present invention in the second preferred embodiment.

茲為便於更進一步對本發明之構造、使用及其特徵有更深一層明確、詳實的認識與瞭解,爰舉出較佳實施例,配合圖式詳細說明如下:請參閱圖1所示,於第一較佳實施例中,本發明嗓音疾病偵測分類系統1主要由一伺服終端10、一病史資料庫20、一嗓音資料庫30以及一應用程式40所構成。 In order to facilitate further understanding and understanding of the structure, use and features of the present invention, a preferred embodiment will be described in detail with reference to the drawings as follows: In the preferred embodiment, the voice disease detection and classification system 1 of the present invention is mainly composed of a server terminal 10, a medical history database 20, a voice database 30, and an application 40.

伺服終端10有一比對分類模組11、一處理模組12以及一病史問卷,比對分類模組11設為複數機器學習模組,如決策樹(decision tree)、隨機森林(Random Forest)、自適應增強分類(Adaboost)、最近鄰居法(K Nearest-neighbor)、支撐向量機(Support Vector Machine,簡稱SVM)、高斯混合模型(Gaussian Mixture Model,簡稱GMM)或深度神經網路(Deep Neural Networks,簡稱DNN)其中一種或多種併用,並電性連接於處理模組12,而病史資料庫20內部存有複數個比對病史資料,並電性連接於伺服終端10的比對分類模組11,其中,嗓音資料庫30電性連接於比對分類模組11,並具有一第一提取模組31以及一標籤模組32,而嗓音資料庫30內部存有複數個原始嗓音資料,於此實施例中,每一個比對病史資料有記載該患者因為怎樣的疾病而導致嗓音產生變化,而每一個嗓音資料是由聲波所構成,並且,每一個嗓音資料的聲波不相同於另一個嗓音資料的聲波,其中,一部分嗓音資料則記載該嗓音為正常,而另一部分則記載該嗓音因為患有疾病(例如:結節、息肉、囊腫、腫瘤、癌症、萎縮、震顫、麻痺或聲帶溝)而不正常,並且,應用程式40連接於伺服終端10的處理模組12,並具有一第二提取模組41。 The servo terminal 10 has a comparison classification module 11, a processing module 12, and a medical history questionnaire. The comparison classification module 11 is configured as a plurality of machine learning modules, such as a decision tree and a random forest. Adaptive enhanced classification (Adaboost), nearest neighbor method (K Nearest-neighbor), Support Vector Machine (SVM), Gaussian Mixture Model (GMM) or Deep Neural Networks (Deep Neural Networks) One or more of the DNNs are used in combination, and are electrically connected to the processing module 12, and the medical history database 20 stores a plurality of comparison medical history data, and is electrically connected to the comparison classification module 11 of the servo terminal 10. The voice database 30 is electrically connected to the comparison classification module 11 and has a first extraction module 31 and a label module 32, and the voice database 30 has a plurality of original voice data therein. In the embodiment, each of the medical history data records that the patient has a change in the voice due to the disease, and each of the voice data is composed of sound waves, and each of the voice data The wave is not the same as the sound wave of another voice data, in which part of the voice data records the voice as normal, while the other part records the voice because of the disease (eg, nodules, polyps, cysts, tumors, cancer, atrophy, tremors) The numbness or the vocal folds are not normal, and the application 40 is connected to the processing module 12 of the servo terminal 10 and has a second extraction module 41.

請參閱圖2至圖3所示,於具體應用時,將一電子裝置50安裝應用程式40,使得電子裝置50能提供資料給伺服終端10或是接收由伺服終端10傳來的資料,其中,電子裝置50具有一輸入模組51以及一顯示螢幕52,而輸入模組51與顯示螢幕52共同形成一觸碰螢幕,並且,電子裝置50可設為手機、電腦或平板。 Referring to FIG. 2 to FIG. 3 , an application device 40 is installed on an electronic device 50 in a specific application, so that the electronic device 50 can provide data to the servo terminal 10 or receive data transmitted by the server 10 . The electronic device 50 has an input module 51 and a display screen 52. The input module 51 and the display screen 52 together form a touch screen, and the electronic device 50 can be set as a mobile phone, a computer or a tablet.

接下來,伺服終端10將上述病史問卷傳送至應用程式40,使得電子裝置50的顯示螢幕52顯示上述病史問卷的內容,進而使用者能依據上述病史 問卷而透過輸入模組51作答來產生一受測病史資料,並且,使用者再透過輸入模組51輸入自己的嗓音以形成一輸入嗓音資料,於此實施例中,使用者先行錄製自己嗓音形成一錄音檔,並在將錄音檔輸入至輸入模組51以形成上述輸入嗓音資料,然而,使用者亦可透過內建於電子裝置50的麥克風直接對輸入模組51輸入嗓音來形成上述輸入嗓音資料,其中,受測病史資料包括使用者性別、年齡、個人醫療病史、嗓音相關症狀、生活習慣(如抽菸、喝酒)、嚴重度自我評估、職業類別與用聲狀況、症狀持續時間其中一種或多種相關內容。 Next, the server 10 transmits the medical history questionnaire to the application 40, so that the display screen 52 of the electronic device 50 displays the content of the medical history questionnaire, so that the user can follow the medical history. The questionnaire is used to generate a measured medical history data through the input module 51, and the user inputs his own voice through the input module 51 to form an input voice data. In this embodiment, the user first records the voice formation. a recording file, and inputting the recording file to the input module 51 to form the input voice data. However, the user can directly input the voice into the input module 51 through the microphone built in the electronic device 50 to form the input voice. Information, wherein the medical history data includes user gender, age, personal medical history, voice related symptoms, lifestyle habits (such as smoking, drinking), severity self-assessment, occupational category and voice status, and duration of symptoms Or a variety of related content.

接下來,電子裝置50的輸入模組51會將上述輸入嗓音資料以及上述受測病史資料傳送至應用程式40,應用程式40會將上述受測病史資料傳遞至伺服終端10的處理模組12,而應用程式40的第二提取模組41會透過梅爾頻率倒譜係數(Mel-scale Frequency Cepstral Coefficients,簡稱MFCC)對上述輸入嗓音資料進行提取以形成一傳送至處理模組12的受測嗓音資料,其中,當處理模組12接收到上述受測病史資料與受測嗓音資料傳送至伺服終端10的比對分類模組11。 Next, the input module 51 of the electronic device 50 transmits the input voice data and the measured medical history data to the application 40, and the application 40 transmits the measured medical history data to the processing module 12 of the servo terminal 10. The second extraction module 41 of the application 40 extracts the input voice data through a Mel-scale Frequency Cepstral Coefficients (MFCC) to form a measured voice transmitted to the processing module 12. The data is processed by the processing module 12 to the comparison classification module 11 that transmits the measured medical history data and the measured arpeggio data to the servo terminal 10.

此外,病史資料庫20將上述複數個比對病史資料傳送至伺服終端10的比對分類模組11,並且,嗓音資料庫30的第一提取模組31透過梅爾頻率倒譜係數將上述每一個原始嗓音資料進行提取以形成複數個傳送至比對分類模組11的比對嗓音資料,其中,當形成上述比對嗓音資料時,標籤模組32能對每一個上述比對嗓音資料標註出正常或異常嗓音,並對於異常嗓音可至少標示一種或多種病理,如結節、息肉、囊腫、腫瘤、癌症、萎縮、震顫、麻痺或聲帶溝各類疾病分類。 In addition, the medical history database 20 transmits the plurality of comparison medical history data to the comparison classification module 11 of the servo terminal 10, and the first extraction module 31 of the voice data library 30 transmits the above by the Mel frequency cepstrum coefficient. An original voice data is extracted to form a plurality of comparison voice data transmitted to the comparison classification module 11, wherein when the comparison voice data is formed, the label module 32 can mark each of the comparison voice data Normal or abnormal voice, and at least one or more pathologies such as nodules, polyps, cysts, tumors, cancer, atrophy, tremors, paralysis, or vocal folds can be classified for abnormal voices.

當伺服終端10的比對分類模組11接受到上述受測病史資料、受測嗓音資料複數個比對病史資料以及複數個比對嗓音資料時,比對分類模組11會單獨依據上述受測病史資料與輸入嗓音資料分別對上述複數個比對病史資料與複數個比對嗓音資料進行分類篩選以形成一傳遞至處理模組12的診斷資料,而處理模組12會偵測上述診斷資料的內容以形成一傳遞至顯示螢幕52的顯示資料,使得顯示螢幕52透過上述顯示資料來顯示使用者的嗓音狀態是否健康。 When the comparison classification module 11 of the servo terminal 10 receives the above-mentioned medical history data, the measured arpeggio data, the plurality of medical history data, and the plurality of comparison voice data, the comparison classification module 11 is separately determined according to the above-mentioned test. The medical history data and the input voice data respectively classify and filter the plurality of comparative medical history data and the plurality of comparative voice data to form a diagnostic data transmitted to the processing module 12, and the processing module 12 detects the diagnostic data. The content is formed to display a display material transmitted to the display screen 52, so that the display screen 52 displays whether the user's voice state is healthy through the display data.

於此實施例中,當電子裝置50的顯示螢幕52透過上述顯示資料顯示使用者的喉嚨狀態為健康時,使用者能透過顯示螢幕52而得知自己的嗓音狀態並沒有任何疾病,反之,當處理模組12偵測上述診斷資料而得知使用者的嗓音狀態因為腫瘤或是其他病理而不健康時,顯示螢幕52會透過上述顯示資料顯示使用者的嗓音狀態為不健康以及可能的疾病包括哪些,進而使用者可以盡快就醫治療。 In this embodiment, when the display screen 52 of the electronic device 50 displays that the throat state of the user is healthy through the display data, the user can know that the voice state of the user is clear by the display screen 52, and vice versa. When the processing module 12 detects the diagnostic data and knows that the user's voice state is unhealthy due to a tumor or other pathology, the display screen 52 displays the user's voice state as unhealthy and possible diseases through the display data. In turn, the user can seek medical treatment as soon as possible.

請參閱圖4所示,於第二較佳實施例中,與第一較佳實施例的差別在於上述診斷資料的形成方式,至於伺服終端10、病史資料庫20、嗓音資料庫30以及應用程式40之間的連接關係與第一較佳實施例相同,在此不加以贅述。 Referring to FIG. 4, in the second preferred embodiment, the difference from the first preferred embodiment lies in the manner in which the diagnostic data is formed. As for the servo terminal 10, the medical history database 20, the voice database 30, and the application program. The connection relationship between 40 is the same as that of the first preferred embodiment, and will not be described herein.

當伺服終端10的處理模組12接收到上述受測病史資料與受測嗓音資料時,此實施例的處理模組12並不會如同第一較佳實施例的處理模組12會將上述受測病史資料與受測嗓音資料傳送至伺服終端10的比對分類模組11。 When the processing module 12 of the servo terminal 10 receives the measured medical history data and the measured voice data, the processing module 12 of this embodiment does not receive the above-mentioned receiving module 12 as in the first preferred embodiment. The medical history data and the measured voice data are transmitted to the comparison sorting module 11 of the servo terminal 10.

當伺服終端10的比對分類模組11接收到複數個比對病史資料與複數個比對嗓音資料,比對分類模組11依據結節、息肉、囊腫、腫瘤、癌症、萎縮、震顫、麻痺與聲帶溝八種病理將複數個比對病史資料與複數個比對嗓音資料分類形成八個分類資料,隨後,比對分類模組11依序將八個分類資料傳送至 伺服終端10的處理模組12,使得處理模組12將上述分類資料依據上述受測病史資料以及受測嗓音資料進行分類篩選以共同形成上述診斷資料,其中,比對分類模組11可能會將單一比對病史資料與單一比對嗓音資料同時分類在不同的分類資料中,於較佳實施例中,當形成上述診斷資料時,處理模組12會停止進行針對其餘的分類資料進行篩選比對。 When the comparison classification module 11 of the servo terminal 10 receives a plurality of comparison medical history data and a plurality of comparison voice data, the comparison classification module 11 is based on nodules, polyps, cysts, tumors, cancer, atrophy, tremors, paralysis and paralysis. Eight pathological records of vocal folds are classified into eight categories of data by comparing multiple medical history data with a plurality of comparative voice data. Subsequently, the comparison classification module 11 sequentially transmits eight classified data to The processing module 12 of the servo terminal 10 causes the processing module 12 to classify and filter the classified data according to the measured medical history data and the measured voice data to jointly form the diagnostic data, wherein the comparison classification module 11 may The single alignment history data and the single comparison voice data are simultaneously classified in different classification data. In the preferred embodiment, when the diagnosis data is formed, the processing module 12 stops performing screening comparison for the remaining classification data. .

於此實施例中,處理模組12將上述分類資料依據上述受測病史資料以及受測嗓音資料進行分類篩選以共同形成上述診斷資料僅方便說明之用,亦可由比對分類模組11將上述分類資料依據上述受測病史資料以及受測嗓音資料進行分類篩選以共同形成上述診斷資料。 In this embodiment, the processing module 12 classifies and screens the classified data according to the measured medical history data and the measured voice data to form the diagnostic data for convenience of description, and may also be used by the comparison classification module 11 The classified data is classified and screened according to the above-mentioned measured medical history data and the measured voice data to jointly form the above-mentioned diagnostic data.

上述所舉實施例,僅用為方便說明本發明並非加以限制,在不離本發明精神範疇,熟悉此一行業技藝人士依本發明申請專利範圍及創作說明所作之各種簡易變形與修飾,均仍應含括於以下申請專利範圍中。 The above-mentioned embodiments are not intended to limit the scope of the present invention, and various simple modifications and modifications made by those skilled in the art in accordance with the scope of the invention and the description of the invention should still be made without departing from the spirit of the invention. It is included in the scope of the following patent application.

Claims (11)

一種嗓音疾病偵測分類系統,包含: 一病史資料庫,能提供複數個比對病史資料; 一嗓音資料庫,能提供複數個對應於上述比對病史資料的比對嗓音資料; 一伺服終端,電性連接於上述病史資料庫與嗓音資料庫,並具有一病史問卷;以及 一可安裝於電子裝置的應用程式,電性連接於上述伺服終端,上述應用程式透過上述病史問卷形成一傳送至上述伺服終端的受測病史資料,並再能透過使用者的聲音形成一傳送至上述伺服終端的受測嗓音資料; 其中,上述伺服終端接收上述複數個比對病史資料、上述複數個比對嗓音資料、上述受測病史資料以及上述受測嗓音資料,並能將上述複數個比對病史資料與複數個比對嗓音資料分別依據上述受測病史資料與受測嗓音資料進行篩選來形成一傳送至上述應用程式的診斷資料。A voice disease detection and classification system comprises: a medical history database, which can provide a plurality of comparison medical history data; a voice database can provide a plurality of comparative voice data corresponding to the comparison medical history data; a servo terminal, Electrically connected to the medical history database and the voice database, and having a medical history questionnaire; and an application that can be installed on the electronic device, electrically connected to the server terminal, and the application program transmits the above-mentioned medical history questionnaire to the above The measured medical history data of the servo terminal, and then the user's voice is used to form a measured arpeggio data transmitted to the servo terminal; wherein the servo terminal receives the plurality of comparative medical history data and the plurality of comparative audio data And the above-mentioned medical history data and the above-mentioned tested voice data, and can filter the above-mentioned plurality of medical history data and a plurality of comparative voice data according to the measured medical history data and the measured voice data respectively to form a transmission to the above The diagnostic data of the application. 如申請專利範圍第1項所述之嗓音疾病偵測分類系統,其中,上述嗓音資料庫設有一第一提取模組,上述第一提取模組用以將複數個儲存於嗓音資料庫的原始嗓音資料提取形成上述比對嗓音資料。The voice detection and classification system of claim 1, wherein the voice database is provided with a first extraction module, and the first extraction module is configured to store a plurality of original voices stored in the voice database. The data extraction forms the above-mentioned comparative voice data. 如申請專利範圍第2項所述之嗓音疾病偵測分類系統,其中,上述第一提取模組透過梅爾頻率倒譜係數將上述原始嗓音資料提取形成上述比對嗓音資料。The arpeggio disease detection and classification system according to claim 2, wherein the first extraction module extracts the original voice data by using a Mel frequency cepstrum coefficient to form the comparison voice data. 如申請專利範圍第1項所述之嗓音疾病偵測分類系統,其中,上述嗓音資料庫進一步具有一標籤模組,上述標籤模組能對每一個上述比對嗓音資料標註至少一種病理,而上述標籤模組能標註有結節、息肉、囊腫、腫瘤、癌症、萎縮、震顫、麻痺與聲帶溝其中一種病理。The arpeggio disease detection and classification system according to claim 1, wherein the voice database further has a tag module, and the tag module can mark at least one pathology for each of the compared voice data, and the foregoing The label module can be labeled with nodules, polyps, cysts, tumors, cancer, atrophy, tremors, paralysis, and vocal folds. 如申請專利範圍第1項所述之嗓音疾病偵測分類系統,其中,上述應用程式具有一第二提取模組,上述第二提取模組用以將一使用者輸入的輸入嗓音資料提取形成上述受測嗓音資料。The sounding disease detection and classification system of claim 1, wherein the application has a second extraction module, and the second extraction module is configured to extract input audio data input by a user to form the above Tested voice data. 如申請專利範圍第5項所述之嗓音疾病偵測分類系統,其中,上述第二提取模組透過梅爾頻率倒譜係數將上述輸入嗓音資料提取形成上述受測嗓音資料。The arpeggio disease detection and classification system according to claim 5, wherein the second extraction module extracts the input voice data by using a Mel frequency cepstrum coefficient to form the measured voice data. 如申請專利範圍第1項所述之嗓音疾病偵測分類系統,其中,上述伺服終端具有一比對分類模組以及一電性連接於上述比對分類模組的處理模組,上述比對分類模組同時電性連接於上述病史資料庫與嗓音資料庫,而上述處理模組電性連接於上述應用程式。The vocal cord disease detection and classification system according to claim 1, wherein the servo terminal has a comparison classification module and a processing module electrically connected to the comparison classification module, and the comparison classification The module is electrically connected to the medical history database and the voice database, and the processing module is electrically connected to the application. 如申請專利範圍第7項所述之嗓音疾病偵測分類系統,其中,上處理模組將上述受測病史資料與受測嗓音資料,使得上述比對分類模組單獨透過上述複數個比對病史資料、上述複數個比對嗓音資料、上述受測病史資料以及上述受測嗓音資料來形成上述診斷資料。The arpeggio disease detection and classification system according to claim 7, wherein the upper processing module compares the measured medical history data with the measured voice data, so that the comparison classification module separately passes the plurality of comparison medical records. The data, the plurality of comparative voice data, the above-mentioned measured medical history data, and the above-mentioned measured voice data are used to form the above-mentioned diagnostic data. 如申請專利範圍第7項所述之嗓音疾病偵測分類系統,其中,上述比對分類模組將上述複數個比對病史資料與上述複數個比對嗓音資料依照不同的病理分類形成複數個分類資料,而上述比對分類模組與處理模組兩者其之一將上述複數個分類資料依據上述受測病史資料以及上述受測嗓音資料來形成上述診斷資料。The arpeggio disease detection and classification system according to claim 7, wherein the comparison classification module forms a plurality of classifications according to different pathological classifications of the plurality of comparison medical history data and the plurality of comparative voice data. And the one of the comparison classification module and the processing module forms the diagnostic data according to the measured medical history data and the measured voice data. 如申請專利範圍第9項所述之嗓音疾病偵測分類系統,其中,當形成上述診斷資料時,上述比對分類模組或處理模組會停止針對其餘的分類資料進行篩選比對。The vocal cord disease detection and classification system according to claim 9, wherein when the diagnostic data is formed, the comparison classification module or the processing module stops screening and screening the remaining classification data. 如申請專利範圍第7項所述之嗓音疾病偵測分類系統,其中,上述比對分類模組選自於決策樹、隨機森林、自適應增強分類、最近鄰居法、支撐向量機、高斯混合模型或深度神經網路的其中一種或多種系統模組進行分類。The arpeggio disease detection and classification system according to claim 7, wherein the comparison classification module is selected from the group consisting of a decision tree, a random forest, an adaptive enhanced classification, a nearest neighbor method, a support vector machine, and a Gaussian mixture model. Or classify one or more system modules of a deep neural network.
TW106130351A 2017-09-05 2017-09-05 Disease detecting and classifying system of voice TWI622980B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW106130351A TWI622980B (en) 2017-09-05 2017-09-05 Disease detecting and classifying system of voice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW106130351A TWI622980B (en) 2017-09-05 2017-09-05 Disease detecting and classifying system of voice

Publications (2)

Publication Number Publication Date
TWI622980B true TWI622980B (en) 2018-05-01
TW201913649A TW201913649A (en) 2019-04-01

Family

ID=62951627

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106130351A TWI622980B (en) 2017-09-05 2017-09-05 Disease detecting and classifying system of voice

Country Status (1)

Country Link
TW (1) TWI622980B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111370131A (en) * 2018-12-26 2020-07-03 陈治平 Method and system for screening biomarkers via disease trajectories
TWI719380B (en) * 2018-12-26 2021-02-21 台灣基督長老教會馬偕醫療財團法人馬偕紀念醫院 Method and system for selecting biomarker via disease trajectories
CN117976141A (en) * 2024-04-01 2024-05-03 四川大学华西医院 Voice rehabilitation analysis method and system based on acoustic analysis algorithm

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI749663B (en) * 2019-07-26 2021-12-11 醫療財團法人徐元智先生醫藥基金會亞東紀念醫院 Method for monitoring phonation and system thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201643865A (en) * 2015-06-03 2016-12-16 創心醫電股份有限公司 Method and system for recognizing physiological sound
TWI572327B (en) * 2014-10-01 2017-03-01 國立成功大學醫學院附設醫院 Apparatus, computer program product and computer readable medium using audio signal for detection and determination of narrowing condition of fluid pipe
TW201713284A (en) * 2015-10-15 2017-04-16 昌泰科醫股份有限公司 Sensing device for measuring physiological condition of pets capable of capturing the sound of the pet and accordingly determining the current mood or health status of the pet

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI572327B (en) * 2014-10-01 2017-03-01 國立成功大學醫學院附設醫院 Apparatus, computer program product and computer readable medium using audio signal for detection and determination of narrowing condition of fluid pipe
TW201643865A (en) * 2015-06-03 2016-12-16 創心醫電股份有限公司 Method and system for recognizing physiological sound
TW201713284A (en) * 2015-10-15 2017-04-16 昌泰科醫股份有限公司 Sensing device for measuring physiological condition of pets capable of capturing the sound of the pet and accordingly determining the current mood or health status of the pet

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111370131A (en) * 2018-12-26 2020-07-03 陈治平 Method and system for screening biomarkers via disease trajectories
TWI719380B (en) * 2018-12-26 2021-02-21 台灣基督長老教會馬偕醫療財團法人馬偕紀念醫院 Method and system for selecting biomarker via disease trajectories
CN111370131B (en) * 2018-12-26 2023-06-09 陈治平 Method and system for screening biomarkers via disease trajectories
CN117976141A (en) * 2024-04-01 2024-05-03 四川大学华西医院 Voice rehabilitation analysis method and system based on acoustic analysis algorithm

Also Published As

Publication number Publication date
TW201913649A (en) 2019-04-01

Similar Documents

Publication Publication Date Title
Mouawad et al. Robust detection of COVID-19 in cough sounds: using recurrence dynamics and variable Markov model
Murton et al. Cepstral peak prominence values for clinical voice evaluation
KR102630580B1 (en) Cough sound analysis method using disease signature for respiratory disease diagnosis
TWI622980B (en) Disease detecting and classifying system of voice
Khalifa et al. Non-invasive identification of swallows via deep learning in high resolution cervical auscultation recordings
Uloza et al. Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening
Cesari et al. Voice disorder detection via an m‐health system: design and results of a clinical study to evaluate Vox4Health
Szkoła et al. Recurrent Neural Networks in Computer‐Based Clinical Decision Support for Laryngopathies: An Experimental Study
Syed et al. Meta-analysis of voice disorders databases and applied machine learning techniques
Chen et al. Exploring racial and gender disparities in voice biometrics
Aly et al. A novel deep learning model to detect COVID-19 based on wavelet features extracted from Mel-scale spectrogram of patients’ cough and breathing sounds
Niu et al. A time-frequency channel attention and vectorization network for automatic depression level prediction
Vatanparvar et al. CoughMatch–subject verification using cough for personal passive health monitoring
Mittal et al. Deep learning approach for voice pathology detection and classification
Verde et al. Exploring the impact of data poisoning attacks on machine learning model reliability
Huang et al. Fast diagnosis of bowel activities
Ankışhan et al. Voice pathology detection by using the deep network architecture
Hafke-Dys et al. Artificial intelligence approach to the monitoring of respiratory sounds in asthmatic patients
Sfayyih et al. A review on lung disease recognition by acoustic signal analysis with deep learning networks
Park et al. Post-stroke respiratory complications using machine learning with voice features from mobile devices
Xu et al. A Novel Smart Depression Recognition Method Using Human‐Computer Interaction System
Chen et al. Deep learning in automatic detection of dysphonia: Comparing acoustic features and developing a generalizable framework
Karan et al. An investigation about the relationship between dysarthria level of speech and the neurological state of Parkinson’s patients
Wang et al. Pathological voice classification based on multi-domain features and deep hierarchical extreme learning machine
Kim et al. Non-invasive way to diagnose dysphagia by training deep learning model with voice spectrograms