WO2018176919A1 - 爆裂音识别方法及系统 - Google Patents
爆裂音识别方法及系统 Download PDFInfo
- Publication number
- WO2018176919A1 WO2018176919A1 PCT/CN2017/116491 CN2017116491W WO2018176919A1 WO 2018176919 A1 WO2018176919 A1 WO 2018176919A1 CN 2017116491 W CN2017116491 W CN 2017116491W WO 2018176919 A1 WO2018176919 A1 WO 2018176919A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- component
- wet
- preset frequency
- frequency bands
- Prior art date
Links
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B7/00—Instruments for auscultation
- A61B7/003—Detecting lung or respiration noise
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/08—Detecting, measuring or recording devices for evaluating the respiratory organs
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B7/00—Instruments for auscultation
- A61B7/02—Stethoscopes
- A61B7/04—Electric stethoscopes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B2562/00—Details of sensors; Constructional details of sensor housings or probes; Accessories for sensors
- A61B2562/02—Details of sensors specially adapted for in-vivo measurements
- A61B2562/0204—Acoustic sensors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7225—Details of analog processing, e.g. isolation amplifier, gain or sensitivity adjustment, filtering, baseline or drift compensation
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7253—Details of waveform analysis characterised by using transforms
- A61B5/726—Details of waveform analysis characterised by using transforms using Wavelet transforms
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H40/00—ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices
- G16H40/60—ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices
- G16H40/63—ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices for local operation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
Definitions
- the present disclosure relates to a popping sound recognition method and system.
- Pulmonary auscultation sounds are closely related to the pathological condition of the lungs.
- the popping sound is used as a special wet rales as the primary clinical clue to suggest interstitial lung disease.
- the hearing sensitivities of different human ears in different frequency bands are different, and the intensity of wet rales is low, the duration is short, and the bandwidth is wide, which may result in inconsistent diagnosis between different doctors.
- digital auscultation is usually used for assisted diagnosis, which can make up for the subjectivity and limitations of doctors' auscultation.
- accuracy of existing digital auscultation techniques for identifying popping sound is not high.
- the present disclosure provides a pop sound recognition method comprising the following steps:
- Calculating a power spectrum of the wet rales component and performing at least one of calculating: calculating a ratio of power of each of the plurality of preset frequency bands to total power of all of the preset frequency bands based on the power spectrum And the total power of all the preset frequency bands, at least one is selected as the frequency domain parameter; and the ratio of the number of wet rales in the late inspiratory phase to the total number of occurrences of the entire inspiratory wet rales and the entire inhalation wet The maximum amplitude of the sound, at least one is selected as the time domain parameter;
- the feature is input into the classification model for classification recognition to identify the popping sound.
- the step of processing the collected lung sound signal to extract a wet rales component of a breathing cycle comprising:
- a breathing cycle is determined based on the breath sound component, and the wet rales of the breathing cycle are extracted.
- the wavelet decomposition comprises coif2 wavelet decomposition.
- the number of decomposition layers is N ⁇ 6.
- An represents the approximate component of the nth layer of wavelet decomposition, where n is an integer and 0 ⁇ n ⁇ 10.
- the step of determining a breathing cycle according to the breath sound component comprises:
- a smooth breathing cycle is determined based on the respiratory gas phase information.
- the ratio corresponding to the two frequency bands with the largest difference is selected as the frequency domain parameter.
- the preset frequency band includes 50 Hz to 200 Hz and 500 Hz to 1000 Hz.
- the inspiratory phase is 1/2 of the entire inspiratory phase.
- the step of processing the acquired lung sound signal includes:
- the lung tone signal is filtered using a bandpass filter.
- the method further comprises acquiring a lung sound signal using the collector.
- the present disclosure also provides a popping sound recognition system, including:
- a processor configured to process the acquired lung sound signal to extract a wet rales component of a breathing cycle
- a calculator configured to perform at least one of the following calculations: calculating the wet rales a power spectrum of the component, based on the power spectrum, calculating a ratio of power of each of the preset frequency bands to a total power of all the preset frequency bands and a total power of all the preset frequency bands, and selecting at least one of Frequency domain parameters; and the ratio of the number of wet-sounds in the late inspiratory phase of the inspiratory cycle to the total number of occurrences of the total inspiratory wet rales and the maximum amplitude of the entire inspiratory wet rales, and at least one is selected as the time domain parameter. ;
- the identifier is configured to input the obtained frequency domain parameter and/or the time domain parameter as a parameter feature into the classification model for classification and identification to identify a popping sound.
- the processor includes:
- the wavelet decomposition circuit is configured to perform wavelet decomposition processing on the collected lung sound signals to obtain wet rales and breath sound components
- An extraction circuit configured to determine a breathing cycle based on the breath sound component and extract a wet rake of the breathing cycle.
- the wavelet decomposition comprises coif2 wavelet decomposition.
- the extracting circuit is configured to obtain an average power curve of the breath sound component in a preset frequency range; the peak point of the average power curve is identified as an inspiratory gas apex, and the valley point is a respiratory gas phase switching point, Thereby obtaining respiratory gas phase information; determining a smooth breathing cycle based on the respiratory gas phase information.
- the calculating unit is configured to select, according to a ratio of the power of each preset frequency band to the total power of all the preset frequency bands, a ratio corresponding to the two frequency bands with the largest difference as the frequency domain parameter.
- the preset frequency band includes 50 Hz to 200 Hz and 500 Hz to 1000 Hz.
- the inspiratory phase is 1/2 of the entire inspiratory phase.
- the popping sound recognition system further comprises a band pass filter for filtering the acquired lung sound signal.
- the popping sound recognition system further includes a collector configured to acquire a lung sound signal.
- the collector comprises a digital stethoscope.
- the present disclosure also provides a pop sound recognition system comprising:
- a processor configured to perform the following steps:
- Calculating a power spectrum of the wet rales component and performing at least one of calculating: calculating a ratio of power of each of the plurality of preset frequency bands to total power of all of the preset frequency bands based on the power spectrum And the total power of all the preset frequency bands, at least one is selected as the frequency domain parameter; and the ratio of the number of wet rales in the late inspiratory phase to the total number of occurrences of the entire inspiratory wet rales and the entire inhalation wet The maximum amplitude of the sound, at least one is selected as the time domain parameter;
- At least one of the obtained frequency domain parameter and the time domain parameter is input as a parameter feature into the classification model for classification recognition to identify a popping sound.
- FIG. 1 is a flowchart of a popping sound recognition method according to Embodiment 1 of the present disclosure
- FIG. 2 is a schematic diagram showing the relationship between a lung sound signal, a wet rales component, and a breath sound component in the present disclosure
- FIG. 3 is a flow chart of step S2 of Figure 1;
- FIG. 4 is a flow chart of step S22 of Figure 3;
- Figure 5 is a graph showing the average power of the breath sound component
- FIG. 6 is a schematic block diagram of a popping sound recognition system according to Embodiment 2 of the present disclosure.
- Figure 7 is a schematic block diagram of the processing unit of Figure 6.
- FIG. 1 is a flowchart of a method for identifying a popping sound according to Embodiment 1 of the present disclosure; referring to FIG. 1, the method for identifying a popping sound provided by the embodiment includes:
- the present disclosure may use a power ratio (PR) of the power of each preset frequency band as a total frequency of all preset frequency bands as a frequency domain parameter; or, the total power PR total of all preset frequency bands is taken as The frequency domain parameter; or, the ratio of the power of each preset frequency band to the total power of all the preset frequency bands (Power Ratio, PR) and the total power PR total of all the preset frequency bands are used as the frequency domain parameters.
- PR power ratio
- the ratio of the number of wet rales in the inspiratory phase of the respiratory cycle to the total number of occurrences of the entire inspiratory wet rales is used as a time domain parameter; or, the maximum amplitude of the entire inspiratory wet rales is used as a time domain parameter.
- the ratio of the number of wet rhythms in the inspiratory phase of the respiratory cycle to the total number of occurrences of the wetness of the inspiratory phase and the maximum amplitude of the entire inspiratory wet rales are taken as time domain parameters.
- the classification model is pre-trained and has a function of identifying a popping sound according to frequency domain parameters and/or time domain parameters.
- the recognition result can be directly obtained.
- the training classification model can be a support vector machine model (SVM classification model).
- step S1 may be omitted, that is, the present invention is not limited to processing the directly collected lung sound signal, and the method of the present invention may process the lung sound signal obtained by other means.
- the present invention can also process lung tone signals that are acquired in advance and stored in a computer readable medium.
- an inspiratory phase includes an inspiratory phase and an inspiratory phase.
- the wet rales can be classified into an inspiratory pre-wet rales and an inspiratory late-wet rales.
- the present disclosure since the pitch of the cracking sound is higher, stronger, and more frequently in the late phase of the inspiratory phase according to the clinical experience, and the amplitude and frequency of the high, strong, and wet rales of the pitch are correlated, the present disclosure passes the wet rales. At least one of a power ratio in different frequency bands and a total power of all frequency bands as a frequency domain parameter; and/or, a maximum amplitude of wet rales and a late inspiratory phase wet At least one of the rhythm appearance ratios is used as a time domain parameter to improve the recognition accuracy of the popping sound.
- the acquired lung sound signal is filtered by using a band pass filter to remove noise such as heart sound signals and power frequency interference.
- the band pass filter includes, but is not limited to, an 8th order Butterworth band pass filter, and the pass band of the band pass filter may be 50 Hz to 2000 Hz.
- the acquisition time can be, but not limited to, 10s.
- the sampling frequency fs can be but not limited to 10000Hz. .
- step S2 can be implemented by processing the collected lung sound signals to extract a wet rales component of a breathing cycle by the following steps S21 and S22:
- the wavelet decomposition method is used in the present disclosure to process the collected lung sound signals, so that the processing process is simpler, and thus the burst sound recognition efficiency can be improved.
- the wavelet is decomposed into a Coiflet wavelet (coif N) decomposition.
- the coif2 wavelet decomposition method is adopted in the present disclosure because the coif2 wavelet has better symmetry, and can avoid waveform distortion of the reconstructed wet rales and breath sound components, thereby further improving the recognition of popping sound. Accuracy.
- Wavelet decomposition process the original signal S is first divided into two components, the high-frequency component is also called the detail component d1, the low-frequency component is also called the approximate component a1, and then the low-frequency component is decomposed for the second time, divided into two components, the detail component d2 and Approximate component a2, and so on, until it is decomposed to a predetermined number of layers N.
- the number of decomposition layers of the coif2 wavelet decomposition is N ⁇ 6, and the waveform distortion of the wet rales component can be well avoided.
- the number of decomposition layers is 9, the d1 to d6 wavelet components are selected to obtain the wet rales component, and the d7 to d9 and a9 wavelet components are cumulatively obtained to obtain the breath component, wherein dn represents the wavelet decomposition of the nth layer of the detail component. , an represents the approximate component of the nth layer of wavelet decomposition, n is an integer, and 0 ⁇ n ⁇ 10.
- the frequency range corresponding to the high frequency band of the nth layer component is fs/2 n+1 to fs/2 n , 1 ⁇ n ⁇ N, although the main frequency band of the burst sound distribution is 200 Hz. ⁇ 1000Hz, however, the burst sound tone is higher than other wet rales, and the wet rales overall distribution frequency band is 50Hz ⁇ 2500Hz. Therefore, the detail component d1 ⁇ d6 wavelet component is selected to obtain the wet rales component, the detail component d7 ⁇ d9 and the approximation. The component a9 wavelet component is accumulated to obtain the breath sound component, which can well avoid waveform distortion of the wet rales component and the breath component.
- FIG. 2 is a schematic diagram showing the relationship between the lung sound signal, the wet rales component and the breath sound component in the present disclosure. It can be directly seen from FIG. 2: the wet rales component in the lung sound signal. It can be clearly seen that, as indicated by the black arrow, waveform distortion of the wet rales and breath sound components can be well avoided.
- wavelet decomposition can also adopt other decomposition methods.
- the step of determining a breathing cycle according to the breath sound component in the step S22 may be implemented by the following steps S221 to S223:
- S221 Obtain an average power curve of the breath sound component in a preset frequency range.
- the preset frequency range may be, but not limited to, 150 to 450 Hz.
- the frequency range of 150 Hz to 450 Hz is selected because the energy difference of the respiratory gas phase in this frequency band is most significant.
- the breath sound component may be first divided into a window of 100 ms duration, the window is shifted by 75 ms, and the respiratory sound signal in each window is calculated by the following formula (1).
- x(m) is the mth window of the breath sound component
- t n and f k are time and frequency, respectively
- w(t n Dm) is a moving window function of duration T and window shift to D.
- the present disclosure uses a Hanning window, although other window functions can of course be used.
- f high is 450 Hz and f low is 150 Hz.
- the start time of the waveform at the apex of the inspiratory gas to the end of the waveform of the respiratory gas phase switching point is a breathing cycle, as shown in FIG. 5 T1-T3; the so-called stable breathing cycle refers to a relatively stable breathing cycle of the power curve.
- T1 is relatively flat compared to T2, so T2 is chosen as a smooth breathing cycle.
- the reason for determining a smooth breathing cycle in the present disclosure is that the patient's lung condition can be better reflected by a smooth breathing cycle, thereby facilitating the improvement of the accuracy of identifying the popping sound.
- the ratio corresponding to the two frequency bands with the largest difference is selected as the frequency domain parameter, which is equivalent to removing some similarities.
- the interference band factor can improve the accuracy of identifying the popping sound.
- the preset frequency band includes 50 Hz to 200 Hz and 500 Hz to 1000 Hz, because the ratio of the power of the two frequency bands to the total power of all the preset frequency bands is the largest.
- the preset frequency band may include, but is not limited to, four preset frequency bands of 50 Hz to 200 Hz, 200 Hz to 500 Hz, 500 Hz to 1000 Hz, and 1000 Hz to 1500 Hz.
- the inspiratory phase in the present disclosure is 1/2 of the entire inspiratory phase, that is, the inspiratory phase is equally divided into an inspiratory phase and an inspiratory phase, so that it can be ensured as much as possible.
- the popping sound is located in the late phase of the inspiratory phase, which improves the accuracy of identifying the popping sound.
- the power spectrum of one respiratory cycle wet rales component may be calculated, but not limited to, using a multi-order autoregressive model, for example, in this embodiment, The 14th-order autoregressive model calculates the power spectrum of a respiratory cycle wet rales.
- FIG. 6 is a schematic block diagram of a popping sound recognition system according to Embodiment 2 of the present disclosure.
- the popping sound recognition system includes an acquisition unit 10, a processing unit 11, a calculation unit 12, and an identification unit 13. among them
- the acquisition unit 10 is configured to acquire a lung sound signal.
- the processing unit 11 is configured to process the acquired lung sound signal to extract a wet rales component of a breathing cycle.
- the calculating unit 12 is configured to calculate a power spectrum of the wet rales component, and calculate, based on the power spectrum, a ratio of the power of each of the preset frequency bands to the total power of all the preset frequency bands and all the pre- Set the total power of the frequency band, select at least one as the frequency domain parameter, and/or calculate the proportion of the late inspiratory wet rales of the respiratory cycle to the total number of occurrences of the entire inspiratory wet rales and the entire inspiratory phase. For the maximum amplitude of the wet rales, select at least one as the time domain parameter.
- the identification unit 13 is configured to input the obtained frequency domain parameters and/or the time domain parameters as parameter features into the classification model for classification recognition to identify popping sounds.
- the acquisition unit 10 may be omitted, that is, the present invention is not limited to processing the directly collected lung sound signals, and the burst sound recognition system of the present invention may acquire lung sound signals obtained by other means. Process it.
- the present invention can also process lung tone signals that are acquired in advance and stored in a computer readable medium.
- acquisition unit 10 may include a collector (eg, an audio collector) for acquiring lung sound signals. More specifically, the collector may include a digital stethoscope, although the invention is not limited thereto.
- the processing unit can include a processor configured to process the lung sound signal.
- computing unit 12 may include a calculator configured to perform further calculations on the wet rales component. The processor and the calculator may be specifically configured by logic circuits, integrated circuits, dedicated processors, general purpose processors, etc., but the present invention is not limited thereto.
- the present disclosure is based on wetness due to the fact that the pitch of the popping sound is higher, stronger and more likely to occur in the late phase of the inspiratory phase, and the amplitude and frequency of the pitch of the high, strong and wet rales are related according to clinical experience.
- Power ratio of Luoyin in different frequency bands, total power of all bands At least one of the rates is used as the frequency domain parameter, and/or at least one of the maximum amplitude of the wet rales and the appearance ratio of the late inspiratory wet rales is used as the time domain parameter, and the recognition accuracy of the popping sound can be improved.
- the processing unit 11 includes: a wavelet decomposition module 111 and an extraction module 112. among them
- the wavelet decomposition module 111 is configured to perform wavelet decomposition processing on the collected lung sound signals to obtain wet rales and breath sound components.
- the extraction module 112 is configured to determine a breathing cycle based on the breath sound component and extract a wet rake of the breathing cycle.
- the above modules may be constituted by circuits that perform respective functions.
- the wavelet decomposition module 111 may be constructed of a wavelet decomposition circuit configured to perform wavelet decomposition on the signal.
- the extraction module 112 can be comprised of an extraction circuit configured to extract wet rales. More specifically, the wavelet decomposition circuit and the extraction circuit may be specifically configured by a logic circuit, an integrated circuit, a dedicated processor, a general purpose processor, etc., but the present invention is not limited thereto.
- the collected lung sound signal is processed by using wavelet decomposition method, so that the processing process is simpler, and thus the recognition efficiency of the pop sound can be improved.
- the wavelet decomposition comprises coif2 wavelet decomposition.
- the coif2 wavelet decomposition method is adopted in the present disclosure because the coif2 wavelet has better symmetry, and can avoid waveform distortion of the reconstructed wet rales and breath sound components, thereby further improving the recognition of popping sound. Accuracy.
- the number of decomposition layers is N ⁇ 6.
- the component, an, represents the approximate component of the nth layer of wavelet decomposition, where n is an integer and 0 ⁇ n ⁇ 10.
- the extraction module 112 is configured to obtain an average power curve of the breath sound component in a preset frequency range; the peak point of the average power curve is identified as an inspiratory gas apex, and the valley point is a respiratory gas phase switching point, Thereby obtaining respiratory gas phase information; determining a smooth breathing cycle based on the respiratory gas phase information. Determining a smooth breathing cycle in the present disclosure can better reflect the patient's lung condition, thereby facilitating the identification of popping sounds. Accuracy.
- the calculating unit 12 is configured to select, as a frequency domain parameter, a ratio corresponding to two frequency bands with the largest difference among the ratios of the power of each preset frequency band to the total power of all the preset frequency bands.
- the ratio corresponding to the two frequency bands with the largest difference is selected as the frequency domain parameter, and some factors of the adjacent interference frequency band can be removed, thereby improving the accuracy of identifying the popping sound.
- the preset frequency band includes 50 Hz to 200 Hz and 500 Hz to 1000 Hz, because the ratio of the power of the two frequency bands to the total power of all the preset frequency bands is the largest.
- the late phase of the inspiratory phase is 1/2 of the entire inspiratory phase, so that the burst sound can be ensured as far as possible in the late phase of the inspiratory phase, thereby improving the accuracy of identifying the popping sound.
- the popping sound recognition system further includes a band pass filter configured to filter the lung sound signal.
- burst sound recognition system provided in this embodiment is a product embodiment corresponding to the burst sound recognition method provided in the first embodiment, and the crack sound recognition method has been described in detail in the above-described first embodiment, For related features of the cracking sound recognition system and the identification method, please refer to the above embodiment 1, and details are not described herein again.
- a popping sound recognition system including:
- a processor configured to perform a popping sound recognition method according to the aforementioned embodiment of the present invention.
- the pitch of the popping sound is higher, stronger and more likely to occur in the late phase of the inspiratory phase, while the pitch, frequency and frequency of the high, strong and wet rales of the pitch are related. Therefore, the present disclosure is based on the wet rales in different frequency bands. At least one of the power ratio above, the total power of all the bands is used as the frequency domain parameter; and/or at least one of the maximum amplitude of the wet rales and the appearance ratio of the late inspiratory wet rales is used as the time domain parameter, which can be improved. The accuracy of the recognition of popping sounds.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- General Health & Medical Sciences (AREA)
- Surgery (AREA)
- Animal Behavior & Ethology (AREA)
- Molecular Biology (AREA)
- Heart & Thoracic Surgery (AREA)
- Veterinary Medicine (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Pulmonology (AREA)
- Pathology (AREA)
- Artificial Intelligence (AREA)
- Physiology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Psychiatry (AREA)
- Evolutionary Computation (AREA)
- Fuzzy Systems (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
Abstract
Description
Claims (23)
- 一种爆裂音识别方法,包括以下步骤:对获取到的肺音信号进行处理以提取一个呼吸周期的湿罗音成分;计算所述湿罗音成分的功率谱,基于所述功率谱执行以下计算中的至少一种:计算多个预设频带中每个预设频带的功率占所有的预设频带的总功率的比例和所有的预设频带的总功率,选取至少一种作为频域参数;以及计算吸气相后期湿罗音出现个数占整个吸气相湿罗音出现总数的比例和整个吸气相湿罗音的最大振幅,选取至少一个作为时域参数;将获得的所述频域参数和所述时域参数中的至少一个作为参数特征输入到分类模型中进行分类识别,以识别爆裂音。
- 根据权利要求1所述的爆裂音识别方法,其中,所述对采集到的肺音信号进行处理以提取一个呼吸周期的湿罗音成分的步骤,包括:对采集到的肺音信号进行小波分解处理,获得湿罗音成分和呼吸音成分;根据所述呼吸音成分确定一个呼吸周期,并提取该呼吸周期的湿罗音。
- 根据权利要求2所述的爆裂音识别方法,其中,所述小波分解包括coif2小波分解。
- 根据权利要求3所述的爆裂音识别方法,其中,分解层数N≥6。
- 根据权利要求4所述的爆裂音识别方法,其中,分解层数N=9;选取d1~d6小波分量累加获得湿罗音成分,d7~d9及a9小波分 量累加获得呼吸音成分;其中,dn表示小波分解第n层的细节分量,an表示小波分解第n层的近似分量,n为整数,且0<n<10。
- 根据权利要求2所述的爆裂音识别方法,其中,所述根据所述呼吸音成分确定一个呼吸周期的步骤,包括:获得呼吸音成分在预设频率范围内的平均功率曲线;识别所述平均功率曲线的峰值点为吸气相顶点,谷值点为呼吸气相切换点,从而获得呼吸气相信息;根据所述呼吸气相信息确定一个平稳的呼吸周期。
- 根据权利要求1-6任一项所述的爆裂音识别方法,其中,在每个预设频带的功率占所有的预设频带的总功率的比例中,选取差别最大的两个频带对应的比例作为频域参数。
- 根据权利要求1-7任一项所述的爆裂音识别方法,其中,所述预设频带包括50Hz~200Hz和500Hz~1000Hz。
- 根据权利要求1-8任一项所述的爆裂音识别方法,其中,所述吸气相后期为整个吸气相周期的1/2。
- 根据权利要求1-9任一项所述的爆裂音识别方法,其中,所述对获取到的肺音信号进行处理的步骤,包括:使用带通滤波器对肺音信号进行滤波。
- 根据权利要求1所述的爆裂音识别方法,其中,所述分类模型是通过预先训练得到的,所述分类模型被配置为根据频域参数和时域参数中的至少一个识别爆裂音。
- 根据权利要求1所述的爆裂音识别方法,还包括:利用采集器获取肺音信号。
- 一种爆裂音识别系统,包括:处理器,配置为对获取到的肺音信号进行处理以提取一个呼吸周期的湿罗音成分;计算器,配置为执行以下计算中的至少一种:计算所述湿罗音成分的功率谱,基于所述功率谱计算多个预设频带中每个预设频带的功率占所有的预设频带的总功率的比例和所有的预设频带的总功率,选取至少一种作为频域参数;以及计算呼吸周期的吸气相后期湿罗音出现个数占整个吸气相湿罗音出现总数的比例和整个吸气相湿罗音的最大振幅,选取至少一个作为时域参数;识别器,配置为将获得的所述频域参数和/或所述时域参数作为参数特征输入到分类模型中进行分类识别,以识别爆裂音。
- 根据权利要求13所述的爆裂音识别系统,其中,所述处理器包括:小波分解电路,配置为对采集到的肺音信号进行小波分解处理,获得湿罗音成分和呼吸音成分;提取电路,配置为根据所述呼吸音成分确定一个呼吸周期,并提取该呼吸周期的湿罗音。
- 根据权利要求14所述的爆裂音识别系统,其中,所述小波分解包括coif2小波分解。
- 根据权利要求14所述的爆裂音识别系统,其中,所述提取电路,配置为获得呼吸音成分在预设频率范围内的平均功率曲线;识别所述平均功率曲线的峰值点为吸气相顶点,谷值点为呼吸气相切换点,从而获得呼吸气相信息;根据所述呼吸气相信息确定一个平稳的呼吸周期。
- 根据权利要求13所述的爆裂音识别系统,其中,所述计算器,配置为在每个预设频带的功率占所有的预设频带的总功率的比例中,选取差别最大的两个频带对应的比例作为频域参数。
- 根据权利要求17所述的爆裂音识别系统,其中,所述预设频带包括50Hz~200Hz和500Hz~1000Hz。
- 根据权利要求13所述的爆裂音识别系统,其中,所述吸气相后期为整个吸气相周期的1/2。
- 根据权利要求13所述的爆裂音识别系统,还包括带通滤波器,对获取到的肺音信号进行滤波。
- 根据权利要求13所述的爆裂音识别系统,还包括:采集器,配置为获取肺音信号。
- 根据权利要求21所述的爆裂音识别系统,其中所述采集器包括数字听诊器。
- 一种爆裂音识别系统,包含:存储器,存储肺音信号;处理器,所述处理器配置为执行以下步骤:对所述肺音信号进行处理以提取一个呼吸周期的湿罗音成分;计算所述湿罗音成分的功率谱,基于所述功率谱执行以下计算中的至少一种:计算多个预设频带中每个预设频带的功率占所有的预设频带的总功率的比例和所有的预设频带的总功率,选取至少一种作为频域参数;以及计算吸气相后期湿罗音出现个数占整个吸气相湿罗音出现总数的比例和整个吸气相湿罗音的最大振幅,选取至少一个作为时域参数;将获得的所述频域参数和所述时域参数中的至少一个作为参数特征输入到分类模型中进行分类识别,以识别爆裂音。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/081,880 US11660062B2 (en) | 2017-03-31 | 2017-12-15 | Method and system for recognizing crackles |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710210176.3 | 2017-03-31 | ||
CN201710210176.3A CN108652658A (zh) | 2017-03-31 | 2017-03-31 | 爆裂音识别方法及系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018176919A1 true WO2018176919A1 (zh) | 2018-10-04 |
Family
ID=63674157
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/116491 WO2018176919A1 (zh) | 2017-03-31 | 2017-12-15 | 爆裂音识别方法及系统 |
Country Status (3)
Country | Link |
---|---|
US (1) | US11660062B2 (zh) |
CN (1) | CN108652658A (zh) |
WO (1) | WO2018176919A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111931728B (zh) * | 2020-09-23 | 2021-01-12 | 杭州百世伽信息科技有限公司 | 自动提取湿罗音特征的方法 |
CN112668556B (zh) * | 2021-01-21 | 2024-06-07 | 广东白云学院 | 一种呼吸音识别方法及系统 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011155048A1 (ja) * | 2010-06-10 | 2011-12-15 | 富士通株式会社 | 音声処理装置および呼吸検出方法 |
CN103932733A (zh) * | 2014-04-11 | 2014-07-23 | 中国人民解放军第三军医大学第三附属医院 | 一种基于肺音的肺间质纤维化的数字化检测分析方法 |
CN105662454A (zh) * | 2016-01-08 | 2016-06-15 | 中国科学院声学研究所 | 一种罗音检测的方法和装置 |
CN105913066A (zh) * | 2016-04-13 | 2016-08-31 | 刘国栋 | 一种基于相关向量机的数字化肺音特征降维方法 |
CN106251880A (zh) * | 2015-06-03 | 2016-12-21 | 创心医电股份有限公司 | 识别生理声音的方法以及系统 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6171258B1 (en) * | 1998-10-08 | 2001-01-09 | Sleep Solutions, Inc. | Multi-channel self-contained apparatus and method for diagnosis of sleep disorders |
WO2002071947A1 (en) * | 2001-03-09 | 2002-09-19 | Biomedical Acoustic Research, Inc. | Acoustic detection of gastric motility dysfunction |
US20080058607A1 (en) * | 2006-08-08 | 2008-03-06 | Zargis Medical Corp | Categorizing automatically generated physiological data based on industry guidelines |
US9492096B2 (en) * | 2009-11-03 | 2016-11-15 | Vivaquant Llc | ECG sensing apparatuses, systems and methods |
US8758262B2 (en) * | 2009-11-25 | 2014-06-24 | University Of Rochester | Respiratory disease monitoring system |
US8702628B2 (en) * | 2010-03-18 | 2014-04-22 | Panasonic Corporation | Physiological sound examination device |
WO2012042611A1 (ja) * | 2010-09-29 | 2012-04-05 | 富士通株式会社 | 呼吸検出装置および呼吸検出方法 |
US10595813B2 (en) * | 2011-09-01 | 2020-03-24 | Medtronic, Inc. | Method and apparatus for monitoring cardiac and respiratory conditions using acoustic sounds |
US10786164B2 (en) * | 2015-12-12 | 2020-09-29 | Verily Life Sciences Llc | Method for improving heart rate estimates by combining multiple measurement modalities |
-
2017
- 2017-03-31 CN CN201710210176.3A patent/CN108652658A/zh active Pending
- 2017-12-15 WO PCT/CN2017/116491 patent/WO2018176919A1/zh active Application Filing
- 2017-12-15 US US16/081,880 patent/US11660062B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011155048A1 (ja) * | 2010-06-10 | 2011-12-15 | 富士通株式会社 | 音声処理装置および呼吸検出方法 |
CN103932733A (zh) * | 2014-04-11 | 2014-07-23 | 中国人民解放军第三军医大学第三附属医院 | 一种基于肺音的肺间质纤维化的数字化检测分析方法 |
CN106251880A (zh) * | 2015-06-03 | 2016-12-21 | 创心医电股份有限公司 | 识别生理声音的方法以及系统 |
CN105662454A (zh) * | 2016-01-08 | 2016-06-15 | 中国科学院声学研究所 | 一种罗音检测的方法和装置 |
CN105913066A (zh) * | 2016-04-13 | 2016-08-31 | 刘国栋 | 一种基于相关向量机的数字化肺音特征降维方法 |
Also Published As
Publication number | Publication date |
---|---|
US20210007703A1 (en) | 2021-01-14 |
CN108652658A (zh) | 2018-10-16 |
US11660062B2 (en) | 2023-05-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110570880B (zh) | 一种鼾声信号识别方法 | |
CN106037655B (zh) | 一种睡眠周期信号的分离及周期计算方法及装置 | |
US11304662B2 (en) | Lung-sound signal processing method, processing device, and readable storage medium | |
CN110970042B (zh) | 一种电子听诊器的肺部啰音人工智能实时分类方法、系统、装置及可读存储介质 | |
CN102283670A (zh) | 一种心音信号量化分析方法及装置 | |
CN111640439A (zh) | 一种基于深度学习的呼吸音分类方法 | |
Zakaria et al. | Three resnet deep learning architectures applied in pulmonary pathologies classification | |
CN104473660B (zh) | 一种基于子带能量包络自相关特征的异常心音识别方法 | |
Lin et al. | Wheeze recognition based on 2D bilateral filtering of spectrogram | |
WO2020151169A1 (zh) | 一种电子听诊器的摩擦音干扰自动消除方法 | |
WO2018176919A1 (zh) | 爆裂音识别方法及系统 | |
CN110942784A (zh) | 基于支持向量机的鼾声分类系统 | |
Abdollahpur et al. | Cycle selection and neuro-voting system for classifying heart sound recordings | |
Rizal et al. | Comparison of discrete wavelet transform and wavelet packet decomposition for the lung sound classification | |
CN112949639A (zh) | 一种先天性心脏病心音智能分割分类算法、装置及存储介质 | |
Singh et al. | An Artificial Intelligence based mobile solution for early detection of valvular heart diseases | |
CN113476074A (zh) | 基于智能穿戴系统的肺音实时监测方法 | |
Huang et al. | ECG baseline wander correction based on ensemble empirical mode decomposition with complementary adaptive noise | |
KR20160147591A (ko) | 천식 진단을 위한 청진음 신호의 전처리 방법 및 장치 | |
CN109036472B (zh) | 一种改进的病理嗓音基音频率提取方法 | |
CN112908291A (zh) | 一种电子听诊器的毛刺噪声消除方法 | |
Ihza et al. | Study of Denoising Method to Detect Valvular Heart Disease Using Phonocardiogram (PCG) | |
CN115040135A (zh) | 一种基于心电心音的儿童心血管疾病诊断方法 | |
Shah et al. | Separation of cardiorespiratory sounds using time-frequency masking and sparsity | |
Torre-Cruz et al. | Wheezing sound separation based on constrained non-negative matrix factorization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17904230 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17904230 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16.03.2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17904230 Country of ref document: EP Kind code of ref document: A1 |