WO2006095380A1 - Automatic method for measuring a baby's, particularly a newborn's, cry, and related apparatus - Google Patents

Automatic method for measuring a baby's, particularly a newborn's, cry, and related apparatus Download PDF

Info

Publication number
WO2006095380A1
WO2006095380A1 PCT/IT2006/000145 IT2006000145W WO2006095380A1 WO 2006095380 A1 WO2006095380 A1 WO 2006095380A1 IT 2006000145 W IT2006000145 W IT 2006000145W WO 2006095380 A1 WO2006095380 A1 WO 2006095380A1
Authority
WO
WIPO (PCT)
Prior art keywords
score
function
frequency
acoustic signal
cry
Prior art date
Application number
PCT/IT2006/000145
Other languages
French (fr)
Inventor
Renata Sisto
Carlo Valerio Bellieni
Giuseppe Buonocore
Original Assignee
Università Degli Studi Di Siena
Istituto Superiore Per La Prevenzione E La Sicurezza Del Lavoro (Ispel)
Azienda Ospedaliera Universitaria Senese
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Università Degli Studi Di Siena, Istituto Superiore Per La Prevenzione E La Sicurezza Del Lavoro (Ispel), Azienda Ospedaliera Universitaria Senese filed Critical Università Degli Studi Di Siena
Priority to EP06711448A priority Critical patent/EP1856687A1/en
Priority to US11/817,927 priority patent/US20080235030A1/en
Publication of WO2006095380A1 publication Critical patent/WO2006095380A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices

Definitions

  • the present invention relates to an automatic method for measuring a baby's, particularly a newborn's, cry, and the related apparatus, that allows in a simple, reliable, and inexpensive way to provide an indication of the pain level suffered by the baby starting from the analysis of his/her cry acoustic characteristics.
  • Pain, or DAN (Douleur Aiguebach-ne), evaluates facial expressions, limb movements, and newborn's vocalizations for generating a score ranging from 0 (corresponding to lack of pain) and 10 (corresponding to maximum pain).
  • A. having N samples p(i), for i 0, 1,..., (N-I), of an acoustic signal p(t) representing the cry, sampled at a sampling frequency ⁇ for a period of duration P; the method being characterised in that it assigns a score PainScore to the acoustic signal p(t) by means of a function AF of one or more acoustic parameters selected from the group comprising:
  • the automatic method according to the invention measures a baby's, in particular a newborn's, cry starting from its time and/or spectral acoustic analysis.
  • the method is based on recording and analysing newborn's cry.
  • the pain level is preferably assigned through the combined evaluation of a set of one or more measurable acoustic parameters, which are related to the pain level.
  • a quantitative estimate of the pain level is obtained on the basis of a validated pain scale, based on the cry acoustic characteristics.
  • the acoustic parameters used for the diagnosis comprise one or more of the following three ones: the fundamental or pitch frequency; the normalised amplitude, with respect to the maximum value, of the root- mean-square or rms value; and the presence of a specific characteristic of cry frequency and amplitude modulation, which characteristic is defined as "siren cry".
  • the method provides as output value a score, preferably ranging from 0 to 6, that is proposed as an adequate scale for describing the pain level.
  • an apparatus for measuring a baby's cry comprising processing means, characterised in that it is capable to perform the previously described automatic method for measuring a baby's cry, the apparatus preferably further comprising means for detecting acoustic signals, and sampling means, capable to sample said acoustic signals.
  • the apparatus performs the aforementioned automatic method for measuring a baby's cry, through an automatic acoustic analysis of the newborn's cry, in order to provide an objective estimate of the newborn's pain level.
  • Figure 1 shows a flow chart of a preferred embodiment of the method according to the invention
  • Figure 2 shows a detailed flow chart of step 2 of the method of
  • Figure 1 Figure 3 shows a graph of the rms values of normalised acoustic signals during cry sequences of 24 seconds as a function of the DAN scale;
  • Figure 4 shows a detailed flow chart of step 3 of the method of Figure 1 ;
  • Figure 5 shows a graph of the values of the fundamental frequency F 0 as a function of the DAN scale
  • FIG. 6 shows a detailed flow chart of step 4 of the method of Figure 1.
  • same references will be used to indicate alike elements in the Figures.
  • cry acoustic parameters which are measured by the method according to the invention for providing a measure of the cry, indicative of the pain level suffered by the baby, comprise: - the normalised amplitude, with respect to the maximum value, of the root-mean-square or rms value of the acoustic signal;
  • the normalised to its maximum value rms value is not a measure of the cry absolute intensity, but it is rather a measure of the emission constancy: in other words, it measures the fraction of the observation time along which the signal is close to its maximum. This is related to the pain level, since a suffering newborn tends to cry for long time close to its maximum reachable level.
  • a normalised rms value over 0,15- 0,2 is associated with high pain levels.
  • the fundamental frequency or pitch is typically higher in cry caused by pain.
  • a pitch frequency over 350-450 Hz is typically correlated with high pain levels.
  • Another specific characteristic of cry due to a high pain is the regularity and reproducibility of the configurations of amplitude and frequency modulation on a short time scale, of the order of 1 second, which configurations define the so-called siren cry, with a persistent configuration lasting several periods.
  • the time-frequency intensity configuration of this siren cry shows a periodical modulation of the fundamental frequency F 0 and of its multiple frequencies, while the mean power spectrum has a quasi-periodical peak structure.
  • the method comprises a step 2 of processing a first score on the basis of the root-mean-square value in the period P of the N samples p(i) of the acoustic signal p(t).
  • the method still comprises a step 3 of processing a second score on the basis of the fundamental or pitch frequency F 0 of the acoustic signal p(t), that is on the basis of the minimum frequency at which a peak in the spectrum of the acoustic signal p(t) occurs. Furthermore, the method comprises a step 4 of processing a third score on the basis of the characteristic defined as "siren cry", preferably not null only in case of persistent cry, i.e. with value of the first score larger than a corresponding threshold value.
  • the method comprises a step 5 of adding up the three calculated scores, that is given as output in a step 6.
  • step 2 comprises:
  • Figure 3 shows the rms values of the normalised acoustic pressure during a cry sequence of 24 seconds, as a function of the DAN scale.
  • the first function is continuous, more preferably equal to:
  • the first function gi(p"°7) ma y b ⁇ discrete, so that the possible values of pZ' T are subdivided into at least two ranges to which a respective value of score ⁇ p" n °TM) corresponds.
  • such discrete function may be the following:
  • Sub-step 35 determines the pitch F 0 as the minimum frequency at which a peak of the mean power spectrum S m (j) occurs.
  • sub-step 35 determines the frequency F 0 as the one corresponding to the first peak of the mean spectrum (i.e. to the first relative maximum) the value of which is larger than a threshold Tl, preferably equal to the mean level S mean of the mean spectrum added to an offset value ⁇ l, possibly even negative, preferably equal to 5 dB:
  • step 3 finally comprises a sub-step
  • the second function g 2 (F 0 ) may be discrete, so that the possible values of F 0 are subdivided into at least two ranges to which a respective value of score(F 0 ) corresponds.
  • such discrete function may be as follows:
  • the integral i.e, the sum of the digitised values
  • sub-step 43 it is calculated the deviation AE F3 F4 (k) of the energy contribution E F3 _ F4 (k) in the second frequency range with respect to its mean value E F3 F4
  • next sub-step 45 it is calculated the digitised power spectrum of the signal obtained from sub-step 44, that is indicative of the frequency components of the variation dynamics of the energy contribution E F3 _F 4 (k) in the second frequency range:
  • F 7 and F 8 the integral (i.e., the sum of the digitised values) of the spectrum ⁇ - ⁇ (k) between F 7 and F 8 :
  • step 4 evaluates the presence and, possibly, the level of the so-called siren cry on the basis of a comparison of the energy contribution ' n the fourth frequency range with the energy contribution V ⁇ N - ⁇ A F5 F6 in the third frequency range of the spectral dynamics ⁇ - ⁇ (k), consequently assigning the third score in relation to such possible characteristic of the siren cry.
  • the third score score (sir encry) is advantageously assigned by means of a third, either continuous or discrete, preferably monotonic not decreasing, function gi(y ⁇ N D * F5 _F6-Vs HR ⁇ F ' F i i F i) °f the difference between the two mentioned energy contributions (V ⁇ F * F5 _ F6 - Vf HR /_ A F7 _ FS ).
  • the third function gs(V ⁇ _ 4 F5 _ F6 -V ⁇ R -f%_ PS ) is discrete, with two intervals of membership for the difference (y ⁇ N -TM F5 F6 -
  • step 4 of Figure 1 comprises a sub- step 48 in which it is verified if the energy contribution Vf H 3 R f F F7 FS within the fourth frequency range is larger than 60% of the energy contribution V XTN TT F5 _ F6 within the third frequency range.
  • Such score is preferably also assigned in the case when there is no persistent cry, i.e. in the case when the normalised rms value of the acoustic signal is low. As shown in Figure 6, such condition is achieved through a preliminary sub-step 40 of step 4 verifying that the first score score(p" m °T) depending on the normalised rms value is larger than a respective threshold 72, more preferably equal to 1 ,85.
  • step 4 of Figure 1 continues with the successive sub-steps 41-48 of Figure 6, illustrated above.
  • step 4 of Figure 1 directly continues with sub-step 50 of assigning a null value to the third score scoreisiren cry) .
  • the third function g 3 (V ⁇ - D F _ 4 F5 _ F6 -Vs H 3 R r F _ A F1 _ F& ) is discrete, with more than two intervals of membership for the difference ⁇ v xw D _ * F5 _ F6 -Vs HR ⁇ FA F i_ F s) > to which a respective score value score (sirencry).
  • the third function g 3 (V ⁇ - D F _ 4 F5 _ F6 -V" R ⁇ [ 4 n _ FS ) may be continuous.
  • the signal power spectrum has been calculated for each interval for providing a time sequence of 256 spectra for each newborn, with a frequency resolution of about 10,77 Hz.
  • a Hanning window has been applied to each interval.
  • Time evolution of these spectra has been displayed as time-frequency intensity graphs, which may be used for a preliminary heuristic analysis.
  • the acoustic pressure signal p(t) of each cry sequence has been normalised to its maximum amplitude p max .
  • the rms value of the normalised acoustic pressure has been calculated for each waveform.
  • a first score has been assigned to the normalised rms value by means of the continuous function [1] that is optimised as in [2].
  • a spectrogram i.e. the graph of the sound spectral composition as time varies
  • time resolution of about 0,093 s
  • spectrogram has been frequency integrated from 2 to 8 kHz, obtaining an integrated signal that is a time function with a time resolution equal to about 0,093 s;
  • the presence of the "siren cry” has been assigned to the cry signal if the energy within the frequency range of 0,6-1 ,7 Hz is larger than 60% of the total energy within the range of 0,4-5,3 Hz.
  • the pain score as illustrated in Figure 6 has been assigned to the presence of the "siren cry", i.e.:
  • the total score PainScore equal to the sum of the three (possibly weighed) scores which are calculated with respect to the three characteristics of the cry acoustic signal:
  • PainScore score(p"TM s m ) + score(F g ) + score ⁇ siren cry) has given a reliable indication of the level of pain suffered by the newborn by means of the following correspondence table, validated in literature:
  • the instrument has been successfully tested on the recordings of 57 crying newborns, whose pain level has been independently evaluated by using the DAN index, providing values in accordance with the ones of the prototype.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

The present invention concerns an automatic method for measuring a baby's cry, comprising the following step: A. having N samples ρ(i), for i = O, 1,..., (N-I), of an acoustic signal p(t) representing the cry, sampled at a sampling frequency^ for a period of duration P; the method being characterised in that it assigns a score PainScore to the acoustic signal p(t) by means of a function AF of one or more acoustic parameters selected from the group comprising: - a root-mean-square or rms value prms of the acoustic signal p(t) in the period P; - a fundamental or pitch frequency F0 of the acoustic signal p(t), i.e. the minimum frequency at which a peak in the spectrum of the acoustic signal p(t) occurs in the period P; and - a configuration of amplitude and frequency modulation of the acoustic signal p(t) in the period P. The invention further concerns the apparatus performing the method.

Description

AUTOMATIC METHOD FOR MEASURING A BABY'S, PARTICULARLY A NEWBORN'S, CRY, AND RELATED APPARATUS
The present invention relates to an automatic method for measuring a baby's, particularly a newborn's, cry, and the related apparatus, that allows in a simple, reliable, and inexpensive way to provide an indication of the pain level suffered by the baby starting from the analysis of his/her cry acoustic characteristics.
Pain has different levels, quantifiable from zero up to a maximum, and the behaviour of babies consequently varies. In the last years, pain scales have been developed for discriminating the level of pain suffered by a newborn.
By way of example, the score scale known as Newborn's Sharp
Pain, or DAN (Douleur Aigue Nouveau-ne), evaluates facial expressions, limb movements, and newborn's vocalizations for generating a score ranging from 0 (corresponding to lack of pain) and 10 (corresponding to maximum pain).
However, such scales are hardly usable, since they cannot be easily automated so as to provide objective and repeatable indications, because they require an active evaluation by an operator.
It is therefore an object of the present invention to provide in a simple, reliable, and inexpensive way an automatic, and hence objective and repeatable, indication of a baby's, in particular a newborn's, pain level.
It is specific subject matter of the present invention an automatic method for measuring a baby's cry, comprising the following step:
A. having N samples p(i), for i = 0, 1,..., (N-I), of an acoustic signal p(t) representing the cry, sampled at a sampling frequency^ for a period of duration P; the method being characterised in that it assigns a score PainScore to the acoustic signal p(t) by means of a function AF of one or more acoustic parameters selected from the group comprising:
- a root-mean-square or rms value prms of the acoustic signal p(t) in the period P;
- a fundamental or pitch frequency F0 of the acoustic signal p(t), i.e. the minimum frequency at which a peak in the spectrum of the acoustic signal p(t) occurs in the period P; and
- a configuration of amplitude and frequency modulation of the acoustic signal p(t) in the period P. In other words, the automatic method according to the invention measures a baby's, in particular a newborn's, cry starting from its time and/or spectral acoustic analysis.
In particular, the method is based on recording and analysing newborn's cry. The pain level is preferably assigned through the combined evaluation of a set of one or more measurable acoustic parameters, which are related to the pain level. A quantitative estimate of the pain level is obtained on the basis of a validated pain scale, based on the cry acoustic characteristics. The acoustic parameters used for the diagnosis comprise one or more of the following three ones: the fundamental or pitch frequency; the normalised amplitude, with respect to the maximum value, of the root- mean-square or rms value; and the presence of a specific characteristic of cry frequency and amplitude modulation, which characteristic is defined as "siren cry". The method provides as output value a score, preferably ranging from 0 to 6, that is proposed as an adequate scale for describing the pain level.
Further characteristics of other embodiments of the method according to the invention are defined in the enclosed claims 2-29. It is still subject matter of the present invention an apparatus for measuring a baby's cry, comprising processing means, characterised in that it is capable to perform the previously described automatic method for measuring a baby's cry, the apparatus preferably further comprising means for detecting acoustic signals, and sampling means, capable to sample said acoustic signals.
In other words, the apparatus according to the invention performs the aforementioned automatic method for measuring a baby's cry, through an automatic acoustic analysis of the newborn's cry, in order to provide an objective estimate of the newborn's pain level. The present invention will now be described, by way of illustration and not by way of limitation, according to its preferred embodiments, by particularly referring to the Figures of the enclosed drawings, in which:
Figure 1 shows a flow chart of a preferred embodiment of the method according to the invention; Figure 2 shows a detailed flow chart of step 2 of the method of
Figure 1 ; Figure 3 shows a graph of the rms values of normalised acoustic signals during cry sequences of 24 seconds as a function of the DAN scale;
Figure 4 shows a detailed flow chart of step 3 of the method of Figure 1 ;
Figure 5 shows a graph of the values of the fundamental frequency F0 as a function of the DAN scale; and
Figure 6 shows a detailed flow chart of step 4 of the method of Figure 1. In the following of the description same references will be used to indicate alike elements in the Figures.
As mentioned, the cry acoustic parameters which are measured by the method according to the invention for providing a measure of the cry, indicative of the pain level suffered by the baby, comprise: - the normalised amplitude, with respect to the maximum value, of the root-mean-square or rms value of the acoustic signal;
- the fundamental frequency or pitch of the acoustic signal;
- the persistence of regular configurations of frequency and amplitude modulation (configurations defined as "siren cry"). The higher the values of such acoustic parameters are, the higher is the pain level of the baby.
The normalised to its maximum value rms value is not a measure of the cry absolute intensity, but it is rather a measure of the emission constancy: in other words, it measures the fraction of the observation time along which the signal is close to its maximum. This is related to the pain level, since a suffering newborn tends to cry for long time close to its maximum reachable level. Preferably, a normalised rms value over 0,15- 0,2 is associated with high pain levels.
The fundamental frequency or pitch is typically higher in cry caused by pain. A pitch frequency over 350-450 Hz is typically correlated with high pain levels.
Another specific characteristic of cry due to a high pain is the regularity and reproducibility of the configurations of amplitude and frequency modulation on a short time scale, of the order of 1 second, which configurations define the so-called siren cry, with a persistent configuration lasting several periods. The time-frequency intensity configuration of this siren cry shows a periodical modulation of the fundamental frequency F0 and of its multiple frequencies, while the mean power spectrum has a quasi-periodical peak structure.
All the three cry acoustic parameters described above are correlated with the pain level, independently evaluated by using the DAN score scale.
With reference to Figure 1 , it may be observed that a preferred embodiment of the method according to the invention comprises a step 1 of acquiring N samples ρ(i), for i = 0, 1,..., (N-I), of the acoustic signal p(t) that is sampled at a suitable sampling frequency^ (taking into account that the Nyquist frequency is equal to yζ ) f°r a period of duration P.
Preferably, P is not shorter than 20 seconds, and N is equal to an involution of 2 (N= 2Λ).
Afterwards, the method comprises a step 2 of processing a first score on the basis of the root-mean-square value in the period P of the N samples p(i) of the acoustic signal p(t).
The method still comprises a step 3 of processing a second score on the basis of the fundamental or pitch frequency F0 of the acoustic signal p(t), that is on the basis of the minimum frequency at which a peak in the spectrum of the acoustic signal p(t) occurs. Furthermore, the method comprises a step 4 of processing a third score on the basis of the characteristic defined as "siren cry", preferably not null only in case of persistent cry, i.e. with value of the first score larger than a corresponding threshold value.
Finally, the method comprises a step 5 of adding up the three calculated scores, that is given as output in a step 6.
With reference to Figure 2, it may be observed that step 2 comprises:
- a sub-step 21 of determining the maximum amplitude pmax of the acoustic signal p(t) in the period P: p ^m m a a x x = ;=0, ml,..a.,(xW-l) I (p(J)) '
- a sub-step 22 of calculating the root-mean-square value of the acoustic signal p(t), normalised to its maximum amplitude pmax, in the period P:
Figure imgf000005_0001
- a sub-step 23 of assigning the first score to the normalised rms value p"m°"" , by means of a first, either continuous or discrete, preferably monotonic not decreasing, function g\(p"0™)-
In particular, Figure 3 shows the rms values of the normalised acoustic pressure during a cry sequence of 24 seconds, as a function of the DAN scale.
Preferably the first function
Figure imgf000006_0001
is continuous, more preferably equal to:
Score(pZm) = - π arctan(αQC" -#)+l [1] where coefficients a and β are preferably equal to the following values:
« =ioo β = 0,14 L J so that the values of score(ρZ' T) meet the following conditions: for /C" « 0,1 it is s∞re(j>™) ~ Q for /C" = 0,1 it is scoreipZT) = 0,15 for /C" = 0,14 it is mw(/Cm) = l for /C = 0, 18 it is scoreipZ' :1 ) = 1 ,85 for /C* » 0,18 it is *coreQC"> « 2
Alternatively, the first function gi(p"°7) may discrete, so that the possible values of pZ' T are subdivided into at least two ranges to which a respective value of score{p"n°™) corresponds. Preferably, such discrete function may be the following:
\ 0 per 0 < /C rms" < 0,1 score(pZ7) = 1 per OJ ≤ /C norm" < 048
[2 per /C" ≥ 0,18
With reference to Figure 4, it may be observed that step 3 of
Figure 1 comprises a sub-step 31 of subdividing the N samples p(i) into M time intervals, of duration equal to D = P/M, wherein M is preferably equal to an involution of 2 (M = 2B, with B ≤ A), each one of which hence comprises ND samples, with
ND = NM= 2{A'B).
In order to avoid in the successive frequency analysis the introduction of spurious spectral characteristics caused by cutting the waveform off, in sub-step 31 a Hanning window WH(J) (for j =
0, 1,..., (ND - I)) is applied to each interval, thus obtaining, for each one of the M intervals, ND samples pmύ) (where k is the interval index, i.e. k = 0, I,.., (M-I)):
(j) for j = 0, 1,..., (ND- 1) and k= 0, 1,..., (Af-I)
In successive sub-step 32, it is calculated for each interval the power spectrum of the digitised signal:
SHkU) = FTND{pm(j)} forj = 0, 1,..., (ND- 1) and k = 0, I5..., (M-I) where y(j) = FTND{x(j)} indicates the operator FTm (preferably the
Fourier transform of the autocorrelation function) that transforms ND samples x(j) from the time domain to ND samples y(j) in the frequency domain. As a consequence, in sub-step 32 it is obtained a time sequence of M spectra, each one with a frequency resolution Rf equal to:
Figure imgf000007_0002
and a bandwidth Bl equal to the Nyquist frequency: _31 = -%\
Afterwards, in sub-step 33 it is calculated the mean spectrum s Hk(J) of the M spectra: 1 M-I
SHk U) = ^∑SHk(J) for / = 0, 1,.., (ND - 1)
Sub-step 34 determines the mean value Smean of the mean spectrum SHk(j) in a first frequency range included between two respective frequency limit values F1 and F2 (to which two indexes correspond j\ = Fι/Rf and j2 = F^Rj), preferably included within the low frequency part of the spectrum bandwidth Bl:
Smean
Figure imgf000007_0003
Sub-step 35 determines the pitch F0 as the minimum frequency at which a peak of the mean power spectrum Sm(j) occurs. In particular, sub-step 35 determines the frequency F0 as the one corresponding to the first peak of the mean spectrum (i.e. to the first relative maximum) the value of which is larger than a threshold Tl, preferably equal to the mean level Smean of the mean spectrum added to an offset value Δl, possibly even negative, preferably equal to 5 dB:
Figure imgf000008_0001
Smean + a]
This definition of the pitch F0 is independent from the absolute calibration. In particular, Figure 5 shows the values of the fundamental frequency F0, as a function of the DAN scale. The continuous line is an interpolation of all the data, while the two dotted lines are two different interpolations for the data related to cries of newborns with DAN < 8 and with DAN > 8. Still with reference to Figure 4, step 3 finally comprises a sub-step
36 of assigning the second score to the value of fundamental frequency or pitch F0, by means of a second, either continuous or discrete, preferably monotonic not decreasing, function gi(F0).
Preferably the second function gι(F0) is continuous, more preferably equal to: score(F0 ) = — arctan(>(F0 - S)) + 1 [3] π where coefficients γ and Jare preferably equal to the following values: r = .oo
£ = 0,4 so that the values of score(F0) meet the following conditions: for F0 « 350 Hz it IS score(F0) « 0 for F0 = 350 Hz it is score(F0) = 0,13 for F0 = 400 Hz it is score(F0) = 1 for F0 = 450 Hz it is score{F0) = 1,87 for F0 » 450 Hz it is score(F0) ∞ 2 Alternatively, the second function g2(F0) may be discrete, so that the possible values of F0 are subdivided into at least two ranges to which a respective value of score(F0) corresponds. Preferably, such discrete function may be as follows:
Figure imgf000009_0001
With reference to Figure 6, it may be observed that step 4 of
Figure 1 comprises a sub-step 41 in which, for each digitised power spectrum SHlc(j) of the signal, obtained in sub-step 32 of Figure 4, it is calculated the energy contribution EF3_F4$ in a second frequency range included between two respective frequency limit values F3 and F4 (to which two indexes J3 = F3ZR/ and J4 = F4//?/ correspond), preferably included within the low frequency part of the spectrum bandwidth Bl. In other words, it is calculated the integral (i.e, the sum of the digitised values) of the spectrum between F3 and F4:
Figure imgf000009_0002
for * = 0, 1,..., (M-I)
In sub-step 42, it is calculated the mean value EF3 F4 along time of the energy contribution EF3 F4(k): J M-I
M. i-=o
In sub-step 43, it is calculated the deviation AEF3 F4(k) of the energy contribution EF3_F4(k) in the second frequency range with respect to its mean value EF3 F4
Figure imgf000009_0003
for k= 0, 1,..., (M-I)
In sub-step 44, a window Wflat.lop(k) (for k = 0, 1,..., (M- I)) having spectrum with flat top main lobe, known as flat-top window, is applied to such deviation, thus obtaining M samples AE ^'^(k) :
ΔE« (*) = AEF3_F4 (k) Wβat_top (k) for *= 0, 1,..., (M-I)
In next sub-step 45, it is calculated the digitised power spectrum of the signal
Figure imgf000009_0004
obtained from sub-step 44, that is indicative of the frequency components of the variation dynamics of the energy contribution EF3_F4(k) in the second frequency range:
VF3-F4(k) = FTM [AE^(Jc)] for £= 0, 1,..., (M-I)
thus obtaining M samples l^3-F4(k) in the frequency domain, with frequency resolution VRf equal to:
mt = f/' s and a bandwidth Bl equal to:
Figure imgf000010_0001
In next sub-step 46, it is calculated the energy contribution vND *F5_F6 in a th j re! frequency range included between two respective frequency limit values F5 and F6 (to which two indexes Jc5 = F5ZVR/ and Jc6 = F6ZVR/ correspond), the preferably excludes only the end at lowest frequency of the spectrum
Figure imgf000010_0002
In other words, it is calculated the integral (i.e., the sum of the digitised values) of the spectrum Jf3^(Ic) between F5 and F6:
Figure imgf000010_0003
In next sub-step 47, it is calculated the energy contribution
V SHR/FI FS 'n a fourth frequency range included between two respective frequency limit values F7 and F8 (to which two indexes k7 = F7ZVRf and ks = FiZVRf correspond), preferably included within the part at frequency around 1 Hz of the spectrum ^-^(k), more preferably included within the third frequency range. In other words, it is calculated the integral (i.e., the sum of the digitised values) of the spectrum ^-^(k) between F7 and F8:
Figure imgf000010_0004
Afterwards, step 4 evaluates the presence and, possibly, the level of the so-called siren cry on the basis of a comparison of the energy contribution
Figure imgf000010_0005
'n the fourth frequency range with the energy contribution V^N-^A F5 F6 in the third frequency range of the spectral dynamics ^-^(k), consequently assigning the third score in relation to such possible characteristic of the siren cry. In particular, the third score score (sir encry) is advantageously assigned by means of a third, either continuous or discrete, preferably monotonic not decreasing, function gi(yχτND *F5_F6-VsHRτF'F i i Fi) °f the difference between the two mentioned energy contributions (V^F*F5_F6 - VfHR/_A F7_FS).
Preferably, the third function gs(V^_4 F5_F6-V^R-f%_PS) is discrete, with two intervals of membership for the difference (y^N-™F5 F6-
VSHR/FI F&)' t° which a respective score value score (sir encry) corresponds. In fact, as shown in Figure 6, step 4 of Figure 1 comprises a sub- step 48 in which it is verified if the energy contribution VfH 3 RfF F7 FS within the fourth frequency range is larger than 60% of the energy contribution VXTNTTF5_F6 within the third frequency range. In the positive, the siren cry characteristic is considered as present, and sub-step 49 is performed, in which a value equal to 2 is assigned to the third score: scoreisiren cry) = 2
Instead, in the case when the verification of sub-step 48 gives a negative outcome, the siren cry characteristic is considered as absent, and sub-step 50 is performed, in which a null value is assigned to the third score: scoreisiren cry) = 0
Such score is preferably also assigned in the case when there is no persistent cry, i.e. in the case when the normalised rms value of the acoustic signal is low. As shown in Figure 6, such condition is achieved through a preliminary sub-step 40 of step 4 verifying that the first score score(p"m°T) depending on the normalised rms value is larger than a respective threshold 72, more preferably equal to 1 ,85.
In the case when the verification of sub-step 40 has a positive outcome, i.e. a persistent cry has been recognised, then step 4 of Figure 1 continues with the successive sub-steps 41-48 of Figure 6, illustrated above.
Otherwise, i.e. in the case when the verification of sub-step 40 has a negative outcome, step 4 of Figure 1 directly continues with sub-step 50 of assigning a null value to the third score scoreisiren cry) . Alternatively, the third function g3(V^-D F_4 F5_F6-VsH 3 RrF_A F1_F&) is discrete, with more than two intervals of membership for the difference {vxwD_ *F5_F6-VsHRτFA Fi_Fs)> to which a respective score value score (sirencry). Still alternatively, the third function g3(V^-D F_4 F5_F6-V"Rτ[4 n_FS) may be continuous.
In the following a prototype made by the inventors is illustrated, that operates according to a preferred embodiment of the method according to the invention for discriminating different pain levels. In particular, the prototype has been tested by analysing the cry, during heel prick, of 57 newborns, the pain intensity of which has been independently evaluated according the DAN index.
The acoustic signal coming from a 1/2 inch (i.e. 1 ,27 cm) microphone, with a 50 mV/Pa sensitivity, has been sample at a frequency of 44,1 kHz, corresponding to a Nyquist frequency of 22,05 kHz. This frequency corresponds to the standard sampling rate of commercial audio devices. A digitised electronic files of about 23,77 s of duration (thus comprising N = 220 samples) has been extracted by each recording, starting from a given time t0 established by the operator.
The digitised waveform has been divided into M = 256 (equal to 28) time intervals, each one of about 92,88 ms of duration. The signal power spectrum has been calculated for each interval for providing a time sequence of 256 spectra for each newborn, with a frequency resolution of about 10,77 Hz. As said, in order to avoid the introduction of spurious spectral characteristics caused by cutting the waveform off, a Hanning window has been applied to each interval. Time evolution of these spectra has been displayed as time-frequency intensity graphs, which may be used for a preliminary heuristic analysis. The acoustic pressure signal p(t) of each cry sequence has been normalised to its maximum amplitude pmax.
The rms value of the normalised acoustic pressure has been calculated for each waveform. A first score has been assigned to the normalised rms value by means of the continuous function [1] that is optimised as in [2].
It has been then calculated the mean of the 256 spectra, in order to determine the pitch F0 as the minimum frequency at which a peak of the mean power spectrum occurs. In particular, a peak has been considered as such when the signal exceeds by at least 5 dB the mean level of the spectrum within the frequency range 3-7.5 kHz. A third score has been assigned to the pitch value F0 by means of the continuous function [3] that is optimised as in [4].
It has been then performed the automatic procedure for recognising the "siren cry", which is only applied in case of persistent cry, i.e. with pain score due to a normalised rms value larger than a threshold (equal to 1,85). In particular:
- it has been calculated a spectrogram (i.e. the graph of the sound spectral composition as time varies) with time resolution of about 0,093 s; - the spectrogram has been frequency integrated from 2 to 8 kHz, obtaining an integrated signal that is a time function with a time resolution equal to about 0,093 s;
- the mean value of the signal has been subtracted from the same;
- a flat-top window has been applied to the thus obtained zero mean signal;
- it has been calculated the power spectrum thereof;
- it has been calculated the energy within the frequency range of
0,6-1 ,7 Hz;
- the presence of the "siren cry" has been assigned to the cry signal if the energy within the frequency range of 0,6-1 ,7 Hz is larger than 60% of the total energy within the range of 0,4-5,3 Hz. The pain score as illustrated in Figure 6 has been assigned to the presence of the "siren cry", i.e.:
- in the case when the siren cry is present, score(siren cry) - 2;
- in the case when the siren cry is absent, score{siren cry) - 0.
The total score PainScore, equal to the sum of the three (possibly weighed) scores which are calculated with respect to the three characteristics of the cry acoustic signal:
PainScore = score(p"™s m) + score(Fg) + score{siren cry) has given a reliable indication of the level of pain suffered by the newborn by means of the following correspondence table, validated in literature:
Figure imgf000013_0001
Figure imgf000014_0001
The prototype implementation of the analysis procedure has been made by using the software LabVIEW from the National Instruments.
The instrument has been successfully tested on the recordings of 57 crying newborns, whose pain level has been independently evaluated by using the DAN index, providing values in accordance with the ones of the prototype.
The preferred embodiments have been above described and some modifications of this invention have been suggested, but it should be understood that those skilled in the art can make other variations and changes, without so departing from the related scope of protection, as defined by the following claims.

Claims

1. Automatic method for measuring a baby's cry, comprising the following step:
A. having N samples p(i), for i = 0, 1,..., (N-I), of an acoustic signal p(t) representing the cry, sampled at a sampling frequency^ for a period of duration /*; the method being characterised in that it assigns a score PainScore to the acoustic signal p(t) by means of a function AF of one or more acoustic parameters selected from the group comprising: - a root-mean-square or rms value prms of the acoustic signal p(t) in the period P;
- a fundamental or pitch frequency F0 of the acoustic signal p(t), i.e. the minimum frequency at which a peak in the spectrum of the acoustic signal p(t) occurs in the period P; and - a configuration of amplitude and frequency modulation of the acoustic signal/^ in the period P.
2. Method according to claim 1, characterised in that the duration P is not shorter than 20 seconds.
3. Method according to claim 1 or 2, characterised in that the number N of samples p(i) is equal to an involution of 2 (N= 2A).
4. Method according to any one of the preceding claims, characterised in that the function AF depends on the rms value prms of the acoustic signal p(t) in the period P that is normalised to its maximum amplitude pmax.
5. Method according to any one of the preceding claims, characterised in that the function AF is a linear combination of one or more terms, each one of which is a function of assigning a score to a respective parameter of said one or more acoustic parameters.
6. Method according to claim 5, characterised in that the function AF is a sum of said one or more terms.
7. Method according to claim 5 or 6, characterised in that said function of score assignment is an either continuous or discrete function.
8. Method according to any one of claims 5 to 7, characterised in that said function of score assignment is a preferably monotonic not decreasing function of the respective acoustic parameter.
9. Method according to any one of claims 5 to 8, when depending on claim 4, characterised in that it comprises the following steps: B.1 determining the maximum amplitude pmax of the acoustic signal p(t) in the period P:
Figure imgf000016_0001
B.2 calculating the rms value of the acoustic signal p(t) in the period P, normalised to its maximum amplitude pmax:
Figure imgf000016_0002
B.3 assigning a first score score(ρ"°™') to the normalised rms value PZ? by means of a first function &(p™)\ score(pZT) = gι(pZ7) whereby the first score score(pZT) 's a term of the linear combination of the function AF giving the score PainScore to the acoustic signal/)^.
10. Method according to claim 9, characterised in that the first function gι(pZ' m) 's equal to ([1]):
S, f Kr; = - Tt arctan(α(/Cffl -β)) +l 11. Method according to claim 10, characterised in that coefficients α and β are equal to ([2]): α = 100 £ = 0,14
12. Method according to claim 9, characterised in that the first function g\(pZT) 's discrete, so that the possible values of pZT are subdivided into at least two ranges to which a respective value of score(pZT) corresponds.
13. Method according to claim 12, characterised in that the first function
Figure imgf000016_0003
is equal to:
0 for O ≤ PZT < 0,1
_ / --norm \ __ SΛPrms ) = 1 for 0,\ ≤ pr"Z < W for PZT ≥ W
14. Method according to any one of claims 5 to 13, when depending on claim 4, characterised in that it comprises the following steps: C.1 subdividing the N samples p(i) into M time intervals, of duration equal to D = PZM, each one of which comprising ND samples pmQ), with ND = N/M C.2 calculating for each interval the digitised power spectrum of the signal:
SmU) = FTm{pmU)} for; = 0, 1,..., (JVb- 1) and k= 0, 1,..., (M-I) where y(j) = FTQ{x(j)} indicates the operator FTQ transforming Q samples x(j) in the time domain to Q samples y(j) in the frequency domain; C.3 calculating the mean spectrum SHk(j) of the M spectra: I M-I
S HkU) = —∑SHk(j) fory = 0, \,..., (ND- 1) C.4 determining the mean value Smean of the mean spectrum SHk(j) in a first frequency range included between two respective frequency limit values F1 and F2:
Smecm
where Rf is the frequency resolution of each spectrum: Rf =f,/ND
C.5 determining the pitch F0 as the minimum frequency at which a peak of the mean power spectrum SHk(j) occurs, the peak being a relative maximum of the spectrum having value larger than a first threshold
Figure imgf000017_0002
C.6 assigning a second score $core(F0) to the pitch value Fo by means of a second function gi(F0): score(F0) = g2(F0) whereby the second score score(F0) is a term of the linear combination of the function AF giving the score PainScore to the acoustic signal p(t).
15. Method according to claim 14, characterised in that the first threshold Tl is equal to the sum of the mean value Smean of the mean spectrum SHk (j) with an offset value Δl .
16. Method according to claim 14 or 15, characterised in that the second function g2(F0) is equal to ([3]):
S2(F0) = -arctan(KF0 - <y)) + l π
17. Method according to claim 16, characterised in that coefficients ^and J are equal to ([4]):
7 = 100
£ = 0,4
18. Method according to claim 14 or 15, characterised in that the second function g2(F0) is equal to ([3]):
Figure imgf000018_0001
19. Method according to claim 18, characterised in that FREF - 400 Hz.
20. Method according to any one of claims 5 to 19, when depending on claim 4, characterised in that it comprises the following steps:
C.1 subdividing the N samples p(i) into M time intervals, of duration equal to D = P/M, each one of which comprising ND samples pm(j), with
ND = N/ M C.2 calculating for each interval the digitised power spectrum of the signal:
SHk (j) = FTND {pHk(j)} for; = 0, 1,..., (ND- 1) and k= 0, 1,..., (M-I) where y(j) = FTQ{x(j)} indicates the operator FTQ transforming Q samples x(j) in the time domain to Q samples y(j) in the frequency domain; D.1 for each digitised power spectrum Sm(j) , calculating the energy contribution EF3 F4(k) in a second frequency range included between two respective frequency limit values F3 and F4:
Figure imgf000018_0002
for Λ= 0, 1,..., (M-I) where Rf is the frequency resolution of each spectrum:
Rf=fs/ND D.2 calculating the mean value EF3_F4 of the energy contribution EF3 F4(k) in tempo: I M-I
EF3 F 4 = T7∑£F3_F4 (£) M Ar=O D.3 calculating the deviation AEF3ι Fli(k) of the energy contribution
EF3_F4(k) in the second frequency range with respect to its mean value En P4 :
Figure imgf000019_0001
5 for k= 0, 1,..., (M-I)
D.4 calculating the digitised power spectrum Jf3^(Ic) of the deviation
ΔEF3 M (£) :
Figure imgf000019_0002
10 for k = 0, 1,..., (M-I)
D.5 calculating the energy contribution V^N-D F4 F5 F6 of the spectrum
^-^(k) in a third frequency range included between two respective frequency limit values F5 and F6:
Figure imgf000019_0003
D.6 calculating the energy contribution V^F F7 Fs of the spectrum
F^-^fc) in a fourth frequency range included between two respective frequency limit values F7 and F8:
Figure imgf000019_0004
20 D.7 assigning a third score score (sir encry) to the difference between said two energy contributions {V^A F5_F6 - VsHRfF4 _FS) by means of a third function gi(V^F5_F6-V™Rl.F_F1_F%): score(sirencry) = g3(V^F_4 F5_F6 -Vs F H 3 R/_F1_F&) whereby the third score score (sirencry) is a term of the linear combination 25 of the function AF giving the score PainScore to the acoustic signal p(t).
21. Method according to claim 20, characterised in that the third function g3(V^N-D F4 F5 F6-V^RfF4 F1 Fi) is discrete, with two intervals of membership for the difference
Figure imgf000019_0005
to which a respective value of score score (sirencry) corresponds, the method further 30 comprising the following steps: D.8 verifying if the energy contribution F^/% FS in the fourth frequency range is larger than a percentage threshold PT of the energy contribution V^N-D F4 F5_F6 in the third frequency range;
D.9 in the case when the verification of step D.8 gives a positive outcome, assigning a value equal to 2 to the third score: score(siren cry) = 2
D.10 in the case when the verification of step D.8 gives a negative outcome, assigning a null value to the third score: score(siren cry) = 0
22. Method according to claim 21 , characterised in that the percentage threshold PT is equal to 60%.
23. Method according to any one of claims 20 to 22, characterised in that the following step is performed between steps D.3 and D.4:
D.11 applying a window Wβat.tOp(k) (for k = 0, 1,..., (M- I)) to the deviation ΔE«_M(*) .
24. Method according to claim 23, characterised in that the window Wβat-toP(k) is a window having spectrum with flat top main lobe, or window flat-top.
25. Method according to any one of claims 20 to 24, characterised in that the third score score (sir encry) is null in the case when the rms value prms of the acoustic signal p(t) in the period P is lower than a second threshold Tl.
26. Method according to any one of claims 14 to 25, characterised in that the number M of time intervals is equal to an involution of 2: M= 2s, with B ≤ A.
27. Method according to any one of claims 14 to 26, characterised in that step C.2 calculates for each interval the digitised power spectrum of the signal through a numerical Fourier transform.
28. Method according to any one of claims 14 to 27, characterised in that the following step is performed between steps C.1 and C.2:
C.7 applying a window WHO) capable to eliminate spurious spectral characteristics caused by cutting the waveform off to each of the
M time intervals, whereby:
PmU) = P(Nn - k + j) - WH {j) fory = 0, 1,..., (ND- 1) and k= 0, 1,..., (M-I)
29. Method according to claim 28, characterised in that said window is a Hanning window.
30. Apparatus for measuring a baby's cry, comprising processing means, characterised in that it is capable to perform the automatic method for measuring a baby's cry according to any one of claims 1-29.
31. Apparatus according to claim 30, characterised in that it further comprises means for detecting acoustic signals, and sampling means, capable to sample said acoustic signals.
PCT/IT2006/000145 2005-03-11 2006-03-10 Automatic method for measuring a baby's, particularly a newborn's, cry, and related apparatus WO2006095380A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP06711448A EP1856687A1 (en) 2005-03-11 2006-03-10 Automatic method for measuring a baby's, particularly a newborn's, cry, and related apparatus
US11/817,927 US20080235030A1 (en) 2005-03-11 2006-03-10 Automatic Method For Measuring a Baby's, Particularly a Newborn's, Cry, and Related Apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
ITRM2005A000110 2005-03-11
IT000110A ITRM20050110A1 (en) 2005-03-11 2005-03-11 AUTOMATIC METHOD OF MEASURING THE PLANT OF A CHILD, IN PARTICULAR OF A NEWBORN, AND ITS APPARATUS.

Publications (1)

Publication Number Publication Date
WO2006095380A1 true WO2006095380A1 (en) 2006-09-14

Family

ID=36609287

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IT2006/000145 WO2006095380A1 (en) 2005-03-11 2006-03-10 Automatic method for measuring a baby's, particularly a newborn's, cry, and related apparatus

Country Status (4)

Country Link
US (1) US20080235030A1 (en)
EP (1) EP1856687A1 (en)
IT (1) ITRM20050110A1 (en)
WO (1) WO2006095380A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014036263A1 (en) * 2012-08-29 2014-03-06 Brown University An accurate analysis tool and method for the quantitative acoustic assessment of infant cry
US10827973B1 (en) * 2015-06-30 2020-11-10 University Of South Florida Machine-based infants pain assessment tool
US11631280B2 (en) * 2015-06-30 2023-04-18 University Of South Florida System and method for multimodal spatiotemporal pain assessment
GB2552067A (en) 2016-05-24 2018-01-10 Graco Children's Products Inc Systems and methods for autonomously soothing babies
US11202604B2 (en) 2018-04-19 2021-12-21 University Of South Florida Comprehensive and context-sensitive neonatal pain assessment system and methods using multiple modalities
WO2019204700A1 (en) 2018-04-19 2019-10-24 University Of South Florida Neonatal pain identification from neonatal facial expressions

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020135485A1 (en) * 2001-03-22 2002-09-26 Meiji University Legal Person System and method for analyzing baby cries

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020135485A1 (en) * 2001-03-22 2002-09-26 Meiji University Legal Person System and method for analyzing baby cries

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
C. BELLIENI, R. SISTO, D. CORDELLI AND G. BUONOCORE: "Cry Features Reflect Pain Intensity in Term Newborns: An Alarm Threshold", PEDIATRIC RESEARCH, vol. 55, no. 1, 2004, U.S.A., pages 142 - 146, XP002388813 *

Also Published As

Publication number Publication date
ITRM20050110A1 (en) 2006-09-12
US20080235030A1 (en) 2008-09-25
EP1856687A1 (en) 2007-11-21

Similar Documents

Publication Publication Date Title
Winholtz et al. Vocal tremor analysis with the vocal demodulator
US7485797B2 (en) Chord-name detection apparatus and chord-name detection program
JP4309053B2 (en) Emotional state detection apparatus and method
EP2465112B1 (en) Method, computer program product and system for determining a perceived quality of an audio system
EP2178082B1 (en) Cyclic signal processing method, cyclic signal conversion method, cyclic signal processing device, and cyclic signal analysis method
EP1856687A1 (en) Automatic method for measuring a baby&#39;s, particularly a newborn&#39;s, cry, and related apparatus
Hudson et al. A study of the frequency reading fundamental vocal of young Black adults
US20120150054A1 (en) Respiratory condition analysis apparatus, respiratory condition display apparatus, processing method therein, and program
JPH09505701A (en) Testing telecommunications equipment
WO1997005730A1 (en) Assessment of signal quality
WO2004084176A1 (en) Sound evaluating method and its system
JP2010128296A (en) Speech signal processing evaluation program and speech signal processing evaluation device
CN106663450A (en) Method of and apparatus for evaluating quality of a degraded speech signal
US8532986B2 (en) Speech signal evaluation apparatus, storage medium storing speech signal evaluation program, and speech signal evaluation method
JP2008116954A (en) Generation of sample error coefficients
Savchenko et al. A method for measuring the pitch frequency of speech signals for the systems of acoustic speech analysis
US8494844B2 (en) Automated sound segment selection method and system
Traunmüller Perception of speaker sex, age, and vocal effort
EP1229517B1 (en) Method for recognizing speech with noise-dependent variance normalization
EP1597720B1 (en) Pitch estimation using low-frequency band noise detection
O'Brian et al. Generalizability Theory I: Assessing reliability of observational data in the communication sciences.
CN109308910B (en) Method and apparatus for determining bpm of audio
US7406356B2 (en) Method for characterizing the timbre of a sound signal in accordance with at least a descriptor
US7505858B2 (en) Method for analyzing tone quality of exhaust sound
Luig et al. Workload monitoring through speech analysis: Towards a system for air traffic control

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2006711448

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

WWW Wipo information: withdrawn in national office

Country of ref document: RU

WWE Wipo information: entry into national phase

Ref document number: 11817927

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 2006711448

Country of ref document: EP