CN115153473A

CN115153473A - Non-contact heart rate detection method based on multivariate singular spectrum analysis

Info

Publication number: CN115153473A
Application number: CN202210655752.6A
Authority: CN
Inventors: 宋仁成; 孙晓雪; 成娟; 李畅; 刘羽; 陈勋
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2022-06-10
Filing date: 2022-06-10
Publication date: 2022-10-11
Anticipated expiration: 2042-06-10
Also published as: CN115153473B

Abstract

The invention discloses a non-contact heart rate detection method based on multivariate singular spectrum analysis, which comprises the following steps: 1. firstly, acquiring a video image and determining a face region of interest; 2. screening four optimal sub-areas from the region of interest, extracting a chrominance signal of each sub-area, and taking a motion track of a nose tip part as a motion signal; 3. self-adaptive filtering removes the motion artifact in the chrominance signal, and the motion artifact is used as an input signal; 4. adopting multivariate singular spectrum analysis to process input signals, and screening out pulse signals from the input signals; 5. extracting the heart rate from the pulse signal by adopting a frequency spectrum analysis method; 6. and finally, finding out abnormal heart rate values according to the heart rate continuity and replacing the abnormal heart rate values with correct heart rate values. The method can simultaneously remove the influence of illumination change and the interference of head movement, thereby improving the accuracy of non-contact video heart rate detection.

Description

Non-contact heart rate detection method based on multivariate singular spectrum analysis

Technical Field

The invention belongs to the technical field of biomedical signal processing, and particularly relates to a non-contact heart rate detection method based on multivariate singular spectrum analysis.

Background

The current heart rate measuring methods are mainly divided into a contact type and a non-contact type. The contact detection method monitors the blood volume pulse by conventional measuring instruments such as electrocardiographs, pulse oximeters, etc. However, the skin of the subject needs to be directly contacted during measurement, so that the human body activity is limited to a certain extent, and the subject may be uncomfortable due to long-term measurement, and the method is not suitable for special people such as infants, burn patients and the like. The non-contact measurement method, also called remote photoplethysmography (rPPG), uses a camera to capture the change in skin color due to blood flow, thereby extracting the heart rate. The defect of traditional contact type heart rate monitoring can be overcome, and the method has the characteristics of no wound, portability, easiness in implementation and the like.

There are two major challenges in rPPG technology-illumination variation and motion noise. At present, most research scenes are under indoor light sources or natural light sources, and a subject is kept still under the condition of small illumination change. Rencheng Song et al adopts an ensemble empirical mode decomposition to decompose the green channel of the screened optimal region of interest, and then extracts a common information method (EEMD-MCCA), which can effectively reduce the influence of ambient light variation on video heart rate extraction. Whereas EEMD-MCCA is a two-step process, the use of EEMD is primarily to construct the MCCA's multi-channel input set. But the number of IMFs obtained in each ROI is different and needs to be selected and filled to keep the number of channels per set the same. The selection process of IMF is heuristic and may lose useful terms due to modal mixing. This will reduce the effectiveness of the MCCA algorithm. In an actual scene, a subject is difficult to avoid motion, so that it is also a challenge to eliminate the interference of motion noise under the condition of light variation, so as to obtain an accurate heart rate.

Disclosure of Invention

The invention provides a non-contact heart rate detection method based on multivariate singular spectrum analysis for solving the defects of the technology, so as to remove the influence of ambient light change and the interference of motion noise, thereby improving the accuracy of non-contact video heart rate detection.

The invention adopts the following technical scheme for solving the technical problems:

the invention relates to a non-contact video heart rate detection method based on multivariate singular spectrum decomposition, which is characterized by comprising the following steps of:

step 1: acquiring T frame video data of a subject, dividing the T frame video data into L data in an overlapped manner, wherein each data comprises N frame video images;

and 2, step: determining a face region of interest of the first N frames of video images by adopting a face detection and face tracking method, dividing the face region of interest in each frame of video image into Q sub-regions, wherein L is more than or equal to 1 and less than or equal to L;

and step 3: calculating the pixel mean value of each sub-region in the l N frame video image frame by frame, calculating the illumination intensity, illumination change and signal-to-noise ratio of each sub-region as the judged quality indexes according to the pixel mean value of each sub-region, selecting P optimal sub-regions from Q sub-regions according to the quality indexes of each sub-region, extracting RGB channel mean value signals from the P optimal sub-regions and converting the RGB channel mean value signals into HSV signals, extracting chrominance signals in the HSV signals and recording the chrominance signals as H _l ＝{h _l,1 ,h _l,2 ,...,h _l,p ,...,h _l,P }，h _l,p The chrominance signal of the P-th optimal subregion of the l N-frame video image is more than 1 and more than P and more than P;

and 4, step 4: detecting human face characteristic points of the first N frames of video images frame by adopting an Openface method, and recording motion track signals of the nose tip position characteristic points of the first N frames of video images as V _l ＝{v _l,1 ,v _l,2 ,...,v _l,j ,...,v _l,N }; wherein v is _l,j The position coordinates of the nose tip position feature points in the ith frame video image are obtained;

and 5: using the first motion track signal V _l Chrominance signal h for the ith optimum subregion of the l _l,p Performing adaptive filtering LMS processing to remove the first chrominance signal h _l,p To obtain the first p-th filtered chrominance signal

And is

Representing pixel values of the P chrominance signals filtered in the jth frame of video image to obtain the P chrominance signals filtered in the jth frame

And as the ith input signal data set;

step 6: using singular spectrum decomposition method to make said I input signal data set X _l Decomposition into several components:

step 6.1: setting the length of a window to be M, wherein M is less than N/2, defining a parameter K = N-M +1, and filtering the ith p-th chrominance signal according to the length M of the window and the parameter K

With formation dimension of M × KFirst p track matrix

Then splicing the first P track matrixes to form a Hankel track matrix with dimension of PM multiplied by K

Step 6.2: calculating the first trace matrix Y _l Y _l ^T Characteristic value of (D) is noted as lambda _l,1 ,...,λ _l,i ,...,λ _l,PM Wherein λ is _l,i Represents the l trace matrix Y _l Y _l ^T The ith feature value of (a); calculating the first trace matrix Y _l Y _l ^T Characteristic value λ of _l,1 ,...,λ _l,i ,...,λ _l,PM Is denoted as U _l,1 ,...,U _l,i ,...,U _l,PM Wherein, U _l,i Representing a characteristic value λ _l,i A corresponding orthonormal vector; calculate R = rank (Y) _l ) R represents a matrix Y _l Rank of (1), calculating principal component

The first hankel locus matrix Y _l Decomposition into Y _l ＝Y _l,1 +...+Y _l,i +...+Y _l,R Wherein, Y _l,i Represents the ith trace matrix Y _l Y _l ^T Ith characteristic value lambda _i A corresponding decomposition matrix, and

step 6.3: divide the first index set { 1. -, R } into R disjoint subsets I _l,i ＝{i},i＝1,2,...,R，I _l,i Represents the ith subset of the first batch; for the Hankel locus matrix Y according to the subset _l Grouping to obtain the first R group matrix { Y } _l,i |1≤i≤R}，Y _l,i Obtained by step 6.2;

step 6.4: to pairFirst R group matrix Y _l,i Carrying out diagonal line averaging to obtain the first reconstructed signal

Wherein the content of the first and second substances,

denotes the ith grouping matrix Y _l,i Carrying out diagonal line average on the processed one-dimensional signals;

and 7: the first reconstructed signal

Grouping, each group containing four signals, the first group of signals

As a candidate pulse set, calculating the ith candidate heart rate signal of the first group

The ratio of the energy of the main frequency and the second harmonic frequency, thereby selecting the candidate heart rate signal with the maximum energy ratio as the pulse signal, and converting the pulse signal into a frequency domain form by utilizing fast Fourier transform, thereby obtaining the main frequency f of the pulse signal _main Calculating the average heart rate value of the subject of the first N frames of video images to obtain a first heart rate value HR _l ＝f _main ×60；

And step 8: obtaining L heart rate value sets { HR ] of L N frame video images according to the processes from the step 2 to the step 7 _l |l＝1,2,…,L}；

And step 9: calculating heart rate value HR of the first N frames of video images _l Heart rate values HR of N frames of video images of k-th part of the same subject _k Whether or not the absolute error therebetween is smaller than the set threshold value Th ₁ If the value is less than the preset value, the counting value is S +1; otherwise, keeping the count value S unchanged, wherein the initial value of the count value S is 0;

step 10: judging whether the count value S is larger than the set threshold Th ₂ If, ifIf it is greater than the first heart rate value HR _l Taking the heart rate as an effective value and recording the effective value into a heart rate candidate set HR, otherwise, judging the next heart rate value; so as to obtain the final heart rate candidate set HR, and calculating the average value thereof as the reference value HR _ref ；

Step 11: according to the reference value HR _ref And L heart rate value sets { HR } _l } _{(l＝1,2,...,L)} Calculating the first heart rate value HR _l And reference HR _ref Absolute error between HR and HR _error,l If the absolute error HR _error,l Less than or equal to a predetermined threshold value Th ₁ The ith heart rate value HR _l As target HR _T Otherwise, the value is regarded as an abnormal value;

if the first heart rate value HR _l If the value is abnormal, the signal is decomposed from the rest

To calculate the closest reference value HR _ref Heart rate value HR of _n,new As target HR _T 。

Compared with the prior art, the invention has the beneficial effects that:

1. the invention selects the chrominance signal: most studies have chosen a green channel signal, which has the advantage of containing a greater heart rate signal intensity compared to the red and blue channels. However, in the case of varying light, the color space of the green channel depends not only on the color of the object, but also on the intensity of the reflected light from the surface. In contrast, the chrominance signal is not dependent on luminance, which means that the chrominance signal is more resistant to changes in ambient light than the green signal, thereby reducing the effect of ambient light changes on the extraction heart rate.

2. The invention adopts a self-adaptive filtering method to remove motion noise: the Openface method is adopted to detect the human face characteristic points frame by frame, the motion trail of the nose tip position is used as a motion signal, the self-adaptive filtering is utilized to effectively remove the motion artifact caused by head motion in the chrominance signal, and the accuracy of heart rate measurement is improved.

3. The invention divides the face interesting area into a plurality of sub-areas and carries out optimal screening: the information of the heart beats contained in different facial interesting regions is the same, compared with the method for extracting the heart rate signal source from a single interesting region, the method emphasizes common signal source components contained in a plurality of interesting regions, and the joint analysis of the plurality of interesting regions can extract the heart rate more accurately.

4. The invention adopts a multivariate singular spectrum decomposition method to extract the heart rate: the multivariate singular spectrum decomposition avoids the mode aliasing problem of the traditional empirical mode decomposition method, can directly decompose a plurality of signals, considers the internal correlation among the signals and can effectively reduce the heart rate signal distortion rate.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2a is a schematic view of a facial region of interest in accordance with the present invention;

FIG. 2b is a schematic diagram of the present invention for screening the optimal region of interest of the face;

FIG. 3a is a candidate heart rate signal 1 obtained by screening multivariate singular spectrum analysis according to the present invention;

fig. 3b is a candidate heart rate signal 2 obtained by screening multivariate singular spectrum analysis according to the present invention;

FIG. 3c is a candidate heart rate signal 3 obtained by filtering the multivariate singular spectrum analysis according to the present invention;

FIG. 3d is a candidate heart rate signal 4 obtained by filtering the multivariate singular spectrum analysis according to the present invention;

FIG. 4a is a frequency spectrum diagram of a candidate heart rate signal 1 obtained by screening multivariate singular spectrum analysis according to the present invention;

FIG. 4b is a frequency spectrum diagram of a candidate heart rate signal 2 obtained by filtering the multivariate singular spectrum analysis according to the present invention;

fig. 4c is a frequency spectrum diagram of a candidate heart rate signal 3 obtained by screening multivariate singular spectrum analysis according to the present invention;

FIG. 4d is a graph of a spectrum of a candidate heart rate signal 4 obtained by screening a multivariate singular spectrum analysis according to the present invention;

FIG. 5a is a flow chart of heart rate calculation according to the present invention;

FIG. 5b is a flowchart of the method for screening outliers of the present invention.

Detailed Description

In the embodiment, a non-contact video heart rate detection method based on multivariate singular spectrum decomposition is disclosed, as shown in fig. 1, a face video image sequence is obtained first, and a face region of interest is determined; then dividing the region of interest of the face into a plurality of sub-regions, screening an optimal sub-region according to the illumination intensity, illumination change and signal-to-noise ratio, extracting RGB signals of the optimal sub-region, converting the RGB signals into HSV signals, and taking the motion trail of the nose tip part as motion signals; carrying out self-adaptive filtering on the chrominance signal and the motion signal, and taking the filtered chrominance signal as an input signal of each subregion; then, processing input signals of all sub-regions by adopting multivariate singular spectrum decomposition and obtaining decomposed signal data sets of all sub-regions; screening a first decomposition signal of each subarea, recording the first decomposition signal as a candidate heart rate signal, then calculating the energy ratio of main frequency and second harmonic frequency of all candidate heart rate signals, screening the candidate heart rate signal with the largest energy ratio as a pulse signal, calculating a heart rate value according to the main frequency value of the signal, and then selecting an abnormal value according to heart rate continuity and replacing the abnormal value with a correct heart rate value. Specifically, the method comprises the following steps:

step 1: the method comprises the steps of acquiring 60s T frames of video data of a subject, dividing the video data into L parts, wherein L =7, and each part of the data comprises N frames of video images.

Step 2: for the l N frame video image, a rectangular frame was obtained using the Viola-Jones face Detector scaled to 60% to remove non-skin areas such as background and hair. The entire ROI region is then divided into Q =4 × 4 block sub-regions, and the four vertices of each block sub-region are tracked along the video frame using the Kanade-Lucas-Tomasi algorithm as shown in fig. 2 a.

And 3, step 3: calculating the pixel mean value of each sub-region in the l N-th frame of video image frame by frame, calculating the illumination intensity, illumination change and signal-to-noise ratio of each sub-region as the quality index of judgment according to the pixel mean value of each sub-region, and selecting P =4 optimal sub-regions from Q sub-regions, as shown in FIG. 2 b. And extracting RGB channel mean value signals from the P optimal sub-regions and converting the RGB channel mean value signals into HSV signals, wherein the HSV signals are compared with the RGB image domain to the HSV image domainCompared with the conversion, the conversion speed between the signals is faster, and the real-time application can be realized. Extracting a chrominance signal in the HSV signal and recording the chrominance signal as H _l ＝{h _l,1 ,h _l,2 ,...,h _l,p ,...,h _l,P }，h _l,p The chroma signal of the P-th optimal subarea of the l N-frame video image is 1 < P < P. The chrominance signal is selected to replace the green signal because the chrominance signal is more resistant to variations in light. In the case of varying light, the color space of the green channel depends not only on the color of the object, but also on the intensity of the reflected light from the surface. In contrast, chrominance signals are not dependent on luminance. This means that the tint signal is more tolerant of changes in ambient light.

and 5: using the first motion track signal V _l Chrominance signal h for the ith best subregion _l,p Performing adaptive filtering LMS processing to remove the first chrominance signal h _l,p Due to motion artifacts of the head. Under the condition that the adaptive filtering does not have prior statistical knowledge about the information to be extracted, the processing parameters are continuously updated recursively in the observation process by directly utilizing observation data according to certain criteria, and the change of statistical properties is automatically tracked so as to gradually approach to a certain optimal processing result. Such a processing method is more suitable for the requirements of non-stationary situations and has also proven to be an effective noise cancellation method. Obtaining the first p-th chrominance signal after self-adaptive filtering

And is provided with

Representing pixel values of the P-th chrominance signal after being filtered in the ith frame video image to obtain the P-th chrominance signals after being filtered

And as the ith input signal data set;

Generating the ith trace matrix with dimension of M × K

Step 6.2: calculating the first trace matrix Y _l Y _l ^T Characteristic value of (D), noted as λ _l,1 ,...,λ _l,i ,...,λ _l,PM Wherein λ is _l,i Represents the l trace matrix Y _l Y _l ^T The ith eigenvalue of (a); calculating the first trace matrix Y _l Y _l ^T Characteristic value λ of _l,1 ,...,λ _l,i ,...,λ _l,PM Is denoted as U _l,1 ,...,U _l,i ,...,U _l,PM Wherein, U _l,i Representing the characteristic value lambda _l,i The corresponding orthonormal vector. Calculate R = rank (Y) _l ) R represents a matrix Y _l Rank of (2), calculating principal component

The first hankel locus matrix Y _l Decomposition into Y _l ＝Y _l,1 +...+Y _l,i +...+Y _l,R Wherein Y is _l,i Represents the ith trace matrix Y _l Y _l ^T Ith eigenvalue lambda _i A corresponding decomposition matrix, and

step 6.3: partition the I < th > index set { 1., R } into R disjoint subsets I _l,i ＝{i},i＝1,2,...,R，I _l,i The ith subset is indicated. According to the subset, for the Hankel locus matrix Y _l Grouping to obtain the first R group matrix { Y } _l,i |1≤i≤R}，Y _l,i Obtained by step 6.2;

step 6.4: to convert the grouped matrices into time series, the first R is grouped into matrix Y _l,i Carrying out diagonal line averaging to obtain the first reconstructed signal

Wherein the content of the first and second substances,

denotes the ith grouping matrix Y _l,i Carrying out diagonal line average on the processed one-dimensional signals; in this example, the input signal is decomposed into components using the multivariate singular spectral decomposition method described above. The method can fully utilize the relevance of input signals of different regions and overcome the influence of mode aliasing of the traditional empirical mode decomposition method.

And 7: the first reconstructed signal

Grouping, each group containing four signals, the first group of signals

As the candidate pulse sets, as shown in FIG. 3a, FIG. 3b, FIG. 3c, FIG. 3d, in this exampleFour decomposed signals are extracted to form a candidate pulse set. Calculating the ith candidate heart rate signal of the first group

And the energy ratio of the second harmonic frequency, thereby selecting the candidate heart rate signal with the largest energy ratio as the pulse signal. As shown in fig. 4a, 4b, 4c, and 4d, each graph corresponds to a single candidate heart rate signal frequency spectrogram, and the black circles correspond to peak frequency points, that is, main frequency points. Converting the pulse signal into a frequency domain form by using fast Fourier transform, thereby obtaining a main frequency f of the pulse signal _main Calculating the average heart rate value of the subject of the first N frames of video images to obtain a first heart rate value HR _l ＝f _main X 60. The first set of decomposed signals is chosen as candidate pulses and because most of the noise in the chrominance channels has been removed in the previous operation step. Thus, the heartbeat signal is the most significant and correlated oscillating signal in the chrominance channel.

And 8: as shown in FIG. 5a, according to the process from step 2 to step 7, L heart rate value sets { HR } of L video images of N frames are obtained _l |l＝1,2,…,L}；

And step 9: calculating heart rate value HR of the first N frames of video images _l Heart rate values HR of N frames of video images of k-th part of the same subject _k Absolute error therebetween, is less than the set threshold Th ₁ If the value is less than the preset value, the counting value is S +1; otherwise, keeping the count value S unchanged, wherein the initial value of the count value S is 0;

step 10: judging whether the value S is larger than the set threshold Th ₂ If yes, then the current HR is set _l Considered as valid value and counted in the heart rate candidate set HR, otherwise, the current HR _l Not counting the heart rate candidate set HR, and finally determining the reference value HR _ref Is the average value of the heart rate candidate set HR;

step 11: obtaining a reference value HR _ref And L heart rate value sets { HR _l } _{(l＝1,2,...,L)} Calculating the current HR _l And reference HR _ref Absolute error HR therebetween _error,l If the absolute error HR _error,l Less than or equal to a predetermined threshold value Th ₁ Then current HR _l Considered as the target HR _T Otherwise, the signal is regarded as an abnormal value, and if the signal is an abnormal value, the residual decomposed signal is used as a reference signal

Calculating the closest reference value HR _ref Heart rate value HR of _n,new As target HR _T . As shown in FIG. 5b, the target HR from which the abnormal value is removed is obtained according to the heart rate continuity _T 。

In order to verify the robustness of the video heart rate algorithm provided by the invention, the invention adopts a public data set COHFACE and a self-acquisition data set BSIPL to perform algorithm verification. In the embodiment, the experimental result is analyzed by comparing the error between the real heart rate of the two data set acquisition videos and the heart rate measured by the algorithm to be tested, and the robustness of the algorithm is evaluated by adopting four evaluation indexes, namely Root Mean Square Error (RMSE), mean Absolute Error (MAE), standard deviation (sd) and correlation coefficient. The method proposed by the present invention was compared with the EEMD-MCCA method, and the results are shown in tables 1 and 2.

TABLE 1 analysis of heart rate measurements by two methods of CoHFACE

Evaluation index	EEMD-MCCA	MSSA-H
			Root mean square error (bpm)	4.81	2.19
Mean absolute error (bpm)	2.08	1.25
			Standard deviation (bpm)	4.33	1.81
Correlation coefficient	0.91	0.98

TABLE 2 analysis of heart rate measurements by BSIPL two methods

Evaluation index	EEMD-MCCA	MSSA-H
			Root mean square error (bpm)	6.14	4.00
Mean absolute error (bpm)	2.95	1.81
			Standard deviation (bpm)	5.39	3.56
Correlation coefficient	0.86	0.94

As can be seen from Table 1, the MSSA-H method has significant improvements in the three indexes of root mean square error, mean absolute error and standard deviation, which are 2.62bpm,0.83bpm and 2.52bpm respectively, and the correlation coefficient is also improved from 0.91 to 0.98, and the results in Table 2 show that the four indexes of MSSA-H are all superior to EEMD-MCCA. The obtained results show that the heart rate value calculated by the method is closer to the heart rate true value, and has better robustness compared with EEMD-MCCA.

In conclusion, the non-contact heart rate detection method based on multivariate singular spectrum analysis, which is provided by the invention, can accurately extract the human heart rate from the video and acquire the video heart rate detection result, and has good robustness.

Claims

1. A non-contact video heart rate detection method based on multivariate singular spectrum decomposition is characterized by comprising the following steps:

step 2: determining a face interested area of the first N frames of video images by adopting a face detection and face tracking method, and dividing the face interested area in each frame of video image into Q sub-areas, wherein L is more than or equal to 1 and less than or equal to L;

and step 3: calculating the pixel mean value of each sub-region in the l N frame video image frame by frame, calculating the illumination intensity, illumination change and signal-to-noise ratio of each sub-region as the judged quality indexes according to the pixel mean value of each sub-region, selecting P optimal sub-regions from Q sub-regions according to the quality indexes of each sub-region, extracting RGB channel mean value signals from the P optimal sub-regions, converting the RGB channel mean value signals into HSV signals, and extracting the HSV signalsChrominance signal, denoted as H _l ＝{h _l,1 ,h _l,2 ,...,h _l,p ,...,h _l,P }，h _l,p The chrominance signal of the P-th optimal subregion of the l N-frame video image is more than 1 and more than P and more than P;

And is provided with

And as the ith input signal data set;

Generating the ith trace matrix with dimension of M × K

Step 6.2: calculating the first trace matrix Y _l Y _l ^T Characteristic value of (D), noted as λ _l,1 ,...,λ _l,i ,...,λ _l,PM Wherein λ is _l,i Represents the ith trace matrix Y _l Y _l ^T The ith feature value of (a); calculating the first trace matrix Y _l Y _l ^T Characteristic value λ of _l,1 ,...,λ _l,i ,...,λ _l,PM Is denoted as U _l,1 ,...,U _l,i ,...,U _l,PM Wherein, U _l,i Representing a characteristic value λ _l,i A corresponding orthonormal vector; calculate R = rank (Y) _l ) R represents a matrix Y _l Rank of (1), calculating principal component

The first hankel locus matrix Y _l Decomposition into Y _l ＝Y _l,1 +...+Y _l,i +...+Y _l,R Wherein Y is _l,i Represents the l trace matrix Y _l Y _l ^T Ith eigenvalue lambda _i A corresponding decomposition matrix, and

step 6.3: divide the first index set { 1. -, R } into R disjoint subsets I _l,i ＝{i},i＝1,2,...,R，I _l,i Represents the ith subset of the first batch; for the Hankel locus matrix Y according to the subset _l Is divided intoObtaining the first R group matrix { Y } _l,i |1≤i≤R}，Y _l,i Obtained by step 6.2;

step 6.4: for the first R set of matrix Y _l,i Carrying out diagonal line averaging to obtain the first reconstructed signal

Wherein the content of the first and second substances,

and 7: the first reconstructed signal

Grouping each group containing four signals, and dividing the first group into four groups

And 8: for the process according to the steps 2 to 7, obtaining L heart rate value sets { HR ] of L N frame video images _l |l＝1,2,…,L}；

And step 9: calculating the heart rate value HR of the first N frames of video images _l Heart rate values HR of N frames of video images of k-th part of the same subject _k Whether or not the absolute error therebetween is smaller than the set threshold value Th ₁ If the ratio is less than the above range,the counting value S +1 is set; otherwise, keeping the count value S unchanged, wherein the initial value of the count value S is 0;

step 10: judging whether the count value S is larger than the set threshold Th ₂ If yes, then the first heart rate value HR is set _l Taking the heart rate as an effective value and recording the effective value into a heart rate candidate set HR, otherwise, judging the next heart rate value; so as to obtain the final heart rate candidate set HR, and calculating the average value thereof as the reference value HR _ref ；

Step 11: according to the reference value HR _ref And L heart rate value sets { HR _l } _{(l＝1,2,...,L)} Calculating the first heart rate value HR _l And reference HR _ref Absolute error HR therebetween _error,l If the absolute error HR is large _error,l Less than or equal to a predetermined threshold value Th ₁ The ith heart rate value HR _l As target HR _T Otherwise, the value is regarded as an abnormal value;

To calculate the closest reference value HR _ref Heart rate value HR of _n,new As a target HR _T 。