CN104573659A - Driver call-making and call-answering monitoring method based on svm - Google Patents

Driver call-making and call-answering monitoring method based on svm Download PDF

Info

Publication number
CN104573659A
CN104573659A CN201510013139.4A CN201510013139A CN104573659A CN 104573659 A CN104573659 A CN 104573659A CN 201510013139 A CN201510013139 A CN 201510013139A CN 104573659 A CN104573659 A CN 104573659A
Authority
CN
China
Prior art keywords
hand
driver
surveyed area
image
rect
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510013139.4A
Other languages
Chinese (zh)
Other versions
CN104573659B (en
Inventor
张卡
何佳
曹昌龙
尼秀明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ANHUI QINGXIN INTERNET INFORMATION TECHNOLOGY Co Ltd
Original Assignee
ANHUI QINGXIN INTERNET INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ANHUI QINGXIN INTERNET INFORMATION TECHNOLOGY Co Ltd filed Critical ANHUI QINGXIN INTERNET INFORMATION TECHNOLOGY Co Ltd
Priority to CN201510013139.4A priority Critical patent/CN104573659B/en
Publication of CN104573659A publication Critical patent/CN104573659A/en
Application granted granted Critical
Publication of CN104573659B publication Critical patent/CN104573659B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/59Context or environment of the image inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions
    • G06V20/597Recognising the driver's state or behaviour, e.g. attention or drowsiness
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/107Static hand or arm
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships

Abstract

The invention relates to a driver call-making and call-answering monitoring method based on an svm. The monitoring method comprises the following sequential steps that a face detection classifier and a hand detection classifier are established, wherein hand images obtained in the calling making process and the call answering process are used as positive training samples; driving state images of a driver are collected in real time; proper image subregions are selected as effective detection regions for hand movement detection and call-making and call-answering detection; whether the driver prepares for making a call or answering a call is judged in real time according to the images in the effective detection regions; according to the images in the effective detection regions, the time when the hand of the driver stays beside the ear is monitored carefully, and whether the driver is making a call or answering a call is judged according to the duration of stay; a real-time video of the call-making or call-answering process of the driver is sent to a remote server, and a command sent by the remote server is received.

Description

A kind of driver based on svm plays phone-monitoring method
Technical field
The present invention relates to safe driving technical field, be specifically related to a kind of driver based on svm and play phone-monitoring method.
Background technology
Along with the quick growth of automobile pollution, people are enjoying the facility and simultaneously efficiently of traffic, and taking place frequently also along with all kinds of traffic hazard, causes huge personnel and economic loss.Cause the factor of traffic hazard a lot, it is one of important inducement that driver drives to play in way phone.Due to cannot the driving behavior video of Real-time Obtaining driver, the supervision department of some passenger and freight enterprises using deduction afterwards as the foundation dividing responsibility, can only cannot carry out monitoring in advance and prevention.Therefore, real-time monitoring driving person plays phone behavior, feeds back to supervision department of transport enterprise in time and prevents, and for avoiding major traffic accidents, plays a part can not be substituted.
Support vector machine (svm) is a kind of supervised learning algorithm, it shows many distinctive advantages in solution small sample, non-linear and high dimensional pattern identification, the method is that the VC being based upon Statistical Learning Theory ties up on theoretical and Structural risk minization basis, between the complicacy (namely to the study precision of specific training sample) and learning ability (namely identifying the ability of arbitrary sample error-free) of model, best compromise is sought, to obtaining best Generalization Ability (or claiming generalization ability) according to limited sample information.
At present, driver is played to the monitoring of phone behavior, conventional technical method has following several:
(1) monitor based on mobile phone signal, these class methods, by placing a mobile phone signal detecting device in pilothouse, according to signal fluctuation in various degree, judge whether that existence plays phone behavior.These class methods can reach the effect that monitoring plays phone on goods stock.But on passenger stock, there is more interference, as the mobile phone signal interference etc. of passenger on passenger vehicle, there is serious undetected and flase drop, comprehensively monitoring driving person cannot be realized in real time and play phone behavior.
(2) monitor based on video image, whether these class methods are put on the steering wheel by real-time monitoring driving person's both hands, once there is certain hand departure direction dish, are namely considered to be in and play phone.There is serious flase drop in these class methods, because a lot of driver exists the custom of a hand steered bearing circle, therefore, the method uses and has larger problem in actual environment.
Summary of the invention
A kind of driver based on svm is the object of the present invention is to provide to play phone-monitoring method, this method for supervising is based on video image processing technology, judge to play phone behavior by the state of real-time monitoring driving person's ear region hand, there is monitoring accuracy high, undetected flase drop is less, speed is fast, the features such as cost is low.
Technical scheme of the present invention is:
Driver based on svm plays a phone-monitoring method, and this method for supervising comprises the step of following order:
(1) set up Face datection sorter and to play phone time hand images detect sorter for training the hand of positive sample;
(2) the driving condition image of Real-time Collection driver;
(3) suitable image region is selected to detect as hand exercise and play effective surveyed area of phone detection;
(4) according to effective surveyed area image, whether real-time judge driver exists the warming-up exercise playing phone; If so, step (5) is then performed; If not, then execution step (2) is returned;
(5) according to effective surveyed area image, monitor that in detail the hand of driver rests on the time on ear side, and judge whether driver plays phone according to the time length that the hand of driver rests on ear side; If so, step (6) is then performed; If not, then execution step (2) is returned;
(6) real-time video driver being played phone is sent to remote server, and receives the order of remote server transmission.
In step (3), the suitable image region of described selection detects and plays as hand exercise effective surveyed area that phone detects, and specifically comprises the step of following order:
(31) adopt harr characteristic sum adaboost sorting algorithm, detect face location;
(32) according to five, the three front yard layout rule of face, the position that coarse positioning is left and right two;
(33) position of eyes is accurately oriented;
(34) suitable effective image subregion is selected.
In step (4), described according to effective surveyed area image, whether real-time judge driver exists the warming-up exercise playing phone, specifically comprises the step of following order:
(41) in left and right effective surveyed area, even point sampling is carried out respectively;
(42) accurate tracking of sampled point is carried out;
(43) movable information of correct tracking sampling point is obtained;
(44) obtain the statistical nature of effective surveyed area, the statistical nature of described surveyed area comprises the mean motion intensity avem of left effectively surveyed area l, right effectively surveyed area mean motion intensity avem r, left effectively surveyed area motion range R lwith the motion range R of the effective surveyed area in the right side r;
(45) judge whether to exist the action of raising one's hand near ear.
In step (5), described according to effective surveyed area image, monitor that the hand of driver rests on the time on ear side in detail, and judge whether driver plays phone according to the time length that the hand of driver rests on ear side; Specifically comprise the step of following order:
(51) using effective surveyed area image as present image, calculate the Gradient Features of present image, and gradient direction angle corrected, the scope of gradient direction is limited to [0 π];
(52) structure one is wide is W, the high hough transform window for H, and the value of W and H equals the wide high level of the training sample of hand trainer respectively, and carries out subwindow division to hough transform window;
(53) hough transform window is slided with a fixed step size in effective surveyed area, judge whether the region that hough transform window covers exists hand images;
(54) judge whether hough transform window slides into ending place of present image; If so, step (55) is then performed; If not, then hough transform window sliding is to next position, again performs step (53);
(55) the wide as W of present image is set image, height is H image, carry out change of scale to present image, carrying out the wide of the image after change of scale is σ W image, height is σ H image, the image after carrying out change of scale is set to present image; Wherein, 0< σ <1;
(56) judge whether the size of present image is less than minimum threshold; If the size of present image is less than minimum threshold, then perform step (57); If the size of present image is greater than minimum threshold, then returns and perform step (51);
(57) according to number and the position of the candidate target in the list of candidate region, the position of hand in comprehensive descision present frame;
(58) judge whether accumulation frame number reaches unit frame number; If so, step (59) is then performed; If not, then execution step (51) is returned;
(59), in statistical unit frame number, there is the frame number proportion of hand images; And judge whether driver is according to the frame number proportion of hand images and play telephone state.
In step (34), the effective image subregion that described selection is suitable, the following formula of concrete employing realizes:
rect left . x = point l . x - rect . width * 0.6 rect left . y = point l . y - rect . height * 0.16 rect left . width = rect . width * 0.6 rect left . height = rect . height * 1.16
rect right . x = point r . x rect right . y = point r . y - rect . height * 0.16 rect right . width = rect . width * 0.6 rect right . height = rect . height * 1.16
Wherein, rect left, rect rightsub-rectangular areas near the left and right ear representing selection respectively, rect represents the face location rectangular area detected, point l, point rrepresent the left hand edge point of left eye and the right hand edge point of right eye respectively.
In step (43), the movable information of the correct tracking sampling point of described acquisition, the following formula of concrete employing realizes:
M ( i ) = sqrt ( D x 2 + D y 2 ) &theta; ( i ) = arctan ( D y D x )
D x = xpoint i . x - ypoint i . x D y = xpoint i . y - ypoint i . y
Wherein, M (i) represents the motion amplitude of i-th sampled point, and θ (i) represents the direction of motion of i-th sampled point, xpoint irepresent the coordinate of i-th sampled point on present frame, ypoint irepresent the coordinate of i-th sampled point on previous frame.
In step (44), the statistical nature of the effective surveyed area of described acquisition, the following formula of concrete employing realizes:
avem l = sum l N ml R l = N l N ml avem r = sum r N mr R r N r N mr
sum l = sum l + M l ( i ) M l ( i ) &GreaterEqual; 2 N ml = N ml + 1 M l ( i ) &GreaterEqual; 2 N l = N l + 1 M l ( i ) &GreaterEqual; 2 and &pi; 4 < &theta; l ( i ) &le; &pi; 2 sum r = sum r + M r ( i ) M r ( i ) &GreaterEqual; 2 N mr = N mr + 1 M r ( i ) &GreaterEqual; 2 N r = N r + 1 M r ( i ) &GreaterEqual; 2 and &pi; 2 < &theta; r ( i ) &le; 3 * &pi; 4
Wherein, the equation indicated with subscript l represents the feature of the effective surveyed area in the left side, and the equation indicated with subscript r represents the feature of the effective surveyed area in the right, sum l, sum rrepresent in left and right surveyed area the amplitude of the sampled point with obviously motion respectively, N ml, N mrrepresent in left and right surveyed area the number of the sampled point with obviously motion respectively, N l, N rrepresent in left and right surveyed area to there is obviously motion and to the number of the close sampled point of ear respectively.
In step (45), described judges whether to exist the action of raising one's hand near ear, and the following formula of concrete employing realizes:
exist 1 s l + s r = 1 0 s l + s r &NotEqual; 1
s l = 1 avem l &GreaterEqual; T ml and R l &GreaterEqual; T &theta;l 0 avem l < T ml or R l < T &theta;l
s r = 1 avem r &GreaterEqual; T mr and R r &GreaterEqual; T &theta;r 0 avem r < T mr or R r < T &theta;r
T ml = T mlb * 0.7 T &theta;l = T &theta;lb * 0.7 T mr = T mrb * 0.7 T &theta;r = T &theta;rb * 0.7
Wherein, exist=1 represents the action existing and raise one's hand near ear, and exist=0 represents the action not existing and raise one's hand near ear, T mlb, T θ lb, T mrb, T θ rbbe illustrated respectively in left and right effective surveyed area, standard raise one's hand near ear time mean motion intensity and motion range.
In step (51), the Gradient Features of described calculating present image, and gradient direction angle is corrected, the following formula of concrete employing realizes:
M ( x , y ) = G x 2 + G y 2 &theta; ( x , y ) = arctan ( G y G x )
G x = f ( x - 1 , y ) + f ( x + 1 , y ) - 2 * f ( x , y ) G y = f ( x , y - 1 ) + f ( x , y + 1 ) - 2 * f ( x , y )
&theta; ( x , y ) = &theta; ( x , y ) + &pi; &theta; ( x , y ) < 0 &theta; ( x , y ) &theta; ( x , y ) &GreaterEqual; 0
Wherein, M (x, y), θ (x, y) represent gradient magnitude and the direction at pixel (x, y) place respectively, and f (x, y) represents the gray-scale value at pixel (x, y) place.
In step (53), described slides hough transform window with a fixed step size in effective surveyed area, judges whether the region that hough transform window covers exists hand images; Concrete employing below order step realize:
(531) histogram of gradients of each subwindow overlay area is added up;
(532) proper vector of hough transform window overlay area is obtained;
(533) proper vector of hough transform window overlay area is normalized;
(534) proper vector of hough transform window overlay area is sent into hand svm sorter, predict its generic; If hand svm sorter predicts that this proper vector belongs to hand images, then hough transform window position is added in the list of candidate region.
Compare other driver and play phone-monitoring method, the present invention adopts video image processing technology, by triggering the hand existence of the real-time monitoring driving person's ear region of monitoring mode, judge to play phone behavior, have monitoring accuracy high, undetected flase drop is less, speed is fast, the features such as cost is low.
Accompanying drawing explanation
Fig. 1 is method flow diagram of the present invention;
Fig. 2 is trigger module workflow diagram;
Fig. 3 is monitoring module workflow diagram;
Fig. 4 is the positive sample master drawing of training;
Fig. 5 is the effective surveyed area schematic diagram in location;
Fig. 6 is point sampling design sketch;
Fig. 7 is that the subwindow of hough transform window divides schematic diagram;
Fig. 8 is hough transform window sliding schematic diagram;
Fig. 9 is hand couple candidate detection position effect figure, and wherein, rectangle frame 1 is effective surveyed area, and rectangle frame 2 is couple candidate detection regions;
Figure 10 is that hand finally detects position design sketch, and wherein, rectangle frame 1 is effective surveyed area, and rectangle frame 2 finally detects position.
Embodiment
The present invention is further illustrated below in conjunction with accompanying drawing.
As shown in Figure 1, in embodiment, the driver based on svm of the present invention plays phone-monitoring system and comprises initialization module, acquisition module, locating module, trigger module, monitoring module and communication module.The concrete implementation step that this driver based on svm plays phone-monitoring system is as follows:
S1, execution initialization module.
The function of initial module is, load and the necessary sorter file of training system, concrete steps are as follows:
S11, load existing Face datection sorter file.
S12, as shown in Figure 4, collects hand images when playing phone and, as positive sample, detects sorter file based on svm theory training hand.
S2, execution acquisition module.
The function of acquisition module is, the driving condition image of Real-time Collection driver, the mainly head image of driver.
S3, execution locating module.
The function of locating module is, selects suitable image region detect as hand exercise and play the effective coverage of phone detection, mainly selects the regional area near left and right ear.This module can promote detection speed greatly, removes more interference region, as shown in Figure 5.The concrete steps of this module are as follows:
S31, employing harr characteristic sum adaboost sorting algorithm, detect face location.
S32, five, three front yard layout rule based on face, the position that coarse positioning is left and right two.
S33, accurately orient the position of eyes.
S34, employing formula (1) and formula (2) select suitable effective image subregion.
rect left . x = point l . x - rect . width * 0.6 rect left . y = point l . y - rect . height * 0.16 rect left . width = rect . width * 0.6 rect left . height = rect . height * 1.16 - - - ( 1 )
rect right . x = point r . x rect right . y = point r . y - rect . height * 0.16 rect right . width = rect . width * 0.6 rect right . height = rect . height * 1.16 - - - ( 2 )
Wherein, rect left, rect rightsub-rectangular areas near the left and right ear representing selection respectively, rect represents the face location rectangular area detected, point l, point rrepresent the left hand edge point of left eye and the right hand edge point of right eye respectively.
S4, judge trigger module open and-shut mode, if trigger module is in opening, then perform step S5, if trigger module is in closed condition, then perform step S7.
S5, execution trigger module.
The function of trigger module is, whether real-time judge driver exists the warming-up exercise playing phone, specifically refers to whether driver exists the action of raising one's hand near ear.If existed, illustrate that driver likely prepares to play phone, now exit trigger module, return and open monitoring module signal; If there is no this action, system can proceed real-time judge, waits for the appearance next time playing phone warming-up exercise.As shown in Figure 2, the concrete steps of this module are as follows:
S51, in left and right effective surveyed area, carry out even point sampling respectively, effect is as Fig. 6.
S52, carry out the accurate tracking of sampled point.Concrete track algorithm is see document: Forward-Backward Error:Automatic Detection of Tracking Failures, Zdenek Kalal, Krystian Mikolajczyk, Jiri Matas, Pattern Recognition (ICPR), 2010 20th International Conference on.
S53, employing formula (3) and formula (4) obtain the movable information of correct tracking sampling point;
M ( i ) = sqrt ( D x 2 + D y 2 ) &theta; ( i ) = arctan ( D y D x ) - - - ( 3 )
D x = xpoint i . x - ypoint i . x D y = xpoint i . y - ypoint i . y - - - ( 4 )
Wherein, M (i) represents the motion amplitude of i-th sampled point, and θ (i) represents the direction of motion of i-th sampled point, xpoint irepresent the coordinate of i-th sampled point on present frame, ypoint irepresent the coordinate of i-th sampled point on previous frame.
S54, employing formula (5) and formula (6) obtain the statistical nature of effective surveyed area.The statistical nature of described effective surveyed area comprises the mean motion intensity avem of the effective surveyed area in the left side l, the effective surveyed area in the right mean motion intensity avem r, the effective surveyed area in the left side motion range R lwith the motion range R of the effective surveyed area in the right r.
avem l = sum l N ml R l = N l N ml avem r = sum r N mr R r N r N mr - - - ( 5 )
sum l = sum l + M l ( i ) M l ( i ) &GreaterEqual; 2 N ml = N ml + 1 M l ( i ) &GreaterEqual; 2 N l = N l + 1 M l ( i ) &GreaterEqual; 2 and &pi; 4 < &theta; l ( i ) &le; &pi; 2 sum r = sum r + M r ( i ) M r ( i ) &GreaterEqual; 2 N mr = N mr + 1 M r ( i ) &GreaterEqual; 2 N r = N r + 1 M r ( i ) &GreaterEqual; 2 and &pi; 2 < &theta; r ( i ) &le; 3 * &pi; 4 - - - ( 6 )
Wherein, the equation indicated with subscript l represents the feature of the effective surveyed area in the left side, and the equation indicated with subscript r represents the feature of the effective surveyed area in the right, sum l, sum rrepresent in left and right effective surveyed area the amplitude of the sampled point with obviously motion respectively, N ml, N mrrepresent in left and right effective surveyed area the number of the sampled point with obviously motion respectively, N l, N rrepresent that left and right effectively has obviously motion in surveyed area and to the number of the close sampled point of ear respectively.
S55, employing formula (7)-(10), judge whether to exist the action of raising one's hand near ear.If exist, then exit trigger module; If not, then continue to perform trigger module.
exist 1 s l + s r = 1 0 s l + s r &NotEqual; 1 - - - ( 7 )
s l = 1 avem l &GreaterEqual; T ml and R l &GreaterEqual; T &theta;l 0 avem l < T ml or R l < T &theta;l - - - ( 8 )
s r = 1 avem r &GreaterEqual; T mr and R r &GreaterEqual; T &theta;r 0 avem r < T mr or R r < T &theta;r - - - ( 9 )
T ml = T mlb * 0.7 T &theta;l = T &theta;lb * 0.7 T mr = T mrb * 0.7 T &theta;r = T &theta;rb * 0.7 - - - ( 10 )
Wherein, exist=1 represents the action existing and raise one's hand near ear, and exist=0 represents the action not existing and raise one's hand near ear, T mlb, T θ lb, T mrb, T θ rbbe illustrated respectively in left and right surveyed area, standard raise one's hand near ear time mean motion intensity and motion range.
S6, judge whether trigger monitoring module.Play the warming-up exercise of phone if exist, then monitoring module will be triggered unlatching, carry out in-depth monitoring, close trigger module simultaneously.If there is not the warming-up exercise playing phone, then directly get back to acquisition module, the triggering carried out next time judges.
S7, execution monitoring module.
The function of monitoring module is, monitors that the hand of driver rests on the time on ear side in detail.If the hand of driver rests on the time long enough on ear side, illustrate that driver plays phone, then return open communication module by signal.If not, then illustrate that this triggers unlatching and belongs to erroneous judgement.As shown in Figure 3, the concrete steps of this module are as follows:
S71, use effective surveyed area image as present image, adopt formula (11) and formula (12), calculate the Gradient Features of present image, and adopt formula (13) to correct gradient direction angle, the scope of gradient direction is limited to [0 π].
M ( x , y ) = G x 2 + G y 2 &theta; ( x , y ) = arctan ( G y G x ) - - - ( 11 )
G x = f ( x - 1 , y ) + f ( x + 1 , y ) - 2 * f ( x , y ) G y = f ( x , y - 1 ) + f ( x , y + 1 ) - 2 * f ( x , y ) - - - ( 12 )
&theta; ( x , y ) = &theta; ( x , y ) + &pi; &theta; ( x , y ) < 0 &theta; ( x , y ) &theta; ( x , y ) &GreaterEqual; 0 - - - ( 13 )
Wherein, M (x, y), θ (x, y) represent gradient magnitude and the direction at pixel (x, y) place, and f (x, y) represents the gray-scale value at pixel (x, y) place.
S72, structure wide height are respectively the hough transform window of W and H, and the value of W and H equals the wide high level of the training sample of hand svm sorter.As shown in Figure 7, subwindow division is carried out to hough transform window.Partitioning standards is: when driver does not answer the call, and because ear side does not have the interference of hand, the contouring head now in effective surveyed area is vertical generally, therefore, selects longitudinally narrow rectangular area to be rational as subwindow.
S73, as shown in Figure 8, hough transform window to be slided with a fixed step size in effective surveyed area, and judges whether the region that hough transform window covers exists hand images, and concrete steps are as follows:
S731, formula (14) is adopted to add up the histogram of gradients hist [bin] of each subwindow overlay area.
hist [ bin ] = hist [ bin ] + M ( x , y ) bin = floor ( &theta; ( x , y ) * 10 &pi; ) + 1 - - - ( 14 )
Wherein, the maximum integer being not more than x is chosen in floor (x) expression, and the span of bin is 1 ~ 10.
The histogram of gradients of S732, all subwindow overlay areas of connecting, obtains the proper vector of detection window overlay area.
S733, by the proper vector normalization of hough transform window overlay area;
S734, judge whether detection window overlay area belongs to hand images, and effect is as Fig. 9.In Fig. 9, square frame 1 represents left and right effectively surveyed area, and square frame 2 represents that hough transform window overlay area belongs to hand images.Concrete grammar is that the proper vector of detection window overlay area is sent into hand svm sorter, predicts its generic.If belong to hand images, then candidate region list is put in hough transform window position.
S74, judge whether detection window slides into ending place of present image.If do not arrive image ending, then hough transform window sliding is to next position, and again performs step S73; If arrived image ending, then perform step S75.
S75, present image is established to be of a size of W image, H image, carry out change of scale to present image, then after change of scale, picture size becomes σ W image, σ H image, and the image after change of scale is designated as present image.Wherein 0< σ <1.
S76, judge whether present image size is less than minimum threshold.If be less than thresholding, then illustrate that present image is too small, can not exist hand, perform step S77; If be greater than thresholding, then illustrate in present image still likely there is hand, perform step S71 and continue testing process.
S77, according to the number of the candidate target in the list of candidate region and position, the position of hand in comprehensive descision present frame, effect is as Figure 10.In Figure 10, square frame 1 represents left and right effectively surveyed area, and square frame 2 represents that hough transform window overlay area belongs to hand images.
S78, judge whether accumulation frame number reaches unit frame number.If reach, then perform step S79; If not, then perform step S71 and continue testing process.
In S79, statistical unit frame number, there is the frame number proportion of hand images.If this ratio is greater than certain threshold value, then illustrates that current driver's is in and play telephone state.
S710, unlatching trigger module, close monitoring module.
S8, executive communication module.
The function of communication module is, when driver be in play telephone state time, this module sends driver to remote server and plays the real-time video of phone, and now supervision department of transport enterprise can be processed timely by this video; If needed and driver's call, remote command can also be received by this module.
The above embodiment is only be described the preferred embodiment of the present invention; not scope of the present invention is limited; under not departing from the present invention and designing the prerequisite of spirit; the various distortion that those of ordinary skill in the art make technical scheme of the present invention and improvement, all should fall in protection domain that claims of the present invention determine.

Claims (10)

1. the driver based on svm plays a phone-monitoring method, it is characterized in that: this method for supervising comprises the step of following order:
(1) set up Face datection sorter and to play phone time hand images detect sorter for training the hand of positive sample;
(2) the driving condition image of Real-time Collection driver;
(3) suitable image region is selected to detect as hand exercise and play effective surveyed area of phone detection;
(4) according to effective surveyed area image, whether real-time judge driver exists the warming-up exercise playing phone; If so, step (5) is then performed; If not, then execution step (2) is returned;
(5) according to effective surveyed area image, monitor that in detail the hand of driver rests on the time on ear side, and judge whether driver plays phone according to the time length that the hand of driver rests on ear side; If so, step (6) is then performed; If not, then execution step (2) is returned;
(6) real-time video driver being played phone is sent to remote server, and receives the order of remote server transmission.
2. a kind of driver based on svm according to claim 1 plays phone-monitoring method, it is characterized in that: in step (3), the suitable image region of described selection detects and plays as hand exercise effective surveyed area that phone detects, and specifically comprises the step of following order:
(31) adopt harr characteristic sum adaboost sorting algorithm, detect face location;
(32) according to five, the three front yard layout rule of face, the position that coarse positioning is left and right two;
(33) position of eyes is accurately oriented;
(34) suitable effective image subregion is selected.
3. a kind of driver based on svm according to claim 1 plays phone-monitoring method, it is characterized in that: in step (4), described according to effective surveyed area image, whether real-time judge driver exists the warming-up exercise playing phone, specifically comprises the step of following order:
(41) in left and right effective surveyed area, even point sampling is carried out respectively;
(42) accurate tracking of sampled point is carried out;
(43) movable information of correct tracking sampling point is obtained;
(44) obtain the statistical nature of effective surveyed area, the statistical nature of described surveyed area comprises the mean motion intensity avem of left effectively surveyed area l, right effectively surveyed area mean motion intensity avem r, left effectively surveyed area motion range R lwith the motion range R of the effective surveyed area in the right side r;
(45) judge whether to exist the action of raising one's hand near ear.
4. a kind of driver based on svm according to claim 1 plays phone-monitoring method, it is characterized in that: in step (5), described according to effective surveyed area image, the hand of detailed supervision driver rests on time on ear side, and judges whether driver plays phone according to the time length that the hand of driver rests on ear side; Specifically comprise the step of following order:
(51) using effective surveyed area image as present image, calculate the Gradient Features of present image, and gradient direction angle corrected, the scope of gradient direction is limited to [0 π];
(52) structure one is wide is W, the high hough transform window for H, and the value of W and H equals the wide high level of the training sample of hand trainer respectively, and carries out subwindow division to hough transform window;
(53) hough transform window is slided with a fixed step size in effective surveyed area, judge whether the region that hough transform window covers exists hand images;
(54) judge whether hough transform window slides into ending place of present image; If so, step (55) is then performed; If not, then hough transform window sliding is to next position, again performs step (53);
(55) the wide as W of present image is set image, height is H image, carry out change of scale to present image, carrying out the wide of the image after change of scale is σ W image, height is σ H image, the image after carrying out change of scale is set to present image; Wherein, 0< σ <1;
(56) judge whether the size of present image is less than minimum threshold; If the size of present image is less than minimum threshold, then perform step (57); If the size of present image is greater than minimum threshold, then returns and perform step (51);
(57) according to number and the position of the candidate target in the list of candidate region, the position of hand in comprehensive descision present frame;
(58) judge whether accumulation frame number reaches unit frame number; If so, step (59) is then performed; If not, then execution step (51) is returned;
(59), in statistical unit frame number, there is the frame number proportion of hand images; And judge whether driver is according to the frame number proportion of hand images and play telephone state.
5. a kind of driver based on svm according to claim 2 plays phone-monitoring method, it is characterized in that: in step (34), the effective image subregion that described selection is suitable, and the following formula of concrete employing realizes:
rect left . x = point l . x - rect . width * 0.6 rect left . y = point l . y - rect . height * 0.16 rect left . width = rect . width * 0.6 rect left . height = rect . height * 1.16
rect right . x = point r . x rect right . y = point r . y - rect . height * 0.16 rect right . width = rect . width * 0.6 rect right . height = rect . height * 1.16
Wherein, rect left, rect rightsub-rectangular areas near the left and right ear representing selection respectively, rect represents the face location rectangular area detected, point l, point rrepresent the left hand edge point of left eye and the right hand edge point of right eye respectively.
6. a kind of driver based on svm according to claim 3 plays phone-monitoring method, it is characterized in that: in step (43), the movable information of the correct tracking sampling point of described acquisition, and the following formula of concrete employing realizes:
M ( i ) = sqrt ( D x 2 + D y 2 ) &theta; ( i ) = arctan ( D y D x )
D x = xpoint i . x - ypoint i . x D y = xpoint i . y - ypoint i . y
Wherein, M (i) represents the motion amplitude of i-th sampled point, and θ (i) represents the direction of motion of i-th sampled point, xpoint irepresent the coordinate of i-th sampled point on present frame, ypoint irepresent the coordinate of i-th sampled point on previous frame.
7. a kind of driver based on svm according to claim 3 plays phone-monitoring method, it is characterized in that: in step (44), the statistical nature of the effective surveyed area of described acquisition, and the following formula of concrete employing realizes:
avem l = sum l N ml R l = N l N ml avem r = sum r N mr R r = N r N mr
sum l = sum l + M l ( i ) M l ( i ) &GreaterEqual; 2 N ml = N ml + 1 M l ( i ) &GreaterEqual; 2 N l = N l + 1 M l ( i ) &GreaterEqual; 2 and &pi; 4 &theta; l ( i ) &le; &pi; 2 sum r = sum r + M r ( i ) M r ( i ) &GreaterEqual; 2 N mr = N mr + 1 M r ( i ) &GreaterEqual; 2 N r = N r + 1 M r ( i ) &GreaterEqual; 2 and &pi; 2 < &theta; r ( i ) &le; 3 * &pi; 4
Wherein, the equation indicated with subscript l represents the feature of the effective surveyed area in the left side, and the equation indicated with subscript r represents the feature of the effective surveyed area in the right, sum l, sum rrepresent in left and right surveyed area the amplitude of the sampled point with obviously motion respectively, N ml, N mrrepresent in left and right surveyed area the number of the sampled point with obviously motion respectively, N l, N rrepresent in left and right surveyed area to there is obviously motion and to the number of the close sampled point of ear respectively.
8. a kind of driver based on svm according to claim 3 plays phone-monitoring method, it is characterized in that: in step (45), and described judges whether to exist the action of raising one's hand near ear, and the following formula of concrete employing realizes:
exist = 1 s l + s r = 1 0 s l + s r &NotEqual; 1
s l = 1 avem l &GreaterEqual; T ml and R l &GreaterEqual; T &theta;l 0 avem l < T ml or R l < T &theta;l
s r = 1 avem r &GreaterEqual; T mr and R r &GreaterEqual; T &theta;r 0 avem r < T mr or R r < T &theta;r
T ml = T mlb * 0.7 T &theta;l = T &theta;lb * 0.7 T mr = T mrb * 0.7 T &theta;r = T &theta;rb * 0.7
Wherein, exist=1 represents the action existing and raise one's hand near ear, and exist=0 represents the action not existing and raise one's hand near ear, T mlb, T θ lb, T mrb, T θ rbbe illustrated respectively in left and right effective surveyed area, standard raise one's hand near ear time mean motion intensity and motion range.
9. a kind of driver based on svm according to claim 4 plays phone-monitoring method, it is characterized in that: in step (51), the Gradient Features of described calculating present image, and gradient direction angle is corrected, the following formula of concrete employing realizes:
M ( x , y ) = G x 2 + G y 2 &theta; ( x , y ) = arctan ( G y G x )
G x = f ( x - 1 , y ) + f ( x + 1 , y ) - 2 * f ( x , y ) G y = f ( x , y - 1 ) + f ( x , y + 1 ) - 2 * f ( x , y )
&theta; ( x , y ) = &theta; ( x , y ) + &pi; &theta; ( x , y ) < 0 &theta; ( x , y ) &theta; ( x , y ) &GreaterEqual; 0
Wherein, M (x, y), θ (x, y) represent gradient magnitude and the direction at pixel (x, y) place respectively, and f (x, y) represents the gray-scale value at pixel (x, y) place.
10. a kind of driver based on svm according to claim 4 plays phone-monitoring method, it is characterized in that: in step (53), described slides hough transform window with a fixed step size in effective surveyed area, judges whether the region that hough transform window covers exists hand images; Concrete employing below order step realize:
(531) histogram of gradients of each subwindow overlay area is added up;
(532) proper vector of hough transform window overlay area is obtained;
(533) proper vector of hough transform window overlay area is normalized;
(534) proper vector of hough transform window overlay area is sent into hand svm sorter, predict its generic; If hand svm sorter predicts that this proper vector belongs to hand images, then hough transform window position is added in the list of candidate region.
CN201510013139.4A 2015-01-09 2015-01-09 A kind of driver based on svm takes phone-monitoring method Active CN104573659B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510013139.4A CN104573659B (en) 2015-01-09 2015-01-09 A kind of driver based on svm takes phone-monitoring method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510013139.4A CN104573659B (en) 2015-01-09 2015-01-09 A kind of driver based on svm takes phone-monitoring method

Publications (2)

Publication Number Publication Date
CN104573659A true CN104573659A (en) 2015-04-29
CN104573659B CN104573659B (en) 2018-01-09

Family

ID=53089681

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510013139.4A Active CN104573659B (en) 2015-01-09 2015-01-09 A kind of driver based on svm takes phone-monitoring method

Country Status (1)

Country Link
CN (1) CN104573659B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966059A (en) * 2015-06-15 2015-10-07 安徽创世科技有限公司 Method for detecting phoning behavior of driver during driving based on intelligent monitoring system
CN105469073A (en) * 2015-12-16 2016-04-06 安徽创世科技有限公司 Kinect-based call making and answering monitoring method of driver
CN105868690A (en) * 2016-03-11 2016-08-17 博康智能信息技术有限公司 Method and apparatus for identifying mobile phone use behavior of driver
CN106022242A (en) * 2016-05-13 2016-10-12 哈尔滨工业大学(威海) Driver call making identification method in intelligent transportation system
CN106530730A (en) * 2016-11-02 2017-03-22 重庆中科云丛科技有限公司 Traffic violation detection method and system
WO2018133874A1 (en) * 2017-01-23 2018-07-26 杭州海康威视数字技术股份有限公司 Method and device for sending warning message
CN108509902A (en) * 2018-03-30 2018-09-07 湖北文理学院 A kind of hand-held telephone relation behavioral value method during driver drives vehicle
CN110309764A (en) * 2019-06-27 2019-10-08 浙江工业大学 A kind of multistage driver based on deep learning makes a phone call behavioral value method
WO2020063753A1 (en) * 2018-09-27 2020-04-02 北京市商汤科技开发有限公司 Maneuver recognition and driving maneuver analysis method and apparatus, and electronic device
CN111523380A (en) * 2020-03-11 2020-08-11 浙江工业大学 Mask wearing condition monitoring method based on face and posture recognition
CN111553217A (en) * 2020-04-20 2020-08-18 哈尔滨工程大学 Driver call monitoring method and system
CN112487990A (en) * 2020-12-02 2021-03-12 重庆邮电大学 DSP-based driver call-making behavior detection method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567743A (en) * 2011-12-20 2012-07-11 东南大学 Automatic identification method of driver gestures based on video images
CN102592143A (en) * 2012-01-09 2012-07-18 清华大学 Method for detecting phone holding violation of driver in driving
US20120214467A1 (en) * 2009-10-31 2012-08-23 Saied Tadayon Method and System for Receiving Forwarded or Delegated Mobile Device Functions
CN103129492A (en) * 2013-03-01 2013-06-05 公安部第三研究所 Vehicular intelligent multimedia terminal device
CN103279750A (en) * 2013-06-14 2013-09-04 清华大学 Detecting method of mobile telephone holding behavior of driver based on skin color range
CN103366506A (en) * 2013-06-27 2013-10-23 北京理工大学 Device and method for automatically monitoring telephone call behavior of driver when driving

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120214467A1 (en) * 2009-10-31 2012-08-23 Saied Tadayon Method and System for Receiving Forwarded or Delegated Mobile Device Functions
CN102567743A (en) * 2011-12-20 2012-07-11 东南大学 Automatic identification method of driver gestures based on video images
CN102592143A (en) * 2012-01-09 2012-07-18 清华大学 Method for detecting phone holding violation of driver in driving
CN103129492A (en) * 2013-03-01 2013-06-05 公安部第三研究所 Vehicular intelligent multimedia terminal device
CN103279750A (en) * 2013-06-14 2013-09-04 清华大学 Detecting method of mobile telephone holding behavior of driver based on skin color range
CN103366506A (en) * 2013-06-27 2013-10-23 北京理工大学 Device and method for automatically monitoring telephone call behavior of driver when driving

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈雪梅 等: ""面向智能驾驶行为的机器学习"", 《道路交通与安全》 *

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966059B (en) * 2015-06-15 2018-04-27 安徽创世科技股份有限公司 Driver based on intelligent monitor system drives detection method of making a phone call
CN104966059A (en) * 2015-06-15 2015-10-07 安徽创世科技有限公司 Method for detecting phoning behavior of driver during driving based on intelligent monitoring system
CN105469073A (en) * 2015-12-16 2016-04-06 安徽创世科技有限公司 Kinect-based call making and answering monitoring method of driver
CN105868690A (en) * 2016-03-11 2016-08-17 博康智能信息技术有限公司 Method and apparatus for identifying mobile phone use behavior of driver
CN106022242A (en) * 2016-05-13 2016-10-12 哈尔滨工业大学(威海) Driver call making identification method in intelligent transportation system
CN106022242B (en) * 2016-05-13 2019-05-03 哈尔滨工业大学(威海) Driver's making and receiving calls recognition methods in intelligent transportation system
CN106530730A (en) * 2016-11-02 2017-03-22 重庆中科云丛科技有限公司 Traffic violation detection method and system
CN108345819B (en) * 2017-01-23 2020-09-15 杭州海康威视数字技术股份有限公司 Method and device for sending alarm message
WO2018133874A1 (en) * 2017-01-23 2018-07-26 杭州海康威视数字技术股份有限公司 Method and device for sending warning message
CN108345819A (en) * 2017-01-23 2018-07-31 杭州海康威视数字技术股份有限公司 A kind of method and apparatus sending warning message
US11386698B2 (en) 2017-01-23 2022-07-12 Hangzhou Hikvision Digital Technology Co., Ltd. Method and device for sending alarm message
CN108509902A (en) * 2018-03-30 2018-09-07 湖北文理学院 A kind of hand-held telephone relation behavioral value method during driver drives vehicle
CN108509902B (en) * 2018-03-30 2020-07-03 湖北文理学院 Method for detecting call behavior of handheld phone in driving process of driver
CN110956060A (en) * 2018-09-27 2020-04-03 北京市商汤科技开发有限公司 Motion recognition method, driving motion analysis method, device and electronic equipment
WO2020063753A1 (en) * 2018-09-27 2020-04-02 北京市商汤科技开发有限公司 Maneuver recognition and driving maneuver analysis method and apparatus, and electronic device
CN110309764A (en) * 2019-06-27 2019-10-08 浙江工业大学 A kind of multistage driver based on deep learning makes a phone call behavioral value method
CN111523380A (en) * 2020-03-11 2020-08-11 浙江工业大学 Mask wearing condition monitoring method based on face and posture recognition
CN111553217A (en) * 2020-04-20 2020-08-18 哈尔滨工程大学 Driver call monitoring method and system
CN112487990A (en) * 2020-12-02 2021-03-12 重庆邮电大学 DSP-based driver call-making behavior detection method and system

Also Published As

Publication number Publication date
CN104573659B (en) 2018-01-09

Similar Documents

Publication Publication Date Title
CN104573659A (en) Driver call-making and call-answering monitoring method based on svm
CN102542289B (en) Pedestrian volume statistical method based on plurality of Gaussian counting models
CN105260744B (en) The automatic on-line diagnostic method and system of a kind of goods train coupler yoke key position failure
CN104637058B (en) A kind of volume of the flow of passengers identify statistical methods based on image information
CN104239867B (en) License plate locating method and system
CN103729854B (en) A kind of method for detecting infrared puniness target based on tensor model
CN102982313B (en) The method of Smoke Detection
CN103810717B (en) A kind of human body behavioral value method and device
CN100454355C (en) Video method for collecting information of vehicle flowrate on road in real time
CN105426905A (en) Robot barrier identification method based on gradient histogram and support vector machine
CN108229407A (en) A kind of behavioral value method and system in video analysis
CN105225523B (en) A kind of parking space state detection method and device
CN105303191A (en) Method and apparatus for counting pedestrians in foresight monitoring scene
CN104268528A (en) Method and device for detecting crowd gathered region
CN107292870A (en) Track plug pin fault detection method and system based on image alignment with detection network model
CN105551264A (en) Speed detection method based on license plate characteristic matching
CN108388871B (en) Vehicle detection method based on vehicle body regression
CN102855500A (en) Haar and HoG characteristic based preceding car detection method
CN107491752A (en) Ship board character recognition method, device in a kind of natural scene based on deep learning
CN103632427B (en) A kind of gate cracking protection method and gate control system
CN108038424A (en) A kind of vision automated detection method suitable for working at height
CN112329691A (en) Monitoring video analysis method and device, electronic equipment and storage medium
CN104573724A (en) Method for monitoring call making and receiving behaviors of driver
CN105809219B (en) A kind of the prefabricated pipe section quality testing statistical system and method for tunnel prefabricated pipe section production line
CN113947731A (en) Foreign matter identification method and system based on contact net safety inspection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant