CN104318547B - The many binoculars accelerated based on GPU splice intelligent analysis system - Google Patents

The many binoculars accelerated based on GPU splice intelligent analysis system Download PDF

Info

Publication number
CN104318547B
CN104318547B CN201410528309.8A CN201410528309A CN104318547B CN 104318547 B CN104318547 B CN 104318547B CN 201410528309 A CN201410528309 A CN 201410528309A CN 104318547 B CN104318547 B CN 104318547B
Authority
CN
China
Prior art keywords
target
camera
gpu
imaging
binocular camera
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410528309.8A
Other languages
Chinese (zh)
Other versions
CN104318547A (en
Inventor
尚凌辉
高勇
王弘玥
刘家佳
余天明
施展
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZHEJIANG ICARE VISION TECHNOLOGY Co Ltd
Original Assignee
ZHEJIANG ICARE VISION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHEJIANG ICARE VISION TECHNOLOGY Co Ltd filed Critical ZHEJIANG ICARE VISION TECHNOLOGY Co Ltd
Priority to CN201410528309.8A priority Critical patent/CN104318547B/en
Publication of CN104318547A publication Critical patent/CN104318547A/en
Application granted granted Critical
Publication of CN104318547B publication Critical patent/CN104318547B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/30Determination of transform parameters for the alignment of images, i.e. image registration
    • G06T7/35Determination of transform parameters for the alignment of images, i.e. image registration using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Abstract

Splice intelligent analysis system the present invention relates to a kind of many binoculars accelerated based on GPU.The present invention is using binocular camera as video capture device, wherein binocular camera is ceiling mounted, vertical vertical view shoots ground, blocked mutually so as to ensure that the target for photographing does not exist, when scene domain cannot be covered with a binocular camera, many cameras are then installed, and form visual range by the way of the splicing;Based on the vertical depth map and cromogram for shooting, you can realize the intelligent video analysis function of various high accuracies.Two-way output video simultaneously for each binocular camera carries out parallel computation to accelerate analysis result using GPU.Present invention uses depth information, and RGB information is combined so that the stability of target detection is greatly improved, for follow-up intellectual analysis provide good basis.

Description

The many binoculars accelerated based on GPU splice intelligent analysis system
Technical field
The invention belongs to video brainpower watch and control technical field, it is related to a kind of many binoculars splicing intelligence point accelerated based on GPU Analysis system.
Background technology
At present, the Intellectual Analysis Technology based on video has been widely applied to all trades and professions, including bank, and traffic is public Peace etc..But actual effect is usually not fully up to expectations, to find out its cause, mainly have it is following some:
1) in order to cost-effective, many intelligent video analysis frequently with existing mounted camera, shooting angle and into As in effect, being often unfavorable for intellectual analysis.
2) common 2D cameras cannot judge the distance between target and camera, cause in same optical axis, different far and near mesh Mark is easy to overlap and blocks, so as to cause the erroneous judgement of intellectual analysis.
3) calculating platform of intellectual analysis is limited in one's ability, and algorithm is usually needed by after the lossy optimization of performance, ability Meet treatment in real time.
The content of the invention
The present invention is in view of the shortcomings of the prior art, there is provided a kind of to be shot based on vertical splicing and GPU speed-up computations Binocular video intelligent analysis system.
The technical solution adopted for solving the technical problem of the present invention is:
, using binocular camera as video capture device, wherein binocular camera is ceiling mounted, vertically bows for the present invention Depending on shooting ground, so that ensure the target for photographing in the absence of blocking mutually, when scene domain cannot be covered with a binocular camera Gai Shi, then install many cameras, and forms visual range by the way of the splicing;Based on the vertical depth map and colour for shooting Figure, you can realize the intelligent video analysis function of various high accuracies.
Described splicing is specifically:To the pixel in each magazine each target area, entered according to its depth information Then row is offset using set polyphaser and demarcated to the projection on ground, the association of target after being projected, so that World coordinates fastens the target detection realized under polyphaser.
Two-way output video simultaneously for each binocular camera carries out parallel computation to accelerate analysis result using GPU.
Furtherly, the association of target is specifically after described projection:
The target for being provided with certain altitude is in two public domains of camera, if camera heights are z0, two camera specifications Unanimously, then areas imaging is W*H, is highly z if target image coordinate is (x, y), then wherein first overall situation of camera subject Coordinate (Xw, Yw) is:
Relative to first skew of camera it is (dx, dy) in view of second camera, then second camera subject is complete Office coordinate (Xw, Yw) be:
Cluster the projected centre point that obtains and represent the target by the MeanShift that is a little to target, by than Compared with the distance of all projected centre points under adjacent cameras, and given threshold Td, target of the distance less than Td is considered same target, So as to reach the purpose that removal repeats target.
Furtherly, when camera lens are fixed, when the distance of vertically vertical view shooting, and target to camera lens is fixed, it is assumed that because The fat thin imaging size for causing of behaving meets Gaussian distribution model:
Wherein μ represents under this height that the average of target imaging size, σ represents that the imaging caused because human body is fat or thin is missed Poor standard deviation.Assuming that above-mentioned model is to count to set up when target is fixed as D to distance of camera lens, when camera heights and target When highly changing, according to pinhole imaging system principle, the size of target imaging is inversely proportional with it to the distance of camera lens, then we Obtain, when camera heights are H, and object height is h, its imaging size should meet Gaussian distribution model:
So, when known to camera setting height(from bottom), by depth map it is estimated that the big mini Mod of the imaging of target, so that For follow-up intellectual analysis provide effective priori.
Beneficial effects of the present invention:
1) depth information has been used, and has combined RGB information so that the stability of target detection has been greatly improved, be follow-up intelligence Can analyze and provide good basis.
2) installed using ceiling, the camera mounting means for vertically being shot towards ground efficiently solves the problems, such as target occlusion.Together Shi Liyong depth informations, floor projection is carried out to target, so as to realize the target association under polyphaser, is solved due to ceiling peace Fill the small problem of the visual range for causing.
3) calculating platform uses GPU, and the parallelization optimization of combination algorithm greatly improves arithmetic speed, so as to realize high score Resolution, the depth map intellectual analysis of polyphaser.
4) counted by the priori to different fat or thin targets, with reference to depth information, the size to target imaging sets up model, For follow-up intellectual analysis provide priori.
Brief description of the drawings
Fig. 1 is polyphaser imaging schematic diagram;
Fig. 2 is that polyphaser is imaged top view;
Fig. 3 is that binocular camera sets up schematic diagram;
Fig. 4 is binocular camera and ATM sphere of action schematic diagram.
Specific embodiment
The invention will be further described with accompanying drawing with reference to embodiments:
Present invention employs binocular camera as video capture device, there is every binocular camera two-way analog video to believe Number, left view and right view are represented respectively.Binocular camera requirement ceiling is ceiling mounted, and vertical vertical view shoots ground, from And ensure that the target for photographing does not exist and block mutually.When scene domain cannot be covered with a binocular camera, can install many Platform camera, and visual range more is formed by the way of splicing, cardinal principle is as follows:
To the pixel in each magazine each target area, the projection on ground is proceeded to according to its depth information, Then demarcated using set polyphaser skew (dx, dy), the association of target after being projected, so as in global coordinate system On realize under polyphaser target detection
As depicted in figs. 1 and 2, the target for having certain altitude is in two public domains of camera.If camera heights are z0, Assuming that two camera specifications are consistent, then areas imaging is W*H, is highly z if target image coordinate is (x, y), then A cameras mesh Target world coordinates (Xw, Yw) is:
In view of B cameras relative to A cameras skew be (dx, dy), then the world coordinates (Xw, Yw) of B camera subjects be:
Generally human region includes multiple pixels, because camera setting angle is different, the foreground point of same target Distribution is different, projects to that be distributed behind ground also can be different, therefore is clustered by the MeanShift that is a little to target To projected centre point represent the target.By comparing the distance of all projected centre points under adjacent cameras, and given threshold Td, target of the distance less than Td is considered same target, so as to reach the purpose that removal repeats target.
After all of camera is installed, it is uniformly accessed into the calculating platform based on GPU, the form of calculating platform is main Based on the x86 servers comprising GPU, or the embedded device comprising GPU.Calculating platform mainly completes following Business:
1:The access of binocular video.
The binocular camera of this programme uses analog signal output, and each camera has two-path video.
2:Calculate binocular depth figure.
The method comparison for calculating depth map using left view and right view is complicated, but is especially suitable for parallel computation, the present invention The algorithm of use has carried out the parallel optimization treatment of height, the speed of service very high can be obtained on GPU, so as to realize high score Resolution, the real-time deep figure of polyphaser is calculated.
3:Realize specific intelligent video analysis function.
Based on the vertical depth map and cromogram for shooting, you can to realize the intelligent video analysis work(of various high accuracies Can, such as passenger flow statisticses, behavioural analysis etc..
When camera lens are fixed, when the distance of vertically vertical view shooting, and target to camera lens is fixed, it is assumed that because human body is fat or thin The imaging size for causing meets Gaussian Profile:
Wherein μ represents under this height that the average of target imaging size, σ represents that the imaging caused because human body is fat or thin is missed Poor standard deviation.
Assuming that model above is to count to set up when target is fixed as D to distance of camera lens, when camera heights and target are high When degree changes, according to pinhole imaging system principle, the size of target imaging is inversely proportional with it to the distance of camera lens, then can obtain Arrive, when camera heights are H, and object height is h, its imaging size should meet Gauss model:
So, when known to camera setting height(from bottom), by depth map it is estimated that the big mini Mod of the imaging of target, so that For follow-up intellectual analysis provide effective priori.
As shown in figure 3, by taking the behavioural analysis application of the self-service business halls of ATM as an example:
Antenna height:3 meters or so;
Set up angle:Vertical 90 degree;
Decorating position:Near the ceiling of ATM;
When antenna height is 2.8m, it is assumed that human body is up to 1.8m, separate unit binocular camera coverage is 2m*2m left It is right.
And according to measuring and calculating, a width for ATM is about in 0.8-1m or so, therefore a binocular camera can cover substantially Scope before 2 ATMs of lid, 4 ATMs are possessed with one, as a example by area is for the medium-sized business hall of 3m*Sm, set up two pairs Mesh camera can just meet demand substantially, referring to Fig. 4.
Two binocular cameras (4 road video) are linked into the calculating platform based on GPU, the behavior point based on depth map is realized The intellectual analysis functions such as analysis algorithm, there is provided " breaking ATM ", " robbery ", " falling down to the ground ", " withdraw the money and trail ".
In sum, the depth camera in the present invention, in addition to obtaining common colour information, moreover it is possible to obtain the depth of scene Degree information, so as to judge the distance between target and camera, has more preferable effect for target detection.Camera mounting means is It is vertical to overlook, it is ensured that the target in scene does not exist eclipse phenomena, further, since mounting means and camera specification is controllable, The yardstick of target under different setting height(from bottom)s can be in advance calibrated, is known for subsequent algorithm analysis provides very valuable priori Know.Calculating platform is based on GPU architecture, and algorithm on GPU by that after parallel optimization, can obtain adding for decades of times higher than CPU Fast performance, so as to for algorithm provides more computing resources, increasingly complex algorithm can be run to obtain more preferable effect.
The above, only presently preferred embodiments of the present invention is not intended to limit the scope of the present invention, should band Understand, the present invention is not limited to implementation as described herein, the purpose of these implementations description is to help this area In technical staff practice the present invention.

Claims (3)

1. many binoculars splicing intelligent analysis system for being accelerated based on GPU, it is characterised in that:
The system is using binocular camera as video capture device, and wherein binocular camera is ceiling mounted, and vertical vertical view is clapped Ground is taken the photograph, so that ensure the target for photographing in the absence of blocking mutually, when scene domain cannot be covered with a binocular camera, Many cameras are then installed, and form visual range by the way of the splicing;Based on the vertical depth map and cromogram for shooting, i.e., It is capable of achieving the intelligent video analysis function of various high accuracies;
Described splicing is specifically:To the pixel in each magazine each target area, proceeded to according to its depth information Projection on ground, is then offset using set polyphaser and demarcated, the association of target after being projected, so as in the overall situation The target detection under polyphaser is realized on coordinate system;
Two-way output video simultaneously for each binocular camera carries out parallel computation to accelerate analysis result using GPU.
2. many binoculars accelerated based on GPU according to claim 1 splice intelligent analysis system, it is characterised in that:It is described Projection after the association of target be specifically:
The target for being provided with certain altitude is in two public domains of camera, if camera heights are z0, two camera specifications are consistent, Then areas imaging is W*H, is highly z if target image coordinate is (x, y), then wherein first world coordinates of camera subject (Xw, Yw) is:
{ ( W - 1 2 - x ) z z 0 + x , ( H - 1 2 - y ) z z 0 + y }
Relative to first skew of camera be (dx, dy) in view of second camera, then the global seat of second camera subject Marking (Xw, Yw) is:
{ ( W - 1 2 - x ) z z 0 + x + d x , ( H - 1 2 - y ) z z 0 + y + d y }
Cluster the projected centre point that obtains and represent the target by the MeanShift that is a little to target, by comparing phase The distance of all projected centre points under adjacent camera, and given threshold Td, target of the distance less than Td are considered same target, so that Reach the purpose that removal repeats target.
3. many binoculars accelerated based on GPU according to claim 1 splice intelligent analysis system, it is characterised in that:
When camera lens are fixed, when the distance of vertically vertical view shooting, and target to camera lens is fixed, it is assumed that cause because human body is fat or thin Imaging size meet Gaussian distribution model:
f ( x ) = 1 2 π σ exp [ - ( x - μ ) 2 2 xσ 2 ]
Wherein μ represented under this height, the average of target imaging size, and σ represents the image error that causes because human body is fat or thin Standard deviation;Assuming that above-mentioned model is to count to set up when target is fixed as D to distance of camera lens, when camera heights and object height When changing, according to pinhole imaging system principle, the size of target imaging is inversely proportional with it to the distance of camera lens, then obtain, when Camera heights are H, and when object height is h, its imaging size should meet Gaussian distribution model:
f ( x ) = 1 2 π σ exp [ - ( x - μ ) 2 2 xσ 2 ] * D H - h
So, when known to camera setting height(from bottom), by depth map it is estimated that the big mini Mod of the imaging of target, so that after being Continuous intellectual analysis provide effective priori.
CN201410528309.8A 2014-10-09 2014-10-09 The many binoculars accelerated based on GPU splice intelligent analysis system Active CN104318547B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410528309.8A CN104318547B (en) 2014-10-09 2014-10-09 The many binoculars accelerated based on GPU splice intelligent analysis system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410528309.8A CN104318547B (en) 2014-10-09 2014-10-09 The many binoculars accelerated based on GPU splice intelligent analysis system

Publications (2)

Publication Number Publication Date
CN104318547A CN104318547A (en) 2015-01-28
CN104318547B true CN104318547B (en) 2017-06-23

Family

ID=52373773

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410528309.8A Active CN104318547B (en) 2014-10-09 2014-10-09 The many binoculars accelerated based on GPU splice intelligent analysis system

Country Status (1)

Country Link
CN (1) CN104318547B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106683071B (en) 2015-11-06 2020-10-30 杭州海康威视数字技术股份有限公司 Image splicing method and device
CN105787469B (en) * 2016-03-25 2019-10-18 浩云科技股份有限公司 The method and system of pedestrian monitoring and Activity recognition
CN106910217A (en) * 2017-03-17 2017-06-30 驭势科技(北京)有限公司 Vision map method for building up, computing device, computer-readable storage medium and intelligent vehicle

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521646A (en) * 2011-11-11 2012-06-27 浙江捷尚视觉科技有限公司 Complex scene people counting algorithm based on depth information cluster
CN102867175A (en) * 2012-08-31 2013-01-09 浙江捷尚视觉科技有限公司 Stereoscopic vision-based ATM (automatic teller machine) machine behavior analysis method
CN104063863A (en) * 2014-06-20 2014-09-24 交通运输部水运科学研究所 Pitch-down type binocular vision system for watercourse monitoring and image processing method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521646A (en) * 2011-11-11 2012-06-27 浙江捷尚视觉科技有限公司 Complex scene people counting algorithm based on depth information cluster
CN102867175A (en) * 2012-08-31 2013-01-09 浙江捷尚视觉科技有限公司 Stereoscopic vision-based ATM (automatic teller machine) machine behavior analysis method
CN104063863A (en) * 2014-06-20 2014-09-24 交通运输部水运科学研究所 Pitch-down type binocular vision system for watercourse monitoring and image processing method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ATM机异常行为识别系统的设计与实现;汤思远 等;《福建电脑》;20130925;28-29,101 *
双目俯视视觉导航相机标定及障碍物检测算法研究;曹煜涛;《万方数据 天津工业大学学位论文》;20140715;第2.2-2.3节,图2-4 *

Also Published As

Publication number Publication date
CN104318547A (en) 2015-01-28

Similar Documents

Publication Publication Date Title
KR102003152B1 (en) Information processing method, device, and terminal
US9741170B2 (en) Method for displaying augmented reality content based on 3D point cloud recognition, and apparatus and system for executing the method
CN106204595B (en) A kind of airdrome scene three-dimensional panorama monitoring method based on binocular camera
EP4016457A1 (en) Positioning method and apparatus
CN106780620A (en) A kind of table tennis track identification positioning and tracking system and method
RU2016146354A (en) DETECTION OF THE STATE USING IMAGE PROCESSING
US20230016896A1 (en) System and method for free space estimation
EP3070430A1 (en) Moving body position estimation device and moving body position estimation method
CN103247045A (en) Method of obtaining artificial scene main directions and image edges from multiple views
CN104318547B (en) The many binoculars accelerated based on GPU splice intelligent analysis system
US10984263B2 (en) Detection and validation of objects from sequential images of a camera by using homographies
CN112991401B (en) Vehicle running track tracking method and device, electronic equipment and storage medium
CN107240120A (en) The tracking and device of moving target in video
EP3806039A1 (en) Spatial positioning method and device, system thereof and computer-readable medium
CN108521554A (en) Large scene multi-target cooperative tracking method, intelligent monitor system, traffic system
De Bruin et al. Drone-based traffic flow estimation and tracking using computer vision: transportation engineering
KR102072796B1 (en) Method, system and non-transitory computer-readable recording medium for calculating spatial coordinates of a region of interest
CN108416798A (en) A kind of vehicle distances method of estimation based on light stream
CN103500454A (en) Method for extracting moving target of shaking video
CN108924627A (en) Position distribution display methods, device, equipment and the storage medium of Moving Objects
CN108960012A (en) Feature point detecting method, device and electronic equipment
CN104809720B (en) The two camera target association methods based on small intersection visual field
CN105138979A (en) Method for detecting the head of moving human body based on stereo visual sense
CN104539926B (en) Distance determines method and apparatus
WO2023061356A1 (en) Advertisement serving method and apparatus, device, storage medium and computer program product

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: GPU acceleration-based multi-binocular splicing intelligent analysis system

Effective date of registration: 20190821

Granted publication date: 20170623

Pledgee: Hangzhou Yuhang Small and Medium-sized Enterprise Transfer Service Co., Ltd.

Pledgor: ZHEJIANG ICARE VISION TECHNOLOGY CO., LTD.

Registration number: Y2019330000020