US20190108384A1 - System and method for aerial video traffic analysis - Google Patents

System and method for aerial video traffic analysis Download PDF

Info

Publication number
US20190108384A1
US20190108384A1 US15/725,747 US201715725747A US2019108384A1 US 20190108384 A1 US20190108384 A1 US 20190108384A1 US 201715725747 A US201715725747 A US 201715725747A US 2019108384 A1 US2019108384 A1 US 2019108384A1
Authority
US
United States
Prior art keywords
vehicle
video image
image sequence
image
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US15/725,747
Other versions
US10410055B2 (en
Inventor
Yijie Wang
Panqu Wang
Pengfei Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tusimple Inc
Original Assignee
Tusimple Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tusimple Inc filed Critical Tusimple Inc
Priority to US15/725,747 priority Critical patent/US10410055B2/en
Assigned to TuSimple reassignment TuSimple ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WANG, Yijie, CHEN, PENGFEI, WANG, PANQU
Priority to EP18864500.6A priority patent/EP3692428A4/en
Priority to CN201880065098.5A priority patent/CN111201496B/en
Priority to AU2018345330A priority patent/AU2018345330B2/en
Priority to CN202310794571.6A priority patent/CN116844072A/en
Priority to PCT/US2018/053795 priority patent/WO2019070604A1/en
Publication of US20190108384A1 publication Critical patent/US20190108384A1/en
Publication of US10410055B2 publication Critical patent/US10410055B2/en
Application granted granted Critical
Assigned to TUSIMPLE, INC. reassignment TUSIMPLE, INC. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: TuSimple
Priority to AU2023278047A priority patent/AU2023278047A1/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • G06K9/0063
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B64AIRCRAFT; AVIATION; COSMONAUTICS
    • B64DEQUIPMENT FOR FITTING IN OR TO AIRCRAFT; FLIGHT SUITS; PARACHUTES; ARRANGEMENT OR MOUNTING OF POWER PLANTS OR PROPULSION TRANSMISSIONS IN AIRCRAFT
    • B64D47/00Equipment not otherwise provided for
    • B64D47/08Arrangements of cameras
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06K9/00765
    • G06K9/209
    • G06K9/6256
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/254Analysis of motion involving subtraction of images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/255Detecting or recognising potential candidate objects based on visual cues, e.g. shapes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/17Terrestrial scenes taken from planes or by drones
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/49Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • G06V20/54Surveillance or monitoring of activities, e.g. for recognising suspicious objects of traffic, e.g. cars on the road, trains or boats
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0108Measuring and analyzing of parameters relative to traffic conditions based on the source of data
    • G08G1/012Measuring and analyzing of parameters relative to traffic conditions based on the source of data from other sources than vehicle or roadside beacons, e.g. mobile networks
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/017Detecting movement of traffic to be counted or controlled identifying vehicles
    • G08G1/0175Detecting movement of traffic to be counted or controlled identifying vehicles by photographing vehicles, e.g. when violating traffic rules
    • B64C2201/123
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B64AIRCRAFT; AVIATION; COSMONAUTICS
    • B64CAEROPLANES; HELICOPTERS
    • B64C39/00Aircraft not otherwise provided for
    • B64C39/02Aircraft not otherwise provided for characterised by special use
    • B64C39/024Aircraft not otherwise provided for characterised by special use of the remote controlled vehicle type, i.e. RPV
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B64AIRCRAFT; AVIATION; COSMONAUTICS
    • B64UUNMANNED AERIAL VEHICLES [UAV]; EQUIPMENT THEREFOR
    • B64U2101/00UAVs specially adapted for particular uses or applications
    • B64U2101/30UAVs specially adapted for particular uses or applications for imaging, photography or videography
    • G06K2209/21
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10032Satellite or aerial image; Remote sensing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20224Image subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30236Traffic on road, railway or crossing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/42Global feature extraction by analysis of the whole pattern, e.g. using frequency domain transformations or autocorrelation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/62Extraction of image or video features relating to a temporal dimension, e.g. time-based feature extraction; Pattern tracking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/757Matching configurations of points or features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Definitions

  • This patent document pertains generally to tools (systems, apparatuses, methodologies, computer program products, etc.) for human driver modeling, trajectory generation and motion planning, vehicle control systems, autonomous driving systems, and autonomous driving simulation systems, and more particularly, but not by way of limitation, to a system and method for aerial video traffic analysis.
  • the control system of autonomous vehicles can sometimes be configured using a simulated human driver environment.
  • the simulated human driver environment attempts to model the typical driving behavior of human drivers in various driving environments.
  • the simulated human driver environment may be built based on the information gathered from the sensors and cameras on the autonomous vehicle or related probe vehicles. Because this information, including images from the vehicle cameras, can be subject to image occlusion and unreliable image reconstruction accuracy, the utility and effectiveness of the simulated human driver environment is degraded. Additionally, the image occlusion problem is further complicated by shadows cast by both the vehicles themselves and overhead objects such as trees, buildings, construction equipment, and the like. Classic color-based methods for shadow detection or removal do not work in this case; because of the diversity of vehicle types and colors. Because of these problems with the image data gathered for configuring the simulated human driver environment, the effectiveness of the control systems of autonomous vehicles based on the degraded simulated human driver environment can be compromised.
  • a system and method for aerial video traffic analysis includes the task of extracting traffic information, including the shape, heading, and trajectories of ground vehicles, from aerial videos captured by aerial vehicles (e.g., UAVs) positioned directly above a road surface at a desired geographical location and altitude.
  • Aerial video is an inexpensive way to collect traffic information.
  • Aerial video traffic analysis as disclosed herein can provide important insights into human driving behaviors in real-world traffic environments and conditions. These human driving behavior insights can be used to train a human driving behavior model, which can be used with a simulation environment for configuring autonomous vehicle control systems.
  • solutions are presented for accomplishing aerial video traffic analysis by combining classic and deep computer vision methods with a specially tailored deep learning model.
  • the example embodiments disclosed herein can achieve pixel-level accuracy in most conditions.
  • the example embodiments also solve another challenging problem caused by the diversity of vehicles in typical traffic environments.
  • the example embodiments disclosed herein can recognize all types of vehicles from tiny ones like motorcycles to huge ones like car carrier trailers.
  • the disclosed example embodiments are insensitive to the size of vehicles, making the various embodiments suitable for all types of vehicles.
  • FIG. 1 is an operational flow diagram illustrating an example embodiment of a system and method for traffic data collection using unmanned aerial vehicles (UAVs);
  • UAVs unmanned aerial vehicles
  • FIGS. 2 through 5 illustrate an example scenario wherein a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision;
  • FIG. 6 is an operational flow diagram illustrating an example embodiment of a system and method for training the vehicle segmentation module of the example embodiment
  • FIG. 7 illustrates the components of a human driver model system of an example embodiment
  • FIG. 8 is a process flow diagram illustrating an example embodiment of a system and method for traffic data collection using UAVs.
  • FIG. 9 shows a diagrammatic representation of machine in the example form of a computer system within which a set of instructions when executed may cause the machine to perform any one or more of the methodologies discussed herein.
  • FIG. 1 is an operational flow diagram illustrating an example embodiment of a system and method for traffic data collection using unmanned aerial vehicles (UAVs).
  • FIGS. 2 through 5 illustrate an example scenario of the operations shown in FIG. 1 , wherein a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision.
  • a system and process for aerial video traffic analysis in an example embodiment starts with collecting aerial video image data taken by UAVs 202 that fly directly above a certain location to be monitored (e.g., expressways).
  • an example scenario shows a sample image captured by UAV 202 with a camera positioned at a certain location to be monitored at an elevated position to record video image data of the traffic activity at the location 204 within the UAV's field of vision.
  • the system and method of an example embodiment provides traffic data collection using modern UAVs, which create a bird's-eye (elevated) view and provide accurate data related to traffic activity in view of the UAV.
  • Modern UAVs 202 are able to hover or move in the sky at an elevated position to collect data related to a location with a high degree of stability regardless of weather conditions that may be inappropriate for data collection. With a high definition and stabilized camera configured on a UAV 202 , data with unprecedented high quality can be collected.
  • the data collected reflects truly realistic real-world traffic information related to the location being monitored. Additionally, the UAVs' presence does not interfere with the traffic activity the UAV is viewing, which is in contrast to any data collection method currently in practice. Further, data collection using UAVs 202 eliminates occlusion problems caused by obstructions in the camera's field of view. The lack of occlusion is crucial for the efficient and high fidelity image data processing performed after the data is collected. Finally, average inexpensive consumer UAVs 202 are sufficient to fulfill most image data collection tasks.
  • the UAV 202 can collect unobstructed video image data from the monitored location 204 .
  • the collected video image data can include images of roadways, traffic flows, and vehicles or other objects in the field of view over a pre-determined time period.
  • the activity and behavior of the vehicles and objects at the location 204 can thereby be recorded and later analyzed and processed for inclusion into a human driver model.
  • the video captured by the UAV 202 is unobstructed and thus provides a consistently clear aerial view of the monitored location 204 , which provides more accurate and useful data for the human driver model.
  • the elevated position of the UAV 202 enables better video capture, which results in better modeling and simulation. Additionally, the use of UAVs can be done with less expense and without interference with the environment as compared with the traditional systems where cameras are mounted on a probe vehicle or mounted at a fixed ground-based location.
  • the UAVs 202 should ideally remain stationary when recording the video image data, but a small amount of drift is tolerable. Nevertheless, the example embodiment provides a clipping and stabilization operation (operation block 110 , shown in FIG. 1 ) to correct for errant image data.
  • the clipping and stabilization operation is performed on the video image data to completely remove any drift in the field of view. Clipping removes any part of the video image data in which the UAV 202 moves erratically. Stabilization aligns the background surface of all video frames to that of a chosen reference frame. In a particular embodiment, a Harris corner detector can be used to select keypoints on the reference frame.
  • Harris corner detection is a well-known process used within computer vision systems to extract certain kinds of features and to infer the contents of the image.
  • the example embodiment can apply a pyramidal Lucas-Kanade sparse optical flow process to find keypoints corresponding to points in each video frame.
  • the Lucas-Kanade method is a widely used differential method for optical flow estimation developed by Bruce D. Lucas and Takeo Kanade. By combining information from several nearby pixels, the Lucas-Kanade method can often resolve the inherent ambiguity of the optical flow equation.
  • the example embodiment can use a random sample consensus (RANSAC) method to solve for a perspective transformation matrix that embodies the alignment of each video frame with the reference frame.
  • RANSAC random sample consensus
  • RANSAC is an iterative method to estimate parameters of a mathematical model from a set of observed data that contains outliers, when outliers are to be accorded no influence on the values of the estimates.
  • the example embodiment can align the background surface of all video frames to that of a chosen reference frame. Stabilization of each frame is performed using the perspective transformation matrix. Segments of the video image data can be removed, if the matrix indicates that the UAV motion is larger than desired. The removal of unsuitable video segments is called clipping.
  • background extraction can be performed on the video image data to generate a realistic image without any moving vehicles (operation block 115 , shown in FIG. 1 ).
  • background extraction can be based on a RANSAC-like process, in which, for each pixel in the field of view, the dominant color value is inferred from a collection of frames sampled over time from the video. This process tends to filter out moving objects (e.g., vehicles) from the background image; because, the pixels of the moving objects are not static over the collection of frames. In practice, this background extraction process works very well, generating background images 205 that are almost indistinguishable from real ones as shown in the example of FIG. 3 .
  • the example embodiment can segment each video frame to identify the locations and shapes of the moving objects captured in the video image frames.
  • This part of the process of an example embodiment is denoted ground vehicle segmentation (operation block 120 , shown in FIG. 1 ).
  • the vehicle segmentation module 183 (shown in FIG. 7 ) can be used for this process.
  • the vehicle segmentation module 183 of an example embodiment can take two inputs: 1) each frame in the video image data, and 2) the corresponding background images extracted in the manner described above.
  • the video image frame can be concatenated with the corresponding background image.
  • the concatenated image data can be processed by a neural network of the vehicle segmentation module 183 .
  • a U-net architecture can be used for the neural network processing.
  • the U-net is a convolutional network architecture for fast and precise segmentation of images.
  • the neural network can output a binary classification of each pixel in the field of view, the binary classification representing whether the pixel is part of a vehicle or not.
  • the training of this neural network is detailed below.
  • the collection of binary classifications of each pixel in the field of view can be used to generate a vehicle segmentation mask, which defines the location and general or rough shape of each vehicle object identified in the video image frames within the field of view.
  • the example embodiment can use the vehicle segmentation mask to infer the centroid, heading, and rectangular shape of each vehicle identified by the neural network (operation block 125 , shown in FIG. 1 ).
  • a visual representation 206 of this data is shown in the example of FIG. 4 .
  • This representation is typically a better and more useful representation as compared with a general vehicle mask, as most vehicles identified in images captured by a UAV are rectangular when viewed top-down.
  • the example embodiment first removes noisy points in the segmentation results produced by the vehicle segmentation module 183 . Then, the remaining connected pixel components corresponding to each vehicle can be used to represent the shape of the vehicle identified in the image data. The center-of-mass of the connected components corresponding to the vehicle can be used as the centroid of the vehicle.
  • the heading of the vehicle can be determined by solving for the eigenvectors of a centered covariance matrix corresponding to the connected components of the vehicle. As a result, the example embodiment can generate the direction along which the variance of the shape as a distribution is maximized. This direction corresponds to the heading of the vehicle associated with the shape distribution.
  • the rectangular shape of the vehicle is inferred by taking percentiles of the shape projected along and perpendicular to the heading direction. In this manner, geometric information of each vehicle in each video frame can be extracted. Similarly, the centroid, heading, and rectangular shape of each identified vehicle can be determined as described above.
  • vehicle tracking through a collection of image frames over time can be performed (operation block 130 , shown in FIG. 1 ).
  • the vehicle tracking module 185 (shown in FIG. 7 ) can be used for this process.
  • the vehicle tracking module 185 in the example embodiment can be applied to associate same vehicle detections in multiple image frames.
  • a tracking method can be used, in which each vehicle detection in a single image frame can be associated with at most one vehicle detection in a previous or subsequent image frame. If image data corresponding to a vehicle detection overlaps in two sequential image frames, the vehicle tracking module 185 can infer the same vehicle detection in the multiple image frames.
  • the vehicle tracking module 185 can follow a same vehicle through multiple image frames and determine a velocity of the vehicle.
  • a visible velocity vector corresponding to the velocity of each vehicle can be generated and added to the video image data.
  • the vehicle tracking module 185 works very well, even for tiny vehicles like motorcycles, as long as the vehicle segmentation is accurate.
  • each instance of the vehicles identified in the input image data 210 can be tagged with a unique identifier to differentiate between the different vehicles and to enable tracking of the same vehicle in different image frames with the same identifier. This tagging process can be used, if needed, to facilitate the identification and tracking of multiple vehicles across multiple image frames.
  • an output and visualization representation 207 of the vehicle data for the identified vehicles can be generated as shown (operation block 135 , shown in FIG. 1 ).
  • the output and visualization representation 207 can include a combination of the background image and the images of each identified vehicle with visual bounding boxes and velocity vectors, if desired.
  • the data corresponding to the output and visualization representation 207 can be used by the human driver model system 201 , as described in more detail below, to build a model for representing typical driving behaviors in the environment imaged by the UAV.
  • FIG. 6 is an operational flow diagram illustrating an example embodiment of a system and method for training the vehicle segmentation module 183 of the example embodiment.
  • the only module in the aerial video analysis and processing pipeline that requires training is the vehicle segmentation module 183 .
  • the training of the vehicle segmentation module 183 can be performed in an offline training process as described in detail below in connection with FIG. 6 .
  • the offline training process includes collecting and labeling a training image dataset.
  • a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision.
  • the method for training the vehicle segmentation module 183 of the example embodiment starts with collecting aerial video image data taken by UAVs 202 that fly directly above a certain location to be monitored (e.g., expressways). The data collected by UAVs 202 reflects truly realistic real-world traffic information related to the location being monitored.
  • the UAV 202 can collect unobstructed video image data from the monitored location 204 .
  • the collected video image data can include images of roadways, traffic flows, and vehicles or other objects in the field of view over a pre-determined time period.
  • the activity and behavior of the vehicles and objects at the location 204 can thereby be recorded and later used to train the vehicle segmentation module 183 to accurately recognize vehicle objects in the image data.
  • the UAV 202 should ideally remain stationary when recording the video image data, but a small amount of drift is tolerable. Nevertheless, the example embodiment provides an offline clipping and stabilization operation (operation block 610 , shown in FIG. 6 ) to correct for errant image data.
  • the offline clipping and stabilization operation 610 is performed on the video image data to completely remove any drift in the field of view. Clipping removes any part of the video image data in which the UAV 202 moves erratically. Stabilization aligns the background surface of all video frames to that of a chosen reference frame. As described above for a particular embodiment, a Harris corner detector can be used to select keypoints on the reference frame.
  • the example embodiment can apply a pyramidal Lucas-Kanade sparse optical flow process to find keypoints corresponding to points in each video frame. Additionally, the example embodiment can use a random sample consensus (RANSAC) method to solve for a perspective transformation matrix that embodies the alignment of each video frame with the reference frame. As a result, the example embodiment can align the background surface of all video frames to that of a chosen reference frame. Stabilization of each frame is performed using the perspective transformation matrix. Segments of the video image data can be removed, if the matrix indicates that the UAV motion is larger than desired. The removal of unsuitable video segments is called clipping.
  • RANSAC random sample consensus
  • an offline background extraction operation 615 can be performed on the video image data to generate a realistic image without any moving vehicles.
  • background extraction can be based on a RANSAC-like process, in which, for each pixel in the field of view, the dominant color value is inferred from a collection of frames sampled over time from the video.
  • the example embodiment can store the generated data in a segmentation training dataset 630 retained in a data storage device and used for training the neural network of the vehicle segmentation module 183 .
  • frames of the clipped and stabilized aerial video image data can be randomly sampled in operation 620 and passed to a manual image labeling process 625 .
  • the manual image labeling process 625 can include presenting the sampled image frames to human labelers or offline automated processes for manual segmentation labeling of the sampled image frames. During the manual segmentation labeling process, human labelers can draw the shapes of all vehicles in the frames.
  • the purpose of the manual image labeling process 625 is to provide a ground truth dataset with which the vehicle segmentation module 183 can be trained.
  • the manual segmentation labeling data generated by the manual image labeling process 625 can be stored in the segmentation training dataset 630 retained in the data storage device.
  • the sampled image frames, their corresponding background image frames, and the segmentation labelling are collected as segmentation training dataset 630 and retained for neural network training.
  • the neural network of the vehicle segmentation module 183 can be a common neural network architecture, such as the U-net architecture described above.
  • the neural network of the vehicle segmentation module 183 can be trained using the video image frames, the corresponding background image frames, and the manual segmentation labelling as input from the segmentation training dataset 630 .
  • the segmentation training dataset 630 can be used to configure parameters in the vehicle segmentation module 183 to cause the vehicle segmentation module 183 to accurately identify vehicle objects in one or more video image frames provided by UAVs 202 .
  • the vehicle segmentation module 183 can be trained to output accurate vehicle segmentation labelling and serve as an effective vehicle segmentation model 640 , which is highly useful to support the aerial video traffic analysis system described herein.
  • a system of an example embodiment can provide aerial video traffic analysis.
  • the example embodiment can include a corresponding method, which can be configured to:
  • the human driver model system 201 can receive high definition image data and other sensor data (e.g., traffic or vehicle image data 210 ) from a UAV positioned above a particular roadway (e.g., monitored location) being monitored.
  • the image data collected by the UAV reflects truly realistic, real-world traffic information related to the location being monitored.
  • the traffic or vehicle image data 210 can be wirelessly (or otherwise) transferred to a data processor 171 of a standard computing system, upon which a human driver model module 175 and/or an image processing module 173 can be executed.
  • the traffic or vehicle image data 210 can be stored in a memory device on the UAV and transferred later to the data processor 171 .
  • the processing performed by the human driver model module 175 of an example embodiment is described in more detail below.
  • the traffic or vehicle image data 210 provided by the deployed UAV can be received and processed by the image processing module 173 , which can also be executed by the data processor 171 .
  • the image processing module 173 can perform clipping, stabilization, background extraction, object/vehicle segmentation, vehicle centroid, heading, and shape inference processing, vehicle tracking, output and visualization generation, and other image processing functions to isolate vehicle or object presence and activity in the received images.
  • the human driver model module 175 can use the information related to these real-world vehicle or objects to create corresponding simulations of vehicles or objects in the human driver model.
  • Parameter values retained in a vehicle segmentation and human driver model parameter dataset 174 stored in a memory 172 can be used to configure the operation of the human driver model module 175 .
  • the elevated position of the UAV above the location being monitored and the stabilized high definition camera on the UAV provides a highly valuable and useful image and data feed for use by the human driver model module 175 .
  • data corresponding to predicted or simulated driver behaviors 220 can be produced and provided to a user or other system components.
  • the predicted or simulated driver behavior data 220 can be provided to a system component used to create a virtual world where a control system for an autonomous vehicle can be trained and improved.
  • the virtual world is configured to be identical (as possible) to the real world where vehicles are operated by human drivers.
  • the simulated driver behavior data is indirectly useful for configuring the control system for the autonomous vehicle.
  • the human driver model system 201 and the traffic or vehicle image data 210 described and claimed herein can be implemented, configured, processed, and used in a variety of other applications and systems as well.
  • a basic human driver model may be used to simulate or predict the behavior of an autonomous vehicle with a simulated driver in a simulation scenario.
  • the basic human driver model represents a virtual world configured to be identical (as possible) to the real world where vehicles are operated by human drivers.
  • the virtual world can be used to train and improve a control system for an autonomous vehicle.
  • the simulation can be indirectly useful for configuring the control systems in autonomous vehicles.
  • Such human driver models can be parameterized models, which may be configured using either real-world input or randomized variables.
  • the basic human driver model may simulate the typical and atypical driver behaviors, such as steering or heading control, speed or throttle control, and stopping or brake control.
  • the basic human driver model may use, for example, sensory-motor transport delay, dynamic capabilities, and preferred driving behaviors.
  • the human driver model may include modeling of the transport time delay between a stimulus and the simulated driver's control response. In some implementations, this delay may represent the time necessary for the driver to sense a stimulus, process it, determine the best corrective action, and respond.
  • the human driver model may also include a speed control model with an absolute maximum vehicle speed (e.g., the maximum speed of the vehicle, the speed a driver is not comfortable exceeding, etc.) and a cornering aggressiveness measure to reduce the speed based on the turning radius. In the example, this may replicate the tendency of drivers to slow down through a turn. In the example, once the turning radius drops below the cornering threshold in the scenario, the speed may be reduced in proportion to the tightness of the turn.
  • the human driver model can be configured to simulate more than the typical driving behaviors.
  • the human driver model needs data concerning typical driving behaviors, which represent average people, while atypical driving behaviors are equally needed.
  • typical driving behaviors which represent average people
  • atypical driving behaviors are equally needed.
  • the simulation system of the various example embodiments includes data related to the driving behaviors of impolite and impatient drivers in the virtual world.
  • the human driver model can be configured with data representing driving behaviors as varied as possible.
  • the dynamics of how a human may respond to stimuli may be included in the human driver model, which may include, for example, a metric of how aggressively the driver brakes and accelerates.
  • an aggressive driver may be modeled as one who applies very high control inputs to achieve the desired vehicle speeds, while a conservative driver may use more gradual control inputs. In some implementations, this may be modelled using parameterized values, with the input being controlled to the desired value. In some implementations, by adjusting the parameterized values, the aggressiveness of the simulated driver may be increased or decreased.
  • a flow diagram illustrates an example embodiment of a system and method 1000 for aerial video traffic analysis.
  • the example embodiment can be configured to: receive a captured video image sequence from an unmanned aerial vehicle (UAV) (processing block 1010 ); clip the video image sequence by removing unnecessary images (processing block 1020 ); stabilize the video image sequence by choosing a reference image and adjusting other images to the reference image (processing block 1030 ); extract a background image of the video image sequence for vehicle segmentation (processing block 1040 ); perform vehicle segmentation to identify vehicles in the video image sequence on a pixel by pixel basis (processing block 1050 ); determine a centroid, heading, and rectangular shape of each identified vehicle (processing block 1060 ); perform vehicle tracking to detect a same identified vehicle in multiple image frames of the video image sequence (processing block 1070 ); and produce output and visualization of the video image sequence including a combination of the background image and the images of each identified vehicle (processing block 1080 ).
  • UAV unmanned aerial vehicle
  • FIG. 9 shows a diagrammatic representation of a machine in the example form of a computing system 700 within which a set of instructions when executed and/or processing logic when activated may cause the machine to perform any one or more of the methodologies described and/or claimed herein.
  • the machine operates as a standalone device or may be connected (e.g., networked) to other machines.
  • the machine may operate in the capacity of a server or a client machine in server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment.
  • the machine may be a personal computer (PC), a laptop computer, a tablet computing system, a Personal Digital Assistant (PDA), a cellular telephone, a smartphone, a web appliance, a set-top box (STB), a network router, switch or bridge, or any machine capable of executing a set of instructions (sequential or otherwise) or activating processing logic that specify actions to be taken by that machine.
  • PC personal computer
  • PDA Personal Digital Assistant
  • STB set-top box
  • STB set-top box
  • network router switch or bridge
  • the example computing system 700 can include a data processor 702 (e.g., a System-on-a-Chip (SoC), general processing core, graphics core, and optionally other processing logic) and a memory 704 , which can communicate with each other via a bus or other data transfer system 706 .
  • the mobile computing and/or communication system 700 may further include various input/output (I/O) devices and/or interfaces 710 , such as a touchscreen display, an audio jack, a voice interface, and optionally a network interface 712 .
  • I/O input/output
  • the network interface 712 can include one or more radio transceivers configured for compatibility with any one or more standard wireless and/or cellular protocols or access technologies (e.g., 2nd (2G), 2.5, 3rd (3G), 4th (4G) generation, and future generation radio access for cellular systems, Global System for Mobile communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), Wideband Code Division Multiple Access (WCDMA), LTE, CDMA2000, WLAN, Wireless Router (WR) mesh, and the like).
  • GSM Global System for Mobile communication
  • GPRS General Packet Radio Services
  • EDGE Enhanced Data GSM Environment
  • WCDMA Wideband Code Division Multiple Access
  • LTE Long Term Evolution
  • CDMA2000 Code Division Multiple Access 2000
  • WLAN Wireless Router
  • Network interface 712 may also be configured for use with various other wired and/or wireless communication protocols, including TCP/IP, UDP, SIP, SMS, RTP, WAP, CDMA, TDMA, UMTS, UWB, WiFi, WiMax, BluetoothTM, IEEE 802.11x, and the like.
  • network interface 712 may include or support virtually any wired and/or wireless communication and data processing mechanisms by which information/data may travel between a computing system 700 and another computing or communication system via network 714 .
  • the memory 704 can represent a machine-readable medium on which is stored one or more sets of instructions, software, firmware, or other processing logic (e.g., logic 708 ) embodying any one or more of the methodologies or functions described and/or claimed herein.
  • the logic 708 may also reside, completely or at least partially within the processor 702 during execution thereof by the mobile computing and/or communication system 700 .
  • the memory 704 and the processor 702 may also constitute machine-readable media.
  • the logic 708 , or a portion thereof may also be configured as processing logic or logic, at least a portion of which is partially implemented in hardware.
  • the logic 708 , or a portion thereof may further be transmitted or received over a network 714 via the network interface 712 .
  • machine-readable medium of an example embodiment can be a single medium
  • the term “machine-readable medium” should be taken to include a single non-transitory medium or multiple non-transitory media (e.g., a centralized or distributed database, and/or associated caches and computing systems) that store the one or more sets of instructions.
  • the term “machine-readable medium” can also be taken to include any non-transitory medium that is capable of storing, encoding or carrying a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the various embodiments, or that is capable of storing, encoding or carrying data structures utilized by or associated with such a set of instructions.
  • the term “machine-readable medium” can accordingly be taken to include, but not be limited to, solid-state memories, optical media, and magnetic media.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Remote Sensing (AREA)
  • Medical Informatics (AREA)
  • Data Mining & Analysis (AREA)
  • Analytical Chemistry (AREA)
  • Software Systems (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Astronomy & Astrophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Traffic Control Systems (AREA)
  • Image Analysis (AREA)

Abstract

A system and method for aerial video traffic analysis are disclosed. A particular embodiment is configured to: receive a captured video image sequence from an unmanned aerial vehicle (UAV); clip the video image sequence by removing unnecessary images; stabilize the video image sequence by choosing a reference image and adjusting other images to the reference image; extract a background image of the video image sequence for vehicle segmentation; perform vehicle segmentation to identify vehicles in the video image sequence on a pixel by pixel basis; determine a centroid, heading, and rectangular shape of each identified vehicle; perform vehicle tracking to detect a same identified vehicle in multiple image frames of the video image sequence; and produce output and visualization of the video image sequence including a combination of the background image and the images of each identified vehicle.

Description

    COPYRIGHT NOTICE
  • A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the U.S. Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever. The following notice applies to the disclosure herein and to the drawings that form a part of this document: Copyright 2016-2017, TuSimple, All Rights Reserved.
  • TECHNICAL FIELD
  • This patent document pertains generally to tools (systems, apparatuses, methodologies, computer program products, etc.) for human driver modeling, trajectory generation and motion planning, vehicle control systems, autonomous driving systems, and autonomous driving simulation systems, and more particularly, but not by way of limitation, to a system and method for aerial video traffic analysis.
  • BACKGROUND
  • The control system of autonomous vehicles can sometimes be configured using a simulated human driver environment. The simulated human driver environment attempts to model the typical driving behavior of human drivers in various driving environments. However, the simulated human driver environment may be built based on the information gathered from the sensors and cameras on the autonomous vehicle or related probe vehicles. Because this information, including images from the vehicle cameras, can be subject to image occlusion and unreliable image reconstruction accuracy, the utility and effectiveness of the simulated human driver environment is degraded. Additionally, the image occlusion problem is further complicated by shadows cast by both the vehicles themselves and overhead objects such as trees, buildings, construction equipment, and the like. Classic color-based methods for shadow detection or removal do not work in this case; because of the diversity of vehicle types and colors. Because of these problems with the image data gathered for configuring the simulated human driver environment, the effectiveness of the control systems of autonomous vehicles based on the degraded simulated human driver environment can be compromised.
  • SUMMARY
  • A system and method for aerial video traffic analysis is disclosed herein. Aerial video traffic analysis includes the task of extracting traffic information, including the shape, heading, and trajectories of ground vehicles, from aerial videos captured by aerial vehicles (e.g., UAVs) positioned directly above a road surface at a desired geographical location and altitude. Aerial video is an inexpensive way to collect traffic information. Aerial video traffic analysis as disclosed herein can provide important insights into human driving behaviors in real-world traffic environments and conditions. These human driving behavior insights can be used to train a human driving behavior model, which can be used with a simulation environment for configuring autonomous vehicle control systems. In the various example embodiments disclosed herein, solutions are presented for accomplishing aerial video traffic analysis by combining classic and deep computer vision methods with a specially tailored deep learning model. The example embodiments disclosed herein can achieve pixel-level accuracy in most conditions. The example embodiments also solve another challenging problem caused by the diversity of vehicles in typical traffic environments. The example embodiments disclosed herein can recognize all types of vehicles from tiny ones like motorcycles to huge ones like car carrier trailers. The disclosed example embodiments are insensitive to the size of vehicles, making the various embodiments suitable for all types of vehicles.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The various embodiments are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which:
  • FIG. 1 is an operational flow diagram illustrating an example embodiment of a system and method for traffic data collection using unmanned aerial vehicles (UAVs);
  • FIGS. 2 through 5 illustrate an example scenario wherein a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision;
  • FIG. 6 is an operational flow diagram illustrating an example embodiment of a system and method for training the vehicle segmentation module of the example embodiment;
  • FIG. 7 illustrates the components of a human driver model system of an example embodiment;
  • FIG. 8 is a process flow diagram illustrating an example embodiment of a system and method for traffic data collection using UAVs; and
  • FIG. 9 shows a diagrammatic representation of machine in the example form of a computer system within which a set of instructions when executed may cause the machine to perform any one or more of the methodologies discussed herein.
  • DETAILED DESCRIPTION
  • In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the various embodiments. It will be evident, however, to one of ordinary skill in the art that the various embodiments may be practiced without these specific details.
  • FIG. 1 is an operational flow diagram illustrating an example embodiment of a system and method for traffic data collection using unmanned aerial vehicles (UAVs). FIGS. 2 through 5 illustrate an example scenario of the operations shown in FIG. 1, wherein a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision. Referring now to FIG. 1, a system and process for aerial video traffic analysis in an example embodiment starts with collecting aerial video image data taken by UAVs 202 that fly directly above a certain location to be monitored (e.g., expressways).
  • Referring now to FIG. 2, an example scenario shows a sample image captured by UAV 202 with a camera positioned at a certain location to be monitored at an elevated position to record video image data of the traffic activity at the location 204 within the UAV's field of vision. The system and method of an example embodiment provides traffic data collection using modern UAVs, which create a bird's-eye (elevated) view and provide accurate data related to traffic activity in view of the UAV. Modern UAVs 202 are able to hover or move in the sky at an elevated position to collect data related to a location with a high degree of stability regardless of weather conditions that may be inappropriate for data collection. With a high definition and stabilized camera configured on a UAV 202, data with unprecedented high quality can be collected. The data collected reflects truly realistic real-world traffic information related to the location being monitored. Additionally, the UAVs' presence does not interfere with the traffic activity the UAV is viewing, which is in contrast to any data collection method currently in practice. Further, data collection using UAVs 202 eliminates occlusion problems caused by obstructions in the camera's field of view. The lack of occlusion is crucial for the efficient and high fidelity image data processing performed after the data is collected. Finally, average inexpensive consumer UAVs 202 are sufficient to fulfill most image data collection tasks.
  • Referring still to FIG. 2, the UAV 202 can collect unobstructed video image data from the monitored location 204. As a result, the collected video image data can include images of roadways, traffic flows, and vehicles or other objects in the field of view over a pre-determined time period. The activity and behavior of the vehicles and objects at the location 204 can thereby be recorded and later analyzed and processed for inclusion into a human driver model. The video captured by the UAV 202 is unobstructed and thus provides a consistently clear aerial view of the monitored location 204, which provides more accurate and useful data for the human driver model. The elevated position of the UAV 202 enables better video capture, which results in better modeling and simulation. Additionally, the use of UAVs can be done with less expense and without interference with the environment as compared with the traditional systems where cameras are mounted on a probe vehicle or mounted at a fixed ground-based location.
  • Referring still to FIGS. 1 and 2, the UAVs 202 should ideally remain stationary when recording the video image data, but a small amount of drift is tolerable. Nevertheless, the example embodiment provides a clipping and stabilization operation (operation block 110, shown in FIG. 1) to correct for errant image data. The clipping and stabilization operation is performed on the video image data to completely remove any drift in the field of view. Clipping removes any part of the video image data in which the UAV 202 moves erratically. Stabilization aligns the background surface of all video frames to that of a chosen reference frame. In a particular embodiment, a Harris corner detector can be used to select keypoints on the reference frame. Harris corner detection is a well-known process used within computer vision systems to extract certain kinds of features and to infer the contents of the image. Next, the example embodiment can apply a pyramidal Lucas-Kanade sparse optical flow process to find keypoints corresponding to points in each video frame. In computer vision, the Lucas-Kanade method is a widely used differential method for optical flow estimation developed by Bruce D. Lucas and Takeo Kanade. By combining information from several nearby pixels, the Lucas-Kanade method can often resolve the inherent ambiguity of the optical flow equation. Additionally, the example embodiment can use a random sample consensus (RANSAC) method to solve for a perspective transformation matrix that embodies the alignment of each video frame with the reference frame. RANSAC is an iterative method to estimate parameters of a mathematical model from a set of observed data that contains outliers, when outliers are to be accorded no influence on the values of the estimates. As a result, the example embodiment can align the background surface of all video frames to that of a chosen reference frame. Stabilization of each frame is performed using the perspective transformation matrix. Segments of the video image data can be removed, if the matrix indicates that the UAV motion is larger than desired. The removal of unsuitable video segments is called clipping.
  • Referring now to FIG. 3, before the video image data is sent to the vehicle segmentation module 183 (shown in FIG. 7), background extraction can be performed on the video image data to generate a realistic image without any moving vehicles (operation block 115, shown in FIG. 1). In an example embodiment, background extraction can be based on a RANSAC-like process, in which, for each pixel in the field of view, the dominant color value is inferred from a collection of frames sampled over time from the video. This process tends to filter out moving objects (e.g., vehicles) from the background image; because, the pixels of the moving objects are not static over the collection of frames. In practice, this background extraction process works very well, generating background images 205 that are almost indistinguishable from real ones as shown in the example of FIG. 3.
  • Referring now to FIG. 4, after the background is extracted from each video image as described above, the example embodiment can segment each video frame to identify the locations and shapes of the moving objects captured in the video image frames. This part of the process of an example embodiment is denoted ground vehicle segmentation (operation block 120, shown in FIG. 1). The vehicle segmentation module 183 (shown in FIG. 7) can be used for this process. The vehicle segmentation module 183 of an example embodiment can take two inputs: 1) each frame in the video image data, and 2) the corresponding background images extracted in the manner described above. For each frame in the video image data, the video image frame can be concatenated with the corresponding background image. The concatenated image data can be processed by a neural network of the vehicle segmentation module 183. In one example embodiment, a U-net architecture can be used for the neural network processing. The U-net is a convolutional network architecture for fast and precise segmentation of images. The neural network can output a binary classification of each pixel in the field of view, the binary classification representing whether the pixel is part of a vehicle or not. The training of this neural network is detailed below. The collection of binary classifications of each pixel in the field of view can be used to generate a vehicle segmentation mask, which defines the location and general or rough shape of each vehicle object identified in the video image frames within the field of view.
  • Referring still to FIG. 4, after the vehicle segmentation mask is generated as described above, the example embodiment can use the vehicle segmentation mask to infer the centroid, heading, and rectangular shape of each vehicle identified by the neural network (operation block 125, shown in FIG. 1). A visual representation 206 of this data is shown in the example of FIG. 4. This representation is typically a better and more useful representation as compared with a general vehicle mask, as most vehicles identified in images captured by a UAV are rectangular when viewed top-down.
  • As part of the process for determining the centroid, heading, and rectangular shape of each identified vehicle, the example embodiment first removes noisy points in the segmentation results produced by the vehicle segmentation module 183. Then, the remaining connected pixel components corresponding to each vehicle can be used to represent the shape of the vehicle identified in the image data. The center-of-mass of the connected components corresponding to the vehicle can be used as the centroid of the vehicle. The heading of the vehicle can be determined by solving for the eigenvectors of a centered covariance matrix corresponding to the connected components of the vehicle. As a result, the example embodiment can generate the direction along which the variance of the shape as a distribution is maximized. This direction corresponds to the heading of the vehicle associated with the shape distribution. The rectangular shape of the vehicle is inferred by taking percentiles of the shape projected along and perpendicular to the heading direction. In this manner, geometric information of each vehicle in each video frame can be extracted. Similarly, the centroid, heading, and rectangular shape of each identified vehicle can be determined as described above.
  • Once the geometric information of each vehicle in each video frame is extracted as described above, vehicle tracking through a collection of image frames over time can be performed (operation block 130, shown in FIG. 1). The vehicle tracking module 185 (shown in FIG. 7) can be used for this process. The vehicle tracking module 185 in the example embodiment can be applied to associate same vehicle detections in multiple image frames. In the example embodiment, a tracking method can be used, in which each vehicle detection in a single image frame can be associated with at most one vehicle detection in a previous or subsequent image frame. If image data corresponding to a vehicle detection overlaps in two sequential image frames, the vehicle tracking module 185 can infer the same vehicle detection in the multiple image frames. In this manner, the vehicle tracking module 185 can follow a same vehicle through multiple image frames and determine a velocity of the vehicle. A visible velocity vector corresponding to the velocity of each vehicle can be generated and added to the video image data. The vehicle tracking module 185, in the example embodiment, works very well, even for tiny vehicles like motorcycles, as long as the vehicle segmentation is accurate. In an alternative embodiment, each instance of the vehicles identified in the input image data 210 (shown in FIG. 7) can be tagged with a unique identifier to differentiate between the different vehicles and to enable tracking of the same vehicle in different image frames with the same identifier. This tagging process can be used, if needed, to facilitate the identification and tracking of multiple vehicles across multiple image frames.
  • Referring now to FIG. 5, after the centroid, heading, rectangular shape, tracking data, and velocity vector for each identified vehicle have been determined or generated as described above, an output and visualization representation 207 of the vehicle data for the identified vehicles can be generated as shown (operation block 135, shown in FIG. 1). The output and visualization representation 207 can include a combination of the background image and the images of each identified vehicle with visual bounding boxes and velocity vectors, if desired. The data corresponding to the output and visualization representation 207 can be used by the human driver model system 201, as described in more detail below, to build a model for representing typical driving behaviors in the environment imaged by the UAV.
  • Training of the Vehicle Segmentation Module in an Example Embodiment
  • FIG. 6 is an operational flow diagram illustrating an example embodiment of a system and method for training the vehicle segmentation module 183 of the example embodiment. In the example embodiment described herein, the only module in the aerial video analysis and processing pipeline that requires training is the vehicle segmentation module 183. The training of the vehicle segmentation module 183 can be performed in an offline training process as described in detail below in connection with FIG. 6.
  • In the offline training process of an example embodiment, in order to train the neural network of vehicle segmentation module 183 that separates vehicle objects from the background image, the offline training process includes collecting and labeling a training image dataset. In an example embodiment, a UAV is configured with a camera and positioned at a certain location to be monitored at an elevated position to record video of the traffic activity at the location within the UAV's field of vision. Referring to FIG. 6, the method for training the vehicle segmentation module 183 of the example embodiment starts with collecting aerial video image data taken by UAVs 202 that fly directly above a certain location to be monitored (e.g., expressways). The data collected by UAVs 202 reflects truly realistic real-world traffic information related to the location being monitored. The UAV 202 can collect unobstructed video image data from the monitored location 204. As a result, the collected video image data can include images of roadways, traffic flows, and vehicles or other objects in the field of view over a pre-determined time period. The activity and behavior of the vehicles and objects at the location 204 can thereby be recorded and later used to train the vehicle segmentation module 183 to accurately recognize vehicle objects in the image data.
  • Referring still to FIG. 6, the UAV 202 should ideally remain stationary when recording the video image data, but a small amount of drift is tolerable. Nevertheless, the example embodiment provides an offline clipping and stabilization operation (operation block 610, shown in FIG. 6) to correct for errant image data. The offline clipping and stabilization operation 610 is performed on the video image data to completely remove any drift in the field of view. Clipping removes any part of the video image data in which the UAV 202 moves erratically. Stabilization aligns the background surface of all video frames to that of a chosen reference frame. As described above for a particular embodiment, a Harris corner detector can be used to select keypoints on the reference frame. The example embodiment can apply a pyramidal Lucas-Kanade sparse optical flow process to find keypoints corresponding to points in each video frame. Additionally, the example embodiment can use a random sample consensus (RANSAC) method to solve for a perspective transformation matrix that embodies the alignment of each video frame with the reference frame. As a result, the example embodiment can align the background surface of all video frames to that of a chosen reference frame. Stabilization of each frame is performed using the perspective transformation matrix. Segments of the video image data can be removed, if the matrix indicates that the UAV motion is larger than desired. The removal of unsuitable video segments is called clipping.
  • Referring still to FIG. 6, an offline background extraction operation 615 can be performed on the video image data to generate a realistic image without any moving vehicles. In an example embodiment as described above, background extraction can be based on a RANSAC-like process, in which, for each pixel in the field of view, the dominant color value is inferred from a collection of frames sampled over time from the video.
  • Referring still to FIG. 6, after the background is extracted from each video image in operation 615 as described above, the example embodiment can store the generated data in a segmentation training dataset 630 retained in a data storage device and used for training the neural network of the vehicle segmentation module 183. Additionally, frames of the clipped and stabilized aerial video image data can be randomly sampled in operation 620 and passed to a manual image labeling process 625. The manual image labeling process 625 can include presenting the sampled image frames to human labelers or offline automated processes for manual segmentation labeling of the sampled image frames. During the manual segmentation labeling process, human labelers can draw the shapes of all vehicles in the frames. The purpose of the manual image labeling process 625 is to provide a ground truth dataset with which the vehicle segmentation module 183 can be trained. The manual segmentation labeling data generated by the manual image labeling process 625 can be stored in the segmentation training dataset 630 retained in the data storage device. Upon completion of the background extraction process 615 and the manual image labeling process 626, the sampled image frames, their corresponding background image frames, and the segmentation labelling are collected as segmentation training dataset 630 and retained for neural network training. In the example embodiment, the neural network of the vehicle segmentation module 183 can be a common neural network architecture, such as the U-net architecture described above. The neural network of the vehicle segmentation module 183 can be trained using the video image frames, the corresponding background image frames, and the manual segmentation labelling as input from the segmentation training dataset 630. Using standard neural network training procedures, the segmentation training dataset 630 can be used to configure parameters in the vehicle segmentation module 183 to cause the vehicle segmentation module 183 to accurately identify vehicle objects in one or more video image frames provided by UAVs 202. As a result, the vehicle segmentation module 183 can be trained to output accurate vehicle segmentation labelling and serve as an effective vehicle segmentation model 640, which is highly useful to support the aerial video traffic analysis system described herein.
  • As described above, a system of an example embodiment can provide aerial video traffic analysis. The example embodiment can include a corresponding method, which can be configured to:
      • 1. Receive a sequence of images (e.g., video);
      • 2. Clip the image sequence by removing unnecessary images (e.g., remove the images captured when the UAV takes off, lands, or only captures a part of a target location);
      • 3. Stabilize the image sequence by choosing a reference image and adjusting/calibrating other images to the reference image;
      • 4. Extract the background image of the image sequence for vehicle segmentation, extract the background image without vehicles on a pixel by pixel basis;
      • 5. Perform object/vehicle segmentation to identify objects/vehicles in the image sequence on a pixel by pixel basis;
      • 6. Determine the centroid, heading, and rectangular shape of each identified vehicle;
      • 7. Perform vehicle tracking to detect the same identified vehicle in multiple image frames of the image sequence; and
      • 8. Produce output and visualization of the image sequence including a combination of the background image and the images of each identified vehicle with visual bounding boxes and velocity vectors, if desired.
  • Referring now to FIG. 7, an example embodiment disclosed herein can be used in the context of a human driver model system 201 for autonomous vehicles. In one example embodiment, the human driver model system 201 can receive high definition image data and other sensor data (e.g., traffic or vehicle image data 210) from a UAV positioned above a particular roadway (e.g., monitored location) being monitored. The image data collected by the UAV reflects truly realistic, real-world traffic information related to the location being monitored. Using the standard capabilities of well-known UAV's, the traffic or vehicle image data 210 can be wirelessly (or otherwise) transferred to a data processor 171 of a standard computing system, upon which a human driver model module 175 and/or an image processing module 173 can be executed. Alternatively, the traffic or vehicle image data 210 can be stored in a memory device on the UAV and transferred later to the data processor 171. The processing performed by the human driver model module 175 of an example embodiment is described in more detail below. The traffic or vehicle image data 210 provided by the deployed UAV can be received and processed by the image processing module 173, which can also be executed by the data processor 171. As described above, the image processing module 173 can perform clipping, stabilization, background extraction, object/vehicle segmentation, vehicle centroid, heading, and shape inference processing, vehicle tracking, output and visualization generation, and other image processing functions to isolate vehicle or object presence and activity in the received images. The human driver model module 175 can use the information related to these real-world vehicle or objects to create corresponding simulations of vehicles or objects in the human driver model. Parameter values retained in a vehicle segmentation and human driver model parameter dataset 174 stored in a memory 172 can be used to configure the operation of the human driver model module 175. As described in more detail above, the elevated position of the UAV above the location being monitored and the stabilized high definition camera on the UAV provides a highly valuable and useful image and data feed for use by the human driver model module 175. As a result of the processing performed by the human driver model system 201, data corresponding to predicted or simulated driver behaviors 220 can be produced and provided to a user or other system components. In particular, the predicted or simulated driver behavior data 220 can be provided to a system component used to create a virtual world where a control system for an autonomous vehicle can be trained and improved. The virtual world is configured to be identical (as possible) to the real world where vehicles are operated by human drivers. In other words, the simulated driver behavior data is indirectly useful for configuring the control system for the autonomous vehicle. It will be apparent to those of ordinary skill in the art that the human driver model system 201 and the traffic or vehicle image data 210 described and claimed herein can be implemented, configured, processed, and used in a variety of other applications and systems as well.
  • A basic human driver model may be used to simulate or predict the behavior of an autonomous vehicle with a simulated driver in a simulation scenario. The basic human driver model represents a virtual world configured to be identical (as possible) to the real world where vehicles are operated by human drivers. The virtual world can be used to train and improve a control system for an autonomous vehicle. Thus, the simulation can be indirectly useful for configuring the control systems in autonomous vehicles. Such human driver models can be parameterized models, which may be configured using either real-world input or randomized variables. In one example, the basic human driver model may simulate the typical and atypical driver behaviors, such as steering or heading control, speed or throttle control, and stopping or brake control. In one example, the basic human driver model may use, for example, sensory-motor transport delay, dynamic capabilities, and preferred driving behaviors. In some implementations, the human driver model may include modeling of the transport time delay between a stimulus and the simulated driver's control response. In some implementations, this delay may represent the time necessary for the driver to sense a stimulus, process it, determine the best corrective action, and respond. The human driver model may also include a speed control model with an absolute maximum vehicle speed (e.g., the maximum speed of the vehicle, the speed a driver is not comfortable exceeding, etc.) and a cornering aggressiveness measure to reduce the speed based on the turning radius. In the example, this may replicate the tendency of drivers to slow down through a turn. In the example, once the turning radius drops below the cornering threshold in the scenario, the speed may be reduced in proportion to the tightness of the turn.
  • In various example embodiments, the human driver model can be configured to simulate more than the typical driving behaviors. To simulate an environment that is identical to the real world as much as possible, the human driver model needs data concerning typical driving behaviors, which represent average people, while atypical driving behaviors are equally needed. In other words, in reality, most human drivers drive vehicles in a pleasant and humble way, while other drivers drive aggressively and impatiently. Equivalently, the simulation system of the various example embodiments includes data related to the driving behaviors of impolite and impatient drivers in the virtual world. In all, the human driver model can be configured with data representing driving behaviors as varied as possible.
  • In some implementations, the dynamics of how a human may respond to stimuli may be included in the human driver model, which may include, for example, a metric of how aggressively the driver brakes and accelerates. In some implementations, an aggressive driver may be modeled as one who applies very high control inputs to achieve the desired vehicle speeds, while a conservative driver may use more gradual control inputs. In some implementations, this may be modelled using parameterized values, with the input being controlled to the desired value. In some implementations, by adjusting the parameterized values, the aggressiveness of the simulated driver may be increased or decreased.
  • Referring now to FIG. 8, a flow diagram illustrates an example embodiment of a system and method 1000 for aerial video traffic analysis. The example embodiment can be configured to: receive a captured video image sequence from an unmanned aerial vehicle (UAV) (processing block 1010); clip the video image sequence by removing unnecessary images (processing block 1020); stabilize the video image sequence by choosing a reference image and adjusting other images to the reference image (processing block 1030); extract a background image of the video image sequence for vehicle segmentation (processing block 1040); perform vehicle segmentation to identify vehicles in the video image sequence on a pixel by pixel basis (processing block 1050); determine a centroid, heading, and rectangular shape of each identified vehicle (processing block 1060); perform vehicle tracking to detect a same identified vehicle in multiple image frames of the video image sequence (processing block 1070); and produce output and visualization of the video image sequence including a combination of the background image and the images of each identified vehicle (processing block 1080).
  • FIG. 9 shows a diagrammatic representation of a machine in the example form of a computing system 700 within which a set of instructions when executed and/or processing logic when activated may cause the machine to perform any one or more of the methodologies described and/or claimed herein. In alternative embodiments, the machine operates as a standalone device or may be connected (e.g., networked) to other machines. In a networked deployment, the machine may operate in the capacity of a server or a client machine in server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment. The machine may be a personal computer (PC), a laptop computer, a tablet computing system, a Personal Digital Assistant (PDA), a cellular telephone, a smartphone, a web appliance, a set-top box (STB), a network router, switch or bridge, or any machine capable of executing a set of instructions (sequential or otherwise) or activating processing logic that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term “machine” can also be taken to include any collection of machines that individually or jointly execute a set (or multiple sets) of instructions or processing logic to perform any one or more of the methodologies described and/or claimed herein.
  • The example computing system 700 can include a data processor 702 (e.g., a System-on-a-Chip (SoC), general processing core, graphics core, and optionally other processing logic) and a memory 704, which can communicate with each other via a bus or other data transfer system 706. The mobile computing and/or communication system 700 may further include various input/output (I/O) devices and/or interfaces 710, such as a touchscreen display, an audio jack, a voice interface, and optionally a network interface 712. In an example embodiment, the network interface 712 can include one or more radio transceivers configured for compatibility with any one or more standard wireless and/or cellular protocols or access technologies (e.g., 2nd (2G), 2.5, 3rd (3G), 4th (4G) generation, and future generation radio access for cellular systems, Global System for Mobile communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), Wideband Code Division Multiple Access (WCDMA), LTE, CDMA2000, WLAN, Wireless Router (WR) mesh, and the like). Network interface 712 may also be configured for use with various other wired and/or wireless communication protocols, including TCP/IP, UDP, SIP, SMS, RTP, WAP, CDMA, TDMA, UMTS, UWB, WiFi, WiMax, Bluetooth™, IEEE 802.11x, and the like. In essence, network interface 712 may include or support virtually any wired and/or wireless communication and data processing mechanisms by which information/data may travel between a computing system 700 and another computing or communication system via network 714.
  • The memory 704 can represent a machine-readable medium on which is stored one or more sets of instructions, software, firmware, or other processing logic (e.g., logic 708) embodying any one or more of the methodologies or functions described and/or claimed herein. The logic 708, or a portion thereof, may also reside, completely or at least partially within the processor 702 during execution thereof by the mobile computing and/or communication system 700. As such, the memory 704 and the processor 702 may also constitute machine-readable media. The logic 708, or a portion thereof, may also be configured as processing logic or logic, at least a portion of which is partially implemented in hardware. The logic 708, or a portion thereof, may further be transmitted or received over a network 714 via the network interface 712. While the machine-readable medium of an example embodiment can be a single medium, the term “machine-readable medium” should be taken to include a single non-transitory medium or multiple non-transitory media (e.g., a centralized or distributed database, and/or associated caches and computing systems) that store the one or more sets of instructions. The term “machine-readable medium” can also be taken to include any non-transitory medium that is capable of storing, encoding or carrying a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the various embodiments, or that is capable of storing, encoding or carrying data structures utilized by or associated with such a set of instructions. The term “machine-readable medium” can accordingly be taken to include, but not be limited to, solid-state memories, optical media, and magnetic media.
  • The Abstract of the Disclosure is provided to allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.

Claims (20)

What is claimed is:
1. A system comprising:
an unmanned aerial vehicle (UAV), equipped with a camera, deployed at an elevated position at a monitored location, the UAV configured to capture a video image sequence of the monitored location for a pre-determined period of time using the UAV camera;
a data processor; and
an image processing module, executable by the data processor, the image processing module being configured to:
receive the captured video image sequence from the UAV;
clip the video image sequence by removing unnecessary images;
stabilize the video image sequence by choosing a reference image and adjusting other images to the reference image;
extract a background image of the video image sequence for vehicle segmentation;
perform vehicle segmentation to identify vehicles in the video image sequence on a pixel by pixel basis;
determine a centroid, heading, and rectangular shape of each identified vehicle;
perform vehicle tracking to detect a same identified vehicle in multiple image frames of the video image sequence; and
produce output and visualization of the video image sequence including a combination of the background image and the images of each identified vehicle.
2. The system of claim 1 wherein the image processing module being configured to extract a background image of the video image sequence by inferring a dominant color value from a collection of frames sampled over time from the video image sequence.
3. The system of claim 1 wherein the image processing module being configured to perform vehicle segmentation by concatenating a video image frame with the corresponding background image.
4. The system of claim 1 wherein the image processing module being configured to perform vehicle segmentation by use of a trained neural network.
5. The system of claim 1 wherein the image processing module includes machine learnable components.
6. The system of claim 1 wherein the image processing module being configured to generate a vehicle segmentation mask.
7. The system of claim 1 wherein the image processing module being configured to generate a direction along which a variance of the shape of a vehicle as a distribution is maximized.
8. The system of claim 1 wherein the image processing module being configured to determine if a vehicle detection overlaps in two sequential image frames.
9. The system of claim 1 wherein the output and visualization includes visual bounding boxes and velocity vectors for each identified vehicle.
10. The system of claim 1 further including a human driver model configured to predict or simulate human driver behaviors.
11. A method comprising:
receiving a captured video image sequence from an unmanned aerial vehicle (UAV);
clipping the video image sequence by removing unnecessary images;
stabilizing the video image sequence by choosing a reference image and adjusting other images to the reference image;
extracting a background image of the video image sequence for vehicle segmentation;
performing vehicle segmentation to identify vehicles in the video image sequence on a pixel by pixel basis;
determining a centroid, heading, and rectangular shape of each identified vehicle;
performing vehicle tracking to detect a same identified vehicle in multiple image frames of the video image sequence; and
producing output and visualization of the video image sequence including a combination of the background image and the images of each identified vehicle.
12. The method of claim 11 including extracting the background image of the video image sequence by inferring a dominant color value from a collection of frames sampled over time from the video image sequence.
13. The method of claim 11 including performing vehicle segmentation by concatenating a video image frame with the corresponding background image.
14. The method of claim 11 including performing vehicle segmentation by use of a trained neural network.
15. The method of claim 11 including using machine learnable components.
16. The method of claim 11 including generating a vehicle segmentation mask.
17. The method of claim 11 including generating a direction along which a variance of the shape of a vehicle as a distribution is maximized.
18. The method of claim 11 including determining if a vehicle detection overlaps in two sequential image frames.
19. The method of claim 11 wherein the output and visualization includes visual bounding boxes and velocity vectors for each identified vehicle.
20. The method of claim 11 further including providing a human driver model configured to predict or simulate human driver behaviors.
US15/725,747 2017-10-05 2017-10-05 System and method for aerial video traffic analysis Active 2037-10-07 US10410055B2 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
US15/725,747 US10410055B2 (en) 2017-10-05 2017-10-05 System and method for aerial video traffic analysis
CN202310794571.6A CN116844072A (en) 2017-10-05 2018-10-01 System and method for aerial video traffic analysis
CN201880065098.5A CN111201496B (en) 2017-10-05 2018-10-01 System and method for aerial video traffic analysis
AU2018345330A AU2018345330B2 (en) 2017-10-05 2018-10-01 System and method for aerial video traffic analysis
EP18864500.6A EP3692428A4 (en) 2017-10-05 2018-10-01 System and method for aerial video traffic analysis
PCT/US2018/053795 WO2019070604A1 (en) 2017-10-05 2018-10-01 System and method for aerial video traffic analysis
AU2023278047A AU2023278047A1 (en) 2017-10-05 2023-12-06 System and method for aerial video traffic analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US15/725,747 US10410055B2 (en) 2017-10-05 2017-10-05 System and method for aerial video traffic analysis

Publications (2)

Publication Number Publication Date
US20190108384A1 true US20190108384A1 (en) 2019-04-11
US10410055B2 US10410055B2 (en) 2019-09-10

Family

ID=65993218

Family Applications (1)

Application Number Title Priority Date Filing Date
US15/725,747 Active 2037-10-07 US10410055B2 (en) 2017-10-05 2017-10-05 System and method for aerial video traffic analysis

Country Status (5)

Country Link
US (1) US10410055B2 (en)
EP (1) EP3692428A4 (en)
CN (2) CN111201496B (en)
AU (2) AU2018345330B2 (en)
WO (1) WO2019070604A1 (en)

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190206117A1 (en) * 2017-12-29 2019-07-04 UBTECH Robotics Corp. Image processing method, intelligent terminal, and storage device
CN110619282A (en) * 2019-08-26 2019-12-27 海南撰云空间信息技术有限公司 Automatic extraction method for unmanned aerial vehicle orthoscopic image building
CN110889394A (en) * 2019-12-11 2020-03-17 安徽大学 Rice lodging recognition method based on deep learning UNet network
US10762673B2 (en) 2017-08-23 2020-09-01 Tusimple, Inc. 3D submap reconstruction system and method for centimeter precision localization using camera-based submap and LiDAR-based global map
US10816354B2 (en) 2017-08-22 2020-10-27 Tusimple, Inc. Verification module system and method for motion-based lane detection with multiple sensors
US10942271B2 (en) 2018-10-30 2021-03-09 Tusimple, Inc. Determining an angle between a tow vehicle and a trailer
US20210082133A1 (en) * 2018-06-22 2021-03-18 X Development Llc Detection and replacement of transient obstructions from high elevation digital images
US10953880B2 (en) 2017-09-07 2021-03-23 Tusimple, Inc. System and method for automated lane change control for autonomous vehicles
US10953881B2 (en) 2017-09-07 2021-03-23 Tusimple, Inc. System and method for automated lane change control for autonomous vehicles
US20210097382A1 (en) * 2019-09-27 2021-04-01 Mcafee, Llc Methods and apparatus to improve deepfake detection with explainability
CN112700654A (en) * 2020-12-21 2021-04-23 上海眼控科技股份有限公司 Video processing method and device, electronic equipment and storage medium
US11009356B2 (en) 2018-02-14 2021-05-18 Tusimple, Inc. Lane marking localization and fusion
US11009365B2 (en) 2018-02-14 2021-05-18 Tusimple, Inc. Lane marking localization
US11010874B2 (en) 2018-04-12 2021-05-18 Tusimple, Inc. Images for perception modules of autonomous vehicles
US11037440B2 (en) * 2018-12-19 2021-06-15 Sony Group Corporation Vehicle identification for smart patrolling
US20210270630A1 (en) * 2020-02-28 2021-09-02 International Business Machines Corporation Probe data generating system for simulator
US11151393B2 (en) 2017-08-23 2021-10-19 Tusimple, Inc. Feature matching and corresponding refinement and 3D submap position refinement system and method for centimeter precision localization using camera-based submap and LiDAR-based global map
US11176715B2 (en) * 2018-05-18 2021-11-16 The Governing Council Of The University Of Toronto Method and system for color representation generation
US20210382490A1 (en) * 2020-06-04 2021-12-09 Firefly Automatix, Inc. Performing low profile object detection on a mower
US11263791B2 (en) 2019-05-03 2022-03-01 The Governing Council Of The University Of Toronto System and method for generation of an interactive color workspace
US11295146B2 (en) 2018-02-27 2022-04-05 Tusimple, Inc. System and method for online real-time multi-object tracking
US11292480B2 (en) 2018-09-13 2022-04-05 Tusimple, Inc. Remote safe driving methods and systems
US11305782B2 (en) 2018-01-11 2022-04-19 Tusimple, Inc. Monitoring system for autonomous vehicle operation
US11312334B2 (en) 2018-01-09 2022-04-26 Tusimple, Inc. Real-time remote control of vehicles with high redundancy
US11321821B2 (en) * 2019-02-12 2022-05-03 Ordnance Survey Limited Method and system for generating composite geospatial images
US11500101B2 (en) 2018-05-02 2022-11-15 Tusimple, Inc. Curb detection by analysis of reflection images
US11554785B2 (en) * 2019-05-07 2023-01-17 Foresight Ai Inc. Driving scenario machine learning network and driving environment simulation
US11574462B1 (en) 2022-06-30 2023-02-07 Plus AI, Inc. Data augmentation for detour path configuring
US11610142B2 (en) * 2019-05-28 2023-03-21 Ati Technologies Ulc Safety monitor for image misclassification
US11699282B1 (en) 2022-06-30 2023-07-11 Plusai, Inc. Data augmentation for vehicle control
US11701931B2 (en) 2020-06-18 2023-07-18 Tusimple, Inc. Angle and orientation measurements for vehicles with multiple drivable sections
US11702101B2 (en) 2020-02-28 2023-07-18 International Business Machines Corporation Automatic scenario generator using a computer for autonomous driving
US11702011B1 (en) * 2022-06-30 2023-07-18 Plusai, Inc. Data augmentation for driver monitoring
CN116630832A (en) * 2023-07-21 2023-08-22 江西现代职业技术学院 Unmanned aerial vehicle target recognition method, unmanned aerial vehicle target recognition system, computer and readable storage medium
US11810322B2 (en) 2020-04-09 2023-11-07 Tusimple, Inc. Camera pose estimation techniques
US11814080B2 (en) 2020-02-28 2023-11-14 International Business Machines Corporation Autonomous driving evaluation using data analysis
US11823460B2 (en) 2019-06-14 2023-11-21 Tusimple, Inc. Image fusion for autonomous vehicle operation
CN117218858A (en) * 2023-10-25 2023-12-12 河北高速公路集团有限公司承德分公司 Traffic safety early warning system and method for expressway
US11853071B2 (en) 2017-09-07 2023-12-26 Tusimple, Inc. Data-driven prediction-based system and method for trajectory planning of autonomous vehicles
US11971803B2 (en) 2019-05-31 2024-04-30 Ati Technologies Ulc Safety monitor for invalid image transform
US11972690B2 (en) 2018-12-14 2024-04-30 Beijing Tusen Zhitu Technology Co., Ltd. Platooning method, apparatus and system of autonomous driving platoon

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10679362B1 (en) * 2018-05-14 2020-06-09 Vulcan Inc. Multi-camera homogeneous object trajectory alignment
US11790773B2 (en) * 2019-02-25 2023-10-17 Quantifly Llc Vehicle parking data collection system and method
US11733424B2 (en) 2020-07-31 2023-08-22 Chevron U.S.A. Inc. Systems and methods for identifying subsurface features as functions of feature positions in a subsurface volume of interest
US11841479B2 (en) 2020-07-31 2023-12-12 Chevron U.S.A. Inc. Systems and methods for identifying subsurface features as a function of position in a subsurface volume of interest

Family Cites Families (118)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5877897A (en) 1993-02-26 1999-03-02 Donnelly Corporation Automatic rearview mirror, vehicle lighting control and vehicle interior monitoring system using a photosensor array
US6822563B2 (en) 1997-09-22 2004-11-23 Donnelly Corporation Vehicle imaging system with accessory control
US7103460B1 (en) 1994-05-09 2006-09-05 Automotive Technologies International, Inc. System and method for vehicle diagnostics
US7783403B2 (en) 1994-05-23 2010-08-24 Automotive Technologies International, Inc. System and method for preventing vehicular accidents
US7655894B2 (en) 1996-03-25 2010-02-02 Donnelly Corporation Vehicular image sensing system
US6263088B1 (en) * 1997-06-19 2001-07-17 Ncr Corporation System and method for tracking movement of objects in a scene
US8711217B2 (en) * 2000-10-24 2014-04-29 Objectvideo, Inc. Video surveillance system employing video primitives
US7167519B2 (en) * 2001-12-20 2007-01-23 Siemens Corporate Research, Inc. Real-time video object generation for smart cameras
ES2391556T3 (en) 2002-05-03 2012-11-27 Donnelly Corporation Object detection system for vehicles
US9007197B2 (en) * 2002-05-20 2015-04-14 Intelligent Technologies International, Inc. Vehicular anticipatory sensor system
US6777904B1 (en) 2003-02-25 2004-08-17 Ford Global Technologies, Llc Method and system for controlling a motor
WO2005098739A1 (en) 2004-04-08 2005-10-20 Mobileye Technologies Limited Pedestrian detection
WO2005098751A1 (en) 2004-04-08 2005-10-20 Mobileye Technologies Limited Crowd detection
EP1741079B1 (en) 2004-04-08 2008-05-21 Mobileye Technologies Limited Collision warning system
US7526103B2 (en) 2004-04-15 2009-04-28 Donnelly Corporation Imaging system for vehicle
US8553088B2 (en) 2005-11-23 2013-10-08 Mobileye Technologies Limited Systems and methods for detecting obstructions in a camera field of view
US8164628B2 (en) 2006-01-04 2012-04-24 Mobileye Technologies Ltd. Estimating distance to an object using a sequence of images recorded by a monocular camera
US8150155B2 (en) * 2006-02-07 2012-04-03 Qualcomm Incorporated Multi-mode region-of-interest video object segmentation
US8265392B2 (en) * 2006-02-07 2012-09-11 Qualcomm Incorporated Inter-mode region-of-interest video object segmentation
US7689559B2 (en) 2006-02-08 2010-03-30 Telenor Asa Document similarity scoring and ranking method, device and computer program product
US8417060B2 (en) * 2006-03-20 2013-04-09 Arizona Board Of Regents For And On Behalf Of Arizona State University Methods for multi-point descriptors for image registrations
US7786898B2 (en) 2006-05-31 2010-08-31 Mobileye Technologies Ltd. Fusion of far infrared and visible images in enhanced obstacle detection in automotive applications
EP1930863B1 (en) 2006-12-06 2011-08-03 Mobileye Technologies Limited Detecting and recognizing traffic signs
US20080249667A1 (en) 2007-04-09 2008-10-09 Microsoft Corporation Learning and reasoning to enhance energy efficiency in transportation systems
US7839292B2 (en) 2007-04-11 2010-11-23 Nec Laboratories America, Inc. Real-time driving danger level prediction
US8229163B2 (en) * 2007-08-22 2012-07-24 American Gnc Corporation 4D GIS based virtual reality for moving target prediction
US8041111B1 (en) 2007-10-15 2011-10-18 Adobe Systems Incorporated Subjective and locatable color theme extraction for images
US9176006B2 (en) 2008-01-15 2015-11-03 Mobileye Vision Technologies Ltd. Detection and classification of light sources using a diffraction grating
US9117133B2 (en) 2008-06-18 2015-08-25 Spectral Image, Inc. Systems and methods for hyperspectral imaging
US20100049397A1 (en) 2008-08-22 2010-02-25 Garmin Ltd. Fuel efficient routing
US8126642B2 (en) 2008-10-24 2012-02-28 Gray & Company, Inc. Control and systems for autonomously driven vehicles
US8345956B2 (en) * 2008-11-03 2013-01-01 Microsoft Corporation Converting 2D video into stereo video
US9459515B2 (en) 2008-12-05 2016-10-04 Mobileye Vision Technologies Ltd. Adjustable camera mount for a vehicle windshield
US8175376B2 (en) 2009-03-09 2012-05-08 Xerox Corporation Framework for image thumbnailing based on visual similarity
EP2411961B1 (en) 2009-03-26 2012-10-31 TP Vision Holding B.V. Method and apparatus for modifying an image by using a saliency map based on color frequency
US8271871B2 (en) 2009-04-30 2012-09-18 Xerox Corporation Automated method for alignment of document objects
US8392117B2 (en) 2009-05-22 2013-03-05 Toyota Motor Engineering & Manufacturing North America, Inc. Using topological structure for path planning in semi-structured environments
US8645480B1 (en) 2009-07-19 2014-02-04 Aaron T. Emigh Trust representation by similarity
TWI393074B (en) * 2009-12-10 2013-04-11 Ind Tech Res Inst Apparatus and method for moving object detection
JP2011176748A (en) 2010-02-25 2011-09-08 Sony Corp Image processing apparatus and method, and program
US9280711B2 (en) 2010-09-21 2016-03-08 Mobileye Vision Technologies Ltd. Barrier and guardrail detection using a single camera
US9118816B2 (en) 2011-12-06 2015-08-25 Mobileye Vision Technologies Ltd. Road vertical contour detection
US8509982B2 (en) 2010-10-05 2013-08-13 Google Inc. Zone driving
US9179072B2 (en) 2010-10-31 2015-11-03 Mobileye Vision Technologies Ltd. Bundling night vision and other driver assistance systems (DAS) using near infra red (NIR) illumination and a rolling shutter
US9355635B2 (en) 2010-11-15 2016-05-31 Futurewei Technologies, Inc. Method and system for video summarization
US9251708B2 (en) 2010-12-07 2016-02-02 Mobileye Vision Technologies Ltd. Forward collision warning trap and pedestrian advanced warning system
RU2596246C2 (en) * 2011-02-21 2016-09-10 Стратек Системс Лимитед Observation system and method of detecting contamination or damage of aerodrome with foreign objects
US8401292B2 (en) 2011-04-26 2013-03-19 Eastman Kodak Company Identifying high saliency regions in digital images
US9233659B2 (en) 2011-04-27 2016-01-12 Mobileye Vision Technologies Ltd. Pedestrian collision warning system
KR101777875B1 (en) 2011-04-28 2017-09-13 엘지디스플레이 주식회사 Stereoscopic image display and method of adjusting stereoscopic image thereof
US9183447B1 (en) 2011-06-09 2015-11-10 Mobileye Vision Technologies Ltd. Object detection using candidate object alignment
WO2013015416A1 (en) 2011-07-28 2013-01-31 本田技研工業株式会社 Wireless power transmission method
DE102011083749B4 (en) 2011-09-29 2015-06-11 Aktiebolaget Skf Rotor blade of a wind turbine with a device for detecting a distance value and method for detecting a distance value
US8891820B2 (en) * 2011-09-29 2014-11-18 The Boeing Company Multi-modal sensor fusion
US9297641B2 (en) 2011-12-12 2016-03-29 Mobileye Vision Technologies Ltd. Detection of obstacles at night by analysis of shadows
FR2984254B1 (en) 2011-12-16 2016-07-01 Renault Sa CONTROL OF AUTONOMOUS VEHICLES
US8810666B2 (en) * 2012-01-16 2014-08-19 Google Inc. Methods and systems for processing a video for stabilization using dynamic crop
US9317776B1 (en) 2013-03-13 2016-04-19 Hrl Laboratories, Llc Robust static and moving object detection system via attentional mechanisms
JP5605381B2 (en) 2012-02-13 2014-10-15 株式会社デンソー Cruise control equipment
US9042648B2 (en) 2012-02-23 2015-05-26 Microsoft Technology Licensing, Llc Salient object segmentation
US9476970B1 (en) 2012-03-19 2016-10-25 Google Inc. Camera based localization
US8737690B2 (en) * 2012-04-06 2014-05-27 Xerox Corporation Video-based method for parking angle violation detection
US9134402B2 (en) 2012-08-13 2015-09-15 Digital Signal Corporation System and method for calibrating video and lidar subsystems
US9025880B2 (en) 2012-08-29 2015-05-05 Disney Enterprises, Inc. Visual saliency estimation for images and video
US9165190B2 (en) * 2012-09-12 2015-10-20 Avigilon Fortress Corporation 3D human pose and shape modeling
US9120485B1 (en) 2012-09-14 2015-09-01 Google Inc. Methods and systems for smooth trajectory generation for a self-driving vehicle
US9111444B2 (en) 2012-10-31 2015-08-18 Raytheon Company Video and lidar target detection and tracking system and method for segmenting moving targets
US9092430B2 (en) 2013-01-02 2015-07-28 International Business Machines Corporation Assigning shared catalogs to cache structures in a cluster computing system
US8788134B1 (en) 2013-01-04 2014-07-22 GM Global Technology Operations LLC Autonomous driving merge management system
US8908041B2 (en) 2013-01-15 2014-12-09 Mobileye Vision Technologies Ltd. Stereo assist with rolling shutters
US9277132B2 (en) 2013-02-21 2016-03-01 Mobileye Vision Technologies Ltd. Image distortion correction of a camera with a rolling shutter
US9147255B1 (en) 2013-03-14 2015-09-29 Hrl Laboratories, Llc Rapid object detection by combining structural information from image segmentation with bio-inspired attentional mechanisms
US9652860B1 (en) * 2013-03-15 2017-05-16 Puretech Systems, Inc. System and method for autonomous PTZ tracking of aerial targets
US9342074B2 (en) 2013-04-05 2016-05-17 Google Inc. Systems and methods for transitioning control of an autonomous vehicle to a driver
AU2013205548A1 (en) * 2013-04-30 2014-11-13 Canon Kabushiki Kaisha Method, system and apparatus for tracking objects of a scene
US9438878B2 (en) 2013-05-01 2016-09-06 Legend3D, Inc. Method of converting 2D video to 3D video using 3D object models
US9025825B2 (en) * 2013-05-10 2015-05-05 Palo Alto Research Center Incorporated System and method for visual motion based object segmentation and tracking
US9070289B2 (en) * 2013-05-10 2015-06-30 Palo Alto Research Incorporated System and method for detecting, tracking and estimating the speed of vehicles from a mobile platform
US9671243B2 (en) 2013-06-13 2017-06-06 Mobileye Vision Technologies Ltd. Vision augmented navigation
CN103413444B (en) * 2013-08-26 2015-08-19 深圳市川大智胜科技发展有限公司 A kind of traffic flow based on unmanned plane HD video is investigated method
US9315192B1 (en) 2013-09-30 2016-04-19 Google Inc. Methods and systems for pedestrian avoidance using LIDAR
US9122954B2 (en) 2013-10-01 2015-09-01 Mobileye Vision Technologies Ltd. Performing a histogram using an array of addressable registers
US9738280B2 (en) 2013-10-03 2017-08-22 Robert Bosch Gmbh Adaptive cruise control with on-ramp detection
US9330334B2 (en) 2013-10-24 2016-05-03 Adobe Systems Incorporated Iterative saliency map estimation
US9299004B2 (en) 2013-10-24 2016-03-29 Adobe Systems Incorporated Image foreground detection
US9656673B2 (en) 2013-12-04 2017-05-23 Mobileye Vision Technologies Ltd. Systems and methods for navigating a vehicle to a default lane
CA2935617C (en) 2013-12-30 2023-09-12 Craig Arnold Tieman Connected vehicle system with infotainment interface for mobile devices
EP3100206B1 (en) 2014-01-30 2020-09-09 Mobileye Vision Technologies Ltd. Systems and methods for lane end recognition
US9664789B2 (en) 2014-02-20 2017-05-30 Mobileye Vision Technologies Ltd. Navigation based on radar-cued visual imaging
CN103793925B (en) 2014-02-24 2016-05-18 北京工业大学 Merge the video image vision significance degree detection method of space-time characteristic
DE102014205170A1 (en) 2014-03-20 2015-11-26 Bayerische Motoren Werke Aktiengesellschaft Method and device for determining a trajectory for a vehicle
US9471889B2 (en) * 2014-04-24 2016-10-18 Xerox Corporation Video tracking based method for automatic sequencing of vehicles in drive-thru applications
US9390329B2 (en) * 2014-04-25 2016-07-12 Xerox Corporation Method and system for automatically locating static occlusions
CN105100134A (en) 2014-04-28 2015-11-25 思科技术公司 Screen shared cache management
EP4187523A1 (en) 2014-05-14 2023-05-31 Mobileye Vision Technologies Ltd. Systems and methods for curb detection and pedestrian hazard assessment
US9720418B2 (en) 2014-05-27 2017-08-01 Here Global B.V. Autonomous vehicle monitoring and control
EP3152704A2 (en) 2014-06-03 2017-04-12 Mobileye Vision Technologies Ltd. Systems and methods for detecting an object
US9457807B2 (en) 2014-06-05 2016-10-04 GM Global Technology Operations LLC Unified motion planning algorithm for autonomous driving vehicle in obstacle avoidance maneuver
US9554030B2 (en) 2014-09-29 2017-01-24 Yahoo! Inc. Mobile device image acquisition using objects of interest recognition
US9746550B2 (en) 2014-10-08 2017-08-29 Ford Global Technologies, Llc Detecting low-speed close-range vehicle cut-in
US9959903B2 (en) * 2014-10-23 2018-05-01 Qnap Systems, Inc. Video playback method
KR101664582B1 (en) 2014-11-12 2016-10-10 현대자동차주식회사 Path Planning Apparatus and Method for Autonomous Vehicle
US10115024B2 (en) 2015-02-26 2018-10-30 Mobileye Vision Technologies Ltd. Road vertical contour detection using a stabilized coordinate frame
JP6421684B2 (en) 2015-04-17 2018-11-14 井関農機株式会社 Riding mower
US10635761B2 (en) 2015-04-29 2020-04-28 Energid Technologies Corporation System and method for evaluation of object autonomy
US9483839B1 (en) * 2015-05-06 2016-11-01 The Boeing Company Occlusion-robust visual object fingerprinting using fusion of multiple sub-region signatures
DE102015211926A1 (en) 2015-06-26 2016-12-29 Robert Bosch Gmbh Method and device for determining or evaluating a desired trajectory of a motor vehicle
JP6436237B2 (en) 2015-07-23 2018-12-12 日本電気株式会社 Route switching device, route switching system, and route switching method
US9989965B2 (en) * 2015-08-20 2018-06-05 Motionloft, Inc. Object detection and analysis via unmanned aerial vehicle
US9587952B1 (en) 2015-09-09 2017-03-07 Allstate Insurance Company Altering autonomous or semi-autonomous vehicle operation based on route traversal values
US9734455B2 (en) * 2015-11-04 2017-08-15 Zoox, Inc. Automated extraction of semantic information to enhance incremental mapping modifications for robotic vehicles
US9568915B1 (en) 2016-02-11 2017-02-14 Mitsubishi Electric Research Laboratories, Inc. System and method for controlling autonomous or semi-autonomous vehicle
US9535423B1 (en) 2016-03-29 2017-01-03 Adasworks Kft. Autonomous vehicle with improved visual detection ability
US10261574B2 (en) * 2016-11-30 2019-04-16 University Of Macau Real-time detection system for parked vehicles
US11295458B2 (en) * 2016-12-01 2022-04-05 Skydio, Inc. Object tracking by an unmanned aerial vehicle using visual sensors
CN106683119B (en) * 2017-01-09 2020-03-13 河北工业大学 Moving vehicle detection method based on aerial video image
US9953236B1 (en) 2017-03-10 2018-04-24 TuSimple System and method for semantic segmentation using dense upsampling convolution (DUC)
US10147193B2 (en) 2017-03-10 2018-12-04 TuSimple System and method for semantic segmentation using hybrid dilated convolution (HDC)

Cited By (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11874130B2 (en) 2017-08-22 2024-01-16 Tusimple, Inc. Verification module system and method for motion-based lane detection with multiple sensors
US10816354B2 (en) 2017-08-22 2020-10-27 Tusimple, Inc. Verification module system and method for motion-based lane detection with multiple sensors
US11573095B2 (en) 2017-08-22 2023-02-07 Tusimple, Inc. Verification module system and method for motion-based lane detection with multiple sensors
US11151393B2 (en) 2017-08-23 2021-10-19 Tusimple, Inc. Feature matching and corresponding refinement and 3D submap position refinement system and method for centimeter precision localization using camera-based submap and LiDAR-based global map
US11846510B2 (en) 2017-08-23 2023-12-19 Tusimple, Inc. Feature matching and correspondence refinement and 3D submap position refinement system and method for centimeter precision localization using camera-based submap and LiDAR-based global map
US10762673B2 (en) 2017-08-23 2020-09-01 Tusimple, Inc. 3D submap reconstruction system and method for centimeter precision localization using camera-based submap and LiDAR-based global map
US10953881B2 (en) 2017-09-07 2021-03-23 Tusimple, Inc. System and method for automated lane change control for autonomous vehicles
US10953880B2 (en) 2017-09-07 2021-03-23 Tusimple, Inc. System and method for automated lane change control for autonomous vehicles
US11853071B2 (en) 2017-09-07 2023-12-26 Tusimple, Inc. Data-driven prediction-based system and method for trajectory planning of autonomous vehicles
US20190206117A1 (en) * 2017-12-29 2019-07-04 UBTECH Robotics Corp. Image processing method, intelligent terminal, and storage device
US11312334B2 (en) 2018-01-09 2022-04-26 Tusimple, Inc. Real-time remote control of vehicles with high redundancy
US11305782B2 (en) 2018-01-11 2022-04-19 Tusimple, Inc. Monitoring system for autonomous vehicle operation
US11852498B2 (en) 2018-02-14 2023-12-26 Tusimple, Inc. Lane marking localization
US11009356B2 (en) 2018-02-14 2021-05-18 Tusimple, Inc. Lane marking localization and fusion
US11009365B2 (en) 2018-02-14 2021-05-18 Tusimple, Inc. Lane marking localization
US11740093B2 (en) 2018-02-14 2023-08-29 Tusimple, Inc. Lane marking localization and fusion
US11830205B2 (en) 2018-02-27 2023-11-28 Tusimple, Inc. System and method for online real-time multi- object tracking
US11295146B2 (en) 2018-02-27 2022-04-05 Tusimple, Inc. System and method for online real-time multi-object tracking
US11010874B2 (en) 2018-04-12 2021-05-18 Tusimple, Inc. Images for perception modules of autonomous vehicles
US11694308B2 (en) 2018-04-12 2023-07-04 Tusimple, Inc. Images for perception modules of autonomous vehicles
US11500101B2 (en) 2018-05-02 2022-11-15 Tusimple, Inc. Curb detection by analysis of reflection images
US11176715B2 (en) * 2018-05-18 2021-11-16 The Governing Council Of The University Of Toronto Method and system for color representation generation
US20210082133A1 (en) * 2018-06-22 2021-03-18 X Development Llc Detection and replacement of transient obstructions from high elevation digital images
US11710219B2 (en) * 2018-06-22 2023-07-25 Mineral Earth Sciences Llc Detection and replacement of transient obstructions from high elevation digital images
US11292480B2 (en) 2018-09-13 2022-04-05 Tusimple, Inc. Remote safe driving methods and systems
US10942271B2 (en) 2018-10-30 2021-03-09 Tusimple, Inc. Determining an angle between a tow vehicle and a trailer
US11714192B2 (en) 2018-10-30 2023-08-01 Tusimple, Inc. Determining an angle between a tow vehicle and a trailer
US11972690B2 (en) 2018-12-14 2024-04-30 Beijing Tusen Zhitu Technology Co., Ltd. Platooning method, apparatus and system of autonomous driving platoon
US11037440B2 (en) * 2018-12-19 2021-06-15 Sony Group Corporation Vehicle identification for smart patrolling
US11321821B2 (en) * 2019-02-12 2022-05-03 Ordnance Survey Limited Method and system for generating composite geospatial images
US11263791B2 (en) 2019-05-03 2022-03-01 The Governing Council Of The University Of Toronto System and method for generation of an interactive color workspace
US11554785B2 (en) * 2019-05-07 2023-01-17 Foresight Ai Inc. Driving scenario machine learning network and driving environment simulation
US11610142B2 (en) * 2019-05-28 2023-03-21 Ati Technologies Ulc Safety monitor for image misclassification
US11971803B2 (en) 2019-05-31 2024-04-30 Ati Technologies Ulc Safety monitor for invalid image transform
US11823460B2 (en) 2019-06-14 2023-11-21 Tusimple, Inc. Image fusion for autonomous vehicle operation
CN110619282A (en) * 2019-08-26 2019-12-27 海南撰云空间信息技术有限公司 Automatic extraction method for unmanned aerial vehicle orthoscopic image building
US20210097382A1 (en) * 2019-09-27 2021-04-01 Mcafee, Llc Methods and apparatus to improve deepfake detection with explainability
CN110889394A (en) * 2019-12-11 2020-03-17 安徽大学 Rice lodging recognition method based on deep learning UNet network
US20210270630A1 (en) * 2020-02-28 2021-09-02 International Business Machines Corporation Probe data generating system for simulator
US11644331B2 (en) * 2020-02-28 2023-05-09 International Business Machines Corporation Probe data generating system for simulator
US11702101B2 (en) 2020-02-28 2023-07-18 International Business Machines Corporation Automatic scenario generator using a computer for autonomous driving
US11814080B2 (en) 2020-02-28 2023-11-14 International Business Machines Corporation Autonomous driving evaluation using data analysis
US11810322B2 (en) 2020-04-09 2023-11-07 Tusimple, Inc. Camera pose estimation techniques
US20210382490A1 (en) * 2020-06-04 2021-12-09 Firefly Automatix, Inc. Performing low profile object detection on a mower
US11971719B2 (en) * 2020-06-04 2024-04-30 Firefly Automatix, Inc. Performing low profile object detection on a mower
US11701931B2 (en) 2020-06-18 2023-07-18 Tusimple, Inc. Angle and orientation measurements for vehicles with multiple drivable sections
CN112700654A (en) * 2020-12-21 2021-04-23 上海眼控科技股份有限公司 Video processing method and device, electronic equipment and storage medium
US11702011B1 (en) * 2022-06-30 2023-07-18 Plusai, Inc. Data augmentation for driver monitoring
US11699282B1 (en) 2022-06-30 2023-07-11 Plusai, Inc. Data augmentation for vehicle control
US11574462B1 (en) 2022-06-30 2023-02-07 Plus AI, Inc. Data augmentation for detour path configuring
CN116630832A (en) * 2023-07-21 2023-08-22 江西现代职业技术学院 Unmanned aerial vehicle target recognition method, unmanned aerial vehicle target recognition system, computer and readable storage medium
CN117218858A (en) * 2023-10-25 2023-12-12 河北高速公路集团有限公司承德分公司 Traffic safety early warning system and method for expressway

Also Published As

Publication number Publication date
AU2018345330B2 (en) 2023-09-07
AU2023278047A1 (en) 2024-01-04
WO2019070604A1 (en) 2019-04-11
CN116844072A (en) 2023-10-03
EP3692428A4 (en) 2021-07-14
EP3692428A1 (en) 2020-08-12
CN111201496B (en) 2023-06-30
US10410055B2 (en) 2019-09-10
AU2018345330A1 (en) 2020-05-07
CN111201496A (en) 2020-05-26

Similar Documents

Publication Publication Date Title
US10410055B2 (en) System and method for aerial video traffic analysis
US9952594B1 (en) System and method for traffic data collection using unmanned aerial vehicles (UAVs)
US10019652B2 (en) Generating a virtual world to assess real-world video analysis performance
US10830669B2 (en) Perception simulation for improved autonomous vehicle control
US11042723B2 (en) Systems and methods for depth map sampling
Mueller et al. A benchmark and simulator for uav tracking
Richter et al. Playing for benchmarks
US9990546B2 (en) Method and apparatus for determining target region in video frame for target acquisition
EP3639241A1 (en) Voxel based ground plane estimation and object segmentation
US10909392B1 (en) Systems and methods for computer-based labeling of sensor data captured by a vehicle
US11302065B2 (en) Systems and methods for filtering sensor data to remove data points associated with ephemeral objects
CN109543634B (en) Data processing method and device in positioning process, electronic equipment and storage medium
Dias et al. Autonomous detection of mosquito-breeding habitats using an unmanned aerial vehicle
Maher et al. Realtime human-UAV interaction using deep learning
JP2022512165A (en) Intersection detection, neural network training and intelligent driving methods, equipment and devices
Kruber et al. Vehicle position estimation with aerial imagery from unmanned aerial vehicles
CN110288629A (en) Target detection automatic marking method and device based on moving Object Detection
US20210357763A1 (en) Method and device for performing behavior prediction by using explainable self-focused attention
Bokovoy et al. Original loop-closure detection algorithm for monocular vslam
Salcedo et al. On-board target virtualization using image features for UAV autonomous tracking
CN111126170A (en) Video dynamic object detection method based on target detection and tracking
Muniruzzaman et al. Deterministic algorithm for traffic detection in free-flow and congestion using video sensor
Rochan et al. Efficient object localization and segmentation in weakly labeled videos
CN112651351B (en) Data processing method and device
Barrozo et al. Simulation of an Autonomous Vehicle Control System Based on Image Processing

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY

AS Assignment

Owner name: TUSIMPLE, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WANG, YIJIE;WANG, PANQU;CHEN, PENGFEI;SIGNING DATES FROM 20171002 TO 20171003;REEL/FRAME:047002/0592

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: TUSIMPLE, INC., CALIFORNIA

Free format text: CHANGE OF NAME;ASSIGNOR:TUSIMPLE;REEL/FRAME:051757/0470

Effective date: 20190412

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4