WO2023102126A1 - Vehicle perception system with temporal tracker - Google Patents

Vehicle perception system with temporal tracker Download PDF

Info

Publication number
WO2023102126A1
WO2023102126A1 PCT/US2022/051550 US2022051550W WO2023102126A1 WO 2023102126 A1 WO2023102126 A1 WO 2023102126A1 US 2022051550 W US2022051550 W US 2022051550W WO 2023102126 A1 WO2023102126 A1 WO 2023102126A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
machine
learned model
vehicle
location
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2022/051550
Other languages
English (en)
French (fr)
Inventor
Cheng-Hsin WUU
Subhasis Das
Po-Jen Lai
Qian SONG
Benjamin Isaac ZWIEBEL
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zoox Inc
Original Assignee
Zoox Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zoox Inc filed Critical Zoox Inc
Priority to CN202280079871.XA priority Critical patent/CN118339067A/zh
Priority to JP2024532785A priority patent/JP2024544196A/ja
Priority to EP22902186.0A priority patent/EP4440897A4/en
Publication of WO2023102126A1 publication Critical patent/WO2023102126A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units
    • B60W30/18Propelling the vehicle
    • B60W30/18009Propelling the vehicle related to particular drive situations
    • B60W30/18145Cornering
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W60/00Drive control systems specially adapted for autonomous road vehicles
    • B60W60/001Planning or execution of driving tasks
    • B60W60/0027Planning or execution of driving tasks using trajectory prediction for other traffic participants
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W30/00Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units
    • B60W30/18Propelling the vehicle
    • B60W30/18009Propelling the vehicle related to particular drive situations
    • B60W30/18159Traversing an intersection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2554/00Input parameters relating to objects
    • B60W2554/40Dynamic objects, e.g. animals, windblown objects
    • B60W2554/402Type
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2554/00Input parameters relating to objects
    • B60W2554/40Dynamic objects, e.g. animals, windblown objects
    • B60W2554/404Characteristics
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2554/00Input parameters relating to objects
    • B60W2554/80Spatial relation or speed relative to objects
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2556/00Input parameters relating to data
    • B60W2556/10Historical data
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2556/00Input parameters relating to data
    • B60W2556/25Data precision
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2556/00Input parameters relating to data
    • B60W2556/45External transmission of data to or from the vehicle
    • B60W2556/50External transmission of data to or from the vehicle of positioning data, e.g. GPS [Global Positioning System] data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]

Definitions

  • FIG. 3 illustrates example detail of a detection component and a tracking component of a machine-learned model according to some of the technologies disclosed herein.
  • a bounding box may indicate, among other things, a location of an object in the environment at to, an orientation of the object at to, a size of the object, or the like.
  • the object data may indicate a classification and/or type of the object, such as whether the object is a dynamic object that is capable of movement (e.g., a vehicle, motorcycle, bicycle, pedestrian, animal, etc.) and/or a static object (e.g., abuilding, road surface, tree, sign, barrier, curb, parked vehicle, etc.).
  • the object data may indicate other information associated with the object, such as a velocity of the object at to, a confidence associated with the object, or the like.
  • the machine-learned model may include an obj ect tracking portion.
  • the object tracking portion may be configured to perform multiple subprocesses to determine trajectories traversed by the detected objects during a period of time from to-N-to, where “N” represents any integer greater than or equal to one.
  • the object tracking portion may include, among other things, a location estimation subprocess, a track-location association subprocess, a trajectory estimation subprocess, and/or a track storage subprocess.
  • the vehicle may be controlled based at least in part on the tracked object data. For instance, at least a portion of the tracked object data may be used as in input to another system associated with the vehicle, such as a prediction system, planning system, or the like.
  • this disclosure is also directed to techniques for end-to-end training of a machine-learned model that includes multiple portions (e.g., stages, subprocesses, components, etc.).
  • a machine-learned model is trained end-to-end, outputs of the machine-learned model are used to train individual portions of the machine-learned model, which may otherwise exist as independent machine-learned models.
  • the intermediary outputs of one portion of the machine-learned model may be specifically tailored for use as inputs by other portions of the machine-learned model.
  • a method associated with end-to-end training of a machine-learned model may include techniques of receiving sensor data representing a vehicle traversing an environment.
  • the sensor data may comprise stored log data associated with the vehicle.
  • the sensor data/log data may be image data, lidar data, radar data, time of flight data, or the like.
  • the sensor data may be a time-ordered collection of image frames representing the sensor data associated with the environment, such that a first frame represents the environment at a first time, a second frame represents the environment at a second time, and so forth.
  • the techniques may include receiving an output from the machine-learned model.
  • the output may include predicted tracked object data that includes, among other things, the predicted bounding box and the predicted trajectory.
  • the ground truth data and the predicted tracked object data may be compared to determine whether differences exist between the ground truth data and predicted tracked object data. In some instances, if a difference is determined to meet or exceed a threshold difference, a parameter of the machine-learned model may be altered to minimize the difference. In some instances, a parameter of one or more portions of the machine-learned model may be altered.
  • the objects 108(1) and 108(2) may be referred to herein collectively as “objects 108.”
  • the sensor component s) may include lidar sensors, radar sensors, ultrasonic transducers, sonar sensors, location sensors (e.g., global positioning component (GPS), compass, etc.), inertial sensors (e.g., inertial measurement units, accelerometers, magnetometers, gyroscopes, etc.), cameras (e.g., RGB, IR, intensity, depth, etc.), wheel encoders, microphones, environment sensors (e.g., temperature sensors, humidity sensors, light sensors, pressure sensors, etc.), time of flight (ToF) sensors, etc.
  • GPS global positioning component
  • inertial sensors e.g., inertial measurement units, accelerometers, magnetometers, gyroscopes, etc.
  • cameras e.g., RGB, IR, intensity, depth, etc.
  • wheel encoders e.g., microphones, environment sensors (e.g
  • the obj ect data 116 may, in some instances, be used in whole or in part as an input to a tracking component 122 of the machine-learned model 112.
  • the tracking component 122 may also receive, as an input, stored tracking data associated with the objects 108(1) and 108(2), which is discussed in further detail below.
  • the tracking component 122 may determine, and the machine-learned model 112 may output, tracked object data 124.
  • the tracked object data 124 may include the top-down data 118 representing the view of the environment 104 from the top-down perspective, with movement data indicative of object movement.
  • the tracked object data 124 may also include trajectories traversed by the objects in the environment 104, such as the trajectory 126(1) traversed by the object 108(1) and the trajectory 126(2) traversed by the object 108(2).
  • the tracked object data 124 may, in some examples, be forwarded to a prediction component 128 executing on the computing device(s) 110.
  • the prediction component 128 includes one or more machine-learned models that are trained to make predictions about the objects in the environment 104.
  • the prediction component 128 may determine prediction data 130 associated with the objects based at least in part on some or all of the tracked object data 124.
  • the prediction data 130 may include the top-down data 118 representing the view of the environment 104 from the top-down perspective. Additionally, or alternatively, the prediction data 130 may indicate one or more prediction(s) associated with the objects. For instance, the prediction(s) 132(1) and 132(2) associated with the objects 108(1) and 108(2), respectively, include trajectories that the objects 108(1) and 108(2) are predicted to traverse in the environment 104 during a future period of time.
  • the prediction data 130 may be forwarded as an input to a planner component 134 of the vehicle 102 that is executing on the computing device(s) 110.
  • the planner component 134 includes one or more machine-learned models and/or other algorithms that are configured to determine a planned trajectory for the vehicle 102 to follow through the environment 104. As such, the planner component 134 may determine the planned trajectory of the vehicle 102 based at least in part on the tracked object data 124 and/or the prediction data 130.
  • any one of the sensor data 106, the object data 116, the tracked object data 124, or the prediction data 130 may be formatted as a multi-channel image, where individual channels of the multi-channel image may represent a selection of information.
  • a first channel of a multi-channel image may include bounding box(es) associated with object(s)
  • a second channel of the multi-channel image may include velocity information associated with the object(s)
  • a third channel of the multi-channel image may include environmental data (e.g., surface or lane markings, traffic light information, etc.), and so forth.
  • multiple different channels of the multichannel image may include a portion of the same information.
  • a first channel and a second channel of the multi-channel image may each include bounding boxes associated with the same objects in an environment.
  • a channel can be represented as a color and/or a layer in a three-dimensional image stack, for example.
  • the LSTM component 316 may receive data and other information associated with the objects/features, such as a feature tensor, finite differences, and an output from the MLP component 312, and determine one or more trajectory -lev el representation(s) 318.
  • the MLP refinement component 320 may utilize the trajectory level representation(s) 318 to determine score(s), position offset(s), and other data 328.
  • the methods 400 and 500 are illustrated as collections of blocks in logical flow graphs, which represent sequences of operations that can be implemented in hardware, software, or a combination thereof.
  • the blocks represent computer-executable instructions stored on one or more computer-readable storage media that, when executed by one or more processors, perform the recited operations.
  • computer-executable instructions include routines, programs, objects, components, data structures, and the like that perform particular functions or implement particular abstract data types.
  • the order in which the operations are described is not intended to be construed as a limitation, and any number of the described blocks can be combined in any order and/or in parallel to implement the processes. In some embodiments, one or more blocks of the process may be omitted entirely.
  • the methods 400 and 500 may be combined in whole or in part with each other or with other methods.
  • the method 400 includes performing a first action.
  • the first action can include determining that the machine-learned model is a trained machine-learned model. Additionally, or alternatively, the first action can include sending the machine-learned model to the vehicle for use by the vehicle to traverse an environment. In some examples, the first action can include validating the performance of the machine-learned model in a simulation or with additional sensor data.
  • the method 400 includes altering a parameter of the first portion of the machine-learned model. For instance, a parameter of the detection component 114 or the tracking component 122 may be altered, as well as, or in the alternative, subprocesses performed by those components.
  • altering the parameter may comprise altering a software component of the first portion of the model or retraining the first portion of the model based at least in part on the difference.
  • the first data determined by the first portion of the model may be adjusted such that, in future predictions, the first data is tailored for use as an input for the second portion of the machine- learned model.
  • the vehicle computing device(s) 604 can include processor(s) 616 and memory 618 communicatively coupled with the processor(s) 616.
  • the memory 618 of the vehicle computing device(s) 604 stores a localization component 620, a perception component 622, a prediction component 130, a planner component 134, and one or more system controller(s) 624.
  • the perception component 622 can include the machine-learned model 112, as well as the tracking data 206.
  • the machine-learned model 112, or at least a tracking component of the machine-learned model 112, may utilize the tracking data 206 to estimate locations of objects in the environment, as described herein.
  • the tracking data 206 is stored in a memory, such as a long short-term memory.
  • the vehicle 602 can also include communication connect! on(s) 610 that enable communication between the vehicle 602 and other local or remote computing device(s), such as the computing device(s) 628, as well as other remote or local computing device(s).
  • the communication connection(s) 610 can facilitate communication with other local computing device(s) on the vehicle 602 and/or the drive system(s) 614.
  • the communication connection(s) 610 can allow the vehicle to communicate with other nearby computing device(s) (e.g., other nearby vehicles, traffic signals, etc.).
  • the communications connect! on(s) 610 also enable the vehicle 602 to communicate with a remote teleoperations computing device or other remote services.
  • the communications connect! on(s) 610 can include physical and/or logical interfaces for connecting the vehicle computing device(s) 604 to another computing device or a network, such as network(s) 626.
  • the communications connect! on(s) 610 can enable Wi-Fi-based communication such as via frequencies defined by the IEEE 602.11 standards, short range wireless frequencies such as BLUETOOTH®, or any suitable wired or wireless communications protocol that enables the respective computing device to interface with the other computing device(s).
  • the training data 638 may also include ground truth data 642.
  • ground truth data 642 for every instance of log data 640 used to train the machine-learned model(s) 636, a corresponding instance of ground truth data 642 may be stored.
  • the ground truth data 642 may include tracked object data associated with an object.
  • the ground truth data 642 may include a bounding box associated with an object, a trajectory traversed by the object, a classification of the object, or the like.
  • the ground truth data 642 may be determined by a human labeler, a trained machine- learned model, or the like.
  • the processor(s) 616 of the vehicle 602 and the processor(s) 630 of the computing device(s) 628 can be any suitable processor capable of executing instructions to process data and perform operations as described herein.
  • the processor(s) 616 and 632 can comprise one or more Central Processing Units (CPUs), Graphics Processing Units (GPUs), or any other device or portion of a device that processes electronic data to transform that electronic data into other electronic data that can be stored in registers and/or memory.
  • integrated circuits e.g., ASICs, etc.
  • gate arrays e.g., FPGAs, etc.
  • other hardware devices can also be considered processors in so far as they are configured to implement encoded instructions.
  • components of the vehicle 602 can be associated with the computing device(s) 628 and/or the components of the computing device(s) 628 can be associated with the vehicle 602. That is, the vehicle 602 can perform one or more of the functions associated with the computing device(s) 628, and vice versa.
  • vehicle computing device(s) 604 and the computing device(s) 628 are shown to include multiple components, in some examples, such components can be associated with more or fewer individual components.
  • the localization component 620, the perception component 622, the prediction component 130, and/or the planner component 134 can be combined into a single component. That is, while depicted as separate components, any one or more of the components can be combined.
  • a method comprising: receiving, at a first time, sensor data associated with a vehicle operating in an environment; inputting the sensor data into a machine-learned model that is configured to: determine a location of an object in the environment based at least in part on the sensor data; determine an estimated location of a tracked object in the environment based at least in part on a trajectory traversed by the tracked object, the trajectory determined based at least in part on additional sensor data received by the machine-learned model prior to the first time; and associate the object with the tracked object based at least in part on the location and the estimated location; receiving, as an output from the machine-learned model, an indication of the location and the trajectory; and controlling the vehicle based at least in part on the output.
  • N The method as recited in any one of paragraphs F-M, wherein the output from the machine- learned model further includes a bounding box associated with the object, the bounding box indicative of at least one of: the location of the object in the environment relative to the vehicle; a size associated with the object; or an orientation associated with the object.
  • the sensor data comprises at least one of: image data; lidar data; radar data; or time of flight data.
  • the machine-learned model comprises: a first portion that is configured to determine the predicted bounding box, the predicted bounding box indicating a location of the object in the environment; and a second portion that is configured to: determine, based at least in part on tracking data associated with the object, an estimated location of the object in the environment; associate the predicted bounding box with the tracking data based at least in part on the location and the estimated location; and determine a predicted trajectory traversed by the object based at least in part on the location and the tracking data.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Transportation (AREA)
  • Automation & Control Theory (AREA)
  • Mechanical Engineering (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Image Analysis (AREA)
  • Traffic Control Systems (AREA)
  • Control Of Driving Devices And Active Controlling Of Vehicle (AREA)
PCT/US2022/051550 2021-12-03 2022-12-01 Vehicle perception system with temporal tracker Ceased WO2023102126A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN202280079871.XA CN118339067A (zh) 2021-12-03 2022-12-01 具有时间跟踪器的车辆感知系统
JP2024532785A JP2024544196A (ja) 2021-12-03 2022-12-01 時間トラッカー付き車両知覚システム
EP22902186.0A EP4440897A4 (en) 2021-12-03 2022-12-01 VEHICLE PERCEPTION SYSTEM WITH TIME TRACKER

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/542,352 2021-12-03
US17/542,352 US12030528B2 (en) 2021-12-03 2021-12-03 Vehicle perception system with temporal tracker

Publications (1)

Publication Number Publication Date
WO2023102126A1 true WO2023102126A1 (en) 2023-06-08

Family

ID=86608987

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2022/051550 Ceased WO2023102126A1 (en) 2021-12-03 2022-12-01 Vehicle perception system with temporal tracker

Country Status (5)

Country Link
US (1) US12030528B2 (https=)
EP (1) EP4440897A4 (https=)
JP (1) JP2024544196A (https=)
CN (1) CN118339067A (https=)
WO (1) WO2023102126A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12488595B2 (en) * 2022-12-05 2025-12-02 Nvidia Corporation Object track management for autonomous systems and applications
US12436531B2 (en) * 2023-05-04 2025-10-07 AIM Intelligent Machines, Inc. Augmented learning model for autonomous earth-moving vehicles

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180322650A1 (en) * 2017-03-30 2018-11-08 Uber Technologies, Inc. Systems and Methods to Track Vehicles Proximate Perceived By an Autonomous Vehicle
JP2019109691A (ja) * 2017-12-18 2019-07-04 日立オートモティブシステムズ株式会社 移動体挙動予測装置および移動体挙動予測方法
US10776941B2 (en) * 2018-07-02 2020-09-15 Chiral Software, Inc. Optimized neural network structure
US10936902B1 (en) * 2018-11-27 2021-03-02 Zoox, Inc. Training bounding box selection
KR102313940B1 (ko) * 2021-05-14 2021-10-18 주식회사 인피닉 연속된 3d 데이터에서 객체 추적 방법 및 이를 실행하기 위하여 기록매체에 기록된 컴퓨터 프로그램

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11205120B2 (en) * 2016-12-22 2021-12-21 Samsung Electronics Co., Ltd System and method for training deep learning classification networks
US10809361B2 (en) * 2017-05-31 2020-10-20 Uatc, Llc Hybrid-view LIDAR-based object detection
US20190079526A1 (en) * 2017-09-08 2019-03-14 Uber Technologies, Inc. Orientation Determination in Object Detection and Tracking for Autonomous Vehicles
WO2019094843A1 (en) * 2017-11-10 2019-05-16 Nvidia Corporation Systems and methods for safe and reliable autonomous vehicles
US11157527B2 (en) 2018-02-20 2021-10-26 Zoox, Inc. Creating clean maps including semantic information
US11768292B2 (en) * 2018-03-14 2023-09-26 Uatc, Llc Three-dimensional object detection
CN119078812A (zh) * 2018-08-14 2024-12-06 御眼视觉技术有限公司 具有安全距离的导航的系统和方法
US11195418B1 (en) 2018-10-04 2021-12-07 Zoox, Inc. Trajectory prediction on top-down scenes and associated model
US20200133272A1 (en) 2018-10-29 2020-04-30 Aptiv Technologies Limited Automatic generation of dimensionally reduced maps and spatiotemporal localization for navigation of a vehicle
US11170299B2 (en) * 2018-12-28 2021-11-09 Nvidia Corporation Distance estimation to objects and free-space boundaries in autonomous machine applications
US11354913B1 (en) * 2019-11-27 2022-06-07 Woven Planet North America, Inc. Systems and methods for improving vehicle predictions using point representations of scene
US11462041B2 (en) * 2019-12-23 2022-10-04 Zoox, Inc. Pedestrians with objects
US11789155B2 (en) * 2019-12-23 2023-10-17 Zoox, Inc. Pedestrian object detection training
US11719821B2 (en) * 2020-02-21 2023-08-08 BlueSpace.ai, Inc. Method for object avoidance during autonomous navigation
KR102462733B1 (ko) * 2020-03-03 2022-11-04 한국과학기술연구원 샴 네트워크를 활용하는 강인한 다중 객체 검출 장치 및 방법
JP7359735B2 (ja) * 2020-04-06 2023-10-11 トヨタ自動車株式会社 物体状態識別装置、物体状態識別方法及び物体状態識別用コンピュータプログラムならびに制御装置
US11708093B2 (en) * 2020-05-08 2023-07-25 Zoox, Inc. Trajectories with intent
EP3935413B1 (en) * 2020-05-15 2024-07-24 Baidu.com Times Technology (Beijing) Co., Ltd. Partial point cloud-based pedestrians' velocity estimation method
CA3139421A1 (en) * 2020-11-17 2022-05-17 Uatc, Llc Automatic annotation of object trajectories in multiple dimensions
US12214801B2 (en) * 2020-11-17 2025-02-04 Aurora Operations, Inc. Generating autonomous vehicle testing data through perturbations and adversarial loss functions
US12060076B2 (en) * 2020-12-09 2024-08-13 Zoox, Inc. Determining inputs for perception system
JP2023552105A (ja) * 2020-12-11 2023-12-14 ヒューマニシング オートノミー リミテッド 人間行動のオクルージョン対応予測
US11538185B2 (en) 2020-12-11 2022-12-27 Zoox, Inc. Localization based on semantic objects
US11810365B1 (en) * 2020-12-15 2023-11-07 Zoox, Inc. Perception error modeling
US11603116B2 (en) * 2020-12-16 2023-03-14 Zoox, Inc. Determining safety area based on bounding box
EP4264181A4 (en) * 2020-12-17 2024-10-30 May Mobility, Inc. METHOD AND SYSTEM FOR DYNAMIC UPDATING AN ENVIRONMENTAL REPRESENTATION OF AN AUTONOMOUS AGENT
US12030522B2 (en) * 2020-12-30 2024-07-09 Zoox, Inc. Collision avoidance using an object contour
US11829449B2 (en) * 2020-12-30 2023-11-28 Zoox, Inc. Intermediate input for machine learned model
CN113158763B (zh) * 2021-02-23 2021-12-07 清华大学 4d毫米波和激光点云多视角特征融合的三维目标检测方法
US11611707B2 (en) * 2021-05-24 2023-03-21 Anduril Industries, Inc. Auto-focus tracking for remote flying targets
US20230003872A1 (en) * 2021-06-30 2023-01-05 Zoox, Inc. Tracking objects with radar data
US12246748B2 (en) * 2022-01-31 2025-03-11 Zoox, Inc. Radar object classification based on radar cross-section data
US20230260266A1 (en) * 2022-02-15 2023-08-17 Waymo Llc Camera-radar data fusion for efficient object detection

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180322650A1 (en) * 2017-03-30 2018-11-08 Uber Technologies, Inc. Systems and Methods to Track Vehicles Proximate Perceived By an Autonomous Vehicle
JP2019109691A (ja) * 2017-12-18 2019-07-04 日立オートモティブシステムズ株式会社 移動体挙動予測装置および移動体挙動予測方法
US10776941B2 (en) * 2018-07-02 2020-09-15 Chiral Software, Inc. Optimized neural network structure
US10936902B1 (en) * 2018-11-27 2021-03-02 Zoox, Inc. Training bounding box selection
KR102313940B1 (ko) * 2021-05-14 2021-10-18 주식회사 인피닉 연속된 3d 데이터에서 객체 추적 방법 및 이를 실행하기 위하여 기록매체에 기록된 컴퓨터 프로그램

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4440897A4 *

Also Published As

Publication number Publication date
CN118339067A (zh) 2024-07-12
EP4440897A1 (en) 2024-10-09
US20230174110A1 (en) 2023-06-08
JP2024544196A (ja) 2024-11-28
US12030528B2 (en) 2024-07-09
EP4440897A4 (en) 2025-12-03

Similar Documents

Publication Publication Date Title
JP7851244B2 (ja) 緊急車両検出
US11584389B2 (en) Teleoperations for collaborative vehicle guidance
US12055935B2 (en) Perpendicular cut-in training
US20210269065A1 (en) Perpendicular cut-in detection
US11919526B1 (en) Model-based vehicle drive system monitors
US12269462B1 (en) Spatial prediction
US12071157B1 (en) Correlating perception errors with vehicle behavior
JP2024513666A (ja) ログデータに基づいたシミュレートされた環境でのオブジェクトのインスタンス化
US11710296B2 (en) End-to-end vehicle perception system training
US12030528B2 (en) Vehicle perception system with temporal tracker
WO2021097070A1 (en) Collision monitoring using statistic models
EP4630304A1 (en) Systems and methods for disengaging or engaging autonomy remotely
US12240445B1 (en) Object tracking based on temporal data attribute inferences
JP2025522708A (ja) 通行権の決定
US11938966B1 (en) Vehicle perception system validation
US20250259454A1 (en) Multi-attention machine learning for object detection and classification
EP4136004B1 (en) Teleoperations for collaborative vehicle guidance
US12333823B1 (en) Machine-learned model training for inferring object attributes from distortions in temporal data
US11780471B1 (en) System for determining a state of an object using thermal data
JP7662656B2 (ja) 直角割り込みトレーニング
JP2024509086A (ja) 運転シミュレーションにおけるエージェント変換
US12585272B1 (en) Error prioritization for improving machine learned models
US11018974B1 (en) Context based bandwidth switching
US12436284B1 (en) Inferring object attributes from track associations
US12287405B1 (en) Inferring object attributes from distortions in temporal data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22902186

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2024532785

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 202280079871.X

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2022902186

Country of ref document: EP

Effective date: 20240703