WO2023102126A1 - Vehicle perception system with temporal tracker - Google Patents
Vehicle perception system with temporal tracker Download PDFInfo
- Publication number
- WO2023102126A1 WO2023102126A1 PCT/US2022/051550 US2022051550W WO2023102126A1 WO 2023102126 A1 WO2023102126 A1 WO 2023102126A1 US 2022051550 W US2022051550 W US 2022051550W WO 2023102126 A1 WO2023102126 A1 WO 2023102126A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- machine
- learned model
- vehicle
- location
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W30/00—Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units
- B60W30/18—Propelling the vehicle
- B60W30/18009—Propelling the vehicle related to particular drive situations
- B60W30/18145—Cornering
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W60/00—Drive control systems specially adapted for autonomous road vehicles
- B60W60/001—Planning or execution of driving tasks
- B60W60/0027—Planning or execution of driving tasks using trajectory prediction for other traffic participants
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W30/00—Purposes of road vehicle drive control systems not related to the control of a particular sub-unit, e.g. of systems using conjoint control of vehicle sub-units
- B60W30/18—Propelling the vehicle
- B60W30/18009—Propelling the vehicle related to particular drive situations
- B60W30/18159—Traversing an intersection
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/56—Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
- G06V20/58—Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2554/00—Input parameters relating to objects
- B60W2554/40—Dynamic objects, e.g. animals, windblown objects
- B60W2554/402—Type
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2554/00—Input parameters relating to objects
- B60W2554/40—Dynamic objects, e.g. animals, windblown objects
- B60W2554/404—Characteristics
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2554/00—Input parameters relating to objects
- B60W2554/80—Spatial relation or speed relative to objects
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2556/00—Input parameters relating to data
- B60W2556/10—Historical data
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2556/00—Input parameters relating to data
- B60W2556/25—Data precision
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2556/00—Input parameters relating to data
- B60W2556/45—External transmission of data to or from the vehicle
- B60W2556/50—External transmission of data to or from the vehicle of positioning data, e.g. GPS [Global Positioning System] data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
- G06N3/0442—Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
Definitions
- FIG. 3 illustrates example detail of a detection component and a tracking component of a machine-learned model according to some of the technologies disclosed herein.
- a bounding box may indicate, among other things, a location of an object in the environment at to, an orientation of the object at to, a size of the object, or the like.
- the object data may indicate a classification and/or type of the object, such as whether the object is a dynamic object that is capable of movement (e.g., a vehicle, motorcycle, bicycle, pedestrian, animal, etc.) and/or a static object (e.g., abuilding, road surface, tree, sign, barrier, curb, parked vehicle, etc.).
- the object data may indicate other information associated with the object, such as a velocity of the object at to, a confidence associated with the object, or the like.
- the machine-learned model may include an obj ect tracking portion.
- the object tracking portion may be configured to perform multiple subprocesses to determine trajectories traversed by the detected objects during a period of time from to-N-to, where “N” represents any integer greater than or equal to one.
- the object tracking portion may include, among other things, a location estimation subprocess, a track-location association subprocess, a trajectory estimation subprocess, and/or a track storage subprocess.
- the vehicle may be controlled based at least in part on the tracked object data. For instance, at least a portion of the tracked object data may be used as in input to another system associated with the vehicle, such as a prediction system, planning system, or the like.
- this disclosure is also directed to techniques for end-to-end training of a machine-learned model that includes multiple portions (e.g., stages, subprocesses, components, etc.).
- a machine-learned model is trained end-to-end, outputs of the machine-learned model are used to train individual portions of the machine-learned model, which may otherwise exist as independent machine-learned models.
- the intermediary outputs of one portion of the machine-learned model may be specifically tailored for use as inputs by other portions of the machine-learned model.
- a method associated with end-to-end training of a machine-learned model may include techniques of receiving sensor data representing a vehicle traversing an environment.
- the sensor data may comprise stored log data associated with the vehicle.
- the sensor data/log data may be image data, lidar data, radar data, time of flight data, or the like.
- the sensor data may be a time-ordered collection of image frames representing the sensor data associated with the environment, such that a first frame represents the environment at a first time, a second frame represents the environment at a second time, and so forth.
- the techniques may include receiving an output from the machine-learned model.
- the output may include predicted tracked object data that includes, among other things, the predicted bounding box and the predicted trajectory.
- the ground truth data and the predicted tracked object data may be compared to determine whether differences exist between the ground truth data and predicted tracked object data. In some instances, if a difference is determined to meet or exceed a threshold difference, a parameter of the machine-learned model may be altered to minimize the difference. In some instances, a parameter of one or more portions of the machine-learned model may be altered.
- the objects 108(1) and 108(2) may be referred to herein collectively as “objects 108.”
- the sensor component s) may include lidar sensors, radar sensors, ultrasonic transducers, sonar sensors, location sensors (e.g., global positioning component (GPS), compass, etc.), inertial sensors (e.g., inertial measurement units, accelerometers, magnetometers, gyroscopes, etc.), cameras (e.g., RGB, IR, intensity, depth, etc.), wheel encoders, microphones, environment sensors (e.g., temperature sensors, humidity sensors, light sensors, pressure sensors, etc.), time of flight (ToF) sensors, etc.
- GPS global positioning component
- inertial sensors e.g., inertial measurement units, accelerometers, magnetometers, gyroscopes, etc.
- cameras e.g., RGB, IR, intensity, depth, etc.
- wheel encoders e.g., microphones, environment sensors (e.g
- the obj ect data 116 may, in some instances, be used in whole or in part as an input to a tracking component 122 of the machine-learned model 112.
- the tracking component 122 may also receive, as an input, stored tracking data associated with the objects 108(1) and 108(2), which is discussed in further detail below.
- the tracking component 122 may determine, and the machine-learned model 112 may output, tracked object data 124.
- the tracked object data 124 may include the top-down data 118 representing the view of the environment 104 from the top-down perspective, with movement data indicative of object movement.
- the tracked object data 124 may also include trajectories traversed by the objects in the environment 104, such as the trajectory 126(1) traversed by the object 108(1) and the trajectory 126(2) traversed by the object 108(2).
- the tracked object data 124 may, in some examples, be forwarded to a prediction component 128 executing on the computing device(s) 110.
- the prediction component 128 includes one or more machine-learned models that are trained to make predictions about the objects in the environment 104.
- the prediction component 128 may determine prediction data 130 associated with the objects based at least in part on some or all of the tracked object data 124.
- the prediction data 130 may include the top-down data 118 representing the view of the environment 104 from the top-down perspective. Additionally, or alternatively, the prediction data 130 may indicate one or more prediction(s) associated with the objects. For instance, the prediction(s) 132(1) and 132(2) associated with the objects 108(1) and 108(2), respectively, include trajectories that the objects 108(1) and 108(2) are predicted to traverse in the environment 104 during a future period of time.
- the prediction data 130 may be forwarded as an input to a planner component 134 of the vehicle 102 that is executing on the computing device(s) 110.
- the planner component 134 includes one or more machine-learned models and/or other algorithms that are configured to determine a planned trajectory for the vehicle 102 to follow through the environment 104. As such, the planner component 134 may determine the planned trajectory of the vehicle 102 based at least in part on the tracked object data 124 and/or the prediction data 130.
- any one of the sensor data 106, the object data 116, the tracked object data 124, or the prediction data 130 may be formatted as a multi-channel image, where individual channels of the multi-channel image may represent a selection of information.
- a first channel of a multi-channel image may include bounding box(es) associated with object(s)
- a second channel of the multi-channel image may include velocity information associated with the object(s)
- a third channel of the multi-channel image may include environmental data (e.g., surface or lane markings, traffic light information, etc.), and so forth.
- multiple different channels of the multichannel image may include a portion of the same information.
- a first channel and a second channel of the multi-channel image may each include bounding boxes associated with the same objects in an environment.
- a channel can be represented as a color and/or a layer in a three-dimensional image stack, for example.
- the LSTM component 316 may receive data and other information associated with the objects/features, such as a feature tensor, finite differences, and an output from the MLP component 312, and determine one or more trajectory -lev el representation(s) 318.
- the MLP refinement component 320 may utilize the trajectory level representation(s) 318 to determine score(s), position offset(s), and other data 328.
- the methods 400 and 500 are illustrated as collections of blocks in logical flow graphs, which represent sequences of operations that can be implemented in hardware, software, or a combination thereof.
- the blocks represent computer-executable instructions stored on one or more computer-readable storage media that, when executed by one or more processors, perform the recited operations.
- computer-executable instructions include routines, programs, objects, components, data structures, and the like that perform particular functions or implement particular abstract data types.
- the order in which the operations are described is not intended to be construed as a limitation, and any number of the described blocks can be combined in any order and/or in parallel to implement the processes. In some embodiments, one or more blocks of the process may be omitted entirely.
- the methods 400 and 500 may be combined in whole or in part with each other or with other methods.
- the method 400 includes performing a first action.
- the first action can include determining that the machine-learned model is a trained machine-learned model. Additionally, or alternatively, the first action can include sending the machine-learned model to the vehicle for use by the vehicle to traverse an environment. In some examples, the first action can include validating the performance of the machine-learned model in a simulation or with additional sensor data.
- the method 400 includes altering a parameter of the first portion of the machine-learned model. For instance, a parameter of the detection component 114 or the tracking component 122 may be altered, as well as, or in the alternative, subprocesses performed by those components.
- altering the parameter may comprise altering a software component of the first portion of the model or retraining the first portion of the model based at least in part on the difference.
- the first data determined by the first portion of the model may be adjusted such that, in future predictions, the first data is tailored for use as an input for the second portion of the machine- learned model.
- the vehicle computing device(s) 604 can include processor(s) 616 and memory 618 communicatively coupled with the processor(s) 616.
- the memory 618 of the vehicle computing device(s) 604 stores a localization component 620, a perception component 622, a prediction component 130, a planner component 134, and one or more system controller(s) 624.
- the perception component 622 can include the machine-learned model 112, as well as the tracking data 206.
- the machine-learned model 112, or at least a tracking component of the machine-learned model 112, may utilize the tracking data 206 to estimate locations of objects in the environment, as described herein.
- the tracking data 206 is stored in a memory, such as a long short-term memory.
- the vehicle 602 can also include communication connect! on(s) 610 that enable communication between the vehicle 602 and other local or remote computing device(s), such as the computing device(s) 628, as well as other remote or local computing device(s).
- the communication connection(s) 610 can facilitate communication with other local computing device(s) on the vehicle 602 and/or the drive system(s) 614.
- the communication connection(s) 610 can allow the vehicle to communicate with other nearby computing device(s) (e.g., other nearby vehicles, traffic signals, etc.).
- the communications connect! on(s) 610 also enable the vehicle 602 to communicate with a remote teleoperations computing device or other remote services.
- the communications connect! on(s) 610 can include physical and/or logical interfaces for connecting the vehicle computing device(s) 604 to another computing device or a network, such as network(s) 626.
- the communications connect! on(s) 610 can enable Wi-Fi-based communication such as via frequencies defined by the IEEE 602.11 standards, short range wireless frequencies such as BLUETOOTH®, or any suitable wired or wireless communications protocol that enables the respective computing device to interface with the other computing device(s).
- the training data 638 may also include ground truth data 642.
- ground truth data 642 for every instance of log data 640 used to train the machine-learned model(s) 636, a corresponding instance of ground truth data 642 may be stored.
- the ground truth data 642 may include tracked object data associated with an object.
- the ground truth data 642 may include a bounding box associated with an object, a trajectory traversed by the object, a classification of the object, or the like.
- the ground truth data 642 may be determined by a human labeler, a trained machine- learned model, or the like.
- the processor(s) 616 of the vehicle 602 and the processor(s) 630 of the computing device(s) 628 can be any suitable processor capable of executing instructions to process data and perform operations as described herein.
- the processor(s) 616 and 632 can comprise one or more Central Processing Units (CPUs), Graphics Processing Units (GPUs), or any other device or portion of a device that processes electronic data to transform that electronic data into other electronic data that can be stored in registers and/or memory.
- integrated circuits e.g., ASICs, etc.
- gate arrays e.g., FPGAs, etc.
- other hardware devices can also be considered processors in so far as they are configured to implement encoded instructions.
- components of the vehicle 602 can be associated with the computing device(s) 628 and/or the components of the computing device(s) 628 can be associated with the vehicle 602. That is, the vehicle 602 can perform one or more of the functions associated with the computing device(s) 628, and vice versa.
- vehicle computing device(s) 604 and the computing device(s) 628 are shown to include multiple components, in some examples, such components can be associated with more or fewer individual components.
- the localization component 620, the perception component 622, the prediction component 130, and/or the planner component 134 can be combined into a single component. That is, while depicted as separate components, any one or more of the components can be combined.
- a method comprising: receiving, at a first time, sensor data associated with a vehicle operating in an environment; inputting the sensor data into a machine-learned model that is configured to: determine a location of an object in the environment based at least in part on the sensor data; determine an estimated location of a tracked object in the environment based at least in part on a trajectory traversed by the tracked object, the trajectory determined based at least in part on additional sensor data received by the machine-learned model prior to the first time; and associate the object with the tracked object based at least in part on the location and the estimated location; receiving, as an output from the machine-learned model, an indication of the location and the trajectory; and controlling the vehicle based at least in part on the output.
- N The method as recited in any one of paragraphs F-M, wherein the output from the machine- learned model further includes a bounding box associated with the object, the bounding box indicative of at least one of: the location of the object in the environment relative to the vehicle; a size associated with the object; or an orientation associated with the object.
- the sensor data comprises at least one of: image data; lidar data; radar data; or time of flight data.
- the machine-learned model comprises: a first portion that is configured to determine the predicted bounding box, the predicted bounding box indicating a location of the object in the environment; and a second portion that is configured to: determine, based at least in part on tracking data associated with the object, an estimated location of the object in the environment; associate the predicted bounding box with the tracking data based at least in part on the location and the estimated location; and determine a predicted trajectory traversed by the object based at least in part on the location and the tracking data.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Transportation (AREA)
- Automation & Control Theory (AREA)
- Mechanical Engineering (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Image Analysis (AREA)
- Traffic Control Systems (AREA)
- Control Of Driving Devices And Active Controlling Of Vehicle (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202280079871.XA CN118339067A (zh) | 2021-12-03 | 2022-12-01 | 具有时间跟踪器的车辆感知系统 |
| JP2024532785A JP2024544196A (ja) | 2021-12-03 | 2022-12-01 | 時間トラッカー付き車両知覚システム |
| EP22902186.0A EP4440897A4 (en) | 2021-12-03 | 2022-12-01 | VEHICLE PERCEPTION SYSTEM WITH TIME TRACKER |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/542,352 | 2021-12-03 | ||
| US17/542,352 US12030528B2 (en) | 2021-12-03 | 2021-12-03 | Vehicle perception system with temporal tracker |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023102126A1 true WO2023102126A1 (en) | 2023-06-08 |
Family
ID=86608987
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2022/051550 Ceased WO2023102126A1 (en) | 2021-12-03 | 2022-12-01 | Vehicle perception system with temporal tracker |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US12030528B2 (https=) |
| EP (1) | EP4440897A4 (https=) |
| JP (1) | JP2024544196A (https=) |
| CN (1) | CN118339067A (https=) |
| WO (1) | WO2023102126A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12488595B2 (en) * | 2022-12-05 | 2025-12-02 | Nvidia Corporation | Object track management for autonomous systems and applications |
| US12436531B2 (en) * | 2023-05-04 | 2025-10-07 | AIM Intelligent Machines, Inc. | Augmented learning model for autonomous earth-moving vehicles |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180322650A1 (en) * | 2017-03-30 | 2018-11-08 | Uber Technologies, Inc. | Systems and Methods to Track Vehicles Proximate Perceived By an Autonomous Vehicle |
| JP2019109691A (ja) * | 2017-12-18 | 2019-07-04 | 日立オートモティブシステムズ株式会社 | 移動体挙動予測装置および移動体挙動予測方法 |
| US10776941B2 (en) * | 2018-07-02 | 2020-09-15 | Chiral Software, Inc. | Optimized neural network structure |
| US10936902B1 (en) * | 2018-11-27 | 2021-03-02 | Zoox, Inc. | Training bounding box selection |
| KR102313940B1 (ko) * | 2021-05-14 | 2021-10-18 | 주식회사 인피닉 | 연속된 3d 데이터에서 객체 추적 방법 및 이를 실행하기 위하여 기록매체에 기록된 컴퓨터 프로그램 |
Family Cites Families (33)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11205120B2 (en) * | 2016-12-22 | 2021-12-21 | Samsung Electronics Co., Ltd | System and method for training deep learning classification networks |
| US10809361B2 (en) * | 2017-05-31 | 2020-10-20 | Uatc, Llc | Hybrid-view LIDAR-based object detection |
| US20190079526A1 (en) * | 2017-09-08 | 2019-03-14 | Uber Technologies, Inc. | Orientation Determination in Object Detection and Tracking for Autonomous Vehicles |
| WO2019094843A1 (en) * | 2017-11-10 | 2019-05-16 | Nvidia Corporation | Systems and methods for safe and reliable autonomous vehicles |
| US11157527B2 (en) | 2018-02-20 | 2021-10-26 | Zoox, Inc. | Creating clean maps including semantic information |
| US11768292B2 (en) * | 2018-03-14 | 2023-09-26 | Uatc, Llc | Three-dimensional object detection |
| CN119078812A (zh) * | 2018-08-14 | 2024-12-06 | 御眼视觉技术有限公司 | 具有安全距离的导航的系统和方法 |
| US11195418B1 (en) | 2018-10-04 | 2021-12-07 | Zoox, Inc. | Trajectory prediction on top-down scenes and associated model |
| US20200133272A1 (en) | 2018-10-29 | 2020-04-30 | Aptiv Technologies Limited | Automatic generation of dimensionally reduced maps and spatiotemporal localization for navigation of a vehicle |
| US11170299B2 (en) * | 2018-12-28 | 2021-11-09 | Nvidia Corporation | Distance estimation to objects and free-space boundaries in autonomous machine applications |
| US11354913B1 (en) * | 2019-11-27 | 2022-06-07 | Woven Planet North America, Inc. | Systems and methods for improving vehicle predictions using point representations of scene |
| US11462041B2 (en) * | 2019-12-23 | 2022-10-04 | Zoox, Inc. | Pedestrians with objects |
| US11789155B2 (en) * | 2019-12-23 | 2023-10-17 | Zoox, Inc. | Pedestrian object detection training |
| US11719821B2 (en) * | 2020-02-21 | 2023-08-08 | BlueSpace.ai, Inc. | Method for object avoidance during autonomous navigation |
| KR102462733B1 (ko) * | 2020-03-03 | 2022-11-04 | 한국과학기술연구원 | 샴 네트워크를 활용하는 강인한 다중 객체 검출 장치 및 방법 |
| JP7359735B2 (ja) * | 2020-04-06 | 2023-10-11 | トヨタ自動車株式会社 | 物体状態識別装置、物体状態識別方法及び物体状態識別用コンピュータプログラムならびに制御装置 |
| US11708093B2 (en) * | 2020-05-08 | 2023-07-25 | Zoox, Inc. | Trajectories with intent |
| EP3935413B1 (en) * | 2020-05-15 | 2024-07-24 | Baidu.com Times Technology (Beijing) Co., Ltd. | Partial point cloud-based pedestrians' velocity estimation method |
| CA3139421A1 (en) * | 2020-11-17 | 2022-05-17 | Uatc, Llc | Automatic annotation of object trajectories in multiple dimensions |
| US12214801B2 (en) * | 2020-11-17 | 2025-02-04 | Aurora Operations, Inc. | Generating autonomous vehicle testing data through perturbations and adversarial loss functions |
| US12060076B2 (en) * | 2020-12-09 | 2024-08-13 | Zoox, Inc. | Determining inputs for perception system |
| JP2023552105A (ja) * | 2020-12-11 | 2023-12-14 | ヒューマニシング オートノミー リミテッド | 人間行動のオクルージョン対応予測 |
| US11538185B2 (en) | 2020-12-11 | 2022-12-27 | Zoox, Inc. | Localization based on semantic objects |
| US11810365B1 (en) * | 2020-12-15 | 2023-11-07 | Zoox, Inc. | Perception error modeling |
| US11603116B2 (en) * | 2020-12-16 | 2023-03-14 | Zoox, Inc. | Determining safety area based on bounding box |
| EP4264181A4 (en) * | 2020-12-17 | 2024-10-30 | May Mobility, Inc. | METHOD AND SYSTEM FOR DYNAMIC UPDATING AN ENVIRONMENTAL REPRESENTATION OF AN AUTONOMOUS AGENT |
| US12030522B2 (en) * | 2020-12-30 | 2024-07-09 | Zoox, Inc. | Collision avoidance using an object contour |
| US11829449B2 (en) * | 2020-12-30 | 2023-11-28 | Zoox, Inc. | Intermediate input for machine learned model |
| CN113158763B (zh) * | 2021-02-23 | 2021-12-07 | 清华大学 | 4d毫米波和激光点云多视角特征融合的三维目标检测方法 |
| US11611707B2 (en) * | 2021-05-24 | 2023-03-21 | Anduril Industries, Inc. | Auto-focus tracking for remote flying targets |
| US20230003872A1 (en) * | 2021-06-30 | 2023-01-05 | Zoox, Inc. | Tracking objects with radar data |
| US12246748B2 (en) * | 2022-01-31 | 2025-03-11 | Zoox, Inc. | Radar object classification based on radar cross-section data |
| US20230260266A1 (en) * | 2022-02-15 | 2023-08-17 | Waymo Llc | Camera-radar data fusion for efficient object detection |
-
2021
- 2021-12-03 US US17/542,352 patent/US12030528B2/en active Active
-
2022
- 2022-12-01 CN CN202280079871.XA patent/CN118339067A/zh active Pending
- 2022-12-01 WO PCT/US2022/051550 patent/WO2023102126A1/en not_active Ceased
- 2022-12-01 JP JP2024532785A patent/JP2024544196A/ja active Pending
- 2022-12-01 EP EP22902186.0A patent/EP4440897A4/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180322650A1 (en) * | 2017-03-30 | 2018-11-08 | Uber Technologies, Inc. | Systems and Methods to Track Vehicles Proximate Perceived By an Autonomous Vehicle |
| JP2019109691A (ja) * | 2017-12-18 | 2019-07-04 | 日立オートモティブシステムズ株式会社 | 移動体挙動予測装置および移動体挙動予測方法 |
| US10776941B2 (en) * | 2018-07-02 | 2020-09-15 | Chiral Software, Inc. | Optimized neural network structure |
| US10936902B1 (en) * | 2018-11-27 | 2021-03-02 | Zoox, Inc. | Training bounding box selection |
| KR102313940B1 (ko) * | 2021-05-14 | 2021-10-18 | 주식회사 인피닉 | 연속된 3d 데이터에서 객체 추적 방법 및 이를 실행하기 위하여 기록매체에 기록된 컴퓨터 프로그램 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4440897A4 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118339067A (zh) | 2024-07-12 |
| EP4440897A1 (en) | 2024-10-09 |
| US20230174110A1 (en) | 2023-06-08 |
| JP2024544196A (ja) | 2024-11-28 |
| US12030528B2 (en) | 2024-07-09 |
| EP4440897A4 (en) | 2025-12-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7851244B2 (ja) | 緊急車両検出 | |
| US11584389B2 (en) | Teleoperations for collaborative vehicle guidance | |
| US12055935B2 (en) | Perpendicular cut-in training | |
| US20210269065A1 (en) | Perpendicular cut-in detection | |
| US11919526B1 (en) | Model-based vehicle drive system monitors | |
| US12269462B1 (en) | Spatial prediction | |
| US12071157B1 (en) | Correlating perception errors with vehicle behavior | |
| JP2024513666A (ja) | ログデータに基づいたシミュレートされた環境でのオブジェクトのインスタンス化 | |
| US11710296B2 (en) | End-to-end vehicle perception system training | |
| US12030528B2 (en) | Vehicle perception system with temporal tracker | |
| WO2021097070A1 (en) | Collision monitoring using statistic models | |
| EP4630304A1 (en) | Systems and methods for disengaging or engaging autonomy remotely | |
| US12240445B1 (en) | Object tracking based on temporal data attribute inferences | |
| JP2025522708A (ja) | 通行権の決定 | |
| US11938966B1 (en) | Vehicle perception system validation | |
| US20250259454A1 (en) | Multi-attention machine learning for object detection and classification | |
| EP4136004B1 (en) | Teleoperations for collaborative vehicle guidance | |
| US12333823B1 (en) | Machine-learned model training for inferring object attributes from distortions in temporal data | |
| US11780471B1 (en) | System for determining a state of an object using thermal data | |
| JP7662656B2 (ja) | 直角割り込みトレーニング | |
| JP2024509086A (ja) | 運転シミュレーションにおけるエージェント変換 | |
| US12585272B1 (en) | Error prioritization for improving machine learned models | |
| US11018974B1 (en) | Context based bandwidth switching | |
| US12436284B1 (en) | Inferring object attributes from track associations | |
| US12287405B1 (en) | Inferring object attributes from distortions in temporal data |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22902186 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2024532785 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202280079871.X Country of ref document: CN |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022902186 Country of ref document: EP Effective date: 20240703 |