CN115984658A

CN115984658A - Multi-sensor fusion vehicle window identification method and system and readable storage medium

Info

Publication number: CN115984658A
Application number: CN202310066606.4A
Authority: CN
Inventors: 秦广军; 孙锦涛; 肖利民; 杨钰杰; 张铭芳; 刘晶晶; 韩萌; 林浩田
Original assignee: Changzhou Weishi Intelligent Iot Innovation Center Co ltd
Current assignee: Changzhou Weishi Intelligent Iot Innovation Center Co ltd
Priority date: 2023-02-06
Filing date: 2023-02-06
Publication date: 2023-04-18
Anticipated expiration: 2043-02-06
Also published as: CN115984658B

Abstract

The invention provides a multi-sensor fusion vehicle window identification method, a multi-sensor fusion vehicle window identification system and a readable storage medium, wherein illumination compensation is carried out on rgb image data according to illumination data, and image feature extraction is carried out; collecting temperature sensor data and a laser radar point cloud array; compensating the laser radar point cloud array according to the data of the temperature sensor, and extracting point cloud characteristics point by point; performing pixel level fusion on the image features and the point cloud features; and identifying the vehicle window according to the fused features. And performing illumination data compensation and fusion on the original rgb image by using a dynamic SSR algorithm, enhancing the image, reducing the influence of severe illumination on vehicle window identification, acquiring temperature sensor data, performing temperature compensation and fusion on a radar point cloud array based on a tested laser radar distance temperature compensation system, and reducing the influence of inaccurate point cloud data caused by temperature on vehicle window identification.

Description

Multi-sensor fusion vehicle window identification method and system and readable storage medium

Technical Field

The invention relates to the field of vehicle window identification, in particular to a multi-sensor fusion vehicle window identification method and system and a readable storage medium.

Background

Vehicle window discernment among the prior art, under adverse lighting environment, too bright and dark in bright/dark/region appear easily in the rgb image, it is not good to directly use original rgb image to carry out feature extraction and follow-up vehicle window detection easy effect under adverse lighting, and simultaneously, under microthermal condition, the position depth data that deviates from central area often appears in the point cloud data is invalid or inaccurate, the condition that the volume is littleer appears when the volume measurement, and under the high temperature condition, the noise of marginal area also can increase, can embody the phenomenon of inside high outside extension occasionally, thereby lead to the initial data of acquireing inaccurate, thereby cause the influence to vehicle window discernment.

The above problems are currently in need of solution.

Disclosure of Invention

The invention aims to provide a multi-sensor fusion vehicle window identification method, a multi-sensor fusion vehicle window identification system and a readable storage medium.

In order to solve the technical problem, the invention provides a multi-sensor fusion vehicle window identification method, which comprises the following steps:

acquiring illumination data and rgb image data;

performing illumination compensation on the rgb image data according to the illumination data, and extracting image features;

collecting temperature sensor data and a laser radar point cloud array;

compensating the laser radar point cloud array according to the temperature sensor data, and extracting point cloud characteristics point by point;

performing pixel level fusion on the image features and the point cloud features;

and identifying the vehicle window according to the fused features.

Further, the method for performing illumination compensation on the rgb image data according to the illumination data is that the illumination compensation is dynamically performed on the rgb image by using a Single Scale Retinex algorithm based on the illumination data, and the expression is as follows:

in the formula, I (x, y) is an original image, R (x, y) is a reflection component, L (x, y) is an illumination component, I represents the ith color channel, x represents convolution, and G (x, y) is a gaussian surrounding function;

the formula for G (x, y) is as follows:

and λ satisfies:

；

Wherein,

is a scale parameter surrounded by gauss, which is dynamically adjusted on the basis of an illumination variable Li, is/are>

For the adjustment factor, lx is the illumination unit:

obtaining

Then converting from a logarithmic domain to a real domain to obtain R (x, y), and then performing linear stretching processing to obtain an output image, wherein the final linear stretching formula is as follows: />

In the formula,

the Single Scale Retinex algorithm is used for dynamically carrying out illumination compensation on the rgb image to obtain image data.

Further, the step of performing image feature extraction includes:

inputting the compensated image data into a trained ResNeXt CNN network to obtain a feature map;

and setting a feature map to obtain a candidate ROI, sending the candidate ROI into an RPN (resilient packet network) to classify and filter part of the ROI, then performing ROI Align operation, and corresponding rgb image data and the pixels of the feature map to obtain a feature map of the candidate ROI.

Further, the step of compensating the laser radar point cloud array according to the temperature sensor data and extracting point cloud characteristics point by point comprises:

collecting distance data measured by laser radar at different temperatures

The distance data is used for constructing a distance temperature change relation,

generating a relation between compensation time and temperature according to the light speed distance relation;

finally, according to the current temperature of the laser radar, searching a corresponding compensation distance in the spline interpolation table, and calculating the compensation distance of each depth data in the point cloud array, thereby completing the compensation of the point cloud array;

and inputting the compensated point cloud array into a point cloud feature extraction network for feature extraction to obtain point cloud features.

Further, the step of identifying the vehicle window according to the fused features includes:

feeding pixel-level fusion features into FC layerTo obtain a feature vector

For the feature vector->

Carrying out Sigmoid operation to carry out confidence judgment;

using RCNN's bounding box regression algorithm, input

Processing to obtain an offset function, and further carrying out offset adjustment on the original frame;

carrying out bilinear interpolation on the pixel-level fusion features meeting the threshold requirement, scaling the pixel-level fusion features to the size of an ROI Align feature map, sending the feature map into an FCN (fuzzy C-means) to generate a car window mask, wherein the size of a single mask is as follows:

；

and performing resize operation and background filling on the vehicle window mask according to the original Proposal and the image size to obtain the mask corresponding to the original image size, filling the mask and frame regression on the original image, and displaying a confidence judgment result of the Sigmoid branch to a corresponding area to finish vehicle window identification.

Further, the offset function is:

wherein P represents the original Proposal,

a feature vector representing an input>

Denotes a prediction offset value, w denotes a learned parameter;

the offset adjustment of the original frame comprises translation and size scaling:

translation (

) RepresentComprises the following steps:

；

size scaling (

) Expressed as:

。

further, the step of performing pixel-level fusion on the image features and the point cloud features includes:

and splicing the color features of each pixel point and the depth features of the corresponding point cloud points on the channel to obtain a group of fused features, wherein the process is as follows:

wherein,

、

respectively the extracted point cloud characteristics and the image characteristics, device for combining or screening>

Indicates a splicing operation, <' > is taken>

Is a relu activation function, < >>

Represents a 1*1 convolution operation, greater or lesser than zero>

Representing a splicing operation on a channel;

feeding the fused feature into mlp, and obtaining the global feature by using the average firing

For a global averaging pooling operation, <' > H>

A mlp process operation;

and splicing the global feature and the fused feature on the channel to obtain a pixel level fusion feature, namely a fused feature:

namely, a dense fused feature.

The invention also provides a multi-sensor fusion vehicle window identification system, which comprises:

a first acquisition module adapted to acquire illumination data and rgb image data

The first processing module is suitable for performing illumination compensation on rgb image data according to the illumination data and performing image feature extraction;

the second acquisition module is suitable for acquiring temperature sensor data and a laser radar point cloud array;

the second processing module is suitable for compensating the laser radar point cloud array according to the temperature sensor data and extracting point cloud characteristics point by point;

the fusion module is suitable for carrying out pixel level fusion on the image characteristics and the point cloud characteristics;

and the identification module is suitable for identifying the vehicle window according to the fused features.

The invention also provides a computer-readable storage medium, wherein at least one instruction is stored in the computer-readable storage medium, and the instruction is executed by a processor to realize the multi-sensor fusion vehicle window identification method.

The invention also provides an electronic device, comprising a memory and a processor; at least one instruction is stored in the memory; the processor is used for realizing the multi-sensor fusion vehicle window identification method by loading and executing the at least one instruction.

The invention has the beneficial effects that the invention provides a method and a system for identifying the fusion vehicle window of multiple sensors and a readable storage medium, wherein the method for identifying the fusion vehicle window of multiple sensors comprises the steps of acquiring illumination data and rgb image data; performing illumination compensation on the rgb image data according to the illumination data, and extracting image features; collecting temperature sensor data and a laser radar point cloud array; compensating the laser radar point cloud array according to the temperature sensor data, and extracting point cloud characteristics point by point; performing pixel level fusion on the image features and the point cloud features; and identifying the vehicle window according to the fused features. And performing illumination data compensation and fusion on the original rgb image by using a dynamic SSR algorithm, enhancing the image, reducing the influence of severe illumination on vehicle window identification, acquiring temperature sensor data, performing temperature compensation and fusion on a radar point cloud array based on a tested laser radar distance temperature compensation system, and reducing the influence of inaccurate point cloud data caused by temperature on vehicle window identification.

Drawings

The invention is further illustrated by the following examples in conjunction with the drawings.

Fig. 1 is a flowchart of a multi-sensor fusion vehicle window identification method according to an embodiment of the present invention.

Fig. 2 is a schematic block diagram of a multi-sensor fusion vehicle window identification system provided by an embodiment of the invention.

Fig. 3 is a partial functional block diagram of an electronic device provided by an embodiment of the invention.

Detailed Description

The present invention will now be described in further detail with reference to the accompanying drawings. These drawings are simplified schematic views illustrating only the basic structure of the present invention in a schematic manner, and thus show only the constitution related to the present invention.

Example 1

Referring to fig. 1-3, an embodiment of the invention provides a multi-sensor fusion vehicle window identification method, which includes the steps of compensating and fusing illumination data of an original rgb image by using a dynamic SSR algorithm, enhancing the image, reducing the influence of severe illumination on vehicle window identification, collecting temperature sensor data, performing temperature compensation and fusion on a radar point cloud array based on an examined laser radar distance temperature compensation system, and reducing the influence of inaccurate point cloud data caused by temperature on vehicle window identification.

Specifically, the multi-sensor fusion vehicle window identification method comprises the following steps:

s110: illumination data is acquired as well as rgb image data.

In particular, the illumination data acquisition is performed by illumination sensor data, and the rgb image data acquisition is performed by a 2D camera.

S120: and performing illumination compensation on the rgb image data according to the illumination data, and extracting image features.

Specifically, the method for performing illumination compensation on the rgb image data according to the illumination data is to dynamically perform illumination compensation on the rgb image by using a Single Scale Retinex algorithm based on the illumination data, and the expression is as follows:

the formula for G (x, y) is as follows:

and λ satisfies:

；

Wherein,

For the adjustment factor, lx is the illuminance unit:

obtaining

Then converting from a logarithmic domain to a real domain to obtain R (x, y), and then performing linear stretching processing to obtain an output image, wherein the final linear stretching formula is as follows:

in the formula,

the rgb image is dynamically subjected to illumination compensation by a Single Scale Retinex algorithm to obtain image data.

The step of performing image feature extraction includes:

Specifically, in order to obtain a feature map of a fixed size, ROI Align uses bilinear interpolation to determine pixel values of a virtual point in the original image, using four actually existing pixel values around the virtual point:

representing a pixel value corresponding to the coordinate point, ("bin")>

)、(

)、 (

)、 (

) The coordinates of the virtual point are respectively the upper left, lower left, upper right and lower right coordinates, and (x, y) are the coordinates corresponding to the obtained virtual point.

After ROI Align, the characteristics of the original image are scaled after bilinear interpolation, and the structure is as follows:

dim is the dimension, H, W original image ROI area size scalar.

S130: and collecting temperature sensor data and a laser radar point cloud array.

S140: and compensating the laser radar point cloud array according to the temperature sensor data, and extracting point cloud characteristics point by point.

In the present embodiment, S140 includes the following steps:

s141: and collecting distance data measured by the laser radar at different temperatures.

S142: and constructing a distance temperature change relation by using the distance data.

Specifically, the constructed distance temperature change curve equation is:

and calculating the compensation time corresponding to the current temperature of the radar, and performing time compensation on the shutter signal to complete the time domain compensation of the radar.

S143: and generating the relation between the compensation time and the temperature according to the light speed distance relation.

Specifically, the initial point temperature at which the laser radar is calibrated is set to

Then the result is substituted into the equation of the distance/temperature variation curve to obtain the corresponding ^ or ^ value>

；

Compensating for distance

The equation of the curve as a function of the temperature f is:

distance formula S =0.5 × c according to the speed of light

And Δ t is the sum of the laser emission time and the laser return time of the laser radar, and the obtained functional relation equation of the compensation time Δ t and the temperature f is as follows:

。

on the basis of time domain compensation, collecting the distance measured by the laser radar at each temperature to form a second group of data of which the distance changes along with the temperature; and carrying out cubic spline interpolation on the second group of data to generate a spline interpolation table, and constructing the relationship between the temperature and the compensation distance.

The interpolation function is:

and f is the acquired temperature, s is the distance of the corresponding temperature point, xq is the interpolation interval, the corresponding compensation distance is searched in the spline interpolation table according to the current temperature of the laser radar, and the compensation distance of the depth data is calculated. Wherein, firstly, the calibration initial temperature is searched

And the final compensation distance is the compensation distance minus the initial value of the compensation distance.

S144: and finally, searching the corresponding compensation distance in the spline interpolation table according to the current temperature of the laser radar, and calculating the compensation distance of each depth data in the point cloud array, thereby completing the compensation of the point cloud array.

The final compensated distance vector is represented as:

where F is the current temperature, query () represents a table lookup operation,

is a compensated distance vector.

Compensating the input point cloud data:

the point cloud array is represented by i, i represents the ith point of the point cloud array, and a, b and c are coordinate values of xyz axes of the point cloud point under the point cloud three-dimensional coordinate system.

S145: and inputting the compensated point cloud array into a point cloud feature extraction network for feature extraction to obtain point cloud features, wherein the network takes a PCTR (point cloud Transformer) unit as a core.

Specifically, step S145 includes the steps of:

(1) FPS (farthest point sampling) is carried out on the point cloud, local coordinates and local features of the point cloud are extracted by combining a K-nearest neighbor method, and the point cloud is respectively sent to a local feature extraction unit, a local PTCR unit and a local jump connection unit to carry out local high-dimensional feature extraction in different feature subspaces.

(2) Feature joining and fusion

Three different local features are added through a matrix, spliced with global features in feature dimensions, and feature fusion is carried out through a layer of nonlinear convolution, and the formula is as follows:

wherein

、

Is the output of the local feature extraction unit and the local jump connection unit, is combined with the output of the local feature extraction unit and the output of the local jump connection unit>

And

is the output of the local PCTR cell and the global PCTR cell, is asserted>

Is a single layer non-linear convolution, <' > is>

And connecting the characteristic channels in proportion by the expansion dimension, and splicing in the dimension of the characteristic channels.

(5) In the decoding stage, point-by-point cloud characteristics are finally obtained through reverse interpolation up-sampling and jump connection, and the specific operation of the reverse interpolation is as follows:

wherein X represents the point in the point cloud feature set after up-sampling,

representing a set of features for an existing point cloud->

The point (b) in (c) is,

table weighting operation, C represents the number of point clouds.

After reverse interpolation, if the returned features lack local information, local jump connection is carried out to finally obtain point-by-point cloud features, and the structure is expressed as

Wherein C represents the point cloud number, d represents the spatial coordinate dimension, and F represents the characteristic dimension.

S150: performing pixel level fusion on the image features and the point cloud features;

specifically, the color feature of each pixel point and the depth feature of the corresponding point cloud point are spliced on the channel to obtain a group of fused features, and the process is as follows:

wherein,

、

respectively extracted point cloud characteristics and image characteristics>

Indicates a splicing operation, <' > is taken>

Is a relu activation function, < >>

Represents a 1*1 convolution operation, greater or lesser than zero>

Representing a splicing operation on a channel;

For a global averaging pooling operation>

A mlp process operation;

and (3) splicing the global feature and the fused feature on the channel to obtain a pixel level fusion feature, namely a dense fused feature:

namely, a dense fused feature.

S160: identifying the vehicle window according to the fused features;

step S160 includes the steps of:

sending the pixel level fusion features into an FC layer to obtain feature vectors

；

For feature vector

Carrying out Sigmoid operation to carry out confidence judgment;

。

using the frame regression algorithm of RCNN, input

And processing to obtain an offset function, and further carrying out offset adjustment on the original frame.

And (M, N, w, h) respectively represents the coordinates of the center point of the window and the width and the height, and further the original frame is subjected to offset adjustment. />

Wherein the offset function is:

wherein, P represents original Proposal,

a feature vector representing an input>

Represents a predicted offset value, w represents a learned parameter;

translation (

) Expressed as:

；

size scaling (

) Expressed as:

。

；

Example 2

The embodiment provides a multi-sensor fusion vehicle window identification system. The multi-sensor fusion vehicle window identification system comprises:

a first acquisition module adapted to acquire illumination data and rgb image data. In this embodiment, the first acquisition module is adapted to implement step S110 in embodiment 1.

And the first processing module is suitable for performing illumination compensation on the rgb image data according to the illumination data and extracting image characteristics. In the present embodiment, the first processing module is adapted to implement step S120 in embodiment 1.

And the second acquisition module is suitable for acquiring temperature sensor data and the laser radar point cloud array. In this embodiment, the second acquisition module is adapted to implement step S130 in embodiment 1.

And the second processing module is suitable for compensating the laser radar point cloud array according to the temperature sensor data and extracting point cloud characteristics point by point. In this embodiment, the first acquisition module is adapted to implement step S110 in embodiment 1. In the present embodiment, the second processing module is adapted to implement step S140 in embodiment 1.

And the fusion module is suitable for carrying out pixel level fusion on the image characteristics and the point cloud characteristics. In the present embodiment, the fusion module is adapted to implement step S150 in embodiment 1.

And the identification module is suitable for identifying the vehicle window according to the fused features. In the present embodiment, the identification module is adapted to implement step S160 in embodiment 1.

Example 3

The present embodiment provides a computer-readable storage medium, wherein at least one instruction is stored in the computer-readable storage medium, and the instruction is executed by a processor, so that the multi-sensor fusion vehicle window identification method provided in embodiment 1 is implemented.

The multi-sensor fusion car window identification method comprises the steps of carrying out illumination compensation on rgb image data according to illumination data, and carrying out image feature extraction; collecting temperature sensor data and a laser radar point cloud array; compensating the laser radar point cloud array according to the temperature sensor data, and extracting point cloud characteristics point by point; performing pixel level fusion on the image features and the point cloud features; and identifying the vehicle window according to the fused features. And performing illumination data compensation and fusion on the original rgb image by using a dynamic SSR algorithm, enhancing the image, reducing the influence of severe illumination on vehicle window identification, acquiring temperature sensor data, performing temperature compensation and fusion on a radar point cloud array based on a tested laser radar distance temperature compensation system, and reducing the influence of inaccurate point cloud data caused by temperature on vehicle window identification.

Example 4

Referring to fig. 3, the present embodiment provides an electronic device, including: a memory 502 and a processor 501; the memory 502 has at least one program instruction stored therein; the processor 501 loads and executes the at least one program instruction to implement the multi-sensor fusion vehicle window identification method provided in embodiment 1.

The memory 502 and the processor 501 are coupled in a bus that may include any number of interconnected buses and bridges that couple one or more of the various circuits of the processor 501 and the memory 502 together. The bus may also connect various other circuits such as peripherals, voltage regulators, power management circuits, etc., which are well known in the art, and therefore, will not be described any further herein. A bus interface provides an interface between the bus and the transceiver. The transceiver may be one element or a plurality of elements, such as a plurality of receivers and transmitters, providing a means for communicating with various other apparatus over a transmission medium. The data processed by the processor 501 is transmitted over a wireless medium through an antenna, which further receives the data and transmits the data to the processor 501.

The processor 501 is responsible for managing the bus and general processing and may also provide various functions including timing, peripheral interfaces, voltage regulation, power management, and other control functions. And memory 502 may be used to store data used by processor 501 in performing operations.

In summary, the present invention provides a multi-sensor fusion vehicle window identification method, a system and a readable storage medium, wherein the multi-sensor fusion vehicle window identification method includes acquiring illumination data and rgb image data; performing illumination compensation on the rgb image data according to the illumination data, and extracting image features; collecting temperature sensor data and a laser radar point cloud array; compensating the laser radar point cloud array according to the temperature sensor data, and extracting point cloud characteristics point by point; performing pixel level fusion on the image features and the point cloud features; and identifying the vehicle window according to the fused features. And performing illumination data compensation and fusion on the original rgb image by using a dynamic SSR algorithm, enhancing the image, reducing the influence of severe illumination on vehicle window identification, acquiring temperature sensor data, performing temperature compensation and fusion on a radar point cloud array based on a tested laser radar distance temperature compensation system, and reducing the influence of inaccurate point cloud data caused by temperature on vehicle window identification.

The components selected for use in the present application (components not illustrated for specific structures) are all common standard components or components known to those skilled in the art, and the structure and principle thereof can be known to those skilled in the art through technical manuals or through routine experimentation. Moreover, the software programs referred to in the present application are all prior art, and the present application does not involve any improvement in the software programs.

In the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units is only one logical division, and there may be other divisions when actually implemented, and for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed coupling or direct coupling or communication connection between each other may be through some communication interfaces, indirect coupling or communication connection between devices or units, and may be in an electrical, mechanical or other form.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.

In light of the foregoing description of the preferred embodiment of the present invention, many modifications and variations will be apparent to those skilled in the art without departing from the spirit and scope of the invention. The technical scope of the present invention is not limited to the content of the specification, and must be determined according to the scope of the claims.

Claims

1. A multi-sensor fusion vehicle window identification method is characterized by comprising the following steps:

acquiring illumination data and rgb image data;

collecting temperature sensor data and a laser radar point cloud array;

and identifying the vehicle window according to the fused features.

2. The multi-sensor fusion window identification method of claim 1,

the method for performing illumination compensation on the rgb image data according to the illumination data is characterized in that illumination compensation is dynamically performed on the rgb image by using a Single Scale Retinex algorithm based on the illumination data, and the expression is as follows:

the formula for G (x, y) is as follows:

and λ satisfies:

；

Wherein,

For the adjustment factor, lx is the illuminance unit:

obtaining

in the formula,

3. The multi-sensor fusion window identification method of claim 2,

the step of performing image feature extraction includes:

4. The multi-sensor fusion window identification method of claim 1,

the method for compensating the laser radar point cloud array according to the temperature sensor data and extracting the point cloud characteristics point by point comprises the following steps:

collecting distance data measured by a laser radar at different temperatures;

constructing a distance temperature change relation by using the distance data;

generating a relation between the compensation time and the temperature according to the light speed distance relation;

5. The multi-sensor fusion window identification method of claim 1,

the step of identifying the vehicle window according to the fused features comprises the following steps of:

For a feature vector>

Carrying out Sigmoid operation to carry out confidence judgment;

using RCNN's bounding box regression algorithm, input

；

6. The multi-sensor fusion window identification method of claim 5,

the offset function is:

wherein P represents the original Proposal,

feature vectors representing inputs>

Represents a predicted offset value, w represents a learned parameter;

translation (

) Expressed as:

；

size scaling (a)

) Expressed as:

。

7. the multi-sensor fusion window identification method of claim 1,

the step of performing pixel level fusion on the image features and the point cloud features comprises the following steps:

wherein,

、

respectively extracted point cloud characteristics and image characteristics, device for combining or screening>

Indicates a splicing operation, <' > is taken>

Is a relu activation function, < >>

Represents 1*1 convolution operation, based on a convolution operation>

Represents a channell, splicing operation;

feeding the fused feature into mlp, and obtaining the global feature by using average firing

For a global averaging pooling operation, <' > H>

A mlp process operation;

namely, a dense fused feature.

8. A multi-sensor fusion window identification system, comprising:

9. A computer readable storage medium having stored therein at least one instruction, wherein the instruction when executed by a processor implements the multi-sensor fusion window identification method of any of claims 1 to 7.

10. An electronic device comprising a memory and a processor; at least one instruction is stored in the memory; the processor, by loading and executing the at least one instruction, implements the multi-sensor fusion window identification method of any one of claims 1-7.