WO2024032856A1

WO2024032856A1 - Method for determining a parking space and a target position for a vehicle in the parking space

Info

Publication number: WO2024032856A1
Application number: PCT/DE2023/200155
Authority: WO
Inventors: Mohanad Youssef
Original assignee: Continental Autonomous Mobility Germany GmbH
Priority date: 2022-08-12
Filing date: 2023-07-27
Publication date: 2024-02-15
Also published as: DE102022208405A1

Abstract

The invention relates to a method for determining a parking space and a target position for a vehicle (1) in the parking space by means of a neural network (4), the method comprising the following steps: - capturing a scene in the area surrounding a vehicle (1) by means of a sensor system (2) and providing surroundings information (S10); - generating a grid map (3) of the scene containing a multiplicity of cells (3.1) on the basis of the surroundings information, the cells (3.1) of the grid map (3) each comprising information regarding whether or not the area of the surroundings that corresponds to the cell (3.1) is occupied by an object (O1, O2) (S11); - transmitting the grid map (3) to a neural network (4), the neural network (4) being trained to take the grid map (3) as a basis for providing information relating to a parking space boundary frame (P) and information relating to a target position (Z) for the vehicle (1) in the parking space boundary frame (P) (S12).

Description

Method for determining a parking space and a target position of a vehicle in the parking space

The invention relates to a method and a system for determining a parking space and a target position of a vehicle in the parking space using a neural network.

Methods for determining a parking space and a target position of a vehicle in the parking space are known from the prior art. Based on the environmental information determined by sensors, an environmental model is first formed. The scene is then interpreted based on this environmental model, a parking space is detected and the target position of the vehicle in the parking space is determined. Complex algorithms are used that use a variety of if-then queries and look-up tables.

The problem with the known methods is that the software code that implements the method for determining the parking space or the target position of the vehicle is very complex due to the large number and often different customer requirements and is therefore difficult to maintain or expand. In addition, the known methods cannot handle all parking situations in a satisfactory manner.

Based on this, it is the object of the invention to provide a method for determining a parking space and a target position of a vehicle in the parking space, which avoids complex program structures and enables reliable and satisfactory parking space detection or determination of the target position of the vehicle. The task is solved by a method with the features of independent patent claim 1. Preferred embodiments are the subject of the subclaims. A system for determining a parking space and a target position of a vehicle is the subject of independent patent claim 12.

According to a first aspect, the invention relates to a method for determining a parking space and a target position of a vehicle using a neural network. The procedure has the following steps:

First, a scene in the area surrounding a vehicle is recorded using a sensor system and surrounding information is provided. The sensor system can have any type of sensor that can be used to detect the environment, for example ultrasonic sensors, radar sensors, LIDAR sensors, cameras, etc. The environmental information is, for example, two-dimensional data in which the detections recorded by the sensor system are mapped.

A raster map of the scene is then generated with a variety of cells based on the environmental information. The cells of the grid map each contain information as to whether the surrounding area corresponding to the cell is occupied by an object or not. The raster map thus forms a virtual digital image and the cells form the pixels of the image, with the pixels having occupancy information, for example “0” for not occupied and “1” for occupied. The cells can preferably also contain further information, for example height information about the detected objects, the density and/or intensity of the reflections received at the sensor, etc. The density of the reflections received at the sensor can indicate how many feature points are in one certain cells fall. The intensity of the reflections received at the sensor can indicate the signal strength of the received reflection, which in turn depends on the material and/or surface condition of the reflecting object.

The raster map is then transmitted to a neural network. The neural network is trained to provide information about a parking space bounding frame and information about a target position of the vehicle in the parking space bounding frame based on the raster map.

The technical advantage of the method according to the invention is that the neural network achieves reliable and satisfactory parking space detection or determination of the target position of the vehicle. In addition, by using the neural network, several prior art software components can be replaced at the same time, namely the software component for interpreting the scene and the software component for calculating the target position.

According to one embodiment, the cells of the raster map have information that indicates whether the respective cell is occupied by an object that is higher or lower than a specified threshold value. This means that the neural network can be provided with additional information as to what height the object has or into which height class (high or low in relation to a predetermined threshold value) the object should be grouped, which significantly improves parking space detection, since, for example, tall objects how vehicles etc. can be distinguished from low objects such as curbs.

According to one embodiment, the cells of the raster map

Information about the density and/or intensity of the received at the sensor reflections. This information can be used to weight the cells in such a way that those cells that have a higher density and/or intensity are weighted more heavily by the neural network than those that have a lower density and/or intensity.

According to one exemplary embodiment, the neural network has several sections, namely a first section, by means of which structures of the objects contained in the raster map are recognized, a second section, by means of which one or more areas relevant for parking space detection are determined, and a third section, which receives the at least one area relevant for the parking space detection and which determines information about a parking space delimitation frame and information about a target position of the vehicle in the parking space delimitation frame within the at least one area relevant for the parking space detection. The neural network therefore has a Fast R-CNN architecture or Faster R-CNN architecture, which offers high efficiency and detection accuracy in parking situations.

According to an exemplary embodiment, the first section has a convolutional neural network (also called convolutional neural network, CNN) with several layers, the layers each having a convolutional layer (ie a convolutional layer) and a pooling layer (ie a pooling layer or Bundling layer) and each layer provides a feature map (also called activation map or feature map) as output information. The layers of the convolutional neural network are designed to recognize differently complex features of the information contained in the raster map and output them in the feature map. This means that the different feature maps provide initial information with different levels of complexity. According to one exemplary embodiment, the output information of the respective convolutional layer is modified by an activation function. The activation function can in particular be a ReLU activation function. This can increase the computational efficiency and convergence ability of the neural network.

According to one embodiment, the second section includes a selective search algorithm or a convolutional network designed to recognize and select one or more excerpts in the feature maps provided by the respective layers of the convolutional neural network of the first section and one or more relevant ones Parking space detection areas included. This means that the second section of the neural network recognizes those areas in the feature maps that are relevant for parking space detection. Such an area can, for example, be a parking space with the objects delimiting this parking space.

According to one embodiment, the selective search algorithm or convolutional network receives multiple feature maps from different layers of the convolutional neural network of the first section. Based on the overall information contained in the feature maps, one or more relevant areas for parking space detection are determined. In other words, information from several or all feature maps is used to determine relevant areas for the detection of parking spaces.

According to an exemplary embodiment, the third section has at least a first fully connected layer. The third section creates at least a section of the feature maps created by the first Section of the neural network are provided based on the at least one relevant area provided by the second section. This at least one section of the feature maps is further processed by the at least one first fully connected layer. In the third section, the information contained in the feature maps is reduced to the areas relevant for parking space detection and further processed.

According to an exemplary embodiment, the third section comprises a pooling layer that generates at least one section of the feature maps. The at least one section has a predefined size. Preferably, the pooling layer provides several sections that have the same size regardless of the size of the relevant areas that led to the creation of the sections. Because the sections are the same size, it is possible to simplify further processing of the information. In particular, all sections can be further processed at the same time.

According to an exemplary embodiment, the third section has at least a second fully-connected layer for calculating the parking space boundary frame and at least a third fully-connected layer for calculating the target position of the vehicle, wherein the second and the third fully-connected layer are connected to the at least one first fully -Connected Layer are connected and output information is received from it.

According to one exemplary embodiment, several different pieces of information are provided by the first and/or second section. This different partial information is processed in parallel by several third sections of the neural network. In particular, information from different feature maps, which are generated by the first section, are further processed in different third sections of the neural network. The output information generated in this way can then be combined in a further information processing step in order to provide the information about a parking space bounding frame and information about a target position of the vehicle in the parking space bounding frame.

According to a further aspect, the invention relates to a system for determining a parking space and a target position of a vehicle using a neural network. The system is coupled to a sensor system that is designed to capture a scene in the area surrounding the vehicle and to provide surrounding information. The system has a computing unit that is designed to generate a raster map of the scene with a plurality of cells based on the environmental information. The cells of the grid map each contain information as to whether the surrounding area corresponding to the cell is occupied by an object or not. The neural network is trained to determine information about a parking space bounding frame and information about a target position of the vehicle in the parking space bounding frame based on the grid map.

The technical advantage of the system according to the invention is that the neural network achieves reliable and satisfactory parking space detection or determination of the target position of the vehicle. In addition, the neural network can be used to determine the parking space boundary frame or the target position of the vehicle in a data-driven manner instead of using complex geometric algorithms. According to an exemplary embodiment of the system, the neural network has several sections, namely a first section which is designed to detect structures of the environmental objects contained in the raster map, and a second section which is designed to determine one or more areas relevant to parking space detection and a third section which is designed to receive the at least one area relevant for the parking space detection and to determine information about a parking space boundary frame and information about a target position of the vehicle in the parking space boundary frame within the at least one area relevant to the parking space detection. The neural network therefore has a Fast R-CNN architecture or Faster R-CNN architecture, which offers high efficiency and detection accuracy in parking situations.

According to an embodiment of the system, the second section comprises a selective search algorithm or a convolutional network designed to recognize and select one or more excerpts in the feature maps provided by the respective layers of a convolutional neural network of the first section and one or contain several relevant areas for detecting parking spaces. This means that the second section of the neural network recognizes those areas in the feature maps that are relevant for parking space detection. Such an area can, for example, be a parking space with the objects delimiting this parking space.

According to an embodiment of the system, the selective search algorithm or the convolutional network is to receive multiple feature maps from different layers of the convolutional neural network of the first section and to determine one or more relevant areas for the detection of parking spaces based on the Overall information contained in the feature maps is formed. In other words, information from several or all feature maps is used to determine relevant areas for the detection of parking spaces.

According to an exemplary embodiment of the system, the third section has at least a first fully connected layer. At least a section of the feature maps provided by the first section of the neural network is generated based on the at least one relevant area provided by the second section. This at least one section of the feature maps is further processed by the at least one first fully connected layer.

According to an exemplary embodiment of the system, the third section has at least a second fully connected layer for calculating the parking space boundary frame and at least a third fully connected layer for calculating the target position of the vehicle. The second and third fully-connected layers are connected to the at least one first fully-connected layer and receive output information from this.

In the sense of the invention, the expressions “approximately”, “essentially” or “approximately” mean deviations from the exact value by +/- 10%, preferably by +/- 5% and/or deviations in the form of changes that are insignificant for the function .

Further developments, advantages and possible applications of the invention also emerge from the following description of exemplary embodiments and from the figures. All features described and/or illustrated are fundamentally the subject of the invention, either individually or in any combination. regardless of their summary in the claims or their relationship. The content of the claims is also made part of the description.

The invention is explained in more detail below using the figures and exemplary embodiments. Show it:

1 shows an example of a schematic top view of a vehicle with a sensor system and a computing unit connected to this sensor system;

2 shows an example of a schematic representation of a grid map which illustrates a parking situation and in which an area relevant for parking space detection, a parking space boundary frame and a target position of the vehicle in the parking space boundary frame are sketched;

3 shows an exemplary schematic representation of a convolutional neural network with several layers;

4 shows an exemplary schematic representation of a neural network which is designed to determine a parking space boundary frame and a target position of the vehicle in the parking space boundary frame; and

5 is an example of a block diagram that illustrates the processes of a method for determining a parking space bounding frame and a target position of the vehicle in the parking space bounding frame. Figure 1 shows a vehicle 1 as an example and roughly schematically. The vehicle 1 has a sensor system comprising a large number of individual sensors distributed around the vehicle, by means of which environmental detection is possible. The sensors can include, for example, ultrasonic sensors, at least one camera, at least one radar sensor and/or at least one LIDAR sensor.

The sensor system 2 is coupled to at least one computing unit R, by means of which the method described below for detecting a parking space and a target position of a vehicle in the parking space is carried out. In particular, a neural network 4 is implemented in the computing unit R, by means of which information about a parking space boundary frame P and information about a target position Z of the vehicle 1 in the parking space boundary frame P can be determined.

The sensor system 2 of the vehicle 1 provides environmental information about a scene in the surrounding area of the vehicle 1. So that the neural network 4 can process the environmental information quickly and efficiently (in relation to the computing resources), a raster map 3 is generated that depicts the scene. 2 shows, by way of example and schematically, a grid map 3 with a scene forming a parking situation.

The raster map 3 is, for example, a two-dimensional, discretized representation of the scene. It has a large number of cells 3.1, each of which is assigned to an environmental area. Each cell is assigned digital information that indicates whether cell 3.1 is occupied by an object or not. In the event that the sensor system provides environmental information that has a height classification of the objects, the cell can also have information as to which height class the section of the object that is in the respective cell falls. For example, the cell may contain information as to whether the object is a high or low object (high/low with respect to a height threshold).

Two lines are drawn in the raster map 3 according to FIG. 2, which outline contours of objects 01, 02 recognized by the sensor system 2.

The neural network 4 determines an area B1 relevant for parking space detection, which at least partially contains the two objects 01, 02. 2 also shows a parking space boundary frame P and a target position Z of the vehicle 1 in the parking space boundary frame P, the parking space boundary frame P and the target position Z of the vehicle 1 being output at the output interface of the neural network 4. The neural network 4 preferably also determines an angle to the parking space boundary frame P and/or the target position Z, which indicates the orientation of the parking space boundary frame P or the target position Z in the scene.

As shown in Fig. 4, the neural network 4 has several sections that contribute to determining the position and size of the parking space boundary frame P and the position of the target position Z of the vehicle 1 in this parking space boundary frame P. The neural network 4 forms in particular a so-called Fast R-CNN or Faster R-CNN, where R-CNN is a so-called Region Based Convolutional Neural Network.

In particular, the neural network 4 has a first section 4.1, which includes a convolutional neural network 5 (CNN). A second section 4.2, connected to the first section 4.1, has means for determining areas that are relevant for parking space detection. In Fig. 2 there is such an area with B1 marked. Such an area is characterized in particular by the fact that it has at least one parking space that is laterally delimited by one or more objects. A third section 4.3 of the neural network 4, connected to the first and second sections 4.1, 4.2, has means for determining the position and size of the parking space boundary frame P and for the position of the target position Z of the vehicle 1 in this parking space boundary frame P.

Fig. 3 shows an example and schematically of a folding neural network 5, which can be used in the first section of the neural network 4.

The folding neural network 5 receives the raster map 3 at its input. The folding neural network 5 has several layers 5.1, 5.2. In the exemplary embodiment shown, only two layers are provided. It should be noted that the convolutional neural network 5 can also have more than two layers 5.1, 5.2, depending on which level of structures are to be recognized.

Each layer 5.1 has at least one convolutional layer CL and one max-pooling layer MPL. The convolutional layer CL has a filter (so-called Kemels) to detect features in the information of the raster map 3. By layering several convolutional layers CL, increasingly complex structures can be recognized in the scene. For example, a first convolutional layer CL can recognize basic structures such as horizontal, vertical or oblique edges. Based on this, a second convolutional layer CL, which follows the first convolutional layer CL in the information flow direction, can recognize patterns such as curves, rectangles or circles. A possibly third convolutional layer CL, which follows the second convolutional layer CL in the information flow direction, can then, for example, be more complex based on this information Detect structures such as vehicles, gaps between objects, etc.

For example, in each layer 5.1, 5.2 a convolutional layer CL is followed by a max-pooling layer MPL. The Max-Pooling Layer MPL serves to reduce the information processed by the subsequent layer to enable the kernel of the subsequent layer to give a zoom-like perspective on the scene that has already been reduced to detected features. For example, the respective max-pooling layer MPL reduces the information by the scaling factor ß, ie the convolutional layer of the first layer 5.1, for example, has the dimension I * w, which is equal to the dimension of the raster map 3 and provides m feature maps, whereas the convolutional layer of the second layer 5.2 has the dimension

and n2 feature maps provides.

The convolutional layer CL preferably use an activation function, for example a ReLU activation function of the form:

Fig. 4 shows the overall structure of the neural network 4. The flow of information in Fig. 4 is from bottom to top, as indicated by the arrows.

A raster map 3, which contains the discretized environmental information, is transmitted to the convolutional neural network 5, which, as previously described, has several layers 5.1, 5.2 and generates several feature maps as output information based on these layers 5.1, 5.2. The sections 4.1 to 4.3 of the neural network 4 are also sketched in FIG. 4, with the convolutional neural network 5 located in the first section 4.1. The folding neural network 5 transmits the feature maps to the second section 4.2 and to the third section 4.3 of the neural network 4.

The second section 4.2 of the neural network 4 implements a selective search algorithm or has another convolutional neural network, also referred to as a regional proposal network (RPN). The selective search algorithm or the regional proposal network are designed to define areas to be examined in the respective feature maps that are generated by the convolutional neural network 5 (so-called regions of interest ROI). Areas to be examined are those areas in the feature maps in which areas to be parked are expected. The regional proposal network, for example, is a convolutional network pre-trained using labeled training data that is adapted to detect parking areas. The training data includes, for example, parking lot scenes in which areas containing one or more objects with adjacent or enclosed open space, which could be a parking space, are marked as labels. By training with this data, the regional proposal network can be trained to recognize areas to be examined for parking.

The areas to be examined and the feature maps are then transferred to the third section 4.3 of the neural network 4. Preferably, the components of the neural network 4 shown in FIG. 4 in the third section 4.3 are provided multiple times, namely once for each feature map that is generated by the convolutional neural network 5. As a result, a parking space boundary frame P and a target position Z of the vehicle 1 in the parking space boundary frame P can initially be determined separately based on each feature map are determined, whereby this information is then linked together and thereby a final parking space boundary frame P and a final target position Z of the vehicle 1 are determined in this final parking space boundary frame P.

The information processing is described below using a single feature map. The information processing of the other feature maps takes place in the same way.

A feature map generated by the convolutional neural network 5 is linked to the at least one area to be examined, which is provided by the regional proposal network or the selective search algorithm. This means that one or more relevant areas B1, B2, B3 are defined within the feature map, which fall into at least one area to be examined. In other words, by linking the feature map and the at least one area to be examined, at least a section A1, A2, A3 of the feature map is determined, as indicated in FIG. 4 in the lower area of section 4.3.

A so-called Region of Interest (Rol) pooling then takes place through a pooling layer 7. For example, a max pooling operation is used to generate a partial feature map (sections A1, A2, A3) for each area to be examined . The sub-feature maps are each the same size, i.e. even if sections of the original feature map are not the same size, the pooling layer 7 provides sub-feature maps that are all the same size.

These sub-feature maps are then forwarded to a first fully connected layer 6, which further processes the information available in the sub-feature maps. The information provided by the fully connected layer 6 is then transmitted in parallel to a second fully connected layer 8 and a third fully connected layer 9. The second fully connected layer 8 is trained, for example, to determine the parking space boundary frame P for a detected parking space. The third fully connected layer 8 is trained, for example, to determine the target position of the vehicle 1 in the parking space boundary frame P or the parking space. This information is then output by the second and third fully connected layers 8, 9.

The neural network 4, for example, provides the information about the parking space boundary frame P and the target position Z of the vehicle 1 as follows. The target position Z of vehicle 1 in a parking space is output, for example, by a vector with the following values:

Z = [tx ty t©]; whereby, for example, a corner _of the vehicle is defined by the coordinates _t

The parking space bounding box P is, for example, output by a vector with the following values:

P = [b _x b _y bi bw b©]; where, for example, a corner of the parking space bounding frame P is defined by the coordinates b _x and b _y , and the length and width of the parking space bounding frame P are defined by the values bi and bw and b© indicates the orientation of the parking space boundary frame P in the grid map 3.

It is understood that this information is provided separately for each detected parking space boundary frame P or for each target position Z of the vehicle 1.

The training of the neural network 4 is carried out using training data that has labeled parking situation scenes, the labels each indicating the aforementioned output information of the neural network 4, i.e. the target position Z and orientation of the vehicle 1 in the parking space and the position, size and orientation of the parking space boundary frame P .

During the training of the neural network 4, an attempt is made to select the parameters or weights of the neural network 4 in such a way that the information provided by the neural network 4 corresponds as best as possible to the labels specified in the training data.

As stated above, the neural network 4 provides two output pieces of information, namely information about the parking space boundary frame P and the target position Z of the vehicle 1.

The training of the neural network 4 is carried out in such a way that the overall error is minimized from the information about the parking space boundary frame P and the target position Z determined by the neural network 4 in comparison to the training data. In addition, there may be validation data that is not used as training data but for validating the training. The validation data can be used to determine when the training of the neural network 4 has resulted in sufficient quality. This can reduce training time and prevent overfitting of the neural network 4 can be prevented, so that the neural network 4 leads to more general, ie less restricted, solutions.

For this purpose, a method based on minimizing the multi-task loss L is used, which takes into account both the loss due to the error in determining the parking space bounding box P and the loss due to the error in determining the target position Z . Since this is a regression problem, the loss is also called regression lost.

The total loss L is defined as follows:

L (t ^k , ^uk , b ^k , v ^k ) = Lz (t ^k , ^uk ) + LP (b ^k , v ^k ); where t ^k is the position and orientation of the target _position Z of the vehicle 1 for the kth _parking _space t via the vector _[ ^t Target position Z of vehicle 1 for the kth parking space in the training data, b ^k via the vector [b _x b _y bi bw b©] the position, size and orientation of the parking space bounding box P for the kth parking space and v ^k via the Vector [v _x v _y vi v _w v©] define the location, size and orientation of the parking space bounding box P for the kth parking space in the training data. Lz (t ^k , ^uk ) is therefore the loss that results from the difference in information about the target position Z of vehicle 1 compared to the training data and Lp (b ^k , v ^k ) is the loss that results from the difference in information to the parking space bounding box P relative to the training data.

The calculation of the losses Lz (t ^k , ^uk ) and Lp (b ^k , v ^k ) can be done as follows:

The smooth _L1 function is a modified loss function that enables a mixture of absolute distance and squared distance. It is defined as follows: if Ixl < 1

0.5 otherwise

5 shows a diagram that illustrates the steps of the method for determining a parking space and a target position of a vehicle in the parking space using a neural network.

First, a scene in the surrounding area of a vehicle is recorded using a sensor system and environmental information is provided (S10).

A raster map of the scene is then generated with a plurality of cells based on the environmental information, the cells of the raster map each having information as to whether the environmental area corresponding to the cell is occupied by an object or not (S11).

Finally, the raster map is transmitted to a neural network, the neural network being trained to provide information about a parking space bounding frame and information about a target position of the vehicle in the parking space bounding frame based on the raster map (S12). The invention has been described above using exemplary embodiments. It is understood that numerous changes and modifications are possible without departing from the scope of protection defined by the patent claims.

Reference number list 1 vehicle

2 Sensors

3 grid map

3.1 Cells

4 neural network 4.1 first section

4.2 second section

4.3 third section

5 convolutional neural network

5.1, 5.2 Layer 6 first fully-connected layer

7 pooling layers

8 second fully-connected layer

9 third fully-connected layer A1, A2, A3 cutout

B1, B2, B3 relevant area

CL convolutional layer

MPL Max pooling layer

01 , 02 Object P parking space boundary frame

R computing unit

Z Target position of the vehicle

Claims

Patent claims

1) Method for determining a parking space and a target position of a vehicle (1) in the parking space using a neural network (4), the method comprising the following steps:

- Detecting a scene in the surrounding area of a vehicle (1) using a sensor system (2) and providing environmental information (S10);

- Generating a raster map (3) of the scene with a plurality of cells (3.1) based on the environmental information, the cells (3.1) of the raster map (3) each having information as to whether the environmental area corresponding to the cell (3.1) is of an object (01, 02) is occupied or not (S11);

- Transmitting the grid map (3) to a neural network (4), the neural network (4) being trained to provide information about a parking space boundary frame (P) and information about a target position (Z) of the vehicle based on the grid map (3). (1) in the parking space boundary frame (P) to provide (S12).

2) Method according to claim 1, characterized in that the cells (3.1) of the grid map (3) have information as to whether the respective cell (3.1) is occupied by an object (01, 02) that is higher or lower than a specified one Threshold value and / or that the cells (3.1) of the raster map (3) have information about the density and / or intensity of the reflections received by the sensor system (2).

3) Method according to claim 1 or 2, characterized in that the neural network (4) has several sections (4.1, 4.2, 4.3), namely a first section (4.1), by means of which a recognition of structures from those in the raster map (3) included objects (01, 02), a second section (3.2), by means of which one or more areas relevant for the parking space detection (B1, B2, B3) are determined and a third section (4.3), which contains the at least one area relevant for the parking space detection (B1, B2, B3) and which, within the at least one area relevant for parking space detection (B1, B2, B3), receives information about a parking space boundary frame (P) and information about a target position (Z) of the vehicle (1) in the parking space boundary frame ( P) determined. ) Method according to claim 3, characterized in that the first section (4.1) has a convolutional neural network (5) with several layers (5.1, 5.2), the layers each having a convolutional layer and a pooling layer and each layer ( 5.1, 5.2) provides a feature map as initial information. ) Method according to claim 4, characterized in that the output information of the respective convolutional layer is modified by an activation function. ) Method according to one of claims 3 to 5, characterized in that the second section (4.2) comprises a selective search algorithm or a convolution network, which are designed to recognize and select one or more sections in the feature maps from the respective layers of the convolutional neural network of the first section and contain one or more relevant areas (B1, B2) for detecting parking spaces. ) Method according to claim 6, characterized in that the selective search algorithm or the convolution network comprises several Receives feature maps from different layers (5.1, 5.2) of the convolutional neural network (5) of the first section (4.1) and, based on the overall information contained in the feature maps, one or more relevant areas (B1, B2) for the Detecting parking spaces. ) Method according to one of claims 4 to 7, characterized in that the third section (4.3) has at least a first fully-connected layer (6) that at least a section (A1, A2, A3) of the feature maps, which through the first section (4.1) of the neural network (4) are provided, based on the at least one relevant area (B1, B2, B3) provided by the second section (4.2) and that the at least one section (A1, A2, A3) the feature maps are further processed by at least a first fully connected layer (6). ) Method according to claim 8, characterized in that the third section (4.3) comprises a pooling layer (7) which generates the at least one section (A1, A2, A3) of the feature maps, wherein the section (A1, A2, A3 ) has a predefined size. 0) Method according to claim 8 or 9, characterized in that the third section (4.3) has at least a second fully connected layer (8) for calculating the parking space boundary frame and at least a third fully connected layer (9) for calculating the target position (Z ) of the vehicle (1), wherein the second and third fully connected layers (8, 9) are connected to the at least one first fully connected layer (6) and receive output information from it. 1) Method according to one of claims 3 to 10, characterized in that through the first and / or second section (4.1, 4.2) several different pieces of information are provided and that the different pieces of information are processed in parallel by several third sections (4.3) of the neural network (4). ) System for determining a parking space and a target position of a vehicle (1) by means of a neural network (4), the system being coupled to a sensor system (2) which is used to detect a scene in the area surrounding the vehicle (1) and to provide Environmental information is formed, wherein the system has a computing unit (R) which is designed to generate a raster map (3) of the scene with a plurality of cells (3.1) based on the environmental information, the cells (3.1) of the raster map (3) each have information as to whether the surrounding area corresponding to the cell (3) is occupied by an object (01, 02) or not and the neural network (4) is trained to provide information about a parking space boundary frame ( P) and information about a target position (Z) of the vehicle (1) in the parking space boundary frame (P). ) System according to claim 12, characterized in that the neural network (4) has several sections (4.1, 4.2, 4.3), namely a first section (4.1) which is used to recognize structures from those contained in the raster map (3). Surrounding objects is formed, a second section (4.2), which is designed to determine one or more areas relevant for the parking space detection (B1, B2, B3) and a third section (4.3), which is designed to receive the at least one area relevant to the parking space detection (B1, B2, B3) and for determining information about a parking space boundary frame (P) and information about a target position (Z) of the vehicle (1) in the Parking space boundary frame (P) is formed within the at least one area (B1, B2, B3) relevant for parking space detection.

14) System according to claim 13, characterized in that the second section (4.2) comprises a selective search algorithm or a convolution network, which are designed to recognize and select one or more sections (A1, A2, A3) in the feature maps are provided by the respective layers (5.1, 5.2) of a folding neural network (5) of the first section (4.1) and contain one or more relevant areas (B1, B2, B3) for detecting parking spaces.

15) System according to claim 14, characterized in that the selective search algorithm or the convolution network is used to receive multiple feature maps from different layers (5.1, 5.2) of the convolutional neural network (5) of the first section (4.1) and to determine one or more relevant ones Areas (B1, B2, B3) are designed for the detection of parking spaces based on the overall information contained in the feature maps.

16) System according to claim 14 or 15, characterized in that the third section (4.3) has at least a first fully connected layer (6) that at least a section of the feature maps, which through the first section (4.1) of the neural network (4) are provided, based on the at least one relevant area (B1, B2, B3) provided by the second section (4.2) and that the at least one section (A1, A2, A3) of the feature maps is generated by the at least one first Fully-connected layer (6) is further processed. ) System according to claim 16, characterized in that the third section (4.3) has at least a second fully connected layer (8) for calculating the parking space boundary frame (P) and at least a third fully connected layer (9) for calculating the target position (Z ) of the vehicle (1), the second and third fully

Connected layer (8, 9) are connected to the at least one first fully connected layer (6) and receive output information from it.