WO2022117165A1

WO2022117165A1 - Training an artificial neural network

Info

Publication number: WO2022117165A1
Application number: PCT/DE2021/200208
Authority: WO
Inventors: Christian Scharfenberger; Michelle Karg
Original assignee: Continental Automotive Gmbh
Priority date: 2020-12-01
Filing date: 2021-11-26
Publication date: 2022-06-09
Also published as: DE102020215122A1

Abstract

The invention relates to a method for training an artificial neural network (3) for converting an input image into an output image, the input image being formed as a night photograph of at least one vehicle occupant, the method comprising the steps of: - providing an artificial neural network (3), - training (S6, S16) the artificial neural network (3) based on input images, which are formed as night photographs of at least one vehicle occupant, on the basis of output images, which comprise at least one predefined region of the at least one vehicle occupant in full light or daylight, so that an extraction of predefined vehicle occupant features is made possible by the region brightened in full light or daylight, by the following steps: - determining the predefined regions and individually determining a brightness level for the regions determined, - applying the artificial neural network (3) to the input images using the brightness level determined for thed regions determine; - the artificial neural network (3) outputting output images. The invention also relates to an image processing system (2) comprising such a method.

Description

description

Training an artificial neural network

The invention relates to a method for training an artificial neural network for converting an input image into an output image, the input image being designed as a night photograph of at least one vehicle occupant. Furthermore, the invention relates to an image processing system with such a method.

Today's vehicles are equipped with interior cameras that are intended to monitor the driver in particular. The recognition of the head pose or body posture and face recognition plays an important role, since characteristics such as alertness, tiredness, the direction of view and other properties of the driver's condition can be derived from them. This information is fed to a system in the vehicle, which either generates a warning to the driver or takes a certain action itself if there is a need, such as a lack of attention.

These systems for driver monitoring by recognizing head pose or body posture with the help of interior cameras show very good performance in daylight. These systems are supported by additional lighting in the vehicle interior, which illuminates the driver's head area, for example. This particularly supports detection at night.

DE 10 2005 023 697 A1 shows a device for controlling the interior lighting of a motor vehicle, with at least one sensor being arranged in the motor vehicle which detects the line of sight of vehicle occupants, with a control unit using output variables from the at least one sensor to generate control signals for lighting elements located in the motor vehicle generated. It is therefore an object of the invention to specify means which lead to improved and simplified vehicle occupant monitoring at night.

The object is achieved by a method with the features of claim 1. The object is also achieved by such an image system with the features of claim 14 and a use with the features of claim 18.

The object is achieved by a method for training an artificial neural network for converting an input image into an output image, the input image being designed as a night photograph of at least one vehicle occupant, comprising the following steps:

- providing an artificial neural network,

- Training of the artificial neural network based on input images, which are designed as night shots of at least one vehicle occupant, and based on output images, which include at least one predefined area of the at least one vehicle occupant in full illumination or daylight, so that the full illumination or daylight brightened Area an extraction of predetermined vehicle occupant characteristics is made possible, with the steps:

- Determination of the predefined areas and individual determination of a degree of lightening for the specific areas,

- Application of the artificial neural network to the input images using the determined degree of brightening for the specific areas;

Output of initial images by the artificial neural network, which include at least one predefined area of the at least one vehicle occupant in full illumination or daylight, so that extraction of predetermined vehicle occupant features is made possible by the area brightened in full illumination or daylight. Furthermore, the object is achieved by an image processing system for converting an input image into an output image, the input image being designed as a night photograph of at least one vehicle occupant, comprising an artificial neural network trained according to a method as described above, the image processing system being designed to convert the input image into an output image, which has at least one predefined area of the at least one vehicle occupant in full illumination or daylight, using the trained artificial neural network, so that the area brightened in full illumination or daylight allows extraction of predetermined vehicle occupant features.

According to the invention, it was recognized that complete illumination of a vehicle interior at night requires a large number of light sources, which, in addition to critical design restrictions, can lead to significant additional costs in the vehicle.

It has been recognized that today's camera systems enable broad monitoring of a large area in the vehicle interior through wide-angle optics, which can lead to considerable additional effort when it comes to equipping it with suitable lighting. Furthermore, the synchronization of cameras and lighting with increasing resolution can lead to a significant increase in costs. If the lighting is switched on for a long time, there can also be high heat and power loss when using a large number of lighting elements. It was also recognized that when using IR-based lighting, either special IR cameras or color cameras that are sensitive in the infrared range are required. This can significantly limit the scope of the camera.

This is where the invention comes in and, as a solution, specifies a method for training an artificial neural network and an image processing system with such a trained artificial neural network. By means of the method according to the invention and the image processing system according to the invention, a trained artificial neural network is available, which creates a day image from a night photograph of a driver or other vehicle occupants or at least brightens those areas or displays them in daylight that are necessary for an extraction of the desired vehicle occupant features.

In this case, such areas can be the face, for example, if, for example, a gaze detection is to be determined to determine the alertness/tiredness of the vehicle occupant.

An image in daylight can be understood to mean a recording which corresponds to a recording taken in daylight.

The areas to be brightened or displayed in daylight can, for example, be specified in advance or dynamically during system operation.

Images or recordings mean corresponding image data which are generated with at least one sensor.

The method according to the invention and the image processing system according to the invention make it possible to achieve good upgrading of weakly or insufficiently illuminated areas in a simplified manner without additional illumination.

For this purpose, image pairs with different exposure times are preferably recorded during the training. These pairs of images are used to train the artificial neural network in such a way that it can reconstruct output images with longer exposures based on input images with shorter exposures. These baseline images are then similar to daylight shots, and further algorithms can be applied to detailed detection of vehicle occupant features on the faces or poses on these baseline images. The invention makes it possible to convert areas of interest into a display that corresponds to a recording with full illumination or daylight, even without additional lighting, despite darkness and a lack of color information.

The method according to the invention and the image processing system according to the invention specify an efficient method for improving the image quality in the event of insufficient lighting. The method according to the invention and the image processing system according to the invention achieve a significant improvement in the image quality when displaying night shots without increasing the interior lighting of a vehicle. Therefore, no additional lighting is required to brighten up the interior areas. This is particularly advantageous when using wide-angle cameras, which are usually used in the vehicle interior of a vehicle.

The output image thus brightened or generated in daylight can be forwarded to a processing unit for extraction of the desired vehicle occupant characteristics. Various applications can be executed with the aid of the vehicle occupant characteristics obtained in this way, for example a warning tone can be output if, for example, increased tiredness/reduced alertness has been determined. Alternatively or additionally, the initial image can be displayed to the vehicle occupant on a display unit in the vehicle, for example via a head-up display.

The invention makes it possible to convert a very dark input image with little contrast and color information into a representation that is, for example, daylight or at least sufficiently bright, or to convert at least areas of interest of the image to daylight or at least sufficiently bright. The image processing system preferably precedes a detection or display unit for displaying the processed initial image or for Further processing of the original image. However, the detection or display unit can also be integrated in the image processing system.

Preferably, existing layers of the artificial neural network are shared with layers for extraction functions so that vehicle occupant features are automatically available. Furthermore, the training for this can preferably take place together.

Preferably, the areas are extracted using semantic segmentation of the interior space. In addition, different areas in the input image can be brightened to a different degree, e.g. with additional illumination of individual areas in the interior, for example by a reading lamp or light from outside, or in particularly dark areas in the interior, e.g. by shadows.

In a preferred embodiment, the at least one predefined area includes the face of the at least one vehicle occupant. As a result, vehicle occupant features such as the direction of view/movement of the eyelids can be extracted particularly well from the facial image that has been brightened or converted to daylight. If tiredness is detected, for example, warning tones can be emitted or other measures can be taken.

This can increase safety enormously, especially when driving at night, which is associated with increased fatigue. The vehicle occupant features can be extracted and evaluated by a connected evaluation unit, for example, without illuminating the interior of the vehicle too much and causing the driver to record the input image, for example, in a disruptive manner. This also improves the trained artificial neural network image processing system.

Furthermore, the at least one predefined area preferably includes at least the head pose of the vehicle occupant. This is also particularly important for detecting tiredness/alertness during night driving. This also improves the trained artificial neural network image processing system.

The at least one predefined area preferably includes at least the posture of the vehicle occupant. For example, the driving attention level of the vehicle occupant can be estimated from a posture. A warning tone can also be emitted in the event of an unbalanced posture or even a dangerous posture. This also improves the trained artificial neural network image processing system.

Furthermore, the artificial neural network is preferably designed as a CNN (Convolutional Neural Network). This convolutional neural network is particularly suitable for image processing. This also improves the trained artificial neural network image processing system.

Such an artificial neural network can automatically learn the parameters for complex scenes by locally and adaptively applying the enhancements to different image areas (people in the interior). Furthermore, such an artificial neural network can reduce the computing time since the CNN can be easily combined with CNNs for a subsequent extraction. With this combination, the vehicle occupant features are enhanced in the artificial neural network so that the extraction functions operate on features that can compensate for the lower illumination at night.

Furthermore, the artificial neural network is preferably trained to use information from better illuminated image areas of the input image for the conversion in order to generate the output image. This means that when there is lighting in the passenger compartment, information from the better lit areas is used to further improve the conversion for the unlit areas. This improves the original image. This also improves the trained artificial neural network image processing system. A plurality of input images are preferably provided for conversion into at least one output image, with the artificial neural network being trained in such a way that information from better illuminated image areas of a second input image is used to convert a first input image in order to convert the at least one predefined area of the at least one vehicle occupant into to generate full illumination or daylight as the initial image. Here, the network is trained less with individual images for each camera, but as an overall system consisting of several camera systems. As a result, an artificial neural network can be adapted to the conditions of the individual interior spaces and an improved result can be achieved. This also improves the trained artificial neural network image processing system.

Information is preferably provided to compensate for missing color and/or contrast and/or brightness information, the artificial neural network being trained to generate the conversion using the color and/or contrast information provided. This means that brightness values or luminance values and/or color information and/or contrast information are provided, with which the artificial neural network achieves improved conversion.

The degree of lightening is preferably learned in stages.

The method can thus brighten the areas with people in the image by a factor d, with this factor d being dynamically adaptable to the prevailing lighting conditions. In particular, the factor d can be adjusted separately for the individual image areas, e.g. driver or occupants in the rear area, so that different lighting conditions in the interior can be taken into account locally.

In a further embodiment, the artificial neural network is trained to simulate a gamma correction and/or a white balance and/or a histogram equalization. For this purpose, the artificial neural network is given a data set consisting of "dark input images (night shots)" and the associated "bright as day" or "illuminated pictures" are made available. Depending on the type of training, the artificial neural network is configured to optimally emulate methods such as white balance, gamma correction and histogram equalization. White balance is essentially the adjustment to the color temperature of the light.

Gamma correction is a correction function that is often used in image processing and changes the brightness information of pixels, for example. Histogram equalization is a method for improving contrast in gray-scale images that goes beyond mere contrast enhancement.

In this way, very dark input images can be converted into corresponding output images, which is advantageous for feature-based recognition or viewing.

Image quality information is preferably provided, and the artificial neural network is trained to generate the conversion using the image quality information provided. In this way, the network can be trained to generate output images which calculate image data optimized for computer vision and human vision, for example. Computer vision / human vision is understood as the attempt to process and analyze the images recorded by cameras in a wide variety of ways in order to understand their content or extract geometric information.

An improved output image can be generated in the image processing system by such an improved artificial neural network. The artificial neural network is preferably trained to convert the input image into an output image which is fully illuminated or displayed in daylight. One or more image sensors are preferably provided for recording the at least one vehicle occupant. The image sensors can be designed as cameras. This achieves good coverage of the vehicle interior.

Furthermore, the one or more image sensors are preferably embodied as a wide-angle camera. This allows good coverage to be achieved with just a few cameras.

Furthermore, the object is achieved by using the image processing system as described above in a vehicle interior of a vehicle for monitoring at least one vehicle occupant.

Further properties and advantages of the present invention emerge from the following description with reference to the enclosed figures.

It shows schematically:

1 shows a method according to the invention schematically, and

2: an image processing system according to the invention schematically in a vehicle interior, and

3: an input image (left) and converted output image (right) recorded by a driver using an image processing system according to the invention, and

4 shows a further embodiment of a method according to the invention schematically.

1 shows a training of a neural network according to the invention schematically. In a first step S1, this receives as input images 6 (FIG. 3) Night shots from the vehicle interior of a vehicle 1 (FIG 2), which shows at least one vehicle occupant, for example the driver.

The input image 6 (FIG. 3) is preferably generated by image sensors such as wide-angle cameras.

The artificial neural network is preferably in the form of a CNN convolutional neural network. This convolutional neural network is particularly suitable for machine image processing. Such a network has, for example, several levels.

The artificial neural network is then trained in a step S6 to convert the night shot into a brightened initial image or day image (night shot in daylight). For this purpose, several night recordings with different contrast levels/color information and associated desired initial images 7 (FIG. 3) are used during the training.

The entire input image 6 (FIG. 3) is preferably converted.

However, the artificial neural network can also be trained to merely brighten different areas from the input image 6 (FIG. 3) or to convert them into a day image. This can be especially the face, head pose and posture. A physical condition (tiredness, lack of concentration, etc.) can be inferred from these vehicle occupant characteristics, for example by extracting the direction of view, the movement of the eyelids, etc. and, if necessary, suitable measures can be taken in the event of a poor physical condition. This can guarantee a safer ride. These areas can be extracted, for example, using semantic segmentation of the interior. In addition, different areas in the input image can be brightened to a different degree, e.g. with additional illumination of individual areas in the interior (e.g. by a reading lamp or light from outside) or in particularly dark areas in the interior, e.g. by shadows. Furthermore, as an additional step S2, the artificial neural network can be trained to use information from better illuminated image areas of the input image 6 (FIG. 3) for conversion when there is lighting in the vehicle interior in order to generate the output image. This allows the conversion for the unlit areas to be further improved and a better output image can be achieved.

For this purpose, image pairs with different exposure times are preferably recorded during the training. These pairs of images are used to train the artificial neural network in such a way that it can reconstruct output images with longer exposures based on input images with shorter exposures. These baseline images are then similar to daylight shots, and further algorithms can be applied to detailed detection of vehicle occupant features on the faces or poses on these baseline images.

Furthermore, as an additional step S3, the artificial neural network can be trained to generate the conversion using provided color and/or contrast information. Information stored in the network structure is used to automatically supplement missing color or contrast information in the original image. In this way, for example, methods such as gamma correction and/or white balance and/or histogram equalization could be simulated in an optimized manner. In this way, very dark images can be converted into a representation that is advantageous for feature-based recognition or viewing.

In an additional step S4, the artificial neural network is trained to simulate a gamma correction and/or white balance and/or histogram equalization. For this purpose, the artificial neural network is trained using a data set consisting of "dark input images (night shots)" and the associated "bright as day" or "illuminated images". Depending on the type of training, the artificial neural network is configured to emulate methods such as gamma correction and histogram equalization, etc. In this way, very dark input images 6 (FIG. 3) can be converted into output images 7 (FIG. 3), which are advantageous for feature-based recognition or viewing.

Furthermore, in an additional further step S5, the artificial neural network can be trained to generate the conversion using information on the image quality. For this purpose, information stored in the network structure regarding image quality is used in order to achieve a better initial image. As a result, the output image is optimized, for example, in that it calculates image data optimized for computer vision and human vision.

Steps S2-S5 can each be included in the method individually or in any combination.

2 shows a vehicle 1 with the image processing system 2 according to the invention, which has an artificial neural network 3 trained with the method according to the invention. The vehicle 1 has a vehicle interior 4 which has interior cameras 5 for recording the vehicle occupants. The interior cameras 5 can in particular be wide-angle cameras. In such a case, the artificial neural network 3 is trained less with individual images for each interior camera 5, but as an overall system consisting of the multiple interior cameras 5.

The image processing system 2 can be integrated as a hardware-based image pre-processing stage in an ISP (Image Signal Processor) of the ISP. The image processing system 2 can carry out the corresponding conversion in the ISP and, for example, make the processed information available with the original data for possible detection or display functions. The image processing system 2 according to the invention specifies a system for improving the image quality in the event of insufficient lighting. Furthermore, the image processing system 2 according to the invention improves the image quality when displaying or processing night shots without additional lighting is required, which brightens the vehicle interior 4. This is a particular advantage when using wide-angle cameras. Image data streams for applications in the vehicle interior 4 can thus be generated by means of the image processing system 2 according to the invention, which has the artificial neural network 3 trained according to the invention. Based on the at least clearly brightened areas of interest, such as the face of the vehicle occupant, features can be extracted and fed to a further processing unit. This can then, for example, analyze these characteristics and carry out measures if there are deviations from the target values.

The image processing system 2 according to the invention enables the nighttime recordings of the underlying interior cameras 5 to be converted into a display that corresponds to a recording with full illumination or daylight without additional lighting, despite darkness and a lack of color information, quickly, inexpensively and without disruptive additional interior lighting.

The image processing system 2 according to the invention enables poorly or insufficiently illuminated areas to be well illuminated by means of the trained neural network 3 without additional illumination.

3 shows an input image 6 which was converted by means of the image processing system 2 according to the invention and the artificial neural network 3 trained according to the invention. The trained artificial neural network 3 is designed here as a CNN. Using the image processing system 2 and the artificial neural network 3 trained according to the invention, a significantly improved output image 7 can be generated from a dark input image 6, for example for recognizing the head pose or body posture.

4 shows a further embodiment of a method according to the invention.

Steps S11 to S15 here correspond to steps S1 to S5 of FIG In step S16, the artificial neural network is trained, as in step S6 in FIG. 1, to convert night shots or input images into brightened output images or day images (night shot in daylight). For this purpose, several night recordings with different contrast levels/color information and associated desired initial images 7 (FIG. 3) are used during the training.

In addition, the artificial neural network is trained in step S16 to identify and determine predefined areas in captured nighttime images that are to be brightened. The predefined area can also include the entire night shot. Predefined areas in the recorded night shots can differ in the desired brightening. For this purpose, different areas can be determined in the image, e.g., using statistical calculations, using semantic segmentation and/or using information about different areas from recordings made at previous times. The types of range determination mentioned above are only examples and should not be regarded as conclusive. In detail, there can be a variant, for example, in which the network outputs a semantic segmentation in addition to the brightened image, which is used as a prior in a subsequent journal. A further variant can be, for example, that the network uses segmentation into lighter and darker areas. This segmentation can be obtained, for example, from a previous journal t-1 , from a separate network or from a multitasking network which first outputs a map for image regions and then performs a brightening enhancement based on this map. The latter is a two-step approach, where network calculations from the first step can be reused for the second step, e.g., the calculations of the first network layers.

In addition, the artificial neural network is trained to individually determine a degree of brightening for each specific area. The artificial neural network is also trained for this purpose, following the determination of the predefined areas and the individual determination of the degree of brightening for the specific areas to use the determined lightening level to lighten the specific areas around the lightening level. The artificial neural network is given a factor d, which corresponds to the parameterization of the illumination of individual image areas and which also determines the degree of brightening. In other words, the factor d represents the ratio between the exposure ratio of the input image and the exposure ratio of the brightened output image of the neural network. The specific areas are dynamically adapted to the prevailing light conditions in a vehicle interior by the factor d. In particular, the artificial neural network can be trained such that the factor d is adjusted separately for individual image areas, eg, driver or occupants in the rear area, so that locally different lighting conditions in the interior can be addressed. If several areas have been determined by the neural network that are to be brightened, a different factor d can be included in the brightening for each area. For example, a first factor d can be used for a first specific area and a second factor d, which differs from the first factor d, can be used for a second specific area in order to be included in the brightening of the corresponding areas.

The factor d can optionally be learned as follows and/or have the following: a) Images with different exposure times are available during the training. As a result, the artificial neural network can gradually learn the degree of brightening. For the training, an image pair is selected from a short exposure and a longer exposure image and the ratio of the exposure times is calculated. This corresponds to the factor d during training. The shorter exposed image and thus the darker image is made available as input to the network. The image with the longer exposure time is used as ground truth for calculating the loss. When calculating the loss, the output of the network is compared with the ground truth. The aim here is that the network learns to reconstruct a brighter image for shorter exposed images and to keep the factor d variable in order to enable a reconstruction of different degrees of brightening. The factor d thus represents an artificial exposure time and the network learns to reconstruct an image with a different exposure time. This is particularly relevant for dynamic environments in which the actual exposure time to be used is limited to short times in order to enable sufficient image sharpness. b) Once the network has been trained, the factor d can be set variably at runtime, i.e. during operation of an image processing system that uses a suitably trained neural network. For example, a night shot is taken at runtime with an exposure time that is appropriate to the environment in order to enable sufficient image sharpness in a dynamic environment. This usually leads to dark recordings. The trained network is applied to these images and the factor d is set in such a way that the image is sufficiently brightened, for example, similar to a daytime image. The factor d can be determined here in various ways, such as, for example, from a determination of the brightness of the recording, from the brightening of the previous image for magazine t-1, from statistical calculations of the quality of the brightness in the image and the necessary brightening, from a network learning the estimation of the factor d, etc. c) The factor d can be applied locally. Here, the training is extended by the adjustment of the factor d to image regions. Image regions are determined which require different brightening, eg poorly and well-illuminated image areas. The training image pairs can differ for the regions in the exposure time of the ground truth and the network input, so that a factor d can be learned for areas in the network and individual areas can be brightened more. In this way, the loss is calculated on the selected areas in the image and recordings of the same scene with different exposure times can be used for the different areas. An example of this is that for a well-lit driver, a medium exposure time ground truth is sufficient, while for example, the occupants in the back seats a ground truth with a longer exposure time is used and therefore the factor d for the driver is correspondingly lower than for the area of the occupants on the back seat. In order to learn the factor d robustly for brightenings of different strengths in image regions, images with different exposure times can also be used for the image regions of the image input of the neural network and these can be combined region-wise with different ground truths with different exposure times. Here, the factor d is calculated during the training for individual image regions from the ratio of the ground truth to the image recording at the network input. The network input is thus composed region by region from different images and the loss is also calculated region by region based on the same image regions by assembling the reference image for the loss calculation region by region from the ground truth recordings corresponding to the image regions in the input image. d) The factor d is an input parameter of the neural network, based on which the network learns and can reconstruct different degrees of illumination improvement. The factor d can be added to one or more layers of the network as an input parameter. The factor d can also be combined with an additional network, which calculates the mapping of the factor d onto the network for illumination improvement, e.g. scaling to the size of the input or feature layers of the neural network for illumination improvement, relevance of factor d to individual image regions in the input or feature layers of the neural network for illumination enhancement. e) To learn the factor or factors d, the artificial neural network can be trained, for example, with a number of input images that have different exposure times but are otherwise identical, and an associated desired output image. Alternatively or additionally, the factor d can be calculated from a downstream application, which is based, for example, on the quality of a possible detection provided and lightened images determines the best factor for image brightening. This can be implemented both when training a neural network and online in the application at runtime. This allows the artificial neural network to gradually learn the degree of whitening. In addition, the input images can already be illuminated differently in order to thereby simulate dynamic illumination in the input images and thus train the artificial neural network. The training of the artificial neural network also includes the output of initial images 7 by the artificial neural network, which include at least the one predefined area of the at least one vehicle occupant in full illumination or daylight, so that the areas brightened in full illumination or daylight allow an extraction predetermined vehicle occupant characteristics is made possible. These output initial images can be used in training, for example, to compare them with desired initial images. In a further application, these images can also be made available to a downstream application which, for example, uses a detection quality to evaluate the quality of the brightened images. Thus, for example, training can be continued until an output image meets certain requirements and matches a desired output image as precisely as possible.

The output of the network can be used here for a loss calculation of the learning process of the neural network. The output of the neural network can be compared to ground truth and a loss can be calculated. Based on this, the weights of the neural network are updated. The loss can optionally be calculated globally for a constant factor d. Alternatively, the loss can be calculated locally or by region. As a result, a different factor d can be used locally for individual image regions. Reference list:

1 vehicle

2 image processing system 3 trained artificial neural network

4 vehicle interior

5 interior camera

6 input image

7 initial image

Claims

patent claims

1. A method for training an artificial neural network for converting an input image (6) into an output image (7), the input image (6) being designed as a night photograph of at least one vehicle occupant, characterized by:

- providing an artificial neural network,

- Training of the artificial neural network based on input images (6), which are designed as night shots of at least one vehicle occupant, and based on output images (7), which include at least one predefined area of the at least one vehicle occupant in full illumination or daylight, so that through the In full illumination or in a brightened area, extraction of predetermined vehicle occupant characteristics is made possible by the following steps: o Application of the artificial neural network to the

input images (6); o Determination of the predefined areas and individual determination of a degree of brightening for the specific areas by the artificial neural network, o Use of the determined degree of brightening for the specific areas by the artificial neural network, with the degree of brightening used having a factor d that corresponds to the parameterization of the illumination of individual corresponds to image areas and with which the specific areas are dynamically adapted to the prevailing lighting conditions in a vehicle interior, o Output of initial images (7) by the artificial neural network, which include at least the one predefined area of the at least one vehicle occupant in full illumination or daylight, so that through the areas brightened in full illumination or daylight an extraction of predetermined vehicle occupant characteristics is made possible.

2. The method according to claim 1, characterized in that the at least one predefined area comprises a face of the at least one vehicle occupant.

3. The method according to any one of the preceding claims, d characterized in that the at least one predefined area comprises at least one head pose of the vehicle occupant.

4. The method according to any one of the preceding claims, characterized in that the at least one predefined area comprises at least one posture of the vehicle occupant.

5. Method according to one of the preceding claims, characterized in that existing layers of the artificial neural network are shared with layers for extraction functions, so that vehicle occupant characteristics are automatically available.

6. The method according to any one of the preceding claims, characterized in that the artificial neural network is trained to use information from better illuminated image areas of the input image (6) for the conversion in order to generate the output image (7).

7. Method according to one of the preceding claims, characterized in that the artificial neural network is trained to simulate a gamma correction and/or a white balance and/or a histogram equalization.

8. The method according to any one of the preceding claims, characterized in that a plurality of input images (6) are provided for conversion into at least one output image (7), wherein the artificial neural network is trained in such a way that information from better illuminated image areas of a second input image (6) is used to convert a first input image (6) in order to generate the at least one predefined area of the at least one vehicle occupant in full illumination or daylight as the output image (7). .

9. The method as claimed in one of the preceding claims, characterized in that information is provided to compensate for missing color and/or contrast information, and the artificial neural network is trained to generate the conversion using the color and/or contrast information provided .

10 . Method according to one of the preceding claims, characterized in that the degree of lightening is learned in stages.

11 . Method according to one of the preceding claims, characterized in that the regions are extracted using such as semantic segmentation of the interior space.

12. The method according to any one of the preceding claims, characterized in that image quality information is provided, and the artificial neural network is trained to generate the conversion using the image quality information provided.

13. The method as claimed in one of the preceding claims, characterized in that the artificial neural network is trained to convert the input image (6) into an output image (7) that is fully illuminated or displayed in daylight.

14. Image processing system (2) for converting an input image (6) into an output image (7), the input image (6) being designed as a night photograph of at least one vehicle occupant, comprising an artificial neural network (3 ), wherein the image processing system (2) is designed to perform a conversion of the input image (6) into an output image (7), which has at least one predefined area of the at least one vehicle occupant in full illumination or daylight, using the trained artificial neural network, so that an extraction can be carried out through the area brightened in full illumination or daylight predetermined vehicle occupant characteristics is enabled.

15. Image processing system (2) according to claim 14, characterized in that one or more image sensors are provided for recording the at least one vehicle occupant.

16. Image processing system (2) according to claim 15, characterized in that the one or more image sensors is designed as a wide-angle camera.

17. Image processing system (2) according to one of the preceding claims 14 to 16, characterized in that the image processing system (2) is designed to extract the predetermined vehicle occupant features from the at least one illuminated area or area shown in daylight.

18. Use of the image processing system (2) according to any one of the preceding claims 14 to 17 in a vehicle interior (4) of a vehicle (1) for monitoring at least one vehicle occupant.