CN113344878A

CN113344878A - Image processing method and system

Info

Publication number: CN113344878A
Application number: CN202110641215.1A
Authority: CN
Inventors: 任永建; 师天磊; 许志强; 孙昌勋
Original assignee: Beijing Ronglian Yitong Information Technology Co ltd
Current assignee: Beijing Ronglian Yitong Information Technology Co ltd
Priority date: 2021-06-09
Filing date: 2021-06-09
Publication date: 2021-09-03
Anticipated expiration: 2041-06-09
Also published as: CN113344878B

Abstract

The invention discloses an image processing method and system, comprising the following steps: obtaining sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; acquiring scene information of a scene to be identified, and shooting an actual scene image of the scene to be identified; comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model; and inputting the actual scene image into the corrected cv model, and outputting an identification result of the actual scene image. Has the advantages that: and adjusting the maximum identification pixel and the minimum identification pixel of the sample cv model according to the scene information of the scene to be identified and the sample scene information, so as to increase the accuracy of the final identification result.

Description

Image processing method and system

Technical Field

The invention belongs to the technical field of visual inspection, and particularly relates to an image processing method and system.

Background

The model algorithm can generate an image recognition model according to a model training data set, and due to the fact that the scenes of the training data set cannot be infinite and the physical reason that an object is imaged, the size of the object in a picture is smaller/larger when the size of the object exceeds a certain range, and the distinguishability of the object is poorer, so that the algorithm model has the maximum recognition pixel and the minimum recognition pixel which are suitable for the recognizable pixels of the object, and the recognition accuracy is greatly reduced when the size of the recognizable pixels exceeds the maximum recognition pixel and is smaller than the minimum recognition pixel. At present, the identification parameters of the CV model are internalized and fixed and cannot be flexibly adjusted, and the CV model is not identified if the target size in a picture exceeds the maximum identification pixel and does not reach the minimum identification pixel, and is not reflected in an algorithm identification result.

Disclosure of Invention

The present invention is directed to solving, at least to some extent, one of the technical problems in the art described above. Therefore, a first objective of the present invention is to provide an image processing method, which adjusts the maximum recognition pixel and the minimum recognition pixel of the sample cv model according to the scene information of the scene to be recognized, so as to increase the accuracy of the final detection result.

A second object of the present invention is to provide an image processing system.

In order to achieve the above object, an embodiment of a first aspect of the present invention provides an image processing method, including:

obtaining sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel;

acquiring scene information of a scene to be identified, and shooting an actual scene image of the scene to be identified;

comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model;

and inputting the actual scene image into the corrected cv model, and outputting an identification result of the actual scene image.

Further, the sample scene information includes the light intensity of the sample scene, the angle and the height of the camera when the image of the sample scene is shot.

Further, after outputting the recognition result of the actual scene image, the method further includes:

and analyzing the recognition result, and sending out an alarm prompt when determining that the corresponding target area in the actual scene image is abnormal.

Further, the comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result includes:

according to the comparison result, when the light intensity of the scene to be identified is determined to be greater than that of a sample scene, and the angle and the height of a camera shooting the scene to be identified are respectively smaller than those of the camera shooting the sample scene, determining that the scene to be identified is a close scene with good light relative to the sample scene, and carrying out enlargement processing on the maximum identification pixel and the minimum identification pixel of the sample cv model;

according to the comparison result, when the light intensity of the scene to be identified is determined to be smaller than that of a sample scene, and the angle and the height of the camera shooting the scene to be identified are respectively larger than those of the camera shooting the sample scene, determining a long-range scene with the scene to be identified being poor in light relative to the sample scene, and carrying out minimization processing on the maximum identification pixel and the minimum identification pixel of the sample cv model.

the recognition result comprises sub-recognition results of a plurality of regions to be recognized which are included in the actual scene image; the method comprises the steps of obtaining the areas of a plurality of regions to be identified, comparing the areas with a preset area, determining the regions to be identified which are smaller than or equal to the preset area as a first target region, and determining the regions to be identified which are larger than the preset area as a second target region;

determining a first number of recognition errors according to the sub-recognition result of the first target area, and performing enlargement processing on the minimum recognition range of the corrected cv model when the first number is determined to be larger than a preset number;

and determining a second number of recognition errors according to the sub-recognition result of the second target area, and performing reduction processing on the maximum recognition range of the corrected cv model when the second number is determined to be larger than a preset number.

Further, the configuration parameters of the sample cv model further include at least one of a start time, an end time, a detection period, an alarm period, an algorithm threshold, and a detection region setting.

Further, before the actual scene image is input into the modified cv model, the method further includes:

acquiring the size of the actual scene image, judging whether the size is the same as a preset size or not, and carrying out normalization processing on the size of the actual scene image when the size is determined to be different from the preset size;

acquiring the absolute value of the pixel difference value of each pixel point and the adjacent pixel point in the actual scene image after normalization processing to obtain the absolute values of a plurality of pixel difference values, screening out the absolute value of the minimum pixel difference value, multiplying the absolute value of the minimum pixel difference value by a preset smoothing coefficient to obtain a smooth pixel value, and adding the pixel value of each pixel point in the actual scene image and the smooth pixel value to obtain the actual scene image after smoothing processing; the adjacent pixel point is any pixel point within a preset distance range;

carrying out image graying on the actual scene image after the smoothing processing to obtain a grayscale image;

acquiring a first gradient value of each pixel point in the gray level image in the horizontal direction;

acquiring a second gradient value of each pixel point in the gray level image in the vertical direction;

calculating according to a first gradient value and a second gradient value of each pixel point to obtain a gradient amplitude of each pixel point, calculating according to the gradient amplitude of each pixel point to obtain an average amplitude, screening out the pixel points of which the gradient amplitudes are larger than the average amplitude, and generating a first pixel point set; the pixel points in the first pixel point set are edge pixel points in the gray level image;

acquiring the gray value of each pixel point in the gray image, screening out the pixel points with the gray values larger than a preset gray value, and generating a second pixel point set;

acquiring the intersection of the first pixel point set and the second pixel point set;

acquiring a union set of the first pixel point set and the second pixel point set;

calculating to obtain the definition of the gray image according to the intersection and the union, judging whether the definition is smaller than a preset definition, inputting the gray image into a depth information acquisition model trained in advance when the definition is determined to be smaller than the preset definition, outputting the depth value of each pixel point in the gray image, calculating to obtain the diameter of a diffusion circle of each pixel point according to the depth value of each pixel point, and performing defuzzification processing on the corresponding pixel point in the gray image according to the diameter of the diffusion circle of each pixel point.

calculating the signal-to-noise ratio of the actual scene image, judging whether the signal-to-noise ratio is smaller than a preset signal-to-noise ratio or not, and carrying out filtering processing on the actual scene image when the signal-to-noise ratio is smaller than the preset signal-to-noise ratio;

the signal-to-noise ratio C of the actual scene image is calculated as shown in formula (1):

g is the maximum gray value of a pixel point in the actual scene image; m is the length of the actual scene image; n is the width of the actual scene image; h (i, j) is the gray value of the pixel point (i, j) in the actual scene image.

Further, after the filtering process is performed on the actual scene image, the method further includes:

calculating a filter coefficient K for the actual scene image, as shown in formula (2):

wherein, W is the size of the filtering window; λ is laplace operator; l (i, j) is the weight of the pixel point (i, j) in the actual scene image; the weight is calculated according to the gradient information of the pixel points;

calculating the gray value f (i, j) of the pixel point (i, j) in the actual scene image after filtering processing according to the filtering coefficient K of the actual scene image, as shown in formula (3):

calculating the gray value of each pixel point in the actual scene image after filtering;

screening out the pixel points with the gray value larger than a preset gray value to generate a third pixel point set, screening out the pixel points with the gray value smaller than the preset gray value to generate a fourth pixel point set;

carrying out reduction processing on the gray value of each pixel point in the third pixel point set;

and carrying out increasing processing on the gray value of each pixel point in the fourth pixel point set.

In order to achieve the above object, a second embodiment of the present invention provides an image processing system, including:

the model training module is used for acquiring sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel;

the system comprises an image acquisition module, a scene recognition module and a scene recognition module, wherein the image acquisition module is used for acquiring scene information of a scene to be recognized and shooting an actual scene image of the scene to be recognized;

the model correction module is used for comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model;

and the image identification module is used for inputting the actual scene image into the corrected cv model and outputting an identification result of the actual scene image.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and drawings.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is a flow chart of an image processing method of the present invention;

fig. 2 is a diagram of adjustment of configuration parameters of the sample cv model;

FIG. 3 is a block diagram of an image processing system according to the present invention.

Reference numerals:

the device comprises a training module 1, an acquisition module 2, a correction module 3 and an identification module 4.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

An image processing method and system according to an embodiment of the present invention are described with reference to fig. 1 and fig. 3.

As shown in fig. 1, an image processing method includes steps S1-S4:

s1, obtaining sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel;

s2, acquiring scene information of a scene to be identified and shooting an actual scene image of the scene to be identified;

s3, comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model;

and S4, inputting the actual scene image into the correction cv model and outputting the identification result of the actual scene image.

The working principle of the scheme is as follows: obtaining sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel; acquiring scene information of a scene to be identified, and shooting an actual scene image of the scene to be identified; comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model; and inputting the actual scene image into the corrected cv model, and outputting an identification result of the actual scene image. Performing model training according to the sample scene information and the sample scene images to obtain a sample cv model, wherein the model training comprises the steps of performing image analysis on a plurality of sample scene images to respectively obtain a foreground image of each sample scene image, segmenting the foreground image from the sample scene images, and leaving a blank frame in the sample scene images; splicing the foreground images into blank frames in different sample scene images respectively to obtain a plurality of spliced images; and performing model training according to the sample scene image, the spliced image and the sample scene information to obtain a cv model.

The beneficial effect of above-mentioned scheme: the scene information of the scene to be identified is compared with the sample scene information to obtain a comparison result, and the identification range of the pixels of the sample cv model is adjusted according to the comparison result, so that the adjustability of the configuration parameters of the cv model is increased, the accuracy of the identification result of the cv model is improved, and the practicability of the cv model is improved.

According to some embodiments of the invention, the sample scene information comprises a light intensity of a sample scene, an angle and a height of a camera when the sample scene image is taken.

The working principle and the beneficial effects of the scheme are as follows: the light intensity of the sample scene, the angle and the height of the camera when the image of the sample scene is shot are important factors influencing the image quality and the image type of the actual scene, and finally the accuracy of the identification result of the cv model is influenced.

According to some embodiments of the present invention, after outputting the recognition result of the actual scene image, the method further includes:

The working principle and the beneficial effects of the scheme are as follows: and analyzing the recognition result, sending an alarm prompt when determining that the corresponding target area in the actual scene image is abnormal, and reminding a worker to adjust the configuration parameters of the cv model in time so as to increase the accuracy of the final recognition result.

According to some embodiments of the present invention, the comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixel of the sample cv model according to the comparison result includes:

The working principle of the scheme is as follows: according to the comparison result, when the light intensity of the scene to be identified is determined to be greater than that of a sample scene, and the angle and the height of a camera shooting the scene to be identified are respectively smaller than those of the camera shooting the sample scene, determining that the scene to be identified is a close scene with good light relative to the sample scene, and carrying out enlargement processing on the maximum identification pixel and the minimum identification pixel of the sample cv model; according to the comparison result, when the light intensity of the scene to be identified is determined to be smaller than that of a sample scene, and the angle and the height of the camera shooting the scene to be identified are respectively larger than those of the camera shooting the sample scene, determining a long-range scene with the scene to be identified being poor in light relative to the sample scene, and carrying out minimization processing on the maximum identification pixel and the minimum identification pixel of the sample cv model.

The beneficial effect of above-mentioned scheme: the scene type of the scene to be identified is accurately positioned according to the light intensity of the scene to be identified, the angle of a camera shooting the scene to be identified and the height, and the maximum identification pixel and the minimum identification pixel of the sample cv model are adjusted according to the scene type, so that the accuracy of the identification result of the cv model is greatly improved, and the situations of missing identification and false alarm are reduced.

The working principle of the scheme is as follows: the recognition result comprises sub-recognition results of a plurality of regions to be recognized which are included in the actual scene image; the method comprises the steps of obtaining the areas of a plurality of regions to be identified, comparing the areas with a preset area, determining the regions to be identified which are smaller than or equal to the preset area as a first target region, and determining the regions to be identified which are larger than the preset area as a second target region; determining a first number of recognition errors according to the sub-recognition result of the first target area, and performing enlargement processing on the minimum recognition range of the corrected cv model when the first number is determined to be larger than a preset number; and determining a second number of recognition errors according to the sub-recognition result of the second target area, and performing reduction processing on the maximum recognition range of the corrected cv model when the second number is determined to be larger than a preset number.

The beneficial effect of above-mentioned scheme: when more small targets are reported in the picture by mistake, the minimum recognition range is properly enlarged; when more large targets are mistakenly reported in the picture, the maximum recognition range is properly reduced, so that the mistaken reporting and the missing reporting of the cv model are controlled, and the accuracy of the final recognition result is improved.

As shown in fig. 2, according to some embodiments of the present invention, the configuration parameters of the sample cv model further include at least one of a start time, an end time, a detection period, an alarm period, an algorithm threshold, a detection region setting.

The working principle and the beneficial effects of the scheme are as follows: the configuration parameters of the sample cv model further comprise at least one of start time, end time, detection period, alarm period, algorithm threshold and detection area setting, wherein the start time is the start identification time of the cv model, the end time is the end identification time of the cv model, and the algorithm threshold is the identification range of the cv model.

According to some embodiments of the invention, before inputting the actual scene image into the modified cv model, the method further comprises:

The working principle of the scheme is as follows: acquiring the size of the actual scene image, judging whether the size is the same as a preset size or not, and carrying out normalization processing on the size of the actual scene image when the size is determined to be different from the preset size; acquiring the absolute value of the pixel difference value of each pixel point and the adjacent pixel point in the actual scene image after normalization processing to obtain the absolute values of a plurality of pixel difference values, screening out the absolute value of the minimum pixel difference value, multiplying the absolute value of the minimum pixel difference value by a preset smoothing coefficient to obtain a smooth pixel value, and adding the pixel value of each pixel point in the actual scene image and the smooth pixel value to obtain the actual scene image after smoothing processing; the adjacent pixel point is any pixel point within a preset distance range; carrying out image graying on the actual scene image after the smoothing processing to obtain a grayscale image; acquiring a first gradient value of each pixel point in the gray level image in the horizontal direction; acquiring a second gradient value of each pixel point in the gray level image in the vertical direction; calculating according to a first gradient value and a second gradient value of each pixel point to obtain a gradient amplitude of each pixel point, calculating according to the gradient amplitude of each pixel point to obtain an average amplitude, screening out the pixel points of which the gradient amplitudes are larger than the average amplitude, and generating a first pixel point set; the pixel points in the first pixel point set are edge pixel points in the gray level image; acquiring the gray value of each pixel point in the gray image, screening out the pixel points with the gray values larger than a preset gray value, and generating a second pixel point set; acquiring the intersection of the first pixel point set and the second pixel point set; acquiring a union set of the first pixel point set and the second pixel point set; calculating to obtain the definition of the gray image according to the intersection and the union, judging whether the definition is smaller than a preset definition, inputting the gray image into a depth information acquisition model trained in advance when the definition is determined to be smaller than the preset definition, outputting the depth value of each pixel point in the gray image, calculating to obtain the diameter of a diffusion circle of each pixel point according to the depth value of each pixel point, and performing defuzzification processing on the corresponding pixel point in the gray image according to the diameter of the diffusion circle of each pixel point.

The beneficial effect of above-mentioned scheme: the definition of the actual scene image also determines the accuracy of the final recognition result, and if the definition of the actual scene image is lower, the recognition result is inaccurate, so that the scheme provides a method for detecting the definition of the actual scene image and increasing the definition of the actual scene image; acquiring the size of the actual scene image, judging whether the size is the same as a preset size or not, and carrying out normalization processing on the size of the actual scene image when the size is determined to be different from the preset size; the normalization processing is to cut the size of the actual scene image to a preset size; acquiring the absolute value of the pixel difference value of each pixel point and the adjacent pixel point in the actual scene image after normalization processing to obtain the absolute values of a plurality of pixel difference values, screening out the absolute value of the minimum pixel difference value, multiplying the absolute value of the minimum pixel difference value by a preset smoothing coefficient to obtain a smooth pixel value, and adding the pixel value of each pixel point in the actual scene image and the smooth pixel value to obtain the actual scene image after smoothing processing; the adjacent pixel point is any pixel point within a preset distance range; carrying out image graying on the actual scene image after the smoothing processing to obtain a grayscale image; the image graying processing is carried out on the actual scene image to avoid banding distortion; acquiring a first gradient value of each pixel point in the gray level image in the horizontal direction; acquiring a second gradient value of each pixel point in the gray level image in the vertical direction; calculating according to a first gradient value and a second gradient value of each pixel point to obtain a gradient amplitude of each pixel point, calculating according to the gradient amplitude of each pixel point to obtain an average amplitude, screening out the pixel points of which the gradient amplitudes are larger than the average amplitude, and generating a first pixel point set; the pixel points in the first pixel point set are edge pixel points in the gray level image; the gradient amplitude value obtained by calculation according to the first gradient value of each pixel point in the gray level image in the horizontal direction and the second gradient value of each pixel point in the vertical direction is more accurate; acquiring the gray value of each pixel point in the gray image, screening out the pixel points with the gray values larger than a preset gray value, and generating a second pixel point set; acquiring the intersection of the first pixel point set and the second pixel point set; acquiring a union set of the first pixel point set and the second pixel point set; calculating the definition of the gray level image according to the intersection and the union, wherein the definition is an intersection ratio, namely a ratio of the intersection to the union; judging whether the definition is smaller than a preset definition or not, inputting the gray image into a depth information acquisition model trained in advance when the definition is determined to be smaller than the preset definition, outputting the depth value of each pixel point in the gray image and the distance between a shooting scene and a shooting camera; and calculating to obtain the diameter of the circle of confusion of each pixel point according to the depth value of each pixel point, and performing defuzzification processing on the corresponding pixel point in the gray-scale image according to the diameter of the circle of confusion of each pixel point, so that the actual scene image after defuzzification processing is clearer, and the accuracy of the final recognition result is improved.

The working principle of the scheme is as follows: and calculating the signal-to-noise ratio of the actual scene image, judging whether the signal-to-noise ratio is smaller than a preset signal-to-noise ratio, and carrying out filtering processing on the actual scene image when the signal-to-noise ratio is determined to be smaller than the preset signal-to-noise ratio.

The beneficial effect of above-mentioned scheme: the accuracy of the final detection result is also affected by excessive noise in the actual scene image, so that it is necessary to calculate the signal-to-noise ratio of the actual scene image, and when calculating the signal-to-noise ratio of the actual scene image, factors such as the maximum gray value of a pixel point in the actual scene image, the length of the actual scene image, the width of the actual scene image and the like are considered, so that the calculated signal-to-noise ratio is more accurate, the accuracy of judging the signal-to-noise ratio and the preset signal-to-noise ratio is improved, and the actual scene image is conveniently filtered when the signal-to-noise ratio is smaller than the preset signal-to-noise ratio.

According to some embodiments of the present invention, after the filtering process is performed on the actual scene image, the method further includes:

The working principle and the beneficial effects of the scheme are as follows: calculating the gray value of each pixel point in the actual scene image after filtering; screening out the pixel points with the gray value larger than a preset gray value to generate a third pixel point set, screening out the pixel points with the gray value smaller than the preset gray value to generate a fourth pixel point set; carrying out reduction processing on the gray value of each pixel point in the third pixel point set; the gray value of each pixel point in the fourth pixel point set is subjected to increasing processing, and the gray value of each pixel point in the actual scene image after filtering processing is subjected to adjusting processing, so that the contrast of the actual scene image can be increased, and the accuracy of final detection is ensured.

As shown in fig. 3, an image processing system includes:

the training module 1 is used for acquiring sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel;

the acquisition module 2 is used for acquiring scene information of a scene to be identified and shooting an actual scene image of the scene to be identified;

the correction module 3 is configured to compare scene information of the scene to be identified with sample scene information to obtain a comparison result, and adjust an identification range of pixels of the sample cv model according to the comparison result to obtain a corrected cv model;

and the identification module 4 is used for inputting the actual scene image into the corrected cv model and outputting an identification result of the actual scene image.

The working principle of the scheme is as follows: the training module obtains sample scene information and a sample scene image shot in a sample scene, and performs model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel; the method comprises the steps that an acquisition module acquires scene information of a scene to be identified and shoots an actual scene image of the scene to be identified; the correction module compares the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusts the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model; and the identification module inputs the actual scene image into the corrected cv model and outputs an identification result of the actual scene image. Performing model training according to the sample scene information and the sample scene images to obtain a sample cv model, wherein the model training comprises the steps of performing image analysis on a plurality of sample scene images to respectively obtain a foreground image of each sample scene image, segmenting the foreground image from the sample scene images, and leaving a blank frame in the sample scene images; splicing the foreground images into blank frames in different sample scene images respectively to obtain a plurality of spliced images; and performing model training according to the sample scene image, the spliced image and the sample scene information to obtain a cv model.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. An image processing method, comprising:

2. The image processing method of claim 1, wherein the sample scene information comprises light intensity of a sample scene, an angle and a height of a camera when the sample scene image is captured.

3. The image processing method according to claim 1, further comprising, after outputting the recognition result of the actual scene image:

4. The image processing method according to claim 1, wherein the comparing scene information of the scene to be recognized with sample scene information to obtain a comparison result, and the adjusting the recognition range of the pixels of the sample cv model according to the comparison result comprises:

5. The image processing method according to claim 1, further comprising, after outputting the recognition result of the actual scene image:

6. The image processing method according to claim 1, wherein the configuration parameters of the sample cv model further include at least one of a start time, an end time, a detection period, an alarm period, an algorithm threshold, a detection region setting.

7. The image processing method according to claim 1, further comprising, before inputting the actual scene image into the modified cv model:

8. The image processing method according to claim 1, further comprising, before inputting the actual scene image into the modified cv model:

9. The image processing method according to claim 8, further comprising, after the filtering process is performed on the actual scene image:

10. An image processing system, comprising:

the training module is used for acquiring sample scene information and a sample scene image shot in a sample scene, and performing model training according to the sample scene information and the sample scene image to obtain a sample cv model; the configuration parameters of the sample cv model include an identification range of pixels; the identification range comprises a maximum identification pixel and a minimum identification pixel;

the system comprises an acquisition module, a recognition module and a recognition module, wherein the acquisition module is used for acquiring scene information of a scene to be recognized and shooting an actual scene image of the scene to be recognized;

the correction module is used for comparing the scene information of the scene to be identified with the sample scene information to obtain a comparison result, and adjusting the identification range of the pixels of the sample cv model according to the comparison result to obtain a corrected cv model;

and the identification module is used for inputting the actual scene image into the corrected cv model and outputting an identification result of the actual scene image.