CN109101646B

CN109101646B - Data processing method, device, system and computer readable medium

Info

Publication number: CN109101646B
Application number: CN201810957602.4A
Authority: CN
Inventors: 王建辉; 于志闯; 徐延迟; 陈瑞军; 肖可伟
Original assignee: Beijing Seemmo Technology Co ltd
Current assignee: Beijing Seemmo Technology Co ltd
Priority date: 2018-08-21
Filing date: 2018-08-21
Publication date: 2020-12-18
Anticipated expiration: 2038-08-21
Also published as: CN109101646A

Abstract

The invention provides a data processing method, a device, a system and a computer readable medium, wherein the method is applied to an edge terminal comprising image acquisition equipment and comprises the following steps: acquiring a video stream in a region acquired by image acquisition equipment; extracting object features in each single-frame image of the video stream to obtain a feature map; for each feature map, determining a rectangular object block diagram of each object in a single-frame image corresponding to the feature map by using a preset classification neural network; tracking objects in the plurality of single-frame images, and determining an optimal image of each object according to a rectangular object block diagram of each object in the plurality of single-frame images; the optimal image of each object is sent to the server, the technical problem that the quality of the image received by the server is low in the prior art is solved, and the technical effect of improving the quality of the image received by the server is achieved.

Description

Data processing method, device, system and computer readable medium

Technical Field

The present invention relates to the field of data processing technologies, and in particular, to a data processing method, apparatus, system, and computer readable medium.

Background

Image capturing devices are ubiquitous today, and the amount of video data generated by these image capturing devices per second is very large. These video data are to be sent to the server. However, due to limitations in storage space and transmission bandwidth, video data needs to be heavily compressed before being sent to the server. After the video data is compressed in a large amount, a problem of image blurring in the video data may be caused. Therefore, a problem of low quality of the image received by the server is caused.

Disclosure of Invention

In view of the above, the present invention provides a data processing method, apparatus, system and computer readable medium to alleviate the technical problem of low quality of images received by a server in the prior art.

In a first aspect, an embodiment of the present invention provides a data processing method, where the method is applied to an edge terminal including an image acquisition device, and the method includes:

acquiring a video stream in the area acquired by the image acquisition equipment;

extracting object features in each single-frame image of the video stream to obtain a feature map;

for each feature map, determining a rectangular object block diagram of each object in a single-frame image corresponding to the feature map by using a preset classification neural network;

tracking objects in the single-frame images, and determining an optimal image of each object according to the rectangular object block diagram of each object in the single-frame images;

and sending the optimal image of each object to a server.

With reference to the first aspect, an embodiment of the present invention provides a first possible implementation manner of the first aspect, where the determining, by using a preset classification neural network, a rectangular object block diagram of each object in a single frame image corresponding to the feature map includes:

generating a feature expression function corresponding to the preset classification neural network;

acquiring a pixel value on each pixel point of the characteristic diagram;

substituting the pixel value into the characteristic expression function for each pixel point to obtain the coordinate positions of four points of the object in the single-frame image;

and generating the rectangular object block diagram according to the four-point coordinate positions.

With reference to the first aspect, an embodiment of the present invention provides a second possible implementation manner of the first aspect, where the four-point coordinate positions include: the method comprises the following steps of generating a rectangular object block diagram according to four-point coordinate positions, wherein the four-point coordinate positions comprise a first sub coordinate, a second sub coordinate, a third sub coordinate and a fourth sub coordinate, the first sub coordinate and the second sub coordinate are on the same horizontal line, the third sub coordinate and the fourth sub coordinate are on the same horizontal line, the first sub coordinate and the third sub coordinate are on the same vertical line, the second sub coordinate and the fourth sub coordinate are on the same vertical line, and the method comprises the following steps:

connecting the point on the first sub-coordinate with the point on the second sub-coordinate and the point on the third sub-coordinate respectively;

connecting the point of the fourth sub-coordinate with the point on the second sub-coordinate and the point on the third sub-coordinate respectively;

and obtaining a block diagram of the rectangular object.

With reference to the first aspect, an embodiment of the present invention provides a third possible implementation manner of the first aspect, where the tracking objects in the multiple single-frame images, and determining an optimal image of each object according to the rectangular object block diagram of each object in the multiple single-frame images, includes:

in any two adjacent single-frame images, setting the same serial number for the same object in different single-frame images;

respectively extracting the rectangular object block diagrams of the objects with the same number from the single-frame images to obtain a rectangular object block diagram set corresponding to the number;

and for each rectangular object block diagram set, performing quality evaluation on the rectangular object block diagrams in the rectangular object block diagram set to obtain the optimal image meeting preset quality requirements.

With reference to the first aspect, an embodiment of the present invention provides a fourth possible implementation manner of the first aspect, where in any two adjacent single-frame images, setting the same serial number for the same object in different single-frame images includes:

in any two adjacent single-frame images, acquiring the serial number of an object and the rectangular object frame diagram of each object in the previous single-frame image, determining the acquired serial number as a first serial number, and determining the acquired rectangular object frame diagram as a first rectangular object frame diagram;

for each object in the next single-frame image, acquiring the rectangular object frame diagram of the object, and determining the acquired rectangular object frame diagram as a second rectangular object frame diagram;

respectively determining the overlapping spaces of the first rectangular object frame diagram and the second rectangular object frame diagrams;

judging whether the overlap space with the largest value exceeds a preset overlap threshold value or not;

if the overlapping space with the largest value exceeds the preset overlapping threshold, determining that a second rectangular object frame diagram corresponding to the overlapping space with the largest value is a target rectangular object frame diagram;

and setting the first number as the number of the object corresponding to the target rectangular object frame.

With reference to the first aspect, an embodiment of the present invention provides a fifth possible implementation manner of the first aspect, where, for each rectangular object block diagram set, performing quality evaluation on a rectangular object block diagram in the rectangular object block diagram set to obtain the optimal image meeting a preset quality requirement, includes:

acquiring the image size, the image definition and the image imaging angle of each rectangular object frame diagram;

for each rectangular object block diagram, multiplying the image size by a preset image size weight, and calculating to obtain an image size component;

multiplying the image definition by a preset image definition weight, and calculating to obtain an image definition component;

multiplying the image imaging angle by a preset image imaging angle weight, and calculating to obtain an image imaging angle component;

adding the image size component, the image definition component and the image imaging angle component to calculate an image quality value;

and determining the rectangular object block diagram corresponding to the image quality value with the maximum value as the optimal image.

With reference to the first aspect, an embodiment of the present invention provides a sixth possible implementation manner of the first aspect, where, for each rectangular object block diagram set, performing quality evaluation on a rectangular object block diagram in the rectangular object block diagram set to obtain the optimal image meeting a preset quality requirement, includes:

and performing quality evaluation on the rectangular object block diagram, and selecting the rectangular object block diagram with the image size larger than a preset image size threshold, the image definition larger than a preset image definition threshold and the image imaging angle larger than a preset image imaging angle threshold as an optimal image to be output, so as to obtain the optimal image meeting preset quality requirements.

In a second aspect, an embodiment of the present invention further provides a data processing apparatus, including: the device comprises an acquisition module, an extraction module, a determination module, a tracking module and a sending module;

the acquisition module is used for acquiring the video stream in the area acquired by the image acquisition equipment;

the extraction module is used for extracting object features in each single-frame image of the video stream to obtain a feature map;

the determining module is used for determining a rectangular object block diagram of each object in the single-frame image corresponding to each feature map by using a preset classification neural network for each feature map;

the tracking module is used for tracking objects in the single-frame images and determining the optimal image of each object according to the rectangular object block diagram of each object in the single-frame images;

the sending module is used for sending the optimal image of each object to a server.

In a third aspect, an embodiment of the present invention further provides a data processing system, including: a server and a plurality of edge terminals to which the method according to the first aspect is applied, the server communicating with the plurality of edge terminals, respectively.

In a fourth aspect, the present invention also provides a computer-readable medium having non-volatile program code executable by a processor, where the program code causes the processor to execute the method according to any one of the first aspect.

The embodiment of the invention has the following beneficial effects: the data processing method provided by the embodiment of the invention is applied to an edge terminal comprising image acquisition equipment, and comprises the following steps: acquiring a video stream in the area acquired by the image acquisition equipment; extracting object features in each single-frame image of the video stream to obtain a feature map; for each feature map, determining a rectangular object block diagram of each object in a single-frame image corresponding to the feature map by using a preset classification neural network; tracking objects in the single-frame images, and determining an optimal image of each object according to the rectangular object block diagram of each object in the single-frame images; and sending the optimal image of each object to a server.

Therefore, the edge end does not need to compress all the acquired video data and then send the video data to the server, and only needs to send the optimal image of each object to the server, so that the problem of image blurring caused by the fact that a large amount of video data are compressed can be avoided, and the problem of low quality of the image received by the server is further avoided, therefore, the technical problem of low quality of the image received by the server in the prior art is solved, and the technical effect of improving the quality of the image received by the server is achieved.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a flow chart of a data processing method according to an embodiment of the present invention;

FIG. 2 is a flowchart of step S104 in FIG. 1;

FIG. 3 is a schematic block diagram of a data processing apparatus provided by an embodiment of the present invention;

fig. 4 is a schematic structural diagram of a data processing system according to an embodiment of the present invention.

Detailed Description

To make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

At present, image acquisition devices are ubiquitous, and the amount of video data generated by these image acquisition devices per second is very large. These video data are to be sent to the server. However, due to limitations in storage space and transmission bandwidth, video data needs to be heavily compressed before being sent to the server. After the video data is compressed in a large amount, a problem of image blurring in the video data may be caused. Therefore, the problem of low quality of the image received by the server is caused, and based on the data processing method, the data processing device, the data processing system and the computer readable medium provided by the embodiments of the present invention, the technical problem of low quality of the image received by the server in the prior art can be alleviated, so as to achieve the technical effect of improving the quality of the image received by the server.

To facilitate understanding of the present embodiment, a detailed description is first given of a data processing method disclosed in the present embodiment, where the method is applied to an edge terminal including an image capturing device, as shown in fig. 1, and the data processing method may include the following steps.

And step S101, acquiring the video stream in the area acquired by the image acquisition equipment.

For example, the image capturing device may be a high definition camera, a smart camera, or the like.

And S102, extracting the object characteristics in each single-frame image of the video stream to obtain a characteristic diagram.

In the embodiment of the invention, the object characteristics of the single-frame image can be extracted by adopting a neural network algorithm of a plurality of convolutional layers to obtain the characteristic diagram.

And S103, determining a rectangular object block diagram of each object in the single-frame image corresponding to the feature map by using a preset classification neural network for each feature map.

The determining, by using a preset classification neural network, a rectangular object block diagram of each object in a single-frame image corresponding to the feature map may include the following steps:

(1) and generating a feature expression function corresponding to the preset classification neural network.

(2) And acquiring a pixel value on each pixel point of the characteristic diagram.

For example, a sliding window manner may be adopted to obtain a pixel value on each pixel point of the feature map.

(3) And substituting the pixel value into the characteristic expression function for each pixel point to obtain the coordinate positions of four points of the object in the single-frame image.

For example, the four-point coordinate positions may include: the system comprises a first sub-coordinate, a second sub-coordinate, a third sub-coordinate and a fourth sub-coordinate, wherein the first sub-coordinate and the second sub-coordinate are on the same horizontal line, the third sub-coordinate and the fourth sub-coordinate are on the same horizontal line, the first sub-coordinate and the third sub-coordinate are on the same vertical line, and the second sub-coordinate and the fourth sub-coordinate are on the same vertical line.

(4) And generating the rectangular object block diagram according to the four-point coordinate positions.

The generating the rectangular object block diagram according to the four-point coordinate positions may include: connecting the point on the first sub-coordinate with the point on the second sub-coordinate and the point on the third sub-coordinate respectively; connecting the point of the fourth sub-coordinate with the point on the second sub-coordinate and the point on the third sub-coordinate respectively; and obtaining a block diagram of the rectangular object.

In the embodiment of the invention, for each feature map, the object class and the class confidence of each object in the single-frame image corresponding to the feature map can be determined by using a preset classification neural network.

For example, the object categories may be: pedestrians, bicycles, motorcycles, tricycles, cars, vans, trucks, buses or buses etc. The predetermined classification neural network may be a fast R-CNN neural network model, a YOLO (You Only Look one: Unifield, Real-Time Object Detection), a R-FCN neural network model, or an SSD neural network model.

Step S104, tracking the objects in the single-frame images, and determining the optimal image of each object according to the rectangular object block diagram of each object in the single-frame images.

Wherein the optical flow tracking algorithm or SORT (SIMPLE ONLINE ANDREALTIME TRACKING) algorithm can be used to track the objects in the plurality of single frame images.

Further, as shown in fig. 2, step S104 may include the following steps.

Step S201, in any two adjacent single-frame images, setting the same serial number for the same object in different single-frame images.

Wherein, the step S201 may include the following steps:

(1) in any two adjacent single-frame images, for each object in the previous single-frame image, acquiring the number of the object and the rectangular object frame diagram, determining the acquired number as a first number, and determining the acquired rectangular object frame diagram as a first rectangular object frame diagram.

(2) And for each object in the subsequent single-frame image, acquiring the rectangular object frame diagram of the object, and determining the acquired rectangular object frame diagram as a second rectangular object frame diagram.

(3) And respectively determining the overlapping spaces of the first rectangular object frame diagram and the second rectangular object frame diagrams.

(4) And judging whether the overlapping space with the largest value exceeds a preset overlapping threshold value.

The preset overlap threshold may be determined according to actual service requirements.

(5) And if the overlapping space with the maximum value exceeds the preset overlapping threshold value, determining that the second rectangular object frame corresponding to the overlapping space with the maximum value is the target rectangular object frame.

(6) And setting the first number as the number of the object corresponding to the target rectangular object frame.

Step S202, the rectangular object block diagrams of the objects with the same number are respectively extracted from the single-frame images, and a rectangular object block diagram set corresponding to the number is obtained.

Step S203, for each rectangular object block diagram set, performing quality evaluation on the rectangular object block diagrams in the rectangular object block diagram set to obtain the optimal image meeting preset quality requirements.

One specific implementation of step S203 may be: acquiring the image size, the image definition and the image imaging angle of each rectangular object frame diagram; for each rectangular object block diagram, multiplying the image size by a preset image size weight, and calculating to obtain an image size component; multiplying the image definition by a preset image definition weight, and calculating to obtain an image definition component; multiplying the image imaging angle by a preset image imaging angle weight, and calculating to obtain an image imaging angle component; adding the image size component, the image definition component and the image imaging angle component to calculate an image quality value; and determining the rectangular object block diagram corresponding to the image quality value with the maximum value as the optimal image.

For example, the formula for calculating the image quality value may be: x is₁*ω₁+x₂*ω₂+x₃*ω₃Where Z may represent an image quality value, x₁Can represent the image size, ω₁May represent a preset image size weight, x₂Can represent image sharpness, ω₂May represent a preset image sharpness weight, x₃Can represent the imaging angle of the image, omega₃A preset image imaging angle weight may be represented. And each rectangular object block diagram in each rectangular object block diagram set can be calculated to obtain an image quality value.

It is to be noted that the preset image size weight, the preset image sharpness weight, and the preset image imaging angle weight are not fixed. In different scenes, the preset image size weights are different, the preset image definition weights are different, and the preset image imaging angle weights are also different.

Further, another specific implementation manner of step S203 may be: acquiring the image size, the image definition and the image imaging angle of each rectangular object frame diagram; and performing quality evaluation on the rectangular object block diagram, and selecting the rectangular object block diagram with the image size larger than a preset image size threshold, the image definition larger than a preset image definition threshold and the image imaging angle larger than a preset image imaging angle threshold as an optimal image to be output, so as to obtain the optimal image meeting preset quality requirements.

Illustratively, the image definition of each rectangular object block diagram can be obtained by using a Brenner gradient method, a Tenegrad gradient method, a laplace gradient method, a variance method or an energy gradient method. The image imaging angle of each rectangular object block diagram can be obtained by adopting a VGG classification network model, ResNet (Residual Networks), GoogleNet classification network model, MobileNet classification network model or DenseNet classification network model.

The preset image size threshold, the preset image definition threshold and the preset image imaging angle threshold can be determined according to actual service requirements.

In the embodiment of the invention, the data acquisition process and the video structuring process are fused in the edge end, so that the compressed fuzzy image is prevented from being used in the video structuring process, and the quality of the optimal image can be improved. When the server performs object recognition using the optimal image, the error rate of object recognition can be reduced, and the probability of object recognition can be improved.

Step S105, sending the optimal image of each object to a server.

In the embodiment of the invention, because the edge end only sends the optimal image to the server, the interference of redundant data can be reduced, the occupancy rate of bandwidth is reduced, the cost of data transmission is reduced, and the quality of the transmitted image is improved, so that the server utilizes the high-quality optimal image to perform the object identification process, and the object identification probability is improved.

In the embodiment of the invention, the edge terminal can also send the object type, the object number, the image quality value of the optimal image and the image frame number of the single-frame image in which the optimal image is located of each object to the server.

For example, the edge may send the optimal image, the object type, the object number, the image quality value of the optimal image, and the image frame number of the single-frame image where the optimal image is located of each object to the server by using a User Datagram Protocol (UDP).

Unless specifically stated otherwise, the relative steps, numerical expressions, and values of the components and steps set forth in these embodiments do not limit the scope of the present invention.

In an embodiment of the present invention, a data processing method provided in the embodiment of the present invention is applied to an edge terminal including an image capture device, and the method includes: acquiring a video stream in the area acquired by the image acquisition equipment; extracting object features in each single-frame image of the video stream to obtain a feature map; for each feature map, determining a rectangular object block diagram of each object in a single-frame image corresponding to the feature map by using a preset classification neural network; tracking objects in the single-frame images, and determining an optimal image of each object according to the rectangular object block diagram of each object in the single-frame images; and sending the optimal image of each object to a server.

Therefore, the edge end does not need to compress all the collected video data and then send the video data to the server, and only needs to send the optimal image of each object to the server, so that the requirement of subsequent analysis can be met, the problem of image blurring caused by the fact that the video data are compressed in large quantities can be avoided, and the problem of low image quality received by the server is further avoided, therefore, the technical problem of low image quality received by the server in the prior art is solved, and the technical effect of improving the quality of the image received by the server is achieved.

In another embodiment of the present invention, a data processing apparatus disclosed in the embodiment of the present invention is described in detail, and as shown in fig. 3, the data processing apparatus may include: an acquisition module 31, an extraction module 32, a determination module 33, a tracking module 34 and a sending module 35.

The acquiring module 31 is configured to acquire a video stream in the area acquired by the image acquisition device.

The extracting module 32 is configured to extract object features in each single frame image of the video stream to obtain a feature map.

The determining module 33 is configured to determine, for each feature map, a rectangular object block diagram of each object in the single frame image corresponding to the feature map by using a preset classification neural network.

The tracking module 34 is configured to track the objects in the plurality of single-frame images, and determine an optimal image of each object according to the rectangular object block diagram of each object in the plurality of single-frame images.

The sending module 35 is configured to send the optimal image of each object to a server.

The device provided by the embodiment of the present invention has the same implementation principle and technical effect as the method embodiments, and for the sake of brief description, reference may be made to the corresponding contents in the method embodiments without reference to the device embodiments.

In another embodiment of the present invention, a data processing system disclosed in the embodiment of the present invention is described in detail, where the data processing system may include: a server and a plurality of edge terminals applying the method according to any of the above embodiments, the server communicating with the plurality of edge terminals respectively.

For example, the data processing system includes a server and four edge terminals. As shown in fig. 4, the data processing system includes a server 41 and four edge terminals, where the four edge terminals are respectively: edge end 42, edge end 43, edge end 44, and edge end 45. Server 41 communicates with edge terminal 42, edge terminal 43, edge terminal 44, and edge terminal 45, respectively.

Wherein each edge end may comprise an image capture device.

In the embodiment of the invention, each edge end only sends the optimal image to the server, so that the interference of redundant data is reduced, and the problem of image blurring caused by the massive compression of video data is avoided, thereby not only reducing the occupancy rate of bandwidth, but also improving the quality of the image. Meanwhile, the overall performance of the data processing system can be improved, the use cost of the data processing system is reduced, and the flexibility and the expandability of the system are ensured.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the system and the apparatus described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In a further embodiment of the present invention, a computer-readable medium having a non-volatile program code executable by a processor and causing the processor to perform any one of the methods of the above embodiments is disclosed.

In all examples shown and described herein, any particular value should be construed as merely exemplary, and not as a limitation, and thus other examples of example embodiments may have different values.

It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.

The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

In addition, in the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

The computer program product for performing the data processing method provided in the embodiment of the present invention includes a computer-readable storage medium storing a nonvolatile program code executable by a processor, where instructions included in the program code may be used to execute the method described in the foregoing method embodiment, and specific implementation may refer to the method embodiment, and is not described herein again.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units is only one logical division, and there may be other divisions when actually implemented, and for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection of devices or units through some communication interfaces, and may be in an electrical, mechanical or other form.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a non-volatile computer-readable storage medium executable by a processor. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A data processing method is applied to an edge terminal comprising an image acquisition device, and comprises the following steps:

for each feature map, determining a rectangular object block diagram, an object class and a class confidence coefficient of each object in a single-frame image corresponding to the feature map by using a preset classification neural network;

sending the optimal image of each object to a server;

the tracking objects in the plurality of single-frame images, and determining an optimal image of each object according to the rectangular object block diagram of each object in the plurality of single-frame images, includes:

for each rectangular object block diagram set, performing quality evaluation on the rectangular object block diagrams in the rectangular object block diagram set to obtain the optimal image meeting preset quality requirements;

the method for determining the rectangular object block diagram of each object in the single-frame image corresponding to the feature diagram by using the preset classification neural network comprises the following steps:

acquiring a pixel value on each pixel point of the characteristic diagram in a sliding window mode;

2. The data processing method of claim 1, wherein the four-point coordinate positions comprise: the method comprises the following steps of generating a rectangular object block diagram according to four-point coordinate positions, wherein the four-point coordinate positions comprise a first sub coordinate, a second sub coordinate, a third sub coordinate and a fourth sub coordinate, the first sub coordinate and the second sub coordinate are on the same horizontal line, the third sub coordinate and the fourth sub coordinate are on the same horizontal line, the first sub coordinate and the third sub coordinate are on the same vertical line, the second sub coordinate and the fourth sub coordinate are on the same vertical line, and the method comprises the following steps:

and obtaining a block diagram of the rectangular object.

3. The data processing method according to claim 1, wherein in any two adjacent single-frame images, setting the same number for the same object in different single-frame images comprises:

4. The data processing method according to claim 1, wherein the performing quality evaluation on the rectangular object block diagrams in the rectangular object block diagram set to obtain the optimal image meeting a preset quality requirement for each rectangular object block diagram set comprises:

5. The data processing method according to claim 1, wherein the performing quality evaluation on the rectangular object block diagrams in the rectangular object block diagram set to obtain the optimal image meeting a preset quality requirement for each rectangular object block diagram set comprises:

6. A data processing apparatus, comprising: the device comprises an acquisition module, an extraction module, a determination module, a tracking module and a sending module;

the determining module is used for determining a rectangular object block diagram, an object category and a category confidence coefficient of each object in the single-frame image corresponding to each feature map by utilizing a preset classification neural network for each feature map;

the sending module is used for sending the optimal image of each object to a server;

the tracking module is further configured to:

the determining module is further configured to:

7. A data processing system, comprising: a server and a plurality of edge terminals to which the method according to any of claims 1-5 is applied, said server communicating with a plurality of said edge terminals respectively.

8. A computer-readable medium having non-volatile program code executable by a processor, wherein the program code causes the processor to perform the method of any of claims 1-5.