WO2021164251A1

WO2021164251A1 - Image annotation task pre-verification method and apparatus, device, and storage medium

Info

Publication number: WO2021164251A1
Application number: PCT/CN2020/117379
Authority: WO
Inventors: 王健宗; 李佳琳
Original assignee: 平安科技（深圳）有限公司
Priority date: 2020-02-21
Filing date: 2020-09-24
Publication date: 2021-08-26
Also published as: CN111428749A

Abstract

The embodiments of the present application belong to the field of artificial intelligence technology, and relate to an image annotation task pre-verification method. Said method comprises: receiving a pre-verification request sent by a user terminal, the pre-verification request at least carrying original image information and data annotation information; performing divided recognition operation on the original image information on the basis of a divided recognition algorithm, and acquiring anchor point data corresponding to the original image information; determining whether the data annotation information satisfies preset anchor point requirements; and if so, outputting a pre-verification success signal to the user terminal. The present application further provides an image annotation task pre-verification apparatus, a computer device and a storage medium. According to the present application, whether data annotation information of a user satisfies the anchor point requirements is determined by acquiring the anchor point data, so that the accuracy of data annotation information submitted by the user can be improved, thereby reducing the workload of a data processing platform, and improving the working efficiency of the data processing platform.

Description

Pre-check method, device, equipment and storage medium for image labeling task

This application is based on the Chinese invention patent application filed on February 21, 2020, with the application number 202010106306.0, titled "A pre-verification method, device, equipment and storage medium for image labeling tasks", and claims its priority .

Technical field

This application relates to the field of artificial intelligence technology, and in particular to a pre-check method, device, computer equipment, and storage medium for image labeling tasks.

Background technique

In recent years, with the continuous development of data processing platforms, the types of image annotation tasks have become more and more diverse. In order to meet the accuracy requirements of different types of image annotation tasks, a targeted task flow mechanism and verification mechanism are often required.

There is a verification method for image labeling tasks, that is, after receiving the data labeling information submitted by the user, the data labeling information is cut, and the verification system compares the cut labeling information, and screens out that the content is consistent with each other. Data annotation.

However, the inventor realizes that traditional verification methods are generally not intelligent. When a user receives an image labeling task, he can arbitrarily label the image labeling task. The accuracy of the data labeling information submitted by the user is generally low, thereby increasing the data. Processing the workload of the platform, thereby reducing the work efficiency of the data processing platform.

Summary of the invention

The purpose of the embodiments of the present application is to propose a pre-verification method for image labeling tasks, which aims to solve the problems of generally low accuracy and low work efficiency of existing image labeling task verification methods.

In order to solve the above technical problems, an embodiment of the present application provides a pre-verification method for image labeling tasks, which adopts the following technical solutions:

Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;

Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

Judging whether the data labeling information meets the preset anchor point requirements;

If the preset anchor requirement is met, output a pre-check success signal to the user terminal.

In order to solve the above technical problems, an embodiment of the present application also provides a pre-verification device for image labeling tasks, which adopts the following technical solutions:

The request receiving module is configured to receive a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data annotation information;

An anchor point acquisition module, configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

The labeling judgment module is used to judge whether the data labeling information meets the preset anchor point requirements;

The result output module is configured to output a pre-check success signal to the user terminal if the preset anchor point requirement is met.

In order to solve the above technical problems, the embodiments of the present application also provide a computer device, which adopts the following technical solutions:

It includes a memory and a processor, and the computer-readable storage medium stores a computer process, and when the computer process is executed by the processor, the steps of the pre-verification method for the image labeling task described below are realized:

Receiving a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data labeling information;

In order to solve the above technical problems, the embodiments of the present application also provide a computer-readable storage medium, which adopts the following technical solutions:

The computer-readable storage medium stores a computer process, and when the computer process is executed by a processor, the steps of the pre-verification method for the image labeling task described below are realized:

The details of one or more embodiments of the present application are presented in the following drawings and description, and other features and advantages of the present application will become apparent from the description, drawings and claims.

This application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal. The pre-verification request carries at least original image information and data labeling information; Image information performs a shunt recognition operation to obtain anchor point data corresponding to the original image information; to determine whether the data labeling information meets the preset anchor point requirements; if the preset anchor point requirements are met, the The user terminal outputs a pre-check success signal. By obtaining the anchor point data, it is judged whether the user's data labeling information meets the anchor point requirements, so that the accuracy of the data labeling information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.

Description of the drawings

In order to explain the solution in this application more clearly, the following will briefly introduce the drawings used in the description of the embodiments of the application. Obviously, the drawings in the following description are some embodiments of the application. Ordinary technicians can obtain other drawings based on these drawings without creative work.

FIG. 1 is an implementation flowchart of a pre-verification method for an image labeling task provided in Embodiment 1 of the present application;

FIG. 2 is a schematic diagram of data labeling information provided in Embodiment 1 of the present application;

FIG. 3 is a schematic diagram of another data labeling information provided in Embodiment 1 of the present application;

FIG. 4 is a schematic diagram of anchor point requirements provided in Embodiment 1 of the present application;

Fig. 5 is a flowchart of the realization of step S102 in Fig. 1;

Fig. 6 is a flowchart of the realization of step S103 in Fig. 1;

FIG. 7 is a flowchart of the realization of step S303 in FIG. 6;

FIG. 8 is a flowchart of the realization of step S402 in FIG. 7;

FIG. 9 is a schematic structural diagram of a pre-verification device for image labeling tasks provided in Embodiment 2 of the present application;

FIG. 10 is a schematic structural diagram of an anchor point acquisition module provided in Embodiment 2 of the present application;

Fig. 11 is a schematic structural diagram of an embodiment of a computer device according to the present application.

Detailed ways

Unless otherwise defined, all technical and scientific terms used herein have the same meanings as commonly understood by those skilled in the technical field of the application; the terms used in the specification of the application herein are only for describing specific embodiments. The purpose is not to limit the application; the terms "including" and "having" in the specification and claims of the application and the above-mentioned description of the drawings and any variations thereof are intended to cover non-exclusive inclusions. The terms "first" and "second" in the description and claims of this application or the above-mentioned drawings are used to distinguish different objects, rather than to describe a specific sequence.

The reference to "embodiments" herein means that a specific feature, structure, or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present application. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment mutually exclusive with other embodiments. Those skilled in the art clearly and implicitly understand that the embodiments described herein can be combined with other embodiments.

In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings.

The pre-verification method for image labeling tasks provided by the embodiments of the present application obtains anchor point data and determines whether the user’s data labeling information meets the anchor point, thereby improving the accuracy of the data labeling information submitted by the user, thereby reducing data processing The workload of the platform improves the efficiency of the data processing platform.

Example one

Fig. 1 shows a flow chart of the implementation of the pre-verification method of the image labeling task provided in the first embodiment of the present application. For ease of description, only the parts related to the present application are shown.

In step S101, a pre-verification request sent by a user terminal is received, where the pre-verification request at least carries original image information and data annotation information.

In the embodiments of this application, the user terminal may be, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a navigation device, etc. It should be understood that the examples of user terminals here are only for ease of understanding, and are not used to limit this application.

In the embodiments of the present application, the original image information refers to the image information that needs to mark some designated positions by the user drawing lines in the picture, such as the position of the designated text in the picture, and generally restricts the user to only draw straight lines. Or different shapes composed of line segments.

In the embodiments of the present application, the data labeling information refers to different shapes composed of the aforementioned straight line and the endpoint coordinates of each line segment. As an example, as shown in Figure 2, the data labeling information is [(2,6),(5,5)]; as shown in Figure 3, the data labeling information is [(2,6),(5,5) ], [(5,5),(7,7)], [(7,7),(4,8)] and [(4,8),(2,6)], it should be understood that The examples of data labeling information are only for ease of understanding, and are not used to limit this application.

In step S102, a shunt recognition operation is performed on the original image information based on a shunt recognition algorithm, and anchor point data corresponding to the original image information is obtained.

In the embodiments of the present application, the shunt recognition algorithm refers to performing row cutting, full text recognition, and column cutting operations on the original image information, so that the original image is cut in units of characters in the original image information and recognized The output text is used as anchor point data.

In the embodiment of the present application, the anchor point refers to a mark set on the original image information, and is used to set a certain threshold range for standard data annotation to determine whether the data annotation information submitted by the user meets the threshold range.

In step S103, it is determined whether the data labeling information meets a preset anchor point requirement.

In the embodiment of this application, the anchor point requirement is used to detect whether the data annotation information submitted by the user passes through a set of anchor points. As an example, as shown in Figure 4, the anchor point requirements are (2,5) and (5,7) , Then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),(5,5) )] To meet the anchor point requirement, it should be understood that the examples of anchor point requirements here are only for ease of understanding and are not used to limit the application.

In step S104, if the preset anchor point requirement is met, a pre-check success signal is output to the user.

In the embodiment of the present application, the output of the pre-check signal to the user may be displayed on the display device of the user terminal, or may be displayed in the form of short message, which is not used to limit the present application.

In the embodiments of the present application, the present application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information; The shunt recognition algorithm performs shunt recognition operations on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if it meets the preset anchor point requirements If the anchor point is required, a pre-check success signal is output to the user terminal. By obtaining anchor point data and determining whether the user's data annotation information meets the anchor point, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.

Continuing to refer to FIG. 5, a flowchart of the implementation of step S102 in FIG. 1 is shown. For ease of description, only the parts related to the present application are shown.

As some optional implementation manners of the first embodiment, the above step S102 specifically includes: step S201, step S202, step S203, step S204, and step S205.

In step S201, a line cutting operation is performed on the original image information to obtain line image information.

In the embodiment of the present application, the row cutting operation refers to segmenting the original image in the form of rows, where the height of the row is not specifically limited, and the user can adjust it according to the actual situation.

In the embodiment of the present application, the line image information refers to the image information after the original image is divided in the form of lines.

In step S202, a recognition operation is performed on the line image information to obtain a text box.

In the embodiment of the present application, the recognition operation refers to recognizing the text content in the above-mentioned line image information, and when the text content is recognized, the text content position is identified in the form of a text box.

In step S203, a column cutting operation is performed on the original image information according to the text box to obtain multiple columns of column image information carrying the text box.

In the embodiment of the present application, the column cutting operation refers to segmenting the original image in the form of columns, where, since the recognized text box has been obtained in step S202, when performing the column cutting operation on the original image information Then the text box in the original image information is used as a reference for segmentation, so that the text content in the original image information is segmented.

In step S204, one text box is selected from the column image information of every two adjacent columns to form a text box group.

In the embodiment of this application, when there is only one text box in the column image information, the only existing text box is selected to form the text box group; when there are multiple text boxes in the column image information, the user can proceed according to the actual situation. Obtain the text boxes that make up the text box group in a limited selection method, as an example, for example, by random selection, or by relative distance selection. It should be understood that the examples of acquisition methods here are only for ease of understanding. To limit the embodiments of this application.

In step S205, the center point of the text box in the text box group is used as the anchor point data.

In the embodiment of this application, the line image information is first obtained by cutting the original image information, so as to facilitate the recognition of the text content in the image information; then the text box is obtained by recognizing the line image information to determine the original The specific location of the text content in the image information; finally, the anchor point data is determined based on the text box, thereby standardizing the effective range of the user's annotation data, and effectively standardizing the user's annotation operation.

As some optional implementation manners of the first embodiment of the present application, the foregoing step S203 specifically includes the following steps:

Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.

In the embodiment of the present application, cutting is performed by selecting the straight line passing through the text box with the least amount as the column cutting line, so as to ensure that there are as many effective text boxes as possible after cutting, thereby ensuring the accuracy of the anchor point setting.

As some optional implementation manners of the first embodiment of the present application, the foregoing step S204 specifically includes:

In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.

Fig. 6 is a flowchart of the realization of step S103 in Fig. 1. For ease of description, only the parts related to the present application are shown.

As some optional implementation manners of the first embodiment of the present application, the above step S103 specifically includes: step S301, step S302, step S303, and step S304.

In step S301, the first equation and the second equation corresponding to the data annotation information and anchor point data are acquired in the same coordinate system.

In the embodiment of this application, it is assumed that the algorithm finds a set of anchor points in the original image information, point A and point B. The annotation data submitted by the user must be marked by a line passing between point A and point B, that is, the line segment AB must be certain There is an intersection with a certain line segment in the user data label.

Create the first equation and the second equation:

The first equation: Assuming that the coordinates of point A are (x _A , y _A ), the coordinates of point B are (x _B , y _B ), and x _A ≤ x _B , it can be obtained from the coordinates of two points on a line segment The equation of this line segment: (y _B -y _A )x+(x _A -x _B )y+x _B *y _A -x _A *y _B =0 (x _A ≤x≤x _B ).

The second equation: According to the end point coordinates of each line segment in the data label submitted by the user, the equation of each line segment can be found, for example: the coordinates of the end point M in the line segment MN (x _M , y _M ), and the coordinates of the end point N are ( x _N ,y _N ), and x _N ≤ x _M , the equation of line segment MN is: (y _N -y _M )x+(x _M -x _N )y+x _N *y _M -x _M *y _N = 0(x _N ≤x≤x _M ).

In step S302, it is determined whether there is an intersection point on the line corresponding to the second equation and the first equation.

Simultaneous equations with line segment AB and each line segment drawn by the user. If the equation group has one and only one solution in the domain, it means that the line segment passes between anchor points A and B. Taking the line segment MN as an example, the method to determine whether the line segment MN passes between anchor points A and B is as follows:

① Simultaneous equations:

②The abscissa of the only solution when the equation system does not consider the domain is:

③If the abscissa of the unique solution meets the domain requirements x _A ≤x ^* ≤x _B and x _N ≤x ^* ≤x _M , then it is considered that the line segment MN and the line segment AB have an intersection point in the definition domain, that is, anchor point A and anchor point B Wire passes between.

In the embodiment of the present application, x ^* meets the requirements of the domain x _A ≤ x ^* ≤ x _B and x _N ≤ x ^* ≤ x _M , that is, there is an intersection point with the first equation on the second equation, indicating that the user The submitted data is marked with a line passing between point A and point B.

In step S303, if there is an intersection point on the line corresponding to the second equation and the first equation, the pre-check success signal is output.

In step S304, if there is no intersection point on the line corresponding to the second equation and the first equation, a pre-check failure signal is output.

In the embodiment of this application, through the logic of judging whether two line segments intersect, the judgment formula is formed by coordinates and equations, which can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, and make crowdsourcing The platform can get more accurate answers in a shorter time and improve the reliability of the answers to crowdsourced tasks such as image annotation.

Fig. 7 is a flow chart of the implementation of step S303 in Fig. 6. For ease of description, only the parts related to the present application are shown.

As some optional implementation manners of the first embodiment of the present application, the above step S303 specifically includes: step S401, step S402, and step S403.

In step S401, if the judgment result is yes, it is judged whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other.

In the embodiment of this application, taking the above-mentioned simultaneous equations as an example, it can be determined by

Is it equal to

In this way, it is determined whether the first equation and the second equation are parallel or coincident with each other.

In step S402, if the lines corresponding to the first equation and the second equation are not parallel or coincide with each other, the pre-check success signal is output.

In step S403, if the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.

Fig. 8 is a flow chart of the implementation of step S402 in Fig. 7. For ease of description, only the parts related to the present application are shown.

As some optional implementation manners of the first embodiment of the present application, the foregoing step S402 specifically includes: step S501, step S502, step S503, and step S504. The original image information labeling task also carries invalid anchor point data.

In step S501, the third-party program and the fourth equation corresponding to the data labeling information and invalid anchor point data are acquired in the same coordinate system.

In the embodiments of this application, the invalid anchor point data refers to the area that cannot be passed through preset by the original image information labeling task. For example, the file signature and file header of the original image information are only used to identify the file, and The original image information labeling task has nothing to do with it. When the user performs labeling at this location, it obviously does not meet the requirements of the original image information labeling task. Therefore, the position of the invalid anchor point needs to be preset in the original image information labeling task.

In the embodiments of this application, the third-party program refers to the equation without anchor point data. Assume that the coordinates of point C in the invalid anchor point are (x _C , y _C ), and the coordinates of point D are (x _D , y _D ), and x _C ≤ x _D , the third-party formula for finding the line segment based on the coordinates of two points on a line segment is: (y _D -y _C )x+(x _C -x _D )y+x _D * y _C -x _C *y _D =0 (x _C ≤x≤x _D ).

In the embodiment of the present application, the fourth equation refers to the equation of the endpoints of the data annotation information submitted by the user. Assume that the two endpoints of the data annotation information submitted by the user are Q(x _Q ,y _Q ), P(x _P ,y _P ):

Then the fourth equation is: (y _P -y _Q )x+(x _Q -x _P )y+x _P *y _Q -x _Q *y _P =0 (x _Q ≤x≤x _P ).

In step S502, it is determined whether there is an intersection point on the line corresponding to the fourth equation and the third-party formula.

① Simultaneous equations:

③If the abscissa of the unique solution meets the domain requirements x _C ≤x ^* ≤x _D and x _Q ≤x ^* ≤x _P , then it is considered that the line segment QP and the line segment CD have intersections in the domain, that is, anchor point C and anchor point D Wire passes between.

In step S503, if there is an intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check failure signal is output.

In step S504, if there is no intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check success signal is output.

In summary, this application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information; The algorithm performs a shunt recognition operation on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if the preset anchor point is satisfied If required, a pre-check success signal is output to the user terminal. By acquiring anchor point data and determining whether the user's data annotation information meets the anchor point requirements, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform. At the same time, first obtain the line image information by cutting the original image information, so as to facilitate the recognition of the text content in the image information, and then obtain the text box by recognizing the line image information to determine the text content in the original image information Finally, the anchor point data is determined based on the text box, thus the specification limits the effective range of the user's labeling data, and effectively standardizes the user's labeling operation; through the logic of judging whether the two line segments intersect, the judgment is formed by coordinates and equations The formula can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, enable the crowdsourcing platform to get more accurate answers in a shorter time, and improve the reliability of the answers for image annotation tasks. sex.

A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through a computer process. The computer process can be stored in a computer readable storage medium. When executed, it may include the procedures of the above-mentioned method embodiments. Among them, the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.

It should be understood that although the various steps in the flowchart of the drawings are displayed in sequence as indicated by the arrows, these steps are not necessarily performed in sequence in the order indicated by the arrows. Unless explicitly stated in this article, the execution of these steps is not strictly limited in order, and they can be executed in other orders. Moreover, at least part of the steps in the flowchart of the drawings may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the order of execution is also It is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.

Example two

With further reference to FIG. 9, as an implementation of the method shown in FIG. 1, this application provides a device for pre-checking an image labeling task. The device embodiment corresponds to the method embodiment shown in FIG. 1, and the device is specifically It can be applied to various electronic devices.

As shown in Fig. 9, the image labeling task pre-verification apparatus 100 provided in the second embodiment of the present application includes: a request receiving module 101, an anchor point obtaining module 102, a labeling judgment module 103, and a result output module 104. in:

The request receiving module 101 is configured to receive a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;

The anchor point obtaining module 102 is configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

The labeling judgment module 103 is used to judge whether the data labeling information meets the preset anchor point requirements;

The result output module 104 is configured to output a pre-check success signal to the user terminal if the preset anchor requirement is met.

In the embodiment of this application, the anchor point requirement is used to detect whether the data labeling information submitted by the user passes through a set of anchor points. As an example, as shown in Figure 4, the anchor point requirements are (2,5) and (5). ,7), then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),( 5, 5)] To meet the anchor point requirement, it should be understood that the examples of anchor point requirements here are only for ease of understanding and are not used to limit this application.

In an embodiment of the present application, the present application provides a pre-verification device for image labeling tasks, including: a request receiving module, configured to receive a pre-verification request sent by a user terminal, and the pre-verification request carries at least the original Image information and data annotation information; an anchor point acquisition module for performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information; an annotation judgment module for judgment Whether the data labeling information meets the preset anchor point requirement; the result output module is configured to output the pre-verification success signal to the user terminal if the preset anchor point requirement is met. By obtaining anchor point data and determining whether the user's data annotation information meets the anchor point, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.

In some optional implementations of the second embodiment of the present application, as shown in FIG. 10, the anchor point acquisition module 102 includes: a row image acquisition submodule 1021, a text box acquisition submodule 1022, a column image acquisition submodule 1023, The text box group obtaining sub-module 1024 and the anchor point determining sub-module 1025. in:

The line image acquisition sub-module 1021 is configured to perform a line cutting operation on the original image information to obtain line image information;

The text box obtaining sub-module 1022 is used to perform a recognition operation on the line of image information to obtain a text box;

The column image acquisition submodule 1023 is configured to perform a column cutting operation on the original image information according to the text box, and obtain multiple columns of column image information carrying the text box;

The text box group acquisition sub-module 1024 is configured to select one text box from the column image information of each two adjacent columns to form a text box;

The anchor point determination sub-module 1025 is configured to use the center point of the text box in the text box group as the anchor point data.

In some optional implementation manners of the second embodiment of the present application, the aforementioned text box group obtaining submodule 1024 includes:

The text box group determining sub-module is used for taking the two text boxes farthest apart in the longitudinal axis direction as the text box group in the column image information of the two adjacent columns.

In some optional implementation manners of the second embodiment of the present application, the column image acquisition submodule 1023 includes:

The column cutting line determination sub-module is used for cutting with the straight line with the least number of text boxes passing longitudinally as the column cutting line.

In some optional implementation manners of the second embodiment of the present application, the above-mentioned labeling judgment module 103 includes:

The first equation obtaining sub-module is used to obtain the first equation and the second equation corresponding to the data annotation information and anchor point data;

The first equation judging sub-module is used to judge whether there is an intersection point with the first equation on the second equation;

The first signal output sub-module is configured to output the pre-check success signal if there is an intersection point with the first equation on the second equation;

The second signal output sub-module is configured to output a pre-check failure signal if there is no intersection point with the first equation in the second equation.

In some optional implementation manners of the second embodiment of the present application, the above-mentioned first signal output submodule specifically includes:

The second equation judging sub-module is used for judging whether the first equation and the second equation are parallel or coincide with each other if the judgment result is yes;

The third signal output sub-module is configured to output the pre-check success signal if the first equation and the second equation are not parallel or coincide with each other;

The fourth signal output sub-module is configured to output the pre-check failure signal if the first equation and the second equation are parallel or coincide with each other.

In some optional implementation manners of the second embodiment of the present application, the third signal output submodule specifically includes:

The second equation obtaining sub-module is used to obtain the third-party program and the fourth equation corresponding to the data annotation information and invalid anchor point data;

The third-party program judgment sub-module is used to determine whether there is an intersection point with the third-party program on the fourth equation;

The fifth signal output sub-module is configured to output the pre-check failure signal if there is an intersection point with the third-party formula on the fourth equation;

The sixth signal output sub-module is configured to output the pre-check success signal if there is no intersection point with the third-party formula in the fourth equation.

Example three

In order to solve the above technical problems, the embodiments of the present application also provide computer equipment. Please refer to FIG. 11 for details. FIG. 11 is a block diagram of the basic structure of the computer device in this embodiment.

The computer device 11 includes a memory 111, a processor 112, and a network interface 113 that are connected to each other in communication through a system bus. It should be pointed out that the figure only shows the computer device 11 with components 111-113, but it should be understood that it is not required to implement all of the illustrated components, and more or fewer components may be implemented instead. Among them, those skilled in the art can understand that the computer device here is a device that can automatically perform numerical calculation and/or information processing in accordance with pre-set or stored instructions. Its hardware includes, but is not limited to, a microprocessor, a dedicated Integrated Circuit (Application Specific Integrated Circuit, ASIC), Programmable Gate Array (Field-Programmable Gate Array, FPGA), Digital Processor (Digital Signal Processor, DSP), embedded equipment, etc.

The computer device may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The computer device can interact with the user through a keyboard, a mouse, a remote control, a touch panel, or a voice control device.

The memory 111 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static memory Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc., the computer readable storage The medium can be non-volatile or volatile. In some embodiments, the memory 111 may be an internal storage unit of the computer device 11, such as a hard disk or a memory of the computer device 11. In other embodiments, the memory 111 may also be an external storage device of the computer device 11, such as a plug-in hard disk, a smart media card (SMC), and a secure digital device equipped on the computer device 11. (Secure Digital, SD) card, Flash Card, etc. Of course, the memory 111 may also include both an internal storage unit of the computer device 11 and an external storage device thereof. In this embodiment, the memory 111 is generally used to store an operating system and various application software installed in the computer device 11, such as computer-readable instructions for a pre-verification method for image labeling tasks. In addition, the memory 111 can also be used to temporarily store various types of data that have been output or will be output.

The processor 112 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments. The processor 112 is generally used to control the overall operation of the computer device 11. In this embodiment, the processor 112 is configured to run computer-readable instructions or processed data stored in the memory 111, for example, computer-readable instructions for running the pre-verification method of the image labeling task.

The network interface 113 may include a wireless network interface or a wired network interface. The network interface 113 is generally used to establish a communication connection between the computer device 11 and other electronic devices.

This application also provides another implementation manner, that is, to provide a computer-readable storage medium that stores a pre-verification process of an image labeling task, and the pre-verification process of the image labeling task can be Is executed by at least one processor, so that the at least one processor executes the steps of the pre-verification method for the image labeling task as described above.

Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.

Obviously, the above-described embodiments are only a part of the embodiments of the present application, rather than all of the embodiments. The drawings show preferred embodiments of the present application, but do not limit the patent scope of the present application. The present application can be implemented in many different forms. On the contrary, the purpose of providing these examples is to make the understanding of the disclosure of the present application more thorough and comprehensive. Although this application has been described in detail with reference to the foregoing embodiments, for those skilled in the art, it is still possible for those skilled in the art to modify the technical solutions described in each of the foregoing specific embodiments, or equivalently replace some of the technical features. . All equivalent structures made using the contents of the description and drawings of this application, directly or indirectly used in other related technical fields, are similarly within the scope of patent protection of this application.

Claims

A pre-verification method for image labeling tasks, the method includes the following steps:

Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;

Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

Judging whether the data labeling information meets the preset anchor point requirements;

If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
The pre-verification method for image tagging tasks according to claim 1, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information , Specifically including the following steps:

Performing a line cutting operation on the original image information to obtain line image information;

Performing a recognition operation on the line image information to obtain a text box;

Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;

Select one text box from each of the two adjacent columns of the image information to form a text box group;

Use the center point of the text box in the text box group as the anchor point data.
3. The pre-verification method for image tagging tasks according to claim 2, wherein the step of selecting one text box from the column image information of each two adjacent columns to form a text box group specifically includes the following steps:

In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
The method for pre-verification of an image labeling task according to claim 2, wherein the column cutting operation is performed on the original image information according to the text box to obtain multiple columns of column image information carrying the text box The steps include the following steps:

Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
8. The pre-verification method for an image labeling task according to claim 1, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:

Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;

Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;

If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;

If there is no intersection point on the line corresponding to the second equation and the first equation, output a pre-check failure signal.
5. The pre-verification method for image labeling tasks according to claim 5, wherein if the judgment result is yes, the step of outputting the pre-verification success signal specifically includes the following steps:

If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;

If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;

If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
The pre-verification method of the image labeling task according to claim 6, wherein the image labeling task also carries invalid anchor point data, and if the judgment result is no, the step of outputting the pre-verification success signal , Specifically including the following steps:

Obtain the third-party program and the fourth equation corresponding to the data labeling information and invalid anchor point data in the same coordinate system;

Determine whether there is an intersection point on the line corresponding to the fourth equation and the third-party program;

If there is an intersection point on the line corresponding to the fourth equation and the third-party program, output the pre-check failure signal;

If there is no intersection point on the line corresponding to the fourth equation and the third-party program, output the pre-check success signal.
A pre-verification device for image labeling tasks, wherein the device includes:

The request receiving module is configured to receive a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data annotation information;

An anchor point acquisition module, configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

The labeling judgment module is used to judge whether the data labeling information meets the preset anchor point requirements;

The result output module is configured to output a pre-check success signal to the user terminal if the preset anchor point requirement is met.
A computer device including a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, wherein the processor executes the computer-readable instructions as follows The steps of the pre-verification method for the image annotation task:

Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;

Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

Judging whether the data labeling information meets the preset anchor point requirements;

If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
9. The computer device according to claim 9, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information specifically comprises the following steps:

Performing a line cutting operation on the original image information to obtain line image information;

Performing a recognition operation on the line image information to obtain a text box;

Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;

Select one text box from each of the two adjacent columns of the image information to form a text box group;

Use the center point of the text box in the text box group as the anchor point data.
10. The computer device according to claim 10, wherein the step of selecting one text box from each of the column image information of every two adjacent columns to form a text box group specifically includes the following steps:

In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
The computer device according to claim 10, wherein the step of performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box specifically includes the following steps :

Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
9. The computer device according to claim 9, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:

Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;

Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;

If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;

If there is no intersection point on the line corresponding to the second equation and the first equation, a pre-check failure signal is output.
The computer device according to claim 13, wherein, if the judgment result is yes, the step of outputting the pre-check success signal specifically includes the following steps:

If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;

If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;

If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
A computer-readable storage medium, wherein, when the computer-readable instructions are executed by a processor, the steps of causing the processor to execute the pre-verification method of the image labeling task:

Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;

Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;

Judging whether the data labeling information meets the preset anchor point requirements;

If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
15. The computer-readable storage medium according to claim 15, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information specifically comprises The following steps:

Performing a line cutting operation on the original image information to obtain line image information;

Performing a recognition operation on the line image information to obtain a text box;

Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;

Select one text box from each of the two adjacent columns of the image information to form a text box group;

Use the center point of the text box in the text box group as the anchor point data.
16. The computer-readable storage medium according to claim 16, wherein the step of selecting one text box from each of the column image information of each two adjacent columns to form a text box group specifically includes the following steps:

In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
The computer-readable storage medium of claim 16, wherein the step of performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box is specifically Including the following steps:

Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
15. The computer-readable storage medium according to claim 15, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:

Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;

Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;

If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;

If there is no intersection point on the line corresponding to the second equation and the first equation, output a pre-check failure signal.
19. The computer-readable storage medium of claim 19, wherein if the judgment result is yes, the step of outputting the pre-check success signal specifically includes the following steps:

If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;

If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;

If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.