WO2021164251A1 - Image annotation task pre-verification method and apparatus, device, and storage medium - Google Patents

Image annotation task pre-verification method and apparatus, device, and storage medium Download PDF

Info

Publication number
WO2021164251A1
WO2021164251A1 PCT/CN2020/117379 CN2020117379W WO2021164251A1 WO 2021164251 A1 WO2021164251 A1 WO 2021164251A1 CN 2020117379 W CN2020117379 W CN 2020117379W WO 2021164251 A1 WO2021164251 A1 WO 2021164251A1
Authority
WO
WIPO (PCT)
Prior art keywords
image information
equation
text box
original image
anchor point
Prior art date
Application number
PCT/CN2020/117379
Other languages
French (fr)
Chinese (zh)
Inventor
王健宗
李佳琳
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021164251A1 publication Critical patent/WO2021164251A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular to a pre-check method, device, computer equipment, and storage medium for image labeling tasks.
  • the purpose of the embodiments of the present application is to propose a pre-verification method for image labeling tasks, which aims to solve the problems of generally low accuracy and low work efficiency of existing image labeling task verification methods.
  • an embodiment of the present application provides a pre-verification method for image labeling tasks, which adopts the following technical solutions:
  • an embodiment of the present application also provides a pre-verification device for image labeling tasks, which adopts the following technical solutions:
  • the request receiving module is configured to receive a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data annotation information;
  • An anchor point acquisition module configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
  • the labeling judgment module is used to judge whether the data labeling information meets the preset anchor point requirements
  • the result output module is configured to output a pre-check success signal to the user terminal if the preset anchor point requirement is met.
  • the embodiments of the present application also provide a computer device, which adopts the following technical solutions:
  • It includes a memory and a processor, and the computer-readable storage medium stores a computer process, and when the computer process is executed by the processor, the steps of the pre-verification method for the image labeling task described below are realized:
  • the embodiments of the present application also provide a computer-readable storage medium, which adopts the following technical solutions:
  • the computer-readable storage medium stores a computer process, and when the computer process is executed by a processor, the steps of the pre-verification method for the image labeling task described below are realized:
  • This application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal.
  • the pre-verification request carries at least original image information and data labeling information; Image information performs a shunt recognition operation to obtain anchor point data corresponding to the original image information; to determine whether the data labeling information meets the preset anchor point requirements; if the preset anchor point requirements are met, the The user terminal outputs a pre-check success signal.
  • By obtaining the anchor point data it is judged whether the user's data labeling information meets the anchor point requirements, so that the accuracy of the data labeling information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.
  • FIG. 1 is an implementation flowchart of a pre-verification method for an image labeling task provided in Embodiment 1 of the present application;
  • FIG. 2 is a schematic diagram of data labeling information provided in Embodiment 1 of the present application.
  • FIG. 3 is a schematic diagram of another data labeling information provided in Embodiment 1 of the present application.
  • FIG. 4 is a schematic diagram of anchor point requirements provided in Embodiment 1 of the present application.
  • Fig. 5 is a flowchart of the realization of step S102 in Fig. 1;
  • Fig. 6 is a flowchart of the realization of step S103 in Fig. 1;
  • FIG. 7 is a flowchart of the realization of step S303 in FIG. 6;
  • FIG. 8 is a flowchart of the realization of step S402 in FIG. 7;
  • FIG. 9 is a schematic structural diagram of a pre-verification device for image labeling tasks provided in Embodiment 2 of the present application.
  • FIG. 10 is a schematic structural diagram of an anchor point acquisition module provided in Embodiment 2 of the present application.
  • Fig. 11 is a schematic structural diagram of an embodiment of a computer device according to the present application.
  • the pre-verification method for image labeling tasks obtained by the embodiments of the present application obtains anchor point data and determines whether the user’s data labeling information meets the anchor point, thereby improving the accuracy of the data labeling information submitted by the user, thereby reducing data processing
  • the workload of the platform improves the efficiency of the data processing platform.
  • Fig. 1 shows a flow chart of the implementation of the pre-verification method of the image labeling task provided in the first embodiment of the present application. For ease of description, only the parts related to the present application are shown.
  • step S101 a pre-verification request sent by a user terminal is received, where the pre-verification request at least carries original image information and data annotation information.
  • the user terminal may be, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a navigation device, etc. It should be understood that the examples of user terminals here are only for ease of understanding, and are not used to limit this application.
  • the original image information refers to the image information that needs to mark some designated positions by the user drawing lines in the picture, such as the position of the designated text in the picture, and generally restricts the user to only draw straight lines. Or different shapes composed of line segments.
  • the data labeling information refers to different shapes composed of the aforementioned straight line and the endpoint coordinates of each line segment.
  • the data labeling information is [(2,6),(5,5)]; as shown in Figure 3, the data labeling information is [(2,6),(5,5) ], [(5,5),(7,7)], [(7,7),(4,8)] and [(4,8),(2,6)], it should be understood that
  • the examples of data labeling information are only for ease of understanding, and are not used to limit this application.
  • step S102 a shunt recognition operation is performed on the original image information based on a shunt recognition algorithm, and anchor point data corresponding to the original image information is obtained.
  • the shunt recognition algorithm refers to performing row cutting, full text recognition, and column cutting operations on the original image information, so that the original image is cut in units of characters in the original image information and recognized
  • the output text is used as anchor point data.
  • the anchor point refers to a mark set on the original image information, and is used to set a certain threshold range for standard data annotation to determine whether the data annotation information submitted by the user meets the threshold range.
  • step S103 it is determined whether the data labeling information meets a preset anchor point requirement.
  • the anchor point requirement is used to detect whether the data annotation information submitted by the user passes through a set of anchor points.
  • the anchor point requirements are (2,5) and (5,7) . Then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),(5,5) )]
  • the examples of anchor point requirements here are only for ease of understanding and are not used to limit the application.
  • step S104 if the preset anchor point requirement is met, a pre-check success signal is output to the user.
  • the output of the pre-check signal to the user may be displayed on the display device of the user terminal, or may be displayed in the form of short message, which is not used to limit the present application.
  • the present application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information;
  • the shunt recognition algorithm performs shunt recognition operations on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if it meets the preset anchor point requirements If the anchor point is required, a pre-check success signal is output to the user terminal.
  • step S102 in FIG. 1 a flowchart of the implementation of step S102 in FIG. 1 is shown. For ease of description, only the parts related to the present application are shown.
  • step S102 specifically includes: step S201, step S202, step S203, step S204, and step S205.
  • step S201 a line cutting operation is performed on the original image information to obtain line image information.
  • the row cutting operation refers to segmenting the original image in the form of rows, where the height of the row is not specifically limited, and the user can adjust it according to the actual situation.
  • the line image information refers to the image information after the original image is divided in the form of lines.
  • step S202 a recognition operation is performed on the line image information to obtain a text box.
  • the recognition operation refers to recognizing the text content in the above-mentioned line image information, and when the text content is recognized, the text content position is identified in the form of a text box.
  • step S203 a column cutting operation is performed on the original image information according to the text box to obtain multiple columns of column image information carrying the text box.
  • the column cutting operation refers to segmenting the original image in the form of columns, where, since the recognized text box has been obtained in step S202, when performing the column cutting operation on the original image information Then the text box in the original image information is used as a reference for segmentation, so that the text content in the original image information is segmented.
  • step S204 one text box is selected from the column image information of every two adjacent columns to form a text box group.
  • the only existing text box is selected to form the text box group; when there are multiple text boxes in the column image information, the user can proceed according to the actual situation.
  • step S205 the center point of the text box in the text box group is used as the anchor point data.
  • the line image information is first obtained by cutting the original image information, so as to facilitate the recognition of the text content in the image information; then the text box is obtained by recognizing the line image information to determine the original The specific location of the text content in the image information; finally, the anchor point data is determined based on the text box, thereby standardizing the effective range of the user's annotation data, and effectively standardizing the user's annotation operation.
  • step S203 specifically includes the following steps:
  • cutting is performed by selecting the straight line passing through the text box with the least amount as the column cutting line, so as to ensure that there are as many effective text boxes as possible after cutting, thereby ensuring the accuracy of the anchor point setting.
  • step S204 specifically includes:
  • Fig. 6 is a flowchart of the realization of step S103 in Fig. 1. For ease of description, only the parts related to the present application are shown.
  • step S103 specifically includes: step S301, step S302, step S303, and step S304.
  • step S301 the first equation and the second equation corresponding to the data annotation information and anchor point data are acquired in the same coordinate system.
  • the algorithm finds a set of anchor points in the original image information, point A and point B.
  • the annotation data submitted by the user must be marked by a line passing between point A and point B, that is, the line segment AB must be certain There is an intersection with a certain line segment in the user data label.
  • step S302 it is determined whether there is an intersection point on the line corresponding to the second equation and the first equation.
  • x * meets the requirements of the domain x A ⁇ x * ⁇ x B and x N ⁇ x * ⁇ x M , that is, there is an intersection point with the first equation on the second equation, indicating that the user
  • the submitted data is marked with a line passing between point A and point B.
  • step S303 if there is an intersection point on the line corresponding to the second equation and the first equation, the pre-check success signal is output.
  • step S304 if there is no intersection point on the line corresponding to the second equation and the first equation, a pre-check failure signal is output.
  • the judgment formula is formed by coordinates and equations, which can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, and make crowdsourcing
  • the platform can get more accurate answers in a shorter time and improve the reliability of the answers to crowdsourced tasks such as image annotation.
  • Fig. 7 is a flow chart of the implementation of step S303 in Fig. 6. For ease of description, only the parts related to the present application are shown.
  • step S303 specifically includes: step S401, step S402, and step S403.
  • step S401 if the judgment result is yes, it is judged whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other.
  • step S402 if the lines corresponding to the first equation and the second equation are not parallel or coincide with each other, the pre-check success signal is output.
  • step S403 if the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
  • Fig. 8 is a flow chart of the implementation of step S402 in Fig. 7. For ease of description, only the parts related to the present application are shown.
  • step S402 specifically includes: step S501, step S502, step S503, and step S504.
  • the original image information labeling task also carries invalid anchor point data.
  • step S501 the third-party program and the fourth equation corresponding to the data labeling information and invalid anchor point data are acquired in the same coordinate system.
  • the invalid anchor point data refers to the area that cannot be passed through preset by the original image information labeling task.
  • the file signature and file header of the original image information are only used to identify the file, and The original image information labeling task has nothing to do with it.
  • the position of the invalid anchor point needs to be preset in the original image information labeling task.
  • the fourth equation refers to the equation of the endpoints of the data annotation information submitted by the user. Assume that the two endpoints of the data annotation information submitted by the user are Q(x Q ,y Q ), P(x P ,y P ):
  • step S502 it is determined whether there is an intersection point on the line corresponding to the fourth equation and the third-party formula.
  • step S503 if there is an intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check failure signal is output.
  • step S504 if there is no intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check success signal is output.
  • this application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information;
  • the algorithm performs a shunt recognition operation on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if the preset anchor point is satisfied If required, a pre-check success signal is output to the user terminal.
  • the anchor point data is determined based on the text box, thus the specification limits the effective range of the user's labeling data, and effectively standardizes the user's labeling operation; through the logic of judging whether the two line segments intersect, the judgment is formed by coordinates and equations
  • the formula can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, enable the crowdsourcing platform to get more accurate answers in a shorter time, and improve the reliability of the answers for image annotation tasks. sex.
  • the computer process can be stored in a computer readable storage medium. When executed, it may include the procedures of the above-mentioned method embodiments.
  • the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.
  • this application provides a device for pre-checking an image labeling task.
  • the device embodiment corresponds to the method embodiment shown in FIG. 1, and the device is specifically It can be applied to various electronic devices.
  • the image labeling task pre-verification apparatus 100 includes: a request receiving module 101, an anchor point obtaining module 102, a labeling judgment module 103, and a result output module 104. in:
  • the request receiving module 101 is configured to receive a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
  • the anchor point obtaining module 102 is configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
  • the labeling judgment module 103 is used to judge whether the data labeling information meets the preset anchor point requirements
  • the result output module 104 is configured to output a pre-check success signal to the user terminal if the preset anchor requirement is met.
  • the user terminal may be, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a navigation device, etc. It should be understood that the examples of user terminals here are only for ease of understanding, and are not used to limit this application.
  • the original image information refers to the image information that needs to mark some designated positions by the user drawing lines in the picture, such as the position of the designated text in the picture, and generally restricts the user to only draw straight lines. Or different shapes composed of line segments.
  • the data labeling information refers to different shapes composed of the aforementioned straight line and the endpoint coordinates of each line segment.
  • the data labeling information is [(2,6),(5,5)]; as shown in Figure 3, the data labeling information is [(2,6),(5,5) ], [(5,5),(7,7)], [(7,7),(4,8)] and [(4,8),(2,6)], it should be understood that
  • the examples of data labeling information are only for ease of understanding, and are not used to limit this application.
  • the shunt recognition algorithm refers to performing row cutting, full text recognition, and column cutting operations on the original image information, so that the original image is cut in units of characters in the original image information and recognized
  • the output text is used as anchor point data.
  • the anchor point refers to a mark set on the original image information, and is used to set a certain threshold range for standard data annotation to determine whether the data annotation information submitted by the user meets the threshold range.
  • the anchor point requirement is used to detect whether the data labeling information submitted by the user passes through a set of anchor points.
  • the anchor point requirements are (2,5) and (5). ,7), then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),( 5, 5)]
  • the examples of anchor point requirements here are only for ease of understanding and are not used to limit this application.
  • the output of the pre-check signal to the user may be displayed on the display device of the user terminal, or may be displayed in the form of short message, which is not used to limit the present application.
  • the present application provides a pre-verification device for image labeling tasks, including: a request receiving module, configured to receive a pre-verification request sent by a user terminal, and the pre-verification request carries at least the original Image information and data annotation information; an anchor point acquisition module for performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information; an annotation judgment module for judgment Whether the data labeling information meets the preset anchor point requirement; the result output module is configured to output the pre-verification success signal to the user terminal if the preset anchor point requirement is met.
  • a request receiving module configured to receive a pre-verification request sent by a user terminal, and the pre-verification request carries at least the original Image information and data annotation information
  • an anchor point acquisition module for performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information
  • an annotation judgment module for judgment Whether the data label
  • the anchor point acquisition module 102 includes: a row image acquisition submodule 1021, a text box acquisition submodule 1022, a column image acquisition submodule 1023, The text box group obtaining sub-module 1024 and the anchor point determining sub-module 1025. in:
  • the line image acquisition sub-module 1021 is configured to perform a line cutting operation on the original image information to obtain line image information;
  • the text box obtaining sub-module 1022 is used to perform a recognition operation on the line of image information to obtain a text box;
  • the column image acquisition submodule 1023 is configured to perform a column cutting operation on the original image information according to the text box, and obtain multiple columns of column image information carrying the text box;
  • the text box group acquisition sub-module 1024 is configured to select one text box from the column image information of each two adjacent columns to form a text box;
  • the anchor point determination sub-module 1025 is configured to use the center point of the text box in the text box group as the anchor point data.
  • the row cutting operation refers to segmenting the original image in the form of rows, where the height of the row is not specifically limited, and the user can adjust it according to the actual situation.
  • the line image information refers to the image information after the original image is divided in the form of lines.
  • the recognition operation refers to recognizing the text content in the above-mentioned line image information, and when the text content is recognized, the text content position is identified in the form of a text box.
  • the column cutting operation refers to segmenting the original image in the form of columns, where, since the recognized text box has been obtained in step S202, when performing the column cutting operation on the original image information Then the text box in the original image information is used as a reference for segmentation, so that the text content in the original image information is segmented.
  • the only existing text box is selected to form the text box group; when there are multiple text boxes in the column image information, the user can proceed according to the actual situation.
  • the line image information is first obtained by cutting the original image information, so as to facilitate the recognition of the text content in the image information; then the text box is obtained by recognizing the line image information to determine the original The specific location of the text content in the image information; finally, the anchor point data is determined based on the text box, thereby standardizing the effective range of the user's annotation data, and effectively standardizing the user's annotation operation.
  • the aforementioned text box group obtaining submodule 1024 includes:
  • the text box group determining sub-module is used for taking the two text boxes farthest apart in the longitudinal axis direction as the text box group in the column image information of the two adjacent columns.
  • the column image acquisition submodule 1023 includes:
  • the column cutting line determination sub-module is used for cutting with the straight line with the least number of text boxes passing longitudinally as the column cutting line.
  • the above-mentioned labeling judgment module 103 includes:
  • the first equation obtaining sub-module is used to obtain the first equation and the second equation corresponding to the data annotation information and anchor point data;
  • the first equation judging sub-module is used to judge whether there is an intersection point with the first equation on the second equation
  • the first signal output sub-module is configured to output the pre-check success signal if there is an intersection point with the first equation on the second equation;
  • the second signal output sub-module is configured to output a pre-check failure signal if there is no intersection point with the first equation in the second equation.
  • the above-mentioned first signal output submodule specifically includes:
  • the second equation judging sub-module is used for judging whether the first equation and the second equation are parallel or coincide with each other if the judgment result is yes;
  • the third signal output sub-module is configured to output the pre-check success signal if the first equation and the second equation are not parallel or coincide with each other;
  • the fourth signal output sub-module is configured to output the pre-check failure signal if the first equation and the second equation are parallel or coincide with each other.
  • the third signal output submodule specifically includes:
  • the second equation obtaining sub-module is used to obtain the third-party program and the fourth equation corresponding to the data annotation information and invalid anchor point data;
  • the third-party program judgment sub-module is used to determine whether there is an intersection point with the third-party program on the fourth equation
  • the fifth signal output sub-module is configured to output the pre-check failure signal if there is an intersection point with the third-party formula on the fourth equation;
  • the sixth signal output sub-module is configured to output the pre-check success signal if there is no intersection point with the third-party formula in the fourth equation.
  • FIG. 11 is a block diagram of the basic structure of the computer device in this embodiment.
  • the computer device 11 includes a memory 111, a processor 112, and a network interface 113 that are connected to each other in communication through a system bus. It should be pointed out that the figure only shows the computer device 11 with components 111-113, but it should be understood that it is not required to implement all of the illustrated components, and more or fewer components may be implemented instead. Among them, those skilled in the art can understand that the computer device here is a device that can automatically perform numerical calculation and/or information processing in accordance with pre-set or stored instructions.
  • Its hardware includes, but is not limited to, a microprocessor, a dedicated Integrated Circuit (Application Specific Integrated Circuit, ASIC), Programmable Gate Array (Field-Programmable Gate Array, FPGA), Digital Processor (Digital Signal Processor, DSP), embedded equipment, etc.
  • ASIC Application Specific Integrated Circuit
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array
  • DSP Digital Processor
  • the computer device may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server.
  • the computer device can interact with the user through a keyboard, a mouse, a remote control, a touch panel, or a voice control device.
  • the memory 111 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static memory Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc., the computer readable storage The medium can be non-volatile or volatile.
  • the memory 111 may be an internal storage unit of the computer device 11, such as a hard disk or a memory of the computer device 11.
  • the memory 111 may also be an external storage device of the computer device 11, such as a plug-in hard disk, a smart media card (SMC), and a secure digital device equipped on the computer device 11. (Secure Digital, SD) card, Flash Card, etc.
  • the memory 111 may also include both an internal storage unit of the computer device 11 and an external storage device thereof.
  • the memory 111 is generally used to store an operating system and various application software installed in the computer device 11, such as computer-readable instructions for a pre-verification method for image labeling tasks.
  • the memory 111 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 112 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments.
  • the processor 112 is generally used to control the overall operation of the computer device 11.
  • the processor 112 is configured to run computer-readable instructions or processed data stored in the memory 111, for example, computer-readable instructions for running the pre-verification method of the image labeling task.
  • the network interface 113 may include a wireless network interface or a wired network interface.
  • the network interface 113 is generally used to establish a communication connection between the computer device 11 and other electronic devices.
  • This application also provides another implementation manner, that is, to provide a computer-readable storage medium that stores a pre-verification process of an image labeling task, and the pre-verification process of the image labeling task can be Is executed by at least one processor, so that the at least one processor executes the steps of the pre-verification method for the image labeling task as described above.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
  • a terminal device which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Abstract

The embodiments of the present application belong to the field of artificial intelligence technology, and relate to an image annotation task pre-verification method. Said method comprises: receiving a pre-verification request sent by a user terminal, the pre-verification request at least carrying original image information and data annotation information; performing divided recognition operation on the original image information on the basis of a divided recognition algorithm, and acquiring anchor point data corresponding to the original image information; determining whether the data annotation information satisfies preset anchor point requirements; and if so, outputting a pre-verification success signal to the user terminal. The present application further provides an image annotation task pre-verification apparatus, a computer device and a storage medium. According to the present application, whether data annotation information of a user satisfies the anchor point requirements is determined by acquiring the anchor point data, so that the accuracy of data annotation information submitted by the user can be improved, thereby reducing the workload of a data processing platform, and improving the working efficiency of the data processing platform.

Description

一种图像标注任务的预校验方法、装置、设备及存储介质Pre-check method, device, equipment and storage medium for image labeling task
本申请以2020年02月21日提交的申请号为202010106306.0,名称为“一种图像标注任务的预校验方法、装置、设备及存储介质”的中国发明专利申请为基础,并要求其优先权。This application is based on the Chinese invention patent application filed on February 21, 2020, with the application number 202010106306.0, titled "A pre-verification method, device, equipment and storage medium for image labeling tasks", and claims its priority .
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种图像标注任务的预校验方法、装置、计算机设备及存储介质。This application relates to the field of artificial intelligence technology, and in particular to a pre-check method, device, computer equipment, and storage medium for image labeling tasks.
背景技术Background technique
近年来,随着数据处理平台的不断发展,图像标注任务类型也越来越多样。为了满足不同类型的图像标注任务的准确性需求,往往需要针对性的任务流转机制与校验机制。In recent years, with the continuous development of data processing platforms, the types of image annotation tasks have become more and more diverse. In order to meet the accuracy requirements of different types of image annotation tasks, a targeted task flow mechanism and verification mechanism are often required.
现有一种图像标注任务的校验方法,即在接收用户提交的数据标注信息之后,对该数据标注信息进行切割,并由校验系统对切割后的标注信息进行比对,筛选出内容相互一致的数据标注。There is a verification method for image labeling tasks, that is, after receiving the data labeling information submitted by the user, the data labeling information is cut, and the verification system compares the cut labeling information, and screens out that the content is consistent with each other. Data annotation.
然而,发明人意识到传统的校验方法普遍不智能,当用户接收到图像标注任务时,可以对该图像标注任务进行任意标注,用户提交的数据标注信息普遍准确性较低,从而加大数据处理平台的工作量,进而降低数据处理平台的工作效率。However, the inventor realizes that traditional verification methods are generally not intelligent. When a user receives an image labeling task, he can arbitrarily label the image labeling task. The accuracy of the data labeling information submitted by the user is generally low, thereby increasing the data. Processing the workload of the platform, thereby reducing the work efficiency of the data processing platform.
发明内容Summary of the invention
本申请实施例的目的在于提出一种图像标注任务的预校验方法,旨在解决现有的图像标注任务校验方法普遍准确性较低,工作效率低的问题。The purpose of the embodiments of the present application is to propose a pre-verification method for image labeling tasks, which aims to solve the problems of generally low accuracy and low work efficiency of existing image labeling task verification methods.
为了解决上述技术问题,本申请实施例提供一种图像标注任务的预校验方法,采用了如下所述的技术方案:In order to solve the above technical problems, an embodiment of the present application provides a pre-verification method for image labeling tasks, which adopts the following technical solutions:
接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
为了解决上述技术问题,本申请实施例还提供一种图像标注任务的预校验装置,采用了如下所述的技术方案:In order to solve the above technical problems, an embodiment of the present application also provides a pre-verification device for image labeling tasks, which adopts the following technical solutions:
请求接收模块,用于接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;The request receiving module is configured to receive a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data annotation information;
锚点获取模块,用于基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;An anchor point acquisition module, configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
标注判断模块,用于判断所述数据标注信息是否满足预设的锚点要求;The labeling judgment module is used to judge whether the data labeling information meets the preset anchor point requirements;
结果输出模块,用于若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。The result output module is configured to output a pre-check success signal to the user terminal if the preset anchor point requirement is met.
为了解决上述技术问题,本申请实施例还提供一种计算机设备,采用了如下所述的技术方案:In order to solve the above technical problems, the embodiments of the present application also provide a computer device, which adopts the following technical solutions:
包括存储器和处理器,所述计算机可读存储介质上存储有计算机流程,所述计算机流程被处理器执行时实现如下所述图像标注任务的预校验方法的步骤:It includes a memory and a processor, and the computer-readable storage medium stores a computer process, and when the computer process is executed by the processor, the steps of the pre-verification method for the image labeling task described below are realized:
接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据 标注信息;Receiving a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data labeling information;
基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
为了解决上述技术问题,本申请实施例还提供一种计算机可读存储介质,采用了如下所述的技术方案:In order to solve the above technical problems, the embodiments of the present application also provide a computer-readable storage medium, which adopts the following technical solutions:
所述计算机可读存储介质上存储有计算机流程,所述计算机流程被处理器执行时实现如下所述图像标注任务的预校验方法的步骤:The computer-readable storage medium stores a computer process, and when the computer process is executed by a processor, the steps of the pre-verification method for the image labeling task described below are realized:
接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
本申请的一个或多个实施例的细节在下面的附图和描述中提出,本申请的其他特征和优点将从说明书、附图以及权利要求变得明显。The details of one or more embodiments of the present application are presented in the following drawings and description, and other features and advantages of the present application will become apparent from the description, drawings and claims.
本申请提供了一种图像标注任务的预校验方法,接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;判断所述数据标注信息是否满足预设的锚点要求;若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。通过获取锚点数据判断用户的数据标注信息是否满足该锚点要求,从而能够提高用户提交的数据标注信息的准确性,进而减少数据处理平台的工作量,提高数据处理平台的工作效率。This application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal. The pre-verification request carries at least original image information and data labeling information; Image information performs a shunt recognition operation to obtain anchor point data corresponding to the original image information; to determine whether the data labeling information meets the preset anchor point requirements; if the preset anchor point requirements are met, the The user terminal outputs a pre-check success signal. By obtaining the anchor point data, it is judged whether the user's data labeling information meets the anchor point requirements, so that the accuracy of the data labeling information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.
附图说明Description of the drawings
为了更清楚地说明本申请中的方案,下面将对本申请实施例描述中所需要使用的附图作一个简单介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the solution in this application more clearly, the following will briefly introduce the drawings used in the description of the embodiments of the application. Obviously, the drawings in the following description are some embodiments of the application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1是本申请实施例一提供的图像标注任务的预校验方法的实现流程图;FIG. 1 is an implementation flowchart of a pre-verification method for an image labeling task provided in Embodiment 1 of the present application;
图2是本申请实施例一提供的一种数据标注信息的示意图;FIG. 2 is a schematic diagram of data labeling information provided in Embodiment 1 of the present application;
图3是本申请实施例一提供的另一种数据标注信息的示意图;FIG. 3 is a schematic diagram of another data labeling information provided in Embodiment 1 of the present application;
图4是本申请实施例一提供的锚点要求的示意图;FIG. 4 is a schematic diagram of anchor point requirements provided in Embodiment 1 of the present application;
图5是图1中步骤S102的实现流程图;Fig. 5 is a flowchart of the realization of step S102 in Fig. 1;
图6是图1中步骤S103的实现流程图;Fig. 6 is a flowchart of the realization of step S103 in Fig. 1;
图7是图6中步骤S303的实现流程图;FIG. 7 is a flowchart of the realization of step S303 in FIG. 6;
图8是图7中步骤S402的实现流程图;FIG. 8 is a flowchart of the realization of step S402 in FIG. 7;
图9是本申请实施例二提供的图像标注任务的预校验装置的结构示意图;FIG. 9 is a schematic structural diagram of a pre-verification device for image labeling tasks provided in Embodiment 2 of the present application;
图10是本申请实施例二提供的锚点获取模块的结构示意图;FIG. 10 is a schematic structural diagram of an anchor point acquisition module provided in Embodiment 2 of the present application;
图11是根据本申请的计算机设备的一个实施例的结构示意图。Fig. 11 is a schematic structural diagram of an embodiment of a computer device according to the present application.
具体实施方式Detailed ways
除非另有定义,本文所使用的所有的技术和科学术语与属于本申请的技术领域的技术人员通常理解的含义相同;本文中在申请的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本申请;本申请的说明书和权利要求书及上述附图说明中的术语“包括”和“具有”以及它们的任何变形,意图在于覆盖不排他的包含。本申请的说 明书和权利要求书或上述附图中的术语“第一”、“第二”等是用于区别不同对象,而不是用于描述特定顺序。Unless otherwise defined, all technical and scientific terms used herein have the same meanings as commonly understood by those skilled in the technical field of the application; the terms used in the specification of the application herein are only for describing specific embodiments. The purpose is not to limit the application; the terms "including" and "having" in the specification and claims of the application and the above-mentioned description of the drawings and any variations thereof are intended to cover non-exclusive inclusions. The terms "first" and "second" in the description and claims of this application or the above-mentioned drawings are used to distinguish different objects, rather than to describe a specific sequence.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。The reference to "embodiments" herein means that a specific feature, structure, or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present application. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment mutually exclusive with other embodiments. Those skilled in the art clearly and implicitly understand that the embodiments described herein can be combined with other embodiments.
为了使本技术领域的人员更好地理解本申请方案,下面将结合附图,对本申请实施例中的技术方案进行清楚、完整地描述。In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings.
本申请实施例提供的图像标注任务的预校验方法,通过获取锚点数据并判断用户的数据标注信息是否满足该锚点,从而能够提高用户提交的数据标注信息的准确性,进而减少数据处理平台的工作量,提高数据处理平台的工作效率。The pre-verification method for image labeling tasks provided by the embodiments of the present application obtains anchor point data and determines whether the user’s data labeling information meets the anchor point, thereby improving the accuracy of the data labeling information submitted by the user, thereby reducing data processing The workload of the platform improves the efficiency of the data processing platform.
实施例一Example one
图1示出了本申请实施例一提供的图像标注任务的预校验方法的实现流程图,为了便于说明,仅示出与本申请相关的部分。Fig. 1 shows a flow chart of the implementation of the pre-verification method of the image labeling task provided in the first embodiment of the present application. For ease of description, only the parts related to the present application are shown.
在步骤S101中,接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息。In step S101, a pre-verification request sent by a user terminal is received, where the pre-verification request at least carries original image information and data annotation information.
在本申请实施例中,用户终端可以是诸如移动电话、智能电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、导航装置等等的移动终端以及诸如数字TV、台式计算机等等的固定终端,应当理解,此处对用户终端的举例仅为方便理解,不用于限定本申请。In the embodiments of this application, the user terminal may be, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a navigation device, etc. It should be understood that the examples of user terminals here are only for ease of understanding, and are not used to limit this application.
在本申请实施例中,原始图像信息指的是需要通过用户在图片中用画线的方式标记一些指定位置的图像信息,比如图中指定文字的位置,并且一般会限制用户只能画直线,或者由线段组成的不同形状。In the embodiments of the present application, the original image information refers to the image information that needs to mark some designated positions by the user drawing lines in the picture, such as the position of the designated text in the picture, and generally restricts the user to only draw straight lines. Or different shapes composed of line segments.
在本申请实施例中,数据标注信息指的是上述直线、各个线段的端点坐标组合组成的不同形状。作为示例,如图2所示,数据标注信息为[(2,6),(5,5)];再如图3所示,数据标注信息为[(2,6),(5,5)]、[(5,5),(7,7)]、[(7,7),(4,8)]和[(4,8),(2,6)],应当理解,此处对数据标注信息的举例仅为方便理解,不用于限定本申请。In the embodiments of the present application, the data labeling information refers to different shapes composed of the aforementioned straight line and the endpoint coordinates of each line segment. As an example, as shown in Figure 2, the data labeling information is [(2,6),(5,5)]; as shown in Figure 3, the data labeling information is [(2,6),(5,5) ], [(5,5),(7,7)], [(7,7),(4,8)] and [(4,8),(2,6)], it should be understood that The examples of data labeling information are only for ease of understanding, and are not used to limit this application.
在步骤S102中,基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据。In step S102, a shunt recognition operation is performed on the original image information based on a shunt recognition algorithm, and anchor point data corresponding to the original image information is obtained.
在本申请实施例中,分流识别算法指的是将该原始图像信息分别进行行切割、全文识别以及列切割操作,从而以原始图像信息中的文字为单位将该原始图像进行切割,并以识别出的文字作为锚点数据。In the embodiments of the present application, the shunt recognition algorithm refers to performing row cutting, full text recognition, and column cutting operations on the original image information, so that the original image is cut in units of characters in the original image information and recognized The output text is used as anchor point data.
在本申请实施例中,锚点指的是该原始图像信息上设置的标记,用于给标准数据标注设置一定的阈值范围,以确定用户提交的数据标注信息是否满足该阈值范围。In the embodiment of the present application, the anchor point refers to a mark set on the original image information, and is used to set a certain threshold range for standard data annotation to determine whether the data annotation information submitted by the user meets the threshold range.
在步骤S103中,判断所述数据标注信息是否满足预设的锚点要求。In step S103, it is determined whether the data labeling information meets a preset anchor point requirement.
在本申请实施例中,锚点要求用于检测用户提交的数据标注信息是否经过一组锚点,作为示例,如图4所示,锚点要求为(2,5)和(5,7),那么数据标注信息[(2,6),(5,5)]经过锚点(2,5)和(5,7)之间,即数据标注信息[(2,6),(5,5)]满足该锚点要求,应当理解,此处锚点要求的举例仅为方便理解,不用于限定本申请。In the embodiment of this application, the anchor point requirement is used to detect whether the data annotation information submitted by the user passes through a set of anchor points. As an example, as shown in Figure 4, the anchor point requirements are (2,5) and (5,7) , Then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),(5,5) )] To meet the anchor point requirement, it should be understood that the examples of anchor point requirements here are only for ease of understanding and are not used to limit the application.
在步骤S104中,若满足所述预设的锚点要求,则向所述用户输出预校验成功信号。In step S104, if the preset anchor point requirement is met, a pre-check success signal is output to the user.
在本申请实施例中,向用户输出预校验信号可以是通过在用户终端的显示装置上展示,也可以通过简讯的方式进行展示,不用于限定本申请。In the embodiment of the present application, the output of the pre-check signal to the user may be displayed on the display device of the user terminal, or may be displayed in the form of short message, which is not used to limit the present application.
在本申请实施例中,本申请提供了一种图像标注任务的预校验方法,接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;判断所述数据标注信息是否满足预设的锚点要求;若满足所述预设的锚点要求,则 向所述用户终端输出预校验成功信号。通过获取锚点数据并判断用户的数据标注信息是否满足该锚点,从而能够提高用户提交的数据标注信息的准确性,进而减少数据处理平台的工作量,提高数据处理平台的工作效率。In the embodiments of the present application, the present application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information; The shunt recognition algorithm performs shunt recognition operations on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if it meets the preset anchor point requirements If the anchor point is required, a pre-check success signal is output to the user terminal. By obtaining anchor point data and determining whether the user's data annotation information meets the anchor point, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.
继续参考图5,示出了图1中步骤S102的实现流程图,为了便于说明,仅示出与本申请相关的部分。Continuing to refer to FIG. 5, a flowchart of the implementation of step S102 in FIG. 1 is shown. For ease of description, only the parts related to the present application are shown.
作为实施例一的一些可选实现方式中,上述步骤S102具体包括:步骤S201、步骤S202、步骤S203、步骤S204以及步骤S205。As some optional implementation manners of the first embodiment, the above step S102 specifically includes: step S201, step S202, step S203, step S204, and step S205.
在步骤S201中,对所述原始图像信息进行行切割操作,获取行图像信息。In step S201, a line cutting operation is performed on the original image information to obtain line image information.
在本申请实施例中,行切割操作指的是将该原始图像通过行的形式进行分割,其中,行的高度不做具体限定,用户可根据实际情况进行适应调整。In the embodiment of the present application, the row cutting operation refers to segmenting the original image in the form of rows, where the height of the row is not specifically limited, and the user can adjust it according to the actual situation.
在本申请实施例中,行图像信息指的是该原始图像通过行的形式进行分割后的图像信息。In the embodiment of the present application, the line image information refers to the image information after the original image is divided in the form of lines.
在步骤S202中,对所述行图像信息进行识别操作,获取文本框。In step S202, a recognition operation is performed on the line image information to obtain a text box.
在本申请实施例中,识别操作指的是识别上述行图像信息中的文字内容,当识别到文字内容时,通过文本框的形式对该文字内容位置进行标识。In the embodiment of the present application, the recognition operation refers to recognizing the text content in the above-mentioned line image information, and when the text content is recognized, the text content position is identified in the form of a text box.
在步骤S203中,根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息。In step S203, a column cutting operation is performed on the original image information according to the text box to obtain multiple columns of column image information carrying the text box.
在本申请实施例中,列切割操作指的是将该原始图像通过列的形式进行分割,其中,由于在步骤S202中已获得识别到的文本框,那么在对原始图像信息进行列切割操作时则以该原始图像信息中的文本框为基准进行分割,从而将原始图像信息中的文字内容分割开来。In the embodiment of the present application, the column cutting operation refers to segmenting the original image in the form of columns, where, since the recognized text box has been obtained in step S202, when performing the column cutting operation on the original image information Then the text box in the original image information is used as a reference for segmentation, so that the text content in the original image information is segmented.
在步骤S204中,在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组。In step S204, one text box is selected from the column image information of every two adjacent columns to form a text box group.
在本申请实施例中,当列图像信息中仅存在一个文本框时,则选取该仅存的文本框组成该文本框组;当列图像信息存在多个文本框时,用户可根据实际情况进行限定选取的方式获取组成该文本框组的文本框,作为示例,例如通过随机选取的方式获取,再例如通过相对距离的方式选取,应当理解,此处对获取方式的举例仅为方便理解,不用于限定本申请实施例。In the embodiment of this application, when there is only one text box in the column image information, the only existing text box is selected to form the text box group; when there are multiple text boxes in the column image information, the user can proceed according to the actual situation. Obtain the text boxes that make up the text box group in a limited selection method, as an example, for example, by random selection, or by relative distance selection. It should be understood that the examples of acquisition methods here are only for ease of understanding. To limit the embodiments of this application.
在步骤S205中,将所述文本框组中所述文本框的中心点作为所述锚点数据。In step S205, the center point of the text box in the text box group is used as the anchor point data.
在本申请实施例中,先通过对原始图像信息进行行切割操作获取行图像信息,从而方便对图像信息中的文字内容进行识别;再通过对行图像信息进行识别操作获取文本框,从而确定原始图像信息中文字内容的具体位置;最后基于文本框确定锚点数据,从而规范限定了用户标注数据的有效范围,有效规范用户的标注操作。In the embodiment of this application, the line image information is first obtained by cutting the original image information, so as to facilitate the recognition of the text content in the image information; then the text box is obtained by recognizing the line image information to determine the original The specific location of the text content in the image information; finally, the anchor point data is determined based on the text box, thereby standardizing the effective range of the user's annotation data, and effectively standardizing the user's annotation operation.
作为本申请实施例一的一些可选实现方式中,上述步骤S203具体包括如下步骤:As some optional implementation manners of the first embodiment of the present application, the foregoing step S203 specifically includes the following steps:
以纵向穿过的文本框数量最少的直线作为列切割线进行切割。Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
在本申请实施例中,通过选取穿过文本框数量最少的直线作为列切割线进行切割,从而保证切割后的有效文本框尽可能的多,进而保证该锚点设定的准确性。In the embodiment of the present application, cutting is performed by selecting the straight line passing through the text box with the least amount as the column cutting line, so as to ensure that there are as many effective text boxes as possible after cutting, thereby ensuring the accuracy of the anchor point setting.
作为本申请实施例一的一些可选实现方式中,上述步骤S204具体包括:As some optional implementation manners of the first embodiment of the present application, the foregoing step S204 specifically includes:
在所述相邻两列的列图像信息中以纵轴方向相距最远的的两个文本框作为所述文本框组。In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
图6是图1中步骤S103的实现流程图,为了便于说明,仅示出与本申请相关的部分。Fig. 6 is a flowchart of the realization of step S103 in Fig. 1. For ease of description, only the parts related to the present application are shown.
作为本申请实施例一的一些可选实现方式中,上述步骤S103具体包括:步骤S301、步骤S302、步骤S303以及步骤S304。As some optional implementation manners of the first embodiment of the present application, the above step S103 specifically includes: step S301, step S302, step S303, and step S304.
在步骤S301中,在同一坐标系中获取与所述数据标注信息以及锚点数据相对应的第一方程式以及第二方程式。In step S301, the first equation and the second equation corresponding to the data annotation information and anchor point data are acquired in the same coordinate system.
在本申请实施例中,假设算法在原始图像信息中找出了一组锚点,点A与点B,用户提交的标注数据标注中必须有线经过点A与点B之间,即线段AB一定与用户数据标注中 的某条线段有交点。In the embodiment of this application, it is assumed that the algorithm finds a set of anchor points in the original image information, point A and point B. The annotation data submitted by the user must be marked by a line passing between point A and point B, that is, the line segment AB must be certain There is an intersection with a certain line segment in the user data label.
建立第一方程式以及第二方程式:Create the first equation and the second equation:
第一方程式:假设点A的坐标为(x A,y A),点B的坐标为(x B,y B),且x A≤x B,则可根据一条线段上的两点坐标求出该线段的方程:(y B-y A)x+(x A-x B)y+x B*y A-x A*y B=0(x A≤x≤x B)。 The first equation: Assuming that the coordinates of point A are (x A , y A ), the coordinates of point B are (x B , y B ), and x A ≤ x B , it can be obtained from the coordinates of two points on a line segment The equation of this line segment: (y B -y A )x+(x A -x B )y+x B *y A -x A *y B =0 (x A ≤x≤x B ).
第二方程式:根据用户提交的数据标注中每条线段的端点坐标,可以求出每条线段的方程,例如:线段MN中端点M的坐标(x M,y M),端点N的坐标为(x N,y N),且x N≤x M,则线段MN的方程为:(y N-y M)x+(x M-x N)y+x N*y M-x M*y N=0(x N≤x≤x M)。 The second equation: According to the end point coordinates of each line segment in the data label submitted by the user, the equation of each line segment can be found, for example: the coordinates of the end point M in the line segment MN (x M , y M ), and the coordinates of the end point N are ( x N ,y N ), and x N ≤ x M , the equation of line segment MN is: (y N -y M )x+(x M -x N )y+x N *y M -x M *y N = 0(x N ≤x≤x M ).
在步骤S302中,判断所述第二方程式以及所述第一方程式相对应的线上是否存在交点。In step S302, it is determined whether there is an intersection point on the line corresponding to the second equation and the first equation.
将线段AB与用户画出的每条线段联立方程组,若方程组在定义域内有且只有一个解,说明该条线段经过锚点A与B之间。以线段MN为例,判断线段MN是否经过锚点A与B之间的方法如下:Simultaneous equations with line segment AB and each line segment drawn by the user. If the equation group has one and only one solution in the domain, it means that the line segment passes between anchor points A and B. Taking the line segment MN as an example, the method to determine whether the line segment MN passes between anchor points A and B is as follows:
①联立方程组:① Simultaneous equations:
Figure PCTCN2020117379-appb-000001
Figure PCTCN2020117379-appb-000001
②方程组不考虑定义域时唯一解的横坐标为:②The abscissa of the only solution when the equation system does not consider the domain is:
Figure PCTCN2020117379-appb-000002
Figure PCTCN2020117379-appb-000002
③若唯一解的横坐标符合定义域要求x A≤x *≤x B且x N≤x *≤x M,则认为线段MN与线段AB在定义域内有交点,即锚点A与锚点B之间有线经过。 ③If the abscissa of the unique solution meets the domain requirements x A ≤x * ≤x B and x N ≤x * ≤x M , then it is considered that the line segment MN and the line segment AB have an intersection point in the definition domain, that is, anchor point A and anchor point B Wire passes between.
在本申请实施例中,x *符合定义域x A≤x *≤x B且x N≤x *≤x M的要求,即第二方程式上存在与所述第一方程式相交的交点,说明用户提交的数据标注有线经过点A与点B之间。 In the embodiment of the present application, x * meets the requirements of the domain x A ≤ x * ≤ x B and x N ≤ x * ≤ x M , that is, there is an intersection point with the first equation on the second equation, indicating that the user The submitted data is marked with a line passing between point A and point B.
在步骤S303中,若所述第二方程式以及所述第一方程式相对应的线上存在交点,则输出所述预校验成功信号。In step S303, if there is an intersection point on the line corresponding to the second equation and the first equation, the pre-check success signal is output.
在步骤S304中,若所述第二方程式以及所述第一方程式相对应的线上不存在交点,则输出预校验失败信号。In step S304, if there is no intersection point on the line corresponding to the second equation and the first equation, a pre-check failure signal is output.
在本申请实施例中,通过判断两条线段是否相交的逻辑,用坐标和方程组的方式形成判断公式,能够很好地规范用户提交的答案,降低众包用户骗取积分的概率,使众包平台在更短的时间内拿到更准确的答案,提高图像标注类众包任务的答案可靠性。In the embodiment of this application, through the logic of judging whether two line segments intersect, the judgment formula is formed by coordinates and equations, which can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, and make crowdsourcing The platform can get more accurate answers in a shorter time and improve the reliability of the answers to crowdsourced tasks such as image annotation.
图7是图6中步骤S303的实现流程图,为了便于说明,仅示出与本申请相关的部分。Fig. 7 is a flow chart of the implementation of step S303 in Fig. 6. For ease of description, only the parts related to the present application are shown.
作为本申请实施例一的一些可选实现方式中,上述步骤S303具体包括:步骤S401、步骤S402以及步骤S403。As some optional implementation manners of the first embodiment of the present application, the above step S303 specifically includes: step S401, step S402, and step S403.
在步骤S401中,若所述判断结果为是,则判断所述第一方程式以及第二方程式相对应的线是否相互平行或者重合。In step S401, if the judgment result is yes, it is judged whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other.
在本申请实施例中,以上述联立的方程组为例,可以通过判断
Figure PCTCN2020117379-appb-000003
是否等于
Figure PCTCN2020117379-appb-000004
从而确定第一方程式以及第二方程式是否相互平行或者重合。
In the embodiment of this application, taking the above-mentioned simultaneous equations as an example, it can be determined by
Figure PCTCN2020117379-appb-000003
Is it equal to
Figure PCTCN2020117379-appb-000004
In this way, it is determined whether the first equation and the second equation are parallel or coincident with each other.
在步骤S402中,若所述第一方程式以及第二方程式相对应的线不是相互平行或者重合,则输出所述预校验成功信号。In step S402, if the lines corresponding to the first equation and the second equation are not parallel or coincide with each other, the pre-check success signal is output.
在步骤S403中,若所述第一方程式以及第二方程式相对应的线相互平行或者重合,则输出所述预校验失败信号。In step S403, if the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
图8是图7中步骤S402的实现流程图,为了便于说明,仅示出与本申请相关的部分。Fig. 8 is a flow chart of the implementation of step S402 in Fig. 7. For ease of description, only the parts related to the present application are shown.
作为本申请实施例一的一些可选实现方式中,上述步骤S402具体包括:步骤S501、步骤S502、步骤S503以及步骤S504。原始图像信息标注任务还携带有无效锚点数据。As some optional implementation manners of the first embodiment of the present application, the foregoing step S402 specifically includes: step S501, step S502, step S503, and step S504. The original image information labeling task also carries invalid anchor point data.
在步骤S501中,在同一坐标系中获取与所述数据标注信息以及无效锚点数据相对应 的第三方程式以及第四方程式。In step S501, the third-party program and the fourth equation corresponding to the data labeling information and invalid anchor point data are acquired in the same coordinate system.
在本申请实施例中,无效锚点数据指的是原始图像信息标注任务预先设定的不能经过的区域,例如,在原始图像信息的文件签名、文件抬头等位置只是用于标识该文件,与该原始图像信息标注任务毫无关联,当用户在该位置进行标注时,显然不符合原始图像信息标注任务的要求,因此,需要在原始图像信息标注任务中预先设置无效锚点的位置。In the embodiments of this application, the invalid anchor point data refers to the area that cannot be passed through preset by the original image information labeling task. For example, the file signature and file header of the original image information are only used to identify the file, and The original image information labeling task has nothing to do with it. When the user performs labeling at this location, it obviously does not meet the requirements of the original image information labeling task. Therefore, the position of the invalid anchor point needs to be preset in the original image information labeling task.
在本申请实施例中,第三方程式指的是无锚点数据联立的方程,假设无效锚点中点C的坐标为(x C,y C),点D的坐标为(x D,y D),且x C≤x D,则可根据一条线段上的两点坐标求出该线段的第三方程式为:(y D-y C)x+(x C-x D)y+x D*y C-x C*y D=0(x C≤x≤x D)。 In the embodiments of this application, the third-party program refers to the equation without anchor point data. Assume that the coordinates of point C in the invalid anchor point are (x C , y C ), and the coordinates of point D are (x D , y D ), and x C ≤ x D , the third-party formula for finding the line segment based on the coordinates of two points on a line segment is: (y D -y C )x+(x C -x D )y+x D * y C -x C *y D =0 (x C ≤x≤x D ).
在本申请实施例中,第四方程式指的是用户提交数据标注信息的端点所联立的方程,假设用户提交的数据标注信息两个端点为Q(x Q,y Q),P(x P,y P): In the embodiment of the present application, the fourth equation refers to the equation of the endpoints of the data annotation information submitted by the user. Assume that the two endpoints of the data annotation information submitted by the user are Q(x Q ,y Q ), P(x P ,y P ):
那么第四方程式则为:(y P-y Q)x+(x Q-x P)y+x P*y Q-x Q*y P=0(x Q≤x≤x P)。 Then the fourth equation is: (y P -y Q )x+(x Q -x P )y+x P *y Q -x Q *y P =0 (x Q ≤x≤x P ).
在步骤S502中,判断所述第四方程式以及所述第三方程式相对应的线上是否存在交点。In step S502, it is determined whether there is an intersection point on the line corresponding to the fourth equation and the third-party formula.
①联立方程组:① Simultaneous equations:
Figure PCTCN2020117379-appb-000005
Figure PCTCN2020117379-appb-000005
②方程组不考虑定义域时唯一解的横坐标为:②The abscissa of the only solution when the equation system does not consider the domain is:
Figure PCTCN2020117379-appb-000006
Figure PCTCN2020117379-appb-000006
③若唯一解的横坐标符合定义域要求x C≤x *≤x D且x Q≤x *≤x P,则认为线段QP与线段CD在定义域内有交点,即锚点C与锚点D之间有线经过。 ③If the abscissa of the unique solution meets the domain requirements x C ≤x * ≤x D and x Q ≤x * ≤x P , then it is considered that the line segment QP and the line segment CD have intersections in the domain, that is, anchor point C and anchor point D Wire passes between.
在步骤S503中,若所述第四方程式以及所述第三方程式相对应的线上存在交点,则输出所述预校验失败信号。In step S503, if there is an intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check failure signal is output.
在步骤S504中,若所述第四方程式以及所述第三方程式相对应的线上不存在交点,则输出所述预校验成功信号。In step S504, if there is no intersection point on the line corresponding to the fourth equation and the third-party formula, the pre-check success signal is output.
综上所述,本申请提供了一种图像标注任务的预校验方法,接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;判断所述数据标注信息是否满足预设的锚点要求;若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。通过获取锚点数据并判断用户的数据标注信息是否满足该锚点要求,从而能够提高用户提交的数据标注信息的准确性,进而减少数据处理平台的工作量,提高数据处理平台的工作效率。同时,先通过对原始图像信息进行行切割操作获取行图像信息,从而方便对图像信息中的文字内容进行识别,再通过对行图像信息进行识别操作获取文本框,从而确定原始图像信息中文字内容的具体位置,最后基于文本框确定锚点数据,从而规范限定了用户标注数据的有效范围,有效规范用户的标注操作;通过判断两条线段是否相交的逻辑,用坐标和方程组的方式形成判断公式,能够很好地规范用户提交的答案,降低众包用户骗取积分的概率,使众包平台在更短的时间内拿到更准确的答案,提高图像标注任务标注类众包任务的答案可靠性。In summary, this application provides a pre-verification method for image labeling tasks, which receives a pre-verification request sent by a user terminal, and the pre-verification request carries at least original image information and data labeling information; The algorithm performs a shunt recognition operation on the original image information, and obtains anchor point data corresponding to the original image information; judges whether the data annotation information meets the preset anchor point requirements; if the preset anchor point is satisfied If required, a pre-check success signal is output to the user terminal. By acquiring anchor point data and determining whether the user's data annotation information meets the anchor point requirements, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform. At the same time, first obtain the line image information by cutting the original image information, so as to facilitate the recognition of the text content in the image information, and then obtain the text box by recognizing the line image information to determine the text content in the original image information Finally, the anchor point data is determined based on the text box, thus the specification limits the effective range of the user's labeling data, and effectively standardizes the user's labeling operation; through the logic of judging whether the two line segments intersect, the judgment is formed by coordinates and equations The formula can well standardize the answers submitted by users, reduce the probability of crowdsourcing users defrauding points, enable the crowdsourcing platform to get more accurate answers in a shorter time, and improve the reliability of the answers for image annotation tasks. sex.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机流程来指令相关的硬件来完成,该计算机流程可存储于一计算机可读取存储介质中,该流程在执行时,可包括如上述各方法的实施例的流程。其中,前述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)等非易失性存储介质,或随机存储记忆体(Random Access Memory,RAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through a computer process. The computer process can be stored in a computer readable storage medium. When executed, it may include the procedures of the above-mentioned method embodiments. Among them, the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.
应该理解的是,虽然附图的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,其可以以其他的顺序执行。而且,附图的流程图中的至少一部 分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,其执行顺序也不必然是依次进行,而是可以与其他步骤或者其他步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flowchart of the drawings are displayed in sequence as indicated by the arrows, these steps are not necessarily performed in sequence in the order indicated by the arrows. Unless explicitly stated in this article, the execution of these steps is not strictly limited in order, and they can be executed in other orders. Moreover, at least part of the steps in the flowchart of the drawings may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the order of execution is also It is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.
实施例二Example two
进一步参考图9,作为对上述图1所示方法的实现,本申请提供了一种图像标注任务的预校验装置,该装置实施例与图1所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。With further reference to FIG. 9, as an implementation of the method shown in FIG. 1, this application provides a device for pre-checking an image labeling task. The device embodiment corresponds to the method embodiment shown in FIG. 1, and the device is specifically It can be applied to various electronic devices.
如与9所示,本申请实施例二所提供的图像标注任务的预校验装置100包括:请求接收模块101、锚点获取模块102、标注判断模块103以及结果输出模块104。其中:As shown in Fig. 9, the image labeling task pre-verification apparatus 100 provided in the second embodiment of the present application includes: a request receiving module 101, an anchor point obtaining module 102, a labeling judgment module 103, and a result output module 104. in:
请求接收模块101,用于接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;The request receiving module 101 is configured to receive a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
锚点获取模块102,用于基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;The anchor point obtaining module 102 is configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
标注判断模块103,用于判断所述数据标注信息是否满足预设的锚点要求;The labeling judgment module 103 is used to judge whether the data labeling information meets the preset anchor point requirements;
结果输出模块104,用于若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。The result output module 104 is configured to output a pre-check success signal to the user terminal if the preset anchor requirement is met.
在本申请实施例中,用户终端可以是诸如移动电话、智能电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、导航装置等等的移动终端以及诸如数字TV、台式计算机等等的固定终端,应当理解,此处对用户终端的举例仅为方便理解,不用于限定本申请。In the embodiments of this application, the user terminal may be, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a navigation device, etc. It should be understood that the examples of user terminals here are only for ease of understanding, and are not used to limit this application.
在本申请实施例中,原始图像信息指的是需要通过用户在图片中用画线的方式标记一些指定位置的图像信息,比如图中指定文字的位置,并且一般会限制用户只能画直线,或者由线段组成的不同形状。In the embodiments of the present application, the original image information refers to the image information that needs to mark some designated positions by the user drawing lines in the picture, such as the position of the designated text in the picture, and generally restricts the user to only draw straight lines. Or different shapes composed of line segments.
在本申请实施例中,数据标注信息指的是上述直线、各个线段的端点坐标组合组成的不同形状。作为示例,如图2所示,数据标注信息为[(2,6),(5,5)];再如图3所示,数据标注信息为[(2,6),(5,5)]、[(5,5),(7,7)]、[(7,7),(4,8)]和[(4,8),(2,6)],应当理解,此处对数据标注信息的举例仅为方便理解,不用于限定本申请。In the embodiments of the present application, the data labeling information refers to different shapes composed of the aforementioned straight line and the endpoint coordinates of each line segment. As an example, as shown in Figure 2, the data labeling information is [(2,6),(5,5)]; as shown in Figure 3, the data labeling information is [(2,6),(5,5) ], [(5,5),(7,7)], [(7,7),(4,8)] and [(4,8),(2,6)], it should be understood that The examples of data labeling information are only for ease of understanding, and are not used to limit this application.
在本申请实施例中,分流识别算法指的是将该原始图像信息分别进行行切割、全文识别以及列切割操作,从而以原始图像信息中的文字为单位将该原始图像进行切割,并以识别出的文字作为锚点数据。In the embodiments of the present application, the shunt recognition algorithm refers to performing row cutting, full text recognition, and column cutting operations on the original image information, so that the original image is cut in units of characters in the original image information and recognized The output text is used as anchor point data.
在本申请实施例中,锚点指的是该原始图像信息上设置的标记,用于给标准数据标注设置一定的阈值范围,以确定用户提交的数据标注信息是否满足该阈值范围。In the embodiment of the present application, the anchor point refers to a mark set on the original image information, and is used to set a certain threshold range for standard data annotation to determine whether the data annotation information submitted by the user meets the threshold range.
在本申请实施例中,锚点要求用于检测用户提交的数据标注信息是否有线段经过一组锚点之前,作为示例,如图4所示,锚点要求为(2,5)和(5,7),那么数据标注信息[(2,6),(5,5)]经过锚点(2,5)和(5,7)之间,即数据标注信息[(2,6),(5,5)]满足该锚点要求,应当理解,此处锚点要求的举例仅为方便理解,不用于限定本申请。In the embodiment of this application, the anchor point requirement is used to detect whether the data labeling information submitted by the user passes through a set of anchor points. As an example, as shown in Figure 4, the anchor point requirements are (2,5) and (5). ,7), then the data labeling information [(2,6),(5,5)] passes between the anchor points (2,5) and (5,7), that is, the data labeling information [(2,6),( 5, 5)] To meet the anchor point requirement, it should be understood that the examples of anchor point requirements here are only for ease of understanding and are not used to limit this application.
在本申请实施例中,向用户输出预校验信号可以是通过在用户终端的显示装置上展示,也可以通过简讯的方式进行展示,不用于限定本申请。In the embodiment of the present application, the output of the pre-check signal to the user may be displayed on the display device of the user terminal, or may be displayed in the form of short message, which is not used to limit the present application.
在本申请实施例中,本申请提供了一种图像标注任务的预校验装置,包括:请求接收模块,用于接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;锚点获取模块,用于基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;标注判断模块,用于判断所述数据标注信息是否满足预设的锚点要求;结果输出模块,用于若满足所述预设的锚点要求,则向所述用户终端输出所述预校验成功信号。通过获取锚点数据并判断用户的数据标注信息是否满足该锚点,从而能够提高用户提交的数据标注信息的准确性,进而减少数据处理平台的工作量,提高数据处理平台的工作效率。In an embodiment of the present application, the present application provides a pre-verification device for image labeling tasks, including: a request receiving module, configured to receive a pre-verification request sent by a user terminal, and the pre-verification request carries at least the original Image information and data annotation information; an anchor point acquisition module for performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information; an annotation judgment module for judgment Whether the data labeling information meets the preset anchor point requirement; the result output module is configured to output the pre-verification success signal to the user terminal if the preset anchor point requirement is met. By obtaining anchor point data and determining whether the user's data annotation information meets the anchor point, the accuracy of the data annotation information submitted by the user can be improved, thereby reducing the workload of the data processing platform and improving the work efficiency of the data processing platform.
在本申请实施例二的一些可选的实现方式中,如图10所示,上述锚点获取模块102包括:行图像获取子模块1021、文本框获取子模块1022、列图像获取子模块1023、文本框组获取子模块1024以及锚点确定子模块1025。其中:In some optional implementations of the second embodiment of the present application, as shown in FIG. 10, the anchor point acquisition module 102 includes: a row image acquisition submodule 1021, a text box acquisition submodule 1022, a column image acquisition submodule 1023, The text box group obtaining sub-module 1024 and the anchor point determining sub-module 1025. in:
行图像获取子模块1021,用于对所述原始图像信息进行行切割操作,获取行图像信息;The line image acquisition sub-module 1021 is configured to perform a line cutting operation on the original image information to obtain line image information;
文本框获取子模块1022,用于对所述行图像信息进行识别操作,获取文本框;The text box obtaining sub-module 1022 is used to perform a recognition operation on the line of image information to obtain a text box;
列图像获取子模块1023,用于根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息;The column image acquisition submodule 1023 is configured to perform a column cutting operation on the original image information according to the text box, and obtain multiple columns of column image information carrying the text box;
文本框组获取子模块1024,用于在每相邻两列的所述列图像信息中各选取一个文本框组成文本框;The text box group acquisition sub-module 1024 is configured to select one text box from the column image information of each two adjacent columns to form a text box;
锚点确定子模块1025,用于将所述文本框组中所述文本框的中心点作为所述锚点数据。The anchor point determination sub-module 1025 is configured to use the center point of the text box in the text box group as the anchor point data.
在本申请实施例中,行切割操作指的是将该原始图像通过行的形式进行分割,其中,行的高度不做具体限定,用户可根据实际情况进行适应调整。In the embodiment of the present application, the row cutting operation refers to segmenting the original image in the form of rows, where the height of the row is not specifically limited, and the user can adjust it according to the actual situation.
在本申请实施例中,行图像信息指的是该原始图像通过行的形式进行分割后的图像信息。In the embodiment of the present application, the line image information refers to the image information after the original image is divided in the form of lines.
在本申请实施例中,识别操作指的是识别上述行图像信息中的文字内容,当识别到文字内容时,通过文本框的形式对该文字内容位置进行标识。In the embodiment of the present application, the recognition operation refers to recognizing the text content in the above-mentioned line image information, and when the text content is recognized, the text content position is identified in the form of a text box.
在本申请实施例中,列切割操作指的是将该原始图像通过列的形式进行分割,其中,由于在步骤S202中已获得识别到的文本框,那么在对原始图像信息进行列切割操作时则以该原始图像信息中的文本框为基准进行分割,从而将原始图像信息中的文字内容分割开来。In the embodiment of the present application, the column cutting operation refers to segmenting the original image in the form of columns, where, since the recognized text box has been obtained in step S202, when performing the column cutting operation on the original image information Then the text box in the original image information is used as a reference for segmentation, so that the text content in the original image information is segmented.
在本申请实施例中,当列图像信息中仅存在一个文本框时,则选取该仅存的文本框组成该文本框组;当列图像信息存在多个文本框时,用户可根据实际情况进行限定选取的方式获取组成该文本框组的文本框,作为示例,例如通过随机选取的方式获取,再例如通过相对距离的方式选取,应当理解,此处对获取方式的举例仅为方便理解,不用于限定本申请实施例。In the embodiment of this application, when there is only one text box in the column image information, the only existing text box is selected to form the text box group; when there are multiple text boxes in the column image information, the user can proceed according to the actual situation. Obtain the text boxes that make up the text box group in a limited selection method, as an example, for example, by random selection, or by relative distance selection. It should be understood that the examples of acquisition methods here are only for ease of understanding. To limit the embodiments of this application.
在本申请实施例中,先通过对原始图像信息进行行切割操作获取行图像信息,从而方便对图像信息中的文字内容进行识别;再通过对行图像信息进行识别操作获取文本框,从而确定原始图像信息中文字内容的具体位置;最后基于文本框确定锚点数据,从而规范限定了用户标注数据的有效范围,有效规范用户的标注操作。In the embodiment of this application, the line image information is first obtained by cutting the original image information, so as to facilitate the recognition of the text content in the image information; then the text box is obtained by recognizing the line image information to determine the original The specific location of the text content in the image information; finally, the anchor point data is determined based on the text box, thereby standardizing the effective range of the user's annotation data, and effectively standardizing the user's annotation operation.
在本申请实施例二的一些可选的实现方式中,上述文本框组获取子模块1024包括:In some optional implementation manners of the second embodiment of the present application, the aforementioned text box group obtaining submodule 1024 includes:
文本框组确定子模块,用于在所述相邻两列的列图像信息中以纵轴方向相距最远的的两个文本框作为所述文本框组。The text box group determining sub-module is used for taking the two text boxes farthest apart in the longitudinal axis direction as the text box group in the column image information of the two adjacent columns.
在本申请实施例二的一些可选的实现方式中,上述列图像获取子模块1023包括:In some optional implementation manners of the second embodiment of the present application, the column image acquisition submodule 1023 includes:
列切割线确定子模块,用于以纵向穿过的文本框数量最少的直线作为列切割线进行切割。The column cutting line determination sub-module is used for cutting with the straight line with the least number of text boxes passing longitudinally as the column cutting line.
在本申请实施例二的一些可选的实现方式中,上述标注判断模块103包括:In some optional implementation manners of the second embodiment of the present application, the above-mentioned labeling judgment module 103 includes:
第一方程式获取子模块,用于获取与所述数据标注信息以及锚点数据相对应的第一方程式以及第二方程式;The first equation obtaining sub-module is used to obtain the first equation and the second equation corresponding to the data annotation information and anchor point data;
第一方程式判断子模块,用于判断所述第二方程式上是否存在与所述第一方程式相交的交点;The first equation judging sub-module is used to judge whether there is an intersection point with the first equation on the second equation;
第一信号输出子模块,用于若所述第二方程式上存在与所述第一方程式相交的交点,则输出所述预校验成功信号;The first signal output sub-module is configured to output the pre-check success signal if there is an intersection point with the first equation on the second equation;
第二信号输出子模块,用于若所述第二方程式上不存在与所述第一方程式相交的交点,则输出预校验失败信号。The second signal output sub-module is configured to output a pre-check failure signal if there is no intersection point with the first equation in the second equation.
在本申请实施例二的一些可选的实现方式中,上述第一信号输出子模块具体包括:In some optional implementation manners of the second embodiment of the present application, the above-mentioned first signal output submodule specifically includes:
第二方程式判断子模块,用于若所述判断结果为是,则判断所述第一方程式以及第二 方程式是否相互平行或者重合;The second equation judging sub-module is used for judging whether the first equation and the second equation are parallel or coincide with each other if the judgment result is yes;
第三信号输出子模块,用于若所述第一方程式以及第二方程式不是相互平行或者重合,则输出所述预校验成功信号;The third signal output sub-module is configured to output the pre-check success signal if the first equation and the second equation are not parallel or coincide with each other;
第四信号输出子模块,用于若所述第一方程式以及第二方程式相互平行或者重合,则输出所述预校验失败信号。The fourth signal output sub-module is configured to output the pre-check failure signal if the first equation and the second equation are parallel or coincide with each other.
在本申请实施例二的一些可选的实现方式中,上述第三信号输出子模块具体包括:In some optional implementation manners of the second embodiment of the present application, the third signal output submodule specifically includes:
第二方程式获取子模块,用于获取与所述数据标注信息以及无效锚点数据相对应的第三方程式以及第四方程式;The second equation obtaining sub-module is used to obtain the third-party program and the fourth equation corresponding to the data annotation information and invalid anchor point data;
第三方程式判断子模块,用于判断所述第四方程式上是否存在与第三方程式相交的交点;The third-party program judgment sub-module is used to determine whether there is an intersection point with the third-party program on the fourth equation;
第五信号输出子模块,用于若所述第四方程式上存在与第三方程式相交的交点,则输出所述预校验失败信号;The fifth signal output sub-module is configured to output the pre-check failure signal if there is an intersection point with the third-party formula on the fourth equation;
第六信号输出子模块,用于若所述第四方程式上不存在与第三方程式相交的交点,则输出所述预校验成功信号。The sixth signal output sub-module is configured to output the pre-check success signal if there is no intersection point with the third-party formula in the fourth equation.
实施例三Example three
为解决上述技术问题,本申请实施例还提供计算机设备。具体请参阅图11,图11为本实施例计算机设备基本结构框图。In order to solve the above technical problems, the embodiments of the present application also provide computer equipment. Please refer to FIG. 11 for details. FIG. 11 is a block diagram of the basic structure of the computer device in this embodiment.
所述计算机设备11包括通过系统总线相互通信连接存储器111、处理器112、网络接口113。需要指出的是,图中仅示出了具有组件111-113的计算机设备11,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。其中,本技术领域技术人员可以理解,这里的计算机设备是一种能够按照事先设定或存储的指令,自动进行数值计算和/或信息处理的设备,其硬件包括但不限于微处理器、专用集成电路(Application Specific Integrated Circuit,ASIC)、可编程门阵列(Field-Programmable Gate Array,FPGA)、数字处理器(Digital Signal Processor,DSP)、嵌入式设备等。The computer device 11 includes a memory 111, a processor 112, and a network interface 113 that are connected to each other in communication through a system bus. It should be pointed out that the figure only shows the computer device 11 with components 111-113, but it should be understood that it is not required to implement all of the illustrated components, and more or fewer components may be implemented instead. Among them, those skilled in the art can understand that the computer device here is a device that can automatically perform numerical calculation and/or information processing in accordance with pre-set or stored instructions. Its hardware includes, but is not limited to, a microprocessor, a dedicated Integrated Circuit (Application Specific Integrated Circuit, ASIC), Programmable Gate Array (Field-Programmable Gate Array, FPGA), Digital Processor (Digital Signal Processor, DSP), embedded equipment, etc.
所述计算机设备可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。所述计算机设备可以与用户通过键盘、鼠标、遥控器、触摸板或声控设备等方式进行人机交互。The computer device may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The computer device can interact with the user through a keyboard, a mouse, a remote control, a touch panel, or a voice control device.
所述存储器111至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等,所述计算机可读存储介质可以是非易失性,也可以是易失性。在一些实施例中,所述存储器111可以是所述计算机设备11的内部存储单元,例如该计算机设备11的硬盘或内存。在另一些实施例中,所述存储器111也可以是所述计算机设备11的外部存储设备,例如该计算机设备11上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。当然,所述存储器111还可以既包括所述计算机设备11的内部存储单元也包括其外部存储设备。本实施例中,所述存储器111通常用于存储安装于所述计算机设备11的操作系统和各类应用软件,例如图像标注任务的预校验方法的计算机可读指令等。此外,所述存储器111还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 111 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static memory Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc., the computer readable storage The medium can be non-volatile or volatile. In some embodiments, the memory 111 may be an internal storage unit of the computer device 11, such as a hard disk or a memory of the computer device 11. In other embodiments, the memory 111 may also be an external storage device of the computer device 11, such as a plug-in hard disk, a smart media card (SMC), and a secure digital device equipped on the computer device 11. (Secure Digital, SD) card, Flash Card, etc. Of course, the memory 111 may also include both an internal storage unit of the computer device 11 and an external storage device thereof. In this embodiment, the memory 111 is generally used to store an operating system and various application software installed in the computer device 11, such as computer-readable instructions for a pre-verification method for image labeling tasks. In addition, the memory 111 can also be used to temporarily store various types of data that have been output or will be output.
所述处理器112在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器112通常用于控制所述计算机设备11的总体操作。本实施例中,所述处理器112用于运行所述存储器111中存储的计算机可读指令或者处理数据,例如运行所述图像标注任务的预校验方法的计算机可读指令。The processor 112 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments. The processor 112 is generally used to control the overall operation of the computer device 11. In this embodiment, the processor 112 is configured to run computer-readable instructions or processed data stored in the memory 111, for example, computer-readable instructions for running the pre-verification method of the image labeling task.
所述网络接口113可包括无线网络接口或有线网络接口,该网络接口113通常用于在 所述计算机设备11与其他电子设备之间建立通信连接。The network interface 113 may include a wireless network interface or a wired network interface. The network interface 113 is generally used to establish a communication connection between the computer device 11 and other electronic devices.
本申请还提供了另一种实施方式,即提供一种计算机可读存储介质,所述计算机可读存储介质存储有图像标注任务的预校验流程,所述图像标注任务的预校验流程可被至少一个处理器执行,以使所述至少一个处理器执行如上述的图像标注任务的预校验方法的步骤。This application also provides another implementation manner, that is, to provide a computer-readable storage medium that stores a pre-verification process of an image labeling task, and the pre-verification process of the image labeling task can be Is executed by at least one processor, so that the at least one processor executes the steps of the pre-verification method for the image labeling task as described above.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
显然,以上所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例,附图中给出了本申请的较佳实施例,但并不限制本申请的专利范围。本申请可以以许多不同的形式来实现,相反地,提供这些实施例的目的是使对本申请的公开内容的理解更加透彻全面。尽管参照前述实施例对本申请进行了详细的说明,对于本领域的技术人员来而言,其依然可以对前述各具体实施方式所记载的技术方案进行修改,或者对其中部分技术特征进行等效替换。凡是利用本申请说明书及附图内容所做的等效结构,直接或间接运用在其他相关的技术领域,均同理在本申请专利保护范围之内。Obviously, the above-described embodiments are only a part of the embodiments of the present application, rather than all of the embodiments. The drawings show preferred embodiments of the present application, but do not limit the patent scope of the present application. The present application can be implemented in many different forms. On the contrary, the purpose of providing these examples is to make the understanding of the disclosure of the present application more thorough and comprehensive. Although this application has been described in detail with reference to the foregoing embodiments, for those skilled in the art, it is still possible for those skilled in the art to modify the technical solutions described in each of the foregoing specific embodiments, or equivalently replace some of the technical features. . All equivalent structures made using the contents of the description and drawings of this application, directly or indirectly used in other related technical fields, are similarly within the scope of patent protection of this application.

Claims (20)

  1. 一种图像标注任务的预校验方法,所述方法包括下述步骤:A pre-verification method for image labeling tasks, the method includes the following steps:
    接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
    基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
    判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
    若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
  2. 如权利要求1所述的图像标注任务的预校验方法,其中,所述基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据的步骤,具体包括如下步骤:The pre-verification method for image tagging tasks according to claim 1, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information , Specifically including the following steps:
    对所述原始图像信息进行行切割操作,获取行图像信息;Performing a line cutting operation on the original image information to obtain line image information;
    对所述行图像信息进行识别操作,获取文本框;Performing a recognition operation on the line image information to obtain a text box;
    根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息;Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;
    在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组;Select one text box from each of the two adjacent columns of the image information to form a text box group;
    将所述文本框组中所述文本框的中心点作为所述锚点数据。Use the center point of the text box in the text box group as the anchor point data.
  3. 如权利要求2所述的图像标注任务的预校验方法,其中,所述在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组的步骤,具体包括如下步骤:3. The pre-verification method for image tagging tasks according to claim 2, wherein the step of selecting one text box from the column image information of each two adjacent columns to form a text box group specifically includes the following steps:
    在所述相邻两列的列图像信息中以纵轴方向相距最远的的两个文本框作为所述文本框组。In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
  4. 如权利要求2所述的图像标注任务的预校验方法,其中,所述根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息的步骤,具体包括如下步骤:The method for pre-verification of an image labeling task according to claim 2, wherein the column cutting operation is performed on the original image information according to the text box to obtain multiple columns of column image information carrying the text box The steps include the following steps:
    以纵向穿过的文本框数量最少的直线作为列切割线进行切割。Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
  5. 如权利要求1所述的图像标注任务的预校验方法,其中,所述判断所述数据标注信息是否满足预设的锚点要求的步骤,具体包括如下步骤:8. The pre-verification method for an image labeling task according to claim 1, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:
    在同一坐标系中获取与所述数据标注信息以及锚点数据相对应的第一方程式以及第二方程式;Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;
    判断所述第二方程式以及所述第一方程式相对应的线上是否存在交点;Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;
    若所述第二方程式以及所述第一方程式相对应的线上存在交点,则输出所述预校验成功信号;If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;
    若所述第二方程式以及所述第一方程式相对应的线上不存在交点,则输出预校验失败信号。If there is no intersection point on the line corresponding to the second equation and the first equation, output a pre-check failure signal.
  6. 如权利要求5所述的图像标注任务的预校验方法,其中,所述若判断结果为是,则输出所述预校验成功信号的步骤,具体包括如下步骤:5. The pre-verification method for image labeling tasks according to claim 5, wherein if the judgment result is yes, the step of outputting the pre-verification success signal specifically includes the following steps:
    若所述判断结果为是,则判断所述第一方程式以及第二方程式相对应的线是否相互平行或者重合;If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;
    若所述第一方程式以及第二方程式相对应的线不是相互平行或者重合,则输出所述预校验成功信号;If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;
    若所述第一方程式以及第二方程式相对应的线相互平行或者重合,则输出所述预校验失败信号。If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
  7. 如权利要求6所述的图像标注任务的预校验方法,其中,所述图像标注任务还携带有无效锚点数据,所述若判断结果为否,则输出所述预校验成功信号的步骤,具体包括如下步骤:The pre-verification method of the image labeling task according to claim 6, wherein the image labeling task also carries invalid anchor point data, and if the judgment result is no, the step of outputting the pre-verification success signal , Specifically including the following steps:
    在同一坐标系中获取与所述数据标注信息以及无效锚点数据相对应的第三方程式以及第四方程式;Obtain the third-party program and the fourth equation corresponding to the data labeling information and invalid anchor point data in the same coordinate system;
    判断所述第四方程式以及所述第三方程式相对应的线上是否存在交点;Determine whether there is an intersection point on the line corresponding to the fourth equation and the third-party program;
    若所述第四方程式以及所述第三方程式相对应的线上存在交点,则输出所述预校验失败信号;If there is an intersection point on the line corresponding to the fourth equation and the third-party program, output the pre-check failure signal;
    若所述第四方程式以及所述第三方程式相对应的线上不存在交点,则输出所述预校验成功信号。If there is no intersection point on the line corresponding to the fourth equation and the third-party program, output the pre-check success signal.
  8. 一种图像标注任务的预校验装置,其中,所述装置包括:A pre-verification device for image labeling tasks, wherein the device includes:
    请求接收模块,用于接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;The request receiving module is configured to receive a pre-verification request sent by the user terminal, where the pre-verification request at least carries original image information and data annotation information;
    锚点获取模块,用于基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;An anchor point acquisition module, configured to perform a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
    标注判断模块,用于判断所述数据标注信息是否满足预设的锚点要求;The labeling judgment module is used to judge whether the data labeling information meets the preset anchor point requirements;
    结果输出模块,用于若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。The result output module is configured to output a pre-check success signal to the user terminal if the preset anchor point requirement is met.
  9. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现如下所述的图像标注任务的预校验方法的步骤:A computer device including a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, wherein the processor executes the computer-readable instructions as follows The steps of the pre-verification method for the image annotation task:
    接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
    基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
    判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
    若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
  10. 如权利要求9所述的计算机设备,其中,所述基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据的步骤,具体包括如下步骤:9. The computer device according to claim 9, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information specifically comprises the following steps:
    对所述原始图像信息进行行切割操作,获取行图像信息;Performing a line cutting operation on the original image information to obtain line image information;
    对所述行图像信息进行识别操作,获取文本框;Performing a recognition operation on the line image information to obtain a text box;
    根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息;Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;
    在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组;Select one text box from each of the two adjacent columns of the image information to form a text box group;
    将所述文本框组中所述文本框的中心点作为所述锚点数据。Use the center point of the text box in the text box group as the anchor point data.
  11. 如权利要求10所述的计算机设备,其中,所述在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组的步骤,具体包括如下步骤:10. The computer device according to claim 10, wherein the step of selecting one text box from each of the column image information of every two adjacent columns to form a text box group specifically includes the following steps:
    在所述相邻两列的列图像信息中以纵轴方向相距最远的的两个文本框作为所述文本框组。In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
  12. 如权利要求10所述的计算机设备,其中,所述根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息的步骤,具体包括如下步骤:The computer device according to claim 10, wherein the step of performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box specifically includes the following steps :
    以纵向穿过的文本框数量最少的直线作为列切割线进行切割。Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
  13. 如权利要求9所述的计算机设备,其中,所述判断所述数据标注信息是否满足预设的锚点要求的步骤,具体包括如下步骤:9. The computer device according to claim 9, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:
    在同一坐标系中获取与所述数据标注信息以及锚点数据相对应的第一方程式以及第二方程式;Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;
    判断所述第二方程式以及所述第一方程式相对应的线上是否存在交点;Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;
    若所述第二方程式以及所述第一方程式相对应的线上存在交点,则输出所述预校验成功信号;If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;
    若所述第二方程式以及所述第一方程式相对应的线上不存在交点,则输出预校验失败 信号。If there is no intersection point on the line corresponding to the second equation and the first equation, a pre-check failure signal is output.
  14. 如权利要求13所述的计算机设备,其中,所述若判断结果为是,则输出所述预校验成功信号的步骤,具体包括如下步骤:The computer device according to claim 13, wherein, if the judgment result is yes, the step of outputting the pre-check success signal specifically includes the following steps:
    若所述判断结果为是,则判断所述第一方程式以及第二方程式相对应的线是否相互平行或者重合;If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;
    若所述第一方程式以及第二方程式相对应的线不是相互平行或者重合,则输出所述预校验成功信号;If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;
    若所述第一方程式以及第二方程式相对应的线相互平行或者重合,则输出所述预校验失败信号。If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
  15. 一种计算机可读存储介质,其中,所述计算机可读指令被一种处理器执行时,使得所述一种处理器执行所述图像标注任务的预校验方法的步骤:A computer-readable storage medium, wherein, when the computer-readable instructions are executed by a processor, the steps of causing the processor to execute the pre-verification method of the image labeling task:
    接收用户终端发送的预校验请求,所述预校验请求至少携带有原始图像信息以及数据标注信息;Receiving a pre-verification request sent by a user terminal, where the pre-verification request at least carries original image information and data annotation information;
    基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据;Performing a shunt recognition operation on the original image information based on a shunt recognition algorithm, and obtain anchor point data corresponding to the original image information;
    判断所述数据标注信息是否满足预设的锚点要求;Judging whether the data labeling information meets the preset anchor point requirements;
    若满足所述预设的锚点要求,则向所述用户终端输出预校验成功信号。If the preset anchor requirement is met, output a pre-check success signal to the user terminal.
  16. 如权利要求15所述的计算机可读存储介质,其中,所述基于分流识别算法对所述原始图像信息进行分流识别操作,获取与所述原始图像信息相对应的锚点数据的步骤,具体包括如下步骤:15. The computer-readable storage medium according to claim 15, wherein the step of performing a shunt recognition operation on the original image information based on a shunt recognition algorithm to obtain anchor point data corresponding to the original image information specifically comprises The following steps:
    对所述原始图像信息进行行切割操作,获取行图像信息;Performing a line cutting operation on the original image information to obtain line image information;
    对所述行图像信息进行识别操作,获取文本框;Performing a recognition operation on the line image information to obtain a text box;
    根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息;Performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box;
    在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组;Select one text box from each of the two adjacent columns of the image information to form a text box group;
    将所述文本框组中所述文本框的中心点作为所述锚点数据。Use the center point of the text box in the text box group as the anchor point data.
  17. 如权利要求16所述的计算机可读存储介质,其中,所述在每相邻两列的所述列图像信息中各选取一个文本框组成文本框组的步骤,具体包括如下步骤:16. The computer-readable storage medium according to claim 16, wherein the step of selecting one text box from each of the column image information of each two adjacent columns to form a text box group specifically includes the following steps:
    在所述相邻两列的列图像信息中以纵轴方向相距最远的的两个文本框作为所述文本框组。In the column image information of the two adjacent columns, two text boxes that are farthest apart in the vertical axis direction are used as the text box group.
  18. 如权利要求16所述的计算机可读存储介质,其中,所述根据所述文本框对所述原始图像信息进行列切割操作,获取多列携带有所述文本框的列图像信息的步骤,具体包括如下步骤:The computer-readable storage medium of claim 16, wherein the step of performing a column cutting operation on the original image information according to the text box to obtain multiple columns of column image information carrying the text box is specifically Including the following steps:
    以纵向穿过的文本框数量最少的直线作为列切割线进行切割。Use the straight line with the least number of text boxes to cross vertically as the column cutting line for cutting.
  19. 如权利要求15所述的计算机可读存储介质,其中,所述判断所述数据标注信息是否满足预设的锚点要求的步骤,具体包括如下步骤:15. The computer-readable storage medium according to claim 15, wherein the step of judging whether the data labeling information meets a preset anchor point requirement specifically includes the following steps:
    在同一坐标系中获取与所述数据标注信息以及锚点数据相对应的第一方程式以及第二方程式;Acquiring the first equation and the second equation corresponding to the data annotation information and anchor point data in the same coordinate system;
    判断所述第二方程式以及所述第一方程式相对应的线上是否存在交点;Judging whether there is an intersection point on the line corresponding to the second equation and the first equation;
    若所述第二方程式以及所述第一方程式相对应的线上存在交点,则输出所述预校验成功信号;If there is an intersection point on the line corresponding to the second equation and the first equation, output the pre-check success signal;
    若所述第二方程式以及所述第一方程式相对应的线上不存在交点,则输出预校验失败信号。If there is no intersection point on the line corresponding to the second equation and the first equation, output a pre-check failure signal.
  20. 如权利要求19所述的计算机可读存储介质,其中,所述若判断结果为是,则输出所述预校验成功信号的步骤,具体包括如下步骤:19. The computer-readable storage medium of claim 19, wherein if the judgment result is yes, the step of outputting the pre-check success signal specifically includes the following steps:
    若所述判断结果为是,则判断所述第一方程式以及第二方程式相对应的线是否相互平 行或者重合;If the judgment result is yes, judge whether the lines corresponding to the first equation and the second equation are parallel or coincide with each other;
    若所述第一方程式以及第二方程式相对应的线不是相互平行或者重合,则输出所述预校验成功信号;If the lines corresponding to the first equation and the second equation are not parallel or coincident with each other, output the pre-check success signal;
    若所述第一方程式以及第二方程式相对应的线相互平行或者重合,则输出所述预校验失败信号。If the lines corresponding to the first equation and the second equation are parallel or coincide with each other, the pre-check failure signal is output.
PCT/CN2020/117379 2020-02-21 2020-09-24 Image annotation task pre-verification method and apparatus, device, and storage medium WO2021164251A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010106306.0 2020-02-21
CN202010106306.0A CN111428749A (en) 2020-02-21 2020-02-21 Image annotation task pre-verification method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2021164251A1 true WO2021164251A1 (en) 2021-08-26

Family

ID=71547037

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/117379 WO2021164251A1 (en) 2020-02-21 2020-09-24 Image annotation task pre-verification method and apparatus, device, and storage medium

Country Status (2)

Country Link
CN (1) CN111428749A (en)
WO (1) WO2021164251A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259980B (en) * 2020-02-10 2023-10-03 北京小马慧行科技有限公司 Method and device for processing annotation data
CN111428749A (en) * 2020-02-21 2020-07-17 平安科技(深圳)有限公司 Image annotation task pre-verification method, device, equipment and storage medium
CN112766254B (en) * 2021-01-19 2024-02-02 中国工商银行股份有限公司 Automatic test method and device for inputting credential information
CN112836732B (en) * 2021-01-25 2024-04-19 深圳市声扬科技有限公司 Verification method and device for data annotation, electronic equipment and storage medium
CN115577379B (en) * 2022-11-09 2023-05-09 中孚安全技术有限公司 Hierarchical protection security analysis method, system and equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108073914A (en) * 2018-01-10 2018-05-25 成都品果科技有限公司 A kind of animal face key point mask method
CN108549893A (en) * 2018-04-04 2018-09-18 华中科技大学 A kind of end-to-end recognition methods of the scene text of arbitrary shape
CN110245716A (en) * 2019-06-20 2019-09-17 杭州睿琪软件有限公司 Sample labeling auditing method and device
CN111428749A (en) * 2020-02-21 2020-07-17 平安科技(深圳)有限公司 Image annotation task pre-verification method, device, equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108073914A (en) * 2018-01-10 2018-05-25 成都品果科技有限公司 A kind of animal face key point mask method
CN108549893A (en) * 2018-04-04 2018-09-18 华中科技大学 A kind of end-to-end recognition methods of the scene text of arbitrary shape
CN110245716A (en) * 2019-06-20 2019-09-17 杭州睿琪软件有限公司 Sample labeling auditing method and device
CN111428749A (en) * 2020-02-21 2020-07-17 平安科技(深圳)有限公司 Image annotation task pre-verification method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN111428749A (en) 2020-07-17

Similar Documents

Publication Publication Date Title
WO2021164251A1 (en) Image annotation task pre-verification method and apparatus, device, and storage medium
US9235758B1 (en) Robust method to find layout similarity between two documents
WO2018099125A1 (en) Method and system for processing displayed content in overlapping windows
WO2021143059A1 (en) Method, apparatus, and device for determining map area, and storage medium
US11861919B2 (en) Text recognition method and device, and electronic device
WO2021217543A1 (en) Image annotation method, apparatus, device and medium
CN112016510A (en) Signal lamp identification method and device based on deep learning, equipment and storage medium
CN116844177A (en) Table identification method, apparatus, device and storage medium
CN115357155A (en) Window identification method, device, equipment and computer readable storage medium
CN111492407B (en) System and method for map beautification
CN113837194A (en) Image processing method, image processing apparatus, electronic device, and storage medium
CN110989880B (en) Interface element processing method and device and readable storage medium
CN107146098B (en) Advertisement operation configuration method and equipment
WO2021174688A1 (en) Facial detection method and system
CN115589786A (en) Method, device and system for recognizing hand-drawn figure and computer readable storage medium
CN111368915A (en) Drawing verification method, device, equipment and storage medium
WO2022105120A1 (en) Text detection method and apparatus from image, computer device and storage medium
CN115840550A (en) Angle-adaptive display screen display method, device and medium
CN111783180B (en) Drawing splitting method and related device
CN112784175B (en) Method, device, equipment and storage medium for processing interest point data
CN114328783A (en) Map data output method, map data processing method and device and electronic equipment
CN114996542A (en) Data distribution-based horizontal histogram display method and related equipment
CN114140813A (en) High-precision map marking method, device, equipment and storage medium
CN113435289A (en) Space division method, device, equipment and storage medium
CN112740151A (en) Data input method of terminal equipment, terminal equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20919774

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20919774

Country of ref document: EP

Kind code of ref document: A1