CN114359367B - Data labeling method and device, computer equipment and storage medium - Google Patents

Data labeling method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN114359367B
CN114359367B CN202210251029.1A CN202210251029A CN114359367B CN 114359367 B CN114359367 B CN 114359367B CN 202210251029 A CN202210251029 A CN 202210251029A CN 114359367 B CN114359367 B CN 114359367B
Authority
CN
China
Prior art keywords
picture
processed
data set
marking
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210251029.1A
Other languages
Chinese (zh)
Other versions
CN114359367A (en
Inventor
李爱林
吴海艋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Huafu Technology Co ltd
Original Assignee
Shenzhen Huafu Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Huafu Information Technology Co ltd filed Critical Shenzhen Huafu Information Technology Co ltd
Priority to CN202210251029.1A priority Critical patent/CN114359367B/en
Publication of CN114359367A publication Critical patent/CN114359367A/en
Application granted granted Critical
Publication of CN114359367B publication Critical patent/CN114359367B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Processing Or Creating Images (AREA)

Abstract

The embodiment of the invention discloses a data labeling method, a data labeling device, computer equipment and a storage medium. The method comprises the following steps: acquiring login account information and password information; judging whether an existing data set exists or not according to the account information and the password information; if not, creating a data set, and determining the labeling information of the data set; determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed; processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result; judging whether the picture to be processed has a label or not according to the processing result; if yes, saving the processing result; if not, marking the picture to be processed without labeling a label to obtain a new picture, and storing the new picture. By implementing the method provided by the embodiment of the invention, desktop application software does not need to be installed, the marked data is high in safety and rich in marking types and forms, marking is realized on the canvas drawing board, and the file formats of the marking results are uniform.

Description

Data labeling method and device, computer equipment and storage medium
Technical Field
The present invention relates to data processing methods, and more particularly, to a data labeling method, apparatus, computer device, and storage medium.
Background
The image recognition technology is an important research subject in the field of machine learning, and for an image to be recognized, a target object in image data is recognized through a neural network model, wherein the target object can be a motor vehicle, a person, an animal, an object and the like. And image labeling, namely adding text information related to the content of the target object in the image to be recognized, and providing a good basis for image recognition.
Before image identification, image data is required to be marked, in the conventional image data marking technology, a marker locally installs a corresponding marking tool to perform marking work, and meanwhile, an imported data file to be marked and an exported marking result file are required to be stored in a local disk, so that unnecessary space resource waste is caused, and data safety problems such as data loss, data leakage and the like can be caused; and the formats of the labeling result files generated by adopting different labeling tools are not uniform, if the LabelImg default storage format is XML and the LabeLme default storage format is JSON, format conversion is needed during use, and the current labeling form is single and only supports basic operations such as conventional picture frames, second points and the like.
Therefore, a new method is needed to be designed, desktop application software is not needed to be installed, the safety of labeled data is high, the labeling types and forms are rich, and the file formats of the labeling results are uniform.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a data annotation method, a data annotation device, computer equipment and a storage medium.
In order to achieve the purpose, the invention adopts the following technical scheme: the data annotation method comprises the following steps:
acquiring login account information and password information;
judging whether an existing data set exists or not according to the account information and the password information;
if the existing data set does not exist, creating the data set, and determining the marking information of the data set;
determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed;
processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result;
judging whether the picture to be processed is marked according to the processing result;
if the to-be-processed picture is marked, storing the processing result;
and if the picture to be processed is not marked, marking the picture to be processed without marking a label to obtain a new picture, and storing the new picture.
The further technical scheme is as follows: the creating a data set and determining the annotation information of the data set include:
Creating a data set, and setting a default picture of the data set to be null;
obtaining screening conditions;
according to the screening conditions and js, uploading corresponding pictures to the data set in a slicing mode;
and setting the type label of each picture in the data set to form the labeling information of the data set.
The further technical scheme is as follows: the annotation information of the data set comprises the name of the picture, whether the picture is in an intelligent annotation form, whether the picture is in a team annotation form and an annotation tool adopted when the picture enters an annotation page.
The further technical scheme is as follows: after judging whether an existing data set exists according to the account information and the password information, the method further comprises the following steps:
and if the existing data set exists, executing the step of determining the picture to be processed according to the data set, and determining the marking information of the picture to be processed.
The further technical scheme is as follows: the processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result, comprising:
drawing the picture to be processed on a canvas drawing board;
acquiring the height and width of the picture to be processed, and adjusting the picture to be processed according to the height and width so as to display the picture to be processed in a visible area of a canvas drawing board;
Determining a corresponding marking tool according to the marking information of the picture to be processed, and adding a corresponding figure to the canvas drawing board by using the marking tool;
associating the graph with the picture to be processed to obtain a processing result;
and saving the processing result to an array variable.
The further technical scheme is as follows: the associating the graph with the picture to be processed to obtain a processing result, comprising:
determining a corresponding label according to the graph;
combining the graph and the label to form a marked graph group;
and associating the graphics in the marked graphics group with the label, keeping uniform colors, and combining the images with the pictures to be processed to obtain a processing result.
The further technical scheme is as follows: the judging whether the picture to be processed is marked according to the processing result comprises the following steps:
judging whether the array length in the array variable changes or not;
if the array length in the array variable changes, determining that the picture to be processed is marked;
and if the array length in the array variable does not change, determining that the picture to be processed is not marked.
The invention also provides a data labeling device, comprising:
the acquisition unit is used for acquiring login account information and password information;
a data set judging unit, configured to judge whether an existing data set exists according to the account information and the password information;
the creating unit is used for creating the data set and determining the labeling information of the data set if the existing data set does not exist;
the picture labeling unit is used for determining a picture to be processed according to the data set and determining labeling information of the picture to be processed;
the processing unit is used for processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed so as to obtain a processing result;
the label judging unit is used for judging whether the picture to be processed has a label according to the processing result;
the storage unit is used for storing the processing result if the picture to be processed has the label;
and the marking unit is used for marking the picture to be processed without labeling if the picture to be processed is not labeled, obtaining a new picture and storing the new picture.
The invention also provides a computer device, which comprises a memory and a processor, wherein the memory is stored with a computer program, and the processor executes the computer program to realize the method.
The invention also provides a storage medium storing a computer program which, when executed by a processor, implements the method described above.
Compared with the prior art, the invention has the beneficial effects that: according to the invention, whether a data set needs to be newly created or not is determined according to the logged account information and password information, the picture to be processed and the corresponding marking information are determined according to the data set, the marking information determines details such as a marking mode, a marking type, a marking mode and the like, when the picture is processed, the picture to be processed is processed by adopting a corresponding marking tool, the picture which is marked correctly and is marked is stored, encryption and uploading are carried out, and the picture which does not need to be marked is marked and then stored, so that desktop application software does not need to be installed, the marked data is high in safety, the marking types and the forms are rich, the marking is realized on a canvas drawing board, and the file formats of the marking results are uniform.
The invention is further described below with reference to the figures and the specific embodiments.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the description below are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
Fig. 1 is a schematic view of an application scenario of a data annotation method according to an embodiment of the present invention;
fig. 2 is a schematic flowchart of a data annotation method according to an embodiment of the present invention;
FIG. 3 is a sub-flow diagram of a data annotation method according to an embodiment of the present invention;
FIG. 4 is a sub-flow diagram of a data annotation method according to an embodiment of the present invention;
FIG. 5 is a sub-flow diagram of a data annotation method according to an embodiment of the present invention;
FIG. 6 is a sub-flow diagram of a data annotation method according to an embodiment of the present invention;
fig. 7 is a schematic diagram illustrating a fragment uploading of a corresponding picture according to an embodiment of the present invention;
FIG. 8 is a schematic diagram of polygon segmentation provided in accordance with an embodiment of the present invention;
FIG. 9 is a schematic diagram of a second point provided by an embodiment of the present invention;
FIG. 10 is a diagram illustrating a frame according to an embodiment of the present invention;
FIG. 11 is a schematic view of a circled drawing provided by an embodiment of the present invention;
FIG. 12 is a schematic illustration of a line drawing provided by an embodiment of the present invention;
FIG. 13 is a schematic block diagram of a data annotation device provided in an embodiment of the present invention;
FIG. 14 is a schematic block diagram of a creating unit of a data annotation device provided by an embodiment of the invention;
FIG. 15 is a schematic block diagram of a processing unit of a data annotation device provided in an embodiment of the present invention;
FIG. 16 is a schematic block diagram of an association subunit of a data annotation device provided in an embodiment of the present invention;
fig. 17 is a schematic block diagram of a labeling judgment unit of the data labeling apparatus according to the embodiment of the present invention;
FIG. 18 is a schematic block diagram of a computer apparatus provided by an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in this specification and the appended claims refers to any and all possible combinations of one or more of the associated listed items and includes such combinations.
Referring to fig. 1 and fig. 2, fig. 1 is a schematic view of an application scenario of a data annotation method according to an embodiment of the present invention. Fig. 2 is a schematic flowchart of a data annotation method according to an embodiment of the present invention. The data annotation method is applied to a server. The server performs data interaction with a terminal, login account information and password information are obtained through the terminal, when a data set is created by an account corresponding to the account information and the password information, only labeling of a picture to be processed and label check of a picture not needing labeling are needed to be performed on the existing data set, when the data set is not created by the account, a data set needs to be created, the corresponding picture is led into the data set, labeling information of each picture is determined, a corresponding labeling tool is selected according to the labeling information to be processed, desktop application software does not need to be installed, the safety of labeled data is high, the labeling types and the labeling forms are rich, and the file formats of labeling results are uniform. In this embodiment, the server is a cloud platform, data annotation is performed on the cloud platform, and annotators can work without installing desktop application software.
Fig. 2 is a schematic flowchart of a data annotation method according to an embodiment of the present invention. As shown in fig. 2, the method includes the following steps S110 to S180.
And S110, obtaining login account information and password information.
In this embodiment, the account information and the password information refer to an account and a corresponding password used for performing data annotation login, the content of each account after login is different, some data sets are already created, and when login is performed again, an annotation operation can be directly performed, and if some data sets are not created, the data sets need to be created first after login is successful, so as to import a picture to be annotated.
And S120, judging whether an existing data set exists according to the account information and the password information.
In this embodiment, whether a data set exists is determined according to the content of the account after the account information and the password information are successfully logged in, and if the data set exists, it indicates that the existing data set exists in the account, otherwise, it indicates that the existing data set does not exist in the account.
S130, if the existing data set does not exist, the data set is created, and the marking information of the data set is determined.
In this embodiment, the data set is a set formed by a plurality of pictures to be processed, and the annotation information includes a name of the picture, whether the picture is in an intelligent annotation form, whether the picture is in a team annotation form, and an annotation tool used when the picture enters an annotation page. A team marking form is added, multiple roles such as an administrator, a marker, an auditor and the like are provided, data authority and labor division among the roles are more definite, and marking efficiency and data safety are higher; an AI intelligent labeling form is added, data labeling is directly completed through an intelligent labeling algorithm, and manual labor is liberated.
The labeling tool is richer in related labeling types, is expanded to classification, detection, key points, segmentation and the like, and has different labeling methods and strategies aiming at different labeling types.
When the data set is created, related selections of data set type labels can be carried out, wherein the related selections comprise names, whether the labels are intelligent labels or not, whether the labels are team labels or not, and tools which can be used when different label types enter a label page and parameters for setting the labels can be different. Therefore, the pictures in the data set need to be set first, so as to be directly executed according to the label information during subsequent processing.
In an embodiment, referring to fig. 3, the step S130 may include steps S131 to S134.
S131, creating a data set, and setting default pictures of the data set to be empty.
In this embodiment, an empty data set is created, then default pictures of the data set are set to be empty, and then corresponding pictures are sequentially uploaded into the data set to form a non-empty data set.
S132, obtaining screening conditions.
In this embodiment, the filtering condition refers to a condition for filtering a plurality of pictures to form elements in the data set, and may be a filtering condition for the size of the picture, or may be a filtering condition for an application scene, and the like, depending on the actual situation.
And S133, according to the screening conditions, adopting js to upload corresponding pictures to the data set in a slicing mode.
In this embodiment, paging is used to manage all elements in a data set, a screening condition can be filled to select a corresponding picture of the data set, a default picture of the data set is empty, multiple pictures or picture folders can be selected to be added at one time, js is used for fragment uploading, as shown in fig. 7, uploading progress and uploading results are monitored, pictures are matched according to a regular pattern, automatic filtering is performed, sorting of files and modification of filenames of files are not required, and operation is convenient.
And S134, setting the type label of each picture in the data set to form the labeling information of the data set.
In this embodiment, after the picture is uploaded, the type tag needs to be set, that is, the label information needs to be set, so as to ensure that the setting is executed according to the label information of the picture when the picture is subsequently processed.
S140, determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed.
In this embodiment, a part of pictures is extracted from the data set to determine the pictures to be processed, and the labeling information corresponding to the pictures to be processed is sequentially determined.
S150, processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result;
in this embodiment, the processing result refers to a result formed by performing operations such as labeling or not labeling on the picture to be processed.
Selecting an allocated labeling task to enter a labeling page, labeling the allocated picture, dividing the labeling page into a title operation bar, a toolbar and a mapping area, and realizing different types of labeling and allocating different labeling tools by using allocation json in the toolbar to prevent misoperation. And the keydown method is used for monitoring the keyboard event, so that the effect that tools and pictures can be switched by pressing a fixed key is realized.
In an embodiment, referring to fig. 4, the step S150 may include steps S151 to S155.
And S151, drawing the picture to be processed on a canvas drawing board.
Specifically, the picture to be marked is drawn on the canvas drawing board by a drawImage method.
S152, obtaining the height and the width of the picture to be processed, and adjusting the picture to be processed according to the height and the width so as to display the picture to be processed in a visible area of the canvas drawing board.
In the embodiment, the height and the width of the original picture of the picture to be processed are obtained, so that the picture is kept in a reasonable proportion, the picture can be completely displayed in a visible area of the drawing board, and the proportion is uniformly used for storing marks.
And S153, determining a corresponding marking tool according to the marking information of the picture to be processed, and adding a corresponding figure to the canvas drawing board by using the marking tool.
In this embodiment, the marking tool includes a rectangle, a polygon, a point, a straight line, a dashed line, a circle, and other related auxiliary tools.
Specifically, the labeling forms of the labeling tool include polygon segmentation, second point, picture frame, circle drawing and line drawing forms as shown in fig. 8 to 12.
And S154, associating the graph with the picture to be processed to obtain a processing result.
In an embodiment, referring to fig. 5, the step S154 may include steps S1541 to S1543.
S1541, determining a corresponding label according to the graph.
In this embodiment, the label refers to a mark corresponding to a different label graphic, for example, a mark corresponding to a label graphic of the object to be marked is an object to be marked.
S1542, combining the graph and the label to form a labeled graph group.
In this embodiment, the labeled graphic group refers to a graphic group formed by combining a labeled graphic and a label.
S1543, associating the graphics in the marked graphic group with the labels, keeping uniform colors, and combining the images with the to-be-processed pictures to obtain a processing result.
In this embodiment, when a label graph is added to the canvas drawing board, a corresponding label is selected, the label and the graph are combined into a label graph group, and the label and the graph are uniformly added, deleted, changed and checked, and are associated with each other in position and all have uniform colors.
And S155, storing the processing result into an array variable.
In this embodiment, all processing results are stored in an array variable and added to the cache.
And S160, judging whether the picture to be processed is marked according to the processing result.
In this embodiment, the unmarked to-be-processed picture needs to be marked without a label, so as to distinguish the marked, unmarked and unmarked pictures.
In an embodiment, referring to fig. 6, the step S160 may include steps S161 to S163.
S161, judging whether the array length in the array variable changes;
S162, if the array length in the array variable changes, determining that the picture to be processed is marked;
and S163, if the array length in the array variable does not change, determining that the picture to be processed is not marked.
After the image to be processed is marked, the length of the array is increased by a little compared with the length of the array before the image to be processed is marked, so that the image to be processed can be judged to be marked as long as the length of the array is changed.
And S170, if the picture to be processed is marked, storing the processing result.
In this embodiment, after the labeling is completed, the labeled picture needs to be checked, whether the labeling is correct is checked, and if the labeling is incorrect, the labeling processing needs to be performed again; and when the label is correct, storing the processing result, encrypting the processing result and uploading the encrypted processing result to a subsequent operation flow so as to ensure the safety of the labeled data.
And S180, if the picture to be processed is not marked, marking the picture to be processed without marking a label to obtain a new picture, and storing the new picture.
When the to-be-processed picture is processed by adopting the corresponding marking tool according to the marking information of the to-be-processed picture, and the corresponding marking tool and the like are not bound in the marking information, it is indicated that the to-be-processed picture is processed by adopting a marking tool-free mode, that is, the to-be-processed picture can be transmitted without processing, so that the to-be-processed picture is not marked.
If there is an existing data set, the step S140 is executed.
Whether the label exists or not is judged according to the length of the group when the pictures are switched, if the label does not exist and is not selected, the label is not needed, the pictures cannot be stored, the state is not labeled, the pictures which do not need to be labeled are stored only after the number of the pictures is increased, and if the label exists, the pictures are stored to background data.
In the embodiment, the development of the whole method which is realized by vue + es6+ css is an open-source technical scheme, canvas and componentization are reasonably used, the image presentation speed is increased by using a lazy loading and preloading method, batch caching of the images is realized by using a localstorage, the situation that the images need to be obtained from the interface again when the images are cut every time is prevented, and the mark for viewing the previous image can be returned in time for modification. The multi-scene labeling mode is realized, such as single labeling, team labeling, AI intelligent labeling and multi-labeling types, such as classification, detection, key points and segmentation labeling modes, the labeling data are encrypted in the labeling process, the labeling state is recorded in real time, and the like, so that the flexibility, adaptability, expandability and safety of data labeling are guaranteed, and the labor cost is effectively saved.
Optimization such as data encryption and marking process record can be added in the data marking process, and the data security of the marked data is ensured.
According to the data labeling method, whether a data set needs to be newly created or not is determined according to the logged account information and password information, the picture to be processed and the corresponding labeling information are determined according to the data set, the labeling mode, the labeling type, the labeling mode and other details are determined according to the labeling information, when the picture is processed, the picture to be processed is processed by a corresponding labeling tool, the picture which is labeled and labeled correctly is stored, the picture which is labeled and not required to be labeled is encrypted and uploaded, the picture which is not required to be labeled is stored after being labeled, desktop application software does not need to be installed, the labeled data is high in safety and rich in labeling types and forms, labeling is achieved on a canvas drawing board, and the file formats of the labeling results are uniform.
Fig. 13 is a schematic block diagram of a data annotation device 300 according to an embodiment of the present invention. As shown in fig. 13, the present invention further provides a data annotation device 300 corresponding to the above data annotation method. The data annotation device 300 includes a unit for performing the data annotation method described above, and the device may be configured in a server. Specifically, referring to fig. 13, the data labeling apparatus 300 includes an obtaining unit 301, a data set determining unit 302, a creating unit 303, a picture labeling unit 304, a processing unit 305, a label determining unit 306, a saving unit 307, and a labeling unit 308.
An obtaining unit 301, configured to obtain login account information and password information; a data set determining unit 302, configured to determine whether an existing data set exists according to the account information and the password information; and if the existing data set exists, determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed. A creating unit 303, configured to create a data set if an existing data set does not exist, and determine tagging information of the data set; a picture labeling unit 304, configured to determine, according to the data set, a picture to be processed, and determine labeling information of the picture to be processed; the processing unit 305 is configured to process the to-be-processed picture by using a corresponding labeling tool according to the labeling information of the to-be-processed picture, so as to obtain a processing result; a label judging unit 306, configured to judge whether the to-be-processed picture has a label according to the processing result; a saving unit 307, configured to save the processing result if the to-be-processed picture has a label; and the marking unit 308 is configured to mark the to-be-processed picture without a label if the to-be-processed picture is not marked, obtain a new picture, and store the new picture.
In one embodiment, as shown in fig. 14, the creating unit 303 includes a data set processing sub-unit 3031, a condition obtaining sub-unit 3032, an uploading sub-unit 3033, and a type setting sub-unit 3034.
A data set processing subunit 3031, configured to create a data set, and set a default picture of the data set to be null; a condition obtaining subunit 3032, configured to obtain a screening condition; an uploading subunit 3033, configured to upload, in segments and according to the screening condition and by using js, the corresponding pictures to the data set; a type setting subunit 3034, configured to set a type label of each picture in the data set, so as to form tagging information of the data set.
In one embodiment, as shown in fig. 15, the processing unit 305 includes a drawing sub-unit 3051, a height and width obtaining sub-unit 3052, a tool determination sub-unit 3053, an association sub-unit 3054, and a result saving sub-unit 3055.
The drawing subunit 3051 is configured to draw the picture to be processed on a canvas drawing board; the height and width acquisition subunit 3052 is configured to acquire the height and width of the picture to be processed, and adjust the picture to be processed according to the height and width, so as to display the picture to be processed in a visible area of the canvas drawing board; the tool determination subunit 3053 is configured to determine a corresponding labeling tool according to the labeling information of the picture to be processed, and add a corresponding image to the canvas drawing board by using the labeling tool; the association subunit 3054, configured to associate the graph with the to-be-processed picture to obtain a processing result; and the result saving subunit 3055, configured to save the processing result to the array variable.
In an embodiment, as shown in fig. 16, the association subunit 3054 comprises a tag determination module 30541, a combination module 30542 and a position association module 30543.
A tag determination module 30541, configured to determine a corresponding tag according to the graph; a combining module 30542, configured to combine the graphics and the labels to form a set of labeled graphics; a position association module 30543, configured to associate positions of the graphics and the tags in the marked graphics group, keep a uniform color, and combine the images with the to-be-processed image to obtain a processing result.
In one embodiment, as shown in fig. 17, the label determination unit 306 includes a length determination subunit 3061, a first determination subunit 3062, and a second determination subunit 3063.
A length determination subunit 3061, configured to determine whether an array length in the array variable changes; a first determining subunit 3062, configured to determine that the to-be-processed picture has a label if the array length in the array variable changes; a second determining subunit 3063, configured to determine that the to-be-processed picture is not marked if the array length in the array variable does not change.
It should be noted that, as can be clearly understood by those skilled in the art, the specific implementation processes of the data annotation apparatus 300 and each unit may refer to the corresponding descriptions in the foregoing method embodiments, and for convenience and brevity of description, no further description is provided herein.
The data annotation device 300 can be implemented in the form of a computer program that can be run on a computer apparatus as shown in fig. 18.
Referring to fig. 18, fig. 18 is a schematic block diagram of a computer device according to an embodiment of the present application. The computer device 500 may be a server, where the server may be an independent server or a server cluster composed of a plurality of servers.
Referring to fig. 18, the computer device 500 includes a processor 502, memory, and a network interface 505 connected by a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
The non-volatile storage medium 503 may store an operating system 5031 and computer programs 5032. The computer programs 5032 include program instructions that, when executed, cause the processor 502 to perform a data annotation process.
The processor 502 is used to provide computing and control capabilities to support the operation of the overall computer device 500.
The internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503, and when the computer program 5032 is executed by the processor 502, the processor 502 can be caused to perform a data annotation method.
The network interface 505 is used for network communication with other devices. It will be appreciated by those skilled in the art that the configuration shown in fig. 18 is a block diagram of only a portion of the configuration associated with the present application, and is not intended to limit the scope of the computer device 500 to which the present application may be applied, and that a particular computer device 500 may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
Wherein the processor 502 is configured to run the computer program 5032 stored in the memory to perform the steps of:
acquiring login account information and password information; judging whether an existing data set exists or not according to the account information and the password information; if the existing data set does not exist, creating the data set, and determining the labeling information of the data set; determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed; processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result; judging whether the picture to be processed is marked according to the processing result; if the picture to be processed is marked, storing the processing result; and if the picture to be processed is not marked, marking the picture to be processed without marking a label to obtain a new picture, and storing the new picture.
In an embodiment, when the processor 502 implements the steps of creating a data set and determining the label information of the data set, the following steps are specifically implemented:
creating a data set, and setting a default picture of the data set to be null; obtaining screening conditions; according to the screening conditions and js, uploading corresponding pictures to the data set in a slicing mode; and setting the type label of each picture in the data set to form the labeling information of the data set.
The annotation information of the data set comprises the name of the picture, whether the picture is in an intelligent annotation form, whether the picture is in a team annotation form and an annotation tool adopted when the picture enters an annotation page.
In an embodiment, after the step of determining whether there is an existing data set according to the account information and the password information is implemented, the processor 502 further implements the following steps:
and if the existing data set exists, executing the step of determining the picture to be processed according to the data set, and determining the marking information of the picture to be processed.
In an embodiment, when the processor 502 implements the step of processing the to-be-processed picture by using the corresponding labeling tool according to the labeling information of the to-be-processed picture to obtain the processing result, the following steps are specifically implemented:
Drawing the picture to be processed on a canvas drawing board; acquiring the height and width of the picture to be processed, and adjusting the picture to be processed according to the height and width so as to display the picture to be processed in a visible area of a canvas drawing board; determining a corresponding marking tool according to the marking information of the picture to be processed, and adding a corresponding figure to the canvas drawing board by using the marking tool; associating the graph with the picture to be processed to obtain a processing result; and saving the processing result to an array variable.
In an embodiment, when the processor 502 implements the step of associating the graph with the to-be-processed picture to obtain the processing result, the following steps are specifically implemented:
determining a corresponding label according to the graph; combining the graph and the label to form a marked graph group; and associating the graphics in the marked graphics group with the label, keeping uniform colors, and combining the images with the pictures to be processed to obtain a processing result.
In an embodiment, when the processor 502 determines whether the to-be-processed picture has the labeling step according to the processing result, the following steps are specifically implemented:
Judging whether the array length in the array variable changes or not; if the array length in the array variable changes, determining that the picture to be processed is marked; and if the array length in the array variable does not change, determining that the picture to be processed is not marked.
It should be understood that, in the embodiment of the present Application, the Processor 502 may be a Central Processing Unit (CPU) 305, and the Processor 502 may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field-Programmable Gate arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, and the like. Wherein a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
It will be understood by those skilled in the art that all or part of the flow of the method implementing the above embodiments may be implemented by a computer program instructing associated hardware. The computer program includes program instructions, and the computer program may be stored in a storage medium, which is a computer-readable storage medium. The program instructions are executed by at least one processor in the computer system to implement the flow steps of the embodiments of the method described above.
Accordingly, the present invention also provides a storage medium. The storage medium may be a computer-readable storage medium. The storage medium stores a computer program, wherein the computer program, when executed by a processor, causes the processor to perform the steps of:
acquiring login account information and password information; judging whether an existing data set exists or not according to the account information and the password information; if the existing data set does not exist, creating the data set, and determining the labeling information of the data set; determining a picture to be processed according to the data set, and determining the labeling information of the picture to be processed; processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result; judging whether the picture to be processed is marked according to the processing result; if the picture to be processed is marked, storing the processing result; and if the picture to be processed is not marked, marking the picture to be processed without marking a label to obtain a new picture, and storing the new picture.
In an embodiment, when the step of creating the data set and determining the label information of the data set is implemented by the processor executing the computer program, the following steps are specifically implemented:
Creating a data set, and setting a default picture of the data set to be null; obtaining screening conditions; according to the screening conditions and js, fragment uploading corresponding pictures to the data set; and setting the type label of each picture in the data set to form the labeling information of the data set.
The annotation information of the data set comprises the name of the picture, whether the picture is in an intelligent annotation form, whether the picture is in a team annotation form and an annotation tool adopted when the picture enters an annotation page.
In an embodiment, after the step of determining whether the existing data set exists according to the account information and the password information is implemented by executing the computer program, the processor further implements the following steps:
and if the existing data set exists, executing the step of determining the picture to be processed according to the data set, and determining the marking information of the picture to be processed.
In an embodiment, when the processor executes the computer program to implement the step of processing the to-be-processed picture by using a corresponding labeling tool according to the labeling information of the to-be-processed picture to obtain a processing result, the following steps are specifically implemented:
Drawing the picture to be processed on a canvas drawing board; acquiring the height and width of the picture to be processed, and adjusting the picture to be processed according to the height and width so as to display the picture to be processed in a visible area of a canvas drawing board; determining a corresponding marking tool according to the marking information of the picture to be processed, and adding a corresponding figure to the canvas drawing board by using the marking tool; associating the graph with the picture to be processed to obtain a processing result; and saving the processing result to an array variable.
In an embodiment, when the processor executes the computer program to implement the step of associating the graph with the picture to be processed to obtain the processing result, the following steps are specifically implemented:
determining a corresponding label according to the graph; combining the graph and the label to form a marked graph group; and associating the graphics in the marked graphics group with the label, keeping uniform colors, and combining the images with the pictures to be processed to obtain a processing result.
In an embodiment, when the processor executes the computer program to implement the step of determining whether the to-be-processed picture has a label according to the processing result, the following steps are specifically implemented:
Judging whether the array length in the array variable changes or not; if the length of the array in the array variable changes, determining that the picture to be processed has a label; and if the array length in the array variable does not change, determining that the picture to be processed is not marked.
The storage medium may be a usb disk, a removable hard disk, a Read-Only Memory (ROM), a magnetic disk, or an optical disk, which can store various computer readable storage media of program codes.
Those of ordinary skill in the art will appreciate that the various illustrative components and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the components and steps of the various examples have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the technical solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the several embodiments provided in the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative. For example, the division of each unit is only one logic function division, and there may be another division manner in actual implementation. For example, various elements or components may be combined or may be integrated in another system or some features may be omitted, or not implemented.
The steps in the method of the embodiment of the invention can be sequentially adjusted, combined and deleted according to actual needs. The units in the device of the embodiment of the invention can be merged, divided and deleted according to actual needs. In addition, functional units in the embodiments of the present invention may be integrated into one processing unit 305, or each unit may exist alone physically, or two or more units are integrated into one unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a storage medium. Based on such understanding, the technical solution of the present invention essentially or partially contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a terminal, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention.
While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (4)

1. The data labeling method is characterized by comprising the following steps:
acquiring login account information and password information;
judging whether an existing data set exists or not according to the account information and the password information;
if the existing data set does not exist, creating the data set, and determining the marking information of the data set;
if the existing data set exists, determining a picture to be processed according to the data set, and determining the marking information of the picture to be processed;
processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result;
judging whether the picture to be processed is marked according to the processing result;
if the to-be-processed picture is marked, storing the processing result;
If the to-be-processed picture is not marked, marking the to-be-processed picture without marking a label to obtain a new picture, and storing the new picture;
the creating a data set and determining the annotation information of the data set comprise:
creating a data set, and setting a default picture of the data set to be null;
obtaining screening conditions;
according to the screening conditions and js, fragment uploading corresponding pictures to the data set;
setting a type label of each picture in the data set to form marking information of the data set;
the marking information of the data set comprises the name of the picture, whether the picture is in an intelligent marking form or not, whether the picture is in a team marking form or not and a marking tool adopted when the picture enters a marking page;
the processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed to obtain a processing result, comprising:
drawing the picture to be processed on a canvas drawing board;
acquiring the height and width of the picture to be processed, and adjusting the picture to be processed according to the height and width so as to display the picture to be processed in a visible area of a canvas drawing board;
determining a corresponding marking tool according to the marking information of the picture to be processed, and adding a corresponding marked graph to the canvas drawing board by using the marking tool;
Associating the graph with the picture to be processed to obtain a processing result;
storing the processing result into an array variable;
the associating the graph with the picture to be processed to obtain a processing result, including:
determining a corresponding label according to the graph;
combining the graph and the label to form a marked graph group;
associating the graphics in the marked graphic group with the label, keeping uniform colors, and combining the images with the pictures to be processed to obtain a processing result;
the judging whether the picture to be processed is marked according to the processing result comprises the following steps:
judging whether the array length in the array variable changes or not;
if the array length in the array variable changes, determining that the picture to be processed is marked;
and if the array length in the array variable does not change, determining that the picture to be processed is not marked.
2. A data annotation device, comprising:
the acquisition unit is used for acquiring login account information and password information;
the data set judging unit is used for judging whether an existing data set exists according to the account information and the password information;
The creating unit is used for creating the data set and determining the labeling information of the data set if the existing data set does not exist;
the image labeling unit is used for determining an image to be processed according to the data set and determining labeling information of the image to be processed if the existing data set exists;
the processing unit is used for processing the picture to be processed by adopting a corresponding marking tool according to the marking information of the picture to be processed so as to obtain a processing result;
the marking judging unit is used for judging whether the picture to be processed has marks according to the processing result;
the storage unit is used for storing the processing result if the picture to be processed has the label;
the marking unit is used for marking the picture to be processed without a label if the picture to be processed is not marked, obtaining a new picture and storing the new picture;
the creating unit comprises a data set processing subunit, a condition acquiring subunit, an uploading subunit and a type setting subunit;
the data set processing subunit is used for creating a data set and setting a default picture of the data set to be null;
the condition obtaining subunit is used for obtaining a screening condition;
The uploading subunit is used for uploading the corresponding pictures to the data set in a slicing manner by adopting js according to the screening conditions;
the type setting subunit is configured to set a type tag of each picture in the data set to form tagging information of the data set;
the marking information of the data set comprises the name of the picture, whether the picture is in an intelligent marking form or not, whether the picture is in a team marking form or not and a marking tool adopted when the picture enters a marking page;
the processing unit comprises a drawing subunit, a height and width acquisition subunit, a tool determination subunit, an association subunit and a result storage subunit;
the drawing subunit is configured to draw the picture to be processed on a canvas drawing board;
the height and width acquisition subunit is configured to acquire the height and width of the picture to be processed, and adjust the picture to be processed according to the height and width, so as to display the picture to be processed in a visible area of the canvas drawing board;
the tool determining subunit is configured to determine a corresponding marking tool according to the marking information of the picture to be processed, and add a corresponding graphic to the canvas drawing board by using the marking tool;
the association subunit is configured to associate the graph with the to-be-processed picture to obtain a processing result;
The result storage subunit is used for storing the processing result into an array variable;
the association subunit comprises a label determining module, a combination module and a position association module;
the label determining module is used for determining a corresponding label according to the graph;
the combination module is used for combining the graph and the label to form a marked graph group;
the position association module is used for associating positions of the graphics and the labels in the marked graphic group, keeping uniform colors, and combining the images with the pictures to be processed to obtain a processing result;
the label judging unit comprises a length judging subunit, a first determining subunit and a second determining subunit;
the length judging subunit is configured to judge whether an array length in the array variable changes;
the first determining subunit is configured to determine that the to-be-processed picture has a label if the array length in the array variable changes;
and the second determining subunit is configured to determine that the to-be-processed picture is not marked if the array length in the array variable does not change.
3. A computer arrangement, characterized in that the computer arrangement comprises a memory having stored thereon a computer program and a processor implementing the method as claimed in claim 1 when the computer program is executed by the processor.
4. A storage medium, characterized in that the storage medium stores a computer program which, when executed by a processor, implements the method of claim 1.
CN202210251029.1A 2022-03-15 2022-03-15 Data labeling method and device, computer equipment and storage medium Active CN114359367B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210251029.1A CN114359367B (en) 2022-03-15 2022-03-15 Data labeling method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210251029.1A CN114359367B (en) 2022-03-15 2022-03-15 Data labeling method and device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN114359367A CN114359367A (en) 2022-04-15
CN114359367B true CN114359367B (en) 2022-06-28

Family

ID=81094422

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210251029.1A Active CN114359367B (en) 2022-03-15 2022-03-15 Data labeling method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114359367B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114925337B (en) * 2022-05-18 2024-03-22 东软睿驰汽车技术(大连)有限公司 Data labeling method and device and electronic equipment

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7098604B2 (en) * 2016-08-19 2022-07-11 ファイユージョン,インコーポレイテッド Automatic tagging of objects in a multi-view interactive digital media representation of a dynamic entity
CN110908784B (en) * 2019-11-12 2022-09-20 苏州智加科技有限公司 Image labeling method, device, equipment and storage medium
CN110888582B (en) * 2019-11-25 2022-01-25 Oppo(重庆)智能科技有限公司 Tag information processing method, device, storage medium and terminal
CN111367445B (en) * 2020-03-31 2021-07-09 中国建设银行股份有限公司 Image annotation method and device
CN112163424A (en) * 2020-09-17 2021-01-01 中国建设银行股份有限公司 Data labeling method, device, equipment and medium
CN112346809A (en) * 2020-11-09 2021-02-09 深圳市安软科技股份有限公司 Webpage image labeling method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN114359367A (en) 2022-04-15

Similar Documents

Publication Publication Date Title
CN107704282B (en) Loading method and device applied to embedded system
CN114359367B (en) Data labeling method and device, computer equipment and storage medium
CN109255300B (en) Bill information extraction method, bill information extraction device, computer equipment and storage medium
CN117668402A (en) System and method for applying layout to document
CN110188337B (en) Revision labeling method, device, equipment and computer readable storage medium
CN113159212A (en) OCR recognition model training method, device and computer readable storage medium
CN112329777A (en) Character recognition method, device, equipment and medium based on direction detection
CN112966687B (en) Image segmentation model training method and device and communication equipment
CN109871205B (en) Interface code adjustment method, device, computer device and storage medium
CN112084103A (en) Interface test method, device, equipment and medium
CN113642642B (en) Control identification method and device
CN103870228A (en) System and method for double display setting of computer
CN110347983A (en) Training sample store method, device, electronic equipment and storage medium
EP4161080A1 (en) Video processing method, apparatus and device, and computer-readable storage medium
CN111124730B (en) Error positioning method and device of server and computer storage medium
CN112579144B (en) Data processing method and device
CN113590564A (en) Data storage method and device, electronic equipment and storage medium
CN114157881A (en) Multimedia processing method, device, electronic equipment and storage medium
CN112528184A (en) Sending method and device of rich text mail, computer equipment and storage medium
CN112449008A (en) Data analysis and visual report generation system and method based on cloud computing
CN112233194A (en) Medical picture optimization method, device and equipment and computer-readable storage medium
CN111309300A (en) Code generation method, device, computer system and readable storage medium
CN115857906B (en) Method, system, electronic device and medium for generating low-code chart
CN113568840B (en) Method and system for identifying transition drawing
US10965902B2 (en) Information processing apparatus and control method therefor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.)

Patentee after: Shenzhen Huafu Technology Co.,Ltd.

Address before: 518000 Room 201, building A, 1 front Bay Road, Shenzhen Qianhai cooperation zone, Shenzhen, Guangdong

Patentee before: SHENZHEN HUAFU INFORMATION TECHNOLOGY Co.,Ltd.

CP03 Change of name, title or address