CN113792203A - Method and system for multiplexing label data, electronic device and readable storage medium - Google Patents

Method and system for multiplexing label data, electronic device and readable storage medium Download PDF

Info

Publication number
CN113792203A
CN113792203A CN202111110072.8A CN202111110072A CN113792203A CN 113792203 A CN113792203 A CN 113792203A CN 202111110072 A CN202111110072 A CN 202111110072A CN 113792203 A CN113792203 A CN 113792203A
Authority
CN
China
Prior art keywords
data
target data
preset
target
data set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111110072.8A
Other languages
Chinese (zh)
Other versions
CN113792203B (en
Inventor
张玉玺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Unisinsight Technology Co Ltd
Original Assignee
Chongqing Unisinsight Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing Unisinsight Technology Co Ltd filed Critical Chongqing Unisinsight Technology Co Ltd
Priority to CN202111110072.8A priority Critical patent/CN113792203B/en
Publication of CN113792203A publication Critical patent/CN113792203A/en
Application granted granted Critical
Publication of CN113792203B publication Critical patent/CN113792203B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Stereo-Broadcasting Methods (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention relates to the technical field of data marking, and discloses a marked data multiplexing method and system, electronic equipment and a readable storage medium, wherein the method comprises the steps of obtaining a data set to be marked and a preset marking type through a browser end, the data set to be marked comprises unmarked data, sending a lead-out instruction to a server end by the browser end, the lead-out instruction comprises the preset marking type of target data and target data identification information, marking the target data according to the preset marking type in advance, obtaining the target data by the browser end, merging the target data into the data set to be marked to obtain a merged data set, marking the unmarked data in the merged data set by the browser end according to the preset marking type to generate a multiplexed data set for subsequent multiplexing, so that the marked target data which are marked are already merged in the data set to be marked are marked, then marking the unmarked data in the data set to be marked, therefore, the target data can be multiplexed, and the operation cost and the time cost are saved.

Description

Method and system for multiplexing label data, electronic device and readable storage medium
Technical Field
The invention relates to the technical field of data annotation, in particular to a method and a system for multiplexing annotated data, electronic equipment and a readable storage medium.
Background
With the development of information technology, image information is increasing explosively, and such a large amount of image data cannot be subjected to image annotation only by manual processing. The image annotation is used as an image data processing method of machine vision, image materials to be annotated are imported into an image annotation platform, the image annotation platform carries out image annotation through methods of target framing, target classification and the like, and then the image materials and annotation information are packaged and managed, so that the labor cost is greatly saved, and the method has important significance for industries such as artificial intelligence, life safety and the like. At present, before the marking data is used, the data to be marked needs to be imported into an image marking platform for marking again, marked marking data cannot be multiplexed, and a large amount of operation cost and time cost are wasted.
Disclosure of Invention
The following presents a simplified summary in order to provide a basic understanding of some aspects of the disclosed embodiments. This summary is not an extensive overview nor is intended to identify key/critical elements or to delineate the scope of such embodiments but rather as a prelude to the more detailed description that is presented later.
In view of the above-mentioned shortcomings of the prior art, the present invention provides a method and system for multiplexing annotation data, an electronic device, and a readable storage medium, so as to solve the above-mentioned problems of operation cost and time cost waste caused by the inability to multiplex already annotated representation data.
The method for multiplexing the label data provided by the invention comprises the following steps: the method comprises the steps that a browser side obtains a data set to be marked and a preset marking type;
the method comprises the steps that a browser end sends a exporting instruction to a server end, the exporting instruction comprises a preset marking type of target data and target data identification information, and marking of the target data is completed in advance according to the preset marking type; the browser end obtains the target data and merges the target data into the data set to be marked to obtain a merged data set; and the browser marks the unmarked data in the combined data set according to the preset marking type to generate a multiplexing data set for subsequent multiplexing.
Optionally, the method for multiplexing annotation data further includes: the method comprises the steps that after a data acquisition request sent by a third-party platform is acquired by a browser, a data conversion rule is acquired according to the data acquisition request, wherein the data acquisition request comprises data type information required by the third-party platform; the browser end carries out data conversion on data to be converted according to the data conversion rule to obtain a conversion data set, wherein the data to be converted comprises at least one part of multiplexing data set; and the browser end sends the conversion data set to the third-party platform.
Optionally, the target data is obtained by: acquiring first original data and generating target identification information corresponding to the first original data; labeling the first original data according to the preset labeling type to obtain target data; and establishing and storing the corresponding relation among the target identification information, the preset labeling type and the target data.
Optionally, the merging, by the browser, the target data into the data set to be annotated includes: the browser end compares the target data with set data in the data set to be marked to obtain a comparison result corresponding to the set data; and the browser deletes the set data with the same comparison result from the data set to be marked, and adds the target data into the data set to be marked to obtain a combined data set.
The method for multiplexing the label data provided by the invention comprises the following steps: the method comprises the steps that a server side receives a exporting instruction sent by a browser side, the exporting instruction comprises a preset marking type of target data and target data identification information, and marking of the target data is completed in advance according to the preset marking type; the server side acquires the target data according to the preset labeling type and the target data identification information; and the server side sends the target data to the browser side so as to realize that the browser side merges the target data into a data set to be annotated to obtain a merged data set, and annotates the data which are not annotated in the merged data set to generate a multiplexing data set for subsequent multiplexing.
Optionally, if the export instruction further includes export tagging information, the server side obtains the target data according to the preset tagging type and the target data identification information, where the target data includes tagging information.
Optionally, the obtaining, by the server, the target data according to the preset annotation type and the target data identification information includes: and the server side matches corresponding target data from a preset labeling data database according to the preset labeling type and the target data identification information, wherein the preset labeling data database comprises the target data, a preset labeling type corresponding to the target data and target data identification information corresponding to the target data.
The system for multiplexing the label data comprises at least one end of the following parts: the system comprises a browser end, a server end and a server end, wherein the browser end is used for acquiring a data set to be marked and a preset marking type, sending a leading-out instruction to the server end, the leading-out instruction comprises the preset marking type of target data and target data identification information, merging the target data into the data set to be marked after the target data are acquired to obtain a merged data set, marking unmarked data in the merged data set to generate a multiplexing data set for subsequent multiplexing, and marking the target data according to the preset marking type in advance; the server is used for receiving a derivation instruction sent by a browser end, acquiring the target data according to a preset marking type in the derivation instruction and target data identification information in the derivation instruction, and sending the target data to the browser end.
The electronic device provided by the invention comprises: a processor and a memory; the memory is used for storing computer programs, and the processor is used for executing the computer programs stored by the memory so as to make the electronic equipment execute the method.
The present invention provides a computer-readable storage medium having stored thereon a computer program; which when executed by a processor implements the method described above.
The invention has the beneficial effects that: the method comprises the steps of obtaining a data set to be marked and a preset marking type through a browser, sending a derivation instruction containing the preset marking type of target data and target data identification information to a server so as to obtain target data which is marked according to the preset marking type in advance, combining the target data into the data set to be marked to obtain a combined data set, marking unmarked data in the combined data set, and generating a multiplexing data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Drawings
FIG. 1 is a flow chart illustrating a method for multiplexing annotation data according to an embodiment of the present invention;
FIG. 2 is a flow chart illustrating another method for multiplexing annotation data according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a method of target data acquisition in an embodiment of the invention;
FIG. 4 is a timing diagram illustrating a method for multiplexing label data according to an embodiment of the present invention;
FIG. 5 is a schematic structural diagram of a system for multiplexing annotation data according to an embodiment of the present invention;
fig. 6 is a schematic diagram of an electronic device in an embodiment of the invention.
Detailed Description
The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It is to be noted that the features in the following embodiments and examples may be combined with each other without conflict.
It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention, and the components related to the present invention are only shown in the drawings rather than drawn according to the number, shape and size of the components in actual implementation, and the type, quantity and proportion of the components in actual implementation may be changed freely, and the layout of the components may be more complicated.
In the following description, numerous details are set forth to provide a more thorough explanation of embodiments of the present invention, however, it will be apparent to one skilled in the art that embodiments of the present invention may be practiced without these specific details, and in other embodiments, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring embodiments of the present invention.
The term "plurality" means two or more unless otherwise specified.
In the embodiment of the present disclosure, the character "/" indicates that the preceding and following objects are in an or relationship. For example, A/B represents: a or B.
The term "and/or" is an associative relationship that describes objects, meaning that three relationships may exist. For example, a and/or B, represents: a or B, or A and B.
Referring to fig. 1, the present invention provides a method for multiplexing label data, including:
step S101, a browser end acquires a data set to be marked and a preset marking type;
step S102, the browser end sends a derivation instruction to the server end;
step S103, the browser end obtains target data and merges the target data into a data set to be annotated to obtain a merged data set;
and step S104, marking the unmarked data in the combined data set by the browser according to the preset marking type to generate a multiplexing data set for subsequent multiplexing.
The export instruction comprises a preset labeling type of the target data and target data identification information, and the target data is labeled according to the preset labeling type in advance.
The marked data multiplexing method provided by the invention is utilized, the data set to be marked and the preset marking type are obtained through the browser end, the export instruction containing the preset marking type of the target data and the identification information of the target data is sent to the server end so as to obtain the target data which is marked in advance according to the preset marking type, the target data is merged into the data set to be marked to obtain a merged data set, the unmarked data in the merged data set is marked, and the multiplexing data set for subsequent multiplexing is generated. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Optionally, the data set to be annotated includes annotated data and non-annotated data.
Optionally, the data set to be annotated is obtained by: generating an empty data set on a current conversion platform, wherein the empty data set is provided with a corresponding relation with a preset mark type; acquiring non-labeled data, adding the non-labeled data to an empty data set, and adding the added empty data set to a data set to be labeled; and acquiring marked data, and adding the marked data to a data set to be marked.
Optionally, the annotated data is obtained by: acquiring second original data, second label information corresponding to the second original data and second label information corresponding to the second original data; compressing the second original data, the second labeling information and the second label information together, and determining the obtained compressed packet as labeled data; the second labeling information and the second tag information are files in JSON (JavaScript Object Notation) format.
Optionally, the target data is obtained by: acquiring first original data and generating target identification information corresponding to the first original data; marking the first original data according to a preset marking type to obtain target data; and establishing and storing the corresponding relation among the target identification information, the preset labeling type and the target data. Therefore, by establishing and storing the corresponding relation among the target identification information, the preset marking type and the target data, the browser end can conveniently send the export instruction comprising the preset marking type of the target data and the target data identification information to the server end so as to obtain the target data. And after the target data are obtained, the browser end merges the target data into the data set to be annotated to obtain a merged data set, and annotates the data which are not annotated in the merged data set to generate a multiplexing data set for subsequent multiplexing, so that the multiplexing of the target data is realized, and the operation cost and the time cost are saved.
Optionally, the target data includes first original data, first label information corresponding to the first original data, and first tag information corresponding to the first original data.
Optionally, the corresponding relationship among the target identification information, the preset labeling type and the target data is saved in a preset labeling data database.
Optionally, the merging, by the browser, the target data into the data set to be annotated includes: the browser end compares the target data with the set data in the data set to be marked to obtain a comparison result corresponding to the set data; and deleting the set data with the same comparison result from the data set to be marked by the browser end, and adding the target data into the data set to be marked to obtain a combined data set. Therefore, set data which are repeated with the target data are deleted, the target data are merged into the data set to be labeled to obtain a merged data set, and unlabeled data in the merged data set are labeled, so that the multiplexing of the target data is realized, and the operation cost and the time cost are saved.
Optionally, the aggregated data includes annotated data and non-annotated data.
Optionally, the comparing, by the browser, the target data with the set data in the data set to be labeled to obtain a comparison result corresponding to the set data, where the comparing includes: and if the set data is label-free data, the browser end compares the image characteristic value corresponding to the first original data in the target data with the image characteristic value corresponding to the label-free data to obtain a comparison result corresponding to the set data.
Optionally, the comparing, by the browser, the target data with the set data in the data set to be labeled to obtain a comparison result corresponding to the set data, where the comparing includes: if the set data is marked data, the browser end compares the image characteristic value corresponding to the first original data in the target data with the image characteristic value corresponding to the second original data in the marked data to obtain a comparison result corresponding to the set data.
Alternatively, the image feature value is one of a MD5(message-digest algorithm 5, generation 5 information digest algorithm) value, an image ID, or the like.
Optionally, the marking, by the browser, unmarked data in the merged data set according to the preset marking type to generate a multiplexed data set for subsequent multiplexing, including: marking the unmarked data in the combined data set by the browser end according to a preset marking type to obtain a marking result corresponding to the unmarked data; and adding the labeling result to the combined data set to generate a multiplexing data set for subsequent multiplexing.
Optionally, the unlabeled data includes one or more of unlabeled data, first original data, and second original data.
Optionally, the tagging result includes tagging information corresponding to the unlabeled data and tag information corresponding to the unlabeled data.
Optionally, if the label-free data, the first original data and the second original data are image data, the preset label type is one of image label types such as single-class label (classification), multi-class label (multi-label classification), object detection label (object Segmentation), image cutting label (Semantic Segmentation), and mixed label of object detection and classification.
In some embodiments, if the preset labeling type is single-classification labeling, the browser end labels the unlabelled data to obtain labeling information corresponding to the unlabelled data as one of the animal labels; among these, animal tags are, for example, cats, dogs, birds, etc.
In some embodiments, if the preset labeling type is single-classification labeling, the browser end labels the unlabelled data, and the labeling information corresponding to the unlabelled data is obtained and is one of the vehicle labels; among these, vehicle tags are, for example, buses, cars, off-road vehicles, etc.
In some embodiments, if the preset labeling type is multi-classification labeling, the browser end labels the unlabelled data to obtain labeling information corresponding to the unlabelled data as one of variety labels and one of color labels; the variety labels include, for example, catfish, cat jack, cat short hair, etc., and the color labels include, for example, black, white, tiger stripe, orange, etc.
In some embodiments, if the preset labeling type is multi-category labeling, the browser end labels the unlabelled data, and obtains labeling information corresponding to the unlabelled data as one of brand labels of vehicles and one of color labels; the variety labels are, for example, popular, BMW, Jili, etc., and the color labels are, for example, black, white, red, etc.
In some embodiments, if the preset labeling type is object detection labeling, the browser end labels the unlabeled data to obtain coordinates of a rectangular box in which the labeling information corresponding to the unlabeled data is a target object and a frame-selected target object; among them, target objects such as cats, dogs, birds, etc.
In some embodiments, if the preset annotation type is image cutting annotation, the browser end annotates the unmarked data to obtain the annotation information corresponding to the unmarked data as the pixel coordinates of each region box.
Optionally, the marking the unmarked data in the merged data set by the browser according to the preset marking type to obtain a marking result corresponding to the unmarked data, including: the method comprises the steps that a browser end sends a labeling instruction to a current labeling platform; the marking instruction comprises a preset marking type and a combined data set; the marking instruction is used for triggering the current marking platform to mark unmarked data in the combined data set according to a preset marking type; and the browser end acquires the labeling result corresponding to the non-labeled data fed back by the current labeling platform.
Optionally, the current annotation platform is one of traffic snapshot image annotation platforms, life picture annotation platforms, photo processing annotation platforms, and other annotation platforms.
Optionally, before the current labeling platform labels the unmarked data in the merged data set according to the preset labeling type, the method further includes: the current marking platform carries out data verification on the merged data set to obtain a verification result; and if the verification result is legal, the current labeling platform labels the unmarked data in the merged data set according to the preset labeling type.
Optionally, the current annotation platform determines that the verification result is illegal when the unmarked data of the merged data set does not meet the preset format condition; and the current labeling platform determines the verification result to be legal under the condition that the unmarked data of the merged data set meet the preset format condition. In some embodiments, the preset format condition is a JPEG (Joint Photographic Experts Group) format image file.
Optionally, the current annotation platform determines that the verification result is illegal when the unmarked data of the merged data set does not meet the preset data size condition; and the current labeling platform determines the verification result to be legal under the condition that the unmarked data of the merged data set meet the preset data size condition. In some embodiments, the preset data size condition is less than 8M.
Optionally, the current annotation platform determines that the verification result is illegal when the target data of the merged data set does not meet the preset type condition; and the current labeling platform determines the verification result to be legal under the condition that the unmarked data of the merged data set meet the preset type condition. In some embodiments, the preset type condition is a single classification label.
Optionally, the method for multiplexing annotation data further includes: the method comprises the steps that after a browser end obtains a data obtaining request sent by a third-party platform, a data conversion rule is obtained according to the data obtaining request, and the data obtaining request comprises data type information required by the third-party platform; the browser end carries out data conversion on data to be converted according to a data conversion rule to obtain a conversion data set, wherein the data to be converted comprises at least one part of multiplexing data set; and the browser end sends the conversion data set to a third-party platform. Therefore, the annotation data of the current annotation platform is converted into the annotation data of the third-party platform through the data conversion rule, so that the third-party platform can read the annotation data, the annotation data can be conveniently multiplexed and shared, and the utilization rate of the annotation data is improved.
Optionally, the third-party platform and the current annotation platform are different annotation platforms; the third-party platform is one of traffic snapshot image labeling platforms, life picture labeling platforms, photo processing labeling platforms and other labeling platforms.
In some embodiments, the data obtaining request further includes a data conversion rule, and the obtaining, by the browser, the data conversion rule by the browser includes: and the browser side acquires the data conversion rule in the data acquisition request.
In some embodiments, the data obtaining request further includes third-party platform information, and the obtaining, by the browser, the data conversion rule by the browser includes: the browser matches a data conversion rule from a preset data conversion rule database according to the third-party platform information and the data type information required by the third-party platform; the preset data conversion rule database stores the corresponding relation between the third-party platform information and the data conversion rules.
Referring to fig. 2, the present invention provides a method for multiplexing label data, including:
step S201, a server receives a exporting instruction sent by a browser;
step S202, the server side acquires target data according to a preset labeling type and target data identification information;
and step S203, the server side sends the target data to the browser side so as to enable the browser side to combine the target data into the data set to be labeled to obtain a combined data set, and label the unlabeled data in the combined data set to generate a multiplexing data set for subsequent multiplexing.
The export instruction comprises a preset marking type of the target data and target data identification information, and the target data is marked according to the preset marking type in advance
The method for multiplexing the marked data comprises the steps that a server end receives a lead-out instruction sent by a browser end, the lead-out instruction comprises a preset mark type and target data identification information of target data, the server end obtains the target data which is marked in advance according to the preset mark type and the target data identification information, and sends the target data to the browser end, so that the browser end can combine the target data into a data set to be marked to obtain a combined data set, and unmarked data in the combined data set are marked to generate a multiplexing data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Optionally, if the export instruction further includes export tagging information, the server side obtains target data according to a preset tagging type and the target data identification information, where the target data includes tagging information.
Optionally, if the export instruction includes export-free labeling information and does not include export labeling information, the server side obtains the target data according to the preset labeling type and the target data identification information, and the target data does not include labeling information.
Optionally, the obtaining, by the server, the target data according to the preset annotation type and the target data identification information includes: the server side matches corresponding target data from a preset labeling data database according to the preset labeling type and the target data identification information; the preset labeling data database stores the corresponding relation among the preset labeling type, the target data identification information and the target data.
Optionally, after the server matches corresponding target data from the preset annotation data database according to the preset annotation type and the target data identification information, the method further includes: the server side obtains a preset conversion protocol, and the preset conversion protocol is obtained according to a preset mark type; and performing data conversion on the target data according to a preset conversion protocol, packaging the converted target data and sending the packaged target data to the browser end. Therefore, the target data are exported from the browser end according to the specific conversion protocol, and the multiplexing and sharing of the target data are further realized.
In some embodiments, the target data includes first original data, first label information corresponding to the first original data, and first tag information corresponding to the first original data, the first label information and the first tag information in the target data are respectively converted into JSON format files according to a preset conversion protocol, and the first original data, the JSON format file of the first label information, and the JSON format file of the first tag information are compressed together to obtain a ZIP (Zone Improvement Plan) format compressed file including the target data.
Referring to fig. 3, the present invention provides a target data acquiring method, including:
step S301, a browser side acquires first original data and a preset marking type;
the first original data is image data, and the preset labeling type is one of image labeling types such as single classification labeling, multi-classification labeling, object detection labeling, image cutting labeling, object detection and classification mixed labeling and the like;
step S302, the browser end sends the first original data and a preset annotation type to a current annotation platform;
the current annotation platform is one of traffic snapshot image annotation platforms, life picture annotation platforms, photo processing annotation platforms and other annotation platforms;
step S303, the current marking platform carries out data verification on the first original data and judges whether the obtained verification result is legal or not; if yes, go to step S304; if not, go to step S305;
step S304, the current labeling platform labels the first original data according to a preset labeling type to obtain target data, and step S306 is executed;
the target data comprises first original data, first label information corresponding to the first original data and first label information corresponding to the first original data;
step S305, the current labeling platform sends labeling failure information to the browser end.
Step S306, the current annotation platform sends target data to a server;
step S307, the server side obtains a preset conversion protocol;
wherein, the preset conversion protocol is obtained according to a preset marking type;
step S308, the server side performs data conversion on the target data according to a preset conversion protocol, packages the converted target data and sends the packaged target data to the browser side;
in step S309, the browser stores the received target data in the client disk.
Referring to fig. 4, the present invention provides a method for multiplexing label data, including:
step S401, a browser end acquires a data set to be marked and a preset marking type;
the data set to be labeled comprises labeled data and label-free data, the labeled data comprises second original data, second label information corresponding to the second original data and second label information corresponding to the second original data, and the preset labeling type is one of image labeling types such as single-classification labeling, multi-classification labeling, object detection labeling, image cutting labeling, object detection and classification mixed labeling and the like;
step S402, the browser end sends a exporting instruction to the server end;
the export instruction comprises a preset labeling type of the target data and target data identification information, and the target data is labeled in advance according to the preset labeling type;
step S403, the server side obtains target data according to the export instruction;
the target data comprises first original data, first label information corresponding to the first original data and first label information corresponding to the first original data;
step S404, the server side sends the target data to the browser side;
step S405, the browser side obtains target data, and merges the target data into a data set to be annotated to obtain a merged data set;
step S406, labeling unmarked data in the combined data set by the browser according to a preset labeling type to generate a multiplexing data set for subsequent multiplexing;
wherein the unlabeled data comprises one or more of unlabeled data, first original data and second original data.
By utilizing the marked data multiplexing method provided by the invention, the browser end acquires the data set to be marked and the preset marking type, sends the export instruction containing the preset marking type of the target data and the identification information of the target data to the server end, the server end acquires the target data according to the export instruction and sends the target data to the browser end, the browser end merges the target data into the data set to be marked to acquire a merged data set, and then marks the unmarked data in the merged data set to generate a multiplexed data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Referring to fig. 5, the present invention provides a system for multiplexing annotation data, which includes at least one of a browser end 501 and a server end 502. The browser 501 is configured to acquire a data set to be labeled and a preset label type, send a export instruction to the server, where the export instruction includes a preset label type of target data and target data identification information, merge the target data into the data set to be labeled after acquiring the target data to obtain a merged data set, label unmarked data in the merged data set to generate a multiplexed data set for subsequent multiplexing, and label the target data according to the preset label type in advance. The server 502 is configured to receive a derivation instruction sent by the browser, obtain target data according to a preset tagging type in the derivation instruction and target data identification information in the derivation instruction, and send the target data to the browser.
By utilizing the marked data multiplexing system provided by the invention, the browser end acquires the data set to be marked and the preset marking type, sends the export instruction containing the preset marking type of the target data and the identification information of the target data to the server end, the server end acquires the target data according to the export instruction and sends the target data to the browser end, the browser end merges the target data into the data set to be marked to acquire a merged data set, and then marks the unmarked data in the merged data set to generate a multiplexed data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Optionally, the browser side and the server side are connected via a wireless network.
The foregoing embodiments are merely illustrative of the principles of the present invention and its efficacy, and are not to be construed as limiting the invention. Any person skilled in the art can modify or change the above-mentioned embodiments without departing from the spirit and scope of the present invention. Accordingly, it is intended that all equivalent modifications or changes which can be made by those skilled in the art without departing from the spirit and technical spirit of the present invention be covered by the claims of the present invention.
As shown in fig. 6, the present embodiment provides an electronic device, including: a processor (processor)600 and a memory (memory) 601; the memory is used for storing computer programs, and the processor is used for executing the computer programs stored by the memory so as to enable the terminal to execute the method in the embodiment. Optionally, the electronic device may further include a Communication Interface 602 and a bus 603. The processor 600, the communication interface 602, and the memory 601 may communicate with each other via a bus 603. The communication interface 602 may be used for information transfer. The processor 600 may call logic instructions in the memory 601 to perform the method of annotation data multiplexing of the above-described embodiment.
In addition, the logic instructions in the memory 601 may be implemented in the form of software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products.
The memory 601 is a computer-readable storage medium, and can be used for storing software programs, computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure. The processor 600 executes the functional application and data processing by executing the program instructions/modules stored in the memory 601, i.e. implements the method of multiplexing the label data in the above embodiments.
The memory 601 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal device, and the like. In addition, the memory 601 may include a high speed random access memory, and may also include a non-volatile memory.
Optionally, if the application is applied to a browser side, the electronic device is one of a computer, a notebook computer, a tablet computer, a smart phone, a server, and the like. The electronic equipment acquires a data set to be marked containing unmarked data and a preset marking type, sends a derivation instruction containing a preset marking type of target data and identification information of the target data to a server side so as to acquire the target data which is marked in advance according to the preset marking type, merges the target data into the data set to be marked to acquire a merged data set, marks the unmarked data in the merged data set, and generates a multiplexing data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
Optionally, if the method is applied to a server, the electronic device is a server. The electronic equipment receives a derivation instruction sent by a browser end, wherein the derivation instruction comprises a preset marking type and target data identification information of target data, the target data marked according to the preset marking type is acquired according to the preset marking type and the target data identification information, the target data are sent to the browser end, so that the target data are merged into a data set to be marked by the browser end to obtain a merged data set, and unmarked data in the merged data set are marked to generate a multiplexing data set for subsequent multiplexing. Compared with the prior art, the target data which is marked are combined in the data set to be marked, and the unmarked data in the data set to be marked are marked, so that the target data are multiplexed, and the operation cost and the time cost are saved.
The present embodiment also provides a computer-readable storage medium on which a computer program is stored, which when executed by a processor implements any of the methods in the present embodiments.
The computer-readable storage medium in the present embodiment can be understood by those skilled in the art as follows: all or part of the steps for implementing the above method embodiments may be performed by hardware associated with a computer program. The aforementioned computer program may be stored in a computer readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
The electronic device provided by the embodiment comprises a processor, a memory, a transceiver and a communication interface, wherein the memory and the communication interface are connected with the processor and the transceiver and are used for realizing mutual communication, the memory is used for storing a computer program, the communication interface is used for carrying out communication, and the processor and the transceiver are used for operating the computer program to enable the electronic device to execute the steps of the method.
In this embodiment, the Memory may include a Random Access Memory (RAM), and may also include a non-volatile Memory (non-volatile Memory), such as at least one disk Memory.
The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.
The above description and drawings sufficiently illustrate embodiments of the disclosure to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, and other changes. The examples merely typify possible variations. Individual components and functions are optional unless explicitly required, and the sequence of operations may vary. Portions and features of some embodiments may be included in or substituted for those of others. Furthermore, the words used in the specification are words of description only and are not intended to limit the claims. As used in the description of the embodiments and the claims, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. Similarly, the term "and/or" as used in this application is meant to encompass any and all possible combinations of one or more of the associated listed. Furthermore, the terms "comprises" and/or "comprising," when used in this application, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Without further limitation, an element defined by the phrase "comprising an …" does not exclude the presence of other like elements in a process, method or apparatus that comprises the element. In this document, each embodiment may be described with emphasis on differences from other embodiments, and the same and similar parts between the respective embodiments may be referred to each other. For methods, products, etc. of the embodiment disclosures, reference may be made to the description of the method section for relevance if it corresponds to the method section of the embodiment disclosure.
Those of skill in the art would appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software may depend upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments. It can be clearly understood by the skilled person that, for convenience and brevity of description, the specific working processes of the system, the apparatus and the unit described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments disclosed herein, the disclosed methods, products (including but not limited to devices, apparatuses, etc.) may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units may be merely a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form. The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to implement the present embodiment. In addition, functional units in the embodiments of the present disclosure may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. In the description corresponding to the flowcharts and block diagrams in the figures, operations or steps corresponding to different blocks may also occur in different orders than disclosed in the description, and sometimes there is no specific order between the different operations or steps. For example, two sequential operations or steps may in fact be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. Each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

Claims (10)

1. A method for multiplexing annotation data, comprising:
the method comprises the steps that a browser side obtains a data set to be marked and a preset marking type;
the method comprises the steps that a browser end sends a exporting instruction to a server end, the exporting instruction comprises a preset marking type of target data and target data identification information, and marking of the target data is completed in advance according to the preset marking type;
the browser end obtains the target data and merges the target data into the data set to be marked to obtain a merged data set;
and the browser marks the unmarked data in the combined data set according to the preset marking type to generate a multiplexing data set for subsequent multiplexing.
2. The method for multiplexing annotation data of claim 1, further comprising:
the method comprises the steps that after a data acquisition request sent by a third-party platform is acquired by a browser, a data conversion rule is acquired according to the data acquisition request, wherein the data acquisition request comprises data type information required by the third-party platform;
the browser end carries out data conversion on data to be converted according to the data conversion rule to obtain a conversion data set, wherein the data to be converted comprises at least one part of multiplexing data set;
and the browser end sends the conversion data set to the third-party platform.
3. The method for multiplexing annotation data according to claim 1, wherein the target data is obtained by:
acquiring first original data and generating target identification information corresponding to the first original data;
labeling the first original data according to the preset labeling type to obtain target data;
and establishing and storing the corresponding relation among the target identification information, the preset labeling type and the target data.
4. The method for multiplexing annotation data of claim 1, wherein the merging the target data into the data set to be annotated by the browser-side comprises:
the browser end compares the target data with set data in the data set to be marked to obtain a comparison result corresponding to the set data;
and the browser deletes the set data with the same comparison result from the data set to be marked, and adds the target data into the data set to be marked to obtain a combined data set.
5. A method for multiplexing annotation data, comprising:
the method comprises the steps that a server side receives a exporting instruction sent by a browser side, the exporting instruction comprises a preset marking type of target data and target data identification information, and marking of the target data is completed in advance according to the preset marking type;
the server side acquires the target data according to the preset labeling type and the target data identification information;
and the server side sends the target data to the browser side so as to realize that the browser side merges the target data into a data set to be annotated to obtain a merged data set, and annotates the data which are not annotated in the merged data set to generate a multiplexing data set for subsequent multiplexing.
6. The method for multiplexing of labeled data according to claim 5, wherein if the export instruction further includes export labeled information, the server obtains the target data according to the preset label type and the target data identification information, and the target data includes labeled information.
7. The method for multiplexing annotation data according to claim 5 or 6, wherein the step of acquiring the target data by the server according to the preset annotation type and the target data identification information comprises:
and the server side matches corresponding target data from a preset labeling data database according to the preset labeling type and the target data identification information, wherein the preset labeling data database comprises the target data, a preset labeling type corresponding to the target data and target data identification information corresponding to the target data.
8. A system for multiplexing annotation data, comprising at least one of:
the system comprises a browser end, a server end and a server end, wherein the browser end is used for acquiring a data set to be marked and a preset marking type, sending a leading-out instruction to the server end, the leading-out instruction comprises the preset marking type of target data and target data identification information, merging the target data into the data set to be marked after the target data are acquired to obtain a merged data set, marking unmarked data in the merged data set to generate a multiplexing data set for subsequent multiplexing, and marking the target data according to the preset marking type in advance;
the server is used for receiving a derivation instruction sent by a browser end, acquiring the target data according to a preset marking type in the derivation instruction and target data identification information in the derivation instruction, and sending the target data to the browser end.
9. An electronic device, comprising: a processor and a memory;
the memory is for storing a computer program, and the processor is for executing the computer program stored by the memory to cause the electronic device to perform the method of any of claims 1-7.
10. A computer-readable storage medium having stored thereon a computer program, characterized in that: the computer program, when executed by a processor, implements the method of any one of claims 1 to 7.
CN202111110072.8A 2021-09-18 2021-09-18 Method and system for multiplexing annotation data, electronic equipment and readable storage medium Active CN113792203B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111110072.8A CN113792203B (en) 2021-09-18 2021-09-18 Method and system for multiplexing annotation data, electronic equipment and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111110072.8A CN113792203B (en) 2021-09-18 2021-09-18 Method and system for multiplexing annotation data, electronic equipment and readable storage medium

Publications (2)

Publication Number Publication Date
CN113792203A true CN113792203A (en) 2021-12-14
CN113792203B CN113792203B (en) 2023-05-16

Family

ID=79184167

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111110072.8A Active CN113792203B (en) 2021-09-18 2021-09-18 Method and system for multiplexing annotation data, electronic equipment and readable storage medium

Country Status (1)

Country Link
CN (1) CN113792203B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114564253A (en) * 2022-03-02 2022-05-31 重庆紫光华山智安科技有限公司 Task creation method, system, electronic device and readable storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020136772A1 (en) * 2001-03-26 2002-09-26 Tai-Nang Huang Polymer synthesis
CN105094760A (en) * 2014-04-28 2015-11-25 小米科技有限责任公司 Picture marking method and device
CN110389978A (en) * 2019-07-17 2019-10-29 北京智能工场科技有限公司 A kind of racing data collection automated programming system and method based on multiplexing
CN110502964A (en) * 2019-05-21 2019-11-26 杭州电子科技大学 A kind of pedestrian's recognition methods again of unsupervised data-driven
CN111639284A (en) * 2020-05-29 2020-09-08 深圳壹账通智能科技有限公司 Webpage labeling method and device, electronic equipment and medium
CN112434794A (en) * 2020-11-30 2021-03-02 国电南瑞科技股份有限公司 Computer vision data set semi-automatic labeling method and system based on deep learning
CN112632284A (en) * 2020-12-30 2021-04-09 上海明略人工智能(集团)有限公司 Information extraction method and system for unlabeled text data set
CN112784905A (en) * 2021-01-26 2021-05-11 北京嘀嘀无限科技发展有限公司 Data sample expansion method and device and electronic equipment
CN112861842A (en) * 2021-03-22 2021-05-28 天津汇智星源信息技术有限公司 Case text recognition method based on OCR and electronic equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020136772A1 (en) * 2001-03-26 2002-09-26 Tai-Nang Huang Polymer synthesis
CN105094760A (en) * 2014-04-28 2015-11-25 小米科技有限责任公司 Picture marking method and device
CN110502964A (en) * 2019-05-21 2019-11-26 杭州电子科技大学 A kind of pedestrian's recognition methods again of unsupervised data-driven
CN110389978A (en) * 2019-07-17 2019-10-29 北京智能工场科技有限公司 A kind of racing data collection automated programming system and method based on multiplexing
CN111639284A (en) * 2020-05-29 2020-09-08 深圳壹账通智能科技有限公司 Webpage labeling method and device, electronic equipment and medium
CN112434794A (en) * 2020-11-30 2021-03-02 国电南瑞科技股份有限公司 Computer vision data set semi-automatic labeling method and system based on deep learning
CN112632284A (en) * 2020-12-30 2021-04-09 上海明略人工智能(集团)有限公司 Information extraction method and system for unlabeled text data set
CN112784905A (en) * 2021-01-26 2021-05-11 北京嘀嘀无限科技发展有限公司 Data sample expansion method and device and electronic equipment
CN112861842A (en) * 2021-03-22 2021-05-28 天津汇智星源信息技术有限公司 Case text recognition method based on OCR and electronic equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李媛媛 等: "结合本体与社会化标签的用户动态兴趣建模研究", 《情报学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114564253A (en) * 2022-03-02 2022-05-31 重庆紫光华山智安科技有限公司 Task creation method, system, electronic device and readable storage medium
CN114564253B (en) * 2022-03-02 2023-06-09 重庆紫光华山智安科技有限公司 Task creation method, system, electronic device and readable storage medium

Also Published As

Publication number Publication date
CN113792203B (en) 2023-05-16

Similar Documents

Publication Publication Date Title
CN103189922A (en) Consolidating metadata relating to duplicate images
EP2382532B1 (en) System and method for enhancing security printing
CN107844524A (en) Data processing method, data processing equipment, computer equipment and storage medium
CN108874558A (en) News subscribing method, electronic device and the readable storage medium storing program for executing of distributed transaction
CN113792203B (en) Method and system for multiplexing annotation data, electronic equipment and readable storage medium
CN111382201A (en) Heterogeneous database synchronization method and device, computer equipment and storage medium
CN111880755A (en) Printing method, printing device, electronic equipment and computer readable storage medium
EP3018590B1 (en) Document processing method and system in wireless communications device
CN112800090A (en) Data processing method combining edge computing and path analysis and big data cloud platform
CN102426556A (en) Method and device for identifying port attribute of data card
CN111414339B (en) File processing method, system, device, equipment and medium
CN112654999B (en) Method and device for determining labeling information
CN107844520A (en) Electronic installation, vehicle data introduction method and storage medium
CN111967810B (en) Method and system for constructing logistics one-ticket-cargo anti-counterfeiting chain
CN115002196A (en) Data processing method and device and vehicle-end acquisition equipment
CN116028246A (en) Scheduling method, system, computer equipment and storage medium for data processing
CN112579539B (en) Management method and system for enterprise cluster big data
CN115345131A (en) Method and device for constructing hierarchical relationship tree and electronic equipment
CN113676290A (en) Data transmission method based on fund system and related equipment
CN113343857A (en) Labeling method, labeling device, storage medium and electronic device
CN111160876A (en) Internet of things identification analysis information input method based on input equipment identity
CN112308098A (en) Image processing method and device
CN114726407B (en) Method, device, controller and storage medium for acquiring abnormal information
CN113052769B (en) Adaptive label selection system and method based on size discrimination
CN113033832B (en) Method and device for inputting automobile repair data, terminal equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant