CN105404896B - Labeled data processing method and labeled data processing system - Google Patents

Labeled data processing method and labeled data processing system Download PDF

Info

Publication number
CN105404896B
CN105404896B CN201510744484.5A CN201510744484A CN105404896B CN 105404896 B CN105404896 B CN 105404896B CN 201510744484 A CN201510744484 A CN 201510744484A CN 105404896 B CN105404896 B CN 105404896B
Authority
CN
China
Prior art keywords
mark
annotation results
data processing
labeled data
task
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510744484.5A
Other languages
Chinese (zh)
Other versions
CN105404896A (en
Inventor
许欣然
高宇
印奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Megvii Technology Co Ltd
Beijing Maigewei Technology Co Ltd
Original Assignee
Beijing Megvii Technology Co Ltd
Beijing Maigewei Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Megvii Technology Co Ltd, Beijing Maigewei Technology Co Ltd filed Critical Beijing Megvii Technology Co Ltd
Priority to CN201510744484.5A priority Critical patent/CN105404896B/en
Publication of CN105404896A publication Critical patent/CN105404896A/en
Application granted granted Critical
Publication of CN105404896B publication Critical patent/CN105404896B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of labeled data processing method and labeled data processing systems.The labeled data processing method includes: step S110: calculating the similarity of multiple annotation results relevant to mark task;Step S120: similarity is compared with similarity threshold, if similarity is greater than or equal to similarity threshold, goes to step S130, if similarity is less than the similarity threshold, goes to step S140;Step S130: determine that multiple annotation results pass through quality testing;And step S140: determine that multiple annotation results do not pass through quality testing.The labeled data processing method and labeled data processing system provided according to the present invention, due to detecting the quality of annotation results automatically using similarity, so that mark personnel are possible to the quality of timely learning annotation results, and then it is possible to timely correction marking error, it can effectively improve mark accuracy.

Description

Labeled data processing method and labeled data processing system
Technical field
The present invention relates to data processing fields, and in particular to a kind of labeled data processing method and labeled data processing system System.
Background technique
Machine is trained (or saying study) and usually requires a large amount of labeled data as training set, the data of labeled data It measures bigger more helpful to trained effect, therefore how efficiently and accurately to carry out data mark and have become one and urgently solve Certainly the problem of.The data mark process of existing data labeling system is usually: publication includes one or more mark units Mark task is manually marked, carries out manual quality's inspection etc..Existing data labeling system fully relies on manual quality's inspection It looks into control mark accuracy, therefore may be grown very much from the time interval being accomplished between quality examination is manually marked, it is difficult to The mistake of timely correction mark personnel.
Summary of the invention
In view of the above problems, the present invention is proposed in order to provide a kind of labeled data at least being partially solved the above problem Processing method and labeled data processing system.
According to an aspect of the invention, there is provided a kind of labeled data processing method, comprising: step S110: calculate with The similarity of the relevant multiple annotation results of mark task;Step S120: similarity is compared with similarity threshold, if phase It is greater than or equal to similarity threshold like degree, then goes to step S130, if similarity is less than similarity threshold, goes to step S140;Step S130: determine that multiple annotation results pass through quality testing;And step S140: determine that multiple annotation results are not led to Cross quality testing.
According to another aspect of the present invention, a kind of labeled data processing system, including computing device, similarity-rough set are provided Device, the first executive device and the second executive device.Computing device is for calculating multiple annotation results relevant to the task that marks Similarity.Similarity-rough set device is for comparing similarity with similarity threshold, if similarity is greater than or equal to phase Like degree threshold value, then start the first executive device, if similarity is less than similarity threshold, starts the second executive device.First Executive device is for determining that multiple annotation results pass through quality testing.Second executive device is for determining that multiple annotation results are not led to Cross quality testing.
The labeled data processing method and labeled data processing system provided according to the present invention, due to automatic using similarity Detect the quality of annotation results so that mark personnel are possible to the quality of timely learning annotation results, and then be possible to and When correct marking error, can effectively improve mark accuracy.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows the flow chart of labeled data processing method according to an embodiment of the invention;
Fig. 2 shows the flow charts of labeled data processing method in accordance with another embodiment of the present invention;
Fig. 3 shows the flow chart of labeled data processing method in accordance with another embodiment of the present invention;And
Fig. 4 shows the schematic block diagram of labeled data processing system according to an embodiment of the invention.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure It is fully disclosed to those skilled in the art.
According to an aspect of the invention, there is provided a kind of labeled data processing method.Fig. 1 shows according to the present invention one The flow chart of the labeled data processing method 100 of a embodiment.
As shown in Figure 1, labeled data processing method 100 includes the following steps.
Step S110: the similarity of multiple annotation results relevant to mark task is calculated.Mark task as described herein Refer to requiring including object to be marked and mark for task.Object to be marked is referred to as " mark unit ", can be One or a set of image, video or audio etc..Mark requirement is the letter how indicateing arm note personnel are labeled mark unit Breath.For example, mark unit can be the image comprising several faces, mark, which requires can be, to be indicated to outline the institute in image The information for having face or marking out several key points on face etc..Mark personnel can scheme after receiving mark task Face or mark face key point etc. are outlined as in.Image comprising the face through marking can be used in and answer with all kinds of recognitions of face With in the training of relevant machine.It certainly, is not pair the above is only the example of mark unit and mark requirement in mark task Limitation of the invention.Other several examples that mark unit and mark require are described below.Mark unit can also be comprising it The image of his content e.g. includes the image of the contents such as text (trademark or license plate number etc.), animal, article.Accordingly Ground, mark require to can be the information of all texts indicated mark out in image, animal or article.Mark unit can also be Image comprising personage, mark require to be to indicate to determine the sex, race of personage or the information at age etc. in image.Mark unit Can also be one group of image, including one include object of reference image and multiple images to be selected, mark require can be instruction from The information of specific image to be selected is selected in all images to be selected, which includes the object same or similar with object of reference Body.Mark unit can also be a segment of audio, and mark requires to be the information for indicating the text occurred in input audio.Mark unit It can also be a problem and set of option, mark requires to be that instruction selects appropriate choosing corresponding with problem from set of option The information of item.
Annotation results refer to that mark personnel require the result data obtained after being labeled to mark unit based on mark. For example, annotation results may include the information of several face key points marked out about mark personnel, such as each face The position etc. of key point in the picture.According to an embodiment of the invention, a mark task can be received by multiple mark personnel And it participates in.Each mark personnel can provide an annotation results for a mark task, thus, it is possible to obtain appointing with mark It is engaged in relevant multiple annotation results.Then, the similarity between these annotation results can be calculated.For different types of mark As a result for, the calculation method of similarity may be different.Below by taking two annotation results as an example description similarity calculation method. A numerical value be can use to describe the similarity between two annotation results, it includes but be not limited to following several method:
If can be calculated between the corresponding points in two annotation results if mark task is that mark is done on the image Similarity of the summation of Euclidean distance as two annotation results;
If mark task is to mark several polygons on the image, the polygon phase in two annotation results can be calculated Cross surface product and phase and similarity of the area ratio (IOU) as two annotation results;
If mark task be in multiple options select an option, the consistent similarity of two annotation results be 1, two The inconsistent similarity of annotation results is 0;
If mark task is to select more than one option in multiple options, can calculate in two annotation results The quantity of public option (i.e. the intersection of option) (has selected choosing with all options in two annotation results Union) similarity of the ratio of number as two annotation results.
With reference to the description of the similarity above with respect to two annotation results, it is possible to understand that the phase of more than two annotation results Like the calculation of degree, it can realize that details are not described herein using routine techniques.
Step S120: similarity is compared with similarity threshold, if similarity is greater than or equal to similarity threshold, Step S130 is gone to, if similarity is less than similarity threshold, goes to step S140.Similarity threshold can be any suitable Value, for example, similarity threshold can be greater than or equal to 80%, e.g. 85%, 90% or 95% etc. can according to need Depending on, the present invention limits not to this.Similarity threshold can be initially set to a default value, then can be according to practical need Automatically it to be adjusted.For different mark tasks, similarity threshold can be identical or different.For example, for opposite Simple mark task, similarity threshold can be larger, on the contrary, for relative complex mark task, similarity threshold can be with It is smaller.Multiple annotation results of the same mark task can be compared, calculate the similarity between them, then, sentence Whether the similarity between multiple annotation results of breaking is greater than or equal to above-mentioned similarity threshold, as shown in Figure 1.According to similarity with Size relation selection between similarity threshold executes step S130 and still executes step S140.
Step S130: determine that multiple annotation results pass through quality testing.If the similarity of above-mentioned multiple annotation results is big In or be equal to similarity threshold, then illustrate multiple mark personnel to it is same mark unit carry out mark be all it is more similar, Therefore this multiple annotation results is possible to be the higher annotation results of accuracy.Therefore, such case can be considered as these Annotation results are all the correct marks to mark unit, are the correct processing results of mark task.In this way it can be considered that this is multiple The quality of annotation results can guarantee, determine that they pass through quality testing.
Step S140: determine that multiple annotation results do not pass through quality testing.If the phase between above-mentioned multiple annotation results It is less than similarity threshold like degree, then the mark difference for illustrating that multiple mark personnel carry out same mark unit is larger, therefore, right For this multiple annotation results, wherein a possibility that there are error labels is very big.Therefore, such case can be considered as to these marks Note result is not the correct mark to mark unit, that is, thinks that the quality of this multiple annotation results not can guarantee, and determines it Do not pass through quality testing.It is understood that quality testing as described herein is to detect annotation results by similarity Quality can reflect the quality of annotation results to a certain extent, can also be further by artificial after quality testing Mode verifies the quality of annotation results.Efficiency is higher by way of quality of the similarity to detect annotation results automatically, and And mark accuracy can be improved.
The labeled data processing method provided according to the present invention, due to detecting the matter of annotation results automatically using similarity Amount so that mark personnel are possible to the quality of timely learning annotation results, and then is possible to timely correction marking error, can To effectively improve mark accuracy.
Fig. 2 shows the flow charts of labeled data processing method 200 in accordance with another embodiment of the present invention.It is shown in Fig. 2 Step S110, step S120, step S130 and step S140 are similar to Fig. 1, repeat no more.In the present embodiment, in step Before S110, labeled data processing method 200 may further include following steps.
Step S102: certain number of annotation results relevant to mark task are obtained.Mark personnel can be by some Interactive device initiates to request to labeled data processing system, and labeled data processing system can be from the queue to be marked safeguarded Selection is suitable for the mark task of mark personnel and the mark task is sent to mark personnel.Then, available mark The annotation results that note personnel obtain after marking.
Step S104: judging whether given number is equal to quantity threshold relevant to mark task, if given number etc. In quantity threshold, then step S106 is gone to, otherwise goes to step S102.Different mark tasks can correspond to different number thresholds Value is that is, related to different quantity thresholds.For relatively simple mark task, mark personnel mark mark unit It is less susceptible to mistake occur when note, therefore less mark personnel can be required to be labeled.Therefore, simply mark task can With the lesser quantity threshold of correspondence.On the contrary, for complicated mark task, when mark personnel are labeled mark unit The probability for mistake occur may be larger, therefore more mark personnel can be required to be labeled.Therefore, complicated mark task Biggish quantity threshold can be corresponded to.For new mark task, initial quantity threshold can be smaller, and for example, 2, then may be used To be adjusted according to actual needs to quantity threshold.For a mark task, whenever mark personnel provide a mark Note is as a result, can store the annotation results.It may determine that whether the number of stored annotation results reaches number Threshold value, the annotation results for being not up to quantity threshold do not execute step S110, but re-execute the steps S102, for reaching The annotation results of quantity threshold start to execute step S110.
Step S106: certain number of annotation results are determined as multiple annotation results and go to step S110.Institute as above It states, when the number of annotation results reaches quantity threshold, i.e., reaches institute when receiving and participate in the mark personnel of same mark task When needing number, it can start to execute step S110.Furthermore it is also possible to maintenance quality detects queue, it can be by the number of annotation results The mark task that mesh is equal to quantity threshold removes out queue to be marked and quality testing queue is added, and temporarily no longer marks to other Personnel send the mark task.
Receiving is controlled above by quantity threshold and the method for participating in the mark personnel of same mark task can be reasonable Ground avoids extra, meaningless mark, as far as possible using mark manpower so as to effectively improve manpower utilization.
Fig. 3 shows the flow chart of labeled data processing method 300 in accordance with another embodiment of the present invention.It is shown in Fig. 3 Step S102, step S104, step S106, step S110, step S120 and step S130 are similar to Fig. 1, repeat no more.At this In embodiment, step S140 may include step S1402 and step S1404.Step S1402 is the multiple marks of above-mentioned determination As a result do not pass through quality inspection steps.In addition to step S1402, step S140 may further include step S1404, that is, increase Quantity threshold simultaneously goes to step S102.
When determining that multiple annotation results do not pass through quality testing, quantity threshold can be increased, and obtain more marks As a result for carrying out quality testing.The number of annotation results increases, so that the similarity of all annotation results is possible to increase, it is whole The quality of body is possible to improve.Depending on the amount that quantity threshold increases every time can according to need, the present invention limits not to this. It is appreciated that mark task can be removed to mass detection when increasing quantity threshold and obtaining new annotation results Queue simultaneously rejoins queue to be marked.Meanwhile new mark personnel can be introduced, wherein each mark personnel only mark one It is secondary.Further, it is also possible to retain last any one or more of the multiple annotation results for carrying out quality testing, or can be with All using new annotation results.For new mark task, quantity threshold can initially be set to compared with It is small, for example, 2.For relatively simple mark task, it is possible to no need to increase quantity threshold or only increase fewer number mesh threshold Value can pass through quality testing.Therefore it for relatively simple mark task, may be carried out eventually by when quality testing Labeled times it is less, so as to reduce the repeat mark for being directed to simple mark task to the greatest extent, save mark manpower.And it is right In relative complex mark task, quantity threshold can be increased always, increase the number of the annotation results of acquisition until passing through matter Amount detection improves mark accuracy in this way, the final mass of the annotation results of complicated mark task can be improved.Therefore, according to The method of this adjust automatically quantity threshold provided by the invention, can be as far as possible efficiently using mark manpower, while can lead to Reasonable more people's repeat marks are crossed, achieve the purpose that promote mark quality.This is unlike the prior art.It is marked in existing data In method, in order to promote mark accuracy, it usually needs the labeled times being multiplied to the same mark task.However, every A mark task, which is appropriate for how many times mark, can not determine and can not also be reasonably adjusted, can only be according to theoretical or experience Estimated.Therefore, this is likely to result in for the inappropriate labeled times of certain mark task choosings, to waste people Power reduces mark accuracy.
Optionally, step S140 may further include: multiple annotation results be sent, to be checked by inspection personnel; Check feedback information;And send and check feedback information, to inform mark personnel.Multiple annotation results similarity compared with It is small, so that multiple annotation results can be sent to inspection personnel in the case that annotation results do not pass through quality testing and examined It looks into.Inspection personnel can provide after checking annotation results and check feedback information (i.e. inspection opinion).For example, In the mark task being labeled to face, it is possible to which the mark personnel having mark face, some marks in the larger context Personnel mark face in lesser range, and inspection personnel may indicate that the mark of which mark personnel more meets the requirements and points out Suitably mark range should be much, and such information is exactly to check feedback information.It is appreciated that inspection personnel ties mark The process that fruit is checked can also be considered as an annotation process, wherein mark unit is annotation results, and mark requirement can be with For, such as, it is indicated that the unreasonable place of annotation results.Therefore, inspection personnel can also be considered as mark personnel.Interaction can be passed through Device will check that feedback information feeds back to labeled data processing system, and labeled data processing system will check that feedback information is fed back to All mark personnel for participating in the mark task, to be referred to by mark personnel, so that it is more acurrate to instruct mark personnel to provide Annotation results.
Optionally, step S130 may further include: be averaged to multiple annotation results, to obtain and mark task Relevant average annotation results.Average annotation results are for spot-check.When multiple annotation results pass through quality testing, can determine Average annotation results, and average annotation results are stored be used for after spot-check by selective examination personnel.It is understood that Labeled data processing system can equally safeguard selective examination queue, can will remove out above-mentioned matter by the mark task of quality testing Amount detection queue simultaneously stores, and selected section mark task can be spot-check from the mark task stored later.
The method of determination of average annotation results is including but not limited to following several:
If the centre of the corresponding points in multiple annotation results can be calculated if mark task is that mark is done on the image Point is as average annotation results;
If mark task can calculate separately multiple marks to mark several points on every frame image in one section of video As a result the intermediate point of the corresponding points in correspondence image in is as average annotation results;
If mark task is to mark several polygons on the image, the correspondence that can be calculated in multiple annotation results is polygon The intermediate point of the intersection of shape or the corresponding points on corresponding polygon is as average annotation results, it should be noted that for passing through If doing on labelling polygons is come for the mark task of labelling polygons, the mark sequence of different labeled personnel may be different It causes (such as the mark personnel having mark clockwise, and some mark personnel mark counterclockwise), in such a case, it is possible to press first The point on polygon that each mark personnel mark out is mapped according to the mark sequence of each mark personnel, then calculates average mark Infuse result;
If the task of mark is to select an option in multiple options, it is all only that multiple annotation results should be all consistent One the correct option selects the option as average annotation results;
If mark task is to select more than one option in multiple options, can choose in multiple annotation results All options in public option or multiple annotation results are as average annotation results, it is to be understood that It selects public option the accuracy of final annotation results can be made higher as average annotation results, selects multiple marks As a result all options in keep the accuracy of final annotation results lower as average annotation results, can according to need choosing Select suitable scheme;
If the task of mark is one section of specific text of input, all texts for including in multiple annotation results can be made For average annotation results;
If mark task is the range of age for marking the personage in image, the age in multiple annotation results can be calculated The common range (i.e. the intersection of the range of age) or total range (i.e. the union of the range of age) of range are as average annotation results;
If mark task is facial orientation, number of people orientation or the angle etc. marked in image, multiple marks can be calculated As a result the average value of facial orientation, number of people orientation or angle etc. in is as average annotation results.
Optionally, labeled data processing method may further include: selection mark task from mark set of tasks Set;Average annotation results relevant to each mark task in mark task subclass are sent, to be carried out by selective examination personnel Selective examination;Receive selective examination feedback information;And determine whether mark set of tasks passes through selective examination based on selective examination feedback information.When multiple When annotation results pass through quality testing, the average annotation results of multiple annotation results and corresponding mark task can be deposited Storage is got up.Then, a collection of mark task can be selected to combine from all mark tasks stored, mark is formed and appoint Business set.It can will mark all mark tasks in set of tasks and selective examination queue is added.Then selected from mark set of tasks Mark task subclass is selected for spot-check.Depending on the selection mode of mark task subclass can according to need, the present invention is not right This is limited.For example, can be sub as mark task from a certain proportion of mark task of random sampling in set of tasks that marks Set.The ratio can be preset, such as be set as 10%~50% etc., due to spot-check need to expend time of selective examination personnel at This, therefore the ratio that can be sampled determine according to actual needs.Then, by with mark task subclass in each mark task Relevant average annotation results are sent to selective examination personnel and carry out manual examination and verification.By selective examination, it may further determine that and appoint with mark Whether relevant average annotation results of being engaged in are qualified, to further increase mark accuracy.
Optionally, the mark task marked in set of tasks is that marking types are identical and label time is in preset period of time Mark task.The identical mark requirement referred in mark task of marking types be it is identical, it is different only to mark unit.Example Such as, for face mark, if the facial image in different labeled task is different, but mark requires identical, example Such as it is required to mark 20 key points on face, it may be considered that the mark that these mark tasks belong to same marking types is appointed Business.Further for example, mark requires to be the personage marked out in image if mark unit is the different image comprising personage The range of age, then such mark task also belongs to the mark task of same marking types.Label time, which refers to, receives mark As a result time, that is, mark personnel provide the time of its annotation results.Since practical mark situation may be with the time Variation, may be unstable, therefore label time can relatively should compare with reference value.
Optionally it is determined that it includes: to obtain selective examination based on selective examination feedback information to pass through that whether mark set of tasks, which passes through selective examination, Rate;And selective examination percent of pass compares with percent of pass threshold value, if selective examination percent of pass is greater than or equal to percent of pass threshold value, really Calibration note set of tasks is by selective examination, if selective examination percent of pass is less than percent of pass threshold value, it is determined that mark set of tasks does not pass through Selective examination.After determining whether mark set of tasks passes through selective examination, labeled data processing method be may further include: if mark Set of tasks is infused by selective examination, then is determined based on selective examination feedback information relevant to each mark task in mark set of tasks Final annotation results;And if mark set of tasks does not pass through selective examination, increases similarity threshold and go to step S120.
Selective examination feedback information is the information that selective examination personnel provide, and may include the letter of the value of direct instruction selective examination percent of pass Breath, such as point out the selective examination percent of pass of certain selective examination is how many.Selective examination feedback information can also include pointing out to receive the every of selective examination The information how a average annotation results correct with the presence or absence of mistake and/or mistake.It later, can be all according to receiving to spot-check The correct and error situations of average annotation results calculates selective examination percent of pass.Percent of pass threshold value can be any suitable value, example As percent of pass threshold value can be greater than or equal to 90% and be less than or equal to 99%.If spot-check percent of pass is less than percent of pass threshold Value, then the entire mark set of tasks of explanation does not pass through selective examination, can be by all mark tasks in the mark set of tasks from pumping It looks into queue and removes and rejoin quality testing queue, and increase the similarity threshold of these mark tasks, so as to this The requirement of the mark accuracy of a little mark tasks becomes higher.If spot-check percent of pass is greater than or equal to percent of pass threshold value, say Bright entire mark set of tasks passes through selective examination, it is believed that all mark tasks in the mark set of tasks are completed.Then, Final annotation results can be determined based on selective examination feedback information.For example, if not finding mistake in sampling procedure, it can be direct Using average annotation results relevant to each mark task in mark set of tasks as final annotation results, if spot-check Mistake is found in journey, then obtains final annotation results after the error label that can be found in filtering out selective examination.
Optionally, before step S110, labeled data processing method be may further include: be received and mark personnel's phase The identification information of pass;Mark task is selected from queue to be marked based on identification information, mark task is corresponding with mark personnel; And mark task is sent, to provide annotation results relevant to mark task by mark personnel;And it receives mark personnel and mentions The annotation results of confession are as one of multiple annotation results.As described above, queue to be marked can be safeguarded comprising several marks Task.Mark task can be sent to mark people by interactive device, such as user's interactive interface by labeled data processing system Member.The interaction of labeled data processing system and mark personnel can be realized further using such as application program (APP).Mark people Member can open the APP, input its identification information.Identification information can be any identity that can be used in identifying mark personnel Information, such as the account name and the password that mark personnel etc..Labeled data processing system can be based on identification information identification mark The identity of personnel, and then it is sent to it the mark task for being suitble to it.For example, the mark task that mark personnel have received and completed Mark personnel will be never sent to.Further, it is to be appreciated that mark personnel can also actively initiate to request, such as request Wish that the type of the mark task participated in, labeled data processing system can be suitble to mark people according to its request selecting and transmission The mark task of member.
As described above, the process that inspection personnel checks annotation results can also be considered as an annotation process, Inspection personnel can also be considered as mark personnel.Therefore, optionally, above-mentioned mark personnel and inspection personnel can be same lineup. That is, same people either mark personnel be also possible to inspection personnel, can according to need flexible conversion.In this way may be used To prevent mark personnel and the unmatched problem of inspection personnel's ratio, examined so as to avoid mark personnel very busy It looks into the very idle state of personnel or marks that personnel are very idle and state that inspection personnel is very busy.In the implementation Each step and the step in labeled data processing method described above are almost the same, but when carrying out quality testing, need to infuse Meaning is labeled for the different people of same mark task choosing and quality testing, that is, prevents the mark personnel of same mark task It is same people with inspection personnel.
According to a further aspect of the invention, a kind of labeled data processing system is provided.Fig. 4 shows an implementation according to the present invention The schematic block diagram of the labeled data processing system 400 of example.As shown in figure 4, labeled data processing system 400 includes computing device 410, similarity-rough set device 420, the first executive device 430 and the second executive device 440.
Computing device 410 is used to calculate the similarity of multiple annotation results relevant to the task that marks.Similarity-rough set dress 420 are set for similarity to compare with similarity threshold, if similarity is greater than or equal to similarity threshold, starts first Executive device 430 starts the second executive device 440 if similarity is less than similarity threshold.First executive device 430 is used In determining that multiple annotation results pass through quality testing.Second executive device 440 is for determining that multiple annotation results do not pass through quality Detection.
Appointing in computing device 410, similarity-rough set device 420, the first executive device 430 and the second executive device 440 What one or more can be realized using any suitable hardware, software and/or firmware, such as pass through specific integrated circuit (ASIC), field programmable gate array (FPGA), Digital Signal Processing (DSP) etc. are realized.Computing device 410, similarity-rough set Any one or more of device 420, the first executive device 430 and second executive device 440 can be handled with labeled data Other devices in system 400 are integrated or are realized using individual device.Computing device 410 and similarity-rough set dress Setting can be connected between 420 by the way of direct or indirect, such as be connected by wired or wireless way.Similarity-rough set dress Setting 420 and first can also be connected by the way of direct or indirect between executive device 430 or the second executive device 440, example Such as connected by wired or wireless way.
The labeled data processing system provided according to the present invention, due to detecting the matter of annotation results automatically using similarity Amount so that mark personnel are possible to the quality of timely learning annotation results, and then is possible to timely correction marking error, can To effectively improve mark accuracy.
Optionally, it is true to may further include acquisition device, judgment means and annotation results for labeled data processing system 400 Determine device (not shown).Acquisition device is for obtaining certain number of annotation results relevant to the task that marks.Judgment means are used In judging whether given number is equal to quantity threshold relevant to mark task, if given number is equal to quantity threshold, open Dynamic annotation results determining device, otherwise starts acquisition device.Annotation results determining device is used for certain number of annotation results It is determined as multiple annotation results and starts computing device.It is executed with above-mentioned computing device 410, similarity-rough set device 420, first Device 430 and the second executive device 440 similarly, any one in acquisition device, judgment means and annotation results determining device It is a or multiple can be realized using any suitable hardware, software and/or firmware.
Optionally, the second executive device 440 can be further used for increasing quantity threshold and start acquisition device.Work as determination When multiple annotation results do not pass through quality testing, quantity threshold can be increased, and obtain more annotation results for carrying out matter Amount detection.It is understood that annotation results are being manually checked and are being provided by inspection personnel due to as described above During checking feedback information, the requirement to the inspection precision of inspection personnel is not too much high, and quasi- to the mark of mark personnel The requirement of true property is some higher, therefore can be omitted manual inspection, by way of increasing quantity threshold, is directly examined using quality It surveys to guarantee to mark quality.Differing biggish mark task for multiple annotation results can require to increase quantity threshold, that is, increase It marks number.The more similar mark task of consistent or most of annotation results obvious for multiple annotation results, can To be directly averaged to obtain average annotation results, as described above to multiple annotation results.In this way, can be efficient as far as possible Using mark manpower, while can achieve the purpose that promote mark quality by reasonable more people's repeat marks.
Optionally, the second executive device 440 may further include annotation results sending module, check feedback reception module Sending module is fed back with checking.Annotation results sending module is for sending multiple annotation results, to be checked by inspection personnel. Check feedback reception module for checking feedback information.Check that feedback sending module checks feedback information for sending, with Inform mark personnel.Multiple annotation results similarity it is smaller so that annotation results do not pass through quality testing in the case where, can It is checked so that multiple annotation results are sent to inspection personnel.Inspection personnel, can be with after checking annotation results It provides and checks feedback information.Being manually checked by inspection personnel can be with the side of the quality using similarity detection annotation results Formula combines, to further increase the accuracy of annotation results.
Optionally, the first executive device 430 can be further used for being averaged to multiple annotation results, to obtain and mark The relevant average annotation results of note task.Average annotation results can be used for spot-check.When multiple annotation results pass through quality testing When, can determine average annotation results, and average annotation results are stored be used for after spot-check by selective examination personnel.On Text has illustrated the method for determination of average annotation results, and details are not described herein.
Optionally, labeled data processing system 400 may further include subclass selection device, the first sending device, First receiving device and selective examination pass through determining device (not shown).Subclass selection device is used to from mark set of tasks select Mark task subclass.First sending device is relevant average to each mark task marked in task subclass for sending Annotation results, to be spot-check by selective examination personnel.First receiving device, for receiving selective examination feedback information.Selective examination passes through determination Device, for determining whether mark set of tasks passes through selective examination based on selective examination feedback information.It is appreciated that labeled data processing system System 400 may include storage device (not shown), for storing the mark task for passing through quality testing.It can be from the institute stored Have and a collection of mark task is selected to combine in mark task, forms mark set of tasks.It then can be from mark task-set It is selected in conjunction, such as random sampling a part mark task is used to spot-check as mark task subclass.Pass through selective examination, Ke Yijin One step determines whether average annotation results relevant to mark task are qualified, to further increase mark accuracy.
Optionally, the mark task marked in set of tasks is that marking types are identical and label time is in preset period of time Mark task.Marking types are identical and the relatively more similar mark task of label time between can compare to have and refer to mutually Value and significance.
Optionally, selective examination may include that percent of pass obtains module and percent of pass comparison module by determining device.Percent of pass Module is obtained to be used to obtain selective examination percent of pass based on selective examination feedback information.Percent of pass comparison module will be for that will spot-check percent of pass and lead to The rate threshold value of mistake compares, if selective examination percent of pass is greater than or equal to percent of pass threshold value, it is determined that and mark set of tasks passes through selective examination, If spot-check percent of pass is less than percent of pass threshold value, it is determined that mark set of tasks does not pass through selective examination.Labeled data processing system 400 It may further include final annotation results determining device and similarity aggrandizement apparatus (not shown).Final annotation results determine dress It sets, if being determined based on selective examination feedback information by selective examination for mark set of tasks and marking each of set of tasks The relevant final annotation results of mark task.Similarity aggrandizement apparatus increases if not passing through selective examination for marking set of tasks Big similarity threshold simultaneously starts similarity-rough set device.
Selective examination percent of pass can directly be provided by selective examination personnel or the correct and mistake according to selective examination personnel to annotation results To determine, the present invention limits not to this for evaluation.It can determine whether mark set of tasks passes through pumping by spot-check percent of pass It looks into, i.e., whether average annotation results relevant to each mark task in mark set of tasks are qualified or say and meet the requirements, and And then it can choose and determine that final annotation results or selection increase similarity threshold and re-start quality testing.
Optionally, labeled data processing system 400 may further include the second reception device, mark task choosing dress It sets, the second sending device and third reception device (not shown).Second reception device is for receiving mark relevant to the personnel that mark Know information.Mark task choosing device is used to select mark task from queue to be marked based on identification information, mark task and Mark personnel are corresponding.Second sending device is relevant to mark task to be provided by mark personnel for sending mark task Annotation results.Third reception device is used to receive the annotation results of mark personnel offer as one of multiple annotation results.As above Described in text, labeled data processing system 400 can be suitble to the mark of mark personnel according to the identification information selection of mark personnel Mark task is simultaneously sent to mark personnel by task, is no longer repeated herein it.
The implementation that Fig. 1 to Fig. 3 describes each step of labeled data processing method provided by the invention is had been combined above Mode and its advantage etc., those of ordinary skill in the art pass through the detailed description read above for labeled data processing method, It will be appreciated that structure, the method for operation and the advantage of above-mentioned labeled data processing system 400, therefore which is not described herein again.
Method and apparatus are not inherently related to any particular computer, virtual system, or other device provided herein. Various general-purpose systems can also be used together with teachings based herein.As described above, it constructs required by this kind of system Structure be obvious.In addition, the present invention is also not directed to any particular programming language.It should be understood that can use various Programming language realizes summary of the invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that implementation of the invention Example can be practiced without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this specification.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of the various inventive aspects, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the disclosed method should not be interpreted as reflecting the following intention: i.e. required to protect Shield the present invention claims features more more than feature expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself All as a separate embodiment of the present invention.
Those skilled in the art will understand that in addition at least one in such feature and/or process or unit It, can be using any combination in this specification (including the accompanying claims, abstract and drawings) except excluding each other Disclosed all features and so disclosed any method or all process or units of device are combined.Unless in addition It is expressly recited, each feature disclosed in this specification (including adjoint claim, abstract and attached drawing) can be by offer phase The alternative features of same, equivalent or similar purpose replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments mean it is of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed Meaning one of can in any combination mode come using.
Various component embodiments of the invention can be implemented in hardware, or to run on one or more processors Software module realize, or be implemented in a combination thereof.It will be understood by those of skill in the art that can be used in practice Microprocessor or digital signal processor (DSP) realize one in labeled data processing system according to an embodiment of the present invention The some or all functions of a little modules.The present invention be also implemented as a part for executing method as described herein or The program of device (for example, computer program and computer program product) of person's whole.It is such to realize that program of the invention be with It may be stored on the computer-readable medium, or may be in the form of one or more signals.Such signal can from because It downloads and obtains on spy's net website, be perhaps provided on the carrier signal or be provided in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and ability Field technique personnel can be designed alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between parentheses should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" located in front of the element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.In the unit claims listing several devices, several in these devices can be through the same hardware branch To embody.The use of word first, second, and third does not indicate any sequence.These words can be explained and be run after fame Claim.

Claims (16)

1. a kind of labeled data processing method, comprising:
Step S110: the similarity of multiple annotation results relevant to mark task is calculated;
Step S120: the similarity is compared with similarity threshold, if the similarity is more than or equal to described similar Threshold value is spent, then goes to step S130, if the similarity is less than the similarity threshold, goes to step S140;
Step S130: determine that the multiple annotation results pass through quality testing;And
Step S140: determine that the multiple annotation results do not pass through quality testing;
Wherein, before the step S110, the labeled data processing method further comprises:
Step S102: certain number of annotation results relevant to the mark task are obtained;
Step S104: judging whether the given number is equal to quantity threshold relevant to the mark task, if the spy Fixed number mesh is equal to the quantity threshold, then goes to step S106, otherwise go to step S102;And
Step S106: the certain number of annotation results are determined as the multiple annotation results and go to the step S110。
2. labeled data processing method as described in claim 1, wherein the step S140 further comprises: described in increase Quantity threshold simultaneously goes to the step S102.
3. labeled data processing method as claimed in claim 1 or 2, wherein
The step S140 further comprises:
The multiple annotation results are sent, to be checked by inspection personnel;
Check feedback information;And
The inspection feedback information is sent, to inform mark personnel.
4. labeled data processing method as described in claim 1, wherein the step S130 further comprises: to described more A annotation results are averaged, to obtain average annotation results relevant to the mark task;
Wherein, the average annotation results are for spot-check.
5. labeled data processing method as claimed in claim 4, wherein the labeled data processing method further comprises:
Mark task subclass is selected from mark set of tasks;
Send average annotation results relevant to each mark task in the mark task subclass, with by spot-check personnel into Row selective examination;
Receive selective examination feedback information;And
Determine whether the mark set of tasks passes through selective examination based on the selective examination feedback information.
6. labeled data processing method as claimed in claim 5, wherein
Whether the determination mark set of tasks, which passes through selective examination, includes:
Selective examination percent of pass is obtained based on the selective examination feedback information;And
The selective examination percent of pass is compared with percent of pass threshold value, if the selective examination percent of pass is greater than or equal to the percent of pass Threshold value, it is determined that the mark set of tasks is by selective examination, if the selective examination percent of pass is less than the percent of pass threshold value, really The fixed mark set of tasks does not pass through selective examination;
After whether the determination mark set of tasks passes through selective examination, the labeled data processing method is further wrapped It includes:
If the mark set of tasks is by selective examination, based on selective examination feedback information determination and the mark set of tasks In the relevant final annotation results of each mark task;And
If the mark set of tasks does not pass through selective examination, increases the similarity threshold and go to the step S120.
7. such as labeled data processing method described in claim 5 or 6, wherein the mark task in the mark set of tasks It is the mark task that marking types are identical and label time is in preset period of time.
8. labeled data processing method as described in claim 1, wherein before the step S110, the labeled data Processing method further comprises:
Receive identification information relevant to mark personnel;
The mark task, the mark task and the mark personnel are selected from queue to be marked based on the identification information It is corresponding;
The mark task is sent, to provide annotation results relevant to the mark task by the mark personnel;And
The annotation results of the mark personnel offer are received as one of the multiple annotation results.
9. a kind of labeled data processing system, including computing device, similarity-rough set device, the first executive device and second execute Device,
The computing device is used to calculate the similarity of multiple annotation results relevant to the task that marks;
The similarity-rough set device for the similarity to be compared with similarity threshold, if the similarity be greater than or Equal to the similarity threshold, then start the first executive device, if the similarity is less than the similarity threshold, starts Second executive device;
First executive device is for determining that the multiple annotation results pass through quality testing;And
Second executive device is for determining that the multiple annotation results do not pass through quality testing;
Wherein, the labeled data processing system further comprises acquisition device, judgment means and annotation results determining device:
The acquisition device is for obtaining certain number of annotation results relevant to the mark task;
The judgment means for judging whether the given number is equal to quantity threshold relevant to the mark task, if The given number is equal to the quantity threshold, then starts the annotation results determining device, otherwise start the acquisition device; And
The annotation results determining device is used to the certain number of annotation results being determined as the multiple annotation results simultaneously Start the computing device.
10. labeled data processing system as claimed in claim 9, wherein second executive device is further used for increasing The quantity threshold simultaneously starts the acquisition device.
11. the labeled data processing system as described in claim 9 or 10, wherein second executive device further comprises:
Annotation results sending module, for sending the multiple annotation results, to be checked by inspection personnel;
Feedback reception module is checked, for checking feedback information;And
Feedback sending module is checked, for sending the inspection feedback information, to inform mark personnel.
12. labeled data processing system as claimed in claim 9, wherein first executive device is further used for institute It states multiple annotation results to be averaged, to obtain average annotation results relevant to the mark task;
Wherein, the average annotation results are for spot-check.
13. labeled data processing system as claimed in claim 12, wherein the labeled data processing system is further wrapped It includes:
Subclass selection device, for selecting mark task subclass from mark set of tasks;
First sending device, for sending average mark knot relevant to each mark task in the mark task subclass Fruit, to be spot-check by selective examination personnel;
First receiving device, for receiving selective examination feedback information;And
Selective examination is by determining device, for determining whether the mark set of tasks passes through pumping based on the selective examination feedback information It looks into.
14. labeled data processing system as claimed in claim 13, wherein
The selective examination by determining device includes:
Percent of pass obtains module, for obtaining selective examination percent of pass based on the selective examination feedback information;And
Percent of pass comparison module, for the selective examination percent of pass to compare with percent of pass threshold value, if the selective examination percent of pass More than or equal to the percent of pass threshold value, it is determined that the mark set of tasks is by selective examination, if the selective examination percent of pass is small In the percent of pass threshold value, it is determined that the mark set of tasks does not pass through selective examination;
The labeled data processing system further comprises:
Final annotation results determining device, if fed back by selective examination based on the selective examination for the mark set of tasks The determining final annotation results relevant to each mark task in the mark set of tasks of information;And
Similarity aggrandizement apparatus increases the similarity threshold simultaneously if not passing through selective examination for the mark set of tasks Start the similarity-rough set device.
15. labeled data processing system according to claim 13 or 14, wherein the mark in the mark set of tasks is appointed Business is the mark task that marking types are identical and label time is in preset period of time.
16. labeled data processing system as claimed in claim 9, wherein the labeled data processing system further comprises:
Second reception device, for receiving identification information relevant to the personnel that mark;
Task choosing device is marked, it is described for selecting the mark task from queue to be marked based on the identification information Mark task is corresponding with the mark personnel;
Second sending device is related to the mark task to be provided by the mark personnel for sending the mark task Annotation results;And
Third reception device, for receiving the annotation results of the mark personnel offer as one of the multiple annotation results.
CN201510744484.5A 2015-11-03 2015-11-03 Labeled data processing method and labeled data processing system Active CN105404896B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510744484.5A CN105404896B (en) 2015-11-03 2015-11-03 Labeled data processing method and labeled data processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510744484.5A CN105404896B (en) 2015-11-03 2015-11-03 Labeled data processing method and labeled data processing system

Publications (2)

Publication Number Publication Date
CN105404896A CN105404896A (en) 2016-03-16
CN105404896B true CN105404896B (en) 2019-04-19

Family

ID=55470371

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510744484.5A Active CN105404896B (en) 2015-11-03 2015-11-03 Labeled data processing method and labeled data processing system

Country Status (1)

Country Link
CN (1) CN105404896B (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105975980B (en) * 2016-04-27 2019-04-05 百度在线网络技术(北京)有限公司 The method and apparatus of monitoring image mark quality
WO2018000269A1 (en) * 2016-06-29 2018-01-04 深圳狗尾草智能科技有限公司 Data annotation method and system based on data mining and crowdsourcing
CN108229772A (en) * 2016-12-14 2018-06-29 北京国双科技有限公司 Mark processing method and processing device
CN108537240A (en) * 2017-03-01 2018-09-14 华东师范大学 Commodity image semanteme marking method based on domain body
CN108197658B (en) * 2018-01-11 2020-08-14 阿里巴巴集团控股有限公司 Image annotation information processing method, device, server and system
CN108154197B (en) * 2018-01-22 2022-03-15 腾讯科技(深圳)有限公司 Method and device for realizing image annotation verification in virtual scene
CN108364018A (en) * 2018-01-25 2018-08-03 北京墨丘科技有限公司 A kind of guard method of labeled data, terminal device and system
CN108446695B (en) * 2018-02-06 2022-02-11 创新先进技术有限公司 Method and device for data annotation and electronic equipment
CN110400029A (en) * 2018-04-24 2019-11-01 北京京东尚科信息技术有限公司 A kind of method and system of mark management
CN109034188B (en) * 2018-06-15 2021-11-05 北京金山云网络技术有限公司 Method and device for acquiring machine learning model, equipment and storage medium
CN108960297B (en) * 2018-06-15 2021-07-30 北京金山云网络技术有限公司 Picture labeling method, labeling device, equipment and storage medium
CN109214343B (en) * 2018-09-14 2021-03-09 北京字节跳动网络技术有限公司 Method and device for generating face key point detection model
CN109784381A (en) * 2018-12-27 2019-05-21 广州华多网络科技有限公司 Markup information processing method, device and electronic equipment
CN111461154A (en) * 2019-01-22 2020-07-28 北京京东尚科信息技术有限公司 Method and device for labeling data
CN110188769B (en) * 2019-05-14 2023-09-05 广州虎牙信息科技有限公司 Method, device, equipment and storage medium for auditing key point labels
CN110210526A (en) * 2019-05-14 2019-09-06 广州虎牙信息科技有限公司 Predict method, apparatus, equipment and the storage medium of the key point of measurand
CN110263934B (en) * 2019-05-31 2021-08-06 中国信息通信研究院 Artificial intelligence data labeling method and device
CN112528609A (en) * 2019-08-29 2021-03-19 北京声智科技有限公司 Method, system, equipment and medium for quality inspection of labeled data
CN110750523A (en) * 2019-09-12 2020-02-04 苏宁云计算有限公司 Data annotation method, system, computer equipment and storage medium
CN110796185B (en) * 2019-10-17 2022-08-26 北京爱数智慧科技有限公司 Method and device for detecting image annotation result
CN110880021B (en) * 2019-11-06 2021-03-16 创新奇智(北京)科技有限公司 Model-assisted data annotation system and annotation method
CN114036036A (en) * 2020-07-21 2022-02-11 南京行者易智能交通科技有限公司 Automatic test analysis system and method based on automatic labeling and terminal equipment
CN111966674B (en) * 2020-08-25 2024-03-15 北京金山云网络技术有限公司 Method and device for judging eligibility of annotation data and electronic equipment
CN111986194B (en) * 2020-09-03 2024-07-05 深圳平安智慧医健科技有限公司 Medical annotation image detection method and device, electronic equipment and storage medium
CN112084241B (en) * 2020-09-23 2024-10-01 北京金山云网络技术有限公司 Labeling data screening method and device and electronic equipment
CN111932536B (en) * 2020-09-29 2021-03-05 平安国际智慧城市科技股份有限公司 Method and device for verifying lesion marking, computer equipment and storage medium
CN112347990B (en) * 2020-11-30 2024-02-02 重庆空间视创科技有限公司 Multi-mode-based intelligent manuscript examining system and method
WO2023126280A1 (en) 2021-12-30 2023-07-06 Robert Bosch Gmbh A system and method for quality check of labelled images

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104795077A (en) * 2015-03-17 2015-07-22 北京航空航天大学 Voice annotation quality consistency detection method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324620B (en) * 2012-03-20 2016-04-27 北京百度网讯科技有限公司 A kind of method and apparatus that annotation results is rectified a deviation
CN102867025A (en) * 2012-08-23 2013-01-09 百度在线网络技术(北京)有限公司 Method and device for acquiring picture marking data
CN103678685A (en) * 2013-12-26 2014-03-26 华为技术有限公司 Webpage labeling method and device
CN104573359B (en) * 2014-12-31 2017-08-08 浙江大学 A kind of mass-rent labeled data integration method of task based access control difficulty and mark person's ability
CN104615705B (en) * 2015-01-30 2018-09-18 百度在线网络技术(北京)有限公司 Method and device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104795077A (en) * 2015-03-17 2015-07-22 北京航空航天大学 Voice annotation quality consistency detection method

Also Published As

Publication number Publication date
CN105404896A (en) 2016-03-16

Similar Documents

Publication Publication Date Title
CN105404896B (en) Labeled data processing method and labeled data processing system
US20200356818A1 (en) Logo detection
CN110046546B (en) Adaptive sight tracking method, device and system and storage medium
CN109800320B (en) Image processing method, device and computer readable storage medium
EP3843036B1 (en) Sample labeling method and device, and damage category identification method and device
EP4372698A3 (en) Systems and methods for image processing
CN108875734A (en) Liver canceration localization method, device and storage medium
CN111709421B (en) Bird identification method, bird identification device, computer equipment and storage medium
EP3057303A3 (en) Information processing system, information processing apparatus, and information processing method
EP2551796A3 (en) Image processing device identifying attribute of region included in image
CN112101124B (en) Sitting posture detection method and device
CN106327546B (en) Method and device for testing face detection algorithm
EP2497545A3 (en) Information processing program, information processing system, and information processing method
CN112801976B (en) Detection system and method for correcting binding mispasting of books and periodicals and computer device
EP2557539A3 (en) Image processing apparatus, image processing method, and image processing program
US12014829B2 (en) Image processing and presentation techniques for enhanced proctoring sessions
EP2433685A3 (en) Information processing apparatus capable of associating data with each other, information processing system, and information processing program
WO2020034672A1 (en) Method and apparatus for determining region of interest of user in image
CN110210314B (en) Face detection method, device, computer equipment and storage medium
EP2085930A3 (en) Image processing apparatus and method thereof
CN109598270A (en) Distort recognition methods and the device, storage medium and processor of text
CN112328822A (en) Picture pre-labeling method and device and terminal equipment
CN103593643B (en) A kind of method and system of image recognition
CN110956656A (en) Spindle positioning method based on depth target detection
CN110971895B (en) Video jitter detection method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100190 Beijing, Haidian District Academy of Sciences, South Road, No. 2, block A, No. 313

Applicant after: MEGVII INC.

Applicant after: Beijing maigewei Technology Co., Ltd.

Address before: 100190 Beijing, Haidian District Academy of Sciences, South Road, No. 2, block A, No. 313

Applicant before: MEGVII INC.

Applicant before: Beijing aperture Science and Technology Ltd.

GR01 Patent grant
GR01 Patent grant