CN110471597A - A kind of data mask method and device, computer readable storage medium - Google Patents

A kind of data mask method and device, computer readable storage medium Download PDF

Info

Publication number
CN110471597A
CN110471597A CN201910678287.6A CN201910678287A CN110471597A CN 110471597 A CN110471597 A CN 110471597A CN 201910678287 A CN201910678287 A CN 201910678287A CN 110471597 A CN110471597 A CN 110471597A
Authority
CN
China
Prior art keywords
data
label
mark
chosen
mouse
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910678287.6A
Other languages
Chinese (zh)
Inventor
徐安华
马瑞璇
路德龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Mininglamp Software System Co ltd
Original Assignee
Beijing Mininglamp Software System Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Mininglamp Software System Co ltd filed Critical Beijing Mininglamp Software System Co ltd
Priority to CN201910678287.6A priority Critical patent/CN110471597A/en
Publication of CN110471597A publication Critical patent/CN110471597A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

This application discloses a kind of data mask method and devices, computer readable storage medium, which comprises monitors and receive the mouse action of user;Detect whether received mouse action is the operation of predefined label for labelling and current mouse is chosen data are labeled data;If the data that received mouse action is the operation of predefined label for labelling and current mouse is chosen are labeled data, overlapping mark, and the label marked according to mark sequence in the side Layering manifestation of the data are carried out to the data that current mouse is chosen.The application is by, in the label of the side Layering manifestation overlapping mark of labeled data, realizing the overlapping mark of data, and have preferable label display effect according to mark sequence.

Description

A kind of data mask method and device, computer readable storage medium
Technical field
This application involves but be not limited to natural language processing (Natural Language Processing, NLP) technology Field more particularly to a kind of data mask method and device, computer readable storage medium.
Background technique
With the research and development of big data and artificial intelligence (Artificial Intelligence, AI), increasingly More enterprises handles Enterprise Data problem using the relevant technology of NLP.Data are the key that NLP, the types of data in addition to Comprising being stored in outside the structural data of database, there are also being greatly non-structured data, such as: text class number According to.Currently, many major companies can provide all kinds of service models such as Entity recognition, relation recognition, to avoid data annotation process To obtain the value of text class data.These service models are obtained by internet data training mostly, internet data Distinguishing feature is that word content is abundant and text is from a wealth of sources, still, due to the word habit and writing style of internet data There are larger differences with enterprise-level text data, for enterprise's application, it is desirable to the value of internet data is obtained, it is just necessary Establish the NLP model for being suitable for respective field.
And NLP model is established, the only way which must be passed: data mark cannot be avoided.It is being marked by a large amount of data Afterwards, the data marked have many purposes.Data mark in simple terms, exactly label to data.It is right for NLP It is very common that entity in data, relationship, which carry out data mark, for example, as shown in Figure 1, in one section of text, the word of appearance Symbol string " March 25 " can be labeled as date (Date), and character string " Gao Nana " can be labeled as name (Name) etc..
In data annotation process, there may come a time when meeting for same character string, there are many different labels, for example, for word For symbol string " Gao Nana ", " Gao Nana " is a name as a whole, still, if " Gao Nana " split into: "high", " Na Na ", at this point, "high" can be labeled as surname, " Na Na " can be marked and be run after fame.Therefore, for same character string " Gao Na For Na ", a part of "high" as " Gao Nana " can not only be labeled as name, but also can be labeled as surname;" Na Na " conduct The a part of " Gao Nana " can not only be labeled as name, but also can mark and run after fame.Therefore, in this case, how real research is The overlapping mark of existing data is it is necessary to and have certain practical significance.
Summary of the invention
In order to solve the above-mentioned technical problem, this application provides a kind of data mask method and device, computer-readable deposit Storage media can be realized the mark of the overlapping to same data.
In order to solve the above-mentioned technical problem, the technical solution of the embodiment of the present application is achieved in that
The embodiment of the invention provides a kind of data mask methods, comprising:
Monitor and receive the mouse action of user;
Whether detect whether received mouse action is the operation of predefined label for labelling and current mouse is chosen data For labeled data;
If received mouse action is that the data that predefined label for labelling operates and current mouse is chosen are to have marked Data then carry out overlapping mark to the data that current mouse is chosen, and according to mark sequence the data side Layering manifestation The label of mark.
In a kind of exemplary embodiment, the mark marked according to mark sequence in the side Layering manifestation of the data Label, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, will currently mark Label be shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or below vertical direction N-layer position, wherein n is the natural number greater than 1.
In a kind of exemplary embodiment, when showing the label, the different labels uses different highlighted back Scape color is shown, and the length of the label is identical as the length of the data of the label for labelling.
In a kind of exemplary embodiment, when the data chosen to current mouse carry out overlapping mark, the side Method further include: using the highlighted background color of the first weight label of the labeled data, be highlighted described marked Data.
In a kind of exemplary embodiment, when showing the label, each label is shown in a geometric figure In block, the geometric figure block is polygon mat, round rectangle block or elliptical blocks.
In a kind of exemplary embodiment, when the data chosen to current mouse carry out overlapping mark, the side Method further include:
Increase the line space of the labeled data display label side of the row, described in being used to show by layer The label of mark.
The embodiment of the invention also provides a kind of computer readable storage medium, the computer-readable recording medium storage Have one or more program, one or more of programs can be executed by one or more processor, with realize such as with The step of upper described in any item data mask methods.
The embodiment of the invention also provides a kind of data annotation equipments, including processor and memory, in which:
The processor is for executing the data marking program stored in memory, to realize as described in any of the above item The step of data mask method.
The embodiment of the invention also provides a kind of data annotation equipments, including detection module and labeling module, in which:
Detection module detects whether received mouse action is predefined for monitoring and receiving the mouse action of user Label for labelling operation and the data chosen of current mouse whether be labeled data, when received mouse action is predefined When the data that label for labelling operation and current mouse are chosen are unlabeled data, the first notice is sent to labeling module;Work as reception Mouse action be the operation of predefined label for labelling and data that current mouse is chosen be labeled data when, it is logical to send second Know to labeling module;
Labeling module carries out the first weight to the data that current mouse is chosen for receiving the first notice of detection module Label for labelling, in the label of the side of data display mark;The second notice for receiving detection module, chooses current mouse The data label that carries out overlapping mark, and marked according to mark sequence in the side Layering manifestation of the data.
In a kind of exemplary embodiment, the labeling module according to mark sequence the data side Layering manifestation The label of mark, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, will currently mark Label be shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or below vertical direction N-layer position, wherein n is the natural number greater than 1.
The technical solution of the application, has the following beneficial effects:
Data mask method and device, computer readable storage medium provided by the present application, by existing according to mark sequence The label of the side Layering manifestation overlapping mark of labeled data, realizes the overlapping mark of data, and has preferable label Bandwagon effect.
Other features and advantage will illustrate in the following description, also, partly become from specification It obtains it is clear that being understood and implementing the application.Other advantages of the application can be by specification, claims And scheme described in attached drawing is achieved and obtained.
Detailed description of the invention
Attached drawing is used to provide the understanding to technical scheme, and constitutes part of specification, with the application's Embodiment is used to explain the technical solution of the application together, does not constitute the limitation to technical scheme.
Fig. 1 is text structure schematic diagram of one of the relevant technologies by label for labelling;
Fig. 2 is a kind of flow diagram of data mask method of the embodiment of the present application;
Fig. 3 is a kind of text structure schematic diagram by label for labelling of the embodiment of the present application;
Fig. 4 is a kind of structural schematic diagram of data annotation equipment of the embodiment of the present application.
Specific embodiment
This application describes multiple embodiments, but the description is exemplary, rather than restrictive, and for this It is readily apparent that can have more in the range of embodiments described herein includes for the those of ordinary skill in field More embodiments and implementation.Although many possible feature combinations are shown in the attached drawings, and in a specific embodiment It is discussed, but many other combinations of disclosed feature are also possible.Unless the feelings specially limited Other than condition, any feature or element of any embodiment can be with any other features or element knot in any other embodiment It closes and uses, or any other feature or the element in any other embodiment can be substituted.
The application includes and contemplates the combination with feature known to persons of ordinary skill in the art and element.The application is It can also combine with any general characteristics or element through disclosed embodiment, feature and element, be defined by the claims with being formed Unique scheme of the invention.Any feature or element of any embodiment can also be with features or member from other scheme of the invention Part combination, to form the unique scheme of the invention that another is defined by the claims.It will thus be appreciated that showing in this application Out and/or any feature of discussion can be realized individually or in any suitable combination.Therefore, in addition to according to appended right It is required that and its other than the limitation done of equivalent replacement, embodiment is not limited.Furthermore, it is possible in the guarantor of appended claims It carry out various modifications and changes in shield range.
In addition, method and/or process may be rendered as spy by specification when describing representative embodiment Fixed step sequence.However, in the degree of this method or process independent of the particular order of step described herein, this method Or process should not necessarily be limited by the step of particular order.As one of ordinary skill in the art will appreciate, other steps is suitable Sequence is also possible.Therefore, the particular order of step described in specification is not necessarily to be construed as limitations on claims.This Outside, the claim for this method and/or process should not necessarily be limited by the step of executing them in the order written, this field skill Art personnel are it can be readily appreciated that these can sequentially change, and still remain in the spirit and scope of the embodiment of the present application It is interior.
Natural language processing, be the data such as voice, text are handled, are converted, a major class problem of Extracting Information General name.Entity, emphasis refers to name Entity recognition (the Named Entity in natural language processing field here Recognition, NER), but it is not limited to name entity.Relationship, here emphasis refer to entity in natural language processing field with Relationship between entity.Entity recognition, from input text in extract the entity with certain semantic information, as name, the date, Place, organization etc..Relation recognition, from the pass extracted in input text between the entity and entity with certain semantic information System, such as parent and child, employ, hold a post, geographical relationship.Training, refer in machine learning field, machine according to training data with And loss function updates the process of model parameter.Chinese word segmentation (Chinese Word Segmentation, CWS) refers to One chinese character sequence is cut into individual word one by one.Participle be exactly by continuous word sequence according to certain specification again It is combined into the process of word sequence.
One data mask method of embodiment
As shown in Fig. 2, being included the following steps: according to a kind of data mask method of the embodiment of the present application
Step 201: monitoring and receive the mouse action of user;
In a kind of exemplary embodiment, the mouse action of the user include left mouse button click, left mouse button double-click, Mouse drag and drop, right mouse button click etc. mouse action.
Step 202: detecting the whether predefined label for labelling operation of received mouse action and current mouse is chosen Whether data are labeled data;
If received mouse action is not predefined label for labelling operation, return step 201;
If received mouse action is that the data that predefined label for labelling operates and current mouse is chosen are to have marked Data then go to step 203;
If received mouse action is that the data that predefined label for labelling operates and current mouse is chosen are not mark Data then go to step 204;
In a kind of exemplary embodiment, the predefined label for labelling operation includes: double click or mouse drag and drop Data, left mouse button are chosen to click selection tag types.
Step 203: overlapping mark being carried out to the data that current mouse is chosen, and according to mark sequence in the side of the data The label of Layering manifestation mark, return step 201;
It should be noted that overlapping mark described herein is referred to institute in one or more character strings in text All labels contained are all labeled, and annotation results can be superimposed and can be explicitly shown.For example, it is assumed that going out in one section of text Existing date information " on October 20th, 2014 ", then " on October 20th, 2014 " integrally can be labeled as the first heavy label: date, Meanwhile in " on October 20th, 2014 ", " 2014 " can be regarded as the time again, can be " 2014 " marks the therefore Double label: year similarly can mark the second heavy label: the moon for " October ", be " 20 days " the second heavy labels of mark: day.Due to It is not overlapped between " 2014 ", " October ", " 20 days ", so the year, month, day label of each mark is second to mark again Label.
In a kind of exemplary embodiment, the mark marked according to mark sequence in the side Layering manifestation of the data Label, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, will currently mark Label be shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or below vertical direction N-layer position, wherein n is the natural number greater than 1.
Data mask method provided by the present application, can carry out multiple mark to the character string in text, and existing mark Label mask method can only carry out one to the character string in text and mark again.As shown in figure 3, the character string " Hong-Kong " in text With triple labels, the first heavy label are as follows: national label is marked to " China ", city label is marked to " Hong Kong ";Second marks again Label are as follows: birthplace label is integrally marked to " Hong-Kong ";Third weight label are as follows: " Hong-Kong " is integrally marked BornLocation label.
Data mask method provided by the present application is supported to mark multiple labels to a character string.When a character string is only marked When one label of note, can by be highlighted or the forms such as underscore display mark label;When a character string needs When marking multiple labels, on the basis of the label of existing mark, mark text vertical direction on, by different colors, The label of label (underscore, upper scribing line etc.) hierarchical display mark.
In a kind of exemplary embodiment, when showing the label, the different labels uses different highlighted back Scape color is shown, and the length of the label is identical as the length of the data of the label for labelling.
In a kind of exemplary embodiment, when the data chosen to current mouse carry out overlapping mark, the side Method further include: using the highlighted background color of the first weight label of the labeled data, be highlighted described marked Data.
For example, when carrying out label for labelling to the character string " Hong-Kong " in Fig. 3 text, it is possible, firstly, to by character string " Hong-Kong " regards an entirety as, carries out first to " Hong-Kong " and marks again, at this point it is possible to for the " Chinese Fragrant in text Port " and the first weight label add blue highlight display background, and the font color of " Hong-Kong " corresponding first weight label can be with Using from its color that be highlighted background color contrast different.Then, second is carried out to character string " China " and " Hong Kong " It marks again, since " China " and " Hong Kong " does not have data overlap, the two belongs to second and marks again, at this point it is possible to be The second weight label of " China " and " Hong Kong " adds the different backgrounds that is highlighted, such as purple and green respectively, " China " and The font color of " Hong Kong " corresponding second weight label can use that be highlighted background color contrast from it different respectively Color.Similarly, third weight or even quadruple can be carried out to character string " China ", character string " Hong Kong " etc. in a similar way Label for labelling.
In a kind of exemplary embodiment, when showing the label, each label is shown in a geometric figure block, The geometric figure block can be the graph block of polygon mat, round rectangle block, elliptical blocks or other arbitrary shapes.
As shown in figure 3, each label is displayed in a rectangular block.
During actual label for labelling, the color of the corresponding geometric figure block of each label can be according to bookmark name Difference and have respective color.By using the mode of the geometric figure block Layering manifestation of different colours, Ke Yi When multiple label for labelling, the label for labelling information of current labeled data is clearly showed that, and will not make to mark page of text In a jumble.
In a kind of exemplary embodiment, when the data chosen to current mouse carry out overlapping mark, the side Method further include:
Increase the line space of the labeled data display label side of the row, described in being used to show by layer The label of mark.
In front end is shown, other than the label for needing Layering manifestation to mark, it is also necessary to the mark marked according to current text The tuple of label dynamically adjusts line space, space is saved, to reach best mark bandwagon effect.
Step 204: the first heavy label for labelling being carried out to the data that current mouse is chosen, shows mark in the side of the data Label, return step 201.
Specifically, first can be carried out to the data that current mouse is chosen by existing label for labelling method herein to mark again Label mark.
Embodiment two: computer readable storage medium
The embodiment of the present application also provides a kind of computer readable storage medium, the computer-readable recording medium storage Have one or more program, one or more of programs can be executed by one or more processor, with realize such as with The step of upper described in any item data mask methods.
Embodiment three: data annotation equipment
The embodiment of the present application also provides a kind of data annotation equipments, including processor and memory, in which: the processing Device is for executing the program stored in memory, the step of to realize the data mask method as described in any of the above item.
Example IV: data annotation equipment
As shown in figure 4, according to a kind of data annotation equipment of the embodiment of the present application, including detection module 401 and mark mould Block 402, in which:
Detection module 401 detects whether received mouse action is predetermined for monitoring and receiving the mouse action of user Whether the label for labelling operation of justice and the data chosen of current mouse are labeled data, when received mouse action is predefined Label for labelling operation and the data chosen of current mouse when being unlabeled data, send the first notice to labeling module 402;When The data that received mouse action is the operation of predefined label for labelling and current mouse is chosen be labeled data when, send the Two notify to labeling module 402;
Labeling module 402 carries out the data that current mouse is chosen for receiving the first notice of detection module 401 First heavy label for labelling, in the label of the side of data display mark;The second notice for receiving detection module 401, to working as The data that preceding mouse is chosen carry out overlapping mark, and the label marked according to mark sequence in the side Layering manifestation of the data.
In a kind of exemplary embodiment, the predefined label for labelling operation includes: double click or mouse drag and drop Data, left mouse button are chosen to click selection tag types.
In a kind of exemplary embodiment, the labeling module 402 is layered according to mark sequence in the side of the data Show the label of mark, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, will currently mark Label be shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or below vertical direction N-layer position, wherein n is the natural number greater than 1.
Data annotation equipment provided by the present application, can carry out multiple mark to the character string in text, and existing mark Label annotation equipment can only carry out one to the character string in text and mark again.As shown in figure 3, the character string " Hong-Kong " in text With triple labels, the first heavy label are as follows: national label is marked to " China ", city label is marked to " Hong Kong ";Second marks again Label are as follows: birthplace label is integrally marked to " Hong-Kong ";Third weight label are as follows: " Hong-Kong " is integrally marked BornLocation label.
Data annotation equipment provided by the present application is supported to mark multiple labels to a character string.When a character string is only marked When one label of note, can by be highlighted or the forms such as underscore display mark label;When a character string needs When marking multiple labels, on the basis of the label of existing mark, mark text vertical direction on, by different colors, The label of label (underscore, upper scribing line etc.) hierarchical display mark.
In a kind of exemplary embodiment, for the labeling module 402 when showing the label, different labels is not using Same highlighted background color is shown, and the length of the label is identical as the length of the data of the label for labelling.
In a kind of exemplary embodiment, the labeling module 402 carries out overlapping mark in the data chosen to current mouse When note, using the highlighted background color of the first weight label of the labeled data, it is highlighted the labeled data.
For example, when carrying out label for labelling to the character string " Hong-Kong " in Fig. 3 text, it is possible, firstly, to by character string " Hong-Kong " regards an entirety as, carries out first to " Hong-Kong " and marks again, at this point it is possible to for the " Chinese Fragrant in text Port " and the first weight label add blue highlight display background, and the font color of " Hong-Kong " corresponding first weight label can be with Using from its color that be highlighted background color contrast different.Then, second is carried out to character string " China " and " Hong Kong " It marks again, since " China " and " Hong Kong " does not have data overlap, the two belongs to second and marks again, at this point it is possible to be The second weight label of " China " and " Hong Kong " adds the different backgrounds that is highlighted, such as purple and green respectively, " China " and The font color of " Hong Kong " corresponding second weight label can use that be highlighted background color contrast from it different respectively Color.Similarly, third weight or even quadruple can be carried out to character string " China ", character string " Hong Kong " etc. in a similar way Label for labelling.
In a kind of exemplary embodiment, for the labeling module 402 when showing the label, each label is shown in one In a geometric figure block, the geometric figure block can be polygon mat, round rectangle block, elliptical blocks or other arbitrary shapes Graph block.
As shown in figure 3, each label is displayed in a rectangular block.
During actual label for labelling, the color of the corresponding geometric figure block of each label can be according to bookmark name Difference and have respective color.By using the mode of the geometric figure block Layering manifestation of different colours, Ke Yi When multiple label for labelling, the label for labelling information of current labeled data is clearly showed that, and will not make to mark page of text In a jumble.
In a kind of exemplary embodiment, when the data chosen to current mouse carry out overlapping mark, the mark mould Block 402 is also used to: increasing the line space of labeled data display label side of the row, for showing the mark by layer Label.
In front end is shown, other than the label for needing Layering manifestation to mark, it is also necessary to the mark marked according to current text The tuple of label dynamically adjusts line space, space is saved, to reach best mark bandwagon effect.
In data annotation process, overlapping mark is a kind of special and common situations of label for labelling.It is provided by the present application Data mask method and device, computer readable storage medium, by being layered according to mark sequence in the side of labeled data The label of display overlapping mark, realizes the overlapping mark of data, and has preferable bandwagon effect.
It will appreciated by the skilled person that whole or certain steps, system, dress in method disclosed hereinabove Functional module/unit in setting may be implemented as software, firmware, hardware and its combination appropriate.In hardware embodiment, Division between the functional module/unit referred in the above description not necessarily corresponds to the division of physical assemblies;For example, one Physical assemblies can have multiple functions or a function or step and can be executed by several physical assemblies cooperations.Certain groups Part or all components may be implemented as by processor, such as the software that digital signal processor or microprocessor execute, or by It is embodied as hardware, or is implemented as integrated circuit, such as specific integrated circuit.Such software can be distributed in computer-readable On medium, computer-readable medium may include computer storage medium (or non-transitory medium) and communication media (or temporarily Property medium).As known to a person of ordinary skill in the art, term computer storage medium is included in for storing information (such as Computer readable instructions, data structure, program module or other data) any method or technique in the volatibility implemented and non- Volatibility, removable and nonremovable medium.Computer storage medium include but is not limited to RAM, ROM, EEPROM, flash memory or its His memory technology, CD-ROM, digital versatile disc (DVD) or other optical disc storages, magnetic holder, tape, disk storage or other Magnetic memory apparatus or any other medium that can be used for storing desired information and can be accessed by a computer.This Outside, known to a person of ordinary skill in the art to be, communication media generally comprises computer readable instructions, data structure, program mould Other data in the modulated data signal of block or such as carrier wave or other transmission mechanisms etc, and may include any information Delivery media.

Claims (10)

1. a kind of data mask method characterized by comprising
Monitor and receive the mouse action of user;
Detect whether received mouse action is whether the data that predefined label for labelling operates and current mouse is chosen are Labeled data;
If the data that received mouse action is the operation of predefined label for labelling and current mouse is chosen are labeled data, Overlapping mark then is carried out to the data that current mouse is chosen, and marked according to mark sequence in the side Layering manifestation of the data Label.
2. the method according to claim 1, wherein described be layered according to mark sequence in the side of the data shows The label of indicating note, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, the mark that will currently mark Label are shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or the n-th layer below vertical direction Position, wherein n is the natural number greater than 1.
3. according to the method described in claim 2, it is characterized in that, the different labels uses when showing the label Different highlighted background colors are shown, and the length of the label is identical as the length of the data of the label for labelling.
4. according to the method described in claim 3, it is characterized in that, carrying out overlapping mark in the data chosen to current mouse When note, the method also includes: using the highlighted background color of the first weight label of the labeled data, it is highlighted The labeled data.
5. according to the method described in claim 2, it is characterized in that, each label is shown in when showing the label In one geometric figure block, the geometric figure block is polygon mat, round rectangle block or elliptical blocks.
6. the method according to claim 1, wherein carrying out overlapping mark in the data chosen to current mouse When note, the method also includes:
The line space for increasing the labeled data display label side of the row, for showing the mark by layer Label.
7. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage have one or Multiple programs, one or more of programs can be executed by one or more processor, to realize such as claim 1 to 6 Any one of described in data mask method the step of.
8. a kind of data annotation equipment, which is characterized in that including processor and memory, in which:
The processor is for executing the data marking program stored in memory, to realize such as any one of claims 1 to 6 The step of described data mask method.
9. a kind of data annotation equipment, which is characterized in that including detection module and labeling module, in which:
Detection module detects whether received mouse action is predefined mark for monitoring and receiving the mouse action of user Whether label labeling operation and the data chosen of current mouse are labeled data, when received mouse action is predefined label When the data that labeling operation and current mouse are chosen are unlabeled data, the first notice is sent to labeling module;When received mouse Mark operation be the operation of predefined label for labelling and the data chosen of current mouse be labeled data when, send second and notify extremely Labeling module;
Labeling module carries out the first heavy label to the data that current mouse is chosen for receiving the first notice of detection module Mark, in the label of the side of data display mark;Receive the second notice of detection module, the number chosen to current mouse According to the label for carrying out overlapping mark, and being marked according to mark sequence in the side Layering manifestation of the data.
10. device according to claim 9, which is characterized in that the labeling module according to mark sequence in the data Side Layering manifestation mark label, comprising:
Which weight label that the label currently marked is the data chosen to the current mouse detected;
If the label currently marked is the n-th heavy label of the data chosen to the current mouse, the mark that will currently mark Label are shown in the n-th layer position above the vertical direction for the data that the current mouse is chosen or the n-th layer below vertical direction Position, wherein n is the natural number greater than 1.
CN201910678287.6A 2019-07-25 2019-07-25 A kind of data mask method and device, computer readable storage medium Pending CN110471597A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910678287.6A CN110471597A (en) 2019-07-25 2019-07-25 A kind of data mask method and device, computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910678287.6A CN110471597A (en) 2019-07-25 2019-07-25 A kind of data mask method and device, computer readable storage medium

Publications (1)

Publication Number Publication Date
CN110471597A true CN110471597A (en) 2019-11-19

Family

ID=68508273

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910678287.6A Pending CN110471597A (en) 2019-07-25 2019-07-25 A kind of data mask method and device, computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110471597A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111046190A (en) * 2019-11-28 2020-04-21 佰聆数据股份有限公司 Semantic graph-based big data label conflict detection method and system, storage medium and computer equipment
CN111460765A (en) * 2020-03-30 2020-07-28 掌阅科技股份有限公司 Electronic book labeling processing method, electronic equipment and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070255551A1 (en) * 2006-04-26 2007-11-01 Edward Ma Language reinforcement system
CN101375278A (en) * 2006-01-26 2009-02-25 微软公司 Strategies for processing annotations
US20130155115A1 (en) * 2011-12-16 2013-06-20 National Chiao Tung University Method for visualizing a complicated metro map in a limited displaying area
CN106649288A (en) * 2016-12-12 2017-05-10 北京百度网讯科技有限公司 Translation method and device based on artificial intelligence
CN107341171A (en) * 2017-05-03 2017-11-10 刘洪利 Extract the method and system of data (gene) feature templates method and application template
CN107885737A (en) * 2017-12-27 2018-04-06 传神语联网网络科技股份有限公司 A kind of human-computer interaction interpretation method and system
CN109062890A (en) * 2018-06-27 2018-12-21 北京明略软件系统有限公司 A kind of label switching method and apparatus, computer readable storage medium
CN109255128A (en) * 2018-10-11 2019-01-22 北京小米移动软件有限公司 Generation method, device and the storage medium of multi-layer label

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101375278A (en) * 2006-01-26 2009-02-25 微软公司 Strategies for processing annotations
US20070255551A1 (en) * 2006-04-26 2007-11-01 Edward Ma Language reinforcement system
US20130155115A1 (en) * 2011-12-16 2013-06-20 National Chiao Tung University Method for visualizing a complicated metro map in a limited displaying area
CN106649288A (en) * 2016-12-12 2017-05-10 北京百度网讯科技有限公司 Translation method and device based on artificial intelligence
CN107341171A (en) * 2017-05-03 2017-11-10 刘洪利 Extract the method and system of data (gene) feature templates method and application template
CN107885737A (en) * 2017-12-27 2018-04-06 传神语联网网络科技股份有限公司 A kind of human-computer interaction interpretation method and system
CN109062890A (en) * 2018-06-27 2018-12-21 北京明略软件系统有限公司 A kind of label switching method and apparatus, computer readable storage medium
CN109255128A (en) * 2018-10-11 2019-01-22 北京小米移动软件有限公司 Generation method, device and the storage medium of multi-layer label

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111046190A (en) * 2019-11-28 2020-04-21 佰聆数据股份有限公司 Semantic graph-based big data label conflict detection method and system, storage medium and computer equipment
CN111046190B (en) * 2019-11-28 2021-03-26 佰聆数据股份有限公司 Semantic graph-based big data label conflict detection method and system, storage medium and computer equipment
CN111460765A (en) * 2020-03-30 2020-07-28 掌阅科技股份有限公司 Electronic book labeling processing method, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
Leydesdorff et al. Mapping the geography of science: Distribution patterns and networks of relations among cities and institutes
EP2570974B1 (en) Automatic crowd sourcing for machine learning in information extraction
CN107392143A (en) A kind of resume accurate Analysis method based on SVM text classifications
CN106874256A (en) Name the method and device of entity in identification field
CN107729309A (en) A kind of method and device of the Chinese semantic analysis based on deep learning
CN104808903B (en) Text selection method and device
US20070277101A1 (en) System and method for dynamic organization of information sets
CN108664239A (en) A kind of across technology stack web front-end development system and method based on micro services
CN105094775B (en) Webpage generation method and device
CN109325233A (en) Global semantic understanding method, apparatus, computer equipment and storage medium
CN108399072A (en) Five application page update method and device
CN110471597A (en) A kind of data mask method and device, computer readable storage medium
CN106462933B (en) User is connected socially using content structure
JP2021103552A (en) Method for labelling structured document information, device for labelling structured document information, electronic apparatus, computer readable storage medium, and computer program
CN115408399A (en) Blood relationship analysis method, device, equipment and storage medium based on SQL script
CN109299074A (en) A kind of data verification method and system based on templating data base view
CN105740355B (en) Webpage context extraction method and device based on aggregation text density
CN109933803A (en) A kind of Chinese idiom information displaying method shows device, electronic equipment and storage medium
CN111858905A (en) Model training method, information identification method, device, electronic equipment and storage medium
US10261987B1 (en) Pre-processing E-book in scanned format
CN116401407A (en) Node attribute configuration method, device, equipment and storage medium of mind map
CN107515866A (en) A kind of data manipulation method, device and system
CN115422066A (en) Test case management method and device
US10963491B2 (en) Structures maintenance mapper
CN112559718B (en) Method, device, electronic equipment and storage medium for dialogue processing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20191119