CN110286895A - A kind of adding method of data sample, device and terminal device, can storage medium - Google Patents

A kind of adding method of data sample, device and terminal device, can storage medium Download PDF

Info

Publication number
CN110286895A
CN110286895A CN201910397823.5A CN201910397823A CN110286895A CN 110286895 A CN110286895 A CN 110286895A CN 201910397823 A CN201910397823 A CN 201910397823A CN 110286895 A CN110286895 A CN 110286895A
Authority
CN
China
Prior art keywords
data sample
format
template
information
identifier
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910397823.5A
Other languages
Chinese (zh)
Inventor
曾稷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rajax Network Technology Co Ltd
Lazhasi Network Technology Shanghai Co Ltd
Original Assignee
Lazhasi Network Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lazhasi Network Technology Shanghai Co Ltd filed Critical Lazhasi Network Technology Shanghai Co Ltd
Priority to CN201910397823.5A priority Critical patent/CN110286895A/en
Publication of CN110286895A publication Critical patent/CN110286895A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/33Intelligent editors

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The present embodiments relate to machine learning fields, disclose adding method, determining device and the electronic equipment of a kind of data sample.The adding method of data sample in the present invention is applied to terminal device, comprising: obtains user and inputs information;Whether the format that detection user inputs information meets preset data sample template;If meeting, data sample is extracted from the input information, and be added to scheduled database, it is possible to reduce character inputs when data sample typing reduce human cost.

Description

A kind of adding method of data sample, device and terminal device, can storage medium
Technical field
The present invention relates to machine learning field, in particular to a kind of adding method of data sample, device and terminal device, It can storage medium.
Background technique
Machine learning is the science of an artificial intelligence, and the main study subject in the field is artificial intelligence, especially such as Where the performance of specific algorithm is improved in empirical learning.During machine learning, need to provide great amount of samples for computer, Available data sample generally passes through programming language typing by assistant director.
The data sample amount as needed for machine learning is very huge, so when assistant director inputs, it may be desirable to be manually entered A large amount of code characters, it is time-consuming and laborious, error-prone.
Summary of the invention
The adding method for being designed to provide a kind of data sample, device and the terminal device of embodiment of the present invention, can Storage medium, it is possible to reduce character inputs when data sample typing reduce human cost.
In order to solve the above technical problems, embodiments of the present invention provide a kind of adding method of data sample, application In terminal device, comprising: obtain user and input information;Whether the format that detection user inputs information meets preset data sample The format of template;If meeting, data sample is extracted from the input information, and be added to scheduled database.
Embodiments of the present invention additionally provide a kind of adding set of data sample, comprising: module are obtained, for obtaining User inputs information;Whether detection module, the format that information is inputted for detecting user meet preset data sample template Format;Processing module then extracts data sample from the input information for meeting in response to the format for inputting information;Add Add module, for sample data obtained to be added to scheduled database.
Embodiments of the present invention additionally provide a kind of electronic equipment, including memory and processor, memory storage meter Calculation machine program, processor execute when running program: obtaining user and input information;Whether the format that detection user inputs information meets The format of preset data sample template;If meeting, data sample is extracted from the input information, and be added to scheduled Database.
Embodiments of the present invention additionally provide a kind of non-volatile memory medium, for storing computer-readable program, The computer-readable program is used to execute the adding method of data sample as described above for computer.
In terms of existing technologies, the main distinction and its effect are embodiment of the present invention: utilizing prefab-form, make Directly to input information identify, meet template style think user input be data sample, directly carry out sample This addition, it is incongruent to think that user inputs other content, without addition, the life of program staff's input is effectively reduced in this way Character quantity is enabled, and since in the adding procedure of data sample, sample size is often quite big, so even if each data sample The character amount of reduction is few, and for the amount of work of data sample addition, the character total amount of reduction is larger, effectively reduces The workload of program staff reduces human cost.
As a further improvement, user inputs information in the following manner: input includes the document of input information;Alternatively, It is inputted by scheduled interactive interface.The mode that user inputs information is specified, unnecessary identification is avoided.
As a further improvement, the template include: the aspect indexing of data sample position and corresponding characteristic value Position, the separator between adjacent aspect indexing, the identifier of aspect indexing specify the necessary character in the input format of sample.
As a further improvement, the template further include: the knowledge of separator, first data sample between adjacent data sample It Fu not be with the identifier of end data sample.Specify the separation between the identifier of first and last data sample and adjacent data sample User's multiple data samples of typing simultaneously may be implemented in symbol, and guarantee the identification accuracy of computer.
Detailed description of the invention
Fig. 1 is the adding method flow chart of the data sample in first embodiment according to the present invention;
Fig. 2 is the adding method flow chart of the data sample in second embodiment according to the present invention;
Fig. 3 is the adding set schematic diagram of the data sample in third embodiment according to the present invention;
Fig. 4 is the terminal device structural schematic diagram that the 4th embodiment provides according to the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.With Under the division of each embodiment be for convenience, any restriction should not to be constituted to specific implementation of the invention, it is each Embodiment can be combined with each other mutual reference under the premise of reconcilable.
The first embodiment of the present invention is related to a kind of adding methods of data sample, are applied to a kind of terminal device, can To be PC PC, server or tablet computer etc., as shown in Figure 1, this method comprises:
Step 101, the template of preset data sample.
Specifically, the template in present embodiment include: the aspect indexing of data sample position and corresponding feature The position of value, the separator between adjacent aspect indexing, the identifier of aspect indexing.More specifically, the feature rope of data sample The position of the position and corresponding characteristic value drawn can be aspect indexing preceding, and for characteristic value rear, the identifier of aspect indexing can To be " () ", aspect indexing and characteristic value are separated with " -> " in " () ", between two neighboring aspect indexing with half-angle comma every It opens.
For example, the format of a data sample can be such that in template
(aspect indexing -> characteristic value), (aspect indexing -> characteristic value), (aspect indexing -> characteristic value), (aspect indexing -> Characteristic value)
Each data sample can have multiple aspect indexings, and the title and quantity of aspect indexing can also be preset, herein It repeats no more.
It should be noted that this step is default step, it can first carry out once, not be carried out every time in advance.
Step 102, it obtains user and inputs information.
Specifically, user can input letter by way of inputting the document including input information in present embodiment It ceasing, in practical application, user completely inputs the data sample for needing typing, is made as document, when detecting, the document is called, It parses input information therein and detection is compared.
In addition, user can also pass through scheduled interactive interface (such as " sample typing " boundary of certain application in practical application Face) input information, it is default whether the input information that specifically detection user passes through predetermined interactive interface input when detecting meets Template.Two ways can select an application, can also be with connected applications, and details are not described herein.
Step 103, whether the format that detection user inputs information meets the format of preset data sample template;If symbol It closes, thens follow the steps 104;If not meeting, 106 are thened follow the steps.
Specifically, the content specifically compared may include: the characteristic symbol that user inputs in information, such as aspect indexing Identifier and separator etc.;It can also include: the position etc. of aspect indexing and characteristic value.If being preset with the name of aspect indexing Claim, whether the title that can also compare aspect indexing belongs to preset title.In practical application, entry, which compares to combine, to be made With so that accuracy is higher.
Step 104, data sample is extracted from input information.
Specifically, when the input information that step 103 testing result is user's typing meets default template, illustrate user Typing is data sample, can be according to input information acquisition data sample.
Step 105, it is added to scheduled database.
Specifically, scheduled database is sample database, can be used for the analytic process of subsequent machine learning.
Step 106, error information is issued.
Specifically, when the input information that step 103 testing result is user's typing does not meet default template, illustrate to use It is not data sample that family, which inputs information, or may be error when user inputs, and can issue error information at this time and remind user, Data sample is not accurately identified.User can check input information after seeing error information, or re-type.
It is noted that can also be not provided with the step of issuing error information in practical application, details are not described herein.
For example, present embodiment can be applied to during giving a mark for dining room, typing dining room to be given a mark, specifically Each parameter for needing these dining rooms of typing, such as position, the style of cooking, category, average price, day order numbers, pre-capita consumption volume, day combined turnover Deng.Accordingly, a data sample is regarded in each dining room as, and each data sample includes such as position, the style of cooking, category, average price, day The aspect indexing of order numbers, pre-capita consumption volume, day combined turnover etc., these parameter values reported according to dining room, determines corresponding Characteristic value, to be added into database respectively as data sample.
In terms of existing technologies, the main distinction and its effect are present embodiment: prefab-form are utilized, so that directly Connect to input information identify, meet template think user input be data sample, directly carry out sample addition, no What is met thinks that user inputs other content, without addition, effectively reduces the command character number of program staff's input in this way Amount, and since in the adding procedure of data sample, sample size is often quite big, so even if the word of each data sample reduction Symbol amount is few, and for the amount of work of data sample addition, the character total amount of reduction is larger, effectively reduces program staff Workload, reduce human cost.
Second embodiment of the present invention is related to a kind of adding method of data sample, and present embodiment is implemented first It is further improved on the basis of mode, mainly thes improvement is that:, can be with the multiple numbers of batch input in second embodiment According to sample, efficiency of inputting is improved.
The process of the adding method of data sample is as shown in Fig. 2, specific as follows in present embodiment:
Step 101 of the step 201 into step 202 and first embodiment is to 102 similar in present embodiment, herein It repeats no more.
Step 203, whether the format that detection user inputs information meets the format of preset data sample template;If symbol It closes, thens follow the steps 204;If not meeting, 206 are thened follow the steps.
Specifically, the template compared with first embodiment, in present embodiment further include: between adjacent data sample The identifier of separator, the identifier of first data sample and end data sample.
It can be found that present embodiment is when detecting, not only detect whether a certain data sample meets template, also needs to detect Whether all data samples of data have the identifier of first data sample and the identifier of end data sample, so that it is determined that It informs against data sample, end data sample, while each data sample between determining.
Since the data sample amount in practical application is quite huge, sometimes, possibly the number for needing typing can not be determined According to the actual quantity of sample, if distinguishing typing detection one by one, more manpower will be consumed, present embodiment may be implemented to count According to the batch input of sample, it is further reduced manpower input quantity, reduces human cost.
It also needing to further illustrate, the corresponding input information of all data samples all meets default template in present embodiment, Just it is determined as meeting default template, does not meet default template at one once wherein having, be determined as not meeting, reports an error to send Information.
In this way, present embodiment can make detection determine whether that all data samples of typing are correctly validated, keep away Exempt from only to identify part, and can not determine it is specific which by success typing, which error, obtained convenient for user's complete and accurate compared with The data sample of big quantity.
In present embodiment step 203 to 206 with the step 103 in present embodiment to 106 similar, it is no longer superfluous herein It states.
As it can be seen that the separator between the identifier and adjacent data sample of the clear first and last data sample of present embodiment can To realize user's multiple data samples of typing simultaneously, and guarantee the identification accuracy of computer.
Third embodiment of the present invention is related to a kind of adding set of data sample.As shown in figure 3, specifically including:
Module is obtained, inputs information for obtaining user;
Whether detection module, the format that information is inputted for detecting user meet the format of preset data sample template.
Processing module then extracts data sample from input information for meeting in response to the format for inputting information.
Adding module, for sample data obtained to be added to scheduled database.
In one example, family inputs information in the following manner: input includes the document of input information;Alternatively, passing through Scheduled interactive interface input.
In one example, further includes: template include: the aspect indexing of data sample position and corresponding characteristic value Position, the separator between adjacent aspect indexing, the identifier of aspect indexing.
In one example, template further include: the identifier of separator, first data sample between adjacent data sample and The identifier of last data sample.
In terms of existing technologies, the main distinction and its effect are present embodiment: prefab-form are utilized, so that directly The input information to input is connect to identify, meet template think user input be data sample, directly progress sample Addition, it is incongruent to think that user inputs other content, without addition, the order of program staff's input is effectively reduced in this way Character quantity, and since in the adding procedure of data sample, sample size is often quite big, so even if each data sample subtracts Few character amount is few, and for the amount of work of data sample addition, the character total amount of reduction is larger, effectively reduces journey The workload of sequence personnel reduces human cost.
It is not difficult to find that present embodiment is Installation practice corresponding with first embodiment, present embodiment can be with First embodiment is worked in coordination implementation.The relevant technical details mentioned in first embodiment still have in the present embodiment Effect, in order to reduce repetition, which is not described herein again.Correspondingly, the relevant technical details mentioned in present embodiment are also applicable in In first embodiment.
It is noted that each module involved in present embodiment is logic module, and in practical applications, one A logic unit can be a physical unit, be also possible to a part of a physical unit, can also be with multiple physics lists The combination of member is realized.In addition, in order to protrude innovative part of the invention, it will not be with solution institute of the present invention in present embodiment The technical issues of proposition, the less close unit of relationship introduced, but this does not indicate that there is no other single in present embodiment Member.
4th embodiment of the invention is related to a kind of terminal device, as shown in figure 4, the electronic equipment includes: at least one A processor 401;And the memory 402 with the communication connection of at least one processor 401;And with scanning means communication link The communication component 403 connect, communication component 403 send and receive data under the control of processor 401;Wherein, memory 402 is deposited The instruction that can be executed by least one processor 401 is contained, instruction is executed by least one processor 401 to realize:
It obtains user and inputs information;
Whether the format that detection user inputs information meets the format of preset data sample template;
If meeting, data sample is extracted from input information, and be added to scheduled database.
Specifically, which includes: one or more processors 401 and memory 402, with a processor For 401.Processor 401, memory 402 can be connected by bus or other modes, for being connected by bus.It deposits Reservoir 402 is used as a kind of non-volatile computer readable storage medium storing program for executing, can be used for storing non-volatile software program, non-volatile Computer executable program and module.The non-volatile software journey that processor 401 is stored in memory 402 by operation Sequence, instruction and module realize adding for above-mentioned data sample thereby executing the various function application and data processing of equipment Adding method.
Memory 402 may include storing program area and storage data area, wherein storing program area can store operation system Application program required for system, at least one function;It storage data area can the Save option list etc..In addition, memory 402 can be with It can also include nonvolatile memory, for example, at least disk memory, a flash memory including high-speed random access memory Device or other non-volatile solid state memory parts.In some embodiments, it includes relative to processing that memory 402 is optional The remotely located memory 402 of device 401, these remote memories 402 can pass through network connection to external equipment.Above-mentioned network Example include but is not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.
One or more module is stored in memory 402, when being executed by one or more processor 401, is held The adding method of data sample in the above-mentioned any means embodiment of row.
The said goods can be performed the application embodiment provided by method, have the corresponding functional module of execution method and Beneficial effect, the not technical detail of detailed description in the present embodiment, reference can be made to method provided by the application embodiment.
5th embodiment of the invention is related to a kind of non-volatile memory medium, for storing computer-readable program, Computer-readable program is used to execute above-mentioned all or part of embodiment of the method for computer.
That is, it will be understood by those skilled in the art that implement the method for the above embodiments be can be with Relevant hardware is instructed to complete by program, which is stored in a storage medium, including some instructions are to make It obtains an equipment (can be single-chip microcontroller, chip etc.) or processor (processor) executes each embodiment method of the application All or part of the steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.
It will be understood by those skilled in the art that the respective embodiments described above are to realize specific embodiments of the present invention, And in practical applications, can to it, various changes can be made in the form and details, without departing from the spirit and scope of the present invention.

Claims (10)

1. a kind of adding method of data sample, which is characterized in that be applied to terminal device, comprising:
It obtains user and inputs information;
Whether the format that detection user inputs information meets the format of preset data sample template;
If meeting, data sample is extracted from input information, and be added to scheduled database.
2. the adding method of data sample according to claim 1, which is characterized in that user inputs letter in the following manner Breath:
Input includes the document of input information;Alternatively,
It is inputted by scheduled interactive interface.
3. the adding method of data sample according to claim 1, which is characterized in that the template includes: data sample Aspect indexing position and corresponding characteristic value position, the separator between adjacent aspect indexing, the identifier of aspect indexing.
4. the adding method of data sample according to claim 3, which is characterized in that the template further include:
The identifier of separator, first data sample between adjacent data sample and the identifier of end data sample.
5. a kind of adding set of data sample characterized by comprising
Module is obtained, inputs information for obtaining user;
Whether detection module, the format that information is inputted for detecting user meet the format of preset data sample template;
Processing module then extracts data sample from the input information for meeting in response to the format for inputting information;
Adding module, for sample data obtained to be added to scheduled database.
6. the adding set of data sample according to claim 5, which is characterized in that user inputs letter in the following manner Breath:
Input includes the document of input information;Alternatively,
It is inputted by scheduled interactive interface.
7. the adding set of data sample according to claim 5, which is characterized in that the template includes: data sample Aspect indexing position and corresponding characteristic value position, the separator between adjacent aspect indexing, the identifier of aspect indexing.
8. the adding set of data sample according to claim 7, which is characterized in that the template further include:
The identifier of separator, first data sample between adjacent data sample and the identifier of end data sample.
9. a kind of terminal device characterized by comprising
At least one processor;And
The memory being connect at least one described processor communication;Wherein,
The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one Device is managed to execute, so that at least one described processor is able to carry out:
It obtains user and inputs information;
Whether the format that detection user inputs information meets the format of preset data sample template;
If meeting, data sample is extracted from the input information, and be added to scheduled database.
10. a kind of computer readable storage medium, is stored with computer program, which is characterized in that the computer program is located Manage the adding method that any data sample in Claims 1-4 is realized when device executes.
CN201910397823.5A 2019-05-14 2019-05-14 A kind of adding method of data sample, device and terminal device, can storage medium Pending CN110286895A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910397823.5A CN110286895A (en) 2019-05-14 2019-05-14 A kind of adding method of data sample, device and terminal device, can storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910397823.5A CN110286895A (en) 2019-05-14 2019-05-14 A kind of adding method of data sample, device and terminal device, can storage medium

Publications (1)

Publication Number Publication Date
CN110286895A true CN110286895A (en) 2019-09-27

Family

ID=68001996

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910397823.5A Pending CN110286895A (en) 2019-05-14 2019-05-14 A kind of adding method of data sample, device and terminal device, can storage medium

Country Status (1)

Country Link
CN (1) CN110286895A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040162831A1 (en) * 2003-02-06 2004-08-19 Patterson John Douglas Document handling system and method
CN105095240A (en) * 2014-05-04 2015-11-25 中国银联股份有限公司 Database data sample acquisition
CN109271871A (en) * 2018-08-22 2019-01-25 平安科技(深圳)有限公司 Sample store path generation method, device, computer equipment and storage medium
CN109727646A (en) * 2018-12-29 2019-05-07 北京优迅医学检验实验室有限公司 The processing method and processing device of cdna sample, mobile terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040162831A1 (en) * 2003-02-06 2004-08-19 Patterson John Douglas Document handling system and method
CN105095240A (en) * 2014-05-04 2015-11-25 中国银联股份有限公司 Database data sample acquisition
CN109271871A (en) * 2018-08-22 2019-01-25 平安科技(深圳)有限公司 Sample store path generation method, device, computer equipment and storage medium
CN109727646A (en) * 2018-12-29 2019-05-07 北京优迅医学检验实验室有限公司 The processing method and processing device of cdna sample, mobile terminal

Similar Documents

Publication Publication Date Title
CN110610240B (en) Virtual automation assistance based on artificial intelligence
CN110348441B (en) Value-added tax invoice identification method and device, computer equipment and storage medium
CN111125343B (en) Text analysis method and device suitable for person post matching recommendation system
CN111343161B (en) Abnormal information processing node analysis method, abnormal information processing node analysis device, abnormal information processing node analysis medium and electronic equipment
CN109492222A (en) Intension recognizing method, device and computer equipment based on conceptional tree
AU2019204444B2 (en) System and method for enrichment of ocr-extracted data
CN109669933B (en) Transaction data intelligent processing method and device and computer readable storage medium
CN109918437A (en) Distributed data processing method, apparatus and data assets management system
CN109783611A (en) A kind of method, apparatus of fields match, computer storage medium and terminal
CN107391383B (en) Test method and test system for application program interface
CN112541332B (en) Form information extraction method and device, electronic equipment and storage medium
CN109189888A (en) The method and storage medium that electronic device, infringement are analyzed
CN109298882A (en) Management method, computer readable storage medium and the terminal device of interface
CN112328489A (en) Test case generation method and device, terminal equipment and storage medium
CN113989549A (en) Semi-supervised learning image classification optimization method and system based on pseudo labels
CN112506757A (en) Automatic test method, system, computer device and medium thereof
CN109308564A (en) The recognition methods of crowd's performance ratings, device, storage medium and computer equipment
CN106649210A (en) Data conversion method and device
CN110532359A (en) Legal provision query method, apparatus, computer equipment and storage medium
CN113792146A (en) Text classification method and device based on artificial intelligence, electronic equipment and medium
CN110532448B (en) Document classification method, device, equipment and storage medium based on neural network
CN110286895A (en) A kind of adding method of data sample, device and terminal device, can storage medium
CN111078870A (en) Evaluation data processing method, evaluation data processing device, evaluation data processing medium, and computer device
CN109284392A (en) Text classification method, device, terminal and storage medium
CN115690821A (en) Intelligent electronic file cataloging method and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190927