CN110532273A - The processing method and processing device of tables of data, storage medium, electronic device - Google Patents
The processing method and processing device of tables of data, storage medium, electronic device Download PDFInfo
- Publication number
- CN110532273A CN110532273A CN201910818833.1A CN201910818833A CN110532273A CN 110532273 A CN110532273 A CN 110532273A CN 201910818833 A CN201910818833 A CN 201910818833A CN 110532273 A CN110532273 A CN 110532273A
- Authority
- CN
- China
- Prior art keywords
- data
- tables
- field
- data table
- pending
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Fuzzy Systems (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of processing method and processing device of tables of data, storage medium, electronic devices, this method comprises: selecting M tables of data for being greater than the first preset threshold with the data item similarity in pending data table from determining N number of tables of data;Each of pending data table field is compared with each of M tables of data field respectively, obtains comparison result;In the case that the matching degree of each field is less than the second preset threshold in each of pending data table field and M tables of data, field less than the second preset threshold is mapped in target matrix, with the mapping relations established between target matrix and pending data table.Through the invention, it solves the problems, such as not high to the treatment effeciency of data table present in the relevant technologies, achievees the effect that efficient process tables of data.
Description
Technical field
The present invention relates to data processing fields, are situated between in particular to a kind of processing method and processing device of tables of data, storage
Matter, electronic device.
Background technique
Current big data application flourishes, but the presentation of every profession and trade data is many kinds of, data volume is big, structure is complicated, builds
The features such as being marked with quasi- disunity, data code conflict.And according to the requirement of Ministry of Public Security's data item Standard compilation, it is desirable that the number after improvement
It is there is certain meaning, according to ad hoc rules name and mark, the information data unit for describing policing Work according to item, is
This combines this standard to establish java standard library in governance process.During data are administered, a large amount of multi-source is had, it is different
How based on these data the tables of data of structure is accessed, the field of table, carrys out the data item in efficient criterion of identification library, to nowadays
Big data administer and data sharing for be vital.Present data item identification is more based on artificial means intervention
And configuration, it is unable to reach the data item for quickly and effectively carrying out criterion of identification library.
In view of the above technical problems, it not yet puts forward effective solutions in the related technology.
Summary of the invention
The embodiment of the invention provides a kind of processing method and processing device of tables of data, storage medium, electronic devices, at least
It solves the problems, such as low to the treatment effeciency of tables of data in the related technology.
According to one embodiment of present invention, a kind of processing method of tables of data is provided, comprising: from determining N number of number
According to the M tables of data selected in table with the data item similarity in pending data table greater than the first preset threshold, wherein on
It is natural number that M, which is stated, less than or equal to above-mentioned N, above-mentioned M and above-mentioned N;By each of above-mentioned pending data table field point
It is not compared with each of above-mentioned M tables of data field, obtains comparison result, wherein include in above-mentioned comparison result
The matching degree of each field in each of above-mentioned pending data table field and above-mentioned M tables of data;Above-mentioned wait locate
Manage feelings of the matching degree less than the second preset threshold of each field in each of tables of data field and above-mentioned M tables of data
Under condition, by be less than above-mentioned second preset threshold field map in target matrix, with establish above-mentioned target matrix with it is upper
State the mapping relations between pending data table.
Optionally, each field in each of above-mentioned pending data table field and above-mentioned M tables of data
In the case that matching degree is less than the second preset threshold, the field for being less than above-mentioned second preset threshold is mapped into target matrix
In, with the mapping relations established between above-mentioned target matrix and above-mentioned pending data table, comprising: in above-mentioned pending data
In the case that the matching degree of each field is less than the second preset threshold in each of table field and above-mentioned M tables of data,
It will be less than in the corresponding data item of field and the above-mentioned target matrix of the equal mapping value of attribute information of above-mentioned second preset threshold, with
Establish the mapping relations between above-mentioned target matrix and above-mentioned pending data table.
Optionally, it is selected from determining N number of tables of data and is greater than the with the data item similarity in pending data table
M tables of data of one preset threshold, comprising: by the field of the first object language in above-mentioned pending data table with it is above-mentioned N number of
The field of first object language in tables of data is matched, and the first similarity is obtained;Above-mentioned first similarity is greater than above-mentioned
The tables of data of first preset threshold is determined as the tables of data in above-mentioned M tables of data.
Optionally, by each of above-mentioned pending data table field respectively each of with above-mentioned M tables of data
Field is compared, after obtaining comparison result, the above method further include: field in above-mentioned pending data table with it is above-mentioned
It, will be upper in the case that the fields match degree in a tables of data in M tables of data is greater than or equal to above-mentioned second preset threshold
It states in the tables of data that a field maps in above-mentioned M tables of data, to establish above-mentioned pending data table and above-mentioned one
The mapping relations of a tables of data.
Optionally, each field in each of above-mentioned pending data table field and above-mentioned M tables of data
In the case that matching degree is less than the second preset threshold, the field for being less than above-mentioned second preset threshold is mapped into target matrix
In, with after the mapping relations established between above-mentioned target matrix and above-mentioned pending data table, the above method further include: will
Mapping relations between above-mentioned target matrix and above-mentioned target matrix and above-mentioned pending data table are stored to target data
In library, wherein above-mentioned target database is also used to store above-mentioned M tables of data.
According to another embodiment of the invention, a kind of processing unit of tables of data is provided, comprising: first choice mould
Block is greater than the first default threshold with the data item similarity in pending data table for selecting from determining N number of tables of data
M tables of data of value, wherein it is natural number that above-mentioned M, which is less than or equal to above-mentioned N, above-mentioned M and above-mentioned N,;First determining module,
For each of above-mentioned pending data table field to be compared with each of above-mentioned M tables of data field respectively
Compared with obtaining comparison result, wherein include each of above-mentioned pending data table field and above-mentioned M in above-mentioned comparison result
The matching degree of each field in a tables of data;Second determining module, in each of above-mentioned pending data table word
It, will be less than above-mentioned second in the case that section and the matching degree of each field in above-mentioned M tables of data are less than the second preset threshold
The field of preset threshold maps in target matrix, to establish between above-mentioned target matrix and above-mentioned pending data table
Mapping relations.
According to still another embodiment of the invention, a kind of storage medium is additionally provided, meter is stored in the storage medium
Calculation machine program, wherein the computer program is arranged to execute the step in any of the above-described embodiment of the method when operation.
According to still another embodiment of the invention, a kind of electronic device, including memory and processor are additionally provided, it is described
Computer program is stored in memory, the processor is arranged to run the computer program to execute any of the above-described
Step in embodiment of the method.
Through the invention, similar to the data item in pending data table due to being selected from determining N number of tables of data
Degree is greater than M tables of data of the first preset threshold, wherein it is natural number that M, which is less than or equal to N, M and N,;By pending data table
Each of field be compared respectively with each of M tables of data field, obtain comparison result, wherein compare knot
It include the matching degree of each of pending data table field with each field in M tables of data in fruit;In number to be processed
According to the matching degree of each of table field and each field in M tables of data less than the second preset threshold in the case where, general
Field less than the second preset threshold maps in target matrix, to establish between target matrix and pending data table
Mapping relations.Tables of data is determined by the comparison of field, therefore, can solve present in the relevant technologies to the place of data table
Inefficient problem is managed, achievees the effect that efficient process tables of data.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, this hair
Bright illustrative embodiments and their description are used to explain the present invention, and are not constituted improper limitations of the present invention.In the accompanying drawings:
Fig. 1 is a kind of hardware block diagram of the mobile terminal of the processing method of tables of data of the embodiment of the present invention;
Fig. 2 is the flow chart of the processing method of tables of data according to an embodiment of the present invention;
Fig. 3 is the flow chart of the alternative embodiment in the present embodiment;
Fig. 4 is the structural block diagram of the processing unit of tables of data according to an embodiment of the present invention.
Specific embodiment
Hereinafter, the present invention will be described in detail with reference to the accompanying drawings and in combination with Examples.It should be noted that not conflicting
In the case of, the features in the embodiments and the embodiments of the present application can be combined with each other.
It should be noted that description and claims of this specification and term " first " in above-mentioned attached drawing, "
Two " etc. be to be used to distinguish similar objects, without being used to describe a particular order or precedence order.
Embodiment of the method provided by the embodiment of the present application one can be in mobile terminal, terminal or similar fortune
It calculates and is executed in device.For running on mobile terminals, Fig. 1 is a kind of processing method of tables of data of the embodiment of the present invention
The hardware block diagram of mobile terminal.As shown in Figure 1, mobile terminal 10 may include that one or more (only shows one in Fig. 1
It is a) (processor 102 can include but is not limited to the processing of Micro-processor MCV or programmable logic device FPGA etc. to processor 102
Device) and memory 104 for storing data, optionally, above-mentioned mobile terminal can also include the biography for communication function
Transfer device 106 and input-output equipment 108.It will appreciated by the skilled person that structure shown in FIG. 1 is only to show
Meaning, does not cause to limit to the structure of above-mentioned mobile terminal.For example, mobile terminal 10 may also include it is more than shown in Fig. 1
Perhaps less component or with the configuration different from shown in Fig. 1.
Memory 104 can be used for storing computer program, for example, the software program and module of application software, such as this hair
The corresponding computer program of the processing method of tables of data in bright embodiment, processor 102 are stored in memory 104 by operation
Interior computer program realizes above-mentioned method thereby executing various function application and data processing.Memory 104 can
Including high speed random access memory, may also include nonvolatile memory, as one or more magnetic storage device, flash memory or
Other non-volatile solid state memories of person.In some instances, memory 104 can further comprise remote relative to processor 102
The memory of journey setting, these remote memories can pass through network connection to mobile terminal 10.The example of above-mentioned network includes
But be not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.
Transmitting device 106 is used to that data to be received or sent via a network.Above-mentioned network specific example may include
The wireless network that the communication providers of mobile terminal 10 provide.In an example, transmitting device 106 includes a Network adaptation
Device (Network Interface Controller, referred to as NIC), can be connected by base station with other network equipments to
It can be communicated with internet.In an example, transmitting device 106 can for radio frequency (Radio Frequency, referred to as
RF) module is used to wirelessly be communicated with internet.
A kind of processing method of tables of data is provided in the present embodiment, and Fig. 2 is tables of data according to an embodiment of the present invention
Processing method flow chart, as shown in Fig. 2, the process includes the following steps:
Step S202 is selected from determining N number of tables of data and is greater than with the data item similarity in pending data table
M tables of data of the first preset threshold, wherein it is natural number that M, which is less than or equal to N, M and N,;
Step S204, by each of pending data table field respectively with each of M tables of data field into
Row compares, and obtains comparison result, wherein includes each of pending data table field and M tables of data in comparison result
In each field matching degree;
Step S206, in the matching degree of each of pending data table field and each field in M tables of data
In the case where less than the second preset threshold, the field less than the second preset threshold is mapped in target matrix, to establish mesh
Mark the mapping relations between tables of data and pending data table.
Through the above steps, due to being selected from determining N number of tables of data and the data item phase in pending data table
It is greater than M tables of data of the first preset threshold like degree, wherein it is natural number that M, which is less than or equal to N, M and N,;By pending data
Each of table field is compared with each of M tables of data field respectively, obtains comparison result, wherein is compared
It as a result include the matching degree of each of pending data table field with each field in M tables of data in;To be processed
In the case that the matching degree of each field is less than the second preset threshold in each of tables of data field and M tables of data,
Field less than the second preset threshold is mapped in target matrix, to establish between target matrix and pending data table
Mapping relations.Tables of data is determined by the comparison of field, therefore, can solve present in the relevant technologies to data table
The not high problem for the treatment of effeciency, achievees the effect that efficient process tables of data.
Optionally, the executing subject of above-mentioned steps can be terminal etc., but not limited to this.
Optionally, the present embodiment includes but is not limited in the scene for being applied to construct tables of data, for example, tables of data is added
In scene into java standard library.
Optionally, the data item in pending data table includes but is not limited to the field and data content in tables of data.Example
Such as, the field in pending data table is " citizen's name ", " gender ", " nationality ", " date of birth " etc..
Optionally, it is false to can be the normal data table in standard database for N number of tables of data, such as has 10 java standard library tables
B1-B10.It is selected from determining N number of tables of data and is greater than the first default threshold with the data item similarity in pending data table
M tables of data of value, for example, select similarity for 50% tables of data B2 (similarity 75%), B8 (similarity 60%).
Optionally, by each of pending data table field respectively each of with the M tables of data
Field is compared, and obtains comparison result, for example, the tables of data B2 of selection, the inside includes 3 data item: " name ", " property
Not ", " birthday ".Then it carries out specific field level and mark process is recommended for " the citizen's name " in pending data table
3 similarity recommending data items, " name " (similarity 80%), " gender " (similarity 0%), " birthday " (similarity 0%), choosing
" name " is selected as field to target mapping result.For the field " gender " in pending data table, 3 similarities are recommended
Recommending data item: " gender " (similarity 100%), " name " (similarity 0%), " birthday " (similarity 0%) select " gender "
As field to target mapping result.
Optionally, for the field " nationality " in pending data table, 3 similarity recommending data items: " property are recommended
Not " (similarity 0%), " name " (similarity 0%), " birthday " (similarity 0%), the data item of recommendation all mismatch, intelligence
" nationality " is added in java standard library table as field to target mapping result.Similarly, " date of birth " mapping result is " birth
Date ".I.e. all fields are mismatched with " nationality " in pending data table, thus generate target matrix B11.
In an alternative embodiment, in each of pending data table field and each in M tables of data
In the case that the matching degree of field is less than the second preset threshold, the field less than the second preset threshold is mapped into target matrix
In, with the mapping relations established between target matrix and pending data table, comprising:
S1, each of pending data table field and each field in M tables of data matching degree less than the
In the case where two preset thresholds, by the corresponding data item of field and the equal mapping value target of attribute information less than the second preset threshold
In tables of data, with the mapping relations established between target matrix and pending data table.
Optionally, in the present embodiment, when the data item of recommendation all mismatches, by the base of the data item in new data table
This information and attribute information are automatically added to, finally to all data item have been marked, generate one newly in target java standard library table
Java standard library table M, automatic synchronization is into java standard library.
In an alternative embodiment, it is selected from determining N number of tables of data and the data in pending data table
Item similarity is greater than M tables of data of the first preset threshold, comprising:
S1, by the first object language in the field of the first object language in pending data table and N number of tables of data
Field is matched, and the first similarity is obtained;
The tables of data that first similarity is greater than the first preset threshold is determined as the tables of data in M tables of data by S2.
Optionally, in the present embodiment, the first target langua0 can be Chinese, be also possible to English.By the Chinese name of field
It is matched with the field Chinese name in java standard library table, the similarity based on each field acquires the similarity between table, final system
It can recommend out higher than all java standard library tables of the similarity of selection, and in conjunction with the similarity of all java standard library tables with similarity
It is ranked up from high to low.
In an alternative embodiment, by each of pending data table field respectively and in M tables of data
Each field is compared, after obtaining comparison result, method further include:
S1 is greater than in the field in pending data table with the fields match degree in a tables of data in M tables of data
Or be equal to the second preset threshold in the case where, a field is mapped in M tables of data tables of data, with foundation to
Handle the mapping relations of tables of data and a tables of data.
Optionally, in the present embodiment, for similarity recommend java standard library table, may be selected one of java standard library table into
Row field level to mark, establish pending data table and the mapping relations with java standard library table: for each pending data table
Field, all 3 high data item of similarity can be selected for selection in the java standard library table.
In an alternative embodiment, in each of pending data table field and each in M tables of data
In the case that the matching degree of field is less than the second preset threshold, the field less than the second preset threshold is mapped into target matrix
In, with after the mapping relations established between target matrix and pending data table, method further include:
S1 stores the mapping relations between target matrix and target matrix and pending data table to target data
In library, wherein target database is also used to store M tables of data.
Optionally, in the present embodiment, it by the mapping relations between the field and data item in target matrix, is synchronized to
In java standard library.
In an alternative embodiment, by each of pending data table field respectively and in M tables of data
Each field is compared, before obtaining comparison result, method further include:
S1 determines the field for including in pending data table;
S2 selects the field of preset quantity to be compared with the field in pending data table from M tables of data.
Optionally, preset quantity can be 3 etc..Can be more accurate determine target matrix.
In an alternative embodiment, it is selected from determining N number of tables of data and the data in pending data table
Item similarity is greater than M tables of data of the first preset threshold, comprising:
S1 determines the data item for including in pending data table;
S2 selects the data item of preset quantity to be compared with the data item in pending data table from N number of tables of data;
S3, in the data item and in the matched situation of data item in pending data table of preset quantity, by present count
Tables of data where the data item of amount is determined as the tables of data in M tables of data.
Optionally, in the present embodiment, preset quantity can be 3 etc..Can be more accurate determine number of targets
According to table.
The present invention will be described combined with specific embodiments below:
Fig. 3 is the flow chart of the alternative embodiment in the present embodiment, as shown in Figure 3, comprising the following steps:
S301: new tables of data is accessed;
S302: selection similarity, waiting system provide all java standard library tables for being higher than this similarity in java standard library;
S303: wherein 1 java standard library table is selected, as field to target object table, system can be directed to new data table
Every 1 field selects 3 higher data item of similarity as with reference to selection from the table of target criteria library;
S304: successively for the data item of field and corresponding 3 recommendations in new tables of data, 1 number is selected
Establishing mapping relations according to item (can also abandon all data item recommended, directly select the field in new data table as number
According to item);
S305: it is saved in java standard library, and will newly count being synchronized in S304 to the new java standard library table generated after the completion of mark
It is saved in java standard library according to the mapping relations that table and new standard library table are established;
If there is new tables of data, S301-S304 is repeated, all new data tables is instructed to identify in java standard library
Data item.
Through the above description of the embodiments, those skilled in the art can be understood that according to above-mentioned implementation
The method of example can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but it is very much
In the case of the former be more preferably embodiment.Based on this understanding, technical solution of the present invention is substantially in other words to existing
The part that technology contributes can be embodied in the form of software products, which is stored in a storage
In medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal device (can be mobile phone, calculate
Machine, server or network equipment etc.) execute method described in each embodiment of the present invention.
A kind of processing unit of tables of data is additionally provided in the present embodiment, and the device is for realizing above-described embodiment and excellent
Embodiment is selected, the descriptions that have already been made will not be repeated.As used below, predetermined function may be implemented in term " module "
Software and/or hardware combination.Although device described in following embodiment is preferably realized with software, hardware,
Or the realization of the combination of software and hardware is also that may and be contemplated.
Fig. 4 is the structural block diagram of the processing unit of tables of data according to an embodiment of the present invention, as shown in figure 4, the device packet
It includes:
First choice module 42, for being selected and the data item in pending data table from determining N number of tables of data
Similarity is greater than M tables of data of the first preset threshold, wherein it is natural number that M, which is less than or equal to N, M and N,;
First determining module 44, for by each of pending data table field respectively with it is every in M tables of data
One field is compared, and obtains comparison result, wherein include each of pending data table field in comparison result with
The matching degree of each field in M tables of data;
Second determining module 46, in each of pending data table field and each word in M tables of data
In the case that the matching degree of section is less than the second preset threshold, the field less than the second preset threshold is mapped into target matrix
In, with the mapping relations established between target matrix and pending data table.
Optionally, above-mentioned second determining module, comprising:
First processing units, in each of pending data table field and each field in M tables of data
Matching degree less than the second preset threshold in the case where, by less than the second preset threshold the corresponding data item of field and attribute believe
It ceases in equal mapping value target matrix, with the mapping relations established between target matrix and pending data table.
Optionally, first choice module, comprising:
First determination unit, for will be in the field of the first object language in pending data table and N number of tables of data
The field of first object language is matched, and the first similarity is obtained;
Second determination unit, the tables of data for the first similarity to be greater than the first preset threshold are determined as M tables of data
In tables of data.
Optionally, above-mentioned apparatus further include:
Third determining module, for by each of pending data table field respectively with it is each in M tables of data
A field is compared, after obtaining comparison result, in the field in pending data table and a data in M tables of data
In the case that fields match degree in table is greater than or equal to the second preset threshold, a field is mapped in M tables of data
In one tables of data, to establish the mapping relations of pending data table and a tables of data.
Optionally, above-mentioned apparatus further include:
Memory module, in each of pending data table field and each field in M tables of data
In the case where spending less than the second preset threshold, the field less than the second preset threshold is mapped in target matrix, to build
After mapping relations between vertical target matrix and pending data table, by target matrix and target matrix with it is to be processed
Mapping relations between tables of data are stored into target database, wherein target database is also used to store M tables of data.
Optionally, above-mentioned apparatus further include:
4th determining module, for by each of pending data table field respectively with it is each in M tables of data
A field is compared, and before obtaining comparison result, determines the field for including in pending data table;
Comparison module, for from M tables of data select preset quantity field and pending data table in field into
Row compares.
Optionally, first choice module, comprising:
Third determination unit, the data item for determining to include in pending data table;
Comparing unit, for from N number of tables of data select preset quantity data item and pending data table in data
Item is compared;
4th determination unit, for the data item of preset quantity with the matched feelings of data item in pending data table
Under condition, the tables of data where the data item of preset quantity is determined as the tables of data in M tables of data.
It should be noted that above-mentioned modules can be realized by software or hardware, for the latter, Ke Yitong
Following manner realization is crossed, but not limited to this: above-mentioned module is respectively positioned in same processor;Alternatively, above-mentioned modules are with any
Combined form is located in different processors.
The embodiments of the present invention also provide a kind of storage medium, computer program is stored in the storage medium, wherein
The computer program is arranged to execute the step in any of the above-described embodiment of the method when operation.
Optionally, in the present embodiment, above-mentioned storage medium can be set to store by executing based on following steps
Calculation machine program:
S1 is selected from determining N number of tables of data and is greater than first in advance with the data item similarity in pending data table
If M tables of data of threshold value, wherein it is natural number that M, which is less than or equal to N, M and N,;
Each of pending data table field is compared with each of M tables of data field by S2 respectively,
Obtain comparison result, wherein include each of pending data table field and each in M tables of data in comparison result
The matching degree of field;
S3, each of pending data table field and each field in M tables of data matching degree less than the
In the case where two preset thresholds, the field less than the second preset threshold is mapped in target matrix, to establish target data
Mapping relations between table and pending data table.
Optionally, in the present embodiment, above-mentioned storage medium can include but is not limited to: USB flash disk, read-only memory (Read-
Only Memory, referred to as ROM), it is random access memory (Random Access Memory, referred to as RAM), mobile hard
The various media that can store computer program such as disk, magnetic or disk.
The embodiments of the present invention also provide a kind of electronic device, including memory and processor, stored in the memory
There is computer program, which is arranged to run computer program to execute the step in any of the above-described embodiment of the method
Suddenly.
Optionally, above-mentioned electronic device can also include transmission device and input-output equipment, wherein the transmission device
It is connected with above-mentioned processor, which connects with above-mentioned processor.
Optionally, in the present embodiment, above-mentioned processor can be set to execute following steps by computer program:
S1 is selected from determining N number of tables of data and is greater than first in advance with the data item similarity in pending data table
If M tables of data of threshold value, wherein it is natural number that M, which is less than or equal to N, M and N,;
Each of pending data table field is compared with each of M tables of data field by S2 respectively,
Obtain comparison result, wherein include each of pending data table field and each in M tables of data in comparison result
The matching degree of field;
S3, each of pending data table field and each field in M tables of data matching degree less than the
In the case where two preset thresholds, the field less than the second preset threshold is mapped in target matrix, to establish target data
Mapping relations between table and pending data table.
Optionally, the specific example in the present embodiment can be with reference to described in above-described embodiment and optional embodiment
Example, details are not described herein for the present embodiment.
Obviously, those skilled in the art should be understood that each module of the above invention or each step can be with general
Computing device realize that they can be concentrated on a single computing device, or be distributed in multiple computing devices and formed
Network on, optionally, they can be realized with the program code that computing device can perform, it is thus possible to which they are stored
It is performed by computing device in the storage device, and in some cases, it can be to be different from shown in sequence execution herein
Out or description the step of, perhaps they are fabricated to each integrated circuit modules or by them multiple modules or
Step is fabricated to single integrated circuit module to realize.In this way, the present invention is not limited to any specific hardware and softwares to combine.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field
For art personnel, the invention may be variously modified and varied.It is all within principle of the invention, it is made it is any modification, etc.
With replacement, improvement etc., should all be included in the protection scope of the present invention.
Claims (8)
1. a kind of processing method of tables of data characterized by comprising
It is selected from determining N number of tables of data and is greater than the first preset threshold with the data item similarity in pending data table
M tables of data, wherein it is natural number that the M, which is less than or equal to the N, the M and the N,;
Each of pending data table field is compared with each of M tables of data field respectively
Compared with obtaining comparison result, wherein include each of pending data table field and the M in the comparison result
The matching degree of each field in a tables of data;
The matching degree of each field is less than the in each of pending data table field and the M tables of data
In the case where two preset thresholds, the field for being less than second preset threshold is mapped in target matrix, described in establishing
Mapping relations between target matrix and the pending data table.
2. the method according to claim 1, wherein in each of pending data table field and institute
In the case where the matching degree of each field in M tables of data is stated less than the second preset threshold, the described second default threshold will be less than
The field of value maps in target matrix, is closed with the mapping established between the target matrix and the pending data table
System, comprising:
The matching degree of each field is less than the in each of pending data table field and the M tables of data
In the case where two preset thresholds, the corresponding data item of field and the equal mapping value of attribute information of second preset threshold will be less than
In the target matrix, with the mapping relations established between the target matrix and the pending data table.
3. the method according to claim 1, wherein being selected from determining N number of tables of data and number to be processed
It is greater than M tables of data of the first preset threshold according to the data item similarity in table, comprising:
By the first object language in the field of the first object language in the pending data table and N number of tables of data
Field is matched, and the first similarity is obtained;
The tables of data that first similarity is greater than first preset threshold is determined as the data in the M tables of data
Table.
4. the method according to claim 1, wherein each of pending data table field is distinguished
It is compared with each of M tables of data field, after obtaining comparison result, the method also includes:
It is greater than in the field in the pending data table with the fields match degree in a tables of data in the M tables of data
Or in the case where being equal to second preset threshold, one field is mapped to a data in the M tables of data
In table, to establish the mapping relations of the pending data table and one tables of data.
5. the method according to claim 1, wherein in each of pending data table field and institute
In the case where the matching degree of each field in M tables of data is stated less than the second preset threshold, the described second default threshold will be less than
The field of value maps in target matrix, is closed with the mapping established between the target matrix and the pending data table
After system, the method also includes:
Mapping relations between the target matrix and the target matrix and the pending data table are stored to mesh
It marks in database, wherein the target database is also used to store the M tables of data.
6. a kind of processing unit of tables of data characterized by comprising
First choice module, for being selected from determining N number of tables of data and the data item similarity in pending data table
Greater than M tables of data of the first preset threshold, wherein it is natural number that the M, which is less than or equal to the N, the M and the N,;
First determining module, for by each of pending data table field respectively and in the M tables of data
Each field is compared, and obtains comparison result, wherein includes every in the pending data table in the comparison result
The matching degree of each field in one field and the M tables of data;
Second determining module, in each of pending data table field and each in the M tables of data
In the case that the matching degree of field is less than the second preset threshold, the field for being less than second preset threshold is mapped into number of targets
According in table, with the mapping relations established between the target matrix and the pending data table.
7. a kind of computer-readable storage medium, which is characterized in that it is stored with computer program in the storage medium,
In, the computer program is arranged to execute method described in any one of claim 1 to 5 when operation.
8. a kind of electronic device, including memory and processor, which is characterized in that be stored with computer journey in the memory
Sequence, the processor are arranged to run the computer program to execute side described in any one of claim 1 to 5
Method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910818833.1A CN110532273A (en) | 2019-08-30 | 2019-08-30 | The processing method and processing device of tables of data, storage medium, electronic device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910818833.1A CN110532273A (en) | 2019-08-30 | 2019-08-30 | The processing method and processing device of tables of data, storage medium, electronic device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110532273A true CN110532273A (en) | 2019-12-03 |
Family
ID=68665805
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910818833.1A Pending CN110532273A (en) | 2019-08-30 | 2019-08-30 | The processing method and processing device of tables of data, storage medium, electronic device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110532273A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111723210A (en) * | 2020-06-29 | 2020-09-29 | 深圳壹账通智能科技有限公司 | Method and device for storing data table, computer equipment and readable storage medium |
CN112115138A (en) * | 2020-08-19 | 2020-12-22 | 第四范式(北京)技术有限公司 | Method, device and equipment for determining association relation between data tables |
CN113010517A (en) * | 2021-03-08 | 2021-06-22 | 中国工商银行股份有限公司 | Data table management method and device |
CN113688126A (en) * | 2021-08-25 | 2021-11-23 | 云从科技集团股份有限公司 | Method, system, and medium for determining mapping relationship between source data and standard data |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104657396A (en) * | 2013-11-25 | 2015-05-27 | 腾讯科技(深圳)有限公司 | Data migration method and device |
CN107784058A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Drug data processing method and processing device |
CN108389621A (en) * | 2018-02-08 | 2018-08-10 | 山东康网网络科技有限公司 | Medical record database quality determining method and system |
CN108595614A (en) * | 2018-04-20 | 2018-09-28 | 成都智信电子技术有限公司 | Tables of data mapping method applied to HIS systems |
CN108763341A (en) * | 2018-05-14 | 2018-11-06 | 中国平安人寿保险股份有限公司 | Electronic device, automatic Building table method and storage medium |
CN109344831A (en) * | 2018-08-22 | 2019-02-15 | 中国平安人寿保险股份有限公司 | A kind of tables of data recognition methods, device and terminal device |
CN109871382A (en) * | 2019-02-13 | 2019-06-11 | 北京明略软件系统有限公司 | A kind of implementation method and device of tables of data access java standard library |
CN109902083A (en) * | 2019-02-26 | 2019-06-18 | 北京明略软件系统有限公司 | Method, apparatus, computer storage medium and the terminal of a kind of pair of mark processing |
CN110019111A (en) * | 2017-08-15 | 2019-07-16 | 北京国双科技有限公司 | Data processing method, device, storage medium and processor |
-
2019
- 2019-08-30 CN CN201910818833.1A patent/CN110532273A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104657396A (en) * | 2013-11-25 | 2015-05-27 | 腾讯科技(深圳)有限公司 | Data migration method and device |
CN107784058A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Drug data processing method and processing device |
CN110019111A (en) * | 2017-08-15 | 2019-07-16 | 北京国双科技有限公司 | Data processing method, device, storage medium and processor |
CN108389621A (en) * | 2018-02-08 | 2018-08-10 | 山东康网网络科技有限公司 | Medical record database quality determining method and system |
CN108595614A (en) * | 2018-04-20 | 2018-09-28 | 成都智信电子技术有限公司 | Tables of data mapping method applied to HIS systems |
CN108763341A (en) * | 2018-05-14 | 2018-11-06 | 中国平安人寿保险股份有限公司 | Electronic device, automatic Building table method and storage medium |
CN109344831A (en) * | 2018-08-22 | 2019-02-15 | 中国平安人寿保险股份有限公司 | A kind of tables of data recognition methods, device and terminal device |
CN109871382A (en) * | 2019-02-13 | 2019-06-11 | 北京明略软件系统有限公司 | A kind of implementation method and device of tables of data access java standard library |
CN109902083A (en) * | 2019-02-26 | 2019-06-18 | 北京明略软件系统有限公司 | Method, apparatus, computer storage medium and the terminal of a kind of pair of mark processing |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111723210A (en) * | 2020-06-29 | 2020-09-29 | 深圳壹账通智能科技有限公司 | Method and device for storing data table, computer equipment and readable storage medium |
CN112115138A (en) * | 2020-08-19 | 2020-12-22 | 第四范式(北京)技术有限公司 | Method, device and equipment for determining association relation between data tables |
CN113010517A (en) * | 2021-03-08 | 2021-06-22 | 中国工商银行股份有限公司 | Data table management method and device |
CN113010517B (en) * | 2021-03-08 | 2024-02-09 | 中国工商银行股份有限公司 | Data table management method and device |
CN113688126A (en) * | 2021-08-25 | 2021-11-23 | 云从科技集团股份有限公司 | Method, system, and medium for determining mapping relationship between source data and standard data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110532273A (en) | The processing method and processing device of tables of data, storage medium, electronic device | |
CN112800095B (en) | Data processing method, device, equipment and storage medium | |
CN107196900A (en) | A kind of method and device for verification of knowing together | |
CN109871382A (en) | A kind of implementation method and device of tables of data access java standard library | |
CN107980241B (en) | Gateway multi-connection method and device | |
CN106528289B (en) | Resource operation processing method and device | |
US20150379158A1 (en) | Systems and methods for pattern matching and relationship discovery | |
CN110457704A (en) | Determination method, apparatus, storage medium and the electronic device of aiming field | |
CN109885651A (en) | A kind of question pushing method and device | |
CN110532267A (en) | Determination method, apparatus, storage medium and the electronic device of field | |
CN108154024A (en) | A kind of data retrieval method, device and electronic equipment | |
CN110472216A (en) | Determination method, apparatus, storage medium and the electronic device of field | |
CN110263226A (en) | For the database update method, apparatus and electronic device of drug | |
CN105589873B (en) | Data searching method, terminal and server | |
CN110347683B (en) | Data table merging processing method and device | |
CN110399360A (en) | The setting method and device of dictionary table, storage medium, electronic device | |
CN117093619A (en) | Rule engine processing method and device, electronic equipment and storage medium | |
CN110472205A (en) | Comparison method and device, the storage medium and electronic device of file difference | |
CN111768136A (en) | Inventory scheduling method and device | |
CN109002446B (en) | Intelligent sorting method, terminal and computer readable storage medium | |
CN112861004B (en) | Method and device for determining rich media | |
CN106844377B (en) | Processing method and device of multidimensional database | |
CN113934776A (en) | Food material pushing method, device, medium and equipment | |
CN104951550B (en) | Date storage method and device | |
CN114266291A (en) | Method and device for determining cluster set, storage medium and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191203 |
|
RJ01 | Rejection of invention patent application after publication |