CN109101634A - Data record processing method, device, electronic equipment and storage medium - Google Patents

Data record processing method, device, electronic equipment and storage medium Download PDF

Info

Publication number
CN109101634A
CN109101634A CN201810931008.8A CN201810931008A CN109101634A CN 109101634 A CN109101634 A CN 109101634A CN 201810931008 A CN201810931008 A CN 201810931008A CN 109101634 A CN109101634 A CN 109101634A
Authority
CN
China
Prior art keywords
data record
database
matching
matching rule
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810931008.8A
Other languages
Chinese (zh)
Other versions
CN109101634B (en
Inventor
孙大禹
刘强
魏建钟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sankuai Online Technology Co Ltd
Original Assignee
Beijing Sankuai Online Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sankuai Online Technology Co Ltd filed Critical Beijing Sankuai Online Technology Co Ltd
Priority to CN201810931008.8A priority Critical patent/CN109101634B/en
Publication of CN109101634A publication Critical patent/CN109101634A/en
Application granted granted Critical
Publication of CN109101634B publication Critical patent/CN109101634B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

This disclosure relates to a kind of data record processing method, device, electronic equipment and storage medium, to improve the matching efficiency of data record.The described method includes: obtaining the first database and the second database for describing same object collection;According to preset matching rule, the matching value of each data record to be matched in the first data record and second database in the first database is determined, first data record is used to describe the first object in the object set;According to the corresponding matching value of data record to be matched each in second database, the second data record for describing first object is determined from second database.

Description

Data record processing method, device, electronic equipment and storage medium
Technical field
This disclosure relates to technical field of data processing, and in particular, to a kind of data record processing method, device, electronics Equipment and storage medium.
Background technique
In the business process of each enterprise, a large amount of data, such as user data, business datum etc. can be generally generated. Over time, these data gradually roll up the data resource of enterprise.The data resource that different enterprises possess it can It can be handled in different ways, then by the data record obtained after processing storage into database, for enterprise's warp Battalion person provides reference when doing business decision.
However, as process demand of the enterprise to data resource is increasingly complicated, it is understood that there may be will be based on different from processing side Data record in the database of formula carries out matched demand.In the related technology, to based on difference by way of artificial treatment Data record in the database of processing mode is matched, and this mode matching efficiency is lower.
Summary of the invention
Purpose of this disclosure is to provide a kind of data record processing method, device, electronic equipment and storage mediums, to improve The matching efficiency of data record.
To achieve the goals above, embodiment of the present disclosure first aspect provides a kind of data record processing method, the side Method includes:
Obtain the first database and the second database for describing same object collection;
According to preset matching rule, the first data record in the first database and second database are determined In each data record to be matched matching value, first data record is used to describe first pair in the object set As;
According to the corresponding matching value of data record to be matched each in second database, from second database Middle determination is used to describe the second data record of first object.
Optionally, the second data record for describing first object is determined from second database, comprising:
The corresponding matching value of data record to be matched each in second database is ranked up;
Determine the difference between highest matching value and secondary high matching value;
In the case where the difference is greater than preset threshold, the highest data record of corresponding matching value is determined as described Second data record.
Optionally, the method also includes:
In the case where the difference is not more than the preset threshold, prompt information is exported, the prompt information is for mentioning Show that user selects a data from the high data record of the highest data record of corresponding matching value and corresponding matching value time Record;
The second data record for describing first object is determined from second database, comprising:
The data record that the user selects is determined as second data record.
Optionally, the preset matching rule includes multiple sub- matching rules;According to preset matching rule, institute is determined State the matching value of any data record to be matched in the first data record and second database in first database, packet It includes:
According to each sub- matching rule, any number to be matched in first data record and second database is determined According to the matching initial value of record;
According to the corresponding weighted value for matching initial value and each sub- matching rule of each sub- matching rule, second data are determined The corresponding matching value of the data record to be matched of this in library.
Optionally, from second database determine for describe first object the second data record it Afterwards, the method also includes:
By first data record, second data record and first data record and second data Matching relationship between record, storage to the third database for describing the object set.
Optionally, the method also includes:
When detecting the data record acquisition request for first object, institute is obtained from the third database State the first data record and/or second data record.
Optionally, the preset matching rule includes:
General matching rule, or, the dedicated matching rule based on the characteristic parameter configuration of object in the object set, or, The combination of the general matching rule and the dedicated matching rule, wherein the general matching rule includes: fuzzy matching rule Then or the combination of equivalent matching rule, or both.
Optionally, the characteristic parameter of object is geographical location in the object set;The dedicated matching rule includes longitude and latitude Spend matching rule;And/or administrative region ratings match rule.
Embodiment of the present disclosure second aspect provides a kind of data recording and processing device, and described device includes:
Module is obtained, for obtaining first database and the second database for describing same object collection;
Matching value determining module, for determining the first data in the first database according to preset matching rule The matching value of record and each data record to be matched in second database, first data record is for describing institute State the first object in object set;
Data record determining module, for according to data record to be matched corresponding each in second database With value, the second data record for describing first object is determined from second database.
Optionally, the data record determining module includes:
Sorting sub-module, for being carried out to the corresponding matching value of data record to be matched each in second database Sequence;
First determines submodule, for determining the difference between highest matching value and secondary high matching value;
Second determines submodule, is used in the case where the difference is greater than preset threshold, by corresponding matching value highest Data record be determined as second data record.
Optionally, described device further include:
Output module, it is described to mention for exporting prompt information in the case where the difference is not more than the preset threshold Show information for prompting user from the high data record of the highest data record of corresponding matching value and corresponding matching value time Select a data record;
The data record determining module includes:
Third determines submodule, and the data record for selecting the user is determined as second data record.
Optionally, the matching rule includes multiple sub- matching rules;The matching value determining module includes:
Matching initial value determines submodule, for according to each sub- matching rule, determining first data record and described the The matching initial value of any data record to be matched in two databases;
Matching value determines submodule, for according to the corresponding power for matching initial value and each sub- matching rule of each sub- matching rule Weight values determine the corresponding matching value of data record to be matched in second database.
Optionally, described device further include:
Memory module is used for first data record, second data record and first data record With the matching relationship between second data record, storage to the third database for describing the object set.
Optionally, described device further include:
Module is obtained, for when detecting the data record acquisition request for first object, from the third First data record and/or second data record are obtained in database.
Optionally, the preset matching rule includes:
General matching rule, or, the dedicated matching rule based on the characteristic parameter configuration of object in the object set, or, The combination of the general matching rule and the dedicated matching rule, wherein the general matching rule includes: fuzzy matching rule Then or the combination of equivalent matching rule, or both.
Optionally, the characteristic parameter of object is geographical location in the object set;The dedicated matching rule includes: longitude and latitude Spend matching rule and/or administrative region ratings match rule.
The embodiment of the present disclosure third aspect provides a kind of electronic equipment, including processor;It is executable for storage processor The memory of instruction;Wherein, the processor is for the step of executing above-mentioned data record processing method.
Embodiment of the present disclosure fourth aspect provides a kind of computer readable storage medium, is stored thereon with computer program and refers to It enables, described program instructs the step of realizing above-mentioned data record method when being executed by processor.
Through the above technical solutions, after obtaining multiple databases for describing same object collection, according to preset Matching rule determines the data record in a database in multiple databases and other databases in addition to the database In data record matching value, it is last according to the matching value determined, determine in multiple databases and concentrated for description object The data record of same target.In this way, realizing the data record in the multiple databases of automation matching, matched without artificial, Improve matching efficiency.
Other feature and advantage of the disclosure will the following detailed description will be given in the detailed implementation section.
Detailed description of the invention
Attached drawing is and to constitute part of specification for providing further understanding of the disclosure, with following tool Body embodiment is used to explain the disclosure together, but does not constitute the limitation to the disclosure.In the accompanying drawings:
Fig. 1 is a kind of flow chart for data record processing method that the embodiment of the present disclosure provides.
Fig. 2 is a kind of another flow chart for data record processing method that the embodiment of the present disclosure provides.
Fig. 3 is a kind of schematic diagram for data recording and processing device that the embodiment of the present disclosure provides.
Fig. 4 is a kind of another schematic diagram for data recording and processing device that the embodiment of the present disclosure provides.
Fig. 5 is the block diagram for a kind of electronic equipment that the embodiment of the present disclosure provides.
Specific embodiment
It is described in detail below in conjunction with specific embodiment of the attached drawing to the disclosure.It should be understood that this place is retouched The specific embodiment stated is only used for describing and explaining the disclosure, is not limited to the disclosure.
The embodiment of the present disclosure provides a kind of data record processing method, and this method is being obtained for describing same object collection After multiple databases, according to preset matching rule, determine data record in a database in multiple databases with The matching value of the data record in other databases in addition to the database, last according to the matching value determined, determination is more The data record of same target is concentrated in a database for description object.In this way, realizing automation matches multiple databases In data record improve matching efficiency without artificial matching.
It is described in detail below with reference to the data record processing method that specific embodiment provides the embodiment of the present disclosure.
With reference to Fig. 1, Fig. 1 is a kind of flow chart for data record processing method that the embodiment of the present disclosure provides, such as Fig. 1 institute Show, method includes the following steps:
Step S11: the first database and the second database for describing same object collection are obtained;
Step S12: according to preset matching rule, the first data record in the first database and described the are determined The matching value of each data record to be matched in two databases, first data record is for describing in the object set First object;
Step S13: according to the corresponding matching value of data record to be matched each in second database, from described The second data record for describing first object is determined in two databases.
Wherein, object set is the set of multiple objects, and multiple objects in object set belong to same type, having the same The value of characteristic parameter, the characteristic parameter that the different objects in object set have is different.Illustratively, object set is the set in city, It include: each city such as Beijing, Shanghai, Chengdu, each city characteristic parameter having the same in object set, comprising: title, Definition, postcode, longitude and latitude, administrative region grade etc..The title of different cities, definition, postcode, longitude and latitude, administration in object set Area grade is different.Illustratively, the title in this city of Beijing is: Beijing, definition are: the Chinese capital, postcode are: 100000, The value of longitude and latitude is: N39 ° 54 ' 11.97 of north latitude " E116 ° 24 ' 3.52 of east longitude ", administrative region grade is: municipality directly under the Central Government;Chengdu this The title in one city is: Chengdu, definition are: the maximum provincial capital in southwest, postcode is: 610000, the value of longitude and latitude is: north latitude N30 ° 34 ' 21.63 " E104 ° 03 ' 44.20 of east longitude ", administrative region grade is: saving (belonging to Sichuan Province).
Database is the set of data record, and a data record is used for the object that description object is concentrated.
Illustratively, a data record in first database is as follows:
Title: Chengdu, definition: the maximum provincial capital in southwest, postcode: 610000, the value of longitude and latitude: N30 ° 34 ' of north latitude 21.63 " E104 ° 03 ' 44.20 of east longitudes " administrative region grade: save (Chengdu belongs to Sichuan Province).
Illustratively, a data record in the second database is as follows:
Title: Chengdu (CD), definition: a southwestern provincial capital, postcode well-known with the slow rhythm that lies fallow: 610000, warp The value of latitude: it N30 ° 34 ' 21.63 of north latitude " E104 ° 03 ' 44.20 of east longitude ", administrative region grade: saves (belonging to Sichuan Province).
In practical application scene, there may be multiple databases and describe identical object set.Illustratively, object set It is the set in city, the database of multiple electric business classes enterprise is used to describe the object set;For another example, object set is the collection of shops It closes, multiple databases for taking out class enterprise are used to describe the object set.
It is understood that the quantity for the database that describes same object collection may be it is multiple, it is multiple in order to match Data record in database, can be using a database in multiple databases as first database, by multiple databases In other any databases in addition to the database (i.e. first database) as the second database, be based on this, execute this public affairs The data record processing method of embodiment offer is provided, to realize the matching of the data record in two databases, executes sheet repeatedly The data record processing method that open embodiment provides, and then the matching of the data record in multiple databases may be implemented.
For, for describing the inconsistent situation of the data record of same target, the embodiment of the present disclosure mentions in multiple databases Out according to preset matching rule, the data record in disparate databases is matched.The preset matching rule includes: General matching rule, or, the dedicated matching rule based on the characteristic parameter configuration of object in the object set, or, described general The combination of matching rule and the dedicated matching rule, wherein the general matching rule includes: fuzzy matching rule or waits It is worth the combination of matching rule, or both.Optionally, the characteristic parameter of object is geographical location in the object set;It is described dedicated Matching rule includes: longitude and latitude matching rule and/or administrative region ratings match rule.
In the embodiment of the present disclosure, general matching rule is suitable for the data record the database for describing any object set It is matched.
Illustratively, object set is the set in city, and it is right that the database of enterprise A and the database of enterprise B are used to describe this As collection can in the database from enterprise B during the determining data record with a data record matching in enterprise A Using by general matching rule as preset matching rule;Object set is the set of shops, the database of enterprise C and enterprise D's Database is used to describe the object set, a determining data record matching with enterprise C in the database from enterprise D It, equally can be using general matching rule as preset matching rule during data record.
General matching rule includes: the combination of fuzzy matching rule or equivalent matching rule, or both.Fuzzy matching rule Then suitable for data record text type data item, the data of equivalent matching rule value type suitable for data record ?.If data record only includes the data item of text type, fuzzy matching rule can only be selected to advise as general matching Then;Similarly, if data record only includes the data item of value type, it can only select equivalent matching rule as general With rule;It similarly, can be with if data record had not only included the data item of text type but also included the data item of value type It regard fuzzy matching rule and equivalent matching rule as general matching rule.
Illustratively, a data record in first database is as follows: title: Chengdu, definition: the maximum provincial capital city in southwest City, postcode: 610000.A data record in second database is as follows: title: Chengdu (CD), definition: a southwestern province Meeting city, postcode: 610000 well-known with the slow rhythm that lies fallow.Due to Name and Description, the two data item are to belong to text type Data item, this data item of postcode is to belong to the data item of value type, so by fuzzy matching rule and equivalent matching rule Then it is used as general matching rule.
In the embodiment of the present disclosure, database of the dedicated matching rule (or persona rules) for description special object collection, root According to the characteristic parameter of the object in object set described in database, (characteristic parameter of object can refer to explanation above, herein Repeat no more), the matching rule specially established.For the database for describing different object sets, due to object in different object sets Characteristic parameter it is different, so the dedicated matching rule of each self application of the database for describing different object sets is different.
Illustratively, object set is the set in city, each city characteristic parameter having the same in object set, comprising: Title, description, postcode, longitude and latitude, administrative region grade etc., establishing and being directed to the dedicated matching rule of the object set includes: longitude and latitude Matching rule and administrative region ratings match rule are spent, the database of enterprise A and the database of enterprise B are used to describe the object Collection determines that the matching rule between the database of enterprise A and the database of enterprise B is longitude and latitude matching rule and administrative region etc. Grade matching rule.
Illustratively, object set is the set of shops, each city characteristic parameter having the same in object set, comprising: POI (Point of Interest, point of interest, each POI include four aspect information, title, classification, coordinate, classification), businessman Telephone number, contact person, brand etc., establishing and being directed to the dedicated matching rule of the object set includes: POI matching rule, Shang Jialian It is phone matching rule, contact person's matching rule and brand matching rule, the database of enterprise C and the database of enterprise D are equal For describing the object set, the matching rule between the database of enterprise C and the database of enterprise D is determined are as follows: POI matching rule Then, business contact phone matching rule, contact person's matching rule and brand matching rule.
As it can be seen that since the database of enterprise A and the database of enterprise B are used to the set in description city, and the number of enterprise C According to the database in library and enterprise D be used to description shops set, due to city characteristic parameter and shops characteristic parameter not Together, lead to the dedicated matching rule being applicable between the database of enterprise A and the database of enterprise B, different from the database of enterprise C The dedicated matching rule being applicable between the database of enterprise D.
In one embodiment, general matching rule and dedicated matching rule can be combined, i.e., preset matching rule It then include general matching rule and dedicated matching rule, on the one hand increasing due to matching rule quantity, matching accuracy is substantially It improves;On the other hand, with the difference of object described in data record, different dedicated matching rules can be set, improve Data record matched flexibility.
Illustratively, the database of enterprise A and the database of enterprise B are used to the set in description city, in the number from enterprise B During according to the data record with a data record matching in enterprise A determining in library, by following rule as preset Matching rule:
1) general matching rule, it is regular (definition for title and city to city matches) including fuzzy matching It (is matched for the postcode to city) with equivalent matching rule;
2) dedicated matching rule, including longitude and latitude matching rule (value for the longitude and latitude to city matches) and row Administrative division domain ratings match is regular (matching for the administrative region grade to city).
In the embodiment of the present disclosure, although first database and the second database are used to describe identical object set, Object described in a data record (such as: the first data record) in first database (such as: the first object), It is that the object is described in which data record in two databases, is unknown, thus needs using above-mentioned preset With rule, the first data record in first database is compared one by one with each data record to be matched in the second database Compared with, determine the matching value of each data record to be matched in the first data record and the second database, it is then several from second According to the data record (i.e. the second data record) determined in library for describing the first object.
Wherein, data record to be matched refers to the data record of non-successful match.Illustratively, it is counted in first time from second According to the process with a data record (such as first data record) matched data record in first database determining in library In, data record to be matched is data record all in the second database, is being determined and first from the second database After the matched data record of data record is the second data record, the second data record is the data record of successful match, For the second time from another data record (such as third data record) with first database determining in the second database During the data record matched, data record to be matched is remaining in addition to the second data record in the second database Data record.
Optionally, the second data record for describing first object is determined from second database, comprising:
The corresponding matching value of data record to be matched each in second database is ranked up;
Determine the difference between highest matching value and secondary high matching value;
In the case where the difference is greater than preset threshold, the highest data record of corresponding matching value is determined as described Second data record;
In the case where the difference is not more than the preset threshold, prompt information is exported, the prompt information is for mentioning Show that user selects a data from the high data record of the highest data record of corresponding matching value and corresponding matching value time Record;The data record that the user selects is determined as second data record.
It, can be using the highest data record of matching value corresponding in the second database as with first in the embodiment of the present disclosure The data record (i.e. the second data record) of record matching, the data record are in the second database for describing the first object Data record.
Alternatively, can be ranked up to the corresponding matching value of each data record to be matched in the second database, really Determine highest matching value and time high matching value, then determine the difference of the two, if the difference of the two is greater than preset threshold, illustrates The corresponding highest data record of matching value and the secondary high data record difference of corresponding matching value are obvious, so will directly correspond to The highest data record of matching value as the second data record;If the difference of the two is not more than preset threshold, illustrate pair The highest data record of the matching value answered and the secondary high data record difference of corresponding matching value are faint, and two data records have It may be to avoid judging by accident to improve matching accuracy with the matched data record of the first data record, it in the case can be with User is prompted manually to select a data record from the two data records, the data record for then selecting user is as the Two data records.
It is understood that the quantity of the data record in first database may be it is multiple, can be by first database In a data record as the first data record, by all data record ratios in the first data record and the second database Compared with execution step S12-S13, until the determining and matched data record of the first data record (i.e. second from the second database Data record), in this way, completing of the first data record in first database and the second data record in the second database Match.It similarly, will using a data record in first database in addition to the first data record as the first new data record The first new data record executes step S12- compared with the remaining data record in the second database in addition to the second data record S13, until the determining and new matched data record of the first data record from the second database.
In one embodiment, the preset matching rule includes multiple sub- matching rules;Correspondingly, step S12 Include:
According to each sub- matching rule, of the data record in first data record and second database is determined With initial value;
According to the corresponding weighted value for matching initial value and each sub- matching rule of each sub- matching rule, the matching value is obtained.
In the embodiment of the present disclosure, preset matching rule may be general matching rule, it is also possible to dedicated matching rule, It or may be the combination of general matching rule and dedicated matching rule.The quantity of general matching rule may be it is multiple, it is dedicated The quantity of matching rule be also likely to be it is multiple, in this way, the quantity of preset matching rule be it is multiple, each matching rule be one Sub- matching rule.
Illustratively, preset matching rule includes four sub- matching rules: 1. fuzzy matching is regular, 2. equivalent matching is advised Then, 3. longitude and latitude matching rule and 4. administrative region ratings match rule.
Different sub- matching rules is to be assessed between two data records from disparate databases from different angles Matching degree, thus when determining the matching value between two data records from disparate databases, it is necessary to it is different Sub- matching rule assigns different weighted values, and the size of the weighted value of each sub- matching rule can be default, be also possible to It is determined according to the confidence level of every sub- matching rule, the confidence level of a sub- matching rule can pass through Neural Network Science acquistion It arrives.
For every sub- matching rule, the first record in first database and some record in the second database are determined Matching initial value, in this way, obtaining multiple matching initial values using multiple sub- matching rules.Then, by each matching initial value and this Weighted value multiplication with the sub- matching rule that initial value is based on, obtains product, and corresponding multiple sub- matching rules obtain multiple multiply The first record in first database and some record in the second database can be obtained finally by multiple product additions in product Matching value.
Illustratively, preset matching rule includes four sub- matching rules: 1. fuzzy matching is regular, 2. equivalent matching is advised Then, 3. longitude and latitude matching rule and 4. administrative region ratings match rule, weighted value is respectively a1, a2, a3 and a4.For Data record b in data record a in the database of enterprise A and the database of enterprise B is determined using fuzzy matching rule Matching initial value be Score1;Using equivalent matching rule, determining matching initial value is Score2;It matches and advises using longitude and latitude Then, determining matching initial value is Score3;Using administrative region ratings match rule, determining matching initial value is Score4.Then The matching value Score=between data record b in data record a in the database of enterprise A and the database of enterprise B Score1*a1+Score2*a2+Score3*a3+Score4*a4。
It is completely illustrated and how to be counted from first with first database determining in the second database with one below According to the data record of record matching.
The database of enterprise A and the database of enterprise B are used to the set in description city, and preset matching rule includes four A sub- matching rule: 1. fuzzy matching is regular, 2. equivalent matching rule, 3. longitude and latitude matching rule and 4. administrative region grade Matching rule, weighted value are respectively a1, a2, a3 and a4.
Data record a in the database of enterprise A is as follows:
Title: Chengdu, definition: the maximum provincial capital in southwest, postcode: 610000, the value of longitude and latitude: N30 ° 34 ' of north latitude 21.63 " E104 ° 03 ' 44.20 of east longitudes " administrative region grade: save (belonging to Sichuan Province).
Data record b in the database of enterprise B is as follows:
Title: Chengdu (CD), definition: a southwestern provincial capital, postcode well-known with the slow rhythm that lies fallow: 610000, warp The value of latitude: it N30 ° 34 ' 21.63 of north latitude " E104 ° 03 ' 44.20 of east longitude ", administrative region grade: saves (belonging to Sichuan Province).
Data record b1 in the database of enterprise B is as follows:
Title: Beijing, definition: the Chinese capital, postcode: 100000, the value of longitude and latitude: N39 ° 54 ' 11.97 " east longitudes of north latitude E116 ° 24 ' 3.52 ", administrative region grade: municipality directly under the Central Government.
For the data record b and data note in the database of data record a and enterprise B in the database of enterprise A B1 is recorded, using fuzzy matching rule, determining matching initial value is Score1 and Score1 respectively ';Using equivalent matching rule, really Fixed matching initial value is Score2 and Score2 respectively ' (Score2 ' is zero, because the postcode in Chengdu and Pekinese's postcode are not Together);Using longitude and latitude matching rule, determining matching initial value is Score3 and Score3 ';It is advised using administrative region ratings match Then, determining matching initial value is Score4 and Score4 ' (Score4 ' is zero, because administrative region grade mismatches, is saved and straight Linchpin city is different two administrative region grades).Then in the data record a in the database of enterprise A and the database of enterprise B Matching value Score=Score1*a1+Score2*a2+Score3*a3+Score4*a4 between data record b;The number of enterprise A According to the matching value Score ' between the data record b1 in the data record a in library and the database of enterprise B=Score1 ' * a1+ Score2’*a2+Score3’*a3+Score4’*a4。
Compare Score and Score ', since Score is greater than Score ', then the data record b phase in the database of enterprise B It is matched compared with for data record b1 with the data record a in the database of enterprise A.
As shown in Fig. 2, Fig. 2 is a kind of another flow chart for data record processing method that the embodiment of the present disclosure provides.Ginseng Fig. 2 is examined, in one embodiment, the data record processing method that the embodiment of the present disclosure provides is further comprising the steps of:
Step S14: by first data record, second data record and first data record with it is described Matching relationship between second data record is stored to third database.
Optionally, as shown in Fig. 2, the data record processing method of embodiment of the present disclosure offer is further comprising the steps of:
Step S15: when detecting the data record acquisition request for first object, from the third database Middle acquisition first data record and/or second data record.
In the embodiment of the present disclosure, it is contemplated that be integrated into the data record in the database based on Different treatments And the demand of new database is obtained, it proposes after matching the data record in disparate databases one by one, by what is matched Matching relationship storage between two or more data records and each data record to match is (different to third database In another database of first database and the second database), the data in fusion and unified multiple databases are realized with this The purpose of record, hereafter if detecting that the data record acquisition for the object (such as: the first object) in the object set is asked It asks, third database can be called, the data record for describing the object is read from third database, it can be only from third The first data record is read in database, the second data record can also be only read from third database, alternatively, from third number According to reading the first data record and the second data record in library.
Illustratively, data record a in the database of the data record b in the database for determining enterprise B and enterprise A With later, a data record is created, which includes: data record a and data record b and between the two Matching relationship.Then newly-built data record is stored into another database.
It hereafter, can be from this if it is how to describe this city of Chengdu that user, which wants to know in the database of enterprise B, Newly-built data record is obtained in another database, and then extracts data record b, similarly, if user wants to know enterprise It is how to describe this city of Chengdu in the database of industry A, newly-built data note can be obtained from another database Record, and then extract data record a.
Alternatively, if user has known with data record b in the database of enterprise B, in the database for wondering enterprise A With the matched data record of data record b which is, newly-built data record can be obtained from another above-mentioned database, And then extract the matching relationship between data record b and data record a.
Alternatively, when need and meanwhile will be described as each data record in all this cities in real time with other data fusions when, Newly-built data record can be obtained from another above-mentioned database, and then disposably extracts data record a and data note Record b, and in real time with other data fusions.
Based on the same inventive concept, the embodiment of the present disclosure also provides a kind of data recording and processing device.It is with reference to Fig. 3, Fig. 3 The schematic diagram for the data recording and processing device that the embodiment of the present disclosure provides.As shown in figure 3, the data note that the embodiment of the present disclosure provides Recording processing unit 300 includes:
Module 301 is obtained, for obtaining first database and the second database for describing same object collection;
Matching value determining module 302, for according to preset matching rule, determining the first number in the first database According to the matching value of record and each data record to be matched in second database, first data record is for describing The first object in the object set;
Data record determining module 303, for corresponding according to data record to be matched each in second database Matching value, the second data record for describing first object is determined from second database.
Optionally, the data record determining module includes:
Sorting sub-module, for being carried out to the corresponding matching value of data record to be matched each in second database Sequence;
First determines submodule, for determining the difference between highest matching value and secondary high matching value;
Second determines submodule, is used in the case where the difference is greater than preset threshold, by corresponding matching value highest Data record be determined as second data record.
Optionally, described device further include:
Output module, it is described to mention for exporting prompt information in the case where the difference is not more than the preset threshold Show information for prompting user from the high data record of the highest data record of corresponding matching value and corresponding matching value time Select a data record;
The data record determining module includes:
Third determines submodule, and the data record for selecting the user is determined as second data record.
Optionally, the matching rule includes multiple sub- matching rules;The matching value determining module includes:
Matching initial value determines submodule, for according to each sub- matching rule, determining first data record and described the The matching initial value of any data record to be matched in two databases;
Matching value determines submodule, for according to the corresponding power for matching initial value and each sub- matching rule of each sub- matching rule Weight values determine the corresponding matching value of data record to be matched in second database.
Optionally, Fig. 4 is the schematic diagram for the data recording and processing device that the embodiment of the present disclosure provides.As shown in figure 4, this public affairs The data recording and processing device 300 of embodiment offer is provided further include:
Memory module 304, for remembering first data record, second data record and first data Matching relationship between record and second data record, storage to the third database for describing the object set.
Optionally, as shown in figure 4, the data recording and processing device 300 that the embodiment of the present disclosure provides further include:
Module 305 is obtained, for when detecting the data record acquisition request for first object, from described the First data record and/or second data record are obtained in three databases.
Optionally, the preset matching rule includes:
General matching rule, or, the dedicated matching rule based on the characteristic parameter configuration of object in the object set, or, The combination of the general matching rule and the dedicated matching rule, wherein the general matching rule includes: fuzzy matching rule Then or the combination of equivalent matching rule, or both.
Optionally, the characteristic parameter of object is geographical location in the object set;The dedicated matching rule includes: longitude and latitude Spend matching rule and/or administrative region ratings match rule.
It should be noted that wherein modules have executed the concrete mode of operation about the device in above-described embodiment It is described in detail in the embodiment of the method, no detailed explanation will be given here.
Fig. 5 is the block diagram for a kind of electronic equipment that the embodiment of the present disclosure provides.For example, electronic equipment 100 can be provided For a data processing server.Referring to Fig. 5, electronic equipment 100 includes processor 1122, and quantity can be one or more, And memory 1132, for storing the computer program that can be executed by processor 1122.The calculating stored in memory 1132 Machine program may include it is one or more each correspond to one group of instruction module.In addition, processor 1122 can be with It is configured as executing the computer program, to execute above-mentioned data record processing method.
In addition, electronic equipment 100 can also include power supply module 1126 and communication component 1150, which can To be configured as executing the power management of electronic equipment 100, which, which can be configured as, realizes electronic equipment 100 Communication, for example, wired or wireless communication.In addition, the electronic equipment 100 can also include input/output (I/O) interface 1158.Electronic equipment 100 can be operated based on the operating system for being stored in memory 1132, such as WindowsServerTM, MacOSXTM, UnixTM, LinuxTM etc..
In a further exemplary embodiment, a kind of computer readable storage medium including program instruction is additionally provided, it should The step of above-mentioned data record processing method is realized when program instruction is executed by processor.For example, the computer-readable storage Medium can be the above-mentioned memory 1132 including program instruction, and above procedure instruction can be by the processor of electronic equipment 100 1122 execute to complete above-mentioned data record processing method.
The preferred embodiment of the disclosure is described in detail in conjunction with attached drawing above, still, the disclosure is not limited to above-mentioned reality The detail in mode is applied, in the range of the technology design of the disclosure, a variety of letters can be carried out to the technical solution of the disclosure Monotropic type, these simple variants belong to the protection scope of the disclosure.In addition, it is necessary to explanation, in above-mentioned specific embodiment party Each particular technique feature described in formula can be combined in any appropriate way in the case of no contradiction, In order to avoid unnecessary repetition, no further explanation will be given to various combinations of possible ways for the disclosure.

Claims (11)

1. a kind of data record processing method, which is characterized in that the described method includes:
Obtain the first database and the second database for describing same object collection;
According to preset matching rule, determine each in the first data record and second database in the first database The matching value of a data record to be matched, first data record are used to describe the first object in the object set;
According to the corresponding matching value of data record to be matched each in second database, from second database really Determine the second data record for describing first object.
2. the method according to claim 1, wherein determining from second database for describing described the Second data record of an object, comprising:
The corresponding matching value of data record to be matched each in second database is ranked up;
Determine the difference between highest matching value and secondary high matching value;
In the case where the difference is greater than preset threshold, the highest data record of corresponding matching value is determined as described second Data record.
3. according to the method described in claim 2, it is characterized by further comprising:
In the case where the difference is not more than the preset threshold, prompt information is exported, the prompt information is used for prompting Family selects a data record from the high data record of the highest data record of corresponding matching value and corresponding matching value time;
The second data record for describing first object is determined from second database, comprising:
The data record that the user selects is determined as second data record.
4. the method according to claim 1, wherein the preset matching rule includes multiple sub- matching rule Then;According to preset matching rule, determines and appoint in the first data record and second database in the first database The matching value of one data record to be matched, comprising:
According to each sub- matching rule, determine that any data to be matched are remembered in first data record and second database The matching initial value of record;
According to the corresponding weighted value for matching initial value and each sub- matching rule of each sub- matching rule, determine in second database The corresponding matching value of data record to be matched.
5. the method according to claim 1, wherein described for describing being determined from second database After second data record of the first object, the method also includes:
By first data record, second data record and first data record and second data record Between matching relationship, storage is to the third database for describing the object set.
6. according to the method described in claim 5, it is characterized in that, the method also includes:
When detecting the data record acquisition request for first object, described the is obtained from the third database One data record and/or second data record.
7. the method according to claim 1, wherein the preset matching rule includes:
General matching rule, or, the dedicated matching rule based on the characteristic parameter configuration of object in the object set, or, described The combination of general matching rule and the dedicated matching rule, wherein the general matching rule include: fuzzy matching rule, Or the combination of equivalent matching rule, or both.
8. the method according to the description of claim 7 is characterized in that the characteristic parameter of object is geographical position in the object set It sets;The dedicated matching rule includes: longitude and latitude matching rule and/or administrative region ratings match rule.
9. a kind of data recording and processing device, which is characterized in that described device includes:
Module is obtained, for obtaining first database and the second database for describing same object collection;
Matching value determining module, for determining the first data record in the first database according to preset matching rule With the matching value of data record to be matched each in second database, it is described right that first data record is used to describe As the first object of concentration;
Data record determining module, for according to the corresponding matching of data record to be matched each in second database Value, determines the second data record for describing first object from second database.
10. a kind of electronic equipment characterized by comprising
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is for executing such as the step of method described in any item of the claim 1 to 8.
11. a kind of computer readable storage medium, is stored thereon with computer program instructions, which is characterized in that described program refers to The step of any one of claims 1 to 8 the method is realized when order is executed by processor.
CN201810931008.8A 2018-08-15 2018-08-15 Data recording processing method, device, electronic equipment and storage medium Active CN109101634B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810931008.8A CN109101634B (en) 2018-08-15 2018-08-15 Data recording processing method, device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810931008.8A CN109101634B (en) 2018-08-15 2018-08-15 Data recording processing method, device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109101634A true CN109101634A (en) 2018-12-28
CN109101634B CN109101634B (en) 2021-06-11

Family

ID=64849999

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810931008.8A Active CN109101634B (en) 2018-08-15 2018-08-15 Data recording processing method, device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109101634B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5991758A (en) * 1997-06-06 1999-11-23 Madison Information Technologies, Inc. System and method for indexing information about entities from different information sources
CN107145574A (en) * 2017-05-05 2017-09-08 恒生电子股份有限公司 database data processing method, device and storage medium and electronic equipment
CN107291951A (en) * 2017-07-24 2017-10-24 北京都在哪智慧城市科技有限公司 Data processing method, device, storage medium and processor

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5991758A (en) * 1997-06-06 1999-11-23 Madison Information Technologies, Inc. System and method for indexing information about entities from different information sources
CN107145574A (en) * 2017-05-05 2017-09-08 恒生电子股份有限公司 database data processing method, device and storage medium and electronic equipment
CN107291951A (en) * 2017-07-24 2017-10-24 北京都在哪智慧城市科技有限公司 Data processing method, device, storage medium and processor

Also Published As

Publication number Publication date
CN109101634B (en) 2021-06-11

Similar Documents

Publication Publication Date Title
US11740102B2 (en) Method, apparatus, device and storage medium for determining point of interest area
US8976266B2 (en) Picture locating method and system based on navigation function of mobile terminal
CN107341220B (en) Multi-source data fusion method and device
CN110245980B (en) Method and equipment for determining target user excitation form based on neural network model
CN109084795B (en) Method and device for searching service facilities based on map service
US10783874B2 (en) Method and apparatus for providing voice feedback information to user in call
CN104537106B (en) Searching method and device based on electronic map
CN105160173B (en) Safety evaluation method and device
CN109062914A (en) User's recommended method and device, storage medium and server
CN108712712A (en) Wireless Fidelity WiFi network related information display methods and device
CN107426693A (en) Localization method and device
CN107038589B (en) A kind of entity information verification method and device
CN110245145A (en) Structure synchronization method and apparatus of the relevant database to Hadoop database
CN110263022A (en) Hotel's data matching method and device
CN105678129A (en) Method and device for determining user identity information
CN111625638B (en) Question processing method, device, equipment and readable storage medium
CN106658666A (en) Method and device for building wireless connection
WO2017128685A1 (en) Transaction processing method and transaction system
CN104573132B (en) Song lookup method and device
CN112069416B (en) Cross-social network user identity recognition method based on community discovery
CN109345081A (en) A kind of collecting method, device and electronic equipment
CN109101634A (en) Data record processing method, device, electronic equipment and storage medium
CN111815467A (en) Auditing method and device
Abe et al. A life log collecting system supported by smartphone to model higher-level human behaviors
CN109815121A (en) Interface automatic test cases generation method and relevant device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant