CN112328633A - Measurement data attribute intelligent matching and identifying module for database construction - Google Patents

Measurement data attribute intelligent matching and identifying module for database construction Download PDF

Info

Publication number
CN112328633A
CN112328633A CN202011107677.7A CN202011107677A CN112328633A CN 112328633 A CN112328633 A CN 112328633A CN 202011107677 A CN202011107677 A CN 202011107677A CN 112328633 A CN112328633 A CN 112328633A
Authority
CN
China
Prior art keywords
data
matched
database
matching
attribute
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011107677.7A
Other languages
Chinese (zh)
Inventor
万少华
姚路明
王强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Jingyi Surveying And Mapping Co ltd
Original Assignee
Anhui Jingyi Surveying And Mapping Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Jingyi Surveying And Mapping Co ltd filed Critical Anhui Jingyi Surveying And Mapping Co ltd
Priority to CN202011107677.7A priority Critical patent/CN112328633A/en
Publication of CN112328633A publication Critical patent/CN112328633A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor

Abstract

The invention relates to the technical field of database construction, in particular to an intelligent matching and identifying module for attributes of measured data for database construction. According to the measurement data attribute intelligent matching and identifying module for database construction, attribute data to be matched are selected as matching reference objects, database attribute data are obtained from a local database, identification work of the attribute data to be matched and a large amount of database data is avoided, and therefore identification matching efficiency is improved; and the weighted value of the similarity of the attribute data to be matched is calculated and judged, which is beneficial to the accurate expansion of the local database.

Description

Measurement data attribute intelligent matching and identifying module for database construction
Technical Field
The invention relates to the technical field of database construction, in particular to an intelligent matching and identifying module for measuring data attributes for database construction.
Background
Databases are warehouses that organize, store, and manage data according to data structures that have evolved over sixty years ago, with the development of information technology and markets, particularly after the nineties of the twentieth century, data management is no longer merely the storage and management of data, but has turned into the various ways of data management required by users. Databases are of many types, ranging from the simplest tables that store various types of data to large database systems that are capable of mass data storage.
When the existing database analyzes network data, data attribute matching and attribute identification work are often required to be carried out quickly, and in the data matching process, the matching efficiency is not high because the data to be matched and a large amount of data in the database are often identified one by one; in the matching and identifying process, the accuracy of the matching process is not high, so that the accurate expansion of the local database is not facilitated.
Disclosure of Invention
The invention mainly aims to provide a measurement data attribute intelligent matching and identifying module for database construction, which can effectively solve the problems in the background technology.
In order to achieve the purpose, the invention adopts the technical scheme that:
an intelligent matching and identifying module for measuring data attributes for database construction comprises data to be matched, a local database and a matching database, wherein the data to be matched comprises attribute data to be matched, the attribute data to be matched is a plurality of attribute data, the local database comprises a plurality of database attribute data, similarity calculation and weighted value calculation are carried out on the data to be matched, the relationship between the weighted value and a threshold value is compared, the data to be matched is led into the matching database, and the matching database is led into the local database.
Preferably, acquiring a plurality of database attribute data in a local database, adding attribute data to be matched to an object queue, and arranging the attribute data to be matched in the object queue according to the attribute data; selecting attribute data to be matched from the object queue as a matching reference object; and acquiring database attribute data of the attribute data to be matched from the local database according to the attribute data of the matched reference object.
Preferably, the similarity of the data to be matched is weighted, and the weighted value is compared with a threshold value; if the weighted value is smaller than the threshold value, the matching is not carried out; if the weighted value is larger than the threshold value, matching is carried out, the weighted average value is that the attribute similarity is multiplied by the corresponding weight, the sum is added to obtain an overall value, and then the overall value is divided by the total unit number;
preferably, the workflow in which a match is determined is as follows:
s1, firstly, importing data to be matched, and determining a plurality of attributes of the data to be matched so as to obtain attribute data to be matched;
and S2, determining a plurality of attributes of the local database according to the local database to obtain database attribute data, and extracting the database attribute data.
S3, judging whether the set of the database attribute data and the set of the attribute data to be matched are empty sets or not, and if not, calculating an attribute similarity weighted value; if the attribute data set is an empty set, entering the next step, and judging whether the attribute data set to be matched and the matched data set are empty sets;
s4, calculating a reference threshold;
s5, judging whether the attribute similarity weighted value is larger than a threshold value, if so, matching, and importing the data to be matched into a matching database; otherwise, the data are not matched, and the data to be matched are stored in the doubt database.
Preferably, in the third step, if the matching data set and the attribute data set to be matched are empty sets, the matching is ended, and if the matching data set and the attribute data set to be matched are not empty sets, the attribute data to be matched are imported into the local database.
Preferably, the device for intelligently matching and identifying attributes of measured data for database construction comprises:
the data acquisition module is used for collecting attribute data to be matched and database attribute data;
the first processing module is used for judging whether the attribute data set is an empty set;
a second processing module for calculating attribute similarity according to the attribute data to be matched and the database attribute data
A weighted value of degree;
the third processing module is used for judging whether the matching data set is an empty set;
the controller is used for judging the relationship between the weighted value of the attribute similarity and the threshold value and determining a matching result;
the memory is used for storing unmatched data to be matched;
and the importing module is used for importing the successfully matched attribute data to be matched into the matching database and finally importing the matching database into the local database.
Compared with the prior art, the invention has the following beneficial effects:
according to the method and the device, the data to be matched are imported, the attribute data to be matched are selected as the matching reference object, the database attribute data of the attribute data to be matched are obtained from the local database according to the attribute data of the matching reference object, and the attribute data to be matched and a large amount of database data are prevented from being identified, so that the identification matching efficiency is improved; and the weighted value of the similarity of the attribute data to be matched is calculated and judged, so that the accuracy of the matching process can be controlled, and the accurate expansion of a local database is facilitated.
Drawings
FIG. 1 is a flowchart illustrating the steps of an intelligent matching and identification module for measuring data attributes for database construction according to the present invention;
FIG. 2 is a flow chart of the intelligent matching and identification module for measuring data attributes for database construction according to the present invention;
fig. 3 is a schematic diagram of an intelligent matching and identifying module for measuring data attributes for database construction according to the present invention.
Detailed Description
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further described with the specific embodiments.
In the description of the present invention, it should be noted that the terms "upper", "lower", "inner", "outer", "front", "rear", "both ends", "one end", "the other end", and the like indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the referred device or element must have a specific orientation, be constructed in a specific orientation, and be operated, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "mounted," "disposed," "connected," and the like are to be construed broadly, such as "connected," which may be fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
As shown in fig. 1 to 3, an intelligent matching and identifying module for attributes of measured data for database construction includes data to be matched, a local database and a matching database, where the data to be matched includes attribute data to be matched, the attribute data to be matched is multiple attribute data, the local database includes multiple database attribute data, similarity calculation and weighted value calculation are performed on the data to be matched, a relationship between the weighted value and a threshold value is compared, the data to be matched is imported into the matching database, and the matching database is imported into the local database.
Acquiring a plurality of database attribute data in a local database, adding attribute data to be matched to an object queue, and arranging the attribute data to be matched in the object queue according to the attribute data; selecting attribute data to be matched from the object queue as a matching reference object; and acquiring database attribute data of the attribute data to be matched from the local database according to the attribute data of the matched reference object.
Weighting the similarity of the data to be matched, and comparing the weighted value with a threshold value; if the weighted value is smaller than the threshold value, the matching is not carried out; if the weighted value is larger than the threshold value, matching is carried out, the weighted average value is that the attribute similarity is multiplied by the corresponding weight, the sum is added to obtain an overall value, and then the overall value is divided by the total unit number;
the workflow for determining the matching is as follows:
s1, firstly, importing data to be matched, and determining a plurality of attributes of the data to be matched so as to obtain attribute data to be matched;
and S2, determining a plurality of attributes of the local database according to the local database to obtain database attribute data, and extracting the database attribute data.
S3, judging whether the set of the database attribute data and the set of the attribute data to be matched are empty sets or not, and if not, calculating an attribute similarity weighted value; if the attribute data set is an empty set, entering the next step, and judging whether the attribute data set to be matched and the matched data set are empty sets;
s4, calculating a reference threshold;
s5, judging whether the attribute similarity weighted value is larger than a threshold value, if so, matching, and importing the data to be matched into a matching database; otherwise, the data are not matched, and the data to be matched are stored in the doubt database.
And in the third step, if the matching data set and the attribute data set to be matched are empty sets, finishing matching, and if the matching data set and the attribute data set to be matched are not empty sets, importing the attribute data to be matched into a local database.
The utility model provides a measurement data attribute intelligent matching and recognition device for database construction, includes:
the data acquisition module is used for collecting attribute data to be matched and database attribute data;
the first processing module is used for judging whether the attribute data set is an empty set;
a second processing module for calculating attribute similarity according to the attribute data to be matched and the database attribute data
A weighted value of degree;
the third processing module is used for judging whether the matching data set is an empty set;
the controller is used for judging the relationship between the weighted value of the attribute similarity and the threshold value and determining a matching result;
the memory is used for storing unmatched data to be matched;
and the importing module is used for importing the successfully matched attribute data to be matched into the matching database and finally importing the matching database into the local database.
The invention relates to an intelligent matching and identifying module for measuring data attributes for database construction, which comprises the following steps of firstly, importing data to be matched, determining a plurality of attributes of the data to be matched, adding the attribute data to be matched to an object queue, and arranging the attribute data to be matched in the object queue according to the attribute data; selecting attribute data to be matched from the object queue as a matching reference object; according to the attribute data of the matched reference object, acquiring database attribute data of the virtual object to be matched from a local database, thereby acquiring the attribute data to be matched; then judging whether the set of the database attribute data and the set of the attribute data to be matched are empty sets or not, and if not, calculating an attribute similarity weighted value; if the attribute data set is an empty set, entering the next step, judging whether the attribute data set to be matched and the matching data set are empty sets, if the matching data set and the attribute data set to be matched are empty sets, finishing matching, and if the matching data set and the attribute data set to be matched are not empty sets, importing the attribute data to be matched into a local database; when the set of the database attribute data and the set of the attribute data to be matched are not empty sets, calculating the similarity of the data to be matched, multiplying the attribute similarity by corresponding weight, summing up and summing to obtain an overall value, dividing by the total number of units to obtain a weighted value, comparing and judging the weighted value with a calculated threshold value, when the weighted value of the similarity is greater than the threshold value, matching, and importing the data to be matched into a matching database; otherwise, the data to be matched is not matched, the data to be matched is stored in the doubt database, the doubt data is stored, the matching and identifying module can realize quick matching identification among the data, the matching database and the local database can be continuously expanded in the matching identification process, and enrichment of database data is facilitated.
The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (6)

1. The utility model provides a database construction is with intelligent matching of measured data attribute and identification module, includes data, local database and the matching database of waiting to match, its characterized in that: the data to be matched comprises attribute data to be matched, the attribute data to be matched is a plurality of attribute data, the local database comprises a plurality of database attribute data, similarity calculation and weighted value calculation are carried out on the data to be matched, the relationship between the weighted value and a threshold value is compared, the data to be matched is led into the matching database, and the matching database is led into the local database.
2. The intelligent matching and identifying module for the measured data attribute for database construction according to claim 1, wherein: acquiring a plurality of database attribute data in a local database, adding attribute data to be matched to an object queue, and arranging the attribute data to be matched in the object queue according to the attribute data; selecting attribute data to be matched from the object queue as a matching reference object; and acquiring database attribute data of the attribute data to be matched from the local database according to the attribute data of the matched reference object.
3. The intelligent matching and identifying module for the measured data attribute for database construction according to claim 1, wherein: weighting the similarity of the data to be matched, and comparing the weighted value with a threshold value; if the weighted value is smaller than the threshold value, the matching is not carried out; if the weighted value is larger than the threshold value, matching is carried out, the weighted average value is that the attribute similarity is multiplied by the corresponding weight, the sum is added to obtain an overall value, and the overall value is divided by the total unit number.
4. The method for intelligently matching and identifying the measured data attributes for database construction according to claim 1, wherein the method comprises the following steps: the workflow for determining the matching is as follows:
s1, firstly, importing data to be matched, and determining a plurality of attributes of the data to be matched so as to obtain attribute data to be matched;
and S2, determining a plurality of attributes of the local database according to the local database to obtain database attribute data, and extracting the database attribute data.
S3, judging whether the set of the database attribute data and the set of the attribute data to be matched are empty sets or not, and if not, calculating an attribute similarity weighted value; if the attribute data set is an empty set, entering the next step, and judging whether the attribute data set to be matched and the matched data set are empty sets;
s4, calculating a reference threshold;
s5, judging whether the attribute similarity weighted value is larger than a threshold value, if so, matching, and importing the data to be matched into a matching database; otherwise, the data are not matched, and the data to be matched are stored in the doubt database.
5. The intelligent matching and identifying module for the measured data attribute for database construction according to claim 4, wherein: and in the third step, if the matching data set and the attribute data set to be matched are empty sets, finishing matching, and if the matching data set and the attribute data set to be matched are not empty sets, importing the attribute data to be matched into a local database.
6. The intelligent matching and recognition device for the measured data attributes for database construction according to claim 1, wherein: the method comprises the following steps:
the data acquisition module is used for collecting attribute data to be matched and database attribute data;
the first processing module is used for judging whether the attribute data set is an empty set;
the second processing module is used for calculating a weighted value of the attribute similarity according to the attribute data to be matched and the database attribute data;
the third processing module is used for judging whether the matching data set is an empty set;
the controller is used for judging the relationship between the weighted value of the attribute similarity and the threshold value and determining a matching result;
the memory is used for storing unmatched data to be matched;
and the importing module is used for importing the successfully matched attribute data to be matched into the matching database and finally importing the matching database into the local database.
CN202011107677.7A 2020-10-16 2020-10-16 Measurement data attribute intelligent matching and identifying module for database construction Pending CN112328633A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011107677.7A CN112328633A (en) 2020-10-16 2020-10-16 Measurement data attribute intelligent matching and identifying module for database construction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011107677.7A CN112328633A (en) 2020-10-16 2020-10-16 Measurement data attribute intelligent matching and identifying module for database construction

Publications (1)

Publication Number Publication Date
CN112328633A true CN112328633A (en) 2021-02-05

Family

ID=74313902

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011107677.7A Pending CN112328633A (en) 2020-10-16 2020-10-16 Measurement data attribute intelligent matching and identifying module for database construction

Country Status (1)

Country Link
CN (1) CN112328633A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279542A (en) * 2013-06-05 2013-09-04 中国电子科技集团公司第十五研究所 Data importing processing method and data processing device
CN104866625A (en) * 2015-06-15 2015-08-26 苏州大学张家港工业技术研究院 Method and system for entities matching
CN107766396A (en) * 2017-03-03 2018-03-06 平安医疗健康管理股份有限公司 resource data management method and device
CN109002528A (en) * 2018-07-12 2018-12-14 北京猫眼文化传媒有限公司 A kind of method, apparatus and storage medium of data importing
CN111651531A (en) * 2020-06-05 2020-09-11 深圳前海微众银行股份有限公司 Data import method, device, equipment and computer storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279542A (en) * 2013-06-05 2013-09-04 中国电子科技集团公司第十五研究所 Data importing processing method and data processing device
CN104866625A (en) * 2015-06-15 2015-08-26 苏州大学张家港工业技术研究院 Method and system for entities matching
CN107766396A (en) * 2017-03-03 2018-03-06 平安医疗健康管理股份有限公司 resource data management method and device
CN109002528A (en) * 2018-07-12 2018-12-14 北京猫眼文化传媒有限公司 A kind of method, apparatus and storage medium of data importing
CN111651531A (en) * 2020-06-05 2020-09-11 深圳前海微众银行股份有限公司 Data import method, device, equipment and computer storage medium

Similar Documents

Publication Publication Date Title
CN109918532B (en) Image retrieval method, device, equipment and computer readable storage medium
US11043808B2 (en) Method for identifying pattern of load cycle
CN102201001B (en) Fast retrieval method based on inverted technology
CN111784093A (en) Enterprise rework auxiliary judgment method based on electric power big data analysis
CN112819299A (en) Differential K-means load clustering method based on center optimization
CN111709668A (en) Power grid equipment parameter risk identification method and device based on data mining technology
CN102169491A (en) Dynamic detection method for multi-data concentrated and repeated records
CN105868352A (en) High-dimensional data dimension ordering method based on dimension correlation analysis
CN103440292B (en) Multimedia information retrieval method and system based on bit vectors
CN115910271A (en) CNN and RF-based medicine storage management method
CN108549696B (en) Time series data similarity query method based on memory calculation
CN112328633A (en) Measurement data attribute intelligent matching and identifying module for database construction
CN108562867A (en) A kind of fingerprint positioning method and device based on cluster
CN116029613B (en) Novel power system index data processing method and platform
CN110377805B (en) Sensor resource recommendation method based on rapid branch allocation and sorting algorithm
CN108062376A (en) A kind of Time Series Compression storage method and system based on similar operating condition
CN112182026A (en) Power grid section data retrieval method considering manifold sorting algorithm
CN110083732A (en) Picture retrieval method, device and computer storage medium
CN115186138A (en) Comparison method and terminal for power distribution network data
CN109446489A (en) Legal information repetitive rate detection system and detection method
CN107389071A (en) A kind of improved indoor positioning KNN methods
Li et al. A novel approach to remote sensing image retrieval with multi-feature VP-tree indexing and online feature selection
CN108848138B (en) Environment monitoring system with good monitoring effect
CN115858895B (en) Multi-source heterogeneous data processing method and system for smart city
CN117131102B (en) Method for high-speed retrieval of similar time sequence data in mixed index database

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination