CN109766436A - A kind of matched method and apparatus of data element of the field and knowledge base of tables of data - Google Patents

A kind of matched method and apparatus of data element of the field and knowledge base of tables of data Download PDF

Info

Publication number
CN109766436A
CN109766436A CN201811472910.4A CN201811472910A CN109766436A CN 109766436 A CN109766436 A CN 109766436A CN 201811472910 A CN201811472910 A CN 201811472910A CN 109766436 A CN109766436 A CN 109766436A
Authority
CN
China
Prior art keywords
feature vector
field
data
data element
tables
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811472910.4A
Other languages
Chinese (zh)
Inventor
张毅然
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Mininglamp Software System Co ltd
Original Assignee
Beijing Mininglamp Software System Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Mininglamp Software System Co ltd filed Critical Beijing Mininglamp Software System Co ltd
Priority to CN201811472910.4A priority Critical patent/CN109766436A/en
Publication of CN109766436A publication Critical patent/CN109766436A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses the matched method and apparatus of the data element of a kind of field of tables of data and knowledge base, wherein the described method includes: carrying out word segmentation processing to the field in tables of data, constructs the feature vector of field;The feature vector library that knowledge base is searched according to the feature vector of the field carries out similarity mode with the feature vector of data element in described eigenvector library and determines matched data element when fitting through.The embodiment of the present invention can be applied to preprocessing process when tables of data access, improve governance efficiency and accuracy rate.

Description

A kind of matched method and apparatus of data element of the field and knowledge base of tables of data
Technical field
The present invention relates to database field, the matched method of data element of the field and knowledge base of espespecially a kind of tables of data and Device.
Background technique
Can be to the name of the data of industry using data element, type, value is standardized and is classified, and data element itself is also Data, during data are administered, if be able to achieve standardization and administer the efficiency and quality for directly determining that data are administered.It is right Tables of data in various sources, it is understood that there may be business meaning is consistent, but the situation that its field information is inconsistent.When taking an original When the tables of data of beginning, how precisely rapidly found from existing knowledge base with data element corresponding to the data sheet field, For realizing that fast and efficiently standardization is very crucial a part for administering.
Currently, becoming in the data integration of data element in various industries in the case where Data element standard is gradually established Increasingly important can be used for normative database, the data item in tables of data.At present the standard most cases of data element be with What document form occurred, the raising with current industry operation system to data dependence relation increasingly, the quality of data is related to industry Whether business system can normally run.The tables of data of data source is verified according to data element, can ensure the quality of data.Mark It is the premise verified that the data item of tables of data, which establishes mapping, in quasi- data element and data source.In the process for carrying out data processing In, the tables of data in all kinds of sources is had, each tables of data has different fields, needs to know based on the field in tables of data Not corresponding data element, the mode generallyd use at present manually is identified, although accuracy rate is relatively high, efficiency is but It is extremely inefficient.When data volume is smaller, still by the way of artificial, if data volume is big, artificial mode will become not Reality.
Summary of the invention
In order to solve the above-mentioned technical problems, the present invention provides a kind of fields of tables of data to match with the data element of knowledge base Method and apparatus, to improve the matched efficiency of data element of the field and knowledge base of tables of data.
In order to reach the object of the invention, the present invention provides the data element of a kind of field of tables of data and knowledge base is matched Method, comprising:
Word segmentation processing is carried out to the field in tables of data, constructs the feature vector of field;
The feature vector library that knowledge base is searched according to the feature vector of the field, with data element in described eigenvector library Feature vector carry out similarity mode determine matched data element when fitting through.
Optionally, the method also includes:
The information for obtaining data element in standard scale, according to the corresponding feature vector of the acquisition of information of the data element, by institute State the feature vector library of the feature vector deposit knowledge base of data element.
Optionally, the field in tables of data carries out word segmentation processing, constructs the feature vector of field, comprising:
Obtain the field in tables of data;
The field is segmented, term vector is generated;
The feature vector of each word is generated according to the term vector;
The feature vector of each word is synthesized, the feature vector of the field is generated.
Optionally, it is described according to the feature vector of the field search knowledge base feature vector library, with the feature to The feature vector for measuring data element in library carries out similarity mode, comprising:
Cosine is successively carried out according to the feature vector of each data element in the feature vector of the field and feature vector library Similarity calculation;
When the similarity score being calculated is greater than preset threshold, determination is fitted through.
Optionally, it is described according to the feature vector of the field search knowledge base feature vector library, with the feature to The feature vector for measuring data element in library carries out similarity mode, when fitting through, after determining matched data element, and the side Method further include:
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score most Field in big data item and the tables of data is carried out to mark.
The present invention also provides the matched devices of the data element of a kind of field of tables of data and knowledge base, comprising:
Field processing module constructs the feature vector of field for carrying out word segmentation processing to the field in tables of data;
Matching module, for searching the feature vector library of knowledge base according to the feature vector of the field, with the feature The feature vector of data element carries out similarity mode and determines matched data element when fitting through in vector library.
Optionally, described device further include:
Feature vector library generation module, for obtaining the information of data element in standard scale, according to the information of the data element Corresponding feature vector is obtained, by the feature vector library of the feature vector deposit knowledge base of the data element.
Optionally, the field processing module, is used for:
Obtain the field in tables of data;
The field is segmented, term vector is generated;
The feature vector of each word is generated according to the term vector;
The feature vector of each word is synthesized, the feature vector of the field is generated.
Optionally, the matching module, is used for:
Cosine is successively carried out according to the feature vector of each data element in the feature vector of the field and feature vector library Similarity calculation;
When the similarity score being calculated is greater than preset threshold, determination is fitted through.
Optionally, the matching module, is also used to:
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score most Field in big data item and the tables of data is carried out to mark.
The embodiment of the present invention includes: word segmentation processing is carried out to the field in tables of data, constructs the feature vector of field;According to The feature vector of the field searches the feature vector library of knowledge base, with the feature vector of data element in described eigenvector library into Row similarity mode determines matched data element when fitting through.When the embodiment of the present invention can be applied to tables of data access Preprocessing process, improve governance efficiency and accuracy rate.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention can be by specification, right Specifically noted structure is achieved and obtained in claim and attached drawing.
Detailed description of the invention
Attached drawing is used to provide to further understand technical solution of the present invention, and constitutes part of specification, with this The embodiment of application technical solution for explaining the present invention together, does not constitute the limitation to technical solution of the present invention.
Fig. 1 is the flow chart of the matched method of data element of the field and knowledge base of the tables of data of the embodiment of the present invention;
Fig. 2 is the flow chart of the step 101 of the embodiment of the present invention;
Fig. 3 is the schematic diagram of the feature vector of the building field of the embodiment of the present invention;
Fig. 4 is the flow chart for establishing feature vector library of the embodiment of the present invention;
Fig. 5 is the flow chart of the matched method of data element of the field and knowledge base of the tables of data of application example of the present invention;
Fig. 6 is the schematic diagram of the matched device of data element of the field and knowledge base of the tables of data of the embodiment of the present invention.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with attached drawing to the present invention Embodiment be described in detail.It should be noted that in the absence of conflict, in the embodiment and embodiment in the application Feature can mutual any combination.
Step shown in the flowchart of the accompanying drawings can be in a computer system such as a set of computer executable instructions It executes.Also, although logical order is shown in flow charts, and it in some cases, can be to be different from herein suitable Sequence executes shown or described step.
The embodiment of the present invention can be identified in knowledge base based on the accumulation of existing knowledge base based on the field in tables of data Data element.
As shown in Figure 1, the field of the tables of data of the embodiment of the present invention and the matched method of the data element of knowledge base, comprising:
Step 101, word segmentation processing is carried out to the field in tables of data, constructs the feature vector of field.
Tables of data herein refers to the tables of data that needs are standardized.
As shown in Fig. 2, step 101 may include:
Step 201, the field in tables of data is obtained.
Step 202, the field is segmented, generates term vector.
Wherein, each word m ∈ [1, M] generates term vector, constructs dictionary.M is the classification number of word.
Step 203, the feature vector of each word is generated according to the term vector.
For each word in field, feature vector is obtainedWherein, L is of word in field Number.
Step 204, the feature vector of each word is synthesized, generates the feature vector of the field.
For each field, feature vector V={ v is obtained1v2,...,vM}.It is shown in Figure 3.
In one embodiment, the method also includes: feature vector library is established, as shown in figure 4, including the following steps:
Step 301, the information of data element in standard scale is obtained.
Step 302, according to the corresponding feature vector of acquisition of information of the data element.
Wherein, the generating mode of the feature vector of data element is referred to generate the mode of the feature vector of field.
Step 303, by the feature vector library of the feature vector deposit knowledge base of the data element.
Step 102, the feature vector library that knowledge base is searched according to the feature vector of the field, with described eigenvector library The feature vector of middle data element carries out similarity mode and determines matched data element when fitting through.
In this step, successively according to the feature vector of each data element in the feature vector of the field and feature vector library Carry out cosine similarity calculating;When the similarity score being calculated is greater than preset threshold, determination is fitted through.
Wherein, similarity score score can use following formula:
Wherein, V={ v1v2,...,vMBe field feature vector,For data element feature to Amount.
In the embodiment of the present invention, data element is split in standardisation process, is clustered, the data sheet field of data source It can be achieved to compare the data element of field and knowledge base using cosine similarity, and then identify accurate data element, phase Than in traditional way, more efficiently, intelligence.
In one embodiment, after step 102, may also include that
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score most Field in big data item and the tables of data is carried out to mark.
Wherein, data item includes data element, can also include determiner, for example, data item are as follows: sender _ name, In, name is data element, and sender is determiner.
When selecting the field in the maximum data item of similarity score and the tables of data to carry out to mark, phase can choose Recommended like the maximum one or more data item of degree score value, in the maximum one or more data item of similarity score again The field in suitable data item and the tables of data is selected to carry out to mark.
It can be known in continuous data management task by the field of tables of data in data source through the embodiment of the present invention Normal data member in other knowledge base is realized quick to mark in standardized data improvement.The embodiment of the present invention can be applied to Preprocessing process when tables of data accesses improves governance efficiency and accuracy rate.
It is illustrated below with an application example.
As shown in figure 5, including the following steps:
Step 401, the field of a tables of data is obtained.
Step 402, the feature vector of field is generated.
Wherein, the feature vector for generating field can refer to the description of Fig. 2.
Step 403, feature vector is obtained from feature vector library;
Step 404, judge whether it is matched complete, if not provided, execute step 405, if matching finish, execute step 408。
Wherein, after successively being matched feature vector all in feature vector library with the feature vector of field, then Think matched complete.
Step 405, the similarity of two feature vectors is calculated.
Step 406, judge whether similarity is greater than preset threshold, if so, executing step 407, executed if not, returning Step 403;
Step 407, the corresponding data item of the data element is recorded, returns to step 403;
Step 408, matching result is exported, wherein if it is small to obtain all similarity scores according to similarity calculation In being equal to preset threshold, then matching result is that it fails to match, which is classified as not match classification.If according to similarity meter Calculation, which obtains similarity score, to be existed greater than preset threshold, then matching result is successful match, which is classified as matching classification, And export the maximum one or more data item of similarity score.
As shown in fig. 6, the embodiment of the present invention also provides the matched dress of data element of the field and knowledge base of a kind of tables of data It sets, comprising:
Field processing module 51 constructs the feature vector of field for carrying out word segmentation processing to the field in tables of data;
Matching module 52, for searching the feature vector library of knowledge base according to the feature vector of the field, with the spy The feature vector of data element carries out similarity mode and determines matched data element when fitting through in sign vector library.
In one embodiment, described device further include:
Feature vector library generation module, for obtaining the information of data element in standard scale, according to the information of the data element Corresponding feature vector is obtained, by the feature vector library of the feature vector deposit knowledge base of the data element.
In one embodiment, the field processing module 51, is used for:
Obtain the field in tables of data;
The field is segmented, term vector is generated;
The feature vector of each word is generated according to the term vector;
The feature vector of each word is synthesized, the feature vector of the field is generated.
In one embodiment, the matching module 52, is used for:
Cosine is successively carried out according to the feature vector of each data element in the feature vector of the field and feature vector library Similarity calculation;
When the similarity score being calculated is greater than preset threshold, determination is fitted through.
In one embodiment, the matching module 52, is also used to:
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score most Field in big data item and the tables of data is carried out to mark.
The embodiment of the present invention can be applied to preprocessing process when tables of data access, improve governance efficiency and accuracy rate.
The embodiment of the present invention also proposes the matched equipment of data element of the field and knowledge base of a kind of tables of data, including storage Device, processor and storage on a memory and the computer program that can run on a processor, the processor execution journey The matched method of the data element of field and knowledge base that above-mentioned tables of data is realized when sequence.
The embodiment of the present invention also proposes a kind of computer readable storage medium, is stored with computer executable instructions, described The matched method of the data element of field and knowledge base that above-mentioned tables of data is realized when computer executable instructions are executed by processor.
It will appreciated by the skilled person that whole or certain steps, system, dress in method disclosed hereinabove Functional module/unit in setting may be implemented as software, firmware, hardware and its combination appropriate.In hardware embodiment, Division between the functional module/unit referred in the above description not necessarily corresponds to the division of physical assemblies;For example, one Physical assemblies can have multiple functions or a function or step and can be executed by several physical assemblies cooperations.Certain groups Part or all components may be implemented as by processor, such as the software that digital signal processor or microprocessor execute, or by It is embodied as hardware, or is implemented as integrated circuit, such as specific integrated circuit.Such software can be distributed in computer-readable On medium, computer-readable medium may include computer storage medium (or non-transitory medium) and communication media (or temporarily Property medium).As known to a person of ordinary skill in the art, term computer storage medium is included in for storing information (such as Computer readable instructions, data structure, program module or other data) any method or technique in the volatibility implemented and non- Volatibility, removable and nonremovable medium.Computer storage medium include but is not limited to RAM, ROM, EEPROM, flash memory or its His memory technology, CD-ROM, digital versatile disc (DVD) or other optical disc storages, magnetic holder, tape, disk storage or other Magnetic memory apparatus or any other medium that can be used for storing desired information and can be accessed by a computer.This Outside, known to a person of ordinary skill in the art to be, communication media generally comprises computer readable instructions, data structure, program mould Other data in the modulated data signal of block or such as carrier wave or other transmission mechanisms etc, and may include any information Delivery media.

Claims (10)

1. a kind of field of tables of data and the matched method of the data element of knowledge base, comprising:
Word segmentation processing is carried out to the field in tables of data, constructs the feature vector of field;
The feature vector library that knowledge base is searched according to the feature vector of the field, the spy with data element in described eigenvector library Sign vector carries out similarity mode and determines matched data element when fitting through.
2. the method according to claim 1, wherein the method also includes:
The information for obtaining data element in standard scale, according to the corresponding feature vector of the acquisition of information of the data element, by the number According to the feature vector library of the feature vector deposit knowledge base of member.
3. the method according to claim 1, wherein the field in tables of data carries out word segmentation processing, structure Build the feature vector of field, comprising:
Obtain the field in tables of data;
The field is segmented, term vector is generated;
The feature vector of each word is generated according to the term vector;
The feature vector of each word is synthesized, the feature vector of the field is generated.
4. the method according to claim 1, wherein described search knowledge base according to the feature vector of the field Feature vector library, in described eigenvector library data element feature vector carry out similarity mode, comprising:
It is similar that the feature vector of each data element in feature vector library cosine is successively carried out according to the feature vector of the field Degree calculates;
When the similarity score being calculated is greater than preset threshold, determination is fitted through.
5. according to the method described in claim 4, it is characterized in that, described search knowledge base according to the feature vector of the field Feature vector library, in described eigenvector library data element feature vector carry out similarity mode, when fitting through, really After fixed matched data element, the method also includes:
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score maximum Field in data item and the tables of data is carried out to mark.
6. a kind of field of tables of data and the matched device of the data element of knowledge base characterized by comprising
Field processing module constructs the feature vector of field for carrying out word segmentation processing to the field in tables of data;
Matching module, for searching the feature vector library of knowledge base according to the feature vector of the field, with described eigenvector The feature vector of data element carries out similarity mode and determines matched data element when fitting through in library.
7. device according to claim 6, which is characterized in that described device further include:
Feature vector library generation module, for obtaining the information of data element in standard scale, according to the acquisition of information of the data element Corresponding feature vector, by the feature vector library of the feature vector deposit knowledge base of the data element.
8. device according to claim 6, which is characterized in that the field processing module is used for:
Obtain the field in tables of data;
The field is segmented, term vector is generated;
The feature vector of each word is generated according to the term vector;
The feature vector of each word is synthesized, the feature vector of the field is generated.
9. device according to claim 6, which is characterized in that the matching module is used for:
It is similar that the feature vector of each data element in feature vector library cosine is successively carried out according to the feature vector of the field Degree calculates;
When the similarity score being calculated is greater than preset threshold, determination is fitted through.
10. device according to claim 9, which is characterized in that the matching module is also used to:
Data item comprising matched data element is sorted from large to small according to similarity score, selects similarity score maximum Field in data item and the tables of data is carried out to mark.
CN201811472910.4A 2018-12-04 2018-12-04 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data Pending CN109766436A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811472910.4A CN109766436A (en) 2018-12-04 2018-12-04 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811472910.4A CN109766436A (en) 2018-12-04 2018-12-04 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data

Publications (1)

Publication Number Publication Date
CN109766436A true CN109766436A (en) 2019-05-17

Family

ID=66450485

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811472910.4A Pending CN109766436A (en) 2018-12-04 2018-12-04 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data

Country Status (1)

Country Link
CN (1) CN109766436A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110196834A (en) * 2019-05-21 2019-09-03 厦门市美亚柏科信息股份有限公司 It is a kind of for data item, file, database to mark method and system
CN110287191A (en) * 2019-06-25 2019-09-27 北京明略软件系统有限公司 Data alignment method and device, storage medium, electronic device
CN110399403A (en) * 2019-07-24 2019-11-01 北京明略软件系统有限公司 Data processing method and device, storage medium, electronic device
CN110473067A (en) * 2019-08-14 2019-11-19 杭州品茗安控信息技术股份有限公司 The cost normative document of component determines method, apparatus, equipment and storage medium
CN110728142A (en) * 2019-09-09 2020-01-24 上海凯京信达科技集团有限公司 Method and device for identifying running files, computer storage medium and electronic equipment
CN110795482A (en) * 2019-10-16 2020-02-14 浙江大华技术股份有限公司 Data benchmarking method, device and storage device
CN110895533A (en) * 2019-11-29 2020-03-20 北京锐安科技有限公司 Form mapping method and device, computer equipment and storage medium
CN111639077A (en) * 2020-05-15 2020-09-08 杭州数梦工场科技有限公司 Data management method and device, electronic equipment and storage medium
CN112233746A (en) * 2020-11-05 2021-01-15 克拉玛依市中心医院 Method for automatically standardizing medical data
CN112287005A (en) * 2020-10-22 2021-01-29 北京锐安科技有限公司 Data processing method, device, server and medium
CN112597149A (en) * 2020-11-25 2021-04-02 贝壳技术有限公司 Data table similarity determination method and device
CN113836144A (en) * 2021-09-28 2021-12-24 厦门市美亚柏科信息股份有限公司 Method and device for recommending database standard table based on field
CN114385623A (en) * 2021-11-30 2022-04-22 北京达佳互联信息技术有限公司 Data table acquisition method, device, apparatus, storage medium, and program product
CN114461679A (en) * 2021-12-31 2022-05-10 浙江大华技术股份有限公司 Data benchmarking method, graph neural network model training method and computer equipment
CN114969001A (en) * 2022-05-24 2022-08-30 浪潮卓数大数据产业发展有限公司 Database metadata field matching method, device, equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090019171A1 (en) * 2007-07-09 2009-01-15 Jing Liu Method, device and system for determining mail class
CN107704625A (en) * 2017-10-30 2018-02-16 锐捷网络股份有限公司 Fields match method and apparatus
CN108256074A (en) * 2018-01-17 2018-07-06 链家网(北京)科技有限公司 Method, apparatus, electronic equipment and the storage medium of checking treatment
CN108595614A (en) * 2018-04-20 2018-09-28 成都智信电子技术有限公司 Tables of data mapping method applied to HIS systems
CN108595657A (en) * 2018-04-28 2018-09-28 成都智信电子技术有限公司 The tables of data classification map method and apparatus of HIS systems

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090019171A1 (en) * 2007-07-09 2009-01-15 Jing Liu Method, device and system for determining mail class
CN107704625A (en) * 2017-10-30 2018-02-16 锐捷网络股份有限公司 Fields match method and apparatus
CN108256074A (en) * 2018-01-17 2018-07-06 链家网(北京)科技有限公司 Method, apparatus, electronic equipment and the storage medium of checking treatment
CN108595614A (en) * 2018-04-20 2018-09-28 成都智信电子技术有限公司 Tables of data mapping method applied to HIS systems
CN108595657A (en) * 2018-04-28 2018-09-28 成都智信电子技术有限公司 The tables of data classification map method and apparatus of HIS systems

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110196834B (en) * 2019-05-21 2022-04-29 厦门市美亚柏科信息股份有限公司 Benchmarking method and system for data items, files and databases
CN110196834A (en) * 2019-05-21 2019-09-03 厦门市美亚柏科信息股份有限公司 It is a kind of for data item, file, database to mark method and system
CN110287191B (en) * 2019-06-25 2021-07-27 北京明略软件系统有限公司 Data alignment method and device, storage medium and electronic device
CN110287191A (en) * 2019-06-25 2019-09-27 北京明略软件系统有限公司 Data alignment method and device, storage medium, electronic device
CN110399403A (en) * 2019-07-24 2019-11-01 北京明略软件系统有限公司 Data processing method and device, storage medium, electronic device
CN110473067A (en) * 2019-08-14 2019-11-19 杭州品茗安控信息技术股份有限公司 The cost normative document of component determines method, apparatus, equipment and storage medium
CN110728142A (en) * 2019-09-09 2020-01-24 上海凯京信达科技集团有限公司 Method and device for identifying running files, computer storage medium and electronic equipment
CN110728142B (en) * 2019-09-09 2023-12-22 上海斑马来拉物流科技有限公司 Method and device for identifying stream file, computer storage medium and electronic equipment
CN110795482A (en) * 2019-10-16 2020-02-14 浙江大华技术股份有限公司 Data benchmarking method, device and storage device
CN110795482B (en) * 2019-10-16 2022-11-22 浙江大华技术股份有限公司 Data benchmarking method, device and storage device
CN110895533A (en) * 2019-11-29 2020-03-20 北京锐安科技有限公司 Form mapping method and device, computer equipment and storage medium
CN110895533B (en) * 2019-11-29 2023-01-17 北京锐安科技有限公司 Form mapping method and device, computer equipment and storage medium
CN111639077B (en) * 2020-05-15 2024-03-22 杭州数梦工场科技有限公司 Data management method, device, electronic equipment and storage medium
CN111639077A (en) * 2020-05-15 2020-09-08 杭州数梦工场科技有限公司 Data management method and device, electronic equipment and storage medium
CN112287005B (en) * 2020-10-22 2024-03-22 北京锐安科技有限公司 Data processing method, device, server and medium
CN112287005A (en) * 2020-10-22 2021-01-29 北京锐安科技有限公司 Data processing method, device, server and medium
CN112233746B (en) * 2020-11-05 2023-09-01 克拉玛依市中心医院 Automatic medical data standardization method
CN112233746A (en) * 2020-11-05 2021-01-15 克拉玛依市中心医院 Method for automatically standardizing medical data
CN112597149B (en) * 2020-11-25 2022-11-22 贝壳技术有限公司 Data table similarity determination method and device
CN112597149A (en) * 2020-11-25 2021-04-02 贝壳技术有限公司 Data table similarity determination method and device
CN113836144A (en) * 2021-09-28 2021-12-24 厦门市美亚柏科信息股份有限公司 Method and device for recommending database standard table based on field
CN114385623A (en) * 2021-11-30 2022-04-22 北京达佳互联信息技术有限公司 Data table acquisition method, device, apparatus, storage medium, and program product
CN114461679A (en) * 2021-12-31 2022-05-10 浙江大华技术股份有限公司 Data benchmarking method, graph neural network model training method and computer equipment
CN114969001A (en) * 2022-05-24 2022-08-30 浪潮卓数大数据产业发展有限公司 Database metadata field matching method, device, equipment and medium
CN114969001B (en) * 2022-05-24 2024-05-10 浪潮卓数大数据产业发展有限公司 Database metadata field matching method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN109766436A (en) A kind of matched method and apparatus of data element of the field and knowledge base of tables of data
US20210182333A1 (en) Correlating image annotations with foreground features
US10657325B2 (en) Method for parsing query based on artificial intelligence and computer device
US11003896B2 (en) Entity recognition from an image
US9697233B2 (en) Image processing and matching
WO2019080411A1 (en) Electrical apparatus, facial image clustering search method, and computer readable storage medium
TW201931169A (en) Sample set processing method and apparatus, and sample querying method and apparatus
CN105493078B (en) Colored sketches picture search
CN110765882B (en) Video tag determination method, device, server and storage medium
US20210201090A1 (en) Method and apparatus for image processing and image classification
CN107832338B (en) Method and system for recognizing core product words
CN111291571A (en) Semantic error correction method, electronic device and storage medium
CN110147455A (en) A kind of face matching retrieval device and method
CN108268510B (en) Image annotation method and device
CN110807472B (en) Image recognition method and device, electronic equipment and storage medium
EP4209959A1 (en) Target identification method and apparatus, and electronic device
CN110209858B (en) Display picture determination, object search and display methods, devices, equipment and media
CN114238329A (en) Vector similarity calculation method, device, equipment and storage medium
CN109740674A (en) A kind of image processing method, device, equipment and storage medium
CN109635004B (en) Object description providing method, device and equipment of database
CN111144109A (en) Text similarity determination method and device
CN112784102B (en) Video retrieval method and device and electronic equipment
JP5520353B2 (en) BoF expression generation device and BoF expression generation method
CN112580620A (en) Sign picture processing method, device, equipment and medium
CN104850600B (en) A kind of method and apparatus for searching for the picture comprising face

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190517