CN108256074A - Method, apparatus, electronic equipment and the storage medium of checking treatment - Google Patents

Method, apparatus, electronic equipment and the storage medium of checking treatment Download PDF

Info

Publication number
CN108256074A
CN108256074A CN201810045917.1A CN201810045917A CN108256074A CN 108256074 A CN108256074 A CN 108256074A CN 201810045917 A CN201810045917 A CN 201810045917A CN 108256074 A CN108256074 A CN 108256074A
Authority
CN
China
Prior art keywords
field
morpheme
standard
type
definition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810045917.1A
Other languages
Chinese (zh)
Other versions
CN108256074B (en
Inventor
崔金辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lianjia Beijing Technology Co Ltd
Original Assignee
Lianjia Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianjia Beijing Technology Co Ltd filed Critical Lianjia Beijing Technology Co Ltd
Priority to CN201810045917.1A priority Critical patent/CN108256074B/en
Publication of CN108256074A publication Critical patent/CN108256074A/en
Application granted granted Critical
Publication of CN108256074B publication Critical patent/CN108256074B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses

Abstract

The embodiment of the present invention provides a kind of method, apparatus of checking treatment, electronic equipment and storage medium.The method includes obtaining the model of data warehouse to be verified, each model includes multiple field informations, and the field information includes field definition and field type;According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple standard terms, and each standard term includes standard definition and type;If the field definition is matched with standard definition and the field type is mismatched with type, the field type is revised as consistent with type.The method verifies the model of data warehouse according to standard term, when field definition is matched with standard definition and field type is mismatched with type, targetedly field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.

Description

Method, apparatus, electronic equipment and the storage medium of checking treatment
Technical field
The present embodiments relate to database technical field, particularly a kind of method, apparatus of checking treatment, electronic equipment And storage medium.
Background technology
In order to preferably make a policy, data warehouse need to be created, providing data for decision-making by data warehouse supports.
Data warehouse includes a large amount of data, and data therein are that the data of multiple databases of original dispersion are taken out It takes, clear up, and process, summarize and arrange by system on this basis.
Since the data of data warehouse have multiple data sources (database), and for an identical field, each number It is likely to be different according to the name in source, if arranging into a data warehouse, there are a variety of inconsistent for an identical field Name, lead to the of low quality of data warehouse, subsequently be stored in data and read data when, cause using confusion.
It is main using desk checking by the way of in the prior art, make the Naming conventions, unanimously of each data.
Since everyone experience, ability are different, it may appear that omit, differentiate happening for mistake, lead to not realize Data Warehouse name is consistent.
Invention content
In view of the drawbacks of the prior art, the embodiment of the present invention provide a kind of method, apparatus of checking treatment, electronic equipment and Storage medium.
On the one hand, the embodiment of the present invention provides a kind of method of checking treatment, the method includes:
The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes Field definition and field type;
According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple marks Mutatis mutandis language, each standard term include standard definition and type;
If the field definition is matched with standard definition and the field type is mismatched with type, by the word Segment type is revised as consistent with type.
On the other hand, the embodiment of the present invention provides a kind of device of checking treatment, and described device includes:
Acquisition module, for obtaining the model of data warehouse to be verified, each model includes multiple field informations, described Field information includes field definition and field type;
Correction verification module, for according to pre-stored data dictionary, being verified to the field information, the data word Allusion quotation includes multiple standard terms, and each standard term includes standard definition and type;
Modified module, if match for the definition of the field definition and standard and the field type and type not Match, be then revised as the field type consistent with type.
On the other hand, the embodiment of the present invention also provides a kind of electronic equipment, including memory, processor, bus and deposits The computer program that can be run on a memory and on a processor is stored up, the processor is realized when performing described program with top The step of method.
On the other hand, the embodiment of the present invention also provides a kind of storage medium, is stored thereon with computer program, described program The step of as above method is realized when being executed by processor.
As shown from the above technical solution, it the method, apparatus of checking treatment provided in an embodiment of the present invention, electronic equipment and deposits Storage media, the method verify the model of data warehouse according to standard term, are matched in field definition with standard definition And field type and type be when mismatching, targetedly field type is revised as it is consistent with type, so as to To the model of the unification of standard.
Description of the drawings
Fig. 1 is a kind of flow diagram of the method for checking treatment provided in an embodiment of the present invention;
Fig. 2 is the overall structure diagram of the device of checking treatment that further embodiment of this invention provides;
Fig. 3 is the flow diagram of the method for checking treatment that further embodiment of this invention provides;
Fig. 4 is the initial phase operational flowchart that further embodiment of this invention provides;
Fig. 5 is the certain embodiments figure of verification operation that further embodiment of this invention provides;
Fig. 6 is the certain embodiments figure of verification operation that further embodiment of this invention provides;
Fig. 7 is the flow diagram of verification operation that further embodiment of this invention provides;
Fig. 8 is the structure diagram of the device of a kind of checking treatment that further embodiment of this invention provides;
Fig. 9 is the structure diagram of a kind of electronic equipment that further embodiment of this invention provides.
Specific embodiment
Purpose, technical scheme and advantage to make the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention In attached drawing, the technical solution in the embodiment of the present invention is explicitly described, it is clear that described embodiment be the present invention Embodiment part of the embodiment, instead of all the embodiments.
Fig. 1 shows a kind of flow diagram of the method for checking treatment provided in an embodiment of the present invention.
As shown in Figure 1, method provided in an embodiment of the present invention specifically includes following steps:
Step 11, the model for obtaining data warehouse to be verified, each model include multiple field informations, the field letter Breath includes field definition and field type;
Optionally, the structure of a data warehouse can be divided into two steps:First, the model of design data storage, secondly by number According to the corresponding model (tables of data) of write-in.
After the completion of modelling, using method provided in an embodiment of the present invention, which is verified.
Optionally, the model that at least one design is completed is uploaded to the device of checking treatment, a model can be regarded as One tables of data, tables of data include multirow data, and corresponding field information is included per data line.
Optionally, the field information includes field definition and field type, and field definition is retouching to the meaning of field It states, it may include field name and field description.Field type is the description to the type of field, such as field is double or int, Wherein, double is double-precision floating points, that is, field can be the number for having decimal point, and int represents integer, that is, field It is integer.
Step 12, according to pre-stored data dictionary, the field information is verified, the data dictionary includes Multiple standard terms, each standard term include standard definition and type;
Optionally, data dictionary is pre-created, data dictionary includes multiple standard terms, and each standard term is to obtain one Accreditation is caused, it can be as the standard works of unified standard.
Optionally, standard term is from industry dialect dictionary, the data of the data warehouse of history, wiki (Wikis hundred Section), various professional books collect what is obtained in data.
Optionally, standard term includes standard and defines and type, and it is standard to a field that the standard, which defines, Description, type is to represent the type that the field can use.
Such as standard is defined as the amount of money, the type double for the amount of money being pre-created determines that type is After double, the amount of money is then without using int as type.
Optionally, it for the field definition of model, inquires in the standard term of data dictionary with the presence or absence of the word with model The matched standard definition of Duan Dingyi.
If the standard of field definition and standard term defines successful match, for the field type of model, inquiry mark Corresponding type is defined in mutatis mutandis language with the matched standard of the field definition of model.
If the standard definition of field definition and standard term match unsuccessful, output verification result is fails.
If step 13, the field definition are matched with standard definition and the field type and type mismatch, The field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type and type are inconsistent, then to model Remarks are carried out, the content of remarks is:Field type is inconsistent with type, output verification as a result, check results include it is described Remarks.
The embodiment of the present invention adds remarks to provide amending advice during being verified, for subsequently being tied according to verification Fruit performs modification, field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type is consistent with type, then illustrates the mould Type has met specification, and the check results of the field information are successfully.
If it is understood that the method that each data warehouse when modeling, is carried out the embodiment of the present invention, root It is verified according to data dictionary, obtains consistent, standard tables of data, then, then can be straight subsequently when data are filled It connects in filling to the tables of data of standard.
The method of checking treatment provided in this embodiment verifies the model of data warehouse according to standard term, When field definition is matched with standard definition and field type is mismatched with type, targetedly field type is revised as It is consistent with type, so as to obtain the model of the unification of standard.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the field is determined Justice includes field name and field description, and the standard definition includes standard name and standard description, correspondingly, according to pre-stored Data dictionary, the step of being verified to field information be specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name Whether field type is consistent with type.
Optionally, the content of a model includes as shown in table 1:
Table 1
Field name Field description Field type
Paidup_perf_amount Paid achievement Double
…… …… ……
Optionally, if the field name and standard name successful match, for other fields of the field information, (field is retouched State and field type) verified, if with corresponding to the standard name of successful match standard description and type it is consistent.
It is if consistent, then it represents that the field information and standard term are completely the same, and check results are successfully.
If inconsistent, the content of remarks is:The field description and the field type and standard term are inconsistent, with The field description and the field type are revised as subsequently consistent with standard term.
Similarly, if the field description and standard profile matching, for other field (field names of the field information And field type) verified, if it is consistent with the standard name corresponding to the criteria field of successful match and type.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, can by being verified respectively for field name and field description It is accurately obtained check results.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field definition It is matched with standard definition and field type is mismatched with type, then field type is revised as to the step consistent with type After rapid, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
Optionally, if the standard definition of field definition and standard term match it is unsuccessful, represent to use data dictionary into Row verification failure, performs the embodiment of the present invention, continues to verify using regulation management library.
Optionally, morpheme is the minimum word for having specific meanings, can not be split again, such as:Day, the moon, income, city Deng.
For example, sequence " traveller's end achievement " is split into " traveller ", " end " and " achievement " these three morphemes.
Optionally, word segmentation processing can be carried out mode to field information according to prior art, obtains morpheme, start using rule Library is then managed to verify each morpheme.
Optionally, the regulation management library includes multiple Substitution Rules, and each Substitution Rules include qualifier and classificating word, Between qualifier and classificating word it is the relationship of attribute and head, that is, the relationship modified and be modified, qualifier is conduct The morpheme for being used to describe classificating word of attribute, classificating word is the morpheme of the head as qualifier.
For example, " the achievement amount of money " the two morphemes, " amount of money " is head, and expression " the achievement amount of money " belongs to money, and this is a kind of Not, it is the numerical value of a money, and " achievement " represents that this numerical value is the numerical value of achievement rather than other numerical value.
Optionally, the effect of Substitution Rules is to include qualifier in the morpheme for determining a field information fractionation and do not wrap When including classificating word, qualifier is replaced with into qualifier and classificating word, is equivalent to and only has qualifier not classify in field information During word, increase classificating word for qualifier.
For each morpheme of field information, the qualifier of Substitution Rules is searched, if in a morpheme and Substitution Rules Qualifier matching is consistent, judges that the morpheme whether there is the classificating word of the morpheme in this field information.
If the classificating word of the qualifier is not present in this field information, remarks are added, by the morpheme in model Two morphemes are replaced with, that is, qualifier is replaced with into qualifier and the corresponding classificating word of qualifier.
Such as when model includes " achievement " this morpheme and does not include " amount of money ", " achievement " is turned according to Substitution Rules It is changed to " the achievement amount of money ".
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only qualifier, classificating word is omitted, for this The nonstandard literary style of kind adds pre-set classificating word by Substitution Rules for qualifier.
If there are the classificating words of the qualifier in this field information, then it represents that this field information specification, verification knot Fruit is successfully.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if field definition is mismatched with standard definition, using rule pipe Li Ku is verified, if morpheme matches with qualifier and there is no during classificating word, the morpheme is replaced with the morpheme and right The classificating word answered so that nonstandard write a Chinese character in simplified form is revised as specification.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field is determined Justice and standard define the step of mismatching, then carrying out data prediction to each field information, obtain multiple morphemes:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
Optionally, using resolver to being parsed to each field information.
Optionally, a json character string is considered as a sequence, participle component is called to carry out word fractionation to the sequence, Obtain morpheme.
Optionally, participle component is a power function, and effect is that a sequence is cut into individual word, i.e. word Element after obtaining morpheme, is verified using regulation management library.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, parses field information, generates corresponding json character strings, And for every json character strings, word segmentation processing is carried out, morpheme is obtained, for subsequently realizing the verification of morpheme rank.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the morpheme packet Chinese morpheme and/or English morpheme are included, correspondingly, if morpheme is matched with qualifier, judges whether point of the morpheme After the step of class word, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
If morpheme and modification word mismatch, are represented the verification failure of regulation management library, then continue to be carried out using business dictionary Verification.
Optionally, each business term includes two morphemes, Chinese term and English term.Such as (achievement, perf), (broker, agent) etc..
For example, the morpheme includes achievement this morpheme, with the Chinese term in business term (achievement, perf) Match, and do not include English term perf, then add remarks:The English term perf of addition.Similarly, if model includes perf, and Do not include achievement, then add remarks:Add achievement.
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only Chinese is without English or only English does not have Have Chinese, for this nonstandard literary style, pass through business term and carry out completion so that subsequent data either Chinese or English can correctly identify filling.
Optionally, if the morpheme only includes Chinese morpheme, the matching of Chinese morpheme is carried out, if morpheme only includes English morpheme then carries out the matching of English morpheme, if the morpheme includes Chinese morpheme and English morpheme, matched sequence It is not limited, can first carry out the matching of Chinese word element, can also first carry out the matching of English words element.
If Chinese morpheme is matched with Chinese term, but English morpheme is mismatched with English term, then adds remarks explanation Such case, output verification result.
Similarly, if English morpheme is matched with English term, but Chinese morpheme is mismatched with Chinese term, need to be added standby Note explanation.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if morpheme and modification word mismatch, continue using Chinese term It is verified with English term, it can be with completion Chinese morpheme and English morpheme.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the morpheme packet Include Chinese morpheme and/or English morpheme, correspondingly, the step of morpheme is replaced with into the morpheme and corresponding classificating word it Afterwards, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
Regulation management library is being used to verify, after being replaced the morpheme write a Chinese character in simplified form, is continuing to be verified using business dictionary.
Optionally, each business term includes two morphemes, Chinese term and English term.Such as (achievement, perf), (broker, agent) etc..
For example, original morpheme includes this morpheme of achievement, after being verified using regulation management library, the achievement amount of money is replaced with, In embodiments of the present invention, achievement is matched with the Chinese term in business term (achievement, perf), and does not include English term Perf then adds remarks:The English term perf of addition, while the amount of money and the Chinese term in business term (amount of money, amount) Matching, and do not include English term amount, then add remarks:The English term amount of addition.
If after traversing business term, without matched morpheme, corresponding remarks, output verification result are added.
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only Chinese is without English or only English does not have Have Chinese, for this nonstandard literary style, pass through business term and carry out completion so that subsequent data either Chinese or English can correctly identify filling.
Optionally, if Chinese morpheme is matched with Chinese term, but English morpheme is mismatched with English term, then is added standby Note illustrates such case, output verification result.Similarly, if English morpheme is matched with English term, but Chinese morpheme is in Literary term mismatches, then adds remarks.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment after regulation management library is used to verify, continues using Chinese term It is verified with English term, it can be with completion Chinese morpheme and English morpheme.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field definition It is matched with standard definition and field type is mismatched with type, then field type is revised as to the step consistent with type After rapid, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Optionally, if field definition is mismatched with standard definition, it is understood that there may be two kinds of situations:One kind is the field definition It is nonstandard, another situation be unified standard standard works, but data dictionary not by the field definition write-in standard determine Justice.
Optionally, the field definition is trained and refers to targetedly carry out again the field definition Match, it is determined whether standard can be used as to define.
Optionally, it according to newest data, is matched with the field definition, if being matched in the presence of with the field definition Unified standard standard works, then defined the field definition as standard, and by the field type of the field definition As type, new standard term is obtained.
It is understood that newly-increased standard term classification is put into data dictionary, a closed loop verification is ultimately formed Management gradually to extend the range of verification, reduces erroneous judgement.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if field definition is mismatched with standard definition, to the word Duan Dingyi is trained, and after determining that the field definition can be used as standard definition, is stored in data dictionary, to extend verification Range.
In order to more fully understand the technology contents of the present invention, on the basis of above-described embodiment, the present embodiment is described in detail The method of the checking treatment of offer.
The method of the embodiment of the present invention is difficult to land implementation primarily directed to the data standard that current data warehouse faces Problem, it is proposed that following solution:
By using the matched mode of participle technique+experience, learning materials are obtained from historical summary and data model, Empirical learning is carried out, common standard vocabulary and business terms are extracted and corrected, according to different classes of standard term Or specification is deposited into respectively in corresponding dictionary manager.
In modeling developing later, gone using the standard term (specification) for having summarized next as " experience and standard " Detection and correction model below solve the problems, such as model data naming standard, and pass through model checking+data word with this The closed loop management mode of allusion quotation accumulation+model checking carrys out continuous adjusting and optimizing, makes constantly improve, reaches automated maintenance data standard The purpose of management improves the quality of data of data warehouse, reduces operation cost.
The embodiment of the present invention is named available for uniform data, data definition, the normalization constraints of data type, for solving Term is chaotic in modeling process or does not know situation about how to name.
Normalized objects described in the embodiment of the present invention refer to the data used in the range of engineering project, it can be understood as need Carry out the target object of data normalization.
Fig. 2 is the overall structure diagram of the device of checking treatment that further embodiment of this invention provides.
As shown in Fig. 2, the overall structure of the device of checking treatment is divided into three parts:
Data storage layer for storing data standard specification dictionary, includes regulation management library, data dictionary, business dictionary; Criteria check layer:Standard dictionary is verified and generated for performing standard object, by vocabulary training system, segments component, Three components of standard calibrator are formed;Data interface tier:For receiving and parsing through standard object model document, verification is externally provided Report, respectively by model solution parser, verification Report Builder composition.
Mainly verification is standardized from the following aspects:
1. morpheme:Least unit word with certain specific meanings may be generally understood to using participle component to mark Standardization object carries out the word after participle fractionation, and in normalized work, the first step is exactly to need to decompose existing term Into least unit meaning, standard word confirmation is then carried out, such as:Day, the moon, income, city etc., belong to business dictionary scope.
2. standard word:Standard word is the least unit word having in lexical meaning, is basic group of business term Into element.Standard word is write a Chinese character in simplified form mark by Chinese and English and is formed together, each standard word can there are one English letters Matching, such as (achievement, perf), (broker, agent) etc. is write, belongs to business dictionary scope.
3. classificating word:Classificating word identifies the standard word of entity or entity attribute type, can therefrom deduce internal number According to the standard word of Value Types.Such as the amount of money, quantity, PV, UV etc., belong to business dictionary scope.
4. canonical domain:Encoding domain, number domain, group domain etc. are splitted data into, defines data type (character string, the number of standard According to date etc.) and length, with explicit data range.Such as (amount of money, amount, double), (quantity, num, int) etc., belong to In business dictionary scope.
5. standard term:Refer to all normal terms generated using standard word according to naming rule (qualifier+classificating word) Mesh name, including physical name, entity attributes name, table name, row name, domain name etc., such as (pt, time subregion, string), (house_id, the source of houses ID, int) etc., belongs to data dictionary scope.
6. rule conversion:Refer to standard word, classificating word, some of canonical domain merge conversion operations, using qualifier+point The mode of the class word vocabulary term higher to some frequency of use splices, and is write a Chinese character in simplified form when there is title in standard target object When, full name conversion, and the information such as subsidiary corresponding English mark can be carried out according to transformation rule, such as:(the achievement amount of money, Perf_amount, double), when occurring " achievement " word in standard object, rule management can be incited somebody to action according to transformation rule " achievement " is converted to " the achievement amount of money ", belongs to regulation management library scope.
The present embodiments relate to two large divisions:
First part:Initial phase.Correct in order to ensure data normalization judges verification accuracy rate, needs into line number It is main comprising data source is collected according to standard initialization, determine the work such as data dictionary, regulation management library and business dictionary.This portion The division of labor is made to implement to collectively constitute with manual intervention automatically by software.
Second part:Data normalization checking stage.After data standard initial work is carried out, data are proceeded by The verification of standard, and examining report is generated, for being standardized modification before model is reached the standard grade;It is right and after model is reached the standard grade The model progress vocabulary parsing newly increased, additional new standard term, the verification of formation standard object->Increase standard term->Mark The closed loop management of quasi- object verification.
Fig. 3 is the flow diagram of the method for checking treatment that further embodiment of this invention provides.
As shown in figure 3, the embodiment of the present invention specifically includes multiple steps:It is model analyzing, word segmentation processing, model checking, defeated Go out verification report, model modification, submission and analytic modell analytical model, model training and model to reach the standard grade.
It can be regarded as including 3 steps:Initialization, verification and subsequent step.
Fig. 4 is the initial phase operational flowchart that further embodiment of this invention provides.
As shown in figure 4, step 1:Selected normalized objects range, usually from industry dialect dictionary, existing number According to depot data, wiki, various professional books are collected in data.
It is collected into after material, can be handled by two ways:It directly carries out participle to text class data to check the mark, word Frequency sort method, then according to word frequency sequence from high to low into filtering;For existing database data information, according to field English Literary fame and Chinese name split recurrence combination, and sequence, then according to pairing frequency of occurrence, existing Naming conventions are filtered successively Modification.Different data dictionaries, regulation management library and business dictionary are finally put into different words according to artificial filter later.
Wherein, in experience matching process is carried out to existing model metadata information, as shown in figure 3,
It is the field attribute information for obtaining all models, carries out word and be split as morpheme one by one;Then recurrence is to each Three parts of field type of English code and Chinese name and present field are spliced into a character string, and be spliced into all Character string make word frequency statistics, count every group of more data of correspondence occurrence number and empirically match standard;Finally In three storage information banks (data dictionary, regulation management library and business dictionary) of typing after hand inspection.
Step 2:It, can be with on-line running criteria check program after completing initialization step.
Fig. 5 is the certain embodiments figure of verification operation that further embodiment of this invention provides.
Fig. 6 is the certain embodiments figure of verification operation that further embodiment of this invention provides.
As shown in Figure 5 and Figure 6, when carrying out data standard verification to a new design model, its model is uploaded first Information includes the information such as table name, field name, field type, field description.
Then【Model solution parser】The model of upload can be parsed, each row of data is parsed into { origincode (words Section name), name (field description), type (field type) } structure json string structures;
After the completion of model all parsing, json information and primary information can be sent to【Standard calibrator】Carry out school It tests;
When【Standard calibrator】After receiving json strings, it can call line by line【Segment component】Word morpheme fractionation is carried out, is torn open / after, start to carry out verifying work to each row of data.
Fig. 7 is the flow diagram of verification operation that further embodiment of this invention provides.
As shown in fig. 7, entire checking procedure meeting lease making is crossed and is judged three times:
1. it calls first【Data dictionary】In data primary information is field code/name matching, inquiry whether have Field name or field description can match, if one party successful match, obtain information to other current to be verified Data do criteria check, if not meeting specification, use【Data dictionary】In field do remarks mark;If all no With success, then into next judgement.
2. passing through【Data dictionary】Afterwards, second be judged as inquire json string in morpheme whether【Regulation management library】 In have element that can correspond to, then take out this rule and judgement filling carried out to json data to be verified, if not finding element It is corresponding, then into next judgement.
3. passing through【Regulation management library】After judgement, it can enter【Business dictionary】In to j son go here and there in morpheme carry out Matching is searched, if there is matching word, then matching word is obtained and specification validation is carried out to element in json data, to being unsatisfactory for Place do standard filling remarks, if it is not, terminate next round verification, one's own profession data check result is kept in.When verified All fields of model all verify when finishing, and obtain all temporary verification data results and are sent to【Verify Report Builder】It generates standby Note verification report and suggestion for revision, return to submitter.Epicycle model checking terminates.
Step 3:It completes on line after model checking, model is named according to verification report, after structure is made an amendment, Ke Yijin Row is submitted again, is clicked " model submission ", and "current" model can enter vocabulary training, word extraction, and final repository safeguards people Member can be put into the standard term classified vocabulary of newly-increased extraction in three different dictionary libraries, ultimately form a closed loop verification Management, to be gradually completing coverage area.
There are following several innovative points in present patent application:
1. the data standard checking process and side of higher the degree of automation are realized by less manual operation+software program Method.The identification of standard object range is carried out using multi-data source, a variety of initialization modes at the beginning of startup;And after online implementing The Closed loop operation mode for forming model checking+standard term vocabulary addition+model checking is constantly extended normal dictionary library, So that the matching cover degree and verification degree of the method for calibration and check system are continuously improved.
2. the present invention data standard verification mode have it is comprehensive, cover physical name in data warehouse, entity comprehensively The numerical nomenclatures such as attribute-name, table name, row name, index name, data definition, the verification of data type.Respectively to normalized objects From morpheme, standard word, standard term, classificating word, canonical domain, transformation rule is many-sided, uses the standard of many levels granularity Condition is verified, to meet maximum matching degree and verification accuracy rate.
3. the data standard verification mode of the present invention has stronger flexibility, according to the different situations of data normalization, It is abstracted as 3 kinds of classifications:Data dictionary, regulation management library, business dictionary library.Data dictionary is used to verify the standard of fixed field name Domain verifies, the title of strict difinition data field, type, description and data area;Regulation management library is used for standard term The conversion of vocabulary to there are successful match, and describes nonstandard vocabulary and is directly changed into standard term;Business dictionary is used for Standardization name advisory opinion is provided the morpheme split by participle.User can be building up to according to different standard criterions In different dictionaries, there is larger independence and flexibility.
The method of checking treatment provided in this embodiment, the Standardization Practice of the construction of the data warehouse for enterprise provide Method, flow and the embodiment of science;Improve the construction quality of data warehouse, it is ensured that the correctness of data maintains enterprise The consistency of model;And data warehouse development and production and the efficiency of management are improved, reduce the resource that repetition ineffective labor is brought, The waste of manpower;Reduce the operation cost that Data Warehouse for Enterprises is safeguarded.
Fig. 8 is the structure diagram of the device of a kind of checking treatment that further embodiment of this invention provides.
Reference Fig. 8, on the basis of above-described embodiment, the device of checking treatment provided in this embodiment, described device packet Acquisition module 81, correction verification module 82 and modified module 83 are included, wherein:
Acquisition module 81 is used to obtain the model of data warehouse to be verified, and each model includes multiple field informations, institute It states field information and includes field definition and field type;Correction verification module 82 is used for according to pre-stored data dictionary, to described Field information is verified, and the data dictionary includes multiple standard terms, and each standard term includes standard definition and standard Type;If modified module 83 matched for the definition of the field definition and standard and the field type and type not Match, be then revised as the field type consistent with type.
Optionally, the structure of a data warehouse can be divided into two steps:First, the model of design data storage, secondly by number According to the corresponding model (tables of data) of write-in.
After the completion of modelling, using device provided in an embodiment of the present invention, which is verified.
The device of checking treatment is the computer for carrying iBATIS frameworks.IBATIS is the open source code item based on Java Mesh can automate realization Object Relation Mapping.
Optionally, the model that at least one design is completed is uploaded to the device of checking treatment, Mei Yimo by acquisition module 81 Type includes multirow data.
Optionally, a model can be regarded as the tables of data of a carrying gauge outfit, and tables of data includes multirow data, per a line Data include corresponding field information.
Optionally, the field information includes field definition and field type, and field definition is retouching to the meaning of field It states, it may include field name and field description.Field type is the description to the type of field, such as field is double or int, Wherein, double is double-precision floating points, that is, field can be the number for having decimal point, and int represents integer, that is, field It is integer.
Correction verification module 82 verifies the field information according to pre-stored data dictionary.
Optionally, data dictionary is pre-created, data dictionary includes multiple standard terms, and each standard term is to obtain one Accreditation is caused, it can be as the standard works of unified standard.
Optionally, standard term is from industry dialect dictionary, the data of the data warehouse of history, wiki (Wikis hundred Section), various professional books collect what is obtained in data.
Optionally, standard term includes standard and defines and type, and it is standard to a field that the standard, which defines, Description, type is to represent the type that the field can use.
Such as standard is defined as the amount of money, the type double for the amount of money being pre-created determines that type is After double, the amount of money is then without using int as type.
Optionally, it for the field definition of model, inquires in the standard term of data dictionary with the presence or absence of the word with model The matched standard definition of Duan Dingyi.
If the standard of field definition and standard term defines successful match, for the field type of model, inquiry mark Corresponding type is defined in mutatis mutandis language with the matched standard of the field definition of model.
If the field definition of model is consistent with standard definition, and field type is inconsistent with type, then changes mould Block 83 carries out remarks to model, and the content of remarks is:Field type is inconsistent with type, and output verification is as a result, verification knot Fruit includes the remarks.
The embodiment of the present invention adds remarks to provide amending advice during being verified, for subsequently being tied according to verification Fruit performs modification, field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type is consistent with type, then illustrates the mould Type has met specification, and the check results for directly exporting the field information are successfully.
If the standard definition of field definition and standard term match unsuccessful, output verification result is fails.
It is understood that by being pre-created data dictionary, if each data warehouse is all applied when modeling The device of the embodiment of the present invention, is verified according to data dictionary, obtains consistent, standard tables of data, then subsequently filling out Make up the number according to when, then can be filled directly into the tables of data of standard.
The device of checking treatment provided in this embodiment, the method available for performing above method embodiment, this implementation is not It repeats again.
The device of checking treatment provided in this embodiment, correction verification module carry out the model of data warehouse according to standard term Verification, when field definition is matched with standard definition and field type is mismatched with type, modified module is targetedly Field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.
Fig. 9 shows the structure diagram for a kind of electronic equipment that further embodiment of this invention provides.
Refering to Fig. 9, electronic equipment provided in an embodiment of the present invention, the electronic equipment include memory (memory) 91, Processor (processor) 92, bus 93 and it is stored in the computer program that can be run on memory 91 and on a processor. Wherein, the memory 91, processor 92 complete mutual communication by the bus 93.
The processor 92 is used to call program instruction in the memory 91, and to perform described program when is realized as schemed 1 method.
In another embodiment, following method is realized when the processor performs described program:
The field definition includes field name and field description, and the standard definition includes standard name and standard description, phase Ying Di, according to pre-stored data dictionary, the step of being verified to field information, is specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name Whether field type is consistent with type.
In another embodiment, following method is realized when the processor performs described program:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
In another embodiment, following method is realized when the processor performs described program:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple The step of morpheme is specially:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
In another embodiment, following method is realized when the processor performs described program:
The morpheme includes Chinese morpheme and/or English morpheme, and correspondingly, if morpheme is matched with qualifier, judgement is After the step of no classificating word there are the morpheme, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when the processor performs described program:
The morpheme includes Chinese morpheme and/or English morpheme, correspondingly, the morpheme is replaced with the morpheme and right After the step of classificating word answered, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when the processor performs described program:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Electronic equipment provided in this embodiment, available for performing the corresponding program of method of above method embodiment, this reality It applies and repeats no more.
Electronic equipment provided in this embodiment is realized when performing described program by the processor according to standard term pair The model of data warehouse is verified, when field definition is matched with standard definition and field type is mismatched with type, Targetedly field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.
A kind of storage medium that further embodiment of this invention provides is stored with computer program on the storage medium, institute It states and is realized when program is executed by processor such as the step of Fig. 1.
In another embodiment, following method is realized when described program is executed by processor:
The field definition includes field name and field description, and the standard definition includes standard name and standard description, phase Ying Di, according to pre-stored data dictionary, the step of being verified to field information, is specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name Whether field type is consistent with type.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple The step of morpheme is specially:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
In another embodiment, following method is realized when described program is executed by processor:
The morpheme includes Chinese morpheme and/or English morpheme, and correspondingly, if morpheme is matched with qualifier, judgement is After the step of no classificating word there are the morpheme, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when described program is executed by processor:
The morpheme includes Chinese morpheme and/or English morpheme, correspondingly, the morpheme is replaced with the morpheme and right After the step of classificating word answered, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Storage medium provided in this embodiment realizes the side of the above method embodiment when described program is executed by processor Method, this implementation repeat no more.
Storage medium provided in this embodiment verifies the model of data warehouse according to standard term, determines in field Justice match with standard definition and when field type and type mismatch, is targetedly revised as field type and standard Type is consistent, so as to obtain the model of the unification of standard.
Further embodiment of this invention discloses a kind of computer program product, and the computer program product includes being stored in non- Computer program in transitory computer readable storage medium, the computer program includes program instruction, when described program refers to When order is computer-executed, computer is able to carry out the method that above-mentioned each method embodiment is provided, such as including:
The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes Field definition and field type;
According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple marks Mutatis mutandis language, each standard term include standard definition and type;
If the field definition is matched with standard definition and the field type is mismatched with type, by the word Segment type is revised as consistent with type.
It will be appreciated by those of skill in the art that although some embodiments described herein include being wrapped in other embodiments The certain features rather than other feature included, but the combination of the feature of different embodiment mean in the scope of the present invention it It is interior and form different embodiments.
It will be understood by those skilled in the art that each step in embodiment can with hardware realization or at one or The software module run on the multiple processors of person is realized or is realized with combination thereof.Those skilled in the art should manage Solution, can realize according to embodiments of the present invention one using microprocessor or digital signal processor (DSP) in practice The some or all functions of a little or whole components.The present invention is also implemented as performing method as described herein Some or all equipment or program of device (for example, computer program and computer program product).
Although being described in conjunction with the accompanying embodiments of the present invention, those skilled in the art can not depart from this hair Various modifications and variations are made in the case of bright spirit and scope, such modifications and variations are each fallen within by appended claims Within limited range.

Claims (10)

  1. A kind of 1. method of checking treatment, which is characterized in that the method includes:
    The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes field Definition and field type;
    According to pre-stored data dictionary, the field information is verified, the data dictionary is used including multiple standards Language, each standard term include standard definition and type;
    If the field definition is matched with standard definition and the field type is mismatched with type, by the field class Type is revised as consistent with type.
  2. 2. according to the method described in claim 1, it is characterized in that:The field definition includes field name and field description, institute It states standard definition and includes standard name and standard description, correspondingly, according to pre-stored data dictionary, school is carried out to field information The step of testing be specially:
    If the field name is matched with standard name, verify whether the field description describes unanimously, and check field with standard Whether type is consistent with type;
    Or;
    If the field description and standard profile matching, whether with standard name consistent, and check field if verifying the field name Whether type is consistent with type.
  3. 3. according to the method described in claim 1, it is characterized in that:If the definition of field definition and standard match and field type and Type mismatches, then is revised as field type after the step consistent with type, the method includes:
    If field definition is mismatched with standard definition, data prediction is carried out to each field information, obtains multiple morphemes;
    Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, and each Substitution Rules include Qualifier and classificating word;
    If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
    If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
  4. 4. according to the method described in claim 3, it is characterized in that:If field definition is mismatched with standard definition, to every One field information carries out data prediction, and the step of obtaining multiple morphemes is specially:
    Each field information is parsed, generates corresponding json character strings;
    For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
  5. 5. according to the method described in claim 3, it is characterized in that:The morpheme includes Chinese morpheme and/or English morpheme, phase Ying Di, if morpheme is matched with qualifier, after the step of judging whether the classificating word of the morpheme, the method packet It includes:
    If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple business Term, each business term include Chinese term and English term;
    If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, Chinese described in remarks Morpheme, for increasing the English term of the Chinese morpheme;
    If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, remarks The English morpheme, for increasing the Chinese term of the English morpheme.
  6. 6. according to the method described in claim 3, it is characterized in that:The morpheme includes Chinese morpheme and/or English morpheme, phase Ying Di, after the step of morpheme is replaced with the morpheme and corresponding classificating word, the method includes:
    Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes Chinese Term and English term;
    If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, Chinese described in remarks Morpheme, for increasing the English term of the Chinese morpheme;
    If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, remarks The English morpheme, for increasing the Chinese term of the English morpheme.
  7. 7. according to the method described in claim 1, it is characterized in that:If the definition of field definition and standard match and field type and Type mismatches, then is revised as field type after the step consistent with type, the method includes:
    If field definition is mismatched with standard definition, the field definition is trained;
    If meeting preset condition, defined the field definition as standard.
  8. 8. a kind of device of checking treatment, which is characterized in that described device includes:
    Acquisition module, for obtaining the model of data warehouse to be verified, each model includes multiple field informations, the field Information includes field definition and field type;
    Correction verification module, for according to pre-stored data dictionary, being verified to the field information, the data dictionary packet Multiple standard terms are included, each standard term includes standard definition and type;
    Modified module, if match for the definition of the field definition and standard and the field type and type mismatch, Then the field type is revised as consistent with type.
  9. 9. a kind of electronic equipment, which is characterized in that including memory, processor, bus and storage on a memory and can be The computer program run on processor, which is characterized in that the processor realizes such as claim 1-7 when performing described program The step of any one.
  10. 10. a kind of storage medium, is stored thereon with computer program, it is characterised in that:It is real when described program is executed by processor Now such as the step of claim 1-7 any one.
CN201810045917.1A 2018-01-17 2018-01-17 Verification processing method and device, electronic equipment and storage medium Active CN108256074B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810045917.1A CN108256074B (en) 2018-01-17 2018-01-17 Verification processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810045917.1A CN108256074B (en) 2018-01-17 2018-01-17 Verification processing method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN108256074A true CN108256074A (en) 2018-07-06
CN108256074B CN108256074B (en) 2020-06-23

Family

ID=62741174

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810045917.1A Active CN108256074B (en) 2018-01-17 2018-01-17 Verification processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN108256074B (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109165209A (en) * 2018-08-14 2019-01-08 上海达梦数据库有限公司 The data verification method, device of object type, equipment and medium in database
CN109388685A (en) * 2018-10-23 2019-02-26 泰华智慧产业集团股份有限公司 The method and apparatus that will plan that the spatial data that industry uses is put in storage
CN109408510A (en) * 2018-10-19 2019-03-01 中国银行股份有限公司 A kind of method for normalizing and device of data model
CN109597763A (en) * 2018-12-04 2019-04-09 北京广利核系统工程有限公司 A kind of consistency verification method and device that multinomial data are normalized
CN109656912A (en) * 2018-12-13 2019-04-19 成都四方伟业软件股份有限公司 Data model management-control method, device and server
CN109766436A (en) * 2018-12-04 2019-05-17 北京明略软件系统有限公司 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data
CN110532561A (en) * 2019-08-30 2019-12-03 北京明略软件系统有限公司 Data detection method and device, storage medium, electronic device
CN110673888A (en) * 2019-08-27 2020-01-10 贝壳技术有限公司 Verification method and device for configuration file
CN110795482A (en) * 2019-10-16 2020-02-14 浙江大华技术股份有限公司 Data benchmarking method, device and storage device
CN110795464A (en) * 2019-08-28 2020-02-14 腾讯科技(深圳)有限公司 Method, device, terminal and storage medium for checking field of object marker data
CN110909003A (en) * 2019-11-25 2020-03-24 车智互联(北京)科技有限公司 Method for creating data table and computing equipment
CN111488327A (en) * 2019-01-29 2020-08-04 卓望数码技术(深圳)有限公司 Data standard management method and system
CN112035451A (en) * 2020-08-25 2020-12-04 上海灵长软件科技有限公司 Data verification optimization processing method and device, electronic equipment and storage medium
CN112164481A (en) * 2020-08-17 2021-01-01 北京广利核系统工程有限公司 Intelligent verification method and system for nuclear power safety control display equipment database
CN112733199A (en) * 2020-12-28 2021-04-30 北京极豪科技有限公司 Data processing method and device, electronic equipment and readable storage medium
CN113642327A (en) * 2021-10-14 2021-11-12 中国光大银行股份有限公司 Method and device for constructing standard knowledge base
CN114416832A (en) * 2022-01-26 2022-04-29 重庆允丰科技有限公司 Method for configuring formula field and report and computer storage medium
CN115186650A (en) * 2022-09-07 2022-10-14 中国中金财富证券有限公司 Data detection method and related device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102346785A (en) * 2011-11-15 2012-02-08 北京创腾科技有限公司 Method and device for directly self-defining field of database
CN104598598A (en) * 2015-01-23 2015-05-06 浙江协同数据系统有限公司 Method for evaluating relational data standard
US9135309B2 (en) * 2006-02-01 2015-09-15 Oracle International Corporation System and method for building decision trees in a database
CN107193681A (en) * 2016-03-15 2017-09-22 阿里巴巴集团控股有限公司 Data verification method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9135309B2 (en) * 2006-02-01 2015-09-15 Oracle International Corporation System and method for building decision trees in a database
CN102346785A (en) * 2011-11-15 2012-02-08 北京创腾科技有限公司 Method and device for directly self-defining field of database
CN104598598A (en) * 2015-01-23 2015-05-06 浙江协同数据系统有限公司 Method for evaluating relational data standard
CN107193681A (en) * 2016-03-15 2017-09-22 阿里巴巴集团控股有限公司 Data verification method and device

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109165209B (en) * 2018-08-14 2021-06-08 上海达梦数据库有限公司 Data verification method, device, equipment and medium for object types in database
CN109165209A (en) * 2018-08-14 2019-01-08 上海达梦数据库有限公司 The data verification method, device of object type, equipment and medium in database
CN109408510A (en) * 2018-10-19 2019-03-01 中国银行股份有限公司 A kind of method for normalizing and device of data model
CN109388685A (en) * 2018-10-23 2019-02-26 泰华智慧产业集团股份有限公司 The method and apparatus that will plan that the spatial data that industry uses is put in storage
CN109597763B (en) * 2018-12-04 2022-02-25 北京广利核系统工程有限公司 Consistency verification method and device for normalizing multiple items of data
CN109597763A (en) * 2018-12-04 2019-04-09 北京广利核系统工程有限公司 A kind of consistency verification method and device that multinomial data are normalized
CN109766436A (en) * 2018-12-04 2019-05-17 北京明略软件系统有限公司 A kind of matched method and apparatus of data element of the field and knowledge base of tables of data
CN109656912B (en) * 2018-12-13 2020-08-07 成都四方伟业软件股份有限公司 Data model control method and device and server
CN109656912A (en) * 2018-12-13 2019-04-19 成都四方伟业软件股份有限公司 Data model management-control method, device and server
CN111488327B (en) * 2019-01-29 2023-08-22 卓望数码技术(深圳)有限公司 Data standard management method and system
CN111488327A (en) * 2019-01-29 2020-08-04 卓望数码技术(深圳)有限公司 Data standard management method and system
CN110673888A (en) * 2019-08-27 2020-01-10 贝壳技术有限公司 Verification method and device for configuration file
CN110673888B (en) * 2019-08-27 2023-04-07 贝壳技术有限公司 Verification method and device for configuration file
CN110795464A (en) * 2019-08-28 2020-02-14 腾讯科技(深圳)有限公司 Method, device, terminal and storage medium for checking field of object marker data
CN110795464B (en) * 2019-08-28 2022-03-04 腾讯科技(深圳)有限公司 Method, device, terminal and storage medium for checking field of object marker data
CN110532561B (en) * 2019-08-30 2022-12-09 北京明略软件系统有限公司 Data detection method and device, storage medium and electronic device
CN110532561A (en) * 2019-08-30 2019-12-03 北京明略软件系统有限公司 Data detection method and device, storage medium, electronic device
CN110795482A (en) * 2019-10-16 2020-02-14 浙江大华技术股份有限公司 Data benchmarking method, device and storage device
CN110909003B (en) * 2019-11-25 2022-06-10 车智互联(北京)科技有限公司 Method for creating data table and computing equipment
CN110909003A (en) * 2019-11-25 2020-03-24 车智互联(北京)科技有限公司 Method for creating data table and computing equipment
CN112164481A (en) * 2020-08-17 2021-01-01 北京广利核系统工程有限公司 Intelligent verification method and system for nuclear power safety control display equipment database
CN112164481B (en) * 2020-08-17 2023-09-29 北京广利核系统工程有限公司 Intelligent verification method and system for nuclear power safety control display equipment database
CN112035451A (en) * 2020-08-25 2020-12-04 上海灵长软件科技有限公司 Data verification optimization processing method and device, electronic equipment and storage medium
CN112733199A (en) * 2020-12-28 2021-04-30 北京极豪科技有限公司 Data processing method and device, electronic equipment and readable storage medium
CN113642327A (en) * 2021-10-14 2021-11-12 中国光大银行股份有限公司 Method and device for constructing standard knowledge base
CN114416832A (en) * 2022-01-26 2022-04-29 重庆允丰科技有限公司 Method for configuring formula field and report and computer storage medium
CN114416832B (en) * 2022-01-26 2022-11-15 重庆允丰科技有限公司 Method for configuring formula field and report and computer storage medium
CN115186650A (en) * 2022-09-07 2022-10-14 中国中金财富证券有限公司 Data detection method and related device
CN115186650B (en) * 2022-09-07 2022-12-09 中国中金财富证券有限公司 Data detection method and related device

Also Published As

Publication number Publication date
CN108256074B (en) 2020-06-23

Similar Documents

Publication Publication Date Title
CN108256074A (en) Method, apparatus, electronic equipment and the storage medium of checking treatment
US10558629B2 (en) Intelligent data quality
US9864741B2 (en) Automated collective term and phrase index
US8762180B2 (en) Claims analytics engine
CN109635298B (en) Group state identification method and device, computer equipment and storage medium
US20160306808A1 (en) Computer-implemented method for determining roof age of a structure
CN110990529B (en) Industry detail dividing method and system for enterprises
CN112035595A (en) Construction method and device of audit rule engine in medical field and computer equipment
CN112036842B (en) Intelligent matching device for scientific and technological service
CN112182246A (en) Method, system, medium, and application for creating an enterprise representation through big data analysis
CN110674131A (en) Financial statement data processing method and device, computer equipment and storage medium
US7822621B1 (en) Method of and system for populating knowledge bases using rule based systems and object-oriented software
CN110968664A (en) Document retrieval method, device, equipment and medium
CN113724057A (en) Financial budget filling method, system, equipment and medium based on big data
CN112036841A (en) Policy analysis system and method based on intelligent semantic recognition
CN114693011A (en) Policy matching method, device, equipment and medium
CN114817526B (en) Text classification method and device, storage medium and terminal
US11830081B2 (en) Automated return evaluation with anomoly detection
US20230410018A1 (en) Systems and method for determining hygiene in enterprise documents with respect to regulatory obligations
CN112416983B (en) Data processing method and device and computer readable storage medium
CN117391643B (en) Knowledge graph-based medical insurance document auditing method and system
CN115953136A (en) Contract auditing method and device, computer equipment and storage medium
CN117788163A (en) Verification method and device for trade background, computer equipment and storage medium
CN116775639A (en) Data processing method, storage medium and electronic device
CN112016268A (en) Online document processing method and device, computer equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100085 Floor 102-1, Building No. 35, West Second Banner Road, Haidian District, Beijing

Applicant after: Seashell Housing (Beijing) Technology Co.,Ltd.

Address before: 100085 Floor 102-1, Building No. 35, West Second Banner Road, Haidian District, Beijing

Applicant before: LIANJIA(BEIJING) TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant