CN108256074A - Method, apparatus, electronic equipment and the storage medium of checking treatment - Google Patents
Method, apparatus, electronic equipment and the storage medium of checking treatment Download PDFInfo
- Publication number
- CN108256074A CN108256074A CN201810045917.1A CN201810045917A CN108256074A CN 108256074 A CN108256074 A CN 108256074A CN 201810045917 A CN201810045917 A CN 201810045917A CN 108256074 A CN108256074 A CN 108256074A
- Authority
- CN
- China
- Prior art keywords
- field
- morpheme
- standard
- type
- definition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/254—Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
Abstract
The embodiment of the present invention provides a kind of method, apparatus of checking treatment, electronic equipment and storage medium.The method includes obtaining the model of data warehouse to be verified, each model includes multiple field informations, and the field information includes field definition and field type;According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple standard terms, and each standard term includes standard definition and type;If the field definition is matched with standard definition and the field type is mismatched with type, the field type is revised as consistent with type.The method verifies the model of data warehouse according to standard term, when field definition is matched with standard definition and field type is mismatched with type, targetedly field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.
Description
Technical field
The present embodiments relate to database technical field, particularly a kind of method, apparatus of checking treatment, electronic equipment
And storage medium.
Background technology
In order to preferably make a policy, data warehouse need to be created, providing data for decision-making by data warehouse supports.
Data warehouse includes a large amount of data, and data therein are that the data of multiple databases of original dispersion are taken out
It takes, clear up, and process, summarize and arrange by system on this basis.
Since the data of data warehouse have multiple data sources (database), and for an identical field, each number
It is likely to be different according to the name in source, if arranging into a data warehouse, there are a variety of inconsistent for an identical field
Name, lead to the of low quality of data warehouse, subsequently be stored in data and read data when, cause using confusion.
It is main using desk checking by the way of in the prior art, make the Naming conventions, unanimously of each data.
Since everyone experience, ability are different, it may appear that omit, differentiate happening for mistake, lead to not realize
Data Warehouse name is consistent.
Invention content
In view of the drawbacks of the prior art, the embodiment of the present invention provide a kind of method, apparatus of checking treatment, electronic equipment and
Storage medium.
On the one hand, the embodiment of the present invention provides a kind of method of checking treatment, the method includes:
The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes
Field definition and field type;
According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple marks
Mutatis mutandis language, each standard term include standard definition and type;
If the field definition is matched with standard definition and the field type is mismatched with type, by the word
Segment type is revised as consistent with type.
On the other hand, the embodiment of the present invention provides a kind of device of checking treatment, and described device includes:
Acquisition module, for obtaining the model of data warehouse to be verified, each model includes multiple field informations, described
Field information includes field definition and field type;
Correction verification module, for according to pre-stored data dictionary, being verified to the field information, the data word
Allusion quotation includes multiple standard terms, and each standard term includes standard definition and type;
Modified module, if match for the definition of the field definition and standard and the field type and type not
Match, be then revised as the field type consistent with type.
On the other hand, the embodiment of the present invention also provides a kind of electronic equipment, including memory, processor, bus and deposits
The computer program that can be run on a memory and on a processor is stored up, the processor is realized when performing described program with top
The step of method.
On the other hand, the embodiment of the present invention also provides a kind of storage medium, is stored thereon with computer program, described program
The step of as above method is realized when being executed by processor.
As shown from the above technical solution, it the method, apparatus of checking treatment provided in an embodiment of the present invention, electronic equipment and deposits
Storage media, the method verify the model of data warehouse according to standard term, are matched in field definition with standard definition
And field type and type be when mismatching, targetedly field type is revised as it is consistent with type, so as to
To the model of the unification of standard.
Description of the drawings
Fig. 1 is a kind of flow diagram of the method for checking treatment provided in an embodiment of the present invention;
Fig. 2 is the overall structure diagram of the device of checking treatment that further embodiment of this invention provides;
Fig. 3 is the flow diagram of the method for checking treatment that further embodiment of this invention provides;
Fig. 4 is the initial phase operational flowchart that further embodiment of this invention provides;
Fig. 5 is the certain embodiments figure of verification operation that further embodiment of this invention provides;
Fig. 6 is the certain embodiments figure of verification operation that further embodiment of this invention provides;
Fig. 7 is the flow diagram of verification operation that further embodiment of this invention provides;
Fig. 8 is the structure diagram of the device of a kind of checking treatment that further embodiment of this invention provides;
Fig. 9 is the structure diagram of a kind of electronic equipment that further embodiment of this invention provides.
Specific embodiment
Purpose, technical scheme and advantage to make the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention
In attached drawing, the technical solution in the embodiment of the present invention is explicitly described, it is clear that described embodiment be the present invention
Embodiment part of the embodiment, instead of all the embodiments.
Fig. 1 shows a kind of flow diagram of the method for checking treatment provided in an embodiment of the present invention.
As shown in Figure 1, method provided in an embodiment of the present invention specifically includes following steps:
Step 11, the model for obtaining data warehouse to be verified, each model include multiple field informations, the field letter
Breath includes field definition and field type;
Optionally, the structure of a data warehouse can be divided into two steps:First, the model of design data storage, secondly by number
According to the corresponding model (tables of data) of write-in.
After the completion of modelling, using method provided in an embodiment of the present invention, which is verified.
Optionally, the model that at least one design is completed is uploaded to the device of checking treatment, a model can be regarded as
One tables of data, tables of data include multirow data, and corresponding field information is included per data line.
Optionally, the field information includes field definition and field type, and field definition is retouching to the meaning of field
It states, it may include field name and field description.Field type is the description to the type of field, such as field is double or int,
Wherein, double is double-precision floating points, that is, field can be the number for having decimal point, and int represents integer, that is, field
It is integer.
Step 12, according to pre-stored data dictionary, the field information is verified, the data dictionary includes
Multiple standard terms, each standard term include standard definition and type;
Optionally, data dictionary is pre-created, data dictionary includes multiple standard terms, and each standard term is to obtain one
Accreditation is caused, it can be as the standard works of unified standard.
Optionally, standard term is from industry dialect dictionary, the data of the data warehouse of history, wiki (Wikis hundred
Section), various professional books collect what is obtained in data.
Optionally, standard term includes standard and defines and type, and it is standard to a field that the standard, which defines,
Description, type is to represent the type that the field can use.
Such as standard is defined as the amount of money, the type double for the amount of money being pre-created determines that type is
After double, the amount of money is then without using int as type.
Optionally, it for the field definition of model, inquires in the standard term of data dictionary with the presence or absence of the word with model
The matched standard definition of Duan Dingyi.
If the standard of field definition and standard term defines successful match, for the field type of model, inquiry mark
Corresponding type is defined in mutatis mutandis language with the matched standard of the field definition of model.
If the standard definition of field definition and standard term match unsuccessful, output verification result is fails.
If step 13, the field definition are matched with standard definition and the field type and type mismatch,
The field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type and type are inconsistent, then to model
Remarks are carried out, the content of remarks is:Field type is inconsistent with type, output verification as a result, check results include it is described
Remarks.
The embodiment of the present invention adds remarks to provide amending advice during being verified, for subsequently being tied according to verification
Fruit performs modification, field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type is consistent with type, then illustrates the mould
Type has met specification, and the check results of the field information are successfully.
If it is understood that the method that each data warehouse when modeling, is carried out the embodiment of the present invention, root
It is verified according to data dictionary, obtains consistent, standard tables of data, then, then can be straight subsequently when data are filled
It connects in filling to the tables of data of standard.
The method of checking treatment provided in this embodiment verifies the model of data warehouse according to standard term,
When field definition is matched with standard definition and field type is mismatched with type, targetedly field type is revised as
It is consistent with type, so as to obtain the model of the unification of standard.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the field is determined
Justice includes field name and field description, and the standard definition includes standard name and standard description, correspondingly, according to pre-stored
Data dictionary, the step of being verified to field information be specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard
Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name
Whether field type is consistent with type.
Optionally, the content of a model includes as shown in table 1:
Table 1
Field name | Field description | Field type |
Paidup_perf_amount | Paid achievement | Double |
…… | …… | …… |
Optionally, if the field name and standard name successful match, for other fields of the field information, (field is retouched
State and field type) verified, if with corresponding to the standard name of successful match standard description and type it is consistent.
It is if consistent, then it represents that the field information and standard term are completely the same, and check results are successfully.
If inconsistent, the content of remarks is:The field description and the field type and standard term are inconsistent, with
The field description and the field type are revised as subsequently consistent with standard term.
Similarly, if the field description and standard profile matching, for other field (field names of the field information
And field type) verified, if it is consistent with the standard name corresponding to the criteria field of successful match and type.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, can by being verified respectively for field name and field description
It is accurately obtained check results.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field definition
It is matched with standard definition and field type is mismatched with type, then field type is revised as to the step consistent with type
After rapid, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple
Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules
Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
Optionally, if the standard definition of field definition and standard term match it is unsuccessful, represent to use data dictionary into
Row verification failure, performs the embodiment of the present invention, continues to verify using regulation management library.
Optionally, morpheme is the minimum word for having specific meanings, can not be split again, such as:Day, the moon, income, city
Deng.
For example, sequence " traveller's end achievement " is split into " traveller ", " end " and " achievement " these three morphemes.
Optionally, word segmentation processing can be carried out mode to field information according to prior art, obtains morpheme, start using rule
Library is then managed to verify each morpheme.
Optionally, the regulation management library includes multiple Substitution Rules, and each Substitution Rules include qualifier and classificating word,
Between qualifier and classificating word it is the relationship of attribute and head, that is, the relationship modified and be modified, qualifier is conduct
The morpheme for being used to describe classificating word of attribute, classificating word is the morpheme of the head as qualifier.
For example, " the achievement amount of money " the two morphemes, " amount of money " is head, and expression " the achievement amount of money " belongs to money, and this is a kind of
Not, it is the numerical value of a money, and " achievement " represents that this numerical value is the numerical value of achievement rather than other numerical value.
Optionally, the effect of Substitution Rules is to include qualifier in the morpheme for determining a field information fractionation and do not wrap
When including classificating word, qualifier is replaced with into qualifier and classificating word, is equivalent to and only has qualifier not classify in field information
During word, increase classificating word for qualifier.
For each morpheme of field information, the qualifier of Substitution Rules is searched, if in a morpheme and Substitution Rules
Qualifier matching is consistent, judges that the morpheme whether there is the classificating word of the morpheme in this field information.
If the classificating word of the qualifier is not present in this field information, remarks are added, by the morpheme in model
Two morphemes are replaced with, that is, qualifier is replaced with into qualifier and the corresponding classificating word of qualifier.
Such as when model includes " achievement " this morpheme and does not include " amount of money ", " achievement " is turned according to Substitution Rules
It is changed to " the achievement amount of money ".
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only qualifier, classificating word is omitted, for this
The nonstandard literary style of kind adds pre-set classificating word by Substitution Rules for qualifier.
If there are the classificating words of the qualifier in this field information, then it represents that this field information specification, verification knot
Fruit is successfully.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if field definition is mismatched with standard definition, using rule pipe
Li Ku is verified, if morpheme matches with qualifier and there is no during classificating word, the morpheme is replaced with the morpheme and right
The classificating word answered so that nonstandard write a Chinese character in simplified form is revised as specification.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field is determined
Justice and standard define the step of mismatching, then carrying out data prediction to each field information, obtain multiple morphemes:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
Optionally, using resolver to being parsed to each field information.
Optionally, a json character string is considered as a sequence, participle component is called to carry out word fractionation to the sequence,
Obtain morpheme.
Optionally, participle component is a power function, and effect is that a sequence is cut into individual word, i.e. word
Element after obtaining morpheme, is verified using regulation management library.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, parses field information, generates corresponding json character strings,
And for every json character strings, word segmentation processing is carried out, morpheme is obtained, for subsequently realizing the verification of morpheme rank.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the morpheme packet
Chinese morpheme and/or English morpheme are included, correspondingly, if morpheme is matched with qualifier, judges whether point of the morpheme
After the step of class word, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple
Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
If morpheme and modification word mismatch, are represented the verification failure of regulation management library, then continue to be carried out using business dictionary
Verification.
Optionally, each business term includes two morphemes, Chinese term and English term.Such as (achievement, perf),
(broker, agent) etc..
For example, the morpheme includes achievement this morpheme, with the Chinese term in business term (achievement, perf)
Match, and do not include English term perf, then add remarks:The English term perf of addition.Similarly, if model includes perf, and
Do not include achievement, then add remarks:Add achievement.
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only Chinese is without English or only English does not have
Have Chinese, for this nonstandard literary style, pass through business term and carry out completion so that subsequent data either Chinese or
English can correctly identify filling.
Optionally, if the morpheme only includes Chinese morpheme, the matching of Chinese morpheme is carried out, if morpheme only includes
English morpheme then carries out the matching of English morpheme, if the morpheme includes Chinese morpheme and English morpheme, matched sequence
It is not limited, can first carry out the matching of Chinese word element, can also first carry out the matching of English words element.
If Chinese morpheme is matched with Chinese term, but English morpheme is mismatched with English term, then adds remarks explanation
Such case, output verification result.
Similarly, if English morpheme is matched with English term, but Chinese morpheme is mismatched with Chinese term, need to be added standby
Note explanation.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if morpheme and modification word mismatch, continue using Chinese term
It is verified with English term, it can be with completion Chinese morpheme and English morpheme.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, the morpheme packet
Include Chinese morpheme and/or English morpheme, correspondingly, the step of morpheme is replaced with into the morpheme and corresponding classificating word it
Afterwards, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes
Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
Regulation management library is being used to verify, after being replaced the morpheme write a Chinese character in simplified form, is continuing to be verified using business dictionary.
Optionally, each business term includes two morphemes, Chinese term and English term.Such as (achievement, perf),
(broker, agent) etc..
For example, original morpheme includes this morpheme of achievement, after being verified using regulation management library, the achievement amount of money is replaced with,
In embodiments of the present invention, achievement is matched with the Chinese term in business term (achievement, perf), and does not include English term
Perf then adds remarks:The English term perf of addition, while the amount of money and the Chinese term in business term (amount of money, amount)
Matching, and do not include English term amount, then add remarks:The English term amount of addition.
If after traversing business term, without matched morpheme, corresponding remarks, output verification result are added.
It is write a Chinese character in simplified form it is understood that may have been used when designing a model, only Chinese is without English or only English does not have
Have Chinese, for this nonstandard literary style, pass through business term and carry out completion so that subsequent data either Chinese or
English can correctly identify filling.
Optionally, if Chinese morpheme is matched with Chinese term, but English morpheme is mismatched with English term, then is added standby
Note illustrates such case, output verification result.Similarly, if English morpheme is matched with English term, but Chinese morpheme is in
Literary term mismatches, then adds remarks.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment after regulation management library is used to verify, continues using Chinese term
It is verified with English term, it can be with completion Chinese morpheme and English morpheme.
On the basis of above-described embodiment, the method for the checking treatment that further embodiment of this invention provides, if field definition
It is matched with standard definition and field type is mismatched with type, then field type is revised as to the step consistent with type
After rapid, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Optionally, if field definition is mismatched with standard definition, it is understood that there may be two kinds of situations:One kind is the field definition
It is nonstandard, another situation be unified standard standard works, but data dictionary not by the field definition write-in standard determine
Justice.
Optionally, the field definition is trained and refers to targetedly carry out again the field definition
Match, it is determined whether standard can be used as to define.
Optionally, it according to newest data, is matched with the field definition, if being matched in the presence of with the field definition
Unified standard standard works, then defined the field definition as standard, and by the field type of the field definition
As type, new standard term is obtained.
It is understood that newly-increased standard term classification is put into data dictionary, a closed loop verification is ultimately formed
Management gradually to extend the range of verification, reduces erroneous judgement.
Other steps of the present embodiment are similar to previous embodiment step, and the present embodiment repeats no more.
The method of checking treatment provided in this embodiment, if field definition is mismatched with standard definition, to the word
Duan Dingyi is trained, and after determining that the field definition can be used as standard definition, is stored in data dictionary, to extend verification
Range.
In order to more fully understand the technology contents of the present invention, on the basis of above-described embodiment, the present embodiment is described in detail
The method of the checking treatment of offer.
The method of the embodiment of the present invention is difficult to land implementation primarily directed to the data standard that current data warehouse faces
Problem, it is proposed that following solution:
By using the matched mode of participle technique+experience, learning materials are obtained from historical summary and data model,
Empirical learning is carried out, common standard vocabulary and business terms are extracted and corrected, according to different classes of standard term
Or specification is deposited into respectively in corresponding dictionary manager.
In modeling developing later, gone using the standard term (specification) for having summarized next as " experience and standard "
Detection and correction model below solve the problems, such as model data naming standard, and pass through model checking+data word with this
The closed loop management mode of allusion quotation accumulation+model checking carrys out continuous adjusting and optimizing, makes constantly improve, reaches automated maintenance data standard
The purpose of management improves the quality of data of data warehouse, reduces operation cost.
The embodiment of the present invention is named available for uniform data, data definition, the normalization constraints of data type, for solving
Term is chaotic in modeling process or does not know situation about how to name.
Normalized objects described in the embodiment of the present invention refer to the data used in the range of engineering project, it can be understood as need
Carry out the target object of data normalization.
Fig. 2 is the overall structure diagram of the device of checking treatment that further embodiment of this invention provides.
As shown in Fig. 2, the overall structure of the device of checking treatment is divided into three parts:
Data storage layer for storing data standard specification dictionary, includes regulation management library, data dictionary, business dictionary;
Criteria check layer:Standard dictionary is verified and generated for performing standard object, by vocabulary training system, segments component,
Three components of standard calibrator are formed;Data interface tier:For receiving and parsing through standard object model document, verification is externally provided
Report, respectively by model solution parser, verification Report Builder composition.
Mainly verification is standardized from the following aspects:
1. morpheme:Least unit word with certain specific meanings may be generally understood to using participle component to mark
Standardization object carries out the word after participle fractionation, and in normalized work, the first step is exactly to need to decompose existing term
Into least unit meaning, standard word confirmation is then carried out, such as:Day, the moon, income, city etc., belong to business dictionary scope.
2. standard word:Standard word is the least unit word having in lexical meaning, is basic group of business term
Into element.Standard word is write a Chinese character in simplified form mark by Chinese and English and is formed together, each standard word can there are one English letters
Matching, such as (achievement, perf), (broker, agent) etc. is write, belongs to business dictionary scope.
3. classificating word:Classificating word identifies the standard word of entity or entity attribute type, can therefrom deduce internal number
According to the standard word of Value Types.Such as the amount of money, quantity, PV, UV etc., belong to business dictionary scope.
4. canonical domain:Encoding domain, number domain, group domain etc. are splitted data into, defines data type (character string, the number of standard
According to date etc.) and length, with explicit data range.Such as (amount of money, amount, double), (quantity, num, int) etc., belong to
In business dictionary scope.
5. standard term:Refer to all normal terms generated using standard word according to naming rule (qualifier+classificating word)
Mesh name, including physical name, entity attributes name, table name, row name, domain name etc., such as (pt, time subregion, string),
(house_id, the source of houses ID, int) etc., belongs to data dictionary scope.
6. rule conversion:Refer to standard word, classificating word, some of canonical domain merge conversion operations, using qualifier+point
The mode of the class word vocabulary term higher to some frequency of use splices, and is write a Chinese character in simplified form when there is title in standard target object
When, full name conversion, and the information such as subsidiary corresponding English mark can be carried out according to transformation rule, such as:(the achievement amount of money,
Perf_amount, double), when occurring " achievement " word in standard object, rule management can be incited somebody to action according to transformation rule
" achievement " is converted to " the achievement amount of money ", belongs to regulation management library scope.
The present embodiments relate to two large divisions:
First part:Initial phase.Correct in order to ensure data normalization judges verification accuracy rate, needs into line number
It is main comprising data source is collected according to standard initialization, determine the work such as data dictionary, regulation management library and business dictionary.This portion
The division of labor is made to implement to collectively constitute with manual intervention automatically by software.
Second part:Data normalization checking stage.After data standard initial work is carried out, data are proceeded by
The verification of standard, and examining report is generated, for being standardized modification before model is reached the standard grade;It is right and after model is reached the standard grade
The model progress vocabulary parsing newly increased, additional new standard term, the verification of formation standard object->Increase standard term->Mark
The closed loop management of quasi- object verification.
Fig. 3 is the flow diagram of the method for checking treatment that further embodiment of this invention provides.
As shown in figure 3, the embodiment of the present invention specifically includes multiple steps:It is model analyzing, word segmentation processing, model checking, defeated
Go out verification report, model modification, submission and analytic modell analytical model, model training and model to reach the standard grade.
It can be regarded as including 3 steps:Initialization, verification and subsequent step.
Fig. 4 is the initial phase operational flowchart that further embodiment of this invention provides.
As shown in figure 4, step 1:Selected normalized objects range, usually from industry dialect dictionary, existing number
According to depot data, wiki, various professional books are collected in data.
It is collected into after material, can be handled by two ways:It directly carries out participle to text class data to check the mark, word
Frequency sort method, then according to word frequency sequence from high to low into filtering;For existing database data information, according to field English
Literary fame and Chinese name split recurrence combination, and sequence, then according to pairing frequency of occurrence, existing Naming conventions are filtered successively
Modification.Different data dictionaries, regulation management library and business dictionary are finally put into different words according to artificial filter later.
Wherein, in experience matching process is carried out to existing model metadata information, as shown in figure 3,
It is the field attribute information for obtaining all models, carries out word and be split as morpheme one by one;Then recurrence is to each
Three parts of field type of English code and Chinese name and present field are spliced into a character string, and be spliced into all
Character string make word frequency statistics, count every group of more data of correspondence occurrence number and empirically match standard;Finally
In three storage information banks (data dictionary, regulation management library and business dictionary) of typing after hand inspection.
Step 2:It, can be with on-line running criteria check program after completing initialization step.
Fig. 5 is the certain embodiments figure of verification operation that further embodiment of this invention provides.
Fig. 6 is the certain embodiments figure of verification operation that further embodiment of this invention provides.
As shown in Figure 5 and Figure 6, when carrying out data standard verification to a new design model, its model is uploaded first
Information includes the information such as table name, field name, field type, field description.
Then【Model solution parser】The model of upload can be parsed, each row of data is parsed into { origincode (words
Section name), name (field description), type (field type) } structure json string structures;
After the completion of model all parsing, json information and primary information can be sent to【Standard calibrator】Carry out school
It tests;
When【Standard calibrator】After receiving json strings, it can call line by line【Segment component】Word morpheme fractionation is carried out, is torn open
/ after, start to carry out verifying work to each row of data.
Fig. 7 is the flow diagram of verification operation that further embodiment of this invention provides.
As shown in fig. 7, entire checking procedure meeting lease making is crossed and is judged three times:
1. it calls first【Data dictionary】In data primary information is field code/name matching, inquiry whether have
Field name or field description can match, if one party successful match, obtain information to other current to be verified
Data do criteria check, if not meeting specification, use【Data dictionary】In field do remarks mark;If all no
With success, then into next judgement.
2. passing through【Data dictionary】Afterwards, second be judged as inquire json string in morpheme whether【Regulation management library】
In have element that can correspond to, then take out this rule and judgement filling carried out to json data to be verified, if not finding element
It is corresponding, then into next judgement.
3. passing through【Regulation management library】After judgement, it can enter【Business dictionary】In to j son go here and there in morpheme carry out
Matching is searched, if there is matching word, then matching word is obtained and specification validation is carried out to element in json data, to being unsatisfactory for
Place do standard filling remarks, if it is not, terminate next round verification, one's own profession data check result is kept in.When verified
All fields of model all verify when finishing, and obtain all temporary verification data results and are sent to【Verify Report Builder】It generates standby
Note verification report and suggestion for revision, return to submitter.Epicycle model checking terminates.
Step 3:It completes on line after model checking, model is named according to verification report, after structure is made an amendment, Ke Yijin
Row is submitted again, is clicked " model submission ", and "current" model can enter vocabulary training, word extraction, and final repository safeguards people
Member can be put into the standard term classified vocabulary of newly-increased extraction in three different dictionary libraries, ultimately form a closed loop verification
Management, to be gradually completing coverage area.
There are following several innovative points in present patent application:
1. the data standard checking process and side of higher the degree of automation are realized by less manual operation+software program
Method.The identification of standard object range is carried out using multi-data source, a variety of initialization modes at the beginning of startup;And after online implementing
The Closed loop operation mode for forming model checking+standard term vocabulary addition+model checking is constantly extended normal dictionary library,
So that the matching cover degree and verification degree of the method for calibration and check system are continuously improved.
2. the present invention data standard verification mode have it is comprehensive, cover physical name in data warehouse, entity comprehensively
The numerical nomenclatures such as attribute-name, table name, row name, index name, data definition, the verification of data type.Respectively to normalized objects
From morpheme, standard word, standard term, classificating word, canonical domain, transformation rule is many-sided, uses the standard of many levels granularity
Condition is verified, to meet maximum matching degree and verification accuracy rate.
3. the data standard verification mode of the present invention has stronger flexibility, according to the different situations of data normalization,
It is abstracted as 3 kinds of classifications:Data dictionary, regulation management library, business dictionary library.Data dictionary is used to verify the standard of fixed field name
Domain verifies, the title of strict difinition data field, type, description and data area;Regulation management library is used for standard term
The conversion of vocabulary to there are successful match, and describes nonstandard vocabulary and is directly changed into standard term;Business dictionary is used for
Standardization name advisory opinion is provided the morpheme split by participle.User can be building up to according to different standard criterions
In different dictionaries, there is larger independence and flexibility.
The method of checking treatment provided in this embodiment, the Standardization Practice of the construction of the data warehouse for enterprise provide
Method, flow and the embodiment of science;Improve the construction quality of data warehouse, it is ensured that the correctness of data maintains enterprise
The consistency of model;And data warehouse development and production and the efficiency of management are improved, reduce the resource that repetition ineffective labor is brought,
The waste of manpower;Reduce the operation cost that Data Warehouse for Enterprises is safeguarded.
Fig. 8 is the structure diagram of the device of a kind of checking treatment that further embodiment of this invention provides.
Reference Fig. 8, on the basis of above-described embodiment, the device of checking treatment provided in this embodiment, described device packet
Acquisition module 81, correction verification module 82 and modified module 83 are included, wherein:
Acquisition module 81 is used to obtain the model of data warehouse to be verified, and each model includes multiple field informations, institute
It states field information and includes field definition and field type;Correction verification module 82 is used for according to pre-stored data dictionary, to described
Field information is verified, and the data dictionary includes multiple standard terms, and each standard term includes standard definition and standard
Type;If modified module 83 matched for the definition of the field definition and standard and the field type and type not
Match, be then revised as the field type consistent with type.
Optionally, the structure of a data warehouse can be divided into two steps:First, the model of design data storage, secondly by number
According to the corresponding model (tables of data) of write-in.
After the completion of modelling, using device provided in an embodiment of the present invention, which is verified.
The device of checking treatment is the computer for carrying iBATIS frameworks.IBATIS is the open source code item based on Java
Mesh can automate realization Object Relation Mapping.
Optionally, the model that at least one design is completed is uploaded to the device of checking treatment, Mei Yimo by acquisition module 81
Type includes multirow data.
Optionally, a model can be regarded as the tables of data of a carrying gauge outfit, and tables of data includes multirow data, per a line
Data include corresponding field information.
Optionally, the field information includes field definition and field type, and field definition is retouching to the meaning of field
It states, it may include field name and field description.Field type is the description to the type of field, such as field is double or int,
Wherein, double is double-precision floating points, that is, field can be the number for having decimal point, and int represents integer, that is, field
It is integer.
Correction verification module 82 verifies the field information according to pre-stored data dictionary.
Optionally, data dictionary is pre-created, data dictionary includes multiple standard terms, and each standard term is to obtain one
Accreditation is caused, it can be as the standard works of unified standard.
Optionally, standard term is from industry dialect dictionary, the data of the data warehouse of history, wiki (Wikis hundred
Section), various professional books collect what is obtained in data.
Optionally, standard term includes standard and defines and type, and it is standard to a field that the standard, which defines,
Description, type is to represent the type that the field can use.
Such as standard is defined as the amount of money, the type double for the amount of money being pre-created determines that type is
After double, the amount of money is then without using int as type.
Optionally, it for the field definition of model, inquires in the standard term of data dictionary with the presence or absence of the word with model
The matched standard definition of Duan Dingyi.
If the standard of field definition and standard term defines successful match, for the field type of model, inquiry mark
Corresponding type is defined in mutatis mutandis language with the matched standard of the field definition of model.
If the field definition of model is consistent with standard definition, and field type is inconsistent with type, then changes mould
Block 83 carries out remarks to model, and the content of remarks is:Field type is inconsistent with type, and output verification is as a result, verification knot
Fruit includes the remarks.
The embodiment of the present invention adds remarks to provide amending advice during being verified, for subsequently being tied according to verification
Fruit performs modification, field type is revised as consistent with type.
If the field definition of model is consistent with standard definition, and field type is consistent with type, then illustrates the mould
Type has met specification, and the check results for directly exporting the field information are successfully.
If the standard definition of field definition and standard term match unsuccessful, output verification result is fails.
It is understood that by being pre-created data dictionary, if each data warehouse is all applied when modeling
The device of the embodiment of the present invention, is verified according to data dictionary, obtains consistent, standard tables of data, then subsequently filling out
Make up the number according to when, then can be filled directly into the tables of data of standard.
The device of checking treatment provided in this embodiment, the method available for performing above method embodiment, this implementation is not
It repeats again.
The device of checking treatment provided in this embodiment, correction verification module carry out the model of data warehouse according to standard term
Verification, when field definition is matched with standard definition and field type is mismatched with type, modified module is targetedly
Field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.
Fig. 9 shows the structure diagram for a kind of electronic equipment that further embodiment of this invention provides.
Refering to Fig. 9, electronic equipment provided in an embodiment of the present invention, the electronic equipment include memory (memory) 91,
Processor (processor) 92, bus 93 and it is stored in the computer program that can be run on memory 91 and on a processor.
Wherein, the memory 91, processor 92 complete mutual communication by the bus 93.
The processor 92 is used to call program instruction in the memory 91, and to perform described program when is realized as schemed
1 method.
In another embodiment, following method is realized when the processor performs described program:
The field definition includes field name and field description, and the standard definition includes standard name and standard description, phase
Ying Di, according to pre-stored data dictionary, the step of being verified to field information, is specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard
Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name
Whether field type is consistent with type.
In another embodiment, following method is realized when the processor performs described program:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as
After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple
Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules
Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
In another embodiment, following method is realized when the processor performs described program:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple
The step of morpheme is specially:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
In another embodiment, following method is realized when the processor performs described program:
The morpheme includes Chinese morpheme and/or English morpheme, and correspondingly, if morpheme is matched with qualifier, judgement is
After the step of no classificating word there are the morpheme, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple
Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when the processor performs described program:
The morpheme includes Chinese morpheme and/or English morpheme, correspondingly, the morpheme is replaced with the morpheme and right
After the step of classificating word answered, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes
Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when the processor performs described program:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as
After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Electronic equipment provided in this embodiment, available for performing the corresponding program of method of above method embodiment, this reality
It applies and repeats no more.
Electronic equipment provided in this embodiment is realized when performing described program by the processor according to standard term pair
The model of data warehouse is verified, when field definition is matched with standard definition and field type is mismatched with type,
Targetedly field type is revised as it is consistent with type, so as to obtain the model of the unification of standard.
A kind of storage medium that further embodiment of this invention provides is stored with computer program on the storage medium, institute
It states and is realized when program is executed by processor such as the step of Fig. 1.
In another embodiment, following method is realized when described program is executed by processor:
The field definition includes field name and field description, and the standard definition includes standard name and standard description, phase
Ying Di, according to pre-stored data dictionary, the step of being verified to field information, is specially:
If the field name is matched with standard name, verify whether the field description describes unanimously, and verify with standard
Whether field type is consistent with type;
Or;
If the field description and standard profile matching, whether with standard name consistent, and verify if verifying the field name
Whether field type is consistent with type.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as
After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple
Morpheme;
Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, each Substitution Rules
Including qualifier and classificating word;
If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;
If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is mismatched with standard definition, data prediction is carried out to each field information, is obtained multiple
The step of morpheme is specially:
Each field information is parsed, generates corresponding json character strings;
For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
In another embodiment, following method is realized when described program is executed by processor:
The morpheme includes Chinese morpheme and/or English morpheme, and correspondingly, if morpheme is matched with qualifier, judgement is
After the step of no classificating word there are the morpheme, the method includes:
If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple
Business term, each business term include Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when described program is executed by processor:
The morpheme includes Chinese morpheme and/or English morpheme, correspondingly, the morpheme is replaced with the morpheme and right
After the step of classificating word answered, the method includes:
Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes
Chinese term and English term;
If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, described in remarks
Chinese morpheme, for increasing the English term of the Chinese morpheme;
If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme,
English morpheme described in remarks, for increasing the Chinese term of the English morpheme.
In another embodiment, following method is realized when described program is executed by processor:
If field definition is matched with standard definition and field type is mismatched with type, field type is revised as
After the step consistent with type, the method includes:
If field definition is mismatched with standard definition, the field definition is trained;
If meeting preset condition, defined the field definition as standard.
Storage medium provided in this embodiment realizes the side of the above method embodiment when described program is executed by processor
Method, this implementation repeat no more.
Storage medium provided in this embodiment verifies the model of data warehouse according to standard term, determines in field
Justice match with standard definition and when field type and type mismatch, is targetedly revised as field type and standard
Type is consistent, so as to obtain the model of the unification of standard.
Further embodiment of this invention discloses a kind of computer program product, and the computer program product includes being stored in non-
Computer program in transitory computer readable storage medium, the computer program includes program instruction, when described program refers to
When order is computer-executed, computer is able to carry out the method that above-mentioned each method embodiment is provided, such as including:
The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes
Field definition and field type;
According to pre-stored data dictionary, the field information is verified, the data dictionary includes multiple marks
Mutatis mutandis language, each standard term include standard definition and type;
If the field definition is matched with standard definition and the field type is mismatched with type, by the word
Segment type is revised as consistent with type.
It will be appreciated by those of skill in the art that although some embodiments described herein include being wrapped in other embodiments
The certain features rather than other feature included, but the combination of the feature of different embodiment mean in the scope of the present invention it
It is interior and form different embodiments.
It will be understood by those skilled in the art that each step in embodiment can with hardware realization or at one or
The software module run on the multiple processors of person is realized or is realized with combination thereof.Those skilled in the art should manage
Solution, can realize according to embodiments of the present invention one using microprocessor or digital signal processor (DSP) in practice
The some or all functions of a little or whole components.The present invention is also implemented as performing method as described herein
Some or all equipment or program of device (for example, computer program and computer program product).
Although being described in conjunction with the accompanying embodiments of the present invention, those skilled in the art can not depart from this hair
Various modifications and variations are made in the case of bright spirit and scope, such modifications and variations are each fallen within by appended claims
Within limited range.
Claims (10)
- A kind of 1. method of checking treatment, which is characterized in that the method includes:The model of data warehouse to be verified is obtained, each model includes multiple field informations, and the field information includes field Definition and field type;According to pre-stored data dictionary, the field information is verified, the data dictionary is used including multiple standards Language, each standard term include standard definition and type;If the field definition is matched with standard definition and the field type is mismatched with type, by the field class Type is revised as consistent with type.
- 2. according to the method described in claim 1, it is characterized in that:The field definition includes field name and field description, institute It states standard definition and includes standard name and standard description, correspondingly, according to pre-stored data dictionary, school is carried out to field information The step of testing be specially:If the field name is matched with standard name, verify whether the field description describes unanimously, and check field with standard Whether type is consistent with type;Or;If the field description and standard profile matching, whether with standard name consistent, and check field if verifying the field name Whether type is consistent with type.
- 3. according to the method described in claim 1, it is characterized in that:If the definition of field definition and standard match and field type and Type mismatches, then is revised as field type after the step consistent with type, the method includes:If field definition is mismatched with standard definition, data prediction is carried out to each field information, obtains multiple morphemes;Pre-stored regulation management library is obtained, the regulation management library includes multiple Substitution Rules, and each Substitution Rules include Qualifier and classificating word;If morpheme is matched with qualifier, the classificating word of the morpheme is judged whether;If it does not exist, then the morpheme is replaced with into the morpheme and corresponding classificating word.
- 4. according to the method described in claim 3, it is characterized in that:If field definition is mismatched with standard definition, to every One field information carries out data prediction, and the step of obtaining multiple morphemes is specially:Each field information is parsed, generates corresponding json character strings;For every json character strings, word segmentation processing is carried out, obtains multiple morphemes.
- 5. according to the method described in claim 3, it is characterized in that:The morpheme includes Chinese morpheme and/or English morpheme, phase Ying Di, if morpheme is matched with qualifier, after the step of judging whether the classificating word of the morpheme, the method packet It includes:If morpheme and modification word mismatch, obtain pre-stored business dictionary, the business dictionary includes multiple business Term, each business term include Chinese term and English term;If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, Chinese described in remarks Morpheme, for increasing the English term of the Chinese morpheme;If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, remarks The English morpheme, for increasing the Chinese term of the English morpheme.
- 6. according to the method described in claim 3, it is characterized in that:The morpheme includes Chinese morpheme and/or English morpheme, phase Ying Di, after the step of morpheme is replaced with the morpheme and corresponding classificating word, the method includes:Pre-stored business dictionary is obtained, the business dictionary includes multiple business terms, and each business term includes Chinese Term and English term;If Chinese morpheme English term corresponding with being not present in Chinese term matching and the morpheme, Chinese described in remarks Morpheme, for increasing the English term of the Chinese morpheme;If English morpheme Chinese term corresponding with English term is not present in English term matching and the morpheme, remarks The English morpheme, for increasing the Chinese term of the English morpheme.
- 7. according to the method described in claim 1, it is characterized in that:If the definition of field definition and standard match and field type and Type mismatches, then is revised as field type after the step consistent with type, the method includes:If field definition is mismatched with standard definition, the field definition is trained;If meeting preset condition, defined the field definition as standard.
- 8. a kind of device of checking treatment, which is characterized in that described device includes:Acquisition module, for obtaining the model of data warehouse to be verified, each model includes multiple field informations, the field Information includes field definition and field type;Correction verification module, for according to pre-stored data dictionary, being verified to the field information, the data dictionary packet Multiple standard terms are included, each standard term includes standard definition and type;Modified module, if match for the definition of the field definition and standard and the field type and type mismatch, Then the field type is revised as consistent with type.
- 9. a kind of electronic equipment, which is characterized in that including memory, processor, bus and storage on a memory and can be The computer program run on processor, which is characterized in that the processor realizes such as claim 1-7 when performing described program The step of any one.
- 10. a kind of storage medium, is stored thereon with computer program, it is characterised in that:It is real when described program is executed by processor Now such as the step of claim 1-7 any one.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810045917.1A CN108256074B (en) | 2018-01-17 | 2018-01-17 | Verification processing method and device, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810045917.1A CN108256074B (en) | 2018-01-17 | 2018-01-17 | Verification processing method and device, electronic equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108256074A true CN108256074A (en) | 2018-07-06 |
CN108256074B CN108256074B (en) | 2020-06-23 |
Family
ID=62741174
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810045917.1A Active CN108256074B (en) | 2018-01-17 | 2018-01-17 | Verification processing method and device, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108256074B (en) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109165209A (en) * | 2018-08-14 | 2019-01-08 | 上海达梦数据库有限公司 | The data verification method, device of object type, equipment and medium in database |
CN109388685A (en) * | 2018-10-23 | 2019-02-26 | 泰华智慧产业集团股份有限公司 | The method and apparatus that will plan that the spatial data that industry uses is put in storage |
CN109408510A (en) * | 2018-10-19 | 2019-03-01 | 中国银行股份有限公司 | A kind of method for normalizing and device of data model |
CN109597763A (en) * | 2018-12-04 | 2019-04-09 | 北京广利核系统工程有限公司 | A kind of consistency verification method and device that multinomial data are normalized |
CN109656912A (en) * | 2018-12-13 | 2019-04-19 | 成都四方伟业软件股份有限公司 | Data model management-control method, device and server |
CN109766436A (en) * | 2018-12-04 | 2019-05-17 | 北京明略软件系统有限公司 | A kind of matched method and apparatus of data element of the field and knowledge base of tables of data |
CN110532561A (en) * | 2019-08-30 | 2019-12-03 | 北京明略软件系统有限公司 | Data detection method and device, storage medium, electronic device |
CN110673888A (en) * | 2019-08-27 | 2020-01-10 | 贝壳技术有限公司 | Verification method and device for configuration file |
CN110795482A (en) * | 2019-10-16 | 2020-02-14 | 浙江大华技术股份有限公司 | Data benchmarking method, device and storage device |
CN110795464A (en) * | 2019-08-28 | 2020-02-14 | 腾讯科技(深圳)有限公司 | Method, device, terminal and storage medium for checking field of object marker data |
CN110909003A (en) * | 2019-11-25 | 2020-03-24 | 车智互联(北京)科技有限公司 | Method for creating data table and computing equipment |
CN111488327A (en) * | 2019-01-29 | 2020-08-04 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN112035451A (en) * | 2020-08-25 | 2020-12-04 | 上海灵长软件科技有限公司 | Data verification optimization processing method and device, electronic equipment and storage medium |
CN112164481A (en) * | 2020-08-17 | 2021-01-01 | 北京广利核系统工程有限公司 | Intelligent verification method and system for nuclear power safety control display equipment database |
CN112733199A (en) * | 2020-12-28 | 2021-04-30 | 北京极豪科技有限公司 | Data processing method and device, electronic equipment and readable storage medium |
CN113642327A (en) * | 2021-10-14 | 2021-11-12 | 中国光大银行股份有限公司 | Method and device for constructing standard knowledge base |
CN114416832A (en) * | 2022-01-26 | 2022-04-29 | 重庆允丰科技有限公司 | Method for configuring formula field and report and computer storage medium |
CN115186650A (en) * | 2022-09-07 | 2022-10-14 | 中国中金财富证券有限公司 | Data detection method and related device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102346785A (en) * | 2011-11-15 | 2012-02-08 | 北京创腾科技有限公司 | Method and device for directly self-defining field of database |
CN104598598A (en) * | 2015-01-23 | 2015-05-06 | 浙江协同数据系统有限公司 | Method for evaluating relational data standard |
US9135309B2 (en) * | 2006-02-01 | 2015-09-15 | Oracle International Corporation | System and method for building decision trees in a database |
CN107193681A (en) * | 2016-03-15 | 2017-09-22 | 阿里巴巴集团控股有限公司 | Data verification method and device |
-
2018
- 2018-01-17 CN CN201810045917.1A patent/CN108256074B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9135309B2 (en) * | 2006-02-01 | 2015-09-15 | Oracle International Corporation | System and method for building decision trees in a database |
CN102346785A (en) * | 2011-11-15 | 2012-02-08 | 北京创腾科技有限公司 | Method and device for directly self-defining field of database |
CN104598598A (en) * | 2015-01-23 | 2015-05-06 | 浙江协同数据系统有限公司 | Method for evaluating relational data standard |
CN107193681A (en) * | 2016-03-15 | 2017-09-22 | 阿里巴巴集团控股有限公司 | Data verification method and device |
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109165209B (en) * | 2018-08-14 | 2021-06-08 | 上海达梦数据库有限公司 | Data verification method, device, equipment and medium for object types in database |
CN109165209A (en) * | 2018-08-14 | 2019-01-08 | 上海达梦数据库有限公司 | The data verification method, device of object type, equipment and medium in database |
CN109408510A (en) * | 2018-10-19 | 2019-03-01 | 中国银行股份有限公司 | A kind of method for normalizing and device of data model |
CN109388685A (en) * | 2018-10-23 | 2019-02-26 | 泰华智慧产业集团股份有限公司 | The method and apparatus that will plan that the spatial data that industry uses is put in storage |
CN109597763B (en) * | 2018-12-04 | 2022-02-25 | 北京广利核系统工程有限公司 | Consistency verification method and device for normalizing multiple items of data |
CN109597763A (en) * | 2018-12-04 | 2019-04-09 | 北京广利核系统工程有限公司 | A kind of consistency verification method and device that multinomial data are normalized |
CN109766436A (en) * | 2018-12-04 | 2019-05-17 | 北京明略软件系统有限公司 | A kind of matched method and apparatus of data element of the field and knowledge base of tables of data |
CN109656912B (en) * | 2018-12-13 | 2020-08-07 | 成都四方伟业软件股份有限公司 | Data model control method and device and server |
CN109656912A (en) * | 2018-12-13 | 2019-04-19 | 成都四方伟业软件股份有限公司 | Data model management-control method, device and server |
CN111488327B (en) * | 2019-01-29 | 2023-08-22 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN111488327A (en) * | 2019-01-29 | 2020-08-04 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN110673888A (en) * | 2019-08-27 | 2020-01-10 | 贝壳技术有限公司 | Verification method and device for configuration file |
CN110673888B (en) * | 2019-08-27 | 2023-04-07 | 贝壳技术有限公司 | Verification method and device for configuration file |
CN110795464A (en) * | 2019-08-28 | 2020-02-14 | 腾讯科技(深圳)有限公司 | Method, device, terminal and storage medium for checking field of object marker data |
CN110795464B (en) * | 2019-08-28 | 2022-03-04 | 腾讯科技(深圳)有限公司 | Method, device, terminal and storage medium for checking field of object marker data |
CN110532561B (en) * | 2019-08-30 | 2022-12-09 | 北京明略软件系统有限公司 | Data detection method and device, storage medium and electronic device |
CN110532561A (en) * | 2019-08-30 | 2019-12-03 | 北京明略软件系统有限公司 | Data detection method and device, storage medium, electronic device |
CN110795482A (en) * | 2019-10-16 | 2020-02-14 | 浙江大华技术股份有限公司 | Data benchmarking method, device and storage device |
CN110909003B (en) * | 2019-11-25 | 2022-06-10 | 车智互联(北京)科技有限公司 | Method for creating data table and computing equipment |
CN110909003A (en) * | 2019-11-25 | 2020-03-24 | 车智互联(北京)科技有限公司 | Method for creating data table and computing equipment |
CN112164481A (en) * | 2020-08-17 | 2021-01-01 | 北京广利核系统工程有限公司 | Intelligent verification method and system for nuclear power safety control display equipment database |
CN112164481B (en) * | 2020-08-17 | 2023-09-29 | 北京广利核系统工程有限公司 | Intelligent verification method and system for nuclear power safety control display equipment database |
CN112035451A (en) * | 2020-08-25 | 2020-12-04 | 上海灵长软件科技有限公司 | Data verification optimization processing method and device, electronic equipment and storage medium |
CN112733199A (en) * | 2020-12-28 | 2021-04-30 | 北京极豪科技有限公司 | Data processing method and device, electronic equipment and readable storage medium |
CN113642327A (en) * | 2021-10-14 | 2021-11-12 | 中国光大银行股份有限公司 | Method and device for constructing standard knowledge base |
CN114416832A (en) * | 2022-01-26 | 2022-04-29 | 重庆允丰科技有限公司 | Method for configuring formula field and report and computer storage medium |
CN114416832B (en) * | 2022-01-26 | 2022-11-15 | 重庆允丰科技有限公司 | Method for configuring formula field and report and computer storage medium |
CN115186650A (en) * | 2022-09-07 | 2022-10-14 | 中国中金财富证券有限公司 | Data detection method and related device |
CN115186650B (en) * | 2022-09-07 | 2022-12-09 | 中国中金财富证券有限公司 | Data detection method and related device |
Also Published As
Publication number | Publication date |
---|---|
CN108256074B (en) | 2020-06-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108256074A (en) | Method, apparatus, electronic equipment and the storage medium of checking treatment | |
US10558629B2 (en) | Intelligent data quality | |
US9864741B2 (en) | Automated collective term and phrase index | |
US8762180B2 (en) | Claims analytics engine | |
CN109635298B (en) | Group state identification method and device, computer equipment and storage medium | |
US20160306808A1 (en) | Computer-implemented method for determining roof age of a structure | |
CN110990529B (en) | Industry detail dividing method and system for enterprises | |
CN112035595A (en) | Construction method and device of audit rule engine in medical field and computer equipment | |
CN112036842B (en) | Intelligent matching device for scientific and technological service | |
CN112182246A (en) | Method, system, medium, and application for creating an enterprise representation through big data analysis | |
CN110674131A (en) | Financial statement data processing method and device, computer equipment and storage medium | |
US7822621B1 (en) | Method of and system for populating knowledge bases using rule based systems and object-oriented software | |
CN110968664A (en) | Document retrieval method, device, equipment and medium | |
CN113724057A (en) | Financial budget filling method, system, equipment and medium based on big data | |
CN112036841A (en) | Policy analysis system and method based on intelligent semantic recognition | |
CN114693011A (en) | Policy matching method, device, equipment and medium | |
CN114817526B (en) | Text classification method and device, storage medium and terminal | |
US11830081B2 (en) | Automated return evaluation with anomoly detection | |
US20230410018A1 (en) | Systems and method for determining hygiene in enterprise documents with respect to regulatory obligations | |
CN112416983B (en) | Data processing method and device and computer readable storage medium | |
CN117391643B (en) | Knowledge graph-based medical insurance document auditing method and system | |
CN115953136A (en) | Contract auditing method and device, computer equipment and storage medium | |
CN117788163A (en) | Verification method and device for trade background, computer equipment and storage medium | |
CN116775639A (en) | Data processing method, storage medium and electronic device | |
CN112016268A (en) | Online document processing method and device, computer equipment and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: 100085 Floor 102-1, Building No. 35, West Second Banner Road, Haidian District, Beijing Applicant after: Seashell Housing (Beijing) Technology Co.,Ltd. Address before: 100085 Floor 102-1, Building No. 35, West Second Banner Road, Haidian District, Beijing Applicant before: LIANJIA(BEIJING) TECHNOLOGY Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |