CN103455471A - Method and device for analyzing text to key value pairs - Google Patents

Method and device for analyzing text to key value pairs Download PDF

Info

Publication number
CN103455471A
CN103455471A CN2012101804966A CN201210180496A CN103455471A CN 103455471 A CN103455471 A CN 103455471A CN 2012101804966 A CN2012101804966 A CN 2012101804966A CN 201210180496 A CN201210180496 A CN 201210180496A CN 103455471 A CN103455471 A CN 103455471A
Authority
CN
China
Prior art keywords
text
variable
source variable
syntactic structure
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012101804966A
Other languages
Chinese (zh)
Other versions
CN103455471B (en
Inventor
王尹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201210180496.6A priority Critical patent/CN103455471B/en
Publication of CN103455471A publication Critical patent/CN103455471A/en
Application granted granted Critical
Publication of CN103455471B publication Critical patent/CN103455471B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a method and a device for analyzing text to key value pairs. The methods mainly includes presetting a text analyzing template of text formats; using the text analyzing template for describing formats of text to be processed and formats of target key value pairs; compiling the text analyzing template of the text formats to acquire compiled results recognizable to computer programs; matching in the text to be processed according to source variable and separators to acquire value of the source variable; reserving the source variable the same with target variable and corresponding value as key value pairs corresponding to the text to be processed. The compiled results include the source variable and the separators corresponding to the text formats to be processed. The method and the device can improve efficiency of analyzing the text to the key value pairs.

Description

A kind of method and apparatus that text resolution is become to key-value pair
Technical field
The application relates to technical field of data processing, particularly relates to a kind of method and apparatus that text resolution is become to key-value pair.
Background technology
Key assignments (Key-value) is to being a kind of data store organisation of computer program, it is by a major key (key) unique key assignments (value) that navigates to fast, at the needs fast finding be widely applied in the scene such as save data one to one.
In actual applications, often there is such demand: valuable information in one section text is extracted, and stored by the form of key-value pair.For example, pending text is " icbc001,123456 ", has wherein hidden channel numbering (api) and the such information of serial number (no), and needs general contents extraction wherein out, finally forms following key-value pair { api:icbc001, no:123456}.
Form to pending text is analyzed, and can learn that text is the character string of separating by a set symbol, comprises irregular separator in text.Therefore with reference to existing string operation method, become key-value pair need to use the mode of obtaining substring text resolution, wherein obtain reference position and the final position of substring in text that substring need to be known extraction in advance; In addition, also need to know the position of separators such as ", "; At two substrings of separator place intercepting, and assignment can obtain key-value pair to corresponding variable.
According to above-mentioned parsing thought, the existing mode that becomes the method for key-value pair mainly to adopt character of a character to process text resolution, detailed process is as follows:
At first, for pending text, from first character, start to search backward, the neither one character is arranged is ", " in judgement, finds after this symbol the position that obtains this symbol place;
Secondly, according to the position of ", " that obtain, know reference position and the final position of substring in whole character string of ", " front, and, reference position and the final position of the substring of back in whole character string;
Again, the function of the extraction substring by programming language extracts respectively this two substrings;
Finally, two substrings are distinguished to assignment to api and no.
To sum up, existing text resolution is become to the method for key-value pair need to carry out a character of a character to text search to determine text formatting, and after searching, need to determine the variable form according to complicated text formatting, then after definite variable form, character string that need to these variablees are corresponding extracts, then by its assignment, form key-value pair, also have now and become the method for key-value pair to need full dose traversal many times text resolution.For example, yet for example, text formatting very complicated (having multiple separator), and, in the situation of text size very long (substring length is very long), repeatedly the full dose traversal can cause declining to a great extent of analyzing efficiency.
In a word, need the urgent technical matters solved of those skilled in the art to be exactly: how can improve the efficiency that text resolution is become to key-value pair.
Summary of the invention
The application's technical matters to be solved is to provide a kind of method and apparatus that text resolution is become to key-value pair, can improve the efficiency that text resolution is become to key-value pair.
In order to address the above problem, the application discloses a kind of method that text resolution is become to key-value pair, comprising:
The text resolution template of preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
Text resolution template to described text formatting is compiled, obtain the discernible compiling result of computer program, described compiling result comprises: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
Mated in pending text according to described source variable and separator, obtained the value of described source variable;
Retain the source variable identical with described target variable and corresponding value, as with the corresponding key-value pair of described pending text.
Preferably, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure and retains syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair.
Preferably, the described source variable of described foundation and separator are mated in pending text, obtain the step of the value of described source variable, comprising:
From the first character of pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
Preferably, the data structure of described compiling result is class object.
Preferably, described text resolution template is also for describing the data check rule of parsing; Described compiling result also comprises and the corresponding source variable of described data check rule and corresponding verification function;
Described method also comprises:
Utilize the verification function to carry out verification to the value of described source variable, if verification is passed through, carry out the described reservation source variable identical with described target variable and corresponding value, as the step with the corresponding key-value pair of described pending text, if verification is not passed through, abandon described pending text.
Preferably, described text resolution template is also for describing the secondary data processing demands of parsing; Described compiling result also comprises that processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands processes function, and described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Described method also comprises:
Process source variable and corresponding new variables and secondary data according to the corresponding secondary data of described and described secondary data processing demands and process function, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
Retain the source variable identical with described target variable and new variables and corresponding value, as with the corresponding key-value pair of described pending text.
Preferably, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and verification syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described verification syntactic structure, identify and the corresponding source variable of described data check rule and corresponding verification function.
Preferably, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and secondary data processing syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described secondary data processing syntactic structure, identify and the corresponding secondary data processing source variable of described secondary data processing demands and corresponding new variables and secondary data processing function.
Preferably, the corresponding regular expression of described verification function.
On the other hand, disclosed herein as well is a kind of device that text resolution is become to key-value pair, comprising:
The template preset module, for the text resolution template of preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
The template collector, for the text resolution template to described text formatting, compiled, obtain the discernible compiling result of computer program, described compiling result comprises: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
The text matches module, for being mated at pending text according to described source variable and separator, obtain the value of described source variable; And
Variable retains module, for retaining the source variable identical with described target variable and being worth accordingly, as with the corresponding key-value pair of described pending text.
Preferably, described text resolution template comprises the several rows text; Described template collector comprises:
Reading submodule, for the style of writing content originally that reads described text resolution template;
Syntactic structure is determined submodule, for the set syntactic structure of determining that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure and retains syntactic structure;
The first recognin module, identify and the corresponding source variable of the form of pending text and separator for the described style of writing content originally from meeting described definition syntactic structure;
The second recognin module, identify and the corresponding target variable of the form of target key-value pair for the described style of writing content originally from meeting described reservation syntactic structure.
Preferably, described text matches module, specifically for the first character from pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
Preferably, the data structure of described compiling result is class object.
Preferably, described text resolution template is also for describing the data check rule of parsing; Described compiling result also comprises and the corresponding source variable of described data check rule and corresponding verification function;
Described device also comprises:
Correction verification module, for utilizing the verification function to carry out verification to the value of described source variable, if verification is passed through, carry out the described reservation source variable identical with described target variable and corresponding value, as the step with the corresponding key-value pair of described pending text, if verification is not passed through, abandon described pending text.
Preferably, described text resolution template is also for describing the secondary data processing demands of parsing; Described compiling result also comprises that processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands processes function, and described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Described device also comprises:
The secondary treating module, process function for according to the corresponding secondary data of described and described secondary data processing demands, processing source variable and corresponding new variables and secondary data, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
The second variable retains module, for retaining the source variable identical with described target variable and new variables and being worth accordingly, as with the corresponding key-value pair of described pending text.
Compared with prior art, the application has the following advantages:
The application adopts preset text resolution template and the mode of pending text matches, extracts valuable information to realize text resolution from pending text; With respect to full dose traversal repeatedly in prior art, the coupling that the application's Chinese version is resolved template and pending text only needs the full dose traversal one time, therefore can effectively improve analyzing efficiency, especially in the situation that text formatting is very complicated or text size is very long, the advantage that the application improves analyzing efficiency is more remarkable.
In addition, if there is the constantly parsing demand of change such as text formatting change, the processing of data secondary data, data check in resolving, needing frequently, revise in large quantities code in prior art, is also the shortcoming that prior art has very flexible; And the application separates text resolution template and computer program code (the resolving Chinese version is resolved the computer program code of template and the operational correspondences such as coupling of pending text) by pre-configured mode, when having the parsing demand of constantly change, computer program code is without change, only need change text resolution template to get final product, therefore, with respect to prior art, the application can improve the dirigibility that text resolution is become to key-value pair greatly.
The accompanying drawing explanation
Fig. 1 is a kind of process flow diagram that text resolution is become to the embodiment of the method 1 of key-value pair of the application;
Fig. 2 is a kind of process flow diagram that text resolution is become to the embodiment of the method 2 of key-value pair of the application;
Fig. 3 is a kind of process flow diagram that text resolution is become to the embodiment of the method 3 of key-value pair of the application;
Fig. 4 is a kind of structural drawing that text resolution is become to the device embodiment of key-value pair of the application.
Embodiment
For above-mentioned purpose, the feature and advantage that make the application can become apparent more, below in conjunction with the drawings and specific embodiments, the application is described in further detail.
The embodiment of the present application adopts the text resolution template to describe the form of pending text and the form of target key-value pair, and adopts the mode of text resolution template and pending text matches therefrom to extract valuable information to realize text resolution.With respect to full dose traversal repeatedly in prior art, the coupling that the application's Chinese version is resolved template and pending text only needs the full dose traversal one time, therefore can effectively improve analyzing efficiency, especially in the situation that text formatting is very complicated or text size is very long, the advantage that the application improves analyzing efficiency is more remarkable.
With reference to Fig. 1, show a kind of process flow diagram that text resolution is become to the embodiment of the method 1 of key-value pair of the application, specifically can comprise:
The text resolution template of step 101, preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
In the embodiment of the present application, text resolution template itself is also a kind of self-defining programming language, and this language is form for describing pending text and the form of target key-value pair.
In a kind of application example of the application, described text resolution template specifically can comprise definition statement and retain output statement two parts, wherein, definition statement can be used for describing the form of pending text, retains the form that output statement can be used for describing the target key-value pair.
In actual applications, can be by the programmer of the art according to the form situation of pending text and the form situation of target key-value pair, preset described text resolution template.
With reference to table 1, show definition statement and the example that retains output statement in a kind of text resolution template of the application.
Table 1
Figure BDA00001718363600071
Figure BDA00001718363600081
Step 102, the text resolution template of described text formatting is compiled, obtain the discernible compiling result of computer program, described compiling result specifically can comprise: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
The text resolution template is comprised of the several rows text, computer program can not Direct Recognition these styles of writing this, computer program is discernible is the data structure (as class object etc.) in internal memory, therefore at first the embodiment of the present application is compiled into described text resolution template the discernible compiling result of computer program.Described compiling result specifically can comprise following content: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair, particularly, which source variable is arranged in pending text, what separator between source variable is, the position at separator place, the target variable that final target key-value pair need to retain have which etc.
In a preferred embodiment of the present application, the data structure of described compiling result is class object.Wherein, an attribute of class object can comprise the Map data structure, the Map data structure here is a kind of synonym that calls the turn the key-value pair data structure at program design language, the character string expression method of this data structure is the { form that key=value} is such, for example " 123=abc, 456=def} ".Like this, just the content of compiling result can be saved in the Map data structure.
In order to realize above-described compiling content, the compilation operations that need to carry out is for resolving every a line of text resolution template, reads every a line and meets any in set syntactic structure.Find meet any after, the content of preserving this row in the text resolution template according to the internal storage structure of this grammer body.Such as if which identification variables is the such definition statement of #define () will comprise out in this definition statement, separator is which type of identifies, and the information identified is saved in the Map data structure.
According to the principle of above-mentioned compilation operations, in a preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, may further include:
Steps A 1, read this content of style of writing of described text resolution template;
Steps A 2, determine the set syntactic structure that this content of described style of writing meets; Described set syntactic structure specifically can comprise the definition syntactic structure and retain syntactic structure;
Steps A 3, from this content of the described style of writing that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
Steps A 4, from this content of the described style of writing that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair.
The template of text resolution shown in table 1 of take is example, the process of corresponding compilation operations can comprise: the text that reads every a line in the text resolution template, the text formatting of every a line is defined as defining syntactic structure and retains the wherein a kind of of syntactic structure Four types, wherein
Definition syntactic structure #define (), determine that the source variable of appointment in this section text definition specifically can comprise: api, action and state, the separator of text definition is ", ", only has this kind of;
Retain syntactic structure #keep (), determine that the target variable that need to finally be retained specifically can comprise :@api@and@action@.
Described above is exactly the process of compilation operations, and after carrying out above-mentioned compilation operations, the application can be saved to the internal storage data structure by compiling accordingly result, at this, provides the example of an internal storage data structure:
{“vars”:[“api”,“action”,“state”],“keepVars”:[“api”,“action”],“replaceStrArray”:[“,”,“,”,]}
Wherein, " vars " for storing source variable orderly, " keepVars ", for storing target variable orderly, " replaceStrArray " is for storing separator orderly; In the situation that specified otherwise not, the order in the embodiment of the present application and order are for meaning the character such as source variable, target variable or the character string sequence of positions at text.
Step 103, the described source variable of foundation and separator are mated in pending text, obtain the value of described source variable;
Provide an application scenarios that text resolution is become to key-value pair at this: system for settling account sends to governor's system by the business flowing water information generated, need governor's system to be monitored for these business flowing water information, the pending text wherein sended over is " icbc001, CREATED, S ".Governor's system can become key-value pair by this section pending text resolution, then is saved in pending message queue.System for settling account will be taken out the message body of these key-value pairs from pending message queue, then carries out.Because governor's core processing engine need to use the message of key-value pair form, so all pending flowing water texts all will be extracted into key-value pair.
In a preferred embodiment of the present application, the described source variable of described foundation and separator are mated in pending text, obtain the step of the value of described source variable, may further include:
From the first character of pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
Example on foundation, " vars " stored source variable orderly, " replaceStrArray " stored separator orderly, suppose that pending text is the " icbc001 in above-mentioned application scenarios, CREATED, S ", so, can in pending text, mate, text " icbc001 " between first character and first separator is exactly the value of first source variable " api ", text " CREATED " between first separator and second separator is exactly the value of second source variable " action ", text " S " between second separator and the 3rd separator is exactly the value of the 3rd source variable " state ".
The source variable that step 104, reservation are identical with described target variable and corresponding value, as with the corresponding key-value pair of described pending text.
Owing to describing the span that target key assignments centering major key is arranged in the text resolution template, and resolve and obtained corresponding target variable according to the text resolution template, finally we need " api " and " action " these two variablees.Therefore step 104 can be screened the value of the source variable that gets at present, removes unwanted source variable, what finally retain is exactly { api:icbc001, the key-value pair result that action:CREATED} is such.
With reference to Fig. 2, show a kind of process flow diagram that text resolution is become to the embodiment of the method 2 of key-value pair of the application, specifically can comprise:
The text resolution template of step 201, preset text formatting; Described text resolution template is for describing the form of pending text, the form of target key-value pair and the data check rule of parsing;
Step 202, the text resolution template of described text formatting is compiled, obtain the discernible compiling result of computer program, described compiling result specifically can comprise: with the corresponding source variable of the form of pending text and separator, with the corresponding target variable of the form of target key-value pair, and with the corresponding source variable of described data check rule and corresponding verification function;
Step 203, the described source variable of foundation and separator are mated in pending text, obtain the value of described source variable;
Step 204, utilize the verification function to carry out verification to the value of described source variable, if verification is passed through, perform step 205, if verification is not passed through, perform step 206;
The source variable that step 205, reservation are identical with described target variable and corresponding value, as with the corresponding key-value pair of described pending text.
Step 206, abandon described pending text.
In some cases, in resolving, may exist the expansion of data check to resolve demand, these data checks are the certain data check rule of correspondence usually, and for example, the value of a data verification rule predetermining source variable can not be character "-" etc.
With respect to embodiment of the method 1, the present embodiment adopts the text resolution template to describe the data check rule of resolving, compiling obtains and the corresponding source variable of described data check rule and corresponding verification function, and utilizes the verification function to carry out verification to the value of described source variable.
In a kind of application example of the application, described text resolution template specifically can comprise definition statement, retain output statement and verification statement three parts, wherein, definition statement can be used for describing the form of pending text, retain the form that output statement can be used for describing the target key-value pair, the verification statement can be used for describing the data check rule of resolving.
In a preferred embodiment of the present application, the verification statement can corresponding regular expression.Regular expression is for carrying out the instrument of text matches, usually some common characters and some metacharacters (metacharacters), consists of.Common character comprises the letter and number of capital and small letter, and metacharacter has special implication.The coupling of regular expression can be understood as, and in given character string, finds the part be complementary with given regular expression.Can judge very easily by regular expression whether pending text meets the form of text resolution template regulation.For example, an example of verification statement is, state==regex (S|F), its implication need to meet regular expression " S|F " for " state " variable, if do not meet, verification is not passed through, pending text does not meet form; Also, while only having value corresponding to " state " variable to meet regular expression " S|F ", the corresponding pending text resolution of " state " variable value out just can be retained.
In a preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, may further include:
Step B1, read this content of style of writing of described text resolution template;
Step B2, determine the set syntactic structure that this content of described style of writing meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and verification syntactic structure;
Step B3, from this content of the described style of writing that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
Step B4, from this content of the described style of writing that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
Step B5, from this content of the described style of writing that meets described verification syntactic structure, identify and the corresponding source variable of described data check rule and corresponding verification function.
Suppose that step B5 determines the described style of writing content originally that meets described verification syntactic structure :@state@==regex (S|F), can therefrom identify source variable to be verified and be "==" left side " state ", need to use the mode of regular expression to be verified, the verification function corresponding with regular expression is regex (S|F).In actual applications, recognition result can be stored to corresponding internal storage data structure, being exemplified as of an internal storage data structure { " equal ": { " state ": [" regex (S|F) "] } }.
Step 204 is used the verification statement of equal the inside to be processed, and variable to be verified is " state ", according to verification statement regex (), means to need by the regular expression verification.Then, after corresponding value is carried out verification by " state ", if meet the text is extracted successfully, if do not meet the text is abandoned, do not do other processing.Suppose that it is S that step 203 is mated the value that obtains " state " variable, meet the content that regular expression " S|F " is corresponding, can determine that current pending text resolution result out needs, and is retained.
With reference to Fig. 3, show a kind of process flow diagram that text resolution is become to the embodiment of the method 3 of key-value pair of the application, specifically can comprise:
The text resolution template of step 301, preset text formatting; Described text resolution template is for describing the form of pending text, the form of target key-value pair and the secondary data processing demands of parsing;
Step 302, the text resolution template of described text formatting is compiled, obtain the discernible compiling result of computer program, described compiling result specifically can comprise: with the corresponding source variable of the form of pending text and separator, with the corresponding target variable of the form of target key-value pair, and process source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands and process function, described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Step 303, the described source variable of foundation and separator are mated in pending text, obtain the value of described source variable;
Step 304, the corresponding secondary data of the described and described secondary data processing demands of foundation are processed source variable and corresponding new variables and secondary data and are processed function, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
The source variable that step 305, reservation are identical with described target variable and new variables and corresponding value, as with the corresponding key-value pair of described pending text.
In some cases, in resolving, may exist the expansion that secondary data is processed to resolve demand.For example, if while with the corresponding source variable of the form of pending text, thering is the Map data structure, need to incite somebody to action by secondary treating that { the Map data structure of key=value} is converted into the key-value pair of new variables and corresponding value.
With respect to embodiment of the method 1, the present embodiment adopts the text resolution template to describe the secondary data processing demands of resolving, compiling obtains processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands and processes statement, and process source variable and corresponding new variables and secondary data according to the corresponding secondary data of described and described secondary data processing demands and process statement, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value.
In a kind of application example of the application, described text resolution template specifically can comprise definition statement, retain output statement and secondary data processing statement three parts, wherein, definition statement can be used for describing the form of pending text, retain the form that output statement can be used for describing the target key-value pair, secondary data is processed statement and be can be used for describing the secondary data processing demands of resolving.
With reference to table 2, show the example of definition statement in the another kind of text resolution template of the application, reservation output statement and secondary data processing statement.
Table 2
Figure BDA00001718363600141
In a preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, may further include:
Step C1, read this content of style of writing of described text resolution template;
Step C2, determine the set syntactic structure that this content of described style of writing meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and secondary data processing syntactic structure;
Step C3, from this content of the described style of writing that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
Step C4, from this content of the described style of writing that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
Step C5, process from meeting described secondary data this content of described style of writing of syntactic structure and identify and the corresponding secondary data of described secondary data processing demands is processed source variable and corresponding new variables and secondary data and processed function.
Suppose that step B5 determines the described style of writing content originally that meets the verification syntactic structure :@no@=getMap (Map, no), can therefrom identify secondary data and process source variable " Map ", and corresponding new variables " no ", and secondary data is processed function getMap ().Suppose that pending text is " icbc001, { no=123456}, CREATED, S ", in actual applications, can be by all compiling result stores to corresponding internal storage data structure, be exemplified as { " vars ": [" api " with table 2 and the corresponding internal storage data structure of described pending text, " Map ", " action ", " state "], " keepVars ": [" api ", " no ", " action "], " subStrings ": [" Map "], " mines ": { " no ": [" getMap (Map, no) "] }, " equal ": { " state ": [" regex (S|F) "] }, " replaceStrArray ": [", ", ", ", ", "], wherein, " subStrings " processes source variable for storing secondary data, " mines " processes the corresponding new variables of source variable and secondary data processing function for storing secondary data.
All source variables that step 304 is used the value of preservation in " mines " to obtain coupling carry out the secondary data processing, and the content of preserving in " mines " is { " no ": [" getMap (Map, no) "] }.Syntactic definition according to getMap (), it is " Map " that secondary data is processed source variable, and the character string that " Map " is the Map data type of a kind of java is expressed form: { no=123456}, can there is a major key that variable is " no " in this data type, therefore secondary treating function getMap (Map, no) value corresponding to this " no " major key can be extracted, assignment is given a new variable " no ".
After the secondary treating of step 304, the variable got and respective value specifically can comprise: { api:icbc001, Map:{no=123456}, no:123456, action:CREATED, state:S}.Further, the value of the variable got through step 305 pair is screened, and removes unwanted variable " Map ", and what finally retain is exactly { api:icbc001, no:123456, action:CREATED, the key-value pair result that state:S} is such.
It should be noted that, except the Map data structure, the data structure that described secondary data is processed source variable can also comprise the properties data structure, and also, the text of properties data structure also needs to carry out secondary treating.Similar to the Map data structure, the properties data structure is also a kind of synonym that calls the turn the key-value pair data structure at program design language, but different from the Map data structure is, separation in the properties data structure between each key-value pair is newline, and there is no outmost " { } ", a kind of example of properties data structure is as follows:
123=abc
456=def
Embodiment of the method 2 and embodiment of the method 3 are introduced the parsing scheme that has the expansion parsing demands such as the processing of data secondary data, data check in resolving respectively; In actual applications, except expansion parsing demand, also likely occurring the parsing demand of the continuous changes such as text formatting change in resolving, need frequently, revise in large quantities code in prior art, is also the shortcoming that prior art has very flexible.
And the application separates text resolution template and computer program code (the resolving Chinese version is resolved the computer program code of template and the operational correspondences such as coupling of pending text) by pre-configured mode, when having the parsing demand of constantly change, only need to revise the content of template file, do not need the update routine code fully, configuration mode can accomplish that the residing operating system of program code do not restart so flexibly, and can play the effect of dynamic-configuration; Therefore, therefore, with respect to prior art, the application can improve the dirigibility that text resolution is become to key-value pair greatly.
In said method embodiment, the description of each embodiment of the method is all emphasized particularly on different fields, there is no the part described in detail in certain embodiment of the method, can get final product referring to the associated description of additive method embodiment.
And, those skilled in the art are easy to expect: the 3 combination in any application of said method embodiment 1-embodiment of the method are all feasible, therefore the combination in any between said method embodiment 1-embodiment of the method 3 is all embodiment of the present invention, but this instructions has not just described in detail one by one at this as space is limited.
With preceding method, embodiment is corresponding, and the application also provides a kind of device that text resolution is become to key-value pair, with reference to the structural drawing shown in Fig. 4, specifically can comprise:
Template preset module 401, for the text resolution template of preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
Template collector 402, for the text resolution template to described text formatting, compiled, obtain the discernible compiling result of computer program, described compiling result comprises: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
Text matches module 403, for being mated at pending text according to described source variable and separator, obtain the value of described source variable; And
Variable retains module 404, for retaining the source variable identical with described target variable and being worth accordingly, as with the corresponding key-value pair of described pending text.
In a preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; Correspondingly, described template collector 402 may further include:
Reading submodule, for the style of writing content originally that reads described text resolution template;
Syntactic structure is determined submodule, for the set syntactic structure of determining that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure and retains syntactic structure;
The first recognin module, identify and the corresponding source variable of the form of pending text and separator for the described style of writing content originally from meeting described definition syntactic structure;
The second recognin module, identify and the corresponding target variable of the form of target key-value pair for the described style of writing content originally from meeting described reservation syntactic structure.
In another preferred embodiment of the present application, described text matches module 403, can be specifically for the first character from pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
In the embodiment of the present application, preferably, the data structure of described compiling result is class object.
In another preferred embodiment of the application, described text resolution template is also for describing the data check rule of parsing; Described compiling result can also comprise and the corresponding source variable of described data check rule and corresponding verification function;
Described device can also comprise:
Correction verification module, for utilizing the verification function to carry out verification to the value of described source variable, if verification is passed through, carry out the described reservation source variable identical with described target variable and corresponding value, as the step with the corresponding key-value pair of described pending text, if verification is not passed through, abandon described pending text.
In a preferred embodiment of the present application, described text resolution template is also for describing the secondary data processing demands of parsing; Described compiling result can also comprise that processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands processes function, and described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Described device can also comprise:
The secondary treating module, process function for according to the corresponding secondary data of described and described secondary data processing demands, processing source variable and corresponding new variables and secondary data, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
The second variable retains module, for retaining the source variable identical with described target variable and new variables and being worth accordingly, as with the corresponding key-value pair of described pending text.
In a preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, may further include:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and verification syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described verification syntactic structure, identify and the corresponding source variable of described data check rule and corresponding verification function.
In the embodiment of the present application, preferably, described verification function can corresponding regular expression.
In another preferred embodiment of the present application, described text resolution template specifically can comprise the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, may further include:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and secondary data processing syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described secondary data processing syntactic structure, identify and the corresponding secondary data processing source variable of described secondary data processing demands and corresponding new variables and secondary data processing function.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed is and the difference of other embodiment that between each embodiment, identical similar part is mutually referring to getting final product.For device embodiment, because it is substantially similar to embodiment of the method, so description is fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method.
Those skilled in the art should understand, the application's embodiment can be provided as method, system or computer program.Therefore, the application can adopt complete hardware implementation example, implement software example or in conjunction with the form of the embodiment of software and hardware aspect fully.And the application can adopt the form that wherein includes the upper computer program of implementing of computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) of computer usable program code one or more.
The application describes with reference to process flow diagram and/or the block scheme of method, equipment (system) and computer program according to the embodiment of the present application.Should understand can be in computer program instructions realization flow figure and/or block scheme each flow process and/or the flow process in square frame and process flow diagram and/or block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out by the processor of computing machine or other programmable data processing device produce for realizing the device in the function of flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.
These computer program instructions also can be stored in energy vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work, make the instruction be stored in this computer-readable memory produce the manufacture that comprises command device, this command device is realized the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
These computer program instructions also can be loaded on computing machine or other programmable data processing device, make and carry out the sequence of operations step to produce computer implemented processing on computing machine or other programmable devices, thereby the instruction of carrying out on computing machine or other programmable devices is provided for realizing the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
Although described the application's preferred embodiment, once those skilled in the art obtain the basic creative concept of cicada, can make other change and modification to these embodiment.So claims are intended to all changes and the modification that are interpreted as comprising preferred embodiment and fall into the application's scope.
Above a kind of method and apparatus that text resolution is become to key-value pair that the application is provided, be described in detail, applied specific case herein the application's principle and embodiment are set forth, the explanation of above embodiment is just for helping to understand the application's method and core concept thereof; Simultaneously, for one of ordinary skill in the art, the thought according to the application, all will change in specific embodiments and applications, and in sum, this description should not be construed as the restriction to the application.

Claims (15)

1. a method that text resolution is become to key-value pair, is characterized in that, comprising:
The text resolution template of preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
Text resolution template to described text formatting is compiled, obtain the discernible compiling result of computer program, described compiling result comprises: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
Mated in pending text according to described source variable and separator, obtained the value of described source variable;
Retain the source variable identical with described target variable and corresponding value, as with the corresponding key-value pair of described pending text.
2. the method for claim 1, is characterized in that, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure and retains syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair.
3. the method for claim 1, is characterized in that, the described source variable of described foundation and separator are mated in pending text, obtain the step of the value of described source variable, comprising:
From the first character of pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
4. as claim 1 or 2 or 3 described methods, it is characterized in that, the data structure of described compiling result is class object.
5. as claim 1 or 2 or 3 described methods, it is characterized in that, described text resolution template is also for describing the data check rule of parsing; Described compiling result also comprises and the corresponding source variable of described data check rule and corresponding verification function;
Described method also comprises:
Utilize the verification function to carry out verification to the value of described source variable, if verification is passed through, carry out the described reservation source variable identical with described target variable and corresponding value, as the step with the corresponding key-value pair of described pending text, if verification is not passed through, abandon described pending text.
6. as claim 1 or 2 or 3 described methods, it is characterized in that, described text resolution template is also for describing the secondary data processing demands of parsing; Described compiling result also comprises that processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands processes function, and described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Described method also comprises:
Process source variable and corresponding new variables and secondary data according to the corresponding secondary data of described and described secondary data processing demands and process function, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
Retain the source variable identical with described target variable and new variables and corresponding value, as with the corresponding key-value pair of described pending text.
7. method as claimed in claim 5, is characterized in that, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and verification syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described verification syntactic structure, identify and the corresponding source variable of described data check rule and corresponding verification function.
8. method as claimed in claim 6, is characterized in that, described text resolution template comprises the several rows text; The described template of the text resolution to described text formatting is compiled, and obtains the step of the discernible compiling result of computer program, comprising:
Read the style of writing content originally of described text resolution template;
Determine the set syntactic structure that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure, retains syntactic structure and secondary data processing syntactic structure;
From the described style of writing content originally that meets described definition syntactic structure, identify and the corresponding source variable of the form of pending text and separator;
From the described style of writing content originally that meets described reservation syntactic structure, identify and the corresponding target variable of the form of target key-value pair;
From the described style of writing content originally that meets described secondary data processing syntactic structure, identify and the corresponding secondary data processing source variable of described secondary data processing demands and corresponding new variables and secondary data processing function.
9. method as claimed in claim 5, is characterized in that, the corresponding regular expression of described verification function.
10. a device that text resolution is become to key-value pair, is characterized in that, comprising:
The template preset module, for the text resolution template of preset text formatting; Described text resolution template is for describing the form of pending text and the form of target key-value pair;
The template collector, for the text resolution template to described text formatting, compiled, obtain the discernible compiling result of computer program, described compiling result comprises: with the corresponding source variable of the form of pending text and separator, and with the corresponding target variable of the form of target key-value pair;
The text matches module, for being mated at pending text according to described source variable and separator, obtain the value of described source variable; And
Variable retains module, for retaining the source variable identical with described target variable and being worth accordingly, as with the corresponding key-value pair of described pending text.
11. device as claimed in claim 10, is characterized in that, described text resolution template comprises the several rows text; Described template collector comprises:
Reading submodule, for the style of writing content originally that reads described text resolution template;
Syntactic structure is determined submodule, for the set syntactic structure of determining that described style of writing content originally meets; Described set syntactic structure comprises the definition syntactic structure and retains syntactic structure;
The first recognin module, identify and the corresponding source variable of the form of pending text and separator for the described style of writing content originally from meeting described definition syntactic structure;
The second recognin module, identify and the corresponding target variable of the form of target key-value pair for the described style of writing content originally from meeting described reservation syntactic structure.
12. device as claimed in claim 10, it is characterized in that, described text matches module, specifically for the first character from pending text, order according to described source variable and separator is mated in pending text, in the matching result obtained, the value that first character be first source variable to the text between first separator, the value that the text between two separators be correspondence source variable sequentially.
13. as claim 10 or 11 or 12 described devices, it is characterized in that, the data structure of described compiling result is class object.
14. as claim 10 or 11 or 12 described devices, it is characterized in that, described text resolution template is also for describing the data check rule of parsing; Described compiling result also comprises and the corresponding source variable of described data check rule and corresponding verification function;
Described device also comprises:
Correction verification module, for utilizing the verification function to carry out verification to the value of described source variable, if verification is passed through, carry out the described reservation source variable identical with described target variable and corresponding value, as the step with the corresponding key-value pair of described pending text, if verification is not passed through, abandon described pending text.
15. as claim 10 or 11 or 12 described devices, it is characterized in that, described text resolution template is also for describing the secondary data processing demands of parsing; Described compiling result also comprises that processing source variable and corresponding new variables and secondary data with the corresponding secondary data of described secondary data processing demands processes function, and described secondary data is processed function for described secondary data being processed to the value assignment of source variable major key to described new variables;
Described device also comprises:
The secondary treating module, process function for according to the corresponding secondary data of described and described secondary data processing demands, processing source variable and corresponding new variables and secondary data, described source variable and corresponding value are carried out to the secondary data processing, obtain corresponding new variables and corresponding value;
The second variable retains module, for retaining the source variable identical with described target variable and new variables and being worth accordingly, as with the corresponding key-value pair of described pending text.
CN201210180496.6A 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair Active CN103455471B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210180496.6A CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210180496.6A CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Publications (2)

Publication Number Publication Date
CN103455471A true CN103455471A (en) 2013-12-18
CN103455471B CN103455471B (en) 2016-12-14

Family

ID=49737858

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210180496.6A Active CN103455471B (en) 2012-06-01 2012-06-01 A kind of method and apparatus that text resolution is become key-value pair

Country Status (1)

Country Link
CN (1) CN103455471B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017107738A1 (en) * 2015-12-24 2017-06-29 北京国双科技有限公司 Rule matching method and device
CN107977440A (en) * 2017-12-07 2018-05-01 网宿科技股份有限公司 A kind of methods, devices and systems for parsing data file
CN108170432A (en) * 2017-12-12 2018-06-15 上海天旦网络科技发展有限公司 Configurable decoded packet data system and method
CN108345511A (en) * 2017-01-24 2018-07-31 阿里巴巴集团控股有限公司 A kind of application data verification method, device and electronic equipment
CN108491364A (en) * 2018-01-25 2018-09-04 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing paperwork management system
CN108511077A (en) * 2018-01-25 2018-09-07 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing document creation method, electronic equipment, storage medium
CN108989902A (en) * 2018-07-03 2018-12-11 武汉斗鱼网络科技有限公司 A kind of processing method, device, terminal and the storage medium of barrage message
CN109299887A (en) * 2018-11-05 2019-02-01 阿里巴巴集团控股有限公司 A kind of data processing method, device and electronic equipment
CN109726284A (en) * 2018-12-07 2019-05-07 成都品果科技有限公司 A kind of versatile data analysing method
CN111950257A (en) * 2020-07-31 2020-11-17 新华三半导体技术有限公司 File analysis method and device
CN113553297A (en) * 2021-06-08 2021-10-26 优刻得科技股份有限公司 Management method and system for switch configuration information

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101164039A (en) * 2005-03-02 2008-04-16 谷歌公司 Generating structured information
US20080275693A1 (en) * 2004-09-02 2008-11-06 Yen-Fu Chen Method, system and computer program product for national language support using a multi-language property file
CN101799808A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Data processing method and system thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080275693A1 (en) * 2004-09-02 2008-11-06 Yen-Fu Chen Method, system and computer program product for national language support using a multi-language property file
CN101164039A (en) * 2005-03-02 2008-04-16 谷歌公司 Generating structured information
CN101799808A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Data processing method and system thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JEFFREY DEAN等: "MapReduce: Simplified Data Processing on Large Clusters", 《COMMUNICATIONS OF THE ACM》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017107738A1 (en) * 2015-12-24 2017-06-29 北京国双科技有限公司 Rule matching method and device
US10796077B2 (en) 2015-12-24 2020-10-06 Beijing Gridsum Technology Co., Ltd. Rule matching method and device
CN108345511A (en) * 2017-01-24 2018-07-31 阿里巴巴集团控股有限公司 A kind of application data verification method, device and electronic equipment
CN108345511B (en) * 2017-01-24 2022-02-08 阿里巴巴集团控股有限公司 Application data checking method and device and electronic equipment
CN107977440A (en) * 2017-12-07 2018-05-01 网宿科技股份有限公司 A kind of methods, devices and systems for parsing data file
CN108170432A (en) * 2017-12-12 2018-06-15 上海天旦网络科技发展有限公司 Configurable decoded packet data system and method
CN108170432B (en) * 2017-12-12 2020-10-16 上海天旦网络科技发展有限公司 Configurable data packet decoding system and method
CN108491364A (en) * 2018-01-25 2018-09-04 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing paperwork management system
CN108511077A (en) * 2018-01-25 2018-09-07 苏州麦迪斯顿医疗科技股份有限公司 Medical treatment and nursing document creation method, electronic equipment, storage medium
CN108989902A (en) * 2018-07-03 2018-12-11 武汉斗鱼网络科技有限公司 A kind of processing method, device, terminal and the storage medium of barrage message
CN108989902B (en) * 2018-07-03 2020-10-16 武汉斗鱼网络科技有限公司 Barrage message processing method and device, terminal and storage medium
CN109299887A (en) * 2018-11-05 2019-02-01 阿里巴巴集团控股有限公司 A kind of data processing method, device and electronic equipment
CN109299887B (en) * 2018-11-05 2022-04-19 创新先进技术有限公司 Data processing method and device and electronic equipment
CN109726284A (en) * 2018-12-07 2019-05-07 成都品果科技有限公司 A kind of versatile data analysing method
CN111950257A (en) * 2020-07-31 2020-11-17 新华三半导体技术有限公司 File analysis method and device
CN113553297A (en) * 2021-06-08 2021-10-26 优刻得科技股份有限公司 Management method and system for switch configuration information
CN113553297B (en) * 2021-06-08 2023-01-06 优刻得科技股份有限公司 Management method and system for switch configuration information

Also Published As

Publication number Publication date
CN103455471B (en) 2016-12-14

Similar Documents

Publication Publication Date Title
CN103455471A (en) Method and device for analyzing text to key value pairs
US8977626B2 (en) Indexing and searching a data collection
CN106843840B (en) Source code version evolution annotation multiplexing method based on similarity analysis
CN110770735B (en) Transcoding of documents with embedded mathematical expressions
CN104462085A (en) Method and device for correcting search keywords
EP3869511A1 (en) Method and system for annotating scope of claims of gene sequence, method and system for searching gene sequence, and method and system for annotating information of gene sequence
CN107203468A (en) A kind of software version evolution comparative analysis method based on AST
CN103927164B (en) A kind of script obscures method and system
US8484229B2 (en) Method and system for identifying traditional arabic poems
CN103440252A (en) Method and device for extracting parallel information in Chinese sentence
CN106598828A (en) Method and device for determining invalid class in source code
CN111159497A (en) Regular expression generation method and regular expression-based data extraction method
CN111079408A (en) Language identification method, device, equipment and storage medium
CN112287071A (en) Text relation extraction method and device and electronic equipment
CN110825840B (en) Word bank expansion method, device, equipment and storage medium
KR20130093889A (en) Apparatus and method for interpreting korean keyword search phrase
CN105094941A (en) Method and device for achieving multiple languages
US9208134B2 (en) Methods and systems for tokenizing multilingual textual documents
CN104424399A (en) Knowledge navigation method, device and system based on virus protein body
US20200387815A1 (en) Building training data and similarity relations for semantic space
CN109558580B (en) Text analysis method and device
WO2019148797A1 (en) Natural language processing method, device, computer apparatus, and storage medium
JP5273884B1 (en) Structure analysis apparatus and program
CN109522921A (en) Statement similarity method of discrimination and equipment
CN111581162B (en) Ontology-based clustering method for mass literature data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20191210

Address after: P.O. Box 31119, grand exhibition hall, hibiscus street, 802 West Bay Road, Grand Cayman, Cayman Islands

Patentee after: Innovative advanced technology Co., Ltd

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Co., Ltd.