CN107766466A - Recognition methods, system, computer-readable recording medium and the equipment of data type - Google Patents
Recognition methods, system, computer-readable recording medium and the equipment of data type Download PDFInfo
- Publication number
- CN107766466A CN107766466A CN201710910740.2A CN201710910740A CN107766466A CN 107766466 A CN107766466 A CN 107766466A CN 201710910740 A CN201710910740 A CN 201710910740A CN 107766466 A CN107766466 A CN 107766466A
- Authority
- CN
- China
- Prior art keywords
- characteristic value
- data
- regular expression
- line
- storehouse
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24553—Query execution of query operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24568—Data stream processing; Continuous queries
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention provides a kind of recognition methods of data type, system, computer-readable recording medium and equipment, and recognition methods includes:Edit characteristic value data storehouse;Characteristic value data storehouse includes being used to judge the regular expression of the data type of business data flow and the characteristic value associated with the regular expression;Different business data flows corresponds to different regular expressions;Received business data flow is read line by line, and itself and the regular expression in the characteristic value data storehouse edited are subjected to string matching, if matching, then add up statistical match result corresponding with a characteristic value, continue to read next line data, and the regular expression in next line data and characteristic value data storehouse is subjected to string matching, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match line by line.The present invention reduces the error rate of manual identification, lifts the efficiency of identification;The automatic business processing of data in manufacturing process is realized, is advantageous to promote the intelligent Process of electronics industry.
Description
Technical field
The invention belongs to technical field of data recognition, is related to a kind of recognition methods and system, more particularly to a kind of data
Recognition methods, system, computer-readable recording medium and the equipment of type.
Background technology
The development of society make it that electronic product is more and more inseparable with the production and living of the mankind, and electronic product quality
Quality is limited by the development level of whole electronics industry.With the proposition of the national strategy of made in China 2025, intelligence manufacture is pushed away
To unprecedented height, how to be reduced in electronics industry using intellectualized technology and artificial dependence is had become in it
Requiring.
Data format species is various in electronics industry at present, and general rule of doing is according to oneself by engineer in industry
Experience carries out manual Put on file to design data.Such as:The design data of electronics industry can be related to tens kinds of CAD texts
Part, such as Accel (* .pcb), Cadence (* .cad), CadenceBRD (* .brd), CR3000 (* .BSF*.CCF...),
CR5000 (* .ftf and * pcf), Docica (* .docica), Fatf (* .asc), Gencad (* .cad*.gen), Gencam (*
.gcm), IPC (* .IPC), Mentor (Neutral), ODB++ (* .tgz), ODBxml (* .xml), OrCAD (* .min), Pcad
(* .pdf), PowerPCB (* .asc), Protel (* .pcbdoc*.pcb), TopCAD (* .txf), Unidat (* .uni),
Viscadif (* .paf), Vutrax (* .art) etc., engineer can only make preliminary judgement by extension name at present, so as to enter
Row is sorted out, but this method is running into such a predicament:When file extension is identical or does not have extension name, engineer can not make
Judge.
Therefore it provides a kind of recognition methods of data type, system, computer-readable recording medium and equipment, to solve
File extension is identical or when not having extension name running into for prior art, can not fast and accurately identification data type bottleneck,
This has turned into those skilled in the art's technical problem urgently to be resolved hurrily.
The content of the invention
In view of the above the shortcomings that prior art, it is an object of the invention to provide a kind of identification side of data type
Method, system, computer-readable recording medium and equipment, for solving in the prior art to run into file extension identical or do not have
During extension name, can not fast and accurately identification data type the problem of.
In order to achieve the above objects and other related objects, one aspect of the present invention provides a kind of recognition methods of data type,
For identifying the data type of received business data flow;The recognition methods of the data type includes:Edit characteristic value number
According to storehouse;The characteristic value data storehouse include be used for judge business data flow data type regular expression and with the canonical table
The characteristic value being associated up to formula;Different business data flows corresponds to different regular expressions;Received business is read line by line
Data flow, and itself and the regular expression in the characteristic value data storehouse edited are subjected to string matching, if matching, adds up
Statistical match result corresponding with a characteristic value, continue to read next line data, and by next line data and the characteristic value number
String matching is carried out according to the regular expression in storehouse, matching is until statistical result corresponding with this feature value reaches accumulative line by line
The upper limit, interrupt match.
In one embodiment of the invention, described the step of editing characteristic value data storehouse, includes:Store characteristic value and canonical
Expression formula, to establish the characteristic value data storehouse;In the characteristic value data storehouse, the characteristic value is occurred according to characteristic value
Number carries out descending sort, and the characteristic value after descending sort is associated with regular expression.
In one embodiment of the invention, the characteristic value data storehouse also includes data corresponding with each regular expression
Type marks, adds up matching result and statistical match result.
In one embodiment of the invention, if the business data flow read and the regular expressions in the characteristic value data storehouse
Formula string matching, while the statistical result is added up, the statistical result is also stored in the characteristic value data storehouse.
In one embodiment of the invention, after received business data flow is read line by line, by itself and the feature
Before regular expression in Value Data storehouse carries out string matching, the recognition methods of the data type is also included described in identification
Whether business data flow comes from file data, if it is not, the business data flow read is separated by the carriage return character, and is transferred to described incite somebody to action
The step of it carries out string matching with the regular expression in the characteristic value data storehouse;If so, directly it is transferred to described by it
The step of string matching being carried out with the regular expression in the characteristic value data storehouse.
In one embodiment of the invention, the business data flow and the regular expressions in the characteristic value data storehouse that read
The string matching of formula includes business data flow and the complete matching of regular expression and business data flow and regular expression
Similarity matching.
Another aspect of the present invention provides a kind of identifying system of data type, for identifying received business data flow
Data type;The identifying system of the data type includes:Editor module, for editing characteristic value data storehouse;The characteristic value
Database includes being used to judge the regular expression of the data type of business data flow and the spy associated with the regular expression
Value indicative;Different business data flows corresponds to different regular expressions;Processing module, for reading received business number line by line
Carry out string matching according to stream, and by itself and the regular expression in the characteristic value data storehouse edited, if matching, add up with
Statistical match result corresponding to one characteristic value, continue to read next line data, and by next line data and the characteristic value data
Regular expression in storehouse carries out string matching, line by line matching until statistical result corresponding with this feature value reaches accumulative on
Limit, interrupt match.
In one embodiment of the invention, the processing module, will after received business data flow is read line by line
Before it carries out string matching with the regular expression in the characteristic value data storehouse, the processing module is additionally operable to identify institute
State whether business data flow comes from file data, if it is not, the business data flow read is separated by the carriage return character, and will read
Business data flow and the characteristic value data storehouse in regular expression carry out string matching;If so, it will directly read
Business data flow and the characteristic value data storehouse in regular expression carry out string matching.
Another aspect of the invention provides a kind of computer-readable recording medium, is stored thereon with computer program, the program
The recognition methods of the data type is realized when being executed by processor.
Last aspect of the present invention provides a kind of equipment, including:Processor and memory;The memory is based on storing
Calculation machine program, the processor is used for the computer program for performing the memory storage, so that the equipment performs the number
According to the recognition methods of type.
As described above, recognition methods, system, computer-readable recording medium and the equipment of the data type of the present invention, tool
There is following beneficial effect:
Stored in the recognition methods of data type provided by the invention, system, computer-readable recording medium and equipment
The method that the recognition methods of data type instead of the manual identification cad file type of original dependence engineer's experience, from
And reduce because of the error rate of engineer's manual identification, improve the efficiency of identification;Really realize data in manufacturing process from
Dynamicization processing, be advantageous to promote the intelligent Process of electronics industry.Pass through the reality of the recognition methods of data type of the present invention
Apply and reduced the step of making manual intervention during electronic manufacture, contribute to the cycle for shortening product manufacturing and putting goods on the market, it is real
The agile manufactruing of existing electronics industry, the competitive strength of enterprise.
Brief description of the drawings
Figure 1A is shown as schematic flow sheet of the recognition methods of the data type of the present invention in an embodiment.
Figure 1B is shown as the schematic flow sheet of S11 in the recognition methods of the data type of the present invention.
Fig. 2 is shown as the business data flow example of the reception of the present invention.
Fig. 3 is shown as theory structure schematic diagram of the identifying system of the data type of the present invention in an embodiment.
Component label instructions
The identifying system of 3 data types
31 editor modules
32 processing modules
S11~S16 steps
S111~S112 steps
Embodiment
Illustrate embodiments of the present invention below by way of specific instantiation, those skilled in the art can be by this specification
Disclosed content understands other advantages and effect of the present invention easily.The present invention can also pass through specific realities different in addition
The mode of applying is embodied or practiced, the various details in this specification can also be based on different viewpoints with application, without departing from
Various modifications or alterations are carried out under the spirit of the present invention.It should be noted that in the case where not conflicting, following examples and implementation
Feature in example can be mutually combined.
It should be noted that the diagram provided in following examples only illustrates the basic structure of the present invention in a schematic way
Think, only show the component relevant with the present invention in schema then rather than according to component count, shape and the size during actual implement
Draw, kenel, quantity and the ratio of each component can be a kind of random change during its actual implementation, and its assembly layout kenel
It is likely more complexity.
Embodiment one
The present embodiment provides a kind of recognition methods of data type, for identifying the data class of received business data flow
Type;The recognition methods of the data type includes:
Characteristic value data storehouse is edited in a predefined manner;The characteristic value data storehouse includes being used for the number for judging business data flow
Regular expression and the characteristic value associated with the regular expression according to type;Different business data flows correspond to it is different just
Then expression formula;
Received business data flow is read line by line, and itself and the regular expression in the characteristic value data storehouse are carried out
String matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read next line data, and will
Next line data carry out string matching with the regular expression in the characteristic value data storehouse, and matching is until and this feature line by line
Statistical result corresponding to value reaches the accumulative upper limit, interrupt match.
The recognition methods of the data type provided below with reference to diagram the present embodiment is described in detail.Refer to
Figure 1A, it is shown as schematic flow sheet of the recognition methods of data type in an embodiment.As shown in Figure 1A, the data type
Recognition methods specifically include following steps:
S11, editor's characteristic value data storehouse.The characteristic value data storehouse includes being used for the data type for judging business data flow
Regular expression (RE), (in the present embodiment, characteristic value is data type to the characteristic value associated with the regular expression
Unique identifier ID), the mark (remark) of data type corresponding with each regular expression, accumulative matching result
(RESULT_COUNT, in the present embodiment, the accumulative matching result are consistent with the accumulative upper limit) and statistical match result
(Result).In the present embodiment, with forms mode editor's characteristic value data storehouse.Characteristic value data storehouse example is as shown in table 1.
Table 1:Characteristic value data storehouse example
In the present embodiment, different business data flows corresponds to different regular expressions.The business data flow includes C
++, C, JAVA, the programming language such as Perl.
Figure 1B is referred to, shows S11 schematic flow sheet.As shown in Figure 1B, the S11 includes:
S111, storage regular expression, characteristic value, the marking of data type, accumulative matching result and statistical match result,
To establish the characteristic value data storehouse;
S112, in the characteristic value data storehouse, the characteristic value is subjected to descending sort according to characteristic value occurrence number,
It is and the characteristic value after descending sort is associated with regular expression.In the present embodiment, first more than occurrence number need to be followed
Match somebody with somebody, to prevent the few first identification of characteristic value quantity from completing, erroneous judgement occurs.
For example, it is as shown in table 2 that the characteristic value data storehouse after descending sort is carried out according to characteristic value occurrence number.
Table 2:The characteristic value data storehouse after descending sort is carried out according to characteristic value occurrence number
S12, received business data flow is read line by line, identifies whether the business data flow comes from file data, if
It is no, then S13 is performed, the business data flow read is separated by the carriage return character (" n ", " r "), and be transferred to S14;If so, directly
It is transferred to S14.In the present embodiment, if the business data flow received is separated from file data by the carriage return character.
S14, the regular expression in the business data flow read and the characteristic value data storehouse is subjected to character string
Match somebody with somebody, if matching, performs S15.In the present embodiment, by the business data flow read with being carried out according to characteristic value occurrence number
The regular expression in characteristic value data storehouse after descending sort carries out string matching one by one.If mismatching, S16 is performed,
Current Datarow is skipped, continues to read next line data, is transferred to S14.In the present embodiment, the business data flow read and institute
Stating the string matching of the regular expression in characteristic value data storehouse includes the complete matching of business data flow and regular expression
With business data flow and the Similarity matching of regular expression.
S15, add up statistical match result corresponding with a characteristic value, and continue to read next line data, and return to step
S14, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match, and returns to and matched line by line
Characteristic value.
For example, the business data flow received is as shown in Fig. 2 by the business data flow read with occurring according to characteristic value
The process that the regular expression that number is carried out in the characteristic value data storehouse after descending sort carries out string matching one by one is specific such as
Under:
A) Fig. 2 the 1st row " $ HEADER " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;
" $ HEADER " match with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful;
" ^ s* $ HEADER " and table 2 ^ s* $ HEADER the match is successful, now Result (statistical match result) word
Segment mark is designated as 1 (being shown in Table 2).Continue to match other regular expressions, the match is successful.
B) Fig. 2 the 2nd row " GENCAD 1.4 " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;
" GENCAD 1.4 " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful.
Until regular expression " ^A!NET_NAME\!REFDES " is whole to have matched what the match is successful yet.
C) Fig. 2 the 3rd row does not all match any one regular expression to 16 rows.
D) Fig. 2 the 17th row " $ ENDHEADER " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;“$
HEADER " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful;
Until " ^ s* $ ENDHEADER " the match is successful, now Result (statistical match result) field mark 1 (is shown in Table
2).Continue to match other regular expressions, the match is successful.
E) Fig. 2 the 18th row NULI character is skipped.
F) Fig. 2 the 19th row " $ BOARD " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;“$
HEADER " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful.
Until " ^ s* $ BOARD " the match is successful, now Result (statistical match result) field mark 1 (being shown in Table 2).
G) now, it is found that ID=2 number of matches has reached the accumulative upper limit 3, interrupt match, return to ID=2 statistics
With result.In the present embodiment,
H) data identification is completed.Return value is available for other programs or method call.
The present embodiment also provides a kind of computer-readable recording medium, is stored thereon with computer program, and the program is located
Reason device realizes the recognition methods of the data type when performing.One of ordinary skill in the art will appreciate that:Realize above-mentioned each side
The all or part of step of method embodiment can be completed by the related hardware of computer program.Foregoing computer program can
To be stored in a computer-readable recording medium.The program upon execution, execution the step of including above-mentioned each method embodiment;
And foregoing storage medium includes:ROM, RAM, magnetic disc or CD etc. are various can be with the medium of store program codes.
The data class stored in a kind of recognition methods of data type of the present embodiment offer and computer-readable recording medium
The method that the recognition methods of type instead of the manual identification cad file type of original dependence engineer's experience, so as to reduce
Because of the error rate of engineer's manual identification, the efficiency of identification is improved;Really realize in manufacturing process at the automation of data
Reason, be advantageous to promote the intelligent Process of electronics industry.Will by the implementation of the recognition methods of data type described in the present embodiment
The step of making manual intervention during electronic manufacture, is reduced, and is contributed to the cycle for shortening product manufacturing and putting goods on the market, is realized electricity
The agile manufactruing of sub-industry, the competitive strength of enterprise.
Embodiment two
The present embodiment provides a kind of identifying system of data type, for identifying the data class of received business data flow
Type;The identifying system of the data type includes:
Editor module, for editing characteristic value data storehouse in a predefined manner;The characteristic value data storehouse includes being used to judge
The regular expression of the data type of business data flow and the characteristic value associated with the regular expression;Different business datums
The corresponding different regular expression of stream;
Processing module, for reading received business data flow line by line, and by itself and the characteristic value data storehouse edited
In regular expression carry out string matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read
Data line is removed, and the regular expression in next line data and the characteristic value data storehouse is subjected to string matching, by
Row matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match.
The identifying system of the data type provided below with reference to diagram the present embodiment is described in detail.Need
Bright is, it should be understood that the division of the modules of system identified above is only a kind of division of logic function, when actually realizing
Can completely or partially it be integrated on a physical entity, can also be physically separate.And these modules can be all with software
The form called by treatment element is realized;All it can also realize in the form of hardware;Processing can be passed through with part of module
The form of element calling software realizes that part of module is realized by the form of hardware.For example, x modules can individually be set up
Treatment element, it can also be integrated in some chip of said apparatus and realize, in addition it is also possible to be deposited in the form of program code
It is stored in the memory of said apparatus, is called by some treatment element of said apparatus and perform the function of above x modules.Its
The realization of its module is similar therewith.In addition these modules can completely or partially integrate, and can also independently realize.Here
Described treatment element can be a kind of integrated circuit, have the disposal ability of signal.In implementation process, the above method it is each
Step or more modules can pass through the integrated logic circuit of the hardware in processor elements or the instruction of software form
Complete.
For example, the above module can be arranged to implement one or more integrated circuits of above method, such as:
One or more specific integrated circuits (ApplicationSpecificIntegratedCircuit, abbreviation ASIC), or, one
Or multi-microprocessor (digitalsingnalprocessor, abbreviation DSP), or, one or more field-programmable gate array
Arrange (FieldProgrammableGateArray, abbreviation FPGA) etc..For another example, some module is dispatched by treatment element more than
When the form of program code is realized, the treatment element can be general processor, such as central processing unit
(CentralProcessingUnit, abbreviation CPU) or it is other can be with the processor of caller code.For another example, these modules can
To integrate, realized in the form of on-chip system (system-on-a-chip, abbreviation SOC).
Referring to Fig. 3, it is shown as theory structure schematic diagram of the identifying system of data type in an embodiment.Such as Fig. 3
Shown, the identifying system 3 of the data type includes editor module 31 and processing module 32.
The editor module 31 is used to edit characteristic value data storehouse in a predefined manner.The characteristic value data storehouse includes being used for
Judge the regular expression (RE) of the data type of business data flow, the characteristic value associated with the regular expression (in this reality
Apply in example, characteristic value be data type unique identifier ID), the mark of data type corresponding with each regular expression
(remark), add up matching result (RESULT_COUNT, in the present embodiment, the accumulative matching result and the accumulative upper limit one
Cause) and statistical match result (Result).In the present embodiment, the predetermined way is editor's forms mode.In the present embodiment
In, different business data flows corresponds to different regular expressions.The business data flow includes the volume such as C++, C, JAVA, Perl
Cheng Yuyan.
The editor module 31 is specifically used for storage regular expression, characteristic value, the marking of data type, accumulative matching knot
Fruit and statistical match result, to establish the characteristic value data storehouse;In the characteristic value data storehouse, by the characteristic value according to
Characteristic value occurrence number carries out descending sort, and the characteristic value after descending sort is associated with regular expression.In this implementation
In example, the first matching more than occurrence number need to be followed, to prevent the few first identification of characteristic value quantity from completing, erroneous judgement occurs.
The processing module 32 coupled with the editor module 31 is used to read received business data flow line by line, identifies institute
State whether data flow comes from file data, if not come from file data, by the business data flow read by the carriage return character ("
N ", " r ") separate, and the regular expression in the business data flow read and the characteristic value data storehouse is subjected to character string
Matching, if matching, add up statistical match result corresponding with a characteristic value, and continue to read next line data, and return and will read
The business data flow got carries out string matching with the regular expression in the characteristic value data storehouse, line by line matching until with
Statistical result corresponding to this feature value reaches the accumulative upper limit, interrupt match, and returns to the characteristic value matched.If mismatching,
Current Datarow is then skipped, continues to read next line data, performs the business data flow that will be read and the characteristic value data
Regular expression in storehouse carries out string matching, line by line matching until statistical result corresponding with this feature value reaches accumulative on
Limit, interrupt match, and return to the characteristic value matched.
In the present embodiment, after carrying out descending sort by the business data flow read and according to characteristic value occurrence number
Regular expression in characteristic value data storehouse carries out string matching one by one.
If coming from file data, the processing module 32 is used for the business data flow that will be read and the characteristic value data
Regular expression in storehouse carries out string matching, if matching, adds up statistical match result corresponding with a characteristic value, and continue
Next line data are read, and returns to the business data flow that will be read and is carried out with the regular expression in the characteristic value data storehouse
String matching, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match, and returns to line by line
The characteristic value matched.If mismatching, Current Datarow is skipped, continues to read next line data, performs the industry that will be read
The regular expression being engaged in data flow and the characteristic value data storehouse carries out string matching, line by line matching until with this feature value
Corresponding statistical result reaches the accumulative upper limit, interrupt match, and returns to the characteristic value matched.
Embodiment three
The present embodiment provides a kind of equipment, including:Processor, memory, transceiver, communication interface and system bus;Deposit
Reservoir and communication interface are connected with processor and transceiver by system bus and complete mutual communication, and memory is used to deposit
Computer program is stored up, communication interface is used for and other equipment is communicated, and processor and transceiver are used to run computer program,
Equipment is set to perform each step of the recognition methods of data type as described above.
System bus mentioned above can be Peripheral Component Interconnect standard
(PeripheralPomponentInterconnect, abbreviation PCI) bus or EISA
(ExtendedIndustryStandardArchitecture, abbreviation EISA) bus etc..The system bus can be divided into address
Bus, data/address bus, controlling bus etc..For ease of representing, only represented in figure with a thick line, it is not intended that only one total
Line or a type of bus.Communication interface is used for accessing data base device and other equipment (such as client, read-write storehouse
And read-only storehouse) between communication.Memory may include random access memory (RandomAccessMemory, abbreviation RAM),
Nonvolatile memory (non-volatilememory), for example, at least a magnetic disk storage may also also be included.
Above-mentioned processor can be general processor, including central processing unit (CentralProcessingUnit, letter
Claim CPU), network processing unit (NetworkProcessor, abbreviation NP) etc.;It can also be digital signal processor
(DigitalSignalProcessing, abbreviation DSP), application specific integrated circuit
(ApplicationSpecificIntegratedCircuit, abbreviation ASIC), field programmable gate array (Field-
ProgrammableGateArray, abbreviation FPGA) either other PLDs, discrete gate or transistor logic device
Part, discrete hardware components.
In summary, the recognition methods of data type provided by the invention, system, computer-readable recording medium and equipment
The recognition methods of the data type of middle storage instead of the manual identification cad file type of original dependence engineer's experience
Method, so as to reduce the efficiency for because of the error rate of engineer's manual identification, improving identification;Really realize number in manufacturing process
According to automatic business processing, be advantageous to promote electronics industry intelligent Process.Pass through the identification side of data type of the present invention
The implementation of method will reduce the step of manual intervention during electronic manufacture, contribute to the week for shortening product manufacturing and putting goods on the market
Phase, realize the agile manufactruing of electronics industry, the competitive strength of enterprise.So the present invention effectively overcomes in the prior art
Various shortcoming and have high industrial utilization.
The above-described embodiments merely illustrate the principles and effects of the present invention, not for the limitation present invention.It is any ripe
Know the personage of this technology all can carry out modifications and changes under the spirit and scope without prejudice to the present invention to above-described embodiment.Cause
This, those of ordinary skill in the art is complete without departing from disclosed spirit and institute under technological thought such as
Into all equivalent modifications or change, should by the present invention claim be covered.
Claims (10)
1. a kind of recognition methods of data type, it is characterised in that for identifying the data type of received business data flow;
The recognition methods of the data type includes:
Edit characteristic value data storehouse;The characteristic value data storehouse includes being used to judge the canonical table of the data type of business data flow
Up to formula and the characteristic value associated with the regular expression;Different business data flows corresponds to different regular expressions;
Read received business data flow line by line, and itself and the regular expression in the characteristic value data storehouse edited are carried out
String matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read next line data, and will
Next line data carry out string matching with the regular expression in the characteristic value data storehouse, and matching is until and this feature line by line
Statistical result corresponding to value reaches the accumulative upper limit, interrupt match.
2. the recognition methods of data type according to claim 1, it is characterised in that editor's characteristic value data storehouse
Step includes:
Characteristic value and regular expression are stored, to establish the characteristic value data storehouse;
In the characteristic value data storehouse, the characteristic value is subjected to descending sort according to characteristic value occurrence number, and by descending
Characteristic value after sequence is associated with regular expression.
3. the recognition methods of data type according to claim 1, it is characterised in that the characteristic value data storehouse also includes
Data type corresponding with each regular expression marks, adds up matching result and statistical match result.
4. the recognition methods of data type according to claim 3, it is characterised in that if the business data flow read and institute
The regular expression string matching in characteristic value data storehouse is stated, while the statistical result is added up, is also tied the statistics
Fruit is stored in the characteristic value data storehouse.
5. the recognition methods of data type according to claim 3, it is characterised in that reading received business line by line
After data flow, before it is carried out into string matching with the regular expression in the characteristic value data storehouse, the data class
The recognition methods of type also includes identifying whether the business data flow comes from file data, if it is not, the business datum that will be read
Stream is separated by the carriage return character, and is transferred to and described itself and regular expression in the characteristic value data storehouse is carried out into string matching
Step;If so, directly it is transferred to the step that it is carried out to string matching with the regular expression in the characteristic value data storehouse
Suddenly.
6. the recognition methods of data type according to claim 1, it is characterised in that the business data flow read and institute
Stating the string matching of the regular expression in characteristic value data storehouse includes the complete matching of business data flow and regular expression
With business data flow and the Similarity matching of regular expression.
7. a kind of identifying system of data type, it is characterised in that for identifying the data type of received business data flow;
The identifying system of the data type includes:
Editor module, for editing characteristic value data storehouse;The characteristic value data storehouse includes being used for the number for judging business data flow
Regular expression and the characteristic value associated with the regular expression according to type;Different business data flows correspond to it is different just
Then expression formula;
Processing module, for reading received business data flow line by line, and by its with the characteristic value data storehouse edited
Regular expression carries out string matching, if matching, adds up statistical match result corresponding with a characteristic value, continues under reading
Data line, and the regular expression in next line data and the characteristic value data storehouse is subjected to string matching, line by line
With until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match.
8. the identifying system of data type according to claim 7, it is characterised in that
The processing module after received business data flow is read line by line, by its with the characteristic value data storehouse just
Before then expression formula carries out string matching, the processing module is additionally operable to identify whether the business data flow comes from number of files
According to if it is not, the business data flow read is separated by the carriage return character, and by the business data flow read and the characteristic value number
String matching is carried out according to the regular expression in storehouse;If so, directly by the business data flow read and the characteristic value number
String matching is carried out according to the regular expression in storehouse.
9. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is held by processor
The recognition methods of data type any one of claim 1 to 6 is realized during row.
A kind of 10. equipment, it is characterised in that including:Processor and memory;
The memory is used to store computer program, and the processor is used for the computer journey for performing the memory storage
Sequence, so that the equipment performs the recognition methods of the data type as any one of claim 1 to 6.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710910740.2A CN107766466A (en) | 2017-09-29 | 2017-09-29 | Recognition methods, system, computer-readable recording medium and the equipment of data type |
PCT/CN2017/119345 WO2019061913A1 (en) | 2017-09-29 | 2017-12-28 | Data type identification method and system, computer readable storage medium and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710910740.2A CN107766466A (en) | 2017-09-29 | 2017-09-29 | Recognition methods, system, computer-readable recording medium and the equipment of data type |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107766466A true CN107766466A (en) | 2018-03-06 |
Family
ID=61267010
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710910740.2A Pending CN107766466A (en) | 2017-09-29 | 2017-09-29 | Recognition methods, system, computer-readable recording medium and the equipment of data type |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107766466A (en) |
WO (1) | WO2019061913A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109615309A (en) * | 2018-09-25 | 2019-04-12 | 阿里巴巴集团控股有限公司 | A kind of data recording method and device, a kind of calculating equipment and storage medium |
CN111061777A (en) * | 2019-12-10 | 2020-04-24 | 广州电力工程监理有限公司 | Project data statistical analysis method and system |
WO2020232880A1 (en) * | 2019-05-21 | 2020-11-26 | 平安科技(深圳)有限公司 | Data processing method and apparatus, storage medium and terminal device |
CN112087486A (en) * | 2020-07-30 | 2020-12-15 | 山东浪潮通软信息科技有限公司 | Data integration method, equipment and medium for Internet of things equipment |
CN113515680A (en) * | 2021-04-20 | 2021-10-19 | 建信金融科技有限责任公司 | Financial monitoring data processing method and device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103778210A (en) * | 2014-01-15 | 2014-05-07 | 北京京东尚科信息技术有限公司 | Method and device for judging specific file type of file to be analyzed |
CN104881496A (en) * | 2015-06-15 | 2015-09-02 | 北京金山安全软件有限公司 | File name identification and file cleaning method and device |
CN105653531A (en) * | 2014-11-12 | 2016-06-08 | 中兴通讯股份有限公司 | Method and device for data extraction |
CN105975575A (en) * | 2016-05-04 | 2016-09-28 | 电子科技大学 | Automatic data type recognition method |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102929596B (en) * | 2012-09-21 | 2016-01-06 | 华为技术有限公司 | Code arrange distinguish method and relevant apparatus |
CN104346366B (en) * | 2013-07-30 | 2017-11-24 | 国际商业机器公司 | Extend the method and apparatus of test data |
CN107038161B (en) * | 2015-07-13 | 2021-03-26 | 阿里巴巴集团控股有限公司 | Equipment and method for filtering data |
CN106855842B (en) * | 2015-12-08 | 2020-12-29 | 中国航空工业第六一八研究所 | Program static analysis method based on regular expression |
CN106445795B (en) * | 2016-09-26 | 2019-03-22 | 中国工商银行股份有限公司 | A kind of database SQL Efficiency testing method and device |
-
2017
- 2017-09-29 CN CN201710910740.2A patent/CN107766466A/en active Pending
- 2017-12-28 WO PCT/CN2017/119345 patent/WO2019061913A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103778210A (en) * | 2014-01-15 | 2014-05-07 | 北京京东尚科信息技术有限公司 | Method and device for judging specific file type of file to be analyzed |
CN105653531A (en) * | 2014-11-12 | 2016-06-08 | 中兴通讯股份有限公司 | Method and device for data extraction |
CN104881496A (en) * | 2015-06-15 | 2015-09-02 | 北京金山安全软件有限公司 | File name identification and file cleaning method and device |
CN105975575A (en) * | 2016-05-04 | 2016-09-28 | 电子科技大学 | Automatic data type recognition method |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109615309A (en) * | 2018-09-25 | 2019-04-12 | 阿里巴巴集团控股有限公司 | A kind of data recording method and device, a kind of calculating equipment and storage medium |
WO2020232880A1 (en) * | 2019-05-21 | 2020-11-26 | 平安科技(深圳)有限公司 | Data processing method and apparatus, storage medium and terminal device |
CN111061777A (en) * | 2019-12-10 | 2020-04-24 | 广州电力工程监理有限公司 | Project data statistical analysis method and system |
CN112087486A (en) * | 2020-07-30 | 2020-12-15 | 山东浪潮通软信息科技有限公司 | Data integration method, equipment and medium for Internet of things equipment |
CN112087486B (en) * | 2020-07-30 | 2022-07-12 | 山东浪潮通软信息科技有限公司 | Data integration method, equipment and medium for Internet of things equipment |
CN113515680A (en) * | 2021-04-20 | 2021-10-19 | 建信金融科技有限责任公司 | Financial monitoring data processing method and device |
Also Published As
Publication number | Publication date |
---|---|
WO2019061913A1 (en) | 2019-04-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107766466A (en) | Recognition methods, system, computer-readable recording medium and the equipment of data type | |
US8230370B2 (en) | Circuit design assisting apparatus, computer-readable medium storing circuit design assisting program, and circuit design assisting method | |
CN114818553B (en) | Chip integrated design method | |
CN114861581B (en) | Auxiliary programming design method of programmable logic device based on image recognition | |
CN109977518A (en) | Design method, system, computer readable storage medium and the equipment of web plate ladder | |
CN104021002B (en) | A kind of PDM system standards part storage method | |
CN107766313A (en) | The introduction method and its terminal of a kind of data list | |
CN106649210A (en) | Data conversion method and device | |
US20070234241A1 (en) | Data processing system and method | |
CN109376546A (en) | Data packet auditing method, system, device and storage medium based on global rule | |
CN111506362B (en) | Processing method, device, storage medium and system for configuration form of game | |
CN113407565A (en) | Cross-database data query method, device and equipment | |
CN107895064A (en) | Component polarity detection method, system, computer-readable recording medium and equipment | |
CN109815635B (en) | Boiler MFT automatic design system and method | |
CN116663479A (en) | PCB-Package collaborative design method | |
CN114140232A (en) | Accounting data conversion method and device and electronic equipment | |
CN111739162B (en) | Automatic PCBA accurate three-dimensional model generation method based on ECAD interface | |
CN114328486A (en) | Data quality checking method and device based on model | |
CN108572948A (en) | The processing method and processing device of doorplate information | |
CN115268846A (en) | Method and device for adding attribute information and computer readable storage medium | |
CN102831531A (en) | Shop decorating method based on electronic commerce platform | |
CN112631920A (en) | Test method, test device, electronic equipment and readable storage medium | |
CN114492282A (en) | Through signal line layout processing method and device, chip and storage medium | |
CN111400991A (en) | Circuit diagram rapid design method based on hardware electrical interface relation | |
US8255856B1 (en) | DC path checking in a hierarchical circuit design |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180306 |