CN107766466A - Recognition methods, system, computer-readable recording medium and the equipment of data type - Google Patents

Recognition methods, system, computer-readable recording medium and the equipment of data type Download PDF

Info

Publication number
CN107766466A
CN107766466A CN201710910740.2A CN201710910740A CN107766466A CN 107766466 A CN107766466 A CN 107766466A CN 201710910740 A CN201710910740 A CN 201710910740A CN 107766466 A CN107766466 A CN 107766466A
Authority
CN
China
Prior art keywords
characteristic value
data
regular expression
line
storehouse
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710910740.2A
Other languages
Chinese (zh)
Inventor
钱胜杰
瞿永建
刘继硕
刘丰收
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vayo Shanghai Technology Co Ltd
Original Assignee
Vayo Shanghai Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vayo Shanghai Technology Co Ltd filed Critical Vayo Shanghai Technology Co Ltd
Priority to CN201710910740.2A priority Critical patent/CN107766466A/en
Priority to PCT/CN2017/119345 priority patent/WO2019061913A1/en
Publication of CN107766466A publication Critical patent/CN107766466A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24568Data stream processing; Continuous queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of recognition methods of data type, system, computer-readable recording medium and equipment, and recognition methods includes:Edit characteristic value data storehouse;Characteristic value data storehouse includes being used to judge the regular expression of the data type of business data flow and the characteristic value associated with the regular expression;Different business data flows corresponds to different regular expressions;Received business data flow is read line by line, and itself and the regular expression in the characteristic value data storehouse edited are subjected to string matching, if matching, then add up statistical match result corresponding with a characteristic value, continue to read next line data, and the regular expression in next line data and characteristic value data storehouse is subjected to string matching, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match line by line.The present invention reduces the error rate of manual identification, lifts the efficiency of identification;The automatic business processing of data in manufacturing process is realized, is advantageous to promote the intelligent Process of electronics industry.

Description

Recognition methods, system, computer-readable recording medium and the equipment of data type
Technical field
The invention belongs to technical field of data recognition, is related to a kind of recognition methods and system, more particularly to a kind of data Recognition methods, system, computer-readable recording medium and the equipment of type.
Background technology
The development of society make it that electronic product is more and more inseparable with the production and living of the mankind, and electronic product quality Quality is limited by the development level of whole electronics industry.With the proposition of the national strategy of made in China 2025, intelligence manufacture is pushed away To unprecedented height, how to be reduced in electronics industry using intellectualized technology and artificial dependence is had become in it Requiring.
Data format species is various in electronics industry at present, and general rule of doing is according to oneself by engineer in industry Experience carries out manual Put on file to design data.Such as:The design data of electronics industry can be related to tens kinds of CAD texts Part, such as Accel (* .pcb), Cadence (* .cad), CadenceBRD (* .brd), CR3000 (* .BSF*.CCF...), CR5000 (* .ftf and * pcf), Docica (* .docica), Fatf (* .asc), Gencad (* .cad*.gen), Gencam (* .gcm), IPC (* .IPC), Mentor (Neutral), ODB++ (* .tgz), ODBxml (* .xml), OrCAD (* .min), Pcad (* .pdf), PowerPCB (* .asc), Protel (* .pcbdoc*.pcb), TopCAD (* .txf), Unidat (* .uni), Viscadif (* .paf), Vutrax (* .art) etc., engineer can only make preliminary judgement by extension name at present, so as to enter Row is sorted out, but this method is running into such a predicament:When file extension is identical or does not have extension name, engineer can not make Judge.
Therefore it provides a kind of recognition methods of data type, system, computer-readable recording medium and equipment, to solve File extension is identical or when not having extension name running into for prior art, can not fast and accurately identification data type bottleneck, This has turned into those skilled in the art's technical problem urgently to be resolved hurrily.
The content of the invention
In view of the above the shortcomings that prior art, it is an object of the invention to provide a kind of identification side of data type Method, system, computer-readable recording medium and equipment, for solving in the prior art to run into file extension identical or do not have During extension name, can not fast and accurately identification data type the problem of.
In order to achieve the above objects and other related objects, one aspect of the present invention provides a kind of recognition methods of data type, For identifying the data type of received business data flow;The recognition methods of the data type includes:Edit characteristic value number According to storehouse;The characteristic value data storehouse include be used for judge business data flow data type regular expression and with the canonical table The characteristic value being associated up to formula;Different business data flows corresponds to different regular expressions;Received business is read line by line Data flow, and itself and the regular expression in the characteristic value data storehouse edited are subjected to string matching, if matching, adds up Statistical match result corresponding with a characteristic value, continue to read next line data, and by next line data and the characteristic value number String matching is carried out according to the regular expression in storehouse, matching is until statistical result corresponding with this feature value reaches accumulative line by line The upper limit, interrupt match.
In one embodiment of the invention, described the step of editing characteristic value data storehouse, includes:Store characteristic value and canonical Expression formula, to establish the characteristic value data storehouse;In the characteristic value data storehouse, the characteristic value is occurred according to characteristic value Number carries out descending sort, and the characteristic value after descending sort is associated with regular expression.
In one embodiment of the invention, the characteristic value data storehouse also includes data corresponding with each regular expression Type marks, adds up matching result and statistical match result.
In one embodiment of the invention, if the business data flow read and the regular expressions in the characteristic value data storehouse Formula string matching, while the statistical result is added up, the statistical result is also stored in the characteristic value data storehouse.
In one embodiment of the invention, after received business data flow is read line by line, by itself and the feature Before regular expression in Value Data storehouse carries out string matching, the recognition methods of the data type is also included described in identification Whether business data flow comes from file data, if it is not, the business data flow read is separated by the carriage return character, and is transferred to described incite somebody to action The step of it carries out string matching with the regular expression in the characteristic value data storehouse;If so, directly it is transferred to described by it The step of string matching being carried out with the regular expression in the characteristic value data storehouse.
In one embodiment of the invention, the business data flow and the regular expressions in the characteristic value data storehouse that read The string matching of formula includes business data flow and the complete matching of regular expression and business data flow and regular expression Similarity matching.
Another aspect of the present invention provides a kind of identifying system of data type, for identifying received business data flow Data type;The identifying system of the data type includes:Editor module, for editing characteristic value data storehouse;The characteristic value Database includes being used to judge the regular expression of the data type of business data flow and the spy associated with the regular expression Value indicative;Different business data flows corresponds to different regular expressions;Processing module, for reading received business number line by line Carry out string matching according to stream, and by itself and the regular expression in the characteristic value data storehouse edited, if matching, add up with Statistical match result corresponding to one characteristic value, continue to read next line data, and by next line data and the characteristic value data Regular expression in storehouse carries out string matching, line by line matching until statistical result corresponding with this feature value reaches accumulative on Limit, interrupt match.
In one embodiment of the invention, the processing module, will after received business data flow is read line by line Before it carries out string matching with the regular expression in the characteristic value data storehouse, the processing module is additionally operable to identify institute State whether business data flow comes from file data, if it is not, the business data flow read is separated by the carriage return character, and will read Business data flow and the characteristic value data storehouse in regular expression carry out string matching;If so, it will directly read Business data flow and the characteristic value data storehouse in regular expression carry out string matching.
Another aspect of the invention provides a kind of computer-readable recording medium, is stored thereon with computer program, the program The recognition methods of the data type is realized when being executed by processor.
Last aspect of the present invention provides a kind of equipment, including:Processor and memory;The memory is based on storing Calculation machine program, the processor is used for the computer program for performing the memory storage, so that the equipment performs the number According to the recognition methods of type.
As described above, recognition methods, system, computer-readable recording medium and the equipment of the data type of the present invention, tool There is following beneficial effect:
Stored in the recognition methods of data type provided by the invention, system, computer-readable recording medium and equipment The method that the recognition methods of data type instead of the manual identification cad file type of original dependence engineer's experience, from And reduce because of the error rate of engineer's manual identification, improve the efficiency of identification;Really realize data in manufacturing process from Dynamicization processing, be advantageous to promote the intelligent Process of electronics industry.Pass through the reality of the recognition methods of data type of the present invention Apply and reduced the step of making manual intervention during electronic manufacture, contribute to the cycle for shortening product manufacturing and putting goods on the market, it is real The agile manufactruing of existing electronics industry, the competitive strength of enterprise.
Brief description of the drawings
Figure 1A is shown as schematic flow sheet of the recognition methods of the data type of the present invention in an embodiment.
Figure 1B is shown as the schematic flow sheet of S11 in the recognition methods of the data type of the present invention.
Fig. 2 is shown as the business data flow example of the reception of the present invention.
Fig. 3 is shown as theory structure schematic diagram of the identifying system of the data type of the present invention in an embodiment.
Component label instructions
The identifying system of 3 data types
31 editor modules
32 processing modules
S11~S16 steps
S111~S112 steps
Embodiment
Illustrate embodiments of the present invention below by way of specific instantiation, those skilled in the art can be by this specification Disclosed content understands other advantages and effect of the present invention easily.The present invention can also pass through specific realities different in addition The mode of applying is embodied or practiced, the various details in this specification can also be based on different viewpoints with application, without departing from Various modifications or alterations are carried out under the spirit of the present invention.It should be noted that in the case where not conflicting, following examples and implementation Feature in example can be mutually combined.
It should be noted that the diagram provided in following examples only illustrates the basic structure of the present invention in a schematic way Think, only show the component relevant with the present invention in schema then rather than according to component count, shape and the size during actual implement Draw, kenel, quantity and the ratio of each component can be a kind of random change during its actual implementation, and its assembly layout kenel It is likely more complexity.
Embodiment one
The present embodiment provides a kind of recognition methods of data type, for identifying the data class of received business data flow Type;The recognition methods of the data type includes:
Characteristic value data storehouse is edited in a predefined manner;The characteristic value data storehouse includes being used for the number for judging business data flow Regular expression and the characteristic value associated with the regular expression according to type;Different business data flows correspond to it is different just Then expression formula;
Received business data flow is read line by line, and itself and the regular expression in the characteristic value data storehouse are carried out String matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read next line data, and will Next line data carry out string matching with the regular expression in the characteristic value data storehouse, and matching is until and this feature line by line Statistical result corresponding to value reaches the accumulative upper limit, interrupt match.
The recognition methods of the data type provided below with reference to diagram the present embodiment is described in detail.Refer to Figure 1A, it is shown as schematic flow sheet of the recognition methods of data type in an embodiment.As shown in Figure 1A, the data type Recognition methods specifically include following steps:
S11, editor's characteristic value data storehouse.The characteristic value data storehouse includes being used for the data type for judging business data flow Regular expression (RE), (in the present embodiment, characteristic value is data type to the characteristic value associated with the regular expression Unique identifier ID), the mark (remark) of data type corresponding with each regular expression, accumulative matching result (RESULT_COUNT, in the present embodiment, the accumulative matching result are consistent with the accumulative upper limit) and statistical match result (Result).In the present embodiment, with forms mode editor's characteristic value data storehouse.Characteristic value data storehouse example is as shown in table 1.
Table 1:Characteristic value data storehouse example
In the present embodiment, different business data flows corresponds to different regular expressions.The business data flow includes C ++, C, JAVA, the programming language such as Perl.
Figure 1B is referred to, shows S11 schematic flow sheet.As shown in Figure 1B, the S11 includes:
S111, storage regular expression, characteristic value, the marking of data type, accumulative matching result and statistical match result, To establish the characteristic value data storehouse;
S112, in the characteristic value data storehouse, the characteristic value is subjected to descending sort according to characteristic value occurrence number, It is and the characteristic value after descending sort is associated with regular expression.In the present embodiment, first more than occurrence number need to be followed Match somebody with somebody, to prevent the few first identification of characteristic value quantity from completing, erroneous judgement occurs.
For example, it is as shown in table 2 that the characteristic value data storehouse after descending sort is carried out according to characteristic value occurrence number.
Table 2:The characteristic value data storehouse after descending sort is carried out according to characteristic value occurrence number
S12, received business data flow is read line by line, identifies whether the business data flow comes from file data, if It is no, then S13 is performed, the business data flow read is separated by the carriage return character (" n ", " r "), and be transferred to S14;If so, directly It is transferred to S14.In the present embodiment, if the business data flow received is separated from file data by the carriage return character.
S14, the regular expression in the business data flow read and the characteristic value data storehouse is subjected to character string Match somebody with somebody, if matching, performs S15.In the present embodiment, by the business data flow read with being carried out according to characteristic value occurrence number The regular expression in characteristic value data storehouse after descending sort carries out string matching one by one.If mismatching, S16 is performed, Current Datarow is skipped, continues to read next line data, is transferred to S14.In the present embodiment, the business data flow read and institute Stating the string matching of the regular expression in characteristic value data storehouse includes the complete matching of business data flow and regular expression With business data flow and the Similarity matching of regular expression.
S15, add up statistical match result corresponding with a characteristic value, and continue to read next line data, and return to step S14, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match, and returns to and matched line by line Characteristic value.
For example, the business data flow received is as shown in Fig. 2 by the business data flow read with occurring according to characteristic value The process that the regular expression that number is carried out in the characteristic value data storehouse after descending sort carries out string matching one by one is specific such as Under:
A) Fig. 2 the 1st row " $ HEADER " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;
" $ HEADER " match with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful;
" ^ s* $ HEADER " and table 2 ^ s* $ HEADER the match is successful, now Result (statistical match result) word Segment mark is designated as 1 (being shown in Table 2).Continue to match other regular expressions, the match is successful.
B) Fig. 2 the 2nd row " GENCAD 1.4 " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful; " GENCAD 1.4 " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful.
Until regular expression " ^A!NET_NAME\!REFDES " is whole to have matched what the match is successful yet.
C) Fig. 2 the 3rd row does not all match any one regular expression to 16 rows.
D) Fig. 2 the 17th row " $ ENDHEADER " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;“$ HEADER " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful;
Until " ^ s* $ ENDHEADER " the match is successful, now Result (statistical match result) field mark 1 (is shown in Table 2).Continue to match other regular expressions, the match is successful.
E) Fig. 2 the 18th row NULI character is skipped.
F) Fig. 2 the 19th row " $ BOARD " and table 2 " ^!PADS | ^ * PADS " matching, matching it is unsuccessful;“$ HEADER " matches with " ^MAXIMUMLAYER " of table 2, and matching is unsuccessful.
Until " ^ s* $ BOARD " the match is successful, now Result (statistical match result) field mark 1 (being shown in Table 2).
G) now, it is found that ID=2 number of matches has reached the accumulative upper limit 3, interrupt match, return to ID=2 statistics With result.In the present embodiment,
H) data identification is completed.Return value is available for other programs or method call.
The present embodiment also provides a kind of computer-readable recording medium, is stored thereon with computer program, and the program is located Reason device realizes the recognition methods of the data type when performing.One of ordinary skill in the art will appreciate that:Realize above-mentioned each side The all or part of step of method embodiment can be completed by the related hardware of computer program.Foregoing computer program can To be stored in a computer-readable recording medium.The program upon execution, execution the step of including above-mentioned each method embodiment; And foregoing storage medium includes:ROM, RAM, magnetic disc or CD etc. are various can be with the medium of store program codes.
The data class stored in a kind of recognition methods of data type of the present embodiment offer and computer-readable recording medium The method that the recognition methods of type instead of the manual identification cad file type of original dependence engineer's experience, so as to reduce Because of the error rate of engineer's manual identification, the efficiency of identification is improved;Really realize in manufacturing process at the automation of data Reason, be advantageous to promote the intelligent Process of electronics industry.Will by the implementation of the recognition methods of data type described in the present embodiment The step of making manual intervention during electronic manufacture, is reduced, and is contributed to the cycle for shortening product manufacturing and putting goods on the market, is realized electricity The agile manufactruing of sub-industry, the competitive strength of enterprise.
Embodiment two
The present embodiment provides a kind of identifying system of data type, for identifying the data class of received business data flow Type;The identifying system of the data type includes:
Editor module, for editing characteristic value data storehouse in a predefined manner;The characteristic value data storehouse includes being used to judge The regular expression of the data type of business data flow and the characteristic value associated with the regular expression;Different business datums The corresponding different regular expression of stream;
Processing module, for reading received business data flow line by line, and by itself and the characteristic value data storehouse edited In regular expression carry out string matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read Data line is removed, and the regular expression in next line data and the characteristic value data storehouse is subjected to string matching, by Row matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match.
The identifying system of the data type provided below with reference to diagram the present embodiment is described in detail.Need Bright is, it should be understood that the division of the modules of system identified above is only a kind of division of logic function, when actually realizing Can completely or partially it be integrated on a physical entity, can also be physically separate.And these modules can be all with software The form called by treatment element is realized;All it can also realize in the form of hardware;Processing can be passed through with part of module The form of element calling software realizes that part of module is realized by the form of hardware.For example, x modules can individually be set up Treatment element, it can also be integrated in some chip of said apparatus and realize, in addition it is also possible to be deposited in the form of program code It is stored in the memory of said apparatus, is called by some treatment element of said apparatus and perform the function of above x modules.Its The realization of its module is similar therewith.In addition these modules can completely or partially integrate, and can also independently realize.Here Described treatment element can be a kind of integrated circuit, have the disposal ability of signal.In implementation process, the above method it is each Step or more modules can pass through the integrated logic circuit of the hardware in processor elements or the instruction of software form Complete.
For example, the above module can be arranged to implement one or more integrated circuits of above method, such as: One or more specific integrated circuits (ApplicationSpecificIntegratedCircuit, abbreviation ASIC), or, one Or multi-microprocessor (digitalsingnalprocessor, abbreviation DSP), or, one or more field-programmable gate array Arrange (FieldProgrammableGateArray, abbreviation FPGA) etc..For another example, some module is dispatched by treatment element more than When the form of program code is realized, the treatment element can be general processor, such as central processing unit (CentralProcessingUnit, abbreviation CPU) or it is other can be with the processor of caller code.For another example, these modules can To integrate, realized in the form of on-chip system (system-on-a-chip, abbreviation SOC).
Referring to Fig. 3, it is shown as theory structure schematic diagram of the identifying system of data type in an embodiment.Such as Fig. 3 Shown, the identifying system 3 of the data type includes editor module 31 and processing module 32.
The editor module 31 is used to edit characteristic value data storehouse in a predefined manner.The characteristic value data storehouse includes being used for Judge the regular expression (RE) of the data type of business data flow, the characteristic value associated with the regular expression (in this reality Apply in example, characteristic value be data type unique identifier ID), the mark of data type corresponding with each regular expression (remark), add up matching result (RESULT_COUNT, in the present embodiment, the accumulative matching result and the accumulative upper limit one Cause) and statistical match result (Result).In the present embodiment, the predetermined way is editor's forms mode.In the present embodiment In, different business data flows corresponds to different regular expressions.The business data flow includes the volume such as C++, C, JAVA, Perl Cheng Yuyan.
The editor module 31 is specifically used for storage regular expression, characteristic value, the marking of data type, accumulative matching knot Fruit and statistical match result, to establish the characteristic value data storehouse;In the characteristic value data storehouse, by the characteristic value according to Characteristic value occurrence number carries out descending sort, and the characteristic value after descending sort is associated with regular expression.In this implementation In example, the first matching more than occurrence number need to be followed, to prevent the few first identification of characteristic value quantity from completing, erroneous judgement occurs.
The processing module 32 coupled with the editor module 31 is used to read received business data flow line by line, identifies institute State whether data flow comes from file data, if not come from file data, by the business data flow read by the carriage return character (" N ", " r ") separate, and the regular expression in the business data flow read and the characteristic value data storehouse is subjected to character string Matching, if matching, add up statistical match result corresponding with a characteristic value, and continue to read next line data, and return and will read The business data flow got carries out string matching with the regular expression in the characteristic value data storehouse, line by line matching until with Statistical result corresponding to this feature value reaches the accumulative upper limit, interrupt match, and returns to the characteristic value matched.If mismatching, Current Datarow is then skipped, continues to read next line data, performs the business data flow that will be read and the characteristic value data Regular expression in storehouse carries out string matching, line by line matching until statistical result corresponding with this feature value reaches accumulative on Limit, interrupt match, and return to the characteristic value matched.
In the present embodiment, after carrying out descending sort by the business data flow read and according to characteristic value occurrence number Regular expression in characteristic value data storehouse carries out string matching one by one.
If coming from file data, the processing module 32 is used for the business data flow that will be read and the characteristic value data Regular expression in storehouse carries out string matching, if matching, adds up statistical match result corresponding with a characteristic value, and continue Next line data are read, and returns to the business data flow that will be read and is carried out with the regular expression in the characteristic value data storehouse String matching, matching is until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match, and returns to line by line The characteristic value matched.If mismatching, Current Datarow is skipped, continues to read next line data, performs the industry that will be read The regular expression being engaged in data flow and the characteristic value data storehouse carries out string matching, line by line matching until with this feature value Corresponding statistical result reaches the accumulative upper limit, interrupt match, and returns to the characteristic value matched.
Embodiment three
The present embodiment provides a kind of equipment, including:Processor, memory, transceiver, communication interface and system bus;Deposit Reservoir and communication interface are connected with processor and transceiver by system bus and complete mutual communication, and memory is used to deposit Computer program is stored up, communication interface is used for and other equipment is communicated, and processor and transceiver are used to run computer program, Equipment is set to perform each step of the recognition methods of data type as described above.
System bus mentioned above can be Peripheral Component Interconnect standard (PeripheralPomponentInterconnect, abbreviation PCI) bus or EISA (ExtendedIndustryStandardArchitecture, abbreviation EISA) bus etc..The system bus can be divided into address Bus, data/address bus, controlling bus etc..For ease of representing, only represented in figure with a thick line, it is not intended that only one total Line or a type of bus.Communication interface is used for accessing data base device and other equipment (such as client, read-write storehouse And read-only storehouse) between communication.Memory may include random access memory (RandomAccessMemory, abbreviation RAM), Nonvolatile memory (non-volatilememory), for example, at least a magnetic disk storage may also also be included.
Above-mentioned processor can be general processor, including central processing unit (CentralProcessingUnit, letter Claim CPU), network processing unit (NetworkProcessor, abbreviation NP) etc.;It can also be digital signal processor (DigitalSignalProcessing, abbreviation DSP), application specific integrated circuit (ApplicationSpecificIntegratedCircuit, abbreviation ASIC), field programmable gate array (Field- ProgrammableGateArray, abbreviation FPGA) either other PLDs, discrete gate or transistor logic device Part, discrete hardware components.
In summary, the recognition methods of data type provided by the invention, system, computer-readable recording medium and equipment The recognition methods of the data type of middle storage instead of the manual identification cad file type of original dependence engineer's experience Method, so as to reduce the efficiency for because of the error rate of engineer's manual identification, improving identification;Really realize number in manufacturing process According to automatic business processing, be advantageous to promote electronics industry intelligent Process.Pass through the identification side of data type of the present invention The implementation of method will reduce the step of manual intervention during electronic manufacture, contribute to the week for shortening product manufacturing and putting goods on the market Phase, realize the agile manufactruing of electronics industry, the competitive strength of enterprise.So the present invention effectively overcomes in the prior art Various shortcoming and have high industrial utilization.
The above-described embodiments merely illustrate the principles and effects of the present invention, not for the limitation present invention.It is any ripe Know the personage of this technology all can carry out modifications and changes under the spirit and scope without prejudice to the present invention to above-described embodiment.Cause This, those of ordinary skill in the art is complete without departing from disclosed spirit and institute under technological thought such as Into all equivalent modifications or change, should by the present invention claim be covered.

Claims (10)

1. a kind of recognition methods of data type, it is characterised in that for identifying the data type of received business data flow; The recognition methods of the data type includes:
Edit characteristic value data storehouse;The characteristic value data storehouse includes being used to judge the canonical table of the data type of business data flow Up to formula and the characteristic value associated with the regular expression;Different business data flows corresponds to different regular expressions;
Read received business data flow line by line, and itself and the regular expression in the characteristic value data storehouse edited are carried out String matching, if matching, add up statistical match result corresponding with a characteristic value, continue to read next line data, and will Next line data carry out string matching with the regular expression in the characteristic value data storehouse, and matching is until and this feature line by line Statistical result corresponding to value reaches the accumulative upper limit, interrupt match.
2. the recognition methods of data type according to claim 1, it is characterised in that editor's characteristic value data storehouse Step includes:
Characteristic value and regular expression are stored, to establish the characteristic value data storehouse;
In the characteristic value data storehouse, the characteristic value is subjected to descending sort according to characteristic value occurrence number, and by descending Characteristic value after sequence is associated with regular expression.
3. the recognition methods of data type according to claim 1, it is characterised in that the characteristic value data storehouse also includes Data type corresponding with each regular expression marks, adds up matching result and statistical match result.
4. the recognition methods of data type according to claim 3, it is characterised in that if the business data flow read and institute The regular expression string matching in characteristic value data storehouse is stated, while the statistical result is added up, is also tied the statistics Fruit is stored in the characteristic value data storehouse.
5. the recognition methods of data type according to claim 3, it is characterised in that reading received business line by line After data flow, before it is carried out into string matching with the regular expression in the characteristic value data storehouse, the data class The recognition methods of type also includes identifying whether the business data flow comes from file data, if it is not, the business datum that will be read Stream is separated by the carriage return character, and is transferred to and described itself and regular expression in the characteristic value data storehouse is carried out into string matching Step;If so, directly it is transferred to the step that it is carried out to string matching with the regular expression in the characteristic value data storehouse Suddenly.
6. the recognition methods of data type according to claim 1, it is characterised in that the business data flow read and institute Stating the string matching of the regular expression in characteristic value data storehouse includes the complete matching of business data flow and regular expression With business data flow and the Similarity matching of regular expression.
7. a kind of identifying system of data type, it is characterised in that for identifying the data type of received business data flow; The identifying system of the data type includes:
Editor module, for editing characteristic value data storehouse;The characteristic value data storehouse includes being used for the number for judging business data flow Regular expression and the characteristic value associated with the regular expression according to type;Different business data flows correspond to it is different just Then expression formula;
Processing module, for reading received business data flow line by line, and by its with the characteristic value data storehouse edited Regular expression carries out string matching, if matching, adds up statistical match result corresponding with a characteristic value, continues under reading Data line, and the regular expression in next line data and the characteristic value data storehouse is subjected to string matching, line by line With until statistical result corresponding with this feature value reaches the accumulative upper limit, interrupt match.
8. the identifying system of data type according to claim 7, it is characterised in that
The processing module after received business data flow is read line by line, by its with the characteristic value data storehouse just Before then expression formula carries out string matching, the processing module is additionally operable to identify whether the business data flow comes from number of files According to if it is not, the business data flow read is separated by the carriage return character, and by the business data flow read and the characteristic value number String matching is carried out according to the regular expression in storehouse;If so, directly by the business data flow read and the characteristic value number String matching is carried out according to the regular expression in storehouse.
9. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is held by processor The recognition methods of data type any one of claim 1 to 6 is realized during row.
A kind of 10. equipment, it is characterised in that including:Processor and memory;
The memory is used to store computer program, and the processor is used for the computer journey for performing the memory storage Sequence, so that the equipment performs the recognition methods of the data type as any one of claim 1 to 6.
CN201710910740.2A 2017-09-29 2017-09-29 Recognition methods, system, computer-readable recording medium and the equipment of data type Pending CN107766466A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710910740.2A CN107766466A (en) 2017-09-29 2017-09-29 Recognition methods, system, computer-readable recording medium and the equipment of data type
PCT/CN2017/119345 WO2019061913A1 (en) 2017-09-29 2017-12-28 Data type identification method and system, computer readable storage medium and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710910740.2A CN107766466A (en) 2017-09-29 2017-09-29 Recognition methods, system, computer-readable recording medium and the equipment of data type

Publications (1)

Publication Number Publication Date
CN107766466A true CN107766466A (en) 2018-03-06

Family

ID=61267010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710910740.2A Pending CN107766466A (en) 2017-09-29 2017-09-29 Recognition methods, system, computer-readable recording medium and the equipment of data type

Country Status (2)

Country Link
CN (1) CN107766466A (en)
WO (1) WO2019061913A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109615309A (en) * 2018-09-25 2019-04-12 阿里巴巴集团控股有限公司 A kind of data recording method and device, a kind of calculating equipment and storage medium
CN111061777A (en) * 2019-12-10 2020-04-24 广州电力工程监理有限公司 Project data statistical analysis method and system
WO2020232880A1 (en) * 2019-05-21 2020-11-26 平安科技(深圳)有限公司 Data processing method and apparatus, storage medium and terminal device
CN112087486A (en) * 2020-07-30 2020-12-15 山东浪潮通软信息科技有限公司 Data integration method, equipment and medium for Internet of things equipment
CN113515680A (en) * 2021-04-20 2021-10-19 建信金融科技有限责任公司 Financial monitoring data processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103778210A (en) * 2014-01-15 2014-05-07 北京京东尚科信息技术有限公司 Method and device for judging specific file type of file to be analyzed
CN104881496A (en) * 2015-06-15 2015-09-02 北京金山安全软件有限公司 File name identification and file cleaning method and device
CN105653531A (en) * 2014-11-12 2016-06-08 中兴通讯股份有限公司 Method and device for data extraction
CN105975575A (en) * 2016-05-04 2016-09-28 电子科技大学 Automatic data type recognition method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102929596B (en) * 2012-09-21 2016-01-06 华为技术有限公司 Code arrange distinguish method and relevant apparatus
CN104346366B (en) * 2013-07-30 2017-11-24 国际商业机器公司 Extend the method and apparatus of test data
CN107038161B (en) * 2015-07-13 2021-03-26 阿里巴巴集团控股有限公司 Equipment and method for filtering data
CN106855842B (en) * 2015-12-08 2020-12-29 中国航空工业第六一八研究所 Program static analysis method based on regular expression
CN106445795B (en) * 2016-09-26 2019-03-22 中国工商银行股份有限公司 A kind of database SQL Efficiency testing method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103778210A (en) * 2014-01-15 2014-05-07 北京京东尚科信息技术有限公司 Method and device for judging specific file type of file to be analyzed
CN105653531A (en) * 2014-11-12 2016-06-08 中兴通讯股份有限公司 Method and device for data extraction
CN104881496A (en) * 2015-06-15 2015-09-02 北京金山安全软件有限公司 File name identification and file cleaning method and device
CN105975575A (en) * 2016-05-04 2016-09-28 电子科技大学 Automatic data type recognition method

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109615309A (en) * 2018-09-25 2019-04-12 阿里巴巴集团控股有限公司 A kind of data recording method and device, a kind of calculating equipment and storage medium
WO2020232880A1 (en) * 2019-05-21 2020-11-26 平安科技(深圳)有限公司 Data processing method and apparatus, storage medium and terminal device
CN111061777A (en) * 2019-12-10 2020-04-24 广州电力工程监理有限公司 Project data statistical analysis method and system
CN112087486A (en) * 2020-07-30 2020-12-15 山东浪潮通软信息科技有限公司 Data integration method, equipment and medium for Internet of things equipment
CN112087486B (en) * 2020-07-30 2022-07-12 山东浪潮通软信息科技有限公司 Data integration method, equipment and medium for Internet of things equipment
CN113515680A (en) * 2021-04-20 2021-10-19 建信金融科技有限责任公司 Financial monitoring data processing method and device

Also Published As

Publication number Publication date
WO2019061913A1 (en) 2019-04-04

Similar Documents

Publication Publication Date Title
CN107766466A (en) Recognition methods, system, computer-readable recording medium and the equipment of data type
US8230370B2 (en) Circuit design assisting apparatus, computer-readable medium storing circuit design assisting program, and circuit design assisting method
CN114818553B (en) Chip integrated design method
CN114861581B (en) Auxiliary programming design method of programmable logic device based on image recognition
CN109977518A (en) Design method, system, computer readable storage medium and the equipment of web plate ladder
CN104021002B (en) A kind of PDM system standards part storage method
CN107766313A (en) The introduction method and its terminal of a kind of data list
CN106649210A (en) Data conversion method and device
US20070234241A1 (en) Data processing system and method
CN109376546A (en) Data packet auditing method, system, device and storage medium based on global rule
CN111506362B (en) Processing method, device, storage medium and system for configuration form of game
CN113407565A (en) Cross-database data query method, device and equipment
CN107895064A (en) Component polarity detection method, system, computer-readable recording medium and equipment
CN109815635B (en) Boiler MFT automatic design system and method
CN116663479A (en) PCB-Package collaborative design method
CN114140232A (en) Accounting data conversion method and device and electronic equipment
CN111739162B (en) Automatic PCBA accurate three-dimensional model generation method based on ECAD interface
CN114328486A (en) Data quality checking method and device based on model
CN108572948A (en) The processing method and processing device of doorplate information
CN115268846A (en) Method and device for adding attribute information and computer readable storage medium
CN102831531A (en) Shop decorating method based on electronic commerce platform
CN112631920A (en) Test method, test device, electronic equipment and readable storage medium
CN114492282A (en) Through signal line layout processing method and device, chip and storage medium
CN111400991A (en) Circuit diagram rapid design method based on hardware electrical interface relation
US8255856B1 (en) DC path checking in a hierarchical circuit design

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180306