CN110866010A - Formatted order information extraction method and device - Google Patents

Formatted order information extraction method and device Download PDF

Info

Publication number
CN110866010A
CN110866010A CN201911041381.7A CN201911041381A CN110866010A CN 110866010 A CN110866010 A CN 110866010A CN 201911041381 A CN201911041381 A CN 201911041381A CN 110866010 A CN110866010 A CN 110866010A
Authority
CN
China
Prior art keywords
data
order
characteristic
rule
segmentation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911041381.7A
Other languages
Chinese (zh)
Other versions
CN110866010B (en
Inventor
叶李灶
刘晨光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Jiadunquansheng Information Technology Co Ltd
Original Assignee
Suzhou Jiadunquansheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Jiadunquansheng Information Technology Co Ltd filed Critical Suzhou Jiadunquansheng Information Technology Co Ltd
Priority to CN201911041381.7A priority Critical patent/CN110866010B/en
Publication of CN110866010A publication Critical patent/CN110866010A/en
Application granted granted Critical
Publication of CN110866010B publication Critical patent/CN110866010B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0633Lists, e.g. purchase orders, compilation or processing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a formatted order information extraction method, which comprises the following steps: establishing a model base, and predefining a data model; the data model comprises an order fact table, a data entity table and a characteristic link table, and all the tables are connected by order numbers; establishing a rule base, wherein a trigger condition and a storage mechanism for extracting information are defined in the rule base; (2) adding a segmentation identification to the received initial input stream; (3) sequentially comparing each data segment with a rule template in a rule base according to the serial number sequence, and triggering a storage mechanism when the extracted data segment is matched with the rule template; (4) and establishing a data combination rule to form a data entity table according to the application requirements of the system function plate. According to the invention, the order information is safer by respectively storing a certain characteristic, a characteristic number and a characteristic value based on the same order number in the three tables.

Description

Formatted order information extraction method and device
Technical Field
The invention relates to the field of data processing, in particular to a formatted order information extraction method and a device thereof.
Background
In Java, data can only be read in an input stream manner, and during the reading process, it is necessary to continuously determine whether there is any subsequent data, and each time of reading, the same code needs to be written once, so that the readability of the code is poor, and errors are easily caused due to multiple times of writing. In the loop, it is not possible to predict in advance whether or not there is any unread data. And (4) reading data by using circulation, wherein in the reading process, whether the circulation is finished or not needs to be judged all the time. If it is desired to know whether there is any data, the next reading can only be stored by the temporary variable, and when the next reading is performed, it is necessary to determine whether the previous temporary variable has read the data, which is very troublesome. The problems of low reusability and code readability, low reading efficiency and easy error in the writing process exist.
To solve the above problems, publication No. CN104461938 discloses a data reading method and apparatus. The data reading method comprises the following steps: adding a segmentation identification to the received initial input stream; reading an initial input stream, segmenting the initial input stream according to segmentation marks in the reading process to obtain a plurality of data fragments, and adding an end mark to a first data fragment in the plurality of data fragments, wherein the first data fragment is a data fragment positioned at the tail end of the initial input stream; sequentially storing the plurality of data fragments to a data buffer area according to the sequence of the plurality of data fragments in the initial input stream; and reading data from the data buffer. However, since the data fragments still need to be stored in the buffer and read from the buffer, the reading efficiency is relatively low.
Disclosure of Invention
The purpose of the invention is as follows: in view of the above problems, an object of the present invention is to provide a method for reading formatted order information, which can effectively improve the data reading efficiency.
Another object of the present invention is to provide a formatted order information reading apparatus.
The technical scheme is as follows: in order to achieve the above object, the present invention provides a formatted order information extraction method, which comprises the following steps
(1) Establishing a model base, predefining a data model, setting characteristics for the field of the model, and forming the field characteristics of the model base; the data model comprises an order fact table, a data fact table and a characteristic link table, and the tables are connected by order numbers; the characteristic link table takes any characteristic in the order fact table as a connecting key to be connected with the characteristic link table, and the characteristic value in the characteristic link table has a corresponding characteristic number; the data entity table is connected with the order fact table through the order number, and the data entity table is connected with the characteristic link table by taking the characteristic number as a link key to obtain a characteristic value;
establishing a rule base according to format rules of the formatted order, wherein trigger conditions and a storage mechanism for extracting information are defined in the rule base; setting a rule template for comparison in a rule base; triggering a storage mechanism when the extracted data segments are matched with the rule template;
defining respective attribute fields of an order fact table, a data entity table and a characteristic link table; for the data segment corresponding to each rule template, a storage mechanism specifies a table and a field stored by the storage mechanism;
(2) adding a segmentation identification to the received initial input stream;
reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
(3) sequentially comparing each data segment with a rule template in a rule base according to the serial number sequence, and triggering a storage mechanism when the extracted data segment is matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template; storing the data segments into corresponding tables of a model library; until said data fragment is traversed;
(4) according to the application requirements of the system function blocks, establishing a data combination rule, defining a combination mode of an order fact table, a data entity table and a characteristic link table to the current application requirements, combining the combination modes to form the data entity table, and taking data in the entity table as basic bottom layer data of the application requirements of each function block.
One order number corresponds to one record in the order fact table.
One order fact table can be connected with a plurality of characteristic link tables; a property link table is coupled to a data entity table.
And segmenting the initial input stream by the segmentation mark once from the position of the segmentation mark to obtain the plurality of data segments when one segmentation mark is read.
A formatted order information extraction device comprises a reading unit, a storage unit and a combination unit;
the reading unit is used for reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
the storage unit is used for sequentially comparing each data segment with the rules in the rule base according to the numbering sequence until the data segments are traversed, triggering the storage action of the data segments meeting the triggering conditions, and storing the data segments into the corresponding tables of the model base;
and the combination unit is used for establishing a data combination rule according to the application requirements of the system function plate and defining the combination mode of the order fact table, the data entity table and the characteristic link table to the current application requirements.
The method comprises a segmentation module, wherein the segmentation module is used for segmenting the initial input stream once from the position of the segmentation mark to obtain the plurality of data segments when one segmentation mark is read.
The device comprises a receiving unit, a processing unit and a processing unit, wherein the receiving unit is used for receiving a setting instruction before the adding unit adds the segmentation identification to the received initial input stream; and the setting unit is used for setting the segmentation identification according to the setting instruction.
Has the advantages that: compared with the prior art, the invention has the following advantages:
1. the data security is high: the method comprises the steps of establishing a model base, wherein the model base comprises an order fact table, a data entity table and a characteristic link table, and all the tables are connected by order numbers; order data information is stored in the three tables respectively, and data security is improved; the characteristic link table takes any characteristic in the order fact table as a connecting key to be connected with the characteristic link table, and the characteristic in the characteristic link table corresponds to the characteristic number; the data entity table is connected with the order fact table through the order number, and the data entity table is connected with the characteristic link table by taking the characteristic number as a link key to obtain a characteristic value; according to the invention, the order information is safer by respectively storing a certain characteristic, a characteristic number and a characteristic value based on the same order number in the three tables.
2. The data reading speed is fast: reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment; comparing each data segment with the rules in the rule base in sequence according to the numbering sequence until the data segments are traversed; meanwhile, the data fragments meeting the trigger conditions trigger storage actions and are stored in corresponding tables. The whole reading process does not need to judge whether the subsequent data exist or not and whether the reading is finished or not all the time, so that the data reading steps are simplified, the problem of low data reading efficiency in the prior art is solved, and the effects of improving the data reading speed and the data reading efficiency are achieved.
3. The order information is stored in a plurality of tables, a data combination rule is established according to the application requirements of system function blocks, the combination mode of an order fact table, a data entity table and a characteristic link table to the current application requirements is defined, the data entity table is formed by combination in the combination mode, and the data in the entity table is used as the basic bottom layer data of the application requirements of each function block; the order data are split and recombined to generate the basic bottom layer data aiming at the application-free requirements, so that the application range is wider.
Drawings
FIG. 1 is a relational view of a model library according to embodiment 1 of the present invention;
FIG. 2 is a schematic structural diagram of an order information extraction device;
FIG. 3 is a flow chart of an order information extraction method;
Detailed Description
According to a preferred embodiment of the present invention.
Fig. 3 shows a formatted order information extraction method, which includes the following steps
(1) Establishing a model base, predefining a data model, setting characteristics for the field of the model, and forming the field characteristics of the model base; the data model comprises an order fact table, a data fact table and a characteristic link table, and the tables are connected by order numbers; the characteristic link table takes any characteristic in the order fact table as a connecting key to be connected with the characteristic link table, and the characteristic value in the characteristic link table has a corresponding characteristic number; the data entity table is connected with the order fact table through the order number, and the data entity table is connected with the characteristic link table by taking the characteristic number as a link key to obtain a characteristic value; one order number corresponds to one record in the order fact table; one order fact table can be connected with a plurality of characteristic link tables; a property link table is coupled to a data entity table.
Establishing a rule base according to format rules of the formatted order, wherein trigger conditions and a storage mechanism for extracting information are defined in the rule base; setting a rule template for comparison in a rule base; triggering a storage mechanism when the extracted data segments are matched with the rule template;
defining respective attribute fields of an order fact table, a data entity table and a characteristic link table; for the data segment corresponding to each rule template, a storage mechanism specifies a table and a field stored by the storage mechanism;
(2) adding a segmentation identification to the received initial input stream;
adding a segmentation identifier to the received initial input stream, wherein the segmentation identifier is a format separator; reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
(3) sequentially comparing each data segment with a rule template in a rule base according to the serial number sequence, and triggering a storage mechanism when the extracted data segment is matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template; storing the data segments into corresponding tables of a model library; until said data fragment is traversed;
(4) according to the application requirements of the system function blocks, establishing a data combination rule, defining a combination mode of an order fact table, a data entity table and a characteristic link table to the current application requirements, combining the combination modes to form the data entity table, and taking data in the entity table as basic bottom layer data of the application requirements of each function block.
Example 1
As shown in fig. 1 and fig. 3, a method for extracting formatted order information of apparel accessories,
(1) establishing a model base, predefining a data model, setting characteristics for the field of the model, and forming the field characteristics of the model base; the data model takes an order table as an order fact table, a gargent table as a data entity table and a size table as a characteristic link table, wherein the size table comprises a size number and a size model; the tables are connected by an order number A; the characteristic linkage table size table is connected with the order fact table order table by taking the size in the order fact table as a connecting key;
the Size model in the Size table corresponds to the Size number; the garent table is connected with the order table through the order number A, and the garent table is connected with the size table by taking the size number as a link key to obtain a specific size numerical value;
establishing a rule base according to a format rule of the formatted order A, wherein a trigger condition and a storage mechanism for extracting information are defined in the rule base; setting a rule template for comparison in a rule base; triggering a storage mechanism when the extracted data segments are matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template; the order table is defined to include: attribute fields such as order number, money number, category, customer, contact and the like;
the Size table includes: attribute fields such as order number, size model and the like;
the Garent table includes: order number, size number, quantity, port, color and other attribute fields;
and the storage mechanism specifies the stored tables and fields for the data segments corresponding to each rule template.
(2) Receiving a customer order, wherein the order information content of the order A comprises an order number, a money number, a category, a customer, a brand, a color, a size, a proper amount, port information, a contact person and the like; adding a segmentation identifier to the received initial input stream, wherein the segmentation identifier is a format separator;
reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
the formatted order reading method provided by the embodiment of the invention comprises the steps of reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data segments, and numbering each data segment in sequence; comparing each data segment with the rules in the rule base in sequence according to the numbering sequence until the data segments are traversed;
(3) sequentially comparing each data segment with a rule template in a rule base according to the serial number sequence, and triggering a storage mechanism when the extracted data segment is matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template; storing the data segments into corresponding tables of a model library; until said data fragment is traversed;
wherein the order table comprises order number, product style number, category, customer, brand
The size table comprises order numbers, sizes and size numbers;
the gargent table comprises an order number, a size, a color, a size, a proper amount, port information and a contact person;
(4) after information extraction is finished, establishing a data combination rule b according to the application requirements of the system function plate, and defining a combination mode of an order table, a size table and a organ table aiming at the current application requirements to form a new entity table Ab;
the table is combined to form a new entity table, so that the problem of incomplete single table information is solved.
According to the invention, under the original table structure of the order information table A, order information data is cut, and then a corresponding data combination rule c is established according to different functional plate requirements, so that entity tables Ac aiming at different requirements are formed, and different order requirements are met.
Establishing a model base, wherein the model base comprises an order fact table, a data entity table and a characteristic link table, and all the tables are connected by order numbers; order data information is stored in the three tables respectively, and data security is improved; the characteristic link table takes any characteristic in the order fact table as a connecting key to be connected with the characteristic link table, and the characteristic in the characteristic link table corresponds to the characteristic number; the data entity table is connected with the order fact table through the order number, and the data entity table is connected with the characteristic link table by taking the characteristic number as a link key to obtain a characteristic value; according to the invention, the order information is safer by respectively storing a certain characteristic, a characteristic number and a characteristic value based on the same order number in the three tables.
The order information is stored in a plurality of tables, a data combination rule is established according to the application requirements of system function blocks, the combination mode of an order fact table, a data entity table and a characteristic link table to the current application requirements is defined, the data entity table is formed by combination in the combination mode, and the data in the entity table is used as the basic bottom layer data of the application requirements of each function block; the order data are split and recombined to generate the basic bottom layer data aiming at the application-free requirements, so that the application range is wider.
Example 2
A formatted order information extraction device comprises a reading unit, a storage unit and a combination unit;
the reading unit is used for reading the initial input stream, and comprises a segmentation module, a data segment obtaining module and a data segment obtaining module, wherein the segmentation module is used for segmenting the initial input stream once from the position of the segmentation mark every time one segmentation mark is read, so that the plurality of data segments are obtained; sequentially numbering each data fragment; comparing each data segment with the rules in the rule base in sequence according to the numbering sequence until the data segments are traversed; segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data segments, and numbering each data segment in sequence; comparing each data segment with the rules in the rule base in sequence according to the numbering sequence until the data segments are traversed;
the storage unit defines a trigger condition and a storage mechanism for extracting information in the rule base; setting a rule template for comparison in a rule base; triggering a storage mechanism when the extracted data segments are matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template;
provision, the order table includes: attribute fields such as order number, money number, category, customer, contact and the like;
the Size table includes: attribute fields such as order number, size model and the like;
the Garent table includes: order number, size number, quantity, port, color and other attribute fields;
defining a trigger condition of a storage mechanism according to the formatted order A, wherein when the trigger condition is that the data segment is matched with the rule template,
the combination unit establishes a data combination rule according to the application requirements of the system function plate, and defines the combination mode of the order table, the size table and the gargent table aiming at the current application requirements.
The device comprises a receiving unit, a processing unit and a processing unit, wherein the receiving unit is used for receiving a setting instruction before the adding unit adds the segmentation identification to the received initial input stream; and the setting unit is used for setting the segmentation identification according to the setting instruction.
The embodiment of the present invention further provides a formatted order information reading device, where the data reading device is mainly used to execute the data reading method provided in the above-mentioned content of the embodiment of the present invention, and the following describes the data reading device provided in the embodiment of the present invention in detail.
Fig. 2 is a schematic diagram of a data reading apparatus according to an embodiment of the present invention, as shown in fig. 2,
the data reading device mainly comprises an adding unit, a reading unit, a storage unit and an analyzing unit, wherein:
the adding unit mainly comprises a segmentation module, and the segmentation module is used for segmenting the initial input stream once from the position where the segmentation mark is located every time one segmentation mark is read to obtain a plurality of data segments. The adding unit adds the segmentation marks at a plurality of positions of the initial input stream, and the segmentation module segments the initial input stream once from the position of the segmentation mark every time one segmentation mark is met in the segmentation process.
The reading unit mainly comprises a traversal module and a judgment module, wherein the traversal module is used for traversing the data buffer area, the judgment module is used for judging whether the end mark is read or not, the traversal module ends traversing the data buffer area under the condition that the end mark is read, and the traversal module continues traversing the data buffer area under the condition that the end mark is not read; and the initial input stream is read, the initial input stream is segmented according to the segmentation identification in the reading process to obtain a plurality of data segments, and each data segment is sequentially numbered.
The storage unit is used for sequentially comparing each data segment with the rules in the rule base according to the numbering sequence until the data segments are traversed, triggering the storage action of the data segments meeting the triggering conditions, and storing the data segments into the corresponding tables of the model base;
and the combination unit is used for establishing a data combination rule according to the application requirements of the system function plate and defining the combination mode of the order fact table, the data entity table and the characteristic link table to the current application requirements.
The data reading device provided by the embodiment of the invention realizes the encapsulation of the input stream in the traverse device by segmenting the initial input stream, adding the ending identifier to the data segment positioned at the tail end of the initial input stream and then storing a plurality of data segments in the data buffer area, sequentially comparing each data segment with the rules in the rule base according to the numbering sequence, triggering the storage action of the data segment meeting the triggering condition, and storing the data segment into the corresponding table of the model base; until said data fragment is traversed; and after the information extraction is finished, generating a data entity to become basic bottom layer data of other functional plates. The data reading steps are simplified, the problem of low data reading efficiency in the prior art is solved, and the effects of improving the data reading speed and the data reading efficiency are achieved. In addition, the data reading method provided by the embodiment of the invention cuts the order information into each data segment by setting the formatted order and setting the cutting identifier, judges whether the data segment meets the stored starting condition in a manner of numbering the data segment, and is beneficial to improving the efficiency and the accuracy of data reading.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a mobile terminal, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (7)

1. A formatted order information extraction method is characterized in that: the method comprises the following steps
(1) Establishing a model base, predefining a data model, setting characteristics for the field of the model, and forming the field characteristics of the model base; the data model comprises an order fact table, a data fact table and a characteristic link table, and the tables are connected by order numbers; the characteristic link table takes any characteristic in the order fact table as a connecting key to be connected with the characteristic link table, and the characteristic value in the characteristic link table has a corresponding characteristic number; the data entity table is connected with the order fact table through the order number, and the data entity table is connected with the characteristic link table by taking the characteristic number as a link key to obtain a characteristic value;
establishing a rule base according to format rules of the formatted order, wherein trigger conditions and a storage mechanism for extracting information are defined in the rule base; setting a rule template for comparison in a rule base; triggering a storage mechanism when the extracted data segments are matched with the rule template;
defining respective attribute fields of an order fact table, a data entity table and a characteristic link table; for the data segment corresponding to each rule template, a storage mechanism specifies a table and a field stored by the storage mechanism;
(2) adding a segmentation identification to the received initial input stream;
reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
(3) sequentially comparing each data segment with a rule template in a rule base according to the serial number sequence, and triggering a storage mechanism when the extracted data segment is matched with the rule template; the storage mechanism specifies a table and a field to be stored for the data segment corresponding to each rule template; storing the data segments into corresponding tables of a model library; until said data fragment is traversed;
(4) according to the application requirements of the system function blocks, establishing a data combination rule, defining a combination mode of an order fact table, a data entity table and a characteristic link table to the current application requirements, combining the combination modes to form the data entity table, and taking data in the entity table as basic bottom layer data of the application requirements of each function block.
2. The method of claim 1, wherein the formatted order information comprises: one order number corresponds to one record in the order fact table.
3. The method of claim 1, wherein the formatted order information comprises: one order fact table can be connected with a plurality of characteristic link tables; a property link table is coupled to a data entity table.
4. The method of claim 1, wherein the formatted order information comprises: and segmenting the initial input stream by the segmentation mark once from the position of the segmentation mark to obtain the plurality of data segments when one segmentation mark is read.
5. A formatted order information extraction device, characterized by: comprises a reading unit, a storage unit and a combination unit;
the reading unit is used for reading the initial input stream, segmenting the initial input stream according to the segmentation identification in the reading process to obtain a plurality of data fragments, and sequentially numbering each data fragment;
the storage unit is used for sequentially comparing each data segment with the rules in the rule base according to the numbering sequence until the data segments are traversed, triggering the storage action of the data segments meeting the triggering conditions, and storing the data segments into the corresponding tables of the model base;
and the combination unit is used for establishing a data combination rule according to the application requirements of the system function plate and defining the combination mode of the order fact table, the data entity table and the characteristic link table to the current application requirements.
6. The formatted order information extraction device according to claim 5, wherein: the method comprises a segmentation module, wherein the segmentation module is used for segmenting the initial input stream once from the position of the segmentation mark to obtain the plurality of data segments when one segmentation mark is read.
7. The formatted order information extraction device according to claim 5, wherein: the device comprises a receiving unit, a processing unit and a processing unit, wherein the receiving unit is used for receiving a setting instruction before the adding unit adds the segmentation identification to the received initial input stream; and the setting unit is used for setting the segmentation identification according to the setting instruction.
CN201911041381.7A 2019-10-30 2019-10-30 Formatted order information extraction method and device Active CN110866010B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911041381.7A CN110866010B (en) 2019-10-30 2019-10-30 Formatted order information extraction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911041381.7A CN110866010B (en) 2019-10-30 2019-10-30 Formatted order information extraction method and device

Publications (2)

Publication Number Publication Date
CN110866010A true CN110866010A (en) 2020-03-06
CN110866010B CN110866010B (en) 2023-05-23

Family

ID=69654296

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911041381.7A Active CN110866010B (en) 2019-10-30 2019-10-30 Formatted order information extraction method and device

Country Status (1)

Country Link
CN (1) CN110866010B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461938A (en) * 2014-12-11 2015-03-25 北京国双科技有限公司 Method and device for reading data
CN104615736A (en) * 2015-02-10 2015-05-13 上海创景计算机系统有限公司 Quick analysis and storage method of big data based on database
CN105447099A (en) * 2015-11-11 2016-03-30 中国建设银行股份有限公司 Log structured information extraction method and apparatus
US10019535B1 (en) * 2013-08-06 2018-07-10 Intuit Inc. Template-free extraction of data from documents

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10019535B1 (en) * 2013-08-06 2018-07-10 Intuit Inc. Template-free extraction of data from documents
CN104461938A (en) * 2014-12-11 2015-03-25 北京国双科技有限公司 Method and device for reading data
CN104615736A (en) * 2015-02-10 2015-05-13 上海创景计算机系统有限公司 Quick analysis and storage method of big data based on database
CN105447099A (en) * 2015-11-11 2016-03-30 中国建设银行股份有限公司 Log structured information extraction method and apparatus

Also Published As

Publication number Publication date
CN110866010B (en) 2023-05-23

Similar Documents

Publication Publication Date Title
CN109445834B (en) Program code similarity rapid comparison method based on abstract syntax tree
CN107807982B (en) Consistency checking method and device for heterogeneous database
CN110515896B (en) Model resource management method, model file manufacturing method, device and system
CN108228825A (en) A kind of station address data cleaning method based on participle
CN112732655B (en) Online analysis method and system for format-free log
CN109918658A (en) A kind of method and system obtaining target vocabulary from text
CN107797916A (en) DDL sentences checking method and device
CN109902125A (en) A kind of distributed storage method based on existing database
CN109753517A (en) A kind of method, apparatus, computer storage medium and the terminal of information inquiry
CN112597345A (en) Laboratory data automatic acquisition and matching method
CN108009223B (en) Method and device for detecting consistency of transaction data
CN115905630A (en) Graph database query method, device, equipment and storage medium
US20160364474A1 (en) Identifying Data Offsets Using Binary Masks
CN108009049A (en) The offline restoration methods of MYISAM storage engines deletion records, storage medium
CN109344163B (en) Data verification method and device and computer readable medium
CN104778252A (en) Index storage method and index storage device
CN110866010A (en) Formatted order information extraction method and device
KR20140031269A (en) Method and device for determining font
CN107291749A (en) A kind of determination method and device of data target incidence relation
CN115712601A (en) Method for reading fixed-length files in batch based on springbatch
US20220199202A1 (en) Method and apparatus for compressing fastq data through character frequency-based sequence reordering
CN115577147A (en) Visual information map retrieval method and device, electronic equipment and storage medium
CN104866535A (en) Compression method and device of number segment records
CN110336817B (en) Unknown protocol frame positioning method based on TextRank
CN113190644A (en) Method and device for hot updating search engine word segmentation dictionary

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant