CN105389338B - A kind of analytic method of buying acceptance of the bid data - Google Patents
A kind of analytic method of buying acceptance of the bid data Download PDFInfo
- Publication number
- CN105389338B CN105389338B CN201510683420.9A CN201510683420A CN105389338B CN 105389338 B CN105389338 B CN 105389338B CN 201510683420 A CN201510683420 A CN 201510683420A CN 105389338 B CN105389338 B CN 105389338B
- Authority
- CN
- China
- Prior art keywords
- bid
- acceptance
- data
- buying
- attribute
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/254—Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of analytic methods of buying acceptance of the bid data, are related to ETL (data pick-up, conversion and load) field in data warehouse technology.This method includes:Isolate the criteria table data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard list data;Criteria table data and non-standard list data are parsed respectively according to the acceptance of the bid bulletins attribute of buying acceptance of the bid bulletin text, obtain acceptance of the bid record;In the acceptance of the bid record storage to database that parsing is obtained.Analytic method provided by the present invention, by the way that criteria table data and non-standard list data progress separating treatment in acceptance of the bid bulletin text will be purchased, efficient, the accurate parsing to purchasing acceptance of the bid data is realized, the depth to purchase acceptance of the bid data, which is excavated and utilized, to provide the foundation.
Description
Technical field
The present invention relates to ETL (data pick-up, conversion and load) fields in data warehouse technology, and in particular to one kind is adopted
The analytic method of purchase acceptance of the bid data.
Background technology
With the fast development of Internet technology, all kinds of Internet users are (super in a large amount of Html of Web realease daily
Text mark up language) files such as document, picture and video, various reptile engines ceaselessly from all kinds of websites crawl,
Analyze and apply these data.Currently, all kinds of search engines to Html texts by segment etc. processing come supported web page inspection
Rope.
In government procurement field, as the further increase government information of government agencies at all levels discloses dynamics, government website hair
Cloth data more frequently, comprising information are more enriched, but the support due to lacking specific transactions model and analytic method, portions at different levels
The government procurement bulletin of door lacks that unified format, form of presentation are different, and existing search engine is only complete multiple by these bulletins
System is got off, and basic inquiry service is provided by full-text search, can not be to the acceptance of the bid of crawl due to not establishing structural model
Announce Html documents carry out depth excavation and utilization, acceptance of the bid bulletin full-text search result often with user demand gap very
Greatly.
Acceptance of the bid record is the data of most worthy in government procurement, including:Supplier, attached bag number, adopts the acceptance of the bid amount of money
Purchase the attributes such as people, project name, expert.The analytic method of existing general government procurement acceptance of the bid bulletin Html documents is to shift to an earlier date
Safeguard a set of keyword for matching, such as:The keyword of supplier include " highest bidder ", " supplier ", " acceptance of the bid candidate ",
" offerer " etc..The underlying attributes positions such as supplier, the acceptance of the bid amount of money, the first candidate, attached bag number are positioned according to keyword,
It is parsed with conventional keyword match method, success resolution factor needs to use more advanced parsing side often less than 50%
Method promotes resolution factor.
Invention content
In view of the deficiencies in the prior art with the needs of practical application, the purpose of the present invention is to provide a kind of buyings
Acceptance of the bid data efficient, accurate analytic method.
To achieve the above object, the technical solution adopted by the present invention is as follows:
A kind of analytic method of buying acceptance of the bid data, includes the following steps:
(1) the criteria table data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard table number are isolated
According to;
(2) the acceptance of the bid bulletins attribute for announcing text is got the bid respectively to criteria table data and non-standard table number according to buying
According to being parsed, acceptance of the bid record is obtained;
(3) in the acceptance of the bid record storage to database for obtaining parsing.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), the acceptance of the bid bulletin category
Property include project name, supplier, acceptance of the bid the amount of money, purchaser and first acceptance of the bid candidate mark.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (1), the criteria table number
According to refer in list data specify acceptance of the bid bulletins attribute be located at same a line, the data of different lines in table;It is described it is specified in
Mark bulletins attribute includes supplier and the acceptance of the bid amount of money.
Further, the analytic method of a kind of buying acceptance of the bid data as described above in step (1), is isolated to be resolved
Criteria table data in Html buying acceptance of the bid bulletin texts and non-standard list data, including:
1) all tables in Html buying acceptance of the bid bulletin texts are isolated according to the form tag table of Html texts;
All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if
It is, it is determined that table is criteria table, if not, it is determined that table is non-standard table.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), to criteria table data
It is parsed, including:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;
2. every a line in circular treatment table obtains each acceptance of the bid per a line according to the row number of each acceptance of the bid bulletins attribute
The value of bulletins attribute obtains the acceptance of the bid record of every a line.
Further, the analytic method of a kind of buying acceptance of the bid data as described above in step (2), is parsed using text string
Method parses non-standard list data, including:
For a non-standard list data, with the associated prefixes or suffix of get the bid bulletins attribute or bulletins attribute of getting the bid
It is retrieved in non-standard list data for keyword, obtains the attribute value of each acceptance of the bid bulletins attribute, announced according to each acceptance of the bid
Attribute and its attribute are worth to acceptance of the bid record.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), to criteria table data
When being parsed with non-standard list data, parsed from the data of innermost layer nested tables according to the nesting order of table,
After the parsing for completing one layer of list data, the list data of respective layer is deleted.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, it is to be resolved isolating in step (1)
Html buying acceptance of the bid bulletin text in criteria table data and non-standard list data before, further include:
Html buying acceptance of the bid bulletin texts to be resolved are pre-processed, are deleted in Html buying acceptance of the bid bulletin texts
The data unrelated with acceptance of the bid content.
Further, in step (3), parsing is obtained for the analytic method of a kind of buying acceptance of the bid data as described above
Before record storage to database of getting the bid, further include:
According to the attribute value of acceptance of the bid bulletins attribute, judge whether acceptance of the bid record is effective, if so, retain acceptance of the bid record,
If it is not, then deleting acceptance of the bid record.
Further, in step (3), parsing is obtained for the analytic method of a kind of buying acceptance of the bid data as described above
Before record storage to database of getting the bid, further include:
Identifying for affiliated table, which is recorded, according to acceptance of the bid judges the weight in recording of getting the bid with the attribute value of its bulletins attribute of getting the bid
Multiple record, and carry out duplicate removal processing;Judgment mode is:If the mark of table is identical belonging to two acceptance of the bid records and its acceptance of the bid is announced
The attribute value of attribute is identical, then judges that two acceptance of the bid records repeat.
The beneficial effects of the present invention are:The analytic method of buying acceptance of the bid data provided by the invention, can will be non-structural
The Html formats buying acceptance of the bid bulletin of change is converted into the acceptance of the bid record of structuring, the analytic method by by criteria table data and
Non-standard list data carries out separation parsing using different analysis modes, effectively increases resolution factor, for buying acceptance of the bid bulletin
The depth of data is excavated and is utilized and provides the foundation.
Description of the drawings
Fig. 1 is a kind of flow chart of the analytic method of buying acceptance of the bid data in specific implementation mode;
Fig. 2 is the process of analysis figure of specific implementation mode Plays list data;
Fig. 3 is the process of analysis figure of non-standard list data in specific implementation mode;
Fig. 4 is the schematic diagram of criteria table data;
Fig. 5 is the schematic diagram of non-standard list data.
Specific implementation mode
The present invention is described in further detail with specific implementation mode with reference to the accompanying drawings of the specification.
Fig. 1 shows a kind of flow chart of the analytic method of buying acceptance of the bid data, the party in the specific embodiment of the invention
Method may comprise steps of:
Step S100:Isolate the list data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard table number
According to;
Html buyings acceptance of the bid bulletin text to be resolved is pre-processed first, delete in buying acceptance of the bid bulletin text with
The unrelated data for content of getting the bid.It is purchased in acceptance of the bid bulletin text in actual Html, has many and actual acceptance of the bid content
Unrelated data, such as show in relation to the data of (font, size, the color of text) or other be not related in essence with text
The content of data is marked, therefore the deletion of these data can be carried out in advance, to improve the efficiency of follow-up data processing.
In practical applications, Html buying acceptance of the bid bulletin texts can be found out according to the display class label in Html texts
In only shown with data in relation to, with the unrelated data of acceptance of the bid content, delete Html buying acceptances of the bid announce in text with acceptance of the bid
The unrelated data of content.Wherein, the display class label includes but not limited to for defining the font of word, size and color
<font>Label, for the section in definition document<span>Label and without meaning space etc..
In present embodiment, the list data includes criteria table data and non-standard list data;The standard scale
Lattice data refer to that the acceptance of the bid bulletins attribute specified in list data is located at same a line, the data of different lines in table, in specified
Mark bulletins attribute includes but not limited to supplier and the acceptance of the bid amount of money.The list data as shown in Fig. 4 is criteria table number
According to offerer's title, that is, supplier and bid amount are to get the bid the amount of money positioned at same a line and difference positioned at table in the table
Row.Data except non-standard list data, that is, criteria table data.
In present embodiment, the acceptance of the bid bulletins attribute include project name, supplier, acceptance of the bid the amount of money, purchaser and
One acceptance of the bid candidate's mark etc., it should be noted that in different acceptance of the bid bulletins, the title for bulletins attribute of getting the bid might have
Institute is different, and acceptance of the bid bulletins attribute can be named according to actual conditions, and if supplier may also be known as offerer, the acceptance of the bid amount of money may
Referred to as bid amount.
In present embodiment, the criteria table data and standard in Html buying acceptance of the bid bulletin texts to be resolved are isolated
The concrete mode of list data is:
1) all tables in Html buying acceptance of the bid bulletin texts are isolated according to the form tag table of Html texts;
All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if
It is, it is determined that table is criteria table, and the data in criteria table are criteria table data, if not, it is determined that table is non-
Criteria table, the data in non-standard table are non-standard list data.
In practical applications, by nesting<table>(<table>Contain<table>) separation be independent N number of son<table
>.Per height<table>Label be all with "<Table " beginning with "</table>" terminate, by keyword "<table>" and
“</table>" position and count, nested son is found out successively<table>Label is detached one by one, obtains each height<
table>The complete character string (list data) of label parses recursive algorithm as suction parameter recursive call data.
All (embedded with not comprising nested)<table>After tag processes, the public text of acceptance of the bid is completed
The separation of Plays list data and non-standard list data, criteria table data as shown in Figure 4 are non-as shown in Figure 5
Criteria table data.In practical applications, the acceptance of the bid bulletins attribute specified with specific reference to which determines criteria table and nonstandard
Whether quasi- table can be selected as needed, in present embodiment, by judging supplier and the acceptance of the bid amount of money in same a line
To determine whether being criteria table, if be in two necessary conditions of same a line:
1) in different lines:Between the position A of supplier and the position B of bid amount comprising cell label "</td>”;
2) in same a line:Between the position A of supplier and the position B of bid amount do not include row label "</tr>”.
Table as shown in Figure 4, wherein offerer's title (supplier) and bid amount meet above-mentioned two necessity item
Part then judges the list data in Fig. 4 for criteria table data.
Step S200:According to buying get the bid bulletin text acceptance of the bid bulletins attribute respectively to criteria table data and non-standard
List data is parsed, and acceptance of the bid record is obtained;
After isolating criteria table data and the non-standard list data in text, respectively to criteria table data and nonstandard
Quasi- list data is parsed.Since there are nest relations in list data, to criteria table data and non-standard list data
It when being parsed, is parsed from the data of innermost layer nested tables according to the nesting order of table, completes one layer of list data
Parsing after, delete the list data of respective layer, parse the outer form data of this layer again later.Using parsing from inside to outside
Mode can ensure not interfered by nested tables label when outer form tag processes, and acceptance of the bid record is obtained with more accurate.
In present embodiment, the concrete mode parsed to criteria table data is as shown in Fig. 2, included the following steps:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;With the entitled keyword for bulletins attribute of getting the bid
The accurate profit number residing for each attribute, list data as shown in Figure 4 are oriented in retrieval in list data, and supplier's row number is
2, the row number of the first candidate mark is 5;
2. every a line in circular treatment table obtains each acceptance of the bid per a line according to the row number of each acceptance of the bid bulletins attribute
The value of bulletins attribute obtains the acceptance of the bid record of every a line.Every a line in criteria table data corresponds to an acceptance of the bid record.
The second row in criteria table data as shown in Figure 4, the acceptance of the bid parsed are recorded as:Supplier:Guangzhou
Xingu Electronic Science and Technology Co., Ltd. of city, the acceptance of the bid amount of money are 246000, which is the first candidate.
In present embodiment, non-standard list data is parsed using text string analytic method, the flow of parsing is such as
Shown in Fig. 3, specifically include:
For a non-standard list data, with the associated prefixes or suffix of get the bid bulletins attribute or bulletins attribute of getting the bid
It is retrieved in non-standard list data for keyword, obtains the attribute value of each acceptance of the bid bulletins attribute, announced according to each acceptance of the bid
Attribute and its attribute are worth to an acceptance of the bid record.
In practical applications, it is necessary first to orient the title of supplier, can with " supplier " or " offerer ",
" quotation company " etc. is retrieved for keyword in subpacket data, if can not find, can be according to the association of supplier before
Sew or suffix carries out matched and searched, for example, had as previous in the title of supplier ":" etc. special prefix or one in title
As have suffix such as " companies ", can according to these prefix or suffixes carry out supplier retrieval position.Complete determining for supplier
Behind position, the acceptance of the bid amount of money and other acceptance of the bid bulletins attributes are further parsed, it is similar to supplier's positioning, it can be with bulletin of getting the bid
Title that attribute is (such as " the acceptance of the bid amount of money ") is that keyword is directly retrieved, if retrieval is less than can be according to relevant association
Prefix or suffix is searched (such as the association suffix " volume " of " the acceptance of the bid amount of money ", " member ", " valence ").
In the parsing for completing criteria table data and non-standard list data, after obtaining acceptance of the bid record, in order to ensure to get the bid
The integrality of record can also obtain the entry name of acceptance of the bid record by conventional keyword match method in practical applications
Other relevant informations such as title, expert.
Step S300:In the acceptance of the bid record storage to database that parsing is obtained.
By the parsing in step S200, after the acquisition for completing acceptance of the bid record, acceptance of the bid data are stored into database.
Before actual storage, in order to avoid there is the phenomenon that description repeats in acceptance of the bid data, centering label record is needed
Validity is judged, and carries out the duplicate removal processing of acceptance of the bid record.
In present embodiment, it is underway label record Effective judgement when, can according to acceptance of the bid bulletins attribute attribute
Value judges whether acceptance of the bid record is effective, if so, retaining acceptance of the bid record, if it is not, then deleting acceptance of the bid record.For example, passing through
Judge that supplier verifies whether effectively or whether the acceptance of the bid amount of money is 0 or whether is the modes such as the first candidate supplier to judge to remember
Whether record is effective, generally, if supplier and the acceptance of the bid amount of money do not have apparent problem, it may be considered that an acceptance of the bid record is effective
Acceptance of the bid record.
In present embodiment, it is underway label record duplicate removal processing when, according to acceptance of the bid record belonging to table mark and
The attribute value of its bulletins attribute of getting the bid judges the repetition record in acceptance of the bid record, and carries out duplicate removal processing;Judgment mode is:If two
It is a acceptance of the bid record belonging to table mark it is identical and its get the bid bulletins attribute attribute value it is identical, then judge two acceptance of the bid record weight
It is multiple.Wherein, the mark of the table is for one table of unique identification, in the non-standard list data as shown in Fig. 5, packet
Included three non-standard list datas, table belonging to three non-standard list datas mark is non-be not " packet one ", " wrapping two " and
" packet three ", in general, in the acceptance of the bid bulletin text of Html formats, each table is identified with it, if not provided, this implementation
Can give tacit consent in mode is that each table distributes a unique identification number.
After the validity and duplicate removal processing for completing acceptance of the bid record, the relevant information of effective acceptance of the bid record is saved in number
According in library.
The analytic method of buying acceptance of the bid data provided in present embodiment can get the bid non-structured buying public
The acceptance of the bid record that announcement (Html acceptances of the bid text) is converted into structuring is stored, and it is public that this method is particularly suitable for government procurement acceptance of the bid
The parsing of announcement can effectively identify that 90% or more government procurement is got the bid using this method and record, in greatly improving in practice
Mark the efficiency and accuracy rate of data parsing.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art
God and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technology
Within, then the present invention is also intended to include these modifications and variations.
Claims (7)
1. a kind of analytic method of buying acceptance of the bid data, includes the following steps:
(1) the criteria table data in Htm l buying acceptance of the bid bulletin texts to be resolved and non-standard list data, institute are isolated
It refers to that the acceptance of the bid bulletins attribute specified in list data is located at same a line, the data of different lines in table to state criteria table data;
In step (1), the criteria table data in Htm l buying acceptance of the bid bulletin texts to be resolved and non-standard table are isolated
Data, including:
1) all tables in Htm l buying acceptance of the bid bulletin texts are isolated according to the form tag tab l e of Htm l texts;
All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if so,
Then determine that table is criteria table, if not, it is determined that table is non-standard table;
(2) according to buying get the bid bulletin text acceptance of the bid bulletins attribute respectively to criteria table data and non-standard list data into
Row parsing obtains acceptance of the bid record;
In step (2), criteria table data are parsed, including:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;
2. every a line in circular treatment table obtains each acceptance of the bid per a line and announces according to the row number of each acceptance of the bid bulletins attribute
The value of attribute obtains the acceptance of the bid record of every a line;
In step (2), non-standard list data is parsed using text string analytic method, including:
For a non-standard list data, associated prefixes or suffix with get the bid bulletins attribute or bulletins attribute of getting the bid are to close
Key word is retrieved in non-standard list data, the attribute value of each acceptance of the bid bulletins attribute is obtained, according to each acceptance of the bid bulletins attribute
And its attribute is worth to acceptance of the bid record;
(3) in the acceptance of the bid record storage to database for obtaining parsing.
2. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:It is described in step (2)
Acceptance of the bid bulletins attribute includes project name, supplier, the acceptance of the bid amount of money, purchaser and first acceptance of the bid candidate's mark.
3. a kind of analytic method of buying acceptance of the bid data according to claim 2, it is characterised in that:It is described in step (1)
Specified acceptance of the bid bulletins attribute includes supplier and the acceptance of the bid amount of money.
4. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (2), to mark
When quasi- list data and non-standard list data are parsed, according to the nesting order of table from the data of innermost layer nested tables
It is parsed, after the parsing for completing one layer of list data, deletes the list data of respective layer.
5. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (1), dividing
Before separating out criteria table data and the non-standard list data in Htm l buying acceptance of the bid bulletin texts to be resolved, further include:
Htm l buyings acceptance of the bid bulletin text to be resolved is pre-processed, delete in Htm l buying acceptance of the bid bulletin texts with
The unrelated data of content of getting the bid.
6. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (3), it will solve
Before analysing obtained acceptance of the bid record storage to database, further include:
According to the attribute value of acceptance of the bid bulletins attribute, judge whether acceptance of the bid record is effective, if so, retain acceptance of the bid record, if it is not,
Then delete acceptance of the bid record.
7. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (3), it will solve
Before analysing obtained acceptance of the bid record storage to database, further include:
According to the repetition note in the attribute value judgement acceptance of the bid record of the mark of table belonging to acceptance of the bid record and its bulletins attribute of getting the bid
Record, and carry out duplicate removal processing;Judgment mode is:If two acceptance of the bid record belonging to tables mark it is identical and its get the bid bulletins attribute
Attribute value it is identical, then judge two acceptance of the bid record repeat.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510683420.9A CN105389338B (en) | 2015-10-20 | 2015-10-20 | A kind of analytic method of buying acceptance of the bid data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510683420.9A CN105389338B (en) | 2015-10-20 | 2015-10-20 | A kind of analytic method of buying acceptance of the bid data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105389338A CN105389338A (en) | 2016-03-09 |
CN105389338B true CN105389338B (en) | 2018-09-04 |
Family
ID=55421628
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510683420.9A Active CN105389338B (en) | 2015-10-20 | 2015-10-20 | A kind of analytic method of buying acceptance of the bid data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105389338B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106250456A (en) * | 2016-07-28 | 2016-12-21 | 浪潮软件集团有限公司 | Bid winning announcement extraction method and device |
CN110069622A (en) * | 2017-08-01 | 2019-07-30 | 武汉楚鼎信息技术有限公司 | A kind of personal share bulletin abstract intelligent extract method |
CN107832381A (en) * | 2017-10-30 | 2018-03-23 | 北京大数元科技发展有限公司 | A kind of government procurement acceptance of the bid bulletin judging method and system from internet collection |
CN113761926A (en) * | 2021-08-02 | 2021-12-07 | 紫金诚征信有限公司 | Method for extracting bid amount in winning bulletin of enterprise or government based on regular |
CN114357054B (en) * | 2022-03-10 | 2022-06-03 | 广州宸祺出行科技有限公司 | Method and device for processing unstructured data based on ClickHouse |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001109741A (en) * | 1999-10-13 | 2001-04-20 | Toshiba Corp | Method and system for preparing html data |
CN101576891A (en) * | 2008-05-05 | 2009-11-11 | 北京瑞佳晨科技有限公司 | Method for analyzing web page form object nodes |
CN101908078A (en) * | 2010-08-30 | 2010-12-08 | 深圳市五巨科技有限公司 | Method and device for importing webpage data to EXCEL sheet |
CN102222227A (en) * | 2011-04-25 | 2011-10-19 | 中国华录集团有限公司 | Video identification based system for extracting film images |
CN104468194A (en) * | 2014-11-05 | 2015-03-25 | 北京星网锐捷网络技术有限公司 | Network device compatible method and forwarding server |
CN104717085A (en) * | 2013-12-16 | 2015-06-17 | 中国移动通信集团湖南有限公司 | Log parsing method and device |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8095403B2 (en) * | 2007-08-10 | 2012-01-10 | Kap Holdings, Llc | System and method for provision of maintenance information and products |
-
2015
- 2015-10-20 CN CN201510683420.9A patent/CN105389338B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001109741A (en) * | 1999-10-13 | 2001-04-20 | Toshiba Corp | Method and system for preparing html data |
CN101576891A (en) * | 2008-05-05 | 2009-11-11 | 北京瑞佳晨科技有限公司 | Method for analyzing web page form object nodes |
CN101908078A (en) * | 2010-08-30 | 2010-12-08 | 深圳市五巨科技有限公司 | Method and device for importing webpage data to EXCEL sheet |
CN102222227A (en) * | 2011-04-25 | 2011-10-19 | 中国华录集团有限公司 | Video identification based system for extracting film images |
CN104717085A (en) * | 2013-12-16 | 2015-06-17 | 中国移动通信集团湖南有限公司 | Log parsing method and device |
CN104468194A (en) * | 2014-11-05 | 2015-03-25 | 北京星网锐捷网络技术有限公司 | Network device compatible method and forwarding server |
Non-Patent Citations (1)
Title |
---|
基于多代理和XML的供应链集成体系结构研究;郭文才 等;《北京理工大学学报(刹_会科学版)》;20050630;第7卷(第3期);第69-72页 * |
Also Published As
Publication number | Publication date |
---|---|
CN105389338A (en) | 2016-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105389338B (en) | A kind of analytic method of buying acceptance of the bid data | |
CN103544255B (en) | Text semantic relativity based network public opinion information analysis method | |
JP5121146B2 (en) | Structured document management apparatus, structured document management program, and structured document management method | |
CN110781670B (en) | Chinese place name semantic disambiguation method based on encyclopedic knowledge base and word vectors | |
CN103064956A (en) | Method, computing system and computer-readable storage media for searching electric contents | |
CN111899089A (en) | Enterprise risk early warning method and system based on knowledge graph | |
CN103886020B (en) | A kind of real estate information method for fast searching | |
US20160110471A1 (en) | Method and system of intelligent generation of structured data and object discovery from the web using text, images, video and other data | |
CN107203526A (en) | A kind of query string semantic requirement analysis method and device | |
Banshal et al. | An altmetric analysis of scholarly articles from India | |
US20150100877A1 (en) | Method or system for automated extraction of hyper-local events from one or more web pages | |
CN111914539A (en) | Channel announcement information extraction method and system based on BilSTM-CRF model | |
Colavizza et al. | Citation mining of humanities journals: the progress to date and the challenges ahead | |
CN109101512B (en) | Construction method of legal database, legal data query method and device | |
Wang et al. | A web text mining approach for the evaluation of regional characteristics at the town level | |
CN117709347A (en) | Multi-mode content key job information timeliness error investigation method, device, equipment and medium | |
CA3063471A1 (en) | Automated classification of network-accessible content | |
CN113377739A (en) | Knowledge graph application method, knowledge graph application platform, electronic equipment and storage medium | |
CN111858938B (en) | Method and device for extracting referee document tag | |
CN113626536B (en) | News geocoding method based on deep learning | |
Oliveira et al. | Gazetteer enrichment for addressing urban areas: A case study | |
JP2010224667A (en) | Device and method for supporting character input | |
JP2000090093A (en) | Method and system for full-text retrieval and record medium recording full-text retrieval program | |
Apostolova et al. | Digital leafleting: Extracting structured data from multimedia online flyers | |
KR101482012B1 (en) | Advertisement method and system for displaying context advertisement |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 100094 2F, building 11, UFIDA Software Park, 68 Beiqing Road, Haidian District, Beijing Patentee after: Beijing UYU Government Software Co.,Ltd. Address before: 100094 2F, building 11, UFIDA Software Park, 68 Beiqing Road, Haidian District, Beijing Patentee before: YONYOU GOVERNMENT AFFAIRS SOFTWARE Co.,Ltd. |