CN105389338B - A kind of analytic method of buying acceptance of the bid data - Google Patents

A kind of analytic method of buying acceptance of the bid data Download PDF

Info

Publication number
CN105389338B
CN105389338B CN201510683420.9A CN201510683420A CN105389338B CN 105389338 B CN105389338 B CN 105389338B CN 201510683420 A CN201510683420 A CN 201510683420A CN 105389338 B CN105389338 B CN 105389338B
Authority
CN
China
Prior art keywords
bid
acceptance
data
buying
attribute
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510683420.9A
Other languages
Chinese (zh)
Other versions
CN105389338A (en
Inventor
陈国强
姬永杰
朱培冬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing UYU Government Software Co.,Ltd.
Original Assignee
BEIJING UFIDA SOFTWARE CO LTD
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING UFIDA SOFTWARE CO LTD filed Critical BEIJING UFIDA SOFTWARE CO LTD
Priority to CN201510683420.9A priority Critical patent/CN105389338B/en
Publication of CN105389338A publication Critical patent/CN105389338A/en
Application granted granted Critical
Publication of CN105389338B publication Critical patent/CN105389338B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses

Abstract

The invention discloses a kind of analytic methods of buying acceptance of the bid data, are related to ETL (data pick-up, conversion and load) field in data warehouse technology.This method includes:Isolate the criteria table data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard list data;Criteria table data and non-standard list data are parsed respectively according to the acceptance of the bid bulletins attribute of buying acceptance of the bid bulletin text, obtain acceptance of the bid record;In the acceptance of the bid record storage to database that parsing is obtained.Analytic method provided by the present invention, by the way that criteria table data and non-standard list data progress separating treatment in acceptance of the bid bulletin text will be purchased, efficient, the accurate parsing to purchasing acceptance of the bid data is realized, the depth to purchase acceptance of the bid data, which is excavated and utilized, to provide the foundation.

Description

A kind of analytic method of buying acceptance of the bid data
Technical field
The present invention relates to ETL (data pick-up, conversion and load) fields in data warehouse technology, and in particular to one kind is adopted The analytic method of purchase acceptance of the bid data.
Background technology
With the fast development of Internet technology, all kinds of Internet users are (super in a large amount of Html of Web realease daily Text mark up language) files such as document, picture and video, various reptile engines ceaselessly from all kinds of websites crawl, Analyze and apply these data.Currently, all kinds of search engines to Html texts by segment etc. processing come supported web page inspection Rope.
In government procurement field, as the further increase government information of government agencies at all levels discloses dynamics, government website hair Cloth data more frequently, comprising information are more enriched, but the support due to lacking specific transactions model and analytic method, portions at different levels The government procurement bulletin of door lacks that unified format, form of presentation are different, and existing search engine is only complete multiple by these bulletins System is got off, and basic inquiry service is provided by full-text search, can not be to the acceptance of the bid of crawl due to not establishing structural model Announce Html documents carry out depth excavation and utilization, acceptance of the bid bulletin full-text search result often with user demand gap very Greatly.
Acceptance of the bid record is the data of most worthy in government procurement, including:Supplier, attached bag number, adopts the acceptance of the bid amount of money Purchase the attributes such as people, project name, expert.The analytic method of existing general government procurement acceptance of the bid bulletin Html documents is to shift to an earlier date Safeguard a set of keyword for matching, such as:The keyword of supplier include " highest bidder ", " supplier ", " acceptance of the bid candidate ", " offerer " etc..The underlying attributes positions such as supplier, the acceptance of the bid amount of money, the first candidate, attached bag number are positioned according to keyword, It is parsed with conventional keyword match method, success resolution factor needs to use more advanced parsing side often less than 50% Method promotes resolution factor.
Invention content
In view of the deficiencies in the prior art with the needs of practical application, the purpose of the present invention is to provide a kind of buyings Acceptance of the bid data efficient, accurate analytic method.
To achieve the above object, the technical solution adopted by the present invention is as follows:
A kind of analytic method of buying acceptance of the bid data, includes the following steps:
(1) the criteria table data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard table number are isolated According to;
(2) the acceptance of the bid bulletins attribute for announcing text is got the bid respectively to criteria table data and non-standard table number according to buying According to being parsed, acceptance of the bid record is obtained;
(3) in the acceptance of the bid record storage to database for obtaining parsing.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), the acceptance of the bid bulletin category Property include project name, supplier, acceptance of the bid the amount of money, purchaser and first acceptance of the bid candidate mark.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (1), the criteria table number According to refer in list data specify acceptance of the bid bulletins attribute be located at same a line, the data of different lines in table;It is described it is specified in Mark bulletins attribute includes supplier and the acceptance of the bid amount of money.
Further, the analytic method of a kind of buying acceptance of the bid data as described above in step (1), is isolated to be resolved Criteria table data in Html buying acceptance of the bid bulletin texts and non-standard list data, including:
1) all tables in Html buying acceptance of the bid bulletin texts are isolated according to the form tag table of Html texts; All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if It is, it is determined that table is criteria table, if not, it is determined that table is non-standard table.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), to criteria table data It is parsed, including:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;
2. every a line in circular treatment table obtains each acceptance of the bid per a line according to the row number of each acceptance of the bid bulletins attribute The value of bulletins attribute obtains the acceptance of the bid record of every a line.
Further, the analytic method of a kind of buying acceptance of the bid data as described above in step (2), is parsed using text string Method parses non-standard list data, including:
For a non-standard list data, with the associated prefixes or suffix of get the bid bulletins attribute or bulletins attribute of getting the bid It is retrieved in non-standard list data for keyword, obtains the attribute value of each acceptance of the bid bulletins attribute, announced according to each acceptance of the bid Attribute and its attribute are worth to acceptance of the bid record.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, in step (2), to criteria table data When being parsed with non-standard list data, parsed from the data of innermost layer nested tables according to the nesting order of table, After the parsing for completing one layer of list data, the list data of respective layer is deleted.
Further, the analytic method of a kind of buying acceptance of the bid data as described above, it is to be resolved isolating in step (1) Html buying acceptance of the bid bulletin text in criteria table data and non-standard list data before, further include:
Html buying acceptance of the bid bulletin texts to be resolved are pre-processed, are deleted in Html buying acceptance of the bid bulletin texts The data unrelated with acceptance of the bid content.
Further, in step (3), parsing is obtained for the analytic method of a kind of buying acceptance of the bid data as described above Before record storage to database of getting the bid, further include:
According to the attribute value of acceptance of the bid bulletins attribute, judge whether acceptance of the bid record is effective, if so, retain acceptance of the bid record, If it is not, then deleting acceptance of the bid record.
Further, in step (3), parsing is obtained for the analytic method of a kind of buying acceptance of the bid data as described above Before record storage to database of getting the bid, further include:
Identifying for affiliated table, which is recorded, according to acceptance of the bid judges the weight in recording of getting the bid with the attribute value of its bulletins attribute of getting the bid Multiple record, and carry out duplicate removal processing;Judgment mode is:If the mark of table is identical belonging to two acceptance of the bid records and its acceptance of the bid is announced The attribute value of attribute is identical, then judges that two acceptance of the bid records repeat.
The beneficial effects of the present invention are:The analytic method of buying acceptance of the bid data provided by the invention, can will be non-structural The Html formats buying acceptance of the bid bulletin of change is converted into the acceptance of the bid record of structuring, the analytic method by by criteria table data and Non-standard list data carries out separation parsing using different analysis modes, effectively increases resolution factor, for buying acceptance of the bid bulletin The depth of data is excavated and is utilized and provides the foundation.
Description of the drawings
Fig. 1 is a kind of flow chart of the analytic method of buying acceptance of the bid data in specific implementation mode;
Fig. 2 is the process of analysis figure of specific implementation mode Plays list data;
Fig. 3 is the process of analysis figure of non-standard list data in specific implementation mode;
Fig. 4 is the schematic diagram of criteria table data;
Fig. 5 is the schematic diagram of non-standard list data.
Specific implementation mode
The present invention is described in further detail with specific implementation mode with reference to the accompanying drawings of the specification.
Fig. 1 shows a kind of flow chart of the analytic method of buying acceptance of the bid data, the party in the specific embodiment of the invention Method may comprise steps of:
Step S100:Isolate the list data in Html buying acceptance of the bid bulletin texts to be resolved and non-standard table number According to;
Html buyings acceptance of the bid bulletin text to be resolved is pre-processed first, delete in buying acceptance of the bid bulletin text with The unrelated data for content of getting the bid.It is purchased in acceptance of the bid bulletin text in actual Html, has many and actual acceptance of the bid content Unrelated data, such as show in relation to the data of (font, size, the color of text) or other be not related in essence with text The content of data is marked, therefore the deletion of these data can be carried out in advance, to improve the efficiency of follow-up data processing.
In practical applications, Html buying acceptance of the bid bulletin texts can be found out according to the display class label in Html texts In only shown with data in relation to, with the unrelated data of acceptance of the bid content, delete Html buying acceptances of the bid announce in text with acceptance of the bid The unrelated data of content.Wherein, the display class label includes but not limited to for defining the font of word, size and color <font>Label, for the section in definition document<span>Label and without meaning space etc..
In present embodiment, the list data includes criteria table data and non-standard list data;The standard scale Lattice data refer to that the acceptance of the bid bulletins attribute specified in list data is located at same a line, the data of different lines in table, in specified Mark bulletins attribute includes but not limited to supplier and the acceptance of the bid amount of money.The list data as shown in Fig. 4 is criteria table number According to offerer's title, that is, supplier and bid amount are to get the bid the amount of money positioned at same a line and difference positioned at table in the table Row.Data except non-standard list data, that is, criteria table data.
In present embodiment, the acceptance of the bid bulletins attribute include project name, supplier, acceptance of the bid the amount of money, purchaser and One acceptance of the bid candidate's mark etc., it should be noted that in different acceptance of the bid bulletins, the title for bulletins attribute of getting the bid might have Institute is different, and acceptance of the bid bulletins attribute can be named according to actual conditions, and if supplier may also be known as offerer, the acceptance of the bid amount of money may Referred to as bid amount.
In present embodiment, the criteria table data and standard in Html buying acceptance of the bid bulletin texts to be resolved are isolated The concrete mode of list data is:
1) all tables in Html buying acceptance of the bid bulletin texts are isolated according to the form tag table of Html texts; All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if It is, it is determined that table is criteria table, and the data in criteria table are criteria table data, if not, it is determined that table is non- Criteria table, the data in non-standard table are non-standard list data.
In practical applications, by nesting<table>(<table>Contain<table>) separation be independent N number of son<table >.Per height<table>Label be all with "<Table " beginning with "</table>" terminate, by keyword "<table>" and “</table>" position and count, nested son is found out successively<table>Label is detached one by one, obtains each height< table>The complete character string (list data) of label parses recursive algorithm as suction parameter recursive call data.
All (embedded with not comprising nested)<table>After tag processes, the public text of acceptance of the bid is completed The separation of Plays list data and non-standard list data, criteria table data as shown in Figure 4 are non-as shown in Figure 5 Criteria table data.In practical applications, the acceptance of the bid bulletins attribute specified with specific reference to which determines criteria table and nonstandard Whether quasi- table can be selected as needed, in present embodiment, by judging supplier and the acceptance of the bid amount of money in same a line To determine whether being criteria table, if be in two necessary conditions of same a line:
1) in different lines:Between the position A of supplier and the position B of bid amount comprising cell label "</td>”;
2) in same a line:Between the position A of supplier and the position B of bid amount do not include row label "</tr>”.
Table as shown in Figure 4, wherein offerer's title (supplier) and bid amount meet above-mentioned two necessity item Part then judges the list data in Fig. 4 for criteria table data.
Step S200:According to buying get the bid bulletin text acceptance of the bid bulletins attribute respectively to criteria table data and non-standard List data is parsed, and acceptance of the bid record is obtained;
After isolating criteria table data and the non-standard list data in text, respectively to criteria table data and nonstandard Quasi- list data is parsed.Since there are nest relations in list data, to criteria table data and non-standard list data It when being parsed, is parsed from the data of innermost layer nested tables according to the nesting order of table, completes one layer of list data Parsing after, delete the list data of respective layer, parse the outer form data of this layer again later.Using parsing from inside to outside Mode can ensure not interfered by nested tables label when outer form tag processes, and acceptance of the bid record is obtained with more accurate.
In present embodiment, the concrete mode parsed to criteria table data is as shown in Fig. 2, included the following steps:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;With the entitled keyword for bulletins attribute of getting the bid The accurate profit number residing for each attribute, list data as shown in Figure 4 are oriented in retrieval in list data, and supplier's row number is 2, the row number of the first candidate mark is 5;
2. every a line in circular treatment table obtains each acceptance of the bid per a line according to the row number of each acceptance of the bid bulletins attribute The value of bulletins attribute obtains the acceptance of the bid record of every a line.Every a line in criteria table data corresponds to an acceptance of the bid record.
The second row in criteria table data as shown in Figure 4, the acceptance of the bid parsed are recorded as:Supplier:Guangzhou Xingu Electronic Science and Technology Co., Ltd. of city, the acceptance of the bid amount of money are 246000, which is the first candidate.
In present embodiment, non-standard list data is parsed using text string analytic method, the flow of parsing is such as Shown in Fig. 3, specifically include:
For a non-standard list data, with the associated prefixes or suffix of get the bid bulletins attribute or bulletins attribute of getting the bid It is retrieved in non-standard list data for keyword, obtains the attribute value of each acceptance of the bid bulletins attribute, announced according to each acceptance of the bid Attribute and its attribute are worth to an acceptance of the bid record.
In practical applications, it is necessary first to orient the title of supplier, can with " supplier " or " offerer ", " quotation company " etc. is retrieved for keyword in subpacket data, if can not find, can be according to the association of supplier before Sew or suffix carries out matched and searched, for example, had as previous in the title of supplier ":" etc. special prefix or one in title As have suffix such as " companies ", can according to these prefix or suffixes carry out supplier retrieval position.Complete determining for supplier Behind position, the acceptance of the bid amount of money and other acceptance of the bid bulletins attributes are further parsed, it is similar to supplier's positioning, it can be with bulletin of getting the bid Title that attribute is (such as " the acceptance of the bid amount of money ") is that keyword is directly retrieved, if retrieval is less than can be according to relevant association Prefix or suffix is searched (such as the association suffix " volume " of " the acceptance of the bid amount of money ", " member ", " valence ").
In the parsing for completing criteria table data and non-standard list data, after obtaining acceptance of the bid record, in order to ensure to get the bid The integrality of record can also obtain the entry name of acceptance of the bid record by conventional keyword match method in practical applications Other relevant informations such as title, expert.
Step S300:In the acceptance of the bid record storage to database that parsing is obtained.
By the parsing in step S200, after the acquisition for completing acceptance of the bid record, acceptance of the bid data are stored into database.
Before actual storage, in order to avoid there is the phenomenon that description repeats in acceptance of the bid data, centering label record is needed Validity is judged, and carries out the duplicate removal processing of acceptance of the bid record.
In present embodiment, it is underway label record Effective judgement when, can according to acceptance of the bid bulletins attribute attribute Value judges whether acceptance of the bid record is effective, if so, retaining acceptance of the bid record, if it is not, then deleting acceptance of the bid record.For example, passing through Judge that supplier verifies whether effectively or whether the acceptance of the bid amount of money is 0 or whether is the modes such as the first candidate supplier to judge to remember Whether record is effective, generally, if supplier and the acceptance of the bid amount of money do not have apparent problem, it may be considered that an acceptance of the bid record is effective Acceptance of the bid record.
In present embodiment, it is underway label record duplicate removal processing when, according to acceptance of the bid record belonging to table mark and The attribute value of its bulletins attribute of getting the bid judges the repetition record in acceptance of the bid record, and carries out duplicate removal processing;Judgment mode is:If two It is a acceptance of the bid record belonging to table mark it is identical and its get the bid bulletins attribute attribute value it is identical, then judge two acceptance of the bid record weight It is multiple.Wherein, the mark of the table is for one table of unique identification, in the non-standard list data as shown in Fig. 5, packet Included three non-standard list datas, table belonging to three non-standard list datas mark is non-be not " packet one ", " wrapping two " and " packet three ", in general, in the acceptance of the bid bulletin text of Html formats, each table is identified with it, if not provided, this implementation Can give tacit consent in mode is that each table distributes a unique identification number.
After the validity and duplicate removal processing for completing acceptance of the bid record, the relevant information of effective acceptance of the bid record is saved in number According in library.
The analytic method of buying acceptance of the bid data provided in present embodiment can get the bid non-structured buying public The acceptance of the bid record that announcement (Html acceptances of the bid text) is converted into structuring is stored, and it is public that this method is particularly suitable for government procurement acceptance of the bid The parsing of announcement can effectively identify that 90% or more government procurement is got the bid using this method and record, in greatly improving in practice Mark the efficiency and accuracy rate of data parsing.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art God and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technology Within, then the present invention is also intended to include these modifications and variations.

Claims (7)

1. a kind of analytic method of buying acceptance of the bid data, includes the following steps:
(1) the criteria table data in Htm l buying acceptance of the bid bulletin texts to be resolved and non-standard list data, institute are isolated It refers to that the acceptance of the bid bulletins attribute specified in list data is located at same a line, the data of different lines in table to state criteria table data;
In step (1), the criteria table data in Htm l buying acceptance of the bid bulletin texts to be resolved and non-standard table are isolated Data, including:
1) all tables in Htm l buying acceptance of the bid bulletin texts are isolated according to the form tag tab l e of Htm l texts; All tables include sub-table nested in table;
2) judge whether the acceptance of the bid bulletins attribute specified described in table meets same a line and different lines positioned at table, if so, Then determine that table is criteria table, if not, it is determined that table is non-standard table;
(2) according to buying get the bid bulletin text acceptance of the bid bulletins attribute respectively to criteria table data and non-standard list data into Row parsing obtains acceptance of the bid record;
In step (2), criteria table data are parsed, including:
1. obtaining the row number for bulletins attribute of respectively getting the bid in criteria table data;
2. every a line in circular treatment table obtains each acceptance of the bid per a line and announces according to the row number of each acceptance of the bid bulletins attribute The value of attribute obtains the acceptance of the bid record of every a line;
In step (2), non-standard list data is parsed using text string analytic method, including:
For a non-standard list data, associated prefixes or suffix with get the bid bulletins attribute or bulletins attribute of getting the bid are to close Key word is retrieved in non-standard list data, the attribute value of each acceptance of the bid bulletins attribute is obtained, according to each acceptance of the bid bulletins attribute And its attribute is worth to acceptance of the bid record;
(3) in the acceptance of the bid record storage to database for obtaining parsing.
2. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:It is described in step (2) Acceptance of the bid bulletins attribute includes project name, supplier, the acceptance of the bid amount of money, purchaser and first acceptance of the bid candidate's mark.
3. a kind of analytic method of buying acceptance of the bid data according to claim 2, it is characterised in that:It is described in step (1) Specified acceptance of the bid bulletins attribute includes supplier and the acceptance of the bid amount of money.
4. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (2), to mark When quasi- list data and non-standard list data are parsed, according to the nesting order of table from the data of innermost layer nested tables It is parsed, after the parsing for completing one layer of list data, deletes the list data of respective layer.
5. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (1), dividing Before separating out criteria table data and the non-standard list data in Htm l buying acceptance of the bid bulletin texts to be resolved, further include:
Htm l buyings acceptance of the bid bulletin text to be resolved is pre-processed, delete in Htm l buying acceptance of the bid bulletin texts with The unrelated data of content of getting the bid.
6. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (3), it will solve Before analysing obtained acceptance of the bid record storage to database, further include:
According to the attribute value of acceptance of the bid bulletins attribute, judge whether acceptance of the bid record is effective, if so, retain acceptance of the bid record, if it is not, Then delete acceptance of the bid record.
7. a kind of analytic method of buying acceptance of the bid data according to claim 1, it is characterised in that:In step (3), it will solve Before analysing obtained acceptance of the bid record storage to database, further include:
According to the repetition note in the attribute value judgement acceptance of the bid record of the mark of table belonging to acceptance of the bid record and its bulletins attribute of getting the bid Record, and carry out duplicate removal processing;Judgment mode is:If two acceptance of the bid record belonging to tables mark it is identical and its get the bid bulletins attribute Attribute value it is identical, then judge two acceptance of the bid record repeat.
CN201510683420.9A 2015-10-20 2015-10-20 A kind of analytic method of buying acceptance of the bid data Active CN105389338B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510683420.9A CN105389338B (en) 2015-10-20 2015-10-20 A kind of analytic method of buying acceptance of the bid data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510683420.9A CN105389338B (en) 2015-10-20 2015-10-20 A kind of analytic method of buying acceptance of the bid data

Publications (2)

Publication Number Publication Date
CN105389338A CN105389338A (en) 2016-03-09
CN105389338B true CN105389338B (en) 2018-09-04

Family

ID=55421628

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510683420.9A Active CN105389338B (en) 2015-10-20 2015-10-20 A kind of analytic method of buying acceptance of the bid data

Country Status (1)

Country Link
CN (1) CN105389338B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106250456A (en) * 2016-07-28 2016-12-21 浪潮软件集团有限公司 Bid winning announcement extraction method and device
CN110069622A (en) * 2017-08-01 2019-07-30 武汉楚鼎信息技术有限公司 A kind of personal share bulletin abstract intelligent extract method
CN107832381A (en) * 2017-10-30 2018-03-23 北京大数元科技发展有限公司 A kind of government procurement acceptance of the bid bulletin judging method and system from internet collection
CN114357054B (en) * 2022-03-10 2022-06-03 广州宸祺出行科技有限公司 Method and device for processing unstructured data based on ClickHouse

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001109741A (en) * 1999-10-13 2001-04-20 Toshiba Corp Method and system for preparing html data
CN101576891A (en) * 2008-05-05 2009-11-11 北京瑞佳晨科技有限公司 Method for analyzing web page form object nodes
CN101908078A (en) * 2010-08-30 2010-12-08 深圳市五巨科技有限公司 Method and device for importing webpage data to EXCEL sheet
CN102222227A (en) * 2011-04-25 2011-10-19 中国华录集团有限公司 Video identification based system for extracting film images
CN104468194A (en) * 2014-11-05 2015-03-25 北京星网锐捷网络技术有限公司 Network device compatible method and forwarding server
CN104717085A (en) * 2013-12-16 2015-06-17 中国移动通信集团湖南有限公司 Log parsing method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8095403B2 (en) * 2007-08-10 2012-01-10 Kap Holdings, Llc System and method for provision of maintenance information and products

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001109741A (en) * 1999-10-13 2001-04-20 Toshiba Corp Method and system for preparing html data
CN101576891A (en) * 2008-05-05 2009-11-11 北京瑞佳晨科技有限公司 Method for analyzing web page form object nodes
CN101908078A (en) * 2010-08-30 2010-12-08 深圳市五巨科技有限公司 Method and device for importing webpage data to EXCEL sheet
CN102222227A (en) * 2011-04-25 2011-10-19 中国华录集团有限公司 Video identification based system for extracting film images
CN104717085A (en) * 2013-12-16 2015-06-17 中国移动通信集团湖南有限公司 Log parsing method and device
CN104468194A (en) * 2014-11-05 2015-03-25 北京星网锐捷网络技术有限公司 Network device compatible method and forwarding server

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于多代理和XML的供应链集成体系结构研究;郭文才 等;《北京理工大学学报(刹_会科学版)》;20050630;第7卷(第3期);第69-72页 *

Also Published As

Publication number Publication date
CN105389338A (en) 2016-03-09

Similar Documents

Publication Publication Date Title
CN103544255B (en) Text semantic relativity based network public opinion information analysis method
CN105389338B (en) A kind of analytic method of buying acceptance of the bid data
CN103793372A (en) Extracting semantic relationships from table structures in electronic documents
CN110781670B (en) Chinese place name semantic disambiguation method based on encyclopedic knowledge base and word vectors
CN103064956A (en) Method, computing system and computer-readable storage media for searching electric contents
US20110246462A1 (en) Method and System for Prompting Changes of Electronic Document Content
CN107657048A (en) user identification method and device
JP2007226452A (en) Structured document management device, structured document management program and structured document management method
CN111899089A (en) Enterprise risk early warning method and system based on knowledge graph
US20160110471A1 (en) Method and system of intelligent generation of structured data and object discovery from the web using text, images, video and other data
CN109871424B (en) Chinese academic research hotspot area information automatic extraction and map making method
US20150199402A1 (en) Computerized systems and methods for indexing and serving recurrent calendar events
Banshal et al. An altmetric analysis of scholarly articles from India
US20150100877A1 (en) Method or system for automated extraction of hyper-local events from one or more web pages
CN107203526A (en) A kind of query string semantic requirement analysis method and device
CN111914539A (en) Channel announcement information extraction method and system based on BilSTM-CRF model
Colavizza et al. Citation mining of humanities journals: the progress to date and the challenges ahead
CN109101512B (en) Construction method of legal database, legal data query method and device
US10504145B2 (en) Automated classification of network-accessible content based on events
CN110110044B (en) Method for enterprise information combination screening
US8626766B1 (en) Systems and methods for ranking and importing business listings
Oliveira et al. Gazetteer enrichment for addressing urban areas: A case study
JP2010224667A (en) Device and method for supporting character input
CN112767933B (en) Voice interaction method, device, equipment and medium of highway maintenance management system
KR101589626B1 (en) Method for establishing start-up data or management data from big data based on lexico semantic pattern analysis

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 100094 2F, building 11, UFIDA Software Park, 68 Beiqing Road, Haidian District, Beijing

Patentee after: Beijing UYU Government Software Co.,Ltd.

Address before: 100094 2F, building 11, UFIDA Software Park, 68 Beiqing Road, Haidian District, Beijing

Patentee before: YONYOU GOVERNMENT AFFAIRS SOFTWARE Co.,Ltd.