CN108415968A - A kind of acquisition method of information on bidding - Google Patents

A kind of acquisition method of information on bidding Download PDF

Info

Publication number
CN108415968A
CN108415968A CN201810127175.7A CN201810127175A CN108415968A CN 108415968 A CN108415968 A CN 108415968A CN 201810127175 A CN201810127175 A CN 201810127175A CN 108415968 A CN108415968 A CN 108415968A
Authority
CN
China
Prior art keywords
information
bidding
web
web data
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810127175.7A
Other languages
Chinese (zh)
Inventor
陈晨
欧凌冰
龚澄源
郑红辉
刘蕊儿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan Hui Ji Network Technology Co Ltd
Original Assignee
Hunan Hui Ji Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan Hui Ji Network Technology Co Ltd filed Critical Hunan Hui Ji Network Technology Co Ltd
Priority to CN201810127175.7A priority Critical patent/CN108415968A/en
Publication of CN108415968A publication Critical patent/CN108415968A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/08Auctions

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of acquisition methods of information on bidding, including step:S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then transfers to web crawlers to carry out information collection this web data;S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, the effective information in web data is then extracted, each information defines an index, all index compositions indicator lists;S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses a row storage in structured database, using web data, bidding information medium source, affiliated area, the industry, Homepage Publishing time, web retrieval time also as index, it is stored in the row of database one.

Description

A kind of acquisition method of information on bidding
Technical field
The present invention relates to bidding field, more particularly to a kind of acquisition method of information on bidding.
Background technology
Bid is a kind of commonly used in the world, organized market trading activity with bid, is engineering, cargo or clothes The dealing of business trade.Typically purchaser offers and requires in advance, and numerous trading objects is invited to participate in competition simultaneously Conclusion of the business person is therefrom selected according to the program of regulation.Bidding activity to break trade monopoly and barriers between the regions, increase economic efficiency, Ensure that project quality, prevention and reduction corruption etc. have played important function, has become the weight for promoting modern market system construction Want means.
Information-based development brings the new situation in bidding field, and originally bidder mainly obtains item by periodicals and magazines The mode of mesh bidding information is transformed to obtains the information for being suitble to oneself to submit a tender by internet site.One kind of bidder Way is to log in each bidding website of various regions to obtain information, and then being retrieved and being investigated one by one by artificial mode needs The information wanted.Another more efficient way is to log in some large-scale bidding information sites, passes through full-text search The bidding information that mode removal search needs.
However, this mode takes time and effort, while the included search of bidding website cannot guarantee that quality, this is resulted in Mistake misses important information.And by logging in large-scale bidding information site, it is gone by way of full-text search Search for the bidding information needed, it is matched of low quality also due to use fuzzy matching algorithm, caused by the nothing that searches out It imitates data and is more than valid data, more frighteningly miss more valuable informations.
Invention content
To solve the above-mentioned problems, the present invention provides a kind of acquisition methods of information on bidding, including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then Web crawlers is transferred to carry out information collection this web data;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so The effective information in web data is extracted afterwards, and each information defines an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses A row storage in structured database, web data, bidding information medium source, affiliated area, the industry, webpage are sent out Cloth time, web retrieval time are also stored in the row of database one respectively as index.
Preferably, the step S100 acquisitions information on bidding further includes the screening of web data:Information on bidding is not timing Publication, the frequency acquisition of web crawlers can be higher than the newer maximum frequency of information on bidding, and acquisition is will appear in gatherer process To the situation of duplicate message;Web crawlers judges the address for the web data to be acquired the information with address only needs Acquisition is primary.
Preferably, the information collection frequency of the web crawlers is once a day.
Preferably, the effective information includes:Project name, project number, bid the time, submit a tender the time, the place of the bid submission, Open bid time, opening of bid place, budget amount, procurement unit, procurement unit contact person, procurement unit's contact method, procurement unit Address, agency, agency contact person, agency's contact method, agency address, buying content, attachment documents.
Preferably, the database can be any one in Access, sql server, mysql and oracle.
Beneficial effects of the present invention are:The present invention provides a kind of acquisition methods of information on bidding, select all kinds of bid nets It is information source to stand, and is acquired to webpage information using web crawlers, and extract effective information and stored, and bid letter is improved The acquisition quality and efficiency of breath.
Description of the drawings
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention without having to pay creative labor, may be used also for those of ordinary skill in the art To obtain other attached drawings according to these attached drawings.
Fig. 1 show a kind of flow chart of the acquisition method of information on bidding provided by the invention;
Fig. 2 show the particular flow sheet of the step S100 of the acquisition method of information on bidding provided by the invention a kind of;
Fig. 3 show the particular flow sheet of the step S200 of the acquisition method of information on bidding provided by the invention a kind of.
Specific implementation mode
The technique effect of the design of the present invention, concrete structure and generation is carried out below with reference to embodiment and attached drawing clear Chu, complete description, to be completely understood by the purpose of the present invention, scheme and effect.It should be noted that the case where not conflicting Under, the features in the embodiments and the embodiments of the present application can be combined with each other.The identical attached drawing mark used everywhere in attached drawing Note indicates same or analogous part.
Fig. 1 show a kind of flow chart of the acquisition method of information on bidding provided by the invention.One according to the present invention Embodiment, a kind of acquisition method of information on bidding, including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then Web crawlers is transferred to carry out information collection this web data;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so The effective information in web data is extracted afterwards, and each information defines an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses A row storage in structured database, web data, bidding information medium source, affiliated area, the industry, webpage are sent out Cloth time, web retrieval time are also stored in the row of database one respectively as index.
Fig. 2 show the particular flow sheet of the step S100 of the acquisition method of information on bidding provided by the invention a kind of, root According to one embodiment of the present of invention, the step of acquiring information on bidding is further illustrated below:
S110) using each bid net as information source, the web data of information on bidding bulletin is obtained;
S120) judge whether the address of web data had crawled, next net is obtained if having climbed out of Page data carries out next step S130 if not crawling.
S130) web crawlers is transferred to carry out information collection this web data.
Fig. 3 show the particular flow sheet of the step S200 of the acquisition method of information on bidding provided by the invention a kind of, root According to one embodiment of the present of invention, the step of extracting information on bidding is further illustrated:
S210 effective information) is extracted, the advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so The effective information in web data is extracted afterwards.
S220) by effective information structuring, project name, the bid time, the time of submitting a tender, the place of the bid submission, is opened project number Mark time, opening of bid place, budget amount, procurement unit, procurement unit contact person, procurement unit contact method, procurement unit Location, agency, agency contact person, agency's contact method, agency address, buying content, attachment documents, often A information defines an index, all index compositions indicator lists.
According to one embodiment of present invention, the information of bidding is not timing publication, the frequency acquisition of web crawlers The maximum frequency of bidding information update can be higher than, the situation for collecting duplicate message is will appear in gatherer process, we will This frequency acquisition is set as once a day.
According to one embodiment of present invention, the database can be Access, sql server, mysql and Any one in oracle, this method fully take into account the problem of compatibility, adapt to frequently-used data libraries all at present.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention It is interior.
It should be noted that:
Algorithm and display be not inherently related to any certain computer, virtual bench or miscellaneous equipment provided herein. Various fexible units can also be used together with teaching based on this.As described above, it constructs required by this kind of device Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:It is i.e. required to protect Shield the present invention claims the more features of feature than being expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific implementation mode are expressly incorporated in the specific implementation mode, wherein each claim itself All as a separate embodiment of the present invention.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.

Claims (5)

1. a kind of acquisition method of information on bidding, which is characterized in that including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then by this Web data transfers to web crawlers to carry out information collection;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, then taken out The effective information in web data, each information is taken to define an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses structure Change a row storage in database, when by web data, bidding information medium source, affiliated area, the industry, Homepage Publishing Between, the web retrieval time also respectively as index, be stored in the row of database one.
2. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the step S100 acquisitions are recruited Mark information further includes the screening of web data:Information on bidding is not timing publication, and the frequency acquisition of web crawlers, which can be higher than, recruits The maximum frequency for marking information update, will appear the situation for collecting duplicate message in gatherer process;Web crawlers is to be adopted The address of the web data of collection is judged that the information with address need to only acquire once.
3. a kind of acquisition method of information on bidding according to claim 2, which is characterized in that the information of the web crawlers Frequency acquisition is once a day.
4. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the effective information includes: Project name, project number, bid time, time of submitting a tender, the place of the bid submission, opening of bid time, opening of bid place, budget amount, buying Unit, procurement unit contact person, procurement unit's contact method, procurement unit address, agency, agency contact person, generation Manage authority contact mode, agency address, buying content, attachment documents.
5. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the database can be Any one in Access, sql server, mysql and oracle.
CN201810127175.7A 2018-02-08 2018-02-08 A kind of acquisition method of information on bidding Pending CN108415968A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810127175.7A CN108415968A (en) 2018-02-08 2018-02-08 A kind of acquisition method of information on bidding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810127175.7A CN108415968A (en) 2018-02-08 2018-02-08 A kind of acquisition method of information on bidding

Publications (1)

Publication Number Publication Date
CN108415968A true CN108415968A (en) 2018-08-17

Family

ID=63127963

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810127175.7A Pending CN108415968A (en) 2018-02-08 2018-02-08 A kind of acquisition method of information on bidding

Country Status (1)

Country Link
CN (1) CN108415968A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109615469A (en) * 2018-12-05 2019-04-12 贵阳高新数通信息有限公司 The management system and method extracted based on bidding website relevant information
CN110609939A (en) * 2019-09-11 2019-12-24 北京网聘咨询有限公司 Web-based distributed recruitment information acquisition system
CN111047268A (en) * 2018-10-11 2020-04-21 上海汽车集团股份有限公司 Bidding method and device
CN116361594A (en) * 2023-06-01 2023-06-30 北京拓普丰联信息科技股份有限公司 Mining method, device, equipment and medium for bidding information release platform

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106971341A (en) * 2017-03-09 2017-07-21 庞己人 A kind of method and system for pushing information on bidding and user's participation competitive bidding
CN107239891A (en) * 2017-05-26 2017-10-10 山东省科学院情报研究所 A kind of bid checking method based on big data
CN107341619A (en) * 2017-07-22 2017-11-10 江苏省鸿源招标代理股份有限公司 A kind of bid information acquisition system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106971341A (en) * 2017-03-09 2017-07-21 庞己人 A kind of method and system for pushing information on bidding and user's participation competitive bidding
CN107239891A (en) * 2017-05-26 2017-10-10 山东省科学院情报研究所 A kind of bid checking method based on big data
CN107341619A (en) * 2017-07-22 2017-11-10 江苏省鸿源招标代理股份有限公司 A kind of bid information acquisition system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
冯思平: "Web招标信息搜索及管理系统的设计", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111047268A (en) * 2018-10-11 2020-04-21 上海汽车集团股份有限公司 Bidding method and device
CN109615469A (en) * 2018-12-05 2019-04-12 贵阳高新数通信息有限公司 The management system and method extracted based on bidding website relevant information
CN110609939A (en) * 2019-09-11 2019-12-24 北京网聘咨询有限公司 Web-based distributed recruitment information acquisition system
CN116361594A (en) * 2023-06-01 2023-06-30 北京拓普丰联信息科技股份有限公司 Mining method, device, equipment and medium for bidding information release platform
CN116361594B (en) * 2023-06-01 2023-08-25 北京拓普丰联信息科技股份有限公司 Mining method, device, equipment and medium for bidding information release platform

Similar Documents

Publication Publication Date Title
CN108415968A (en) A kind of acquisition method of information on bidding
CN107239891A (en) A kind of bid checking method based on big data
CN108415969A (en) A kind of information on bidding retrieval analysis method and system
CA2612895A1 (en) Systems and methods for providing search results
CN108427721A (en) A kind of standardized method of the information on bidding based on database and system
CN108304994A (en) A kind of source of houses method for evaluating quality on sale and server
CN108491426A (en) A kind of information on bidding supplying system
Kuruppuarachchi et al. A comparison of major environmental justice screening and mapping tools
Dudin et al. " Green" Logistics as an Instrument for Putting Together a New Model for Professional and Career-Broadening Training in Global Economic Space.
Agami The international accounting course state of the art
Hugar Impact of open access journals in DOAJ: An analysis
Crosetto et al. Assessment in a tight time frame: Using readily available data to evaluate your collection
CN108460109A (en) A kind of information on bidding analysis method based on big data
Harris Economic aspects of military expenditure in developing countries: A survey article
Nickum Elusive no longer? Increasing accessibility to the federally funded technical report literature
Marzuki et al. Progress and promise for science in Indonesia
Firoozbakht et al. Reviewing factors affecting ICT outsourcing services (Case Study of Karaj Municipality)
Dewa et al. Optimizing Public Fund to Finance Smallholder Plantations for Sustainable Palm Oil in Indonesia.
Rayan et al. Review of Religious Tourism in Kingdom of Saudi Arabia During the Covid-19 Pandemic
Budiman et al. Content Analysis on Quick Services Information System (SILAT) at the Indonesia Ministry of Marine and Fisheries
De Stefano Use-based selection for preservation microfilming
GB2454161A (en) A mechanism for improving the effectiveness of an internet search engine
Gondwe Land reform in Malawi
Kershaw et al. Micro-Computer Based Real Estate Decision Making and Information Management-An Integrated Approach
Silvestre et al. Synopsis and recommendations of the ADB/ICLARM workshop on tropical coastal fish stocks in Asia

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180817

RJ01 Rejection of invention patent application after publication