CN108415968A - A kind of acquisition method of information on bidding - Google Patents
A kind of acquisition method of information on bidding Download PDFInfo
- Publication number
- CN108415968A CN108415968A CN201810127175.7A CN201810127175A CN108415968A CN 108415968 A CN108415968 A CN 108415968A CN 201810127175 A CN201810127175 A CN 201810127175A CN 108415968 A CN108415968 A CN 108415968A
- Authority
- CN
- China
- Prior art keywords
- information
- bidding
- web
- web data
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/08—Auctions
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Development Economics (AREA)
- General Business, Economics & Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention provides a kind of acquisition methods of information on bidding, including step:S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then transfers to web crawlers to carry out information collection this web data;S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, the effective information in web data is then extracted, each information defines an index, all index compositions indicator lists;S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses a row storage in structured database, using web data, bidding information medium source, affiliated area, the industry, Homepage Publishing time, web retrieval time also as index, it is stored in the row of database one.
Description
Technical field
The present invention relates to bidding field, more particularly to a kind of acquisition method of information on bidding.
Background technology
Bid is a kind of commonly used in the world, organized market trading activity with bid, is engineering, cargo or clothes
The dealing of business trade.Typically purchaser offers and requires in advance, and numerous trading objects is invited to participate in competition simultaneously
Conclusion of the business person is therefrom selected according to the program of regulation.Bidding activity to break trade monopoly and barriers between the regions, increase economic efficiency,
Ensure that project quality, prevention and reduction corruption etc. have played important function, has become the weight for promoting modern market system construction
Want means.
Information-based development brings the new situation in bidding field, and originally bidder mainly obtains item by periodicals and magazines
The mode of mesh bidding information is transformed to obtains the information for being suitble to oneself to submit a tender by internet site.One kind of bidder
Way is to log in each bidding website of various regions to obtain information, and then being retrieved and being investigated one by one by artificial mode needs
The information wanted.Another more efficient way is to log in some large-scale bidding information sites, passes through full-text search
The bidding information that mode removal search needs.
However, this mode takes time and effort, while the included search of bidding website cannot guarantee that quality, this is resulted in
Mistake misses important information.And by logging in large-scale bidding information site, it is gone by way of full-text search
Search for the bidding information needed, it is matched of low quality also due to use fuzzy matching algorithm, caused by the nothing that searches out
It imitates data and is more than valid data, more frighteningly miss more valuable informations.
Invention content
To solve the above-mentioned problems, the present invention provides a kind of acquisition methods of information on bidding, including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then
Web crawlers is transferred to carry out information collection this web data;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so
The effective information in web data is extracted afterwards, and each information defines an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses
A row storage in structured database, web data, bidding information medium source, affiliated area, the industry, webpage are sent out
Cloth time, web retrieval time are also stored in the row of database one respectively as index.
Preferably, the step S100 acquisitions information on bidding further includes the screening of web data:Information on bidding is not timing
Publication, the frequency acquisition of web crawlers can be higher than the newer maximum frequency of information on bidding, and acquisition is will appear in gatherer process
To the situation of duplicate message;Web crawlers judges the address for the web data to be acquired the information with address only needs
Acquisition is primary.
Preferably, the information collection frequency of the web crawlers is once a day.
Preferably, the effective information includes:Project name, project number, bid the time, submit a tender the time, the place of the bid submission,
Open bid time, opening of bid place, budget amount, procurement unit, procurement unit contact person, procurement unit's contact method, procurement unit
Address, agency, agency contact person, agency's contact method, agency address, buying content, attachment documents.
Preferably, the database can be any one in Access, sql server, mysql and oracle.
Beneficial effects of the present invention are:The present invention provides a kind of acquisition methods of information on bidding, select all kinds of bid nets
It is information source to stand, and is acquired to webpage information using web crawlers, and extract effective information and stored, and bid letter is improved
The acquisition quality and efficiency of breath.
Description of the drawings
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below
There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this
Some embodiments of invention without having to pay creative labor, may be used also for those of ordinary skill in the art
To obtain other attached drawings according to these attached drawings.
Fig. 1 show a kind of flow chart of the acquisition method of information on bidding provided by the invention;
Fig. 2 show the particular flow sheet of the step S100 of the acquisition method of information on bidding provided by the invention a kind of;
Fig. 3 show the particular flow sheet of the step S200 of the acquisition method of information on bidding provided by the invention a kind of.
Specific implementation mode
The technique effect of the design of the present invention, concrete structure and generation is carried out below with reference to embodiment and attached drawing clear
Chu, complete description, to be completely understood by the purpose of the present invention, scheme and effect.It should be noted that the case where not conflicting
Under, the features in the embodiments and the embodiments of the present application can be combined with each other.The identical attached drawing mark used everywhere in attached drawing
Note indicates same or analogous part.
Fig. 1 show a kind of flow chart of the acquisition method of information on bidding provided by the invention.One according to the present invention
Embodiment, a kind of acquisition method of information on bidding, including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then
Web crawlers is transferred to carry out information collection this web data;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so
The effective information in web data is extracted afterwards, and each information defines an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses
A row storage in structured database, web data, bidding information medium source, affiliated area, the industry, webpage are sent out
Cloth time, web retrieval time are also stored in the row of database one respectively as index.
Fig. 2 show the particular flow sheet of the step S100 of the acquisition method of information on bidding provided by the invention a kind of, root
According to one embodiment of the present of invention, the step of acquiring information on bidding is further illustrated below:
S110) using each bid net as information source, the web data of information on bidding bulletin is obtained;
S120) judge whether the address of web data had crawled, next net is obtained if having climbed out of
Page data carries out next step S130 if not crawling.
S130) web crawlers is transferred to carry out information collection this web data.
Fig. 3 show the particular flow sheet of the step S200 of the acquisition method of information on bidding provided by the invention a kind of, root
According to one embodiment of the present of invention, the step of extracting information on bidding is further illustrated:
S210 effective information) is extracted, the advertisement in the web data of web crawlers acquisition, friendly link are filtered out, so
The effective information in web data is extracted afterwards.
S220) by effective information structuring, project name, the bid time, the time of submitting a tender, the place of the bid submission, is opened project number
Mark time, opening of bid place, budget amount, procurement unit, procurement unit contact person, procurement unit contact method, procurement unit
Location, agency, agency contact person, agency's contact method, agency address, buying content, attachment documents, often
A information defines an index, all index compositions indicator lists.
According to one embodiment of present invention, the information of bidding is not timing publication, the frequency acquisition of web crawlers
The maximum frequency of bidding information update can be higher than, the situation for collecting duplicate message is will appear in gatherer process, we will
This frequency acquisition is set as once a day.
According to one embodiment of present invention, the database can be Access, sql server, mysql and
Any one in oracle, this method fully take into account the problem of compatibility, adapt to frequently-used data libraries all at present.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all
Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention
It is interior.
It should be noted that:
Algorithm and display be not inherently related to any certain computer, virtual bench or miscellaneous equipment provided herein.
Various fexible units can also be used together with teaching based on this.As described above, it constructs required by this kind of device
Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that can utilize various
Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair
Bright preferred forms.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention
Example can be put into practice without these specific details.In some instances, well known method, structure is not been shown in detail
And technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect,
Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:It is i.e. required to protect
Shield the present invention claims the more features of feature than being expressly recited in each claim.More precisely, as following
Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore,
Thus the claims for following specific implementation mode are expressly incorporated in the specific implementation mode, wherein each claim itself
All as a separate embodiment of the present invention.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments
In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention
Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed
One of meaning mode can use in any combination.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability
Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.
Claims (5)
1. a kind of acquisition method of information on bidding, which is characterized in that including step:
S100 information on bidding) is acquired:Using each bid net as information source, the web data of information on bidding bulletin is obtained, then by this
Web data transfers to web crawlers to carry out information collection;
S200 information on bidding) is extracted:Advertisement in the web data of web crawlers acquisition, friendly link are filtered out, then taken out
The effective information in web data, each information is taken to define an index, all index compositions indicator lists;
S300 information on bidding) is stored:Effective information uses the table in database to store, each index extracted uses structure
Change a row storage in database, when by web data, bidding information medium source, affiliated area, the industry, Homepage Publishing
Between, the web retrieval time also respectively as index, be stored in the row of database one.
2. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the step S100 acquisitions are recruited
Mark information further includes the screening of web data:Information on bidding is not timing publication, and the frequency acquisition of web crawlers, which can be higher than, recruits
The maximum frequency for marking information update, will appear the situation for collecting duplicate message in gatherer process;Web crawlers is to be adopted
The address of the web data of collection is judged that the information with address need to only acquire once.
3. a kind of acquisition method of information on bidding according to claim 2, which is characterized in that the information of the web crawlers
Frequency acquisition is once a day.
4. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the effective information includes:
Project name, project number, bid time, time of submitting a tender, the place of the bid submission, opening of bid time, opening of bid place, budget amount, buying
Unit, procurement unit contact person, procurement unit's contact method, procurement unit address, agency, agency contact person, generation
Manage authority contact mode, agency address, buying content, attachment documents.
5. a kind of acquisition method of information on bidding according to claim 1, which is characterized in that the database can be
Any one in Access, sql server, mysql and oracle.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810127175.7A CN108415968A (en) | 2018-02-08 | 2018-02-08 | A kind of acquisition method of information on bidding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810127175.7A CN108415968A (en) | 2018-02-08 | 2018-02-08 | A kind of acquisition method of information on bidding |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108415968A true CN108415968A (en) | 2018-08-17 |
Family
ID=63127963
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810127175.7A Pending CN108415968A (en) | 2018-02-08 | 2018-02-08 | A kind of acquisition method of information on bidding |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108415968A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109615469A (en) * | 2018-12-05 | 2019-04-12 | 贵阳高新数通信息有限公司 | The management system and method extracted based on bidding website relevant information |
CN110609939A (en) * | 2019-09-11 | 2019-12-24 | 北京网聘咨询有限公司 | Web-based distributed recruitment information acquisition system |
CN111047268A (en) * | 2018-10-11 | 2020-04-21 | 上海汽车集团股份有限公司 | Bidding method and device |
CN116361594A (en) * | 2023-06-01 | 2023-06-30 | 北京拓普丰联信息科技股份有限公司 | Mining method, device, equipment and medium for bidding information release platform |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106971341A (en) * | 2017-03-09 | 2017-07-21 | 庞己人 | A kind of method and system for pushing information on bidding and user's participation competitive bidding |
CN107239891A (en) * | 2017-05-26 | 2017-10-10 | 山东省科学院情报研究所 | A kind of bid checking method based on big data |
CN107341619A (en) * | 2017-07-22 | 2017-11-10 | 江苏省鸿源招标代理股份有限公司 | A kind of bid information acquisition system and method |
-
2018
- 2018-02-08 CN CN201810127175.7A patent/CN108415968A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106971341A (en) * | 2017-03-09 | 2017-07-21 | 庞己人 | A kind of method and system for pushing information on bidding and user's participation competitive bidding |
CN107239891A (en) * | 2017-05-26 | 2017-10-10 | 山东省科学院情报研究所 | A kind of bid checking method based on big data |
CN107341619A (en) * | 2017-07-22 | 2017-11-10 | 江苏省鸿源招标代理股份有限公司 | A kind of bid information acquisition system and method |
Non-Patent Citations (1)
Title |
---|
冯思平: "Web招标信息搜索及管理系统的设计", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111047268A (en) * | 2018-10-11 | 2020-04-21 | 上海汽车集团股份有限公司 | Bidding method and device |
CN109615469A (en) * | 2018-12-05 | 2019-04-12 | 贵阳高新数通信息有限公司 | The management system and method extracted based on bidding website relevant information |
CN110609939A (en) * | 2019-09-11 | 2019-12-24 | 北京网聘咨询有限公司 | Web-based distributed recruitment information acquisition system |
CN116361594A (en) * | 2023-06-01 | 2023-06-30 | 北京拓普丰联信息科技股份有限公司 | Mining method, device, equipment and medium for bidding information release platform |
CN116361594B (en) * | 2023-06-01 | 2023-08-25 | 北京拓普丰联信息科技股份有限公司 | Mining method, device, equipment and medium for bidding information release platform |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108415968A (en) | A kind of acquisition method of information on bidding | |
CN107239891A (en) | A kind of bid checking method based on big data | |
CN108415969A (en) | A kind of information on bidding retrieval analysis method and system | |
CA2612895A1 (en) | Systems and methods for providing search results | |
CN108427721A (en) | A kind of standardized method of the information on bidding based on database and system | |
CN108304994A (en) | A kind of source of houses method for evaluating quality on sale and server | |
CN108491426A (en) | A kind of information on bidding supplying system | |
Kuruppuarachchi et al. | A comparison of major environmental justice screening and mapping tools | |
Dudin et al. | " Green" Logistics as an Instrument for Putting Together a New Model for Professional and Career-Broadening Training in Global Economic Space. | |
Agami | The international accounting course state of the art | |
Hugar | Impact of open access journals in DOAJ: An analysis | |
Crosetto et al. | Assessment in a tight time frame: Using readily available data to evaluate your collection | |
CN108460109A (en) | A kind of information on bidding analysis method based on big data | |
Harris | Economic aspects of military expenditure in developing countries: A survey article | |
Nickum | Elusive no longer? Increasing accessibility to the federally funded technical report literature | |
Marzuki et al. | Progress and promise for science in Indonesia | |
Firoozbakht et al. | Reviewing factors affecting ICT outsourcing services (Case Study of Karaj Municipality) | |
Dewa et al. | Optimizing Public Fund to Finance Smallholder Plantations for Sustainable Palm Oil in Indonesia. | |
Rayan et al. | Review of Religious Tourism in Kingdom of Saudi Arabia During the Covid-19 Pandemic | |
Budiman et al. | Content Analysis on Quick Services Information System (SILAT) at the Indonesia Ministry of Marine and Fisheries | |
De Stefano | Use-based selection for preservation microfilming | |
GB2454161A (en) | A mechanism for improving the effectiveness of an internet search engine | |
Gondwe | Land reform in Malawi | |
Kershaw et al. | Micro-Computer Based Real Estate Decision Making and Information Management-An Integrated Approach | |
Silvestre et al. | Synopsis and recommendations of the ADB/ICLARM workshop on tropical coastal fish stocks in Asia |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180817 |
|
RJ01 | Rejection of invention patent application after publication |