CN103049537A - Network information collection method - Google Patents
Network information collection method Download PDFInfo
- Publication number
- CN103049537A CN103049537A CN2012105703659A CN201210570365A CN103049537A CN 103049537 A CN103049537 A CN 103049537A CN 2012105703659 A CN2012105703659 A CN 2012105703659A CN 201210570365 A CN201210570365 A CN 201210570365A CN 103049537 A CN103049537 A CN 103049537A
- Authority
- CN
- China
- Prior art keywords
- information
- inquiry
- configuration file
- network
- querying condition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Transfer Between Computers (AREA)
Abstract
The invention relates to the technical field of information processing, in particular to an automatic network information collection method. The method includes: querying conditional formats and setting up configuration files, using the configuration files to read defined query conditions, acquiring and filtering information of appointed websites according to the query conditions, and returning results to a querier. The automatic network information collection method is convenient for users to acquire interested data from the enormous network, saves a great deal of searching time for the users, and can be used for collection of network information.
Description
Technical field
The present invention relates to technical field of information processing, especially a kind of automated network formation gathering method.
Background technology
In our daily life and work, often can be interested in especially the some of them data, particularly for some particularly preferred websites, wish to pay close attention to these part data of these websites.As, there are some colleagues to miss potter in the occupation men's basketball match of Sina website and the Tengxun online browsing U.S. match message about Heat; At this moment, this colleague only has by browser and opens this network address, then at netpage search, click wherein several famous websites, find in this website about the news of U.S.'s occupation men's basketball match, find at last Heat's result of the match wherein.Similarly inquiry also has a lot, pays close attention to the server info in Central Shanxi Plain village, the price movement of paying close attention to the online a certain hardware device in Jingdone district or configuration change etc. such as some colleagues.Suchlike information all needs the user just can obtain corresponding information by network address search slowly, also can not find suitable information in some situation, wastes a large amount of time.
Summary of the invention
The technical matters that the present invention solves is to provide the automated network formation gathering method, realizes automatic collection, feedback to user interest information.
The technical scheme that the present invention solves aforementioned technical problem is:
Carry out according to following steps:
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 sends to the information after filtering the address of inquiry's appointment.
Described querying condition formal definition is to occur with the form of key-value pair, a plurality of conditions with "; " separate.
Configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
Information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
The present invention is by configuration file, and the website that can arrive appointment obtains the interested information of user, finally gathers and the mode by mail sends in the mail of appointment; But this method configuration information collection period makes the timely variation of trace information of user simultaneously.The present invention has changed the information mode to a certain extent, has greatly improved work efficiency, avoids doing every day the work of repetition.
Description of drawings
The present invention is further described below in conjunction with accompanying drawing:
Fig. 1 is the process flow diagram of Network Information Gathering of the present invention;
Fig. 2 is configuration file definition structure synoptic diagram of the present invention.
Embodiment
Network information automatically collecting of the present invention can carry out as follows:
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 is crossed the address that Email etc. sends to inquiry's appointment with the information exchange after filtering.
Described querying condition formal definition is to occur with the form of key-value pair, a plurality of conditions with "; " separate.
Aforesaid configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
Lower mask body is take the score of obtaining Rockets on Dec 19th, 2012 in two P. E Web Sites as example.
As shown in Figure 1, comprise the steps:
The 1st goes on foot, will all publicize first, and agreement automatically replies the mail format of E-mail inquiries massaging device.In this agreement, the automated network information collection apparatus only has the form agreement to querying condition, and the form that form is decided to be approximately with key-value pair occurs, a plurality of conditions with "; " separate.Concrete form is: querying condition 1=query condition value; Querying condition 2=query condition value, wherein, number of parameters can constantly be expanded.Such as, inquire about Rockets's score on Dec 19th, 2012, then this querying condition is: " team=rocket; Match date=2012-12-19 "; Its subject is: score.Concrete structure as shown in Figure 2.
The 2nd step, read configuration file F, obtaining needs the subject information S that obtains, information filtering condition C and Data Source network address As in fact;
The 3rd step, by network address As, read the data I nfo in this network address;
The 4th step, by subject S, obtain among the Info about the purpose information S_Info of this section;
The 5th step, by filtercondition C, filter the data of S_Info, finally be met the information N_Info of user's request;
The 6th step, device are by lettergram mode, the recorded information return mechanism user of inquiry.
Claims (5)
1. Network Information Gathering method is characterized in that: carry out according to following steps:
Step 1, definition querying condition form also arranges configuration file;
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 sends to the information after filtering the address of inquiry's appointment.
2. Network Information Gathering method according to claim 1 is characterized in that: described querying condition formal definition occurs for the form with key-value pair, a plurality of conditions with "; " separate.
3. Network Information Gathering method according to claim 1 and 2, it is characterized in that: configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
4. Network Information Gathering method according to claim 1 and 2, it is characterized in that: the information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
5. Network Information Gathering method according to claim 3, it is characterized in that: the information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012105703659A CN103049537A (en) | 2012-12-25 | 2012-12-25 | Network information collection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012105703659A CN103049537A (en) | 2012-12-25 | 2012-12-25 | Network information collection method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103049537A true CN103049537A (en) | 2013-04-17 |
Family
ID=48062178
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012105703659A Pending CN103049537A (en) | 2012-12-25 | 2012-12-25 | Network information collection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103049537A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101042709A (en) * | 2007-04-11 | 2007-09-26 | 芦树鹏 | Active mode search |
CN101751428A (en) * | 2008-12-12 | 2010-06-23 | 汉王科技股份有限公司 | Information search method and device |
CN101957866A (en) * | 2010-10-25 | 2011-01-26 | 中国农业大学 | Network text information integration method and device |
US20110071977A1 (en) * | 2009-09-18 | 2011-03-24 | Apple Inc. | Segmented graphical representations for recommending elements |
WO2011075440A2 (en) * | 2009-12-18 | 2011-06-23 | Sacred Agent, Inc. | A system and method algorithmic movie generation based on audio/video synchronization |
-
2012
- 2012-12-25 CN CN2012105703659A patent/CN103049537A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101042709A (en) * | 2007-04-11 | 2007-09-26 | 芦树鹏 | Active mode search |
CN101751428A (en) * | 2008-12-12 | 2010-06-23 | 汉王科技股份有限公司 | Information search method and device |
US20110071977A1 (en) * | 2009-09-18 | 2011-03-24 | Apple Inc. | Segmented graphical representations for recommending elements |
WO2011075440A2 (en) * | 2009-12-18 | 2011-06-23 | Sacred Agent, Inc. | A system and method algorithmic movie generation based on audio/video synchronization |
CN101957866A (en) * | 2010-10-25 | 2011-01-26 | 中国农业大学 | Network text information integration method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102521321B (en) | Video search method based on search term ambiguity and user preferences | |
CN104281622A (en) | Information recommending method and information recommending device in social media | |
CN104077402A (en) | Data processing method and data processing system | |
JP6713331B2 (en) | Program, information processing method, and information processing apparatus | |
CN102930059A (en) | Method for designing focused crawler | |
CN102708174A (en) | Method and device for displaying rich media information in browser | |
CN102426610A (en) | Microblog rank searching method and microblog searching engine | |
CN102314443A (en) | Method for correcting search engine and system | |
CN104166683A (en) | Data mining method | |
CN102811207A (en) | Network information pushing method and system | |
CN106021609A (en) | Method and device for intelligently recommending website videos | |
CN102346751A (en) | Information transmitting method and equipment | |
JP4875911B2 (en) | Content identification method and apparatus | |
CN104077293A (en) | Webpage acquisition method and device | |
CN102760058A (en) | Massive software project sharing method oriented to large-scale collaborative development | |
CN103327049A (en) | Rich content pushing method and system based on browser address bar | |
CN103106234A (en) | Searching method and device of webpage content | |
CN104537080A (en) | Information recommendation method and system | |
CN103914465A (en) | User interest graph based intelligent customization audio listening implementation system and method | |
CN103605742A (en) | Method and device for recognizing network resource entity content page | |
CN103049537A (en) | Network information collection method | |
Dooms et al. | Mining cross-domain rating datasets from structured data on twitter | |
CN103164522A (en) | Method for obtaining linkman by end-user in social software | |
WO2015000083A1 (en) | System and method for ranking online content | |
KR101132431B1 (en) | System and method for providing interest information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20130417 |