CN103049537A - Network information collection method - Google Patents

Network information collection method Download PDF

Info

Publication number
CN103049537A
CN103049537A CN2012105703659A CN201210570365A CN103049537A CN 103049537 A CN103049537 A CN 103049537A CN 2012105703659 A CN2012105703659 A CN 2012105703659A CN 201210570365 A CN201210570365 A CN 201210570365A CN 103049537 A CN103049537 A CN 103049537A
Authority
CN
China
Prior art keywords
information
inquiry
configuration file
network
querying condition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012105703659A
Other languages
Chinese (zh)
Inventor
关班记
孙傲冰
季统凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
G Cloud Technology Co Ltd
Original Assignee
G Cloud Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by G Cloud Technology Co Ltd filed Critical G Cloud Technology Co Ltd
Priority to CN2012105703659A priority Critical patent/CN103049537A/en
Publication of CN103049537A publication Critical patent/CN103049537A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention relates to the technical field of information processing, in particular to an automatic network information collection method. The method includes: querying conditional formats and setting up configuration files, using the configuration files to read defined query conditions, acquiring and filtering information of appointed websites according to the query conditions, and returning results to a querier. The automatic network information collection method is convenient for users to acquire interested data from the enormous network, saves a great deal of searching time for the users, and can be used for collection of network information.

Description

A kind of Network Information Gathering method
Technical field
The present invention relates to technical field of information processing, especially a kind of automated network formation gathering method.
Background technology
In our daily life and work, often can be interested in especially the some of them data, particularly for some particularly preferred websites, wish to pay close attention to these part data of these websites.As, there are some colleagues to miss potter in the occupation men's basketball match of Sina website and the Tengxun online browsing U.S. match message about Heat; At this moment, this colleague only has by browser and opens this network address, then at netpage search, click wherein several famous websites, find in this website about the news of U.S.'s occupation men's basketball match, find at last Heat's result of the match wherein.Similarly inquiry also has a lot, pays close attention to the server info in Central Shanxi Plain village, the price movement of paying close attention to the online a certain hardware device in Jingdone district or configuration change etc. such as some colleagues.Suchlike information all needs the user just can obtain corresponding information by network address search slowly, also can not find suitable information in some situation, wastes a large amount of time.
Summary of the invention
The technical matters that the present invention solves is to provide the automated network formation gathering method, realizes automatic collection, feedback to user interest information.
The technical scheme that the present invention solves aforementioned technical problem is:
Carry out according to following steps:
Step 1, definition querying condition form also arranges configuration file;
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 sends to the information after filtering the address of inquiry's appointment.
Described querying condition formal definition is to occur with the form of key-value pair, a plurality of conditions with "; " separate.
Configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
Information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
The present invention is by configuration file, and the website that can arrive appointment obtains the interested information of user, finally gathers and the mode by mail sends in the mail of appointment; But this method configuration information collection period makes the timely variation of trace information of user simultaneously.The present invention has changed the information mode to a certain extent, has greatly improved work efficiency, avoids doing every day the work of repetition.
Description of drawings
The present invention is further described below in conjunction with accompanying drawing:
Fig. 1 is the process flow diagram of Network Information Gathering of the present invention;
Fig. 2 is configuration file definition structure synoptic diagram of the present invention.
Embodiment
Network information automatically collecting of the present invention can carry out as follows:
Step 1, definition querying condition form also arranges configuration file;
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 is crossed the address that Email etc. sends to inquiry's appointment with the information exchange after filtering.
Described querying condition formal definition is to occur with the form of key-value pair, a plurality of conditions with "; " separate.
Aforesaid configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
Lower mask body is take the score of obtaining Rockets on Dec 19th, 2012 in two P. E Web Sites as example.
As shown in Figure 1, comprise the steps:
The 1st goes on foot, will all publicize first, and agreement automatically replies the mail format of E-mail inquiries massaging device.In this agreement, the automated network information collection apparatus only has the form agreement to querying condition, and the form that form is decided to be approximately with key-value pair occurs, a plurality of conditions with "; " separate.Concrete form is: querying condition 1=query condition value; Querying condition 2=query condition value, wherein, number of parameters can constantly be expanded.Such as, inquire about Rockets's score on Dec 19th, 2012, then this querying condition is: " team=rocket; Match date=2012-12-19 "; Its subject is: score.Concrete structure as shown in Figure 2.
The 2nd step, read configuration file F, obtaining needs the subject information S that obtains, information filtering condition C and Data Source network address As in fact;
The 3rd step, by network address As, read the data I nfo in this network address;
The 4th step, by subject S, obtain among the Info about the purpose information S_Info of this section;
The 5th step, by filtercondition C, filter the data of S_Info, finally be met the information N_Info of user's request;
The 6th step, device are by lettergram mode, the recorded information return mechanism user of inquiry.

Claims (5)

1. Network Information Gathering method is characterized in that: carry out according to following steps:
Step 1, definition querying condition form also arranges configuration file;
Step 2 reads configuration file, obtains the information subject of inquiry;
Step 3 by configuration file, is obtained the extraction document of inquiry;
Step 4, by configuration file, the network address in acquired information source;
Step 5 by network address, reads the information of this network address;
Step 6 take extraction document as filtercondition, is filtered website information; Residue meets the information of extraction document;
Step 7 sends to the information after filtering the address of inquiry's appointment.
2. Network Information Gathering method according to claim 1 is characterized in that: described querying condition formal definition occurs for the form with key-value pair, a plurality of conditions with "; " separate.
3. Network Information Gathering method according to claim 1 and 2, it is characterized in that: configuration file can read defined querying condition, according to querying condition the information of specifying network address is obtained, is filtered, and the result is returned to the inquiry.
4. Network Information Gathering method according to claim 1 and 2, it is characterized in that: the information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
5. Network Information Gathering method according to claim 3, it is characterized in that: the information exchange after the described filtration is crossed E-mail mode and is fed back to the inquiry.
CN2012105703659A 2012-12-25 2012-12-25 Network information collection method Pending CN103049537A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012105703659A CN103049537A (en) 2012-12-25 2012-12-25 Network information collection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012105703659A CN103049537A (en) 2012-12-25 2012-12-25 Network information collection method

Publications (1)

Publication Number Publication Date
CN103049537A true CN103049537A (en) 2013-04-17

Family

ID=48062178

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012105703659A Pending CN103049537A (en) 2012-12-25 2012-12-25 Network information collection method

Country Status (1)

Country Link
CN (1) CN103049537A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101042709A (en) * 2007-04-11 2007-09-26 芦树鹏 Active mode search
CN101751428A (en) * 2008-12-12 2010-06-23 汉王科技股份有限公司 Information search method and device
CN101957866A (en) * 2010-10-25 2011-01-26 中国农业大学 Network text information integration method and device
US20110071977A1 (en) * 2009-09-18 2011-03-24 Apple Inc. Segmented graphical representations for recommending elements
WO2011075440A2 (en) * 2009-12-18 2011-06-23 Sacred Agent, Inc. A system and method algorithmic movie generation based on audio/video synchronization

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101042709A (en) * 2007-04-11 2007-09-26 芦树鹏 Active mode search
CN101751428A (en) * 2008-12-12 2010-06-23 汉王科技股份有限公司 Information search method and device
US20110071977A1 (en) * 2009-09-18 2011-03-24 Apple Inc. Segmented graphical representations for recommending elements
WO2011075440A2 (en) * 2009-12-18 2011-06-23 Sacred Agent, Inc. A system and method algorithmic movie generation based on audio/video synchronization
CN101957866A (en) * 2010-10-25 2011-01-26 中国农业大学 Network text information integration method and device

Similar Documents

Publication Publication Date Title
CN102521321B (en) Video search method based on search term ambiguity and user preferences
CN104281622A (en) Information recommending method and information recommending device in social media
CN104077402A (en) Data processing method and data processing system
JP6713331B2 (en) Program, information processing method, and information processing apparatus
CN102930059A (en) Method for designing focused crawler
CN102708174A (en) Method and device for displaying rich media information in browser
CN102426610A (en) Microblog rank searching method and microblog searching engine
CN102314443A (en) Method for correcting search engine and system
CN104166683A (en) Data mining method
CN102811207A (en) Network information pushing method and system
CN106021609A (en) Method and device for intelligently recommending website videos
CN102346751A (en) Information transmitting method and equipment
JP4875911B2 (en) Content identification method and apparatus
CN104077293A (en) Webpage acquisition method and device
CN102760058A (en) Massive software project sharing method oriented to large-scale collaborative development
CN103327049A (en) Rich content pushing method and system based on browser address bar
CN103106234A (en) Searching method and device of webpage content
CN104537080A (en) Information recommendation method and system
CN103914465A (en) User interest graph based intelligent customization audio listening implementation system and method
CN103605742A (en) Method and device for recognizing network resource entity content page
CN103049537A (en) Network information collection method
Dooms et al. Mining cross-domain rating datasets from structured data on twitter
CN103164522A (en) Method for obtaining linkman by end-user in social software
WO2015000083A1 (en) System and method for ranking online content
KR101132431B1 (en) System and method for providing interest information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130417