CN105389310A - Method of applying web crawlers to household registration management - Google Patents
Method of applying web crawlers to household registration management Download PDFInfo
- Publication number
- CN105389310A CN105389310A CN201410446240.4A CN201410446240A CN105389310A CN 105389310 A CN105389310 A CN 105389310A CN 201410446240 A CN201410446240 A CN 201410446240A CN 105389310 A CN105389310 A CN 105389310A
- Authority
- CN
- China
- Prior art keywords
- household registration
- information
- web crawlers
- keyword
- registration information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses a method of applying web crawlers to household registration management. The method comprises concrete steps of: step 1, inputting keywords of household registration information; step 2, searching URLs of household registration information by a server; step 3, grabbing web page contents of household registration information; and step 4, re-inputting keywords of household registration information for secondary filter. The method of applying web crawlers to household registration management has following beneficial effects: web crawlers are adopted for re-filtering web pages on the condition that web pages are automatically searched so that information amount of a household registration base is great; conventionally speaking, it may take a large amount of labor to search for targeted household registration information and accurate household registration information may not be found out; and the method helps to provide a convenient and effective method for obtaining targeted household registration information for users by refining searched household registration information.
Description
Technical field
The present invention relates to web crawlers technical field, be specially the application process of a kind of web crawlers in residence management.
Background technology
Web crawlers is a program automatically extracting webpage, it be search engine from downloading web pages WWW, be the important composition of search engine.Tradition reptile, from the URL of one or several Initial page, obtains the URL on Initial page, in the process capturing webpage, constantly extracts new URL from current page and puts into queue, until meet certain stop condition of system.Web crawlers is a kind of according to certain rule, captures program or the script of web message automatically.The name that other seldom uses also has ant, automatic indexing, simulator program or worm.Web crawlers technology can obtain object information rapidly accurately from internet, be interconnected in the process to residence management and often need the household register information database to huge to search for one of them information point, traditional searching method not only speed is slow, the true property of honor receiving the information of rope is also lower, and the characteristic of web crawlers can well address this problem just, for this reason, we propose the application process of a kind of web crawlers in residence management.
Summary of the invention
The application process of web crawlers in residence management, concrete steps are as follows:
The first step, the keyword of input household register letter information.
Second step, server receives the URL of rope household register information.
3rd step, server captures the web page contents of household register information.
4th step, the keyword again inputting household register information carries out secondary filtration.
Preferably, described keyword can be name, age, sex, home address etc., can input one of them, also can input multiple receipts rope keyword.
Preferably, the URL Initial page information of described receipts rope household register information, will not have messagewindow and show this information.
Preferably, described again input household register information keyword carry out secondary filtration before, the instant window of keyword input can be ejected.
Compared with prior art, the invention has the beneficial effects as follows: the application process of this web crawlers in residence management is automatically received on the basis of rope webpage at employing web crawlers and again filtered webpage, the quantity of information of household register information bank is very large, if want to look for target household register information for us, need to expend very large manpower, but also not necessarily can find out household register information accurately, the household register information of ropes is received in the method refinement, provide method easily and effectively for we obtain target household register information.
Accompanying drawing explanation
Fig. 1 is process flow diagram of the present invention.
Embodiment
The application process of web crawlers in residence management, concrete steps are as follows:
The first step, the keyword of input household register letter information, described keyword can be name, age, sex, home address etc., can input one of them, also can input multiple receipts rope keyword.
Second step, server receives the URL of rope household register information, and the URL Initial page information of described receipts rope household register information, will not have messagewindow and show this information.
3rd step, server captures the web page contents of household register information.
4th step, the keyword again inputting household register information carries out secondary filtration, described again input household register information keyword carry out secondary filtration before, the instant window of keyword input can be ejected.
5th step, obtains household register information, and described acquisition household register information is provided with derives point fast.
The application process of present networks reptile in residence management, just one of them embodiment, to the above-mentioned explanation of the disclosed embodiments, enables professional and technical personnel in the field realize or uses the present invention.To be apparent for those skilled in the art to the multiple amendment of these embodiments, General Principle as defined herein can without departing from the spirit or scope of the present invention, realize in other embodiments.Therefore, the present invention can not be restricted to these embodiments shown in this article, but will meet the widest scope consistent with principle disclosed herein and features of novelty.
Claims (4)
1. the application process of web crawlers in residence management, concrete steps are as follows:
The first step, the keyword of input household register letter information.
Second step, server receives the URL of rope household register information.
3rd step, server captures the web page contents of household register information.
4th step, the keyword again inputting household register information carries out secondary filtration.
2. the application process of a kind of web crawlers according to claim 1 in residence management, is characterized in that: described keyword can be name, age, sex, home address etc., can input one of them, also can input multiple receipts rope keyword.
3. the application process of a kind of web crawlers according to claim 1 in residence management, is characterized in that: the URL Initial page information of described receipts rope household register information, will not have messagewindow and show this information.
4. the application process of a kind of web crawlers according to claim 1 in residence management, is characterized in that: described again input household register information keyword carry out secondary filtration before, the instant window of keyword input can be ejected.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410446240.4A CN105389310A (en) | 2014-09-03 | 2014-09-03 | Method of applying web crawlers to household registration management |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410446240.4A CN105389310A (en) | 2014-09-03 | 2014-09-03 | Method of applying web crawlers to household registration management |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105389310A true CN105389310A (en) | 2016-03-09 |
Family
ID=55421607
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410446240.4A Pending CN105389310A (en) | 2014-09-03 | 2014-09-03 | Method of applying web crawlers to household registration management |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105389310A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107590236A (en) * | 2017-09-09 | 2018-01-16 | 杭州数立方征信有限公司 | A kind of big data acquisition method and system towards enterprise in charge of construction |
-
2014
- 2014-09-03 CN CN201410446240.4A patent/CN105389310A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107590236A (en) * | 2017-09-09 | 2018-01-16 | 杭州数立方征信有限公司 | A kind of big data acquisition method and system towards enterprise in charge of construction |
CN107590236B (en) * | 2017-09-09 | 2020-08-28 | 数立方(杭州)信息科技有限公司 | Big data acquisition method and system for building construction enterprises |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2009276354B2 (en) | Providing posts to discussion threads in response to a search query | |
CN104077402B (en) | Data processing method and data handling system | |
CN102054028B (en) | Method for implementing web-rendering function by using web crawler system | |
CN102254027B (en) | Method for obtaining webpage contents in batch | |
CN104750704B (en) | A kind of webpage URL address sorts recognition methods and device | |
CN102710795B (en) | Hotspot collecting method and device | |
CN102693271A (en) | Network information recommending method and system | |
CN102930059A (en) | Method for designing focused crawler | |
CN102542061B (en) | Intelligent product classification method | |
CN102411617B (en) | Method for storing and inquiring a large quantity of URLs | |
CN103116635B (en) | Field-oriented method and system for collecting invisible web resources | |
CN103077250A (en) | Method and device for capturing webpage content | |
JP2009048380A5 (en) | ||
CN105812417B (en) | Remote server, router and bad webpage information filtering method | |
CN103279507A (en) | Webpage spider operational method and system | |
CN105302876A (en) | Regular expression based URL filtering method | |
CN103500172A (en) | Image searching system | |
CN104991904A (en) | Page data acquisition method of dynamic webpage | |
CN106302849A (en) | A kind of method carrying out moving solid fusion by carrier data | |
CN103744944A (en) | Method for re-filtering in webpage or data crawling by web crawler | |
CN104298780A (en) | Method and system for pre-obtaining browser webpage information | |
CN105468618A (en) | Web crawler thesis duplicate checking method | |
CN105677921A (en) | Method and system for acquiring Internet public opinion data | |
CN103853771B (en) | A kind of method for pushing and system of search result | |
CN104008213A (en) | Method and device for finding and counting webpage information updating |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160309 |
|
WD01 | Invention patent application deemed withdrawn after publication |