CN104866532B - A kind of method and apparatus for the data search under semiclosed data environment - Google Patents

A kind of method and apparatus for the data search under semiclosed data environment Download PDF

Info

Publication number
CN104866532B
CN104866532B CN201510205529.1A CN201510205529A CN104866532B CN 104866532 B CN104866532 B CN 104866532B CN 201510205529 A CN201510205529 A CN 201510205529A CN 104866532 B CN104866532 B CN 104866532B
Authority
CN
China
Prior art keywords
data
semiclosed
environment
address
access
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510205529.1A
Other languages
Chinese (zh)
Other versions
CN104866532A (en
Inventor
张士益
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qian Xiansheng (beijing) Network Technology Co Ltd
Original Assignee
Qian Xiansheng (beijing) Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qian Xiansheng (beijing) Network Technology Co Ltd filed Critical Qian Xiansheng (beijing) Network Technology Co Ltd
Priority to CN201510205529.1A priority Critical patent/CN104866532B/en
Publication of CN104866532A publication Critical patent/CN104866532A/en
Application granted granted Critical
Publication of CN104866532B publication Critical patent/CN104866532B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention relates to a kind of method and apparatus for the data search under semiclosed data environment, comprising: one client browser of building uses the network access address of semiclosed data environment described in default log-on message sign-on access by the browser;The client browser accesses to the preset web address in the semiclosed data environment after logining successfully, and obtains the data file of the correspondence webpage returned from the server of the semiclosed data environment;According to preset data positional information corresponding with the webpage, the data of corresponding position are extracted from the data file.The present invention, which may not need, to be established particular interface the automatic search of double of closed data environment can be realized, it wherein required data will accurately extract, to improve data search efficiency, expand data search range, while also improving the accuracy of data search result.

Description

A kind of method and apparatus for the data search under semiclosed data environment
Technical field
The present invention relates to field of data search more particularly to a kind of sides for the data search under semiclosed data environment Method and device.
Background technique
Search engine is information to be collected from internet with specific computer program, right according to certain strategy After information carries out tissue and processing, retrieval service is provided for user, by the relative information displaying of user search to user.
In the prior art, the course of work of search engine is to utilize " spider " system (or crawler technology), automatic to access Webpage in internet reads the word content in the webpage, and finds other chained addresses for including in the webpage, and edge The chained address access crawl into other webpages, " spider " system constantly repeats this crawling process, and handle in internet All web datas creeped are collected.
Existing " spider " system can choose a station address from initial URL library first, usually those large-scale doors Family website, from these initial network address, spider can access and download the storage of corresponding web page contents into database, and by its In Word Input come out segmented after be stored in index database in, meanwhile, spider system extracts again to be existed in the web page contents Other website links, then repeat the above process.Since there is upper and lower levels between each webpage in a website Linking relationship, and be also likely to be present other websites address link, therefore, using spider system, not only can quickly by Whole webpages of one website all access one time, but also can use the link of those other station addresses and crawl into new net It stands, and obtains the web page contents of new website.
But as can be seen from the above description, web data collected by " spider " system that existing search engine utilizes is equal For the web data of publicity, the data under semiclosed data environment can not be collected.For example, for some The semiclosed data environments such as forum website, microblogging website, the personal internet banking system of member system, in particular for authorization or verification machine The data environment of system can not receive to be similar to outside access as " spider " system, in fact, the address of these websites Link also seldom has an opportunity to appear on the webpage of publicity, even if having, after being obtained and being accessed by spider system, due to not having Access authority, the result returned is also that can not open webpage, can not carry out subsequent creep to obtain data.
However, the data under these semiclosed data environments are not complete private data, but existing search is drawn Holding up automatically can not get these data due to technical for the general public user.Even there is access authority User also automatically can not accurately obtain interested data.In fact, if existing search engine will obtain these partly Data under closed data environment, it is necessary to just can be by individually establishing specific data open interface with these data environments Row access obtains.This is very uneconomic, and if other side, which disagrees, establishes open interface, existing search engine It just can not effectively get these data.
Summary of the invention
In view of the above-mentioned problems, the main purpose of the present invention is to provide a kind of data under semiclosed data environment to search The method and apparatus of rope, to solve the useful data that search engine of the existing technology cannot be searched in semiclosed data environment The problem of.
In order to solve the above-mentioned technical problem, the purpose of the present invention is be achieved through the following technical solutions:
The present invention provides a kind of data search methods under semiclosed data environment, comprising the following steps: building One client browser uses the net of semiclosed data environment described in default log-on message sign-on access by the browser Network access address;The client browser after logining successfully to the preset web address in the semiclosed data environment into Row access, and obtain the data file of the correspondence webpage returned from the server of the semiclosed data environment;According to preset Data positional information corresponding with the webpage, extracts the data of corresponding position from the data file.
Wherein, described according to preset data positional information corresponding with the webpage, it will from the data file The step of data of corresponding position extract, comprising: to the web data text obtained by client browser access Part is analyzed, according to preset file tag information corresponding with the data to be obtained in the webpage, in the number According to the position for positioning the label to match in file;It, will be with the label position from the data file according to the label position Corresponding data are set to extract.
Wherein, described will data corresponding with label position the step of extracting, comprising: operation and the label position Corresponding script data is set, and operation result data are extracted.
Wherein, the method for the invention further comprises: one database of building records the net of the semiclosed data environment Network access address, the default log-on message that the data environment can be logged in, the web page address for needing to access in the data environment, with And file tag information corresponding with the data to be obtained in the webpage.
Wherein, the client browser further comprises after logining successfully: obtaining by the semiclosed data environment Server distribution session token, and carry the session token to the preset web address in the semiclosed data environment into Row access.
The present invention also provides a kind of data serching devices under semiclosed data environment, comprising:
Login module, by the browser, is logged in using default log-on message and is visited for constructing a client browser Ask the server of the semiclosed data environment;
Access modules, for the client browser to default in the semiclosed data environment after logining successfully Web page address accesses, and obtains the data file of the correspondence webpage returned from the server;
Extraction module is used for according to preset data positional information corresponding with the webpage, from the data file It is middle to extract the data of corresponding position.
Wherein, the extraction module includes: positioning unit to the webpage number obtained by client browser access It is analyzed according to file, according to preset file tag information corresponding with the data to be obtained in the webpage, in institute State the position that the label to match is positioned in data file;Extraction unit is according to the label position, from the data file Data corresponding with the label position are extracted.
Wherein, the extraction unit, for running corresponding with label position script data, and by operation result number According to extracting.
Wherein, described device further comprises a database, and the network for recording the semiclosed data environment accesses Address, the default log-on message that the data environment can be logged in, the web page address for needing to access in the data environment, and with The corresponding file tag information of the data to be obtained in the webpage.
Wherein, the access modules, for obtaining by the session token of the server distribution of the semiclosed data environment, And it carries the session token and accesses to the preset web address in the semiclosed data environment.
Using the embodiment of the present invention, it may not need and establish particular interface the automatic of double of closed data environment can be realized Search wherein required data will be extracted accurately, to improve data search efficiency, expand data search model It encloses, while also improving the accuracy of data search result.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, this hair Bright illustrative embodiments and their description are used to explain the present invention, and are not constituted improper limitations of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart for the data search method under semiclosed data environment of the embodiment of the present invention;
Fig. 2 is the module map for the data serching device under semiclosed data environment of the embodiment of the present invention.
Specific embodiment
Main thought of the invention is, constructs a client browser, by the browser, is believed using default login Cease the network access address of semiclosed data environment described in sign-on access;The client browser is after logining successfully to described Preset web address in semiclosed data environment accesses, and obtains and return from the server of the semiclosed data environment Correspondence webpage data file;According to preset data positional information corresponding with the webpage, from the data file It is middle to extract the data of corresponding position.
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with drawings and the specific embodiments, to this Invention is described in further detail.
According to an embodiment of the invention, providing a kind of data search method under semiclosed data environment.
It is the process for the data search method under semiclosed data environment of the embodiment of the present invention with reference to Fig. 1, Fig. 1 Figure.
At step S102, a client browser is constructed, by the browser, is logged in and is visited using default log-on message Ask the network access address of the semiclosed data environment.
The embodiment of the present invention realizes that the mode of data search and existing crawler technology are entirely different.Existing crawler skill Art does not use browser access mode, but is interacted using command request mode with Website server, this is for open data Data search under environment is possible, but data higher for semienclosed data environment, especially security requirement The access of environment, due to some property parameters be in this access mode of command request it is sightless, if still adopted It can not just be accessed with command request mode.
The embodiment of the present invention is using browser access mode, then available existing by one client browser of building The property parameters for having crawler technology that can not obtain, such as session token (Session ID) parameter etc..User can be by this Double of closed data environment of client browser is browsed, but if not having the visit of the server of the semiclosed data environment It asks permission, will result in the case where can not logging in, therefore, the login that setting is directed to the semiclosed data environment can be collected in advance Information, with gain access.
For example, for semiclosed data environments such as microblogging, forums, it can be by way of username and password registered in advance Log-on message is obtained ahead of time;For the semiclosed data environment such as social network sites, the side of name registered in advance and password can be passed through Formula obtains log-on message;It, can be by way of bank's card number registered in advance and password for the semiclosed data environment such as Internetbank Obtain log-on message.
In fact, can also further be carried out to the corresponding semiclosed data environment after log-on message is obtained ahead of time Analysis, is informed in the web page address in the data environment needed to access, and opposite with the data to be obtained in the webpage The information such as the file label answered.Thus, it is possible to be visited by one database of building to record the network of the semiclosed data environment Ask address, the default log-on message that the data environment can be logged in, the web page address for needing to access in the data environment, Yi Jiyu The corresponding file tag information of the data to be obtained in the webpage.Certainly, other than analyzing know in advance, half is being logged in After closed data environment, by its each page is accessed and is analyzed automatically can also know corresponding web page address and The information such as file label.But for search efficiency and accuracy angle, it is clear that execute the effect of access more according to presupposed information It is good.
At step S104, the client browser is after logining successfully to default in the semiclosed data environment Web page address accesses, and obtains the data file of the correspondence webpage returned from the server of the semiclosed data environment.
There are corresponding network access address for the semiclosed data environment, are based on the network access address, client After end browser logs in the server of semiclosed data environment using preset log-on message, so that it may be carried out to its each page Access.
In order to improve access efficiency and accuracy, the embodiment of the present invention executes visit using pre-set web page address It asks.Such as after logging in some social network sites, can directly controlling browser access, there are the pages of data of interest;Again for example After logging in some Internetbank, can directly controlling browser access, there are the pages of product introduction.
Specifically, the pre-set web page address may include single web page address and/or web page address stream.
Further, client browser accesses to preset single web page address, that is, accesses a preset net Page address;And client browser accesses that (the web page address stream includes orderly multiple to preset web page address stream Web page address), i.e. the sequence based on the orderly multiple web page addresses for including in the web page address stream successively executes multiple net Each of page address web page address, to obtain the correspondence webpage returned from the server of the semiclosed data environment Data file, wherein the data file and the last one the web page address row phase being located in orderly multiple web page addresses It is corresponding.
For some pairs of higher data environments of security requirement, often may require that access side carries session token just can be with Access is executed, therefore, according to an embodiment of the invention, can further obtain by the semiclosed data after logining successfully The session token of the server distribution of environment, and carry the session token to the preset web in the semiclosed data environment Location accesses.
At step S106, according to preset data positional information corresponding with the webpage, from the data file It is middle to extract the data of corresponding position.
To by the client browser access obtain web data file analyze, according to it is preset in institute The corresponding file tag information of the data to be obtained in webpage is stated, the position of the label to match is positioned in the data file It sets.
Although text corresponding with interested data can be obtained by automatically analyze to the data file of acquisition Part label information, but consider for efficiency and accuracy, according to embodiments of the present invention, it can store and be directed in the database in advance The specific file tag information of particular webpage address, the label represent position of the specific data in web data file It sets.
For example, can store in the database: 1, the network access address to be accessed: www.facebook.com;2, phase The default log-on message answered: account: mike;Password: 123;3, there are the web page addresses that the needs of data of interest access: 1.facebook.com;4, the corresponding label information of interested data is the 2nd<a>mark in the data file of the webpage Label.
When running of the embodiment of the present invention, the network access address to be accessed can be obtained from database first, then root Login is executed according to corresponding log-on message, can control the web page address of the direct access preset of browser after logining successfully, from obtaining Web data file in the label position that matches positioned according to preset file tag information.
According to the label position, data corresponding with the label position are extracted.
The position for positioning the label to match, it is therefore intended that extract data corresponding with the label position User is showed, so, after positioning the position of the label to match in the data file, it can extract and the label The corresponding data in position.
When extracting data, certain data in data file can directly extract and show user, for example, literary Word content.
But it is also possible that script data (e.g., JS code) in the data file, since script data is executable text Part, so data cannot be directly extracted, in such a case, it is possible to achieve the purpose that extract data by other means, such as Run corresponding with label position script data first to obtain the operation result of the script data, and by operation result data It extracts.
The present invention also provides a kind of data serching device under semiclosed data environment, Fig. 2 show the present invention The module map for the data serching device under semiclosed data environment of embodiment.
The apparatus according to the invention may include login module 210, access modules 230, extraction module 250.
Login module 210 is stepped on by the browser using default log-on message for constructing a client browser Record accesses the network access address of the semiclosed data environment.
One database of building in advance, for recording the network access address of the semiclosed data environment, this can be logged in The default log-on message of data environment, the web page address for needing to access in the data environment, and wanted in the webpage The corresponding file tag information of the data of acquisition.
Using preset log-on message, login module 210 can obtain the access authority of the semiclosed data environment.
Access modules 230, for the client browser in the semiclosed data environment after logining successfully Preset web address accesses, and obtains the data text of the correspondence webpage returned from the server of the semiclosed data environment Part.
For requiring access side to carry the semiclosed data environment that session token can just access, work as login module After 210 successfully log in the server of the semiclosed data environment, which can provide session token, and access modules 230 obtain The session token distributed by the server of the semiclosed data environment, and the session token is carried to the semiclosed data environment In preset web address access.
Extraction module 250 is used for according to preset data positional information corresponding with the webpage, from the data text The data of corresponding position are extracted in part.
It further include positioning unit (not shown) and extraction unit (not shown) in the extraction module 250.
Wherein, positioning unit is used to divide the web data file obtained by client browser access Analysis, according to preset file tag information corresponding with the data to be obtained in the webpage, in the data file Position the position of the label to match.
Extraction unit is used to be extracted data corresponding with the label position according to the label position.
Extraction unit operation and the label if in the data file including script data, in the extraction module 250 The corresponding script data in position, and operation result data are extracted.
The present invention is by way of default and record the useful data in semiclosed data environment, in semiclosed data environment In, data are positioned and extract, finally by the data exhibiting to user, to improve data search efficiency, expand data and search Rope range, while also improving the accuracy of data search result.
The specific embodiment of modules included by the device of the invention as described in Fig. 2 and side of the invention The specific embodiment of step in method be it is corresponding, since Fig. 1 being described in detail, so in order not to mould The application is pasted, no longer the detail of modules is described herein.
The above description is only an embodiment of the present invention, is not intended to restrict the invention, for those skilled in the art For member, the invention may be variously modified and varied.All within the spirits and principles of the present invention, it is made it is any modification, Equivalent replacement, improvement etc., should be included within scope of the presently claimed invention.

Claims (10)

1. a kind of data search method under semiclosed data environment characterized by comprising
A client browser is constructed, by the browser, uses semiclosed data described in default log-on message sign-on access The network access address of environment;
The client browser accesses to the preset web address in the semiclosed data environment after logining successfully, And the data file of the correspondence webpage returned from the server of the semiclosed data environment is obtained, the preset web address packet Single web page address and/or web page address stream are included, the web page address stream includes orderly multiple web page addresses;
According to preset data positional information corresponding with the webpage, by the data of corresponding position from the data file It extracts.
2. the method as described in claim 1, which is characterized in that described according to preset data bit corresponding with the webpage Confidence breath, the step of extracting the data of corresponding position from the data file, comprising:
To by the client browser access obtain web data file analyze, according to it is preset in the net The corresponding file tag information of the data to be obtained in page, positions the position of the label to match in the data file;
According to the label position, data corresponding with the label position are extracted.
3. method according to claim 2, which is characterized in that described to extract data corresponding with the label position The step of, comprising: operation script data corresponding with the label position, and operation result data are extracted.
4. method according to claim 2, which is characterized in that further comprise: one database of building records described semiclosed The network access address of data environment, the default log-on message that can log in the data environment need to access in the data environment Web page address, and file tag information corresponding with the data to be obtained in the webpage.
5. the method as described in claim 1, which is characterized in that the client browser further wraps after logining successfully It includes: obtaining the session token distributed by the server of the semiclosed data environment, and carry the session token and sealed to described half The preset web address closed in data environment accesses.
6. a kind of data serching device under semiclosed data environment characterized by comprising
Login module uses default log-on message sign-on access institute by the browser for constructing a client browser State the network access address of semiclosed data environment;
Access modules, for the client browser to the preset web in the semiclosed data environment after logining successfully Address accesses, and obtains the data file of the correspondence webpage returned from the server of the semiclosed data environment, described Preset web address includes single web page address and/or web page address stream, and the web page address stream includes orderly multiple nets Page address;
Extraction module is used for according to preset data positional information corresponding with the webpage, will from the data file The data of corresponding position extract.
7. device as claimed in claim 6, which is characterized in that the extraction module includes:
Positioning unit, for analyzing the web data file obtained by client browser access, according to pre- If file tag information corresponding with the data to be obtained in the webpage, position and match in the data file Label position;
Extraction unit, for according to the label position, data corresponding with the label position to be extracted.
8. device as claimed in claim 7, which is characterized in that the extraction unit is opposite with the label position for running The script data answered, and operation result data are extracted.
9. device as claimed in claim 7, which is characterized in that it further comprise a database, it is described semiclosed for recording The network access address of data environment, the default log-on message that can log in the data environment need to access in the data environment Web page address, and file tag information corresponding with the data to be obtained in the webpage.
10. device as claimed in claim 6, which is characterized in that the access modules, for obtaining by the semiclosed data The session token of the server distribution of environment, and carry the session token to the preset web in the semiclosed data environment Location accesses.
CN201510205529.1A 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment Active CN104866532B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510205529.1A CN104866532B (en) 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510205529.1A CN104866532B (en) 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment
CN201310111969.1A CN103218422B (en) 2013-04-01 2013-04-01 Data searching method and device used in semi-sealed data environment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201310111969.1A Division CN103218422B (en) 2013-04-01 2013-04-01 Data searching method and device used in semi-sealed data environment

Publications (2)

Publication Number Publication Date
CN104866532A CN104866532A (en) 2015-08-26
CN104866532B true CN104866532B (en) 2019-08-23

Family

ID=48816209

Family Applications (3)

Application Number Title Priority Date Filing Date
CN201510205529.1A Active CN104866532B (en) 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment
CN201510206718.0A Active CN104866533B (en) 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment
CN201310111969.1A Active CN103218422B (en) 2013-04-01 2013-04-01 Data searching method and device used in semi-sealed data environment

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN201510206718.0A Active CN104866533B (en) 2013-04-01 2013-04-01 A kind of method and apparatus for the data search under semiclosed data environment
CN201310111969.1A Active CN103218422B (en) 2013-04-01 2013-04-01 Data searching method and device used in semi-sealed data environment

Country Status (2)

Country Link
CN (3) CN104866532B (en)
WO (1) WO2014161454A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104866532B (en) * 2013-04-01 2019-08-23 钱咸升(北京)网络科技有限公司 A kind of method and apparatus for the data search under semiclosed data environment
CN106897401A (en) * 2017-02-13 2017-06-27 北京奇虎科技有限公司 A kind of data-storage system and method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008142614A1 (en) * 2007-05-24 2008-11-27 Nokia Corporation Webpage history view
CN101315695A (en) * 2008-07-09 2008-12-03 北京九恒星科技股份有限公司 Bank information processing method and data extraction component
CN101682638A (en) * 2007-04-27 2010-03-24 技术两合公开有限公司 The digital information service
CN102034178A (en) * 2009-09-29 2011-04-27 上海艾融信息科技有限公司 Cross-mechanism online payment method, system and device
CN102880624A (en) * 2011-07-16 2013-01-16 张文广 Website navigation tool system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101415010B (en) * 2008-11-26 2012-07-04 涂彦晖 WEB browsing apparatus and operation method
CN101876978B (en) * 2009-04-30 2012-12-26 刘长龙 Website navigation system and method
CN102915360B (en) * 2012-10-17 2016-09-28 北京奇虎科技有限公司 Present the system of the relevant information of website
CN104866532B (en) * 2013-04-01 2019-08-23 钱咸升(北京)网络科技有限公司 A kind of method and apparatus for the data search under semiclosed data environment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101682638A (en) * 2007-04-27 2010-03-24 技术两合公开有限公司 The digital information service
WO2008142614A1 (en) * 2007-05-24 2008-11-27 Nokia Corporation Webpage history view
CN101315695A (en) * 2008-07-09 2008-12-03 北京九恒星科技股份有限公司 Bank information processing method and data extraction component
CN102034178A (en) * 2009-09-29 2011-04-27 上海艾融信息科技有限公司 Cross-mechanism online payment method, system and device
CN102880624A (en) * 2011-07-16 2013-01-16 张文广 Website navigation tool system

Also Published As

Publication number Publication date
CN103218422A (en) 2013-07-24
CN104866532A (en) 2015-08-26
CN103218422B (en) 2015-06-03
CN104866533A (en) 2015-08-26
CN104866533B (en) 2019-02-05
WO2014161454A1 (en) 2014-10-09

Similar Documents

Publication Publication Date Title
CN104067561B (en) Method and system for dynamic scan WEB application
TWI515588B (en) Machine behavior determination method, web browser and web server
US20160301732A1 (en) Systems and Methods for Recording and Replaying of Web Transactions
US9438659B2 (en) Systems for serving website content according to user status
WO2018053620A1 (en) Digital communications platform for webpage overlay
CN104468790B (en) The processing method and client of cookie data
CN109376291B (en) Website fingerprint information scanning method and device based on web crawler
CN106131047A (en) Account login method and relevant device, account login system
US20130346476A1 (en) Serving Website Content According to User Status
WO2017053802A1 (en) System and method for detecting whether automatic login of user credentials to a web site has succeeded
CN103297469A (en) Method and device of collecting website data
CN107590236B (en) Big data acquisition method and system for building construction enterprises
CN110808868B (en) Test data acquisition method and device, computer equipment and storage medium
CN106446113A (en) Mobile big data analysis method and device
Gheorghe et al. Modern techniques of web scraping for data scientists
CN110555146A (en) method and system for generating network crawler camouflage data
CN107862039A (en) Web data acquisition methods, system and Data Matching method for pushing
CN104915438B (en) A method of obtaining PCU associated data in specific topics microblogging
CN108156118A (en) User Identity method and device
CN104361007B (en) The processing method of browser and its collection
CN104866532B (en) A kind of method and apparatus for the data search under semiclosed data environment
CN108322420A (en) The detection method and device of backdoor file
CN111797297B (en) Page data processing method and device, computer equipment and storage medium
CN104010019A (en) Classified information management system
CN108388796A (en) Dynamic domain name verification method, system, computer equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant