CN107948052A - Information crawler method, apparatus, electronic equipment and system - Google Patents

Information crawler method, apparatus, electronic equipment and system Download PDF

Info

Publication number
CN107948052A
CN107948052A CN201711125869.9A CN201711125869A CN107948052A CN 107948052 A CN107948052 A CN 107948052A CN 201711125869 A CN201711125869 A CN 201711125869A CN 107948052 A CN107948052 A CN 107948052A
Authority
CN
China
Prior art keywords
wechat
public platform
request
address
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711125869.9A
Other languages
Chinese (zh)
Inventor
沈文策
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Cnfol Information Technology Co Ltd
Original Assignee
Fujian Cnfol Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Cnfol Information Technology Co Ltd filed Critical Fujian Cnfol Information Technology Co Ltd
Priority to CN201711125869.9A priority Critical patent/CN107948052A/en
Publication of CN107948052A publication Critical patent/CN107948052A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/04Real-time or near real-time messaging, e.g. instant messaging [IM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/14Session management
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention provides a kind of information crawler method, apparatus, electronic equipment and system, belong to Internet communication technology field.Information crawler method, apparatus, electronic equipment and system provided in an embodiment of the present invention, electronic equipment is set between wechat client and wechat server, obtain the network request address that wechat server provides, it is supplied to web crawlers unit, the data and information in wechat public platform are crawled by web crawlers unit, all articles of wechat public platform can be obtained, crawlers is obtained the content of substantial amounts of wechat public platform in a short time, the performance and efficiency of crawlers are lifted in very big program.

Description

Information crawler method, apparatus, electronic equipment and system
Technical field
The present invention relates to Internet communication technology field, in particular to a kind of information crawler method, apparatus, electronics Equipment and system.
Background technology
The mode of checking of the article in wechat public platform has two kinds at present:A kind of looked into by the function of search of search dog wechat See public platform article, one kind is to check public platform article by handset Wechat APP.Existing wechat public platform crawlers are most Number is crawled by way of search dog searches for wechat public platform, and efficiency is low, and search dog search has stringent anti-reptile rule, no The article of a large amount of public platforms can be obtained in a short time, even if it is also to have quantity limitation to get wechat public platform article.
The content of the invention
For above-mentioned problems of the prior art, the present invention provides a kind of information crawler method, apparatus, electronics to set Standby and system.
In a first aspect, an embodiment of the present invention provides a kind of information crawler method, applied to be arranged on wechat client with On electronic equipment between wechat server, for crawling the content in wechat public platform, including:
When receiving the wechat public platform access request that wechat client is sent, the chain that the access request carries is extracted It is grounded location and wechat public platform mark;
To the chained address, accordingly wechat server sends linking request;The linking request includes the wechat Public platform identifies;
Receive the wechat public platform that wechat server returns and identify corresponding network request address;
The network request address is sent to the web crawlers unit for crawling information, so that the web crawlers list Member crawls the content in the wechat public platform.
With reference to first aspect, an embodiment of the present invention provides the first possible embodiment of first aspect, wherein, to The chained address accordingly wechat server send linking request the step of, including:
Wechat program is opened, accordingly wechat server transmission link please to the chained address by the wechat program Ask.
With reference to first aspect, an embodiment of the present invention provides second of possible embodiment of first aspect, wherein, institute The method of stating further includes:
When receiving the wechat public platform access request that wechat client is sent, the use that the access request carries is extracted Family information;
Wechat program is logged according to the user information, by the wechat program to the chained address accordingly wechat Server sends linking request.
With reference to first aspect, an embodiment of the present invention provides the third possible embodiment of first aspect, wherein, connect After receiving the step of wechat public platform that wechat server returns identifies corresponding network request address, the method is also wrapped Include:
The network request address is sent to the wechat client for initiating wechat public platform access request.
Second aspect, the embodiment of the present invention additionally provide a kind of information crawler device, applied to being arranged on wechat client On electronic equipment between wechat server, for crawling the content in wechat public platform, including:
Chained address acquiring unit, for when receiving the wechat public platform access request that wechat client is sent, carrying Take the chained address and wechat public platform mark that the access request carries;
Linking request transmitting element, for accordingly wechat server to send linking request to the chained address;It is described Linking request includes the wechat public platform mark;
Request address transmitting element, the wechat public platform for receiving wechat server return identify corresponding network Request address;The network request address is sent to the web crawlers unit for crawling information, so that the web crawlers Unit crawls the content in the wechat public platform.
With reference to second aspect, an embodiment of the present invention provides the first possible embodiment of second aspect, wherein, institute Linking request transmitting element is stated, is additionally operable to:
Wechat program is opened, accordingly wechat server transmission link please to the chained address by the wechat program Ask.
With reference to second aspect, an embodiment of the present invention provides second of possible embodiment of second aspect, wherein, institute Request address transmitting element is stated, is additionally operable to:
The network request address is sent to the wechat client for initiating wechat public platform access request.
The third aspect, the embodiment of the present invention additionally provide a kind of electronic equipment, including processor and memory;The storage Device is used to store the program for supporting processor to perform the above method;The processor is configurable for performing in the memory The program of storage.
Fourth aspect, the embodiment of the present invention additionally provide a kind of information crawler system, including one or more wechat clients End, wechat server and above-mentioned electronic equipment;The wechat client connects the wechat service by the electronic equipment Device.
5th aspect, the embodiment of the present invention additionally provide a kind of machinable medium, it is characterised in that are stored with State the computer software instructions used in device.
The embodiment of the present invention brings following beneficial effect:
Information crawler method, apparatus, electronic equipment and system provided in an embodiment of the present invention, in wechat client and wechat Electronic equipment is set between server, obtains the network request address that wechat server provides, there is provided web crawlers unit is given, by Web crawlers unit crawls data and information in wechat public platform, can obtain all articles of wechat public platform, make reptile Program can obtain the content of substantial amounts of wechat public platform in a short time, in very big program lifted crawlers performance with Efficiency.
Other features and advantages of the present invention will illustrate in the following description, also, partly become from specification Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages are in specification, claims And specifically noted structure is realized and obtained in attached drawing.
To enable the above objects, features and advantages of the present invention to become apparent, preferred embodiment cited below particularly, and coordinate Appended attached drawing, is described in detail below.
Brief description of the drawings
, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution of the prior art Embodiment or attached drawing needed to be used in the description of the prior art are briefly described, it should be apparent that, in describing below Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor Put, other attached drawings can also be obtained according to these attached drawings.
The flow chart for the information crawler method that Fig. 1 is provided by one embodiment of the invention;
The flow chart for the information crawler method that Fig. 2 is provided by another embodiment of the present invention;
The structure diagram for the information crawler device that Fig. 3 is provided by one embodiment of the invention;
The structure diagram for the information crawler system that Fig. 4 is provided by one embodiment of the invention.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with attached drawing to the present invention Technical solution be clearly and completely described, it is clear that described embodiment is part of the embodiment of the present invention, rather than Whole embodiments.The component of embodiments of the present invention, which are generally described and illustrated herein in the accompanying drawings can be matched somebody with somebody with a variety of Put to arrange and design.Therefore, the detailed description of the embodiment of the present invention to providing in the accompanying drawings is not intended to limit below The scope of claimed invention, but it is merely representative of the selected embodiment of the present invention.Based on the embodiments of the present invention, originally Field those of ordinary skill all other embodiments obtained without making creative work, belong to the present invention The scope of protection.
For present crawlers crawl wechat public platform article efficiency it is low and can crawl article limited amount the problem of, An embodiment of the present invention provides a kind of information crawler method, apparatus, electronic equipment and system, below first to the information of the present invention Crawling method describes in detail.
Embodiment one
This embodiment offers a kind of information crawler method, applied to being arranged between wechat client and wechat server Electronic equipment on, for crawling the content in wechat public platform.Fig. 1 shows the information crawler method that the present embodiment is provided Flow chart.As shown in Figure 1, this method comprises the following steps:
Step S101, when receiving the wechat public platform access request that wechat client is sent, extraction access request is taken The chained address of band and wechat public platform mark.
In the present embodiment, wechat client needs above-mentioned electronic equipment to be carried out as intermediate equipment with wechat server Connection.The electronic equipment is referred to as proxy server.When wechat client pays close attention to certain wechat public platform or checks certain wechat , it is necessary to send wechat public platform access request during certain article in public platform, after which is received by electronic equipment, Electronic equipment can forward the access request.
Step S102, to chained address, accordingly wechat server sends linking request;Linking request includes wechat public affairs Many numbers marks.
When sending linking request to wechat server, it can be transmitted by third party's program or wechat program.For example, Wechat program is opened, accordingly wechat server sends linking request to chained address by wechat program.
Alternatively, when receiving the wechat public platform access request that wechat client is sent, extraction access request carries User information;Wechat program is logged according to user information, accordingly wechat server is sent out to chained address by wechat program Send linking request.By this operation, wechat server can determine which the request comes from from the linking request received Wechat client.
Step S103, receives the wechat public platform that wechat server returns and identifies corresponding network request address.
, can be according to the wechat public platform identifier lookup wechat public platform or this is micro- after wechat server receives linking request Believe the network request address of public platform article, which is returned back into electronic equipment.
Step S104, network request address is sent to the web crawlers unit for crawling information, so that web crawlers Unit crawls the content in wechat public platform.
Information crawler method provided in this embodiment, electronic equipment is set between wechat client and wechat server, Obtain the network request address that wechat server provides, there is provided give web crawlers unit, wechat public affairs are crawled by web crawlers unit Data and information in many numbers, can obtain all articles of wechat public platform, crawlers is obtained in a short time The content of substantial amounts of wechat public platform, lifts the performance and efficiency of crawlers in very big program.
In one more preferably embodiment, as shown in Fig. 2, information crawler method includes the following steps:
Step S201, when receiving the wechat public platform access request that wechat client is sent, extraction access request is taken The chained address of band and wechat public platform mark;
Step S202, to chained address, accordingly wechat server sends linking request;Linking request includes wechat public affairs Many numbers marks;
Step S203, receives the wechat public platform that wechat server returns and identifies corresponding network request address;
Step S204, network request address is sent to the web crawlers unit for crawling information, so that web crawlers Unit crawls the content in wechat public platform;
Step S205, network request address is sent to the wechat client for initiating wechat public platform access request.
Wherein, the order of step S204 and step S205 can exchange.
A kind of concrete application of information crawler method provided in an embodiment of the present invention is implemented as follows:
Electronic equipment passes through MITM (Man-in-the-middle-attacks, man-in-the-middle attack) technical limit spacing wechat visitor The network request address that the wechat public platform access request and wechat server at family end return.MITM technologies are a kind of indirect aggressions The network technology of computer, although MITM technologies bring definitely Network Security Vulnerabilities, reasonably utilizes MITM technologies, can Greatly to simplify the development process of reptile, the performance and efficiency of crawler technology are lifted.
When the information crawler method of the present embodiment is realized using MITM technologies, first, wechat client (user terminal) needs Open MITM services.Then the proxy server (electronic equipment) of wechat client is set, take the MITM of wechat client Business accesses wechat server by this proxy server.After being provided with, open wechat program in wechat client and log in, Wechat client, which accesses, needs the wechat public platform that reptile crawls.MITM services on proxy server can obtain wechat public affairs The network request address of many numbers, then sends the network request address to web crawlers unit, so that web crawlers unit can To crawl the content of wechat public platform.
Information crawler method provided in this embodiment, electronic equipment is set between wechat client and wechat server, Obtain the network request address that wechat server provides, there is provided give web crawlers unit, wechat public affairs are crawled by web crawlers unit Data and information in many numbers, can obtain all articles of wechat public platform, crawlers is obtained in a short time The content of substantial amounts of wechat public platform, lifts the performance and efficiency of crawlers in very big program.
Embodiment two
With the method embodiment accordingly, a kind of information crawler device is present embodiments provided, applied to being arranged on On electronic equipment between wechat client and wechat server, for crawling the content in wechat public platform.Fig. 3 shows this The structure diagram for the information crawler device that embodiment is provided.As shown in figure 3, the device includes:
Chained address acquiring unit 31, for when receiving the wechat public platform access request that wechat client is sent, Extract chained address and the wechat public platform mark that the access request carries;
Linking request transmitting element 32, for accordingly wechat server to send linking request to the chained address;Institute State linking request and include the wechat public platform mark;
Request address transmitting element 33, the wechat public platform for receiving wechat server return identify corresponding net Network request address;The network request address is sent to the web crawlers unit for crawling information, so that the network is climbed Worm unit crawls the content in the wechat public platform.
Wherein, chained address acquiring unit 31, can be also used for the user information that extraction access request carries, the user Information can include one or more of:User name, password, the pet name, telephone number, mailbox etc..
Linking request transmitting element 32, can be also used for opening wechat program, by the wechat program to the link Accordingly wechat server sends linking request for address;Or for logging in wechat program according to user information, pass through wechat journey To chained address, accordingly wechat server sends linking request to sequence.
Request address transmitting element 33, can be also used for sending to initiation wechat public platform to access by network request address asking The wechat client asked.
Information crawler device provided in an embodiment of the present invention, sets electronics to set between wechat client and wechat server It is standby, obtain the network request address that wechat server provides, there is provided give web crawlers unit, wechat is crawled by web crawlers unit Data and information in public platform, can obtain all articles of wechat public platform, crawlers is obtained in a short time The content of substantial amounts of wechat public platform is taken, the performance and efficiency of crawlers are lifted in very big program.
Embodiment three
This embodiment offers a kind of electronic equipment.The electronic equipment includes processor and memory.
Memory is used for the software program module for storing processor execution.Processor is used to perform institute in above-described embodiment one The method of record.Memory may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetic Property storage device, flash memory or other non-volatile solid state memories.
Further, above-mentioned software program module may also include:Operating system and service module.Wherein operating system, LINUX, UNIX, WINDOWS are may be, for example, it may include various to be used for management system task (such as memory management, storage device Control, power management etc.) component software and/or driving, and can mutually be communicated with various hardware or component software, so as to provide The running environment of other software component.
Service module is operated on the basis of operating system, and is monitored by the network service of operating system come automatic network Request, corresponding data processing is completed according to request, and returns to mobile terminal of the handling result to user.That is, service Module is used to provide network service to the mobile terminal of user.
The electronic equipment further includes the mixed-media network modules mixed-media for receiving and sending network signal.Above-mentioned network signal may include Wireless signal or wire signal.
It is understood that the explanation is only to schematically illustrate, server may also include than above-mentioned more or less Component, or there is different configurations.Above-mentioned each component can use hardware, software or its combination to realize.It is in addition, of the invention Server in embodiment can also include the server of multiple specific difference in functionality.
The technique effect and preceding method embodiment of the electronic equipment that above-described embodiment is provided, its realization principle and generation Identical, to briefly describe, device embodiment part does not refer to part, refers to corresponding contents in preceding method embodiment.
Example IV
This embodiment offers a kind of information crawler system, Fig. 4 shows the structure diagram of the information crawler system.Such as figure Shown in 4, which includes one or more wechat client 100, wechat server 300 and above-mentioned electronic equipments 200;Wechat client 100 connects wechat server 300 by electronic equipment 200.
When wechat client sends wechat public platform access request to wechat server, electronic equipment can be according to the visit Ask that request sends linking request to wechat server, obtain the network request address for the wechat public platform that wechat server returns, The network request address is supplied to web crawlers unit, to crawl the content of wechat public platform.
Further, another embodiment of the present invention additionally provides a kind of machinable medium, is stored with above device Computer software instructions used.
Each embodiment in this specification is described by the way of progressive, what each embodiment stressed be with The difference of other embodiment, between each embodiment identical similar part mutually referring to.
Information crawler method, apparatus, electronic equipment and system provided in an embodiment of the present invention have identical technical characteristic, So can also solve identical technical problem, reach identical technique effect.
It should be noted that in embodiment provided by the present invention, it should be understood that disclosed system and method, can To realize by another way.Device embodiment described above is only schematical, for example, the unit is drawn Point, it is only a kind of division of logic function, there can be other dividing mode when actually realizing, in another example, multiple units or group Part can combine or be desirably integrated into another system, or some features can be ignored, or not perform.It is described to be used as separation unit The unit that part illustrates may or may not be physically separate, can be as the component that unit is shown or also may be used Not to be physical location, you can with positioned at a place, or can also be distributed in multiple network unit.Can be according to reality Need select some or all of unit therein to realize the purpose of this embodiment scheme.
In addition, each functional unit in embodiment provided by the invention can be integrated in a processing unit, also may be used To be that unit is individually physically present, can also two or more units integrate in a unit.
If the function is realized in the form of SFU software functional unit and is used as independent production marketing or in use, can be with It is stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words The part to contribute to the prior art or the part of the technical solution can be embodied in the form of software product, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be People's computer, server, or network equipment etc.) perform all or part of step of each embodiment the method for the present invention. And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), arbitrary access are deposited Reservoir (RAM, Random Access Memory), magnetic disc or CD etc. are various can be with the medium of store program codes.
In addition, term " first ", " second ", " the 3rd " are only used for description purpose, and it is not intended that instruction or implying phase To importance.
Finally it should be noted that:Embodiment described above, is only the embodiment of the present invention, to illustrate the present invention Technical solution, rather than its limitations, protection scope of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair It is bright to be described in detail, it will be understood by those of ordinary skill in the art that:Any one skilled in the art The invention discloses technical scope in, it can still modify the technical solution described in previous embodiment or can be light It is readily conceivable that change, or equivalent substitution is carried out to which part technical characteristic;And these modifications, change or replacement, do not make The essence of appropriate technical solution departs from the spirit and scope of technical solution of the embodiment of the present invention, should all cover the protection in the present invention Within the scope of.Therefore, protection scope of the present invention answers the scope of the claims of being subject to.

Claims (10)

  1. A kind of 1. information crawler method, it is characterised in that applied to the electricity being arranged between wechat client and wechat server In sub- equipment, for crawling the content in wechat public platform, including:
    When receiving the wechat public platform access request that wechat client is sent, the chain ground connection that the access request carries is extracted Location and wechat public platform mark;
    To the chained address, accordingly wechat server sends linking request;The linking request includes the wechat public Number mark;
    Receive the wechat public platform that wechat server returns and identify corresponding network request address;
    The network request address is sent to the web crawlers unit for crawling information, so that the web crawlers unit is climbed Take the content in the wechat public platform.
  2. 2. according to the method described in claim 1, it is characterized in that, to the chained address accordingly wechat server send chain The step of connecing request, including:
    Wechat program is opened, accordingly wechat server sends linking request to the chained address by the wechat program.
  3. 3. according to the method described in claim 1, it is characterized in that, the method further includes:
    When receiving the wechat public platform access request that wechat client is sent, user's letter that the access request carries is extracted Breath;
    Wechat program is logged according to the user information, by the wechat program to the chained address accordingly wechat service Device sends linking request.
  4. 4. according to the method described in claim 1, it is characterized in that, receive the wechat public platform mark that wechat server returns After the step of knowing corresponding network request address, the method further includes:
    The network request address is sent to the wechat client for initiating wechat public platform access request.
  5. 5. a kind of information crawler device, it is characterised in that applied to the electricity being arranged between wechat client and wechat server In sub- equipment, for crawling the content in wechat public platform, including:
    Chained address acquiring unit, for when receiving the wechat public platform access request that wechat client is sent, extracting institute State chained address and the wechat public platform mark of access request carrying;
    Linking request transmitting element, for accordingly wechat server to send linking request to the chained address;The link Request bag contains the wechat public platform mark;
    Request address transmitting element, the wechat public platform for receiving wechat server return identify corresponding network request Address;The network request address is sent to the web crawlers unit for crawling information, so that the web crawlers unit Crawl the content in the wechat public platform.
  6. 6. device according to claim 5, it is characterised in that the linking request transmitting element, is additionally operable to:
    Wechat program is opened, accordingly wechat server sends linking request to the chained address by the wechat program.
  7. 7. device according to claim 5, it is characterised in that the request address transmitting element, is additionally operable to:
    The network request address is sent to the wechat client for initiating wechat public platform access request.
  8. 8. a kind of electronic equipment, it is characterised in that including processor and memory;The memory, which is used to store, supports processor The program of any one of perform claim requirement 1 to 4 the method;The processor is configurable for performing and is deposited in the memory The program of storage.
  9. 9. a kind of information crawler system, it is characterised in that will including one or more wechat client, wechat server and rights Seek the electronic equipment described in 8;The wechat client connects the wechat server by the electronic equipment.
  10. 10. a kind of machinable medium, it is characterised in that be stored with used in any one of claim 5 to 7 described device Computer software instructions.
CN201711125869.9A 2017-11-14 2017-11-14 Information crawler method, apparatus, electronic equipment and system Pending CN107948052A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711125869.9A CN107948052A (en) 2017-11-14 2017-11-14 Information crawler method, apparatus, electronic equipment and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711125869.9A CN107948052A (en) 2017-11-14 2017-11-14 Information crawler method, apparatus, electronic equipment and system

Publications (1)

Publication Number Publication Date
CN107948052A true CN107948052A (en) 2018-04-20

Family

ID=61932177

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711125869.9A Pending CN107948052A (en) 2017-11-14 2017-11-14 Information crawler method, apparatus, electronic equipment and system

Country Status (1)

Country Link
CN (1) CN107948052A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109388735A (en) * 2018-09-13 2019-02-26 广州丰石科技有限公司 A method of crawling wechat public platform information
CN110188257A (en) * 2019-04-16 2019-08-30 国家计算机网络与信息安全管理中心 A kind of mobile application collecting method and device
CN110677423A (en) * 2019-09-30 2020-01-10 深圳前海环融联易信息科技服务有限公司 Data acquisition method and device based on client agent side and computer equipment
CN110781367A (en) * 2019-09-25 2020-02-11 中国科学院计算技术研究所 Internet data acquisition method and system based on man-in-the-middle
CN115242491A (en) * 2022-07-19 2022-10-25 厦门市美亚柏科信息股份有限公司 APP cloud detection method and system based on web crawler

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103961869A (en) * 2014-04-14 2014-08-06 林云帆 Device control method
CN104035997A (en) * 2014-06-13 2014-09-10 淮阴工学院 Scientific and technical information acquisition and pushing method based on text classification and image deep mining
CN105243122A (en) * 2015-09-29 2016-01-13 浪潮电子信息产业股份有限公司 Social software based data acquisition method and apparatus
CN105320740A (en) * 2015-09-22 2016-02-10 清华大学 WeChat article and official account acquisition method and acquisition system
CN105429865A (en) * 2015-12-31 2016-03-23 深圳中泓在线股份有限公司 WeChat public number data collection method and device based on browser
CN105577528A (en) * 2015-12-31 2016-05-11 深圳中泓在线股份有限公司 Wechat official account data collection method and device based on virtual machine
CN105718587A (en) * 2016-01-26 2016-06-29 王薇 Network content resource evaluation method and evaluation system
CN105790944A (en) * 2014-12-22 2016-07-20 深圳易思智科技有限公司 Wechat-based network authentication method and device
CN106202232A (en) * 2016-06-27 2016-12-07 中国南方电网有限责任公司电网技术研究中心 A kind of analysis method and device of power-off event

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103961869A (en) * 2014-04-14 2014-08-06 林云帆 Device control method
CN104035997A (en) * 2014-06-13 2014-09-10 淮阴工学院 Scientific and technical information acquisition and pushing method based on text classification and image deep mining
CN105790944A (en) * 2014-12-22 2016-07-20 深圳易思智科技有限公司 Wechat-based network authentication method and device
CN105320740A (en) * 2015-09-22 2016-02-10 清华大学 WeChat article and official account acquisition method and acquisition system
CN105243122A (en) * 2015-09-29 2016-01-13 浪潮电子信息产业股份有限公司 Social software based data acquisition method and apparatus
CN105429865A (en) * 2015-12-31 2016-03-23 深圳中泓在线股份有限公司 WeChat public number data collection method and device based on browser
CN105577528A (en) * 2015-12-31 2016-05-11 深圳中泓在线股份有限公司 Wechat official account data collection method and device based on virtual machine
CN105718587A (en) * 2016-01-26 2016-06-29 王薇 Network content resource evaluation method and evaluation system
CN106202232A (en) * 2016-06-27 2016-12-07 中国南方电网有限责任公司电网技术研究中心 A kind of analysis method and device of power-off event

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LIJINMA: "如何优雅的抓取微信公众号历史文章", 《LARAVEL》 *
飯口組組长: "持续更新,微信公众号文章批量采集系统的构建", 《知乎专栏》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109388735A (en) * 2018-09-13 2019-02-26 广州丰石科技有限公司 A method of crawling wechat public platform information
CN110188257A (en) * 2019-04-16 2019-08-30 国家计算机网络与信息安全管理中心 A kind of mobile application collecting method and device
CN110188257B (en) * 2019-04-16 2021-12-31 国家计算机网络与信息安全管理中心 Mobile application data acquisition method and device
CN110781367A (en) * 2019-09-25 2020-02-11 中国科学院计算技术研究所 Internet data acquisition method and system based on man-in-the-middle
CN110781367B (en) * 2019-09-25 2023-10-20 中国科学院计算技术研究所 Internet data acquisition method and system based on middleman
CN110677423A (en) * 2019-09-30 2020-01-10 深圳前海环融联易信息科技服务有限公司 Data acquisition method and device based on client agent side and computer equipment
CN115242491A (en) * 2022-07-19 2022-10-25 厦门市美亚柏科信息股份有限公司 APP cloud detection method and system based on web crawler
CN115242491B (en) * 2022-07-19 2024-04-19 厦门市美亚柏科信息股份有限公司 APP cloud detection method and system based on web crawlers

Similar Documents

Publication Publication Date Title
CN107948052A (en) Information crawler method, apparatus, electronic equipment and system
CN106528432B (en) The construction method and device of test scene data bury a test method
CN104333599B (en) Share the method and system and application service platform of application
CN104283843B (en) A kind of method, apparatus and system that user logs in
CN103607385B (en) Method and apparatus for security detection based on browser
CN107809383A (en) A kind of map paths method and device based on MVC
CN103150513B (en) The method of the implantation information in interception application program and device
CN106933871A (en) Short linking processing method, device and short linked server
CN107404481B (en) User information recognition methods and device
CN106453216A (en) Malicious website interception method, malicious website interception device and client
CN108090091A (en) Web page crawl method and apparatus
CN107147748A (en) File uploading method and device
CN106878368A (en) The implementation method and device of information pushing
CN107807937A (en) A kind of website SEO processing methods, apparatus and system
CN105553968A (en) Method and device for realizing login by multiple accounts
CN107689941A (en) A kind of apparatus and method for preventing same user's repeat logon
CN106681799A (en) Disk inserting method, device and system
CN106572095A (en) Account registration method, device and system
CN103703474A (en) Handling device generated data
CN104618410A (en) Resource push method and resource push device
CN101645021B (en) Integrating method for multisystem single-spot logging under Java application server
CN106603556A (en) Single sign-on method, device and system
CN107294905A (en) A kind of method and device for recognizing user
CN104462488A (en) Database high reliability solution method and device
CN105144073A (en) Removable storage device identity and configuration information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180420