CN105577528B - A kind of wechat public platform collecting method and device based on virtual machine - Google Patents

A kind of wechat public platform collecting method and device based on virtual machine Download PDF

Info

Publication number
CN105577528B
CN105577528B CN201511013666.1A CN201511013666A CN105577528B CN 105577528 B CN105577528 B CN 105577528B CN 201511013666 A CN201511013666 A CN 201511013666A CN 105577528 B CN105577528 B CN 105577528B
Authority
CN
China
Prior art keywords
public platform
wechat
wechat public
virtual machine
account
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201511013666.1A
Other languages
Chinese (zh)
Other versions
CN105577528A (en
Inventor
陈志群
李晓亮
马瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Zhonghong On-Line Co Ltd
Original Assignee
Shenzhen Zhonghong On-Line Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Zhonghong On-Line Co Ltd filed Critical Shenzhen Zhonghong On-Line Co Ltd
Priority to CN201511013666.1A priority Critical patent/CN105577528B/en
Publication of CN105577528A publication Critical patent/CN105577528A/en
Application granted granted Critical
Publication of CN105577528B publication Critical patent/CN105577528B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/214Monitoring or handling of messages using selective forwarding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45504Abstract machines for programme code execution, e.g. Java virtual machine [JVM], interpreters, emulators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/52User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail for supporting social networking services

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Information Transfer Between Computers (AREA)
  • Computer And Data Communications (AREA)

Abstract

The present invention relates to field of information acquisition, and in particular to a kind of wechat public platform collecting method and device based on virtual machine.The wechat public platform interaction data packet that virtual machine is interacted with internet is monitored and is downloaded when virtual machine is interacted with internet data by virtual machine, Quick Macro simulated operation by this method and device;After virtual machine simulation logs in wechat, effect can keep Entered state not go offline as mobile phone after logging in, if special circumstances, which go offline, to be logged in automatically in the case where restoring proper network access, so more stable;Acquisition target intelligently increases, and can pay close attention to wechat public platform automatically, and solve the problem of manually to pay close attention to cumbersome solve artificial concern amount and increases slowly;Access it is unrestricted, under Entered state, simulation click obtain wechat public number it is unrestricted, guarantee the comprehensive of data acquisition;Timeliness is good, simulated operation, can actively obtain latest data in time.

Description

A kind of wechat public platform collecting method and device based on virtual machine
Technical field
The present invention relates to field of information acquisition, and in particular to a kind of wechat public platform collecting method based on virtual machine And device.
Background technique
Wechat public platform push article can be checked by two kinds of approach, one be search engine wechat search, but Be search information it is too mixed and disorderly, and limited by provider server.Second is concern public platform, and acceptable public platform pushes away The article sent, but the public platform of each wechat account concern is limited.
In general wechat acquisition method, the request of plain engine is searched by simulating, acquisition keyword search comes out micro- Message chapter, this method is the disadvantage is that request frequency can be strictly restricted, and occurring needing to input identifying code could continue to request, journey Sequence can not identify verifying and cause to acquire interruption, information collection low efficiency and information clutter.
Summary of the invention
To overcome drawbacks described above, the purpose of the present invention is to provide a kind of wechat public's number based on virtual machine and adopts Set method and device.
The purpose of the present invention is achieved through the following technical solutions:
A kind of wechat public platform collecting method based on virtual machine of the invention, comprising the following steps:
The wechat public platform account paid close attention to will be needed to be collected;
Log in more than one wechat account simultaneously on a virtual machine, virtual machine will be collected by Quick Macro simulated operation To wechat public platform account be added to the task queue of wechat account and paid close attention to;
Virtual machine clicks wechat public platform activly request wechat public's number by Quick Macro simulated operation, virtual The wechat public platform interaction data packet that virtual machine is interacted with internet is monitored and is downloaded when machine is interacted with internet data;
Wechat relevant information needed for scanning the wechat public platform interaction data packet on virtual machine and extracting;
Wechat relevant information needed for extracting parses wechat public platform index information, accesses the wechat public platform rope Draw the specific wechat information of acquisition of information wechat public platform.
Further, further comprising the steps of:
Wechat public platform interaction data packet on scanning virtual machine extracts other related wechat public platform accounts, by other phases The task queue that pass wechat public platform account is added to wechat account is paid close attention to.
Further, the wechat public platform account paid close attention to will be needed to be collected are as follows: micro- by internet information Text Feature Extraction Signal, and the WeChat ID audited, intervenes, classify, arrangement filters out wechat public platform and is stored in wechat public platform account Database.
Further, the wechat public platform index information is wechat article chained address or wechat public platform account.
Further, the specific wechat information of the wechat public platform includes title, author, issuing time, article content.
Further, the virtual machine is Android virtual machine.
A kind of wechat public platform data acquisition device based on virtual machine, the device are connect with virtual-machine data, the device Including sequentially connected:
The account collector unit that the wechat public platform account paid close attention to will be needed to be collected;
The wechat public platform interaction data packet for interacting virtual machine with internet when virtual machine is interacted with internet data The data monitoring and download unit for being monitored and downloading;
The data scanning of wechat relevant information needed for scanning the wechat public platform interaction data packet on virtual machine and extracting Extraction unit;
Wechat relevant information needed for extracting parses wechat public platform index information, accesses the wechat public platform rope Draw the parsing and access acquiring unit of the specific wechat information of acquisition of information wechat public platform.
Further, which further includes connecting with data monitoring and download unit, scanning wechat public platform on virtual machine Interaction data packet extracts the account extraction unit of other related wechat public platform accounts.
A kind of wechat public platform collecting method and device, this method and device based on virtual machine provided by the invention Virtual machine is interacted with internet when virtual machine is interacted with internet data by virtual machine, Quick Macro simulated operation micro- Letter public platform interaction data packet is monitored and downloads;After virtual machine simulation logs in wechat, effect, can after logging in as mobile phone Entered state is kept not go offline, if special circumstances, which go offline, to be logged in automatically in the case where restoring proper network access, so It is more stable;Acquisition target intelligently increases, and can pay close attention to wechat public platform automatically, and solution is manually paid close attention to cumbersome, solves artificial The problem of concern amount increasess slowly;Access it is unrestricted, under Entered state, simulation click obtain wechat public number it is unrestricted System, guarantees the comprehensive of data acquisition;Timeliness is good, simulated operation, can actively obtain latest data in time.
Detailed description of the invention
The present invention is described in detail by following preferred embodiments and attached drawing for ease of explanation,.
Fig. 1 is a kind of step flow chart of the wechat public platform collecting method based on virtual machine of the present invention;
Fig. 2 is a kind of module frame chart of the wechat public platform data acquisition device based on virtual machine of the present invention.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.
Please refer to Fig. 1, a kind of wechat public platform collecting method based on virtual machine of the invention, including following step It is rapid:
The wechat public platform account paid close attention to will be needed to be collected, specifically: pass through internet information Text Feature Extraction wechat Number, and the WeChat ID audited, intervenes, classify, arrangement filters out wechat public platform and is stored in wechat public platform account number According to library;
Android virtual machine is built on the server, and Android wechat client app is installed, and setting needs the wechat acquired Username and password is logged in wechat client by account on Android virtual machine, i.e., logs in more than one simultaneously on a virtual machine Wechat account, while the wechat public platform account being collected into is added to wechat account by Quick Macro simulated operation by virtual machine Number task queue paid close attention to, after wechat account logs in, open entrance for wechat acquisition, but acquisition target is wechat public affairs Many numbers article links, so needing to add to wechat public platform account and pay close attention to, concern needs before acquisition wechat article link The wechat public platform being collected into is added to task queue by the wechat public platform to be acquired, and virtual machine is executed by Quick Macro The button operation set will need the wechat public platform paid close attention to be automatically added to concern list, and the wechat account can in this way To obtain the article information of the wechat public platform of concern;
Virtual machine clicks wechat public platform activly request wechat public's number by Quick Macro simulated operation, virtual The wechat public platform interaction data packet that virtual machine is interacted with internet is monitored and is downloaded when machine is interacted with internet data, I.e. by Quick Macro, simulates and click public platform, activly request wechat data, the historical data including obtaining wechat public platform, During virtual machine is interacted with internet, the data packet between them is monitored, data packet is saved under specified path;
Data packet is parsed, the wechat correlation letter needed for scanning the wechat public platform interaction data packet on virtual machine and extracting Breath, and the wechat relevant information is saved in specified folder;
Specific wechat information downloading, the wechat relevant information needed for extracting parse wechat public platform index information, The specific wechat information that the wechat public platform index information obtains wechat public platform is accessed, which can be with Specific wechat information for wechat article chained address or wechat public platform account, the wechat public platform includes title, author, hair The information such as cloth time, article content, and format storage and save.
Wherein, the wechat public platform collecting method is further comprising the steps of:
Wechat public platform interaction data packet on scanning virtual machine extracts other related wechat public platform accounts, by other phases The task queue that pass wechat public platform account is added to wechat account is paid close attention to, i.e., by monitoring scan data packet, analysis is mentioned Other wechat public platforms are produced, these wechat public platforms are automatically added to the concern list of wechat account, so that data source obtains To expand.
Referring specifically to Fig. 2, a kind of wechat public platform collecting method using above-mentioned based on virtual machine based on virtual The wechat public platform data acquisition device of machine, the device are connect with virtual-machine data, which includes sequentially connected:
The account collector unit that the wechat public platform account paid close attention to will be needed to be collected;
The wechat public platform interaction data packet for interacting virtual machine with internet when virtual machine is interacted with internet data The data monitoring and download unit for being monitored and downloading;
The data scanning of wechat relevant information needed for scanning the wechat public platform interaction data packet on virtual machine and extracting Extraction unit;
Wechat relevant information needed for extracting parses wechat public platform index information, accesses the wechat public platform rope Draw the parsing and access acquiring unit of the specific wechat information of acquisition of information wechat public platform.
Further, which further includes connecting with data monitoring and download unit, scanning wechat public platform on virtual machine Interaction data packet extracts the account extraction unit of other related wechat public platform accounts.
A kind of wechat public platform collecting method and device, this method and device based on virtual machine provided by the invention Virtual machine is interacted with internet when virtual machine is interacted with internet data by virtual machine, Quick Macro simulated operation micro- Letter public platform interaction data packet is monitored and downloads;After virtual machine simulation logs in wechat, effect, can after logging in as mobile phone Entered state is kept not go offline, if special circumstances, which go offline, to be logged in automatically in the case where restoring proper network access, so It is more stable;Acquisition target intelligently increases, and can pay close attention to wechat public platform automatically, and solution is manually paid close attention to cumbersome, solves artificial The problem of concern amount increasess slowly;Access it is unrestricted, under Entered state, simulation click obtain wechat public number it is unrestricted System, guarantees the comprehensive of data acquisition;Timeliness is good, simulated operation, can actively obtain latest data in time.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.

Claims (7)

1. a kind of wechat public platform collecting method based on virtual machine, which comprises the following steps:
The wechat public platform account paid close attention to will be needed to be collected;
More than one wechat account is logged on a virtual machine, and virtual machine passes through the wechat that Quick Macro simulated operation will be collected into The task queue that public platform is added to wechat account is paid close attention to;
Virtual machine by Quick Macro simulated operation click wechat public platform activly request wechat public's number, virtual machine with The wechat public platform interaction data packet that virtual machine is interacted with internet is monitored and is downloaded when internet data interaction;
Wechat relevant information needed for scanning the wechat public platform interaction data packet on virtual machine and extracting;
Wechat relevant information needed for extracting parses wechat public platform index information, accesses wechat public platform index letter Breath obtains the specific wechat information of wechat public platform;The wechat public platform index information is wechat article chained address or wechat Public platform account.
2. a kind of wechat public platform collecting method based on virtual machine according to claim 1, which is characterized in that also The following steps are included:
Wechat public platform interaction data packet on scanning virtual machine extracts other related wechat public platform accounts, other correlations are micro- The task queue that letter public platform is added to wechat account is paid close attention to.
3. a kind of wechat public platform collecting method based on virtual machine according to claim 1 or 2, feature exist In the wechat public platform account paid close attention to being needed to be collected are as follows: by internet information Text Feature Extraction WeChat ID, and this is micro- Signal is audited, is intervened, is classified, and arrangement filters out wechat public platform and is stored in wechat public number database.
4. a kind of wechat public platform collecting method based on virtual machine according to claim 1, which is characterized in that institute The specific wechat information for stating wechat public platform includes title, author, issuing time, article content.
5. a kind of wechat public platform collecting method based on virtual machine according to claim 1, which is characterized in that institute Stating virtual machine is Android virtual machine.
6. a kind of wechat public platform data acquisition device based on virtual machine, which is characterized in that the device and virtual-machine data connect It connects, which includes sequentially connected:
The account collector unit that the wechat public platform account paid close attention to will be needed to be collected;
The wechat public platform interaction data packet for interacting virtual machine with internet when virtual machine is interacted with internet data carries out The data monitoring and download unit for monitoring and downloading;
The data scanning of wechat relevant information needed for scanning the wechat public platform interaction data packet on virtual machine and extracting extracts Unit;
Wechat relevant information needed for extracting parses wechat public platform index information, accesses wechat public platform index letter Breath obtains the parsing and access acquiring unit of the specific wechat information of wechat public platform;The wechat public platform index information is wechat Article chained address or wechat public platform account.
7. a kind of wechat public platform data acquisition device based on virtual machine according to claim 6, which is characterized in that should Device further include connect with data monitoring and download unit, the wechat public platform interaction data packet that scans on virtual machine extracts other The account extraction unit of related wechat public platform account.
CN201511013666.1A 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine Active CN105577528B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511013666.1A CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511013666.1A CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Publications (2)

Publication Number Publication Date
CN105577528A CN105577528A (en) 2016-05-11
CN105577528B true CN105577528B (en) 2019-01-15

Family

ID=55887217

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511013666.1A Active CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Country Status (1)

Country Link
CN (1) CN105577528B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106059851A (en) * 2016-05-20 2016-10-26 天津海量信息技术股份有限公司 App data collection method based on cooperative work of mobile end and service end
CN107644021A (en) * 2016-07-20 2018-01-30 北大方正集团有限公司 Information collecting method and information collecting device
CN106341541B (en) * 2016-09-20 2019-06-21 腾讯科技(北京)有限公司 List treating method and apparatus
CN108090072A (en) * 2016-11-22 2018-05-29 上海看榜信息科技有限公司 A kind of minute grade monitoring system for graph text information
CN107257314A (en) * 2017-06-05 2017-10-17 成都知道创宇信息技术有限公司 A kind of message statistics analysis method based on wechat group
CN107948052A (en) * 2017-11-14 2018-04-20 福建中金在线信息科技有限公司 Information crawler method, apparatus, electronic equipment and system
CN110188257B (en) * 2019-04-16 2021-12-31 国家计算机网络与信息安全管理中心 Mobile application data acquisition method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197491B1 (en) * 1999-09-21 2007-03-27 International Business Machines Corporation Architecture and implementation of a dynamic RMI server configuration hierarchy to support federated search and update across heterogeneous datastores
CN104199953A (en) * 2014-09-15 2014-12-10 浪潮软件集团有限公司 Method for crawling public account information of mobile phone client
CN104639605A (en) * 2014-12-18 2015-05-20 张磊 System, device and method for automatically adding friends for social communication software
CN104794633A (en) * 2015-04-09 2015-07-22 张磊 System, device and method for automatically adding friends to socializing communication software

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197491B1 (en) * 1999-09-21 2007-03-27 International Business Machines Corporation Architecture and implementation of a dynamic RMI server configuration hierarchy to support federated search and update across heterogeneous datastores
CN104199953A (en) * 2014-09-15 2014-12-10 浪潮软件集团有限公司 Method for crawling public account information of mobile phone client
CN104639605A (en) * 2014-12-18 2015-05-20 张磊 System, device and method for automatically adding friends for social communication software
CN104794633A (en) * 2015-04-09 2015-07-22 张磊 System, device and method for automatically adding friends to socializing communication software

Also Published As

Publication number Publication date
CN105577528A (en) 2016-05-11

Similar Documents

Publication Publication Date Title
CN105577528B (en) A kind of wechat public platform collecting method and device based on virtual machine
CN105404584B (en) LPC static code inspection method, device and system
CN105069355B (en) The static detection method and device of webshell deformations
CN105094889B (en) A kind of application plug loading method and device
CN107341399B (en) Method and device for evaluating security of code file
CN105956180B (en) A kind of filtering sensitive words method
CN106844685B (en) Method, device and server for identifying website
CN106796637A (en) Analytical equipment, analysis method and analysis program
CN101853300A (en) Method and system for identifying and evaluating video downloading service website
CN104601672B (en) The method and apparatus of network resource sharing based on different application client
CN107633433B (en) Advertisement auditing method and device
CN108416034B (en) Information acquisition system based on financial heterogeneous big data and control method thereof
CN103593613A (en) Method, terminal, server and system for computer virus detection
CN108985064A (en) A kind of method and device identifying malice document
CN113038153B (en) Financial live broadcast violation detection method, device, equipment and readable storage medium
CN102984161A (en) Identification method and device for reliable website
CN110020161B (en) Data processing method, log processing method and terminal
CN111447224A (en) Web vulnerability scanning method and vulnerability scanner
KR20190058141A (en) Method for generating data extracted from document and apparatus thereof
CN104080058A (en) Information processing method and device
CN114528457A (en) Web fingerprint detection method and related equipment
CN105429865A (en) WeChat public number data collection method and device based on browser
CN104317847A (en) Method and system for identifying languages in network text information
CN103248513A (en) Network information data collection method and system based on Office suite
CN113568626A (en) Dynamic packaging method, application package starting method, device and electronic equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant