CN105577528A - Wechat official account data collection method and device based on virtual machine - Google Patents

Wechat official account data collection method and device based on virtual machine Download PDF

Info

Publication number
CN105577528A
CN105577528A CN201511013666.1A CN201511013666A CN105577528A CN 105577528 A CN105577528 A CN 105577528A CN 201511013666 A CN201511013666 A CN 201511013666A CN 105577528 A CN105577528 A CN 105577528A
Authority
CN
China
Prior art keywords
micro
public number
letter
virtual machine
letter public
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201511013666.1A
Other languages
Chinese (zh)
Other versions
CN105577528B (en
Inventor
陈志群
李晓亮
马瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Zhonghong On-Line Co Ltd
Original Assignee
Shenzhen Zhonghong On-Line Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Zhonghong On-Line Co Ltd filed Critical Shenzhen Zhonghong On-Line Co Ltd
Priority to CN201511013666.1A priority Critical patent/CN105577528B/en
Publication of CN105577528A publication Critical patent/CN105577528A/en
Application granted granted Critical
Publication of CN105577528B publication Critical patent/CN105577528B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/214Monitoring or handling of messages using selective forwarding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45504Abstract machines for programme code execution, e.g. Java virtual machine [JVM], interpreters, emulators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/52User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail for supporting social networking services

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Information Transfer Between Computers (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention relates to the information collection field, specifically relates to a wechat official account data collection method and device based on a virtual machine. According to the method and the device, when the virtual machine interacts data with the internet, the virtual machine and the Quick Macro simulates operations of monitoring and downloading the wechat official account interaction data packet; the wechat official account interaction data packet is the wechat official account interaction data packet interacted by the virtual machine with the internet; after the virtual machine simulates logining the wechat, the effect is the same to login the wechat through a mobile phone; after the virtual logins the wechat, a login state is kept without losing connection; if connection is lost under a special condition, automatic login can be realized when the normal network access is recovered, the login of the wechat is stable; the collection objects are increased intelligently; the wechat official accounts can be attended automatically; the artificial attention complexity is removed; the problem of slow growth of artificial attention quantity is solved; the access is not restricted; in the login state, simulating clicking to obtain the wechat official account data is not restricted; the data obtaining completeness is ensured; the timeliness is good; the operations are simulated; and the newest data can be obtained actively and timely.

Description

A kind of micro-letter public number collecting method based on virtual machine and device
Technical field
the present invention relates to field of information acquisition, be specifically related to a kind of micro-letter public number collecting method based on virtual machine and device.
Background technology
the article that micro-letter public number pushes can be checked by two kinds of approach, and one is micro-letter search of search engine, but the information of search too in a jumble, and is subject to the restriction of provider server.Second is pay close attention to public number, can accept the article that public number pushes, but the public number that each micro-letter account is paid close attention to is limited.
in general micro-letter acquisition method, the request of plain engine is searched by simulation, obtain keyword search micro-message chapter out, this method shortcoming is that request frequency can strictly be restricted, occur needing input validation code could continue request, program None-identified checking and cause gathering and interrupt, the low and information clutter of information gathering efficiency.
Summary of the invention
for overcoming above-mentioned defect, namely object of the present invention is to provide a kind of micro-letter public number collecting method based on virtual machine and device.
the object of the invention is to be achieved through the following technical solutions:
a kind of micro-letter public number collecting method based on virtual machine of the present invention, comprises the following steps:
collect needing the micro-letter public number account paid close attention to;
log in more than one micro-letter account on a virtual machine, the task queue that the micro-letter public number account collected adds micro-letter account to is paid close attention to by QMacro simulated operation by virtual machine simultaneously;
virtual machine clicks micro-letter public number active request micro-letter public number data by QMacro simulated operation, virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and downloading;
micro-letter relevant information needed for micro-letter public number interaction data bag on scanning virtual machine also extracts;
parse micro-letter public number index information according to the micro-letter relevant information needed for extracting, access concrete micro-letter information that this micro-letter public number index information obtains micro-letter public number.
further, further comprising the steps of:
micro-letter public number interaction data bag on scanning virtual machine extracts other relevant micro-letter public number accounts, the task queue that other relevant micro-letter public number accounts add micro-letter account to is paid close attention to.
further, be collected as: by internet information Text Feature Extraction micro-signal by needing the micro-letter public number account paid close attention to, and carried out by this micro-signal auditing, intervene, classify, arrangement filters out micro-letter public number and stored in micro-letter public number account data storehouse.
further, described micro-letter public number index information is micro-message chapter chained address or micro-letter public number account.
further, concrete micro-letter information of described micro-letter public number comprises title, author, issuing time, article content.
further, described virtual machine is Android virtual machine.
based on a micro-letter public number data acquisition unit for virtual machine, this device is connected with virtual-machine data, and this device comprises and connecting successively:
the account collector unit collected is carried out by needing the micro-letter public number account paid close attention to;
virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and the data monitoring downloaded and download unit;
the data scanning extraction unit of the micro-letter relevant information needed for the micro-letter public number interaction data bag on scanning virtual machine also extracts;
parse micro-letter public number index information according to the micro-letter relevant information needed for extracting, access parsing and access acquiring unit that this micro-letter public number index information obtains the concrete micro-letter information of micro-letter public number.
further, this device also comprise connect to data monitoring and download unit, account extraction unit that the micro-letter public number interaction data bag scanned on virtual machine extracts other relevant micro-letter public number accounts.
a kind of micro-letter public number collecting method based on virtual machine provided by the invention and device, the method and device by virtual machine, QMacro simulated operation virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and downloading; After virtual machine simulation logs in micro-letter, effect is the same with mobile phone, and Entered state can be kept after logging in not go offline, and automatically can log in, so more stable if special circumstances go offline when recovering proper network access; Acquisition target intelligence increases, and automatically can pay close attention to micro-letter public number, solves the loaded down with trivial details of artificial concern, solves the artificial problem of closing fluence and increaseing slowly; Access unrestricted, under Entered state, it is unrestricted that the micro-letter public number data of acquisition are clicked in simulation, ensures the comprehensive of data acquisition; Promptness is good, simulated operation, can active obtaining latest data in time.
Accompanying drawing explanation
for ease of illustrating, the present invention is described in detail by following preferred embodiment and accompanying drawing.
fig. 1 is the flow chart of steps of a kind of micro-letter public number collecting method based on virtual machine of the present invention;
fig. 2 is the module frame chart of a kind of micro-letter public number data acquisition unit based on virtual machine of the present invention.
Embodiment
in order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
please refer to Fig. 1, a kind of micro-letter public number collecting method based on virtual machine of the present invention, comprises the following steps:
by needing the micro-letter public number account paid close attention to collect, be specially: by internet information Text Feature Extraction micro-signal, and carried out by this micro-signal auditing, intervene, classify, arrangement filters out micro-letter public number and stored in micro-letter public number account data storehouse;
build Android virtual machine on the server, and the micro-letter client app of Android is installed, the micro-letter account needing to gather is set, username and password is logged in micro-letter client on Android virtual machine, namely log in more than one micro-letter account on a virtual machine simultaneously, the task queue that the micro-letter public number account collected adds micro-letter account to is paid close attention to by QMacro simulated operation by virtual machine simultaneously, after micro-letter account logs in, for micro-letter collection opens entrance, but acquisition target is the article link of micro-letter public number, so before the micro-message chapter link of collection, need to add to micro-letter public number account to pay close attention to, pay close attention to the micro-letter public number needing to gather, add the micro-letter public number collected to task queue, virtual machine passes through QMacro, perform the button operation set, concern list is added to by needing the micro-letter public number automatic powder adding paid close attention to, this micro-letter account just can obtain the article information of micro-letter public number of concern like this,
virtual machine clicks micro-letter public number active request micro-letter public number data by QMacro simulated operation, virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and downloading, namely QMacro is passed through, public number is clicked in simulation, the micro-letter data of active request, comprises the historical data obtaining micro-letter public number, in the mutual process of virtual machine and the Internet, monitor the packet between them, under packet is saved in specified path;
resolution data bag, the micro-letter relevant information needed for the micro-letter public number interaction data bag on scanning virtual machine also extracts, and this micro-letter relevant information is saved in specified folder;
concrete micro-letter information is downloaded, micro-letter public number index information is parsed according to the micro-letter relevant information needed for extracting, access concrete micro-letter information that this micro-letter public number index information obtains micro-letter public number, this micro-letter public number index information can be micro-message chapter chained address or micro-letter public number account, concrete micro-letter information of this micro-letter public number comprises the information such as title, author, issuing time, article content, and format warehouse-in is preserved.
wherein, this micro-letter public number collecting method is further comprising the steps of:
micro-letter public number interaction data bag on scanning virtual machine extracts other relevant micro-letter public number accounts, the task queue that other relevant micro-letter public number accounts add micro-letter account to is paid close attention to, namely by monitoring scan-data bag, analysis extracts other micro-letter public number, this is believed slightly public number automatic powder adding is added to the concern list of micro-letter account, make data source expanded.
specifically see Fig. 2, a kind of micro-letter public number data acquisition unit based on virtual machine applying above-mentioned micro-letter public number collecting method based on virtual machine, this device is connected with virtual-machine data, and this device comprises and connecting successively:
the account collector unit collected is carried out by needing the micro-letter public number account paid close attention to;
virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and the data monitoring downloaded and download unit;
the data scanning extraction unit of the micro-letter relevant information needed for the micro-letter public number interaction data bag on scanning virtual machine also extracts;
parse micro-letter public number index information according to the micro-letter relevant information needed for extracting, access parsing and access acquiring unit that this micro-letter public number index information obtains the concrete micro-letter information of micro-letter public number.
further, this device also comprise connect to data monitoring and download unit, account extraction unit that the micro-letter public number interaction data bag scanned on virtual machine extracts other relevant micro-letter public number accounts.
a kind of micro-letter public number collecting method based on virtual machine provided by the invention and device, the method and device by virtual machine, QMacro simulated operation virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and downloading; After virtual machine simulation logs in micro-letter, effect is the same with mobile phone, and Entered state can be kept after logging in not go offline, and automatically can log in, so more stable if special circumstances go offline when recovering proper network access; Acquisition target intelligence increases, and automatically can pay close attention to micro-letter public number, solves the loaded down with trivial details of artificial concern, solves the artificial problem of closing fluence and increaseing slowly; Access unrestricted, under Entered state, it is unrestricted that the micro-letter public number data of acquisition are clicked in simulation, ensures the comprehensive of data acquisition; Promptness is good, simulated operation, can active obtaining latest data in time.
the foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any amendments done within the spirit and principles in the present invention, equivalent replacement and improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1., based on a micro-letter public number collecting method for virtual machine, it is characterized in that, comprise the following steps:
Collect needing the micro-letter public number account paid close attention to;
Log in more than one micro-letter account on a virtual machine, the task queue that the micro-letter public number account collected adds micro-letter account to is paid close attention to by QMacro simulated operation by virtual machine;
Virtual machine clicks micro-letter public number active request micro-letter public number data by QMacro simulated operation, virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and downloading;
Micro-letter relevant information needed for micro-letter public number interaction data bag on scanning virtual machine also extracts;
Parse micro-letter public number index information according to the micro-letter relevant information needed for extracting, access concrete micro-letter information that this micro-letter public number index information obtains micro-letter public number.
2. a kind of micro-letter public number collecting method based on virtual machine according to claim 1, is characterized in that, further comprising the steps of:
micro-letter public number interaction data bag on scanning virtual machine extracts other relevant micro-letter public number accounts, the task queue that other relevant micro-letter public number accounts add micro-letter account to is paid close attention to.
3. a kind of micro-letter public number collecting method based on virtual machine according to claim 1 and 2, it is characterized in that, be collected as needing the micro-letter public number account paid close attention to: by internet information Text Feature Extraction micro-signal, and this micro-signal carried out audit, intervene, classify, arrange and filter out micro-letter public number and stored in micro-letter public number account data storehouse.
4. a kind of micro-letter public number collecting method based on virtual machine according to claim 1, is characterized in that, described micro-letter public number index information is micro-message chapter chained address or micro-letter public number account.
5. a kind of micro-letter public number collecting method based on virtual machine according to claim 1, it is characterized in that, concrete micro-letter information of described micro-letter public number comprises title, author, issuing time, article content.
6. a kind of micro-letter public number collecting method based on virtual machine according to claim 1, it is characterized in that, described virtual machine is Android virtual machine.
7. based on a micro-letter public number data acquisition unit for virtual machine, it is characterized in that, this device is connected with virtual-machine data, and this device comprises and connecting successively:
the account collector unit collected is carried out by needing the micro-letter public number account paid close attention to;
virtual machine and internet data mutual time the mutual micro-letter public number interaction data bag of virtual machine and the Internet is carried out monitoring and the data monitoring downloaded and download unit;
the data scanning extraction unit of the micro-letter relevant information needed for the micro-letter public number interaction data bag on scanning virtual machine also extracts;
parse micro-letter public number index information according to the micro-letter relevant information needed for extracting, access parsing and access acquiring unit that this micro-letter public number index information obtains the concrete micro-letter information of micro-letter public number.
8. a kind of micro-letter public number data acquisition unit based on virtual machine according to claim 7, it is characterized in that, this device also comprise connect to data monitoring and download unit, account extraction unit that the micro-letter public number interaction data bag scanned on virtual machine extracts other relevant micro-letter public number accounts.
CN201511013666.1A 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine Active CN105577528B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511013666.1A CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511013666.1A CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Publications (2)

Publication Number Publication Date
CN105577528A true CN105577528A (en) 2016-05-11
CN105577528B CN105577528B (en) 2019-01-15

Family

ID=55887217

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511013666.1A Active CN105577528B (en) 2015-12-31 2015-12-31 A kind of wechat public platform collecting method and device based on virtual machine

Country Status (1)

Country Link
CN (1) CN105577528B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106059851A (en) * 2016-05-20 2016-10-26 天津海量信息技术股份有限公司 App data collection method based on cooperative work of mobile end and service end
CN106341541A (en) * 2016-09-20 2017-01-18 腾讯科技(北京)有限公司 List processing method and device
CN107257314A (en) * 2017-06-05 2017-10-17 成都知道创宇信息技术有限公司 A kind of message statistics analysis method based on wechat group
CN107644021A (en) * 2016-07-20 2018-01-30 北大方正集团有限公司 Information collecting method and information collecting device
CN107948052A (en) * 2017-11-14 2018-04-20 福建中金在线信息科技有限公司 Information crawler method, apparatus, electronic equipment and system
CN108090072A (en) * 2016-11-22 2018-05-29 上海看榜信息科技有限公司 A kind of minute grade monitoring system for graph text information
CN110188257A (en) * 2019-04-16 2019-08-30 国家计算机网络与信息安全管理中心 A kind of mobile application collecting method and device
CN112737925A (en) * 2020-12-29 2021-04-30 深圳前海微众银行股份有限公司 Data acquisition method, device and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197491B1 (en) * 1999-09-21 2007-03-27 International Business Machines Corporation Architecture and implementation of a dynamic RMI server configuration hierarchy to support federated search and update across heterogeneous datastores
CN104199953A (en) * 2014-09-15 2014-12-10 浪潮软件集团有限公司 Method for crawling public account information of mobile phone client
CN104639605A (en) * 2014-12-18 2015-05-20 张磊 System, device and method for automatically adding friends for social communication software
CN104794633A (en) * 2015-04-09 2015-07-22 张磊 System, device and method for automatically adding friends to socializing communication software

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197491B1 (en) * 1999-09-21 2007-03-27 International Business Machines Corporation Architecture and implementation of a dynamic RMI server configuration hierarchy to support federated search and update across heterogeneous datastores
CN104199953A (en) * 2014-09-15 2014-12-10 浪潮软件集团有限公司 Method for crawling public account information of mobile phone client
CN104639605A (en) * 2014-12-18 2015-05-20 张磊 System, device and method for automatically adding friends for social communication software
CN104794633A (en) * 2015-04-09 2015-07-22 张磊 System, device and method for automatically adding friends to socializing communication software

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106059851A (en) * 2016-05-20 2016-10-26 天津海量信息技术股份有限公司 App data collection method based on cooperative work of mobile end and service end
CN107644021A (en) * 2016-07-20 2018-01-30 北大方正集团有限公司 Information collecting method and information collecting device
CN106341541A (en) * 2016-09-20 2017-01-18 腾讯科技(北京)有限公司 List processing method and device
CN106341541B (en) * 2016-09-20 2019-06-21 腾讯科技(北京)有限公司 List treating method and apparatus
CN108090072A (en) * 2016-11-22 2018-05-29 上海看榜信息科技有限公司 A kind of minute grade monitoring system for graph text information
CN107257314A (en) * 2017-06-05 2017-10-17 成都知道创宇信息技术有限公司 A kind of message statistics analysis method based on wechat group
CN107948052A (en) * 2017-11-14 2018-04-20 福建中金在线信息科技有限公司 Information crawler method, apparatus, electronic equipment and system
CN110188257A (en) * 2019-04-16 2019-08-30 国家计算机网络与信息安全管理中心 A kind of mobile application collecting method and device
CN110188257B (en) * 2019-04-16 2021-12-31 国家计算机网络与信息安全管理中心 Mobile application data acquisition method and device
CN112737925A (en) * 2020-12-29 2021-04-30 深圳前海微众银行股份有限公司 Data acquisition method, device and system

Also Published As

Publication number Publication date
CN105577528B (en) 2019-01-15

Similar Documents

Publication Publication Date Title
CN105577528A (en) Wechat official account data collection method and device based on virtual machine
CN103235913B (en) A kind of for identifying, intercept the system of bundled software, Apparatus and method for
CN107895009B (en) Distributed internet data acquisition method and system
CN105930363B (en) HTML5 webpage-based user behavior analysis method and device
CN101605074B (en) Method and system for monitoring Trojan Horse based on network communication behavior characteristic
Dai et al. Networkprofiler: Towards automatic fingerprinting of android apps
CN102662703B (en) A kind of application plug loading method and device
CN107241296B (en) Webshell detection method and device
CN103605738A (en) Webpage access data statistical method and webpage access data statistical device
CN102984161B (en) The recognition methods of a kind of reliable website and device
CN111177779B (en) Database auditing method, device, electronic equipment and computer storage medium
CN104506484A (en) Proprietary protocol analysis and identification method
CN105162676B (en) A kind of wechat data capture method and system
CN108737549A (en) A kind of log analysis method and device of big data quantity
CN106055608A (en) Method and apparatus for automatically collecting and analyzing switch logs
CN103914655A (en) Downloaded file security detection method and device
CN103593613A (en) Method, terminal, server and system for computer virus detection
CN113259467B (en) Webpage asset fingerprint tag identification and discovery method based on big data
US10387370B2 (en) Collecting test results in different formats for storage
CN105376077A (en) Network behavior information processing method, log transmitting method, network behavior information processing device and system
CN105337753A (en) Method and device for monitoring Internet real quality
CN104468790A (en) Method for processing cookie data and client side
CN111740868A (en) Alarm data processing method and device and storage medium
CN105429865A (en) WeChat public number data collection method and device based on browser
CN102984162B (en) The recognition methods of credible website and gathering system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant