CN109710140A - A kind of smart phone social application automatic data collection method - Google Patents

A kind of smart phone social application automatic data collection method Download PDF

Info

Publication number
CN109710140A
CN109710140A CN201811591933.7A CN201811591933A CN109710140A CN 109710140 A CN109710140 A CN 109710140A CN 201811591933 A CN201811591933 A CN 201811591933A CN 109710140 A CN109710140 A CN 109710140A
Authority
CN
China
Prior art keywords
phone
data acquisition
screen
smart phone
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811591933.7A
Other languages
Chinese (zh)
Inventor
庞文俊
伊晓强
汤泰鼎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Net (hefei) Technology Co Ltd
Original Assignee
Net (hefei) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Net (hefei) Technology Co Ltd filed Critical Net (hefei) Technology Co Ltd
Priority to CN201811591933.7A priority Critical patent/CN109710140A/en
Publication of CN109710140A publication Critical patent/CN109710140A/en
Pending legal-status Critical Current

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a kind of smart phone social application automatic data collection method, this method includes the following steps, step 1: initiating data acquisition command;Step 2: data acquisition session is executed;Step 3: operation mission script;Step 4: application state is identified by mission script;It identifies the concrete principle of cell phone application state are as follows: first with the development interface of the Android phone SDK extraction interface data provided, extracts UI feature;UI knowledge base is established later, realizes the automatic identification to mobile phone UI state;Method in through the invention can accomplish to be automatically brought into operation according to App of the script to mobile phone, while can identify the state of application, can obtain data from screen and carry out next step operation, and carry out continual acquisition information needed using circulate operation;The basic API and tool that the present invention utilizes Android platform to provide, are packaged and improve, and realize and directly obtain information from mobile phone screen.

Description

A kind of smart phone social application automatic data collection method
Technical field
The invention belongs to data collecting fields, are related to a kind of mobile phone application collecting method, specifically a kind of intelligence hand Machine social application automatic data collection method.
Background technique
Data acquisition technology is widely used in search engine, public sentiment monitoring, open source information etc. and needs to obtain from open network It takes in the system of mass data.With the depth development of smart phone and social application, the main battle ground of data acquisition is also gradual It is transferred to above mobile phone social application from web page.Traditional data acquisition technology is mainly for web page, how from social activity It is a current problem that App, which efficiently and effectively acquires data, is shown:
(1) smart phone social application communication protocol does not open, and no image of Buddha web page acquires data from protocol layer like that;
(2) smart phone social application is many kinds of, and function is complicated, and no source code is not available the means such as code injection It is broken through;
(3) smart phone social application message form multiplicity, such as text, voice, video etc. needs comprehensively to acquire these Information.
Summary of the invention
The present invention proposes a kind of completely new approach, and the basic API and tool that it utilizes Android platform to provide are packaged and change Into realization directly obtains information from mobile phone screen, and then realizes the automatic collection of smart phone social application information;Of the invention It is designed to provide a kind of smart phone social application automatic data collection method.
The purpose of the present invention can be achieved through the following technical solutions:
A kind of smart phone social application automatic data collection method, this method include the following steps:
Step 1: data acquisition command is initiated;
Step 2: data acquisition session is executed;
Step 3: operation mission script;
Step 4: application state is identified by mission script;It identifies the concrete principle of cell phone application state are as follows:
S1: first with the development interface of the Android phone SDK extraction interface data provided, UI feature is extracted;
S2: establishing UI knowledge base later, realizes the automatic identification to mobile phone UI state;
Step 5: data acquisition directly is carried out from screen;The basic step of data acquisition are as follows:
S1: the screen operator tool ADB provided by Android phone platform tools grasps mobile phone App substantially Make;Basic operation is carried out to cell phone application mainly to realize using UIAutomatorAPI;
S2: carrying out data acquisition later, and data acquiring mode is to be provided by Android phone platform api and tool Word Input, the basic operation of screenshotss and record screen;It is encapsulated as to can be realized text in social App, the letter of audio and video Breath extracts;
Step 6: step 2-step 5 is repeated, until receiving stopping data acquisition command;
Step 7: tenth skill.
Further, the specific steps of UI knowledge base are established in the step 4 S2 are as follows:
Step 1: screen message is got first, and is saved it in local computing;
Step 2: parsing ui.xml file operation is carried out to all screen messages later, UI feature is extracted, establishes UI and know Know library;
Step 3: new UI is identified finally by UI knowledge base.
Further, the UI feature in the step 2 includes xml framework and title.
Further, the basic operation in the step 5 S1 may include entering some specified chat group, exits some and refers to Fixed chat group, is rolled to up-to-date information beginning.
Beneficial effects of the present invention:
Method in through the invention can accomplish to be automatically brought into operation according to App of the script to mobile phone, while can identify The state of application can obtain data from screen and carry out next step operation, and carry out continual obtain using circulate operation Take information needed;The basic API and tool that the present invention utilizes Android platform to provide, are packaged and improve, and realize directly from hand Machine screen obtains information, and then realizes the automatic collection of smart phone social application information;
Allow the invention to that smart phone social application communication protocol is avoided not open, no image of Buddha web page like that from Protocol layer acquires this problem of data, while it is many kinds of to also avoid smart phone social application, and function is complicated, passive generation Code, is not available the means such as code injection and is broken through;Also the social activity that can comprehensively collect smart phone multiplicity form is answered With information, such as text, voice, video etc..
Detailed description of the invention
In order to facilitate the understanding of those skilled in the art, the present invention will be further described below with reference to the drawings.
Fig. 1 is flow chart of the invention;
Fig. 2 is wechat group display interface state recognition figure of the invention.
Specific embodiment
As shown in Figure 1, a kind of smart phone social application automatic data collection method, this method include the following steps:
Step 1: data acquisition command is initiated;
Step 2: data acquisition session is executed;
Step 3: operation mission script;
Step 4: application state is identified by mission script;
As shown in Fig. 2, Android phone SDK provides the development interface for extracting interface data;By taking wechat as an example, the left side It is wechat group display interface, the information of the state at the interface is listed on the right;Information includes interface assembly tree and each node Details;
It identifies the extraction interface that the concrete principle of cell phone application state is as follows, provides first with Android phone SDK The development interface of data extracts UI feature;UI knowledge base is established later, realizes the automatic identification to mobile phone UI state;
Wherein, the specific steps of UI knowledge base are established are as follows: get screen message first, and save it in local computing In;Parsing ui.xml file operation is carried out to all screen messages later, UI feature is extracted, establishes UI knowledge base;UI feature It mainly include xml framework and title;New UI is identified finally by UI knowledge base;
Step 5: data acquisition directly is carried out from screen;
By the screen operator tool ADB that Android phone platform tools provide, can be realized to the basic of mobile phone App Operation, such as click, sliding, roll;It is realized at this time using UIAutomatorAPI and enters some chat group, release some chat Group, is rolled to some routine operations such as up-to-date information beginning;
Data acquisition is carried out later, and data acquiring mode is the text provided by Android phone platform api and tool Word extracts, the basic operation of screenshotss and record screen;It is encapsulated as to can be realized text in social App, the information of audio and video It extracts;Wherein Word Input can use Word Input API:
com.android.uiautomator.core.UiObject.gettext();
Screenshotss order:
adb shell screencap-p/sdcard/screen.png
Recorded screen command:
adb shell screenrecord--time-limit 10/sdcard/demo.mp4
Step 6: step 2-step 5 is repeated, until receiving stopping data acquisition command;
Step 7: tenth skill.
Method in through the invention can accomplish to be automatically brought into operation according to App of the script to mobile phone, while can identify The state of application can obtain data from screen and carry out next step operation, and carry out continual obtain using circulate operation Take information needed;The basic API and tool that the present invention utilizes Android platform to provide, are packaged and improve, and realize directly from hand Machine screen obtains information, and then realizes the automatic collection of smart phone social application information;
Allow the invention to that smart phone social application communication protocol is avoided not open, no image of Buddha web page like that from Protocol layer acquires this problem of data, while it is many kinds of to also avoid smart phone social application, and function is complicated, passive generation Code, is not available the means such as code injection and is broken through;Also the social activity that can comprehensively collect smart phone multiplicity form is answered With information, such as text, voice, video etc..
Above content is only to structure of the invention example and explanation, affiliated those skilled in the art couple Described specific embodiment does various modifications or additions or is substituted in a similar manner, without departing from invention Structure or beyond the scope defined by this claim, is within the scope of protection of the invention.

Claims (4)

1. a kind of smart phone social application automatic data collection method, which is characterized in that this method includes the following steps:
Step 1: data acquisition command is initiated;
Step 2: data acquisition session is executed;
Step 3: operation mission script;
Step 4: application state is identified by mission script;It identifies the concrete principle of cell phone application state are as follows:
S1: first with the development interface of the Android phone SDK extraction interface data provided, UI feature is extracted;
S2: establishing UI knowledge base later, realizes the automatic identification to mobile phone UI state;
Step 5: data acquisition directly is carried out from screen;The basic step of data acquisition are as follows:
S1: the screen operator tool ADB provided by Android phone platform tools, basic operation is carried out to mobile phone App;It is right Cell phone application is carried out basic operation and is mainly realized using UIAutomatorAPI;
S2: carrying out data acquisition later, and data acquiring mode is the text provided by Android phone platform api and tool It extracts, the basic operation of screenshotss and record screen;It is encapsulated as to can be realized text in social App, the information of audio and video mentions It takes;
Step 6: step 2-step 5 is repeated, until receiving stopping data acquisition command;
Step 7: tenth skill.
2. a kind of smart phone social application automatic data collection method according to claim 1, which is characterized in that described The specific steps of UI knowledge base are established in step 4 S2 are as follows:
Step 1: screen message is got first, and is saved it in local computing;
Step 2: parsing ui.xml file operation is carried out to all screen messages later, UI feature is extracted, establishes UI knowledge Library;
Step 3: new UI is identified finally by UI knowledge base.
3. a kind of smart phone social application automatic data collection method according to claim 2, which is characterized in that described UI feature in step 2 includes xml framework and title.
4. a kind of smart phone social application automatic data collection method according to claim 1, which is characterized in that described Basic operation in step 5 S1 may include entering some specified chat group, exits some specified chat group, is rolled to newest letter Cease beginning.
CN201811591933.7A 2018-12-25 2018-12-25 A kind of smart phone social application automatic data collection method Pending CN109710140A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811591933.7A CN109710140A (en) 2018-12-25 2018-12-25 A kind of smart phone social application automatic data collection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811591933.7A CN109710140A (en) 2018-12-25 2018-12-25 A kind of smart phone social application automatic data collection method

Publications (1)

Publication Number Publication Date
CN109710140A true CN109710140A (en) 2019-05-03

Family

ID=66257526

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811591933.7A Pending CN109710140A (en) 2018-12-25 2018-12-25 A kind of smart phone social application automatic data collection method

Country Status (1)

Country Link
CN (1) CN109710140A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110276281A (en) * 2019-06-10 2019-09-24 浙江工业大学 A kind of screenshotss picture and text identification extracting method and system towards mobile terminal
CN110865851A (en) * 2019-11-18 2020-03-06 中国民航信息网络股份有限公司 Automatic Android application data acquisition method and system
CN111638964A (en) * 2020-06-09 2020-09-08 武汉虹旭信息技术有限责任公司 Centralized internet data acquisition system and acquisition method
CN114443191A (en) * 2021-12-23 2022-05-06 厦门市美亚柏科信息股份有限公司 Method for rapidly extracting application data of Android equipment
CN115277202A (en) * 2022-07-28 2022-11-01 四川封面传媒科技有限责任公司 Automatic data acquisition system and method for android APP

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105430443A (en) * 2015-11-17 2016-03-23 Tcl集团股份有限公司 Method and system for acquiring user behavior data based on smart TV
CN106951520A (en) * 2017-03-02 2017-07-14 内蒙古大学 A kind of bus passenger trip data acquisition system and its application
CN107340954A (en) * 2017-07-03 2017-11-10 国家计算机网络与信息安全管理中心 A kind of information extracting method and device
CN108089967A (en) * 2017-12-12 2018-05-29 成都睿码科技有限责任公司 A kind of method for crawling Android mobile phone App data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105430443A (en) * 2015-11-17 2016-03-23 Tcl集团股份有限公司 Method and system for acquiring user behavior data based on smart TV
CN106951520A (en) * 2017-03-02 2017-07-14 内蒙古大学 A kind of bus passenger trip data acquisition system and its application
CN107340954A (en) * 2017-07-03 2017-11-10 国家计算机网络与信息安全管理中心 A kind of information extracting method and device
CN108089967A (en) * 2017-12-12 2018-05-29 成都睿码科技有限责任公司 A kind of method for crawling Android mobile phone App data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
黄威: "移动智能终端应用程序信息分析技术研究及实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110276281A (en) * 2019-06-10 2019-09-24 浙江工业大学 A kind of screenshotss picture and text identification extracting method and system towards mobile terminal
CN110865851A (en) * 2019-11-18 2020-03-06 中国民航信息网络股份有限公司 Automatic Android application data acquisition method and system
CN110865851B (en) * 2019-11-18 2023-12-01 中国民航信息网络股份有限公司 Automatic Android application data acquisition method and system
CN111638964A (en) * 2020-06-09 2020-09-08 武汉虹旭信息技术有限责任公司 Centralized internet data acquisition system and acquisition method
CN114443191A (en) * 2021-12-23 2022-05-06 厦门市美亚柏科信息股份有限公司 Method for rapidly extracting application data of Android equipment
CN115277202A (en) * 2022-07-28 2022-11-01 四川封面传媒科技有限责任公司 Automatic data acquisition system and method for android APP

Similar Documents

Publication Publication Date Title
CN109710140A (en) A kind of smart phone social application automatic data collection method
CN103984579B (en) More equipment rooms share the method for current application program real-time running state
CN109429522A (en) Voice interactive method, apparatus and system
US20170249934A1 (en) Electronic device and method for operating the same
CN104317787A (en) Instant communication terminal and information translation method and device thereof
CN102752371B (en) In client, realize method and the client of dodging screen
CN103365840A (en) Web-based screenshot taking method and device
CN103942054A (en) Data evidence obtaining system based on Android
CN108228770A (en) A kind of method and device of application file source inquiry
CN104468941A (en) Information display method and device
CN108228664B (en) Unstructured data processing method and device
CN111176627A (en) Device and method for separating front end from back end based on micro-service
CN106933811A (en) A kind of entry automatic generation method and device
CN102811288A (en) Method and device for recording call information
CN104539509B (en) The method and apparatus that notification channel starts broadcasting
CN103561147A (en) Address-book updating method and system of mobile equipment
CN103605514A (en) Front-end template processing method and device
CN104156430A (en) Device and method for fast extracting Android mobile phone data
CN106358068B (en) System and method for solving network live broadcast drawing by big data
CN107943515A (en) A kind of method and apparatus of cloud management platform management service orchestration template
CN102457498A (en) Method and device for switching instant messaging session
CN110445934A (en) Call-information processing method, system, terminal and readable storage medium storing program for executing
CN108833125B (en) Drawing method, system, computer equipment and storage medium for restoring voice speech path
CN113988954A (en) Financing product marketing method and device
CN112905464B (en) Application running environment data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190503

RJ01 Rejection of invention patent application after publication