CN110704709B - Application data capturing method and device, storage medium and electronic equipment - Google Patents

Application data capturing method and device, storage medium and electronic equipment Download PDF

Info

Publication number
CN110704709B
CN110704709B CN201910796526.8A CN201910796526A CN110704709B CN 110704709 B CN110704709 B CN 110704709B CN 201910796526 A CN201910796526 A CN 201910796526A CN 110704709 B CN110704709 B CN 110704709B
Authority
CN
China
Prior art keywords
grabbing
application
data
processed
configuration data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910796526.8A
Other languages
Chinese (zh)
Other versions
CN110704709A (en
Inventor
潘仕江
柳超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jindi Technology Co Ltd
Original Assignee
Beijing Jindi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jindi Technology Co Ltd filed Critical Beijing Jindi Technology Co Ltd
Priority to CN201910796526.8A priority Critical patent/CN110704709B/en
Publication of CN110704709A publication Critical patent/CN110704709A/en
Application granted granted Critical
Publication of CN110704709B publication Critical patent/CN110704709B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/61Installation

Abstract

The embodiment of the disclosure discloses an application data capturing method and device, a storage medium and electronic equipment, wherein the method comprises the following steps: acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in a database; starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data; performing a grab operation on the at least one operational data based on the grab configuration data; by installing the application to be processed, the data capture is simple and quick, and the data to be obtained can be obtained by skipping various encryption methods and/or reverse-crawling methods.

Description

Application data capturing method and device, storage medium and electronic equipment
Technical Field
The present disclosure relates to data capture technologies, and in particular, to a method and an apparatus for capturing application data, a storage medium, and an electronic device.
Background
With the rapid development of big data, the importance of the data is further verified, and people acquire the data and also protect the data; along with the increase of data value, the measure for protecting the data is stronger, the protection measure is also called as the back-climbing, and along with the improvement of the back-climbing technology, the difficulty of data capture is larger and larger. Through traditional data capture technology, it is difficult to handle current anti-crawl technique and carry out data capture.
Disclosure of Invention
The present disclosure is proposed to solve the above technical problems. The embodiment of the disclosure provides an application data capturing method and device, a storage medium and electronic equipment.
According to an aspect of the embodiments of the present disclosure, there is provided an application data capturing method, including:
acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in a database;
starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data;
performing a grab operation on the at least one operation data based on the grab configuration data.
Optionally, the crawling configuration data comprises at least one crawling keyword and at least one crawling element; each grabbing keyword corresponds to at least one grabbing element;
the performing a grab operation on the at least one operation data based on the grab configuration data comprises:
determining at least one corresponding grab element based on at least one grab keyword in the grab configuration data;
performing a grab operation on the at least one operation data based on the grab element.
Optionally, the determining, based on at least one crawling keyword in the crawling configuration data, the corresponding at least one crawling element includes:
searching and determining at least one element number from the grabbing configuration data based on the grabbing keywords; wherein each captured keyword corresponds to at least one element number;
searching and determining at least one grabbing element from the grabbing configuration data based on the element number; wherein each of the grab elements corresponds to an element number.
Optionally, before the starting the to-be-processed application, executing at least one operation in the to-be-processed application based on the grab configuration data, and obtaining at least one operation data, the method further includes:
judging whether a port corresponding to the application to be processed is occupied by other applications or not;
responding to the port not occupied by other applications, and starting the application to be processed;
and in response to the port being occupied by other applications, closing the other applications occupying the port, and starting the application to be processed.
Optionally, before the obtaining the grab configuration data corresponding to the installed application to be processed from the plurality of grab configuration data stored in the database, the method further includes:
and acquiring an installation package of the application to be processed, and installing the application to be processed based on the installation package.
Optionally, before the obtaining the grab configuration data corresponding to the installed application to be processed from the plurality of grab configuration data stored in the database, the method further includes:
and acquiring the plurality of capture configuration data and storing the capture configuration data in the database.
Optionally, the method further comprises:
whether grabbing is successful is determined based on whether the grabbing operation obtains target data corresponding to the grabbing configuration data;
responding to target data corresponding to the grabbing configuration data obtained based on the grabbing operation, determining that the grabbing operation is successful, and storing the target data;
and determining that the grabbing operation fails in response to not obtaining target data corresponding to the grabbing configuration data based on the grabbing operation.
Optionally, after determining that the grabbing operation fails, the method further includes:
and screenshot is carried out on the display page of the application to be processed corresponding to the grabbing operation and is stored.
According to another aspect of the embodiments of the present disclosure, there is provided an application data capturing apparatus, including:
the data configuration module is used for acquiring the capture configuration data corresponding to the installed application to be processed from the plurality of capture configuration data stored in the database;
the operation execution module is used for starting the application to be processed, executing at least one operation in the application to be processed and obtaining at least one operation data;
and the data grabbing module is used for executing grabbing operation on the at least one operation data based on the grabbing configuration data.
Optionally, the crawling configuration data comprises at least one crawling keyword and at least one crawling element; each grabbing keyword corresponds to at least one grabbing element;
the data capturing module is specifically configured to determine at least one corresponding capturing element based on at least one capturing keyword in the capturing configuration data; performing a grab operation on the at least one operation data based on the grab element.
Optionally, when determining the corresponding at least one grab element based on at least one grab keyword in the grab configuration data, the data grab module is configured to search and determine at least one element number from the grab configuration data based on the grab keyword; wherein each captured keyword corresponds to at least one element number; searching and determining at least one grabbing element from the grabbing configuration data based on the element number; wherein each of the grab elements corresponds to an element number.
Optionally, the apparatus further comprises:
the port judgment module is used for judging whether the port corresponding to the application to be processed is occupied by other applications or not; responding to the port not occupied by other applications, and starting the application to be processed; and in response to the port being occupied by other applications, closing the other applications occupying the port, and starting the application to be processed.
Optionally, the apparatus further comprises:
and the application installation module is used for acquiring the installation package of the application to be processed and installing the application to be processed based on the installation package.
Optionally, the apparatus further comprises:
and the data storage module is used for acquiring the plurality of capture configuration data and storing the capture configuration data into the database.
Optionally, the apparatus further comprises:
the grabbing judgment module is used for determining whether grabbing is successful or not based on whether the grabbing operation obtains the target data corresponding to the grabbing configuration data or not; responding to target data corresponding to the grabbing configuration data obtained based on the grabbing operation, determining that the grabbing operation is successful, and storing the target data; and determining that the grabbing operation fails in response to not obtaining target data corresponding to the grabbing configuration data based on the grabbing operation.
Optionally, after determining that the grabbing operation fails, the grabbing judgment module is further configured to capture and store a screenshot of the display page of the application to be processed corresponding to the grabbing operation.
According to still another aspect of the embodiments of the present disclosure, there is provided a computer-readable storage medium storing a computer program for executing the application data crawling method according to any one of the embodiments.
According to still another aspect of the embodiments of the present disclosure, there is provided an electronic apparatus including:
a processor;
a memory for storing the processor-executable instructions;
the processor is configured to read the executable instruction from the memory and execute the instruction to implement the application data fetching method according to any of the above embodiments.
Based on the application data capturing method and device, the storage medium and the electronic device provided by the embodiment of the disclosure, capturing configuration data corresponding to an installed application to be processed is obtained from a plurality of capturing configuration data stored in a database; starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data; performing a grab operation on the at least one operational data based on the grab configuration data; by installing the application to be processed, the data capture is simple and quick, and the data to be obtained can be obtained by skipping various encryption methods and/or reverse-crawling methods.
The technical solution of the present disclosure is further described in detail by the accompanying drawings and examples.
Drawings
The above and other objects, features and advantages of the present disclosure will become more apparent by describing in more detail embodiments of the present disclosure with reference to the attached drawings. The accompanying drawings are included to provide a further understanding of the embodiments of the disclosure and are incorporated in and constitute a part of this specification, illustrate embodiments of the disclosure and together with the description serve to explain the principles of the disclosure and not to limit the disclosure. In the drawings, like reference numbers generally represent like parts or steps.
Fig. 1 is a schematic flowchart of an application data crawling method according to an exemplary embodiment of the present disclosure.
FIG. 2 is a schematic flow chart of step 106 in the embodiment shown in FIG. 1 of the present disclosure.
Fig. 3 is a flowchart illustrating an application data crawling method according to another exemplary embodiment of the present disclosure.
Fig. 4 is a flowchart illustrating an application data crawling method according to yet another exemplary embodiment of the present disclosure.
Fig. 5 is a flowchart illustrating an application data crawling method according to still another exemplary embodiment of the present disclosure.
Fig. 6 is a schematic structural diagram of an application data crawling apparatus according to an exemplary embodiment of the present disclosure.
Fig. 7 is a block diagram of an electronic device provided in an exemplary embodiment of the present disclosure.
Detailed Description
Hereinafter, example embodiments according to the present disclosure will be described in detail with reference to the accompanying drawings. It is to be understood that the described embodiments are merely a subset of the embodiments of the present disclosure and not all embodiments of the present disclosure, with the understanding that the present disclosure is not limited to the example embodiments described herein.
It should be noted that: the relative arrangement of the components and steps, the numerical expressions, and numerical values set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise.
It will be understood by those of skill in the art that the terms "first," "second," and the like in the embodiments of the present disclosure are used merely to distinguish one element from another, and are not intended to imply any particular technical meaning, nor is the necessary logical order between them.
It is also understood that in embodiments of the present disclosure, "a plurality" may refer to two or more and "at least one" may refer to one, two or more.
It is also to be understood that any reference to any component, data, or structure in the embodiments of the disclosure, may be generally understood as one or more, unless explicitly defined otherwise or stated otherwise.
In addition, the term "and/or" in the present disclosure is only one kind of association relationship describing an associated object, and means that three kinds of relationships may exist, for example, a and/or B may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" in the present disclosure generally indicates that the former and latter associated objects are in an "or" relationship.
It should also be understood that the description of the various embodiments of the present disclosure emphasizes the differences between the various embodiments, and the same or similar parts may be referred to each other, so that the descriptions thereof are omitted for brevity.
Meanwhile, it should be understood that the sizes of the respective portions shown in the drawings are not drawn in an actual proportional relationship for the convenience of description.
The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application, or uses.
Techniques, methods, and apparatus known to those of ordinary skill in the relevant art may not be discussed in detail but are intended to be part of the specification where appropriate.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, further discussion thereof is not required in subsequent figures.
The disclosed embodiments may be applied to electronic devices such as terminal devices, computer systems, servers, etc., which are operational with numerous other general purpose or special purpose computing system environments or configurations. Examples of well known terminal devices, computing systems, environments, and/or configurations that may be suitable for use with electronic devices, such as terminal devices, computer systems, servers, and the like, include, but are not limited to: personal computer systems, server computer systems, thin clients, thick clients, hand-held or laptop devices, microprocessor-based systems, set top boxes, programmable consumer electronics, network pcs, minicomputer systems, mainframe computer systems, distributed cloud computing environments that include any of the above systems, and the like.
Electronic devices such as terminal devices, computer systems, servers, etc. may be described in the general context of computer system-executable instructions, such as program modules, being executed by a computer system. Generally, program modules may include routines, programs, objects, components, logic, data structures, etc. that perform particular tasks or implement particular abstract data types. The computer system/server may be practiced in distributed cloud computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed cloud computing environment, program modules may be located in both local and remote computer system storage media including memory storage devices.
Summary of the application
In the process of implementing the present disclosure, the inventor finds that the existing data capturing is usually obtained by a crawler crawling manner, but the prior art at least has the following problems: the reverse climbing technology cannot be solved.
Exemplary method
Fig. 1 is a schematic flowchart of an application data crawling method according to an exemplary embodiment of the present disclosure. The embodiment can be applied to a virtual machine, an electronic device, and the like, as shown in fig. 1, and includes the following steps:
and 102, acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in the database.
Optionally, each application to be processed corresponds to one piece of capture configuration data, and the corresponding data in the application to be processed is captured through the configuration in the capture configuration data; in this embodiment, a plurality of capture configuration data may be stored in the local database.
Step 104, starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data.
In an embodiment, the application to be processed is locally started, and the operation in the application to be processed is executed, so that at least one piece of operation data corresponding to the application to be processed can be generated through the operation, and the anti-crawling technical means such as signature algorithm cracking can be skipped through the operation.
And 106, executing grabbing operation on the at least one operation data based on the grabbing configuration data.
Optionally, after the application to be processed performs the operation, all the operation data may be captured, but in the embodiment, what needs to be captured is data corresponding to the capture configuration data, so that the capture operation is performed on at least one operation data based on the capture configuration data to obtain the data that needs to be captured.
In the application data capture method provided by the embodiment of the present disclosure, capture configuration data corresponding to an installed application to be processed is obtained from a plurality of capture configuration data stored in a database; starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data; performing a grab operation on the at least one operational data based on the grab configuration data; by installing the application to be processed, the data capture is simple and quick, and the data to be obtained can be obtained by skipping various encryption methods and/or reverse-crawling methods.
As shown in fig. 2, based on the embodiment shown in fig. 1, step 106 may include the following steps:
step 1061, determining at least one corresponding grabbing element based on at least one grabbing keyword in the grabbing configuration data.
The grabbing configuration data comprises at least one grabbing keyword and at least one grabbing element; each grabbing keyword corresponds to at least one grabbing element.
Optionally, the capture configuration data may be data information stored in a form of a table, each capture keyword in the table corresponds to at least one capture element, and in this embodiment, at least one capture keyword corresponding to data that needs to be captured by the application to be processed may be obtained in the capture configuration data corresponding to the application to be processed.
Step 1062, performing a grab operation on the at least one operation data based on the grab element.
Optionally, the grabbing elements may include number information corresponding to data to be grabbed, coordinate position information of the data to be grabbed may be determined through the number information, grabbing operations such as clicking, sliding, inputting and the like may be realized through simulation of at least one grabbing element, further operation simulation of the application to be processed is realized, operation data is obtained through simulation operation, and then simulation of the grabbing operations is realized according to the grabbing elements, so that the data to be grabbed is obtained. For example, the process of grabbing specific data for APP may include: APP name- -search keyword- -id of operation (which may include multiple, e.g., id1 of operation, id2 of operation, id3 of operation, etc.) - -id of crawl (which may include multiple, e.g., id1 of crawl, id2 of crawl, id3 of crawl, etc.). Such as: sky eye search-Beijing Jinbei technology Limited company-search corresponding id element-next page element corresponding id-grab element.
Optionally, step 1061 includes:
searching and determining at least one element number from the grabbing configuration data based on the grabbing keywords; each captured keyword corresponds to at least one element number;
searching and determining at least one grabbing element from the grabbing configuration data based on the element number; wherein each grabbing element corresponds to an element number.
In this embodiment, the grasping configuration data includes not only the grasping key words and the grasping elements, but also element numbers (e.g., element IDs), corresponding relationships are established between the grasping key words and the grasping elements through the element numbers, when the grasping elements need to be searched, at least one corresponding element number can be directly determined through the grasping key words, the storage space is saved in the form of the numbers, the searching efficiency is improved, and the corresponding grasping elements are obtained through the corresponding relationships between the element numbers and the grasping elements, so that the utilization rate of the grasping elements can be improved, for example, grasping element a, B, and c corresponds to the grasping key word a, grasping element B, grasping element a, c, and e corresponds to the grasping key word B, at this time, if the corresponding relationships between the grasping key words and the grasping elements are directly stored, a large amount of repeated data can occupy more storage space, and when corresponding is performed through the element, the grabbing elements do not need to be stored in the table corresponding to the grabbing keywords, the corresponding grabbing elements are corresponding to the grabbing keywords through the element numbers, and the grabbing elements corresponding to the element numbers are searched from the stored grabbing elements, so that the searching efficiency is improved.
Fig. 3 is a flowchart illustrating an application data crawling method according to another exemplary embodiment of the present disclosure. As shown in fig. 3, the method comprises the following steps:
and 102, acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in the database.
Step 303, judging whether a port corresponding to the application to be processed is occupied by other applications; if yes, after closing other applications occupying the port, executing step 104; otherwise, step 104 is performed.
Step 104, starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data.
And 106, executing grabbing operation on the at least one operation data based on the grabbing configuration data.
In this embodiment, different applications may correspond to the same port, and before the application is started, it is necessary to determine whether the port corresponding to the application is occupied, and if the port is occupied, it is necessary to release the port (close other applications occupying the port), so that before the application to be processed is started, it is determined whether the port corresponding to the application to be processed is occupied by other applications, and if the port is occupied, the application to be processed is started after other applications occupying the port are closed; if not occupied, the application to be processed can be directly started; through the judgment process, the starting success rate of the application to be processed is improved, and the repeated starting of the application is avoided.
For example, an application to be processed is tested by an automatic testing tool (appium, which may be in a virtual machine or a client), and metstat-aon | findstr is executed by os.pop to see whether a port corresponding to the application to be processed is occupied; then closing the pid process of the occupied port through taskill-f-pid; then starting the appium through 'start/b appium-session-override-a' + host + '-p' + port + '-bp' + str (port + 1); and detecting whether the appium is started or not through netstat.
In some optional embodiments, before performing step 102, the method may further include:
and acquiring an installation package of the application to be processed, and installing the application to be processed based on the installation package.
In the embodiment, in order to overcome the anti-crawling technology in the application to be processed, the installation package of the application to be processed is downloaded to the local for installation, the application to be processed is installed to the local and runs locally, processes such as processing of a communication protocol and cracking of a signature algorithm are skipped, and operation data in the application to be processed can be directly obtained locally.
Fig. 4 is a flowchart illustrating an application data crawling method according to yet another exemplary embodiment of the present disclosure. As shown in fig. 4, the method comprises the following steps:
step 401, acquiring a plurality of capture configuration data and storing the capture configuration data in a database.
And 102, acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in the database.
Step 104, starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data.
And 106, executing grabbing operation on the at least one operation data based on the grabbing configuration data.
In the embodiment, the capturing configuration data is stored in the local database in advance, so that a plurality of applications to be processed can be captured, and after a certain application is determined to be captured, the capturing configuration data corresponding to the application is searched from the database without repeated configuration, so that the time is saved, and the efficiency of data capturing is improved.
Fig. 5 is a flowchart illustrating an application data crawling method according to still another exemplary embodiment of the present disclosure. As shown in fig. 5, the method comprises the following steps:
and 102, acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in the database.
Step 104, starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data.
And 106, executing grabbing operation on the at least one operation data based on the grabbing configuration data.
Step 508, whether the grabbing is successful is determined based on whether the grabbing operation obtains the target data corresponding to the grabbing configuration data, if so, the target data is stored, and the grabbing is finished; otherwise, go to step 510;
and step 510, determining that the grabbing fails, and capturing and storing the display page of the application to be processed corresponding to the grabbing operation.
In the embodiment, whether target data needing to be captured can be obtained as a judgment standard for successful capturing or not is determined, the data obtained after successful capturing is stored, and when capturing fails, in order to quickly find an abnormal problem during capturing, screenshot is supported, the screenshot is stored, and is sent to a set address (for example, a mailbox of a developer).
In some optional embodiments, before acquiring the application to be processed for data capture, environment construction and setting of a client (e.g., a mobile phone) may also be performed: for example, the program needs to build the following environment: comprises the following tools: wherein a version number marked must be installed corresponding to the version number.
Js version number 6.11.3
Appium version number 1.7.2
·Appium-desktop
·Appium-doctor
·Appium-Python-Client
·Python3
JDK version number 1.8.0_05
·Andriod SDK
Redis database
Mysql database.
For another example, when the client is a mobile phone, the mobile phone is configured to be in a developer mode, and the USB debugging is started; downloading and installing a required captured APP installation package; and adding APP information, mobile phone information and appium information in the program configuration file.
Any one of the application data capture methods provided by the embodiments of the present disclosure may be executed by any suitable device having data processing capability, including but not limited to: terminal equipment, a server and the like. Alternatively, any of the application data fetching methods provided by the embodiments of the present disclosure may be executed by a processor, for example, the processor may execute any of the application data fetching methods mentioned in the embodiments of the present disclosure by calling a corresponding instruction stored in a memory. And will not be described in detail below.
Exemplary devices
Fig. 6 is a schematic structural diagram of an application data crawling apparatus according to an exemplary embodiment of the present disclosure. As shown in fig. 6, the present embodiment includes:
and the data configuration module 61 is configured to obtain grab configuration data corresponding to the installed application to be processed from the multiple grab configuration data stored in the database.
And the operation execution module 62 is configured to start the application to be processed, execute at least one operation in the application to be processed, and obtain at least one operation data.
And a data fetching module 63 configured to perform a fetching operation on the at least one operation data based on the fetching configuration data.
The application data capture device provided by the above embodiment of the present disclosure obtains capture configuration data corresponding to an installed application to be processed from a plurality of capture configuration data stored in a database; starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data; performing a grab operation on the at least one operational data based on the grab configuration data; by installing the application to be processed, the data capture is simple and quick, and the data to be obtained can be obtained by skipping various encryption methods and/or reverse-crawling methods.
Optionally, the data crawling module 63 is specifically configured to determine, based on at least one crawling keyword in the crawling configuration data, at least one corresponding crawling element; a grab operation is performed on the at least one operation data based on the grab element.
The grabbing configuration data comprises at least one grabbing keyword and at least one grabbing element; each grabbing keyword corresponds to at least one grabbing element.
Optionally, the data crawling module 63 is configured to, when determining the corresponding at least one crawling element based on at least one crawling keyword in the crawling configuration data, search and determine at least one element number from the crawling configuration data based on the crawling keyword; each captured keyword corresponds to at least one element number; searching and determining at least one grabbing element from the grabbing configuration data based on the element number; wherein each grabbing element corresponds to an element number.
In some optional embodiments, the apparatus provided in this embodiment further includes:
the port judgment module is used for judging whether a port corresponding to the application to be processed is occupied by other applications; responding to the fact that the port is not occupied by other applications, and starting the application to be processed; and in response to the port being occupied by other applications, closing the other applications occupying the port and starting the application to be processed.
In this embodiment, different applications may correspond to the same port, and before the application is started, it is necessary to determine whether the port corresponding to the application is occupied, and if the port is occupied, it is necessary to release the port (close other applications occupying the port), so that before the application to be processed is started, it is determined whether the port corresponding to the application to be processed is occupied by other applications, and if the port is occupied, the application to be processed is started after other applications occupying the port are closed; if not occupied, the application to be processed can be directly started; through the judgment process, the starting success rate of the application to be processed is improved, and the repeated starting of the application is avoided.
In some optional embodiments, the apparatus provided in this embodiment further includes:
and the application installation module is used for acquiring an installation package of the application to be processed and installing the application to be processed based on the installation package.
In the embodiment, in order to overcome the anti-crawling technology in the application to be processed, the installation package of the application to be processed is downloaded to the local for installation, the application to be processed is installed to the local and runs locally, processes such as processing of a communication protocol and cracking of a signature algorithm are skipped, and operation data in the application to be processed can be directly obtained locally.
In some optional embodiments, the apparatus provided in this embodiment further includes:
and the data storage module is used for acquiring a plurality of capture configuration data and storing the capture configuration data in a database.
In some optional embodiments, the apparatus provided in this embodiment further includes:
the grabbing judgment module is used for determining whether grabbing is successful or not based on whether grabbing operation obtains target data corresponding to the grabbing configuration data or not; responding to target data corresponding to the grabbing configuration data obtained based on the grabbing operation, determining that the grabbing operation is successful, and storing the target data; and determining that the grabbing operation fails in response to not obtaining the target data corresponding to the grabbing configuration data based on the grabbing operation.
Optionally, the capture judgment module is further configured to capture and store a screenshot of the display page of the to-be-processed application corresponding to the capture operation after determining that the capture operation fails.
Exemplary electronic device
Next, an electronic apparatus according to an embodiment of the present disclosure is described with reference to fig. 7. The electronic device may be either or both of the first device 100 and the second device 200, or a stand-alone device separate from them that may communicate with the first device and the second device to receive the collected input signals therefrom.
FIG. 7 illustrates a block diagram of an electronic device in accordance with an embodiment of the disclosure.
As shown in fig. 7, the electronic device 70 includes one or more processors 71 and a memory 72.
The processor 71 may be a Central Processing Unit (CPU) or other form of processing unit having data processing capabilities and/or instruction execution capabilities, and may control other components in the electronic device 70 to perform desired functions.
Memory 72 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include, for example, Random Access Memory (RAM), cache memory (cache), and/or the like. The non-volatile memory may include, for example, Read Only Memory (ROM), hard disk, flash memory, etc. One or more computer program instructions may be stored on the computer-readable storage medium and executed by processor 71 to implement the application data crawling methods of the various embodiments of the present disclosure described above and/or other desired functions. Various contents such as an input signal, a signal component, a noise component, etc. may also be stored in the computer-readable storage medium.
In one example, the electronic device 70 may further include: an input device 73 and an output device 74, which are interconnected by a bus system and/or other form of connection mechanism (not shown).
For example, when the electronic device is the first device 100 or the second device 200, the input device 73 may be a microphone or a microphone array as described above for capturing an input signal of a sound source. When the electronic device is a stand-alone device, the input means 73 may be a communication network connector for receiving the acquired input signals from the first device 100 and the second device 200.
The input device 73 may also include, for example, a keyboard, a mouse, and the like.
The output device 74 may output various information including the determined distance information, direction information, and the like to the outside. The output devices 74 may include, for example, a display, speakers, a printer, and a communication network and remote output devices connected thereto, among others.
Of course, for simplicity, only some of the components of the electronic device 70 relevant to the present disclosure are shown in fig. 7, omitting components such as buses, input/output interfaces, and the like. In addition, the electronic device 70 may include any other suitable components, depending on the particular application.
Exemplary computer program product and computer-readable storage Medium
In addition to the above-described methods and apparatus, embodiments of the present disclosure may also be a computer program product comprising computer program instructions that, when executed by a processor, cause the processor to perform the steps in the application data crawling method according to various embodiments of the present disclosure described in the "exemplary methods" section of this specification above.
The computer program product may write program code for carrying out operations for embodiments of the present disclosure in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server.
Furthermore, embodiments of the present disclosure may also be a computer-readable storage medium having stored thereon computer program instructions that, when executed by a processor, cause the processor to perform steps in an application data crawling method according to various embodiments of the present disclosure described in the "exemplary methods" section above in this specification.
The computer-readable storage medium may take any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may include, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The foregoing describes the general principles of the present disclosure in conjunction with specific embodiments, however, it is noted that the advantages, effects, etc. mentioned in the present disclosure are merely examples and are not limiting, and they should not be considered essential to the various embodiments of the present disclosure. Furthermore, the foregoing disclosure of specific details is for the purpose of illustration and description and is not intended to be limiting, since the disclosure is not intended to be limited to the specific details so described.
In the present specification, the embodiments are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same or similar parts in the embodiments are referred to each other. For the system embodiment, since it basically corresponds to the method embodiment, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
The block diagrams of devices, apparatuses, systems referred to in this disclosure are only given as illustrative examples and are not intended to require or imply that the connections, arrangements, configurations, etc. must be made in the manner shown in the block diagrams. These devices, apparatuses, devices, systems may be connected, arranged, configured in any manner, as will be appreciated by those skilled in the art. Words such as "including," "comprising," "having," and the like are open-ended words that mean "including, but not limited to," and are used interchangeably therewith. The words "or" and "as used herein mean, and are used interchangeably with, the word" and/or, "unless the context clearly dictates otherwise. The word "such as" is used herein to mean, and is used interchangeably with, the phrase "such as but not limited to".
The methods and apparatus of the present disclosure may be implemented in a number of ways. For example, the methods and apparatus of the present disclosure may be implemented by software, hardware, firmware, or any combination of software, hardware, and firmware. The above-described order for the steps of the method is for illustration only, and the steps of the method of the present disclosure are not limited to the order specifically described above unless specifically stated otherwise. Further, in some embodiments, the present disclosure may also be embodied as programs recorded in a recording medium, the programs including machine-readable instructions for implementing the methods according to the present disclosure. Thus, the present disclosure also covers a recording medium storing a program for executing the method according to the present disclosure.
It is also noted that in the devices, apparatuses, and methods of the present disclosure, each component or step can be decomposed and/or recombined. These decompositions and/or recombinations are to be considered equivalents of the present disclosure.
The previous description of the disclosed aspects is provided to enable any person skilled in the art to make or use the present disclosure. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects without departing from the scope of the disclosure. Thus, the present disclosure is not intended to be limited to the aspects shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
The foregoing description has been presented for purposes of illustration and description. Furthermore, this description is not intended to limit embodiments of the disclosure to the form disclosed herein. While a number of example aspects and embodiments have been discussed above, those of skill in the art will recognize certain variations, modifications, alterations, additions and sub-combinations thereof.

Claims (11)

1. An application data crawling method is characterized by comprising the following steps:
acquiring grabbing configuration data corresponding to the installed application to be processed from a plurality of grabbing configuration data stored in a database;
starting the application to be processed, executing at least one operation in the application to be processed, and obtaining at least one operation data;
performing a grab operation on the at least one operational data based on the grab configuration data; wherein the crawling configuration data comprises at least one crawling keyword and at least one crawling element; each grabbing keyword corresponds to at least one grabbing element; the grabbing elements comprise number information corresponding to the data needing to be grabbed, and the coordinate position information of the data needing to be grabbed is determined through the number information.
2. The method of claim 1, wherein performing a grab operation on the at least one operation data based on the grab configuration data comprises:
determining at least one corresponding grab element based on at least one grab keyword in the grab configuration data;
performing a grab operation on the at least one operation data based on the grab element.
3. The method of claim 2, wherein determining the corresponding at least one crawl element based on the at least one crawl keyword in the crawl configuration data comprises:
searching and determining at least one element number from the grabbing configuration data based on the grabbing keywords; wherein each captured keyword corresponds to at least one element number;
searching and determining at least one grabbing element from the grabbing configuration data based on the element number; wherein each of the grab elements corresponds to an element number.
4. The method according to any of claims 1-3, wherein before the launching the application to be processed, performing at least one operation in the application to be processed based on the grab configuration data, and obtaining at least one operation data, further comprising:
judging whether a port corresponding to the application to be processed is occupied by other applications or not;
responding to the port not occupied by other applications, and starting the application to be processed;
and in response to the port being occupied by other applications, closing the other applications occupying the port, and starting the application to be processed.
5. The method according to any one of claims 1 to 3, wherein before obtaining the grab configuration data corresponding to the installed application to be processed from the plurality of grab configuration data stored in the database, the method further comprises:
and acquiring an installation package of the application to be processed, and installing the application to be processed based on the installation package.
6. The method according to any one of claims 1 to 3, wherein before obtaining the grab configuration data corresponding to the installed application to be processed from the plurality of grab configuration data stored in the database, the method further comprises:
and acquiring the plurality of capture configuration data and storing the capture configuration data in the database.
7. The method of any of claims 1-3, further comprising:
whether grabbing is successful is determined based on whether the grabbing operation obtains target data corresponding to the grabbing configuration data;
responding to target data corresponding to the grabbing configuration data obtained based on the grabbing operation, determining that the grabbing operation is successful, and storing the target data;
and determining that the grabbing operation fails in response to not obtaining target data corresponding to the grabbing configuration data based on the grabbing operation.
8. The method of claim 7, wherein after determining that the grab operation failed, further comprising:
and screenshot is carried out on the display page of the application to be processed corresponding to the grabbing operation and is stored.
9. An application data grasping apparatus, comprising:
the data configuration module is used for acquiring the capture configuration data corresponding to the installed application to be processed from the plurality of capture configuration data stored in the database;
the operation execution module is used for starting the application to be processed, executing at least one operation in the application to be processed and obtaining at least one operation data;
a data fetch module for performing fetch operations on the at least one operational data based on the fetch configuration data; wherein the crawling configuration data comprises at least one crawling keyword and at least one crawling element; each grabbing keyword corresponds to at least one grabbing element; the grabbing elements comprise number information corresponding to the data needing to be grabbed, and the coordinate position information of the data needing to be grabbed is determined through the number information.
10. A computer-readable storage medium, characterized in that the storage medium stores a computer program for executing the application data fetching method of any one of the preceding claims 1 to 8.
11. An electronic device, characterized in that the electronic device comprises:
a processor;
a memory for storing the processor-executable instructions;
the processor is used for reading the executable instructions from the memory and executing the instructions to realize the application data grabbing method of any one of the claims 1 to 8.
CN201910796526.8A 2019-08-27 2019-08-27 Application data capturing method and device, storage medium and electronic equipment Active CN110704709B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910796526.8A CN110704709B (en) 2019-08-27 2019-08-27 Application data capturing method and device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910796526.8A CN110704709B (en) 2019-08-27 2019-08-27 Application data capturing method and device, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN110704709A CN110704709A (en) 2020-01-17
CN110704709B true CN110704709B (en) 2021-01-26

Family

ID=69193411

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910796526.8A Active CN110704709B (en) 2019-08-27 2019-08-27 Application data capturing method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN110704709B (en)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130305222A1 (en) * 2012-05-11 2013-11-14 Microsoft Corporation Development System
CN103049371A (en) * 2012-11-07 2013-04-17 青岛海信传媒网络技术有限公司 Testing method and testing device of Android application programs
CN104598287B (en) * 2013-10-30 2019-02-12 北京猎豹移动科技有限公司 Detection method, device and the client of rogue program
CN105320595A (en) * 2014-07-31 2016-02-10 腾讯科技(深圳)有限公司 Application test method and device
CN109561117B (en) * 2017-09-26 2022-02-25 北京国双科技有限公司 Data acquisition method and device
CN110162682A (en) * 2019-04-12 2019-08-23 深圳壹账通智能科技有限公司 A kind of crawling method of network data, device, storage medium and terminal device

Also Published As

Publication number Publication date
CN110704709A (en) 2020-01-17

Similar Documents

Publication Publication Date Title
US9876812B1 (en) Automatic malware signature extraction from runtime information
US9781144B1 (en) Determining duplicate objects for malware analysis using environmental/context information
US9294486B1 (en) Malware detection and analysis
TW201915809A (en) Safe scanning method and device for mini application and electronic equipment
US9686303B2 (en) Web page vulnerability detection method and apparatus
US20150012924A1 (en) Method and Device for Loading a Plug-In
CN109873804A (en) Service identification method, device, equipment and the readable storage medium storing program for executing of Behavior-based control
US11748473B2 (en) Intrusion detection in micro-services through container telemetry and behavior modeling
US9876777B2 (en) Method and browser for online banking login
US10013251B2 (en) Method and computer program stored in computer-readable medium for converting sort script to reuse JCL in different encoding environment
CN111163095B (en) Network attack analysis method, network attack analysis device, computing device, and medium
EP3111359B1 (en) Method and system for extracting characteristic information
US10372466B2 (en) Rule-based monitoring engine with tracing capabilities for multi-threaded logging
US11108787B1 (en) Securing a network device by forecasting an attack event using a recurrent neural network
CN110765288B (en) Image information synchronization method, device and system and storage medium
US9087137B2 (en) Detection of custom parameters in a request URL
EP2988242B1 (en) Information processing device, and information processing method
WO2014150921A1 (en) Cord and cable identification and tracking system
CN110737891A (en) host intrusion detection method and device
CN110704025A (en) Method and device for generating coding specification tool, storage medium and electronic equipment
US10621063B2 (en) System and method for dynamic domain-specific sequence diagram visualization
JP2017123142A (en) System and method for detection of phishing script
CN110704709B (en) Application data capturing method and device, storage medium and electronic equipment
US10375576B1 (en) Detection of malware apps that hijack app user interfaces
CN115454576B (en) Virtual machine process management method and system and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant