WO2022001564A1 - Operation set obtaining and executing methods and apparatuses, storage medium, and terminal device - Google Patents

Operation set obtaining and executing methods and apparatuses, storage medium, and terminal device Download PDF

Info

Publication number
WO2022001564A1
WO2022001564A1 PCT/CN2021/097922 CN2021097922W WO2022001564A1 WO 2022001564 A1 WO2022001564 A1 WO 2022001564A1 CN 2021097922 W CN2021097922 W CN 2021097922W WO 2022001564 A1 WO2022001564 A1 WO 2022001564A1
Authority
WO
WIPO (PCT)
Prior art keywords
execution
current
operations
information
frame image
Prior art date
Application number
PCT/CN2021/097922
Other languages
French (fr)
Chinese (zh)
Inventor
高宏华
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2022001564A1 publication Critical patent/WO2022001564A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72466User interfaces specially adapted for cordless or mobile telephones with selection means, e.g. keys, having functions defined by the mode or the status of the device

Definitions

  • the present disclosure relates to the field of communications, and in particular, to a method and apparatus for acquiring and executing an operation set, a storage medium, and a terminal device.
  • terminal devices eg., mobile phones, tablet computers, notebook computers, personal computers (Personal Computer, PC for short), etc.
  • the use and operations thereof are becoming more and more complicated.
  • Embodiments of the present disclosure provide a method and apparatus for acquiring and executing an operation set, a storage medium, and a terminal device, so as to at least solve the problem of how to simplify user operations on the terminal device.
  • a method for obtaining an operation set including: receiving one or more operations on a terminal device, and obtaining operation information of each of the one or more operations, wherein:
  • the operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations, and operation description data of the operation; an operation set is generated according to the operation information, wherein the operation set Including: the operation information of the one or more operations.
  • a method for executing an operation set comprising: obtaining the operation set in the case of receiving an operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied,
  • the operation set includes: operation information of one or more operations
  • the operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations, an operation description of the operation data; and according to the operation sequence identified by the sequence identification information, perform the one or more operations according to the operation description data.
  • an apparatus for obtaining an operation set including: a first obtaining module configured to receive one or more operations on a terminal device, and obtain each of the one or more operations Operation information of the operation, wherein the operation information includes: sequence identification information for identifying the operation sequence of the operation in the one or more operations, and operation description data of the operation; The operation information generates an operation set, wherein the operation set includes: the operation information of the one or more operations.
  • an apparatus for executing an operation set including: a second obtaining module, configured to: when an operation set execution request corresponding to the operation set is received or an execution condition corresponding to the operation set is judged to be satisfied , obtain the operation set, wherein the operation set includes: operation information of one or more operations, and the operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations , the operation description data of the operation; the execution module is configured to execute the one or more operations according to the operation description data according to the operation sequence identified by the sequence identification information.
  • a computer-readable storage medium is also provided, and a computer program is stored in the computer-readable storage medium, wherein the computer program is configured to execute any one of the above method implementations when running steps in the example.
  • a terminal device including a memory and a processor, wherein the memory stores a computer program, and the processor is configured to run the computer program to execute any one of the above methods steps in the examples.
  • the operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the automatic recording can be performed.
  • a series of operations performed by the user so that the terminal device can automatically perform a series of operations according to the recorded operation set when triggered by the user or satisfying the execution conditions. Therefore, the problem of how to simplify the user operations on the terminal device can be solved, and the Custom shortcuts are recorded and executed.
  • Fig. 1 is a hardware structure block diagram of a mobile terminal of a method for acquiring and executing an operation set
  • Fig. 2 is the flow chart of the acquisition method of operation set
  • Fig. 3 is the flow chart of the execution method of operation set
  • Fig. 4 is the structural block diagram of the acquisition device of operation set
  • Fig. 5 is the structural block diagram of the execution apparatus of the operation set
  • Fig. 6 is the schematic diagram that application coordinate moves
  • FIG. 7 is a schematic diagram of different switch states.
  • the embodiments of the present disclosure provide a solution that allows users to customize shortcuts according to their own usage habits and preferences, the solution can save the user's operation collection, and generate shortcuts for users to use in subsequent use. Can operate quickly.
  • the solution also supports sending the shortcut to other users for use, or adding a timer for regular execution to derive more powerful functions.
  • This solution can be used in a wide range of scenarios, such as: mobile phone one-click navigation, one-click scanning of bicycles; after the computer is turned on, let the computer automatically open various software to be opened, and enter the working mode; for remote assistance (such as remote setting an alarm clock), backup Waiting for a series of operations; timing check-in and so on.
  • FIG. 1 is a block diagram of the hardware structure of a mobile terminal with a method for acquiring and executing an operation set.
  • the mobile terminal may include one or more (only one is shown in FIG. 1 ) processor 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a transmission device 106 and an input and output device 108 for communication functions.
  • FIG. 1 is only a schematic diagram, which does not limit the structure of the above-mentioned mobile terminal.
  • the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration than that shown in FIG. 1 .
  • the memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as computer programs corresponding to the acquisition and execution methods of the operation sets in the embodiments of the present disclosure.
  • the processor 102 runs the computer programs stored in the memory 104 by running the computer programs , so as to perform various functional applications and data processing, that is, to implement the above method.
  • Memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include memory located remotely from the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of such networks include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • Transmission means 106 are used to receive or transmit data via a network.
  • the specific example of the above-mentioned network may include a wireless network provided by a communication provider of the mobile terminal.
  • the transmission device 106 includes a network adapter (Network Interface Controller, NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (Radio Frequency, RF for short) module, which is used to communicate with the Internet in a wireless manner.
  • RF Radio Frequency
  • Fig. 2 is a flowchart of the method for obtaining an operation set. As shown in Fig. 2 , the flow process includes the following steps:
  • Step S202 Receive one or more operations on the terminal device, and acquire operation information of each operation in the one or more operations, wherein the operation information includes: an operation information used to identify that the operation is performed in the one or more operations. Sequence identification information of the operation sequence in the operation, operation description data of the operation;
  • Step S204 generating an operation set according to the operation information, wherein the operation set includes: the operation information of the one or more operations.
  • the operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the user's operations can be automatically recorded.
  • a series of operations so that the terminal device can automatically execute a series of operations according to the recorded operation set when the user triggers or meets the execution conditions. Therefore, it can solve the problem of how to simplify the user operation on the terminal device and realize the customization record of shortcuts.
  • the execution subject of the above steps may be a terminal device or the like, but is not limited thereto.
  • the operation information may further include: a relevant frame image corresponding to the operation, wherein the relevant frame image includes: a valid frame image before execution, a valid frame image during execution, and a valid frame image after execution frame image.
  • the pre-execution valid frame image may be a screen image at a first predetermined time (eg, 30ms) before the operation is performed, and the post-execution valid frame image may be a screen image at a second predetermined time (eg, 80ms) after the operation is performed.
  • the effective frame image after the execution of the previous operation may be the same as the effective frame image before the execution of the next operation; Operations that last for a certain period of time (for example, making a call), the effective frame image after the execution of the previous operation (the call page image after the call is made) and the effective frame image before the execution of the next operation (the page image after the call is hung up). ) may be different.
  • the execution-time valid frame image can assist the acquisition of the operation description data.
  • the execution-time valid frame image can be obtained by The image performs image recognition to obtain the operation description data of the operation.
  • step S202 may include the following operations:
  • the terminal device In response to the received operation set collection request, the terminal device obtains the relevant frame image corresponding to each operation in the one or more operations through a screen recording function or a screen capture function, and collects the one or more operations The sequence identification information and the operation description data of each operation in the . until a collection end indication is received.
  • the screen of the terminal device in response to the received operation set collection request, may be controlled to display an initial page, and each of the one or more operations may be acquired through a screen recording function or a screen capture function. The corresponding relevant frame images are operated, and the sequence identification information and the operation description data of each operation in the one or more operations are collected until a collection end indication is received.
  • the user can collect a request by operating a collection (for example, by clicking a control for recording a shortcut on the operation interface to issue the request), and initiate the recording process of the shortcut, and the terminal device can obtain it by recording a screen or taking a screenshot.
  • the sequence identifier of each operation in the one or more operations can be obtained through direct reading by the system, or direct reading by the system in combination with the image recognition of the valid frame image at the time of execution. information and data describing the operation.
  • the sequence identification information for identifying the operation sequence of the operation in the one or more operations may include at least one of the following: the operation time of the operation, the operation time of the operation in the one or more operations. or the sequence number of the operation in multiple operations.
  • the operation description data of the operation may include at least one of the following: an operation category, a coordinate parameter, a duration parameter, key identification information, identification information of a sensor that collects biometrics, Collection parameters, the operation object corresponding to the operation, the description information of the execution page, and the description information of the result page.
  • the operation category may include, but is not limited to, at least one of the following: clicking on the screen, sliding the screen, pressing a button, and collecting biological features.
  • the collected operation description data may also be different, and the content of the specific operation description data can be set according to actual needs.
  • the operation category includes the click screen
  • the operation The description data may include at least one of the following: the coordinates of the click screen, the duration, the operation object corresponding to the operation, the description information of the execution page, and the description information of the result page
  • all The operation description data may include at least one of the following: the starting coordinates of the sliding screen, the ending coordinates of the sliding screen, the duration, the operation object corresponding to the operation, the description information of the execution page and the description information of the result page; in the operation category
  • the operation description data may include at least one of the following: key identification information for identifying the pressed key, duration, operation object corresponding to the operation, execution page description information, and Result page description information
  • the operation description data may include at least one of the following: sensor identification
  • the operation category may be acquired based on a screen touch signal, a key touch signal, or a system sensor call signal; and/or,
  • the coordinate parameters may be obtained based on a screen touch signal; and/or,
  • the duration parameter may be obtained based on a screen touch signal; and/or,
  • the key identification information may be obtained based on a key touch signal; and/or,
  • the identification information and the acquisition parameters of the sensors that collect biological features may be obtained based on the system sensor call signal; and/or,
  • the operation object and execution page description information corresponding to the operation can be obtained based on the image recognition technology according to the valid frame image corresponding to the execution time of the operation, or based on the image recognition technology based on the valid frame image before execution corresponding to the operation combined with the coordinates Parameter acquisition, for example, since the effective frame image often displays the area currently being operated with visual effects visible to the naked eye during execution, such as which function button to click, which slider to slide, etc.
  • the valid frame image identifies the operation object corresponding to the current operation, and further identifies the execution page description information corresponding to the current operation, such as the text on the function button, the text description around the slider, etc.; It can be implemented based on the valid frame image before execution. The only difference is that the valid frame image before execution needs to be combined with the coordinate parameters of the operation to get which operation object the user is currently operating, and further identify the operation object or the operation object. description information surrounding the execution page; and/or,
  • the description information of the result page corresponding to the operation can be obtained from the valid frame image after execution corresponding to the operation based on image recognition technology.
  • the page description information can be identified in the valid frame image after execution corresponding to the operation based on the image recognition technology.
  • a result description keyword may be further identified in the identified page description information according to an algorithm obtained by machine learning as the result page description information.
  • the method may further include at least one of the following:
  • the execution condition corresponding to the operation set can be set, and the execution condition may include execution time, pre-event or remote trigger, etc., thereby realizing more flexible shortcut triggering.
  • the operation set can also be sent to other terminal devices, so as to realize remote control guidance to other terminal devices.
  • FIG. 3 is a flowchart of the method for executing an operation set. As shown in FIG. 3 , the process includes the following steps:
  • Step S302 in the case of receiving the operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, obtain the operation set, wherein the operation set includes: operation information of one or more operations,
  • the operation information includes: sequence identification information for identifying an operation sequence of the operation in the one or more operations, and operation description data of the operation;
  • Step S304 according to the operation sequence identified by the sequence identification information, perform the one or more operations according to the operation description data.
  • the operation set can be automatically responded to the execution request of the operation set or when it is judged that the execution condition is satisfied, the operation set can be obtained and the operation sequence identified by the sequence identification information can be identified, and the one or more operations can be executed according to the operation description data. Therefore, the terminal device can automatically perform a series of operations according to the shortcut, which solves the problem of how to simplify user operations on the terminal device and realizes the execution of the customized shortcut.
  • the execution subject of the above steps may be a terminal device or the like, but is not limited thereto.
  • the operation information may further include: a relevant frame image corresponding to the operation, wherein the relevant frame image includes: a valid frame image before execution, a valid frame image during execution, and a valid frame image after execution frame image.
  • the pre-execution valid frame image may be a screen image at a first predetermined time (eg, 30ms) before the operation is performed, and the post-execution valid frame image may be a screen image at a second predetermined time (eg, 80ms) after the operation is performed.
  • the effective frame image after the execution of the previous operation may be the same as the effective frame image before the execution of the next operation; Operations that last for a certain period of time (for example, making a call), the effective frame image after the execution of the previous operation (the call page image after the call is made) and the effective frame image before the execution of the next operation (the page image after the call is hung up). ) may be different.
  • the execution-time valid frame image may assist the acquisition of the operation description data.
  • the execution-time valid frame image may be processed by Image recognition to obtain the complete operation description data of the operation.
  • step S304 may include:
  • step (2) may include the following processing:
  • the current pre-execution screen image includes an operation object corresponding to the current operation
  • the position of the operation object corresponding to the current operation is the same in the current pre-execution screen image and the pre-execution effective frame image
  • the current operation is performed according to the operation description data of the current operation; and/or, in the screen image before the current execution and the effective frame image before the execution of the operation object corresponding to the current operation
  • the position changes adjust the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current operation, and adjust the operation description data of the current operation according to the adjusted value of the current operation.
  • the operation description data performs the current operation.
  • the operation object corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or corresponding to the current operation based on an image recognition technology
  • the valid frame image at the time of execution is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
  • step (1) may include at least one of the following:
  • the technology identifies whether the application icon of the bicycle APP is included in the screen image before the current execution;
  • Identify the page description information included in the current pre-execution screen image based on image recognition technology match the identified page description information with the execution page description information corresponding to the current operation, and determine the current pre-execution page description information according to the matching result.
  • the screen image includes the operation object corresponding to the current operation. For example, for an operation to start a bicycle APP, the description information of the execution page corresponding to the current operation can be "bicycle", and the screen image before the current execution can be identified based on image recognition technology. Whether there is the word "Bike" in the screen, if so, it is judged to be a match.
  • the operation object corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or corresponding to the current operation based on an image recognition technology
  • the valid frame image at the time of execution is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
  • the execution page description information corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or based on image recognition technology according to the The valid frame image at the time of execution corresponding to the current operation is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
  • adjusting the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current execution may include one of the following:
  • the coordinate parameters in the operation description data of the current operation are changed to Replaced with the coordinate parameters of the operation object corresponding to the current operation in the screen image before the current execution;
  • the coordinate parameters in the image, (x2, y2) are the coordinate parameters of the operation object in the screen image before the current execution, and (xd1, yd1) are the coordinate parameters in the operation description data before adjustment.
  • the method further includes one of the following:
  • determining whether the current operation is successfully performed in step (3) may include at least one of the following:
  • the result page description information corresponding to the current operation used in the above process may be included in the operation description data of the current operation, or based on image recognition technology according to the current operation
  • the post-execution valid frame image corresponding to the operation is acquired.
  • the method may further include one of the following:
  • A. Receive the one or more operations on the terminal device, obtain operation information of each operation in the one or more operations, generate the operation set according to the operation information, and save the operation set;
  • Some embodiments of the present disclosure provide an apparatus for acquiring an operation set, the apparatus is used to implement the embodiments and preferred implementations of the above-mentioned method for acquiring an operation set, which have been described and will not be repeated.
  • the term "module” may be a combination of software and/or hardware that implements a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, implementations in hardware, or a combination of software and hardware, are also possible and contemplated.
  • Fig. 4 is a structural block diagram of an apparatus for obtaining an operation set. As shown in Fig. 4, the apparatus includes:
  • the first obtaining module 42 is configured to receive one or more operations on the terminal device, and obtain operation information of each operation in the one or more operations, wherein the operation information includes: an operation information used to identify where the operation is performed. Sequence identification information of the sequence of operations in the one or more operations, and operation description data of the operations;
  • the generating module 44 is configured to generate an operation set according to the operation information, wherein the operation set includes: the operation information of the one or more operations.
  • one or more operations on the terminal device can be automatically received, operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the user's operations can be automatically recorded.
  • a series of operations so that the terminal device can automatically execute a series of operations according to the recorded operation set when the user triggers or meets the execution conditions. Therefore, it can solve the problem of how to simplify the user operation on the terminal device and realize the customization record of shortcuts.
  • the above-mentioned apparatus may be provided in a terminal device, but is not limited to this.
  • the above modules can be implemented by software or hardware, and the latter can be implemented in the following ways, but not limited to this: the above modules are all located in the same processor; or, the above modules can be combined in any combination The forms are located in different processors.
  • Some embodiments of the present disclosure provide an apparatus for executing an operation set, and the apparatus is used to implement the embodiments and preferred implementations of the above-mentioned method for executing an operation set, which have been described and will not be repeated.
  • the term "module” may be a combination of software and/or hardware that implements a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, implementations in hardware, or a combination of software and hardware, are also possible and contemplated.
  • Fig. 5 is the structural block diagram of the execution apparatus of the operation set, as shown in Fig. 5, this apparatus includes:
  • the second obtaining module 52 is configured to obtain the operation set when receiving the operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, wherein the operation set includes: one or more Operation information of the operation, the operation information includes: sequence identification information for identifying the operation sequence of the operation in the one or more operations, and operation description data of the operation;
  • the execution module 54 is configured to execute the one or more operations according to the operation description data according to the operation sequence identified by the sequence identification information.
  • the terminal device can automatically perform a series of operations according to the shortcut, which solves the problem of how to simplify user operations on the terminal device and realizes the execution of the customized shortcut.
  • the above-mentioned apparatus may be provided in a terminal device, but is not limited to this.
  • the above modules can be implemented by software or hardware, and the latter can be implemented in the following ways, but not limited to this: the above modules are all located in the same processor; or, the above modules can be combined in any combination The forms are located in different processors.
  • Embodiments of the present disclosure also provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, wherein the computer program is configured to execute the steps in the method embodiments when running.
  • the above-mentioned computer-readable storage medium may include, but is not limited to, a USB flash drive, a read-only memory (Read-Only Memory, referred to as ROM for short), and a random access memory (Random Access Memory, referred to as RAM for short) ), mobile hard disks, magnetic disks or optical discs and other media that can store computer programs.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • Embodiments of the present disclosure also provide a terminal device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in the above method embodiments.
  • the above-mentioned terminal device may further include a transmission device and an input/output device, wherein the transmission device is connected to the above-mentioned processor, and the above-mentioned input/output device is connected to the above-mentioned processor.
  • Image recognition technology refers to the use of computers to process, analyze and understand images to identify various patterns of targets and objects. It is a practical application of deep learning algorithms. The traditional image recognition process is divided into four steps: image acquisition ⁇ image preprocessing ⁇ feature extraction ⁇ image recognition. Through image recognition technology, the computer can identify the content in the photo.
  • the present exemplary embodiment obtains a screenshot of the user's operation process by means of a screen recording technology, and obtains relevant data of the user's operation by means of an image recognition technology.
  • the user's operation steps are recorded (the user operation description data such as click event set, operation interval, etc. are saved during recording, and the user's operation steps before, during, and after the operation are saved. keyframe images) and store them to generate a sequence of operations in chronological order, which in turn generate corresponding shortcuts.
  • the system When the shortcut is executed, the system will execute the pre-stored set of events in chronological order, just like video playback.
  • the system when the shortcut performs each step of the user operation, combined with the image recognition technology, the system will compare the image before the operation, the image when the operation is performed, and the image after the operation is performed with the pre-stored images respectively.
  • the system will identify the image content before and after the operation through image recognition technology (for example, here is a switch switch, what is the word on the switch; there is an icon, which application is the icon), and the pre-stored image content
  • image recognition technology for example, here is a switch switch, what is the word on the switch; there is an icon, which application is the icon
  • the key frame images are compared, and according to the result of the image content comparison, it is judged whether the preconditions of the current operation execution are satisfied, whether the current operation is abnormal, and whether the result after execution is normal. If there is a difference between the images before the operation is performed, the preconditions of the operation are not satisfied, the system can perform appropriate error correction according to the image recognition technology; Judgment of the result, if it is judged that the result of the execution of the operation is a failure, the system can perform exception processing.
  • this shortcut can be sent to other users for use, or a timer can be added for regular execution to derive more powerful functions.
  • This exemplary embodiment can be applied to a variety of application scenarios, for example, one-key navigation on mobile phones, one-key scanning of bicycles; after the computer is turned on, let the computer automatically open various software to be opened, and enter the working mode; remote assistance (such as remote setting Alarm clock), backup a series of operations; timed punching and so on.
  • This exemplary embodiment mainly involves four modules, which will be described in detail below.
  • Information collection module It is mainly responsible for recording the screen, saving the key frame images before, during and after the user's operation, and collecting the data of each operation of the user. For example, the time and category of each operation (such as clicking the screen, volume keys), if the screen is clicked, the coordinates will also be recorded, whether it is a long press or a short press, and so on.
  • Storage module used to store the data collected by the information collection module, and its data structure can be a list, similar to that shown in Table 1 below. Here, each operation is numbered in chronological order, with the operation performed first being 0001, followed by 0002, and so on.
  • Image recognition module According to the parameters of the operation, combined with the key frame image saved during the operation, the object of the user's operation is recognized. Before each operation, compare the current image with the pre-operation key frame stored in advance, and determine whether there is a difference between the two frame images. For example: whether the operation object is still there; whether the position of the operation object is the same; if there is a difference, notify the control module to correct the error. After each operation, compare the current image with the pre-stored key frames after the operation, and judge whether the current operation is successful according to whether there is a difference between the two frames of images; if the operation fails, notify the control module to handle exceptions.
  • Control module When the user records the screen, the control system enters the initial screen recording state, and is responsible for starting the information collection module to collect information, and then stores the collected information in the storage module. After executing the shortcut, execute the pre-stored operation steps in chronological order and relevant parameters, and start the image recognition module to compare the current image data and the pre-stored image data in real time, and judge whether to perform error correction or re-run according to the recognition result. try, handle exceptions, abort the operation, or prompt the user to choose the next operation.
  • the information collection process includes the following steps:
  • the information collection module saves a frame of the current screenshot image and waits for user input;
  • the information acquisition module saves a frame of the image during the operation, and stores the coordinates, time, type and other parameters of the operation;
  • the control module generates a shortcut.
  • the shortcut execution process includes the following steps:
  • the new coordinates of the "bicycle” application icon can be identified, and then the control module stores the new coordinates in the corresponding place of the storage module, and performs operation 0001 according to the new coordinates;
  • control module If the "Bike" icon is gone, the control module prompts the user that the operation fails;
  • step (3) If step (3) is successfully executed, repeat steps (1) to (3) for the operation whose serial number is 0002; until all operations are completed, or the process terminates abnormally.
  • the reason for the need for error correction may be due to the change of the position of the "Bicycle” APP on the desktop.
  • image recognition technology it is possible to identify which area in the current screenshot is the icon of the "bicycle” APP, or identify (or further identify) the two Chinese characters "bicycle", and then obtain the "bicycle” APP in the screenshot. Location. And the new position of the "Bicycle” APP in the screenshot is the new coordinates to perform the screen click operation.
  • Figure 6 is a schematic diagram of the movement of the application coordinates.
  • the left picture is the position of the "bicycle" coordinates when the user records the shortcut, and then the user changes the coordinates of the "bicycle” application for some reason. , changed to the position shown on the right in Figure 6.
  • the coordinates of the upper left corner of the screenshot are (0, 0)
  • the position of the "bicycle” application in the left picture in Figure 6 is (x1, y1)
  • the position in the right picture is (x2 , y2). Since the icon has a certain width and height, when taking the coordinates, it is recommended to take the coordinates of the center point of the icon.
  • the coordinates in the screenshot and the coordinates of the click on the screen are in one-to-one correspondence, that is to say, the coordinates of the "bicycle” icon on the screenshot are the coordinates of the user's click on the screen.
  • (x2, y2) is the new click coordinate of "bicycle";
  • a special case is that the coordinates in the screenshot are not the coordinates of the click on the screen.
  • the coordinates of "bicycle" in the left picture of Figure 6 are (x1, y1)
  • the coordinates of the user's click are (xd1, yd1)
  • the coordinates of "bicycle” in the right picture of Figure 6 are (x2, y2)
  • (xd2, yd2) can be calculated by the following algorithm:
  • (x1, y1), (x2, y2) are the coordinates obtained by the image recognition technology in the screenshot, and (xd1, yd1) are the coordinates of the user's click on the screen collected by the information collection module when the user is recording.
  • FIG. 7 is a schematic diagram of different switch states. As shown in FIG. 7 , the image on the left is an image with the switch off, and the image on the right is an image with the switch on. The difference between the two is obvious.
  • the embodiments of the present disclosure allow users to customize shortcuts, and since the image recognition technology is combined, errors can be corrected during the execution of the shortcuts, and the execution result can be judged.
  • modules or steps of the present disclosure can be implemented by a general-purpose computing device, and they can be centralized on a single computing device or distributed in a network composed of multiple computing devices
  • they can be implemented in program code executable by a computing device, so that they can be stored in a storage device and executed by the computing device, and in some cases, can be performed in a different order than shown here.
  • the described steps, or they are respectively made into individual integrated circuit modules, or a plurality of modules or steps in them are made into a single integrated circuit module to realize.
  • the present disclosure is not limited to any particular combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Environmental & Geological Engineering (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention provides operation set obtaining and executing methods and apparatuses, a storage medium, and a terminal device. The obtaining method comprises: receiving one or more operations for the terminal device, and obtaining operation information of each operation in the one or more operations, wherein the operation information comprises: order identification information for identifying an operation order of an operation in the one or more operations, and operation description data of the operation; and generating an operation set according to the operation information, wherein the operation set comprises: the operation information of the one or more operations. According to the present invention, recording of a series of operations of a user is achieved, so that the terminal device can automatically execute a series of operations according to the recorded operation set in the case of a user trigger or satisfying execution conditions, and the problem of how to simplify user operations on the terminal device is solved.

Description

操作集合的获取、执行方法及装置、存储介质和终端设备Method and apparatus for obtaining and executing operation set, storage medium and terminal device 技术领域technical field
本公开涉及通信领域,具体而言,涉及一种操作集合的获取、执行方法及装置、存储介质和终端设备。The present disclosure relates to the field of communications, and in particular, to a method and apparatus for acquiring and executing an operation set, a storage medium, and a terminal device.
背景技术Background technique
随着终端设备(例如,手机、平板电脑、笔记本电脑、个人电脑(Personal Computer,简称为PC)等等)功能的完善,其使用操作也越来越复杂。With the improvement of the functions of terminal devices (eg, mobile phones, tablet computers, notebook computers, personal computers (Personal Computer, PC for short), etc.), the use and operations thereof are becoming more and more complicated.
举一个简单的例子,当用户需要使用共享单车业务,用户就需要连续执行一系列操作,包括:运行共享单车软件或运行包括共享单车功能的多业务软件并点击进入单车业务,打开数据流量,打开定位,打开蓝牙,之后再点击扫描单车,才能让手机进入扫描单车的状态。这样的操作,对于用户而言过于繁琐,而且对于老人等群体来说复杂度过高。To give a simple example, when the user needs to use the shared bicycle business, the user needs to perform a series of operations continuously, including: running the shared bicycle software or running the multi-service software including the shared bicycle function and clicking to enter the bicycle business, open the data flow, open the Locate, turn on bluetooth, and then click Scan bicycles to let the phone enter the state of scanning bicycles. Such operations are too cumbersome for users, and too complicated for groups such as the elderly.
随着越来越多智能业务的涌现,以及用户的需求越来越多样化、复杂化,为了实现用户需要,在终端设备上所需执行的用户操作越来越复杂,如何简化终端设备上的用户操作是目前亟待解决的问题。With the emergence of more and more intelligent services and the increasingly diverse and complex needs of users, in order to meet user needs, the user operations that need to be performed on terminal devices are becoming more and more complex. User operation is an urgent problem to be solved at present.
发明内容SUMMARY OF THE INVENTION
本公开实施例提供了一种操作集合的获取、执行方法及装置、存储介质和终端设备,以至少解决如何简化终端设备上的用户操作的问题。Embodiments of the present disclosure provide a method and apparatus for acquiring and executing an operation set, a storage medium, and a terminal device, so as to at least solve the problem of how to simplify user operations on the terminal device.
根据本公开的一些实施例,提供了一种操作集合的获取方法,包括:接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,其中,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;根据所述操作信息生成操作集合,其中,所述操作集合包括:所述一个或多个操作的所述操作信息。According to some embodiments of the present disclosure, a method for obtaining an operation set is provided, including: receiving one or more operations on a terminal device, and obtaining operation information of each of the one or more operations, wherein: The operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations, and operation description data of the operation; an operation set is generated according to the operation information, wherein the operation set Including: the operation information of the one or more operations.
根据本公开的一些实施例,提供了一种操作集合的执行方法,包括:在接收到操作集合对应的操作集合执行请求或判断满足操作集合对应的执行条件的情况下,获取所述操作集合,其中,所述操作集合包括:一个或多个操作的操作信息,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作。According to some embodiments of the present disclosure, there is provided a method for executing an operation set, comprising: obtaining the operation set in the case of receiving an operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, Wherein, the operation set includes: operation information of one or more operations, the operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations, an operation description of the operation data; and according to the operation sequence identified by the sequence identification information, perform the one or more operations according to the operation description data.
根据本公开的一些实施例,提供了一种操作集合的获取装置,包括:第一获取模块,设置为接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,其中,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;生成模块,设置为根据所述操作信息生成操作集合,其中,所述操作集合包括:所述一个或多个操作的所述操作信息。According to some embodiments of the present disclosure, an apparatus for obtaining an operation set is provided, including: a first obtaining module configured to receive one or more operations on a terminal device, and obtain each of the one or more operations Operation information of the operation, wherein the operation information includes: sequence identification information for identifying the operation sequence of the operation in the one or more operations, and operation description data of the operation; The operation information generates an operation set, wherein the operation set includes: the operation information of the one or more operations.
根据本公开的一些实施例,提供了一种操作集合的执行装置,包括:第二获取模块,设 置为在接收到操作集合对应的操作集合执行请求或判断满足操作集合对应的执行条件的情况下,获取所述操作集合,其中,所述操作集合包括:一个或多个操作的操作信息,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;执行模块,设置为按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作。According to some embodiments of the present disclosure, an apparatus for executing an operation set is provided, including: a second obtaining module, configured to: when an operation set execution request corresponding to the operation set is received or an execution condition corresponding to the operation set is judged to be satisfied , obtain the operation set, wherein the operation set includes: operation information of one or more operations, and the operation information includes: sequence identification information used to identify the operation sequence of the operation in the one or more operations , the operation description data of the operation; the execution module is configured to execute the one or more operations according to the operation description data according to the operation sequence identified by the sequence identification information.
根据本公开的一些实施例,还提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。According to some embodiments of the present disclosure, a computer-readable storage medium is also provided, and a computer program is stored in the computer-readable storage medium, wherein the computer program is configured to execute any one of the above method implementations when running steps in the example.
根据本公开的一些实施例,还提供了一种终端设备,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行上述任一项方法实施例中的步骤。According to some embodiments of the present disclosure, there is also provided a terminal device including a memory and a processor, wherein the memory stores a computer program, and the processor is configured to run the computer program to execute any one of the above methods steps in the examples.
通过本公开实施例,由于能够自动接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成操作集合,从而可以自动记录用户的一系列操作,从而便于在用户的触发或满足执行条件的情况下终端设备能够自动根据记录的操作集合执行一系列操作,因此,可以解决如何简化终端设备上的用户操作的问题,实现了自定义的快捷方式的记录和执行。Through the embodiments of the present disclosure, since one or more operations on the terminal device can be automatically received, the operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the automatic recording can be performed. A series of operations performed by the user, so that the terminal device can automatically perform a series of operations according to the recorded operation set when triggered by the user or satisfying the execution conditions. Therefore, the problem of how to simplify the user operations on the terminal device can be solved, and the Custom shortcuts are recorded and executed.
附图说明Description of drawings
图1是一种操作集合的获取、执行方法的移动终端的硬件结构框图;Fig. 1 is a hardware structure block diagram of a mobile terminal of a method for acquiring and executing an operation set;
图2是操作集合的获取方法的流程图;Fig. 2 is the flow chart of the acquisition method of operation set;
图3是操作集合的执行方法的流程图;Fig. 3 is the flow chart of the execution method of operation set;
图4是操作集合的获取装置的结构框图;Fig. 4 is the structural block diagram of the acquisition device of operation set;
图5是操作集合的执行装置的结构框图;Fig. 5 is the structural block diagram of the execution apparatus of the operation set;
图6是应用坐标发生移动的示意图;Fig. 6 is the schematic diagram that application coordinate moves;
图7是不同开关状态的示意图。FIG. 7 is a schematic diagram of different switch states.
具体实施方式detailed description
面对如何简化终端设备上的用户操作的问题,可以考虑通过设置快捷方式来一定程度上简化用户的操作。目前在智能手机上,如果用户要设置快捷操作,可以在桌面上长按并在弹出菜单里边选择“添加小部件”,找到“设置快捷方式”,在里边找到相应的设置项,从而创建相关的快捷方式,这个功能方便用户快捷的调用某个设置项进行设置操作。PC上比较常见的设置快捷方式的方法则是鼠标右键点击之后,选择“发送到桌面快捷方式”,之后可以在桌面生成一个快捷方式,方便用户快捷的到达某个位置,或者运行某个软件。Faced with the problem of how to simplify user operations on the terminal device, it may be considered to simplify user operations to a certain extent by setting shortcuts. Currently on a smartphone, if a user wants to set a shortcut operation, they can long press on the desktop and select "Add Widget" in the pop-up menu, find "Set Shortcut", and find the corresponding setting item in it to create a related Shortcut, this function is convenient for users to quickly call a setting item for setting operation. The more common way to set a shortcut on PC is to right-click and select "Send to Desktop Shortcut", and then a shortcut can be generated on the desktop, which is convenient for users to quickly reach a certain location or run a certain software.
这些快捷方式存在以下不足:These shortcuts have the following disadvantages:
(1)这些快捷方式功能过于简单,例如,PC上的快捷方式,要么就是到某个文件夹位置,要么就是启动某个应用;而智能手机上的快捷方式,只有固定数量的设置项可以允许用户设置快捷方式,由于这个功能实用性不佳,所以使用频率很低。(1) The functions of these shortcuts are too simple. For example, the shortcuts on the PC are either to a certain folder location or to start an application; while the shortcuts on the smartphone, only a fixed number of setting items can allow Users set shortcuts, which are rarely used due to the poor practicality of this function.
(2)这些快捷方式过于固化,无论是PC、还是智能手机,快捷方式都是出厂后就预置好的。用户只能选择用,或者不用,功能是固化的,用户完全没法自定义快捷方式的功能。(2) These shortcuts are too solid. Whether it is a PC or a smartphone, the shortcuts are preset after leaving the factory. The user can only choose to use or not to use, the function is fixed, and the user cannot customize the function of the shortcut at all.
(3)这些快捷方式没办法定时启动,也没办法从A用户发送给B用户。(3) These shortcuts cannot be started regularly, nor can they be sent from user A to user B.
然而,用户在使用终端设备时,往往会根据个人的使用习惯,产生较多的复杂操作及连续操作。例如,用户上班后可能会在打开电脑之后陆续打开邮箱、记事本、各类工作相关工具软件等等,让电脑进入工作状态;再例如,用户希望使用共享单车时往往需要运行共享单车软件或运行包括共享单车功能的多业务软件并点击进入单车业务,打开数据流量,打开定位,打开蓝牙,之后再点击扫描单车,才能让手机进入扫描单车的状态。很显然,上述提及的快捷方式技术是没有办法满足不同的用户需求的。However, when a user uses a terminal device, more complex operations and continuous operations are often generated according to personal usage habits. For example, after going to work, the user may open the mailbox, notepad, various work-related tool software, etc. after turning on the computer, so that the computer can enter the working state; for another example, when the user wants to use the shared bicycle, he often needs to run the shared bicycle software or run The multi-service software including the shared bicycle function and click to enter the bicycle business, turn on the data flow, turn on the positioning, turn on the Bluetooth, and then click on the scan bicycle, so that the mobile phone can enter the state of scanning the bicycle. Obviously, the shortcut technology mentioned above cannot meet the needs of different users.
为了解决上述问题,本公开实施例提供了一种让用户能够根据自己的使用习惯和喜好,来自定义快捷方式的方案,该方案可以保存用户的操作集合,并生成快捷方式让用户在后续使用中可以快捷操作。此外,该方案还支持将该快捷方式发送给其他用户使用,或者增加定时器定时执行从而衍生出更强大的功能。该方案的使用场景广泛,如:手机一键导航、一键扫描单车;电脑开机后,让电脑自动打开要打开的各种软件,进入工作模式;用于远程协助(比如远程设置闹钟)、备份等一系列的操作;定时打卡等等。In order to solve the above problems, the embodiments of the present disclosure provide a solution that allows users to customize shortcuts according to their own usage habits and preferences, the solution can save the user's operation collection, and generate shortcuts for users to use in subsequent use. Can operate quickly. In addition, the solution also supports sending the shortcut to other users for use, or adding a timer for regular execution to derive more powerful functions. This solution can be used in a wide range of scenarios, such as: mobile phone one-click navigation, one-click scanning of bicycles; after the computer is turned on, let the computer automatically open various software to be opened, and enter the working mode; for remote assistance (such as remote setting an alarm clock), backup Waiting for a series of operations; timing check-in and so on.
下文中将参考附图并结合一些实例的描述来详细说明本公开的实施例。Embodiments of the present disclosure are hereinafter described in detail with reference to the accompanying drawings and in conjunction with the description of some examples.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence.
本公开实施例中所提供的方法实施例可以在移动终端、计算机终端或者类似的终端设备中执行。以运行在移动终端上为例,图1是一种操作集合的获取、执行方法的移动终端的硬件结构框图。如图1所示,移动终端可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和用于存储数据的存储器104,其中,上述移动终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述移动终端的结构造成限定。例如,移动终端还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiments provided in the embodiments of the present disclosure may be executed in a mobile terminal, a computer terminal, or a similar terminal device. Taking running on a mobile terminal as an example, FIG. 1 is a block diagram of the hardware structure of a mobile terminal with a method for acquiring and executing an operation set. As shown in FIG. 1 , the mobile terminal may include one or more (only one is shown in FIG. 1 ) processor 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a transmission device 106 and an input and output device 108 for communication functions. Those of ordinary skill in the art can understand that the structure shown in FIG. 1 is only a schematic diagram, which does not limit the structure of the above-mentioned mobile terminal. For example, the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration than that shown in FIG. 1 .
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本公开实施例中的操作集合的获取、执行方法对应的计算机程序,处理器102通过运行存储在存储器104内的计算机程序,从而执行各种功能应用以及数据处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as computer programs corresponding to the acquisition and execution methods of the operation sets in the embodiments of the present disclosure. The processor 102 runs the computer programs stored in the memory 104 by running the computer programs , so as to perform various functional applications and data processing, that is, to implement the above method. Memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some instances, the memory 104 may further include memory located remotely from the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of such networks include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。Transmission means 106 are used to receive or transmit data via a network. The specific example of the above-mentioned network may include a wireless network provided by a communication provider of the mobile terminal. In one example, the transmission device 106 includes a network adapter (Network Interface Controller, NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet. In one example, the transmission device 106 may be a radio frequency (Radio Frequency, RF for short) module, which is used to communicate with the Internet in a wireless manner.
本公开的一些实施例提供了运行于上述终端设备的操作集合的获取方法,图2是操作集 合的获取方法的流程图,如图2所示,该流程包括如下步骤:Some embodiments of the present disclosure provide a method for obtaining an operation set running on the above-mentioned terminal device. Fig. 2 is a flowchart of the method for obtaining an operation set. As shown in Fig. 2 , the flow process includes the following steps:
步骤S202,接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,其中,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;Step S202: Receive one or more operations on the terminal device, and acquire operation information of each operation in the one or more operations, wherein the operation information includes: an operation information used to identify that the operation is performed in the one or more operations. Sequence identification information of the operation sequence in the operation, operation description data of the operation;
步骤S204,根据所述操作信息生成操作集合,其中,所述操作集合包括:所述一个或多个操作的所述操作信息。Step S204, generating an operation set according to the operation information, wherein the operation set includes: the operation information of the one or more operations.
通过上述步骤,由于能够自动接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成操作集合,从而可以自动记录用户的一系列操作,从而便于在用户的触发或满足执行条件的情况下终端设备能够自动根据记录的操作集合执行一系列操作,因此,可以解决如何简化终端设备上的用户操作的问题,实现了自定义的快捷方式的记录。Through the above steps, since one or more operations on the terminal device can be automatically received, the operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the user's operations can be automatically recorded. A series of operations, so that the terminal device can automatically execute a series of operations according to the recorded operation set when the user triggers or meets the execution conditions. Therefore, it can solve the problem of how to simplify the user operation on the terminal device and realize the customization record of shortcuts.
其中,上述步骤的执行主体可以为终端设备等,但不限于此。Wherein, the execution subject of the above steps may be a terminal device or the like, but is not limited thereto.
在至少一个示例性实施例中,所述操作信息还可以包括:所述操作对应的相关帧图像,其中,所述相关帧图像包括:执行前有效帧图像、执行时有效帧图像和执行后有效帧图像。执行前有效帧图像可以是在操作执行之前第一预定时间(例如,30ms)时的屏幕图像,执行后有效帧图像可以是在操作执行之后第二预定时间(例如,80ms)时的屏幕图像。对于某些短暂性操作(例如,点击网页、进入APP中功能页面等等),前一个操作的执行后有效帧图像有可能与下一个操作的执行前有效帧图像相同;但是对于某些动作后会持续一定时间的操作(例如,拨打电话),前一个操作的执行后有效帧图像(拨出电话后的通话页面图像)和下一个操作的执行前有效帧图像(电话挂断后的页面图像)可能是不同的。In at least one exemplary embodiment, the operation information may further include: a relevant frame image corresponding to the operation, wherein the relevant frame image includes: a valid frame image before execution, a valid frame image during execution, and a valid frame image after execution frame image. The pre-execution valid frame image may be a screen image at a first predetermined time (eg, 30ms) before the operation is performed, and the post-execution valid frame image may be a screen image at a second predetermined time (eg, 80ms) after the operation is performed. For some transient operations (for example, clicking on a webpage, entering a function page in an APP, etc.), the effective frame image after the execution of the previous operation may be the same as the effective frame image before the execution of the next operation; Operations that last for a certain period of time (for example, making a call), the effective frame image after the execution of the previous operation (the call page image after the call is made) and the effective frame image before the execution of the next operation (the page image after the call is hung up). ) may be different.
在某些示例性实施例中,所述执行时有效帧图像可以辅助操作描述数据的获取,在通过系统直接读取操作无法获取到完整的操作描述数据的情况下,可以通过对执行时有效帧图像进行图像识别来获取操作的操作描述数据。In some exemplary embodiments, the execution-time valid frame image can assist the acquisition of the operation description data. In the case that the complete operation description data cannot be obtained through the system's direct reading operation, the execution-time valid frame image can be obtained by The image performs image recognition to obtain the operation description data of the operation.
在至少一个示例性实施例中,步骤S202可以包括以下操作:In at least one exemplary embodiment, step S202 may include the following operations:
接收操作集合采集请求;Receive an operation collection collection request;
响应于接收到的操作集合采集请求,所述终端设备通过录屏功能或截屏功能获取所述一个或多个操作中每个操作对应的所述相关帧图像,并采集所述一个或多个操作中每个操作的所述顺序标识信息和所述操作描述数据,直到接收到采集结束指示。在某些示例性实施例中,还可以响应于接收到的操作集合采集请求,控制所述终端设备的屏幕显示初始页面,通过录屏功能或截屏功能获取所述一个或多个操作中每个操作对应的所述相关帧图像,并采集所述一个或多个操作中每个操作的所述顺序标识信息和所述操作描述数据,直到接收到采集结束指示。In response to the received operation set collection request, the terminal device obtains the relevant frame image corresponding to each operation in the one or more operations through a screen recording function or a screen capture function, and collects the one or more operations The sequence identification information and the operation description data of each operation in the . until a collection end indication is received. In some exemplary embodiments, in response to the received operation set collection request, the screen of the terminal device may be controlled to display an initial page, and each of the one or more operations may be acquired through a screen recording function or a screen capture function. The corresponding relevant frame images are operated, and the sequence identification information and the operation description data of each operation in the one or more operations are collected until a collection end indication is received.
通过上述方案,用户可以通过操作集合采集请求(例如,通过点击操作界面上的一个录制快捷方式的控件来发出该请求),发起快捷方式的录制流程,终端设备可以通过录屏或截屏的方式获取到操作对应的相关帧图像,可以通过系统直接读取、或系统直接读取结合对执行时有效帧图像的图像识别的方式获取到所述一个或多个操作中每个操作的所述顺序标识信息和所述操作描述数据。Through the above solution, the user can collect a request by operating a collection (for example, by clicking a control for recording a shortcut on the operation interface to issue the request), and initiate the recording process of the shortcut, and the terminal device can obtain it by recording a screen or taking a screenshot. To the relevant frame image corresponding to the operation, the sequence identifier of each operation in the one or more operations can be obtained through direct reading by the system, or direct reading by the system in combination with the image recognition of the valid frame image at the time of execution. information and data describing the operation.
在至少一个示例性实施例中,用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息可以包括以下至少之一:所述操作的操作时间、所述操作在所述一个或多个操作中 的操作顺序序号。In at least one exemplary embodiment, the sequence identification information for identifying the operation sequence of the operation in the one or more operations may include at least one of the following: the operation time of the operation, the operation time of the operation in the one or more operations. or the sequence number of the operation in multiple operations.
在至少一个示例性实施例中,所述操作的操作描述数据可以包括以下至少之一:操作类别、坐标参数、持续时间参数、按键标识信息、采集生物特征的传感器的标识信息、采集生物特征的采集参数、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息。其中,所述操作类别可以包括但不限于以下至少之一:点击屏幕、滑动屏幕、按下按键、采集生物特征。In at least one exemplary embodiment, the operation description data of the operation may include at least one of the following: an operation category, a coordinate parameter, a duration parameter, key identification information, identification information of a sensor that collects biometrics, Collection parameters, the operation object corresponding to the operation, the description information of the execution page, and the description information of the result page. The operation category may include, but is not limited to, at least one of the following: clicking on the screen, sliding the screen, pressing a button, and collecting biological features.
随着操作类别的不同,采集的操作描述数据可能也存在不同,具体操作描述数据的内容可以根据实际需要进行设定,例如:在所述操作类别包括所述点击屏幕的情况下,所述操作描述数据可以包括以下至少之一:点击屏幕的坐标、持续时间、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息;在所述操作类别包括所述滑动屏幕的情况下,所述操作描述数据可以包括以下至少之一:滑动屏幕的起始坐标、滑动屏幕的终止坐标、持续时间、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息;在所述操作类别包括所述按下按键的情况下,所述操作描述数据可以包括以下至少之一:用于标识所按下按键的按键标识信息、持续时间、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息;在所述操作类别包括所述采集生物特征的情况下,所述操作描述数据可以包括以下至少之一:用于采集生物特征的传感器的传感器标识信息和采集参数、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息。With different operation categories, the collected operation description data may also be different, and the content of the specific operation description data can be set according to actual needs. For example, when the operation category includes the click screen, the operation The description data may include at least one of the following: the coordinates of the click screen, the duration, the operation object corresponding to the operation, the description information of the execution page, and the description information of the result page; when the operation category includes the sliding screen, all The operation description data may include at least one of the following: the starting coordinates of the sliding screen, the ending coordinates of the sliding screen, the duration, the operation object corresponding to the operation, the description information of the execution page and the description information of the result page; in the operation category In the case where the pressed key is included, the operation description data may include at least one of the following: key identification information for identifying the pressed key, duration, operation object corresponding to the operation, execution page description information, and Result page description information; in the case where the operation category includes the collection of biometric features, the operation description data may include at least one of the following: sensor identification information and collection parameters of the sensor used to collect the biometric feature, the operation Corresponding operation object, execution page description information, and result page description information.
在至少一个示例性实施例中,所述操作类别可以基于屏幕触摸信号或按键触控信号或系统传感器调用信号获取;和/或,In at least one exemplary embodiment, the operation category may be acquired based on a screen touch signal, a key touch signal, or a system sensor call signal; and/or,
所述坐标参数可以基于屏幕触摸信号获取;和/或,The coordinate parameters may be obtained based on a screen touch signal; and/or,
所述持续时间参数可以基于屏幕触摸信号获取;和/或,The duration parameter may be obtained based on a screen touch signal; and/or,
所述按键标识信息可以基于按键触控信号获取;和/或,The key identification information may be obtained based on a key touch signal; and/or,
所述采集生物特征的传感器的标识信息以及采集参数可以基于系统传感器调用信号获取;和/或,The identification information and the acquisition parameters of the sensors that collect biological features may be obtained based on the system sensor call signal; and/or,
所述操作对应的操作对象、执行页面描述信息可以基于图像识别技术根据所述操作对应的执行时有效帧图像获取,或者基于图像识别技术根据所述操作对应的执行前有效帧图像结合所述坐标参数获取,例如,由于执行时有效帧图像往往以肉眼可见的视觉效果来显示当前正在被操作的区域,如点击哪个功能按钮、滑动哪个滑块等等,所以,根据图像识别技术可以基于执行时有效帧图像识别出当前操作对应的操作对象是哪个,进一步还可以识别出当前操作对应的执行页面描述信息,如功能按钮上的文字、滑块周围的文字说明等等;同理,该过程也可以基于执行前有效帧图像来实现,区别仅仅在于执行前有效帧图像需要和操作的坐标参数结合,来得到用户当前正在操作的是哪个操作对象,并进一步可以识别出该操作对象上或操作对象周围的执行页面描述信息;和/或,The operation object and execution page description information corresponding to the operation can be obtained based on the image recognition technology according to the valid frame image corresponding to the execution time of the operation, or based on the image recognition technology based on the valid frame image before execution corresponding to the operation combined with the coordinates Parameter acquisition, for example, since the effective frame image often displays the area currently being operated with visual effects visible to the naked eye during execution, such as which function button to click, which slider to slide, etc. The valid frame image identifies the operation object corresponding to the current operation, and further identifies the execution page description information corresponding to the current operation, such as the text on the function button, the text description around the slider, etc.; It can be implemented based on the valid frame image before execution. The only difference is that the valid frame image before execution needs to be combined with the coordinate parameters of the operation to get which operation object the user is currently operating, and further identify the operation object or the operation object. description information surrounding the execution page; and/or,
所述操作对应的结果页面描述信息可以基于图像识别技术根据所述操作对应的执行后有效帧图像获取,例如,可以基于图像识别技术在所述操作对应的执行后有效帧图像中识别页面描述信息作为结果页面描述信息,优选地,也可以在识别出的页面描述信息中进一步根据机器学习获得的算法识别出结果描述关键字作为所述结果页面描述信息。The description information of the result page corresponding to the operation can be obtained from the valid frame image after execution corresponding to the operation based on image recognition technology. For example, the page description information can be identified in the valid frame image after execution corresponding to the operation based on the image recognition technology. As the result page description information, preferably, a result description keyword may be further identified in the identified page description information according to an algorithm obtained by machine learning as the result page description information.
在至少一个示例性实施例中,在步骤S204之后,所述方法还可以包括以下至少之一:In at least one exemplary embodiment, after step S204, the method may further include at least one of the following:
A、保存所述操作集合;A. Save the set of operations;
B、设置所述操作集合对应的执行条件后保存所述操作集合和所述操作集合对应的所述执行条件;B. After setting the execution condition corresponding to the operation set, save the operation set and the execution condition corresponding to the operation set;
C、发送所述操作集合;C. Send the set of operations;
D、设置所述操作集合对应的执行条件后发送所述操作集合和所述操作集合对应的所述执行条件。D. After setting the execution condition corresponding to the operation set, send the operation set and the execution condition corresponding to the operation set.
通过上述方式,可以对操作集合对应的执行条件进行设置,该执行条件可以包括执行时间、前置事件或远程触发等等,从而实现更加灵活的快捷方式触发。此外,还可以将该操作集合发送给其他终端设备,以便实现对其他终端设备的远程控制指导。In the above manner, the execution condition corresponding to the operation set can be set, and the execution condition may include execution time, pre-event or remote trigger, etc., thereby realizing more flexible shortcut triggering. In addition, the operation set can also be sent to other terminal devices, so as to realize remote control guidance to other terminal devices.
本公开的一些实施例提供了一种运行于上述终端设备的操作集合的执行方法,图3是操作集合的执行方法的流程图,如图3所示,该流程包括如下步骤:Some embodiments of the present disclosure provide a method for executing an operation set running on the above-mentioned terminal device. FIG. 3 is a flowchart of the method for executing an operation set. As shown in FIG. 3 , the process includes the following steps:
步骤S302,在接收到操作集合对应的操作集合执行请求或判断满足操作集合对应的执行条件的情况下,获取所述操作集合,其中,所述操作集合包括:一个或多个操作的操作信息,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;Step S302, in the case of receiving the operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, obtain the operation set, wherein the operation set includes: operation information of one or more operations, The operation information includes: sequence identification information for identifying an operation sequence of the operation in the one or more operations, and operation description data of the operation;
步骤S304,按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作。Step S304, according to the operation sequence identified by the sequence identification information, perform the one or more operations according to the operation description data.
通过上述步骤,由于能够自动响应于操作集合执行请求或在判断满足执行条件的情况下,获取操作集合并按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作,从而终端设备能够自动根据快捷方式执行一系列操作,解决了如何简化终端设备上的用户操作的问题,实现了自定义的快捷方式的执行。Through the above steps, because the operation set can be automatically responded to the execution request of the operation set or when it is judged that the execution condition is satisfied, the operation set can be obtained and the operation sequence identified by the sequence identification information can be identified, and the one or more operations can be executed according to the operation description data. Therefore, the terminal device can automatically perform a series of operations according to the shortcut, which solves the problem of how to simplify user operations on the terminal device and realizes the execution of the customized shortcut.
其中,上述步骤的执行主体可以为终端设备等,但不限于此。Wherein, the execution subject of the above steps may be a terminal device or the like, but is not limited thereto.
在至少一个示例性实施例中,所述操作信息还可以包括:所述操作对应的相关帧图像,其中,所述相关帧图像包括:执行前有效帧图像、执行时有效帧图像和执行后有效帧图像。执行前有效帧图像可以是在操作执行之前第一预定时间(例如,30ms)时的屏幕图像,执行后有效帧图像可以是在操作执行之后第二预定时间(例如,80ms)时的屏幕图像。对于某些短暂性操作(例如,点击网页、进入APP中功能页面等等),前一个操作的执行后有效帧图像有可能与下一个操作的执行前有效帧图像相同;但是对于某些动作后会持续一定时间的操作(例如,拨打电话),前一个操作的执行后有效帧图像(拨出电话后的通话页面图像)和下一个操作的执行前有效帧图像(电话挂断后的页面图像)可能是不同的。In at least one exemplary embodiment, the operation information may further include: a relevant frame image corresponding to the operation, wherein the relevant frame image includes: a valid frame image before execution, a valid frame image during execution, and a valid frame image after execution frame image. The pre-execution valid frame image may be a screen image at a first predetermined time (eg, 30ms) before the operation is performed, and the post-execution valid frame image may be a screen image at a second predetermined time (eg, 80ms) after the operation is performed. For some transient operations (for example, clicking on a webpage, entering a function page in an APP, etc.), the effective frame image after the execution of the previous operation may be the same as the effective frame image before the execution of the next operation; Operations that last for a certain period of time (for example, making a call), the effective frame image after the execution of the previous operation (the call page image after the call is made) and the effective frame image before the execution of the next operation (the page image after the call is hung up). ) may be different.
在某些示例性实施例中,所述执行时有效帧图像可以辅助操作描述数据的获取,在操作描述数据的内容不完整导致无法精确执行操作的情况下,可以通过对执行时有效帧图像进行图像识别来获取操作的完整的操作描述数据。In some exemplary embodiments, the execution-time valid frame image may assist the acquisition of the operation description data. In the case that the content of the operation description data is incomplete and the operation cannot be performed accurately, the execution-time valid frame image may be processed by Image recognition to obtain the complete operation description data of the operation.
在至少一个示例性实施例中,步骤S304可以包括:In at least one exemplary embodiment, step S304 may include:
(1)根据所述顺序标识信息确定待执行的当前操作;(1) Determine the current operation to be performed according to the sequence identification information;
(2)根据当前执行前屏幕图像和所述当前操作对应的所述执行前有效帧图像判断是否满足执行所述当前操作的前置条件,并在满足的情况下执行所述当前操作;(2) according to the current screen image before the execution and the corresponding effective frame image before the execution of the current operation, determine whether the preconditions for executing the current operation are satisfied, and if the current operation is satisfied, the current operation is performed;
(3)确定所述当前操作是否执行成功,在执行成功的情况下继续确定并执行下一个待执行的当前操作,直到所述一个或多个操作执行完毕。(3) Determine whether the current operation is successfully executed, and if the execution is successful, continue to determine and execute the next current operation to be executed until the one or more operations are executed.
在至少一个示例性实施例中,步骤(2)可以包括以下处理:In at least one exemplary embodiment, step (2) may include the following processing:
确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象;determining whether the current screen image before execution includes the operation object corresponding to the current operation;
若所述当前执行前屏幕图像中包括所述当前操作对应的操作对象,在所述当前操作对应的操作对象在所述当前执行前屏幕图像和所述执行前有效帧图像中的位置相同的情况下,根据所述当前操作的所述操作描述数据执行所述当前操作;和/或,在所述当前操作对应的操作对象在所述当前执行前屏幕图像和所述执行前有效帧图像中的位置变化的情况下,根据所述当前操作对应的操作对象在所述当前执行前屏幕图像中的位置调整所述当前操作的所述操作描述数据,并根据所述当前操作的调整后的所述操作描述数据执行所述当前操作。If the current pre-execution screen image includes an operation object corresponding to the current operation, in the case where the position of the operation object corresponding to the current operation is the same in the current pre-execution screen image and the pre-execution effective frame image In the next step, the current operation is performed according to the operation description data of the current operation; and/or, in the screen image before the current execution and the effective frame image before the execution of the operation object corresponding to the current operation When the position changes, adjust the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current operation, and adjust the operation description data of the current operation according to the adjusted value of the current operation. The operation description data performs the current operation.
在至少一个示例性实施例中,上述过程中所依据的所述当前操作对应的操作对象可以是包括在所述当前操作的所述操作描述数据中,或者基于图像识别技术根据所述当前操作对应的所述执行时有效帧图像获取,或者基于图像识别技术根据所述当前操作对应的所述执行前有效帧图像结合所述当前操作的坐标参数获取。In at least one exemplary embodiment, the operation object corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or corresponding to the current operation based on an image recognition technology The valid frame image at the time of execution is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
通过上述过程,可以实现在每个操作执行前的前置检查,从而决定是直接执行操作还是调整纠错后执行操作。Through the above process, it is possible to implement a pre-check before each operation is performed, so as to decide whether to perform the operation directly or to perform the operation after adjustment and error correction.
在至少一个示例性实施例中,步骤(1)可以包括以下至少之一:In at least one exemplary embodiment, step (1) may include at least one of the following:
基于图像识别技术识别所述当前执行前屏幕图像中是否包括所述当前操作对应的操作对象的图标(可以包括应用图标、应用内控件图标等等用户能够在其上执行操作的图标,包括但不限于可以点击、滑动或执行其他屏幕操作的图标),根据识别结果确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象,例如,对于操作启动单车APP而言,可以基于图像识别技术识别当前执行前屏幕图像中是否包括有单车APP的应用图标;Identify, based on image recognition technology, whether the current screen image before execution includes an icon of an operation object corresponding to the current operation (which may include application icons, in-app control icons, and other icons on which users can perform operations, including but not (limited to icons that can be clicked, swiped or performed other on-screen operations), according to the recognition result, determine whether the current pre-execution screen image includes the operation object corresponding to the current operation. The technology identifies whether the application icon of the bicycle APP is included in the screen image before the current execution;
基于图像识别技术识别所述当前执行前屏幕图像中包括的页面描述信息,将识别的所述页面描述信息与所述当前操作对应的执行页面描述信息进行匹配,根据匹配结果确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象,例如,对于操作启动单车APP而言,所述当前操作对应的执行页面描述信息可以是“单车”,可以基于图像识别技术识别当前执行前屏幕图像中是否有“单车”字样,如有则判定为匹配,此外,如果基于图像识别技术识别当前执行前屏幕图像中有“Bike”或“Bicycle”或“Danche”字样,也认为是匹配的。同一个APP可能存在不同语言的版本,而语言之间具有对应关系,可以基于这种对应关系实现多个语言版本APP的匹配,而不会出现录入时在中文APP上操作,执行终端为英文版本系统导致APP的名字是一个英文名字就无法识别的情况出现,更加的智能。Identify the page description information included in the current pre-execution screen image based on image recognition technology, match the identified page description information with the execution page description information corresponding to the current operation, and determine the current pre-execution page description information according to the matching result. Whether the screen image includes the operation object corresponding to the current operation. For example, for an operation to start a bicycle APP, the description information of the execution page corresponding to the current operation can be "bicycle", and the screen image before the current execution can be identified based on image recognition technology. Whether there is the word "Bike" in the screen, if so, it is judged to be a match. In addition, if there is a word "Bike", "Bicycle" or "Danche" in the screen image before the current execution based on image recognition technology, it is also considered to be a match. The same APP may have different language versions, and there is a corresponding relationship between the languages. Based on this correspondence, the matching of multiple language versions of APPs can be realized, and there will be no operation on the Chinese APP when entering, and the execution terminal is the English version. The system causes the situation that the name of the APP cannot be recognized by an English name, which is more intelligent.
在至少一个示例性实施例中,上述过程中所依据的所述当前操作对应的操作对象可以是包括在所述当前操作的所述操作描述数据中,或者基于图像识别技术根据所述当前操作对应的所述执行时有效帧图像获取,或者基于图像识别技术根据所述当前操作对应的所述执行前有效帧图像结合所述当前操作的坐标参数获取。In at least one exemplary embodiment, the operation object corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or corresponding to the current operation based on an image recognition technology The valid frame image at the time of execution is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
在至少一个示例性实施例中,上述过程中的所依据的所述当前操作对应的执行页面描述信息可以是包括在所述当前操作的所述操作描述数据中,或者基于图像识别技术根据所述当前操作对应的所述执行时有效帧图像获取,或者基于图像识别技术根据所述当前操作对应的所述执行前有效帧图像结合所述当前操作的坐标参数获取。In at least one exemplary embodiment, the execution page description information corresponding to the current operation based on the above process may be included in the operation description data of the current operation, or based on image recognition technology according to the The valid frame image at the time of execution corresponding to the current operation is obtained, or based on the image recognition technology, the valid frame image before execution corresponding to the current operation is obtained in combination with the coordinate parameters of the current operation.
在至少一个示例性实施例中,根据所述当前操作对应的操作对象在所述当前执行前屏幕图像中的位置调整所述当前操作的所述操作描述数据可以包括以下之一:In at least one exemplary embodiment, adjusting the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current execution may include one of the following:
在所述操作对象在所述当前执行前屏幕图像中的坐标参数与对屏幕上所述操作对象进行点击的坐标参数一致的情况下,将所述当前操作的所述操作描述数据中的坐标参数替换为所述当前操作对应的操作对象在所述当前执行前屏幕图像中的坐标参数;In the case that the coordinate parameters of the operation object in the screen image before the current execution are consistent with the coordinate parameters of the click on the operation object on the screen, the coordinate parameters in the operation description data of the current operation are changed to Replaced with the coordinate parameters of the operation object corresponding to the current operation in the screen image before the current execution;
在所述操作对象在所述当前执行前屏幕图像中的坐标参数与对屏幕上所述操作对象进行点击的坐标参数不一致的情况下,根据以下公式确定点击变化位置后的所述操作对象的坐标参数(xd2,yd2):(x1,y1)/(xd1,yd1)=(x2,y2)/(xd2,yd2),其中,(x1,y1)为所述操作对象在所述执行前有效帧图像中的坐标参数,(x2,y2)为所述操作对象在所述当前执行前屏幕图像中的坐标参数,(xd1,yd1)为调整前的所述操作描述数据中的坐标参数。In the case that the coordinate parameters of the operation object in the screen image before the current execution are inconsistent with the coordinate parameters of clicking the operation object on the screen, the coordinates of the operation object after the click changes position are determined according to the following formula Parameter (xd2, yd2): (x1, y1)/(xd1, yd1)=(x2, y2)/(xd2, yd2), where (x1, y1) is the valid frame of the operation object before the execution The coordinate parameters in the image, (x2, y2) are the coordinate parameters of the operation object in the screen image before the current execution, and (xd1, yd1) are the coordinate parameters in the operation description data before adjustment.
在至少一个示例性实施例中,在所述当前执行前屏幕图像中不包括所述当前操作对应的操作对象的情况下,所述方法还包括以下之一:In at least one exemplary embodiment, in the case that the current pre-execution screen image does not include an operation object corresponding to the current operation, the method further includes one of the following:
A、确认执行所述一个或多个操作失败;A. Confirm the failure to perform the one or more operations;
B、退回重复执行所述当前操作的前一步操作;B. Return to repeat the previous operation of the current operation;
C、提示由用户继续执行所述一个或多个操作中未完成的操作;C. Prompt the user to continue to perform the unfinished operation in the one or more operations;
D、提示由用户执行所述当前操作,并在所述当前操作执行完成后,继续按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作中未完成的操作。D. Prompt the user to perform the current operation, and after the current operation is completed, continue to follow the sequence of operations identified by the sequence identification information, and perform the one or more operations not completed according to the operation description data operation.
通过该方案,在每个操作执行前的前置检查中,若发现图标不存在无法自动实现操作,可以采取重试、转由用户帮忙执行当前操作或后续全部操作、宣告失败等等方法。Through this solution, in the pre-check before each operation is performed, if it is found that the icon does not exist and the operation cannot be automatically realized, methods such as retry, transfer to the user to help perform the current operation or all subsequent operations, or failure to be declared can be adopted.
在至少一个示例性实施例中,步骤(3)中确定所述当前操作是否执行成功可以包括以下至少之一:In at least one exemplary embodiment, determining whether the current operation is successfully performed in step (3) may include at least one of the following:
基于图像识别技术识别当前执行后屏幕图像和所述当前操作对应的所述执行后有效帧图像是否一致,在一致的情况下确定所述当前操作执行成功;Identify whether the screen image after the current execution is consistent with the effective frame image after execution corresponding to the current operation based on image recognition technology, and determine that the current operation is successfully executed if they are consistent;
基于图像识别技术识别当前执行后屏幕图像中是否包括所述当前操作的下一个操作对应的操作对象的图标,在包括的情况下确定所述当前操作执行成功;Identify, based on image recognition technology, whether the current screen image includes an icon of an operation object corresponding to the next operation of the current operation, and if it is included, determine that the current operation is successfully executed;
基于图像识别技术识别所述当前执行后屏幕图像中包括的页面描述信息,确定识别的所述页面描述信息与所述当前操作对应的结果页面描述信息是否匹配,在匹配的情况下确定所述当前操作执行成功。Identify the page description information included in the currently executed screen image based on the image recognition technology, determine whether the identified page description information matches the result page description information corresponding to the current operation, and determine the current The operation performed successfully.
通过该方法,可以进行每步操作执行后的检查,从而实现更加有效和准确的操作流程控制。Through this method, the inspection after the execution of each operation step can be carried out, thereby realizing more effective and accurate operation flow control.
在至少一个示例性实施例中,上述过程中所使用的所述当前操作对应的结果页面描述信息可以是包括在所述当前操作的所述操作描述数据中,或者基于图像识别技术根据所述当前操作对应的所述执行后有效帧图像获取。In at least one exemplary embodiment, the result page description information corresponding to the current operation used in the above process may be included in the operation description data of the current operation, or based on image recognition technology according to the current operation The post-execution valid frame image corresponding to the operation is acquired.
在至少一个示例性实施例中,在步骤S302之前,所述方法还可以包括以下之一:In at least one exemplary embodiment, before step S302, the method may further include one of the following:
A、接收对终端设备的所述一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成所述操作集合并保存所述操作集合;A. Receive the one or more operations on the terminal device, obtain operation information of each operation in the one or more operations, generate the operation set according to the operation information, and save the operation set;
B、接收其他终端设备发送的所述操作集合;B. Receive the operation set sent by other terminal equipment;
C、接收对终端设备的所述一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成所述操作集合,设置所述操作集合对应的所述执行条件,并保存所述操作集合和所述操作集合对应的所述执行条件;C. Receive the one or more operations on the terminal device, and obtain the operation information of each operation in the one or more operations, generate the operation set according to the operation information, and set the corresponding operation set of the operation set. the execution condition, and save the operation set and the execution condition corresponding to the operation set;
D、接收其他终端设备发送的所述操作集合和所述操作集合对应的所述执行条件。D. Receive the operation set sent by other terminal equipment and the execution condition corresponding to the operation set.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据本公开实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本公开的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本公开实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method according to the embodiment of the present disclosure can be implemented by means of software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases the former is a better implementation. Based on this understanding, the technical solutions of the present disclosure essentially or the parts that contribute to the prior art can be embodied in the form of software products, and the computer software products are stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal device (which may be a mobile phone, a computer, a server, or a network device, etc.) to execute the method described in the embodiments of the present disclosure.
本公开的一些实施例提供了一种操作集合的获取装置,该装置用于实现上述操作集合的获取方法的实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。Some embodiments of the present disclosure provide an apparatus for acquiring an operation set, the apparatus is used to implement the embodiments and preferred implementations of the above-mentioned method for acquiring an operation set, which have been described and will not be repeated. As used below, the term "module" may be a combination of software and/or hardware that implements a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, implementations in hardware, or a combination of software and hardware, are also possible and contemplated.
图4是操作集合的获取装置的结构框图,如图4所示,该装置包括:Fig. 4 is a structural block diagram of an apparatus for obtaining an operation set. As shown in Fig. 4, the apparatus includes:
第一获取模块42,设置为接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,其中,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;The first obtaining module 42 is configured to receive one or more operations on the terminal device, and obtain operation information of each operation in the one or more operations, wherein the operation information includes: an operation information used to identify where the operation is performed. Sequence identification information of the sequence of operations in the one or more operations, and operation description data of the operations;
生成模块44,设置为根据所述操作信息生成操作集合,其中,所述操作集合包括:所述一个或多个操作的所述操作信息。The generating module 44 is configured to generate an operation set according to the operation information, wherein the operation set includes: the operation information of the one or more operations.
通过上述装置,由于能够自动接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成操作集合,从而可以自动记录用户的一系列操作,从而便于在用户的触发或满足执行条件的情况下终端设备能够自动根据记录的操作集合执行一系列操作,因此,可以解决如何简化终端设备上的用户操作的问题,实现了自定义的快捷方式的记录。With the above device, one or more operations on the terminal device can be automatically received, operation information of each operation in the one or more operations can be acquired, and an operation set can be generated according to the operation information, so that the user's operations can be automatically recorded. A series of operations, so that the terminal device can automatically execute a series of operations according to the recorded operation set when the user triggers or meets the execution conditions. Therefore, it can solve the problem of how to simplify the user operation on the terminal device and realize the customization record of shortcuts.
其中,上述装置可以设置在终端设备中,但不限于此。Wherein, the above-mentioned apparatus may be provided in a terminal device, but is not limited to this.
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。It should be noted that the above modules can be implemented by software or hardware, and the latter can be implemented in the following ways, but not limited to this: the above modules are all located in the same processor; or, the above modules can be combined in any combination The forms are located in different processors.
本公开的一些实施例中提供了一种操作集合的执行装置,该装置用于实现上述操作集合的执行方法的实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。Some embodiments of the present disclosure provide an apparatus for executing an operation set, and the apparatus is used to implement the embodiments and preferred implementations of the above-mentioned method for executing an operation set, which have been described and will not be repeated. As used below, the term "module" may be a combination of software and/or hardware that implements a predetermined function. Although the apparatus described in the following embodiments is preferably implemented in software, implementations in hardware, or a combination of software and hardware, are also possible and contemplated.
图5是操作集合的执行装置的结构框图,如图5所示,该装置包括:Fig. 5 is the structural block diagram of the execution apparatus of the operation set, as shown in Fig. 5, this apparatus includes:
第二获取模块52,设置为在接收到操作集合对应的操作集合执行请求或判断满足操作集合对应的执行条件的情况下,获取所述操作集合,其中,所述操作集合包括:一个或多个操作的操作信息,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;The second obtaining module 52 is configured to obtain the operation set when receiving the operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, wherein the operation set includes: one or more Operation information of the operation, the operation information includes: sequence identification information for identifying the operation sequence of the operation in the one or more operations, and operation description data of the operation;
执行模块54,设置为按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作。The execution module 54 is configured to execute the one or more operations according to the operation description data according to the operation sequence identified by the sequence identification information.
通过上述装置,由于能够自动响应于操作集合执行请求或在判断满足执行条件的情况下, 获取操作集合并按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作,从而终端设备能够自动根据快捷方式执行一系列操作,解决了如何简化终端设备上的用户操作的问题,实现了自定义的快捷方式的执行。Through the above device, because the operation set can be automatically responded to the execution request of the operation set or when it is judged that the execution condition is satisfied, the operation set can be obtained and the operation sequence identified by the sequence identification information can be obtained, and the one or more operations can be executed according to the operation description data. Therefore, the terminal device can automatically perform a series of operations according to the shortcut, which solves the problem of how to simplify user operations on the terminal device and realizes the execution of the customized shortcut.
其中,上述装置可以设置在终端设备中,但不限于此。Wherein, the above-mentioned apparatus may be provided in a terminal device, but is not limited to this.
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。It should be noted that the above modules can be implemented by software or hardware, and the latter can be implemented in the following ways, but not limited to this: the above modules are all located in the same processor; or, the above modules can be combined in any combination The forms are located in different processors.
本公开的实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行方法实施例中的步骤。Embodiments of the present disclosure also provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, wherein the computer program is configured to execute the steps in the method embodiments when running.
在某些示例性实施例中,上述计算机可读存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。In some exemplary embodiments, the above-mentioned computer-readable storage medium may include, but is not limited to, a USB flash drive, a read-only memory (Read-Only Memory, referred to as ROM for short), and a random access memory (Random Access Memory, referred to as RAM for short) ), mobile hard disks, magnetic disks or optical discs and other media that can store computer programs.
本公开的实施例还提供了一种终端设备,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行上方法实施例中的步骤。Embodiments of the present disclosure also provide a terminal device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in the above method embodiments.
在某些示例性实施例中,上述终端设备还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。In some exemplary embodiments, the above-mentioned terminal device may further include a transmission device and an input/output device, wherein the transmission device is connected to the above-mentioned processor, and the above-mentioned input/output device is connected to the above-mentioned processor.
关于计算机可读存储介质和终端设备的具体示例可以参考上述实施例及示例性实施方式中所描述的示例,在此不再赘述。For specific examples of the computer-readable storage medium and the terminal device, reference may be made to the examples described in the foregoing embodiments and exemplary implementation manners, and details are not repeated here.
以下以通过录屏方式来获取用户的操作集合从而生成快捷方式的实现方式为例,详细描述本公开实施例的操作集合的获取、执行方法的技术方案。需要说明的是,以下的示例性实施例是前述实施例所描述的方案的示例性的实施方式,其可以被理解为是对前述实施例的方案的进一步解释,但不构成对前述实施例的不当限定。The following takes an example of an implementation of acquiring a user's operation set by recording a screen to generate a shortcut to describe in detail the technical solutions of the method for acquiring and executing the operation set in the embodiment of the present disclosure. It should be noted that the following exemplary embodiments are exemplary implementations of the solutions described in the foregoing embodiments, which may be understood as further explanations for the solutions of the foregoing embodiments, but do not constitute any limitation to the foregoing embodiments. Improperly limited.
图像识别技术是指利用计算机对图像进行处理、分析和理解,以识别各种不同模式的目标和对象的技术,是应用深度学习算法的一种实践应用。图像的传统识别流程分为四个步骤:图像采集→图像预处理→特征提取→图像识别。通过图像识别技术,计算机可以识别出照片里边的内容。本示例性实施例借助录屏技术获取用户操作过程中的截屏,并借助图像识别技术获取到用户操作的相关数据。Image recognition technology refers to the use of computers to process, analyze and understand images to identify various patterns of targets and objects. It is a practical application of deep learning algorithms. The traditional image recognition process is divided into four steps: image acquisition→image preprocessing→feature extraction→image recognition. Through image recognition technology, the computer can identify the content in the photo. The present exemplary embodiment obtains a screenshot of the user's operation process by means of a screen recording technology, and obtains relevant data of the user's operation by means of an image recognition technology.
本示例性实施例中,通过录屏技术,录制好用户的操作步骤(在录制时保存用户操作描述数据如点击事件集合,操作间隔等,并保存好用户操作前、操作时、以及操作后的关键帧图像)并存储起来,按时间顺序生成一个操作序列,进而生成相应的快捷方式。In this exemplary embodiment, through the screen recording technology, the user's operation steps are recorded (the user operation description data such as click event set, operation interval, etc. are saved during recording, and the user's operation steps before, during, and after the operation are saved. keyframe images) and store them to generate a sequence of operations in chronological order, which in turn generate corresponding shortcuts.
执行快捷方式的时候,系统会按时间顺序执行事先存储的事件集合,就类似录像回放一样。要特别说明,快捷方式执行每步用户操作的时候,结合图像识别技术,系统会将操作执行前的图像,操作执行时的图像,以及操作执行后的图像,分别跟事先存储好的图像进行比对,具体地,系统会通过图像识别技术识别出操作前后的图像内容(如,这里是一个switch开关,开关上的字是什么;那里是一个图标,图标是哪个应用的),跟事先存储的关键帧图像进行对比,并根据图像内容对比的结果,来判断出当前操作执行的前置条件是否满足,当前 操作是否有出现异常,执行后的结果是否正常。如果操作执行前的图像存在差别,导致操作的前置条件不满足,系统可以根据图像识别技术进行适当的纠错;如果操作执行后的图像存在差别,可以根据图像识别技术识别的结果,进行操作结果的判断,如果判断出操作执行后的结果为失败,系统可以进行异常处理。When the shortcut is executed, the system will execute the pre-stored set of events in chronological order, just like video playback. In particular, when the shortcut performs each step of the user operation, combined with the image recognition technology, the system will compare the image before the operation, the image when the operation is performed, and the image after the operation is performed with the pre-stored images respectively. Yes, specifically, the system will identify the image content before and after the operation through image recognition technology (for example, here is a switch switch, what is the word on the switch; there is an icon, which application is the icon), and the pre-stored image content The key frame images are compared, and according to the result of the image content comparison, it is judged whether the preconditions of the current operation execution are satisfied, whether the current operation is abnormal, and whether the result after execution is normal. If there is a difference between the images before the operation is performed, the preconditions of the operation are not satisfied, the system can perform appropriate error correction according to the image recognition technology; Judgment of the result, if it is judged that the result of the execution of the operation is a failure, the system can perform exception processing.
更进一步的,本快捷方式可以发送给其他用户使用,或者增加定时器定时执行从而衍生出更强大的功能。本示例性实施例可以适用于多种应用场景,例如,手机一键导航、一键扫描单车;电脑开机后,让电脑自动打开要打开的各种软件,进入工作模式;远程协助(比如远程设置闹钟)、备份一系列的操作;定时打卡等等。Further, this shortcut can be sent to other users for use, or a timer can be added for regular execution to derive more powerful functions. This exemplary embodiment can be applied to a variety of application scenarios, for example, one-key navigation on mobile phones, one-key scanning of bicycles; after the computer is turned on, let the computer automatically open various software to be opened, and enter the working mode; remote assistance (such as remote setting Alarm clock), backup a series of operations; timed punching and so on.
本示例性实施例主要涉及四个模块,以下分别进行详细说明。This exemplary embodiment mainly involves four modules, which will be described in detail below.
(1)信息采集模块:主要负责录屏,保存用户操作前、操作时、操作后的关键帧图像,并采集用户每次操作时的数据。比如每次操作的时间、类别(如点击屏幕,音量键)、如果是点击屏幕,还会记录坐标,是长按还是短按等等。(1) Information collection module: It is mainly responsible for recording the screen, saving the key frame images before, during and after the user's operation, and collecting the data of each operation of the user. For example, the time and category of each operation (such as clicking the screen, volume keys), if the screen is clicked, the coordinates will also be recorded, whether it is a long press or a short press, and so on.
(2)存储模块:用来存储信息采集模块采集的数据,其数据结构可以是一个列表,类似下表一所示。这里按照时间先后顺序给每个操作编号,最先执行的操作是0001,之后是0002,以此类推。(2) Storage module: used to store the data collected by the information collection module, and its data structure can be a list, similar to that shown in Table 1 below. Here, each operation is numbered in chronological order, with the operation performed first being 0001, followed by 0002, and so on.
表一:存储模块存储的数据Table 1: Data stored by the storage module
Figure PCTCN2021097922-appb-000001
Figure PCTCN2021097922-appb-000001
(3)图像识别模块:根据操作的参数,结合操作时保存的关键帧图像,识别出用户操作的对象。每次操作前,对比当前图像和事先存储的操作前关键帧,判断出两帧图像是否存在差别。比如:操作对象是否还在;操作对象的位置是否一样;如果存在差别,则通知控制模块进行纠错。每次操作后,对比当前图像和事先存储的操作后关键帧,根据两帧图像是否存在差别,来判断当前操作是否执行成功;如果操作失败,则通知控制模块进行异常处理。(3) Image recognition module: According to the parameters of the operation, combined with the key frame image saved during the operation, the object of the user's operation is recognized. Before each operation, compare the current image with the pre-operation key frame stored in advance, and determine whether there is a difference between the two frame images. For example: whether the operation object is still there; whether the position of the operation object is the same; if there is a difference, notify the control module to correct the error. After each operation, compare the current image with the pre-stored key frames after the operation, and judge whether the current operation is successful according to whether there is a difference between the two frames of images; if the operation fails, notify the control module to handle exceptions.
(4)控制模块:用户录屏的时候,控制系统进入录屏初始状态,负责启动信息采集模块进行信息采集,之后将采集到的信息存储到存储模块。执行快捷方式后,按时间顺序和相关参数来执行事先存储好的操作步骤,并启动图像识别模块实时对比当前图像数据和事先存储的图像数据,根据识别结果来判断当前是否要进行纠错、重试、异常处理、放弃操作,或者提示用户选择接下来的操作。(4) Control module: When the user records the screen, the control system enters the initial screen recording state, and is responsible for starting the information collection module to collect information, and then stores the collected information in the storage module. After executing the shortcut, execute the pre-stored operation steps in chronological order and relevant parameters, and start the image recognition module to compare the current image data and the pre-stored image data in real time, and judge whether to perform error correction or re-run according to the recognition result. try, handle exceptions, abort the operation, or prompt the user to choose the next operation.
以下以用户通过使用“单车”APP来扫描共享单车的场景来举例说明一下本公开实施例 的实施流程,该流程包括信息采集流程、快捷方式执行流程、纠错流程、运行结果判断流程,以下详细进行说明。需要说明的是,以下的示例性实施方式是前述实施例、示例性实施例所描述的方案的在具体场景下的具体实施,其可以被理解为是对前述实施例、示例性实施例的方案的进一步解释,但不构成对前述实施例、示例性实施例的不当限定。The following describes the implementation process of the embodiment of the present disclosure by taking a scenario in which a user scans a shared bicycle by using the "Bicycle" APP. The process includes an information collection process, a shortcut execution process, an error correction process, and a running result judgment process. Be explained. It should be noted that the following exemplary embodiments are specific implementations of the solutions described in the foregoing embodiments and exemplary embodiments in specific scenarios, which may be understood as solutions to the foregoing embodiments and exemplary embodiments. further explanation, but does not constitute an improper limitation to the foregoing embodiments and exemplary embodiments.
一、信息采集流程,包括以下步骤:1. The information collection process includes the following steps:
(1)界面上会新增一个让用户录制快捷方式的入口,用户可以通过该入口启动自定义快捷方式功能;(1) An entry for users to record shortcuts will be added on the interface, and users can start the custom shortcut function through this entry;
(2)系统切换到屏幕录制初始状态,一般是回到手机初始桌面;(2) The system switches to the initial state of screen recording, generally returning to the initial desktop of the mobile phone;
(3)信息采集模块保存一帧当前的截屏图像,并等待用户输入;(3) The information collection module saves a frame of the current screenshot image and waits for user input;
(4)用户运行“单车”APP,点击一下屏幕;(4) The user runs the "Bicycle" APP and clicks on the screen;
(5)点击操作的同时,信息采集模块保存一帧操作时的图像,并存储下操作的坐标,时间,类型等参数;(5) At the same time as the click operation, the information acquisition module saves a frame of the image during the operation, and stores the coordinates, time, type and other parameters of the operation;
(6)操作结束,“单车”APP运行成功,信息采集模块又保存一帧操作后的图像;(6) After the operation is over, the "Bike" APP runs successfully, and the information acquisition module saves another frame of the image after the operation;
(7)重复(3)到(6)的步骤,直到用户结束录制;(7) Repeat steps (3) to (6) until the user ends the recording;
(8)控制模块生成快捷方式。(8) The control module generates a shortcut.
二、快捷方式执行流程,包括以下步骤:2. The shortcut execution process includes the following steps:
(1)界面上会新增一个让用户启动快捷方式的入口,或者可以让用户设置快捷方式的自动启动时间,当需要执行快捷方式时,控制模块从存储模块获取序号为0001的操作,并拿到该操作的各种参数和关键帧图像;(1) An entry will be added on the interface for users to start shortcuts, or users can set the automatic startup time of shortcuts. When the shortcuts need to be executed, the control module obtains the operation with the serial number of 0001 from the storage module, and takes Various parameters and keyframe images to the operation;
(2)通过信息采集模块采集当前的截屏图像,并将当前图像跟0001号操作前关键帧进行比较,通过图像识别技术判断出“单车”APP图标当前是否还在(如果界面上存在一个“Danche”APP图标,或者“Bicycle”APP图标,或者“Bike”APP图标,也认为对应于“单车”APP的图标还在),所在位置是否有变化:(2) Collect the current screenshot image through the information acquisition module, compare the current image with the key frame before the operation of No. 0001, and determine whether the “Bike” APP icon is still still present through image recognition technology (if there is a “Danche” on the interface. ” APP icon, or “Bicycle” APP icon, or “Bike” APP icon, it is also considered that the icon corresponding to the “Bicycle” APP is still there), whether the location has changed:
如果没有变化,执行操作0001;If there is no change, perform operation 0001;
如果“单车”图标位置有变化,则需要进行纠错。通过图像识别技术,能够识别出“单车”应用图标所在的新坐标,之后控制模块将新坐标存储到存储模块相应的地方,并按照新坐标来执行操作0001;If there is a change in the position of the "Bike" icon, it needs to be corrected. Through the image recognition technology, the new coordinates of the "bicycle" application icon can be identified, and then the control module stores the new coordinates in the corresponding place of the storage module, and performs operation 0001 according to the new coordinates;
如果“单车”图标不在了,控制模块提示用户操作失败;If the "Bike" icon is gone, the control module prompts the user that the operation fails;
(3)0001操作执行完毕后,采集一帧图像,与事先存储的0001操作执行后关键帧进行比较,判断操作执行是否成功:(3) After the 0001 operation is completed, a frame of image is collected, and compared with the key frame after the 0001 operation stored in advance to determine whether the operation is successful:
如果没有变化,则表示执行成功;可以继续执行下一步操作;If there is no change, it means the execution is successful; you can continue to the next step;
如果“单车”应用启动失败,则提示用户操作失败,流程异常终止;If the "Bike" application fails to start, it will prompt the user that the operation failed, and the process will be terminated abnormally;
(4)如果第(3)步执行成功,则针对序号为0002的操作,重复执行(1)到(3)的步骤;直到所有操作执行完毕,或者流程异常终止。(4) If step (3) is successfully executed, repeat steps (1) to (3) for the operation whose serial number is 0002; until all operations are completed, or the process terminates abnormally.
三、纠错流程:3. Error correction process:
需要纠错的原因,可能是由于“单车”APP在桌面的位置发生了变化。通过实时图像识 别技术,可以识别出当前截屏中哪个区域是“单车”APP的图标,或者识别出(或进一步识别出)“单车”这两个中文字,进而得到“单车”APP在截屏中的位置。而“单车”APP在截屏中的新位置,就是要执行屏幕点击操作的新坐标。The reason for the need for error correction may be due to the change of the position of the "Bicycle" APP on the desktop. Through real-time image recognition technology, it is possible to identify which area in the current screenshot is the icon of the "bicycle" APP, or identify (or further identify) the two Chinese characters "bicycle", and then obtain the "bicycle" APP in the screenshot. Location. And the new position of the "Bicycle" APP in the screenshot is the new coordinates to perform the screen click operation.
举例说明,图6是应用坐标发生移动的示意图,如图6所示,左侧图是用户录制快捷方式时“单车”坐标的位置,之后用户由于某个原因,将“单车”应用的坐标改变了,变到了图6中右侧图所示位置。假设截屏左上角的坐标是(0,0),通过图像识别技术,可以得到“单车”应用在图6中左侧图中的位置是(x1,y1),右侧图中的位置是(x2,y2)。由于图标有一定的宽高的,所以在取坐标的时候,建议取图标中心点的坐标即可。For example, Figure 6 is a schematic diagram of the movement of the application coordinates. As shown in Figure 6, the left picture is the position of the "bicycle" coordinates when the user records the shortcut, and then the user changes the coordinates of the "bicycle" application for some reason. , changed to the position shown on the right in Figure 6. Assuming that the coordinates of the upper left corner of the screenshot are (0, 0), through image recognition technology, it can be obtained that the position of the "bicycle" application in the left picture in Figure 6 is (x1, y1), and the position in the right picture is (x2 , y2). Since the icon has a certain width and height, when taking the coordinates, it is recommended to take the coordinates of the center point of the icon.
通过图6中左侧图中的坐标来获取右侧图中的坐标,可以通过以下方式实现。Obtaining the coordinates in the right diagram from the coordinates in the left diagram in FIG. 6 can be implemented in the following manner.
一般情况下,截屏中的坐标和点击屏幕的坐标是一一对应的,也就是说“单车”图标在截屏上的坐标,就是用户点击屏幕的坐标。这种情况(x2,y2)就是“单车”的新点击坐标;In general, the coordinates in the screenshot and the coordinates of the click on the screen are in one-to-one correspondence, that is to say, the coordinates of the "bicycle" icon on the screenshot are the coordinates of the user's click on the screen. In this case (x2, y2) is the new click coordinate of "bicycle";
比较特别的情况是,截屏中的坐标,不是点击屏幕的坐标。假设,“单车”在图6的左侧图中的坐标是(x1,y1),用户的点击坐标是(xd1,yd1),“单车”在图6的右侧图中的坐标是(x2,y2),由于截屏中的坐标和点击屏幕的坐标是有对应关系的,可以通过如下算法计算出(xd2,yd2):A special case is that the coordinates in the screenshot are not the coordinates of the click on the screen. Suppose, the coordinates of "bicycle" in the left picture of Figure 6 are (x1, y1), the coordinates of the user's click are (xd1, yd1), and the coordinates of "bicycle" in the right picture of Figure 6 are (x2, y2), since the coordinates in the screenshot and the coordinates of the clicked screen have a corresponding relationship, (xd2, yd2) can be calculated by the following algorithm:
(x1,y1)/(xd1,yd1)=(x2,y2)/(xd2,yd2)(x1,y1)/(xd1,yd1)=(x2,y2)/(xd2,yd2)
其中(x1,y1)、(x2,y2)是通过图像识别技术在截屏中得到的坐标,(xd1,yd1)是用户录制时通过信息采集模块采集到用户点击屏幕的坐标。Among them, (x1, y1), (x2, y2) are the coordinates obtained by the image recognition technology in the screenshot, and (xd1, yd1) are the coordinates of the user's click on the screen collected by the information collection module when the user is recording.
四、运行结果判断流程:Fourth, the judgment process of the operation result:
根据之前记录的用户正常操作打开“单车”应用正常运行时的界面的图像,通过图像比对可以确定当前操作后的图像是否与之一致。依靠图像识别技术,可以很容易的判断出“单车”应用是否运行成功。According to the previously recorded image of the interface of the user's normal operation to open the "Bicycle" application, whether the image after the current operation is consistent with the image can be determined by comparing the images. Relying on image recognition technology, it is easy to judge whether the "bicycle" application is running successfully.
不仅仅是启动应用的操作,其他类型的操作运行形成与否也能够通过图像识别技术判断出来。例如,switch开关的状态,通过图像识别技术也是能够判断出来的,可以知道当前开关的状态是开还是关。图7是不同开关状态的示意图,如图7所示,左侧图是开关为关的图像,右侧图是开关为开的图像,两者区别是很明显的。Not only the operation of starting the application, but also the formation of other types of operations can also be judged through image recognition technology. For example, the state of the switch can also be judged through image recognition technology, and it is possible to know whether the current state of the switch is on or off. FIG. 7 is a schematic diagram of different switch states. As shown in FIG. 7 , the image on the left is an image with the switch off, and the image on the right is an image with the switch on. The difference between the two is obvious.
总之,有了图像识别技术的使用,系统就好像有了一双“眼睛”一样,可以准确的识别出每个操作执行的前置条件是否满足,操作执行过后的结果是否符合预期。In short, with the use of image recognition technology, the system seems to have a pair of "eyes", which can accurately identify whether the preconditions for each operation are met, and whether the results of the operation are in line with expectations.
综上所述,本公开实施例可以让用户自定义快捷方式,由于结合了图像识别技术,在快捷方式在执行过程中可以进行纠错,以及执行结果判断。To sum up, the embodiments of the present disclosure allow users to customize shortcuts, and since the image recognition technology is combined, errors can be corrected during the execution of the shortcuts, and the execution result can be judged.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that the above-mentioned modules or steps of the present disclosure can be implemented by a general-purpose computing device, and they can be centralized on a single computing device or distributed in a network composed of multiple computing devices On the other hand, they can be implemented in program code executable by a computing device, so that they can be stored in a storage device and executed by the computing device, and in some cases, can be performed in a different order than shown here. Or the described steps, or they are respectively made into individual integrated circuit modules, or a plurality of modules or steps in them are made into a single integrated circuit module to realize. As such, the present disclosure is not limited to any particular combination of hardware and software.
以上所述仅为本公开的实施例和示例性实施例而已,并不用于限制本公开,对于本领域 的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above descriptions are only the embodiments and exemplary embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the principles of the present disclosure shall be included within the protection scope of the present disclosure.

Claims (17)

  1. 一种操作集合的获取方法,包括:A method for obtaining an operation collection, including:
    接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,其中,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;Receive one or more operations on the terminal device, and obtain operation information of each operation in the one or more operations, wherein the operation information includes: an operation information used to identify the operation in the one or more operations The sequence identification information of the operation sequence, the operation description data of the operation;
    根据所述操作信息生成操作集合,其中,所述操作集合包括:所述一个或多个操作的所述操作信息。An operation set is generated according to the operation information, wherein the operation set includes: the operation information of the one or more operations.
  2. 根据权利要求1所述的方法,其中,所述操作信息还包括:所述操作对应的相关帧图像,其中,所述相关帧图像包括:执行前有效帧图像、执行时有效帧图像和执行后有效帧图像。The method according to claim 1, wherein the operation information further comprises: a relevant frame image corresponding to the operation, wherein the relevant frame image comprises: a valid frame image before execution, a valid frame image at the time of execution, and a valid frame image after execution Valid frame image.
  3. 根据权利要求2所述的方法,其中,接收对终端设备的一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息包括:The method according to claim 2, wherein receiving one or more operations on the terminal device and acquiring operation information of each of the one or more operations comprises:
    响应于接收到的操作集合采集请求,所述终端设备通过录屏功能或截屏功能获取所述一个或多个操作中每个操作对应的所述相关帧图像,并采集所述一个或多个操作中每个操作的所述顺序标识信息和所述操作描述数据,直到接收到采集结束指示。In response to the received operation set collection request, the terminal device obtains the relevant frame image corresponding to each operation in the one or more operations through a screen recording function or a screen capture function, and collects the one or more operations The sequence identification information and the operation description data of each operation in the . until a collection end indication is received.
  4. 根据权利要求1所述的方法,其中,The method of claim 1, wherein,
    用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息包括以下至少之一:所述操作的操作时间、所述操作在所述一个或多个操作中的操作顺序序号;The sequence identification information used to identify the operation sequence of the operation in the one or more operations includes at least one of the following: the operation time of the operation, the operation sequence number of the operation in the one or more operations;
    和/或,and / or,
    所述操作的操作描述数据包括以下至少之一:操作类别、坐标参数、持续时间参数、按键标识信息、采集生物特征的传感器的标识信息、采集生物特征的采集参数、所述操作对应的操作对象、执行页面描述信息以及结果页面描述信息。The operation description data of the operation includes at least one of the following: operation category, coordinate parameter, duration parameter, key identification information, identification information of the sensor that collects biometrics, acquisition parameters for collecting biometrics, and operation object corresponding to the operation , execution page description information, and result page description information.
  5. 根据权利要求4所述的方法,其中,The method of claim 4, wherein,
    所述操作类别基于屏幕触摸信号或按键触控信号或系统传感器调用信号获取;和/或,The operation category is obtained based on a screen touch signal or a button touch signal or a system sensor call signal; and/or,
    所述坐标参数基于屏幕触摸信号获取;和/或,The coordinate parameters are obtained based on the screen touch signal; and/or,
    所述持续时间参数基于屏幕触摸信号获取;和/或,The duration parameter is obtained based on the screen touch signal; and/or,
    所述按键标识信息基于按键触控信号获取;和/或,The key identification information is obtained based on a key touch signal; and/or,
    所述采集生物特征的传感器的标识信息以及采集参数基于系统传感器调用信号获取;和/或,The identification information and the acquisition parameters of the sensors that collect biological features are acquired based on the system sensor call signal; and/or,
    所述操作对应的操作对象、执行页面描述信息基于图像识别技术根据所述操作对应的执行时有效帧图像获取,或者基于图像识别技术根据所述操作对应的执行前有效帧图像结合所述坐标参数获取;和/或,The operation object corresponding to the operation and the description information of the execution page are obtained based on the image recognition technology according to the valid frame image corresponding to the execution time of the operation, or based on the image recognition technology based on the valid frame image before the execution corresponding to the operation combined with the coordinate parameters obtain; and/or,
    所述操作对应的结果页面描述信息基于图像识别技术根据所述操作对应的执行后有效帧图像获取。The description information of the result page corresponding to the operation is obtained based on the image recognition technology according to the effective frame image after execution corresponding to the operation.
  6. 根据权利要求1-5中任一项所述的方法,其中,根据所述操作信息生成操作集合之后,还包括以下至少之一:The method according to any one of claims 1-5, wherein after generating the operation set according to the operation information, it further comprises at least one of the following:
    保存所述操作集合;save the set of operations;
    设置所述操作集合对应的执行条件后保存所述操作集合和所述操作集合对应的所述执行条件;After setting the execution condition corresponding to the operation set, save the operation set and the execution condition corresponding to the operation set;
    发送所述操作集合;sending the set of operations;
    设置所述操作集合对应的执行条件后发送所述操作集合和所述操作集合对应的所述执行条件。After setting the execution condition corresponding to the operation set, the operation set and the execution condition corresponding to the operation set are sent.
  7. 一种操作集合的执行方法,包括:An execution method of an operation collection, including:
    在接收到操作集合对应的操作集合执行请求或判断满足操作集合对应的执行条件的情况下,获取所述操作集合,其中,所述操作集合包括:一个或多个操作的操作信息,所述操作信息包括:用于标识操作在所述一个或多个操作中的操作顺序的顺序标识信息、所述操作的操作描述数据;In the case of receiving an operation set execution request corresponding to the operation set or judging that the execution condition corresponding to the operation set is satisfied, the operation set is obtained, wherein the operation set includes: operation information of one or more operations, the operation The information includes: sequence identification information for identifying the operation sequence of the operation in the one or more operations, and operation description data of the operation;
    按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作。The one or more operations are performed according to the operation description data according to the operation sequence identified by the sequence identification information.
  8. 根据权利要求7所述的方法,其中,所述操作信息还包括:所述操作对应的相关帧图像,其中,所述相关帧图像包括:执行前有效帧图像、执行时有效帧图像和执行后有效帧图像。The method according to claim 7, wherein the operation information further comprises: a relevant frame image corresponding to the operation, wherein the relevant frame image comprises: a valid frame image before execution, a valid frame image at the time of execution, and a valid frame image after execution Valid frame image.
  9. 根据权利要求8所述的方法,其中,按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作包括:The method according to claim 8, wherein, according to the operation sequence identified by the sequence identification information, performing the one or more operations according to the operation description data comprises:
    根据所述顺序标识信息确定待执行的当前操作;Determine the current operation to be performed according to the sequence identification information;
    根据当前执行前屏幕图像和所述当前操作对应的所述执行前有效帧图像判断是否满足执行所述当前操作的前置条件,并在满足的情况下执行所述当前操作;Determine whether the precondition for executing the current operation is satisfied according to the current screen image before execution and the valid frame image before execution corresponding to the current operation, and execute the current operation if it is satisfied;
    确定所述当前操作是否执行成功,在执行成功的情况下继续确定并执行下一个待执行的当前操作,直到所述一个或多个操作执行完毕。It is determined whether the current operation is successfully executed, and if the execution is successful, the next current operation to be executed is continued to be determined and executed until the one or more operations are executed.
  10. 根据权利要求9所述的方法,其中,根据所述当前执行前屏幕图像和所述当前操作对应的所述执行前有效帧图像判断是否满足所述当前操作的前置条件,并在满足的情况下执行所述当前操作包括:The method according to claim 9, wherein whether the precondition of the current operation is satisfied is determined according to the current pre-execution screen image and the pre-execution valid frame image corresponding to the current operation, and if the precondition is satisfied Performing the current operation below includes:
    确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象;determining whether the current screen image before execution includes the operation object corresponding to the current operation;
    若所述当前执行前屏幕图像中包括所述当前操作对应的操作对象,在所述当前操作对应的操作对象在所述当前执行前屏幕图像和所述执行前有效帧图像中的位置相同的情况下,根据所述当前操作的所述操作描述数据执行所述当前操作;和/或,在所述当前操作对应的操作对象在所述当前执行前屏幕图像和所述执行前有效帧图像中的位置变化的情况下,根据所述当前操作对应的操作对象在所述当前执行前屏幕图像中的位置调整所述当前操作的所述操作描述数据,并根据所述当前操作的调整后的所述操作描述数据执行所述当前操作。If the current pre-execution screen image includes an operation object corresponding to the current operation, in the case where the position of the operation object corresponding to the current operation is the same in the current pre-execution screen image and the pre-execution effective frame image In the next step, the current operation is performed according to the operation description data of the current operation; and/or, in the screen image before the current execution and the effective frame image before the execution of the operation object corresponding to the current operation When the position changes, adjust the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current operation, and adjust the operation description data of the current operation according to the adjusted value of the current operation. The operation description data performs the current operation.
  11. 根据权利要求10所述的方法,其中,确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象包括以下至少之一:The method according to claim 10, wherein determining whether the current pre-execution screen image includes an operation object corresponding to the current operation comprises at least one of the following:
    基于图像识别技术识别所述当前执行前屏幕图像中是否包括所述当前操作对应的操作对象的图标,根据识别结果确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对 象;Based on image recognition technology, identify whether to include the icon of the operation object corresponding to the current operation in the screen image before the current execution, and determine whether the screen image before the current execution includes the operation object corresponding to the current operation according to the recognition result;
    基于图像识别技术识别所述当前执行前屏幕图像中包括的页面描述信息,将识别的所述页面描述信息与所述当前操作对应的执行页面描述信息进行匹配,根据匹配结果确定所述当前执行前屏幕图像是否包括所述当前操作对应的操作对象。Identify the page description information included in the current pre-execution screen image based on image recognition technology, match the identified page description information with the execution page description information corresponding to the current operation, and determine the current pre-execution page description information according to the matching result. Whether the screen image includes the operation object corresponding to the current operation.
  12. 根据权利要求10所述的方法,其中,根据所述当前操作对应的操作对象在所述当前执行前屏幕图像中的位置调整所述当前操作的所述操作描述数据包括以下之一:The method according to claim 10, wherein adjusting the operation description data of the current operation according to the position of the operation object corresponding to the current operation in the screen image before the current execution comprises one of the following:
    在所述操作对象在所述当前执行前屏幕图像中的坐标参数与对屏幕上所述操作对象进行点击的坐标参数一致的情况下,将所述当前操作的所述操作描述数据中的坐标参数替换为所述当前操作对应的操作对象在所述当前执行前屏幕图像中的坐标参数;In the case that the coordinate parameters of the operation object in the screen image before the current execution are consistent with the coordinate parameters of the click on the operation object on the screen, the coordinate parameters in the operation description data of the current operation are changed to Replaced with the coordinate parameters of the operation object corresponding to the current operation in the screen image before the current execution;
    在所述操作对象在所述当前执行前屏幕图像中的坐标参数与对屏幕上所述操作对象进行点击的坐标参数不一致的情况下,根据以下公式确定点击变化位置后的所述操作对象的坐标参数(xd2,yd2):(x1,y1)/(xd1,yd1)=(x2,y2)/(xd2,yd2),其中,(x1,y1)为所述操作对象在所述执行前有效帧图像中的坐标参数,(x2,y2)为所述操作对象在所述当前执行前屏幕图像中的坐标参数,(xd1,yd1)为调整前的所述操作描述数据中的坐标参数。In the case that the coordinate parameters of the operation object in the screen image before the current execution are inconsistent with the coordinate parameters of clicking the operation object on the screen, the coordinates of the operation object after the click changes position are determined according to the following formula Parameter (xd2, yd2): (x1, y1)/(xd1, yd1)=(x2, y2)/(xd2, yd2), where (x1, y1) is the valid frame of the operation object before the execution The coordinate parameters in the image, (x2, y2) are the coordinate parameters of the operation object in the screen image before the current execution, and (xd1, yd1) are the coordinate parameters in the operation description data before adjustment.
  13. 根据权利要求10所述的方法,其中,在所述当前执行前屏幕图像中不包括所述当前操作对应的操作对象的情况下,所述方法还包括以下之一:The method according to claim 10, wherein, in the case that the current pre-execution screen image does not include an operation object corresponding to the current operation, the method further comprises one of the following:
    确认执行所述一个或多个操作失败;confirming the failure to perform the one or more operations;
    退回重复执行所述当前操作的前一步操作;Return to repeat the previous operation of the current operation;
    提示由用户继续执行所述一个或多个操作中未完成的操作;prompting the user to proceed with the unfinished operation of the one or more operations;
    提示由用户执行所述当前操作,并在所述当前操作执行完成后,继续按照所述顺序标识信息标识的操作顺序,根据所述操作描述数据执行所述一个或多个操作中未完成的操作。Prompt the user to perform the current operation, and after the current operation is completed, continue to follow the sequence of operations identified by the sequence identification information, and execute the unfinished operations in the one or more operations according to the operation description data .
  14. 根据权利要求9所述的方法,其中,确定所述当前操作是否执行成功包括以下至少之一:The method of claim 9, wherein determining whether the current operation is successfully performed comprises at least one of the following:
    基于图像识别技术识别当前执行后屏幕图像和所述当前操作对应的所述执行后有效帧图像是否一致,在一致的情况下确定所述当前操作执行成功;Identify whether the screen image after the current execution is consistent with the effective frame image after execution corresponding to the current operation based on image recognition technology, and determine that the current operation is successfully executed if they are consistent;
    基于图像识别技术识别当前执行后屏幕图像中是否包括所述当前操作的下一个操作对应的操作对象的图标,在包括的情况下确定所述当前操作执行成功;Identify, based on image recognition technology, whether the current screen image includes an icon of an operation object corresponding to the next operation of the current operation, and if it is included, determine that the current operation is successfully executed;
    基于图像识别技术识别所述当前执行后屏幕图像中包括的页面描述信息,确定识别的所述页面描述信息与所述当前操作对应的结果页面描述信息是否匹配,在匹配的情况下确定所述当前操作执行成功。Identify the page description information included in the currently executed screen image based on the image recognition technology, determine whether the identified page description information matches the result page description information corresponding to the current operation, and determine the current The operation performed successfully.
  15. 根据权利要求7-14中任一项所述的方法,其中,在获取所述操作集合之前,还包括以下之一:The method according to any one of claims 7-14, wherein before acquiring the operation set, it further comprises one of the following:
    接收对终端设备的所述一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成所述操作集合并保存所述操作集合;Receive the one or more operations on the terminal device, obtain operation information of each operation in the one or more operations, generate the operation set according to the operation information, and save the operation set;
    接收其他终端设备发送的所述操作集合;receiving the operation set sent by other terminal equipment;
    接收对终端设备的所述一个或多个操作,并获取所述一个或多个操作中每个操作的操作信息,根据所述操作信息生成所述操作集合,设置所述操作集合对应的所述执行条件,并保 存所述操作集合和所述操作集合对应的所述执行条件;Receive the one or more operations on the terminal device, obtain operation information of each operation in the one or more operations, generate the operation set according to the operation information, and set the corresponding execution conditions, and save the operation set and the execution conditions corresponding to the operation set;
    接收其他终端设备发送的所述操作集合和所述操作集合对应的所述执行条件。The operation set and the execution condition corresponding to the operation set sent by other terminal devices are received.
  16. 一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至6任一项中所述的方法,或者执行权利要求7-15任一项中所述的方法。A computer-readable storage medium in which a computer program is stored, wherein the computer program is configured to execute the method according to any one of claims 1 to 6 when running, or A method as claimed in any one of claims 7-15 is performed.
  17. 一种终端设备,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行所述权利要求1至6任一项中所述的方法,或者执行权利要求7-15任一项中所述的方法。A terminal device, comprising a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to execute the method according to any one of claims 1 to 6, Or perform the method of any one of claims 7-15.
PCT/CN2021/097922 2020-06-30 2021-06-02 Operation set obtaining and executing methods and apparatuses, storage medium, and terminal device WO2022001564A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010617365.4 2020-06-30
CN202010617365.4A CN113946257A (en) 2020-06-30 2020-06-30 Operation set acquisition and execution method and device, storage medium and terminal equipment

Publications (1)

Publication Number Publication Date
WO2022001564A1 true WO2022001564A1 (en) 2022-01-06

Family

ID=79317400

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/097922 WO2022001564A1 (en) 2020-06-30 2021-06-02 Operation set obtaining and executing methods and apparatuses, storage medium, and terminal device

Country Status (2)

Country Link
CN (1) CN113946257A (en)
WO (1) WO2022001564A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116910393A (en) * 2023-09-13 2023-10-20 戎行技术有限公司 Large-batch news data acquisition method based on recurrent neural network

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530133A (en) * 2013-10-29 2014-01-22 广东欧珀移动通信有限公司 Custom operation method and device for terminal
CN108304105A (en) * 2017-12-20 2018-07-20 维沃移动通信有限公司 A kind of application interface starts method, mobile terminal
CN108323239A (en) * 2016-11-29 2018-07-24 华为技术有限公司 Recording, playback method, record screen terminal and the playback terminal of film recording
CN108681483A (en) * 2018-05-16 2018-10-19 维沃移动通信有限公司 A kind of task processing method and device
US20190102139A1 (en) * 2017-09-29 2019-04-04 Spotify Ab Systems and methods of associating media content with contexts
CN110442401A (en) * 2018-05-03 2019-11-12 腾讯科技(北京)有限公司 Function jump method, system and function recording, back method, device, equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530133A (en) * 2013-10-29 2014-01-22 广东欧珀移动通信有限公司 Custom operation method and device for terminal
CN108323239A (en) * 2016-11-29 2018-07-24 华为技术有限公司 Recording, playback method, record screen terminal and the playback terminal of film recording
US20190102139A1 (en) * 2017-09-29 2019-04-04 Spotify Ab Systems and methods of associating media content with contexts
CN108304105A (en) * 2017-12-20 2018-07-20 维沃移动通信有限公司 A kind of application interface starts method, mobile terminal
CN110442401A (en) * 2018-05-03 2019-11-12 腾讯科技(北京)有限公司 Function jump method, system and function recording, back method, device, equipment
CN108681483A (en) * 2018-05-16 2018-10-19 维沃移动通信有限公司 A kind of task processing method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116910393A (en) * 2023-09-13 2023-10-20 戎行技术有限公司 Large-batch news data acquisition method based on recurrent neural network
CN116910393B (en) * 2023-09-13 2023-12-12 戎行技术有限公司 Large-batch news data acquisition method based on recurrent neural network

Also Published As

Publication number Publication date
CN113946257A (en) 2022-01-18

Similar Documents

Publication Publication Date Title
JP6893606B2 (en) Image tagging methods, devices and electronics
CN105204745B (en) Screen capturing method and device for mobile terminal
EP2874383B1 (en) System and method for controlling slide operation auxiliary input in portable terminal devices
CN105425941B (en) A kind of method and device preventing application program in error starting mobile terminal
CN104618577B (en) A kind of response method and device of button request
TWI452527B (en) Method and system for application program execution based on augmented reality and cloud computing
CN105549868A (en) Mobile terminal operation processing method and apparatus and mobile terminal
WO2012065518A1 (en) Method for changing user operation interface and terminal
WO2022022566A1 (en) Graphic code identification method and apparatus and electronic device
EP3260998A1 (en) Method and device for setting profile picture
WO2020135334A1 (en) Television application theme switching method, television, readable storage medium, and device
CN109814801A (en) Using login method, device, terminal and storage medium
WO2015078126A1 (en) Positioning method and device
JP2021034003A (en) Human object recognition method, apparatus, electronic device, storage medium, and program
CN109857787B (en) Display method and terminal
WO2020192215A1 (en) Interactive method and wearable interactive device
CN108205455B (en) Application function implementation method and device and terminal
WO2022001564A1 (en) Operation set obtaining and executing methods and apparatuses, storage medium, and terminal device
CN111047147B (en) Automatic business process acquisition method and intelligent terminal
WO2018040733A1 (en) Virtual keyboard input method and device, and robot
CN113938733A (en) Shortcut key control method and device for remote control equipment, storage medium and device
US10588433B2 (en) Data setting method for body information analysis apparatus
WO2016019794A1 (en) Picture selection method and apparatus therefor
WO2018107422A1 (en) Electronic apparatus and information reading control method
CN105335088A (en) File sharing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21833566

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 19/05/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21833566

Country of ref document: EP

Kind code of ref document: A1