WO2019233318A1 - Content identification method and device, and mobile terminal - Google Patents

Content identification method and device, and mobile terminal Download PDF

Info

Publication number
WO2019233318A1
WO2019233318A1 PCT/CN2019/088874 CN2019088874W WO2019233318A1 WO 2019233318 A1 WO2019233318 A1 WO 2019233318A1 CN 2019088874 W CN2019088874 W CN 2019088874W WO 2019233318 A1 WO2019233318 A1 WO 2019233318A1
Authority
WO
WIPO (PCT)
Prior art keywords
screenshot
recognition
content
user interface
text
Prior art date
Application number
PCT/CN2019/088874
Other languages
French (fr)
Chinese (zh)
Inventor
段丽霞
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2019233318A1 publication Critical patent/WO2019233318A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures

Definitions

  • the present application relates to the technical field of mobile terminals, and more particularly, to a content recognition method, device, and mobile terminal.
  • the display screen of the mobile terminal can display various contents. If a user wants to obtain detailed information of some of the displayed contents, the corresponding content needs to be copied to a browser search box, and the operation process is tedious.
  • this application proposes a content recognition method, device, and mobile terminal, which are used to identify the content of the user interface, simplify the recognition process, and improve the user experience.
  • an embodiment of the present application provides a content recognition method.
  • the method includes: when a recognition touch is received on a user interface, performing content recognition on the user interface; Failure, displaying an adjustable screenshot frame in the user interface; identifying the content within the screenshot frame.
  • an embodiment of the present application provides a content recognition device.
  • the device includes: a first recognition module, configured to perform content recognition on the user interface when receiving a recognition touch on the user interface; A module configured to display an adjustable screenshot frame in the user interface if the content identification of the user interface fails; a second identification module is configured to identify the content in the screenshot frame.
  • an embodiment of the present application provides a mobile terminal including a display screen, a memory, and a processor.
  • the display screen and the memory are coupled to the processor.
  • the memory stores instructions. The method described above by the processor when executed by the processor.
  • an embodiment of the present application provides a computer-readable storage medium having a processor-executable program code, where the program code causes the processor to perform the foregoing method.
  • the content recognition method, device and mobile terminal provided in this application display an adjustable screenshot frame and identify the content of the screenshot frame in the case that the content recognition of the user interface fails in response to the recognition touch.
  • User interface for content identification simple and convenient operation.
  • FIG. 1 shows a flowchart of a content identification method according to an embodiment of the present application
  • FIG. 2 shows a first display schematic diagram provided by an embodiment of the present application
  • FIG. 3 shows a second display schematic diagram provided by an embodiment of the present application
  • FIG. 4 shows a third display schematic diagram provided by an embodiment of the present application.
  • FIG. 5 is a flowchart of a content identification method according to another embodiment of the present application.
  • FIG. 6 shows a fourth display schematic diagram provided by an embodiment of the present application.
  • FIG. 7 shows a fifth display schematic diagram provided by an embodiment of the present application.
  • FIG. 8 shows a sixth display schematic diagram provided by an embodiment of the present application.
  • FIG. 9 shows a seventh display schematic diagram provided by an embodiment of the present application.
  • FIG. 10 shows a functional module diagram of a content recognition device according to an embodiment of the present application.
  • FIG. 11 shows a structural block diagram of a mobile terminal according to an embodiment of the present application.
  • FIG. 12 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
  • FIG. 13 shows a block diagram of a mobile terminal for performing a content recognition method according to an embodiment of the present application.
  • the displayed content may be selected by using pressure sensing and other technologies and the selected content may be identified to obtain a recognition result, so as to improve the speed of obtaining information.
  • the inventors have discovered through a large number of studies that, in response to the user's touch on the user interface for content recognition, the recognition may fail and the recognition result desired by the user may not be obtained.
  • an embodiment of the present application proposes a content recognition method, device, and mobile terminal.
  • recognition failure an adjustable screenshot frame is displayed on the user interface, so that the content can be selected by re-selecting the content in the user interface. Recognize and get recognition results that fit the needs of the user.
  • an embodiment of the present application provides a content identification method.
  • the content identification method is used to identify all or part of content in a user interface displayed on a display screen.
  • the content recognition method is applied to a content recognition device as shown in FIG. 10 and a mobile terminal 400 (FIG. 11 and FIG. 12) corresponding to the content recognition device 300.
  • the above content identification method may specifically include the following steps:
  • Step S110 When a recognition touch on the user interface is received, content recognition is performed on the user interface.
  • the user can perform recognition touch on the user interface.
  • the specific touch operation corresponding to the recognition touch is not limited in the embodiments of the present application, such as single-finger long press, two-finger long press, multi-finger long press, knuckle long press, single-finger tap, two-finger Tap, multi-finger tap, knuckle tap, single-finger large-area compression, two-finger large-area compression, multi-finger large-area compression, single-finger, two-finger, or multi-finger sliding on a preset trajectory, etc.
  • the touch operation is to slide according to a preset trajectory, the sliding trajectory may be a closed graphic to identify the content in the closed graphic.
  • the mobile terminal When the mobile terminal receives the recognition touch, it recognizes the content corresponding to the recognition touch in the user interface.
  • the identified display content may be all content in the user interface. In this embodiment, all content except a fixed field in the mobile terminal can be identified. If an application is displayed on the display of the mobile terminal, all content currently displayed on the display by the application is identified.
  • the identified display content may be content corresponding to the touch position of the touch operation in the user interface.
  • the specific content corresponding to the touch position may be a text paragraph where the touch position is located, a picture where the touch position is located, and a control where the touch position is located.
  • the displayed interface is a touched user interface
  • the circle A indicates the touch position
  • the text segment where the circle A is located is used as the display content to be identified.
  • the recognized display content is a text displayed in a text control corresponding to a touch position.
  • the method may include: determining a text control corresponding to the touch position of the recognition touch; and acquiring text in the text control for recognition.
  • the text control corresponding to the touch position may be a text control closest to the touch position.
  • the touched text control is used as the text control to be recognized.
  • a circle A indicates a touch position
  • a text control B corresponding to the chat information is a text control touched by the touch position, and the chat information in the text control B is identified.
  • the touch position is at a position other than the text control
  • the text in the text control closest to the touch position can be recognized.
  • circle A indicates a touch position, and no touch is on the text control.
  • the text control B corresponding to the chat information is the text control closest to the touch position. Identification of chat messages.
  • the recognition of the displayed content may use some existing recognition methods, such as word segmentation and semantic recognition, which are not limited here.
  • the display content can also be identified through background screenshots. After taking screenshots of the content that needs to be identified, it can be identified through image analysis, such as OCR (Optical Character Recognition). Specifically, determining a touch position of the recognition touch in a user interface; in the user interface, taking a screenshot within a preset range of the touch position; and performing text recognition on an image obtained by the screenshot.
  • OCR Optical Character Recognition
  • the size of the preset range is not limited in the embodiments of the present application, and may be a rectangular range with a preset length and width, a circular range with a preset radius, or other preset shapes and a range with a preset size.
  • Step S120 if the content identification of the user interface fails, an adjustable screenshot frame is displayed on the user interface.
  • the identification of the content of the user interface may fail for various reasons. There may be multiple reasons for recognition failure, for example, network problems such as unstable network connection, mobile terminal not connected to the network, etc., lead to failure to recognize or timeout;
  • the content to be identified is unhealthy information, etc., and is not exhaustive here.
  • an adjustable screenshot frame can be displayed in the user interface.
  • the screenshot frame K may be a rectangular frame as shown in the figure, or may be a closed shape of other shapes, such as any shape such as a circle, a prism, a triangle, and a polygon.
  • the user interface is a user interface displayed on a display screen when a touch is recognized.
  • Step S130 identify the content in the screenshot frame.
  • the content of the user interface is corresponding to the screenshot box K.
  • the content in the screenshot box indicates the content that needs to be identified. Therefore, the content in the screenshot box can be identified.
  • the adjustable screenshot frame is displayed on the user interface, and the user interface is framed by the screenshot frame. Select, and then identify the contents of the box selection. Therefore, in the case of recognition failure, the frame selection button can be used to reframe and identify the recognition content in the user interface again.
  • the adjustable screenshot frame can be adjusted by the user according to requirements, so that the content in the screenshot frame is the content that the user wants to identify.
  • the method provided in the embodiment of the present application includes:
  • Step S210 When a recognition touch on the user interface is received, content recognition is performed on the user interface.
  • the user can initiate a recognition touch operation to identify the displayed content in the user interface.
  • the user interface may be a chat interface, a web interface, a video interface, a user interface of various applications, and the like, which are not limited in the embodiments of the present application.
  • Step S220 If the content identification of the user interface fails, an adjustable screenshot frame is displayed on the user interface.
  • the recognition fails, the recognition result of the content in the touched user interface is not obtained, a screenshot frame is displayed on the user interface, and the user interface is selected and identified.
  • the interface displayed on the display screen is the user interface touched when the touch is recognized, and is displayed on the user interface.
  • Screenshot box K is the user interface touched when the touch is recognized, and is displayed on the user interface.
  • the touch-sensitive user interface may be reduced and displayed, and the screenshot frame K is displayed in the reduced user interface.
  • auxiliary content can be displayed outside the reduced user interface.
  • the auxiliary content can be one or more types of recognition options such as identifying QR codes, identifying products, and identifying text.
  • the auxiliary content can be a prompt for adjusting the screenshot box.
  • Information, such as "select an area to be identified" in FIG. 6, and the auxiliary content may also be a control button for exiting the screenshot identification.
  • the screenshot box when the screenshot box is displayed, only the screenshot box can be operated in the user interface, and other locations cannot be operated.
  • the position where the screenshot frame is first displayed is not limited.
  • the screenshot frame may be displayed at a preset position in a preset size.
  • the preset size can be a fixed size that is set in advance, or a preset size that is proportional to the user interface.
  • the preset position may be a certain fixed position of the display screen, or may be any certain position of the touched user interface.
  • a display position of the screenshot frame may be determined according to a touch position that recognizes a touch. Specifically, it may be a screenshot frame that includes a touch position frame.
  • the screenshot frame may be a preset size, or a minimum screenshot frame including a touch area frame corresponding to a touch position.
  • the screenshot frame may include the control frame touched by the touch position. Specifically, a control corresponding to the touch position for identifying the touch may be determined, and the correspondence may indicate that in a user interface, the position where the control is located overlaps the touch position. Then, an adjustable screenshot frame including the control frame is displayed on the user interface.
  • the screenshot frame may be a screenshot frame that is larger than the control, including the touched control frame. It can also be the smallest screenshot box that can select the control box where the touch position is located, that is, the screenshot box only selects the control box that it touches.
  • the screenshot box can be a preset shape, such as a rectangle, A rounded rectangle or a shape that conforms to the shape and outline of the control where the touch location is located.
  • a prompt message may be displayed to prompt the user if the recognition fails, whether to enter the box selection recognition. If the user selects Yes, an adjustable screenshot box is displayed in the user interface; if the user selects No, you can exit During the recognition process, the recognition of the user interface is ended, and no recognition result is obtained.
  • Step S230 identify the content in the screenshot frame.
  • the content selected by the screenshot frame can be identified.
  • the identification may be directly obtaining the content in the screenshot box, such as including a text control in the screenshot box, obtaining the text in the text control, including a picture control in the screenshot box, and directly obtaining a picture in the picture control.
  • the recognition may also be that the content selected by the screenshot frame is taken as an image, for example, the edge of the screenshot frame is used as the interception edge, and all the content in the screenshot frame is taken to obtain the intercepted image, and then the Content is identified.
  • the recognition process can first obtain the content in the picture through image processing, such as OCR (Optical Character Recognition, Optical Character Recognition) processing, etc. There is no restriction here, and the text, pictures, and two-dimensional code in the image can be obtained through the existing And so on.
  • the user interface displays a screenshot frame to identify all or part of the content in the screenshot frame.
  • the identification type indicates the type to which the content to be identified belongs, such as a QR code, a product, a text, a picture, and the like .
  • the mobile terminal may receive a target recognition type selected by the user from one or more recognition types; and identify content corresponding to the target recognition type in the screenshot frame. That is, a recognition type selected by the user from one or more recognition types is received, and the recognition type selected by the user is used as the target recognition type to identify the content in the screenshot frame that belongs to the target recognition type.
  • Figure 6 shows three optional recognition types: QR code, product, and text.
  • the recognition of different recognition types may be different in recognition content.
  • text recognition only parses the text content in the screenshot box to obtain the recognition result
  • picture recognition only parses the picture in the screenshot box to obtain the recognition result. result.
  • It can also be a different recognition method, such as recognition of text, pictures, and two-dimensional codes, which is implemented by the corresponding recognition processing server.
  • the content in the screenshot box is sent to a dedicated text
  • the recognized server, or the text in the screenshot box is sent to a server dedicated to text recognition, which recognizes the text in the screenshot box
  • the selected target recognition type is picture
  • the content in the screenshot box is sent To a server dedicated to image recognition, or to send a picture in a screenshot box to a server dedicated to picture recognition, and the server recognizes the picture in the screenshot box
  • the identification of the product can be linked to a third party Shopping platforms, such as Taobao, pass the content in the screenshot box to a third-party shopping platform for identification, in order to obtain product information and purchase links from the third-party shopping platform.
  • the recognition results may also be displayed differently. For example, the recognition of text, products, pictures, etc. can be displayed directly through cards in the form of word segmentation, introduction, and links. The recognition of products can be displayed through third-party shopping platforms.
  • the recognition process may be analyzing the content in the screenshot frame, obtaining the content corresponding to the target recognition type from it, and identifying the content of the target recognition type.
  • the target recognition type is a two-dimensional code
  • analyze the content in the screenshot box to obtain the two-dimensional code therein, and then identify the two-dimensional code to obtain the information contained in the two-dimensional code.
  • the target recognition type is text
  • perform text segmentation, parsing, and semantic search operations on the text in the screenshot box and feedback the recognition result of the text.
  • the text content can be obtained by analyzing the screenshot box, and then the text content is filtered. After filtering out the garbled characters, valid text is obtained, and then the valid text is obtained. Perform analytical identification.
  • the garbled text may be text other than a preset type of text. For example, if the preset type of text is Chinese characters, English characters, and selected common punctuation, other characters, punctuation other than common punctuation, etc. Judged as garbled.
  • one of the recognition types may be used as the default recognition type to identify the content of the default recognition type in the screenshot box.
  • the recognition may not be performed. After the user selects the recognition type, the content corresponding to the selected recognition type is recognized.
  • all types can be recognized.
  • the screenshot frame is an adjustable screenshot frame, and the adjustment includes adjustment of position and adjustment of size.
  • the mobile terminal may receive adjustments to the size or position of the screenshot frame; and identify the content within the adjusted screenshot frame.
  • Figure 4 and Figure 6 show the selection of the screenshot frame in the user interface after entering the frame selection interface.
  • the user can adjust the size of the screenshot frame by pulling the corner of the screenshot frame in different directions.
  • the screenshot box K after being reduced in FIG. 4.
  • the user can also change the position of the screenshot frame by dragging the screenshot frame by pressing the border of the screenshot frame, the inner area, etc., as shown in FIG. 8, which is adjusted relative to the position of FIG. 7.
  • the shape of the screenshot frame can also be changed, and a user's request for changing the shape of the screenshot frame is received.
  • the shape of the screenshot frame is changed to a circle, a triangle, or any other polygon.
  • the specific change of the shape can be determined by the user, that is, the shape of the screenshot frame can be changed to the shape specified by the change request.
  • each time the user clicks the button of a change request to change the shape the shape of the screenshot box is changed in the preset shape order; or when the user's change request is received, the displayable shape options are displayed.
  • the selected shape is used as the shape of the screenshot box to meet the needs of different users.
  • a shape selection button may be provided corresponding to the screenshot frame K, and when the button is pressed, a selectable shape is displayed. If the user selects a triangle, the shape of the current screenshot frame is changed to a triangle.
  • the screenshot frame when the screenshot frame is displayed in response to the recognition touch recognition failure, if the adjustment of the screenshot frame has not been received, the display of the screenshot frame is maintained, and the identification is not performed for the time being. , Or after waiting for the preset time, if the adjustment of the screenshot box has not been received, the content in the screenshot box is identified; after receiving the adjustment of the screenshot box, the content in the adjusted screenshot box is identified Recognize and get the recognition result.
  • the content in the screenshot frame can be identified.
  • the user may make multiple adjustments to the screenshot frame, that is, one adjustment has not reached the adjustment result desired by the user, and multiple adjustments are required.
  • wait for a preset time If the adjustment of the screenshot frame is not received within the preset time, the content in the screenshot frame starts to be identified. If the adjustment of the screenshot frame is received within a preset time, the shape or position of the screenshot frame is changed in response to the adjustment operation. It can be understood that the preset time should not be too long, for example, it can be 1 second. Specifically, a timer can be set. When the screenshot box starts to be displayed, the timer starts timing.
  • the timer does not receive the adjustment of the screenshot frame when the timer reaches the preset time length, the timer is set to zero to identify the content in the screenshot frame. If an adjustment operation on the screenshot frame is received before the timer reaches the preset time length, the timer is set to zero, and the screenshot frame is adjusted according to the adjustment operation. After the operation of adjusting the screenshot frame ends, the timer starts counting. When the timer does not receive the adjustment of the screenshot frame when it reaches the preset time length again, the timer is set to zero to identify the contents of the screenshot frame; if the timer receives the time before the preset time length is received again When the screenshot frame is adjusted, the timer is set to zero, and the screenshot frame is adjusted according to the adjustment operation. After the adjustment operation of the screenshot frame is finished, the timer is counted, and so on. The timer of the controller determines whether to start recognition.
  • the end of the adjustment operation of the screenshot frame may be the end of the touch on the adjustment.
  • the adjustment of the screenshot frame starts, and the size of the screenshot frame is adjusted according to the dragging of the border line, during which the finger keeps in contact with the display screen; when it is determined that the finger is away Display, the adjustment is over.
  • step S230 when the screenshot frame is displayed in response to the failure of identifying touch recognition, the content in the screenshot frame is identified to obtain a recognition result. If the adjustment of the screenshot frame is received, the content of the adjusted screenshot frame is identified, and the recognition result is updated according to the identification of the adjusted screenshot frame. That is, as shown in FIG. 5, in this embodiment, after step S230, it may further include step S240: receiving adjustment to the size or position of the screenshot frame. Step S250: Perform screenshot identification on the adjusted screenshot frame.
  • step S250 Perform screenshot identification on the adjusted screenshot frame.
  • whether to start recognition can also be determined by the timer.
  • the timer starts counting from zero. If the timer again receives the adjustment operation of the screenshot frame before the preset time length expires, the timer is set to zero, the screenshot frame is adjusted according to the adjustment operation, and the adjustment operation of the screenshot frame ends. After that, the timer starts to count; if the timer does not receive adjustments to the screenshot frame when the timer reaches a preset length of time, the timer is set to zero to identify the content in the screenshot frame.
  • whether the recognition is started or not can be determined by a user.
  • recognition may be started.
  • the user adjusts the screenshot frame, it is determined that the content in the screenshot frame includes the content to be identified, and a command to start the identification is issued, and the mobile terminal starts the identification.
  • Receiving the relevant command for the start of recognition may be: receiving a selection of a target type; providing a touch button dedicated to recognize the start, receiving a touch to the touch button; or receiving a gesture corresponding to the start of recognition Operations, such as clicking or double-clicking in the screenshot box.
  • the recognition in the screenshot box may include word segmentation and corresponding search for the corresponding content
  • the recognition result may include the word segmentation result, the introduction, link, location map of the film and television, books, characters, etc., and the purchase channel of the product.
  • schedule information, courier information, and the like are not limited in the embodiments of the present application, and may be any interpretation information of the displayed content.
  • FIG. 9 shows a specific manner of displaying a recognition result.
  • the displayed recognition result may include segmentation of corresponding content. The user may select a word from the segmentation result and then copy, select all, translate, or search.
  • the recognition result may be displayed by a card.
  • the card C displays the recognition result.
  • the card is a carrier for displaying information, and may be a control or a combination of multiple controls.
  • the information displayed by the card in the embodiment of the present application may be information corresponding to the recognition result.
  • different recognition results of the same display content can be displayed, or different recognition results of the same display content can be displayed on different cards.
  • a prompt message that the recognition fails may be displayed.
  • the content recognition method when a recognition touch is received, content recognition is performed on a user interface. If the identification fails, a screenshot frame is provided for selecting the user interface.
  • the content of the adjusted screenshot box is identified, so that users can adjust the size and position of the screenshot box according to their own needs, and select the content box that they want to identify in the screenshot Recognize in the frame to get the desired recognition result.
  • An embodiment of the present application further provides a content recognition device 300.
  • the device 300 includes: a first recognition module 310 configured to perform content processing on the user interface when receiving a recognition touch on the user interface. Identify.
  • a frame selection module 320 is configured to display an adjustable screenshot frame on the user interface if the content identification of the user interface fails.
  • the second identification module 330 is configured to identify content in the screenshot frame.
  • the first recognition module 310 may include a control determining unit for determining a text control corresponding to a touch position for recognizing a touch, and a recognition unit for acquiring text in the text control for recognition.
  • the first recognition module may include a position determination unit for determining a touch position for recognizing a touch, and an image acquisition unit for presetting the touch position in the user interface. Take a screenshot within the scope; a recognition unit, configured to perform text recognition on the picture obtained by the screenshot.
  • the device may further include: a type determination module configured to receive a target recognition type selected by the user from one or more recognition types; a second recognition module configured to correspond to the content corresponding to the target recognition type in the screenshot frame For identification.
  • a type determination module configured to receive a target recognition type selected by the user from one or more recognition types
  • a second recognition module configured to correspond to the content corresponding to the target recognition type in the screenshot frame For identification.
  • the device may further include: an adjustment module, configured to receive adjustment of the size or position of the screenshot frame.
  • the second recognition module is further configured to perform screenshot recognition on the adjusted screenshot frame.
  • the frame selection module 320 may be further configured to display the screenshot frame at a preset position in a preset size. Alternatively, it is used to determine a control corresponding to the touch position for identifying touch; and an adjustable screenshot frame including the control frame is displayed on the user interface.
  • a manual screenshot box is displayed so that the user can manually select the area that needs to be captured. Then, the captured picture is identified.
  • the user can select whether it is a two-dimensional code recognition, a text recognition, or an article recognition. Among them, preliminary analysis can be performed on the parsed results, and garbled characters and characters are filtered. If there is no valid text after filtering, a process that does not recognize any content is taken, and if there is valid text after filtering, it is passed to text recognition.
  • an embodiment of the present application further provides a mobile terminal 400.
  • the mobile terminal 400 includes a display screen 120, a memory 104, and a processor 102.
  • the display screen 120 and the memory 104 are coupled to the processor 102.
  • the display screen 120 is used to display content.
  • Parsing recognition results, etc. the memory 104 stores instructions, and when the instructions are executed by the processor 102, the processor 102 executes the method provided in the embodiment of the present application.
  • the mobile terminal 400 may include an electronic body portion 10, and the electronic body portion 10 includes a casing 12 and a display screen 120 disposed on the casing 12.
  • the casing 12 can be made of metal, such as steel and aluminum alloy.
  • the display screen 120 generally includes a display panel 111, and may also include a circuit for responding to a touch operation on the display panel 111, and the like.
  • the display panel 111 may be a liquid crystal display (Liquid Crystal Display, LCD). In some embodiments, the display panel 111 is a touch screen 109 at the same time.
  • the mobile terminal 400 can be used as a smart phone terminal.
  • the electronic body portion 10 usually further includes one or more (only shown in the figure) (A) The processor 102, the memory 104, an RF (Radio Frequency) module 106, an audio circuit 110, a sensor 114, an input module 118, and a power module 122.
  • a person of ordinary skill in the art can understand that the structure shown in FIG. 13 is only for illustration, and it does not limit the structure of the electronic body portion 10.
  • the electronic body portion 10 may further include more or fewer components than those shown in FIG. 13, or have a different correspondence from that shown in FIG. 13.
  • peripheral interface 124 may be implemented based on the following standards: Universal Asynchronous Receiver / Transmitter (UART), General Input / Output (GPIO), Serial Peripheral Interface , SPI), Inter-Integrated Circuit (I2C), but not limited to the above standards.
  • UART Universal Asynchronous Receiver / Transmitter
  • GPIO General Input / Output
  • SPI Serial Peripheral Interface
  • I2C Inter-Integrated Circuit
  • the peripheral interface 124 may only include a bus; in other examples, the peripheral interface 124 may further include other elements, such as one or more controllers, for example, for connecting the display panel.
  • these controllers can also be separated from the peripheral interface 124 and integrated into the processor 102 or a corresponding peripheral.
  • the memory 104 may be used to store software programs and modules, and the processor 102 executes various functional applications and data processing by running the software programs and modules stored in the memory 104.
  • the memory 104 may include a high-speed random access memory, and may further include a non-volatile memory, such as one or more magnetic storage devices, a flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include memories remotely disposed with respect to the processor 102, and these remote memories may be connected to the electronic body portion 10 or the display screen 120 through a network. Examples of the above network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • the RF module 106 is used to receive and send electromagnetic waves, to realize mutual conversion between electromagnetic waves and electrical signals, and to communicate with a communication network or other equipment.
  • the RF module 106 may include various existing circuit elements for performing these functions, such as an antenna, a radio frequency transceiver, a digital signal processor, an encryption / decryption chip, a subscriber identity module (SIM) card, a memory, and the like .
  • the RF module 106 can communicate with various networks such as the Internet, an intranet, and a wireless network, or communicate with other devices through a wireless network.
  • the wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network.
  • the above wireless network can use various communication standards, protocols and technologies, including but not limited to Global System for Mobile Communication (GSM), Enhanced Data Communication Technology (GSM) Environment, EDGE, Broadband Code Division multiple access technology (wideband code division multiple access, W-CDMA), code division multiple access technology (Code division access, CDMA), time division multiple access technology (time division multiple access, TDMA), wireless fidelity technology (Wireless, Fidelity , WiFi) (such as the American Institute of Electrical and Electronics Engineers standards IEEE 802.10A, IEEE 802.11b, IEEE802.11g, and / or IEEE 802.11n), Voice over Internet (Internet Protocol, VoIP), Global Microwave Interoperability (Worldwide Interoperability for Microwave Access (Wi-Max), other protocols for mail, instant messaging, and short messaging, and any other suitable communication protocol, even those that have not yet been developed.
  • GSM Global System for Mobile Communication
  • GSM Global System for Mobile Communication
  • GSM Global System for Mobile Communication
  • GSM Global System for Mobile Communication
  • EDGE Broadband
  • the audio circuit 110, the speaker 101, the microphone 103, and the microphone 105 collectively provide an audio interface between the user and the electronic body portion 10 or the display screen 120.
  • the sensor 114 is disposed in the electronic body portion 10 or the display screen 120.
  • Examples of the sensor 114 include, but are not limited to, an acceleration sensor 114F, a gyroscope 114G, a magnetometer 114H, and other sensors.
  • the input module 118 may include the touch screen 109 provided on the display screen 120, and the touch screen 109 may collect a touch operation performed by the user on or near the touch screen (such as a user using a finger, a stylus, etc.) Any suitable object or accessory is operated on or near the touch screen 109), so that the user ’s touch gesture can be obtained, and the corresponding connection device is driven according to a preset program, so the user can The touch operation of the display selects the target area.
  • the touch screen 109 may include a touch detection device and a touch controller. The touch detection device detects a user's touch position, and detects a signal caused by a touch operation, and transmits the signal to the touch controller.
  • the touch controller receives touch information from the touch detection device, and The touch information is converted into touch point coordinates, and then sent to the processor 102, and can receive and execute commands sent by the processor 102.
  • various types such as resistive, capacitive, infrared, and surface acoustic wave can be used to implement the touch detection function of the touch screen 109.
  • the input module 118 may further include other input devices, such as keys 107.
  • the keys 107 may include, for example, character keys for inputting characters, and control keys for triggering control functions. Examples of the control buttons include a "return to the home screen" button, an on / off button, and the like.
  • the display screen 120 is used to display information input by the user, information provided to the user, and various graphical user interfaces of the electronic body portion 10, and these graphical user interfaces may include graphics, text, icons, numbers, videos, and other Composed of any combination.
  • the touch screen 109 may be disposed on the display panel 111 so as to form a whole with the display panel 111.
  • the power module 122 is configured to provide power to the processor 102 and other components.
  • the power module 122 may include a power management system, one or more power sources (such as a battery or AC power), a charging circuit, a power failure detection circuit, an inverter, a power status indicator, and any other electronic components Components related to the generation, management and distribution of power in the display 10 or the display 120.
  • the mobile terminal 400 further includes a locator 119, which is configured to determine an actual location where the mobile terminal 400 is located.
  • the locator 119 uses a positioning service to implement positioning of the mobile terminal 400.
  • the positioning service should be understood as obtaining position information (such as latitude and longitude coordinates) of the mobile terminal 400 through a specific positioning technology. ), A technology or service for marking the location of an object on an electronic map.
  • the above-mentioned mobile terminal 400 is not limited to a smart phone terminal, and it should refer to a computer device that can be used in mobile. Specifically, the mobile terminal 400 refers to a mobile computer device equipped with a smart operating system.
  • the mobile terminal 400 includes, but is not limited to, a smart phone, a smart watch, a tablet computer, and the like.
  • first and second are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Therefore, the features defined as “first” and “second” may explicitly or implicitly include at least one of the features. In the description of the present application, the meaning of "plurality” is at least two, for example, two, three, etc., unless it is specifically and specifically defined otherwise.
  • Any process or method description in a flowchart or otherwise described herein can be understood as representing a module, fragment, or portion of code that includes one or more executable instructions for implementing a particular logical function or step of a process
  • the scope of the preferred embodiments of the present application includes additional implementations, in which the functions may be performed out of the order shown or discussed, including performing functions in a substantially simultaneous manner or in the reverse order according to the functions involved, which should It is understood by those skilled in the art to which the embodiments of the present application pertain.
  • a sequenced list of executable instructions that can be considered to implement a logical function can be embodied in any computer-readable medium,
  • the instruction execution system, device, or device such as a computer-based system, a system including a processor, or other system that can fetch and execute instructions from the instruction execution system, device, or device), or combine these instruction execution systems, devices, or devices Or equipment.
  • a "computer-readable medium” may be any device that can contain, store, communicate, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device.
  • computer-readable media include the following: electrical connections (mobile terminals) with one or more wirings, portable computer disk enclosures (magnetic devices), random access memory (RAM), Read-only memory (ROM), erasable and editable read-only memory (EPROM or flash memory), fiber optic devices, and portable optical disk read-only memory (CDROM).
  • the computer-readable medium may even be paper or other suitable medium on which the program can be printed, because, for example, by optically scanning the paper or other medium, followed by editing, interpretation, or other suitable Processing to obtain the program electronically and then store it in computer memory.
  • each part of the application may be implemented by hardware, software, firmware, or a combination thereof.
  • multiple steps or methods may be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system.
  • a suitable instruction execution system For example, if implemented in hardware, as in another embodiment, it may be implemented using any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gate circuits, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
  • each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist separately physically, or two or more units may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium.
  • the aforementioned storage medium may be a read-only memory, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)

Abstract

Embodiments of the present application relate to the technical field of mobile terminals. Disclosed are a content identification method and device, and a mobile terminal. The method comprises: performing content identification on a user interface when an identification touch for the user interface is received; displaying an adjustable screenshot box on the user interface if the content identification of the user interface is unsuccessful; and identifying content in the screenshot box. In the method, the content identification can be directly performed on the user interface, so that operation is simple and convenient.

Description

内容识别方法、装置及移动终端Content recognition method, device and mobile terminal
本申请要求于2018年6月08日提交的申请号为201810588338.1的中国专利申请的优先权,在此通过引用将其全部内容并入本文。This application claims priority from Chinese Patent Application No. 201810588338.1, filed on June 08, 2018, the entire contents of which are incorporated herein by reference.
技术领域Technical field
本申请涉及移动终端技术领域,更具体地,涉及一种内容识别方法、装置及移动终端。The present application relates to the technical field of mobile terminals, and more particularly, to a content recognition method, device, and mobile terminal.
背景技术Background technique
移动终端的显示屏可以显示各种内容,若用户想要获取其中某些显示内容的详细信息,则需要将相应的内容复制到浏览器搜索框,操作过程繁琐。The display screen of the mobile terminal can display various contents. If a user wants to obtain detailed information of some of the displayed contents, the corresponding content needs to be copied to a browser search box, and the operation process is tedious.
发明内容Summary of the Invention
鉴于上述问题,本申请提出了一种内容识别方法、装置及移动终端,用于对用户界面的内容进行识别,简化了识别过程,提高用户体验。In view of the above problems, this application proposes a content recognition method, device, and mobile terminal, which are used to identify the content of the user interface, simplify the recognition process, and improve the user experience.
第一方面,本申请实施例提供了一种内容识别方法,所述方法包括:接收到对用户界面的识别触控时,对所述用户界面进行内容识别;若对所述用户界面的内容识别失败,在所述用户界面显示可调整的截图框;对所述截图框内的内容进行识别。According to a first aspect, an embodiment of the present application provides a content recognition method. The method includes: when a recognition touch is received on a user interface, performing content recognition on the user interface; Failure, displaying an adjustable screenshot frame in the user interface; identifying the content within the screenshot frame.
第二方面,本申请实施例提供了一种内容识别装置,所述装置包括:第一识别模块,用于接收到对用户界面的识别触控时,对所述用户界面进行内容识别;框选模块,用于若对所述用户界面的内容识别失败,在所述用户界面显示可调整的截图框;第二识别模块,用于对所述截图框内的内容进行识别。In a second aspect, an embodiment of the present application provides a content recognition device. The device includes: a first recognition module, configured to perform content recognition on the user interface when receiving a recognition touch on the user interface; A module configured to display an adjustable screenshot frame in the user interface if the content identification of the user interface fails; a second identification module is configured to identify the content in the screenshot frame.
第三方面,本申请实施例提供了一种移动终端,包括显示屏、存储器及处理器,所述显示屏及所述存储器耦接到所述处理器,所述存储器存储指令,当所述指令由所述处理器执行时所述处理器上述的方法。According to a third aspect, an embodiment of the present application provides a mobile terminal including a display screen, a memory, and a processor. The display screen and the memory are coupled to the processor. The memory stores instructions. The method described above by the processor when executed by the processor.
第四方面,本申请实施例提供了一种具有处理器可执行的程序代码的计算机可读存储介质,所述程序代码使所述处理器执行上述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium having a processor-executable program code, where the program code causes the processor to perform the foregoing method.
本申请提供的内容识别方法、装置及移动终端,在响应于识别触控对用户界面的内容识别失败的情况下,显示可调整的截图框,并对截图框的内容进行识别,从而可以直接在用户界面进行内容识别,操作简单方便。The content recognition method, device and mobile terminal provided in this application display an adjustable screenshot frame and identify the content of the screenshot frame in the case that the content recognition of the user interface fails in response to the recognition touch. User interface for content identification, simple and convenient operation.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得 其他的附图。In order to explain the technical solutions in the embodiments of the present application more clearly, the drawings used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are just some embodiments of the application. For those skilled in the art, other drawings can be obtained based on these drawings without paying creative labor.
图1示出了本申请一实施例提出的内容识别方法的流程图;FIG. 1 shows a flowchart of a content identification method according to an embodiment of the present application;
图2示出了本申请实施例提出的第一显示示意图;FIG. 2 shows a first display schematic diagram provided by an embodiment of the present application; FIG.
图3示出了本申请实施例提出的第二显示示意图;FIG. 3 shows a second display schematic diagram provided by an embodiment of the present application;
图4示出了本申请实施例提出的第三显示示意图;FIG. 4 shows a third display schematic diagram provided by an embodiment of the present application;
图5示出了本申请另一实施例提出的内容识别方法的流程图;FIG. 5 is a flowchart of a content identification method according to another embodiment of the present application; FIG.
图6示出了本申请实施例提出的第四显示示意图;FIG. 6 shows a fourth display schematic diagram provided by an embodiment of the present application;
图7示出了本申请实施例提出的第五显示示意图;FIG. 7 shows a fifth display schematic diagram provided by an embodiment of the present application;
图8示出了本申请实施例提出的第六显示示意图;FIG. 8 shows a sixth display schematic diagram provided by an embodiment of the present application;
图9示出了本申请实施例提出的第七显示示意图;FIG. 9 shows a seventh display schematic diagram provided by an embodiment of the present application;
图10示出了本申请实施例提出的内容识别装置的功能模块图;FIG. 10 shows a functional module diagram of a content recognition device according to an embodiment of the present application; FIG.
图11示出了本申请实施例提出的移动终端的一种结构框图;FIG. 11 shows a structural block diagram of a mobile terminal according to an embodiment of the present application;
图12示出了本申请实施例提出的移动终端的一种结构示意图;FIG. 12 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application; FIG.
图13示出了本申请实施例的用于执行根据本申请实施例的内容识别方法的移动终端的框图。FIG. 13 shows a block diagram of a mobile terminal for performing a content recognition method according to an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of this application.
目前,用户在通过移动终端上网聊天、阅读文字、查看图片或者观看视频时,经常会对其中的一些内容产生兴趣并进行搜索获取更加详细的信息。此时,用户首先需要复制感兴趣的内容或牢记感兴趣的内容,然后打开浏览器,并将复制的内容粘贴到浏览器的搜索框中或将牢记的内容输入到浏览器的搜索框中进行搜索以获得详细信息,导致操作过程十分的繁琐,耗时较长且容易产生错误。At present, when users use the mobile terminal to chat, read text, view pictures, or watch videos on the Internet, they often become interested in some of them and search for more detailed information. At this time, the user first needs to copy the content of interest or remember the content of interest, then open the browser, and paste the copied content into the browser's search box or enter the content of the memory into the browser's search box. Searching for detailed information results in tedious, time-consuming and error-prone operations.
进一步地,为了解决搜索的操作过程繁琐的问题,可以通过压力感应等技术对显示的内容进行选取并将选取的内容进行识别,获得识别结果,以提升信息的获取速度。但是,发明人经过大量的研究发现,直接响应于用户在用户界面的触控进行内容的识别,其识别可能失败,无法获得用户想要的识别结果。Further, in order to solve the problem of tedious operation process of searching, the displayed content may be selected by using pressure sensing and other technologies and the selected content may be identified to obtain a recognition result, so as to improve the speed of obtaining information. However, the inventors have discovered through a large number of studies that, in response to the user's touch on the user interface for content recognition, the recognition may fail and the recognition result desired by the user may not be obtained.
针对上述技术问题,本申请实施例提出了一种内容识别方法、装置及移动终端,在识别失败的情况下,在用户界面显示可调整的截图框,从而可以通过在用户界面重新框选内容进行识别,获得契合于用户需求的识别结果。In view of the above technical problems, an embodiment of the present application proposes a content recognition method, device, and mobile terminal. In the case of recognition failure, an adjustable screenshot frame is displayed on the user interface, so that the content can be selected by re-selecting the content in the user interface. Recognize and get recognition results that fit the needs of the user.
下面将结合附图并通过具体的实施例对本申请实施例提供的内容识别方法、装置及移动终端进行说明。The content identification method, device, and mobile terminal provided in the embodiments of the present application will be described below with reference to the drawings and specific embodiments.
请参阅图1,本申请实施例提供了一种内容识别方法,所述内容识别方法用于对显示 屏显示的用户界面中的全部或部分内容进行识别。在具体的实施例中,所述内容识别方法应用于如图10所示的内容识别装置以及对应有内容识别装置300的移动终端400(图11、图12)。上述的内容识别方法具体可以包括以下步骤:Referring to FIG. 1, an embodiment of the present application provides a content identification method. The content identification method is used to identify all or part of content in a user interface displayed on a display screen. In a specific embodiment, the content recognition method is applied to a content recognition device as shown in FIG. 10 and a mobile terminal 400 (FIG. 11 and FIG. 12) corresponding to the content recognition device 300. The above content identification method may specifically include the following steps:
步骤S110:接收到对用户界面的识别触控时,对所述用户界面进行内容识别。Step S110: When a recognition touch on the user interface is received, content recognition is performed on the user interface.
用户想要对用户界面的某些内容进行识别以获得该内容更详细的信息时,可以在用户界面进行识别触控。其中,该识别触控具体所对应的触控操作在本申请实施例中并不限定,如单指长按、双指长按、多指长按、指关节长按、单指点击、双指点击、多指点击、指关节敲击、单指大面积按压、双指大面积按压、多指大面积按压,单指、双指或多指按预设轨迹滑动等。若该触控操作为按预设轨迹滑动,其滑动轨迹可以是封闭图形,以对封闭图形内的内容进行识别。When a user wants to identify certain content of the user interface to obtain more detailed information of the content, the user can perform recognition touch on the user interface. The specific touch operation corresponding to the recognition touch is not limited in the embodiments of the present application, such as single-finger long press, two-finger long press, multi-finger long press, knuckle long press, single-finger tap, two-finger Tap, multi-finger tap, knuckle tap, single-finger large-area compression, two-finger large-area compression, multi-finger large-area compression, single-finger, two-finger, or multi-finger sliding on a preset trajectory, etc. If the touch operation is to slide according to a preset trajectory, the sliding trajectory may be a closed graphic to identify the content in the closed graphic.
移动终端在接收到识别触控时,对用户界面中该识别触控所对应的内容进行识别。作为一种具体的实施方式,识别的显示内容可以是该用户界面中的所有内容。在该实施方式中,可以对移动终端中固定栏位以外的所有内容进行识别,如移动终端显示屏显示某应用程序,则识别该应用程序当前在显示屏中显示的所有内容。When the mobile terminal receives the recognition touch, it recognizes the content corresponding to the recognition touch in the user interface. As a specific implementation manner, the identified display content may be all content in the user interface. In this embodiment, all content except a fixed field in the mobile terminal can be identified. If an application is displayed on the display of the mobile terminal, all content currently displayed on the display by the application is identified.
作为一种具体的实施方式,识别的显示内容可以是该触控操作在用户界面中触控位置所对应的内容。其中,触控位置具体对应的内容可以是,触控位置所在的文字段落,触控位置所在的图片,触控位置所在的控件。例如图2所示,显示的界面为被触控的用户界面,圆圈A表示触控位置,以圆圈A所在的文字段落作为需要识别的显示内容。As a specific implementation manner, the identified display content may be content corresponding to the touch position of the touch operation in the user interface. The specific content corresponding to the touch position may be a text paragraph where the touch position is located, a picture where the touch position is located, and a control where the touch position is located. For example, as shown in FIG. 2, the displayed interface is a touched user interface, the circle A indicates the touch position, and the text segment where the circle A is located is used as the display content to be identified.
作为一种具体的实施方式,识别的显示内容为触控位置对应的文本控件中显示的文字。具体可以包括:确定所述识别触控的触控位置对应的文本控件;获取所述文本控件中的文本进行识别。其中,触控位置对应的文本控件,可以是离触控位置最近的文本控件。As a specific implementation manner, the recognized display content is a text displayed in a text control corresponding to a touch position. Specifically, the method may include: determining a text control corresponding to the touch position of the recognition touch; and acquiring text in the text control for recognition. The text control corresponding to the touch position may be a text control closest to the touch position.
也就是说,触控位置若触控在文本控件上,则以该被触控的文本控件作为待识别的文本控件。例如图2所示的聊天界面中,圆圈A表示触控位置,聊天信息对应的文本控件B为该触控位置所触控到的文本控件,则对该文本控件B中的聊天信息进行识别。That is, if the touch position is touched on the text control, the touched text control is used as the text control to be recognized. For example, in the chat interface shown in FIG. 2, a circle A indicates a touch position, and a text control B corresponding to the chat information is a text control touched by the touch position, and the chat information in the text control B is identified.
触控位置若触控在非文本控件的位置,则可以对离该触控位置最近的文本控件中的文本进行识别。例如图3所示的聊天界面中,圆圈A表示触控位置,未触控在文本控件上,聊天信息对应的文本控件B为离该触控位置最近的文本控件,则对该文本控件B中的聊天信息进行识别。If the touch position is at a position other than the text control, the text in the text control closest to the touch position can be recognized. For example, in the chat interface shown in FIG. 3, circle A indicates a touch position, and no touch is on the text control. The text control B corresponding to the chat information is the text control closest to the touch position. Identification of chat messages.
其中对显示内容的识别可以采用一些现有的识别方式,如分词、语义识别等,在此不做限定。The recognition of the displayed content may use some existing recognition methods, such as word segmentation and semantic recognition, which are not limited here.
另外,对显示内容的识别也可以是通过后台截图,对需要识别的内容进行截图后,通过图片分析进行识别,如通过OCR(Optical Character Recognition,光学字符识别)识别。具体如,确定所述识别触控在用户界面中的触控位置;在所述用户界面中,在所述触控位置的预设范围内进行截图;对所述截图获得的图片进行文本识别。该预设范围的大小在本申请实施例中并不限定,可以是预设长宽的矩形范围、预设半径的圆形范围或者其他预设形状 以及预设大小的范围。In addition, the display content can also be identified through background screenshots. After taking screenshots of the content that needs to be identified, it can be identified through image analysis, such as OCR (Optical Character Recognition). Specifically, determining a touch position of the recognition touch in a user interface; in the user interface, taking a screenshot within a preset range of the touch position; and performing text recognition on an image obtained by the screenshot. The size of the preset range is not limited in the embodiments of the present application, and may be a rectangular range with a preset length and width, a circular range with a preset radius, or other preset shapes and a range with a preset size.
步骤S120:若对所述用户界面的内容识别失败,在用户界面显示可调整的截图框。Step S120: if the content identification of the user interface fails, an adjustable screenshot frame is displayed on the user interface.
由于各种原因,对用户界面的内容的识别可能失败。其中识别失败的原因可能有多种,例如,网络连接不稳定、移动终端未连接网络等网络问题导致无法识别或者识别超时;或如,响应于识别触控,无法获得有效的识别内容;或如需要识别的内容为不健康信息等,在此不做穷举。The identification of the content of the user interface may fail for various reasons. There may be multiple reasons for recognition failure, for example, network problems such as unstable network connection, mobile terminal not connected to the network, etc., lead to failure to recognize or timeout; The content to be identified is unhealthy information, etc., and is not exhaustive here.
若识别失败,未获得识别结果,可以在用户界面显示可调整的截图框。如图4,该截图框K可以是如图所示的矩形框,也可以是其他形状的封闭图形,如圆形、棱形、三角形、多边形等任意形状。其中,该用户界面为识别触控时显示屏显示的用户界面。If the recognition fails and no recognition result is obtained, an adjustable screenshot frame can be displayed in the user interface. As shown in FIG. 4, the screenshot frame K may be a rectangular frame as shown in the figure, or may be a closed shape of other shapes, such as any shape such as a circle, a prism, a triangle, and a polygon. The user interface is a user interface displayed on a display screen when a touch is recognized.
步骤S130:对所述截图框内的内容进行识别。Step S130: identify the content in the screenshot frame.
如图4所示,截图框K中对应有用户界面的内容。截图框中的内容表示需要进行识别的内容,因此,可以对截图框内的内容进行识别。As shown in FIG. 4, the content of the user interface is corresponding to the screenshot box K. The content in the screenshot box indicates the content that needs to be identified. Therefore, the content in the screenshot box can be identified.
在本申请实施例中,在接收到对用户界面的识别触控,对用户界面进行内容识别却识别失败的情况下,通过在用户界面显示可调整的截图框,通过截图框对用户界面进行框选,再对框选的内容进行识别。从而,在识别失败的情况下,可以通过框选按键重新在用户界面进行识别内容的重新框选并识别。In the embodiment of the present application, when an identification touch on the user interface is received, but the content identification of the user interface fails, but the recognition fails, the adjustable screenshot frame is displayed on the user interface, and the user interface is framed by the screenshot frame. Select, and then identify the contents of the box selection. Therefore, in the case of recognition failure, the frame selection button can be used to reframe and identify the recognition content in the user interface again.
在本申请实施例中,可调整的截图框可以由用户根据需求进行调整,使截图框内的内容为用户想要识别的内容。具体的,请参见图5,在本申请实施例所提供的方法中,包括:In the embodiment of the present application, the adjustable screenshot frame can be adjusted by the user according to requirements, so that the content in the screenshot frame is the content that the user wants to identify. Specifically, referring to FIG. 5, the method provided in the embodiment of the present application includes:
步骤S210:接收到对用户界面的识别触控时,对所述用户界面进行内容识别。Step S210: When a recognition touch on the user interface is received, content recognition is performed on the user interface.
用户可以在用户界面发起对显示内容进行识别的识别触控操作。其中,如前所述,该用户界面可以是聊天界面、网页界面、视频界面、各种应用程序的用户界面等,在本申请实施例中并不限定。接收到用户的触控时,在所触控的用户界面上进行内容识别。The user can initiate a recognition touch operation to identify the displayed content in the user interface. Wherein, as described above, the user interface may be a chat interface, a web interface, a video interface, a user interface of various applications, and the like, which are not limited in the embodiments of the present application. When a user touch is received, content recognition is performed on the touched user interface.
步骤S220:若对所述用户界面的内容识别失败,在用户界面显示可调整的截图框。Step S220: If the content identification of the user interface fails, an adjustable screenshot frame is displayed on the user interface.
若识别失败,未获得对所触控的用户界面中的内容的识别结果,在用户界面显示截图框,对用户界面进行框选识别。If the recognition fails, the recognition result of the content in the touched user interface is not obtained, a screenshot frame is displayed on the user interface, and the user interface is selected and identified.
作为一种具体的实施方式,在用户界面显示可调整的截图框时,可以是如图4所示,在显示屏显示的界面为识别触控时所触控的用户界面,在该用户界面显示截图框K。As a specific implementation manner, when the adjustable screenshot frame is displayed on the user interface, as shown in FIG. 4, the interface displayed on the display screen is the user interface touched when the touch is recognized, and is displayed on the user interface. Screenshot box K.
作为一种具体的实施方式,如图6所示,在用户界面显示可调整的截图框时,可以将触控的用户界面缩小显示,在该缩小的用户界面中显示截图框K。在缩小的用户界面以外的位置,可以显示辅助内容,辅助内容可以是如识别二维码、识别商品、识别文本等一种或多种识别类型的可选项,辅助内容可以是截图框调整的提示信息,如图6中的“选择需要识别的区域”,辅助内容也可以是退出截图识别的控制按钮。As a specific implementation, as shown in FIG. 6, when an adjustable screenshot frame is displayed on the user interface, the touch-sensitive user interface may be reduced and displayed, and the screenshot frame K is displayed in the reduced user interface. Outside the reduced user interface, auxiliary content can be displayed. The auxiliary content can be one or more types of recognition options such as identifying QR codes, identifying products, and identifying text. The auxiliary content can be a prompt for adjusting the screenshot box. Information, such as "select an area to be identified" in FIG. 6, and the auxiliary content may also be a control button for exiting the screenshot identification.
可选的,在显示截图框时,用户界面中只有截图框可以操作,其他位置不可操作。Optionally, when the screenshot box is displayed, only the screenshot box can be operated in the user interface, and other locations cannot be operated.
在本申请实施例中,在识别失败开始显示截图框时,截图框首先显示的位置并不限定。In the embodiment of the present application, when the screenshot frame starts to be displayed when the recognition fails, the position where the screenshot frame is first displayed is not limited.
作为一种实施方式,可以将截图框以预设大小显示在预设位置。该预设大小可以是预 先设置的固定大小,也可以是预先设置的与用户界面的比例大小。该预设位置可以是在显示屏的某个固定位置,也可以是在被触控的用户界面的任意某个位置。As an implementation manner, the screenshot frame may be displayed at a preset position in a preset size. The preset size can be a fixed size that is set in advance, or a preset size that is proportional to the user interface. The preset position may be a certain fixed position of the display screen, or may be any certain position of the touched user interface.
作为一种实施方式,截图框的显示位置可以根据识别触控的触控位置确定。具体的,可以是,将触控位置框选在内的截图框。可选的,该截图框可以是预设大小,也可以是将触控位置对应的触控区域框选在内的最小截图框等。As an implementation manner, a display position of the screenshot frame may be determined according to a touch position that recognizes a touch. Specifically, it may be a screenshot frame that includes a touch position frame. Optionally, the screenshot frame may be a preset size, or a minimum screenshot frame including a touch area frame corresponding to a touch position.
在该实施方式中,也可以是,若触控位置在控件上,截图框将该触控位置所触控到的控件框选在内。具体的,可以确定所述识别触控的触控位置所对应的控件,该对应可以表示在用户界面中,该控件所在位置与触控位置相重叠。再在所述用户界面显示将该控件框选在内的可调整的截图框。In this embodiment, if the touch position is on the control, the screenshot frame may include the control frame touched by the touch position. Specifically, a control corresponding to the touch position for identifying the touch may be determined, and the correspondence may indicate that in a user interface, the position where the control is located overlaps the touch position. Then, an adjustable screenshot frame including the control frame is displayed on the user interface.
在该实施方式中,可选的,若触控位置在控件上,该截图框可以是将触控到的控件框选在内的、比该控件大的截图框。也可以是可将触控位置所在的控件框选在内的最小的截图框,即该截图框仅将触控到的控件框选在内,该截图框可以是预设的形状,如矩形、圆角矩形、或者和触控位置所在的控件的形状轮廓一致的形状。In this embodiment, optionally, if the touch position is on the control, the screenshot frame may be a screenshot frame that is larger than the control, including the touched control frame. It can also be the smallest screenshot box that can select the control box where the touch position is located, that is, the screenshot box only selects the control box that it touches. The screenshot box can be a preset shape, such as a rectangle, A rounded rectangle or a shape that conforms to the shape and outline of the control where the touch location is located.
可选的,在显示截图框之前,可以显示提示信息,提示用户识别失败,是否进入框选识别,若用户选择是,则在用户界面显示可调整的截图框;若用户选择否,则可以退出识别过程,对用户界面的识别结束,未获得识别结果。Optionally, before the screenshot box is displayed, a prompt message may be displayed to prompt the user if the recognition fails, whether to enter the box selection recognition. If the user selects Yes, an adjustable screenshot box is displayed in the user interface; if the user selects No, you can exit During the recognition process, the recognition of the user interface is ended, and no recognition result is obtained.
步骤S230:对所述截图框内的内容进行识别。Step S230: identify the content in the screenshot frame.
在本申请实施例中,可以对截图框所框选的内容进行识别。该识别可以是,直接获取截图框中的内容,如截图框中包括文本控件,获取文本控件中的文本,截图框中包括图片控件,直接获取该图片控件中的图片等。另外,该识别也可以是,将所述截图框框选的内容截取为图片,例如,以截图框的边缘作为截取边缘,将截图框中的所有内容进行截图获得截取的图片,再对图片中的内容进行识别。该识别过程可以首先通过图像处理获得图片中的内容,如OCR(Optical Character Recognition,光学字符识别)处理等,在此不做限制,可以通过现有的可以获得图像中文本、图片、二维码等各种内容的处理方式。In the embodiment of the present application, the content selected by the screenshot frame can be identified. The identification may be directly obtaining the content in the screenshot box, such as including a text control in the screenshot box, obtaining the text in the text control, including a picture control in the screenshot box, and directly obtaining a picture in the picture control. In addition, the recognition may also be that the content selected by the screenshot frame is taken as an image, for example, the edge of the screenshot frame is used as the interception edge, and all the content in the screenshot frame is taken to obtain the intercepted image, and then the Content is identified. The recognition process can first obtain the content in the picture through image processing, such as OCR (Optical Character Recognition, Optical Character Recognition) processing, etc. There is no restriction here, and the text, pictures, and two-dimensional code in the image can be obtained through the existing And so on.
作为一种具体的实施方式,用户界面显示截图框,对截图框中的全部或部分内容进行识别。As a specific implementation manner, the user interface displays a screenshot frame to identify all or part of the content in the screenshot frame.
作为一种具体的实施方式,在显示截图框时,同时提供一种或多种识别类型的可选项,该识别类型表示所要识别的内容所属的类型,如二维码、商品、文本、图片等。移动终端可以接收用户从一个或多个识别类型中选择的目标识别类型;对所述截图框中目标识别类型对应的内容进行识别。也就是说,接收到用户从一个或多个识别类型中选择的一个识别类型,以用户选择的识别类型作为目标识别类型,对截图框中的内容中属于目标识别类型的内容进行识别。图6示出了二维码、商品、文本三个可选的识别类型。As a specific implementation manner, when the screenshot frame is displayed, one or more optional types of identification types are provided at the same time, and the identification type indicates the type to which the content to be identified belongs, such as a QR code, a product, a text, a picture, and the like . The mobile terminal may receive a target recognition type selected by the user from one or more recognition types; and identify content corresponding to the target recognition type in the screenshot frame. That is, a recognition type selected by the user from one or more recognition types is received, and the recognition type selected by the user is used as the target recognition type to identify the content in the screenshot frame that belongs to the target recognition type. Figure 6 shows three optional recognition types: QR code, product, and text.
在该实施方式中,对不同识别类型的识别,可以是识别内容的不同,如文本识别仅对截图框中的文本内容进行解析获取识别结果,图片识别仅对截图框中的图片进行解析获取识别结果。也可以是识别方式的不同,如对文本、图片、二维码的识别,由相应识别处理 的服务器实现,如选择的目标识别类型为文本,则将截图框中的内容发送到专门用于文本识别的服务器,或者将截图框中的文本发送到专门用于文本识别的服务器,由该服务器对截图框中的文本进行识别;如选择的目标识别类型为图片,则将截图框中的内容发送到专门用于图片识别的服务器,或者将截图框中的图片发送到专门用于图片识别的服务器,由该服务器对截图框中的图片进行识别;对商品的识别则可以跳转链接到第三方购物平台,如淘宝等,将截图框中的内容传递给第三方购物平台进行识别,以从第三方购物平台获得商品的信息及购买链接。也可以是识别结果的展现不同,例如,对文本、商品、图片等的识别,可以通过分词、简介、链接等形式直接通过卡片进行展示,对商品的识别,可以通过第三方购物平台展示。In this embodiment, the recognition of different recognition types may be different in recognition content. For example, text recognition only parses the text content in the screenshot box to obtain the recognition result, and picture recognition only parses the picture in the screenshot box to obtain the recognition result. result. It can also be a different recognition method, such as recognition of text, pictures, and two-dimensional codes, which is implemented by the corresponding recognition processing server. If the selected target recognition type is text, the content in the screenshot box is sent to a dedicated text The recognized server, or the text in the screenshot box is sent to a server dedicated to text recognition, which recognizes the text in the screenshot box; if the selected target recognition type is picture, the content in the screenshot box is sent To a server dedicated to image recognition, or to send a picture in a screenshot box to a server dedicated to picture recognition, and the server recognizes the picture in the screenshot box; the identification of the product can be linked to a third party Shopping platforms, such as Taobao, pass the content in the screenshot box to a third-party shopping platform for identification, in order to obtain product information and purchase links from the third-party shopping platform. The recognition results may also be displayed differently. For example, the recognition of text, products, pictures, etc. can be displayed directly through cards in the form of word segmentation, introduction, and links. The recognition of products can be displayed through third-party shopping platforms.
具体的,识别过程可以是,对截图框中的内容进行分析,从中获得目标识别类型所对应的内容,对目标识别类型的内容进行识别。例如,若目标识别类型为二维码,则对截图框中的内容进行分析获得其中的二维码后,对二维码进行识别,获取二维码所包含的信息。又如,若目标识别类型为文本,则对截图框中的文本进行分词、解析、语义搜索等操作,并反馈该文本的识别结果。Specifically, the recognition process may be analyzing the content in the screenshot frame, obtaining the content corresponding to the target recognition type from it, and identifying the content of the target recognition type. For example, if the target recognition type is a two-dimensional code, analyze the content in the screenshot box to obtain the two-dimensional code therein, and then identify the two-dimensional code to obtain the information contained in the two-dimensional code. For another example, if the target recognition type is text, perform text segmentation, parsing, and semantic search operations on the text in the screenshot box, and feedback the recognition result of the text.
其中,可选的,当目标识别类型是文本时,可以通过对截图框进行分析获得其中的文本内容,再对文本内容进行过滤,滤除其中的乱码后,获得有效文本,再对该有效文本进行解析识别。在该实施方式中,乱码可以是预设类型文本以外的文本,如预设类型的文本为中文字符、英文字符以及选定的常用标点,则其他文字、常用标点以外的其他标点等,都被判定为乱码。Among them, optionally, when the target recognition type is text, the text content can be obtained by analyzing the screenshot box, and then the text content is filtered. After filtering out the garbled characters, valid text is obtained, and then the valid text is obtained. Perform analytical identification. In this embodiment, the garbled text may be text other than a preset type of text. For example, if the preset type of text is Chinese characters, English characters, and selected common punctuation, other characters, punctuation other than common punctuation, etc. Judged as garbled.
在该实施方式中,可选的,若用户未选择目标识别类型,可以以其中一种识别类型作为默认识别类型,对截图框中默认识别类型的内容进行识别。可选的,若用户未选择目标识别类型,也可以不进行识别,在用户选择识别类型后,再对选择的识别类型对应的内容进行识别。可选的,若用户未选择目标识别类型,也可以对所有类型进行识别。In this embodiment, optionally, if the user has not selected the target recognition type, one of the recognition types may be used as the default recognition type to identify the content of the default recognition type in the screenshot box. Optionally, if the user does not select the target recognition type, the recognition may not be performed. After the user selects the recognition type, the content corresponding to the selected recognition type is recognized. Optionally, if the user has not selected the target recognition type, all types can be recognized.
在本申请实施例中,截图框为可调整的截图框,该调整包括位置的调整以及大小的调整。移动终端可以接收对所述截图框的大小或者位置的调整;对调整后的所述截图框内的内容进行识别。In the embodiment of the present application, the screenshot frame is an adjustable screenshot frame, and the adjustment includes adjustment of position and adjustment of size. The mobile terminal may receive adjustments to the size or position of the screenshot frame; and identify the content within the adjusted screenshot frame.
例如图4及图6所示的为进入框选界面后截图框在用户界面的框选,用户可以通过对截图框的角的不同方向拉动,调整截图框的大小,如图7所示为相对于图4调小后的截图框K。用户也可以通过按压截图框的边线、内部区域等拖动截图框改变位置,如图8所示为相对于图7位置调整后的截图框。For example, Figure 4 and Figure 6 show the selection of the screenshot frame in the user interface after entering the frame selection interface. The user can adjust the size of the screenshot frame by pulling the corner of the screenshot frame in different directions. The screenshot box K after being reduced in FIG. 4. The user can also change the position of the screenshot frame by dragging the screenshot frame by pressing the border of the screenshot frame, the inner area, etc., as shown in FIG. 8, which is adjusted relative to the position of FIG. 7.
在本申请实施例中,还可以改变截图框的形状,接收到用户对截图框形状改变的请求,根据该请求,将截图框的形状改变为圆形、三角形或其它任意的多边形等。具体改变为什么形状可以由用户确定,即可以将所述截图框的形状改变为所述改变请求指定的形状。例如,用户每点击一次对形状进行改变的改变请求的按钮,按预设的形状顺序改变一次截图框形状;或者接收到用户的改变请求,显示可显示的形状选项,用户选择形状后,以用户 选择的形状作为截图框的形状,以满足不同用户的需求。例如可以对应截图框K提供形状选择按钮,接收到对该按钮的按压时,显示可选择的形状。若用户选择了三角形,则将当前截图框的形状改变为三角形。In the embodiment of the present application, the shape of the screenshot frame can also be changed, and a user's request for changing the shape of the screenshot frame is received. According to the request, the shape of the screenshot frame is changed to a circle, a triangle, or any other polygon. The specific change of the shape can be determined by the user, that is, the shape of the screenshot frame can be changed to the shape specified by the change request. For example, each time the user clicks the button of a change request to change the shape, the shape of the screenshot box is changed in the preset shape order; or when the user's change request is received, the displayable shape options are displayed. The selected shape is used as the shape of the screenshot box to meet the needs of different users. For example, a shape selection button may be provided corresponding to the screenshot frame K, and when the button is pressed, a selectable shape is displayed. If the user selects a triangle, the shape of the current screenshot frame is changed to a triangle.
作为一种具体的实施方式,在本申请实施例中,在响应于识别触控识别失败显示截图框时,若还未接收到对截图框的调整,则保持截图框的显示,暂时不进行识别,或者在等待预设时间后,若仍然未接收到对截图框的调整,则对截图框中的内容进行识别;在接收到对截图框的调整后,对调整后的截图框内的内容进行识别,获取识别结果。As a specific implementation manner, in the embodiment of the present application, when the screenshot frame is displayed in response to the recognition touch recognition failure, if the adjustment of the screenshot frame has not been received, the display of the screenshot frame is maintained, and the identification is not performed for the time being. , Or after waiting for the preset time, if the adjustment of the screenshot box has not been received, the content in the screenshot box is identified; after receiving the adjustment of the screenshot box, the content in the adjusted screenshot box is identified Recognize and get the recognition result.
可选的,在该实施方式中,接收到对截图框的调整完成,即每次调整结束,则可以开始对截图框中的内容进行识别。Optionally, in this embodiment, after the adjustment of the screenshot frame is received, that is, each time the adjustment ends, the content in the screenshot frame can be identified.
或者是,可选的,用户可能对截图框进行多次调整,即一次调整还未达到用户想要的调整结果,需要进行多次的调整,则在每次调整后,都等待预设时间。若预设时间内未接收到对截图框的调整,则开始对截图框内的内容进行识别。若预设时间内接收到截图框的调整,则响应于调整操作更改截图框的形状或位置。可以理解的,该预设时间不宜过长,例如可以是1秒。具体的,可以设置定时器,当截图框开始显示时,定时器开始计时。若定时器计时到预设时间长度时未接收到对所述截图框的调整,定时器置零,对所述截图框中的内容进行识别。若定时器计时到预设时间长度之前接收到对所述截图框的调整操作,定时器置零,根据所述调整操作对所述截图框进行调整。在对所述截图框的调整操作结束后,定时器开始计时。当定时器再次计时到预设时间长度时未接收到对所述截图框的调整,定时器置零,对所述截图框中的内容进行识别;若定时器再次计时到预设时间长度之前接收到对所述截图框的调整操作,定时器置零,根据所述调整操作对所述截图框进行调整,在对所述截图框的调整操作结束后,定时器计时,以此类推,通过定时器的计时确定是否开始识别。Alternatively, optionally, the user may make multiple adjustments to the screenshot frame, that is, one adjustment has not reached the adjustment result desired by the user, and multiple adjustments are required. After each adjustment, wait for a preset time. If the adjustment of the screenshot frame is not received within the preset time, the content in the screenshot frame starts to be identified. If the adjustment of the screenshot frame is received within a preset time, the shape or position of the screenshot frame is changed in response to the adjustment operation. It can be understood that the preset time should not be too long, for example, it can be 1 second. Specifically, a timer can be set. When the screenshot box starts to be displayed, the timer starts timing. If the timer does not receive the adjustment of the screenshot frame when the timer reaches the preset time length, the timer is set to zero to identify the content in the screenshot frame. If an adjustment operation on the screenshot frame is received before the timer reaches the preset time length, the timer is set to zero, and the screenshot frame is adjusted according to the adjustment operation. After the operation of adjusting the screenshot frame ends, the timer starts counting. When the timer does not receive the adjustment of the screenshot frame when it reaches the preset time length again, the timer is set to zero to identify the contents of the screenshot frame; if the timer receives the time before the preset time length is received again When the screenshot frame is adjusted, the timer is set to zero, and the screenshot frame is adjusted according to the adjustment operation. After the adjustment operation of the screenshot frame is finished, the timer is counted, and so on. The timer of the controller determines whether to start recognition.
其中,对截图框的调整操作结束,可以是对调整的触控结束。例如,接收到手指在显示屏对截图框的边线的触控时,对截图框的调整开始,根据对边线的拖动调整截图框的大小,期间手指一直与显示屏保持接触;当判定手指离开显示屏,则本次调整结束。The end of the adjustment operation of the screenshot frame may be the end of the touch on the adjustment. For example, when a finger touches the border of the screenshot frame on the display screen, the adjustment of the screenshot frame starts, and the size of the screenshot frame is adjusted according to the dragging of the border line, during which the finger keeps in contact with the display screen; when it is determined that the finger is away Display, the adjustment is over.
作为一种具体的实施方式,在响应于识别触控识别失败显示截图框时,对截图框中的内容进行识别,获取识别结果。若接收到对截图框的调整,则对调整后的截图框中的内容进行识别,根据对调整后的截图框的识别更新识别结果。也就是说,如图5所示,在该实施方式中,在步骤S230之后,还可以包括步骤S240:接收对所述截图框的大小或者位置的调整。步骤S250:对调整后的所述截图框进行截图识别。可选的,在该实施方式中,在截图框的大小或位置重新调整以更新识别结果的过程中,也可以通过定时器的计时确定是否开始识别。例如,若接收到对所述截图框的调整操作,一次调整操作结束后,定时器从零开始计时。若定时器计时到预设时间长度之前再次接收到对所述截图框的调整操作,定时器置零,根据所述调整操作对所述截图框进行调整,在对所述截图框的调整操作结束后,定时器开始计时;若定时器计时到预设时间长度时未接收到对所述截图框的调整,定时器 置零,对所述截图框内的内容进行识别。As a specific implementation manner, when the screenshot frame is displayed in response to the failure of identifying touch recognition, the content in the screenshot frame is identified to obtain a recognition result. If the adjustment of the screenshot frame is received, the content of the adjusted screenshot frame is identified, and the recognition result is updated according to the identification of the adjusted screenshot frame. That is, as shown in FIG. 5, in this embodiment, after step S230, it may further include step S240: receiving adjustment to the size or position of the screenshot frame. Step S250: Perform screenshot identification on the adjusted screenshot frame. Optionally, in this embodiment, in the process of readjusting the size or position of the screenshot frame to update the recognition result, whether to start recognition can also be determined by the timer. For example, if an adjustment operation on the screenshot frame is received, after one adjustment operation ends, the timer starts counting from zero. If the timer again receives the adjustment operation of the screenshot frame before the preset time length expires, the timer is set to zero, the screenshot frame is adjusted according to the adjustment operation, and the adjustment operation of the screenshot frame ends. After that, the timer starts to count; if the timer does not receive adjustments to the screenshot frame when the timer reaches a preset length of time, the timer is set to zero to identify the content in the screenshot frame.
作为一种具体的实施方式,识别的开始与否可以由用户确定。具体的,可以在接收到识别开始的相关命令后,开始进行识别。如用户在通过截图框的调整后,确定截图框中的内容包括要识别的内容,发出识别开始的命令,移动终端开始进行识别。接收到该识别开始的相关命令可以是,接收到对目标类型的选择;设置有专门用于识别开始的触控按钮,接收到对该触控按钮的触控;或者接收到识别开始对应的手势操作,如在截图框内单击或双击等。As a specific implementation manner, whether the recognition is started or not can be determined by a user. Specifically, after receiving a command related to recognition start, recognition may be started. For example, after the user adjusts the screenshot frame, it is determined that the content in the screenshot frame includes the content to be identified, and a command to start the identification is issued, and the mobile terminal starts the identification. Receiving the relevant command for the start of recognition may be: receiving a selection of a target type; providing a touch button dedicated to recognize the start, receiving a touch to the touch button; or receiving a gesture corresponding to the start of recognition Operations, such as clicking or double-clicking in the screenshot box.
在本申请实施例中,对截图框中的识别可以包括对相应内容的分词以及对应搜索,识别结果可以包括分词结果,影视、图书、人物等的简介、链接,地点的地图,商品的购买通道,日程信息、快递信息等一种或多种,在本申请实施例中并不限制,可以是显示内容的任何解释信息。如图9示出了一种具体的识别结果显示方式,显示的识别结果可以包括对相应的内容的分词,用户可以从分词结果中选择词语后进行复制、全选、翻译或者搜索等。In the embodiment of the present application, the recognition in the screenshot box may include word segmentation and corresponding search for the corresponding content, and the recognition result may include the word segmentation result, the introduction, link, location map of the film and television, books, characters, etc., and the purchase channel of the product. One or more of schedule information, courier information, and the like are not limited in the embodiments of the present application, and may be any interpretation information of the displayed content. FIG. 9 shows a specific manner of displaying a recognition result. The displayed recognition result may include segmentation of corresponding content. The user may select a word from the segmentation result and then copy, select all, translate, or search.
可选的,在本申请实施例中,可以将所述识别结果通过卡片进行显示,如图9所示,卡片C显示识别结果。其中,卡片为显示信息的载体,可以是一个控件或者多个控件的组合。本申请实施例中卡片显示的信息可以是识别结果对应的信息。同一个卡片中,可以显示同一显示内容的不同识别结果,或者同一显示内容的不同识别结果可以显示于不同的卡片。Optionally, in the embodiment of the present application, the recognition result may be displayed by a card. As shown in FIG. 9, the card C displays the recognition result. The card is a carrier for displaying information, and may be a control or a combination of multiple controls. The information displayed by the card in the embodiment of the present application may be information corresponding to the recognition result. In the same card, different recognition results of the same display content can be displayed, or different recognition results of the same display content can be displayed on different cards.
可选的,在本申请实施例中,若对截图框中的内容识别不成功,可以显示识别失败的提示信息。Optionally, in the embodiment of the present application, if the content in the screenshot frame is unsuccessfully identified, a prompt message that the recognition fails may be displayed.
综上所述,本申请实施例提供的内容识别方法中,在接收到识别触控时,对用户界面进行内容识别。若识别失败,提供对用户界面进行框选的截图框。当接收到对截图框的大小或者位置等调整,对调整后的截图框中的内容进行识别,从而实现用户可以根据自身需求调整截图框的大小和位置,将想要识别的内容框选在截图框内进行识别,获得想要的识别结果。In summary, in the content recognition method provided in the embodiment of the present application, when a recognition touch is received, content recognition is performed on a user interface. If the identification fails, a screenshot frame is provided for selecting the user interface. When receiving adjustments to the size or position of the screenshot box, the content of the adjusted screenshot box is identified, so that users can adjust the size and position of the screenshot box according to their own needs, and select the content box that they want to identify in the screenshot Recognize in the frame to get the desired recognition result.
在本申请实施例中,上述的各种实施方式可以在符合逻辑的情况下任意结合,本申请实施例对各种结合方案不再进行一一赘述。In the embodiment of the present application, the above-mentioned various implementation manners can be arbitrarily combined under logical circumstances, and the embodiments of the present application will not go into details of the various combining schemes.
本申请实施例还提供了一种内容识别装置300,请参见图10,该装置300包括:第一识别模块310,用于接收到对用户界面的识别触控时,对所述用户界面进行内容识别。框选模块320,用于若对所述用户界面的内容识别失败,在用户界面显示可调整的截图框。第二识别模块330,用于对所述截图框内的内容进行识别。An embodiment of the present application further provides a content recognition device 300. Referring to FIG. 10, the device 300 includes: a first recognition module 310 configured to perform content processing on the user interface when receiving a recognition touch on the user interface. Identify. A frame selection module 320 is configured to display an adjustable screenshot frame on the user interface if the content identification of the user interface fails. The second identification module 330 is configured to identify content in the screenshot frame.
可选的,第一识别模块310可以包括控件确定单元,用于确定所述识别触控的触控位置对应的文本控件;识别单元,用于获取所述文本控件中的文本进行识别。Optionally, the first recognition module 310 may include a control determining unit for determining a text control corresponding to a touch position for recognizing a touch, and a recognition unit for acquiring text in the text control for recognition.
可选的,第一识别模块可以包括,位置确定单元,用于确定所述识别触控的触控位置;图像获取单元,用于在所述用户界面中,在所述触控位置的预设范围内进行截图;识 别单元,用于对所述截图获得的图片进行文本识别。Optionally, the first recognition module may include a position determination unit for determining a touch position for recognizing a touch, and an image acquisition unit for presetting the touch position in the user interface. Take a screenshot within the scope; a recognition unit, configured to perform text recognition on the picture obtained by the screenshot.
可选的,该装置还可以包括:类型确定模块,用于接收用户从一个或多个识别类型中选择的目标识别类型;第二识别模块用于对所述截图框中目标识别类型对应的内容进行识别。Optionally, the device may further include: a type determination module configured to receive a target recognition type selected by the user from one or more recognition types; a second recognition module configured to correspond to the content corresponding to the target recognition type in the screenshot frame For identification.
可选的,该装置还可以包括:调整模块,用于接收对所述截图框的大小或者位置的调整。第二识别模块还用于对调整后的所述截图框进行截图识别。Optionally, the device may further include: an adjustment module, configured to receive adjustment of the size or position of the screenshot frame. The second recognition module is further configured to perform screenshot recognition on the adjusted screenshot frame.
可选的,框选模块320还可以用于将所述截图框以预设大小显示在预设位置。或者用于,确定所述识别触控的触控位置所对应的控件;在所述用户界面显示将所述控件框选在内的可调整的截图框。Optionally, the frame selection module 320 may be further configured to display the screenshot frame at a preset position in a preset size. Alternatively, it is used to determine a control corresponding to the touch position for identifying touch; and an adjustable screenshot frame including the control frame is displayed on the user interface.
综上所述,本申请实施例中,当用户在聊天或者在浏览器中查阅文本时,在识别用户所选择的区域的文本失败以后,显示手动截图框,以便用户手动选择需要截图的区域,再对所截取的图片进行识别,其中,对于截图的图片进行识别时,用户可以选择是二维码识别,还是文本识别,还是物品识别。其中,可以对解析出来的结果做初步筛选,对乱码和字符进行过滤,过滤后如果无有效文本,则走未识别到任何内容的流程,如果过滤后有有效文本,则传递给文本识别。In summary, in the embodiment of the present application, when the user is chatting or viewing text in the browser, after the text in the area selected by the user fails to be recognized, a manual screenshot box is displayed so that the user can manually select the area that needs to be captured. Then, the captured picture is identified. When the screenshot picture is identified, the user can select whether it is a two-dimensional code recognition, a text recognition, or an article recognition. Among them, preliminary analysis can be performed on the parsed results, and garbled characters and characters are filtered. If there is no valid text after filtering, a process that does not recognize any content is taken, and if there is valid text after filtering, it is passed to text recognition.
请再次参阅图11,基于上述的内容识别方法及装置,本申请实施例还提供一种移动终端400。如图11所示,该移动终端400包括显示屏120、存储器104及处理器102,所述显示屏120及所述存储器104耦接到所述处理器102,所述显示屏120用于显示内容、解析识别结果等,所述存储器104存储指令,当所述指令由所述处理器102执行时所述处理器102执行本申请实施例提供的方法。Please refer to FIG. 11 again. Based on the foregoing content identification method and device, an embodiment of the present application further provides a mobile terminal 400. As shown in FIG. 11, the mobile terminal 400 includes a display screen 120, a memory 104, and a processor 102. The display screen 120 and the memory 104 are coupled to the processor 102. The display screen 120 is used to display content. , Parsing recognition results, etc., the memory 104 stores instructions, and when the instructions are executed by the processor 102, the processor 102 executes the method provided in the embodiment of the present application.
具体的,如图12所示,该移动终端400可以包括电子本体部10,所述电子本体部10包括壳体12及设置在所述壳体12上的显示屏120。所述壳体12可采用金属、如钢材、铝合金制成。本实施例中,所述显示屏120通常包括显示面板111,也可包括用于响应对所述显示面板111进行触控操作的电路等。所述显示面板111可以为一个液晶显示面板(Liquid Crystal Display,LCD),在一些实施例中,所述显示面板111同时为一个触摸屏109。Specifically, as shown in FIG. 12, the mobile terminal 400 may include an electronic body portion 10, and the electronic body portion 10 includes a casing 12 and a display screen 120 disposed on the casing 12. The casing 12 can be made of metal, such as steel and aluminum alloy. In this embodiment, the display screen 120 generally includes a display panel 111, and may also include a circuit for responding to a touch operation on the display panel 111, and the like. The display panel 111 may be a liquid crystal display (Liquid Crystal Display, LCD). In some embodiments, the display panel 111 is a touch screen 109 at the same time.
请同时参阅图13,在实际的应用场景中,所述移动终端400可作为智能手机终端进行使用,在这种情况下所述电子本体部10通常还包括一个或多个(图中仅示出一个)处理器102、存储器104、RF(Radio Frequency,射频)模块106、音频电路110、传感器114、输入模块118、电源模块122。本领域普通技术人员可以理解,图13所示的结构仅为示意,其并不对所述电子本体部10的结构造成限定。例如,所述电子本体部10还可包括比图13中所示更多或者更少的组件,或者具有与图13所示不同的对应。Please refer to FIG. 13 at the same time. In an actual application scenario, the mobile terminal 400 can be used as a smart phone terminal. In this case, the electronic body portion 10 usually further includes one or more (only shown in the figure) (A) The processor 102, the memory 104, an RF (Radio Frequency) module 106, an audio circuit 110, a sensor 114, an input module 118, and a power module 122. A person of ordinary skill in the art can understand that the structure shown in FIG. 13 is only for illustration, and it does not limit the structure of the electronic body portion 10. For example, the electronic body portion 10 may further include more or fewer components than those shown in FIG. 13, or have a different correspondence from that shown in FIG. 13.
本领域普通技术人员可以理解,相对于所述处理器102来说,所有其他的组件均属于外设,所述处理器102与这些外设之间通过多个外设接口124相耦合。所述外设接口124可基于以下标准实现:通用异步接收/发送装置(Universal Asynchronous  Receiver/Transmitter,UART)、通用输入/输出(General Purpose Input Output,GPIO)、串行外设接口(Serial Peripheral Interface,SPI)、内部集成电路(Inter-Integrated Circuit,I2C),但不并限于上述标准。在一些实例中,所述外设接口124可仅包括总线;在另一些实例中,所述外设接口124还可包括其他元件,如一个或者多个控制器,例如用于连接所述显示面板111的显示控制器或者用于连接存储器的存储控制器。此外,这些控制器还可以从所述外设接口124中脱离出来,而集成于所述处理器102内或者相应的外设内。Those of ordinary skill in the art can understand that, with respect to the processor 102, all other components are peripherals, and the processor 102 and these peripherals are coupled through multiple peripheral interfaces 124. The peripheral interface 124 may be implemented based on the following standards: Universal Asynchronous Receiver / Transmitter (UART), General Input / Output (GPIO), Serial Peripheral Interface , SPI), Inter-Integrated Circuit (I2C), but not limited to the above standards. In some examples, the peripheral interface 124 may only include a bus; in other examples, the peripheral interface 124 may further include other elements, such as one or more controllers, for example, for connecting the display panel. A display controller of 111 or a memory controller for connecting a memory. In addition, these controllers can also be separated from the peripheral interface 124 and integrated into the processor 102 or a corresponding peripheral.
所述存储器104可用于存储软件程序以及模块,所述处理器102通过运行存储在所述存储器104内的软件程序以及模块,从而执行各种功能应用以及数据处理。所述存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,所述存储器104可进一步包括相对于所述处理器102远程设置的存储器,这些远程存储器可以通过网络连接至所述电子本体部10或所述显示屏120。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 may be used to store software programs and modules, and the processor 102 executes various functional applications and data processing by running the software programs and modules stored in the memory 104. The memory 104 may include a high-speed random access memory, and may further include a non-volatile memory, such as one or more magnetic storage devices, a flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include memories remotely disposed with respect to the processor 102, and these remote memories may be connected to the electronic body portion 10 or the display screen 120 through a network. Examples of the above network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
所述RF模块106用于接收以及发送电磁波,实现电磁波与电信号的相互转换,从而与通讯网络或者其他设备进行通讯。所述RF模块106可包括各种现有的用于执行这些功能的电路元件,例如,天线、射频收发器、数字信号处理器、加密/解密芯片、用户身份模块(SIM)卡、存储器等等。所述RF模块106可与各种网络如互联网、企业内部网、无线网络进行通讯或者通过无线网络与其他设备进行通讯。上述的无线网络可包括蜂窝式电话网、无线局域网或者城域网。上述的无线网络可以使用各种通信标准、协议及技术,包括但并不限于全球移动通信系统(Global System for Mobile Communication,GSM)、增强型移动通信技术(Enhanced Data GSM Environment,EDGE),宽带码分多址技术(wideband code division multiple access,W-CDMA),码分多址技术(Code division access,CDMA)、时分多址技术(time division multiple access,TDMA),无线保真技术(Wireless,Fidelity,WiFi)(如美国电气和电子工程师协会标准IEEE 802.10A,IEEE 802.11b,IEEE802.11g和/或IEEE 802.11n)、网络电话(Voice over internet protocal,VoIP)、全球微波互联接入(Worldwide Interoperability for Microwave Access,Wi-Max)、其他用于邮件、即时通讯及短消息的协议,以及任何其他合适的通讯协议,甚至可包括那些当前仍未被开发出来的协议。The RF module 106 is used to receive and send electromagnetic waves, to realize mutual conversion between electromagnetic waves and electrical signals, and to communicate with a communication network or other equipment. The RF module 106 may include various existing circuit elements for performing these functions, such as an antenna, a radio frequency transceiver, a digital signal processor, an encryption / decryption chip, a subscriber identity module (SIM) card, a memory, and the like . The RF module 106 can communicate with various networks such as the Internet, an intranet, and a wireless network, or communicate with other devices through a wireless network. The wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network. The above wireless network can use various communication standards, protocols and technologies, including but not limited to Global System for Mobile Communication (GSM), Enhanced Data Communication Technology (GSM) Environment, EDGE, Broadband Code Division multiple access technology (wideband code division multiple access, W-CDMA), code division multiple access technology (Code division access, CDMA), time division multiple access technology (time division multiple access, TDMA), wireless fidelity technology (Wireless, Fidelity , WiFi) (such as the American Institute of Electrical and Electronics Engineers standards IEEE 802.10A, IEEE 802.11b, IEEE802.11g, and / or IEEE 802.11n), Voice over Internet (Internet Protocol, VoIP), Global Microwave Interoperability (Worldwide Interoperability for Microwave Access (Wi-Max), other protocols for mail, instant messaging, and short messaging, and any other suitable communication protocol, even those that have not yet been developed.
音频电路110、扬声器101、传声器103、麦克风105共同提供用户与所述电子本体部10或所述显示屏120之间的音频接口。The audio circuit 110, the speaker 101, the microphone 103, and the microphone 105 collectively provide an audio interface between the user and the electronic body portion 10 or the display screen 120.
所述传感器114设置在所述电子本体部10内或所述显示屏120内,所述传感器114的实例包括但并不限于:加速度传感器114F、陀螺仪114G、磁力计114H以及其他传感器。The sensor 114 is disposed in the electronic body portion 10 or the display screen 120. Examples of the sensor 114 include, but are not limited to, an acceleration sensor 114F, a gyroscope 114G, a magnetometer 114H, and other sensors.
本实施例中,所述输入模块118可包括设置在所述显示屏120上的所述触摸屏109, 所述触摸屏109可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在所述触摸屏109上或在所述触摸屏109附近的操作),从而可以获得用户的触摸手势,并根据预先设定的程序驱动相应的连接装置,因此,用户可以通过在显示屏的触控操作选定目标区域。可选的,所述触摸屏109可包括触摸检测装置和触摸控制器。其中,所述触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给所述触摸控制器;所述触摸控制器从所述触摸检测装置上接收触摸信息,并将该触摸信息转换成触点坐标,再送给所述处理器102,并能接收所述处理器102发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现所述触摸屏109的触摸检测功能。除了所述触摸屏109,在其它变更实施方式中,所述输入模块118还可以包括其他输入设备,如按键107。所述按键107例如可包括用于输入字符的字符按键,以及用于触发控制功能的控制按键。所述控制按键的实例包括“返回主屏”按键、开机/关机按键等等。In this embodiment, the input module 118 may include the touch screen 109 provided on the display screen 120, and the touch screen 109 may collect a touch operation performed by the user on or near the touch screen (such as a user using a finger, a stylus, etc.) Any suitable object or accessory is operated on or near the touch screen 109), so that the user ’s touch gesture can be obtained, and the corresponding connection device is driven according to a preset program, so the user can The touch operation of the display selects the target area. Optionally, the touch screen 109 may include a touch detection device and a touch controller. The touch detection device detects a user's touch position, and detects a signal caused by a touch operation, and transmits the signal to the touch controller. The touch controller receives touch information from the touch detection device, and The touch information is converted into touch point coordinates, and then sent to the processor 102, and can receive and execute commands sent by the processor 102. In addition, various types such as resistive, capacitive, infrared, and surface acoustic wave can be used to implement the touch detection function of the touch screen 109. In addition to the touch screen 109, in other modified embodiments, the input module 118 may further include other input devices, such as keys 107. The keys 107 may include, for example, character keys for inputting characters, and control keys for triggering control functions. Examples of the control buttons include a "return to the home screen" button, an on / off button, and the like.
所述显示屏120用于显示由用户输入的信息、提供给用户的信息以及所述电子本体部10的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、数字、视频和其任意组合来构成。在一个实例中,所述触摸屏109可设置于所述显示面板111上从而与所述显示面板111构成一个整体。The display screen 120 is used to display information input by the user, information provided to the user, and various graphical user interfaces of the electronic body portion 10, and these graphical user interfaces may include graphics, text, icons, numbers, videos, and other Composed of any combination. In one example, the touch screen 109 may be disposed on the display panel 111 so as to form a whole with the display panel 111.
所述电源模块122用于向所述处理器102以及其他各组件提供电力供应。具体地,所述电源模块122可包括电源管理系统、一个或多个电源(如电池或者交流电)、充电电路、电源失效检测电路、逆变器、电源状态指示灯以及其他任意与所述电子本体部10或所述显示屏120内电力的生成、管理及分布相关的组件。The power module 122 is configured to provide power to the processor 102 and other components. Specifically, the power module 122 may include a power management system, one or more power sources (such as a battery or AC power), a charging circuit, a power failure detection circuit, an inverter, a power status indicator, and any other electronic components Components related to the generation, management and distribution of power in the display 10 or the display 120.
所述移动终端400还包括定位器119,所述定位器119用于确定所述移动终端400所处的实际位置。本实施例中,所述定位器119采用定位服务来实现所述移动终端400的定位,所述定位服务,应当理解为通过特定的定位技术来获取所述移动终端400的位置信息(如经纬度坐标),在电子地图上标出被定位对象的位置的技术或服务。The mobile terminal 400 further includes a locator 119, which is configured to determine an actual location where the mobile terminal 400 is located. In this embodiment, the locator 119 uses a positioning service to implement positioning of the mobile terminal 400. The positioning service should be understood as obtaining position information (such as latitude and longitude coordinates) of the mobile terminal 400 through a specific positioning technology. ), A technology or service for marking the location of an object on an electronic map.
应当理解的是,上述的移动终端400并不局限于智能手机终端,其应当指可以在移动中使用的计算机设备。具体而言,移动终端400,是指搭载了智能操作系统的移动计算机设备,移动终端400包括但不限于智能手机、智能手表、平板电脑,等等。It should be understood that the above-mentioned mobile terminal 400 is not limited to a smart phone terminal, and it should refer to a computer device that can be used in mobile. Specifically, the mobile terminal 400 refers to a mobile computer device equipped with a smart operating system. The mobile terminal 400 includes, but is not limited to, a smart phone, a smart watch, a tablet computer, and the like.
需要说明的是,本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。对于装置类实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。对于方法实施例中的所描述的任意的处理方式,在装置实施例中均可以通过相应的处理模块实现,装置实施例中不再一一赘述。It should be noted that each embodiment in this specification is described in a progressive manner. Each embodiment focuses on the differences from other embodiments. For the same and similar parts between the embodiments, refer to each other. can. As for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple. For the relevant part, refer to the description of the method embodiment. Any of the processing methods described in the method embodiments can be implemented by corresponding processing modules in the device embodiments, and will not be described in detail in the device embodiments.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须 针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, the description with reference to the terms “one embodiment”, “some embodiments”, “examples”, “specific examples”, or “some examples” and the like means specific features described in conjunction with the embodiments or examples , Structure, materials, or features are included in at least one embodiment or example of the present application. In this specification, the schematic expressions of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. In addition, without any contradiction, those skilled in the art may combine and combine different embodiments or examples and features of the different embodiments or examples described in this specification.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Therefore, the features defined as "first" and "second" may explicitly or implicitly include at least one of the features. In the description of the present application, the meaning of "plurality" is at least two, for example, two, three, etc., unless it is specifically and specifically defined otherwise.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in a flowchart or otherwise described herein can be understood as representing a module, fragment, or portion of code that includes one or more executable instructions for implementing a particular logical function or step of a process And, the scope of the preferred embodiments of the present application includes additional implementations, in which the functions may be performed out of the order shown or discussed, including performing functions in a substantially simultaneous manner or in the reverse order according to the functions involved, which should It is understood by those skilled in the art to which the embodiments of the present application pertain.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(移动终端),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and / or steps represented in the flowchart or otherwise described herein, for example, a sequenced list of executable instructions that can be considered to implement a logical function, can be embodied in any computer-readable medium, For the instruction execution system, device, or device (such as a computer-based system, a system including a processor, or other system that can fetch and execute instructions from the instruction execution system, device, or device), or combine these instruction execution systems, devices, or devices Or equipment. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device. More specific examples (non-exhaustive list) of computer-readable media include the following: electrical connections (mobile terminals) with one or more wirings, portable computer disk enclosures (magnetic devices), random access memory (RAM), Read-only memory (ROM), erasable and editable read-only memory (EPROM or flash memory), fiber optic devices, and portable optical disk read-only memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program can be printed, because, for example, by optically scanning the paper or other medium, followed by editing, interpretation, or other suitable Processing to obtain the program electronically and then store it in computer memory.
应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that each part of the application may be implemented by hardware, software, firmware, or a combination thereof. In the above embodiments, multiple steps or methods may be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it may be implemented using any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gate circuits, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。此外,在本申请各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或 两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。A person of ordinary skill in the art can understand that all or part of the steps carried by the methods in the foregoing embodiments may be implemented by a program instructing related hardware. The program may be stored in a computer-readable storage medium. The program is When executed, one or a combination of the steps of the method embodiment is included. In addition, each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist separately physically, or two or more units may be integrated into one module. The above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium.
上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本申请的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。The aforementioned storage medium may be a read-only memory, a magnetic disk, or an optical disk. Although the embodiments of the present application have been shown and described above, it can be understood that the above embodiments are exemplary and should not be construed as limitations on the present application. Those skilled in the art can interpret the above within the scope of the present application. Embodiments are subject to change, modification, substitution, and modification.
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不驱使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present application, but not limited thereto. Although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still Modifications to the technical solutions described in the foregoing embodiments, or equivalent replacements of some of the technical features thereof; and these modifications or replacements do not drive the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (20)

  1. 一种内容识别方法,其特征在于,所述方法包括:A content identification method, characterized in that the method includes:
    接收到对用户界面的识别触控时,对所述用户界面进行内容识别;When a recognition touch to the user interface is received, performing content recognition on the user interface;
    若对所述用户界面的内容识别失败,在所述用户界面显示可调整的截图框;If the content identification of the user interface fails, an adjustable screenshot frame is displayed on the user interface;
    对所述截图框内的内容进行识别。Identify the content in the screenshot box.
  2. 根据权利要求1所述的方法,其特征在于,所述识别触控对应的触控操作的滑动轨迹为封闭图形。The method according to claim 1, wherein the sliding track of the touch operation corresponding to the recognition touch is a closed graphic.
  3. 根据权利要求1或2所述的方法,其特征在于,所述对所述用户界面进行内容识别,包括:The method according to claim 1 or 2, wherein the performing content identification on the user interface comprises:
    确定所述识别触控的触控位置对应的文本控件;Determining a text control corresponding to the touch position of the recognition touch;
    获取所述文本控件中的文本进行识别。Acquire text in the text control for recognition.
  4. 根据权利要求3所述的方法,其特征在于,所述确定所述识别触控的触控位置对应的文本控件,包括:The method according to claim 3, wherein the determining a text control corresponding to a touch position for recognizing a touch comprises:
    若所述触控位置在文本控件上,以所述触控位置所在的文本控件作为所述触控位置对应的文本控件;If the touch position is on a text control, using the text control where the touch position is as a text control corresponding to the touch position;
    若所述触控位置不在文本控件上,以距离该触控位置最近的文本控件作为所述触控位置对应的文本控件。If the touch position is not on the text control, the text control closest to the touch position is used as the text control corresponding to the touch position.
  5. 根据权利要求1-4任一项所述的方法,其特征在于,所述对所述用户界面进行内容识别,包括:The method according to any one of claims 1-4, wherein the performing content identification on the user interface comprises:
    确定所述识别触控的触控位置;Determining a touch position of the recognition touch;
    在所述用户界面中,在所述触控位置的预设范围内进行截图;Taking a screenshot in a preset range of the touch position in the user interface;
    对所述截图获得的图片进行文本识别。Text recognition is performed on the picture obtained by the screenshot.
  6. 根据权利要求1-5任一项所述的方法,其特征在于,所述在所述用户界面显示可调整的截图框,包括:The method according to any one of claims 1-5, wherein the displaying an adjustable screenshot frame on the user interface comprises:
    将所述用户界面缩小显示,在该缩小的用户界面中显示所述截图框;Reducing the display of the user interface, and displaying the screenshot frame in the reduced user interface;
    在缩小的用户界面以外的位置,显示辅助内容,所述辅助内容包括一个或多个识别类型。Outside the reduced user interface, auxiliary content is displayed, the auxiliary content including one or more recognition types.
  7. 根据权利要求1-5任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-5, further comprising:
    接收用户从一个或多个识别类型中选择的目标识别类型;Receiving a target recognition type selected by the user from one or more recognition types;
    对所述截图框中目标识别类型对应的内容进行识别。The content corresponding to the target recognition type in the screenshot box is identified.
  8. 根据权利要求7所述的方法,其特征在于,所述识别类型包括:The method according to claim 7, wherein the identification type comprises:
    二维码、商品或文本。QR code, product or text.
  9. 根据权利要求7或8所述的方法,其特征在于,对所述截图框中目标识别类型对应的内容进行识别,包括:The method according to claim 7 or 8, wherein identifying the content corresponding to the target recognition type in the screenshot frame comprises:
    若所述目标识别类型为文本,将所述截图框中的内容发送到专门用于文本识别的服务器,由该服务器对截图框中的文本进行识别;If the target recognition type is text, sending the content in the screenshot box to a server dedicated to text recognition, and the server recognizes the text in the screenshot box;
    若所述目标识别类型为图片,将所述截图框中的内容发送到专门用于图片识别的服务器,由该服务器对截图框中的图片进行识别;If the target recognition type is a picture, sending the content in the screenshot box to a server dedicated to picture recognition, and the server recognizes the picture in the screenshot box;
    若所述目标识别类型为商品,将截图框中的内容传递给第三方购物平台进行识别,以从第三方购物平台获得商品的信息及购买链接。If the target identification type is a product, the content in the screenshot box is passed to a third-party shopping platform for identification, so as to obtain product information and a purchase link from the third-party shopping platform.
  10. 根据权利要求1-9任一项所述的方法,其特征在于,预先设置有定时器,所述对所述截图框内的内容进行识别,包括:The method according to any one of claims 1-9, wherein a timer is set in advance, and the identifying the content in the screenshot frame comprises:
    当截图框开始显示时,定时器开始计时;When the screenshot box starts to display, the timer starts counting;
    若定时器计时到预设时间长度之前接收到对所述截图框的调整操作,定时器置零,根据所述调整操作对所述截图框进行调整,If an adjustment operation on the screenshot frame is received before the timer reaches the preset time length, the timer is set to zero, and the screenshot frame is adjusted according to the adjustment operation.
    在对所述截图框的调整操作结束后,定时器开始计时;After the operation of adjusting the screenshot frame ends, the timer starts counting;
    若定时器计时到预设时间长度时未接收到对所述截图框的调整,定时器置零,对所述截图框内的内容进行识别。If the timer does not receive the adjustment of the screenshot frame when the timer reaches the preset time length, the timer is set to zero to identify the content in the screenshot frame.
  11. 根据权利要求1-10任一项所述的方法,其特征在于,所述对所述截图框内的内容进行识别之后,还包括:The method according to any one of claims 1 to 10, wherein after identifying the content in the screenshot frame, further comprising:
    接收对所述截图框的大小或者位置的调整;Receiving adjustments to the size or position of the screenshot frame;
    对调整后的所述截图框进行截图识别。Perform screenshot recognition on the adjusted screenshot frame.
  12. 根据权利要求1-11任一项所述的方法,其特征在于,所述在所述用户界面显示可调整的截图框包括:The method according to any one of claims 1-11, wherein the displaying an adjustable screenshot frame on the user interface comprises:
    将所述截图框以预设大小以及预设形状显示在预设位置。Displaying the screenshot frame at a preset position in a preset size and a preset shape.
  13. 根据权利要求1-11任一项所述的方法,其特征在于,所述在所述用户界面显示可调整的截图框包括:The method according to any one of claims 1-11, wherein the displaying an adjustable screenshot frame on the user interface comprises:
    确定所述识别触控的触控位置所对应的控件;Determining a control corresponding to the touch position of the recognition touch;
    在所述用户界面显示将所述控件框选在内的可调整的截图框。An adjustable screenshot frame selected from the control frame is displayed on the user interface.
  14. 根据权利要求13所述的方法,其特征在于,所述在所述用户界面显示将所述控件框选在内的可调整的截图框,包括:The method according to claim 13, wherein the displaying on the user interface an adjustable screenshot frame including the control frame selection comprises:
    显示将触控位置所在的控件框选在内的最小的截图框。Shows the smallest screenshot box with the control box where the touch position is located.
  15. 根据权利要求14所述的方法,其特征在于,显示将触控位置对应的触控区域框选在内的最小截图框中,截图框的形状与触控位置所在控件的形状轮廓一致。The method according to claim 14, wherein a smallest screenshot frame including a touch area frame corresponding to the touch position is displayed, and a shape of the screenshot frame is consistent with a shape contour of a control where the touch position is located.
  16. 根据权利要求12-14任一项所述的方法,其特征在于,所述在所述用户界面显示可调整的截图框之后,还包括The method according to any one of claims 12 to 14, wherein after displaying an adjustable screenshot frame in the user interface, the method further comprises
    接收对所述截图框的形状进行改变的改变请求;Receiving a change request for changing the shape of the screenshot frame;
    将所述截图框的形状改变为所述改变请求指定的形状。Changing the shape of the screenshot frame to a shape specified by the change request.
  17. 根据权利要求1-16任一项所述的方法,其特征在于,所述对所述截图框内的内容进行识别,包括:The method according to any one of claims 1-16, wherein the identifying the content in the screenshot frame comprises:
    对截图框进行分析获得其中的文本内容;Analyze the screenshot box to obtain the text content;
    滤除所述文本内容中的乱码后,获得有效文本,所述乱码为预设类型文本以外的文本,所述预设类型的文本为中文字符、英文字符以及选定的常用标点;After filtering garbled characters in the text content, valid text is obtained, the garbled text is text other than a preset type of text, and the preset type of text is Chinese characters, English characters, and selected common punctuation;
    对所述有效文本进行解析识别。Analyze and identify the valid text.
  18. 一种内容识别装置,其特征在于,所述装置包括:A content recognition device, characterized in that the device includes:
    第一识别模块,用于接收到对用户界面的识别触控时,对所述用户界面进行内容识别;A first recognition module, configured to perform content recognition on the user interface when receiving a recognition touch on the user interface;
    框选模块,用于若对所述用户界面的内容识别失败,在所述用户界面显示可调整的截图框;A frame selection module, configured to display an adjustable screenshot frame on the user interface if the content identification of the user interface fails;
    第二识别模块,用于对所述截图框内的内容进行识别。The second identification module is configured to identify content in the screenshot frame.
  19. 一种移动终端,其特征在于,包括显示屏、存储器及处理器,所述显示屏及所述存储器耦接到所述处理器,所述存储器存储指令,当所述指令由所述处理器执行时所述处理器执行权利要求1至17任一项所述的方法。A mobile terminal is characterized by comprising a display screen, a memory and a processor, the display screen and the memory are coupled to the processor, the memory stores instructions, and when the instructions are executed by the processor When the processor executes the method according to any one of claims 1 to 17.
  20. 一种具有处理器可执行的程序代码的计算机可读存储介质,其特征在于,所述程序代码使所述处理器执行权利要求1至17任一项所述的方法。A computer-readable storage medium having a processor-executable program code, wherein the program code causes the processor to perform the method according to any one of claims 1 to 17.
PCT/CN2019/088874 2018-06-08 2019-05-28 Content identification method and device, and mobile terminal WO2019233318A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810588338.1A CN108958576B (en) 2018-06-08 2018-06-08 Content identification method and device and mobile terminal
CN201810588338.1 2018-06-08

Publications (1)

Publication Number Publication Date
WO2019233318A1 true WO2019233318A1 (en) 2019-12-12

Family

ID=64494007

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/088874 WO2019233318A1 (en) 2018-06-08 2019-05-28 Content identification method and device, and mobile terminal

Country Status (2)

Country Link
CN (1) CN108958576B (en)
WO (1) WO2019233318A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108958576B (en) * 2018-06-08 2021-02-02 Oppo广东移动通信有限公司 Content identification method and device and mobile terminal
CN109933275A (en) * 2019-02-12 2019-06-25 努比亚技术有限公司 A kind of knowledge screen method, terminal and computer readable storage medium
CN110647640B (en) * 2019-09-30 2023-01-10 京东方科技集团股份有限公司 Computer system, method for operating a computing device and system for operating a computing device
CN111310482A (en) * 2020-01-20 2020-06-19 北京无限光场科技有限公司 Real-time translation method, device, terminal and storage medium
CN112596656A (en) * 2020-12-28 2021-04-02 北京小米移动软件有限公司 Content identification method, device and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107797750A (en) * 2017-10-27 2018-03-13 珠海市魅族科技有限公司 A kind of screen content identifying processing method, apparatus, terminal and medium
CN108958576A (en) * 2018-06-08 2018-12-07 Oppo广东移动通信有限公司 content identification method, device and mobile terminal

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6320982B2 (en) * 2014-11-26 2018-05-09 ネイバー コーポレーションNAVER Corporation Translated sentence editor providing apparatus and translated sentence editor providing method
CN105005551A (en) * 2015-06-29 2015-10-28 东南(福建)汽车工业有限公司 Method for implementing rapid acquisition of picture characters in document revision
CN106020694B (en) * 2016-05-24 2023-01-31 北京京东尚科信息技术有限公司 Electronic equipment, and method and device for dynamically adjusting selected area
CN106325688B (en) * 2016-08-17 2020-01-14 北京字节跳动网络技术有限公司 Text processing method and device
CN111381751A (en) * 2016-10-18 2020-07-07 北京字节跳动网络技术有限公司 Text processing method and device
CN107358226A (en) * 2017-06-23 2017-11-17 联想(北京)有限公司 The recognition methods of electronic equipment and electronic equipment
CN107632773A (en) * 2017-10-17 2018-01-26 北京百度网讯科技有限公司 For obtaining the method and device of information

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107797750A (en) * 2017-10-27 2018-03-13 珠海市魅族科技有限公司 A kind of screen content identifying processing method, apparatus, terminal and medium
CN108958576A (en) * 2018-06-08 2018-12-07 Oppo广东移动通信有限公司 content identification method, device and mobile terminal

Also Published As

Publication number Publication date
CN108958576A (en) 2018-12-07
CN108958576B (en) 2021-02-02

Similar Documents

Publication Publication Date Title
WO2019233318A1 (en) Content identification method and device, and mobile terminal
WO2019233212A1 (en) Text identification method and device, mobile terminal, and storage medium
US20200341600A1 (en) Method for Page Displaying and Terminals
JP6167245B2 (en) COMMUNICATION MESSAGE IDENTIFICATION METHOD, COMMUNICATION MESSAGE IDENTIFICATION DEVICE, PROGRAM, AND RECORDING MEDIUM
EP3142107A1 (en) Voice recognition apparatus and controlling method thereof
US20190056904A1 (en) Method, apparatus, and mobile terminal for screen mirroring
US20140062962A1 (en) Text recognition apparatus and method for a terminal
US9560188B2 (en) Electronic device and method for displaying phone call content
CN109101498B (en) Translation method and device and mobile terminal
WO2019233316A1 (en) Data processing method and device, mobile terminal, and storage medium
CN109085982B (en) Content identification method and device and mobile terminal
CN104123093A (en) Information processing method and device
US9921735B2 (en) Apparatuses and methods for inputting a uniform resource locator
WO2022111394A1 (en) Information processing method and apparatus, and electronic devices
CN109032491A (en) Data processing method, device and mobile terminal
US9367225B2 (en) Electronic apparatus and computer-readable recording medium
CN105095366A (en) Method and device for processing character messages
WO2019228370A1 (en) Data processing method and device, mobile terminal and storage medium
US10963121B2 (en) Information display method, apparatus and mobile terminal
WO2019201109A1 (en) Word processing method and apparatus, and mobile terminal and storage medium
WO2019223484A1 (en) Information display method and apparatus, and mobile terminal and storage medium
CN108803961B (en) Data processing method and device and mobile terminal
CN109101163B (en) Long screen capture method and device and mobile terminal
CN108810262B (en) Application configuration method, terminal and computer readable storage medium
CN108811177B (en) Communication method and terminal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19814091

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19814091

Country of ref document: EP

Kind code of ref document: A1