WO2022267696A1 - Content recognition method and apparatus, electronic device, and storage medium - Google Patents

Content recognition method and apparatus, electronic device, and storage medium Download PDF

Info

Publication number
WO2022267696A1
WO2022267696A1 PCT/CN2022/090382 CN2022090382W WO2022267696A1 WO 2022267696 A1 WO2022267696 A1 WO 2022267696A1 CN 2022090382 W CN2022090382 W CN 2022090382W WO 2022267696 A1 WO2022267696 A1 WO 2022267696A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
displayed
recognition result
content
electronic device
Prior art date
Application number
PCT/CN2022/090382
Other languages
French (fr)
Chinese (zh)
Inventor
徐思琪
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2022267696A1 publication Critical patent/WO2022267696A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04847Interaction techniques to control parameter settings, e.g. interaction with sliders or dials
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/30Scenes; Scene-specific elements in albums, collections or shared content, e.g. social network photos or video
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/70Labelling scene content, e.g. deriving syntactic or semantic representations

Definitions

  • the present application relates to the technical field of electronic equipment, and more specifically, to a content identification method, device, electronic equipment, and storage medium.
  • the present application proposes a content identification method, device, electronic device and storage medium to improve the above problems.
  • the present application provides a method for content recognition, which is applied to electronic equipment, and the method includes: displaying the captured image in real time; if the displayed image includes specified content, displaying a prompt at the specified content identification; in response to a touch operation acting on the prompt identification, identify the specified content; and output an identification result.
  • the present application provides a content recognition device that runs on electronic equipment, and the device includes: an image display unit, configured to display captured images in real time; and a content identification unit, configured to display images that include If there is specified content, a prompt mark is displayed at the specified content; a recognition unit is configured to identify the specified content in response to a touch operation acting on the prompt mark; a content output unit is configured to output a recognition result.
  • the present application provides an electronic device, including one or more processors and a memory; one or more programs are stored in the memory and configured to be executed by the one or more processors, The one or more programs are configured to perform the methods described above.
  • the present application provides a computer-readable storage medium storing program code executable by a processor, the computer-readable storage medium includes the stored program code, wherein, when the program code is running, the above-mentioned Methods.
  • FIG. 1 shows a schematic diagram of a scene of a content recognition method proposed by the present application
  • FIG. 2 shows a flow chart of a content identification method proposed by an embodiment of the present application
  • FIG. 3 shows a schematic diagram of an image collected by an electronic device in the present application
  • Fig. 4 shows a schematic diagram of a reminder mark in this application
  • Fig. 5 shows a schematic diagram of a variety of specified content in this application with a corresponding reminder for each specified content
  • Fig. 6 shows a schematic diagram of a display mode of a recognition result in this application
  • Fig. 7 shows a schematic diagram of another display mode of recognition results in this application.
  • FIG. 8 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • FIG. 10 shows another schematic diagram of displaying recognition results based on a full-screen mode in this application.
  • FIG. 11 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • Fig. 12 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • Fig. 13 shows a schematic diagram of the locking interface in this application
  • FIG. 14 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • Figure 15 shows a schematic diagram of the operation menu in this application
  • FIG. 16 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • Fig. 17 shows a schematic diagram of a complete display of objects in this application.
  • Figure 18 shows a schematic diagram of the size comparison of objects after increasing the focal length in the present application
  • FIG. 19 shows a flow chart of a content identification method proposed by another embodiment of the present application.
  • Fig. 20 shows a structural block diagram of a content identification device proposed in this application.
  • Fig. 21 shows a structural block diagram of another content identification device proposed by the present application.
  • FIG. 22 shows a structural block diagram of an electronic device for performing a content identification method according to an embodiment of the present application
  • Fig. 23 is a storage unit for saving or carrying program codes for realizing the content identification method according to the embodiment of the present application according to the embodiment of the present application.
  • the WiFi password will be printed on some paper, or will be posted on the wall, then in this case, the user will first operate the electronic device to take pictures of the WiFi password, and get The image including the WiFi password, and then operate other applications to perform text recognition on the image including the WiFi password, thereby extracting the WiFi password.
  • the inventor found that in the related identification process, it is necessary to first take pictures of the electronic device to obtain the image to be identified, and then operate the electronic device to obtain the image to be identified by the photo.
  • Image content recognition which in turn results in a cumbersome recognition process and poor user experience.
  • the inventor proposes a content recognition method, device, electronic device, and storage medium in the present application.
  • the collected images are displayed in real time, and when the displayed images include specified content
  • a prompt mark may be displayed at the specified content, and then the specified content may be identified in response to a touch operation acting on the prompt mark, and a recognition result may be output. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
  • the method provided in this embodiment may further include the following process: acquiring a background image, where the background image includes an image displayed by the electronic device when the touch operation acts on the prompt sign; The recognition result is displayed with the background image as the background.
  • the method provided in this embodiment may further include the following process: acquiring an image to be processed, the image to be processed is an image displayed by the electronic device when the touch operation acts on the prompt mark image; perform blurring processing on the image to be processed, and use the blurred image as a background image; display the recognition result with the background image as the background.
  • the method provided in this embodiment may further include the following process: displaying a first trigger control; and displaying the recognition result based on a full-screen mode in response to a touch operation acting on the first trigger control.
  • the method provided in this embodiment may further include the following process: displaying the recognition result; displaying the second trigger control; displaying a locking interface in response to a touch operation acting on the second trigger control, and the locking
  • the interface includes an image displayed by the electronic device when a touch operation acts on the prompt mark, and a prompt mark corresponding to the specified content in the displayed image.
  • the method provided in this embodiment may further include the following process: in response to the first operation, resume real-time display of the collected images.
  • the method provided in this embodiment may further include the following procedures: displaying the recognition result; displaying a third trigger control; displaying an operation menu in response to a touch operation acting on the third trigger control, and the operation
  • the menu includes at least one operation control, and each operation control corresponds to a different operation; in response to the touch operation acting on the operation control, the operation corresponding to the operation control with the touch operation is used as the target operation; the recognition result Execute the target action.
  • the method provided in this embodiment may further include the following process: performing zoom processing on the captured image in response to the zoom request.
  • the method provided in this embodiment may further include the following process: if there is an area selection operation acting on the image displayed in real time, detect whether the object in the selected area is completely displayed; if it is completely displayed, Then generate a size-increasing zoom request, and the size-increasing zoom request is used to make the objects in the selected area be completely displayed with the first target size; in response to the size-increasing zoom request, the collected The image is zoomed.
  • the method provided in this embodiment may further include the following process: if it is not fully displayed, generate a reduced-size zoom request, and the reduced-size zoom request is used to make the selected area
  • the object is displayed with the second target size; in response to the zoom request for reducing the size, the captured image is zoomed.
  • the specified content includes: text content or a target object.
  • the method provided in this embodiment may further include the following process: if the specified content is text content, use the scene image corresponding to the scene expressed by the semantics of the recognition result as the background image.
  • the method provided in this embodiment may further include the following process: if the specified content is text content, use an image corresponding to a keyword in the recognition result as a background image.
  • the method provided in this embodiment may further include the following process: merging the recognition result and the background image into one image to obtain a fused image; displaying the fused image .
  • the method provided in this embodiment may further include the following process: displaying the background image, and suspending the recognition result on the displayed background image.
  • the method provided in this embodiment may further include the following process: if the specified content is text content, the recognition result is enlarged in size and then displayed.
  • the method provided in this embodiment may further include the following process: start the camera program in response to the user's operation; after starting the camera program, display the captured image in real time in the interface of the displayed camera program .
  • the content identification method provided in the embodiment of the present application may be independently executed by an electronic device.
  • the electronic device collects images through its own image acquisition device, then displays the image collected by the image acquisition device in real time, and executes the content recognition method provided by the embodiment of the present application on the image displayed in real time.
  • the content identification method provided in the embodiment of the present application may be executed cooperatively by at least two electronic devices. As shown in FIG. 1 , in the scene shown in FIG.
  • the electronic equipment 100 and electronic equipment 200 there are electronic equipment 100 and electronic equipment 200, wherein the electronic equipment 200 can perform image acquisition through its configured image acquisition device, and then transmit the acquired image to itself
  • the network module of the electronic device 100 can use the network module to transmit the captured image to the network module of the electronic device 100, and the processor of the electronic device 100 can obtain the image received by its own network module, and then execute the application embodiment provided by the obtained image content identification method.
  • the electronic devices for example, the electronic device 100 and the electronic device 200
  • the electronic devices may be mobile phones, tablet computers and other devices.
  • the two electronic devices may be of the same type, or may be of different types.
  • the two electronic devices shown in FIG. 1 are both smart phones, it may also include that one electronic device is a smart phone and the other electronic device is a smart watch.
  • a content identification method provided by this application is applied to electronic equipment, and the method includes:
  • S110 Display the collected images in real time.
  • the displayed image is collected by an image acquisition device (for example, a camera), then displaying the image in real time can be understood as displaying the image collected by the image acquisition device in real time, so that the user can view the current electronic equipment.
  • the images collected by the image acquisition device are previewed.
  • the displayed content will also change synchronously.
  • the electronic device can start the camera program in response to the user's operation, and after the camera program is started, the captured image can be displayed in real time on the displayed interface of the camera program.
  • the designated content may be text content, or may be a target object.
  • the target object may include a human face and the like.
  • a prompt logo may be displayed at the specified content.
  • the prompt identifier can be a frame surrounding the specified content.
  • the user uses an electronic device to shoot content in the remote projection screen 10 , and correspondingly, the electronic device will process the captured image in the interface 11 Real-time display, and then the projection screen 10 and the content in the projection screen 10 will be displayed synchronously on the interface 11 .
  • the electronic device will display a frame 12 surrounding the text content as a prompt mark at the area where the text content is included.
  • the specific style of the prompt mark is not specifically limited in this embodiment of the application, and it may be other styles besides the box shown in FIG. 3 .
  • the prompt mark can also be a transparent layer 13 with color, and the transparent layer 13 can cover the specified content, so that the user can see the specified content through the transparent layer 13 .
  • the color of the transparent layer may be yellow or green.
  • different types of specified content may have different prompting signs corresponding to them, so that the user can quickly distinguish the specified content through the prompting mark.
  • the face of the characters will be determined as a specified content
  • the bus stop Since the card contains text content, it can also be identified as a specified content.
  • the frame 14 can be used as a prompt mark
  • the transparent layer 15 can be used as a prompt mark.
  • a transparent layer may also be used as a corresponding prompting mark
  • a frame may also be used as a prompting mark for text content.
  • the touch operation may be a click operation.
  • the touch operation may be a double-tap operation.
  • the touch operation that triggers the recognition of the specified content can also be configured by the user according to his own needs, so as to meet the personalized needs of the user.
  • the content identification methods corresponding to different specified content may be different.
  • the specified content is text content
  • the corresponding recognition of the text content can be understood as extracting the text content from the image.
  • the electronic device may extract text content from the image based on optical character recognition (Optical Character Recognition, OCR).
  • optical character recognition refers to the process of analyzing and recognizing image files of text materials to obtain text and layout information. That is to recognize the text in the image and return it in the form of text. It should be noted that, taking the specified content as an example of text content, in the aforementioned S120, recognizing that there is specified content in the image displayed in real time can be understood as recognizing a certain area in the image as an area containing text content, but the electronic The device does not yet know what the specific content of the text content in this area is. Then, by identifying the specified content, the specific content of the text content can be determined.
  • the specified content is a human face
  • the electronic device detects a human face in the picture displayed in real time
  • the electronic device only detects a human face in the image, but does not know who the human face represents.
  • the identity information may include age or gender.
  • outputting the recognition result may include displaying the recognition result.
  • the electronic device may display the recognition result on a certain interface.
  • FIG. 6 there are two types of prompt signs, the transparent layer 15 and the frame 14, displayed in the left image of FIG.
  • the device can recognize the text content on the transparent layer 15 in response to the touch operation acting on the transparent layer 15 , and then display the recognition result in the interface 16 as shown in the right image of FIG. 6 .
  • the recognition result may be displayed after being enlarged when displaying the recognition result, so that the user can see the recognition result more clearly.
  • the electronic device can recognize the face at the frame 14 in response to the touch operation acting on the frame 14, and directly
  • the recognition result is displayed in the original interface, and the recognition result includes the recognized gender and age.
  • the recognition result may be displayed after being enlarged in size.
  • the multiple of the size magnification may be a multiple of a screen adapted to the electronic device, so that the user can have a better reading experience.
  • the electronic device may output the recognition result by sending it to an external device.
  • the electronic device can directly send the content in the recognition result to the external device.
  • the external device may include a server, or may be an electronic device of another user.
  • the electronic device can directly upload the recognition result to the server, and instruct the server to store the uploaded content and the account corresponding to the electronic device, so that the electronic device can use the corresponding account from The uploaded content is retrieved from the server.
  • the electronic device can The communication method establishes a communication connection and sends the recognition result to the external device. If there is an association between the account of the external device and the account of the electronic device, the electronic device may send the identification result to the external device through the association between the accounts.
  • WiFi short-range wireless communication
  • the electronic device may determine which manner to use for output by triggering a touch operation for identifying specified content.
  • the click operation may be configured to correspond to outputting the recognition result by displaying, and the double-click operation may be configured to correspond to outputting the recognition result by sending it to an external device.
  • the click operation may also be configured to output the recognition result by sending it to an external device, and the double-click operation may be configured to output the recognition result by displaying it.
  • a default output manner may be configured in a setting interface of the electronic device. Then, when the electronic device needs to output the recognition result, it will read the default output mode in the electronic device, and then output the recognition result according to the default output mode. For example, if the default output method is output by display, the electronic device will only display the recognition result.
  • the interface for displaying the recognition result can also be configured with a trigger to jump to the
  • the controls of the interface for displaying the collected images in real time, the configuration mode and the trigger mode of the controls are not specifically limited in this embodiment of the present application.
  • This embodiment provides a content recognition method.
  • the captured image will be displayed in real time, and if the displayed image contains specified content, a prompt mark can be displayed at the specified content, and then respond A touch operation acting on the prompt mark identifies the specified content and outputs a recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
  • a content identification method provided by the present application is applied to electronic equipment, and the method includes:
  • S210 Display the collected images in real time.
  • S230 Identify the specified content in response to a touch operation acting on the prompt mark.
  • S240 Acquire a background image, where the background image includes an image displayed by the electronic device when a touch operation is performed on the prompt sign.
  • the image currently displayed by the electronic device may be used as a corresponding background image for subsequent recognition result display.
  • the corresponding background image may also be determined according to the content of the recognition result.
  • the scene image corresponding to the scene expressed by the semantics of the recognition result may be used as the background image.
  • the expressed scene may be a teaching scene, and then the scene image corresponding to the teaching scene may be determined as the background image.
  • the scene image corresponding to the teaching scene may be an image whose content is a classroom, or an image whose content is a blackboard.
  • the semantic expression of the recognition result is a bus as shown in FIG. 5
  • the expressed scene can be a traffic scene, and then the scene image corresponding to the traffic scene can be determined as the background image.
  • the scene image corresponding to the traffic scene may be a vehicle.
  • the electronic device can obtain the scene expressed by the recognition result through the neural network model obtained through pre-training.
  • the image corresponding to the keyword in the recognition result may be used as the background image.
  • the recognition result includes the keyword "mobile phone”, then the image corresponding to the mobile phone can be used as the background image.
  • the recognition result includes "bus”, then the image corresponding to the bus may be used as the background image.
  • the correspondence between keywords and images may be stored in the electronic device, so that after the electronic device obtains the keywords in the recognition result, the image corresponding to the keywords may be obtained through the correspondence.
  • the recognition result when displaying the recognition result, the recognition result may be displayed based on the background image.
  • displaying the recognition result based on the background image may include: fusing the recognition result and the background image into one image, and then displaying the fused image. In this way, after the recognition result is obtained, the background image can be obtained first, and then the recognition result is integrated into the background image to obtain the background image integrated with the recognition result.
  • displaying the recognition result based on the background image may include: displaying the background image, and suspending the recognition result on the displayed background image.
  • the outputting the recognition result further includes: displaying a first trigger control; after displaying the recognition result with the background image as the background, it further includes: responding to a touch acting on the first trigger control control operation, and display the recognition result based on the full screen mode.
  • the full-screen mode can be understood as displaying the recognition result based on a full-screen display.
  • the electronic device may only display the background image and the recognition result.
  • the electronic device in addition to displaying background images and recognition results, the electronic device can also display a status bar set on the top of the electronic device.
  • the status column may include at least one of a battery status and a wireless signal status.
  • the recognition result may be displayed in the style shown in the left image in FIG. 9.
  • a status bar 17 and an operation area 18 are also displayed.
  • a first trigger control named “Reading Mode” can be displayed in the status bar 17 . If it is detected that the first trigger control named "reading mode” is active, the electronic device switches to the style shown in the right image in FIG. 9 to display the recognition result. In the right image of FIG. 9 , the status bar 17 and the operation area 18 are canceled, and the full-screen display interface 19 is used to display the recognition result, thereby realizing full-screen display of the recognition result.
  • a background image (not shown in the figure) may be displayed on the interface 19 .
  • the interface 19 for displaying the recognition results will not cover all areas of the display screen as shown in FIG.
  • a status bar 191 may also be displayed.
  • the battery status and the wireless signal status can be displayed.
  • the wireless signal state may include a WiFi state and a mobile communication signal state.
  • This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Moreover, in this embodiment, when the recognition result is displayed, the recognition result will be displayed with the background image as the background, thereby helping to reduce the degree of interface change perceived by the user.
  • a content identification method provided by the present application is applied to electronic devices, and the method includes:
  • S260 Display the collected images in real time.
  • S262 Identify the specified content in response to a touch operation acting on the prompt mark.
  • S263 Acquire an image to be processed, where the image to be processed is an image displayed by the electronic device when a touch operation is performed on the prompt sign.
  • S264 Perform blurring processing on the image to be processed, and use the blurred image as a background image.
  • S265 Display the recognition result with the background image as the background.
  • This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Blurring the image to be processed can make the final background image visually blurred, which makes it easier for the user to see the content of the recognition result, and also allows the user to pay more attention to the recognition result itself.
  • a content identification method provided by the present application is applied to electronic equipment, and the method includes:
  • S310 Display the collected images in real time.
  • S330 Identify the specified content in response to a touch operation acting on the prompt mark.
  • displaying the second trigger control and displaying the recognition result can be performed simultaneously, so that the user can visually perceive that the second trigger control and the recognition result are displayed together on the electronic device. on the screen.
  • S360 In response to the touch operation acting on the second trigger control, display a lock interface, where the lock interface includes an image displayed by the electronic device when the touch operation acts on the prompt sign, and the The prompt identifier corresponding to the specified content in the displayed image.
  • the electronic device can specify the corresponding The content is recognized, and the recognition result is displayed through the style shown in the middle image of Figure 13.
  • the second trigger control 20 is also displayed therein. If a touch operation on the second trigger control 20 is detected, the electronic device will display the lock interface 21 shown in the right image of FIG. 13 . As shown in the right image of FIG. 13, when the image content in the locking interface 21 and the touch operation acting on the prompt mark are in effect, the image content displayed by the electronic device (that is, the image content shown in the left image of FIG. 13 content) are the same.
  • the image content in the locking interface 21 is a static image
  • the content shown in the left image of FIG. 13 is an image captured by the image acquisition device of the electronic device in real time.
  • the static image in the locking interface 21 can be understood as that even if the image captured by the image acquisition device of the electronic device changes, the image content in the locking interface 21 remains unchanged.
  • the content of the image displayed by the electronic device changes in real time as the image captured by the image capture device changes.
  • the electronic device may stop image collection during image collection, so as to reduce power consumption.
  • the method further includes: in response to the first operation, resuming real-time display of the collected images.
  • the first operation may be a double-click operation acting on the display screen.
  • the electronic device can display a recovery control that triggers the recovery of the real-time display of the collected image, so that the electronic device detects that the touch that acts on the recovery control When the control operation is performed, the real-time display of the collected images is resumed.
  • resuming the real-time display of the collected images may include: starting the image acquisition device, and performing the image acquisition on the images collected by the image acquisition device after starting real-time display. If in the process of displaying the locked interface, the image acquisition device is still collecting images and buffering the acquired images to a designated area, but does not read the buffered images from the designated area for real-time display, then the electronic device In the process of resuming the real-time display of the collected images, it can be known that the resuming is executed to read the images from the designated area for display.
  • This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience.
  • the second trigger control can also be displayed, so that the user can touch the second trigger control to cause the electronic device to display the touch operation function including the prompt mark,
  • the image displayed by the electronic device, and the lock interface of the prompt logo corresponding to the specified content in the displayed image so that when there are multiple prompt logos, the user can trigger other prompts again in the lock interface Prompt for identification to identify additional specified content and display the identification result.
  • a content identification method provided by the present application is applied to electronic devices, and the method includes:
  • S410 Display the collected images in real time.
  • S430 Identify the specified content in response to a touch operation acting on the prompt mark.
  • displaying the third trigger control and displaying the recognition result can be performed simultaneously, so that the user can visually perceive that the third trigger control and the recognition result are displayed together on the electronic device. on the screen.
  • S460 Display an operation menu in response to a touch operation acting on the third trigger control, where the operation menu includes at least one operation control, and each operation control corresponds to a different operation.
  • the third trigger control 22 may be displayed simultaneously when the recognition result is displayed. Then, in response to the touch operation acting on the third trigger control 22 , an operation menu 23 may be displayed.
  • the operation menu 23 includes an operation control named send, an operation control named copy, an operation control named save as document, and an operation control named save as sticky note.
  • the operation corresponding to the touch operation named sending includes sending the recognition result to a third-party application program.
  • the third-party application program may include an instant messaging application program or a short message program.
  • the operation corresponding to the operation control named copy includes copying the recognition result, so that after the copy operation is performed, the electronic device can input the recognition result by pasting in other positions where text input is possible.
  • the operation corresponding to the operation control named as saving the document may include storing the recognition result in the form of a document.
  • the operation corresponding to the operation control named "Save Memo" may include storing the recognition result in the form of Memo.
  • S470 In response to a touch operation acting on the operation control, use an operation corresponding to the operation control with the touch operation as a target operation.
  • the electronic device when the electronic device displays the recognition result based on the full-screen mode, the electronic device can also display the third touch control at the same time, and the function of the third touch control displayed in the full-screen mode is the same as The functions of the third touch controls shown in FIG. 15 are the same.
  • This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience.
  • a third trigger control that triggers further operations on the recognition result will also be displayed, so that the user can call up the operation menu by directly operating the third trigger control, and based on the operation menu The operation controls in to perform further operations on the recognition results.
  • a content identification method provided by the present application is applied to electronic devices, and the method includes:
  • S510 Display the collected images in real time.
  • the size of the object in the collected image is too small, so that the electronic device cannot effectively recognize the object in the image, or the electronic device cannot effectively recognize the text on the object .
  • the objects in the collected images cannot be completely displayed on the screen, which will also cause the electronic device to fail to effectively identify the objects.
  • the zooming process in the embodiment of the present application may include zooming by changing the focal length of the image acquisition device, or may be zoomed by digital zooming.
  • the electronic device increases the area of each pixel in the captured image through a processor, so as to achieve the purpose of zooming.
  • the zooming processing of the collected image in response to the zoom request includes: if there is an area selection operation acting on the image displayed in real time, detecting whether the object in the selected area is completely displayed; if is completely displayed, then generate a zoom request of increasing size, and the zoom request of increasing size is used to make the object in the selected area be completely displayed with the first target size; in response to the zoom request of increasing size, Perform zoom processing on the captured image.
  • the complete display of the object can be understood as that the overall outline of the object is within the collection range of the image collection device.
  • the left image of Figure 17 includes a girl and the bus stop sign next to the girl, and in the illustration on the left side of Figure 17, the overall outline of the girl and the bus stop sign are all in Within the collection range of the image collection device, the girl and the bus stop sign are completely displayed.
  • the girl's feet nor the right side of the bus stop can be seen, so in the right image of Figure 17, neither the girl nor the bus stop are fully displayed
  • the zoom request for reducing the size is used to display the object in the selected area with a second target size; responding to the zoom request for reducing the size , to perform zoom processing on the captured image.
  • the first target size in the embodiment of the present application is the corresponding maximum size when the objects in the selected area can be completely displayed. It can be understood that when the focal length is increased, the size of the object in the captured image will increase accordingly. Exemplarily, as shown in Figure 18, the image on the left side of Figure 18 shows the image before increasing the focal length, and the image on the right side of Figure 18 shows the image after increasing the focal length, in the image after increasing the focal length The size of the stop sign will be larger than the size in the image on the right, which is beneficial for the text content in the bus stop sign to be detected. However, when the object increases to a certain extent, the object may not be completely displayed.
  • the second target size is the size when the objects in the selected area can be displayed in the most complete state. It is understandable that during the process of reducing the focal length, it may be reduced to the minimum focal length supported by the image acquisition device in time, and the object cannot be completely displayed, but compared with before the focal length is reduced, the range of the object displayed on the screen will be smaller is larger, thereby benefiting the probability that the specified content at the object is successfully detected.
  • S540 Identify the specified content in response to a touch operation acting on the prompt mark.
  • This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience.
  • the electronic device may zoom the image acquisition device for capturing images in response to the zoom request triggered by the area selection operation, so that objects in the area selected by the area selection operation can be displayed at a target size. displayed in order to increase the probability of being detected from the live image.
  • the multiple modes can be configured in the camera.
  • the multiple modes may include a magnification mode.
  • the electronic device can detect whether there is target information in the image displayed in the viewfinder frame.
  • the target information here can be understood as the specified content in the foregoing embodiments.
  • the image displayed in the viewfinder frame is the image collected by the image acquisition device displayed in real time.
  • S630 A target information pre-recognition frame appears in the viewfinder frame.
  • S631 Obtain the location circled by the user.
  • the location circled by the user in S631 can be understood as the area selection operation in the foregoing embodiments, then after acquiring the circled location of the user, the operation performed by the electronic device is the same as the response to the region selection operation in the foregoing embodiments The operations performed afterwards are the same.
  • the box with the pre-identification can be understood as a kind of prompt mark pointed out in the foregoing embodiments.
  • the electronic device After the electronic device detects the operation of clicking the pre-recognition frame, it can recognize the target information at the pre-recognition frame.
  • S650 The recognition content is amplified and output.
  • S651 may be executed: performing further operations through the operation menu.
  • the further operation may include operations corresponding to the operation controls in the operation menu in the foregoing embodiments.
  • the reading mode can be understood as entering a full-screen mode in the foregoing embodiments to display the recognition result.
  • S661 can be executed: performing further operations through the operation menu.
  • the further operation may include operations corresponding to the operation controls in the operation menu in the foregoing embodiments.
  • a content identification device 600 provided by the present application runs on an electronic device, and the device 600 includes:
  • the image display unit 610 is configured to display the collected images in real time.
  • the image display unit 610 is specifically configured to start the camera program in response to the user's operation; after starting the camera program, display the captured image in real time on the displayed interface of the camera program.
  • the content identification unit 620 is configured to display a prompt mark at the specified content if the displayed image includes the specified content.
  • the identifying unit 630 is configured to identify the specified content in response to a touch operation acting on the prompt mark.
  • the content output unit 640 is configured to output the recognition result.
  • the content output unit 640 is specifically configured to obtain a background image, the background image includes an image displayed by the electronic device when the touch operation acts on the prompt sign; the background image is The background displays the recognition result.
  • the content output unit 640 is specifically configured to acquire an image to be processed, the image to be processed is an image displayed by the electronic device when a touch operation is applied to the prompt mark; The image is blurred, and the blurred image is used as a background image.
  • the content output unit 640 is also specifically configured to display a first trigger control; after displaying the recognition result with the background image as the background, it further includes: responding to a touch operation acting on the first trigger control, based on The full-screen mode displays the recognition result.
  • the content output unit 640 is specifically configured to display the recognition result.
  • the content output unit 640 is further configured to display a second trigger control; in response to a touch operation acting on the second trigger control, display a lock interface, and the lock interface includes a , the image displayed by the electronic device, and a prompt identifier corresponding to the specified content in the displayed image.
  • the content output unit 640 is further configured to resume real-time display of the collected images in response to the first operation.
  • the content output unit 640 is specifically configured to display the recognition result.
  • the content output unit 640 is further configured to display a third trigger control; in response to a touch operation acting on the third trigger control, an operation menu is displayed, and the operation menu includes at least one operation control, and each operation control corresponds to The operations are different; in response to the touch operation acting on the operation control, the operation corresponding to the operation control with the touch operation is used as the target operation; and the target operation is performed on the recognition result.
  • the content output unit 640 is specifically configured to use the scene image corresponding to the scene expressed by the semantics of the recognition result as the background image if the specified content is text content.
  • the content output unit 640 is specifically configured to use an image corresponding to a keyword in the recognition result as a background image if the specified content is text content.
  • the content output unit 640 is specifically configured to fuse the recognition result and the background image into one image to obtain a fused image; and display the fused image.
  • the content output unit 640 is specifically configured to display the background image, and suspend the recognition result on the displayed background image.
  • the content output unit 640 is specifically configured to, if the specified content is text content, enlarge the size of the recognition result before displaying it.
  • the device further includes: a zoom unit 650 configured to perform zoom processing on the captured image in response to a zoom request.
  • the zoom unit 650 is specifically configured to detect whether the object in the selected area is fully displayed if there is an area selection operation acting on the image displayed in real time; if it is fully displayed, generate a zoom request for increasing the size , the size-increasing zoom request is used to completely display the object in the selected area with a first target size; in response to the size-increasing zoom request, zoom processing is performed on the captured image.
  • the zoom request for reducing the size is used to display the object in the selected area with a second target size; responding to the zoom request for reducing the size , to perform zoom processing on the captured image.
  • the specified content includes: text content or a target object.
  • the content recognition device provided by the present application can display the collected image in real time, and when the displayed image contains specified content, it can display a prompt mark at the specified content, and then respond to the action on the specified content.
  • the touch operation of the above-mentioned prompt mark is used to identify the specified content and output the recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
  • the electronic device 1000 includes one or more (only one is shown in the figure) processors 102 , memory 104 , network module 106 , sensor module 108 , image acquisition device 110 and screen 112 coupled to each other.
  • the memory 104 stores programs capable of executing the contents of the foregoing embodiments, and the processor 102 can execute the programs stored in the memory 104 .
  • the processor 102 may include one or more cores for processing data.
  • the processor 102 uses various interfaces and circuits to connect various parts of the entire electronic device 1000, and executes or executes instructions, programs, code sets, or instruction sets stored in the memory 104, and calls data stored in the memory 104 to execute Various functions of the electronic device 1000 and processing data.
  • the processor 102 may adopt at least one of Digital Signal Processing (Digital Signal Processing, DSP), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), and Programmable Logic Array (Programmable Logic Array, PLA). implemented in the form of hardware.
  • DSP Digital Signal Processing
  • FPGA Field-Programmable Gate Array
  • PLA Programmable Logic Array
  • the processor 102 may integrate one or a combination of a central processing unit (Central Processing Unit, CPU), an image processor (Graphics Processing Unit, GPU), a modem, and the like.
  • CPU Central Processing Unit
  • GPU Graphics Processing Unit
  • the CPU mainly handles the operating system, user interface and application programs, etc.
  • the GPU is used to render and draw the displayed content
  • the modem is used to handle wireless communication. It can be understood that the above-mentioned modem may not be integrated into the processor 102, but may be realized by a communication chip alone.
  • the memory 104 may include random access memory (Random Access Memory, RAM), and may also include read-only memory (Read-Only Memory). Memory 104 may be used to store instructions, programs, codes, sets of codes, or sets of instructions.
  • the memory 104 may include a program storage area and a data storage area, wherein the program storage area may store instructions for implementing an operating system, instructions for implementing at least one function (such as a touch function, a sound playback function, an image playback function, etc.) , instructions for implementing the following method embodiments, and the like.
  • content identification means may be stored in the memory 104 .
  • the device for content identification may be the aforementioned device 600 .
  • the storage data area can also store data created by the electronic device 1000 during use (such as phonebook, audio and video data, chat record data) and the like.
  • the network module 106 is used to receive and send electromagnetic waves, realize mutual conversion between electromagnetic waves and electrical signals, and communicate with communication networks or other devices, such as audio playback devices.
  • the network module 106 may include various existing circuit elements for performing these functions, such as antennas, radio frequency transceivers, digital signal processors, encryption/decryption chips, Subscriber Identity Module (SIM) cards, memory, etc. .
  • SIM Subscriber Identity Module
  • the network module 106 can communicate with various networks such as the Internet, intranet, wireless network or communicate with other devices through the wireless network.
  • the wireless network mentioned above may include a cellular telephone network, a wireless local area network or a metropolitan area network.
  • the network module 106 can perform information exchange with the base station.
  • the sensor module 108 may include at least one sensor.
  • the sensor module 108 may include, but is not limited to: a light sensor, a motion sensor, a pressure sensor, an infrared heat sensor, a distance sensor, an acceleration sensor, and other sensors.
  • the pressure sensor may be a sensor for detecting pressure generated by pressing on the electronic device 1000 . That is, the pressure sensor detects pressure generated by contact or press between the user and the electronic device, eg, contact or press between the user's ear and the mobile terminal. Therefore, the pressure sensor can be used to determine whether contact or pressure occurs between the user and the electronic device 1000, and the magnitude of the pressure.
  • the acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when it is stationary, and can be used to identify the application of the posture of the electronic device 1000 (such as horizontal and vertical screen switching, related games, Magnetometer posture calibration), vibration recognition related functions (such as pedometer, tapping), etc.
  • the electronic device 1000 may also be configured with other sensors such as a gyroscope, a barometer, a hygrometer, and a thermometer, which will not be repeated here.
  • the image acquisition device 110 can be used for image acquisition, so that the electronic device 1000 can display the acquired image on the screen 112 .
  • FIG. 23 shows a structural block diagram of a computer-readable storage medium provided by an embodiment of the present application.
  • Program codes are stored in the computer-readable storage medium 1100, and the program codes can be invoked by a processor to execute the methods described in the foregoing method embodiments.
  • the computer readable storage medium 1100 may be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM.
  • the computer-readable storage medium 1100 includes a non-transitory computer-readable storage medium (non-transitory computer-readable storage medium).
  • the computer-readable storage medium 1100 has a storage space for program code 1110 for executing any method steps in the above methods. These program codes can be read from or written into one or more computer program products.
  • Program code 1110 may, for example, be compressed in a suitable form.
  • the present application provides a content identification method, device, electronic equipment, and storage medium.
  • the collected image is displayed in real time, and when the displayed image includes specified content, a prompt mark can be displayed at the specified content, and then in response to the touch operation acting on the prompt mark, Recognize the specified content and output the recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
  • first and second are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features.
  • the features defined as “first” and “second” may explicitly or implicitly include at least one of these features.
  • “plurality” means at least two, such as two, three, etc., unless otherwise specifically defined.
  • each part of the present application may be realized by hardware, software, firmware or a combination thereof.
  • various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system.
  • a suitable instruction execution system For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or combination of the following techniques known in the art: Discrete logic circuits, ASICs with suitable combinational logic gates, programmable gate arrays (PGAs), field programmable gate arrays (FPGAs), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Embodiments of the present application disclose a content recognition method and apparatus, an electronic device, and a storage medium. The method comprises: displaying a collected image in real time; if the displayed image includes specified content, displaying a prompting identifier on the specified content; recognizing the specified content in response to a touch operation acting on the prompting identifier; and outputting a recognition result. Thus, by using the described means, in the state in which an electronic device displays a collected image in real time, the electronic device automatically uses a means of a prompting identifier to recognize specified content appearing in the image displayed in real time, and then, by means of a touch operation acting on the prompting identifier, the electronic device can be directly triggered to recognize the specified content, thus the operation process of triggering the recognition of an image is simplified, and user experience is improved.

Description

内容识别方法、装置、电子设备以及存储介质Content identification method, device, electronic device and storage medium
相关申请的交叉引用Cross References to Related Applications
本申请要求于2021年6月24日提交的申请号为202110706063.9的中国申请的优先权,其在此出于所有目的通过引用将其全部内容并入本文。This application claims priority to Chinese application No. 202110706063.9 filed on June 24, 2021, which is hereby incorporated by reference in its entirety for all purposes.
技术领域technical field
本申请涉及电子设备技术领域,更具体地,涉及一种内容识别方法、装置、电子设备以及存储介质。The present application relates to the technical field of electronic equipment, and more specifically, to a content identification method, device, electronic equipment, and storage medium.
背景技术Background technique
随着内容识别技术的发展,更多的电子设备都支持基于图像的内容识别。With the development of content recognition technology, more electronic devices support image-based content recognition.
发明内容Contents of the invention
鉴于上述问题,本申请提出了一种内容识别方法、装置、电子设备以及存储介质,以改善上述问题。In view of the above problems, the present application proposes a content identification method, device, electronic device and storage medium to improve the above problems.
第一方面,本申请提供了一种内容识别方法,应用于电子设备,所述方法包括:对采集的图像进行实时显示;若所显示的图像包括有指定内容,在所述指定内容处显示提示标识;响应作用于所述提示标识的触控操作,对所述指定内容进行识别;输出识别结果。In the first aspect, the present application provides a method for content recognition, which is applied to electronic equipment, and the method includes: displaying the captured image in real time; if the displayed image includes specified content, displaying a prompt at the specified content identification; in response to a touch operation acting on the prompt identification, identify the specified content; and output an identification result.
第二方面,本申请提供了一种内容识别装置,运行于电子设备,所述装置包括:图像显示单元,用于对采集的图像进行实时显示;内容标识单元,用于若所显示的图像包括有指定内容,在所述指定内容处显示提示标识;识别单元,用于响应作用于所述提示标识的触控操作,对所述指定内容进行识别;内容输出单元,用于输出识别结果。In a second aspect, the present application provides a content recognition device that runs on electronic equipment, and the device includes: an image display unit, configured to display captured images in real time; and a content identification unit, configured to display images that include If there is specified content, a prompt mark is displayed at the specified content; a recognition unit is configured to identify the specified content in response to a touch operation acting on the prompt mark; a content output unit is configured to output a recognition result.
第三方面,本申请提供了一种电子设备,包括一个或多个处理器以及存储器;一个或多个程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于执行上述的方法。In a third aspect, the present application provides an electronic device, including one or more processors and a memory; one or more programs are stored in the memory and configured to be executed by the one or more processors, The one or more programs are configured to perform the methods described above.
第四方面,本申请提供的一种存储有处理器可执行的程序代码的计算机可读存储介质,所述计算机可读存储介质包括存储的程序代码,其中,在所述程序代码运行时执行上述的方法。In a fourth aspect, the present application provides a computer-readable storage medium storing program code executable by a processor, the computer-readable storage medium includes the stored program code, wherein, when the program code is running, the above-mentioned Methods.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained based on these drawings without any creative effort.
图1示出了本申请提出的一种内容识别方法的的场景示意图;FIG. 1 shows a schematic diagram of a scene of a content recognition method proposed by the present application;
图2示出了本申请一实施例提出的一种内容识别方法的流程图;FIG. 2 shows a flow chart of a content identification method proposed by an embodiment of the present application;
图3示出了本申请中的电子设备采集图像的示意图;FIG. 3 shows a schematic diagram of an image collected by an electronic device in the present application;
图4示出了本申请中的一种提示标识的示意图;Fig. 4 shows a schematic diagram of a reminder mark in this application;
图5示出了本申请中的多种指定内容以每个指定内容各自对应的提示标识的示意图;Fig. 5 shows a schematic diagram of a variety of specified content in this application with a corresponding reminder for each specified content;
图6示出了本申请中的一种识别结果的显示方式的示意图;Fig. 6 shows a schematic diagram of a display mode of a recognition result in this application;
图7示出了本申请中的另一种识别结果的显示方式的示意图;Fig. 7 shows a schematic diagram of another display mode of recognition results in this application;
图8示出了本申请另一实施例提出的一种内容识别方法的流程图;FIG. 8 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图9示出了本申请中一种基于全屏模式显示识别结果的示意图;FIG. 9 shows a schematic diagram of displaying recognition results based on a full-screen mode in the present application;
图10示出了本申请中另一种基于全屏模式显示识别结果的示意图;FIG. 10 shows another schematic diagram of displaying recognition results based on a full-screen mode in this application;
图11示出了本申请再一实施例提出的一种内容识别方法的流程图;FIG. 11 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图12示出了本申请再一实施例提出的一种内容识别方法的流程图;Fig. 12 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图13示出了本申请中锁定界面的示意图;Fig. 13 shows a schematic diagram of the locking interface in this application;
图14示出了本申请又一实施例提出的一种内容识别方法的流程图;FIG. 14 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图15示出了本申请中操作菜单的示意图;Figure 15 shows a schematic diagram of the operation menu in this application;
图16示出了本申请又一实施例提出的一种内容识别方法的流程图;FIG. 16 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图17示出了本申请中物体被完整的显示的示意图;Fig. 17 shows a schematic diagram of a complete display of objects in this application;
图18示出了本申请中增加焦距后物体的尺寸对比的示意图;Figure 18 shows a schematic diagram of the size comparison of objects after increasing the focal length in the present application;
图19示出了本申请又一实施例提出的一种内容识别方法的流程图;FIG. 19 shows a flow chart of a content identification method proposed by another embodiment of the present application;
图20示出了本申请提出的一种内容识别装置的结构框图;Fig. 20 shows a structural block diagram of a content identification device proposed in this application;
图21示出了本申请提出的另一种内容识别装置的结构框图;Fig. 21 shows a structural block diagram of another content identification device proposed by the present application;
图22示出了本申请的用于执行根据本申请实施例的内容识别方法的电子设备的结构框图;FIG. 22 shows a structural block diagram of an electronic device for performing a content identification method according to an embodiment of the present application;
图23是本申请实施例的用于保存或者携带实现根据本申请实施例的内容识别方法的程序代码的存储单元。Fig. 23 is a storage unit for saving or carrying program codes for realizing the content identification method according to the embodiment of the present application according to the embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.
随着基于图像的内容识别技术在电子设备中的普及,在更多的场景下,电子设备都可以用于进行内容识别。例如,在一些场景中,WiFi密码会被打印在一些纸张上,或者是会被张贴在墙上,那么在这种情况下,用户会先操作电子设备通过拍照的方式将WiFi密码拍摄下来,得到包括有WiFi密码的图像,然后再操作其他的应用程序对包括有WiFi密码的图像进行文本识别,从而提取到WiFi密码。但是,发明人对相关的识别操作流程进行研究后发现,在相关的识别过程中,需要通过电子设备先拍照得到待进行识别的图像,然后再操作电子设备来对该拍照得到的待进行识别的图像进行内容识别,进而就造成了识别过程比较繁琐,用户体验不佳。With the popularity of image-based content recognition technology in electronic devices, electronic devices can be used for content recognition in more scenarios. For example, in some scenarios, the WiFi password will be printed on some paper, or will be posted on the wall, then in this case, the user will first operate the electronic device to take pictures of the WiFi password, and get The image including the WiFi password, and then operate other applications to perform text recognition on the image including the WiFi password, thereby extracting the WiFi password. However, after researching the related identification operation process, the inventor found that in the related identification process, it is necessary to first take pictures of the electronic device to obtain the image to be identified, and then operate the electronic device to obtain the image to be identified by the photo. Image content recognition, which in turn results in a cumbersome recognition process and poor user experience.
因此,发明人提出了本申请中的一种内容识别方法、装置、电子设备以及存储介质,在该方法中,会对采集的图像进行实时显示,并且在所显示的图像包括有指定内容的情况下,可以在指定内容处显示提示标识,然后响应作用于所述提示标识的触控操作,对所述指定内容进行识别,并输出识别结果。从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。Therefore, the inventor proposes a content recognition method, device, electronic device, and storage medium in the present application. In this method, the collected images are displayed in real time, and when the displayed images include specified content In this case, a prompt mark may be displayed at the specified content, and then the specified content may be identified in response to a touch operation acting on the prompt mark, and a recognition result may be output. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:获取背景图像,所述背景图像包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像;以所述背景图像为背景对所述识别结果进行显示。In one implementation manner, the method provided in this embodiment may further include the following process: acquiring a background image, where the background image includes an image displayed by the electronic device when the touch operation acts on the prompt sign; The recognition result is displayed with the background image as the background.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:获取待处理图像,所述待处理图像为作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像;对所述待处理图像进行虚化处理,并将所述虚化处理后的图像作为背景图像;以所述背景图像为背景对所述识别结果进行显示。In an implementation manner, the method provided in this embodiment may further include the following process: acquiring an image to be processed, the image to be processed is an image displayed by the electronic device when the touch operation acts on the prompt mark image; perform blurring processing on the image to be processed, and use the blurred image as a background image; display the recognition result with the background image as the background.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:显示第一触发控件;响应作用于所述第一触发控件的触控操作,基于全屏模式对所述识别结果进行显示。In an implementation manner, the method provided in this embodiment may further include the following process: displaying a first trigger control; and displaying the recognition result based on a full-screen mode in response to a touch operation acting on the first trigger control.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:显示识别结果;显示第二触发控件;响应作用于所述第二触发控件的触控操作,显示锁定界面,所述锁定界面中包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像,以及所述所显示的图像中的指定内容对应的提示标识。In one implementation, the method provided in this embodiment may further include the following process: displaying the recognition result; displaying the second trigger control; displaying a locking interface in response to a touch operation acting on the second trigger control, and the locking The interface includes an image displayed by the electronic device when a touch operation acts on the prompt mark, and a prompt mark corresponding to the specified content in the displayed image.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:响应于第一操作,恢复对采集的图像进行实时显示。In an implementation manner, the method provided in this embodiment may further include the following process: in response to the first operation, resume real-time display of the collected images.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:显示识别结果;显示第三触发控件;响应作用于所述第三触发控件的触控操作,显示操作菜单,所述操作菜单中包括有至少一个操作控件,每个操作控件对应的操作不同;响应作用于所述操作控件的触控操作,将有触控操作的操作控件对应的操作作为目标操作;对所述识别结果执行所述目标操作。In an implementation manner, the method provided in this embodiment may further include the following procedures: displaying the recognition result; displaying a third trigger control; displaying an operation menu in response to a touch operation acting on the third trigger control, and the operation The menu includes at least one operation control, and each operation control corresponds to a different operation; in response to the touch operation acting on the operation control, the operation corresponding to the operation control with the touch operation is used as the target operation; the recognition result Execute the target action.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:响应于变焦请求,对所采集的图像进行变焦处理。In an implementation manner, the method provided in this embodiment may further include the following process: performing zoom processing on the captured image in response to the zoom request.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:若有作用于实时显示的图像中的区域选择操作,检测所选择区域中的物体是否被完整显示;若被完整显示,则生成增大尺寸的变焦请求,所述增大尺寸的变焦请求用于使得所选择区域中的物体以第一目标尺寸被完整显示;响应于所述增大尺寸的变焦请求,对所采集的图像进行变焦处理。In one embodiment, the method provided in this embodiment may further include the following process: if there is an area selection operation acting on the image displayed in real time, detect whether the object in the selected area is completely displayed; if it is completely displayed, Then generate a size-increasing zoom request, and the size-increasing zoom request is used to make the objects in the selected area be completely displayed with the first target size; in response to the size-increasing zoom request, the collected The image is zoomed.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:若没有被完整显示,则生成减小尺寸的变焦请求,所述减小尺寸的变焦请求用于使得所选择区域中的物体以第二目标尺寸进行显 示;响应于所述减小尺寸的变焦请求,对所采集的图像进行变焦处理。In an implementation manner, the method provided in this embodiment may further include the following process: if it is not fully displayed, generate a reduced-size zoom request, and the reduced-size zoom request is used to make the selected area The object is displayed with the second target size; in response to the zoom request for reducing the size, the captured image is zoomed.
在一些实施方式中,所述指定内容包括:文本内容或者目标物体。In some implementations, the specified content includes: text content or a target object.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:若所述指定内容为文本内容,将所述识别结果的语义所表达的场景对应的场景图像作为背景图像。In an implementation manner, the method provided in this embodiment may further include the following process: if the specified content is text content, use the scene image corresponding to the scene expressed by the semantics of the recognition result as the background image.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:若所述指定内容为文本内容,将所述识别结果中的关键词对应的图像作为背景图像。In an implementation manner, the method provided in this embodiment may further include the following process: if the specified content is text content, use an image corresponding to a keyword in the recognition result as a background image.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:将所述识别结果与所述背景图像融合为一张图像,得到融合后的图像;对所述融合后的图像进行显示。In one embodiment, the method provided in this embodiment may further include the following process: merging the recognition result and the background image into one image to obtain a fused image; displaying the fused image .
在一种实施方式中,本实施例提供的方法还可以包括如下流程:对所述背景图像进行显示,并将所述识别结果悬浮于所显示的所述背景图像上。In an implementation manner, the method provided in this embodiment may further include the following process: displaying the background image, and suspending the recognition result on the displayed background image.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:若所述指定内容为文本内容,对所述识别结果进行尺寸放大后再进行显示。In an implementation manner, the method provided in this embodiment may further include the following process: if the specified content is text content, the recognition result is enlarged in size and then displayed.
在一种实施方式中,本实施例提供的方法还可以包括如下流程:响应于用户的操作启动相机程序;在启动相机程序后,在所显示的相机程序的界面中对采集的图像进行实时显示。In one embodiment, the method provided in this embodiment may further include the following process: start the camera program in response to the user's operation; after starting the camera program, display the captured image in real time in the interface of the displayed camera program .
下面将先对本申请所涉及的应用环境进行介绍。The application environment involved in this application will first be introduced below.
作为一种方式,本申请实施例提供的内容识别方法可以由一个电子设备单独进行执行。在这种方式中,电子设备通过自身配置的图像采集器件进行图像采集,然后对图像采集器件所采集的图像进行实时显示,并对实时显示的图像执行本申请实施例提供的内容识别方法。作为另外一种方式,本申请实施例提供的内容识别方法可以由至少两个电子设备协同进行执行。如图1所示,在图1所示的场景中,包括有电子设备100和电子设备200,其中,电子设备200可以通过其配置的图像采集器件进行图像采集,然后将采集的图像传输给自身的网络模块,以利用网络模块将采集的图像传输给电子设备100的网络模块,电子设备100的处理器则可以获取到自身网络模块所接收到的图像,然后对获取的图像执行申请实施例提供的内容识别方法。其中,电子设备(例如,电子设备100和电子设备200)可为手机、平板电脑等设备。在由两个电子设备协同执行本申请实施例提供的内容识别方法的情况下,两个电子设备可以为相同类型的设备,也可以为不同类型的设备。例如,除了可以为图1中所示的两个电子设备均为智能手机外,还可以包括一个电子设备为智能手机,而另一个电子设备为智能手表等方式。As a manner, the content identification method provided in the embodiment of the present application may be independently executed by an electronic device. In this manner, the electronic device collects images through its own image acquisition device, then displays the image collected by the image acquisition device in real time, and executes the content recognition method provided by the embodiment of the present application on the image displayed in real time. As another manner, the content identification method provided in the embodiment of the present application may be executed cooperatively by at least two electronic devices. As shown in FIG. 1 , in the scene shown in FIG. 1 , there are electronic equipment 100 and electronic equipment 200, wherein the electronic equipment 200 can perform image acquisition through its configured image acquisition device, and then transmit the acquired image to itself The network module of the electronic device 100 can use the network module to transmit the captured image to the network module of the electronic device 100, and the processor of the electronic device 100 can obtain the image received by its own network module, and then execute the application embodiment provided by the obtained image content identification method. Wherein, the electronic devices (for example, the electronic device 100 and the electronic device 200 ) may be mobile phones, tablet computers and other devices. In the case where two electronic devices cooperate to execute the content identification method provided by the embodiment of the present application, the two electronic devices may be of the same type, or may be of different types. For example, in addition to the fact that the two electronic devices shown in FIG. 1 are both smart phones, it may also include that one electronic device is a smart phone and the other electronic device is a smart watch.
下面将结合附图具体描述本申请的各实施例。Various embodiments of the present application will be described in detail below with reference to the accompanying drawings.
请参阅图2,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to Figure 2, a content identification method provided by this application is applied to electronic equipment, and the method includes:
S110:对采集的图像进行实时显示。S110: Display the collected images in real time.
其中,所显示的图像是由图像采集器件(例如,摄像头)进行采集的,那么对图像进行实时显示可以理解为对图像采集器件所采集的图像进行实时显示,以便于用户可以对当前电子设备的图像采集器件所采集的图像进行预览。其中,在对采集的图像进行实时显示的过程中,若图像采集器件进行图像采集的区域发生变化,那么所显示的内容也会同步的发生变换。Wherein, the displayed image is collected by an image acquisition device (for example, a camera), then displaying the image in real time can be understood as displaying the image collected by the image acquisition device in real time, so that the user can view the current electronic equipment. The images collected by the image acquisition device are previewed. Wherein, in the process of real-time displaying the collected images, if the area where the image collection device collects the images changes, the displayed content will also change synchronously.
作为一种方式,电子设备可以响应于用户的操作而启动相机程序,在启动相机程序后,在所显示的相机程序的界面中对采集的图像进行实时显示。As a manner, the electronic device can start the camera program in response to the user's operation, and after the camera program is started, the captured image can be displayed in real time on the displayed interface of the camera program.
S120:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S120: If the displayed image includes specified content, display a prompt logo at the specified content.
在本申请实施例中,该指定内容可以为文本内容,也可以为目标物体。其中,目标物体可以包括人脸等。在检测到所显示的图像中包括有指定内容后,作为一种提示用户的方式,可以在指定内容处显示提示标识。可选的,该提示标识可以为包围指定内容的框。示例性的,如图3所示,在图3所示的场景中,用户使用电子设备拍摄远处投影幕10中的内容,对应的,电子设备中会在界面11中对所采集的图像进行实时显示,进而在界面11中会同步的显示投影幕10以及投影幕10中的内容。若投影幕10中包括有文本内容,那么则电子设备会在包括有文本内容的区域处显示围绕该文本内容的框12作为提示标识。其中,需要说明的是,提示标识的具体样式在本申请实施例中不做具体的限定,除了可以为图3中所示的框之外,还可以为其他样式。例如,如图4所示,提示标识还可以为带有颜色的透明图层13,该透明图层13可以覆盖在指定内容上,从而使得用户可以透过该透明图层13看到指定内容。其中,透明图层的颜色可以为黄色或者绿色等。In the embodiment of the present application, the designated content may be text content, or may be a target object. Wherein, the target object may include a human face and the like. After it is detected that the displayed image includes the specified content, as a way of prompting the user, a prompt logo may be displayed at the specified content. Optionally, the prompt identifier can be a frame surrounding the specified content. Exemplarily, as shown in FIG. 3 , in the scene shown in FIG. 3 , the user uses an electronic device to shoot content in the remote projection screen 10 , and correspondingly, the electronic device will process the captured image in the interface 11 Real-time display, and then the projection screen 10 and the content in the projection screen 10 will be displayed synchronously on the interface 11 . If the projection screen 10 includes text content, the electronic device will display a frame 12 surrounding the text content as a prompt mark at the area where the text content is included. Wherein, it should be noted that the specific style of the prompt mark is not specifically limited in this embodiment of the application, and it may be other styles besides the box shown in FIG. 3 . For example, as shown in FIG. 4 , the prompt mark can also be a transparent layer 13 with color, and the transparent layer 13 can cover the specified content, so that the user can see the specified content through the transparent layer 13 . Wherein, the color of the transparent layer may be yellow or green.
作为一种方式,不同类型的指定内容所对应的提示标识可以不同,从而使得用户可以通过提示标识就够较快的区分出指定内容。示例性的,如图5所示,在电子设备所采集的图像中,包括有人物和人物身旁的公交站牌,那么对于人物的人脸则会被确定为一种指定内容,而公交站牌处因为包括有文本内容,则也可以被识别为一种指定内容。对于人脸类型的指定内容,则可以将框14作为提示标识,对于文本类型的指定内容,则可以将透明图层15作为提示标识。再者,对于人脸类型的指定内容,也可以以透明图层作为对应的提示标识,对于文本内容也可以以框作为提示标识。并且,对于不同类型的指定内容,也可以以 对应有相同的提示标识。As a way, different types of specified content may have different prompting signs corresponding to them, so that the user can quickly distinguish the specified content through the prompting mark. Exemplarily, as shown in Figure 5, in the image collected by the electronic device, there are characters and bus stop signs beside the characters, then the face of the characters will be determined as a specified content, and the bus stop Since the card contains text content, it can also be identified as a specified content. For the specified content of the face type, the frame 14 can be used as a prompt mark, and for the specified content of the text type, the transparent layer 15 can be used as a prompt mark. Furthermore, for the specified content of the face type, a transparent layer may also be used as a corresponding prompting mark, and a frame may also be used as a prompting mark for text content. Moreover, for different types of specified content, there may also be corresponding to the same prompt mark.
S130:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S130: Identify the specified content in response to a touch operation acting on the prompt mark.
在本申请实施例中,触控操作可以有多种实施方式。作为一种方式,触控操作可以为点击操作。再者,作为另外一种方式,触控操作可以为双击操作。并且,该触发对指定内容进行识别的触控操作也可以由用户根据自己的需要进行配置,从而满足用户的个性化需求。其中,对于不同的指定内容所对应的内容识别方式可以不同。可选的,若指定内容为文本内容,那么对应的对文本内容进行识别则可以理解为从图像中提取出文本内容。可选的,电子设备可以基于光学字符识别(Optical Character Recognition,OCR)的方式从图像中提取出文本内容。其中,光学字符识别是指对文本资料的图像文件进行分析识别处理,获取文字及版面信息的过程。亦即将图像中的文字进行识别,并以文本的形式返回。需要说明的是,以指定内容为文本内容为例,在前述S120中,识别到实时显示的图像中有指定内容可以理解为识别到图像中的某个区域为包括有文本内容的区域,但是电子设备还并不知道该区域的文本内容具体的内容是什么。那么通过对指定内容进行识别,则可以确定文本内容的具体内容。In the embodiment of the present application, there may be various implementation manners for the touch operation. As a manner, the touch operation may be a click operation. Furthermore, as another manner, the touch operation may be a double-tap operation. Moreover, the touch operation that triggers the recognition of the specified content can also be configured by the user according to his own needs, so as to meet the personalized needs of the user. Wherein, the content identification methods corresponding to different specified content may be different. Optionally, if the specified content is text content, the corresponding recognition of the text content can be understood as extracting the text content from the image. Optionally, the electronic device may extract text content from the image based on optical character recognition (Optical Character Recognition, OCR). Among them, optical character recognition refers to the process of analyzing and recognizing image files of text materials to obtain text and layout information. That is to recognize the text in the image and return it in the form of text. It should be noted that, taking the specified content as an example of text content, in the aforementioned S120, recognizing that there is specified content in the image displayed in real time can be understood as recognizing a certain area in the image as an area containing text content, but the electronic The device does not yet know what the specific content of the text content in this area is. Then, by identifying the specified content, the specific content of the text content can be determined.
类似的,若指定内容为人脸,那么电子设备在检测到实时显示的图中有人脸的情况下,电子设备也只是检测到图像中有人脸,但是并不知道该人脸表征的是谁。那么通过对人脸进行识别,则可以进一步的获取到该人脸所属人物的身份信息。该身份信息可以包括年龄或者性别等。Similarly, if the specified content is a human face, when the electronic device detects a human face in the picture displayed in real time, the electronic device only detects a human face in the image, but does not know who the human face represents. Then, by recognizing the face, the identity information of the person to whom the face belongs can be further obtained. The identity information may include age or gender.
S140:输出识别结果。S140: output the recognition result.
在本申请实施例中,输出识别结果的方式可以有多种。作为一种方式,输出识别结果可以包括通过对识别结果进行显示的方式进行输出,那么在这种方式下,电子设备可以在某个界面中对识别结果进行显示。示例性的,如图6所示,在图6的左侧图像中显示有透明图层15和框14这两种类型的提示标识,若有作用于透明图层15的触控操作,则电子设备可以响应于作用于透明图层15的触控操作对透明图层15处的文本内容进行识别,进而如图6右侧图像所示的在界面16中对识别结果进行显示。可选的,在指定内容为文本内容的情况下,在显示识别结果时可以将识别结果进行放大后再进行显示,从而有利于用户能够更清楚的看到识别结果。再者,若有作用于框14的触控操作,则电子设备可以响应于作用于框14的触控操作对框14处的人脸进行识别,并如图7的右侧图像所示的直接在原界面中显示识别结果,该识别结果包括识别出的性别以及年龄。可选的,在指定内容为文本内容的情况下,可以对识别结果进行尺寸放大后再进行显示。其中,尺寸放大的倍数可以为适配电子设备的屏幕的倍数,从而使得用户能够有更好的阅读体验。In the embodiment of the present application, there may be multiple ways of outputting the recognition result. As a manner, outputting the recognition result may include displaying the recognition result. In this manner, the electronic device may display the recognition result on a certain interface. Exemplarily, as shown in FIG. 6 , there are two types of prompt signs, the transparent layer 15 and the frame 14, displayed in the left image of FIG. The device can recognize the text content on the transparent layer 15 in response to the touch operation acting on the transparent layer 15 , and then display the recognition result in the interface 16 as shown in the right image of FIG. 6 . Optionally, when the specified content is text content, the recognition result may be displayed after being enlarged when displaying the recognition result, so that the user can see the recognition result more clearly. Furthermore, if there is a touch operation acting on the frame 14, the electronic device can recognize the face at the frame 14 in response to the touch operation acting on the frame 14, and directly The recognition result is displayed in the original interface, and the recognition result includes the recognized gender and age. Optionally, in the case that the specified content is text content, the recognition result may be displayed after being enlarged in size. Wherein, the multiple of the size magnification may be a multiple of a screen adapted to the electronic device, so that the user can have a better reading experience.
作为另外一种方式,电子设备可以通过将识别结果发送给外部设备的方式进行输出。在这种方式下,电子设备可以直接将识别结果中的内容发送给外部设备。其中,外部设备可以包括服务器,也可以为其他用户的电子设备。在外部设备为服务器的这种情况下,电子设备可以将识别结果直接上传给服务器,并且指示服务器将所上传的内容与电子设备对应的帐号进行对应存储,从而使得电子设备可以通过对应的帐号从服务器中获取到所上传的内容。在外部设备为其他用户的电子设备的这种情况下,若在外部设备与电子设备之间有通过近距离无线通信(WiFi或者蓝牙)的方式建立通信连接,则电子设备可以通过该近距离无线通信的方式建立通信连接将识别结果发送给外部设备。若外部设备的帐号与电子设备的帐号之间具有关联关系,那么电子设备可以通过帐号之间的关联关系来将识别结果发送给该外部设备。As another manner, the electronic device may output the recognition result by sending it to an external device. In this way, the electronic device can directly send the content in the recognition result to the external device. Wherein, the external device may include a server, or may be an electronic device of another user. In the case where the external device is a server, the electronic device can directly upload the recognition result to the server, and instruct the server to store the uploaded content and the account corresponding to the electronic device, so that the electronic device can use the corresponding account from The uploaded content is retrieved from the server. In the case where the external device is an electronic device of another user, if a communication connection is established between the external device and the electronic device through short-range wireless communication (WiFi or Bluetooth), the electronic device can The communication method establishes a communication connection and sends the recognition result to the external device. If there is an association between the account of the external device and the account of the electronic device, the electronic device may send the identification result to the external device through the association between the accounts.
需要说明的是,在输出方式有多种的情况下,电子设备中有多种方式来确定具体采用哪种方式来对指定内容进行识别。It should be noted that, when there are multiple output methods, there are multiple methods in the electronic device to determine which method is specifically adopted to identify the specified content.
作为一种方式,电子设备可以通过触发对指定内容进行识别的触控操作来确定具体采用哪种方式进行输出。可选的,在这种方式中,可以配置点击操作对应于将识别结果通过显示的方式进行输出,配置双击操作对应于将识别结果通过发送给外部设备的方式进行输出。示例性的,请再参阅图6,若有作用于透明图层15的点击操作,电子设备则会以图6的右侧图像中所示的方式来对识别结果进行输出,若有作用于透明图层15的双击操作,那么电子设备则不会对识别结果进行显示,而是直接发送给外部设备。对应的,也可以配置点击操作对应于将识别结果通过发送给外部设备的方式进行输出,配置双击操作对应于将识别结果通过显示的方式进行输出。As a manner, the electronic device may determine which manner to use for output by triggering a touch operation for identifying specified content. Optionally, in this manner, the click operation may be configured to correspond to outputting the recognition result by displaying, and the double-click operation may be configured to correspond to outputting the recognition result by sending it to an external device. Exemplarily, please refer to FIG. 6 again. If there is a click operation acting on the transparent layer 15, the electronic device will output the recognition result in the manner shown in the right image of FIG. Double-click operation on layer 15, the electronic device will not display the recognition result, but directly send it to the external device. Correspondingly, the click operation may also be configured to output the recognition result by sending it to an external device, and the double-click operation may be configured to output the recognition result by displaying it.
作为另外一种方式,在电子设备中的设置界面中可以配置默认的输出方式。那么电子设备在需要对识别结果进行输出时,则会读取电子设备中的默认的输出方式,进而根据默认的输出方式来对识别结果进行输出。例如,若默认的输出方式为通过显示的方式进行输出,那么则电子设备仅会对识别结果进行显示。As another manner, a default output manner may be configured in a setting interface of the electronic device. Then, when the electronic device needs to output the recognition result, it will read the default output mode in the electronic device, and then output the recognition result according to the default output mode. For example, if the default output method is output by display, the electronic device will only display the recognition result.
需要说明的是,若是通过用于对采集的图像进行实时显示的界面以外的界面,来对识别结果进行显示的情况下,该显示识别结果的界面中还可以配置有触发跳转到用于对采集的图像进行实时显示的界面的控件,该控件的配置方式以及触发方式在本申请实施例中不做具体的限定。It should be noted that if the recognition result is displayed through an interface other than the interface for real-time display of the collected images, the interface for displaying the recognition result can also be configured with a trigger to jump to the The controls of the interface for displaying the collected images in real time, the configuration mode and the trigger mode of the controls are not specifically limited in this embodiment of the present application.
本实施例提供的一种内容识别方法,在该方法中,会对采集的图像进行实时显示,并且在所显示的图像包括有指定内容的情况下,可以在指定内容处显示提示标识,然后响应作用于所述提示标识的触控操作,对所述指定内容进行识别,并输出识别结果。从而通过上述方式使得在电子设备在实时的对所采集的图像 进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。This embodiment provides a content recognition method. In this method, the captured image will be displayed in real time, and if the displayed image contains specified content, a prompt mark can be displayed at the specified content, and then respond A touch operation acting on the prompt mark identifies the specified content and outputs a recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
请参阅图8,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to Figure 8, a content identification method provided by the present application is applied to electronic equipment, and the method includes:
S210:对采集的图像进行实时显示。S210: Display the collected images in real time.
S220:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S220: If the displayed image includes specified content, display a prompt logo at the specified content.
S230:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S230: Identify the specified content in response to a touch operation acting on the prompt mark.
S240:获取背景图像,所述背景图像包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像。S240: Acquire a background image, where the background image includes an image displayed by the electronic device when a touch operation is performed on the prompt sign.
在S240中,当检测到有作用于提示标识的触控操作时,则可以将电子设备当前所显示的图像作为后续进行识别结果显示时对应的背景图像。In S240, when it is detected that there is a touch operation for prompting the logo, the image currently displayed by the electronic device may be used as a corresponding background image for subsequent recognition result display.
再者,需要说明的是,在本申请实施例中,除了S240中所示的确定背景图像的方式外,还可以有其他的确定背景图像的方式。可选的,还可以根据识别结果的内容来确定对应的背景图像。Furthermore, it should be noted that, in the embodiment of the present application, in addition to the manner of determining the background image shown in S240, there may be other manners of determining the background image. Optionally, the corresponding background image may also be determined according to the content of the recognition result.
作为一种方式,若指定内容为文本内容,那么可以将识别结果的语义所表达的场景对应的场景图像作为背景图像。例如,若识别结果的语义表达的是教学类的内容,那么所表达的场景则可以为教学场景,进而可以确定教学场景对应的场景图像作为背景图像。可选的,教学场景对应的场景图像可以为一张内容为教室的图像,或者内容为黑板的图像。再例如,若识别结果的语义表达的是如图5中所示的公交车,那么所表达的场景则可以为交通场景,进而可以确定交通场景对应的场景图像作为背景图像。可选的,交通场景对应的场景图像可以为一个交通工具。其中,电子设备可以通过预先训练得到的神经网络模型来获取识别结果所表达的场景。As a manner, if the specified content is text content, then the scene image corresponding to the scene expressed by the semantics of the recognition result may be used as the background image. For example, if the semantics of the recognition result express teaching content, then the expressed scene may be a teaching scene, and then the scene image corresponding to the teaching scene may be determined as the background image. Optionally, the scene image corresponding to the teaching scene may be an image whose content is a classroom, or an image whose content is a blackboard. For another example, if the semantic expression of the recognition result is a bus as shown in FIG. 5 , then the expressed scene can be a traffic scene, and then the scene image corresponding to the traffic scene can be determined as the background image. Optionally, the scene image corresponding to the traffic scene may be a vehicle. Wherein, the electronic device can obtain the scene expressed by the recognition result through the neural network model obtained through pre-training.
作为另外一种方式,若指定内容为文本内容,可以将识别结果中的关键词对应的图像作为背景图像。示例性的,若识别结果中包括有关键词“手机”,那么则可以将手机对应的图像作为背景图像。再例如,若识别结果中包括有“公交车”,那么则可以将公交车对应的图像作为背景图像。其中,可以在电子设备中存储有关键词与图像的对应关系,从而电子设备在得到识别结果中的关键词后,可以通过该对应关系来获取得到关键词所对应的图像。As another way, if the specified content is text content, the image corresponding to the keyword in the recognition result may be used as the background image. Exemplarily, if the recognition result includes the keyword "mobile phone", then the image corresponding to the mobile phone can be used as the background image. For another example, if the recognition result includes "bus", then the image corresponding to the bus may be used as the background image. Wherein, the correspondence between keywords and images may be stored in the electronic device, so that after the electronic device obtains the keywords in the recognition result, the image corresponding to the keywords may be obtained through the correspondence.
S250:以所述背景图像为背景对识别结果进行显示。S250: Display the recognition result with the background image as the background.
其中,在对识别结果进行显示时,可以基于背景图像来对识别结果进行显示。其中,作为一种方式,基于背景图像对识别结果进行显示可以包括:将识别结果与背景图像融合为一张图像,然后对融合后的图像进行显示。在这种方式中,在得到识别结果后,可以先获取到背景图像,然后将识别结果融入到背景图像中,得到融入识别结果的背景图像。作为另外一种方式,基于背景图像对识别结果进行显示可以包括:对背景图像进行显示,并将识别结果悬浮于所显示的背景图像上。Wherein, when displaying the recognition result, the recognition result may be displayed based on the background image. Wherein, as a manner, displaying the recognition result based on the background image may include: fusing the recognition result and the background image into one image, and then displaying the fused image. In this way, after the recognition result is obtained, the background image can be obtained first, and then the recognition result is integrated into the background image to obtain the background image integrated with the recognition result. As another manner, displaying the recognition result based on the background image may include: displaying the background image, and suspending the recognition result on the displayed background image.
作为一种方式,所述输出识别结果还包括:显示第一触发控件;所述以所述背景图像为背景对所述识别结果进行显示之后还包括:响应作用于所述第一触发控件的触控操作,基于全屏模式对所述识别结果进行显示。其中,全屏模式可以理解为基于全屏显示的方式对识别结果进行显示。可选的,在一种全屏模式中,电子设备可以只显示背景图像和识别结果。在另外一种全屏显示模式中,电子设备除了显示背景图像和识别结果外,还将可以显示设置在电子设备顶部的状态栏。该状态栏中可以包括有电量状态以及无线信号状态中的至少一个。As a manner, the outputting the recognition result further includes: displaying a first trigger control; after displaying the recognition result with the background image as the background, it further includes: responding to a touch acting on the first trigger control control operation, and display the recognition result based on the full screen mode. Wherein, the full-screen mode can be understood as displaying the recognition result based on a full-screen display. Optionally, in a full-screen mode, the electronic device may only display the background image and the recognition result. In another full-screen display mode, in addition to displaying background images and recognition results, the electronic device can also display a status bar set on the top of the electronic device. The status column may include at least one of a battery status and a wireless signal status.
示例性的,如图9所示,在S250中可以以图9中的左侧图像所示的样式对识别结果进行显示,在图9的左侧图像所示的样式中,除了用于显示识别结果的界面16外,还会显示状态栏17以及操作区域18。在这种方式下,可以在状态栏17中显示名称为“阅读模式”的第一触发控件。若检测到有作用于该名称为“阅读模式”的第一触发控件后,则电子设备切换为图9中右侧图像所示的样式对识别结果进行显示。在图9的右侧图像中,则会取消显示状态栏17和操作区域18,并且使用全屏显示的界面19来对识别结果进行显示,从而实现对识别结果的全屏显示。其中,在界面19中可以显示有背景图像(图中未示出)。如图10所示,在另一种全屏显示模式中,用于展示识别结果的界面19则不会如图9中所示的覆盖显示屏的所有区域,进而除了可以显示用于展示识别结果的界面19外,还可以显示有状态栏191。在该状态栏191中可以显示有电量状态和无线信号状态。该无线信号状态可以包括有WiFi状态和移动通信信号的状态。Exemplarily, as shown in FIG. 9, in S250, the recognition result may be displayed in the style shown in the left image in FIG. 9. In the style shown in the left image of FIG. In addition to the result interface 16, a status bar 17 and an operation area 18 are also displayed. In this way, a first trigger control named “Reading Mode” can be displayed in the status bar 17 . If it is detected that the first trigger control named "reading mode" is active, the electronic device switches to the style shown in the right image in FIG. 9 to display the recognition result. In the right image of FIG. 9 , the status bar 17 and the operation area 18 are canceled, and the full-screen display interface 19 is used to display the recognition result, thereby realizing full-screen display of the recognition result. Wherein, a background image (not shown in the figure) may be displayed on the interface 19 . As shown in FIG. 10, in another full-screen display mode, the interface 19 for displaying the recognition results will not cover all areas of the display screen as shown in FIG. In addition to the interface 19, a status bar 191 may also be displayed. In the status bar 191 , the battery status and the wireless signal status can be displayed. The wireless signal state may include a WiFi state and a mobile communication signal state.
需要说明的是,本实施例中与其他实施例相同的步骤的具体说明,可以参见其他实施例中的相关内容,本实施例中不再赘述。It should be noted that for specific descriptions of steps in this embodiment that are the same as those in other embodiments, reference may be made to relevant content in other embodiments, and details are not repeated in this embodiment.
本实施例提供的一种内容识别方法,从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。并且,在本实施例中,在对识别结果进行显示时,会以背景图像 为背景对所述识别结果进行显示,从而有利于减少用户所感知到的界面变化程度。This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Moreover, in this embodiment, when the recognition result is displayed, the recognition result will be displayed with the background image as the background, thereby helping to reduce the degree of interface change perceived by the user.
请参阅图11,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to FIG. 11 , a content identification method provided by the present application is applied to electronic devices, and the method includes:
S260:对采集的图像进行实时显示。S260: Display the collected images in real time.
S261:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S261: If the displayed image includes specified content, display a prompt logo at the specified content.
S262:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S262: Identify the specified content in response to a touch operation acting on the prompt mark.
S263:获取待处理图像,所述待处理图像为作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像。S263: Acquire an image to be processed, where the image to be processed is an image displayed by the electronic device when a touch operation is performed on the prompt sign.
S264:对所述待处理图像进行虚化处理,并将所述虚化处理后的图像作为背景图像。S264: Perform blurring processing on the image to be processed, and use the blurred image as a background image.
S265:以所述背景图像为背景对所述识别结果进行显示。S265: Display the recognition result with the background image as the background.
本实施例提供的一种内容识别方法,从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。对待处理图像进行虚化处理,可以使得最终所得到的背景图像在视觉上会有模糊的感觉,进而可以便于用户可以更加容易的看清楚识别结果的内容,也使得用户可以更加关注识别结果本身。This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Blurring the image to be processed can make the final background image visually blurred, which makes it easier for the user to see the content of the recognition result, and also allows the user to pay more attention to the recognition result itself.
请参阅图12,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to FIG. 12, a content identification method provided by the present application is applied to electronic equipment, and the method includes:
S310:对采集的图像进行实时显示。S310: Display the collected images in real time.
S320:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S320: If the displayed image includes specified content, display a prompt logo at the specified content.
S330:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S330: Identify the specified content in response to a touch operation acting on the prompt mark.
S340:显示识别结果。S340: Display the recognition result.
S350:显示第二触发控件。S350: Displaying the second trigger control.
需要说明的是,在本申请实施例中,显示第二触发控件和显示识别结果可以是同时执行的,从而使得用户在视觉上可以感知到第二触发控件和识别结果是一起显示在电子设备的屏幕上。It should be noted that, in the embodiment of the present application, displaying the second trigger control and displaying the recognition result can be performed simultaneously, so that the user can visually perceive that the second trigger control and the recognition result are displayed together on the electronic device. on the screen.
S360:响应作用于所述第二触发控件的触控操作,显示锁定界面,所述锁定界面中包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像,以及所述所显示的图像中的指定内容对应的提示标识。S360: In response to the touch operation acting on the second trigger control, display a lock interface, where the lock interface includes an image displayed by the electronic device when the touch operation acts on the prompt sign, and the The prompt identifier corresponding to the specified content in the displayed image.
示例性的,如图13所示,若在如图13左侧图像所示的情况下,检测到有作用于透明图层15的触控操作后,电子设备可以对透明图层15对应的指定内容进行识别,并通过图13的中间图像所示的样式来显识别结果。并且,其中还会显示第二触发控件20。若检测到有作用于第二触发控件20的触控操作,电子设备则会显示图13的右侧图像中所示的锁定界面21。如图13的右侧图像所示,锁定界面21中的图像内容与作用于所述提示标识的触控操作作用时,电子设备所显示的图像内容(即图13的左侧图像中所示的内容)是相同的。而不同的是,锁定界面21中的图像内容为静态图像,而在图13的左侧图像中所示的内容为电子设备的图像采集器件所实时采集的图像。其中,锁定界面21中的静态图像可以理解为即使电子设备的图像采集器件所采集的图像发生变换,而锁定界面21中的图像内容依然保持不变。对应的,图13的左侧图像中所示情况中,电子设备所显示的图像内容是会实时的随着图像采集器件所采集的图像的变换而随之变化的。可选的,电子设备在显示锁定界面时,电子设备在图像采集期间可以停止进行图像采集,以便降低功耗。Exemplarily, as shown in FIG. 13 , if the touch operation on the transparent layer 15 is detected in the situation shown in the left image of FIG. 13 , the electronic device can specify the corresponding The content is recognized, and the recognition result is displayed through the style shown in the middle image of Figure 13. Moreover, the second trigger control 20 is also displayed therein. If a touch operation on the second trigger control 20 is detected, the electronic device will display the lock interface 21 shown in the right image of FIG. 13 . As shown in the right image of FIG. 13, when the image content in the locking interface 21 and the touch operation acting on the prompt mark are in effect, the image content displayed by the electronic device (that is, the image content shown in the left image of FIG. 13 content) are the same. The difference is that the image content in the locking interface 21 is a static image, while the content shown in the left image of FIG. 13 is an image captured by the image acquisition device of the electronic device in real time. Wherein, the static image in the locking interface 21 can be understood as that even if the image captured by the image acquisition device of the electronic device changes, the image content in the locking interface 21 remains unchanged. Correspondingly, in the situation shown in the left image of FIG. 13 , the content of the image displayed by the electronic device changes in real time as the image captured by the image capture device changes. Optionally, when the electronic device displays the locking interface, the electronic device may stop image collection during image collection, so as to reduce power consumption.
作为一种方式,所述响应作用于所述第二触发控件的触控操作,显示锁定界面之后还包括:响应于第一操作,恢复对采集的图像进行实时显示。其中,第一操作可以为作用于显示屏的双击操作。再者,作为另外一种方式,可以在显示锁定界面的情况下,电子设备可以显示一个触发恢复对采集的图像进行实时显示的恢复控件,从而使得电子设备检测到有作用于该恢复控件的触控操作时,恢复对采集的图像进行实时显示。其中,若在显示锁定界面的过程中,图像采集器件停止进行图像采集,那么恢复对采集的图像进行实时显示则可以包括:启动图像采集器件,并将启动后的图像采集器件所采集的图像进行实时显示。若在显示锁定界面的过程中,图像采集器件依然在进行图像采集,并将所采集的图像缓存到指定区域,但是并未从该指定区域读取所缓存的图像进行实时显示,那么则电子设备在恢复对采集的图像进行实时显示的过程中,则可知就执行恢复从该指定区域读取图像进行显示。As a manner, after displaying the lock interface in response to the touch operation acting on the second trigger control, the method further includes: in response to the first operation, resuming real-time display of the collected images. Wherein, the first operation may be a double-click operation acting on the display screen. Furthermore, as another way, when the lock interface is displayed, the electronic device can display a recovery control that triggers the recovery of the real-time display of the collected image, so that the electronic device detects that the touch that acts on the recovery control When the control operation is performed, the real-time display of the collected images is resumed. Wherein, if in the process of displaying the locking interface, the image acquisition device stops image acquisition, then resuming the real-time display of the collected images may include: starting the image acquisition device, and performing the image acquisition on the images collected by the image acquisition device after starting real-time display. If in the process of displaying the locked interface, the image acquisition device is still collecting images and buffering the acquired images to a designated area, but does not read the buffered images from the designated area for real-time display, then the electronic device In the process of resuming the real-time display of the collected images, it can be known that the resuming is executed to read the images from the designated area for display.
需要说明的是,本实施例中与其他实施例相同的步骤的具体说明,可以参见其他实施例中的相关内容,本实施例中不再赘述。It should be noted that for specific descriptions of steps in this embodiment that are the same as those in other embodiments, reference may be made to relevant content in other embodiments, and details are not repeated in this embodiment.
本实施例提供的一种内容识别方法,从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。并且,在本实施例中,在显示识别结果的同时,还可以显示第二 触发控件,以便用户可以通过触控第二触发控件来使得电子设备显示包括作用于提示标识的触控操作作用时,电子设备所显示的图像,以及所述所显示的图像中的指定内容对应的提示标识的锁定界面,从而使得可以在提示标识有多个的情况下,用户可以通过在锁定界面中再次触发其他的提示标识,以对另外的指定内容进行识别并显示识别结果。This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Moreover, in this embodiment, while displaying the recognition result, the second trigger control can also be displayed, so that the user can touch the second trigger control to cause the electronic device to display the touch operation function including the prompt mark, The image displayed by the electronic device, and the lock interface of the prompt logo corresponding to the specified content in the displayed image, so that when there are multiple prompt logos, the user can trigger other prompts again in the lock interface Prompt for identification to identify additional specified content and display the identification result.
请参阅图14,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to Figure 14, a content identification method provided by the present application is applied to electronic devices, and the method includes:
S410:对采集的图像进行实时显示。S410: Display the collected images in real time.
S420:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S420: If the displayed image includes specified content, display a prompt logo at the specified content.
S430:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S430: Identify the specified content in response to a touch operation acting on the prompt mark.
S440:显示识别结果。S440: Display the recognition result.
S450:显示第三触发控件。S450: Displaying a third trigger control.
需要说明的是,在本申请实施例中,显示第三触发控件和显示识别结果可以是同时执行的,从而使得用户在视觉上可以感知到第三触发控件和识别结果是一起显示在电子设备的屏幕上。It should be noted that, in the embodiment of the present application, displaying the third trigger control and displaying the recognition result can be performed simultaneously, so that the user can visually perceive that the third trigger control and the recognition result are displayed together on the electronic device. on the screen.
S460:响应作用于所述第三触发控件的触控操作,显示操作菜单,所述操作菜单中包括有至少一个操作控件,每个操作控件对应的操作不同。S460: Display an operation menu in response to a touch operation acting on the third trigger control, where the operation menu includes at least one operation control, and each operation control corresponds to a different operation.
如图15所示,在图15的右侧图中,在显示识别结果时可以同时显示第三触发控件22。那么响应于作用于该第三触发控件22的触控操作,则可以显示操作菜单23。在操作菜单23中包括有名称为发送的操作控件、名称为复制的操作控件、名称为保存为文档的操作控件以及名称为保存为便签的操作控件。其中,名称为发送的触控操作对应的操作包括将识别结果发送到第三方应用程序中。该第三方应用程序可以包括即时通信类应用程序或者短信类程序等。名称为复制的操作控件对应的操作包括对识别结果进行复制操作,从而使得电子设备在执行该复制操作后,可以在其他可以进行文本输入的位置通过粘贴的方式将识别结果进行输入。名称为保存问文档的操作控件对应的操作可以包括将识别结果通过文档的方式进行存储。名称为保存问便签的操作控件对应的操作可以包括将识别结果通过便签的方式进行存储。As shown in FIG. 15 , in the right diagram of FIG. 15 , the third trigger control 22 may be displayed simultaneously when the recognition result is displayed. Then, in response to the touch operation acting on the third trigger control 22 , an operation menu 23 may be displayed. The operation menu 23 includes an operation control named send, an operation control named copy, an operation control named save as document, and an operation control named save as sticky note. Wherein, the operation corresponding to the touch operation named sending includes sending the recognition result to a third-party application program. The third-party application program may include an instant messaging application program or a short message program. The operation corresponding to the operation control named copy includes copying the recognition result, so that after the copy operation is performed, the electronic device can input the recognition result by pasting in other positions where text input is possible. The operation corresponding to the operation control named as saving the document may include storing the recognition result in the form of a document. The operation corresponding to the operation control named "Save Memo" may include storing the recognition result in the form of Memo.
S470:响应作用于所述操作控件的触控操作,将有触控操作的操作控件对应的操作作为目标操作。S470: In response to a touch operation acting on the operation control, use an operation corresponding to the operation control with the touch operation as a target operation.
S480:对所述识别结果执行所述目标操作。S480: Execute the target operation on the recognition result.
需要说明的是,在电子设备在基于全屏模式对识别结果进行显示的情况下,电子设备也可以同时显示第三触控控件,并且,在全屏模式下所显示的第三触控控件的功能与图15所示的第三触控控件的功能是相同的。It should be noted that, when the electronic device displays the recognition result based on the full-screen mode, the electronic device can also display the third touch control at the same time, and the function of the third touch control displayed in the full-screen mode is the same as The functions of the third touch controls shown in FIG. 15 are the same.
需要说明的是,本实施例中与其他实施例相同的步骤的具体说明,可以参见其他实施例中的相关内容,本实施例中不再赘述。It should be noted that for specific descriptions of steps in this embodiment that are the same as those in other embodiments, reference may be made to relevant content in other embodiments, and details are not repeated in this embodiment.
本实施例提供的一种内容识别方法,从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。并且,在本实施例中,在显示识别结果后,还会显示触发对识别结果进行进一步操作的第三触发控件,以便用户可以通过直接操作第三触发控件来调出操作菜单,并基于操作菜单中的操作控件来对识别结果进行进一步的操作。This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Moreover, in this embodiment, after the recognition result is displayed, a third trigger control that triggers further operations on the recognition result will also be displayed, so that the user can call up the operation menu by directly operating the third trigger control, and based on the operation menu The operation controls in to perform further operations on the recognition results.
请参阅图16,本申请提供的一种内容识别方法,应用于电子设备,所述方法包括:Please refer to FIG. 16, a content identification method provided by the present application is applied to electronic devices, and the method includes:
S510:对采集的图像进行实时显示。S510: Display the collected images in real time.
S520:响应于变焦请求,对所采集的图像进行变焦处理。S520: In response to the zoom request, perform zoom processing on the captured image.
需要说明的是,在一些情况下,所采集图像中的物体尺寸太小,从而造成电子设备无法对图像中的物体进行有效的识别,或者电子设备无法有效的对物体上的文本进行有效的识别。而在另外一些情况下,所采集图像中的物体无法完整的显示在屏幕中,从而也会造成电子设备无法对物体进行有效的识别。那么通过变焦处理,可以改变所采集图像中的物体显示在屏幕中的范围,从而使得物体能够更大概率的被完整的进行显示。其中,本申请实施例中的变焦处理可以包括通过改变图像采集器件的焦距来实现变焦,也可以通过数码变焦的方式来进行变焦。其中,在数码变焦的过程中,电子设备是通过处理器,把所采集图像内的每个像素面积增大,从而达到变焦放大目的。It should be noted that, in some cases, the size of the object in the collected image is too small, so that the electronic device cannot effectively recognize the object in the image, or the electronic device cannot effectively recognize the text on the object . In other cases, the objects in the collected images cannot be completely displayed on the screen, which will also cause the electronic device to fail to effectively identify the objects. Then, through the zoom processing, the display range of the object in the captured image can be changed on the screen, so that the object can be completely displayed with a higher probability. Wherein, the zooming process in the embodiment of the present application may include zooming by changing the focal length of the image acquisition device, or may be zoomed by digital zooming. Wherein, in the process of digital zooming, the electronic device increases the area of each pixel in the captured image through a processor, so as to achieve the purpose of zooming.
作为一种方式,所述响应于变焦请求,对所采集的图像进行变焦处理,包括:若有作用于实时显示的图像中的区域选择操作,检测所选择区域中的物体是否被完整显示;若被完整显示,则生成增大尺寸的变焦请求,所述增大尺寸的变焦请求用于使得所选择区域中的物体以第一目标尺寸被完整显示;响应于所述增大尺寸的变焦请求,对所采集的图像进行变焦处理。其中,物体被完整显示可以理解为物体整体轮廓位于图像采集器件的采集范围内。示例性的,如图17所示,在图17的左侧图像中包括有一位女生和女生身旁的公交站牌,在图17左侧图示中,女生和公交站牌的整体轮廓都在图像采集器件的采集范围内,从而使得女生和公交站牌均被完整的显示。而在图17的右侧图像中, 女生的脚部以及公交站牌的右侧均无法被看到,那么则在图17的右侧图像中,女生和公交站牌均未被完整的显示As a manner, the zooming processing of the collected image in response to the zoom request includes: if there is an area selection operation acting on the image displayed in real time, detecting whether the object in the selected area is completely displayed; if is completely displayed, then generate a zoom request of increasing size, and the zoom request of increasing size is used to make the object in the selected area be completely displayed with the first target size; in response to the zoom request of increasing size, Perform zoom processing on the captured image. Wherein, the complete display of the object can be understood as that the overall outline of the object is within the collection range of the image collection device. Exemplarily, as shown in Figure 17, the left image of Figure 17 includes a girl and the bus stop sign next to the girl, and in the illustration on the left side of Figure 17, the overall outline of the girl and the bus stop sign are all in Within the collection range of the image collection device, the girl and the bus stop sign are completely displayed. In the right image of Figure 17, neither the girl's feet nor the right side of the bus stop can be seen, so in the right image of Figure 17, neither the girl nor the bus stop are fully displayed
若没有被完整显示,则生成减小尺寸的变焦请求,所述减小尺寸的变焦请求用于使得所选择区域中的物体以第二目标尺寸进行显示;响应于所述减小尺寸的变焦请求,对所采集的图像进行变焦处理。If it is not fully displayed, generate a zoom request for reducing the size, and the zoom request for reducing the size is used to display the object in the selected area with a second target size; responding to the zoom request for reducing the size , to perform zoom processing on the captured image.
需要说明的是,本申请实施例中的第一目标尺寸为所选择区域中的物体能够完整的显示时对应的最大的尺寸。可以理解的是,在增加焦距的情况下,所采集图像中的物体的尺寸会随着增大。示例性的,如图18所示,图18的左侧图像所示的为增加焦距之前的图像,图18的右侧图像所示的为增加焦距之后的图像,在增加焦距之后的图像中公交站牌的尺寸会相比右侧图像中的尺寸更大,从而有利使得公交站牌中的文本内容被检测出。但是,在物体增大到一定程度的情况下,物体可能无法被完整的进行显示,从而在增加焦距的过程中,同时保证物体的完整的显示有利于提升物体处的指定内容被成功检测到的概率。第二目标尺寸为所选择区域中的物体能够以最为完整的状态进行显示时的尺寸。可以理解的是,在进行焦距减小的过程中,可能及时减小到图像采集器件所支持的最小焦距,物体也无法完整的进行显示,但是相比缩小焦距之前,物体显示在屏幕的范围会更大,从而有利于物体处的指定内容被成功检测到的概率。It should be noted that the first target size in the embodiment of the present application is the corresponding maximum size when the objects in the selected area can be completely displayed. It can be understood that when the focal length is increased, the size of the object in the captured image will increase accordingly. Exemplarily, as shown in Figure 18, the image on the left side of Figure 18 shows the image before increasing the focal length, and the image on the right side of Figure 18 shows the image after increasing the focal length, in the image after increasing the focal length The size of the stop sign will be larger than the size in the image on the right, which is beneficial for the text content in the bus stop sign to be detected. However, when the object increases to a certain extent, the object may not be completely displayed. Therefore, in the process of increasing the focal length, ensuring the complete display of the object at the same time is conducive to improving the success detection of the specified content at the object. probability. The second target size is the size when the objects in the selected area can be displayed in the most complete state. It is understandable that during the process of reducing the focal length, it may be reduced to the minimum focal length supported by the image acquisition device in time, and the object cannot be completely displayed, but compared with before the focal length is reduced, the range of the object displayed on the screen will be smaller is larger, thereby benefiting the probability that the specified content at the object is successfully detected.
S530:若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。S530: If the displayed image includes specified content, display a prompt logo at the specified content.
S540:响应作用于所述提示标识的触控操作,对所述指定内容进行识别。S540: Identify the specified content in response to a touch operation acting on the prompt mark.
S550:输出识别结果。S550: Outputting a recognition result.
需要说明的是,本实施例中与其他实施例相同的步骤的具体说明,可以参见其他实施例中的相关内容,本实施例中不再赘述。It should be noted that for specific descriptions of steps in this embodiment that are the same as those in other embodiments, reference may be made to relevant content in other embodiments, and details are not repeated in this embodiment.
本实施例提供的一种内容识别方法,从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。并且,在本实施例中,电子设备可以响应于区域选择操作而触发的变焦请求而对采集图像的图像采集器件进行变焦,以便可以使得区域选择操作所选择的区域中的物体可以目标尺寸被进行显示,以便提升被从实时图像中被检测出的概率。This embodiment provides a method for content recognition, so that in the state where the electronic device is displaying the collected images in real time, the electronic device automatically recognizes the images displayed in real time by means of prompting and marking After the specified content appears, the electronic device can be directly triggered to recognize the specified content through the touch operation acting on the prompt mark, thereby simplifying the operation process of triggering image recognition and improving user experience. Moreover, in this embodiment, the electronic device may zoom the image acquisition device for capturing images in response to the zoom request triggered by the area selection operation, so that objects in the area selected by the area selection operation can be displayed at a target size. displayed in order to increase the probability of being detected from the live image.
下面再通过一个场景来对本申请实施例进行一下说明,如图19所示,在该场景中方法包括:Next, a scenario is used to describe the embodiment of the present application, as shown in Figure 19, the method in this scenario includes:
S610:打开相机。S610: Turn on the camera.
S620:进入放大模式。S620: Enter the enlargement mode.
电子设备在启动相机后,在相机中可以配置有多种模式。该多种模式中可以包括有放大模式。在进入放大模式后,电子设备则可以对取景框中所显示的图像中是否有目标信息进行检测。其中,这里的目标信息可以理解为前述实施例中的指定内容。而其中取景框中所显示的图像则为实时显示的图像采集器件所采集的图像。After the electronic device starts the camera, multiple modes can be configured in the camera. The multiple modes may include a magnification mode. After entering the magnification mode, the electronic device can detect whether there is target information in the image displayed in the viewfinder frame. Wherein, the target information here can be understood as the specified content in the foregoing embodiments. The image displayed in the viewfinder frame is the image collected by the image acquisition device displayed in real time.
S630:取景框中出现目标信息预识别框。S630: A target information pre-recognition frame appears in the viewfinder frame.
其中,还可以包括:S631:获取用户圈出的位置。在S631中用户圈出的位置可以理解为前述实施例中的区域选择操作,那么在获取用户的圈出的位置后,电子设备所执行的操作,则和前述实施例中的响应于区域选择操作后所执行的操作是相同的。Wherein, it may further include: S631: Obtain the location circled by the user. The location circled by the user in S631 can be understood as the area selection operation in the foregoing embodiments, then after acquiring the circled location of the user, the operation performed by the electronic device is the same as the response to the region selection operation in the foregoing embodiments The operations performed afterwards are the same.
其中,与预识别框可以理解为一种前述实施例中所指出的提示标识。Wherein, the box with the pre-identification can be understood as a kind of prompt mark pointed out in the foregoing embodiments.
S640:点击预识别框。S640: Click the pre-identification box.
在电子设备检测到点击预识别框的操作后,则可以对预识别框处的目标信息进行识别。After the electronic device detects the operation of clicking the pre-recognition frame, it can recognize the target information at the pre-recognition frame.
S650:识别内容被放大输出。S650: The recognition content is amplified and output.
在识别内容被放大输出后,可以执行S651:通过操作菜单进行进一步的操作。该进一步的操作可以包括前述实施例中的操作菜单中的操作控件所对应的操作。After the recognition content is amplified and output, S651 may be executed: performing further operations through the operation menu. The further operation may include operations corresponding to the operation controls in the operation menu in the foregoing embodiments.
S660:进入阅读模式。S660: Enter the reading mode.
其中,阅读模式可以理解为前述实施例中的进入全屏模式对识别结果进行显示。在进入阅读模式后,可以执行S661:通过操作菜单进行进一步的操作。该进一步的操作可以包括前述实施例中的操作菜单中的操作控件所对应的操作。Wherein, the reading mode can be understood as entering a full-screen mode in the foregoing embodiments to display the recognition result. After entering the reading mode, S661 can be executed: performing further operations through the operation menu. The further operation may include operations corresponding to the operation controls in the operation menu in the foregoing embodiments.
请参阅图20,本申请提供的一种内容识别装置600,运行于电子设备,所述装置600包括:Please refer to FIG. 20 , a content identification device 600 provided by the present application runs on an electronic device, and the device 600 includes:
图像显示单元610,用于对采集的图像进行实时显示。可选的,图像显示单元610具体用于响应于用户的操作启动相机程序;在启动相机程序后,在所显示的相机程序的界面中对采集的图像进行实时显示。内容标识单元620,用于若所显示的图像包括有指定内容,在所述指定内容处显示提示标识。识别单元630,用于响应作用于所述提示标识的触控操作,对所述指定内容进行识别。内容输出单元640,用于输出识别结果。作为一种方式,内容输出单元640,具体用于获取背景图像,所述背景图像包括作用于 所述提示标识的触控操作作用时,所述电子设备所显示的图像;以所述背景图像为背景对所述识别结果进行显示。可选的,内容输出单元640,具体用于获取待处理图像,所述待处理图像为作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像;对所述待处理图像进行虚化处理,并将所述虚化处理后的图像作为背景图像。内容输出单元640,还具体用于显示第一触发控件;所述以所述背景图像为背景对所述识别结果进行显示之后还包括:响应作用于所述第一触发控件的触控操作,基于全屏模式对所述识别结果进行显示。The image display unit 610 is configured to display the collected images in real time. Optionally, the image display unit 610 is specifically configured to start the camera program in response to the user's operation; after starting the camera program, display the captured image in real time on the displayed interface of the camera program. The content identification unit 620 is configured to display a prompt mark at the specified content if the displayed image includes the specified content. The identifying unit 630 is configured to identify the specified content in response to a touch operation acting on the prompt mark. The content output unit 640 is configured to output the recognition result. As one manner, the content output unit 640 is specifically configured to obtain a background image, the background image includes an image displayed by the electronic device when the touch operation acts on the prompt sign; the background image is The background displays the recognition result. Optionally, the content output unit 640 is specifically configured to acquire an image to be processed, the image to be processed is an image displayed by the electronic device when a touch operation is applied to the prompt mark; The image is blurred, and the blurred image is used as a background image. The content output unit 640 is also specifically configured to display a first trigger control; after displaying the recognition result with the background image as the background, it further includes: responding to a touch operation acting on the first trigger control, based on The full-screen mode displays the recognition result.
作为一种方式,内容输出单元640,具体用于显示识别结果。内容输出单元640,还用于显示第二触发控件;响应作用于所述第二触发控件的触控操作,显示锁定界面,所述锁定界面中包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像,以及所述所显示的图像中的指定内容对应的提示标识。内容输出单元640,还用于响应于第一操作,恢复对采集的图像进行实时显示。As a manner, the content output unit 640 is specifically configured to display the recognition result. The content output unit 640 is further configured to display a second trigger control; in response to a touch operation acting on the second trigger control, display a lock interface, and the lock interface includes a , the image displayed by the electronic device, and a prompt identifier corresponding to the specified content in the displayed image. The content output unit 640 is further configured to resume real-time display of the collected images in response to the first operation.
作为一种方式,内容输出单元640,具体用于显示识别结果。内容输出单元640,还用于显示第三触发控件;响应作用于所述第三触发控件的触控操作,显示操作菜单,所述操作菜单中包括有至少一个操作控件,每个操作控件对应的操作不同;响应作用于所述操作控件的触控操作,将有触控操作的操作控件对应的操作作为目标操作;对所述识别结果执行所述目标操作。As a manner, the content output unit 640 is specifically configured to display the recognition result. The content output unit 640 is further configured to display a third trigger control; in response to a touch operation acting on the third trigger control, an operation menu is displayed, and the operation menu includes at least one operation control, and each operation control corresponds to The operations are different; in response to the touch operation acting on the operation control, the operation corresponding to the operation control with the touch operation is used as the target operation; and the target operation is performed on the recognition result.
可选的,内容输出单元640,具体用于若所述指定内容为文本内容,将所述识别结果的语义所表达的场景对应的场景图像作为背景图像。Optionally, the content output unit 640 is specifically configured to use the scene image corresponding to the scene expressed by the semantics of the recognition result as the background image if the specified content is text content.
可选的,内容输出单元640,具体用于若所述指定内容为文本内容,将所述识别结果中的关键词对应的图像作为背景图像。Optionally, the content output unit 640 is specifically configured to use an image corresponding to a keyword in the recognition result as a background image if the specified content is text content.
可选的,内容输出单元640,具体用于将所述识别结果与所述背景图像融合为一张图像,得到融合后的图像;对所述融合后的图像进行显示。Optionally, the content output unit 640 is specifically configured to fuse the recognition result and the background image into one image to obtain a fused image; and display the fused image.
可选的,内容输出单元640,具体用于对所述背景图像进行显示,并将所述识别结果悬浮于所显示的所述背景图像上。Optionally, the content output unit 640 is specifically configured to display the background image, and suspend the recognition result on the displayed background image.
可选的,内容输出单元640,具体用于若所述指定内容为文本内容,对所述识别结果进行尺寸放大后再进行显示。Optionally, the content output unit 640 is specifically configured to, if the specified content is text content, enlarge the size of the recognition result before displaying it.
作为一种方式,如图21所示,所述装置,还包括:变焦单元650,用于响应于变焦请求,对所采集的图像进行变焦处理。可选的,变焦单元650,具体用于若有作用于实时显示的图像中的区域选择操作,检测所选择区域中的物体是否被完整显示;若被完整显示,则生成增大尺寸的变焦请求,所述增大尺寸的变焦请求用于使得所选择区域中的物体以第一目标尺寸被完整显示;响应于所述增大尺寸的变焦请求,对所采集的图像进行变焦处理。若没有被完整显示,则生成减小尺寸的变焦请求,所述减小尺寸的变焦请求用于使得所选择区域中的物体以第二目标尺寸进行显示;响应于所述减小尺寸的变焦请求,对所采集的图像进行变焦处理。其中,指定内容包括:文本内容或者目标物体。As one manner, as shown in FIG. 21 , the device further includes: a zoom unit 650 configured to perform zoom processing on the captured image in response to a zoom request. Optionally, the zoom unit 650 is specifically configured to detect whether the object in the selected area is fully displayed if there is an area selection operation acting on the image displayed in real time; if it is fully displayed, generate a zoom request for increasing the size , the size-increasing zoom request is used to completely display the object in the selected area with a first target size; in response to the size-increasing zoom request, zoom processing is performed on the captured image. If it is not fully displayed, generate a zoom request for reducing the size, and the zoom request for reducing the size is used to display the object in the selected area with a second target size; responding to the zoom request for reducing the size , to perform zoom processing on the captured image. Wherein, the specified content includes: text content or a target object.
本申请提供的一种内容识别装置,在该装置会对采集的图像进行实时显示,并且在所显示的图像包括有指定内容的情况下,可以在指定内容处显示提示标识,然后响应作用于所述提示标识的触控操作,对所述指定内容进行识别,并输出识别结果。从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。The content recognition device provided by the present application can display the collected image in real time, and when the displayed image contains specified content, it can display a prompt mark at the specified content, and then respond to the action on the specified content. The touch operation of the above-mentioned prompt mark is used to identify the specified content and output the recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
需要说明的是,本申请中装置实施例与前述方法实施例是相互对应的,装置实施例中各个单元的具体实施原理与前述方法实施例中的原理是相似的,装置实施例中的具体内容可以参见方法实施例,而在装置实施例中不再赘述。It should be noted that the device embodiments in this application correspond to the foregoing method embodiments, and the specific implementation principles of each unit in the device embodiments are similar to those in the foregoing method embodiments. The specific content of the device embodiments Reference may be made to the method embodiments, and details are not repeated in the device embodiments.
下面将结合图22对本申请提供的一种电子设备进行说明。An electronic device provided by the present application will be described below with reference to FIG. 22 .
请参阅图22,基于上述的文本处理方法、装置,本申请实施例还提供的另一种可以执行前述文本处理方法的电子设备1000。电子设备1000包括相互耦合的一个或多个(图中仅示出一个)处理器102、存储器104、网络模块106、传感器模块108、图像采集器件110以及屏幕112。其中,该存储器104中存储有可以执行前述实施例中内容的程序,而处理器102可以执行该存储器104中存储的程序。Please refer to FIG. 22 , based on the above text processing method and apparatus, another electronic device 1000 that can implement the above text processing method is provided in the embodiment of the present application. The electronic device 1000 includes one or more (only one is shown in the figure) processors 102 , memory 104 , network module 106 , sensor module 108 , image acquisition device 110 and screen 112 coupled to each other. Wherein, the memory 104 stores programs capable of executing the contents of the foregoing embodiments, and the processor 102 can execute the programs stored in the memory 104 .
其中,处理器102可以包括一个或者多个用于处理数据的核。处理器102利用各种接口和线路连接整个电子设备1000内的各个部分,通过运行或执行存储在存储器104内的指令、程序、代码集或指令集,以及调用存储在存储器104内的数据,执行电子设备1000的各种功能和处理数据。可选地,处理器102可以采用数字信号处理(Digital Signal Processing,DSP)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)、可编程逻辑阵列(Programmable Logic Array,PLA)中的至少一种硬件形式来实现。处理器102可集成中央处理器(Central Processing Unit,CPU)、图像处理器(Graphics Processing Unit,GPU)和调制解调器等中的一种或几种的组合。其中,CPU主要处理操作系统、用户界面和应用程序等;GPU用于负责显示内容的渲染和绘制;调制解调器用于处理无线通信。可以理解的是,上述调制解调器也可以 不集成到处理器102中,单独通过一块通信芯片进行实现。Wherein, the processor 102 may include one or more cores for processing data. The processor 102 uses various interfaces and circuits to connect various parts of the entire electronic device 1000, and executes or executes instructions, programs, code sets, or instruction sets stored in the memory 104, and calls data stored in the memory 104 to execute Various functions of the electronic device 1000 and processing data. Optionally, the processor 102 may adopt at least one of Digital Signal Processing (Digital Signal Processing, DSP), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), and Programmable Logic Array (Programmable Logic Array, PLA). implemented in the form of hardware. The processor 102 may integrate one or a combination of a central processing unit (Central Processing Unit, CPU), an image processor (Graphics Processing Unit, GPU), a modem, and the like. Among them, the CPU mainly handles the operating system, user interface and application programs, etc.; the GPU is used to render and draw the displayed content; the modem is used to handle wireless communication. It can be understood that the above-mentioned modem may not be integrated into the processor 102, but may be realized by a communication chip alone.
存储器104可以包括随机存储器(Random Access Memory,RAM),也可以包括只读存储器(Read-Only Memory)。存储器104可用于存储指令、程序、代码、代码集或指令集。存储器104可包括存储程序区和存储数据区,其中,存储程序区可存储用于实现操作系统的指令、用于实现至少一个功能的指令(比如触控功能、声音播放功能、图像播放功能等)、用于实现下述各个方法实施例的指令等。例如,存储器104中可以存储有内容识别的装置。该内容识别的装置可以为前述的装置600。存储数据区还可以存储电子设备1000在使用中所创建的数据(比如电话本、音视频数据、聊天记录数据)等。The memory 104 may include random access memory (Random Access Memory, RAM), and may also include read-only memory (Read-Only Memory). Memory 104 may be used to store instructions, programs, codes, sets of codes, or sets of instructions. The memory 104 may include a program storage area and a data storage area, wherein the program storage area may store instructions for implementing an operating system, instructions for implementing at least one function (such as a touch function, a sound playback function, an image playback function, etc.) , instructions for implementing the following method embodiments, and the like. For example, content identification means may be stored in the memory 104 . The device for content identification may be the aforementioned device 600 . The storage data area can also store data created by the electronic device 1000 during use (such as phonebook, audio and video data, chat record data) and the like.
所述网络模块106用于接收以及发送电磁波,实现电磁波与电信号的相互转换,从而与通讯网络或者其他设备进行通讯,例如和音频播放设备进行通讯。所述网络模块106可包括各种现有的用于执行这些功能的电路元件,例如,天线、射频收发器、数字信号处理器、加密/解密芯片、用户身份模块(SIM)卡、存储器等等。所述网络模块106可与各种网络如互联网、企业内部网、无线网络进行通讯或者通过无线网络与其他设备进行通讯。上述的无线网络可包括蜂窝式电话网、无线局域网或者城域网。例如,网络模块106可以与基站进行信息交互。The network module 106 is used to receive and send electromagnetic waves, realize mutual conversion between electromagnetic waves and electrical signals, and communicate with communication networks or other devices, such as audio playback devices. The network module 106 may include various existing circuit elements for performing these functions, such as antennas, radio frequency transceivers, digital signal processors, encryption/decryption chips, Subscriber Identity Module (SIM) cards, memory, etc. . The network module 106 can communicate with various networks such as the Internet, intranet, wireless network or communicate with other devices through the wireless network. The wireless network mentioned above may include a cellular telephone network, a wireless local area network or a metropolitan area network. For example, the network module 106 can perform information exchange with the base station.
传感器模块108可以包括至少一种传感器。具体地,传感器模块108可包括但并不限于:光传感器、运动传感器、压力传感器、红外热传感器、距离传感器、加速度传感器、以及其他传感器。The sensor module 108 may include at least one sensor. Specifically, the sensor module 108 may include, but is not limited to: a light sensor, a motion sensor, a pressure sensor, an infrared heat sensor, a distance sensor, an acceleration sensor, and other sensors.
其中,压力传感器可以检测由按压在电子设备1000产生的压力的传感器。即,压力传感器检测由用户和电子设备之间的接触或按压产生的压力,例如由用户的耳朵与移动终端之间的接触或按压产生的压力。因此,压力传感器可以用来确定在用户与电子设备1000之间是否发生了接触或者按压,以及压力的大小。Wherein, the pressure sensor may be a sensor for detecting pressure generated by pressing on the electronic device 1000 . That is, the pressure sensor detects pressure generated by contact or press between the user and the electronic device, eg, contact or press between the user's ear and the mobile terminal. Therefore, the pressure sensor can be used to determine whether contact or pressure occurs between the user and the electronic device 1000, and the magnitude of the pressure.
其中,加速度传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别所述电子设备1000姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等。另外,所述电子设备1000还可配置陀螺仪、气压计、湿度计、温度计等其他传感器,在此不再赘述。Among them, the acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when it is stationary, and can be used to identify the application of the posture of the electronic device 1000 (such as horizontal and vertical screen switching, related games, Magnetometer posture calibration), vibration recognition related functions (such as pedometer, tapping), etc. In addition, the electronic device 1000 may also be configured with other sensors such as a gyroscope, a barometer, a hygrometer, and a thermometer, which will not be repeated here.
图像采集器件110可以用于进行图像采集,从而使得电子设备1000可以将所采集的图像在屏幕112中进行显示。The image acquisition device 110 can be used for image acquisition, so that the electronic device 1000 can display the acquired image on the screen 112 .
请参考图23,其示出了本申请实施例提供的一种计算机可读存储介质的结构框图。该计算机可读存储介质1100中存储有程序代码,所述程序代码可被处理器调用执行上述方法实施例中所描述的方法。Please refer to FIG. 23 , which shows a structural block diagram of a computer-readable storage medium provided by an embodiment of the present application. Program codes are stored in the computer-readable storage medium 1100, and the program codes can be invoked by a processor to execute the methods described in the foregoing method embodiments.
计算机可读存储介质1100可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。可选地,计算机可读存储介质1100包括非易失性计算机可读介质(non-transitory computer-readable storage medium)。计算机可读存储介质1100具有执行上述方法中的任何方法步骤的程序代码1110的存储空间。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。程序代码1110可以例如以适当形式进行压缩。The computer readable storage medium 1100 may be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM. Optionally, the computer-readable storage medium 1100 includes a non-transitory computer-readable storage medium (non-transitory computer-readable storage medium). The computer-readable storage medium 1100 has a storage space for program code 1110 for executing any method steps in the above methods. These program codes can be read from or written into one or more computer program products. Program code 1110 may, for example, be compressed in a suitable form.
综上所述,本申请提供的一种内容识别方法、装置、电子设备以及存储介质。在该方法中,会对采集的图像进行实时显示,并且在所显示的图像包括有指定内容的情况下,可以在指定内容处显示提示标识,然后响应作用于所述提示标识的触控操作,对所述指定内容进行识别,并输出识别结果。从而通过上述方式使得在电子设备在实时的对所采集的图像进行显示的状态下,在电子设备自动通过提示标识的方式,对实时显示的图像中所出现的指定内容进行标识后,通过作用于提示标识的触控操作,就可以直接触发电子设备对指定内容进行识别,从而简化了触发对图像进行识别的操作过程,提升了用户体验。In summary, the present application provides a content identification method, device, electronic equipment, and storage medium. In this method, the collected image is displayed in real time, and when the displayed image includes specified content, a prompt mark can be displayed at the specified content, and then in response to the touch operation acting on the prompt mark, Recognize the specified content and output the recognition result. Therefore, through the above method, when the electronic device is displaying the collected image in real time, after the electronic device automatically identifies the specified content appearing in the image displayed in real time by means of prompting and marking, it acts on the The touch operation of the prompt mark can directly trigger the electronic device to recognize the specified content, thereby simplifying the operation process of triggering the recognition of the image and improving the user experience.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present application. In this specification, the schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the described specific features, structures, materials or characteristics may be combined in any suitable manner in any one or more embodiments or examples. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the present application, "plurality" means at least two, such as two, three, etc., unless otherwise specifically defined.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method descriptions in flowcharts or otherwise described herein may be understood to represent modules, segments or portions of code comprising one or more executable instructions for implementing specific logical functions or steps of the process , and the scope of preferred embodiments of the present application includes additional implementations in which functions may be performed out of the order shown or discussed, including in substantially simultaneous fashion or in reverse order depending on the functions involved, which shall It should be understood by those skilled in the art to which the embodiments of the present application belong.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计 算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, can be considered as a sequenced listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium, For use with instruction execution systems, devices, or devices (such as computer-based systems, systems including processors, or other systems that can fetch instructions from instruction execution systems, devices, or devices and execute instructions), or in conjunction with these instruction execution systems, devices or equipment used.
应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that each part of the present application may be realized by hardware, software, firmware or a combination thereof. In the embodiments described above, various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or combination of the following techniques known in the art: Discrete logic circuits, ASICs with suitable combinational logic gates, programmable gate arrays (PGAs), field programmable gate arrays (FPGAs), etc.
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征图进行等同替换;而这些修改或者替换,并不驱使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent replacements are made to some of the technical feature diagrams; and these modifications or replacements do not drive the essence of the corresponding technical solutions away from the spirit and scope of the technical solutions of the various embodiments of the application .

Claims (20)

  1. 一种内容识别方法,其中,应用于电子设备,所述方法包括:A content identification method, which is applied to an electronic device, the method comprising:
    对采集的图像进行实时显示;Real-time display of the collected images;
    若所显示的图像包括有指定内容,在所述指定内容处显示提示标识;If the displayed image includes specified content, a prompt logo is displayed at the specified content;
    响应作用于所述提示标识的触控操作,对所述指定内容进行识别;Responding to a touch operation acting on the prompt mark, identifying the specified content;
    输出识别结果。Output the recognition result.
  2. 根据权利要求1所述的方法,其中,所述输出识别结果,包括:The method according to claim 1, wherein said outputting the recognition result comprises:
    获取背景图像,所述背景图像包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像;Acquiring a background image, where the background image includes an image displayed by the electronic device when the touch operation acts on the prompt sign;
    以所述背景图像为背景对所述识别结果进行显示。The recognition result is displayed with the background image as the background.
  3. 根据权利要求1所述的方法,其中,所述输出识别结果,包括:The method according to claim 1, wherein said outputting the recognition result comprises:
    获取待处理图像,所述待处理图像为作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像;Acquiring an image to be processed, where the image to be processed is an image displayed by the electronic device when the touch operation acts on the prompt sign;
    对所述待处理图像进行虚化处理,并将所述虚化处理后的图像作为背景图像;Perform blurring processing on the image to be processed, and use the blurred image as a background image;
    以所述背景图像为背景对所述识别结果进行显示。The recognition result is displayed with the background image as the background.
  4. 根据权利要求2或3所述的方法,其中,所述输出识别结果还包括:The method according to claim 2 or 3, wherein the output recognition result further comprises:
    显示第一触发控件;Display the first trigger control;
    所述以所述背景图像为背景对所述识别结果进行显示之后还包括:After displaying the recognition result with the background image as the background, it also includes:
    响应作用于所述第一触发控件的触控操作,基于全屏模式对所述识别结果进行显示。In response to a touch operation acting on the first trigger control, the recognition result is displayed based on a full-screen mode.
  5. 根据权利要求2或3所述的方法,其中,所述方法还包括:The method according to claim 2 or 3, wherein the method further comprises:
    若所述指定内容为文本内容,将所述识别结果的语义所表达的场景对应的场景图像作为背景图像。If the specified content is text content, use the scene image corresponding to the scene expressed by the semantics of the recognition result as the background image.
  6. 根据权利要求2或3所述的方法,其中,所述方法还包括:The method according to claim 2 or 3, wherein the method further comprises:
    若所述指定内容为文本内容,将所述识别结果中的关键词对应的图像作为背景图像。If the specified content is text content, the image corresponding to the keyword in the recognition result is used as the background image.
  7. 根据权利要求2或3所述的方法,其中,所述以所述背景图像为背景对所述识别结果进行显示,包括:The method according to claim 2 or 3, wherein the displaying the recognition result with the background image as the background includes:
    将所述识别结果与所述背景图像融合为一张图像,得到融合后的图像;merging the recognition result and the background image into one image to obtain a fused image;
    对所述融合后的图像进行显示。The fused image is displayed.
  8. 根据权利要求2或3所述的方法,其中,所述以所述背景图像为背景对所述识别结果进行显示,包括:The method according to claim 2 or 3, wherein the displaying the recognition result with the background image as the background includes:
    对所述背景图像进行显示,并将所述识别结果悬浮于所显示的所述背景图像上。The background image is displayed, and the recognition result is suspended on the displayed background image.
  9. 根据权利要求1-8任一所述的方法,其中,所述输出识别结果,包括:The method according to any one of claims 1-8, wherein said outputting the recognition result comprises:
    显示识别结果;Display the recognition result;
    所述响应作用于所述提示标识的触控操作,对所述指定内容进行识别之后还包括:The response acts on the touch operation of the prompt mark, and after identifying the specified content, it also includes:
    显示第二触发控件;Show the second trigger control;
    响应作用于所述第二触发控件的触控操作,显示锁定界面,所述锁定界面中包括作用于所述提示标识的触控操作作用时,所述电子设备所显示的图像,以及所述所显示的图像中的指定内容对应的提示标识。In response to the touch operation acting on the second trigger control, a locking interface is displayed, and the locking interface includes the image displayed by the electronic device when the touch operation acting on the prompt sign is acted on, and the The prompt ID corresponding to the specified content in the displayed image.
  10. 根据权利要求9所述的方法,其中,所述响应作用于所述第二触发控件的触控操作,显示锁定界面之后还包括:The method according to claim 9, wherein, after displaying the locking interface in response to the touch operation acting on the second trigger control, further comprising:
    响应于第一操作,恢复对采集的图像进行实时显示。In response to the first operation, real-time display of the acquired images is resumed.
  11. 根据权利要求1-8任一所述的方法,其中,所述输出识别结果,包括:The method according to any one of claims 1-8, wherein said outputting the recognition result comprises:
    显示识别结果;Display the recognition result;
    所述响应作用于所述提示标识的触控操作,对所述指定内容进行识别之后还包括:The response acts on the touch operation of the prompt mark, and after identifying the specified content, it also includes:
    显示第三触发控件;Display the third trigger control;
    响应作用于所述第三触发控件的触控操作,显示操作菜单,所述操作菜单中包括有 至少一个操作控件,每个操作控件对应的操作不同;In response to the touch operation acting on the third trigger control, an operation menu is displayed, the operation menu includes at least one operation control, and each operation control corresponds to a different operation;
    响应作用于所述操作控件的触控操作,将有触控操作的操作控件对应的操作作为目标操作;In response to the touch operation acting on the operation control, using the operation corresponding to the operation control with the touch operation as the target operation;
    对所述识别结果执行所述目标操作。performing the target operation on the recognition result.
  12. 根据权利要求9或11所述的方法,其中,所述显示识别结果,包括:The method according to claim 9 or 11, wherein said displaying the recognition result comprises:
    若所述指定内容为文本内容,对所述识别结果进行尺寸放大后再进行显示。If the specified content is text content, the recognition result is enlarged in size and then displayed.
  13. 根据权利要求1-12任一所述的方法,其中,所述响应作用于所述提示标识的触控操作,对所述指定内容进行识别之前还包括:The method according to any one of claims 1-12, wherein the response to the touch operation acting on the prompt mark, before identifying the specified content, further includes:
    响应于变焦请求,对所采集的图像进行变焦处理。In response to the zoom request, the captured image is zoomed.
  14. 根据权利要求13所述的方法,其中,所述响应于变焦请求,对所采集的图像进行变焦处理,包括:The method according to claim 13, wherein said performing zoom processing on the captured image in response to the zoom request comprises:
    若有作用于实时显示的图像中的区域选择操作,检测所选择区域中的物体是否被完整显示;If there is an area selection operation in the image displayed in real time, detect whether the object in the selected area is completely displayed;
    若被完整显示,则生成增大尺寸的变焦请求,所述增大尺寸的变焦请求用于使得所选择区域中的物体以第一目标尺寸被完整显示;If it is fully displayed, then generate a zoom request for increasing the size, and the zoom request for increasing the size is used to make the object in the selected area be completely displayed with the first target size;
    响应于所述增大尺寸的变焦请求,对所采集的图像进行变焦处理。In response to the zoom request for increasing the size, zoom processing is performed on the captured image.
  15. 根据权利要求14所述的方法,其中,所述所述响应于变焦请求,对所采集的图像进行变焦处理还包括:The method according to claim 14, wherein said performing zoom processing on the captured image in response to the zoom request further comprises:
    若没有被完整显示,则生成减小尺寸的变焦请求,所述减小尺寸的变焦请求用于使得所选择区域中的物体以第二目标尺寸进行显示;If not completely displayed, generating a zoom request for reducing the size, the zoom request for reducing the size is used to display the object in the selected area with a second target size;
    响应于所述减小尺寸的变焦请求,对所采集的图像进行变焦处理。In response to the size-reducing zoom request, zoom processing is performed on the captured image.
  16. 根据权利要求1-15任一所述的方法,其中,所述对采集的图像进行实时显示,包括:The method according to any one of claims 1-15, wherein the real-time display of the collected images comprises:
    响应于用户的操作启动相机程序;launch the camera program in response to user actions;
    在启动相机程序后,在所显示的相机程序的界面中对采集的图像进行实时显示。After the camera program is started, the collected images are displayed in real time in the interface of the displayed camera program.
  17. 根据权利要求1-16任一所述的方法,其中,所述指定内容包括:文本内容或者目标物体。The method according to any one of claims 1-16, wherein the specified content includes: text content or a target object.
  18. 一种内容识别装置,其中,运行于电子设备,所述装置包括:A content identification device, wherein, running on an electronic device, the device includes:
    图像显示单元,用于对采集的图像进行实时显示;An image display unit is used to display the collected images in real time;
    内容标识单元,用于若所显示的图像包括有指定内容,在所述指定内容处显示提示标识;A content identification unit, configured to display a prompt mark at the specified content if the displayed image includes specified content;
    识别单元,用于响应作用于所述提示标识的触控操作,对所述指定内容进行识别;An identification unit, configured to identify the specified content in response to a touch operation acting on the prompt mark;
    内容输出单元,用于输出识别结果。The content output unit is used to output the recognition result.
  19. 一种电子设备,其中,包括一个或多个处理器以及存储器;An electronic device comprising one or more processors and memory;
    一个或多个程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于执行权利要求1-11任一所述的方法。One or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs configured to perform the method of any one of claims 1-11.
  20. 一种存储有处理器可执行的程序代码的计算机可读存储介质,其中,所述计算机可读存储介质包括存储的程序代码,其中,在所述程序代码运行时执行权利要求1-11任一所述的方法。A computer-readable storage medium storing program codes executable by a processor, wherein the computer-readable storage medium includes stored program codes, wherein any one of claims 1-11 is executed when the program codes run. the method described.
PCT/CN2022/090382 2021-06-24 2022-04-29 Content recognition method and apparatus, electronic device, and storage medium WO2022267696A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110706063.9 2021-06-24
CN202110706063.9A CN115527135A (en) 2021-06-24 2021-06-24 Content identification method and device and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022267696A1 true WO2022267696A1 (en) 2022-12-29

Family

ID=84545220

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/090382 WO2022267696A1 (en) 2021-06-24 2022-04-29 Content recognition method and apparatus, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN115527135A (en)
WO (1) WO2022267696A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100067738A1 (en) * 2008-09-16 2010-03-18 Robert Bosch Gmbh Image analysis using a pre-calibrated pattern of radiation
CN109034115A (en) * 2018-08-22 2018-12-18 Oppo广东移动通信有限公司 Video knows drawing method, device, terminal and storage medium
CN111126301A (en) * 2019-12-26 2020-05-08 腾讯科技(深圳)有限公司 Image processing method and device, computer equipment and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100067738A1 (en) * 2008-09-16 2010-03-18 Robert Bosch Gmbh Image analysis using a pre-calibrated pattern of radiation
CN109034115A (en) * 2018-08-22 2018-12-18 Oppo广东移动通信有限公司 Video knows drawing method, device, terminal and storage medium
CN111126301A (en) * 2019-12-26 2020-05-08 腾讯科技(深圳)有限公司 Image processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN115527135A (en) 2022-12-27

Similar Documents

Publication Publication Date Title
US20230393721A1 (en) Method and Apparatus for Dynamically Displaying Icon Based on Background Image
US11847314B2 (en) Machine translation method and electronic device
CN111541845B (en) Image processing method and device and electronic equipment
WO2021104197A1 (en) Object tracking method and electronic device
CN108234875B (en) Shooting display method and device, mobile terminal and storage medium
US9319632B2 (en) Display apparatus and method for video calling thereof
US9654942B2 (en) System for and method of transmitting communication information
CN109670427B (en) Image information processing method and device and storage medium
CN111164983B (en) The interconnection terminal lends local processing capability
US11893767B2 (en) Text recognition method and apparatus
CN111031398A (en) Video control method and electronic equipment
US10893203B2 (en) Photographing method and apparatus, and terminal device
WO2021179804A1 (en) Image processing method, image processing device, storage medium, and electronic apparatus
WO2018184260A1 (en) Correcting method and device for document image
WO2021104266A1 (en) Object display method and electronic device
WO2021179856A1 (en) Content recognition method and apparatus, electronic device, and storage medium
US20230224574A1 (en) Photographing method and apparatus
CN114037692A (en) Image processing method, mobile terminal and storage medium
CN116916151B (en) Shooting method, electronic device and storage medium
CN113946302B (en) Method and device for opening file
US11756302B1 (en) Managing presentation of subject-based segmented video feed on a receiving device
EP4284009A1 (en) Method for acquiring image, and electronic device
WO2022267696A1 (en) Content recognition method and apparatus, electronic device, and storage medium
US20210377454A1 (en) Capturing method and device
CN116700477A (en) Display method and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22827194

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE