WO2021120420A1 - Reading assistance method and apparatus, and electronic device - Google Patents

Reading assistance method and apparatus, and electronic device Download PDF

Info

Publication number
WO2021120420A1
WO2021120420A1 PCT/CN2020/079181 CN2020079181W WO2021120420A1 WO 2021120420 A1 WO2021120420 A1 WO 2021120420A1 CN 2020079181 W CN2020079181 W CN 2020079181W WO 2021120420 A1 WO2021120420 A1 WO 2021120420A1
Authority
WO
WIPO (PCT)
Prior art keywords
character string
target
recognized
target area
reading
Prior art date
Application number
PCT/CN2020/079181
Other languages
French (fr)
Chinese (zh)
Inventor
钟波
肖适
王鑫
余金清
Original Assignee
成都极米科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都极米科技股份有限公司 filed Critical 成都极米科技股份有限公司
Publication of WO2021120420A1 publication Critical patent/WO2021120420A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces

Definitions

  • This application relates to the field of image processing technology, and in particular, to a reading assistance method, device, and electronic equipment.
  • the current reading mode is generally: 1) physical books carry reading content; 2) electronic devices display the content that needs to be read.
  • electronic devices display the content that needs to be read.
  • the reading efficiency of the above two methods is relatively low.
  • the purpose of the embodiments of the present application is to provide a reading assistance method, device, and electronic equipment. It can assist the user to process the reading, thereby improving the reading effect.
  • an embodiment provides a reading assistance method, including:
  • the target character string is displayed in the target area.
  • the step of displaying the target character string in the target area includes:
  • the target character string is projected to the blank area of the target area for display.
  • the reading assistance method provided by the embodiment of the present application can also display the target character string in a blank area, which can prevent the target character string from blocking the content that the user may need to read, reduce the display effect of the reading, and affect the customer experience.
  • the method further includes:
  • the reading assistance method provided by the embodiment of the present application can also store the updated content when there is updated content, which can facilitate the user to subsequently query the content generated during the reading process.
  • the step of storing the updated content includes:
  • the updated content is stored in association with the electronic reading.
  • the updated content is stored in association with the electronic reading, so that it is convenient for the user to query the updated content when viewing the electronic reading.
  • the step of displaying the target character string in the target area includes:
  • the target character string is projected to the display surface for display.
  • the physical reading object may not be a standard plane
  • the target character string is displayed on the plane, it may cause the character string misalignment, etc., so that the display surface is determined first, and then based on the display surface.
  • the target character string is displayed, so that the display effect can be more in line with the visual effect required by the human eye.
  • the step of performing designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized includes:
  • the interpretation document corresponding to the character string to be recognized is retrieved, and the interpretation document is used as the target character string corresponding to the character string to be recognized.
  • the reading assistance method provided by the embodiment of the present application can also translate or interpret the character string to be recognized, which can reduce the user's query operations during the reading process and improve the reading experience.
  • the method further includes:
  • Projecting an electronic reading into the target area, and the character string to be identified is a character string in the electronic reading.
  • the reading assistance method provided by the embodiment of the present application can also directly project the electronic reading material that needs to be read, so that it is convenient for the user to read more content.
  • an embodiment provides a reading aid device, including:
  • An acquisition module configured to acquire first image data in a target area, the first image data including an instruction action image of the target object
  • a recognition module configured to recognize the instruction action image to determine the character string to be recognized corresponding to the instruction action image
  • a processing module configured to perform designated processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
  • the first projection module is configured to display the target character string in the target area.
  • an embodiment provides an electronic device including: a processor and a memory, the memory stores machine-readable instructions executable by the processor, and when the electronic device is running, the machine-readable instructions are When the processor is executed, the steps of the method described in any of the foregoing embodiments are executed.
  • an embodiment provides a computer-readable storage medium with a computer program stored on the computer-readable storage medium, and the computer program executes the steps of the method described in any of the foregoing embodiments when the computer program is run by a processor.
  • the reading assistance method, device, electronic equipment, and computer-readable storage medium provided by the embodiments of the present application can realize the specified processing of the character string to be recognized through instruction actions, which can facilitate the user to obtain the corresponding character string to be recognized when reading
  • the target character string can improve the understanding of the character string to be recognized when reading.
  • FIG. 1 is a schematic block diagram of an electronic device provided by an embodiment of the application.
  • FIG. 2 is a flowchart of a reading assistance method provided by an embodiment of the application.
  • FIG. 3 is a detailed flowchart of step 204 of the reading assistance method provided by an embodiment of the application.
  • FIG. 4 is a detailed flowchart of step 204 of the reading assistance method provided by an embodiment of the application.
  • FIG. 5 is a partial flowchart of a reading assistance method provided by an embodiment of this application.
  • FIG. 6 is a schematic diagram of functional modules of a reading aid device provided by an embodiment of the application.
  • the electronic device 100 may include a memory 111, a storage controller 112, a processor 113, a peripheral interface 114, an input and output unit 115, a collection unit 116, a projector 117, and a radio frequency unit 118.
  • a memory 111 may include a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a main memory 111, a processor 113, a peripheral interface 114, an input and output unit 115, a collection unit 116, a projector 117, and a radio frequency unit 118.
  • FIG. 1 is only for illustration, and does not limit the structure of the electronic device 100.
  • the electronic device 100 may also include more or fewer components than those shown in FIG. 1 or have a different configuration from that shown in FIG. 1.
  • the aforementioned components of the memory 111, the storage controller 112, the processor 113, the peripheral interface 114, the input output unit 115, and the collection unit 116 are directly or indirectly electrically connected to each other to realize data transmission or interaction.
  • these components can be electrically connected to each other through one or more communication buses or signal lines.
  • the aforementioned processor 113 is used to execute executable modules stored in the memory.
  • the memory 111 may be, but is not limited to, random access memory (Random Access Memory, RAM for short), Read Only Memory (ROM for short), Programmable Read-Only Memory (PROM for short) ), Erasable Programmable Read-Only Memory (EPROM), Electrical Erasable Programmable Read-Only Memory (EEPROM), etc.
  • the memory 111 is used to store a program, and the processor 113 executes the program after receiving an execution instruction.
  • the method executed by the electronic device 100 of the process definition disclosed in any embodiment of the present application can be applied to processing In the processor 113, or implemented by the processor 113.
  • the aforementioned processor 113 may be an integrated circuit chip with signal processing capability.
  • the aforementioned processor 113 may be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU for short), a network processor (Network Processor, NP for short), etc.; it may also be a digital signal processor (DSP for short). ), Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
  • the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • peripheral interface 114 couples various input/output devices to the processor 113 and the memory 111.
  • the peripheral interface 114, the processor 113, and the storage controller 112 may be implemented in a single chip. In some other instances, they can be implemented by independent chips.
  • the aforementioned input and output unit 115 is used to provide input data to the user.
  • the input and output unit 115 may be, but is not limited to, a mouse, a keyboard, and the like.
  • the aforementioned acquisition unit 116 is used to capture images (for example, photos, videos, etc.), and to store the captured images for use by other components.
  • the acquisition unit 116 may be an RGB-D (Red Green Blue-Deep) camera.
  • the acquisition unit 116 can be used to capture depth images.
  • the electronic device 100 in this embodiment may further include a projector 117, and the projector 117 includes a light source, a projection optical system, and other projection elements, which are used to realize image projection.
  • the projector 117 includes a light source, a projection optical system, and other projection elements, which are used to realize image projection.
  • a radio frequency (RF) unit 118 is used to receive and send electromagnetic waves, realize the mutual conversion between electromagnetic waves and electrical signals, and communicate with communication networks or other devices.
  • the radio frequency unit 118 may include various existing circuit elements for performing these functions, for example, an antenna, a radio frequency transceiver, a digital signal processor, an encryption/decryption chip, a subscriber identity module (SIM) card, a memory, and so on.
  • the radio frequency unit 118 can communicate with various networks such as the Internet, an intranet, and a wireless network, or communicate with other devices through a wireless network.
  • the aforementioned wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network.
  • the above-mentioned wireless network can use various communication standards, protocols and technologies, including but not limited to Global System for Mobile Communication (GSM), Enhanced Data GSM Environment, EDGE, and broadband code Wideband code division multiple access (W-CDMA), code division multiple access (CDMA), time division multiple access (TDMA), wireless fidelity technology (Wireless, Fidelity) , WiFi) (such as the American Institute of Electrical and Electronics Engineers standards IEEE 802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE 802.11n), Internet telephony (Voice over internet protocol, VoIP), Worldwide Interoperability for Microwave Access, Wi-Max), other protocols used for mail, instant messaging and short messages, and any other appropriate communication protocols, even those that have not yet been developed.
  • the above-mentioned radio frequency unit 118 can realize the communication between the electronic device 100 and external devices.
  • the electronic device 100 in this embodiment can be used to execute each step in each method provided in the embodiment of the present application.
  • the implementation process of the reading assistance method will be described in detail below through several embodiments.
  • FIG. 2 is a flowchart of a reading assistance method provided by an embodiment of the present application. The specific process shown in FIG. 2 will be described in detail below.
  • Step 201 Collect first image data in the target area.
  • the first image data includes an instruction action image of the target object.
  • the above-mentioned target object can be any object that can point to the content in the target area.
  • the target object may be a pen, a baton, a user's finger, and so on.
  • the indicating action image can be that the target object touches a character string; it can also be that the target object is located below a character string.
  • the reading content is displayed in the target area.
  • the reading content may be the content of the electronic reading displayed by projection, or the content printed in the physical reading placed in the target area.
  • the aforementioned target area may be a desktop.
  • Physical reading objects can be placed on the desktop.
  • the entity reading material may be a novel book, a foreign document, learning materials, and the like.
  • electronic reading materials may be displayed in the above-mentioned target area.
  • the content represented by the electronic book may be learning content, novel fragments, and so on.
  • the target area carrying the electronic reading can be any interface that can be used to display the projection screen, such as a solid-color wall surface, a solid-color white paper, or the like.
  • the method in this embodiment further includes: projecting the electronic reading into the target area.
  • the first image data may include a character string in the electronic book.
  • Step 202 Recognizing the indicating action image to determine the character string to be recognized corresponding to the indicating action image.
  • step 202 may include: recognizing the location of the target object to determine the target location pointed to by the target object; performing text recognition on the content around the target location to extract the character string to be recognized.
  • the following describes the detection of the position of the target object by taking the target object as the user's finger as an example.
  • the indicating action image can be detected by edge detection to determine the edge of the user's finger.
  • the determined specified orientation of the edge of the user's finger may be used as the target position.
  • the detected upper edge can be used as the target position.
  • the determined designated position of the area where the user's finger is located may be used as the target position.
  • the upper left edge of the area where the user's finger is located can be used as the target position.
  • performing text recognition on the content around the target location to extract the character string to be recognized may be implemented as: recognizing the content around the target location using a neural network model to extract the character string to be recognized.
  • performing text recognition on the content around the target location to extract the character string to be recognized can be implemented as: using an OCR (Optical Character Recognition, Chinese name: optical character recognition) model to perform text recognition on the content around the target location Recognition to extract the string to be recognized.
  • OCR Optical Character Recognition, Chinese name: optical character recognition
  • text recognition can be performed on the area around the target location that is not covered by the target object.
  • only the line of character strings closest to the target location can be identified.
  • the above-mentioned character string to be identified is the character string in the electronic reading.
  • the above-mentioned character string to be identified is a character string extracted from the physical reading object.
  • Step 203 Perform designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized.
  • step 203 may include: translating the character string to be recognized to obtain a target character string of the character string to be recognized in a target language.
  • the character string to be recognized may be a character in the first language
  • the target character string may be a character in the second language.
  • the above-mentioned designation process may be to translate the character string to be recognized.
  • the character string to be recognized may be characters in languages such as English, French, and Italian.
  • the target string can be Chinese characters.
  • the aforementioned character string to be identified can be "patent", and the target character string can be "patent". It can be known that the language types corresponding to the above-mentioned character string to be recognized and the target character string are merely exemplary, and the embodiment of the present application does not limit the language types corresponding to the recognized character string and the target character string.
  • translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language may include: translating the character string to be recognized through the translation application.
  • the recognized character string is translated offline to obtain the target character string of the character string to be recognized in the target language.
  • the above-mentioned translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language may include: translating the character string to the target language through the translation application.
  • the character string to be recognized is translated online to obtain the target character string of the character string to be recognized in the target language.
  • step 203 may include: searching for the interpretation document corresponding to the character string to be recognized, and using the interpretation document as the target character string corresponding to the character string to be recognized.
  • the character string to be recognized may be an abbreviation of a proper noun
  • the target character string may be the full name corresponding to the abbreviation.
  • the character string to be identified may be "IP”
  • the corresponding target character string may be "Internet Protocol (IP)”.
  • IP Internet Protocol
  • the string to be identified can be "CNN”
  • the corresponding target string can be "Cable News Network (CNN)” or “Convolutional Neural Networks (CNN)” Wait.
  • the character string to be recognized may be an idiom
  • the target character string may be the meaning, allusion, or source corresponding to the idiom.
  • the character string to be recognized can be "sounds like a bell", and the corresponding target character string can be "speaking or singing loudly, like striking a bell”.
  • the target string may also include the source of the idiom.
  • the target string in the above example can also include "Ming Feng Menglong's "Eastern Zhou Dynasty" Chapter 72: Recalling Xu's eyes is like lightning, and the sound is like a bell.”
  • the character string to be recognized may be a professional word
  • the target character string may be the corresponding interpretation of the idiom.
  • the character string to be identified can be the "leverage principle" in the field of physics
  • the target string can be an explanation of the lever principle. Condition”. To balance the lever, the two moments (the product of the force and the arm) acting on the lever must be equal in size.”
  • a search engine may be installed in the electronic device.
  • the aforementioned retrieval of the interpretation document corresponding to the character string to be identified, and using the interpretation document as the target character string corresponding to the character string to be identified may include: using a search engine to retrieve For the interpretation document corresponding to the character string to be recognized, the interpretation document is used as the target character string corresponding to the character string to be recognized.
  • a local database may also be stored in the electronic device.
  • searching for the interpretation document corresponding to the character string to be recognized, and using the interpretation document as the target character string corresponding to the character string to be recognized may include: querying a local database corresponding to the character string to be recognized Interpretation document, the interpretive document is used as the target character string corresponding to the character string to be identified.
  • Step 204 Display the target character string in the target area.
  • the target character string may be displayed in an area that does not obscure the electronic reading displayed in the target area, or the text part of the physical reading.
  • the target character string can also be displayed in the read area, for example, the target character string can be displayed above the position of the character string to be recognized.
  • the target character string can be projected to the target area for display by means of projection.
  • the target area may be an electronic display, and the target character string may be displayed on the electronic display.
  • step 204 may include the following steps.
  • Step 2041 Collect second image data of the target area.
  • Step 2042 Determine whether the target area includes a blank area by identifying the second image data.
  • the aforementioned blank area may indicate an area in the target area where no characters or images are mapped.
  • the blank area of the target area can be determined by recognizing the color of each pixel in the second image data.
  • the color of the target area may be a designated color.
  • the designated color can be white, green, black, etc.
  • the color value is determined to be the specified color in the pixel point corresponding to the specified color, and all pixels in the area formed by the specified number of pixels are determined to be the specified color, it can be determined that the specified number of pixels are formed
  • the area is the designated color area, that is, it can be judged as a blank area.
  • Step 2043 If the target area includes a blank area, project the target character string to the blank area of the target area for display.
  • Step 2044 If the target area does not include a blank area, project the target character string to the position of the character string in the previous designated line of the character string to be recognized in the target area.
  • the projection optical system of the projector of the electronic device can be adjusted to adjust the projection angle of the projected target character string, so as to realize the projecting of the target character string to the blank area of the target area or the character to be recognized The position of the specified line string before the string is displayed.
  • the target character string can be matched with the surface carrying the display target character string.
  • the target character string can be displayed on the flat surface according to the display standard of the flat surface.
  • the target character string can be mapped and displayed on the curved surface according to the shape of the curved surface.
  • step 204 may include the following steps.
  • Step 2045 If it is detected that a physical reading object is placed in the target area, collect fourth image data of the target area.
  • the fourth image data may be a depth image.
  • Each pixel in the fourth image data represents the distance between the collection device and the object in the fourth image data.
  • a depth image of the target area can be collected to determine whether a physical reading object is placed in the target area.
  • the collecting unit may collect a target depth image when any plane in the target area containing the character string to be read is parallel, and judge whether the pixel value of each pixel corresponding to the target area in the depth image is equal. If the values are not all equal, it means that there is a physical reading material in the target area; if the pixel value of each pixel is equal, it means that there is no object in the target area, and the target area may have an electronic reading material projected.
  • Step 2046 Determine the display surface corresponding to the physical reading object according to the fourth image data.
  • the location of the physical reading object and the plane formed by the surface of the physical reading object are determined by the pixels of each pixel.
  • Step 2047 Project the target character string to the display surface for display.
  • the above-mentioned display surface can be subdivided into grids, the pre-distortion matrix of the projector can be solved, and geometric correction can be performed in real time.
  • each triangle mesh uses its own three vertices and at least two adjacent nodes to obtain the corresponding pre-distortion matrix M using the least square method, and finally use the The screen-like method pre-distorted the corresponding area in the buffer before projection.
  • the reading assistance method in this embodiment may further include the following steps.
  • Step 205 Collect third image data of the target area according to a preset period.
  • Step 206 When the third image data is collected, identify whether the third image data has updated content relative to the image data collected at the previous moment of the current moment.
  • the third image data can be compared with the image data collected at the previous moment of the current moment by pixel to determine whether there is a difference between the third image data and the image data collected at the previous moment of the current moment. If there is a difference, it can be determined that there is updated content in the third image data.
  • the first image feature of the third image data can be extracted; the second image feature of the image data collected at the previous moment of the current time can be extracted; and then the first image feature and the second image feature can be calculated. If the Euclidean distance is greater than the preset value, it can be determined that there is updated content in the third image data.
  • the aforementioned preset value may be a positive number greater than zero.
  • the size of the preset value can be set according to requirements, and the embodiment of the present application is not limited to the size of the preset value.
  • the updated content may be notes made in the target area.
  • the aforementioned update content may not include the target object mentioned in step 201.
  • Step 207 If there is updated content, store the updated content.
  • step 207 may include: if it is detected that the projected electronic reading is displayed in the target area, then storing the updated content in association with the electronic reading.
  • the generation time of the updated content can also be stored in association with the updated content.
  • step 207 can also be implemented as: if there is updated content and the specified action is detected in the third image data, the updated content is stored.
  • the designated action may be directed to update content.
  • the designated action may be directed to the time to update the content for a designated period of time.
  • the third image data may include multiple pictures, and when the specified number of images continuously collected all include the target object pointing to the updated content, it can be determined that the time pointing to the updated content lasts for the specified period of time.
  • the third image data may be a video, and when the specified duration of the video includes the target object pointing to the updated content, it can be determined that the time pointing to the updated content lasts for the specified duration.
  • the method provided in the embodiment of the present application can be used for teaching; the target area can be a blackboard or whiteboard for teaching.
  • Teaching materials are currently projected in the target area.
  • the annotation corresponding to the content can be displayed.
  • an English text is displayed in the current target area, and when the teaching stick points to an English word, the Chinese meaning corresponding to the English word can be displayed on the blank area of the blackboard or whiteboard.
  • the teacher writes some teaching notes on the blackboard, the teaching notes can be stored in association with the teaching materials currently displayed on the projection.
  • the method provided by the embodiment of this application can be used for personal reading; a book is placed in the target area, and when the user’s finger points to some words and sentences in the book, it can be retrieved from various search websites Query the corresponding meaning of the word and sentence, and display the found content in the blank area of the book. Further, if the user writes some notes in a blank area of the book, the notes can be saved to the storage space pointed to by the designated account.
  • the designated account may be an account of a cloud storage space.
  • the specified processing of the character string to be recognized can be realized through the instruction action, which can facilitate the user to obtain the target character string corresponding to the character string to be recognized when reading, and can improve the assisting understanding of the character string to be recognized when reading.
  • String the new information generated during the reading or teaching process can also be stored, which can be the user's ability to view useful information generated during the reading or teaching at any time in the future.
  • the embodiment of the application also provides a reading assistance device corresponding to the reading assistance method. Since the device in the embodiment of the application solves the problem in principle similar to the foregoing embodiment of the reading assistance method, in this embodiment
  • the implementation of the device can refer to the description in the embodiment of the above method, and the repetitive parts will not be repeated.
  • FIG. 6 is a schematic diagram of functional modules of a reading aid device provided by an embodiment of the present application.
  • Each module in the reading aid device in this embodiment is used to execute each step in the foregoing method embodiment.
  • the reading aid device includes: an acquisition module 301, an identification module 302, a processing module 303, and a first projection module 304; among them,
  • the collection module 301 is configured to collect first image data in a target area, where the first image data includes an instruction action image of a target object, and the reading content is displayed in the target area;
  • the recognition module 302 is configured to recognize the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
  • the processing module 303 is configured to perform designated processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
  • the first projection module 304 is configured to display the target character string in the target area.
  • the first projection module 304 is configured to:
  • the target character string is projected to the blank area of the target area for display.
  • the reading aid device may further include a storage module 305 for:
  • the storage module 305 is also used for:
  • the updated content is stored in association with the electronic reading.
  • the first projection module 304 is configured to:
  • the target character string is projected to the display surface for display.
  • the processing module 303 is configured to:
  • the interpretation document corresponding to the character string to be recognized is retrieved, and the interpretation document is used as the target character string corresponding to the character string to be recognized.
  • the reading aid device may further include:
  • the second projection module 306 is configured to project an electronic reading into the target area, and the character string to be recognized is a character string in the electronic reading.
  • embodiments of the present application also provide a computer-readable storage medium having a computer program stored on the computer-readable storage medium, and the computer program executes the steps of the reading assistance method described in the above method embodiment when the computer program is run by a processor. .
  • the computer program product of the reading assistance method provided by the embodiment of the present application includes a computer-readable storage medium storing program code, and the instructions included in the program code can be used to execute the steps of the reading assistance method described in the above method embodiment
  • the above method embodiment which will not be repeated here.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the module, program segment, or part of the code contains one or more functions for realizing the specified logical function. Executable instructions. It should also be noted that in some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings.
  • each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.
  • the functional modules in the various embodiments of the present application may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
  • the function is implemented in the form of a software function module and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of the present application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes.
  • ROM read-only memory
  • RAM random access memory
  • relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply one of these entities or operations. There is any such actual relationship or order between.
  • the terms “include”, “include” or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or device including a series of elements not only includes those elements, but also includes those that are not explicitly listed Other elements of, or also include elements inherent to this process, method, article or equipment. If there are no more restrictions, the element defined by the sentence "including" does not exclude the existence of other same elements in the process, method, article or equipment that includes the element.

Abstract

Provided in the present application are a reading assistance method and apparatus, and an electronic device. The method comprises: collecting first image data in a target region, the first image data comprising an indication action image of a target object, and reading content being displayed in the target region; recognizing the indication action image so as to determine a character string to be recognized that corresponds to the indication action image; performing designated processing on said character string so as to determine a target character string corresponding to the character string to be recognized; and displaying the target character string in the target region.

Description

阅读辅助方法、装置及电子设备Reading assistance method, device and electronic equipment 技术领域Technical field
本申请涉及图像处理技术领域,具体而言,涉及一种阅读辅助方法、装置及电子设备。This application relates to the field of image processing technology, and in particular, to a reading assistance method, device, and electronic equipment.
背景技术Background technique
目前的阅读模式一般是:1)实体书本承载阅读内容;2)电子设备上显示需要阅读的内容,上述的两种阅读方式,在遇到陌生的内容可以通过手机等电子设备查阅相关内容。就阅读而言,但是上面两种方式的阅读效率相对较低。The current reading mode is generally: 1) physical books carry reading content; 2) electronic devices display the content that needs to be read. In the above two reading methods, when encountering unfamiliar content, you can check related content through electronic devices such as mobile phones. As far as reading is concerned, the reading efficiency of the above two methods is relatively low.
发明内容Summary of the invention
有鉴于此,本申请实施例的目的在于提供一种阅读辅助方法、装置及电子设备。能够达到能够协助用户处理阅读物,从而提高阅读效果。In view of this, the purpose of the embodiments of the present application is to provide a reading assistance method, device, and electronic equipment. It can assist the user to process the reading, thereby improving the reading effect.
第一方面,实施例提供一种阅读辅助方法,包括:In the first aspect, an embodiment provides a reading assistance method, including:
采集目标区域中的第一图像数据,所述第一图像数据中包括目标对象的指示动作图像,所述目标区域中显示有阅读内容;Acquiring first image data in a target area, where the first image data includes an instruction action image of a target object, and reading content is displayed in the target area;
对所述指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串;Recognizing the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串;Performing designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
将所述目标字符串在所述目标区域中进行显示。The target character string is displayed in the target area.
在可选的实施方式中,所述将所述目标字符串在所述目标区域中进行 显示的步骤,包括:In an optional implementation manner, the step of displaying the target character string in the target area includes:
采集所述目标区域的第二图像数据;Collecting second image data of the target area;
通过识别所述第二图像数据,以确定所述目标区域是否包含空白区域;By identifying the second image data to determine whether the target area includes a blank area;
若所述目标区域包含空白区域,则将所述目标字符串投射至所述目标区域的所述空白区域进行显示。If the target area includes a blank area, the target character string is projected to the blank area of the target area for display.
本申请实施例提供的阅读辅助方法,还可以将目标字符串显示在空白区域,可以避免目标字符串挡住用户可能需要阅读的内容,降低阅读物的显示效果,影响客户的体验。The reading assistance method provided by the embodiment of the present application can also display the target character string in a blank area, which can prevent the target character string from blocking the content that the user may need to read, reduce the display effect of the reading, and affect the customer experience.
在可选的实施方式中,所述方法还包括:In an optional embodiment, the method further includes:
按照预设周期采集所述目标区域的第三图像数据;Collecting the third image data of the target area according to a preset period;
在采集到的所述第三图像数据时,识别所述第三图像数据相对于当前时刻的前一时刻采集到的图像数据是否存在更新内容;When the third image data is collected, identifying whether the third image data has updated content relative to the image data collected at the previous moment of the current moment;
若存在更新内容,将所述更新内容进行存储。If there is updated content, store the updated content.
本申请实施例提供的阅读辅助方法,还可以当存在更新内容时,可以将更新内容进行存储,可以方便用户后续查询阅读过程中产生的内容。The reading assistance method provided by the embodiment of the present application can also store the updated content when there is updated content, which can facilitate the user to subsequently query the content generated during the reading process.
在可选的实施方式中,所述若存在更新内容,将所述更新内容进行存储的步骤,包括:In an optional implementation manner, if there is updated content, the step of storing the updated content includes:
若检测到所述目标区域中显示有投影的电子读物,则将所述更新内容与所述电子读物关联存储。If it is detected that the projected electronic reading is displayed in the target area, the updated content is stored in association with the electronic reading.
本申请实施例提供的阅读辅助方法,通过将更新内容与电子读物关联存储,从而可以方便用户在查阅电子读物时,也能够关联查询到更新内容。In the reading assistance method provided by the embodiments of the present application, the updated content is stored in association with the electronic reading, so that it is convenient for the user to query the updated content when viewing the electronic reading.
在可选的实施方式中,所述将所述目标字符串在所述目标区域中进行显示的步骤,包括:In an optional implementation manner, the step of displaying the target character string in the target area includes:
若检测到所述目标区域中摆放有实体阅读物,则采集所述目标区域的第四图像数据;If it is detected that a physical reading object is placed in the target area, collecting fourth image data of the target area;
根据所述第四图像数据确定出所述实体阅读物对应的显示面;Determining the display surface corresponding to the physical reading object according to the fourth image data;
将所述目标字符串投射至所述显示面进行显示。The target character string is projected to the display surface for display.
本申请实施例提供的阅读辅助方法,由于实体的阅读物可能不是标准的平面,如果按照平面现实目标字符串可能会导致字符串错位等现象,从而通过先对显示面的确定,再基于显示面显示目标字符串,从而可以使显示效果更符合人眼需要的视觉效果。In the reading assistance method provided by the embodiments of the present application, since the physical reading object may not be a standard plane, if the target character string is displayed on the plane, it may cause the character string misalignment, etc., so that the display surface is determined first, and then based on the display surface. The target character string is displayed, so that the display effect can be more in line with the visual effect required by the human eye.
在可选的实施方式中,所述将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串的步骤,包括:In an optional implementation manner, the step of performing designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized includes:
将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串,或/及,Translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language, or/and,
检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。The interpretation document corresponding to the character string to be recognized is retrieved, and the interpretation document is used as the target character string corresponding to the character string to be recognized.
本申请实施例提供的阅读辅助方法,还可以对待识别字符串进行翻译或解释,能够减少用户在阅读过程中的查询操作,提高阅读体验。The reading assistance method provided by the embodiment of the present application can also translate or interpret the character string to be recognized, which can reduce the user's query operations during the reading process and improve the reading experience.
在可选的实施方式中,所述方法还包括:In an optional embodiment, the method further includes:
将电子读物投影至所述目标区域中,所述待识别字符串为所述电子读物中的字符串。Projecting an electronic reading into the target area, and the character string to be identified is a character string in the electronic reading.
本申请实施例提供的阅读辅助方法,还可以直接对需要阅读的电子读物进行投影,从而可以方便用户阅读更多内容。The reading assistance method provided by the embodiment of the present application can also directly project the electronic reading material that needs to be read, so that it is convenient for the user to read more content.
第二方面,实施例提供一种阅读辅助装置,包括:In a second aspect, an embodiment provides a reading aid device, including:
采集模块,用于采集目标区域中的第一图像数据,所述第一图像数据中包括目标对象的指示动作图像;An acquisition module, configured to acquire first image data in a target area, the first image data including an instruction action image of the target object;
识别模块,用于对所述指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串;A recognition module, configured to recognize the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
处理模块,用于将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串;A processing module, configured to perform designated processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
第一投影模块,用于将所述目标字符串在所述目标区域中进行显示。The first projection module is configured to display the target character string in the target area.
第三方面,实施例提供一种电子设备,包括:处理器、存储器,所述存储器存储有所述处理器可执行的机器可读指令,当电子设备运行时,所述机器可读指令被所述处理器执行时执行如前述实施方式任一所述的方法的步骤。In a third aspect, an embodiment provides an electronic device including: a processor and a memory, the memory stores machine-readable instructions executable by the processor, and when the electronic device is running, the machine-readable instructions are When the processor is executed, the steps of the method described in any of the foregoing embodiments are executed.
第四方面,实施例提供一种计算机可读存储介质,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行如前述实施方式任一所述的方法的步骤。In a fourth aspect, an embodiment provides a computer-readable storage medium with a computer program stored on the computer-readable storage medium, and the computer program executes the steps of the method described in any of the foregoing embodiments when the computer program is run by a processor.
本申请实施例提供的阅读辅助方法、装置、电子设备及计算机可读存储介质,采用通过指示动作就能够实现对待识别字符串的指定处理,可以方便用户阅读时,得到与待识别字符串对应的目标字符串,可以提高阅读时辅助了解待识别字符串。The reading assistance method, device, electronic equipment, and computer-readable storage medium provided by the embodiments of the present application can realize the specified processing of the character string to be recognized through instruction actions, which can facilitate the user to obtain the corresponding character string to be recognized when reading The target character string can improve the understanding of the character string to be recognized when reading.
为使本申请的上述目的、特征和优点能更明显易懂,下文特举实施例,并配合所附附图,作详细说明如下。In order to make the above-mentioned objectives, features and advantages of the present application more obvious and understandable, the following embodiments are specially cited in conjunction with the accompanying drawings, and detailed descriptions are made as follows.
附图说明Description of the drawings
为了更清楚地说明本申请实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本申请的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the following will briefly introduce the drawings needed in the embodiments. It should be understood that the following drawings only show certain embodiments of the present application, and therefore do not It should be regarded as a limitation of the scope. For those of ordinary skill in the art, other related drawings can be obtained based on these drawings without creative work.
图1为本申请实施例提供的电子设备的方框示意图。FIG. 1 is a schematic block diagram of an electronic device provided by an embodiment of the application.
图2为本申请实施例提供的阅读辅助方法的流程图。FIG. 2 is a flowchart of a reading assistance method provided by an embodiment of the application.
图3为本申请实施例提供的阅读辅助方法的步骤204的详细流程图。FIG. 3 is a detailed flowchart of step 204 of the reading assistance method provided by an embodiment of the application.
图4为本申请实施例提供的阅读辅助方法的步骤204的详细流程图。FIG. 4 is a detailed flowchart of step 204 of the reading assistance method provided by an embodiment of the application.
图5为本申请实施例提供的阅读辅助方法的部分流程图。FIG. 5 is a partial flowchart of a reading assistance method provided by an embodiment of this application.
图6为本申请实施例提供的阅读辅助装置的功能模块示意图。FIG. 6 is a schematic diagram of functional modules of a reading aid device provided by an embodiment of the application.
具体实施方式Detailed ways
下面将结合本申请实施例中附图,对本申请实施例中的技术方案进行描述。The technical solutions in the embodiments of the present application will be described below in conjunction with the drawings in the embodiments of the present application.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。同时,在本申请的描述中,术语“第一”、“第二”等仅用于区分描述,而不能理解为指示或暗示相对重要性。It should be noted that similar reference numerals and letters indicate similar items in the following figures. Therefore, once a certain item is defined in one figure, it does not need to be further defined and explained in subsequent figures. At the same time, in the description of this application, the terms "first", "second", etc. are only used to distinguish the description, and cannot be understood as indicating or implying relative importance.
实施例一Example one
为便于对本实施例进行理解,首先对执行本申请实施例所公开的一种阅读辅助方法的电子设备进行详细介绍。To facilitate the understanding of this embodiment, first, an electronic device that executes a reading assistance method disclosed in the embodiment of this application is introduced in detail.
如图1所示,是电子设备的方框示意图。电子设备100可以包括存储器111、存储控制器112、处理器113、外设接口114、输入输出单元115、采集单元116投影仪117及射频单元118。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对电子设备100的结构造成限定。例如,电子设备100还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。As shown in Figure 1, it is a block diagram of an electronic device. The electronic device 100 may include a memory 111, a storage controller 112, a processor 113, a peripheral interface 114, an input and output unit 115, a collection unit 116, a projector 117, and a radio frequency unit 118. Those of ordinary skill in the art can understand that the structure shown in FIG. 1 is only for illustration, and does not limit the structure of the electronic device 100. For example, the electronic device 100 may also include more or fewer components than those shown in FIG. 1 or have a different configuration from that shown in FIG. 1.
上述的存储器111、存储控制器112、处理器113、外设接口114、输入输出单元115及采集单元116各元件相互之间直接或间接地电性连接,以实现数据的传输或交互。例如,这些元件相互之间可通过一条或多条通讯总线或信号线实现电性连接。上述的处理器113用于执行存储器中存储的可执行模块。The aforementioned components of the memory 111, the storage controller 112, the processor 113, the peripheral interface 114, the input output unit 115, and the collection unit 116 are directly or indirectly electrically connected to each other to realize data transmission or interaction. For example, these components can be electrically connected to each other through one or more communication buses or signal lines. The aforementioned processor 113 is used to execute executable modules stored in the memory.
其中,存储器111可以是,但不限于,随机存取存储器(Random Access Memory,简称RAM),只读存储器(Read Only Memory,简称ROM),可编程只读存储器(Programmable Read-Only Memory,简称PROM),可擦除只读存储器(Erasable Programmable Read-Only Memory,简称EPROM), 电可擦除只读存储器(Electric Erasable Programmable Read-Only Memory,简称EEPROM)等。其中,存储器111用于存储程序,所述处理器113在接收到执行指令后,执行所述程序,本申请实施例任一实施例揭示的过程定义的电子设备100所执行的方法可以应用于处理器113中,或者由处理器113实现。The memory 111 may be, but is not limited to, random access memory (Random Access Memory, RAM for short), Read Only Memory (ROM for short), Programmable Read-Only Memory (PROM for short) ), Erasable Programmable Read-Only Memory (EPROM), Electrical Erasable Programmable Read-Only Memory (EEPROM), etc. The memory 111 is used to store a program, and the processor 113 executes the program after receiving an execution instruction. The method executed by the electronic device 100 of the process definition disclosed in any embodiment of the present application can be applied to processing In the processor 113, or implemented by the processor 113.
上述的处理器113可能是一种集成电路芯片,具有信号的处理能力。上述的处理器113可以是通用处理器,包括中央处理器(Central Processing Unit,简称CPU)、网络处理器(Network Processor,简称NP)等;还可以是数字信号处理器(digital signal processor,简称DSP)、专用集成电路(Application Specific Integrated Circuit,简称ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The aforementioned processor 113 may be an integrated circuit chip with signal processing capability. The aforementioned processor 113 may be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU for short), a network processor (Network Processor, NP for short), etc.; it may also be a digital signal processor (DSP for short). ), Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components. The methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed. The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
上述的外设接口114将各种输入/输出装置耦合至处理器113以及存储器111。在一些实施例中,外设接口114,处理器113以及存储控制器112可以在单个芯片中实现。在其他一些实例中,他们可以分别由独立的芯片实现。The aforementioned peripheral interface 114 couples various input/output devices to the processor 113 and the memory 111. In some embodiments, the peripheral interface 114, the processor 113, and the storage controller 112 may be implemented in a single chip. In some other instances, they can be implemented by independent chips.
上述的输入输出单元115用于提供给用户输入数据。所述输入输出单元115可以是,但不限于,鼠标和键盘等。The aforementioned input and output unit 115 is used to provide input data to the user. The input and output unit 115 may be, but is not limited to, a mouse, a keyboard, and the like.
上述的采集单元116用于拍摄图像(例如照片、视频等),并且将所拍摄的图像进行存储,以供其它组件使用。可选地,采集单元116可以是一RGB-D(Red Green Blue-Deep)相机。该采集单元116可用于拍摄深度图像。The aforementioned acquisition unit 116 is used to capture images (for example, photos, videos, etc.), and to store the captured images for use by other components. Optionally, the acquisition unit 116 may be an RGB-D (Red Green Blue-Deep) camera. The acquisition unit 116 can be used to capture depth images.
可选地,本实施例中的电子设备100还可以包括投影仪117,该投影仪117包括光源、投影光学系统等投影元件,用于实现画面的投影。Optionally, the electronic device 100 in this embodiment may further include a projector 117, and the projector 117 includes a light source, a projection optical system, and other projection elements, which are used to realize image projection.
射频(Radio Frequency,简称:RF)单元118用于接收以及发送电磁 波,实现电磁波与电信号的相互转换,从而与通讯网络或者其他设备进行通讯。射频单元118可包括各种现有的用于执行这些功能的电路元件,例如,天线、射频收发器、数字信号处理器、加密/解密芯片、用户身份模块(SIM)卡、存储器等。射频单元118可与各种网络如互联网、企业内部网、无线网络进行通讯或者通过无线网络与其他设备进行通讯。上述的无线网络可包括蜂窝式电话网、无线局域网或者城域网。上述的无线网络可以使用各种通信标准、协议及技术,包括但并不限于全球移动通信系统(Global System for Mobile Communication,GSM)、增强型移动通信技术(Enhanced Data GSM Environment,EDGE),宽带码分多址技术(wideband code division multiple access,W-CDMA),码分多址技术(Code division access,CDMA)、时分多址技术(time division multiple access,TDMA),无线保真技术(Wireless,Fidelity,WiFi)(如美国电气和电子工程师协会标准IEEE 802.11a,IEEE 802.11b,IEEE802.11g和/或IEEE 802.11n)、网络电话(Voice over internet protocal,VoIP)、全球微波互联接入(Worldwide Interoperability for Microwave Access,Wi-Max)、其他用于邮件、即时通讯及短消息的协议,以及任何其他合适的通讯协议,甚至可包括那些当前仍未被开发出来的协议。本实施例中,通过上述的射频单元118可以实现电子设备100与外界设备的通信。A radio frequency (RF) unit 118 is used to receive and send electromagnetic waves, realize the mutual conversion between electromagnetic waves and electrical signals, and communicate with communication networks or other devices. The radio frequency unit 118 may include various existing circuit elements for performing these functions, for example, an antenna, a radio frequency transceiver, a digital signal processor, an encryption/decryption chip, a subscriber identity module (SIM) card, a memory, and so on. The radio frequency unit 118 can communicate with various networks such as the Internet, an intranet, and a wireless network, or communicate with other devices through a wireless network. The aforementioned wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network. The above-mentioned wireless network can use various communication standards, protocols and technologies, including but not limited to Global System for Mobile Communication (GSM), Enhanced Data GSM Environment, EDGE, and broadband code Wideband code division multiple access (W-CDMA), code division multiple access (CDMA), time division multiple access (TDMA), wireless fidelity technology (Wireless, Fidelity) , WiFi) (such as the American Institute of Electrical and Electronics Engineers standards IEEE 802.11a, IEEE 802.11b, IEEE802.11g and/or IEEE 802.11n), Internet telephony (Voice over internet protocol, VoIP), Worldwide Interoperability for Microwave Access, Wi-Max), other protocols used for mail, instant messaging and short messages, and any other appropriate communication protocols, even those that have not yet been developed. In this embodiment, the above-mentioned radio frequency unit 118 can realize the communication between the electronic device 100 and external devices.
本实施例中的电子设备100可以用于执行本申请实施例提供的各个方法中的各个步骤。下面通过几个实施例详细描述阅读辅助方法的实现过程。The electronic device 100 in this embodiment can be used to execute each step in each method provided in the embodiment of the present application. The implementation process of the reading assistance method will be described in detail below through several embodiments.
实施例二Example two
请参阅图2,是本申请实施例提供的阅读辅助方法的流程图。下面将对图2所示的具体流程进行详细阐述。Please refer to FIG. 2, which is a flowchart of a reading assistance method provided by an embodiment of the present application. The specific process shown in FIG. 2 will be described in detail below.
步骤201,采集目标区域中的第一图像数据。Step 201: Collect first image data in the target area.
其中,第一图像数据中包括目标对象的指示动作图像。上述的目标对 象可以是任意能够指向目标区域中的内容的对象。示例性地,目标对象可以是笔、指挥棒、用户手指等。指示动作图像可以是目标对象接触到一字符串;也可以是目标对象位于一字符串的下方等。Wherein, the first image data includes an instruction action image of the target object. The above-mentioned target object can be any object that can point to the content in the target area. Illustratively, the target object may be a pen, a baton, a user's finger, and so on. The indicating action image can be that the target object touches a character string; it can also be that the target object is located below a character string.
本实施例中,目标区域中显示有阅读内容。示例性地,该阅读内容可以是投影显示的电子读物中的内容,也可以是摆放在目标区域中的实体阅读物中印刷的内容。In this embodiment, the reading content is displayed in the target area. Exemplarily, the reading content may be the content of the electronic reading displayed by projection, or the content printed in the physical reading placed in the target area.
可选地,上述的目标区域可以是一桌面。该桌面上可以摆设有实体阅读物。示例性地,该实体阅读物可以是小说书、外文书、学习资料等。Optionally, the aforementioned target area may be a desktop. Physical reading objects can be placed on the desktop. Exemplarily, the entity reading material may be a novel book, a foreign document, learning materials, and the like.
可选地,上述的目标区域可以显示有电子读物。该电子读物表征的内容可以是学习内容、小说片段等。可选地,承载该电子读物的目标区域可以是纯色墙面、纯色白纸等任意可以用于显示投影画面的界面。Optionally, electronic reading materials may be displayed in the above-mentioned target area. The content represented by the electronic book may be learning content, novel fragments, and so on. Optionally, the target area carrying the electronic reading can be any interface that can be used to display the projection screen, such as a solid-color wall surface, a solid-color white paper, or the like.
在一实施方式中,若目标区域中显示有投影,本实施例中的方法所述方法还包括:将电子读物投影至所述目标区域中。In one embodiment, if a projection is displayed in the target area, the method in this embodiment further includes: projecting the electronic reading into the target area.
示例性地,第一图像数据中则可以包括该电子读物中的字符串。Exemplarily, the first image data may include a character string in the electronic book.
步骤202,对指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串。Step 202: Recognizing the indicating action image to determine the character string to be recognized corresponding to the indicating action image.
在一可选的实施方式中,步骤202可以包括:对目标对象的位置进行识别,确定出目标对象指向的目标位置;对该目标位置周边的内容进行文字识别,以提取出待识别字符串。In an alternative embodiment, step 202 may include: recognizing the location of the target object to determine the target location pointed to by the target object; performing text recognition on the content around the target location to extract the character string to be recognized.
示例性地,下面以目标对象为用户手指为例,对目标对象的位置进行检测进行描述。Exemplarily, the following describes the detection of the position of the target object by taking the target object as the user's finger as an example.
可选地,可以通过边缘检测对指示动作图像进行检测,以确定出用户手指的边缘。示例性地,可以将确定出的用户手指的边缘的指定方位作为目标位置。例如,可以将检测出的上方边缘作为目标位置。Optionally, the indicating action image can be detected by edge detection to determine the edge of the user's finger. Exemplarily, the determined specified orientation of the edge of the user's finger may be used as the target position. For example, the detected upper edge can be used as the target position.
可选地,也可以通过使用通过神经网络实现的分类模型,对指示动作图像中的内容进行分类,以筛选出指示动作图像中的用户手指,以及用户 手指所在区域。示例性地,可以将确定出的用户手指所在区域的指定位置作为目标位置。例如,可以将用户手指所在区域的左上方边缘作为目标位置。Optionally, it is also possible to classify the content in the indicating action image by using a classification model implemented by a neural network, so as to filter out the user's finger in the indicating action image and the area where the user's finger is located. Exemplarily, the determined designated position of the area where the user's finger is located may be used as the target position. For example, the upper left edge of the area where the user's finger is located can be used as the target position.
可选地,对该目标位置周边的内容进行文字识别,以提取出待识别字符串可以被实施为:使用神经网络模型对该目标位置周边的内容进行识别,以提取出待识别字符串。Optionally, performing text recognition on the content around the target location to extract the character string to be recognized may be implemented as: recognizing the content around the target location using a neural network model to extract the character string to be recognized.
可选地,对该目标位置周边的内容进行文字识别,以提取出待识别字符串可以被实施为:使用OCR(Optical Character Recognition,中文称:光学字符识别)模型对该目标位置周边的内容进行识别,以提取出待识别字符串。Optionally, performing text recognition on the content around the target location to extract the character string to be recognized can be implemented as: using an OCR (Optical Character Recognition, Chinese name: optical character recognition) model to perform text recognition on the content around the target location Recognition to extract the string to be recognized.
可选地,可以对目标位置周边未被目标对象覆盖的区域进行文字识别。可选地,可以仅对离目标位置最近的一行字符串进行识别。可选地,还可以仅对离目标位置最近的一个词、或一个句子进行文字识别。Optionally, text recognition can be performed on the area around the target location that is not covered by the target object. Optionally, only the line of character strings closest to the target location can be identified. Optionally, it is also possible to perform text recognition on only the word or sentence closest to the target location.
在一实施方式中,若目标区域可以显示有电子读物,则上述的待识别字符串为所述电子读物中的字符串。In one embodiment, if an electronic reading can be displayed in the target area, the above-mentioned character string to be identified is the character string in the electronic reading.
在一实施方式中,若目标区域摆设有实体阅读物,则上述的待识别字符串为从该实体阅读物中提取到的字符串。In one embodiment, if a physical reading object is placed in the target area, the above-mentioned character string to be identified is a character string extracted from the physical reading object.
步骤203,将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串。Step 203: Perform designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized.
在一种实施方式中,步骤203可以包括:将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串。In an embodiment, step 203 may include: translating the character string to be recognized to obtain a target character string of the character string to be recognized in a target language.
示例性地,待识别字符串可以是第一种语言的文字,目标字符串可以是第二种语言的文字。上述的指定处理可以是对待识别字符串进行翻译。可选地,待识别字符串可以是英语、法语、意大利语等语种的文字。目标字符串可以是中文文字。例如,上述的待识别字符串可以是“patent”,目标字符串则可以是“专利”。可以知道的是,上述的待识别字符串和目标字 符串对应的语种仅仅是示例性地,本申请实施例并不对待识别字符串和目标字符串对应的语种为限。Exemplarily, the character string to be recognized may be a character in the first language, and the target character string may be a character in the second language. The above-mentioned designation process may be to translate the character string to be recognized. Optionally, the character string to be recognized may be characters in languages such as English, French, and Italian. The target string can be Chinese characters. For example, the aforementioned character string to be identified can be "patent", and the target character string can be "patent". It can be known that the language types corresponding to the above-mentioned character string to be recognized and the target character string are merely exemplary, and the embodiment of the present application does not limit the language types corresponding to the recognized character string and the target character string.
可选地,电子设备中可以安全有一翻译应用程序,还可以存储语言数据库。示例性地,在非网络环境中,上述的将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串可以包括:通过该翻译应用程序将所述待识别字符串进行离线翻译,以得到所述待识别字符串在目标语种下的目标字符串。示例性地,若电子设备处于联网状态时,上述的将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串可以包括:通过该翻译应用程序将所述待识别字符串进行在线翻译,以得到所述待识别字符串在目标语种下的目标字符串。Optionally, there may be a translation application in the electronic device, and a language database may also be stored. Exemplarily, in a non-network environment, translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language may include: translating the character string to be recognized through the translation application. The recognized character string is translated offline to obtain the target character string of the character string to be recognized in the target language. Exemplarily, if the electronic device is in a networked state, the above-mentioned translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language may include: translating the character string to the target language through the translation application. The character string to be recognized is translated online to obtain the target character string of the character string to be recognized in the target language.
在另一种实施方式中,步骤203可以包括:检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。In another embodiment, step 203 may include: searching for the interpretation document corresponding to the character string to be recognized, and using the interpretation document as the target character string corresponding to the character string to be recognized.
示例性地,待识别字符串可以是专用名词的缩写,目标字符串可以是该缩写对应的全称。例如,待识别字符串可以是“IP”,则对应的目标字符串可以是“网际互连协议(Internet Protocol,简称IP)”。示例性地,如果一个缩写对应多项全称,则可以将所有的全称均显示。例如,待识别字符串可以是“CNN”,则对应的目标字符串可以是“美国有线电视新闻网(Cable News Network,简称CNN)”、“卷积神经网络(Convolutional Neural Networks,简称CNN)”等。Exemplarily, the character string to be recognized may be an abbreviation of a proper noun, and the target character string may be the full name corresponding to the abbreviation. For example, the character string to be identified may be "IP", and the corresponding target character string may be "Internet Protocol (IP)". Illustratively, if one abbreviation corresponds to multiple full names, all full names can be displayed. For example, the string to be identified can be "CNN", and the corresponding target string can be "Cable News Network (CNN)" or "Convolutional Neural Networks (CNN)" Wait.
示例性地,待识别字符串可以是成语,目标字符串可以是该成语对应的含义、或典故、或出处。例如,待识别字符串可以是“声如洪钟”,则对应的目标字符串可以是“形容说话或歌唱的声音洪亮,如同敲击大钟似的”。可选地,目标字符串还可以包括成语的出处。例如,上述实例中的目标字符串还可以包括“明·冯梦龙《东周列国志》第七十二回:忆胥目如闪电,声如洪钟。”Exemplarily, the character string to be recognized may be an idiom, and the target character string may be the meaning, allusion, or source corresponding to the idiom. For example, the character string to be recognized can be "sounds like a bell", and the corresponding target character string can be "speaking or singing loudly, like striking a bell". Optionally, the target string may also include the source of the idiom. For example, the target string in the above example can also include "Ming Feng Menglong's "Eastern Zhou Dynasty" Chapter 72: Recalling Xu's eyes is like lightning, and the sound is like a bell."
示例性地,待识别字符串可以是专业词,目标字符串可以是该成语对 应的解释。例如,待识别字符串可以是物理学领域中的“杠杆原理”,目标字符串可以是杠杆原理的解释“杠杆又分称费力杠杆、省力杠杆和等臂杠杆,杠杆原理也称为“杠杆平衡条件”。要使杠杆平衡,作用在杠杆上的两个力矩(力与力臂的乘积)大小必须相等”。Exemplarily, the character string to be recognized may be a professional word, and the target character string may be the corresponding interpretation of the idiom. For example, the character string to be identified can be the "leverage principle" in the field of physics, and the target string can be an explanation of the lever principle. Condition". To balance the lever, the two moments (the product of the force and the arm) acting on the lever must be equal in size."
可选地,电子设备中可以安装有搜索引擎。示例性地,若电子设备处于联网状态时,上述的检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串可以包括:使用搜索引擎检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。Optionally, a search engine may be installed in the electronic device. Exemplarily, if the electronic device is in a networked state, the aforementioned retrieval of the interpretation document corresponding to the character string to be identified, and using the interpretation document as the target character string corresponding to the character string to be identified may include: using a search engine to retrieve For the interpretation document corresponding to the character string to be recognized, the interpretation document is used as the target character string corresponding to the character string to be recognized.
可选地,电子设备中也可以存储有本地数据库。示例性地,上述的检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串可以包括:在本地数据库中查询所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。Optionally, a local database may also be stored in the electronic device. Exemplarily, searching for the interpretation document corresponding to the character string to be recognized, and using the interpretation document as the target character string corresponding to the character string to be recognized may include: querying a local database corresponding to the character string to be recognized Interpretation document, the interpretive document is used as the target character string corresponding to the character string to be identified.
步骤204,将所述目标字符串在所述目标区域中进行显示。Step 204: Display the target character string in the target area.
可选地,目标字符串可以显示在不遮挡目标区域中显示的电子读物,或实体阅读物中的文字部分的区域。可选地,目标字符串也可以显示在已阅读的区域中,例如,可以将目标字符串显示在待识别字符串所在位置的上方。Optionally, the target character string may be displayed in an area that does not obscure the electronic reading displayed in the target area, or the text part of the physical reading. Optionally, the target character string can also be displayed in the read area, for example, the target character string can be displayed above the position of the character string to be recognized.
可选地,可以通过投影的方式将目标字符串投影至目标区域进行显示。可选地,目标区域可以是一电子显示器,则可以将目标字符串显示在该电子显示器中。Optionally, the target character string can be projected to the target area for display by means of projection. Optionally, the target area may be an electronic display, and the target character string may be displayed on the electronic display.
可选地,如图3所示,步骤204包括可以包括以下步骤。Optionally, as shown in FIG. 3, step 204 may include the following steps.
步骤2041,采集所述目标区域的第二图像数据。Step 2041: Collect second image data of the target area.
步骤2042,通过识别所述第二图像数据,以确定所述目标区域是否包含空白区域。Step 2042: Determine whether the target area includes a blank area by identifying the second image data.
示例性地,上述的空白区域可以表示目标区域中未映射有字符或图像的区域。Exemplarily, the aforementioned blank area may indicate an area in the target area where no characters or images are mapped.
可选地,可以通过对第二图像数据中的各个像素点颜色的识别,确定出目标区域的空白区域。示例性地,目标区域的颜色可以是指定颜色。该指定颜色可以是白色、绿色、黑色等。例如,颜色数值在该指定颜色对应的数值区间内的像素点确定为指定颜色,则指定数量的像素点形成的区域中的所有像素被确定为指定颜色时,则可以判断指定数量的像素点形成的区域为指定颜色区域,也就是可判定为空白区域。Optionally, the blank area of the target area can be determined by recognizing the color of each pixel in the second image data. Illustratively, the color of the target area may be a designated color. The designated color can be white, green, black, etc. For example, if the color value is determined to be the specified color in the pixel point corresponding to the specified color, and all pixels in the area formed by the specified number of pixels are determined to be the specified color, it can be determined that the specified number of pixels are formed The area is the designated color area, that is, it can be judged as a blank area.
步骤2043,若所述目标区域包含空白区域,则将所述目标字符串投射至所述目标区域的所述空白区域进行显示。Step 2043: If the target area includes a blank area, project the target character string to the blank area of the target area for display.
步骤2044,若所述目标区域不包含空白区域,则将所述目标字符串投射至所述目标区域的所述待识别字符串的前指定行字符串所在位置。Step 2044: If the target area does not include a blank area, project the target character string to the position of the character string in the previous designated line of the character string to be recognized in the target area.
可选地,可以调整电子设备的投影仪的投影光学系统,以调整投影的目标字符串的投影角度,从而实现将所述目标字符串投射至所述目标区域的所述空白区域或待识别字符串的前指定行字符串所在位置进行显示。Optionally, the projection optical system of the projector of the electronic device can be adjusted to adjust the projection angle of the projected target character string, so as to realize the projecting of the target character string to the blank area of the target area or the character to be recognized The position of the specified line string before the string is displayed.
通过上述的显示方式,可以避免显示的目标字符串将用户需要阅读的区域遮挡。Through the above-mentioned display mode, it is possible to prevent the displayed target character string from obscuring the area that the user needs to read.
可选地,目标字符串可以与承载显示目标字符串的表面匹配。例如,承载显示目标字符串的表面为平面时,则可以按照平面的显示标准将目标字符串显示在平面上。再例如,承载显示目标字符串的表面为曲面时,则可以按照曲面的形状,将目标字符串映射在曲面上显示。Optionally, the target character string can be matched with the surface carrying the display target character string. For example, when the surface bearing the display target character string is a flat surface, the target character string can be displayed on the flat surface according to the display standard of the flat surface. For another example, when the surface bearing the display target character string is a curved surface, the target character string can be mapped and displayed on the curved surface according to the shape of the curved surface.
可选地,若目标区域中摆放有实体阅读物,则实体阅读物为打开状态时,可能会导致可用于投影画面的实体阅读物体上的面为非平面,在此基础上,可以将投影的图像按照实体阅读物体所呈现的非平面进行校正。如图4所示,步骤204可以包括以下步骤。Optionally, if a physical reading object is placed in the target area, when the physical reading object is open, it may cause the surface of the physical reading object that can be used to project the screen to be non-planar. On this basis, the projection can be The image of is corrected according to the non-plane presented by the entity reading object. As shown in FIG. 4, step 204 may include the following steps.
步骤2045,若检测到目标区域中摆放有实体阅读物,则采集所述目标 区域的第四图像数据。Step 2045: If it is detected that a physical reading object is placed in the target area, collect fourth image data of the target area.
可选地,第四图像数据可以是深度图像。第四图像数据中的各个像素表示采集设备与第四图像数据中的物体的距离。Optionally, the fourth image data may be a depth image. Each pixel in the fourth image data represents the distance between the collection device and the object in the fourth image data.
可选地,可以通过采集目标区域的深度图像确定目标区域中是否摆放有实体阅读物。例如,采集单元可以在于目标区域中的任意包含有待阅读的字符串的平面平行时,采集一目标深度图像,判断该深度图像中目标区域对应的各个像素的像素值是否相等,若各个像素的像素值不都相等,则表示目标区域中摆放有实体阅读物;若各个像素的像素值相等,则表示目标区域中未摆放任何物品,目标区域可能投影有电子读物。Optionally, a depth image of the target area can be collected to determine whether a physical reading object is placed in the target area. For example, the collecting unit may collect a target depth image when any plane in the target area containing the character string to be read is parallel, and judge whether the pixel value of each pixel corresponding to the target area in the depth image is equal. If the values are not all equal, it means that there is a physical reading material in the target area; if the pixel value of each pixel is equal, it means that there is no object in the target area, and the target area may have an electronic reading material projected.
步骤2046,根据所述第四图像数据确定出所述实体阅读物对应的显示面。Step 2046: Determine the display surface corresponding to the physical reading object according to the fourth image data.
可选地,通过每个像素点的像素确定出实体阅读物所在位置,以及实体阅读物的表面形成的平面。Optionally, the location of the physical reading object and the plane formed by the surface of the physical reading object are determined by the pixels of each pixel.
步骤2047,将所述目标字符串投射至所述显示面进行显示。Step 2047: Project the target character string to the display surface for display.
示例性地,可以将上述的显示面进行网格细分,求解投影仪的预扭曲矩阵,并实时进行几何校正。Exemplarily, the above-mentioned display surface can be subdivided into grids, the pre-distortion matrix of the projector can be solved, and geometric correction can be performed in real time.
示例性地,在对应显示面中,针对每个三角形网格,利用其本身的三个顶点,以及至少两个相邻节点,使用最小二乘法求取对应的预扭曲矩阵M,最后采用与平面幕类似的方法对投影前缓冲区内的对应区域进行预扭曲。Exemplarily, in the corresponding display surface, for each triangle mesh, use its own three vertices and at least two adjacent nodes to obtain the corresponding pre-distortion matrix M using the least square method, and finally use the The screen-like method pre-distorted the corresponding area in the buffer before projection.
可选地,如图5所示,本实施例中的阅读辅助方法还可以包括以下步骤。Optionally, as shown in FIG. 5, the reading assistance method in this embodiment may further include the following steps.
步骤205,按照预设周期采集所述目标区域的第三图像数据。Step 205: Collect third image data of the target area according to a preset period.
步骤206,在采集到的所述第三图像数据时,识别所述第三图像数据相对于当前时刻的前一时刻采集到的图像数据是否存在更新内容。Step 206: When the third image data is collected, identify whether the third image data has updated content relative to the image data collected at the previous moment of the current moment.
在一种实施方式中,可以将第三图像数据与当前时刻的前一时刻采集 到的图像数据进行像素对比,以确定第三图像数据与当前时刻的前一时刻采集到的图像数据是否存在差异,若存在差异,可以判定第三图像数据中存在更新内容。In one embodiment, the third image data can be compared with the image data collected at the previous moment of the current moment by pixel to determine whether there is a difference between the third image data and the image data collected at the previous moment of the current moment. If there is a difference, it can be determined that there is updated content in the third image data.
在另一种实施方式中,可以提取第三图像数据的第一图像特征;提取当前时刻的前一时刻采集到的图像数据的第二图像特征;然后计算第一图像特征及第二图像特征之间的欧式距离,若该欧式距离大于预设值,则可以判定第三图像数据中存在更新内容。In another embodiment, the first image feature of the third image data can be extracted; the second image feature of the image data collected at the previous moment of the current time can be extracted; and then the first image feature and the second image feature can be calculated. If the Euclidean distance is greater than the preset value, it can be determined that there is updated content in the third image data.
示例性地,若第三图像数据与当前时刻的前一时刻采集到的图像数据完全一样,则计算得到的欧氏距离为零。但是由于不同时间采集的图像可能会存在光线误差,因此,即使两张完全一样的画面,在不同时间采集的图像也可能存在差异。因此,上述预设值可以是一大于零的正数。其中,预设值的大小可以按照需求设置,本申请实施例并不以预设值的大小为限。Exemplarily, if the third image data is exactly the same as the image data collected at the moment before the current moment, the calculated Euclidean distance is zero. However, since the images collected at different times may have light errors, even if two images are exactly the same, the images collected at different times may be different. Therefore, the aforementioned preset value may be a positive number greater than zero. The size of the preset value can be set according to requirements, and the embodiment of the present application is not limited to the size of the preset value.
示例性地,更新内容可以是在目标区域作的笔记。示例性地,上述的更新内容可以不包括步骤201中提到的目标对象。Exemplarily, the updated content may be notes made in the target area. Exemplarily, the aforementioned update content may not include the target object mentioned in step 201.
步骤207,若存在更新内容,将所述更新内容进行存储。Step 207: If there is updated content, store the updated content.
可选地,步骤207可以包括:若检测到所述目标区域中显示有投影的电子读物,则将所述更新内容与所述电子读物关联存储。Optionally, step 207 may include: if it is detected that the projected electronic reading is displayed in the target area, then storing the updated content in association with the electronic reading.
可选地,还可以将更新内容的产生时间与该更新内容关联存储。Optionally, the generation time of the updated content can also be stored in association with the updated content.
可选地,步骤207也可以被实施为:若存在更新内容,且第三图像数据中检测到指定动作时,才将更新内容进行存储。Optionally, step 207 can also be implemented as: if there is updated content and the specified action is detected in the third image data, the updated content is stored.
可选地,指定动作可以是指向更新内容。Optionally, the designated action may be directed to update content.
可选地,指定动作可以是指向更新内容的时间持续指定时长。示例性地,第三图像数据可以包括多张图片,则连续采集的指定数量张图像均包括目标对象指向该更新内容时,则可以判定指向更新内容的时间持续指定时长。示例性地,第三图像数据可以是一视频,则指定时长的视频中均包括目标对象指向该更新内容时,则可以判定指向更新内容的时间持续指定 时长。Optionally, the designated action may be directed to the time to update the content for a designated period of time. Exemplarily, the third image data may include multiple pictures, and when the specified number of images continuously collected all include the target object pointing to the updated content, it can be determined that the time pointing to the updated content lasts for the specified period of time. Exemplarily, the third image data may be a video, and when the specified duration of the video includes the target object pointing to the updated content, it can be determined that the time pointing to the updated content lasts for the specified duration.
通过将纸质的更新内容电子化,并将电子化的更新内容进行存储,可以方便用户后续查询阅读过程中产生的内容。By digitizing the paper-based update content and storing the electronic update content, it is convenient for users to inquire about the content generated during the reading process.
下面通过两个实际应用场景描述本实施例中的方法的详细过程。The detailed process of the method in this embodiment is described below through two actual application scenarios.
在一个实例中,本申请实施例提供的方法可以用于教学;目标区域可以是教学用黑板或白板。该目标区域中当前投影有教学资料。当教棍指向任意内容时,可以显示该内容对应的注释。例如,当前目标区域中显示有英文课文,当教棍指向一英文单词,则可以在黑板或白板的空白区域上显示该英文单词对应的中文含义。进一步地,若老师在黑板上写了一些教学批注,则可以将该教学批注与当前投影显示的教学资料关联存储。In an example, the method provided in the embodiment of the present application can be used for teaching; the target area can be a blackboard or whiteboard for teaching. Teaching materials are currently projected in the target area. When the teaching stick points to any content, the annotation corresponding to the content can be displayed. For example, an English text is displayed in the current target area, and when the teaching stick points to an English word, the Chinese meaning corresponding to the English word can be displayed on the blank area of the blackboard or whiteboard. Further, if the teacher writes some teaching notes on the blackboard, the teaching notes can be stored in association with the teaching materials currently displayed on the projection.
在另一个实施例中,本申请实施例提供的方法可以用于个人阅读;目标区域中摆放有一本书籍,当用户的手指点向书中一些字词句时,则可以从各个搜索网站上查询该字词句对应的含义,并将查找到的内容显示在书籍的空白区域中。进一步地,若用户在书本的空白区域中写下一些笔记,则可以将该笔记保存至指定账号指向的存储空间。示例性地,该指定账号可以是以云存储空间的账号。In another embodiment, the method provided by the embodiment of this application can be used for personal reading; a book is placed in the target area, and when the user’s finger points to some words and sentences in the book, it can be retrieved from various search websites Query the corresponding meaning of the word and sentence, and display the found content in the blank area of the book. Further, if the user writes some notes in a blank area of the book, the notes can be saved to the storage space pointed to by the designated account. Exemplarily, the designated account may be an account of a cloud storage space.
通过本实施例中的上述方法,采用通过指示动作就能够实现对待识别字符串的指定处理,可以方便用户阅读时,得到与待识别字符串对应的目标字符串,可以提高阅读时辅助了解待识别字符串。进一步地,还可以将在阅读或教学过程中产生的新的信息进行存储,可以是用户能够在后续还能随时查看阅读或教学时产生的有用信息。Through the above method in this embodiment, the specified processing of the character string to be recognized can be realized through the instruction action, which can facilitate the user to obtain the target character string corresponding to the character string to be recognized when reading, and can improve the assisting understanding of the character string to be recognized when reading. String. Further, the new information generated during the reading or teaching process can also be stored, which can be the user's ability to view useful information generated during the reading or teaching at any time in the future.
实施例三Example three
基于同一申请构思,本申请实施例中还提供了与阅读辅助方法对应的阅读辅助装置,由于本申请实施例中的装置解决问题的原理与前述的阅读辅助方法实施例相似,因此本实施例中的装置的实施可以参见上述方法的 实施例中的描述,重复之处不再赘述。Based on the same application concept, the embodiment of the application also provides a reading assistance device corresponding to the reading assistance method. Since the device in the embodiment of the application solves the problem in principle similar to the foregoing embodiment of the reading assistance method, in this embodiment The implementation of the device can refer to the description in the embodiment of the above method, and the repetitive parts will not be repeated.
请参阅图6,是本申请实施例提供的阅读辅助装置的功能模块示意图。本实施例中的阅读辅助装置中的各个模块用于执行上述方法实施例中的各个步骤。阅读辅助装置包括:采集模块301、识别模块302、处理模块303及第一投影模块304;其中,Please refer to FIG. 6, which is a schematic diagram of functional modules of a reading aid device provided by an embodiment of the present application. Each module in the reading aid device in this embodiment is used to execute each step in the foregoing method embodiment. The reading aid device includes: an acquisition module 301, an identification module 302, a processing module 303, and a first projection module 304; among them,
采集模块301,用于采集目标区域中的第一图像数据,所述第一图像数据中包括目标对象的指示动作图像,目标区域中显示有阅读内容;The collection module 301 is configured to collect first image data in a target area, where the first image data includes an instruction action image of a target object, and the reading content is displayed in the target area;
识别模块302,用于对所述指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串;The recognition module 302 is configured to recognize the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
处理模块303,用于将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串;The processing module 303 is configured to perform designated processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
第一投影模块304,用于将所述目标字符串在所述目标区域中进行显示。The first projection module 304 is configured to display the target character string in the target area.
一种可能的实施方式中,第一投影模块304,用于:In a possible implementation manner, the first projection module 304 is configured to:
采集所述目标区域的第二图像数据;Collecting second image data of the target area;
通过识别所述第二图像数据,以确定所述目标区域是否包含空白区域;By identifying the second image data to determine whether the target area includes a blank area;
若所述目标区域包含空白区域,则将所述目标字符串投射至所述目标区域的所述空白区域进行显示。If the target area includes a blank area, the target character string is projected to the blank area of the target area for display.
一种可能的实施方式中,阅读辅助装置还可以包括存储模块305,用于:In a possible implementation manner, the reading aid device may further include a storage module 305 for:
按照预设周期采集所述目标区域的第三图像数据;Collecting the third image data of the target area according to a preset period;
在采集到的所述第三图像数据时,识别所述第三图像数据相对于当前时刻的前一时刻采集到的图像数据是否存在更新内容;When the third image data is collected, identifying whether the third image data has updated content relative to the image data collected at the previous moment of the current moment;
若存在更新内容,将所述更新内容进行存储。If there is updated content, store the updated content.
一种可能的实施方式中,存储模块305,还用于:In a possible implementation manner, the storage module 305 is also used for:
若检测到所述目标区域中显示有投影的电子读物,则将所述更新内容与所述电子读物关联存储。If it is detected that the projected electronic reading is displayed in the target area, the updated content is stored in association with the electronic reading.
一种可能的实施方式中,第一投影模块304,用于:In a possible implementation manner, the first projection module 304 is configured to:
若检测到所述目标区域中摆放有实体阅读物,则采集所述目标区域的第四图像数据;If it is detected that a physical reading object is placed in the target area, collecting fourth image data of the target area;
根据所述第四图像数据确定出所述实体阅读物对应的显示面;Determining the display surface corresponding to the physical reading object according to the fourth image data;
将所述目标字符串投射至所述显示面进行显示。The target character string is projected to the display surface for display.
一种可能的实施方式中,处理模块303,用于:In a possible implementation manner, the processing module 303 is configured to:
将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串,或/及,Translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language, or/and,
检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。The interpretation document corresponding to the character string to be recognized is retrieved, and the interpretation document is used as the target character string corresponding to the character string to be recognized.
一种可能的实施方式中,阅读辅助装置还可以包括:In a possible implementation manner, the reading aid device may further include:
第二投影模块306,用于将电子读物投影至所述目标区域中,所述待识别字符串为所述电子读物中的字符串。The second projection module 306 is configured to project an electronic reading into the target area, and the character string to be recognized is a character string in the electronic reading.
此外,本申请实施例还提供一种计算机可读存储介质,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行上述方法实施例中所述的阅读辅助方法的步骤。In addition, the embodiments of the present application also provide a computer-readable storage medium having a computer program stored on the computer-readable storage medium, and the computer program executes the steps of the reading assistance method described in the above method embodiment when the computer program is run by a processor. .
本申请实施例所提供的阅读辅助方法的计算机程序产品,包括存储了程序代码的计算机可读存储介质,所述程序代码包括的指令可用于执行上述方法实施例中所述的阅读辅助方法的步骤,具体可参见上述方法实施例,在此不再赘述。The computer program product of the reading assistance method provided by the embodiment of the present application includes a computer-readable storage medium storing program code, and the instructions included in the program code can be used to execute the steps of the reading assistance method described in the above method embodiment For details, please refer to the above method embodiment, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,也可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,附图中的流程图和框图显示了根据本申请的多个实施例的装置、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能 的可执行指令。也应当注意,在有些作为替换的实现方式中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。In the several embodiments provided in this application, it should be understood that the disclosed device and method may also be implemented in other ways. The device embodiments described above are merely illustrative. For example, the flowcharts and block diagrams in the accompanying drawings show the possible implementation architectures, functions, and functions of the devices, methods, and computer program products according to multiple embodiments of the present application. operating. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the module, program segment, or part of the code contains one or more functions for realizing the specified logical function. Executable instructions. It should also be noted that in some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two consecutive blocks can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved. It should also be noted that each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart, can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.
另外,在本申请各个实施例中的各功能模块可以集成在一起形成一个独立的部分,也可以是各个模块单独存在,也可以两个或两个以上模块集成形成一个独立的部分。In addition, the functional modules in the various embodiments of the present application may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
所述功能如果以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。If the function is implemented in the form of a software function module and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of the present application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. . It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply one of these entities or operations. There is any such actual relationship or order between. Moreover, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or device including a series of elements not only includes those elements, but also includes those that are not explicitly listed Other elements of, or also include elements inherent to this process, method, article or equipment. If there are no more restrictions, the element defined by the sentence "including..." does not exclude the existence of other same elements in the process, method, article or equipment that includes the element.
以上所述仅为本申请的优选实施例而已,并不用于限制本申请,对于本领域的技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。The above descriptions are only preferred embodiments of the application, and are not intended to limit the application. For those skilled in the art, the application can have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included in the protection scope of this application. It should be noted that similar reference numerals and letters indicate similar items in the following figures. Therefore, once a certain item is defined in one figure, it does not need to be further defined and explained in subsequent figures.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in this application. Should be covered within the scope of protection of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (10)

  1. 一种阅读辅助方法,其特征在于,包括:A reading assistance method, characterized in that it comprises:
    采集目标区域中的第一图像数据,所述第一图像数据中包括目标对象的指示动作图像,所述目标区域中显示有阅读内容;Acquiring first image data in a target area, where the first image data includes an instruction action image of a target object, and reading content is displayed in the target area;
    对所述指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串;Recognizing the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
    将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串;Performing designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
    将所述目标字符串在所述目标区域中进行显示。The target character string is displayed in the target area.
  2. 根据权利要求1所述的方法,其特征在于,所述将所述目标字符串在所述目标区域中进行显示的步骤,包括:The method according to claim 1, wherein the step of displaying the target character string in the target area comprises:
    采集所述目标区域的第二图像数据;Collecting second image data of the target area;
    通过识别所述第二图像数据,以确定所述目标区域是否包含空白区域;By identifying the second image data to determine whether the target area includes a blank area;
    若所述目标区域包含空白区域,则将所述目标字符串投射至所述目标区域的所述空白区域进行显示。If the target area includes a blank area, the target character string is projected to the blank area of the target area for display.
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    按照预设周期采集所述目标区域的第三图像数据;Collecting the third image data of the target area according to a preset period;
    在采集到的所述第三图像数据时,识别所述第三图像数据相对于当前时刻的前一时刻采集到的图像数据是否存在更新内容;When the third image data is collected, identifying whether the third image data has updated content relative to the image data collected at the previous moment of the current moment;
    若存在更新内容,将所述更新内容进行存储。If there is updated content, store the updated content.
  4. 根据权利要求3所述的方法,其特征在于,所述若存在更新内容,将所述更新内容进行存储的步骤,包括:The method according to claim 3, characterized in that, if there is updated content, the step of storing the updated content comprises:
    若检测到所述目标区域中显示有投影的电子读物,则将所述更新内容与所述电子读物关联存储。If it is detected that the projected electronic reading is displayed in the target area, the updated content is stored in association with the electronic reading.
  5. 根据权利要求1所述的方法,其特征在于,所述将所述目标字符串 在所述目标区域中进行显示的步骤,包括:The method according to claim 1, wherein the step of displaying the target character string in the target area comprises:
    若检测到所述目标区域中摆放有实体阅读物,则采集所述目标区域的第四图像数据;If it is detected that a physical reading object is placed in the target area, collecting fourth image data of the target area;
    根据所述第四图像数据确定出所述实体阅读物对应的显示面;Determining the display surface corresponding to the physical reading object according to the fourth image data;
    将所述目标字符串投射至所述显示面进行显示。The target character string is projected to the display surface for display.
  6. 根据权利要求1所述的方法,其特征在于,所述将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串的步骤,包括:The method according to claim 1, wherein the step of performing designation processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized comprises:
    将所述待识别字符串进行翻译,以得到所述待识别字符串在目标语种下的目标字符串,或/及,Translating the character string to be recognized to obtain the target character string of the character string to be recognized in the target language, or/and,
    检索所述待识别字符串对应的解释文献,将所述解释文献作为所述待识别字符串对应的目标字符串。The interpretation document corresponding to the character string to be recognized is retrieved, and the interpretation document is used as the target character string corresponding to the character string to be recognized.
  7. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    将电子读物投影至所述目标区域中,所述待识别字符串为所述电子读物中的字符串。Projecting an electronic reading into the target area, and the character string to be identified is a character string in the electronic reading.
  8. 一种阅读辅助装置,其特征在于,包括:A reading aid device, characterized in that it comprises:
    采集模块,用于采集目标区域中的第一图像数据,所述第一图像数据中包括目标对象的指示动作图像,所述目标区域中显示有阅读内容;An acquisition module, configured to acquire first image data in a target area, where the first image data includes an instruction action image of a target object, and reading content is displayed in the target area;
    识别模块,用于对所述指示动作图像进行识别,以确定出所述指示动作图像对应的待识别字符串;A recognition module, configured to recognize the instruction action image to determine the character string to be recognized corresponding to the instruction action image;
    处理模块,用于将所述待识别字符串进行指定处理,以确定出与所述待识别字符串对应的目标字符串;A processing module, configured to perform designated processing on the character string to be recognized to determine a target character string corresponding to the character string to be recognized;
    第一投影模块,用于将所述目标字符串在所述目标区域中进行显示。The first projection module is configured to display the target character string in the target area.
  9. 一种电子设备,其特征在于,包括:处理器、存储器,所述存储器存储有所述处理器可执行的机器可读指令,当电子设备运行时,所述机器可读指令被所述处理器执行时执行如权利要求1至7任一所述的方法的步 骤。An electronic device, comprising: a processor and a memory, the memory storing machine-readable instructions executable by the processor, and when the electronic device is running, the machine-readable instructions are used by the processor When executed, the steps of the method according to any one of claims 1 to 7 are executed.
  10. 一种计算机可读存储介质,其特征在于,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行如权利要求1至7任一所述的方法的步骤。A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, and the computer program executes the steps of the method according to any one of claims 1 to 7 when the computer program is run by a processor.
PCT/CN2020/079181 2019-12-16 2020-03-13 Reading assistance method and apparatus, and electronic device WO2021120420A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911305405.5A CN110990107A (en) 2019-12-16 2019-12-16 Reading assistance method and device and electronic equipment
CN201911305405.5 2019-12-16

Publications (1)

Publication Number Publication Date
WO2021120420A1 true WO2021120420A1 (en) 2021-06-24

Family

ID=70094885

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/079181 WO2021120420A1 (en) 2019-12-16 2020-03-13 Reading assistance method and apparatus, and electronic device

Country Status (2)

Country Link
CN (1) CN110990107A (en)
WO (1) WO2021120420A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015158869A (en) * 2014-02-25 2015-09-03 カシオ計算機株式会社 projection display device and program
CN107003714A (en) * 2014-09-12 2017-08-01 惠普发展公司,有限责任合伙企业 Contextual information is developed from image
CN108665742A (en) * 2018-05-11 2018-10-16 亮风台(上海)信息科技有限公司 A kind of method and apparatus read by arrangement for reading
CN108681393A (en) * 2018-04-16 2018-10-19 优视科技有限公司 Translation display methods, device, computing device and medium based on augmented reality

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7239747B2 (en) * 2002-01-24 2007-07-03 Chatterbox Systems, Inc. Method and system for locating position in printed texts and delivering multimedia information
JP6290922B2 (en) * 2012-12-28 2018-03-07 メタイオ ゲゼルシャフト ミット ベシュレンクテル ハフツングmetaio GmbH Method and system for projecting digital information on a real object in a real environment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015158869A (en) * 2014-02-25 2015-09-03 カシオ計算機株式会社 projection display device and program
CN107003714A (en) * 2014-09-12 2017-08-01 惠普发展公司,有限责任合伙企业 Contextual information is developed from image
CN108681393A (en) * 2018-04-16 2018-10-19 优视科技有限公司 Translation display methods, device, computing device and medium based on augmented reality
CN108665742A (en) * 2018-05-11 2018-10-16 亮风台(上海)信息科技有限公司 A kind of method and apparatus read by arrangement for reading

Also Published As

Publication number Publication date
CN110990107A (en) 2020-04-10

Similar Documents

Publication Publication Date Title
US11645826B2 (en) Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
WO2021128578A1 (en) Image processing method and apparatus, electronic device, and storage medium
US10176409B2 (en) Method and apparatus for image character recognition model generation, and vertically-oriented character image recognition
JP6479266B1 (en) Intelligent identification and presentation of digital documents
KR102567285B1 (en) Mobile video search
WO2020005731A1 (en) Text entity detection and recognition from images
KR20200109239A (en) Image processing method, device, server and storage medium
JP7206729B2 (en) Information processing device and program
EP3175375A1 (en) Image based search to identify objects in documents
US10482393B2 (en) Machine-based learning systems, methods, and apparatus for interactively mapping raw data objects to recognized data objects
WO2021159843A1 (en) Object recognition method and apparatus, and electronic device and storage medium
JP2021034003A (en) Human object recognition method, apparatus, electronic device, storage medium, and program
US10460192B2 (en) Method and system for optical character recognition (OCR) of multi-language content
US10311330B2 (en) Proactive input selection for improved image analysis and/or processing workflows
WO2021223629A1 (en) Method and device for analyzing image material
TWM457241U (en) Picture character recognition system by combining augmented reality
WO2016155643A1 (en) Input-based candidate word display method and device
CN111881900B (en) Corpus generation method, corpus translation model training method, corpus translation model translation method, corpus translation device, corpus translation equipment and corpus translation medium
WO2021120420A1 (en) Reading assistance method and apparatus, and electronic device
US20230048495A1 (en) Method and platform of generating document, electronic device and storage medium
US20220343662A1 (en) Method and apparatus for recognizing text, device and storage medium
WO2022105120A1 (en) Text detection method and apparatus from image, computer device and storage medium
WO2020000966A1 (en) Method for generating wireless access point information, device, and computer readable medium
CN110019661A (en) Text search method, apparatus and electronic equipment based on office documents
US20230206668A1 (en) Vision processing and model training method, device, storage medium and program product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20902367

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20902367

Country of ref document: EP

Kind code of ref document: A1