WO2018171560A1 - 一种快速插入识别文字的方法及装置 - Google Patents

一种快速插入识别文字的方法及装置 Download PDF

Info

Publication number
WO2018171560A1
WO2018171560A1 PCT/CN2018/079489 CN2018079489W WO2018171560A1 WO 2018171560 A1 WO2018171560 A1 WO 2018171560A1 CN 2018079489 W CN2018079489 W CN 2018079489W WO 2018171560 A1 WO2018171560 A1 WO 2018171560A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
text
document
camera
instruction
Prior art date
Application number
PCT/CN2018/079489
Other languages
English (en)
French (fr)
Inventor
区钺坚
黄志军
高延平
王峰
杨松
Original Assignee
北京金山办公软件股份有限公司
珠海金山办公软件有限公司
广州金山移动科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京金山办公软件股份有限公司, 珠海金山办公软件有限公司, 广州金山移动科技有限公司 filed Critical 北京金山办公软件股份有限公司
Priority to EP18772598.1A priority Critical patent/EP3605277A4/en
Priority to SG11201908705W priority patent/SG11201908705WA/en
Priority to US16/496,080 priority patent/US20200042581A1/en
Priority to JP2020500950A priority patent/JP2020515996A/ja
Publication of WO2018171560A1 publication Critical patent/WO2018171560A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/226Character recognition characterised by the type of writing of cursive writing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Definitions

  • the present application relates to the field of electronic document editing, and in particular to a method and apparatus for quickly inserting recognized characters.
  • a piece of non-repeatable text in an external carrier needs to be inserted into the document, such as a piece of text in the picture, a piece of text in the video, or A piece of text in a copied electronic document, and so on.
  • the existing method first converts the non-image format carrier into a picture format by using the format conversion software, and then starts another existing picture recognition program to recognize the text in the picture, and finally copies the recognized text to the document to be edited.
  • the purpose of the embodiments of the present application is to provide a method and apparatus for quickly inserting recognized characters to improve work efficiency.
  • the specific technical solutions are as follows:
  • the embodiment of the present application discloses a method for quickly inserting an identification text, including:
  • the required text is moved to the document to be edited.
  • the image obtaining instruction includes:
  • the obtaining according to the picture obtaining instruction, obtaining a picture containing the required text, including:
  • the obtaining a picture interception range includes:
  • the image obtaining instruction includes:
  • the obtaining, according to the user's picture obtaining instruction, obtaining a picture containing the required text including:
  • the picture acquisition instruction is an instruction to acquire a picture by using a camera, start the camera, and confirm that the visible area of the camera includes a required text
  • the moving the requirement text into the to-be-edited document includes:
  • the required text is moved to a position to be inserted in the document to be edited, wherein the position to be inserted is a position where the mouse cursor is located, or a position where the touch screen cursor is located.
  • the embodiment of the present application further discloses an apparatus for quickly inserting an identification text, including:
  • An instruction acquisition module configured to acquire a picture acquisition instruction of the user
  • a picture obtaining module configured to obtain a picture containing the required text according to the picture obtaining instruction
  • An identification module configured to identify a required text in the image in the first document editing software
  • a text moving module configured to move the required text into the to-be-edited document.
  • the image obtaining instruction includes:
  • the image obtaining module includes:
  • a picture interception range obtaining sub-module configured to acquire a picture interception range when the picture acquisition instruction is an instruction for acquiring a picture by a screenshot, where the picture interception range includes a requirement text;
  • the picture intercepting sub-module is used to intercept the picture in the range of the picture interception.
  • the picture interception scope obtaining submodule is specifically configured to:
  • the image obtaining instruction includes:
  • the image obtaining module includes:
  • a camera activation sub-module configured to: when the image acquisition instruction is an instruction to acquire a picture by using a camera, start the camera, and confirm that a visible text is included in a visible shooting area of the camera;
  • the shooting sub-module is used to capture pictures in the visible area of the camera.
  • the text moving module is specifically configured to:
  • the required text is moved to a position to be inserted in the document to be edited, wherein the position to be inserted is a position where the mouse cursor is located, or a position where the touch screen cursor is located.
  • the embodiment of the present application further discloses an electronic device, including a processor and a memory.
  • a memory for storing a computer program
  • the processor when used to execute a program stored on the memory, implements any of the above methods for quickly inserting the recognized text.
  • the embodiment of the present application further discloses a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by the processor, implements any one of the above methods for quickly inserting the recognized text.
  • the embodiment of the present application also discloses an executable program code for a method executed to perform any of the above-described quick insertion of recognized characters.
  • the method and apparatus for quickly inserting an identification text acquires a picture acquisition instruction of a user when facing a requirement text that cannot be copied in an external carrier. Then, according to the picture acquisition instruction, a picture containing the required text is obtained. The required text in the picture is then identified in the first document editing software. Finally, the requirement text is added to the document to be edited.
  • the embodiment of the present application can obtain a picture containing the required text, and recognize the required text in the image, and automatically insert into the document to be edited.
  • the embodiment of the present application automatically inserts the recognized text by using the first document editing software, and needs to open multiple softwares and programs, and manually copy the required text, or manually type in the prior art. Insert recognition text to improve work efficiency.
  • FIG. 1 is a schematic flowchart of a method for quickly inserting an identification text according to an embodiment of the present application
  • FIG. 2 is a schematic diagram of a picture containing a required text according to an embodiment of the present application
  • FIG. 3 is a schematic flow chart of an example based on the method shown in FIG. 1;
  • FIG. 4 is a schematic flow chart of still another example based on the method shown in FIG. 1;
  • FIG. 5 is a schematic structural diagram of an apparatus for quickly inserting and recognizing characters according to an embodiment of the present disclosure
  • FIG. 6 is a schematic structural view of an example based on the apparatus shown in FIG. 5;
  • Figure 7 is a schematic structural view of still another example based on the apparatus shown in Figure 5;
  • FIG. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
  • the embodiment of the present application discloses a method and a device for quickly inserting an identification text, which can improve work efficiency.
  • the method for quickly inserting a recognized text disclosed in the embodiment of the present application includes: acquiring a picture acquisition instruction of a user; obtaining a picture containing a required text according to the picture acquisition instruction; and identifying, in the first document editing software, the picture in the picture a requirement text; adding the requirement text to the document to be edited in the first document editing software. It can be seen that the embodiment of the present application automatically inserts the recognized text by using the first document editing software, and needs to open multiple softwares and programs, and manually copy the required text, or manually type in the prior art. The way to insert recognized text improves work efficiency.
  • FIG. 1 is a flowchart of a method for quickly inserting an identification text according to an embodiment of the present application, including the following steps:
  • step 101 the document to be edited is opened.
  • the embodiment of the present application is executed by a processor of a terminal device, which includes a computer, a mobile phone, a tablet computer, a device capable of editing an electronic document, and the like.
  • the document can be opened by the first document editing software as the document to be edited; if the open document already exists, step 101 is not required to be performed.
  • the first document editing software is software installed in the terminal device for editing electronic documents, such as Kingsoft office software WPS Office. Each step in the embodiment of the present application may be completed in the first document editing software, or step 102-step 103 may also be implemented by other software.
  • the first document editing software in the embodiment of the present application may include a screenshot function and a picture recognition function.
  • the first document editing software may be integrated with a screen capture program, and the screenshot program can intercept the image of the selected area, and the first document editing software can also integrate a picture recognition program, and the image recognition program can recognize the program. The text in the picture.
  • Step 102 Acquire a picture acquisition instruction of the user.
  • a plurality of methods for acquiring a picture may be provided, such as acquiring a picture by means of a screenshot, acquiring a picture by using a camera, and the image acquisition instruction is performed by the user in the multiple acquisition modes.
  • the selected instruction or if the user receives an instruction to select a picture mode, considers that the user's picture acquisition instruction is obtained.
  • the embodiment of the present application may establish a user selection window in the first document editing software in advance, and provide a plurality of selection options for obtaining a picture mode for the user to select, and then obtain a click operation of the user to select a picture mode.
  • step 101 and step 102 in the embodiment of the present application may be in no particular order. That is to say, in the embodiment of the present application, after the document to be edited is opened by using the first document editing software, the image acquisition instruction of the user is obtained, and the situation is usually used to edit the edited document first, and then obtain the scene of the required text; After the first document editing software obtains the image acquisition instruction of the user, the document to be edited is opened, and the situation is usually used for the carrier where the known requirement text is located, and the scene of the required text needs to be obtained first, and in this scenario, the document can be opened.
  • the documents to be edited are inserted into the plurality of documents to be edited, and the order of steps 101 and 102 in the embodiment of the present application is determined according to the usage habits of the user or the specific use scenario.
  • Step 103 Obtain a picture containing the required text according to the picture acquisition instruction.
  • the picture acquisition instruction in the embodiment of the present application is related to the carrier where the requirement text is located, and the requirement text is a text that the user actually needs to insert into the document to be edited.
  • the required text may exist in an existing electronic carrier, such as an existing picture, a video, a non-replicable electronic document, etc. in the terminal device, and the image acquisition instruction may be a screenshot through the electronic carrier. The way to get the instructions of the picture.
  • an instruction for acquiring a picture by means of a screenshot is received by the user, and an electronic carrier including the required text is currently displayed on the screen of the terminal device.
  • the range of the screenshot may be determined according to the instruction, according to the screenshot range.
  • a picture containing the required text is intercepted in the currently displayed electronic carrier.
  • the required text may also exist in an existing physical carrier outside the terminal device, such as text in a paper book, text in a wall poster, text on a television screen, etc., facing the physical carrier.
  • the image acquisition instruction may be an instruction for acquiring a picture by using a camera, and according to an instruction for acquiring a picture by using a camera, the camera uses the camera to take a photo of the physical carrier where the desired text is located, and obtains a picture containing the required text.
  • the picture containing the required text in the embodiment of the present application has a wide range of sources and is more universal.
  • Step 104 Identify the required text in the picture in the first document editing software.
  • the image recognition program integrated in the first document editing software may be used to identify the required text in the image.
  • the program interface may be preset in the first document editing software to facilitate multiple image recognition. Replacement of the program.
  • step 105 the demand text is moved to the document to be edited.
  • Step 105 may be: adding the requirement text to the document to be edited in the first document editing software.
  • the recognized text is added to the to-be-edited document, and may be added to a preset fixed position, a random location in the document to be edited, or added to the to-be-inserted position set by the user in the document to be edited.
  • the text adding method can add text for the image recognition at the same time, that is, once a text is recognized, the text is immediately added to the document to be edited, and the synchronous adding method is beneficial for the user to use or edit the recognized text as soon as possible. Part of the text; or in the embodiment of the present application, after the required texts in the picture are all recognized, the overall addition of the required text is performed. This overall addition method is beneficial to maintaining the integrity of the required text, and is more suitable for the overall content of the required text. Use, or edit.
  • the specific addition form of the text from the picture to the document to be edited may be a plurality of modes such as sliding, scrolling, and beating.
  • the embodiment of the present application does not limit the specific addition form of the text.
  • the text in the recognized picture is automatically inserted into the document to be edited, and the text is not required to be copied, pasted or dragged manually. Wait for the move operation.
  • the embodiment of the present application can prevent the user from manually copying the recognized text into the document to be edited, can realize automatic insertion, and can improve work efficiency.
  • the embodiment of the present application provides a method for quickly inserting an identification text, and acquiring a picture acquisition instruction of the user. Then, according to the picture acquisition instruction, an image containing the required text is obtained. The required text in the picture is then identified in the first document editing software. Finally add the requirement text to the document to be edited.
  • the embodiment of the present application can obtain the image containing the required text, and recognize that the required text in the image is moved to the document to be edited, so that the required text in the image is recognized and automatically inserted. Edit the document.
  • the embodiment of the present application automatically inserts the recognized text by using the first document editing software, and needs to open multiple softwares and programs, and manually copy the required text, or manually type in the prior art. Insert recognition text to improve work efficiency.
  • FIG. 2 is a picture containing the required text.
  • the user uses the document editing software to edit the document, the user can simultaneously view a PDF (Portable Document Format, portable). File format), find that some of the text is the required text, see the text in the box in Figure 2, the user expects to insert the required text into the document to be edited. If some solutions in the prior art are used, an external format needs to be converted first.
  • the software or program converts the PDF document into a picture format, that is, the picture shown in FIG. 2, and then uses another picture recognition software or program to identify all the characters in the picture, and finally the user manually places the picture in the box.
  • the user's image acquisition instruction may be obtained.
  • the user may be provided with multiple options in the first document editing software, and the multiple options may be Corresponding to a variety of ways to get pictures, such as taking pictures by screenshots, or using a camera to get pictures, and so on.
  • the method of obtaining the picture selected by the user is determined, and the picture containing the required text is obtained according to the method of obtaining the picture selected by the user.
  • the picture of the box part in the PDF document is obtained by using a screen capture program integrated in the first document editing software, for example, assuming that the user selects a picture by using a mouse.
  • the text in the box part of 2 is determined according to the user's operation, and the range of the screenshot includes the text of the box part, and the screenshot is taken according to the screenshot range to obtain the picture containing the text of the box in FIG. 2, and then integrated in the first document editing software.
  • the picture recognition program recognizes the required text in the picture, that is, recognizes the text in the box. Finally add the requirement text to the document to be edited.
  • the embodiment of the present application completes the entire process by using only the first document editing software, and can automatically recognize and insert some characters in the picture, thereby improving work efficiency.
  • FIG. 3 is a flowchart of an example based on the method shown in FIG. 1, including the following steps:
  • step 301 the document to be edited is opened.
  • the first document editing software may be used to open the document as the document to be edited; if the open document already exists, step 301 is not required to be performed.
  • the embodiment of the present application uses the first document editing software to open a document to be edited.
  • the terminal device receives an instruction of the user to open the document, such as a click operation of the first document editing software icon by the user, a click operation of the user to edit the document icon, a voice operation instruction of the user, and the like.
  • the processor of the terminal device opens the document to be edited according to the instruction to open the document.
  • the user clicks on the first document editing software icon, and the processor of the terminal device first starts the first document editing software, and then receives the instruction of the user to select the document, such as obtaining the user's selection of a document. After the operation, open the document as a document to be edited, and so on;
  • the user clicks on the edited document icon, and the processor of the terminal device uses the first document editing software to open the to-be-edited document, and the like;
  • the voice operation instruction of the user for example, the voice operation instruction of the user is to open the document named “File 1”, and the processor of the terminal device finds the document named “File 1”, and utilizes the A document editing software opens the document named "File 1" as a document to be edited, and the like.
  • Step 302 Acquire an instruction of a screenshot of the user to obtain a picture.
  • the user in the user selection window preset in the first document editing software, the user is provided with an option, so that the user can select to acquire the image by using a screenshot, and when detecting the operation of the user clicking the option, It is considered that the user gets the picture acquisition instruction.
  • This option can be located in the options window of the tool menu bar of the first editing software or a user dialog window outside the tool menu bar of the first editing software.
  • step 301 and step 302 in the embodiment of the present application may be in no particular order. That is to say, the embodiment of the present application can use the first document editing software to open the to-be-edited document, obtain the instruction of the user's screenshot to obtain the image, and can also use the first document editing software to obtain the user's screenshot to obtain the image instruction, and then start to wait. Edit the document, the order of the two depends on the user's usage habits or specific usage scenarios.
  • step 303 a picture interception range is obtained.
  • Step 303 may be: determining a screenshot range according to the picture acquisition instruction.
  • the embodiment of the present application adopts a corresponding method for acquiring a range of image capture, such as a terminal device using a mouse, to obtain a range of pictures selected by a mouse frame, as a picture capture range, that is, a screenshot range.
  • a range of image capture such as a terminal device using a mouse
  • a desktop computer usually uses a mouse, and then a desktop computer that uses a mouse is used to obtain a range of pictures selected by the user using the mouse frame as a picture capture range, that is, a screenshot range.
  • the image interception range contains the required text.
  • the range of the image selected by the mouse frame may be a range of the track area formed by the mouse dragging on the electronic carrier containing the required text, which may be a rectangle, a circle or an arbitrary irregular shape, etc., as shown in FIG. 2, the mouse can be obtained.
  • a rectangular area formed by continuously clicking and dragging on the PDF document, that is, the box in FIG. 2 uses the range of the box as the image capture range, that is, the screenshot range.
  • the embodiment of the present application can also preset the shape of the mouse frame, such as a rectangle, etc., that is, when the user drags the mouse, a rectangular frame range of different sizes will appear, and the user only needs to adjust the size of the rectangular frame to select a rectangular frame that can contain the required text. The size is sufficient, so that it is convenient to unify the standard of the interception range. Because the screenshots are made in a regular shape, such as a rectangle, it is less difficult to intercept the image and the efficiency is higher.
  • the embodiment of the present application can also preset the thickness and color of the mouse frame, and can select the thickness and color of the mouse frame that is conspicuous and easy to observe according to the user's usage habit or the background color of the electronic carrier, so as to be convenient for the user.
  • the embodiment of the present application can also provide a confirmation link after obtaining the image range selected by the mouse frame, such as setting a user dialog window to prompt the user to confirm, etc., to avoid user misoperation and the like.
  • the range of the image selected by the touch screen touch track frame is obtained as the image capture range, that is, the screenshot range.
  • the range of pictures selected by the user's touch screen touch track frame is obtained as a picture interception range.
  • the image interception range contains the required text.
  • the range of the image selected by the touch screen touch track frame may be a range of the track area formed by any finger or other tool on the electronic carrier containing the required text, which may be a rectangle, a circle or an irregular shape, etc., as shown in FIG. 2
  • a rectangular area formed by the user's finger continuously touching and dragging operations on the PDF document can be obtained, as shown in the box in FIG.
  • the box range is taken as the picture interception range.
  • the range of the image selected by the touch screen touch track frame is not a regular shape.
  • the embodiment of the present application may also preset the shape of the touch screen touch track frame, such as a rectangular shape, a circular shape, or the like, that is, The obtained irregular touch track frame of the user is converted into a corresponding rule shape with the highest matching degree. For example, if the user's touch track frame is an irregular shape, the content is found according to the required text contained in the user's touch track frame.
  • the preset rule shape of the demand text is convenient for unifying the standard of the interception range, because the screenshot is taken in a regular shape, such as a rectangle, and the image is less difficult and efficient.
  • the embodiment of the present application can also preset the thickness and color of the touch screen touch track frame, and can select the thickness and color of the touch screen touch track frame that is conspicuous and easy to observe according to the user's usage habit or the background color of the electronic carrier, so as to be convenient for the user.
  • the embodiment of the present application may further provide a confirmation link after obtaining a range of pictures selected by the touch screen touch track frame, such as setting a user dialog window to prompt the user to confirm, etc., to avoid user misoperation.
  • a full-screen interception mode may also be provided, which facilitates intercepting the entire image without selecting a range of image interception, which is often applicable to a scenario in which all characters in the entire image need to be identified as required text.
  • the embodiment of the present application may set a corresponding full-screen interception mode, such as a full-screen intercept shortcut key of the mobile phone.
  • the image interception range may be obtained for the existing electronic carrier containing the required text in the terminal device where the first document editing software is located, or may be first transmitted through Bluetooth transmission, the Internet, or the like.
  • the electronic carrier containing the required text is obtained outside the terminal device where the document editing software is located, and then the screenshot is taken for the electronic carrier, the picture containing the required text is intercepted, or the Internet is remotely connected and controlled, and the required text is directly directed to the remaining terminal devices.
  • the electronic carrier performs a screenshot to obtain a picture containing the required text, and then performs subsequent steps to identify and move the required text to the position to be inserted in the document to be edited of the user terminal device.
  • the embodiments of the present application can obtain a range of image interception for a plurality of electronic carriers, which are obtained by using various types of texts, and can be applied to a plurality of terminal devices using a mouse or a touch screen.
  • the method of the present application has a wide range of applications and practicality. Strong.
  • step 304 the picture in the range of the picture interception is intercepted.
  • Step 304 may be: capturing a picture containing the required text according to the screenshot range.
  • the screenshot program in the first document editing software is used to intercept the image containing the required text according to the range of the screenshot determined in step 303.
  • the embodiment of the present application may also intercept the picture according to the full screen interception manner.
  • Step 305 Identify the required text in the picture in the first document editing software.
  • the required text in the picture is identified by using a picture recognition program in the first document editing software.
  • step 306 the demand text is moved to the position to be inserted in the document to be edited.
  • Step 306 may be: adding the requirement text to a position to be inserted in the document to be edited in the first document editing software.
  • the acquired position to be inserted is the location where the mouse cursor is located. If the terminal device is a terminal device that uses the touch screen, the acquired position to be inserted is the position where the touch screen cursor is located.
  • the embodiment of the present application can also provide a confirmation link, such as setting a user dialog window to prompt the user to confirm the position to be inserted, etc., to avoid user misoperation and the like. Then move the requirement text to the position to be inserted in the document to be edited.
  • a confirmation link such as setting a user dialog window to prompt the user to confirm the position to be inserted, etc., to avoid user misoperation and the like. Then move the requirement text to the position to be inserted in the document to be edited.
  • the text movement mode and the specific addition form of the text from the picture to the document to be edited may be as described in step 105, and will not be repeated here.
  • the embodiment of the present application may also obtain the to-be-inserted position after the step 301, that is, after the step 301 opens the to-be-edited document, the embodiment of the present application may detect the position where the mouse cursor is located or the position where the touch screen cursor is located as the position to be inserted. The embodiment of the present application may further provide a confirmation link after the detection, such as setting a user dialog window to prompt the user to confirm the position to be inserted, etc., to avoid user misoperation and the like. Then at step 306, the demand text is moved directly to the position to be inserted in the document to be edited.
  • the embodiment of the present application provides a method for quickly inserting a recognized text, and acquiring an instruction of a screenshot of the user to obtain a picture. Then get the image capture range. Next, capture the image within the clipping range of the image. The required text in the picture is then identified in the first document editing software. Finally, the requirement text is added to the position to be inserted in the document to be edited.
  • the embodiment of the present application is directed to a plurality of electronic carriers containing required texts. When editing a document in the first document editing software, the image containing the required text can be obtained through the screenshot, and the required text in the image is automatically recognized and inserted into the document to be edited. The position to be inserted.
  • the embodiment of the present application automatically uses the first document editing software to automatically insert the identification text into the to-be-inserted position in the document to be edited. Compared with the prior art, it is required to open multiple softwares and programs, and manually copy the required text. Or the prior art inserts the recognized text by manual typing, thereby improving work efficiency.
  • FIG. 4 is a flowchart of still another example based on the method shown in FIG. 1, including the following steps:
  • step 401 the document to be edited is opened.
  • the first document editing software may be used to open the document as the document to be edited; if the open document already exists, step 401 is not required to be performed.
  • the embodiment of the present application uses the first document editing software to open a document to be edited.
  • the terminal device receives an instruction of the user to open the document, such as a user click operation on the first document editing software icon, a click operation of the user to edit the document icon, a voice operation instruction of the user, and the like.
  • the processor of the terminal device opens the document to be edited according to the instruction to open the document.
  • the user clicks on the first document editing software icon, and the processor of the terminal device first starts the first document editing software, and then receives the instruction of the user to select the document, such as obtaining the user's click on a document. After the operation, open the document as the document to be edited.
  • the user clicks on the edited document icon, and the processor of the terminal device uses the first document editing software to open the to-be-edited document.
  • the voice operation instruction of the user for example, the voice operation instruction of the user is to open the document named “File 1”, and the processor of the terminal device finds the document named “File 1”, and utilizes the A document editing software opens the document named "File 1" as the document to be edited.
  • Step 402 Acquire an instruction of the user to acquire a picture by using a camera.
  • the user in the user selection window preset in the first document editing software, the user is provided with an option, so that the user can select to use the camera to acquire the image, and when detecting the operation of the user clicking the option, the user considers Get the image acquisition instruction to the user.
  • This option can be located in the options window of the tool menu bar of the first editing software or a user dialog window outside the tool menu bar of the first editing software.
  • step 401 and step 402 in the embodiment of the present application may be in no particular order. That is, after the first document editing software is used to open the to-be-edited document, the embodiment of the present application may obtain an instruction of the user to obtain a picture by using the camera, or may use the first document editing software to obtain an instruction of the user to obtain a picture by using the camera. The document to be edited is opened, and the order of the two is determined according to the user's usage habits or specific usage scenarios.
  • step 403 the camera is started, and it is confirmed that the visible area of the camera includes the required text.
  • Step 403 and step 404 may be: taking a picture containing the required text by using a camera.
  • the camera in the embodiment of the present application may be a camera of a terminal device where the first document editing software is located, such as a camera of a user computer, a camera of a mobile phone, or the like, or a camera other than the terminal device where the first document editing software is located, such as other users.
  • the embodiment of the present application can start the camera of the user terminal device, and can also start other cameras connected to the user terminal device by using the Internet, a local area network, Bluetooth, or the like.
  • the user can confirm that the visible image is included in the visible shooting area of the camera, and include the required text in the best definition as much as possible, so as to facilitate subsequent image capturing. If the camera is visible, the shooting area cannot contain all the required text, or The resolution of the required text does not meet the requirements for picture shooting.
  • the user can adjust the physical carrier or the camera containing the required text. For example, the position of the paper book printed with the required text in the visible area of the camera can be manually adjusted, or the position can be utilized.
  • the preset camera adjustment program adjusts the parameters of the camera, including the distance of the camera from the paper book, the shooting angle of the camera, the focal length of the camera, etc., until the camera is visible in the visible area, and the resolution is included. Meet the shooting requirements.
  • the terminal device where the first document editing software is located may also determine whether the current shooting area of the camera includes the required text; if included, the camera is controlled to capture the current shooting area to obtain a picture containing the required text; If not, the position of the camera is adjusted, and the step of determining whether the required image is included in the current shooting area of the camera is returned.
  • the camera can be adjusted by the terminal device where the first document editing software is located, so that the camera can capture a picture containing the required text.
  • Step 404 capturing a picture in the visible area of the camera.
  • the camera of the user terminal device can be used to capture the picture in the visible shooting area of the camera, or the other camera other than the user terminal device can be connected through the Internet, the local area network, the Bluetooth, etc., and the camera can be photographed in the visible shooting area. image. For example, if you connect the surveillance camera in the corridor, you can take pictures of the camera in the shooting area.
  • the embodiment of the present application can use a camera to take a picture for a plurality of physical carriers, such as a paper book, a wall poster, a billboard, and the like.
  • the method of the present application extends the carrier of the required text to a plurality of physical carriers. The scope is wider and more practical.
  • Step 405 Identify the required text in the picture in the first document editing software.
  • the required text in the picture is identified by using a picture recognition program in the first document editing software.
  • step 406 the demand text is moved to the position to be inserted in the document to be edited.
  • Step 406 may be: adding the requirement text to a position to be inserted in the document to be edited in the first document editing software.
  • the position to be inserted is the position where the mouse cursor is located. If the terminal device is the terminal device that uses the touch screen, the position to be inserted is the position where the touch screen cursor is located.
  • the embodiment of the present application can also provide a confirmation link, such as setting a user dialog window to prompt the user to confirm the position to be inserted, etc., to avoid user misoperation and the like. Move the demand text to the position to be inserted in the document to be edited.
  • the embodiment of the present application may also obtain the position to be inserted after the step 401, that is, after the step 401 opens the document to be edited, the embodiment of the present application may detect the position where the mouse cursor is located or the position where the touch screen cursor is located as the position to be inserted. The embodiment of the present application may further provide a confirmation link after the detection, such as setting a user dialog window to prompt the user to confirm the position to be inserted, etc., to avoid user misoperation and the like. Then at step 406, the demand text is moved directly to the position to be inserted in the document to be edited.
  • the device for quickly inserting the recognized text acquires an instruction of the user to obtain a picture by using the camera. Then use the camera to take pictures with the required text. The required text in the picture is then identified in the first document editing software. Finally, the requirement text is added to the position to be inserted in the document to be edited.
  • the embodiment of the present application is directed to a plurality of physical carriers containing required texts.
  • the camera can obtain a picture containing the required text through the camera, and recognize the required text in the image, and automatically insert the document to be edited. The position to be inserted.
  • the embodiment of the present application automatically uses the first document editing software to automatically insert the identification text into the to-be-inserted position in the document to be edited, and needs to open multiple softwares and programs, and manually copy the required text, or
  • the identification text is inserted by manual typing, which improves work efficiency.
  • FIG. 5 is a structural diagram of an apparatus for quickly inserting and recognizing characters according to an embodiment of the present application, including:
  • the module 501 is opened for opening a document to be edited.
  • the instruction acquisition module 502 is configured to acquire a picture acquisition instruction of the user.
  • the picture obtaining module 503 is configured to obtain a picture containing the required text according to the picture obtaining instruction.
  • the identification module 504 is configured to identify the required text in the picture in the first document editing software.
  • the text moving module 505 is configured to move the required text into the document to be edited.
  • the text moving module 505 is specifically configured to: add the required text to the to-be-edited document in the first document editing software.
  • the document to be edited is first opened in the first document editing software. Secondly, the user's picture acquisition instruction is obtained. Again, according to the image acquisition instruction, obtain the image containing the required text. The required text in the picture is then identified in the first document editing software. Finally, move the requirements text to the document to be edited.
  • the embodiment of the present application can obtain a picture containing the required text, and recognize the required text in the image, and automatically insert into the document to be edited.
  • the embodiment of the present application can automatically implement the method of inserting the recognized text by using the first document editing software. The method of the present application can improve the working efficiency.
  • the apparatus in the embodiment of the present application is a device that applies the method for quickly inserting the recognized character, and all the embodiments of the method for quickly inserting the recognized text are applicable to the device, and both can achieve the same or similar.
  • FIG. 6 is a structural diagram based on an example of the apparatus shown in FIG.
  • the image obtaining instruction is: an instruction for acquiring a picture by using a screenshot
  • the image obtaining module 603 includes:
  • the picture interception range obtaining sub-module 6031 is configured to determine a screenshot range according to the picture acquisition instruction.
  • the picture intercepting sub-module 6032 is configured to intercept a picture containing the required text according to the screenshot range.
  • the picture interception range obtaining sub-module 6031 is specifically configured to:
  • the text moving module 605 is specifically configured to:
  • the document to be edited is first opened in the first document editing software. Second, get the user's screenshot to get the image instructions. Get the image capture range again. Next, capture the image within the clipping range of the image. The required text in the picture is then identified in the first document editing software. Finally, the demand text is moved to the position to be inserted in the document to be edited.
  • the embodiment of the present application is directed to a plurality of electronic carriers containing required texts. When editing a document in the first document editing software, the image containing the required text can be obtained through the screenshot, and the required text in the image is automatically recognized and inserted into the document to be edited. The position to be inserted.
  • the first document editing software is used to automatically insert the identification text into the to-be-inserted position in the document to be edited.
  • multiple softwares and programs need to be opened, and the required text is manually selected for the recognized text of the image.
  • the manner of copying to the position to be inserted in the document to be edited, and thus the embodiment of the present application can improve work efficiency.
  • FIG. 7 is a structural diagram based on still another example of the apparatus shown in FIG.
  • the image obtaining instruction is:
  • the image obtaining module 703 is specifically configured to: capture a picture containing the required text by using a camera.
  • the image obtaining module 703 is specifically configured to:
  • the position of the camera is adjusted, and the step of determining whether the required text is included in the current shooting area of the camera is returned.
  • the text moving module 705 is specifically configured to:
  • the document to be edited is first opened in the first document editing software. Secondly, the user's instruction to obtain a picture using the camera is obtained. Start the camera again and make sure that the camera's visible shooting area contains the required text. Next, take a picture of the camera's visible shooting area. The required text in the picture is then identified in the first document editing software. Finally, the demand text is moved to the position to be inserted in the document to be edited.
  • the embodiment of the present application is directed to a plurality of physical carriers containing required texts.
  • the camera can obtain a picture containing the required text through the camera, and recognize the required text in the image, and automatically insert the document to be edited.
  • the position to be inserted In the embodiment of the present application, the first document editing software is used to automatically insert the identification text into the to-be-inserted position in the document to be edited.
  • multiple softwares and programs need to be opened, and the required text is manually selected for the recognized text of the image. The manner of copying to the position to be inserted in the document to be edited, and thus the embodiment of the present application can improve work efficiency.
  • the embodiment of the present application further discloses an electronic device, as shown in FIG. 8, including a processor 801 and a memory 802.
  • a memory 802 configured to store a computer program
  • the processor 801 is configured to implement any of the above methods for quickly inserting recognized characters when executing the program stored on the memory 802.
  • the embodiment of the present application further discloses a computer readable storage medium, where the computer readable storage medium stores a computer program, and when the computer program is executed by the processor, implements any one of the above methods for quickly inserting the recognized text.
  • the embodiment of the present application also discloses an executable program code for a method of being executed to perform any of the above-described quick insertion of recognized characters.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)
  • Character Discrimination (AREA)
  • Processing Or Creating Images (AREA)
  • Character Input (AREA)
  • Document Processing Apparatus (AREA)

Abstract

本申请实施例提供了一种快速插入识别文字的方法及装置。所述方法包括:开启待编辑文档;获取用户的图片获取指令;根据所述图片获取指令,获得含有需求文字的图片;在第一文档编辑软件中识别出所述图片中的需求文字;将所述需求文字移动至所述待编辑文档中。应用本申请实施例方法,能够提高工作效率。

Description

一种快速插入识别文字的方法及装置
本申请要求于2017年3月20日提交中国专利局、申请号为201710165750.8、发明名称为“一种快速插入识别文字的方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及电子文档编辑领域,特别是涉及一种快速插入识别文字的方法及装置。
背景技术
用户在利用计算机、手机等终端设备中的文档编辑软件进行文档编辑时,有时候需要在文档中插入外部载体中的一段不可复制的文本,比如图片中的一段文字、视频中的一段文字、不可复制的电子文档中的一段文字等等。现有方法先利用格式转化软件将非图片格式的载体转化为图片格式,再启动另一个已有的图片识别程序识别图片中的文字,最后将识别出的文字拷贝至待编辑文档中。
由此可见,现有技术需要人工启动多个软件、程序才能将外部载体中的文字插入待编辑文档中,而且需要人工将图片识别后的文字拷贝至待编辑文档中,工作效率较低。
在另一些方案中,如果用户需要在文档中插入一段不可复制的文本,用户通过手动打字的方式,将该不可复制的文本中的内容添加至待编辑文档中,效率较低。
发明内容
本申请实施例的目的在于提供一种快速插入识别文字的方法及装置,以提高工作效率。具体技术方案如下:
本申请实施例公开了一种快速插入识别文字的方法,包括:
开启待编辑文档;
获取用户的图片获取指令;
根据所述图片获取指令,获得含有需求文字的图片;
在第一文档编辑软件中识别出所述图片中的需求文字;
将所述需求文字移动至所述待编辑文档中。
可选的,所述图片获取指令,包括:
截图获取图片的指令。
可选的,所述根据所述图片获取指令,获得含有需求文字的图片,包括:
当所述图片获取指令为截图获取图片的指令时,获取图片截取范围,其中,所述图片截取范围包含需求文字;
截取所述图片截取范围内的图片。
可选的,所述获取图片截取范围,包括:
获取鼠标框选取的图片范围,作为图片截取范围;或
获取触摸屏触摸轨迹框选取的图片范围,作为图片截取范围。
可选的,所述图片获取指令,包括:
利用摄像头获取图片的指令。
可选的,所述根据用户的图片获取指令,获得含有需求文字的图片,包括:
当所述图片获取指令为利用摄像头获取图片的指令时,启动所述摄像头,并确认摄像头可见拍摄区域内包含需求文字;
拍摄摄像头可见拍摄区域内的图片。
可选的,所述将所述需求文字移动至所述待编辑文档中,包括:
将所述需求文字移动至所述待编辑文档中的待插入位置,其中,所述待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
本申请实施例还公开了一种快速插入识别文字的装置,包括:
开启模块,用于开启待编辑文档;
指令获取模块,用于获取用户的图片获取指令;
图片获取模块,用于根据所述图片获取指令,获得含有需求文字的图片;
识别模块,用于在第一文档编辑软件中识别出所述图片中的需求文字;
文字移动模块,用于将所述需求文字移动至所述待编辑文档中。
可选的,所述图片获取指令,包括:
截图获取图片的指令。
可选的,所述图片获取模块,包括:
图片截取范围获取子模块,用于当所述图片获取指令为截图获取图片的指令时,获取图片截取范围,其中,所述图片截取范围包含需求文字;
图片截取子模块,用于截取所述图片截取范围内的图片。
可选的,所述图片截取范围获取子模块,具体用于:
获取鼠标框选取的图片范围,作为图片截取范围;或
获取触摸屏触摸轨迹框选取的图片范围,作为图片截取范围。
可选的,所述图片获取指令,包括:
利用摄像头获取图片的指令。
可选的,所述图片获取模块,包括:
摄像头启动子模块,用于当所述图片获取指令为利用摄像头获取图片的指令时,启动所述摄像头,并确认摄像头可见拍摄区域内包含需求文字;
拍摄子模块,用于拍摄摄像头可见拍摄区域内的图片。
可选的,所述文字移动模块,具体用于:
将所述需求文字移动至所述待编辑文档中的待插入位置,其中,所述待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
本申请实施例还公开了一种电子设备,包括处理器和存储器,
存储器,用于存放计算机程序;
处理器,用于执行存储器上所存放的程序时,实现上述任一种快速插入识别文字的方法。
本申请实施例还公开了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种快速插入识别文字的方法。
本申请实施例还公开了一种可执行程序代码,所述可执行程序代码用于被运行以执行上述任一种快速插入识别文字的方法。
本申请实施例提供的快速插入识别文字的方法及装置,在面对要插入外部载体中不可复制的需求文字时,获取用户的图片获取指令。再根据所述图片获取指令,获得含有需求文字的图片。然后,在第一文档编辑软件中识别出所述图片中的需求文字。最后,将所述需求文字添加至所述待编辑文档中。本申请实施例在第一文档编辑软件中编辑文档时,能够获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字,相比于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,或者现有技术中通过手动打字的方式插入识别文字,提高了工作效率。
当然,实施本申请的任一产品或方法并不一定需要同时达到以上所述的所有优点。
附图说明
为了更清楚地说明本申请实施例和现有技术的技术方案,下面对实施例和现有技术中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1为本申请实施例提供的快速插入识别文字的方法的一种流程示意图;
图2为本申请实施例提供的一含有需求文字的图片的示意图;
图3为基于图1所示方法的一实例的流程示意图;
图4为基于图1所示方法的又一实例的流程示意图;
图5为本申请实施例提供的快速插入识别文字的装置的一种结构示意图;
图6为基于图5所示装置的一实例的结构示意图;
图7为基于图5所示装置的又一实例的结构示意图;
图8为本申请实施例提供的一种电子设备的结构示意图。
具体实施方式
为使本申请的目的、技术方案、及优点更加清楚明白,以下参照附图并举实施例,对本申请进一步详细说明。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。
本申请实施例公开了一种快速插入识别文字的方法及装置,能够提高工作效率。
本申请实施例公开的快速插入识别文字的方法,包括:获取用户的图片获取指令;根据所述图片获取指令,获得含有需求文字的图片;在第一文档编辑软件中识别出所述图片中的需求文字;将所述需求文字添加至所述第一文档编辑软件中的待编辑文档。可见,本申请实施例仅利用第一文档编辑软件自动实现插入识别文字,相比于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,或者现有技术中通过手动打字的方式插入识别文字,提高了工作效率。
参见图1,图1为本申请实施例的快速插入识别文字的方法的一种流程图,包括如下步骤:
步骤101,开启待编辑文档。
本申请实施例由终端设备的处理器执行,终端设备包括计算机、手机、平板电脑、能够进行电子文档编辑的设备等。
在本申请实施例中,如果不存在已打开的文档,则可以利用第一文档编辑软件开启文档,作为待编辑文档;如果已经存在打开的文档,则不需要执 行步骤101。
第一文档编辑软件是安装在终端设备中的,用于编辑电子文档的软件,如金山办公软件WPS Office等。本申请实施例中的各步骤均可以在第一文档编辑软件中完成的,或者,步骤102-步骤103也可以通过其他软件实现。
本申请实施例中的第一文档编辑软件可以包括截图功能及图片识别功能。举例来说,第一文档编辑软件可以集成有屏幕截图程序,通过该屏幕截图程序能够对选择区域的图像进行截取,第一文档编辑软件还可以集成有图片识别程序,通过该图片识别程序能够识别出图片中的文字。
步骤102,获取用户的图片获取指令。
本申请实施例中,可以提供多种获取图片方式,比如通过截图的方式获取图片、利用摄像头拍摄的方式获取图片等,这种情况下,该图片获取指令为用户在这多种获取图片方式进行选择的指令,或者说,如果接收到用户选择获取图片方式的指令,则认为获取到用户的图片获取指令。本申请实施例可以预先在第一文档编辑软件中建立用户选择窗口,在其中提供多种获取图片方式的选择项供用户选择,然后获取用户对获取图片方式的选择项的点击操作。
需要说明的是,本申请实施例中的步骤101和步骤102可以不分先后顺序。也就是说,本申请实施例可以利用第一文档编辑软件开启待编辑文档后,获取用户的图片获取指令,该种情况通常用于先对待编辑文档编辑,后获取需求文字的场景;也可以利用第一文档编辑软件获取用户的图片获取指令后,开启待编辑文档,该种情况通常用于已知需求文字所在的载体,需要先获取需求文字的场景,在该种场景下,还可以开启多个待编辑文档,以将需求文字插入多个待编辑文档,本申请实施例中步骤101和步骤102的先后顺序根据用户的使用习惯或者具体使用场景而定。
步骤103,根据图片获取指令,获得含有需求文字的图片。
本申请实施例的图片获取指令与需求文字所在的载体相关,需求文字是用户实际需要插入待编辑文档中的文字。举例来说,该需求文字可能存在于已有的一个电子载体中,如终端设备中已有的图片、视频、不可复制的电子 文档等,面对该电子载体,图片获取指令可以为通过截图的方式获取图片的指令。
假设接收到用户发送的通过截图的方式获取图片的指令,而且终端设备的屏幕中当前展示有一个包括需求文字的电子载体,这种情况下,可以根据该指令确定截图范围,根据该截图范围,在当前展示的电子载体中截取含有需求文字的图片。
在一些情况下,该需求文字还可能存在于终端设备外已有的一个实体载体中,如纸质书籍中的文字、墙体海报中的文字、电视屏幕上的文字等,面对该实体载体,图片获取指令可以为利用摄像头获取图片的指令,根据利用摄像头获取图片的指令,利用摄像头将需求文字所在的实体载体拍摄成照片,获得含有需求文字的图片。
由此可见,本申请实施例的含有需求文字的图片的来源广泛,普适性更强。
步骤104,在第一文档编辑软件中识别出图片中的需求文字。
举例来说,可以利用集成在第一文档编辑软件中的图片识别程序,识别出图片中的需求文字,本申请实施例可以在第一文档编辑软件中预先设置程序接口,以便于多种图片识别程序的替换使用。
步骤105,将需求文字移动至待编辑文档中。
步骤105可以为:将所述需求文字添加至所述第一文档编辑软件中的待编辑文档。
本申请实施例将识别出的文字添加至待编辑文档中,可以添加至待编辑文档中预设的固定位置、随机位置,或者添加至待编辑文档中用户设置的待插入位置等。
文字添加方式可以为图片识别的同时进行文字添加,也就是说,一旦识别出一个文字就立即将该文字添加至待编辑文档中,这种同步添加方式有利于用户尽快利用、或编辑识别出的部分文字;或者本申请实施例可以在图片中的需求文字全部识别出后,再进行需求文字的整体添加,这种整体添加方式有利于维持需求文字的整体性,更适合对需求文字整体内容的利用、或编 辑的情况。
文字从图片到待编辑文档的具体添加形式可以为滑动、滚动、跳动等多种方式,本申请实施例不对文字的具体添加形式进行限定。本申请实施例的文字从图片到待编辑文档的多种具体添加形式,均实现将识别出的图片中的文字,自动插入至待编辑文档中,并不需要人工对文字进行复制、粘贴、拖曳等移动操作。
因此,本申请实施例能够避免用户手动拷贝识别出的文字至待编辑文档中,能够实现自动插入,能够提高工作效率。
可见,本申请实施例提供快速插入识别文字的方法,获取用户的图片获取指令。再根据图片获取指令,获得含有需求文字的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字添加至待编辑文档中。本申请实施例在第一文档编辑软件中编辑文档时,能够获取包含需求文字的图片,并识别出图片中的需求文字移动至待编辑文档中,实现将图片中的需求文字识别并自动插入待编辑文档中。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字,相比于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,或者现有技术中通过手动打字的方式插入识别文字,提高了工作效率。
以下举例说明本申请实施例的实现过程,参见图2,图2为一含有需求文字的图片,具体来说,用户使用文档编辑软件进行文档编辑时,同时阅览一份PDF(Portable Document Format,便携式文件格式)文档,发现其中部分文字是需求文字,参见图2中方框内文字,用户期望将该需求文字插入待编辑文档中,如果采用现有技术中的一些方案,需要先用一个外部格式转化软件或者程序,将该PDF文档转化为图片格式,也就是如图2所示图片,然后再利用另一个图片识别软件或者程序将该图片中的文字全部识别出来,最后由用户手动将方框内的需求文字复制粘贴至待编辑文档中。该过程需要开启文档编辑软件、格式转化软件或者程序、及图片识别软件或者程序,并且将图片中文字全部识别后,需要用户手动选择并拷贝需求文字,工作效率较低。如果采用现有技术中的另一些方案,需要用户手动打字,得到方框内的需求文字,工作效率较低。
而采用本申请实施例方法,在使用第一文档编辑软件进行文档编辑时, 可以获取用户的图片获取指令,比如,可以在第一文档编辑软件中向用户提供多个选项,该多个选项可以对应于多种获取图片方式,比如通过截图的方式获取图片,或者利用摄像头获取图片,等等。根据用户对这多种选项的点击操作,确定用户选择的获取图片方式,并根据用户选择的获取图片方式,获得含有需求文字的图片。
仍以图2为例进行说明,如果为通过截图的方式获取图片的方式,则利用第一文档编辑软件中集成的屏幕截图程序获得该PDF文档中方框部分的图片,比如假设用户利用鼠标选取图2中方框部分的文字,则根据用户的操作,确定截图范围包括方框部分的文字,根据该截图范围进行截图,得到包含图2中方框部分文字的图片,之后利用第一文档编辑软件中集成的图片识别程序,识别出图片中的需求文字,也就是识别出方框内的文字。最后将需求文字添加至待编辑文档中。本申请实施例仅利用第一文档编辑软件完成整个过程,并且能够针对图片中部分文字,实现自动识别并插入,能够提高工作效率。
参见图3,图3为基于图1所示方法的一实例的流程图,包括如下步骤:
步骤301,开启待编辑文档。
在本申请实施例中,如果不存在已打开的文档,则可以利用第一文档编辑软件开启文档,作为待编辑文档;如果已经存在打开的文档,则不需要执行步骤301。
本申请实施例利用第一文档编辑软件开启待编辑文档。具体可以为,终端设备接收到用户的开启文档的指令,如用户对第一文档编辑软件图标的点击操作、用户对待编辑文档图标的点击操作、用户的语音操作指令等。终端设备的处理器根据开启文档的指令开启待编辑文档。
比如,根据开启文档的指令为用户对第一文档编辑软件图标的点击操作,终端设备的处理器先开启第一文档编辑软件,再接收用户选择文档的指令,如获取到用户对一文档的选择操作后,开启该文档作为待编辑文档,等等;
比如,根据开启文档的指令为用户对待编辑文档图标的点击操作,终端设备的处理器利用第一文档编辑软件开启该待编辑文档,等等;
比如,根据开启文档的指令为用户的语音操作指令,如用户的语音操作 指令为开启名称为“文件1”的文档,终端设备的处理器查找到名称为“文件1”的文档,并利用第一文档编辑软件开启该名称为“文件1”的文档作为待编辑文档,等等。
本申请实施例的开启待编辑文档的方式可以为多种,并可以相互结合,在此不一一举例。
步骤302,获取用户的截图获取图片的指令。
本申请实施例中,可以在第一文档编辑软件中预先设置的用户选择窗口中,为用户提供一个选项,使得用户可以选择通过截图的方式获取图片,当检测到用户点击该选项的操作时,便认为获取到用户的图片获取指令。该选项可以位于第一编辑软件的工具菜单栏中的选项窗口,或者位于第一编辑软件的工具菜单栏之外的一个用户对话窗口。
需要说明的是,本申请实施例中的步骤301和步骤302可以不分先后顺序。也就是说,本申请实施例可以利用第一文档编辑软件开启待编辑文档后,获取用户的截图获取图片的指令,也可以利用第一文档编辑软件获取用户的截图获取图片的指令后,开启待编辑文档,两者的先后顺序根据用户的使用习惯或者具体使用场景而定。
步骤303,获取图片截取范围。
步骤303可以为:根据所述图片获取指令,确定截图范围。
本申请实施例根据终端设备的不同类型,采用对应的获取图片截取范围的方法,如使用鼠标的终端设备,获取鼠标框选取的图片范围,作为图片截取范围,也就是截图范围。举例来说,常用的终端设备中,台式计算机通常使用鼠标,那么对应该使用鼠标的台式计算机,获取用户利用鼠标框选取的图片范围,作为图片截取范围,也就是截图范围。其中,图片截取范围包含需求文字。鼠标框选取的图片范围可以是,鼠标在含有需求文字的电子载体上任意拖动形成的轨迹区域范围,可以为矩形、圆形或任意不规则形状等,以图2为例,可以获取到鼠标在该PDF文档上持续点击拖动操作形成的一矩形区域,也就是图2中的方框,将方框范围作为图片截取范围,也就是截图范围。
本申请实施例也可以预设鼠标框的形状,如矩形等,也就是用户在拖动 鼠标时会出现不同大小的矩形框范围,用户只需调整矩形框大小,选择能够包含需求文字的矩形框大小即可,这样便于统一截取范围的标准,因为以规则形状,如矩形等进行截图,截取图片的难度较小,效率较高。本申请实施例还可以预设鼠标框的粗细及颜色,可以根据用户的使用习惯或者电子载体的背景色,选择醒目便于观察的鼠标框的粗细及颜色,以便于用户使用。当然本申请实施例还可以在获取鼠标框选取的图片范围之后,提供确认环节,如设置用户对话窗口提示用户确认等等,以避免用户的误操作等。
或,使用触摸屏的终端设备,获取触摸屏触摸轨迹框选取的图片范围,作为图片截取范围,也就是截图范围。举例来说,对于常用的触摸屏手机,获取用户的触摸屏触摸轨迹框选取的图片范围,作为图片截取范围。其中,图片截取范围包含需求文字。触摸屏触摸轨迹框选取的图片范围可以是,手指或其他工具在含有需求文字的电子载体上任意触摸拖动形成的轨迹区域范围,可以为矩形、圆形或任意不规则形状等,以图2为例,可以获取到用户手指在该PDF文档上持续触摸拖动操作形成的一矩形区域,如图2中的方框,将方框范围作为图片截取范围。当然,一般来说,触摸屏触摸轨迹框选取的图片范围不会是规则形状,但本申请实施例也可以预设触摸屏触摸轨迹框的形状,如矩形、圆形等多种规则形状,也就是将获取到的用户的不规则的触摸轨迹框,转化为对应的匹配度最高的规则形状,比如用户的触摸轨迹框为不规则形状,则根据用户的触摸轨迹框内包含的需求文字,找到一个包含该需求文字的预设的规则形状,如矩形等,这样便于统一截取范围的标准,因为以规则形状,如矩形等进行截图,截取图片的难度较小,效率较高。本申请实施例还可以预设触摸屏触摸轨迹框的粗细及颜色,可以根据用户的使用习惯或者电子载体的背景色,选择醒目便于观察的触摸屏触摸轨迹框的粗细及颜色,以便于用户使用。当然本申请实施例还可以在获取触摸屏触摸轨迹框选取的图片范围之后,提供确认环节,如设置用户对话窗口提示用户确认等等,以避免用户的误操作等。
本申请实施例,还可以设置有全屏截取方式,便于截取整幅图片,而不进行图片截取范围的选择,该种情况常适用于需要将整幅图片中的所有文字作为需求文字进行识别的场景。针对于使用鼠标的终端设备或者使用触摸屏的终端设备,本申请实施例可以设置各自对应的全屏截取方式,如手机的全 屏截取快捷键等。
当然,实际中还有多种其他使用鼠标或者触摸屏的终端设备,对应的获取图片截取范围的情况不一一赘述。
本申请实施例中,既可以对第一文档编辑软件所在的终端设备内的,已有的含有需求文字的电子载体,获取图片截取范围,也可以先通过蓝牙传输、互联网等方式,从第一文档编辑软件所在的终端设备以外获取含有需求文字的电子载体,获取之后再针对该电子载体进行截图,截取含有需求文字的图片,或者利用互联网远程连接、控制,直接针对其余终端设备的含有需求文字的电子载体进行截图,得到含有需求文字的图片,然后执行后续步骤识别并将需求文字移动至用户终端设备的待编辑文档中的待插入位置。
本申请实施例能够针对多种渠道获得的,含有需求文字的多种电子载体获取图片截取范围,并且可以适用于使用鼠标或者触摸屏的多种终端设备,本申请实施例方法适用范围广泛,实用性强。
步骤304,截取图片截取范围内的图片。
步骤304可以为:根据所述截图范围,截取含有需求文字的图片。
本申请实施例中,利用第一文档编辑软件中的屏幕截图程序,根据步骤303中确定出的截图范围,截取含有需求文字的图片。当然本申请实施例也可以根据全屏截取方式截取图片。
步骤305,在第一文档编辑软件中识别出图片中的需求文字。
本申请实施例中,利用第一文档编辑软件中的图片识别程序识别出图片中的需求文字。
步骤306,将需求文字移动至待编辑文档中的待插入位置。
步骤306可以为:将所述需求文字添加至所述第一文档编辑软件中的待编辑文档中的待插入位置。
举例来说,如果终端设备为使用鼠标的终端设备,则获取的待插入位置为鼠标光标所在的位置,如果终端设备为使用触摸屏的终端设备,则获取的待插入位置为触摸屏光标所在的位置。
本申请实施例还可以提供确认环节,如设置用户对话窗口提示用户确认待插入位置等,以避免用户的误操作等。然后将需求文字移动至待编辑文档中的待插入位置。
文字移动方式及文字从图片到待编辑文档的具体添加形式可以如步骤105所述,在此不一一赘述。
本申请实施例也可以在步骤301之后,获取待插入位置,也就是说步骤301开启待编辑文档之后,本申请实施例可以检测鼠标光标所在的位置或触摸屏光标所在的位置作为待插入位置,当然本申请实施例在检测之后还可以提供确认环节,如设置用户对话窗口提示用户确认待插入位置等,以避免用户的误操作等。然后在步骤306时,直接将需求文字移动至待编辑文档中的待插入位置。
可见,本申请实施例提供快速插入识别文字的方法,获取用户的截图获取图片的指令。再获取图片截取范围。接下来截取图片截取范围内的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字添加至待编辑文档中的待插入位置。本申请实施例针对含有需求文字的多种电子载体,在第一文档编辑软件中编辑文档时,能够通过截图获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中的待插入位置。本申请实施例仅利用第一文档编辑软件,自动实现插入识别文字至待编辑文档中的待插入位置,相比于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,或者现有技术中通过手动打字的方式插入识别文字,提高了工作效率。
参见图4,图4为基于图1所示方法的又一实例的流程图,包括如下步骤:
步骤401,开启待编辑文档。
在本申请实施例中,如果不存在已打开的文档,则可以利用第一文档编辑软件开启文档,作为待编辑文档;如果已经存在打开的文档,则不需要执行步骤401。
本申请实施例利用第一文档编辑软件开启待编辑文档。具体可以为,终 端设备接收到用户的开启文档的指令,如用户对第一文档编辑软件图标的点击操作、用户对待编辑文档图标的点击操作、用户的语音操作指令等。终端设备的处理器根据开启文档的指令开启待编辑文档。
比如,根据开启文档的指令为用户对第一文档编辑软件图标的点击操作,终端设备的处理器先开启第一文档编辑软件,再接收用户选择文档的指令,如获取到用户对一文档的点击操作后,开启该文档为待编辑文档。
比如,根据开启文档的指令为用户对待编辑文档图标的点击操作,终端设备的处理器利用第一文档编辑软件开启该待编辑文档。
比如,根据开启文档的指令为用户的语音操作指令,如用户的语音操作指令为开启名称为“文件1”的文档,终端设备的处理器查找到名称为“文件1”的文档,并利用第一文档编辑软件开启该名称为“文件1”的文档为待编辑文档。
本申请实施例的开启待编辑文档的方式可以为多种,并可以相互结合,在此不一一举例。
步骤402,获取用户的利用摄像头获取图片的指令。
本申请实施例中,可以在第一文档编辑软件中预先设置的用户选择窗口中,为用户提供一个选项,使得用户可以选择利用摄像头获取图片,当检测到用户点击该选项的操作时,便认为获取到用户的图片获取指令。该选项可以位于第一编辑软件的工具菜单栏中的选项窗口,或者位于第一编辑软件的工具菜单栏之外的一个用户对话窗口。
需要说明的是,本申请实施例中的步骤401和步骤402可以不分先后顺序。也就是说,本申请实施例可以利用第一文档编辑软件开启待编辑文档后,获取用户的利用摄像头获取图片的指令,也可以利用第一文档编辑软件获取用户的利用摄像头获取图片的指令后,开启待编辑文档,两者的先后顺序根据用户的使用习惯或者具体使用场景而定。
步骤403,启动摄像头,并确认摄像头可见拍摄区域内包含需求文字。
步骤403及步骤404可以为:利用摄像头,拍摄含有需求文字的图片。
本申请实施例中的摄像头可以是第一文档编辑软件所在的终端设备的摄像头,如用户计算机的摄像头、手机的摄像头等,或者第一文档编辑软件所在的终端设备之外的摄像头,如其余用户的计算机或手机的摄像头、交通监控系统的摄像头、楼道监控设备的摄像头等。
本申请实施例可以启动用户终端设备的摄像头,也可以利用互联网、局域网、蓝牙等启动与用户终端设备连接的其他摄像头。
本申请实施例中可以由用户确认摄像头可见拍摄区域内包含需求文字,并尽可能以最佳清晰度包含需求文字,以便于后续的图片拍摄,如果摄像头可见拍摄区域内不能包含全部需求文字,或者需求文字的清晰度达不到图片拍摄要求,用户可以对含有需求文字的实体载体、或者摄像头进行调节,如可以人工调节印刷有需求文字的纸质书籍在摄像头可见拍摄区域内的位置,或者利用预设的摄像头调节程序对摄像头的参数进行调节,该参数包括,摄像头距离纸质书籍的距离、摄像头的拍摄角度、摄像头的焦距等,直至调节至摄像头可见拍摄区域内包含需求文字,且清晰度达到拍摄要求。
或者,第一文档编辑软件所在的终端设备也可以判断摄像头的当前拍摄区域内是否包含需求文字;如果包含,则控制所述摄像头对所述当前拍摄区域进行拍摄,得到含有需求文字的图片;如果不包含,则调整所述摄像头的位置,并返回所述判断摄像头的当前拍摄区域内是否包含需求文字的步骤。
也就是说,可以由第一文档编辑软件所在的终端设备对摄像头进行调整,使得摄像头能够拍摄到含有需求文字的图片。
步骤404,拍摄摄像头可见拍摄区域内的图片。
本申请实施例中,可以利用用户终端设备的摄像头,拍摄摄像头可见拍摄区域内的图片,也可以通过互联网、局域网、蓝牙等,连接用户终端设备之外的其他摄像头,拍摄摄像头可见拍摄区域内的图片。如连接楼道内的监控摄像头,拍摄摄像头可见拍摄区域内的图片等。
本申请实施例能够针对含有需求文字的多种实体载体,如纸质书籍、墙体海报、广告牌等利用摄像头拍摄图片,本申请实施例方法将需求文字的载体扩展到多种实体载体,适用范围更加广泛,实用性更强。
步骤405,在第一文档编辑软件中识别出图片中的需求文字。
本申请实施例中,利用第一文档编辑软件中的图片识别程序识别出图片中的需求文字。
步骤406,将需求文字移动至待编辑文档中的待插入位置。
步骤406可以为:将所述需求文字添加至所述第一文档编辑软件中的待编辑文档中的待插入位置。
举例来说,如果终端设备为使用鼠标的终端设备,则获取待的插入位置为鼠标光标所在的位置,如果终端设备为使用触摸屏的终端设备,则获取的待插入位置为触摸屏光标所在的位置。本申请实施例还可以提供确认环节,如设置用户对话窗口提示用户确认待插入位置等,以避免用户的误操作等。并将需求文字移动至待编辑文档中的待插入位置。
本申请实施例也可以在步骤401之后,获取待插入位置,也就是说步骤401开启待编辑文档之后,本申请实施例可以检测鼠标光标所在的位置或触摸屏光标所在的位置作为待插入位置,当然本申请实施例在检测之后还可以提供确认环节,如设置用户对话窗口提示用户确认待插入位置等,以避免用户的误操作等。然后在步骤406时,直接将需求文字移动至待编辑文档中的待插入位置。
可见,本申请实施例提供的快速插入识别文字的装置,获取用户的利用摄像头获取图片的指令。再利用摄像头,拍摄含有需求文字的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字添加至待编辑文档中的待插入位置。本申请实施例针对含有需求文字的多种实体载体,在第一文档编辑软件中编辑文档时,能够通过摄像头拍照获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中的待插入位置。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字至待编辑文档中的待插入位置,相比于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,或者现有技术中通过手动打字的方式插入识别文字,提高了工作效率。
参见图5,图5为本申请实施例的快速插入识别文字的装置的一种结构图, 包括:
开启模块501,用于开启待编辑文档。
指令获取模块502,用于获取用户的图片获取指令。
图片获取模块503,用于根据图片获取指令,获得含有需求文字的图片。
识别模块504,用于在第一文档编辑软件中识别出图片中的需求文字。
文字移动模块505,用于将需求文字移动至待编辑文档中。
文字移动模块505,具体用于:将所述需求文字添加至所述第一文档编辑软件中的待编辑文档。
可见,本申请实施例提供的快速插入识别文字的装置,首先在第一文档编辑软件中,开启待编辑文档。其次获取用户的图片获取指令。再次根据图片获取指令,获得含有需求文字的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字移动至待编辑文档中。本申请实施例在第一文档编辑软件中编辑文档时,能够获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字,不同于现有技术需要开启多个软件、程序,并且进行人工拷贝需求文字的方法,因此本申请实施例能够提高工作效率。
需要说明的是,本申请实施例的装置是应用上述快速插入识别文字的方法的装置,则上述应用于快速插入识别文字的方法的所有实施例均适用于该装置,且均能达到相同或相似的有益效果。
在图5的基础上,作为优选的实施例,与图3所示的方法对应,参见图6,图6为基于图5所示装置的一实例的结构图,包括:
本申请实施例中,图片获取指令为:通过截图的方式获取图片的指令;
本申请实施例中,图片获取模块603,包括:
图片截取范围获取子模块6031,用于根据所述图片获取指令,确定截图范围。
图片截取子模块6032,用于根据所述截图范围,截取含有需求文字的图 片。
本申请实施例中,图片截取范围获取子模块6031,具体用于:
获取鼠标框选取的图片范围,作为截图范围。或
获取触摸屏触摸轨迹框选取的图片范围,作为截图范围。
本申请实施例中,文字移动模块605,具体用于:
将需求文字移动至待编辑文档中的待插入位置,其中,待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
可见,本申请实施例提供的快速插入识别文字的装置,首先在第一文档编辑软件中,开启待编辑文档。其次获取用户的截图获取图片的指令。再次获取图片截取范围。接下来截取图片截取范围内的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字移动至待编辑文档中的待插入位置。本申请实施例针对含有需求文字的多种电子载体,在第一文档编辑软件中编辑文档时,能够通过截图获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中的待插入位置。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字至待编辑文档中的待插入位置,不同于现有技术需要开启多个软件、程序,并且对图片的识别文字人工选择需求文字,拷贝至待编辑文档中的待插入位置的方式,因此本申请实施例能够提高工作效率。
在图5的基础上,作为优选的实施例,与图4所示的方法对应,参见图7,图7为基于图5所示装置的又一实例的结构图,包括:
本申请实施例中,图片获取指令为:
利用摄像头获取图片的指令。
本申请实施例中,图片获取模块703,具体用于:利用摄像头,拍摄含有需求文字的图片。
本申请实施例中,图片获取模块703,具体用于:
判断摄像头的当前拍摄区域内是否包含需求文字;
如果包含,则控制所述摄像头对所述当前拍摄区域进行拍摄,得到含有需求文字的图片;
如果不包含,则调整所述摄像头的位置,并返回所述判断摄像头的当前拍摄区域内是否包含需求文字的步骤。
本申请实施例中,文字移动模块705,具体用于:
将需求文字添加至待编辑文档中的待插入位置,其中,待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
可见,本申请实施例提供的快速插入识别文字的装置,首先在第一文档编辑软件中,开启待编辑文档。其次获取用户的利用摄像头获取图片的指令。再次启动摄像头,并确认摄像头可见拍摄区域内包含需求文字。接下来拍摄摄像头可见拍摄区域内的图片。然后在第一文档编辑软件中识别出图片中的需求文字。最后将需求文字移动至待编辑文档中的待插入位置。本申请实施例针对含有需求文字的多种实体载体,在第一文档编辑软件中编辑文档时,能够通过摄像头拍照获取包含需求文字的图片,并识别出图片中的需求文字,自动插入待编辑文档中的待插入位置。本申请实施例仅利用第一文档编辑软件自动实现插入识别文字至待编辑文档中的待插入位置,不同于现有技术需要开启多个软件、程序,并且对图片的识别文字人工选择需求文字,拷贝至待编辑文档中的待插入位置的方式,因此本申请实施例能够提高工作效率。
本申请实施例还公开了一种电子设备,如图8所示,包括处理器801和存储器802,
存储器802,用于存放计算机程序;
处理器801,用于执行存储器802上所存放的程序时,实现上述任一种快速插入识别文字的方法。
本申请实施例还公开了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种快速插入识别文字的方法。
本申请实施例还公开了一种可执行程序代码,所述可执行程序代码用于 被运行以执行上述任一种快速插入识别文字的方法。
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括要素的过程、方法、物品或者设备中还存在另外的相同要素。
本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于图5-7所示的快速插入识别文字的装置实施例、图8所示的电子设备实施例、上述计算机可读存储介质实施例以及上述可执行程序代码实施例而言,由于其基本相似于图1-4所示的快速插入识别文字的方法实施例,所以描述的比较简单,相关之处参见图1-4所示的快速插入识别文字的方法实施例的部分说明即可。
以上所述仅为本申请的较佳实施例而已,并不用以限制本申请,凡在本申请的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本申请保护的范围之内。

Claims (15)

  1. 一种快速插入识别文字的方法,其特征在于,包括:
    获取用户的图片获取指令;
    根据所述图片获取指令,获得含有需求文字的图片;
    在第一文档编辑软件中识别出所述图片中的需求文字;
    将所述需求文字添加至所述第一文档编辑软件中的待编辑文档。
  2. 根据权利要求1所述的方法,其特征在于,所述图片获取指令为:通过截图的方式获取图片的指令;
    所述根据所述图片获取指令,获得含有需求文字的图片,包括:
    根据所述图片获取指令,确定截图范围;
    根据所述截图范围,截取含有需求文字的图片。
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述图片获取指令,确定截图范围,包括:
    获取鼠标框选取的图片范围,作为截图范围;或
    获取触摸屏触摸轨迹框选取的图片范围,作为截图范围。
  4. 根据权利要求1所述的方法,其特征在于,所述图片获取指令为:利用摄像头获取图片的指令;
    所述根据用户的图片获取指令,获得含有需求文字的图片,包括:
    利用摄像头,拍摄含有需求文字的图片。
  5. 根据权利要求4所述的方法,其特征在于,所述利用摄像头,拍摄含有需求文字的图片,包括:
    判断摄像头的当前拍摄区域内是否包含需求文字;
    如果包含,则控制所述摄像头对所述当前拍摄区域进行拍摄,得到含有需求文字的图片;
    如果不包含,则调整所述摄像头的位置,并返回所述判断摄像头的当前拍摄区域内是否包含需求文字的步骤。
  6. 根据权利要求1所述的方法,其特征在于,所述将所述需求文字添加至所述第一文档编辑软件中的待编辑文档,包括:
    将所述需求文字添加至所述待编辑文档中的待插入位置,其中,所述待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
  7. 一种快速插入识别文字的装置,其特征在于,包括:
    指令获取模块,用于获取用户的图片获取指令;
    图片获取模块,用于根据所述图片获取指令,获得含有需求文字的图片;
    识别模块,用于在第一文档编辑软件中识别出所述图片中的需求文字;
    文字移动模块,用于将所述需求文字添加至所述第一文档编辑软件中的待编辑文档。
  8. 根据权利要求7所述的装置,其特征在于,所述图片获取指令为:通过截图的方式获取图片的指令;所述图片获取模块,包括:
    图片截取范围获取子模块,用于根据所述图片获取指令,确定截图范围;
    图片截取子模块,用于根据所述截图范围,截取含有需求文字的图片。
  9. 根据权利要求8所述的装置,其特征在于,所述图片截取范围获取子模块,具体用于:
    获取鼠标框选取的图片范围,作为截图范围;或
    获取触摸屏触摸轨迹框选取的图片范围,作为截图范围。
  10. 根据权利要求7所述的装置,其特征在于,所述图片获取指令为:利用摄像头获取图片的指令;
    所述图片获取模块,具体用于:利用摄像头,拍摄含有需求文字的图片。
  11. 根据权利要求10所述的装置,其特征在于,所述图片获取模块,具体用于:
    判断摄像头的当前拍摄区域内是否包含需求文字;
    如果包含,则控制所述摄像头对所述当前拍摄区域进行拍摄,得到含有需求文字的图片;
    如果不包含,则调整所述摄像头的位置,并返回所述判断摄像头的当前拍摄区域内是否包含需求文字的步骤。
  12. 根据权利要求7所述的装置,其特征在于,所述文字移动模块,具体用于:
    将所述需求文字添加至所述待编辑文档中的待插入位置,其中,所述待插入位置为鼠标光标所在的位置,或,触摸屏光标所在的位置。
  13. 一种电子设备,其特征在于,包括处理器和存储器,
    存储器,用于存放计算机程序;
    处理器,用于执行存储器上所存放的程序时,实现权利要求1-6任一所述的方法步骤。
  14. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-6任一所述的方法步骤。
  15. 一种可执行程序代码,其特征在于,所述可执行程序代码用于被运行以执行权利要求1-6任一所述的方法步骤。
PCT/CN2018/079489 2017-03-20 2018-03-19 一种快速插入识别文字的方法及装置 WO2018171560A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP18772598.1A EP3605277A4 (en) 2017-03-20 2018-03-19 METHOD AND DEVICE FOR QUICKLY INSERTING RECOGNIZED WORDS
SG11201908705W SG11201908705WA (en) 2017-03-20 2018-03-19 Method and device for quickly inserting recognized word
US16/496,080 US20200042581A1 (en) 2017-03-20 2018-03-19 Method and Device for Quickly Inserting Recognized Word
JP2020500950A JP2020515996A (ja) 2017-03-20 2018-03-19 認識した語を迅速に挿入する方法およびデバイス

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710165750.8 2017-03-20
CN201710165750.8A CN108628814A (zh) 2017-03-20 2017-03-20 一种快速插入识别文字的方法及装置

Publications (1)

Publication Number Publication Date
WO2018171560A1 true WO2018171560A1 (zh) 2018-09-27

Family

ID=63584165

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/079489 WO2018171560A1 (zh) 2017-03-20 2018-03-19 一种快速插入识别文字的方法及装置

Country Status (6)

Country Link
US (1) US20200042581A1 (zh)
EP (1) EP3605277A4 (zh)
JP (1) JP2020515996A (zh)
CN (1) CN108628814A (zh)
SG (1) SG11201908705WA (zh)
WO (1) WO2018171560A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109874051A (zh) * 2019-02-21 2019-06-11 百度在线网络技术(北京)有限公司 视频内容处理方法、装置及设备
CN110166621B (zh) * 2019-04-17 2020-09-15 维沃移动通信有限公司 一种文字处理方法及终端设备
CN110275667B (zh) * 2019-06-25 2021-12-17 努比亚技术有限公司 内容显示方法、移动终端及计算机可读存储介质
CN111611945A (zh) * 2020-05-25 2020-09-01 江西金格科技股份有限公司 一种通用的AutoCAD图框识别方法
CN113448461A (zh) * 2020-06-24 2021-09-28 北京新氧科技有限公司 信息处理方法、装置及设备

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1505431A (zh) * 2002-09-11 2004-06-16 ���ǵ�����ʽ���� 用于从图象屏识别字符图象的装置和方法
CN1964413A (zh) * 2006-11-25 2007-05-16 王永顺 小型图书馆文字图像扫描摘录系统
CN101262513A (zh) * 2008-04-24 2008-09-10 陶建新 有微型扫描仪的拍照手机
US20100188419A1 (en) * 2009-01-28 2010-07-29 Google Inc. Selective display of ocr'ed text and corresponding images from publications on a client device
CN101881999A (zh) * 2010-06-21 2010-11-10 安阳师范学院 甲骨文视频输入系统及实现方法
CN106156761A (zh) * 2016-08-10 2016-11-23 北京交通大学 面向移动终端拍摄的图像表格检测与识别方法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08329190A (ja) * 1995-03-24 1996-12-13 Fuji Xerox Co Ltd 文字認識装置
JP2001175572A (ja) * 1999-12-20 2001-06-29 Minolta Co Ltd 携帯電子メール端末および電子メール文書作成方法
JP2003331217A (ja) * 2002-03-08 2003-11-21 Nec Corp 文字入力装置、文字入力方法及び文字入力プログラム
JP2005346628A (ja) * 2004-06-07 2005-12-15 Omron Corp 文字入力方法、文字入力装置、及びプログラム
JP2010205136A (ja) * 2009-03-05 2010-09-16 Fujitsu Ltd 音声読み上げ装置、携帯電話機及びコンピュータプログラム
JP2011248669A (ja) * 2010-05-27 2011-12-08 Ricoh Co Ltd 文書管理プログラム、記録媒体、情報処理装置、及び文書管理方法
JP5554177B2 (ja) * 2010-08-17 2014-07-23 ヤフー株式会社 情報表示装置及び方法
JP6442336B2 (ja) * 2015-03-18 2018-12-19 セコム株式会社 異常検知端末及びプログラム
CN106445144A (zh) * 2016-09-27 2017-02-22 宇龙计算机通信科技(深圳)有限公司 一种笔记方法、装置及终端

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1505431A (zh) * 2002-09-11 2004-06-16 ���ǵ�����ʽ���� 用于从图象屏识别字符图象的装置和方法
CN1964413A (zh) * 2006-11-25 2007-05-16 王永顺 小型图书馆文字图像扫描摘录系统
CN101262513A (zh) * 2008-04-24 2008-09-10 陶建新 有微型扫描仪的拍照手机
US20100188419A1 (en) * 2009-01-28 2010-07-29 Google Inc. Selective display of ocr'ed text and corresponding images from publications on a client device
CN101881999A (zh) * 2010-06-21 2010-11-10 安阳师范学院 甲骨文视频输入系统及实现方法
CN106156761A (zh) * 2016-08-10 2016-11-23 北京交通大学 面向移动终端拍摄的图像表格检测与识别方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3605277A4

Also Published As

Publication number Publication date
US20200042581A1 (en) 2020-02-06
SG11201908705WA (en) 2019-10-30
JP2020515996A (ja) 2020-05-28
EP3605277A1 (en) 2020-02-05
CN108628814A (zh) 2018-10-09
EP3605277A4 (en) 2020-04-01

Similar Documents

Publication Publication Date Title
WO2018171560A1 (zh) 一种快速插入识别文字的方法及装置
CN107659416B (zh) 一种会议记录分享的方法、装置、会议终端和存储介质
US20210056253A1 (en) Method and apparatus for generating image file
JP7125834B2 (ja) 画像取得方法および装置
US7966558B2 (en) Snipping tool
WO2018058749A1 (zh) 一种内容分享的方法及装置
US20180210634A1 (en) Method and device for generating captured image for display windows
US20140043255A1 (en) Electronic device and image zooming method thereof
CN112673617B (zh) 针对图像的多区域检测
WO2018171561A1 (zh) 一种快速插入语音载体中文字的方法及装置
US11190653B2 (en) Techniques for capturing an image within the context of a document
US11057558B2 (en) Using change of scene to trigger automatic image capture
US11733831B2 (en) Devices and methods of intelligent interaction, and storage media
US10686983B1 (en) Automatic image capture mode based on changes in a target region
US20160247323A1 (en) Head mounted display, information processing system and information processing method
WO2022228301A1 (zh) 文档生成方法、装置和电子设备
WO2017011680A1 (en) Device and method for processing data
US11328120B2 (en) Importing text into a draft email
CN112132762A (zh) 一种数据处理方法、装置和录音设备
JP2010039538A (ja) 情報処理装置、情報表示処理システム、情報処理方法、および情報処理プログラム
TWI598757B (zh) 嵌入選取內容至檔案的方法
JP2015121845A (ja) 情報処理装置、情報処理方法及びプログラム
JP2017162080A (ja) 機器、情報処理システム、表示制御方法、及びプログラム
WO2022200879A1 (en) Systems and methods for managing digital notes for collaboration
CN117193610A (zh) 窗口拖拽插入截图的方法、系统及智能交互平板

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18772598

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2020500950

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2018772598

Country of ref document: EP

Effective date: 20191021