CN110929480A - Document editing method and device, computer storage medium and terminal - Google Patents

Document editing method and device, computer storage medium and terminal Download PDF

Info

Publication number
CN110929480A
CN110929480A CN201811095007.0A CN201811095007A CN110929480A CN 110929480 A CN110929480 A CN 110929480A CN 201811095007 A CN201811095007 A CN 201811095007A CN 110929480 A CN110929480 A CN 110929480A
Authority
CN
China
Prior art keywords
view area
editing
determining
document
character recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811095007.0A
Other languages
Chinese (zh)
Inventor
邓斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Original Assignee
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Office Software Inc, Zhuhai Kingsoft Office Software Co Ltd, Guangzhou Kingsoft Mobile Technology Co Ltd filed Critical Beijing Kingsoft Office Software Inc
Priority to CN201811095007.0A priority Critical patent/CN110929480A/en
Publication of CN110929480A publication Critical patent/CN110929480A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition

Abstract

A method, a device, a computer storage medium and a terminal for editing a document comprise: determining a view area needing character recognition; performing character recognition on the determined view area; and editing the document by identifying the obtained characters. According to the embodiment of the invention, the characters in the view area are identified after the view area needing character identification is selected, so that the document editing efficiency is improved.

Description

Document editing method and device, computer storage medium and terminal
Technical Field
The present disclosure relates to, but not limited to, information editing technologies, and in particular, to a method, an apparatus, a computer storage medium, and a terminal for document editing.
Background
In daily life and work, electronic documents referred by users have various formats, including Word files, Portable Document Format (PDF) files, and the like, wherein some of the files are scanned files obtained by scanning.
For a scanned file obtained by a scanning mode, when a user needs to obtain part of text information in the scanned file, if format conversion is performed on the scanned file by referring to a related technology, both typesetting change and errors occurring in the format conversion process affect the document editing efficiency of the user.
Disclosure of Invention
The following is a summary of the subject matter described in detail herein. This summary is not intended to limit the scope of the claims.
The embodiment of the invention provides a method and a device for editing a document, a computer storage medium and a terminal, which can improve the document editing efficiency.
The embodiment of the invention provides a method for editing a document, which comprises the following steps:
determining a view area needing character recognition;
performing character recognition on the determined view area;
and editing the document by identifying the obtained characters.
Optionally, the view area determined to need to perform text recognition includes a view area determined in one of the following manners:
determining a current view as the view area needing character recognition through a first preset operation;
and determining a partial area in the current view as the view area through a second preset operation.
Optionally, the document editing by identifying the obtained text includes:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
Optionally, the view area includes:
an image area located within the scanned document.
On the other hand, an embodiment of the present invention further provides a device for editing a document, including: the device comprises a determining unit, an identifying unit and an editing unit; wherein the content of the first and second substances,
the determination unit is used for: determining a view area needing character recognition;
the identification unit is used for: performing character recognition on the determined view area;
the editing unit is used for: and editing the document by identifying the obtained characters.
Optionally, the determining unit includes a first determining module and a second determining module; wherein the content of the first and second substances,
the first determination module is to: determining a current view as the view area needing character recognition through a first preset operation;
the second determination module is to: and determining a partial area in the current view as the view area through a second preset operation.
Optionally, the editing unit is specifically configured to:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
Optionally, the view area includes:
an image area located within the scanned document.
In still another aspect, an embodiment of the present invention further provides a computer storage medium, where computer-executable instructions are stored in the computer storage medium, and the computer-executable instructions are used to execute the above method for editing a document.
In another aspect, an embodiment of the present invention further provides a terminal, including: a memory and a processor; wherein the content of the first and second substances,
the processor is configured to execute program instructions in the memory;
the program instructions read on the processor to perform the following operations:
determining a view area needing character recognition;
performing character recognition on the determined view area;
and editing the document by identifying the obtained characters.
Compared with the related art, the technical scheme of the application comprises the following steps: determining a view area needing character recognition; performing character recognition on the determined view area; and editing the document by identifying the obtained characters. According to the embodiment of the invention, the characters in the view area are identified after the view area needing character identification is selected, so that the document editing efficiency is improved.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the example serve to explain the principles of the invention and not to limit the invention.
FIG. 1 is a flow diagram of a method of document editing according to an embodiment of the present invention;
fig. 2 is a block diagram of a device for document editing according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments and features of the embodiments in the present application may be arbitrarily combined with each other without conflict.
The steps illustrated in the flow charts of the figures may be performed in a computer system such as a set of computer-executable instructions. Also, while a logical order is shown in the flow diagrams, in some cases, the steps shown or described may be performed in an order different than here.
FIG. 1 is a flowchart of a document editing method according to an embodiment of the present invention, as shown in FIG. 1, including:
step 101, determining a view area needing character recognition;
it should be noted that, the view area in the embodiment of the present invention may include a partial scan image area identified from a scan file; for example, after scanning a paper document to obtain a scanned document, if part of text information in the scanned document needs to be obtained, an area where the text information needs to be obtained may be a view area that needs to be subjected to text recognition in the embodiment of the present invention.
Optionally, the view area determined to be subjected to character recognition in the embodiment of the present invention includes a view area determined in one of the following ways:
determining a current view as the view area needing character recognition through a first preset operation;
it should be noted that, here, when the entire current view is taken as the view area, the current view may include a page view displayed normally, or may include several page views displayed in a reduced size.
And determining a partial area in the current view as the view area through a second preset operation.
It should be noted that, taking scanning a file as an example of a document processed by the embodiment of the present invention, the first preset operation and the second preset operation in the embodiment of the present invention may include any operation that does not conflict with the prior art;
102, performing character recognition on the determined view area;
it should be noted that the method for recognizing words in the embodiment of the present invention may include an image processing method known to those skilled in the art in the related art, and details thereof are not described herein.
And 103, editing the document by identifying the obtained characters.
Optionally, the document editing by identifying the obtained characters in the embodiment of the present invention includes:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
It should be noted that the text interaction box in the embodiment of the present invention includes a text box known to those skilled in the art, and may also include other editing boxes that can be displayed in text and are formed based on the text box, for example, a prompt box, or an annotation-like information editing box. In addition, the migration in the embodiment of the present invention may include processing operations such as copying and cutting; during migration, the recognized characters can be adjusted in font and format according to default setting or an automatic adjustment mode. The editing processing of the embodiment of the invention can comprise document editing such as copying, cutting and the like, namely, characters in the text box are transferred to other documents in an editing state in a copying or cutting mode; the embodiment of the invention can identify and copy the characters in the local area of the scanned file through the processing, thereby improving the document editing efficiency of the user and the document editing experience of the user.
Optionally, the view area in the embodiment of the present invention includes:
an image area located within the scanned document.
It should be noted that the scan file is only one optional implementation object for performing view area determination and character recognition in the embodiment of the present invention, and the embodiment of the present invention may be applied to other documents existing in an image form and needing character recognition, and details are not described herein.
Fig. 2 is a block diagram of a device for document editing according to an embodiment of the present invention, as shown in fig. 2, including: the device comprises a determining unit, an identifying unit and an editing unit; wherein the content of the first and second substances,
the determination unit is used for: determining a view area needing character recognition;
it should be noted that, the view area in the embodiment of the present invention may include a partial scan image area identified from a scan file; for example, after scanning a paper document to obtain a scanned document, if part of text information in the scanned document needs to be obtained, an area where the text information needs to be obtained may be a view area that needs to be subjected to text recognition in the embodiment of the present invention.
The identification unit is used for: performing character recognition on the determined view area;
it should be noted that the method for recognizing words in the embodiment of the present invention may include an image processing method known to those skilled in the art in the related art, and details thereof are not described herein.
The editing unit is used for: and editing the document by identifying the obtained characters.
Optionally, the determining unit in the embodiment of the present invention includes a first determining module and a second determining module; wherein the content of the first and second substances,
the first determination module is to: determining a current view as the view area needing character recognition through a first preset operation;
it should be noted that, here, when the entire current view is taken as the view area, the current view may include a page view displayed normally, or may include several page views displayed in a reduced size.
The second determination module is to: and determining a partial area in the current view as the view area through a second preset operation.
It should be noted that, taking scanning a file as an example of a document processed by the embodiment of the present invention, the first preset operation and the second preset operation in the embodiment of the present invention may include any operation that does not conflict with the prior art;
optionally, the editing unit in the embodiment of the present invention is specifically configured to:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
It should be noted that the text interaction box in the embodiment of the present invention includes a text box known to those skilled in the art, and may also include other editing boxes that can be displayed in text and are formed based on the text box, for example, a prompt box, or an annotation-like information editing box. In addition, the migration in the embodiment of the present invention may include processing operations such as copying and cutting; during migration, the recognized characters can be adjusted in font and format according to default setting or an automatic adjustment mode. The editing processing of the embodiment of the invention can comprise document editing such as copying, cutting and the like, namely, characters in the text box are transferred to other documents in an editing state in a copying or cutting mode; the embodiment of the invention can identify and copy the characters in the local area of the scanned file through the processing, thereby improving the document editing efficiency of the user and the document editing experience of the user.
Optionally, the view area in the embodiment of the present invention includes:
an image area located within the scanned document.
It should be noted that the scan file is only one optional implementation object for performing view area determination and character recognition in the embodiment of the present invention, and the embodiment of the present invention may be applied to other documents existing in an image form and needing character recognition, and details are not described herein.
Compared with the related art, the technical scheme of the application comprises the following steps: determining a view area needing character recognition; performing character recognition on the determined view area; and editing the document by identifying the obtained characters. According to the embodiment of the invention, the characters in the view area are identified after the view area needing character identification is selected, so that the document editing efficiency is improved.
The embodiment of the invention also provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used for executing the document editing method.
An embodiment of the present invention further provides a terminal, including: a memory and a processor; wherein the content of the first and second substances,
the processor is configured to execute program instructions in the memory;
the program instructions read on the processor to perform the following operations:
determining a view area needing character recognition;
performing character recognition on the determined view area;
and editing the document by identifying the obtained characters.
It will be understood by those skilled in the art that all or part of the steps of the above methods may be implemented by a program instructing associated hardware (e.g., a processor) to perform the steps, and the program may be stored in a computer readable storage medium, such as a read only memory, a magnetic or optical disk, and the like. Alternatively, all or part of the steps of the above embodiments may be implemented using one or more integrated circuits. Accordingly, each module/unit in the above embodiments may be implemented in hardware, for example, by an integrated circuit to implement its corresponding function, or in software, for example, by a processor executing a program/instruction stored in a memory to implement its corresponding function. The present invention is not limited to any specific form of combination of hardware and software.
Although the embodiments of the present invention have been described above, the above description is only for the convenience of understanding the present invention, and is not intended to limit the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (10)

1. A method of document editing, comprising:
determining a view area needing character recognition;
performing character recognition on the determined view area;
and editing the document by identifying the obtained characters.
2. The method of claim 1, wherein determining the view area that needs to be text recognized comprises determining the view area by one of:
determining a current view as the view area needing character recognition through a first preset operation;
and determining a partial area in the current view as the view area through a second preset operation.
3. The method according to claim 1 or 2, wherein the document editing by identifying the obtained text comprises:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
4. The method according to claim 1 or 2, wherein the view area comprises:
an image area located within the scanned document.
5. An apparatus for document editing, comprising: the device comprises a determining unit, an identifying unit and an editing unit; wherein the content of the first and second substances,
the determination unit is used for: determining a view area needing character recognition;
the identification unit is used for: performing character recognition on the determined view area;
the editing unit is used for: and editing the document by identifying the obtained characters.
6. The apparatus of claim 5, wherein the determining unit comprises a first determining module and a second determining module; wherein the content of the first and second substances,
the first determination module is to: determining a current view as the view area needing character recognition through a first preset operation;
the second determination module is to: and determining a partial area in the current view as the view area through a second preset operation.
7. The apparatus according to claim 5 or 6, wherein the editing unit is specifically configured to:
transferring the recognized characters to a preset text interaction box;
and performing document editing processing through the characters transferred into the text interaction box.
8. The apparatus of claim 5 or 6, wherein the view area comprises:
an image area located within the scanned document.
9. A computer storage medium having computer-executable instructions stored therein for performing the method of document editing of any of claims 1-4.
10. A terminal, comprising: a memory and a processor; wherein the content of the first and second substances,
the processor is configured to execute program instructions in the memory;
the program instructions read on the processor to perform the following operations:
determining a view area needing character recognition;
performing character recognition on the determined view area;
and editing the document by identifying the obtained characters.
CN201811095007.0A 2018-09-19 2018-09-19 Document editing method and device, computer storage medium and terminal Pending CN110929480A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811095007.0A CN110929480A (en) 2018-09-19 2018-09-19 Document editing method and device, computer storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811095007.0A CN110929480A (en) 2018-09-19 2018-09-19 Document editing method and device, computer storage medium and terminal

Publications (1)

Publication Number Publication Date
CN110929480A true CN110929480A (en) 2020-03-27

Family

ID=69855937

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811095007.0A Pending CN110929480A (en) 2018-09-19 2018-09-19 Document editing method and device, computer storage medium and terminal

Country Status (1)

Country Link
CN (1) CN110929480A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1452121A (en) * 2002-04-16 2003-10-29 富士通株式会社 On-line handwrited script mode identifying editing device and method
US20110194770A1 (en) * 2010-02-05 2011-08-11 Samsung Electronics Co., Ltd. Document editing apparatus and method
CN106951893A (en) * 2017-05-08 2017-07-14 奇酷互联网络科技(深圳)有限公司 Text information acquisition methods, device and mobile terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1452121A (en) * 2002-04-16 2003-10-29 富士通株式会社 On-line handwrited script mode identifying editing device and method
US20110194770A1 (en) * 2010-02-05 2011-08-11 Samsung Electronics Co., Ltd. Document editing apparatus and method
CN106951893A (en) * 2017-05-08 2017-07-14 奇酷互联网络科技(深圳)有限公司 Text information acquisition methods, device and mobile terminal

Similar Documents

Publication Publication Date Title
US20080050019A1 (en) Image processing apparatus, and computer program product
JP2007183742A (en) Image processor, image processing method and computer program
JP5412903B2 (en) Document image processing apparatus, document image processing method, and document image processing program
US8682075B2 (en) Removing character from text in non-image form where location of character in image of text falls outside of valid content boundary
US11146705B2 (en) Character recognition device, method of generating document file, and storage medium
RU2648636C2 (en) Storage of the content in converted documents
CN111198664B (en) Document printing method and device, computer storage medium and terminal
US10602019B2 (en) Methods and systems for enhancing image quality for documents with highlighted content
CN110941947A (en) Document editing method and device, computer storage medium and terminal
CN110929480A (en) Document editing method and device, computer storage medium and terminal
US9692936B2 (en) Image processing apparatus and image processing method for clipping, from a second image, an area at a position corresponding to designated position in a first image
JP2007328432A (en) Business form processor, business form processing method, and program
JP2022014856A (en) OCR recognition accuracy improvement support system and program
JP7342518B2 (en) Image processing device and image processing program
CN110929481A (en) Document editing method and device, computer storage medium and terminal
JP2006165863A (en) Information processing system
US20230102476A1 (en) Information processing apparatus, non-transitory computer readable medium storing program, and information processing method
CN111414258A (en) Information processing method and device, computer storage medium and terminal
JP2001312691A (en) Method/device for processing picture and storage medium
US11206336B2 (en) Information processing apparatus, method, and non-transitory computer readable medium
JP2008181383A (en) Character recognition apparatus, and method and program for controlling the same
JP2008244612A (en) Image processing apparatus and method
CN111414734A (en) Document editing method and device, computer storage medium and terminal
CN111581921A (en) Text editing method and device, computer storage medium and terminal
CN110941400A (en) Method and device for processing annotation, computer storage medium and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination