CN113936281A - Document scanning method and device, electronic equipment and storage medium - Google Patents

Document scanning method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113936281A
CN113936281A CN202111188958.4A CN202111188958A CN113936281A CN 113936281 A CN113936281 A CN 113936281A CN 202111188958 A CN202111188958 A CN 202111188958A CN 113936281 A CN113936281 A CN 113936281A
Authority
CN
China
Prior art keywords
picture
text
display area
area
new
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111188958.4A
Other languages
Chinese (zh)
Inventor
高钧亮
彭连
薛云冀
余新闻
梁小迷
董江鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202111188958.4A priority Critical patent/CN113936281A/en
Publication of CN113936281A publication Critical patent/CN113936281A/en
Pending legal-status Critical Current

Links

Images

Abstract

The disclosure provides a document scanning method, a document scanning device, an electronic device and a storage medium, and relates to the field of artificial intelligence, in particular to an OCR scanning technology. The specific implementation scheme comprises the following steps: performing text recognition on the picture to be processed to obtain a text recognition result; and displaying the picture to be processed in a picture display area of a word processing interface, and displaying the text recognition result in a text display area of the word processing interface. According to the scheme, the text recognition result and the image to be processed are synchronously displayed, so that a user can conveniently check the text recognition result by referring to the picture.

Description

Document scanning method and device, electronic equipment and storage medium
Technical Field
The present disclosure relates to the field of artificial intelligence, and in particular, to an OCR scanning technique, and more particularly, to a document scanning method, apparatus, electronic device, storage medium, and computer program product.
Background
OCR (Optical Character Recognition) refers to a process in which an electronic device (e.g., a scanner or a digital camera) checks a Character printed on paper, determines its shape by detecting dark and light patterns, and then translates the shape into a computer text by a Character Recognition method; the method is characterized in that characters in a paper document are converted into an image file with a black-white dot matrix in an optical mode aiming at print characters, and the characters in the image are converted into a text format through recognition software for further editing and processing by word processing software.
Disclosure of Invention
The disclosure provides a document scanning method, an apparatus, an electronic device, a storage medium, and a computer program product.
According to an aspect of the present disclosure, there is provided a document scanning method including:
performing text recognition on the picture to be processed to obtain a text recognition result;
and displaying the picture to be processed in a picture display area of the word processing interface, and displaying the text recognition result in a text display area of the word processing interface.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a document scanning method of any embodiment of the disclosure.
According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform a document scanning method of any embodiment of the present disclosure.
According to another aspect of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the document scanning method of any embodiment of the present disclosure.
According to the technology disclosed by the invention, the text recognition result and the image to be processed are synchronously displayed, so that a user can conveniently check the text recognition result by referring to the picture.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1a is a schematic flow chart diagram of a document scanning method provided in accordance with an embodiment of the present disclosure;
FIG. 1b is a schematic diagram of a word processing interface provided in accordance with an embodiment of the present disclosure;
FIG. 2a is a schematic flow chart diagram of another document scanning method provided in accordance with an embodiment of the present disclosure;
FIG. 2b is a schematic illustration of yet another word processing interface provided in accordance with an embodiment of the present disclosure;
FIG. 2c is a schematic illustration of yet another word processing interface provided in accordance with an embodiment of the present disclosure;
FIG. 2d is a schematic diagram of providing yet another word processing interface according to an embodiment of the present disclosure;
FIG. 3a is a schematic flow chart diagram illustrating a document scanning method according to an embodiment of the disclosure;
FIG. 3b is a schematic illustration of yet another word processing interface provided in accordance with an embodiment of the present disclosure;
FIG. 4 is a schematic flow chart diagram of yet another document scanning method provided in accordance with an embodiment of the present disclosure;
FIG. 5 is a schematic structural diagram of a document scanning device according to an embodiment of the present disclosure;
FIG. 6 is a block diagram of an electronic device for implementing a document scanning method of an embodiment of the disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the disclosure are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Fig. 1a is a schematic flow chart of a document scanning method according to an embodiment of the present disclosure, which is applicable to a case where a document is scanned by using a scanning function in an input method. The method may be performed by a document scanning device implemented in software and/or hardware and integrated on an electronic device, such as a mobile terminal.
Specifically, referring to fig. 1a, the document scanning method is as follows:
s101, performing text recognition on the picture to be processed to obtain a text recognition result.
In the embodiment of the disclosure, the user sometimes has an appeal of scanning a certain segment of characters in the chat process, so that the document scanning function is implanted into the input method used by the user, the user can use the document scanning function without quitting the chat interface, and the convenience of character scanning of the user is ensured. After a user triggers a document scanning function, text content in the picture can be scanned by shooting the picture. Therefore, the picture to be processed can be selected from a document picture shot after the user triggers the document scanning function. And then recognizing the text content in the picture to be processed by an OCR recognition technology to obtain a text recognition result.
S102, displaying the picture to be processed in the picture display area of the word processing interface, and displaying the text recognition result in the text display area of the word processing interface.
In the embodiment of the present disclosure, the word processing interface is an interface for a user to process (e.g., copy, cut, or share) the text recognition result, that is, the word processing interface is a display interface of the scanning result. The word processing interface at least comprises two display areas, namely a picture display area and a text display area. The image display area is used for displaying the image to be processed, and the text display area is used for displaying the text recognition result.
Illustratively, referring to fig. 1b, a schematic diagram of a word processing interface is shown, wherein to meet the user's operating habits, a picture display area is located in a top region of the word processing interface and a text display area is located in a bottom region of the word processing interface. After the picture to be processed is displayed in the picture display area and the text recognition result is displayed in the text display area, the user can refer to the picture to be processed to view the text recognition result. It should be noted that, for convenience of reading, the text recognition result is made into a word segmentation form with small gaps.
Furthermore, after the text recognition result and the picture to be processed are synchronously displayed to the user, the user can select partial characters to perform processing such as copying, cutting or sharing.
In the embodiment of the disclosure, the picture to be processed and the text recognition result are synchronously displayed in the word processing interface, so that a user can conveniently check the text recognition result by referring to the picture. Compared with the method that only the text recognition result is displayed in the word processing interface, the accuracy of the text recognition result can be determined in time through comparison and check by the user.
Fig. 2a is a schematic flow chart of another document scanning method according to an embodiment of the present disclosure, which is optimized based on the above embodiment, and referring to fig. 2a, the document scanning method is specifically as follows:
s201, performing text recognition on the picture to be processed to obtain a text recognition result.
S202, displaying the picture to be processed in the picture display area of the word processing interface, and displaying the text recognition result in the text display area of the word processing interface.
In the embodiment of the disclosure, because the to-be-processed image and the text recognition result are synchronously displayed in the word processing interface, a user can check the accuracy of the recognition result through comparison, and if some text recognition result is inaccurate, re-recognition can be performed through the steps of S203-S205.
S203, responding to the touch operation of the picture reselection control in the picture display area, and acquiring a new picture area reselected from the picture display area.
In the embodiment of the present disclosure, referring to fig. 2b, a schematic diagram of another word processing interface is shown, where a picture reselection control is disposed in a picture display area of the word processing interface, and a user may select a new picture area to be identified by touching the picture reselection control. It should be noted that the setting of the picture reselection control in the picture display area is only an example, and may also be set in other positions of the word processing interface, which is not specifically limited herein.
In the embodiment of the present disclosure, the picture reselection control optionally includes at least one of the following: a frame selection control, a line-by-line control and a smearing control. If the picture reselection control is a line-by-line control, selecting a new picture area according to the following steps: responding to touch operation of a picture reselection control in a picture display area, and dividing the picture display area to obtain at least two sub-areas to be selected; optionally, a region surrounding each line of text in the picture is used as a sub-region to be selected, for example, the regions A, B, C in fig. 2c are all sub-regions to be selected; and acquiring a new picture area reselected from the picture display area according to the selection operation of the user on at least two sub-areas to be selected, for example, if the user clicks the area A, taking the area A as the new picture area.
If the picture reselection control is a frame selection control, selecting a new picture area according to the following steps: in response to the touch operation on the picture reselecting control in the picture display area, displaying a picture pick-up frame in the picture display area, for example, the picture pick-up frame is a frame D in fig. 2D; and acquiring a new picture area reselected from the picture display area according to the position information of the picture selecting frame, namely taking the picture area surrounded by the frame D as the new picture area.
In addition, if the picture reselection control is the smearing control, the smearing region of the user is detected and is used as a new picture region.
It should be noted that, by setting different picture reselection controls, it can be ensured that a user flexibly selects a new image region that needs to be re-identified.
And S204, performing text recognition on the new picture area again to obtain a new text recognition result of the new picture area.
And S205, displaying the new text recognition result in the text display area.
After the new text recognition result is obtained, the new text recognition result is directly displayed in the text display area, so that the user can be ensured to view the recognition result after the recognition area is changed in real time.
It should be noted that if only the text recognition result is displayed in the word processing interface (i.e., the scan result display interface), the document scan is performed again, and only the image capture interface is skipped, then the recognition range of the image is reselected, and after the recognition is completed, the document scan jumps to the scan result display interface corresponding to the recognition range, that is, the re-recognition can be completed only by multiple page jumps. According to the scheme, the to-be-processed picture and the text recognition result are synchronously displayed in the word processing interface, so that a new recognition result can be displayed in the text display area in real time only by reselecting the recognition range of the to-be-processed picture in the word processing interface during re-recognition, and therefore page skipping is not needed in the re-recognition process.
In the embodiment of the disclosure, when re-identification is carried out, a user can flexibly select a new picture area according to a picture re-selection control in a word processing interface; and after the picture range is adjusted, a new text recognition result can be immediately displayed in a text display area of the word processing interface, so that page skipping is not needed in the re-recognition process, and a user can conveniently see the text recognition result after the recognition area is changed in real time.
Fig. 3a is a schematic flow chart of another document scanning method according to an embodiment of the present disclosure, which is optimized based on the above embodiment, and referring to fig. 3a, the document scanning method is specifically as follows:
s301, performing text recognition on the picture to be processed to obtain a text recognition result.
S302, displaying the picture to be processed in the picture display area of the word processing interface, and displaying the text recognition result in the text display area of the word processing interface.
In the embodiment of the present disclosure, referring to fig. 3b, a schematic diagram of another word processing interface is shown, where a translation control is further included in the text display area, so that a user with a translation appeal can translate words in the text recognition result through the translation control. It should be noted that the setting of the translation control in the text display area is only an example, and may also be set at any position of the word processing interface, which is not specifically limited herein. See S303-S305 for a specific translation process.
And S303, acquiring the text to be processed selected by the user from the text recognition result.
In the embodiment of the present disclosure, the user may select the text to be processed by long-pressing a part of the text in the text recognition result, or may select the text to be processed in other manners, which is not specifically limited herein.
S304, responding to the touch operation of the translation control, translating the text to be processed to obtain text translation information.
And responding to the touch operation of the translation control, and requesting a translation interface to translate the text to be processed to obtain text translation information.
And S305, displaying the text translation information in the text display area.
Optionally, the text translation information is synchronously displayed below the text to be processed, or the text to be processed is directly replaced by the text translation information, so that the user can process (for example, cut, copy, share, etc.) the text translation information.
In the embodiment of the disclosure, after the text recognition result is displayed in the word processing interface, a part of text in the text recognition result can be translated through the translation control so as to meet the translation appeal of a user.
Fig. 4 is a schematic flow chart of another document scanning method according to an embodiment of the present disclosure, which is optimized based on the above embodiment, and referring to fig. 4, the document scanning method is specifically as follows:
s401, performing text recognition on the picture to be processed to obtain a text recognition result.
S402, displaying the picture to be processed in the picture display area of the word processing interface, and displaying the text recognition result in the text display area of the word processing interface.
And S403, adjusting the display parameters of the picture display area and the text display area in the word processing interface.
In embodiments of the present disclosure, the presentation parameters include position and/or size. Optionally, an area adjustment control is set in the word processing interface, and in response to a touch operation of the area adjustment control by a user, positions and/or sizes of the picture display area and the text display area in the word processing interface are adjusted, for example, the size of the picture display area is increased or decreased, or the picture display area and the text display area are adjusted.
In the embodiment of the disclosure, the personalized display of the text recognition result and the picture is realized by adjusting the display parameters of the picture display area and the text display area in the word processing interface.
Further, for the picture to be processed, responding to a touch operation of a user for translating the picture to be processed, obtaining a translation result picture, and covering the translation result picture on the document picture so as to display the translation result picture to the user.
Fig. 5 is a schematic structural diagram of a document scanning apparatus according to an embodiment of the present disclosure, which is applicable to a case where a document is scanned using a scanning function in an input method. As shown in fig. 5, the apparatus specifically includes:
the recognition module 501 is configured to perform text recognition on the picture to be processed to obtain a text recognition result;
the display module 502 is configured to display the to-be-processed picture in a picture display area of the word processing interface, and display the text recognition result in a text display area of the word processing interface.
On the basis of the above embodiment, optionally, the apparatus further includes:
the picture reselection module is used for responding to touch operation of the picture reselection control in the picture display area and acquiring a new picture area reselected from the picture display area;
the re-recognition module is used for re-performing text recognition on the new picture area to obtain a new text recognition result of the new picture area;
and the result updating module is used for displaying the new text recognition result in the text display area.
On the basis of the foregoing embodiment, optionally, the picture reselection module includes:
the image display device comprises a region dividing unit, a selection unit and a selection unit, wherein the region dividing unit is used for dividing an image display region to obtain at least two sub-regions to be selected;
and the first new picture area acquisition unit is used for acquiring a new picture area reselected from the picture display area according to the selection operation of the at least two sub-areas to be selected.
On the basis of the foregoing embodiment, optionally, the picture reselection module includes:
the picture selecting frame display unit is used for displaying the picture selecting frame in the picture display area;
and the second new picture area acquisition unit is used for acquiring a new picture area reselected from the picture display area according to the position information of the picture selection frame.
On the basis of the above embodiment, optionally, the text display area further includes a translation control;
the device still includes:
the text to be processed selection module is used for acquiring a text to be processed selected by a user from the text recognition result;
the text translation module is used for responding to the touch operation on the translation control and translating the text to be processed to obtain text translation information;
and the translation information display module is used for displaying the text translation information in the text display area.
On the basis of the above embodiment, optionally, the method further includes:
the region adjusting module is used for adjusting display parameters of the picture display region and the text display region in the word processing interface; the presentation parameters include position and/or size.
The document scanning device provided by the embodiment of the disclosure can execute the document scanning method provided by any embodiment of the disclosure, and has corresponding functional modules and beneficial effects of the execution method. Reference may be made to the description of any method embodiment of the disclosure for a matter not explicitly described in this embodiment.
In the technical scheme of the disclosure, the acquisition, storage, application and the like of the personal information of the related user all accord with the regulations of related laws and regulations, and do not violate the good customs of the public order.
The present disclosure also provides an electronic device, a readable storage medium, and a computer program product according to embodiments of the present disclosure.
FIG. 6 illustrates a schematic block diagram of an example electronic device 600 that can be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 6, the apparatus 600 includes a computing unit 601, which can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM)602 or a computer program loaded from a storage unit 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data required for the operation of the device 600 can also be stored. The calculation unit 601, the ROM 602, and the RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
A number of components in the device 600 are connected to the I/O interface 605, including: an input unit 606 such as a keyboard, a mouse, or the like; an output unit 607 such as various types of displays, speakers, and the like; a storage unit 608, such as a magnetic disk, optical disk, or the like; and a communication unit 609 such as a network card, modem, wireless communication transceiver, etc. The communication unit 609 allows the device 600 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.
The computing unit 601 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of the computing unit 601 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various dedicated Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, and so forth. The calculation unit 601 executes the respective methods and processes described above, such as the document scanning method. For example, in some embodiments, the document scanning method may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 608. In some embodiments, part or all of the computer program may be loaded and/or installed onto the device 600 via the ROM 602 and/or the communication unit 609. When the computer program is loaded into the RAM 603 and executed by the computing unit 601, one or more steps of the document scanning method described above may be performed. Alternatively, in other embodiments, the computing unit 601 may be configured to perform the document scanning method by any other suitable means (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuitry, Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), system on a chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowchart and/or block diagram to be performed. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server may be a cloud server, a server of a distributed system, or a server with a combined blockchain.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present disclosure may be executed in parallel, sequentially, or in different orders, as long as the desired results of the technical solutions disclosed in the present disclosure can be achieved, and the present disclosure is not limited herein.
The above detailed description should not be construed as limiting the scope of the disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present disclosure should be included in the scope of protection of the present disclosure.

Claims (15)

1. A document scanning method, comprising:
performing text recognition on the picture to be processed to obtain a text recognition result;
and displaying the picture to be processed in a picture display area of a word processing interface, and displaying the text recognition result in a text display area of the word processing interface.
2. The method of claim 1, further comprising:
responding to touch operation of a picture reselection control in the picture display area, and acquiring a new picture area reselected from the picture display area;
performing text recognition on the new picture area again to obtain a new text recognition result of the new picture area;
and displaying the new text recognition result in the text display area.
3. The method of claim 2, wherein said obtaining a new picture region reselected from the picture presentation region comprises:
dividing the picture display area to obtain at least two sub-areas to be selected;
and acquiring a new picture area reselected from the picture display area according to the selection operation of the at least two sub-areas to be selected.
4. The method of claim 2, wherein said obtaining a new picture region reselected from the picture presentation region comprises:
displaying a picture selection frame in the picture display area;
and acquiring a new picture area reselected from the picture display area according to the position information of the picture selecting frame.
5. The method of claim 1, wherein the text presentation area further comprises a translation control;
the method further comprises the following steps:
acquiring a text to be processed selected by a user from the text recognition result;
responding to the touch operation of the translation control, translating the text to be processed to obtain text translation information;
and displaying the text translation information in the text display area.
6. The method of claim 1, further comprising:
adjusting the display parameters of the picture display area and the text display area in a word processing interface; the presentation parameters include position and/or size.
7. A document scanning device, comprising:
the recognition module is used for performing text recognition on the picture to be processed to obtain a text recognition result;
and the display module is used for displaying the picture to be processed in a picture display area of the word processing interface and displaying the text recognition result in a text display area of the word processing interface.
8. The apparatus of claim 7, further comprising:
the picture reselection module is used for responding to touch operation of a picture reselection control in the picture display area and acquiring a new picture area reselected from the picture display area;
the re-identification module is used for re-performing text identification on the new picture area to obtain a new text identification result of the new picture area;
and the result updating module is used for displaying the new text recognition result in the text display area.
9. The apparatus of claim 8, wherein the picture reselection module comprises:
the area dividing unit is used for dividing the picture display area to obtain at least two sub-areas to be selected;
and the first new picture area acquisition unit is used for acquiring a new picture area reselected from the picture display area according to the selection operation of the at least two sub-areas to be selected.
10. The apparatus of claim 8, wherein the picture reselection module comprises:
the picture selecting frame display unit is used for displaying the picture selecting frame in the picture display area;
and the second new picture area acquisition unit is used for acquiring a new picture area reselected from the picture display area according to the position information of the picture selection frame.
11. The apparatus of claim 7, wherein the text presentation area further comprises a translation control;
the device further comprises:
the text to be processed selection module is used for acquiring a text to be processed selected by a user from the text recognition result;
the text translation module is used for responding to the touch operation on the translation control and translating the text to be processed to obtain text translation information;
and the translation information display module is used for displaying the text translation information in the text display area.
12. The apparatus of claim 7, further comprising:
the region adjusting module is used for adjusting display parameters of the picture display region and the text display region in a word processing interface; the presentation parameters include position and/or size.
13. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-6.
14. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-6.
15. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1-6.
CN202111188958.4A 2021-10-12 2021-10-12 Document scanning method and device, electronic equipment and storage medium Pending CN113936281A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111188958.4A CN113936281A (en) 2021-10-12 2021-10-12 Document scanning method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111188958.4A CN113936281A (en) 2021-10-12 2021-10-12 Document scanning method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113936281A true CN113936281A (en) 2022-01-14

Family

ID=79279009

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111188958.4A Pending CN113936281A (en) 2021-10-12 2021-10-12 Document scanning method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113936281A (en)

Similar Documents

Publication Publication Date Title
US9104261B2 (en) Method and apparatus for notification of input environment
EP2704061A2 (en) Apparatus and method for recognizing a character in terminal equipment
CN110942004A (en) Handwriting recognition method and device based on neural network model and electronic equipment
US10395131B2 (en) Apparatus, method and non-transitory storage medium for changing position coordinates of a character area stored in association with a character recognition result
CN112395038B (en) Method and device for adjusting characters during desktop sharing
JP7389824B2 (en) Object identification method and device, electronic equipment and storage medium
EP3553702A1 (en) Image recognition-based communication method and device
CN108256523B (en) Identification method and device based on mobile terminal and computer readable storage medium
KR20210040305A (en) Method and apparatus for generating images
US10452943B2 (en) Information processing apparatus, control method of information processing apparatus, and storage medium
CN113392660A (en) Page translation method and device, electronic equipment and storage medium
US9582230B1 (en) Method and system for automated form document fill-in via image processing
CN113626441A (en) Text management method, device and equipment based on scanning equipment and storage medium
CN110119459A (en) Image data retrieval method and image data retrieving apparatus
CN113936281A (en) Document scanning method and device, electronic equipment and storage medium
CN115101069A (en) Voice control method, device, equipment, storage medium and program product
US9692936B2 (en) Image processing apparatus and image processing method for clipping, from a second image, an area at a position corresponding to designated position in a first image
CN114419636A (en) Text recognition method, device, equipment and storage medium
CN112269467A (en) Translation method based on AR and AR equipment
JP5612975B2 (en) Serif data generation apparatus, serif data generation method, and program
CN113051235A (en) Document loading method and device, terminal and storage medium
US20200272688A1 (en) Information processing apparatus and non-transitory computer readable medium
CN113038184A (en) Data processing method, device, equipment and storage medium
CN112749769A (en) Graphic code detection method and device, computer equipment and storage medium
CN113657366B (en) Control method, device, equipment and storage medium of scanning pen

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination