CN113641439A - Text recognition and display method, device, electronic equipment and medium - Google Patents

Text recognition and display method, device, electronic equipment and medium Download PDF

Info

Publication number
CN113641439A
CN113641439A CN202110938868.6A CN202110938868A CN113641439A CN 113641439 A CN113641439 A CN 113641439A CN 202110938868 A CN202110938868 A CN 202110938868A CN 113641439 A CN113641439 A CN 113641439A
Authority
CN
China
Prior art keywords
text recognition
text
candidate
scanning
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110938868.6A
Other languages
Chinese (zh)
Other versions
CN113641439B (en
Inventor
梁霄
张铭阳
蒋峰
唐红羚
柳舒芳
张国旺
杨彦哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Original Assignee
Baidu Online Network Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Baidu Online Network Technology Beijing Co Ltd filed Critical Baidu Online Network Technology Beijing Co Ltd
Priority to CN202110938868.6A priority Critical patent/CN113641439B/en
Publication of CN113641439A publication Critical patent/CN113641439A/en
Application granted granted Critical
Publication of CN113641439B publication Critical patent/CN113641439B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The disclosure provides a text recognition and display method, a text recognition and display device, electronic equipment and a medium, relates to the technical field of computers, and particularly relates to the technical field of text recognition, cloud computing and cloud service. The specific implementation scheme is as follows: determining a binding relationship between the candidate client and the scanning equipment according to the acquired text recognition instruction; and determining a target server from the candidate servers according to the binding relationship, and sending the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result. The present disclosure achieves the effect of improving the diversity and richness of text recognition results.

Description

Text recognition and display method, device, electronic equipment and medium
Technical Field
The present disclosure relates to the field of computer technologies, and in particular, to a method and an apparatus for text recognition and display, an electronic device, and a medium.
Background
Along with the development of science and technology, more and more intelligent learning equipment is merged into student's learning process, has improved student's learning efficiency greatly. The dictionary pen is a new intelligent learning device, and students can obtain information such as pronunciation, paraphrase, translation and the like of a character only by scanning the character by using the dictionary pen without repeatedly searching dictionary books.
Most current dictionary pens generate information query results based on a locally mounted processor.
Disclosure of Invention
The disclosure provides a method, an apparatus, an electronic device and a medium for text recognition of a text to be recognized.
According to an aspect of the present disclosure, there is provided a text recognition method including:
determining a binding relationship between the candidate client and the scanning equipment according to the acquired text recognition instruction;
and determining a target server from the candidate servers according to the binding relationship, and sending the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
According to another aspect of the present disclosure, there is provided a text presentation method including:
acquiring a text recognition result from a target server; the target server side is determined from the candidate server sides according to the binding relation between the candidate client sides and the scanning equipment;
and displaying the text recognition result.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any of the present disclosure.
According to another aspect of the present disclosure, there is provided a non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of the present disclosure.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1 is a flow chart of a text recognition method disclosed in accordance with an embodiment of the present disclosure;
FIG. 2A is a flow chart of a text recognition method disclosed in accordance with an embodiment of the present disclosure;
FIG. 2B is a schematic illustration of an open user centric interface disclosed in accordance with an embodiment of the disclosure;
FIG. 2C is a schematic illustration of a user centric interface disclosed in accordance with an embodiment of the present disclosure;
FIG. 2D is a schematic diagram of a switch pattern disclosed in accordance with an embodiment of the present disclosure;
FIG. 2E is a schematic diagram of a binding hint disclosed in accordance with an embodiment of the present disclosure;
fig. 2F is a schematic diagram of a network disconnection prompting message according to an embodiment of the disclosure;
FIG. 2G is a schematic illustration of a shortcut icon disclosed in accordance with an embodiment of the present disclosure;
FIG. 3A is a flow chart of a method of text presentation disclosed in accordance with an embodiment of the present disclosure;
FIG. 3B is an interface schematic diagram of a function page disclosed in accordance with an embodiment of the present disclosure;
FIG. 3C is a schematic diagram illustrating a text recognition result display according to an embodiment of the disclosure;
FIG. 3D is a schematic diagram illustrating a text recognition result presentation according to an embodiment of the disclosure;
FIG. 4 is a schematic structural diagram of a text recognition apparatus according to an embodiment of the disclosure;
FIG. 5 is a schematic structural diagram of a text display device according to an embodiment of the disclosure;
fig. 6 is a block diagram of an electronic device for implementing the text recognition method and/or the text presentation method disclosed in the embodiments of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the disclosure are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
In the research and development process of the applicant, the existing text scanning device usually performs text recognition on a text scanning result based on a locally-mounted processor to obtain a text recognition result corresponding to the text scanning result. However, the disadvantages of the conventional method are as follows: 1) the calculation power of a processor carried by the scanning equipment is limited, so that the speed of text recognition is low, and a text recognition result cannot be obtained quickly. 2) The text recognition method of the local end of the scanning device is single, so that the diversity and richness of the obtained text recognition results are low, and high-quality text recognition results cannot be provided for users.
Fig. 1 is a flowchart of a text recognition method disclosed in an embodiment of the present disclosure, and this embodiment may be applied to a case of performing text recognition based on a server. The method of the present embodiment may be executed by the text recognition apparatus disclosed in the embodiments of the present disclosure, and the apparatus may be implemented by software and/or hardware, and may be integrated on any electronic device with computing capability.
As shown in fig. 1, the text recognition method disclosed in this embodiment may include:
s101, determining a binding relationship between the candidate client and the scanning equipment according to the acquired text recognition instruction.
The scanning device is a device with an image acquisition function, and can acquire images of text carriers such as books, test papers, newspapers and other entities containing text information at will to obtain image acquisition results, namely text scanning results. The text recognition instruction is an instruction for triggering the scanning device to perform text scanning. The candidate client is a client which establishes mutual trust with the scanning device in advance, namely the candidate client has the authority to acquire the text recognition result corresponding to the text scanning result. The candidate clients are divided into two types, one type of candidate clients and the scanning equipment have a binding relation, and the candidate clients can obtain a text recognition result based on the binding relation; another type of candidate client does not have a binding relationship with the scanning device, and the candidate client cannot acquire the text recognition result temporarily and needs to be bound with the scanning device first.
In one embodiment, the user implements a text recognition instruction to the scanning device, for example, clicking a text recognition button on a touch-enabled display screen of the scanning device itself, to trigger generation of the text recognition instruction. The scanning equipment acquires a text recognition instruction implemented by a user, verifies the binding relationship between each candidate client and the scanning equipment according to the text recognition instruction, and determines whether the binding relationship is established between each candidate client and the scanning equipment. For example, polling and checking each candidate client, and if a binding relationship is established between any candidate client and the scanning device, performing a bound mark on the candidate client; correspondingly, if the binding relation between any candidate client and the scanning equipment is not established, the candidate client is marked with the unbound state.
The binding relationship between the candidate client and the scanning equipment is determined according to the acquired text recognition instruction, so that a foundation is laid for determining a target server for text recognition according to the binding relationship subsequently.
S102, determining a target server from the candidate servers according to the binding relationship, and sending the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
The candidate service end is determined according to the candidate client, and an incidence relation exists between the candidate service end and the candidate client, namely, any candidate client is associated with at least one candidate service end. With the addition of new candidate clients, the corresponding server is added to the existing candidate servers, that is, the candidate servers have extensibility in this embodiment.
In one implementation, each candidate client is screened according to the determined binding relationship between the candidate client and the scanning device, the candidate client having the binding relationship established with the scanning device is determined, and at least one candidate server associated with the candidate client is used as a target server. After the target server is determined, the scanning equipment prompts a user to scan the text, the user controls the scanning equipment to scan the text aiming at the text information to be scanned, and the scanning equipment correspondingly obtains a text scanning result. The scanning device sends the text scanning result to the main server in a network transmission mode based on the network transmission function of the scanning device, and the main server calls the interfaces of all target servers and respectively transmits the text scanning result to all the target servers.
After receiving the text scanning result, each target server performs text recognition on the text scanning result based on a text recognition algorithm and a database carried by the target server, and determines a text recognition result corresponding to the text scanning result, where the text recognition result includes, but is not limited to, OCR (optical character recognition) information, text part-of-speech information, text paraphrase information, text phonetic symbol information, and the like, and the embodiment does not limit specific contents included in the text recognition result. Because each target server is independent, the carried text recognition algorithm and the database are different, so that the diversity and the richness of the obtained text recognition result are higher.
Each target server side sends the determined text recognition result to the master server side, and the master server side sends the text recognition result to a target client side in a network transmission mode, wherein the target client side can be a client side corresponding to the target server side, namely each target client side receives the text recognition result of the corresponding target server side; the target client can also be an independent client, namely the total server collects the text recognition results obtained by the target servers and sends the collected text recognition results to the target client. Meanwhile, the general server side sends the text recognition result to the scanning equipment.
And after receiving the text recognition result, the target client displays the text recognition result in the specified display area. After receiving the text recognition result, the scanning device displays the text recognition result in the specified area of the display screen of the scanning device, so that the client and the scanning device can be synchronously displayed, and the user can conveniently check the text recognition result.
According to the method, the binding relationship between the candidate client and the scanning device is determined according to the obtained text recognition instruction, the target server is determined from the candidate servers according to the binding relationship, the obtained text scanning result is sent to the target server, so that the target server determines the text recognition result according to the text scanning result, and the server is strong in calculation support, so that the speed of text recognition can be improved, and the waiting time of a user is reduced; in addition, a plurality of candidate servers are arranged in the embodiment, so that the diversity and richness of the text recognition result are improved, the candidate servers have expandability, and more candidate servers can be accessed subsequently, so that the quality of text recognition is improved.
On the basis of the above embodiment, optionally, the scanning device is an intelligent dictionary pen.
The intelligent dictionary pen is provided with a high-speed camera at a pen point, a hundred of images are shot at an appointment every second, and the images are spliced together to serve as a text scanning result. The intelligent dictionary pen is also internally provided with a network transmission module so as to realize that the text scanning result is sent to the target server in a network transmission mode. The intelligent dictionary pen is also externally provided with a touch display screen for a user to implement a text recognition instruction and show a text recognition result to the user.
By using the intelligent dictionary pen as scanning equipment, the application range of the text recognition method provided by the embodiment is expanded, the text recognition speed and the text recognition quality of the intelligent dictionary pen are improved, and more perfect learning help is provided for users.
Fig. 2A is a flowchart of a text recognition method disclosed according to an embodiment of the present disclosure, which is further optimized and expanded based on the above technical solution, and can be combined with the above optional embodiments.
As shown in fig. 2A, the text recognition method disclosed in this embodiment may include:
s201, displaying a text recognition function switch, and generating a text recognition instruction according to user control operation of the text recognition function switch.
In one embodiment, a user clicks a "user center" button on a touch display of the scanning device, and the touch display correspondingly presents a user center interface to the user, wherein the user center interface includes a text recognition function switch. The user can select to turn on or turn off the text recognition function switch according to the self requirement, and after the user turns on the text recognition function switch, the text recognition function is turned on, so that a text recognition instruction is generated; correspondingly, when the user closes the text recognition function switch, the text recognition function is closed.
Fig. 2B is a schematic diagram of opening a user center interface according to an embodiment of the disclosure, as shown in fig. 2B, 200 represents a desktop page of a touch display screen of a scanning device, 201 represents a "user center" button, and when the button 201 is clicked, the touch display screen displays the user center interface.
Fig. 2C is a schematic diagram of a user center interface disclosed according to an embodiment of the present disclosure, as shown in fig. 2C, 202 represents the user center interface, 203 represents a text recognition function switch, a user can select to turn on or off the text recognition function switch 203 according to a requirement of the user, and when the user turns on the text recognition function switch 203, the text recognition function is turned on, so as to generate a text recognition instruction; accordingly, when the user turns off the text recognition function switch 203, the text recognition function is turned off.
Optionally, "displaying a text recognition function switch" in S201 includes:
controlling the mode of the text recognition switch to be in a first mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state; controlling the mode of the text recognition switch to be in a second mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in a connection state; under the condition that the text recognition function switch is in a closed state, controlling the style of the text recognition switch to be in a third style; wherein the first pattern, the second pattern, and the third pattern are different.
In one embodiment, the scanning device detects the network state and the state of the text recognition function switch in real time, when the text recognition function switch is in an on state and the network state of the scanning device is in an off state, although the text recognition function is turned on, the scanning device controls the text recognition switch to display the text recognition switch to the user in the first mode because the network is off and data cannot be transmitted to the target server.
And under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in a connection state, the text recognition function can be normally realized, and the scanning equipment controls the text recognition switch to display the text recognition switch to the user in a second mode.
And under the condition that the text recognition function switch is in a closed state, the text recognition function is closed at the moment, and the text recognition cannot be carried out, and the scanning equipment controls the text recognition switch to be displayed to the user in a third mode.
The first style, the second style, and the third style represent different styles, that is, the text recognition function switch of the first style, the text recognition function switch of the second style, and the text recognition function switch of the third style have different visual appearances.
FIG. 2D is a schematic diagram of a switch style disclosed in accordance with an embodiment of the present disclosure, as shown in FIG. 2D, 204 shows a first style text recognition function switch with striped undertone fill and a toggle button on the right side of the switch; 205, a second style text recognition function switch with solid background color fill and a switch button on the right side of the switch; 206 represents a third style text recognition function switch with no undercolor fill and a switch button to the left of the switch. The present embodiment explains the first pattern, the second pattern, and the third pattern only by taking the above-described pattern as an example, and does not make any particular limitation.
The mode of the text recognition switch is controlled to be in a first mode under the conditions that the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state; controlling the mode of the text recognition switch to be in a second mode under the condition that the text recognition function switch is in an opening state and the network state of the scanning equipment is in a connection state; under the condition that the text recognition function switch is in a closed state, controlling the style of the text recognition switch to be in a third style; the first style, the second style and the third style are different, so that the effect of performing differential display on the text recognition function switch according to whether the scanning equipment can normally perform text recognition at present is achieved, a user can intuitively know whether the text recognition can be performed normally, and user experience is improved.
S202, determining the binding relationship between the candidate client and the scanning equipment according to the acquired text recognition instruction.
Optionally, after S202, the method further includes:
and under the condition that any candidate client side does not have the binding relationship with the scanning equipment, displaying binding identification information, and controlling at least one candidate client side to be bound with the scanning equipment according to the binding identification information.
In an embodiment, the scanning device determines a binding relationship between each candidate client and the scanning device, and if it is determined that all the candidate clients are not bound to the scanning device, that is, under the condition that any candidate client does not have a binding relationship with the scanning device, binding prompt information is displayed to prompt a user that at least one candidate client needs to be bound to the scanning device to perform text recognition. And after clicking the identification button by the user, displaying the binding identification information, and controlling at least one candidate client to be bound with the scanning equipment by the user according to the binding identification information. The binding identification information includes, but is not limited to, a two-dimensional code, a barcode, an identification code, or the like. For example, the user invokes a scanning function of any candidate client to scan the two-dimensional code, so as to bind the candidate client with the scanning device.
Fig. 2E is a schematic diagram of a binding prompt message according to the embodiment of the disclosure, and as shown in fig. 2E, when any candidate client does not have a binding relationship with the scanning device, a binding prompt message 207 pops up on a user center interface, for example, "please show the binding identifier information in the upper right corner, and turn on after scanning with the XX client". The embodiment is explained by taking the binding prompt information 207 as an example, and the specific content of the binding prompt information is not limited.
Under the condition that any candidate client side and the scanning equipment do not have the binding relation, the binding identification information is displayed and is used for controlling at least one candidate client side to be bound with the scanning equipment according to the binding identification information, the effect of guiding a user to bind the candidate client side is achieved, and the text recognition can be smoothly executed.
S203, determining at least one target client having a binding relationship with the scanning device from the candidate clients, and taking the candidate server associated with the target client as the target server.
Exemplarily, it is assumed that the candidate clients of the scanning device include a candidate client a, a candidate client B, a candidate client C, and a candidate client D, and the candidate servers respectively associated with the candidate clients are a candidate server a, a candidate server B, a candidate server C, and a candidate server D. And if the candidate client A and the candidate client B have the binding relation with the scanning equipment, taking the candidate client A and the candidate client B as target clients, and taking the candidate server A and the candidate server B as target servers.
And S204, sending the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
Optionally, before S204, the method further includes:
determining a network status of the scanning device; and under the condition that the network state is a disconnected state, displaying network disconnection prompt information for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
In one embodiment, the scanning device detects whether its own network transmission function is normal, and if it is determined that the network transmission function is not normal, that is, the network state is a disconnected state, the scanning device displays network disconnection prompting information to prompt a user to control the scanning device to perform network reconnection.
Fig. 2F is a schematic diagram of a network disconnection prompting message disclosed according to an embodiment of the present disclosure, and as shown in fig. 2F, when the network state of the scanning device is a disconnected state, a network disconnection prompting message 208 pops up on the user center interface, for example, "network is disconnected, please connect the network first and then start up". The present embodiment is explained by taking the network disconnection prompting information 208 as an example, and the specific content of the network disconnection prompting information is not limited.
By determining a network status of the scanning device; and under the condition that the network state is the disconnection state, displaying the network disconnection prompt information for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information, realizing the effect of guiding a user to carry out network connection on the scanning equipment, and ensuring that text recognition can be smoothly executed.
According to the text recognition function switch and the text recognition method, the text recognition instruction is generated by displaying the text recognition function switch and according to the user control operation on the text recognition function switch, the effect that a user can control the on or off of the text recognition function is achieved, and the user experience is improved; by determining at least one target client in binding relation with the scanning device from the candidate clients and taking the candidate server associated with the target client as the target server, the effect of determining the server for text recognition is achieved, text recognition can be smoothly executed, and the candidate servers are arranged in the embodiment, so that diversity and richness of text recognition results are improved.
On the basis of the above embodiment, optionally, the text recognition function switch may also be displayed in the form of a shortcut icon in a shortcut setting page of the scanning device.
Fig. 2G is a schematic diagram of a shortcut icon disclosed according to an embodiment of the present disclosure, and as shown in fig. 2G, 209 is a shortcut icon of a text recognition function switch in a shortcut setup page, and a user may control the shortcut icon 209 to turn on or off a text recognition function. And, in case that the text recognition function switch is in an on state and the network state of the scanning device is in an off state, the style of the control shortcut icon 209 is in a fourth style; in the case where the text recognition function switch is in an on state and the network state of the scanning apparatus is a connected state, the style of the control shortcut icon 209 is in a fifth style; in the case where the text recognition function switch is in the off state, the style of the control shortcut icon 209 is in the sixth style; wherein the fourth pattern, the fifth pattern, and the sixth pattern are different.
Through the shortcut icon of the text recognition function switch displayed in the shortcut setting page, a user can conveniently and quickly control the text recognition function to be turned on or turned off, and the efficiency is improved.
In the research and development process, the applicant finds that the existing text recognition result display method mostly utilizes a near field communication technology such as WIFI or bluetooth to transmit the text recognition result obtained by the scanning device to the client for display. However, this method must keep the distance between the scanning device and the terminal to which the client belongs to be short, and has a large limitation.
Fig. 3A is a flowchart of a text presentation method disclosed in the embodiment of the present disclosure, which may be applied to a case where a client presents a text recognition result. The method of the present embodiment may be executed by the text recognition apparatus disclosed in the embodiments of the present disclosure, and the apparatus may be implemented by software and/or hardware, and may be integrated on any electronic device with computing capability.
As shown in fig. 3A, the text presentation method disclosed in this embodiment may include:
s301, acquiring a text recognition result from a target server; and the target server is determined from the candidate servers according to the binding relationship between the candidate clients and the scanning equipment.
In one implementation mode, at least one target client having a binding relationship with the scanning device is determined from the candidate clients, the candidate server associated with the target client is used as the target server, and the client acquires a text recognition result from the target server.
In an actual scenario, a user clicks a "real-time display" button in a function page of a client to pop up prompt information, for example, "please confirm that a bound scanning device is in a networking state", and after the user clicks the confirmation button, the client determines whether the client and the scanning device are in an available state.
Fig. 3B is an interface schematic diagram of a function page disclosed according to an embodiment of the present disclosure, and as shown in fig. 3B, 300 represents a function interface of a client, and after a user clicks a real-time display button 301, a prompt message pops up, for example, "please confirm that a bound scanning device is in a networking state", and after the user clicks a confirmation button, the client determines whether the client and the scanning device are in an available state.
If the network state of the terminal to which the client belongs is the connected state, determining that the client is in the available state; and if the text recognition function switch of the scanning equipment is turned on, determining that the scanning equipment is in an available state.
When the scanning device is in the unavailable state, guidance information is displayed, for example, "please check whether the text recognition function switch of the scanning device is turned on". When the client is in the unavailable state, guidance information is displayed, such as "please check the mobile phone network and try again".
When the client itself and the scanning device are both in the available state, guidance information is displayed, for example, "start scanning the word-searching bar, and the recognition result is displayed here synchronously". And then, the client starts to acquire a text recognition result from the target server.
S302, displaying the text recognition result.
In one embodiment, the client acquires the OCR result from the target client for on-screen display, and then sequentially acquires other types of text recognition results from the target server. By displaying the OCR result first, the user can obtain rapid recognition result display experience, and the waiting time of the user is reduced.
Fig. 3C is a schematic diagram of text recognition result presentation according to an embodiment of the disclosure, and as shown in fig. 3C, an OCR result 302 is presented in the client, and a prompt message 303 "in query" is presented to prompt the user to patiently wait for other types of text recognition results.
And after the client side obtains the text recognition results of other types from the target server side, the text recognition results of other types are completely displayed.
Fig. 3D is a schematic diagram illustrating a text recognition result display according to an embodiment of the disclosure, and as shown in fig. 3D, a client displays a complete text recognition.
In the display process, the network of the scanning device may be disconnected or the network of the terminal to which the client belongs may be disconnected, at this time, the client displays prompt information, for example, "please check the network states of the mobile phone and the scanning device", and when the user controls the network states of the terminal to which the client belongs and the scanning device to be connected again, the "reacquire" button is clicked to trigger the client to reacquire the text recognition result from the target server and display the text recognition result.
The text recognition result is obtained from the target server; the target server side determines from the candidate server side according to the binding relation between the candidate client side and the scanning equipment and displays the text recognition result, the effect of displaying the text recognition result to the user in the client side is achieved, the user can conveniently check the text recognition result, and the server side sends the text recognition result, so that a terminal to which the client side belongs does not need to keep a short distance from the scanning equipment, the applicability range of the method is greatly improved, and the limitation is reduced. Meanwhile, a plurality of candidate servers are arranged in the embodiment, so that the diversity and richness of text recognition results are improved.
Fig. 4 is a schematic structural diagram of a text recognition apparatus according to an embodiment of the present disclosure, which may be applied to a case of performing text recognition based on a server. The device of the embodiment can be implemented by software and/or hardware, and can be integrated on any electronic equipment with computing capability.
As shown in fig. 4, the text recognition apparatus 40 disclosed in this embodiment may include a binding relationship determination module 41 and a scan result transmission module 42, where:
and a binding relationship determining module 41, configured to determine a binding relationship between the candidate client and the scanning device according to the obtained text recognition instruction.
And the scanning result sending module 42 is configured to determine a target server from the candidate servers according to the binding relationship, and send the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
Optionally, the scanning result sending module 42 is specifically configured to:
determining at least one target client having a binding relationship with the scanning device from the candidate clients;
and taking the candidate server associated with the target client as the target server.
Optionally, the apparatus further includes a switch display module, specifically configured to:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
Optionally, the switch display module is further specifically configured to:
controlling the mode of the text recognition switch to be in a first mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the mode of the text recognition switch to be in a second mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in a connection state;
under the condition that the text recognition function switch is in a closed state, controlling the style of the text recognition switch to be in a third style;
wherein the first pattern, the second pattern, and the third pattern are different.
Optionally, the apparatus further includes a binding identifier information display module, specifically configured to:
and under the condition that any candidate client side does not have the binding relationship with the scanning equipment, displaying binding identification information, and controlling at least one candidate client side to be bound with the scanning equipment according to the binding identification information.
Optionally, the device further includes a prompt information display module, specifically configured to:
determining a network status of the scanning device;
and under the condition that the network state is a disconnected state, displaying network disconnection prompt information for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
Optionally, the scanning device is an intelligent dictionary pen.
The text recognition device 40 disclosed in the embodiment of the present disclosure can execute the text recognition method disclosed in the embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method. Reference may be made to the description of any method embodiment of the disclosure for a matter not explicitly described in this embodiment.
Fig. 5 is a schematic structural diagram of a text display apparatus according to an embodiment of the present disclosure, which may be suitable for a case where a client displays a text recognition result. The device of the embodiment can be implemented by software and/or hardware, and can be integrated on any electronic equipment with computing capability.
As shown in fig. 5, the text display apparatus 50 disclosed in this embodiment may include a recognition result obtaining module 51 and a recognition result displaying module 52, where:
a recognition result obtaining module 51, configured to obtain a text recognition result from the target server; the target server side is determined from the candidate server sides according to the binding relation between the candidate client sides and the scanning equipment;
and the recognition result display module 52 is configured to display the text recognition result.
The text display device 50 disclosed in the embodiment of the present disclosure can execute the text display method disclosed in the embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method. Reference may be made to the description of any method embodiment of the disclosure for a matter not explicitly described in this embodiment.
In the technical scheme of the disclosure, the collection, storage, use, processing, transmission, provision, disclosure and other processing of the personal information of the related user are all in accordance with the regulations of related laws and regulations and do not violate the good customs of the public order.
The present disclosure also provides an electronic device, a readable storage medium, and a computer program product according to embodiments of the present disclosure.
FIG. 6 illustrates a schematic block diagram of an example electronic device 600 that can be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 6, the apparatus 600 includes a computing unit 601, which can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM)602 or a computer program loaded from a storage unit 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data required for the operation of the device 600 can also be stored. The calculation unit 601, the ROM 602, and the RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
A number of components in the device 600 are connected to the I/O interface 605, including: an input unit 606 such as a keyboard, a mouse, or the like; an output unit 607 such as various types of displays, speakers, and the like; a storage unit 608, such as a magnetic disk, optical disk, or the like; and a communication unit 609 such as a network card, modem, wireless communication transceiver, etc. The communication unit 609 allows the device 600 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.
The computing unit 601 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of the computing unit 601 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various dedicated Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, and so forth. The calculation unit 601 performs the respective methods and processes described above, such as a text recognition method and/or a text presentation method. For example, in some embodiments, the text recognition method and/or the text presentation method may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 608. In some embodiments, part or all of the computer program may be loaded and/or installed onto the device 600 via the ROM 602 and/or the communication unit 609. When the computer program is loaded into the RAM 603 and executed by the computing unit 601, one or more steps of the text recognition method and/or the text presentation method described above may be performed. Alternatively, in other embodiments, the computing unit 601 may be configured to perform the text recognition method and/or the text presentation method by any other suitable means (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuitry, Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), system on a chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowchart and/or block diagram to be performed. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), blockchain networks, and the internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server may be a cloud server, a server of a distributed system, or a server with a combined blockchain.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present disclosure may be executed in parallel, sequentially, or in different orders, as long as the desired results of the technical solutions disclosed in the present disclosure can be achieved, and the present disclosure is not limited herein.
The above detailed description should not be construed as limiting the scope of the disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present disclosure should be included in the scope of protection of the present disclosure.

Claims (19)

1. A text recognition method, comprising:
determining a binding relationship between the candidate client and the scanning equipment according to the acquired text recognition instruction;
and determining a target server from the candidate servers according to the binding relationship, and sending the obtained text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
2. The method of claim 1, wherein the determining a target server from the candidate servers according to the binding relationship comprises:
determining at least one target client having a binding relationship with the scanning device from the candidate clients;
and taking the candidate server associated with the target client as the target server.
3. The method of claim 1, before determining the binding relationship between the candidate client and the scanning device according to the obtained text recognition instruction, further comprising:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
4. The method of claim 3, wherein the presentation text recognition function switch comprises:
controlling the mode of the text recognition switch to be in a first mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the mode of the text recognition switch to be in a second mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in a connection state;
under the condition that the text recognition function switch is in a closed state, controlling the style of the text recognition switch to be in a third style;
wherein the first pattern, the second pattern, and the third pattern are different.
5. The method of claim 1, after determining the binding relationship between the candidate client and the scanning device, further comprising:
and under the condition that any candidate client side does not have the binding relationship with the scanning equipment, displaying binding identification information, and controlling at least one candidate client side to be bound with the scanning equipment according to the binding identification information.
6. The method of claim 1, before sending the obtained text scanning result to the target server, further comprising:
determining a network status of the scanning device;
and under the condition that the network state is a disconnected state, displaying network disconnection prompt information for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
7. The method of claim 1, wherein the scanning device is a smart dictionary pen.
8. A text presentation method comprises the following steps:
acquiring a text recognition result from a target server; the target server side is determined from the candidate server sides according to the binding relation between the candidate client sides and the scanning equipment;
and displaying the text recognition result.
9. A text recognition apparatus comprising:
the binding relation determining module is used for determining the binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction;
and the scanning result sending module is used for determining a target server from the candidate servers according to the binding relationship, sending the obtained text scanning result to the target server and enabling the target server to determine a text recognition result according to the text scanning result.
10. The apparatus according to claim 9, wherein the scan result sending module is specifically configured to:
determining at least one target client having a binding relationship with the scanning device from the candidate clients;
and taking the candidate server associated with the target client as the target server.
11. The device of claim 9, further comprising a switch presentation module, specifically configured to:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
12. The device of claim 11, wherein the switch demonstration module is further specifically configured to:
controlling the mode of the text recognition switch to be in a first mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the mode of the text recognition switch to be in a second mode under the condition that the text recognition function switch is in an on state and the network state of the scanning equipment is in a connection state;
under the condition that the text recognition function switch is in a closed state, controlling the style of the text recognition switch to be in a third style;
wherein the first pattern, the second pattern, and the third pattern are different.
13. The apparatus according to claim 9, further comprising a binding identifier information presentation module, specifically configured to:
and under the condition that any candidate client side does not have the binding relationship with the scanning equipment, displaying binding identification information, and controlling at least one candidate client side to be bound with the scanning equipment according to the binding identification information.
14. The device of claim 9, further comprising a prompt message presentation module specifically configured to:
determining a network status of the scanning device;
and under the condition that the network state is a disconnected state, displaying network disconnection prompt information for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
15. The apparatus of claim 9, wherein the scanning device is a smart dictionary pen.
16. A text presentation device comprising:
the recognition result acquisition module is used for acquiring a text recognition result from the target server; the target server side is determined from the candidate server sides according to the binding relation between the candidate client sides and the scanning equipment;
and the recognition result display module is used for displaying the text recognition result.
17. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7 and/or claim 8.
18. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any of claims 1-7 and/or 8.
19. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1-7 and/or claim 8.
CN202110938868.6A 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium Active CN113641439B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110938868.6A CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110938868.6A CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Publications (2)

Publication Number Publication Date
CN113641439A true CN113641439A (en) 2021-11-12
CN113641439B CN113641439B (en) 2023-08-29

Family

ID=78422151

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110938868.6A Active CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Country Status (1)

Country Link
CN (1) CN113641439B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115086501A (en) * 2022-05-19 2022-09-20 阿波罗智联(北京)科技有限公司 Scanning method, scanning device, electronic equipment and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102904909A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 OCR (Optical Character Recognition) method based on cloud model
CN102902968A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 Method for quickly acquiring publication content by scanning with mobile phone
CN103176964A (en) * 2011-12-21 2013-06-26 上海博路信息技术有限公司 Translation auxiliary system based on OCR
US20140172408A1 (en) * 2012-12-14 2014-06-19 Microsoft Corporation Text overlay techniques in realtime translation
US20150120279A1 (en) * 2013-10-28 2015-04-30 Linkedin Corporation Techniques for translating text via wearable computing device
CN110611685A (en) * 2019-10-30 2019-12-24 南宁市指搜信息技术有限公司 Internet site login system based on intelligent equipment monitoring and user identity recognition
CN111862940A (en) * 2020-07-15 2020-10-30 百度在线网络技术(北京)有限公司 Earphone-based translation method, device, system, equipment and storage medium
CN112382286A (en) * 2020-11-10 2021-02-19 苏州思必驰信息科技有限公司 Method, device and system for realizing online evaluation of learning effect based on intelligent voice control

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102904909A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 OCR (Optical Character Recognition) method based on cloud model
CN102902968A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 Method for quickly acquiring publication content by scanning with mobile phone
CN103176964A (en) * 2011-12-21 2013-06-26 上海博路信息技术有限公司 Translation auxiliary system based on OCR
US20140172408A1 (en) * 2012-12-14 2014-06-19 Microsoft Corporation Text overlay techniques in realtime translation
US20150120279A1 (en) * 2013-10-28 2015-04-30 Linkedin Corporation Techniques for translating text via wearable computing device
CN110611685A (en) * 2019-10-30 2019-12-24 南宁市指搜信息技术有限公司 Internet site login system based on intelligent equipment monitoring and user identity recognition
CN111862940A (en) * 2020-07-15 2020-10-30 百度在线网络技术(北京)有限公司 Earphone-based translation method, device, system, equipment and storage medium
CN112382286A (en) * 2020-11-10 2021-02-19 苏州思必驰信息科技有限公司 Method, device and system for realizing online evaluation of learning effect based on intelligent voice control

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WEIXIN_42824978: "OCR在扫描笔中的技术应用", Retrieved from the Internet <URL:https://blog.csdn.net/weixin_42824978/article/details/114784253> *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115086501A (en) * 2022-05-19 2022-09-20 阿波罗智联(北京)科技有限公司 Scanning method, scanning device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN113641439B (en) 2023-08-29

Similar Documents

Publication Publication Date Title
EP3821330B1 (en) Electronic device and method for generating short cut of quick command
CN107608652B (en) Method and device for controlling graphical interface through voice
CN108039173B (en) Voice information input method, mobile terminal, system and readable storage medium
KR102136474B1 (en) Synchronization of client-side keyboard layout with server-side keyboard layout in a virtual session
KR20210114356A (en) Screen mirroring method, device, equipment and storage medium
KR20210146850A (en) Control method and system of applet, server and terminal device
CN113285866B (en) Information sending method and device and electronic equipment
CN112528179A (en) Two-dimensional code processing method and system, electronic device and storage medium
KR20210038812A (en) Speech control method and apparatus, electronic device, and readable storage medium
EP4095686A2 (en) Method for switching skin of mini-program page, and electronic device
EP4099627A1 (en) Method for synchronizing verification code, electronic device and storage medium
CN114726906B (en) Device interaction method, device, electronic device and storage medium
CN111966257A (en) Information processing method and device and electronic equipment
CN113641439B (en) Text recognition and display method, device, electronic equipment and medium
CN113552988A (en) Interface focus control method and device, electronic equipment and storage medium
CN112910741A (en) Interface testing method and device, computer equipment and storage medium
CN110309462B (en) Data display method and system
US20220148422A1 (en) Annunciator control method, electronic device and system
CN111782992B (en) Display control method, device, equipment and readable storage medium
CN114422236A (en) Intelligent device access method and device and electronic device
CN113391737A (en) Interface display control method and device, storage medium and electronic equipment
CN111752190A (en) Equipment control method, device and system, storage medium and electronic equipment
CN112748835A (en) Terminal, server, recent task list display method and application recommendation method
CN113992729B (en) Cloud mobile phone control method, related device and computer program product
EP4207714A1 (en) Page display method and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant