CN113743102B - Method and device for recognizing characters and electronic equipment - Google Patents

Method and device for recognizing characters and electronic equipment Download PDF

Info

Publication number
CN113743102B
CN113743102B CN202110950635.8A CN202110950635A CN113743102B CN 113743102 B CN113743102 B CN 113743102B CN 202110950635 A CN202110950635 A CN 202110950635A CN 113743102 B CN113743102 B CN 113743102B
Authority
CN
China
Prior art keywords
character
recognized
text
target
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110950635.8A
Other languages
Chinese (zh)
Other versions
CN113743102A (en
Inventor
张铭阳
蒋峰
张志达
胡晓雨
张国鹏
陈轶博
高丰
谢卓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202110950635.8A priority Critical patent/CN113743102B/en
Publication of CN113743102A publication Critical patent/CN113743102A/en
Application granted granted Critical
Publication of CN113743102B publication Critical patent/CN113743102B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

The disclosure provides a method and a device for recognizing characters and electronic equipment, relates to the field of data processing, and particularly relates to the field of word processing. The specific implementation scheme is as follows: acquiring at least one character to be recognized, which is obtained after scanning a text to be recognized; displaying an initial recognition result of character error correction on at least one character to be recognized; determining a target character from at least one character to be recognized in response to a selection instruction of the at least one character to be recognized contained in the initial recognition result; and identifying the target character to obtain an identification result. By the method and the device, the problem of low accuracy of character recognition in the prior art is at least solved.

Description

Method and device for recognizing characters and electronic equipment
Technical Field
The disclosure relates to the technical field of data processing, and in particular relates to a method and device for recognizing characters and electronic equipment.
Background
With the development of internet technology, various text recognition devices enter into daily life of people, and the text recognition devices can help people to perform quick scanning inquiry of Chinese and English, including: word searching, sentence searching and translation are beneficial to improving the working efficiency of people.
In the conventional various text recognition devices, a user is required to perform stroke scanning on a text by using a pen point, so that text contents shot by a camera on the pen point are spliced and OCR (optical character recognition, text recognition), and then word searching, translation and other processes are performed on an OCR result, and a final recognition result is obtained.
However, the result of OCR can only be recognized at a character level, that is, the OCR follows the principle that the OCR is obtained when the OCR is seen in the text recognition process, and does not perform any semantic correction and prediction of the actual intention of the user, on the basis of which, the text recognition device directly retrieves and translates the result of OCR, the recognized character is often inaccurate, and thus, the user often has a sense of unconsciousness in the use process.
In view of the above problems, no effective solution has been proposed at present.
Disclosure of Invention
The disclosure provides a method and a device for recognizing characters and electronic equipment, so as to at least solve the problem of low character recognition accuracy in the prior art.
According to an aspect of the present disclosure, there is provided a method of recognizing a character, including: acquiring at least one character to be recognized, which is obtained after scanning a text to be recognized; displaying an initial recognition result of character error correction on at least one character to be recognized; in response to a selection instruction of at least one character to be recognized contained in the initial recognition result, determining a target character from the at least one character to be recognized, and recognizing the target character to obtain a recognition result.
Further, the method for recognizing characters further comprises: and acquiring at least one character to be recognized, which is obtained after the document to be recognized is scanned by a text scanning unit, wherein the text recognition equipment at least comprises the text scanning unit and a text display unit, and the text display unit is at least used for displaying the initial recognition result, the target character and the recognition result.
Further, the method for recognizing characters further comprises: and identifying the target character to obtain the identification result, acquiring the semantic meaning of the target character in the text to be identified, and identifying the target character according to the semantic meaning to obtain the identification result.
Further, the method for recognizing characters further comprises: and displaying an initial recognition result obtained by performing natural language processing on at least one character to be recognized in a text display unit.
Further, the method for recognizing characters further comprises: before at least one character to be recognized, which is obtained after a text to be recognized is scanned, is obtained, when the text scanning unit scans abnormally, prompt information is displayed on the text display unit; and when the display time of the prompt information is longer than the second preset time, hiding the prompt information in the text display unit.
Further, the method for recognizing characters further comprises: before determining a target character from at least one character to be recognized in response to a selection instruction of the at least one character to be recognized contained in the initial recognition result, detecting a character type corresponding to the at least one character to be recognized; and responding to a splitting instruction of at least one character to be recognized, and displaying at least one splitting character after splitting operation is performed on the at least one character to be recognized according to the character type in a text display unit.
Further, the method for recognizing characters further comprises: splitting at least one character to be recognized according to any one or more modes of the following; splitting at least one character to be recognized based on characters contained in a preset word stock; splitting at least one character to be recognized according to the number of units contained in each character to be recognized, wherein each character to be recognized consists of at least one unit; and splitting the at least one character to be recognized according to the word frequency of the at least one character to be recognized.
Further, the method for recognizing characters further comprises: detecting whether semantics corresponding to target characters exist in a preset word stock or not; when the semantics corresponding to the target characters do not exist in the preset word stock, performing character adding operation and/or character deleting operation on the target characters to obtain first target characters; inquiring the semantics corresponding to the first target character in a preset word stock, and displaying the semantics corresponding to the first target character in a text display unit.
Further, the method for recognizing characters further comprises: detecting whether semantics corresponding to target characters exist in a preset word stock or not; when the semantics corresponding to the target characters exist in the preset word stock, highlighting the semantics corresponding to the target characters in the text display unit; performing character adding operation and/or character deleting operation on the target characters to obtain second target characters; inquiring the semantics corresponding to the second target character in a preset word stock, and displaying the second target character and the semantics corresponding to the second target character in a text display unit.
Further, the method for recognizing characters further comprises: in response to a touch operation of a voice playing unit in the text recognition device, playing the target character and/or a recognition result of the target character.
Further, the method for recognizing characters further comprises: after detecting that the text scanning unit scans at least one character, the at least one character is displayed on the text display unit.
Further, the method for recognizing characters further comprises: acquiring the distance between a scanning unit of text recognition equipment and a text to be recognized; recording a first duration with a distance greater than a preset distance; when the first time length is smaller than a first preset time length, acquiring a first character scanned by the text recognition device before the first time length and a second character scanned after the first time length; continuing the first character and the second character to obtain at least one character to be recognized; at least one character to be recognized is displayed on the text display unit.
According to an aspect of the present disclosure, there is also provided an apparatus for recognizing a character, including: the acquisition module is used for acquiring at least one character to be identified, which is obtained after the text to be identified is scanned; the display module is used for displaying an initial recognition result of character error correction on at least one character to be recognized; the response module is used for responding to the selection instruction of at least one character to be recognized contained in the initial recognition result, and determining a target character from the at least one character to be recognized; and the identification module is used for identifying the target character to obtain the identification result.
According to another aspect of the present disclosure, there is provided an electronic device including: at least one processor; and a memory communicatively coupled to the at least one processor; the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of recognizing characters.
According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform the above-described method of recognizing characters.
According to another aspect of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the above-described method of recognizing characters.
According to another aspect of the present disclosure, there is provided a text recognition apparatus including: the text scanning unit is used for scanning the text to be recognized to obtain at least one character to be recognized; the text recognition unit is used for carrying out character error correction processing on at least one character to be recognized, determining a target character from the at least one character to be recognized after the character error correction processing, and recognizing the target character according to the semantics of the target character in the text to be recognized to obtain a recognition result; and the text display unit is used for displaying at least one character to be recognized and a recognition result.
In the method, a mode of identifying target characters according to the semantics of the target characters in a text to be identified is adopted, and finally, a display identification result is obtained, at least one character to be identified obtained after the text to be identified is scanned is obtained, and an initial identification result of character error correction is displayed on the at least one character to be identified, so that the target characters are determined from the at least one character to be identified in response to a selection instruction of the at least one character to be identified contained in the initial identification result, and the target characters are identified, so that the identification result is obtained.
In the process, the method and the device for identifying the text to be identified firstly scan the text to be identified to obtain the character to be identified, error correction is carried out on the character to obtain the target character, and the target character is identified again according to the semantic meaning of the target character in the text to be identified on the basis of the target character, so that the semantic error correction is carried out on the target character, the real intention of a user is predicted more accurately, a corresponding display identification result is given, the problem that in the prior art, the character identification accuracy is low due to the fact that the semantic identification cannot be carried out on the character in the text to be identified is solved, and the effect of improving the user experience is achieved.
Therefore, the scheme provided by the disclosure achieves the purpose of improving the accuracy of character recognition, so that the technical effect of improving the use experience of the user is achieved, and the problem of low accuracy of character recognition in the prior art is solved.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the disclosure, nor is it intended to be used to limit the scope of the disclosure. Other features of the present disclosure will become apparent from the following specification.
Drawings
The drawings are for a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 2 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 3 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 4 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 5 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 6 is a flow chart of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 7 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 8 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 9 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 10 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 11 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 12 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 13 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 14 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 15 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 16 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 17 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 18 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 19 is a schematic diagram of a method of recognizing characters according to embodiment 1 of the present disclosure;
FIG. 20 is a schematic diagram of a predictive device for recognizing characters according to embodiment 2 of the present disclosure;
fig. 21 is a block diagram of an electronic device for implementing a method of recognizing characters according to an embodiment of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
It should be noted that the terms "first," "second," and the like in the description and claims of the present disclosure and in the foregoing figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the disclosure described herein may be capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
Example 1
In accordance with the disclosed embodiments, there is provided an embodiment of a character recognition method, it being noted that the steps shown in the flowcharts of the figures may be performed in a computer system such as a set of computer executable instructions, and although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order other than that shown or described herein.
In addition, it should be further noted that the method provided by the embodiment of the present disclosure may be applied to a text recognition device, where the text recognition device includes at least a text scanning unit and a text display unit.
Fig. 1 is a flowchart of a text recognition method according to an embodiment of the present disclosure, as shown in fig. 1, including the steps of:
step S102, obtaining at least one character to be recognized, which is obtained after scanning the text to be recognized.
In an alternative embodiment, the text to be recognized may be scanned by a text scanning device in the text recognition device, and at least one character to be recognized is obtained, where the text scanning device may be a miniature camera, and the text recognition device may be a dictionary pen, and the miniature camera is embedded on a pen point of the dictionary pen.
Optionally, as shown in fig. 2, a spring leg is mounted on the tip of the dictionary pen for fixing the tip and preventing shake. In addition, as shown in fig. 3, the pen point of the dictionary pen is further provided with a pen point alignment line for approving text content in the text recognition process. The user can use the dictionary pen to input the content to be queried or translated in a mode of point pressing, single-line scanning, multi-line scanning and the like, a corresponding content result is obtained, namely, the character to be recognized is read, after the character to be recognized is read, the dictionary pen can further check or perform corresponding operation on the character to be recognized, for example, the user clicks a pronunciation button on the dictionary pen to make a sentence pronounce, clicks a follow-reading button to enter follow-reading and the like, at this time, the process of reading the character to be recognized once can be called a user session control, further, no matter the user performs the point pressing operation, or the operations of single-line scanning, multi-line scanning and the like, and the input of the user session control represents the complete operation of inputting the text content which is wanted to be selected by the user.
In addition, after one user session control is generated, the dictionary pen system can buffer, and when the next session control is generated or the user clicks the entry such as the history record to enter the display interface of another session control, the previous session control buffer is cleaned. When the session control cache exists, searching or translating and other operations are performed, and the dictionary pen can read the session control content in the cache and display the session control content.
Step S104, displaying an initial recognition result of character error correction on at least one character to be recognized.
In an alternative embodiment, the user uses a dictionary pen to select the content desired to be queried/translated on the physical material in the modes of point press, single line scan, multi-line scan, etc., wherein the physical material includes but is not limited to: books, newspapers, magazines, computer screens, etc. When a user wants to translate a section of words on a book, a dictionary pen is used for pressing the pen on the book to start scanning the text, the alignment line of the pen point is aligned with the initial center line of the text to be recognized on the physical material, the spring support leg of the pen point is pressed down, the spring support leg of the pen point is kept pressed down and slides to the right, the center line of the text to be recognized is enabled to be aligned with the alignment line of the pen point in the sliding process, and after the pen point slides to the right side of the last word of the text to be recognized, the pen is lifted to start scanning and reading the text to be recognized until the reading is finished. Wherein a text display unit, e.g. a screen, is arranged on the dictionary pen. As shown in fig. 4, after the pen is started, a cursor in a starting state is displayed on the screen, and the cursor always keeps flashing during scanning. In addition, during the scanning process, the result during the scanning process will appear in the screen, and as shown in fig. 5, the cursor with the intermediate result will flash behind the last character.
Alternatively, the text recognition device may correct the character of the character to be recognized, and in the sliding process of the text device, may correct the character of the character to be recognized by OCR to obtain an initial recognition result, for example, in the recognition process, the word name is recognized as s0me (the letter o is recognized as the number 0), and correct english spelling name is corrected, so that the correct initial recognition result name is displayed on the text display unit.
It should be noted that, in step S104, character error correction is performed on the character to be recognized to obtain an initial recognition result, so that the problem of character recognition errors is effectively avoided, and the character recognition accuracy is improved.
Step S106, responding to a selection instruction of at least one character to be recognized contained in the initial recognition result, and determining a target character from the at least one character to be recognized.
In an alternative embodiment, the user may select the initial recognition result, the text recognition device responds to the selection instruction, where the initial recognition result has performed character correction on the characters to be recognized, the text recognition device may continue to select at least one character to be recognized included in the initial recognition result, the selecting process includes performing error correction processing on the at least one character to be recognized by the text recognition device through NLP (Neuro-Linguistic Programming, neuro-linguistic), and further includes actively performing split and other selections on the at least one character to be recognized by the user through the text recognition device, that is, performing reselection on the at least one character to be recognized included in the initial recognition result according to the intention of the predicted user, to obtain the target character, and displaying the target character in the text display unit.
Through the process, the character to be recognized is selected again on the basis of the initial recognition result, so that the target character is determined according to the predicted user intention, the real intention of the user is predicted more accurately, the corresponding display recognition result is given, and the effect of improving the character recognition accuracy is achieved.
Step S108, identifying the target character to obtain an identification result.
In an alternative embodiment, the text display unit may be used to display the above recognition result, where the text display unit may be a screen on the text recognition device, or may be another display device connected to the text recognition device, for example, a display screen with a projection function, where the target character is recognized according to the meaning of the target character in the text to be recognized, so as to obtain the recognition result, for example, when the target character is "white-sun-to-all", the target character is a paleoverse in the text to be recognized, from the meaning, the split recognition and interpretation of the target character are inaccurate, so that the embodiment of the disclosure will follow the meaning of the target character in the text to be recognized, interpret the target character as a sentence, and display the paraphrase.
In the process, the target character is identified again according to the semantic meaning of the target character in the text to be identified on the basis of the target character, so that the error correction is carried out on the target character according to the voice, and the effect of improving the accuracy of the corresponding paraphrasing of the character is realized.
Based on the above-mentioned schemes defined in steps S102 to S108, it can be known that, in the embodiment of the present disclosure, a manner of recognizing a character according to the semantic meaning of the character in the text to be recognized is adopted, by obtaining at least one character to be recognized obtained after scanning the text to be recognized, and displaying an initial recognition result of performing character error correction on the at least one character to be recognized in a text display unit, so as to determine a target character from the at least one character to be recognized in response to a selection instruction of the at least one character to be recognized included in the initial recognition result, and recognize the target character, thereby obtaining a recognition result.
It is easy to note that in the above process, the present disclosure scans the text to be recognized to obtain the character to be recognized, corrects the error of the character to obtain the target character, and recognizes the target character again based on the target character according to the semantic meaning of the target character in the text to be recognized, thereby correcting the semantic meaning of the target character, predicting the true intention of the user more accurately and providing the corresponding display recognition result, further solving the problem of low accuracy of character recognition in the prior art, and realizing the effect of improving the user experience.
Therefore, the scheme provided by the disclosure achieves the purpose of improving the accuracy of character recognition, so that the technical effect of improving the use experience of the user is achieved, and the problem of low accuracy of character recognition in the prior art is solved.
In an alternative embodiment, the text recognition device displays the at least one character on the text display unit after detecting that the text scanning unit scans the at least one character.
Alternatively, the text recognition device may detect the text by means of a single-line scan, for example, the user holds the dictionary pen right, aligns the pen point alignment with the beginning midline of the text to be recognized on the physical material, presses down the pen point spring leg, keeps the press and slides to the right, and the sliding process aligns the pen point with the text midline to be recognized, slides to the right of the last word of the text to be recognized, and lifts the pen. The user can inevitably shake hands to float up and down in the sliding process, and shake can be effectively avoided through the spring support legs on the pen head.
In the process, the text recognition device can display the characters on the text display unit, so that a user can intuitively see the characters recognized by the text recognition device, and the user can actively correct the characters, thereby being beneficial to improving the use experience of the user.
In an alternative embodiment, the text recognition device obtains at least one character to be recognized obtained after the text scanning unit scans the file to be recognized, wherein the text recognition device at least comprises a text scanning unit and a text display unit, and the text display unit is at least used for displaying an initial recognition result, a target character and a recognition result.
Alternatively, the text scanning unit may be a micro camera installed on the text recognition device, and the text display unit may be a display screen installed on the text recognition device or other display devices connected to the text recognition device, for example, a display screen with a projection function, through which the text is photographed and scanned.
Through the text scanning unit and the text display unit, the effect of intuitively displaying the initial recognition result, the target characters and the recognition result is realized.
In an alternative embodiment, the text recognition device obtains the semantics of the target character in the text to be recognized, and recognizes the target character according to the semantics to obtain a recognition result.
Optionally, the text recognition device may recognize the target character according to the semantic meaning of the target character in the text to be recognized, so as to obtain a recognition result, for example, when the target character is "white-sun-free", the target character is a paleoverse in the text to be recognized, and from the semantic point of view, the splitting recognition and interpretation of the target character are inaccurate.
In the process, the target character is identified again according to the semantic meaning of the target character in the text to be identified on the basis of the target character, so that the error correction is carried out on the target character according to the voice, and the effect of improving the accuracy of the corresponding paraphrasing of the character is realized.
In an alternative embodiment, the text recognition device obtains a distance between a scanning unit of the text recognition device and the text to be recognized; recording a first duration with a distance greater than a preset distance; when the first time length is smaller than a first preset time length, acquiring a first character scanned by the text recognition device before the first time length and a second character scanned after the first time length; continuing the first character and the second character to obtain at least one character to be recognized; at least one character to be recognized is displayed on the text display unit.
Optionally, the text recognition device may detect the text by using a multi-line scanning manner, as shown in fig. 6, after the text recognition device detects that the user scans and lifts the pen last time, a user session control is formed, so as to obtain a first character "wake" within a preset duration, for example, the user drops the pen again to scan the content within 2 seconds, so as to obtain a second character "away", then the recognition content and the last scanned content perform character continuation, as shown in fig. 7, so as to obtain at least one character "wake away" to be recognized, and display the character on the text display unit, where the situation of the multi-line text also determines whether to connect the characters according to the preset duration.
In addition, unlike the multi-line scanning, in the use process of the text recognition device, there is often a click operation, that is, the text scanning device on the text recognition device, for example, a camera, shoots the text to be recognized, the shooting range is the preset distance, as shown in fig. 8, the preset distance can be set to be 1.6 cm, and the camera can transmit the recognition result of all characters in the shooting range to the text recognition device.
It should be noted that, in the above process, the character to be recognized obtained after the multi-line scanning and the spot pressing operation can be displayed on the text display unit, the multi-line scanning can improve the character recognition efficiency, and the character can be intuitively displayed to the user, so that the effect of improving the text detection efficiency is achieved.
In an alternative embodiment, the text recognition device displays an initial recognition result obtained by natural language processing of at least one character to be recognized in the text display unit.
Alternatively, the initial recognition result displayed by the text recognition apparatus in the text display unit is obtained through natural language processing, which may be error correction processing through NLP (Neuro-Linguistic Programming, neuro-linguistic). For example, when the text recognition device gives the character to be recognized to the NLP, the user's intention may be predicted and located twice, and the first presentation that the user appears after lifting the pen in one user session is the predicted and located of the first intention of the user by the NLP. The user clicks the label or text word segmentation content in one user session, and then the NLP automatically locates the secondary intention of the user.
Alternatively, as shown in fig. 9, the character recognition device may add the "dot difference" identifier to all the characters to be recognized, so that the content of the characters to be recognized may be referred to, for example, to a dictionary to inquire the characters to be recognized. The specific process is shown in fig. 10, the character recognition device can select the letters of the query result and sort the letters by calculating the length of the letters, if no result exists, the method directly performs explanation, sorts and places the most probable explanation on the forefront default highlight, and the guessing process of the user intention can make guess according to the word frequency system. In addition, the interpreted results should all be translatable in a dictionary or in a translation software development kit. Ordering according to word frequency. If the condition that the page cannot be found or translated out is not found, a spam page as shown in FIG. 11 is found.
Alternatively, as shown in fig. 12, the text recognition device may autonomously locate the secondary intention of the user through the NLP, for example, when the character to be recognized is "take off", the NLP queries the dictionary, and the character can be directly found, and then the character and the paraphrase in the dictionary are displayed, that is, the explanation, and at this time, the user clicks the text area again, the text is segmented into take/off, and the label of "word" is added in front. When the character to be recognized is s family d, the NLP is obtained by searching in a dictionary, s, family and d can be found, the default family is in a highlighting mode (because the word frequency is highest), at the moment, the user clicks the text area again, the text is segmented into s/family/d, the user clicks the label of the sentence, and the whole sentence can be translated, so that the NLP can automatically position the secondary intention of the user, for example, whether the user is to translate the whole sentence or the word is judged.
It is noted that, in the above process, natural language processing is performed on at least one character to be recognized, so that the real intention of the user is predicted more accurately, and a corresponding display recognition result is given, thereby solving the problem of low character recognition accuracy caused by incapability of performing semantic recognition on characters in the text to be recognized in the prior art, and realizing the effect of improving the use experience of the user.
In an alternative embodiment, the text recognition device displays prompt information on the text display unit when the text scanning unit scans abnormally before reading at least one character to be recognized obtained after the text scanning unit scans the text to be recognized; and when the display time of the prompt information is longer than the second preset time, hiding the prompt information in the text display unit.
Alternatively, when the text recognition device scans the text scanning unit abnormally, a prompt message is displayed on the text display unit, for example, as shown in fig. 13, if the user triggers the pressing and lifting of the pen point spring support leg, but the dictionary pen does not recognize any content, the user can enter the input guide chart and pop up the prompt message "not recognized, please retry. Hiding prompt information after lasting a preset time, wherein the preset time can be set to be 4 seconds.
In addition, the above-mentioned input guidance chart may guide the user to use the text recognition device, for example, when the user first queries to translate the text to be recognized without any user session control buffer, as shown in fig. 14, how the user performs operations to input, specifically including: and returning control labeling, schematic drawing labeling and guiding the text to align the alignment line of the pen point with the center line of the text and scan the text as far as possible perpendicular to the paper surface.
Through the process, the fault reasons can be intuitively displayed for the user, and the user can be guided in an operation manner, so that the use experience of the user can be improved.
In an alternative embodiment, the text recognition device detects a character type corresponding to at least one character to be recognized before determining a target character from the at least one character to be recognized in response to a selection instruction of the at least one character to be recognized included in the initial recognition result; and responding to a splitting instruction of at least one character to be recognized, and displaying at least one splitting character after splitting operation is performed on the at least one character to be recognized according to the character type in a text display unit.
Optionally, the character recognition device may detect a character type of the character to be recognized, and respond to a splitting instruction for recognizing the character, and display at least one split character after splitting the at least one character to be recognized according to the character type on the text display unit.
Optionally, as shown in fig. 15, the character to be recognized may be split by using a node word segmentation tree policy, where after performing NLP on the OCR result scanned by the user, the node word segmentation tree performs infinite downward splitting and searchable policy logic based on the local dictionary as shown in fig. 15, and splits a sentence splitting unit "I need the report as soon as possible", where the user clicks the sentence, the sentence is split into a plurality of phrases "I", "need", "report", "as soon as possible", and the user continues clicking, where the phrase is split into a plurality of minimum units (words). Therefore, translation from a sentence to each phrase and then to a word is realized, and an infinite splitting experience is provided for a user.
In addition, the text recognition device also supports line feed logic, that is, before the user performs the operation of splitting the character to be recognized, the character to be recognized is displayed in a plurality of lines, and after the user clicks to split the character to be recognized, the character to be recognized is changed from a plurality of lines to one line, as shown in fig. 18, so that the left-right sliding view can be supported.
Through the process, the character recognition equipment can split the character to be recognized, so that the character recognition equipment can split a semantic unit into a plurality of minimum units, search and translate sentences, phrases and words in the splitting process, and the character query speed is improved.
In an alternative embodiment, the text recognition device performs the splitting operation on at least one character to be recognized according to any one or more of the following manners; splitting at least one character to be recognized based on characters contained in a preset word stock; splitting at least one character to be recognized according to the number of units contained in each character to be recognized, wherein each character to be recognized consists of at least one unit; and splitting the at least one character to be recognized according to the word frequency of the at least one character to be recognized.
Optionally, the preset word stock may be a chinese word stock, including but not limited to: chinese-Chinese dictionary and simple Chinese-English dictionary; english vocabulary entry word libraries are also possible, including but not limited to: oxford dictionary and simple English-Chinese dictionary
The text recognition device performs splitting operation on at least one character to be recognized based on characters contained in a preset word stock, namely, a dictionary entry in the preset word stock is prioritized, namely, a split unit can be searched in a local dictionary; the text recognition equipment performs splitting operation on at least one character to be recognized according to the number of units contained in each character to be recognized, wherein the character to be recognized can be split from more to less according to the contained minimum number of units, and the character to be recognized is preferentially split into more words; the text recognition device performs splitting operation on at least one character to be recognized according to the word frequency of the at least one character to be recognized, and the word frequency can be split according to the principle that the word frequency is high to low.
Alternatively, as shown in fig. 16, a splitting unit may be a semantic unit that may be split by the node word segmentation policy, for example, a sentence "i want to design drawing" in fig. 16. A minimum unit may be a Chinese character, such as "I", "want" in FIG. 16, or a continuous letter string with spaces before and after English, such as the scan identification content is sfomily d, and then three English minimum units, s/family/d. The splitting mode can be backward recursion splitting, namely multi-layer splitting is performed from back to front until the splitting is completed to the minimum unit. The character recognition equipment can make splitting trees for characters to be recognized (including punctuation) output by OCR and NLP through a node word segmentation strategy, so that the characters to be recognized are split according to the principle that word frequency is high to low.
Through the process, the character recognition device can infinitely split the minimum unit when the user performs secondary intention positioning, so that the effect of improving the character query speed is achieved.
In an alternative embodiment, the text recognition device detects whether the semantics corresponding to the target characters exist in a preset word stock; when the semantics corresponding to the target characters do not exist in the preset word stock, performing character adding operation and/or character deleting operation on the target characters to obtain first target characters; inquiring the semantics corresponding to the first target character in a preset word stock, and displaying the semantics corresponding to the first target character in a text display unit.
Optionally, when the text recognition device detects whether the semantics corresponding to the target character exist in the preset word stock and the semantics corresponding to the target character do not exist in the preset word stock, performing character adding operation and/or character deleting operation on the target character to obtain the first target character, for example, when the target character is 'hoeing day time', the text recognition device cannot detect the corresponding semantics in the preset word stock, and can automatically supplement the added character to be 'hoeing day time noon', namely the first target character. If the target character is "white-sun-mountain-full" as shown in fig. 12, the target character can be directly inquired in the poetry dictionary, and the corresponding semantics are directly displayed in the text display unit.
Through the above process, when the semantics corresponding to the target characters do not exist in the preset word stock, the character adding operation and/or the character deleting operation are performed on the target characters, so that the fault tolerance of the user can be improved, and when the situation that the characters are lost or irrelevant characters are added occurs, the character recognition equipment can automatically perform character adding or deleting, so that the fault tolerance of the user can be improved, and the effect of improving the accuracy of the correct matching semantics of the character recognition is realized.
In an alternative embodiment, the text recognition device detects whether the semantics corresponding to the target characters exist in a preset word stock; when the semantics corresponding to the target characters exist in the preset word stock, highlighting the semantics corresponding to the target characters in the text display unit; performing character adding operation and/or character deleting operation on the target characters to obtain second target characters; inquiring the semantics corresponding to the second target character in a preset word stock, and displaying the second target character and the semantics corresponding to the second target character in a text display unit.
Optionally, the highlighting of the semantics corresponding to the target character in the text display unit may be highlighting of the voice, where the preset word stock includes but is not limited to: a chinese-to-chinese dictionary, a chinese-to-english dictionary, an english-to-english dictionary, an chinese-to-english translation online software development kit, and an chinese-to-english translation offline software development kit, wherein the various pre-set word libraries may be ordered, e.g., when a target character hits multiple dictionaries, the results are presented in the following order: in the Chinese dictionary, when a target character hits a poem text or a title, the poem dictionary is interpreted as a first place, then the Chinese-Chinese dictionary is executed, and finally the Chinese-English dictionary is executed; the English dictionary is characterized in that the English dictionary is the first English-Chinese dictionary, and then the English-English dictionary. In addition, the Chinese-English translation online software development kit is invoked when the text recognition device is online, and the Chinese-English translation offline software development kit is invoked when the text recognition device is offline.
The text recognition equipment to be described can perform character adding operation and/or character deleting operation on the target characters to obtain second characters, and further inquire the semantics corresponding to the second target characters, so that the fault tolerance of users can be improved, and the effect of improving the accuracy of correct matching semantics of character recognition is achieved.
In an alternative embodiment, the text recognition device plays the target character and/or the recognition result of the target character in response to a touch operation of a voice playing unit in the text recognition device.
Optionally, the user may click on a pronunciation button of the text recognition device to make the target character and/or the recognition result of the target character pronounce, and click on a follow-up button to enter follow-up, where the text recognition device may also pronounce automatically, and the text recognition device recalls the content presentation directly after a user session control operation, and whether the process pronounces depends on whether the user sets automatic pronunciation in the text recognition device. Wherein, the automatic pronunciation can pronounce the content language input by the user. When the input content is English, even if the user sets English pronunciation, the English pronunciation can be used when the preset dictionary has English real person audio recall result, if no English real person audio exists, off-line TTS (Text To Speech) is used for American pronunciation. Wherein the pronunciation buttons include, but are not limited to: pinyin button, english/american button, and original/translated button
Through the above process, the text recognition device can play the target character and/or the recognition result of the target character through the voice playing unit, and in some special scenes, for example, vision disorder personnel acquire information, or read the content in the book for infants, the text recognition device can be used for completing the conversion from the character to the voice, so that the applicability of the text recognition device is improved.
Fig. 19 shows a flowchart of a character recognition method according to embodiment 1 of the present disclosure, to further explain the overall operation flow of the embodiment of the present disclosure, specifically as follows:
optionally, when the text recognition device processes a user session control, first performing an input operation of recognizing a user and contents of a text to be recognized, where the input operation of the user includes: the method comprises the steps of point pressing operation, single-line scanning and multi-line scanning, and further, the content of the text to be recognized is recognized through OCR by utilizing the processing strategies corresponding to the various operations. Then, the text recognition equipment carries out NLP error correction on the character to be recognized, and comprises the steps of predicting the user intention, firstly predicting and positioning the user intention through NLP, carrying out default highlighting on the predicted character to be recognized, then carrying out endowing parameter values on the character to be recognized, adding a 'spot check' tag, listening to the 'spot check' tag to interpret the character to be recognized to obtain a corresponding interpretation, adding tags such as 'sentence', 'word' and the like on the character to be recognized, and carrying out splitting operation on the character to be recognized by the user to carry out secondary intention confirmation, wherein the character to be recognized can be actively highlighted according to the selection of the user during the secondary intention confirmation.
Further, the character recognition device performs calling and recall operations on the characters, wherein the calling refers to the action that the character recognition device queries a local or cloud preset dictionary through the target characters to obtain corresponding paraphrases or translation results in the preset dictionary. And recalling the action of returning the local or cloud preset dictionary to the word recognition equipment query result. The preset dictionary can be a Chinese-Chinese dictionary, a Chinese-English dictionary, an English-Chinese dictionary, an on-line software development kit for Chinese-English translation and an off-line software development kit for Chinese-English translation, and sets up priorities for different preset dictionaries to display the same, and the method comprises the following steps: text presentation, audio playback, TTS playback, sentence follow-up, collection, and the like. The text recognition device is provided with corresponding buttons for the presentation function.
Through the above process, the scheme provided by the embodiment of the disclosure achieves the purpose of improving the accuracy of character recognition, thereby realizing the technical effect of improving the user experience, and further solving the problem of low accuracy of character recognition in the prior art.
Example 2
There is further provided, in accordance with an embodiment of the present disclosure, an apparatus embodiment for recognizing a character, wherein fig. 20 is a schematic diagram of an apparatus for recognizing a character according to embodiment 2 of the present disclosure, the apparatus including: an acquiring module 2001, configured to acquire at least one character to be recognized obtained after scanning a text to be recognized; a display module 2003 for displaying an initial recognition result of performing character error correction on at least one character to be recognized; a response module 2005 for determining a target character from the at least one character to be recognized in response to a selection instruction of the at least one character to be recognized included in the initial recognition result; the recognition module 2007 is configured to recognize a target character to obtain a recognition result.
It should be noted that the above-mentioned obtaining module 2001, display module 2003, response module 2005 and identification module 2007 correspond to steps S102 to S108 in the above-mentioned embodiment, and the four modules are the same as examples and application scenarios implemented by the corresponding steps, but are not limited to those disclosed in the above-mentioned embodiment 1.
Optionally, the device for recognizing characters further includes: the detection module is used for: after detecting that the text scanning unit scans at least one character, the at least one character is displayed on the text display unit.
Optionally, the device for recognizing characters further includes: the device comprises a first acquisition module, a second acquisition module, a connection module and a first display module, wherein the first acquisition module is used for acquiring the distance between a scanning unit of text recognition equipment and a text to be recognized; the second acquisition module is used for recording a first duration with a distance larger than a preset distance; the continuing module is used for acquiring a first character scanned by the text recognition device before the first time length and a second character scanned after the first time length when the first time length is smaller than the first preset time length; the first display module is used for carrying out continuous processing on the first character and the second character to obtain at least one character to be identified; at least one character to be recognized is displayed on the text display unit.
Optionally, the display module further includes: and the processing module is used for displaying an initial recognition result obtained by performing natural language processing on at least one character to be recognized in the text display unit.
Optionally, the device for recognizing characters further includes: the first display module and the first processing module. The first display module is used for displaying prompt information on the text display unit when the text scanning unit scans abnormally before at least one character to be recognized, which is obtained after the text scanning unit scans the text to be recognized, is read; the first processing module is used for hiding the prompt information at the text display unit when the display time of the prompt information is longer than the second preset time.
Optionally, the device for recognizing characters further includes: the device comprises a first detection module and a first response module, wherein the first detection module is used for detecting a character type corresponding to at least one character to be identified before determining a target character from the at least one character to be identified in response to a selection instruction of the at least one character to be identified contained in the initial identification result; the first response module is used for responding to the splitting instruction of the at least one character to be recognized, and displaying at least one splitting character after splitting operation is performed on the at least one character to be recognized according to the character type on the text display unit.
Optionally, the device for recognizing characters further includes: the device comprises a first splitting module, a second splitting module and a third splitting module. The first splitting module is used for splitting at least one character to be recognized based on characters contained in a preset word stock; the second splitting module is used for splitting at least one character to be recognized according to the number of units contained in each character to be recognized, wherein each character to be recognized consists of at least one unit; and the third splitting module is used for splitting the at least one character to be recognized according to the word frequency of the at least one character to be recognized.
Optionally, the identification module further includes: the system comprises a second detection module, a first adding and deleting module and a query module. The second detection module is used for detecting whether the semantics corresponding to the target characters exist in the preset word stock or not; when the semantics corresponding to the target characters do not exist in the preset word stock, performing character adding operation and/or character deleting operation on the target characters to obtain first target characters; the query module is used for querying the semantics corresponding to the first target character in the preset word stock and displaying the semantics corresponding to the first target character in the text display unit.
Optionally, the identification module further includes: the system comprises a third detection module, a second adding and deleting module, a second display module and a first query module. The third detection module is used for detecting whether the semantics corresponding to the target characters exist in the preset word stock or not; the second display module is used for highlighting the semantics corresponding to the target characters in the text display unit when the semantics corresponding to the target characters exist in the preset word stock; the second adding and deleting module performs character adding operation and/or character deleting operation on the target characters to obtain second target characters; the first query module is used for querying the semantics corresponding to the second target character in the preset word stock and displaying the second target character and the semantics corresponding to the second target character in the text display unit.
Optionally, the device for recognizing characters further includes: and the second response module is used for responding to the touch operation of the voice playing unit in the text recognition equipment and playing the target characters and/or the recognition results of the target characters.
Optionally, the acquiring module further includes: the third acquisition module is used for acquiring at least one character to be recognized, which is obtained after the text scanning unit scans the file to be recognized, wherein the text recognition equipment at least comprises a text scanning unit and a text display unit, and the text display unit is at least used for displaying an initial recognition result, a target character and a recognition result.
Optionally, the display module further includes: and the fourth acquisition module and the first identification module. The fourth acquisition module is used for acquiring the semantics of the target character in the text to be identified; the first recognition module is used for recognizing the target character according to the semantics to obtain a recognition result.
Example 3
According to another aspect of the embodiments of the present disclosure, there is also provided an electronic device including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of recognizing characters of embodiment 1 described above.
Example 4
According to another aspect of the embodiments of the present disclosure, there is also provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the method of recognizing characters in embodiment 1 described above.
Example 5
According to another aspect of the disclosed embodiments, there is also provided a computer program product comprising a computer program which, when executed by a processor, implements the method of recognizing characters in embodiment 1 described above.
In the technical scheme of the disclosure, the acquisition, storage, application and the like of the related user personal information all conform to the regulations of related laws and regulations, and the public sequence is not violated.
Fig. 21 shows a schematic block diagram of an example electronic device 800 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 21, the apparatus 800 includes a computing unit 801 that can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM) 802 or a computer program loaded from a storage unit 808 into a Random Access Memory (RAM) 803. In the RAM 803, various programs and data required for the operation of the device 800 can also be stored. The computing unit 801, the ROM 802, and the RAM 803 are connected to each other by a bus 804. An input/output (I/O) interface 805 is also connected to the bus 804.
Various components in device 800 are connected to I/O interface 805, including: an input unit 806 such as a keyboard, mouse, etc.; an output unit 807 such as various types of displays, speakers, and the like; a storage unit 808, such as a magnetic disk, optical disk, etc.; and a communication unit 809, such as a network card, modem, wireless communication transceiver, or the like. The communication unit 809 allows the device 800 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.
The computing unit 801 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing unit 801 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, etc. The calculation unit 801 performs the respective methods and processes described above, for example, a prediction method of a vehicle trajectory. For example, in some embodiments, the method of predicting vehicle trajectories may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as the storage unit 808. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 800 via ROM 802 and/or communication unit 809. When the computer program is loaded into the RAM 803 and executed by the computing unit 801, one or more steps of the above-described vehicle trajectory prediction method may be performed. Alternatively, in other embodiments, the computing unit 801 may be configured to perform the method of predicting the vehicle trajectory in any other suitable way (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program code may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus such that the program code, when executed by the processor or controller, causes the functions/operations specified in the flowchart and/or block diagram to be implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), and the internet.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server may be a cloud server, a server of a distributed system, or a server incorporating a blockchain.
Example 6
According to another aspect of the embodiments of the present disclosure, there is also provided a text recognition apparatus including: the text scanning unit is used for scanning the text to be recognized to obtain at least one character to be recognized; the text recognition unit is used for carrying out character error correction processing on at least one character to be recognized, determining a target character from the at least one character to be recognized after the character error correction processing, and recognizing the target character according to the semantics of the target character in the text to be recognized to obtain a recognition result; and the text display unit is used for displaying at least one character to be recognized and a recognition result.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps recited in the present disclosure may be performed in parallel or sequentially or in a different order, provided that the desired results of the technical solutions of the present disclosure are achieved, and are not limited herein.
The above detailed description should not be taken as limiting the scope of the present disclosure. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (15)

1. A method of recognizing characters, the method comprising:
acquiring at least one character to be recognized, which is obtained after scanning a text to be recognized;
displaying an initial recognition result of character error correction on the at least one character to be recognized;
responding to a selection instruction of at least one character to be recognized contained in the initial recognition result, and determining a target character from the at least one character to be recognized;
identifying the target character to obtain the identification result;
wherein the method further comprises:
the method comprises the steps that the distance between a text scanning unit of text recognition equipment and a text to be recognized is obtained, wherein the text recognition equipment at least comprises the text scanning unit and a text display unit, the text scanning unit is at least used for detecting the text to be recognized in a multi-line scanning mode, and the text display unit is at least used for displaying an initial recognition result, target characters and the recognition result;
recording a first duration that the distance is greater than a preset distance;
when the first time length is smaller than a first preset time length, acquiring a first character scanned by the text recognition device before the first time length and a second character scanned after the first time length;
Continuing the first character and the second character to obtain at least one character to be recognized;
and displaying the at least one character to be recognized on the text display unit.
2. The method of claim 1, obtaining at least one character to be recognized obtained after scanning text to be recognized, comprising: and acquiring at least one character to be recognized, which is obtained after the text to be recognized is scanned by the text scanning unit.
3. The method of claim 1, identifying the target character to obtain the identification result, comprising:
acquiring the semantics of the target character in the text to be recognized;
and identifying the target character according to the semantics to obtain the identification result.
4. The method of claim 2, wherein the displaying the initial recognition result of character error correction for the at least one character to be recognized comprises:
and displaying an initial recognition result obtained by performing natural language processing on the at least one character to be recognized in the text display unit.
5. The method of claim 2, further comprising, prior to obtaining the at least one character to be recognized obtained after scanning the text to be recognized:
When the text scanning unit scans abnormality, prompt information is displayed on the text display unit;
and when the display time of the prompt information is longer than the second preset time, hiding the prompt information in the text display unit.
6. The method of claim 2, further comprising, prior to determining a target character from at least one character to be recognized contained in the initial recognition result in response to a selection instruction for the at least one character to be recognized:
detecting a character type corresponding to the at least one character to be recognized;
and responding to a splitting instruction of the at least one character to be recognized, and displaying at least one splitting character after splitting the at least one character to be recognized according to the character type on the text display unit.
7. The method of claim 6, the method further comprising:
splitting the at least one character to be recognized according to any one or more modes of the following;
splitting the at least one character to be recognized based on characters contained in a preset word stock;
splitting the at least one character to be recognized according to the number of units contained in each character to be recognized, wherein each character to be recognized consists of at least one unit;
And splitting the at least one character to be recognized according to the word frequency of the at least one character to be recognized.
8. The method of claim 2, wherein the identifying the target character to obtain the identification result includes:
detecting whether the semantics corresponding to the target characters exist in a preset word stock or not;
when the semantics corresponding to the target characters do not exist in the preset word stock, performing character adding operation and/or character deleting operation on the target characters to obtain first target characters;
inquiring the semantics corresponding to the first target character in the preset word stock, and displaying the semantics corresponding to the first target character in the text display unit.
9. The method of claim 2, wherein the identifying the target character to obtain the identification result includes:
detecting whether the semantics corresponding to the target characters exist in a preset word stock or not;
when the semantics corresponding to the target characters exist in the preset word stock, highlighting the semantics corresponding to the target characters in the text display unit;
performing character adding operation and/or character deleting operation on the target characters to obtain second target characters;
Inquiring the semantics corresponding to the second target character in the preset word stock, and displaying the second target character and the semantics corresponding to the second target character in the text display unit.
10. The method of claim 2, after the identifying the target character, the method further comprising:
and playing the target character and/or the recognition result of the target character in response to the touch operation of a voice playing unit in the text recognition device.
11. The method of claim 2, the method further comprising:
after detecting that the text scanning unit scans at least one character, the at least one character is displayed on the text display unit.
12. An apparatus for recognizing characters, the apparatus comprising:
the acquisition module is used for acquiring at least one character to be identified, which is obtained after the text to be identified is scanned;
the display module is used for displaying an initial recognition result of character error correction on the at least one character to be recognized;
the response module is used for responding to a selection instruction of at least one character to be recognized contained in the initial recognition result, and determining a target character from the at least one character to be recognized;
The recognition module is used for recognizing the target character to obtain the recognition result;
the acquisition module is further used for acquiring the distance between a text scanning unit of the text recognition device and the text to be recognized, wherein the text recognition device at least comprises the text scanning unit and a text display unit, the text scanning unit is at least used for detecting the text to be recognized in a multi-line scanning mode, and the text display unit is at least used for displaying the initial recognition result, the target characters and the recognition result;
the apparatus further comprises: the recording module is used for recording a first duration of which the distance is larger than a preset distance;
the acquisition module is further configured to acquire a first character scanned by the text recognition device before the first time period and a second character scanned after the first time period when the first time period is less than a first preset time period;
the apparatus further comprises: the processing module is used for carrying out continuous processing on the first character and the second character to obtain the at least one character to be recognized;
the display module is further used for displaying the at least one character to be recognized on the text display unit.
13. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of recognizing characters of any one of claims 1 to 11.
14. A non-transitory computer-readable storage medium storing computer instructions for causing the computer to perform the method of recognizing characters according to any one of claims 1 to 11.
15. A text recognition device, comprising:
the text scanning unit is used for scanning the text to be recognized to obtain at least one character to be recognized;
the text recognition unit is used for carrying out character error correction processing on the at least one character to be recognized, determining a target character from the at least one character to be recognized after the character error correction processing, and recognizing the target character according to the semantic meaning of the target character in the text to be recognized to obtain a recognition result;
a text display unit, configured to display the at least one character to be recognized and the recognition result;
Wherein, the text scanning unit is further used for:
acquiring the distance between a text scanning unit of the text recognition device and the text to be recognized, wherein the text scanning unit is at least used for detecting the text to be recognized in a multi-line scanning mode;
recording a first duration that the distance is greater than a preset distance;
when the first time length is smaller than a first preset time length, acquiring a first character scanned by the text recognition device before the first time length and a second character scanned after the first time length;
and continuing the first character and the second character to obtain the at least one character to be recognized.
CN202110950635.8A 2021-08-18 2021-08-18 Method and device for recognizing characters and electronic equipment Active CN113743102B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110950635.8A CN113743102B (en) 2021-08-18 2021-08-18 Method and device for recognizing characters and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110950635.8A CN113743102B (en) 2021-08-18 2021-08-18 Method and device for recognizing characters and electronic equipment

Publications (2)

Publication Number Publication Date
CN113743102A CN113743102A (en) 2021-12-03
CN113743102B true CN113743102B (en) 2023-09-01

Family

ID=78731689

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110950635.8A Active CN113743102B (en) 2021-08-18 2021-08-18 Method and device for recognizing characters and electronic equipment

Country Status (1)

Country Link
CN (1) CN113743102B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982326A (en) * 2011-09-02 2013-03-20 汉王科技股份有限公司 A method and a device for word processing and an electronic translation pen
CN103019834A (en) * 2011-09-22 2013-04-03 汉王科技股份有限公司 Scanning mode switching method based on entry equipment and entry equipment
CN105335356A (en) * 2015-10-28 2016-02-17 阿坝师范学院 Semantic recognition-oriented paper translation method and translation pen device
CN109657738A (en) * 2018-10-25 2019-04-19 平安科技(深圳)有限公司 Character identifying method, device, equipment and storage medium
CN109711412A (en) * 2018-12-27 2019-05-03 信雅达系统工程股份有限公司 A kind of optical character identification error correction method based on dictionary
CN111368918A (en) * 2020-03-04 2020-07-03 拉扎斯网络科技(上海)有限公司 Text error correction method and device, electronic equipment and storage medium
CN112085011A (en) * 2020-09-27 2020-12-15 中国建设银行股份有限公司 OCR recognition result error correction method, device and storage medium
CN112905026A (en) * 2021-03-30 2021-06-04 完美世界控股集团有限公司 Method, device, storage medium and computer equipment for displaying word suggestions

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060081714A1 (en) * 2004-08-23 2006-04-20 King Martin T Portable scanning device
US20170177189A1 (en) * 2015-12-19 2017-06-22 Radean T. Anvari Method and System for Capturing Data on Display Using Scanner Pen

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982326A (en) * 2011-09-02 2013-03-20 汉王科技股份有限公司 A method and a device for word processing and an electronic translation pen
CN103019834A (en) * 2011-09-22 2013-04-03 汉王科技股份有限公司 Scanning mode switching method based on entry equipment and entry equipment
CN105335356A (en) * 2015-10-28 2016-02-17 阿坝师范学院 Semantic recognition-oriented paper translation method and translation pen device
CN109657738A (en) * 2018-10-25 2019-04-19 平安科技(深圳)有限公司 Character identifying method, device, equipment and storage medium
CN109711412A (en) * 2018-12-27 2019-05-03 信雅达系统工程股份有限公司 A kind of optical character identification error correction method based on dictionary
CN111368918A (en) * 2020-03-04 2020-07-03 拉扎斯网络科技(上海)有限公司 Text error correction method and device, electronic equipment and storage medium
CN112085011A (en) * 2020-09-27 2020-12-15 中国建设银行股份有限公司 OCR recognition result error correction method, device and storage medium
CN112905026A (en) * 2021-03-30 2021-06-04 完美世界控股集团有限公司 Method, device, storage medium and computer equipment for displaying word suggestions

Also Published As

Publication number Publication date
CN113743102A (en) 2021-12-03

Similar Documents

Publication Publication Date Title
US8504350B2 (en) User-interactive automatic translation device and method for mobile device
US9484034B2 (en) Voice conversation support apparatus, voice conversation support method, and computer readable medium
US20150199341A1 (en) Speech translation apparatus, method and program
CN109522564B (en) Voice translation method and device
US20080077393A1 (en) Virtual keyboard adaptation for multilingual input
US20090326938A1 (en) Multiword text correction
US11907671B2 (en) Role labeling method, electronic device and storage medium
EP2102761A1 (en) Web-based collocation error proofing
US10915697B1 (en) Computer-implemented presentation of synonyms based on syntactic dependency
CN101044494A (en) An electronic device and method for visual text interpretation
US11640503B2 (en) Input method, input device and apparatus for input
US20190362713A1 (en) Dynamic extraction of contextually-coherent text blocks
CN112528681A (en) Cross-language retrieval and model training method, device, equipment and storage medium
US10025772B2 (en) Information processing apparatus, information processing method, and program
CN113626441A (en) Text management method, device and equipment based on scanning equipment and storage medium
CN107797676B (en) Single character input method and device
CN113743102B (en) Method and device for recognizing characters and electronic equipment
CN113268981B (en) Information processing method and device and electronic equipment
CN111475129A (en) Method and equipment for displaying candidate homophones through voice recognition
CN114090885B (en) Product title core word extraction method, related device and computer program product
CN112837398A (en) Text annotation method and device, electronic equipment and storage medium
CN106708797B (en) Word processing method and device
CN114462364B (en) Method and device for inputting information
CN113642559A (en) Text acquisition method, device and equipment based on scanning equipment and storage medium
CN107239441B (en) Dictionary paraphrasing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant