CN115562505A - Method, device, equipment, storage medium and product for determining calibration characters - Google Patents

Method, device, equipment, storage medium and product for determining calibration characters Download PDF

Info

Publication number
CN115562505A
CN115562505A CN202211231864.5A CN202211231864A CN115562505A CN 115562505 A CN115562505 A CN 115562505A CN 202211231864 A CN202211231864 A CN 202211231864A CN 115562505 A CN115562505 A CN 115562505A
Authority
CN
China
Prior art keywords
input
characters
character
standard
pinyin
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211231864.5A
Other languages
Chinese (zh)
Inventor
龚孟旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bank of China Ltd
Original Assignee
Bank of China Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bank of China Ltd filed Critical Bank of China Ltd
Priority to CN202211231864.5A priority Critical patent/CN115562505A/en
Publication of CN115562505A publication Critical patent/CN115562505A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a method, a device, equipment, a storage medium and a product for determining calibration characters, wherein the method comprises the following steps: judging whether the input characters are any standard characters recorded in a character table or not; if yes, inputting characters without errors, and determining calibration characters; if not, splitting the input characters into at least one input single character; comparing at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character; selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters; comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters; and determining the relevant character with the highest similarity with the input character as the calibration character. The invention can determine the calibration character actually corresponding to the error character caused by the pinyin, and improve the accuracy of information acquisition through the calibration character.

Description

Method, device, equipment, storage medium and product for determining calibration characters
Technical Field
The invention relates to the technical field of information verification, in particular to a method, a device, equipment, a storage medium and a product for determining calibration characters.
Background
At present, information of a user needs to be acquired in many scenes of daily work and life, and a common method for acquiring the information of the user is that the user fills in the information through a mobile phone page or a computer page, the information is submitted after the information is filled in, and the terminal can receive the information filled in by the user.
One of the common ways for the user to input information is to input information through an input box, and the user fills information through the input box and then directly sends the information filled by the user to the terminal. However, in practice, there are many reasons for erroneous input that may occur during the process of inputting information by the user, and particularly when inputting information by the pinyin input method, there are cases where the pinyin is erroneous, for example, the character that the user wants to input is "football", and as a result, "zu" is spelled into "cu" in the actual spelling, and "kicking ball" is input.
Therefore, there is a need for a method for determining calibration words, which can determine calibration words actually corresponding to error words caused by pinyin, and improve the accuracy of information acquisition by using the calibration words.
Disclosure of Invention
The embodiment of the invention provides a method, a device, equipment, a storage medium and a product for determining calibration characters, which are used for determining calibration characters actually corresponding to error characters caused by pinyin and improving the accuracy of information acquisition by the calibration characters, wherein the method comprises the following steps:
judging whether the input characters are any standard characters recorded in a character table;
if yes, the input characters are correct, and calibration characters do not need to be determined;
if not, splitting the input characters into at least one input single character;
comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
and determining the relevant character with the highest similarity with the input character as the calibration character.
Preferably, the comparing the at least one input word with the at least one standard word in each standard word to obtain the correlation value between each standard word and the input word further includes:
comparing the at least one input single character with at least one standard single character in each standard character, and determining the standard single character which is the same as the input single character in each standard character as the same single character;
and calculating the ratio of the number of the same single characters to the total number of all the standard single characters in each standard character, and taking the ratio as a correlation value between each standard character and the input character.
Preferably, the comparing the similarity of the pinyin between the relevant characters and the input characters, and determining the similarity between the relevant characters and the input characters further includes:
acquiring related pinyin corresponding to the related characters and input pinyin corresponding to the input characters;
splitting the input pinyin into an input initial and an input final;
judging whether the input initial consonant is a similar initial consonant of the related pinyin and/or whether the input final is a similar final of the related pinyin;
if yes, determining that the similarity between the related characters and the input characters is 100%;
if not, splitting the input pinyin into at least one input tone, and comparing the at least one input tone with at least one related tone in the related pinyin to obtain the similarity between the related characters and the input characters.
Preferably, the method for determining similar initial consonants includes:
obtaining a first target letter forming the initial according to the initial of the relevant pinyin;
determining letters adjacent to the first target letter in the 26-key input method and the 9-key input method and the first target letter as similar initial consonants of the related pinyin;
the method for determining the similar finals comprises the following steps:
obtaining a second target letter forming a final according to the final of the related pinyin;
and determining letters adjacent to the second target letter in the 26-key input method and the 9-key input method and the second target letter as similar vowels of the related pinyin.
Preferably, the similar initial consonant is an initial consonant which is confused with the pronunciation of the initial consonant of the relevant pinyin; the similar vowels are vowels which are confused with the vowel pronunciation of the related pinyin.
Preferably, the comparing the at least one input tone with at least one related tone in the related pinyin to obtain the similarity between the related text and the input text further comprises:
comparing the at least one input tone with at least one related tone in the related Pinyin, and determining a related tone in the related Pinyin, which is the same as the input tone, as a same tone;
and calculating the ratio of the number of the same tones to the total number of all the relevant tones in the relevant pinyin, and taking the ratio as the similarity between the relevant characters and the input characters.
An embodiment of the present invention further provides a device for determining calibration text, including:
a judging module: judging whether the input characters are any standard characters recorded in a character table or not;
a first branch module: if yes, the input characters are correct, and calibration characters do not need to be determined;
a second branch module: if not, the method comprises the following steps:
splitting the submodule: splitting an input character into at least one input single character;
a first comparison submodule: comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
selecting a submodule: selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
a second comparison submodule: comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
determining a sub-module: and determining the relevant character with the highest similarity with the input character as the calibration character.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
An embodiment of the present invention further provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements the above method.
An embodiment of the present invention further provides a computer program product, where the computer program product includes a computer program, and when the computer program is executed by a processor, the computer program implements the method described above.
Through the method, related characters can be screened out from all standard characters through single character comparison, the similarity between the related characters and the input characters is determined through pinyin similarity comparison, and the related characters with the highest similarity are determined as calibration characters. Therefore, the input characters input by the user in the input box by mistake can be calibrated through the calibration characters, and the accuracy of information acquisition is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the prior art descriptions will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts. In the drawings:
fig. 1 is a schematic flowchart illustrating a method for determining calibration texts, provided in an embodiment of the present disclosure;
FIG. 2 is a flow chart illustrating a method for obtaining a correlation value between each standard word and an input word provided by an embodiment of the present disclosure;
FIG. 3 is a flow chart illustrating a method for determining similarity between a relevant word and the input word provided by embodiments herein;
fig. 4 is a flowchart illustrating a method for determining similar initial consonants provided in the embodiments herein;
fig. 5 is a flowchart illustrating a method for determining similar finals provided in an embodiment of the present disclosure;
FIG. 6 is a schematic flow chart of obtaining similarity between related text and the input text according to an embodiment of the present disclosure;
fig. 7 is a schematic block diagram illustrating a device for determining calibration text according to an embodiment of the present disclosure;
fig. 8 shows a schematic structural diagram of a computer device provided in an embodiment herein.
Description of the figures the symbols:
100. a judgment module;
200. a first branching module;
300. a second branching module;
400. splitting the submodule;
500. a first comparison sub-module;
600. selecting a submodule;
700. a second comparison submodule;
800. determining a submodule;
802. a computer device;
804. a processor;
806. a memory;
808. a drive mechanism;
810. an input/output module;
812. an input device;
814. an output device;
816. a presentation device;
818. a graphical user interface;
820. a network interface;
822. a communication link;
824. a communication bus.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the embodiments of the present invention are further described in detail below with reference to the accompanying drawings. The exemplary embodiments and descriptions of the present invention are provided to explain the present invention, but not to limit the present invention.
At present, information of a user needs to be acquired in many scenes of daily work and life, and a common method for acquiring the information of the user is that the user fills in the information through a mobile phone page or a computer page, and the information is submitted after the information is filled in, so that the terminal can receive the information filled in by the user.
One of the common ways for the user to fill in the information is to input the information through an input box, and the user fills in the information through the input box and then directly sends the information filled in by the user to the terminal. However, in the process of inputting information by a user, wrong input may occur, and the wrong input may occur due to many reasons, especially when inputting information by a pinyin input method, pinyin may be wrong, for example, the character that the user wants to input is "football", and as a result, "zu" is spelled into "cu" in actual spelling, resulting in "kicking ball" being input.
In order to solve the above problem, embodiments herein provide a method for determining calibration texts. Fig. 1 is a flow chart of a method for determining calibration text provided in the embodiments herein, and the present specification provides the method operation steps as described in the embodiments or the flow chart, but may include more or less operation steps based on conventional or non-inventive labor. The order of steps recited in the embodiments is merely one manner of performing the steps in a multitude of orders and does not represent the only order of execution. When an actual system or apparatus product executes, it can execute sequentially or in parallel according to the method shown in the embodiment or the figures.
It should be noted that the terms "first," "second," and the like in the description and claims herein and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments herein described are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, apparatus, article, or device that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or device.
Referring to fig. 1, a method for determining calibration text is provided, including:
s101: judging whether the input characters are any standard characters recorded in a character table;
s102: if yes, the input characters are correct, and calibration characters do not need to be determined;
s103: if not, splitting the input characters into at least one input single character;
s104: comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
s105: selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
s106: comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
s107: and determining the relevant character with the highest similarity with the input character as the calibration character.
The input characters are characters in an input box input by a user, the input box corresponds to a character table, and the standard characters recorded in the character table are characters allowed to be input in the input box. If the input text is correct, the input text should be any standard text recorded in the text table, but if the input text is incorrect, the calibration text corresponding to the input text needs to be determined through steps S103 to S107.
When determining the calibration characters, firstly splitting the input characters into at least one input single character, splitting the standard characters into at least one standard single character in advance, and comparing the at least one input single character with at least one standard single character in each standard character respectively to obtain the correlation value between each standard character and the input characters.
And taking the standard characters with the correlation values larger than the set threshold value in all the standard characters as the related characters, performing pinyin similarity comparison on the related characters and the input characters to determine the similarity between the related characters and the input characters, and determining the related characters with the highest similarity as the calibration characters.
Through the method, related characters can be screened out from all standard characters through single character comparison, the similarity between the related characters and the input characters is determined through pinyin similarity comparison, and the related characters with the highest similarity are determined as calibration characters. Therefore, the input characters input by the user in the input box by mistake can be calibrated through the calibration characters, and the accuracy of information acquisition is improved.
In this embodiment, referring to fig. 2, the comparing the at least one input word with the at least one standard word in each standard word to obtain the correlation value between each standard word and the input word further comprises:
s201: comparing the at least one input single character with at least one standard single character in each standard character, and determining the standard single character which is the same as the input single character in each standard character as the same single character;
s202: and calculating the ratio of the number of the same single characters to the total number of all the standard single characters in each standard character, and taking the ratio as a correlation value between each standard character and the input character.
For example, the standard word is football, and the corresponding standard words are foot and ball; the input characters are 'kick ball', and the corresponding input characters are 'kick' and 'ball'. The ball is the same single character, the number of the same single character is 1, the total number of all the standard single characters in the standard characters is 2, the ratio of the number of the same single characters to the total number of all the standard single characters is 1/2, and the ratio is used as the correlation value between the standard character football and the input character kicking ball.
For example, if the threshold is set to 0.49,1/2 is 0.5, and if the threshold is larger than the threshold, the standard word "football" is the relevant word.
Referring to fig. 3, the comparing the similarity of the pinyin between the relevant word and the input word, and determining the similarity between the relevant word and the input word further includes:
s301: acquiring related pinyin corresponding to the related characters and input pinyin corresponding to the input characters;
s302: splitting the input pinyin into an input initial consonant and an input final;
s303: judging whether the input initial consonant is a similar initial consonant of the related pinyin and/or whether the input final is a similar final of the related pinyin;
s304: if yes, determining that the similarity between the related characters and the input characters is 100%;
s305: if not, splitting the input pinyin into at least one input tone, and comparing the at least one input tone with at least one related tone in the related pinyin to obtain the similarity between the related characters and the input characters.
The relative pinyin corresponding to the football is zu qiu, the input pinyin corresponding to the input characters is cu qiu, the input pinyin is divided into input initials and input finals, and the input initials are as follows: c and q, and inputting vowels u and iu.
Referring to fig. 4, the method for determining similar initial consonants includes:
s401: obtaining a first target letter forming the initial consonant according to the initial consonant of the related pinyin;
s402: and determining the letters adjacent to the first target letter in the 26-key input method and the 9-key input method as similar initial consonants of the related pinyin.
For example, the initial consonants of the relevant pinyin are "z" and "q", the first target letters are "z" and "q", and the letters adjacent to the position of "z" in the first target letter may include, for example, the 26-key: "a", "s", "d", "x", and "c", the letters adjacent to the "q" position in the first target letter may include: "w", "s" and "a"; thus, similar initials for "z" can be obtained as "z", "a", "s", "d", "x", and "c", and similar initials for "q" can be obtained as "q", "w", "s", and "a".
Since the input initial consonant "c" of the associated text is a similar initial consonant of "z", the input initial consonant "q" of the associated text is a similar initial consonant of "q".
For the 26-key input method, the adjacent letters at the front, back, left and right or the adjacent letters at intervals belong to adjacent letters; for the 9-key input method, abc is located in the same key, so abc is adjacent letters.
Referring to fig. 5, the method for determining similar finals includes:
s501: obtaining a second target letter forming the final according to the final of the related pinyin;
s502: and determining letters adjacent to the second target letter position in the 26-key input method and the 9-key input method as similar vowels of the related pinyin.
Similarly, the input vowel "u" of the related characters is a similar vowel "u", and "iu" is a similar vowel "iu". Thus, the similarity between the relevant character "football" and the input character "kick ball" is 100%.
Besides the determination method of similar initial consonants and the determination method of similar vowels, the similar initial consonants can also be the initial consonants confused with the pronunciation of the initial consonants of the relevant pinyin; the similar finals can also be finals confused with the finals of the related pinyin.
Preferably, the similar initials can be obtained by the steps S401 to S402 and by synthesizing initials confused with the initials of the relevant pinyin, and the similar finals can be obtained by the steps S501 to S502 and by synthesizing finals confused with the finals of the relevant pinyin.
Specifically, the consonants confused with the pronunciation of the consonants of the relevant pinyin can be obtained by synthesizing different pronunciation habits in different regions and different pronunciations formed by personal physiological structures. For example, a pronunciation-confusing initial such as initial "z" may include "zh", a pronunciation-confusing initial such as initial "s" may include "sh", and so on.
In this embodiment, referring to fig. 6, the comparing the at least one input tone with at least one relevant tone in the relevant pinyin to obtain the similarity between the relevant word and the input word further comprises:
s601: comparing the at least one input tone with at least one related tone in the related Pinyin, and determining a related tone in the related Pinyin, which is the same as the input tone, as a same tone;
s602: and calculating the ratio of the number of the same tones to the total number of all the related tones in the related pinyin, and taking the ratio as the similarity between the related characters and the input characters.
If the input initial consonant is not the similar initial consonant of the related pinyin, and the input final is not the similar final of the related pinyin, for example, if the input character is 'kick ball', the related character is 'basketball', the input pinyin is 'cu qiu', and the input initial consonants are 'c' and 'q'; inputting vowels of the sounds as u and iu; the related pinyin is "lan qiu", wherein initials of the related pinyin are "l" and "q", and finals of the related pinyin are "an" and "iu". Although the input initial consonant "q" of the associated character is a similar initial consonant of "q", the input initial consonant "c" of the associated character is not a similar initial consonant of "l", and it can be determined that the input initial consonant is a similar initial consonant of the associated pinyin only if all the input initial consonants are similar initial consonants of the associated pinyin.
Further, the input pinyin needs to be split into at least one input tone, and the input tone is embodied in the form of an initial consonant or a final sound. The input tones obtained by splitting the input pinyin "cu qiu" are "c", "u", "q" and "iu", and the relevant tones of the relevant pinyin "lan qiu" are "l", "an", "q" and "iu". Wherein, 2 same tones are provided, the ratio of the number of the same tones to the total number of all the relevant tones is 2/4=1/2, and the similarity between the relevant character "basketball" and the input character "kick ball" is 50%.
In summary, "football" is more similar than "basketball", so the calibration text should be "football". By the calibration method, even if the user input is 'kick on ball', the corresponding calibration character can be determined to be 'football', and the accuracy of information acquisition is improved.
It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data for analysis, stored data, displayed data, etc.) referred to in the present application are information and data authorized by the user or sufficiently authorized by each party. In addition, the technical scheme described in the embodiment of the application conforms to relevant regulations of national laws and regulations in terms of data acquisition, storage, use, processing and the like.
The embodiment of the invention also provides a device for determining the calibration characters, which is described in the following embodiment. Because the principle of the device for solving the problems is similar to the method for determining the calibration characters, the implementation of the device can be referred to the implementation of the method, and repeated details are not repeated.
Referring to fig. 7, an apparatus for determining calibration text includes:
the judging module 100: judging whether the input characters are any standard characters recorded in a character table;
the first branching module 200: if yes, inputting characters without errors, and determining calibration characters;
the second branch module 300: if not, the method comprises the following steps:
splitting sub-module 400: splitting an input character into at least one input single character;
first comparison submodule 500: comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
selecting a submodule 600: selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
second comparison submodule 700: comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
determination submodule 800: and determining the relevant character with the highest similarity with the input character as the calibration character.
Referring to fig. 8, based on the method for determining calibration text, an embodiment of the present disclosure further provides a computer device 802, where the method is executed on the computer device 802. Computer device 802 may include one or more processors 804, such as one or more Central Processing Units (CPUs) or Graphics Processors (GPUs), each of which may implement one or more hardware threads. The computer device 802 may also include any memory 806 for storing any kind of information, such as code, settings, data, etc., and in a particular embodiment a computer program on the memory 806 and executable on the processor 804, which computer program, when executed by the processor 804, may perform instructions according to the above-described method.
For example, and without limitation, memory 806 may include any one or combination of the following: any type of RAM, any type of ROM, flash memory devices, hard disks, optical disks, etc. More generally, any memory may use any technology to store information. Further, any memory may provide volatile or non-volatile retention of information. Further, any memory may represent fixed or removable components of computer device 802. In one case, when the processor 804 executes the associated instructions, which are stored in any memory or combination of memories, the computer device 802 can perform any of the operations of the associated instructions. The computer device 802 also includes one or more drive mechanisms 808, such as a hard disk drive mechanism, an optical disk drive mechanism, etc., for interacting with any memory.
Computer device 802 may also include an input/output module 810 (I/O) for receiving various inputs (via input device 812) and for providing various outputs (via output device 814). One particular output mechanism may include a presentation device 816 and an associated graphical user interface 818 (GUI). In other embodiments, input/output module 810 (I/O), input device 812, and output device 814 may also be excluded as just one computer device in a network. Computer device 802 may also include one or more network interfaces 820 for exchanging data with other devices via one or more communication links 822. One or more communication buses 824 couple the above-described components together.
Communication link 822 may be implemented in any manner, such as over a local area network, a wide area network (e.g., the Internet), a point-to-point connection, etc., or any combination thereof. The communication link 822 may include any combination of hardwired links, wireless links, routers, gateway functions, name servers, etc., governed by any protocol or combination of protocols.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
An embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored, and when the computer program is executed by a processor, the computer program implements the foregoing method.
An embodiment of the present invention further provides a computer program product, where the computer program product includes a computer program, and when the computer program is executed by a processor, the computer program implements the method described above.
It should be understood that, in various embodiments herein, the sequence numbers of the above-mentioned processes do not mean the execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments herein.
It should also be understood that, in the embodiments herein, the term "and/or" is only one kind of association relation describing an associated object, meaning that three kinds of relations may exist. For example, a and/or B, may represent: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
Those of ordinary skill in the art will appreciate that the elements and algorithm steps of the examples described in connection with the embodiments disclosed herein may be embodied in electronic hardware, computer software, or combinations of both, and that the components and steps of the examples have been described in a functional general in the foregoing description for the purpose of illustrating clearly the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided herein, it should be understood that the disclosed system, apparatus, and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one type of logical functional division, and other divisions may be realized in practice, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may also be an electric, mechanical or other form of connection.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purposes of the embodiments herein.
In addition, functional units in the embodiments herein may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit may be implemented in the form of hardware, or may also be implemented in the form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium.
Based on such understanding, the technical solutions of the present invention may be implemented in a form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The principles and embodiments of this document are explained herein using specific examples, which are presented only to aid in understanding the methods and their core concepts; meanwhile, for a person skilled in the art, according to the idea of the present disclosure, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present disclosure should not be construed as a limitation to the present disclosure.

Claims (10)

1. A method for determining calibration text, comprising:
judging whether the input characters are any standard characters recorded in a character table;
if yes, inputting characters without errors, and determining calibration characters;
if not, splitting the input characters into at least one input single character;
comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
and determining the relevant character with the highest similarity with the input character as the calibration character.
2. The method for determining calibration words according to claim 1, wherein the comparing the at least one input word with at least one standard word in each standard word to obtain the correlation value between each standard word and the input word further comprises:
comparing the at least one input single character with at least one standard single character in each standard character, and determining the standard single character which is the same as the input single character in each standard character as the same single character;
and calculating the ratio of the number of the same single characters to the total number of all the standard single characters in each standard character, and taking the ratio as a correlation value between each standard character and the input character.
3. The method of determining calibration words according to claim 2, wherein said comparing the similarity of pinyin between said associated word and said input word, determining the similarity between said associated word and said input word further comprises:
acquiring related pinyin corresponding to the related characters and input pinyin corresponding to the input characters;
splitting the input pinyin into an input initial and an input final;
judging whether the input initial consonant is a similar initial consonant of the related pinyin and/or whether the input final is a similar final of the related pinyin;
if yes, determining that the similarity between the related characters and the input characters is 100%;
if not, splitting the input pinyin into at least one input tone, and comparing the at least one input tone with at least one related tone in the related pinyin to obtain the similarity between the related characters and the input characters.
4. The method for determining calibration letters according to claim 3, wherein the method for determining similar initial consonants comprises:
obtaining a first target letter forming the initial consonant according to the initial consonant of the related pinyin;
determining letters adjacent to the first target letter in the 26-key input method and the 9-key input method and the first target letter as similar initial consonants of the related pinyin;
the method for determining the similar finals comprises the following steps:
obtaining a second target letter forming the final according to the final of the related pinyin;
and determining letters adjacent to the second target letter in the 26-key input method and the 9-key input method and the second target letter as similar vowels of the related pinyin.
5. The method for determining calibration word according to claim 3, wherein the similar initial is an initial confused with the pronunciation of the initial of the associated pinyin; the similar vowels are vowels which are confused with the vowel pronunciation of the related pinyin.
6. The method of claim 3, wherein the comparing the at least one input tone with at least one associated tone in the associated pinyin to obtain the similarity between the associated text and the input text further comprises:
comparing the at least one input tone with at least one related tone in the related pinyin, and determining the related tone which is the same as the input tone in the related pinyin as the same tone;
and calculating the ratio of the number of the same tones to the total number of all the relevant tones in the relevant pinyin, and taking the ratio as the similarity between the relevant characters and the input characters.
7. An apparatus for determining calibration text, comprising:
a judging module: judging whether the input characters are any standard characters recorded in a character table;
a first branch module: if yes, inputting characters without errors, and determining calibration characters;
a second branch module: if not, the method comprises the following steps:
splitting the submodule: splitting an input character into at least one input single character;
a first comparison submodule: comparing the at least one input single character with at least one standard single character in each standard character to obtain a correlation value between each standard character and the input character;
selecting a submodule: selecting standard characters with correlation values larger than a set threshold value from all the standard characters as related characters;
a second comparison submodule: comparing the similarity of the pinyin of the related characters and the input characters one by one, and determining the similarity between the related characters and the input characters;
determining a submodule: and determining the relevant character with the highest similarity with the input character as the calibration character.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method of any one of claims 1 to 6 when executing the computer program.
9. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by a processor, implements the method of any of claims 1 to 6.
10. A computer program product, characterized in that the computer program product comprises a computer program which, when being executed by a processor, carries out the method of any one of claims 1 to 6.
CN202211231864.5A 2022-09-30 2022-09-30 Method, device, equipment, storage medium and product for determining calibration characters Pending CN115562505A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211231864.5A CN115562505A (en) 2022-09-30 2022-09-30 Method, device, equipment, storage medium and product for determining calibration characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211231864.5A CN115562505A (en) 2022-09-30 2022-09-30 Method, device, equipment, storage medium and product for determining calibration characters

Publications (1)

Publication Number Publication Date
CN115562505A true CN115562505A (en) 2023-01-03

Family

ID=84745537

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211231864.5A Pending CN115562505A (en) 2022-09-30 2022-09-30 Method, device, equipment, storage medium and product for determining calibration characters

Country Status (1)

Country Link
CN (1) CN115562505A (en)

Similar Documents

Publication Publication Date Title
US8812302B2 (en) Techniques for inserting diacritical marks to text input via a user device
US7398200B2 (en) Token stream differencing with moved-block detection
JP5245255B2 (en) Specific expression extraction program, specific expression extraction method, and specific expression extraction apparatus
JP2020533654A (en) Holographic anti-counterfeit code inspection method and equipment
CN101208689A (en) Method and apparatus for creating a language model and kana-kanji conversion
CN105549760A (en) Information input method and device
CN109582775B (en) Information input method, device, computer equipment and storage medium
CN106843141A (en) Numerical control device
CN111199146A (en) Test report generation method and device, computer equipment and storage medium
CN110209780B (en) Question template generation method and device, server and storage medium
CN112162670B (en) Online learning method and device, electronic equipment and storage medium
CN115562505A (en) Method, device, equipment, storage medium and product for determining calibration characters
CN112527967A (en) Text matching method, device, terminal and storage medium
CN112487334A (en) Method, apparatus, computer device and medium for front end page language translation
US20090177690A1 (en) Determining an Optimal Solution Set Based on Human Selection
US8482529B2 (en) Computer input system and input method thereof
JP2010165141A (en) Method for extracting specific location from text log, and program
KR101491549B1 (en) Information retrieval device and information retrieval method
CN113282837A (en) Event analysis method and device, computer equipment and storage medium
CN112329424A (en) Service data processing method and device, storage medium and electronic equipment
CN108509057B (en) Input method and related equipment
CN110309127B (en) Data processing method and device and electronic equipment
CN114676685B (en) Voice text error processing method and device, electronic equipment and storage medium
CN102193967B (en) The relatively value of bounded domain
JP7211297B2 (en) Correction Support Program, Correction Support Method, and Information Processing Device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination