CN113138677A - Method and device for determining candidate words of input method, electronic equipment and storage medium - Google Patents

Method and device for determining candidate words of input method, electronic equipment and storage medium Download PDF

Info

Publication number
CN113138677A
CN113138677A CN202110449912.7A CN202110449912A CN113138677A CN 113138677 A CN113138677 A CN 113138677A CN 202110449912 A CN202110449912 A CN 202110449912A CN 113138677 A CN113138677 A CN 113138677A
Authority
CN
China
Prior art keywords
determining
words
input method
list
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110449912.7A
Other languages
Chinese (zh)
Inventor
胡恒一
庄杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing ByteDance Network Technology Co Ltd
Original Assignee
Beijing ByteDance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing ByteDance Network Technology Co Ltd filed Critical Beijing ByteDance Network Technology Co Ltd
Priority to CN202110449912.7A priority Critical patent/CN113138677A/en
Publication of CN113138677A publication Critical patent/CN113138677A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the disclosure discloses a method, a device, electronic equipment and a storage medium for determining candidate words of an input method, wherein the method comprises the following steps: receiving characters input by a user through an input method; according to the characters, determining a first vocabulary list from a special database which is matched with the current application of the input method, and determining a second vocabulary list from a general database corresponding to the input method; determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words. By the method for determining the candidate words of the input method, the determination accuracy of the candidate words is improved, and the frequency of searching the target words by turning pages from the candidate words by a user can be reduced.

Description

Method and device for determining candidate words of input method, electronic equipment and storage medium
Technical Field
The present disclosure relates to the field of input method technologies, and in particular, to a method and an apparatus for determining candidate words of an input method, an electronic device, and a storage medium.
Background
With the rapid development of mobile terminals, mobile terminals have become an indispensable tool in human life. Inputting information based on a virtual keyboard of a mobile terminal is one of the necessary functions of the mobile terminal.
In order to improve the information input efficiency, most of the existing input methods have an intelligent recommendation function, and after a user inputs characters, candidate information is intelligently recommended based on the input of the user.
However, the recommendation accuracy of the intelligent recommendation function of the existing input method still needs to be improved continuously.
Disclosure of Invention
In order to solve the technical problem or at least partially solve the technical problem, embodiments of the present disclosure provide a method, an apparatus, an electronic device, and a storage medium for determining an input method candidate word, which improve the determination accuracy of the candidate word, and further reduce the frequency of a user searching for a target word by turning a page from the candidate word.
In a first aspect, an embodiment of the present disclosure provides a method for determining an input method candidate word, where the method includes:
receiving characters input by a user through an input method;
according to the characters, determining a first vocabulary list from a special database which is matched with the current application of the input method, and determining a second vocabulary list from a general database corresponding to the input method;
determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words.
In a second aspect, an embodiment of the present disclosure further provides an apparatus for determining an input method candidate word, where the apparatus includes:
the receiving module is used for receiving characters input by a user through an input method;
the first determining module is used for determining a first vocabulary list from a special database which is matched with the current application of the input method according to the characters and determining a second vocabulary list from a general database corresponding to the input method;
a second determining module to determine at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list.
In a third aspect, an embodiment of the present disclosure further provides an electronic device, where the electronic device includes:
one or more processors;
storage means for storing one or more programs;
when executed by the one or more processors, the one or more programs cause the one or more processors to implement a method for determining input method candidate words according to any embodiment of the present disclosure.
In a fourth aspect, this disclosure further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the method for determining the input method candidate word according to any embodiment of the disclosure.
In a fifth aspect, the present disclosure further provides a computer program product, where the computer program product includes a computer program or instructions, and the computer program or instructions, when executed by a processor, implement the method for determining the input method candidate word according to any embodiment of the present disclosure.
Compared with the prior art, the technical scheme provided by the embodiment of the disclosure has at least the following advantages:
according to the method for determining the candidate words of the input method, the candidate words are respectively inquired and determined from the special database and the general database according to the characters input by the user by referring to the application type of the input method, so that the method is beneficial to determining the candidate words meeting the user expectation, the determination precision of the candidate words is improved, the frequency of searching for the candidate words by turning pages by the user can be reduced, and the use experience of the user is improved.
Drawings
The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent by referring to the following detailed description when taken in conjunction with the accompanying drawings. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that elements and features are not necessarily drawn to scale.
Fig. 1 is a flowchart of a method for determining candidate words of an input method in an embodiment of the present disclosure;
fig. 2 is a schematic structural diagram of an apparatus for determining candidate words of an input method in an embodiment of the present disclosure;
fig. 3 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure.
Detailed Description
Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it is to be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for a more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the disclosure are for illustration purposes only and are not intended to limit the scope of the disclosure.
It should be understood that the various steps recited in the method embodiments of the present disclosure may be performed in a different order, and/or performed in parallel. Moreover, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this respect.
The term "include" and variations thereof as used herein are open-ended, i.e., "including but not limited to". The term "based on" is "based, at least in part, on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions for other terms will be given in the following description.
It should be noted that the terms "first", "second", and the like in the present disclosure are only used for distinguishing different devices, modules or units, and are not used for limiting the order or interdependence relationship of the functions performed by the devices, modules or units.
It is noted that references to "a", "an", and "the" modifications in this disclosure are intended to be illustrative rather than limiting, and that those skilled in the art will recognize that "one or more" may be used unless the context clearly dictates otherwise.
The names of messages or information exchanged between devices in the embodiments of the present disclosure are for illustrative purposes only, and are not intended to limit the scope of the messages or information.
Fig. 1 is a flowchart of a method for determining candidate words for an input method in an embodiment of the present disclosure, which may be applied to a scene in which a complete vocabulary is recommended for a user according to limited characters input by the user when the user inputs information, so as to improve efficiency of inputting information by the user and improve user experience. The method can be executed by a device for determining candidate words of an input method, which can be implemented in a software and/or hardware manner, and the device can be configured in an electronic device, such as a server or a terminal, where the terminal specifically includes, but is not limited to, an electronic device with an input interface, such as a smart phone, a palm computer, a tablet computer, a wearable device with a display screen, a desktop computer, a notebook computer, an all-in-one machine, and a smart home device.
As shown in fig. 1, the method may specifically include the following steps:
step 110, receiving characters input by a user through an input method.
The characters may refer to pinyin or english letters input by the user through an input method, and are used for expressing the current input intention of the user. For example, when a user wants to input the text information "mathematics", the user needs to call an input method keyboard on the input box interface and input the pinyin "shuxue" through the input method keyboard. To increase the input speed of the user, the input method automatically displays candidate words for the user when the user inputs a limited number of characters. For example, when the user inputs the character "shux" and the input of the character "shuxue" is not completed, the input method displays the character information "mathematics" for the user, and at the moment, the user can directly select the character information "mathematics" without continuously inputting the rest of the characters "ue", so that the purposes of saving the time of the user and improving the input efficiency of the user are achieved. For another example, in another application scenario, assuming that the user has completed inputting the complete character "shuxue" very quickly, the input method displays the text "math" at the front position of the candidate word list based on the historical input record of the user or the input context information (for example, when the text "language" has just been input at the previous moment, when the user inputs the character "shuxue"), and displays the text that the user wishes to input at the front position of the candidate word list, so that the user can find the text that the user wishes to input more quickly without searching by turning pages, thereby achieving the purpose of improving the information input efficiency of the user.
For example, if the user wants to input the english word enjoy, the user needs to input the character "enjoy" through the input method keyboard. To increase the input speed of the user, the input method automatically displays candidate words for the user, typically when the user inputs a limited number of characters. For example, when the user inputs the character "en" and the input of the character "enjoy" is not completed, the input method displays the english word "enjoy" for the user, and at this time, the user can directly select the english word "enjoy" without continuously inputting the remaining character "joy", so that the purposes of saving the time of the user and improving the input efficiency of the user are achieved.
However, when the user inputs the character "shux", whether the user wants to input the text information "math" or "familiar", or whether the user wants to input other text information, such as "writing", needs to be determined by integrating various information. The purpose of the scheme for determining the candidate words of the input method provided by the embodiment is to accurately determine the information really desired to be input by the user and display the information so as to be quickly found and input by the user.
And step 120, determining a first vocabulary list from the special database matched with the current application of the input method according to the characters, and determining a second vocabulary list from the general database corresponding to the input method.
The application where the input method is currently located refers to an application using the input method, for example, when a user chats with a friend in a certain chat application, the input method needs to be used in the chat application to complete input of chat information, and the chat application is the application where the input method is currently located. For example, when a user queries information through a browser, a search term needs to be input, and when the search term is input, an input method needs to be called, that is, the search term is input in a search window of the browser through the input method, and at this time, the browser is the current application of the input method.
In one embodiment, the specific database matched with the application where the input method is currently located is determined based on the application type of the application where the input method is currently located. For example, in an application scenario in which a user chats with a friend in a certain chat application, the application type in which the input method is currently applied may be determined as a social class. In an application scene that a user queries information through a browser, the application type of the application where the input method is currently located can be determined as a query type. In an application scenario where a user uses an input method in a learning application, the application type of the application where the input method is currently located may be determined as an education class. In an application scenario where a user uses an input method in a certain game application, the application type of the application where the input method is currently located may be determined as a game class. In an application scene that a user uses an input method in certain home application, the application type of the application where the input method is currently located can be determined as a life class and the like. When the user uses different types of applications, the required candidate word range can be different, for example, when the user uses a social application, the user can need to use more words in aspects of politeness, praise and the like, and when the user uses a learning application or an educational application, the user can need to use more professional terms or literary terms. I.e., the information that the user desires to enter may be different in different types of applications for the same character entered by the user. For example, when a user enters the character "shux" in an educational application, the input desired by the user is more likely to be "math", and when the user enters the character "shux" in a social application, the input desired by the user is more likely to be "familiar". Therefore, by determining the candidate word with reference to the application type to which the input method is currently applied, the determination accuracy of the candidate word can be improved.
In one embodiment, the application types to which the various common applications belong may be classified based on business practice experience, and a mapping relationship between the application identification of the application, the type identification of the application type to which the application belongs, and the database identification of the dedicated database to which the application matches may be stored (e.g., in the form of a list). When the method is used, the corresponding type identification is determined based on the application identification currently applied by the input method, the database identification of the matched special database is further determined based on the type identification, and the matching is carried out from the corresponding special database according to the determined database identification so as to determine the first vocabulary list corresponding to the characters input by the user. The advantage of storing the mapping relationship among the application identifier of the application, the type identifier of the application type to which the application belongs, and the database identifier of the dedicated database to which the application is matched is that applications with different application identifiers may belong to the same application type, that is, a plurality of different applications may have the same type identifier, and by establishing the mapping relationship between the application identifier and the type identifier, and the mapping relationship between the type identifier and the database identifier of the dedicated database, the amount of data of the mapping relationship with the database identifier can be reduced compared with directly establishing the mapping relationship between the application identifier and the database identifier. Specifically, if the application type to which the application belongs is a social class, the matched special database may be a social class thesaurus, and more words associated with social are stored in the social class thesaurus, for example, a word corresponding to the character "pengy" is "friend", a word corresponding to the character "shux" is "familiar", a word corresponding to the character "you" is "friendship" instead of "useful", and the like. If the application type of the application belongs to education, the matched special database can be an education word bank, and more vocabularies related to education are stored in the education word bank, for example, the vocabulary corresponding to the character "shux" is "mathematics"; the character "yuw" corresponds to a word "language" rather than "rain fog"; the word corresponding to the character "yingy" is "english" or the like. If the application type of the application is a game class, the matched special database can be a game class word bank, more words associated with game terms are stored in the game class word bank, for example, if the word corresponding to the character "wangz" is "joker" or "joker honor", but not other words with poor association with game terms, such as "joker", "look at", "website", etc., the "joker" and the "joker honor" can form a first word list.
It will be appreciated that in another embodiment, only the mapping relationship between the application identifier of the application and the database identifier of the matched dedicated database may be stored, without adding the type identifier of the application type to which the application belongs, that is, establishing a direct association between the application and the dedicated database, rather than establishing an indirect association between the application and the dedicated database through the application type to which the application belongs.
The universal database is a database applicable to any application scene, wherein words frequently used by people are stored in advance, but the words are not classified, that is, the universal database comprises words of any category, and words with higher use frequency are generally collected according to the use frequency of the words of people in the past. In order to meet the requirement that a user inputs information with small relevance with a specific application in the specific application, in the technical scheme of the embodiment, on the basis of searching and inquiring a first vocabulary list based on a special database matched with the specific application, the search and search are further performed in a general database to obtain a second vocabulary list, and then a certain number of candidate words are respectively determined from the first vocabulary list and the second vocabulary list for the user to select, namely the determined candidate words not only comprise special vocabularies matched with the application types of the current application of the input method, but also comprise general vocabularies which are not related to the application types of the current application of the input method, so that the purpose of further improving the determination precision of the candidate words is achieved, and the frequency of searching target words by the user through page turning from the candidate words is reduced.
Step 130, determining at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list.
Illustratively, determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words includes: determining at least one first candidate word corresponding to the character from a first vocabulary list; determining at least one second candidate word corresponding to the character from a second vocabulary list; determining the at least one target candidate word based on the first candidate word and the second candidate word.
In one embodiment, determining at least one first candidate word corresponding to the character from the first list of words includes: a certain number of words (the specific number may be determined based on the total number of words that can be displayed on a single page, for example, if the total number of words that can be displayed on a single page is 10, then 3 or 4 high-frequency words may be determined from the first word list) are randomly selected from the first word list as the first candidate word. Or, ordering all the vocabularies in the first vocabulary list according to a set ordering rule; and selecting at least one vocabulary as a first candidate word according to the sorting position. Compared with random selection, the method of selecting according to the sorting position can further enable the user to find the character information which the user wants to input more quickly, and the determination precision of the target candidate word is improved. Wherein, setting the sequencing rule comprises at least one of the following: ranking based on frequency of use, ranking based on time of use, and ranking based on first letter. The process of sorting the words in the first word list based on the usage frequency sorting may specifically be: for example, the first vocabulary list includes "king" and "king glory", but the frequency of use of "king glory" is higher than that of "king", so that "king glory" can be arranged in front of "king". The process of sorting the words in the first word list based on the usage time may specifically be: for example, the first vocabulary list includes "king" and "king glory", but the user has entered the vocabulary "king" in the last entry, so the "king" can be arranged in front of the "king glory". The process of sorting the words in the first word list based on the initial sequence may specifically be: for example, the first list of words includes "prince" and "prince," which may be ranked in front of "prince" based on an initial. The purpose of sequencing all the words is to arrange the words which are used more frequently or are used most recently (such as the words input by the user last time) in front, and the words are usually the words which the user really wants to input, so that the user can find the target words which the user wants to input as soon as possible by arranging the words in front, and the purpose of reducing the frequency of searching the target words by turning pages from the candidate words by the user is achieved.
Also, in one embodiment, determining at least one second candidate word corresponding to the character from the second vocabulary list includes: a certain number of words (the specific number may be determined based on the total number of words that can be displayed on a single page, for example, 10 words can be displayed on a single page, and 5 or 6 high-frequency words may be determined from the first word list) are randomly selected from the second word list as the second candidate words. Or, ordering all the words in the second word list according to a set ordering rule; and selecting at least one vocabulary as a second candidate word according to the sorting position. Compared with random selection, the method of selecting according to the sorting position can further enable the user to find the character information which the user wants to input more quickly, and the determination precision of the target candidate word is improved.
Further, in one embodiment, determining the at least one target candidate word based on the first candidate word and the second candidate word comprises: all the words in the first candidate word are taken as target candidate words and are displayed at the positions which are relatively front randomly or sequentially arranged according to the sequencing position, all the words in the second candidate word are taken as target candidate words and are displayed at the positions which are relatively rear randomly or sequentially arranged according to the sequencing position, and the target candidate words in the subsequent page turning can all be the words from the second word list. When the relevance between the text information which the user wants to input and the current application is large, the implementation mode can enable the user to quickly find the text information which the user wants to input, the operation of searching for page turning is omitted, and the efficiency of inputting information by the user is improved. In another embodiment, the first few words (the specific number may be determined based on the total number of words that can be displayed on a single page, for example, if the total number of words that can be displayed on a single page is 10, then 3 or 4 high-frequency words may be determined from the first word list) in the first candidate word may be sequentially displayed at a relatively front position as the target candidate word, all words in the second candidate word may be randomly displayed or displayed at a relatively rear position according to the order position as the target candidate word, and the target candidate words in the subsequent page turning may all be words from the second word list. Or combining the first candidate word and the second candidate word together, sorting the combined candidate words by combining a set sorting rule, and then sequentially displaying the candidate words according to the sorting position.
It is understood that the words in the first candidate word and the words in the second candidate word may have the same words, and a deduplication algorithm is added to keep only one word for the repeated occurrence of the same words, so as to avoid the situation of repeated occurrence of the same words.
Specifically, in one embodiment, when receiving a character input by a user, determining a dedicated database identifier matched with an application where an input method is currently located from a mapping relationship between a prestored application identifier and a dedicated database identifier based on an application identifier where the input method is currently located, querying from the matched dedicated database based on the dedicated database identifier, and sorting vocabularies in a queried first vocabulary list; then, inquiring from the general database, sequencing all the words in the inquired second word list, then carrying out de-duplication processing on all the words in the first word list and all the words in the second word list, and deleting the same words in the first word list and the second word list from the first word list or the second word list so as to avoid returning repeated candidate words; and finally, sequentially displaying the first 8 pieces of data in the first sorted vocabulary list, all the data in the second sorted vocabulary list and the rest data except the first 8 pieces of data in the first sorted vocabulary list. The first 8 pieces of data in the first vocabulary list are preferentially displayed and are displayed at the front position of the candidate word list so as to be seen by a user at first sight, then all data in the second vocabulary list after sequencing are sequentially displayed, and finally the rest data in the first vocabulary list are sequentially displayed. The data is inquired from the matched special database according to the application type of the current application of the input method, so that the precision of the determined candidate word can be improved, the determined candidate word is the target word which the user really wants to input, and the information input efficiency of the user and the use experience of the user are improved. The data inquired from the general database is used for supplementing, the requirement that a user inputs information which is not related to the specific application in the specific application is met, the inquired candidate word list is further sequenced based on the use frequency or the use time, the vocabulary which is used with higher frequency or is used for the nearest time is arranged in the front position, the speed of selecting information by the user is further increased, and the purpose of improving the information input efficiency of the user is achieved.
According to the technical scheme of the embodiment, on the basis of searching and querying the first vocabulary list based on the special database matched with the specific application, searching and querying are further carried out in the general database to obtain the second vocabulary list, and then a certain number of candidate words are respectively determined from the first vocabulary list and the second vocabulary list to be selected by a user, namely the determined candidate words not only comprise special vocabularies matched with the application type of the current application of the input method, but also comprise general vocabularies which are not too related to the application type of the current application of the input method, so that the purpose of further improving the determination accuracy of the candidate words is achieved, and the frequency of searching target words by turning pages from the candidate words by the user is reduced.
Fig. 2 is a schematic structural diagram of an apparatus for determining candidate words of an input method in an embodiment of the present disclosure. As shown in fig. 2, the apparatus for determining candidate words of an input method specifically includes: a receiving module 210, a first determining module 220, and a second determining module 230.
The receiving module 210 is configured to receive a character input by a user through an input method; the first determining module 220 is configured to determine, according to the characters, a first vocabulary list from a dedicated database matched with an application where the input method is currently located, and determine a second vocabulary list from a general database corresponding to the input method; the second determining module 230 is configured to determine at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list.
Optionally, the second determining module 230 includes:
the first determining unit is used for determining at least one first candidate word corresponding to the character from the first vocabulary list;
the second determining unit is used for determining at least one second candidate word corresponding to the character from the second vocabulary list;
a third determining unit, configured to determine the at least one target candidate word based on the first candidate word and the second candidate word.
Optionally, the first determining unit includes:
the first ordering subunit is used for ordering all the vocabularies in the first vocabulary list according to a set ordering rule;
and the first selection subunit is used for selecting at least one vocabulary as the first candidate word according to the sorting position.
Optionally, the second determining unit includes:
the second sorting subunit is used for sorting all the words in the second word list according to a set sorting rule;
and the second selection subunit is used for selecting at least one vocabulary as the second candidate word according to the sorting position.
Optionally, the set ordering rule includes at least one of: ranking based on frequency of use, ranking based on time of use, and ranking based on first letter.
Optionally, the apparatus further comprises:
and the display module is used for displaying the target candidate words so that a user can select the target words from the target candidate words as input words.
The device for determining candidate words of an input method provided by the embodiment of the disclosure further performs search query in a general database on the basis of searching and querying a first vocabulary list based on a special database matched with a specific application to obtain a second vocabulary list, and then determines a certain number of candidate words from the first vocabulary list and the second vocabulary list respectively for selection by a user, namely, the determined candidate words not only include special vocabularies matched with the application type of the current application of the input method, but also include general vocabularies which are not too related to the application type of the current application of the input method, which is helpful for determining candidate words meeting the user's desire, improves the determination precision of the candidate words, improves the user experience, and enables the user to automatically obtain target words to be input in a region where the candidate words are displayed with a large probability when the user uses different applications, and the search is carried out without page turning or page turning for many times, so that the frequency of searching the target word by the user from the candidate word by page turning can be reduced.
The device for determining the input method candidate word provided by the embodiment of the disclosure may perform the steps performed in the method for determining the input method candidate word provided by the embodiment of the disclosure, and the steps and the beneficial effects are not repeated herein.
Fig. 3 is a schematic structural diagram of an electronic device in an embodiment of the present disclosure. Referring now specifically to fig. 3, a schematic diagram of an electronic device 500 suitable for use in implementing embodiments of the present disclosure is shown. The electronic device 500 in the embodiments of the present disclosure may include, but is not limited to, mobile terminals such as a mobile phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet), a PMP (portable multimedia player), a vehicle-mounted terminal (e.g., a car navigation terminal), a wearable electronic device, and the like, and fixed terminals such as a digital TV, a desktop computer, a smart home device, and the like. The electronic device shown in fig. 3 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure.
As shown in fig. 3, the electronic device 500 may include a processing means (e.g., a central processing unit, a graphic processor, etc.) 501, which may perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)502 or a program loaded from a storage means 508 into a Random Access Memory (RAM)503 to implement the determination method of an input method candidate word according to the embodiments of the present disclosure. In the RAM 503, various programs and data necessary for the operation of the electronic apparatus 500 are also stored. The processing device 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504. An input/output (I/O) interface 505 is also connected to bus 504.
Generally, the following devices may be connected to the I/O interface 505: input devices 506 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; output devices 507 including, for example, a Liquid Crystal Display (LCD), speakers, vibrators, and the like; storage devices 508 including, for example, magnetic tape, hard disk, etc.; and a communication device 509. The communication means 509 may allow the electronic device 500 to communicate with other devices wirelessly or by wire to exchange data. While fig. 3 illustrates an electronic device 500 having various means, it is to be understood that not all illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, an embodiment of the present disclosure includes a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for executing the method illustrated in the flowchart, thereby implementing the method for determining input method candidate words as described above. In such an embodiment, the computer program may be downloaded and installed from a network via the communication means 509, or installed from the storage means 508, or installed from the ROM 502. The computer program performs the above-described functions defined in the methods of the embodiments of the present disclosure when executed by the processing device 501.
It should be noted that the computer readable medium in the present disclosure can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In contrast, in the present disclosure, a computer readable signal medium may comprise a propagated data signal with computer readable program code embodied therein, either in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
In some embodiments, the clients, servers may communicate using any currently known or future developed network Protocol, such as HTTP (HyperText Transfer Protocol), and may interconnect with any form or medium of digital data communication (e.g., a communications network). Examples of communication networks include a local area network ("LAN"), a wide area network ("WAN"), the Internet (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently known or future developed network.
The computer readable medium may be embodied in the electronic device; or may exist separately without being assembled into the electronic device.
The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: receiving characters input by a user through an input method; according to the characters, determining a first vocabulary list from a special database which is matched with the current application of the input method, and determining a second vocabulary list from a general database corresponding to the input method; determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words.
Optionally, when the one or more programs are executed by the electronic device, the electronic device may further perform other steps described in the above embodiments.
Computer program code for carrying out operations for the present disclosure may be written in any combination of one or more programming languages, including but not limited to an object oriented programming language such as Java, Smalltalk, C + +, and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present disclosure may be implemented by software or hardware. Where the name of an element does not in some cases constitute a limitation on the element itself.
The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), systems on a chip (SOCs), Complex Programmable Logic Devices (CPLDs), and the like.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
According to one or more embodiments of the present disclosure, there is provided a method for determining an input method candidate word, including:
receiving characters input by a user through an input method;
according to the characters, determining a first vocabulary list from a special database which is matched with the current application of the input method, and determining a second vocabulary list from a general database corresponding to the input method;
determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words.
According to one or more embodiments of the present disclosure, in the method for determining candidate words for an input method provided by the present disclosure, the determining at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list includes: determining at least one first candidate word corresponding to the character from the first vocabulary list; determining at least one second candidate word corresponding to the character from the second vocabulary list; determining the at least one target candidate word based on the first candidate word and the second candidate word.
According to one or more embodiments of the present disclosure, in a method for determining an input method candidate word, the determining at least one first candidate word corresponding to the character from the first vocabulary list includes: sequencing all the words in the first word list according to a set sequencing rule; and selecting at least one vocabulary as the first candidate word according to the sorting position.
According to one or more embodiments of the present disclosure, in a method for determining an input method candidate word, the determining at least one second candidate word corresponding to the character from the second vocabulary list includes: sequencing all the words in the second word list according to a set sequencing rule; and selecting at least one vocabulary as the second candidate word according to the sorting position.
According to one or more embodiments of the present disclosure, in a method for determining an input method candidate word, the setting of the ranking rule includes at least one of: ranking based on frequency of use, ranking based on time of use, and ranking based on first letter.
According to one or more embodiments of the present disclosure, in a method for determining an input method candidate word, the method further includes: and displaying the target candidate words so that the user selects the target words from the target candidate words as input words.
According to one or more embodiments of the present disclosure, there is provided an apparatus for determining an input method candidate word, including: the receiving module is used for receiving characters input by a user through an input method; the first determining module is used for determining a first vocabulary list from a special database which is matched with the current application of the input method according to the characters and determining a second vocabulary list from a general database corresponding to the input method; a second determining module to determine at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list.
In one or more embodiments of the present disclosure, in the apparatus for determining an input method candidate word provided in the present disclosure, the second determining module 230 includes: the first determining unit is used for determining at least one first candidate word corresponding to the character from the first vocabulary list; the second determining unit is used for determining at least one second candidate word corresponding to the character from the second vocabulary list; a third determining unit, configured to determine the at least one target candidate word based on the first candidate word and the second candidate word.
According to one or more embodiments of the present disclosure, in an apparatus for determining an input method candidate word provided by the present disclosure, the first determining unit includes: the first ordering subunit is used for ordering all the vocabularies in the first vocabulary list according to a set ordering rule; and the first selection subunit is used for selecting at least one vocabulary as the first candidate word according to the sorting position.
According to one or more embodiments of the present disclosure, in the apparatus for determining an input method candidate word provided by the present disclosure, the second determining unit includes: the second sorting subunit is used for sorting all the words in the second word list according to a set sorting rule; and the second selection subunit is used for selecting at least one vocabulary as the second candidate word according to the sorting position.
According to one or more embodiments of the present disclosure, in the apparatus for determining an input method candidate word provided by the present disclosure, the set ordering rule includes at least one of: ranking based on frequency of use, ranking based on time of use, and ranking based on first letter.
According to one or more embodiments of the present disclosure, in the apparatus for determining an input method candidate word provided by the present disclosure, the apparatus further includes: and the display module is used for displaying the target candidate words so that a user can select the target words from the target candidate words as input words.
In accordance with one or more embodiments of the present disclosure, there is provided an electronic device including:
one or more processors;
a memory for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement any of the methods of determining input method candidate words provided by the present disclosure.
According to one or more embodiments of the present disclosure, there is provided a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the method of determining an input method candidate word as any one of the methods provided in the present disclosure.
The embodiments of the present disclosure also provide a computer program product, which includes a computer program or instructions, and when the computer program or instructions are executed by a processor, the method for determining the input method candidate word as described above is implemented.
The foregoing description is only exemplary of the preferred embodiments of the disclosure and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the disclosure herein is not limited to the particular combination of features described above, but also encompasses other embodiments in which any combination of the features described above or their equivalents does not depart from the spirit of the disclosure. For example, the above features and (but not limited to) the features disclosed in this disclosure having similar functions are replaced with each other to form the technical solution.
Further, while operations are depicted in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, while several specific implementation details are included in the above discussion, these should not be construed as limitations on the scope of the disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.

Claims (10)

1. A method for determining candidate words of an input method is characterized by comprising the following steps:
receiving characters input by a user through an input method;
according to the characters, determining a first vocabulary list from a special database which is matched with the current application of the input method, and determining a second vocabulary list from a general database corresponding to the input method;
determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words.
2. The method of claim 1, wherein determining at least one target candidate word corresponding to the character based on the first list of words and the second list of words comprises:
determining at least one first candidate word corresponding to the character from the first vocabulary list;
determining at least one second candidate word corresponding to the character from the second vocabulary list;
determining the at least one target candidate word based on the at least one first candidate word and the at least one second candidate word.
3. The method of claim 2, wherein determining at least one first candidate word corresponding to the character from the first vocabulary list comprises:
sequencing all the words in the first word list according to a set sequencing rule;
and selecting at least one vocabulary as the first candidate word according to the sorting position.
4. The method of claim 2, wherein determining at least one second candidate word corresponding to the character from the second vocabulary list comprises:
sequencing all the words in the second word list according to a set sequencing rule;
and selecting at least one vocabulary as the second candidate word according to the sorting position.
5. The method of claim 3 or 4, wherein the setting of the ordering rule comprises at least one of: ranking based on frequency of use, ranking based on time of use, and ranking based on first letter.
6. The method according to any one of claims 1-4, further comprising:
and displaying the target candidate words so that the user selects the target words from the target candidate words as input words.
7. An apparatus for determining candidate words for an input method, comprising:
the receiving module is used for receiving characters input by a user through an input method;
the first determining module is used for determining a first vocabulary list from a special database which is matched with the current application of the input method according to the characters and determining a second vocabulary list from a general database corresponding to the input method;
a second determining module to determine at least one target candidate word corresponding to the character based on the first vocabulary list and the second vocabulary list.
8. An electronic device, characterized in that the electronic device comprises:
one or more processors;
storage means for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement the method of determining input method candidate words of any of claims 1-6.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the method for determining an input method candidate word according to any one of claims 1 to 6.
10. A computer program product comprising a computer program or instructions, which, when executed by a processor, implements the method of determining input method candidate words according to any one of claims 1-6.
CN202110449912.7A 2021-04-25 2021-04-25 Method and device for determining candidate words of input method, electronic equipment and storage medium Pending CN113138677A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110449912.7A CN113138677A (en) 2021-04-25 2021-04-25 Method and device for determining candidate words of input method, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110449912.7A CN113138677A (en) 2021-04-25 2021-04-25 Method and device for determining candidate words of input method, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113138677A true CN113138677A (en) 2021-07-20

Family

ID=76813514

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110449912.7A Pending CN113138677A (en) 2021-04-25 2021-04-25 Method and device for determining candidate words of input method, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113138677A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114791769A (en) * 2022-06-24 2022-07-26 湖北云享客数字智能科技有限公司 Big database establishment method for user behavior prediction result
CN115437510A (en) * 2022-09-23 2022-12-06 联想(北京)有限公司 Data display method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079060A (en) * 2007-03-26 2007-11-28 腾讯科技(深圳)有限公司 Chinese character input simple 'pinyin' implementation method and system
CN101546226A (en) * 2009-05-08 2009-09-30 腾讯科技(深圳)有限公司 English input method and system
CN103455160A (en) * 2012-05-29 2013-12-18 阿里巴巴集团控股有限公司 Method and device for recommending candidate words according to geographic position
CN107102746A (en) * 2016-02-19 2017-08-29 北京搜狗科技发展有限公司 Candidate word generation method, device and the device generated for candidate word
US20190332663A1 (en) * 2016-07-22 2019-10-31 Huawei Technologies Co., Ltd. Candidate-item presentation method and terminal device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079060A (en) * 2007-03-26 2007-11-28 腾讯科技(深圳)有限公司 Chinese character input simple 'pinyin' implementation method and system
CN101546226A (en) * 2009-05-08 2009-09-30 腾讯科技(深圳)有限公司 English input method and system
CN103455160A (en) * 2012-05-29 2013-12-18 阿里巴巴集团控股有限公司 Method and device for recommending candidate words according to geographic position
CN107102746A (en) * 2016-02-19 2017-08-29 北京搜狗科技发展有限公司 Candidate word generation method, device and the device generated for candidate word
US20190332663A1 (en) * 2016-07-22 2019-10-31 Huawei Technologies Co., Ltd. Candidate-item presentation method and terminal device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114791769A (en) * 2022-06-24 2022-07-26 湖北云享客数字智能科技有限公司 Big database establishment method for user behavior prediction result
CN115437510A (en) * 2022-09-23 2022-12-06 联想(北京)有限公司 Data display method and device

Similar Documents

Publication Publication Date Title
CN110619076B (en) Search term recommendation method and device, computer and storage medium
US20170351687A1 (en) Method and system for enhanced query term suggestion
US10564846B2 (en) Supplementing a virtual input keyboard
CN111368185A (en) Data display method and device, storage medium and electronic equipment
CN112819512B (en) Text processing method, device, equipment and medium
CN111444356A (en) Search-based recommendation method and device
CN111708911B (en) Searching method, searching device, electronic equipment and computer-readable storage medium
CN110059172B (en) Method and device for recommending answers based on natural language understanding
CN112287206A (en) Information processing method and device and electronic equipment
CN113138677A (en) Method and device for determining candidate words of input method, electronic equipment and storage medium
CN111538830B (en) French searching method, device, computer equipment and storage medium
CN113589991A (en) Text input method and device, electronic equipment and storage medium
CN111339452B (en) Method, terminal, server and system for displaying search result
CN114722292A (en) Book searching method, device, equipment and storage medium
CN114357325A (en) Content search method, device, equipment and medium
CN114782716A (en) Image matching method and device
CN113934938A (en) Information display method and device, readable medium and electronic equipment
CN110909117A (en) Subject identification implementation method, system, storage medium and terminal equipment
CN110765357A (en) Method, device and equipment for searching online document and storage medium
CN111339124B (en) Method, apparatus, electronic device and computer readable medium for displaying data
CN111399725B (en) Method, apparatus, electronic device, and medium for presenting information
CN113221572A (en) Information processing method, device, equipment and medium
CN104850654A (en) Processing result displaying method and device
CN112799520A (en) Retrieval processing method, device and equipment
CN114238229B (en) Print file searching method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination