CN115344127A - Method, device, equipment and storage medium for determining candidate word display sequence - Google Patents

Method, device, equipment and storage medium for determining candidate word display sequence Download PDF

Info

Publication number
CN115344127A
CN115344127A CN202210884529.9A CN202210884529A CN115344127A CN 115344127 A CN115344127 A CN 115344127A CN 202210884529 A CN202210884529 A CN 202210884529A CN 115344127 A CN115344127 A CN 115344127A
Authority
CN
China
Prior art keywords
candidate word
candidate
determining
words
ranking factor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210884529.9A
Other languages
Chinese (zh)
Inventor
李婷
王乔飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202210884529.9A priority Critical patent/CN115344127A/en
Publication of CN115344127A publication Critical patent/CN115344127A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0236Character input methods using selection techniques to select from displayed items

Abstract

The disclosure provides a method, a device, equipment and a storage medium for determining a candidate word display sequence, and relates to the field of computers, in particular to a word input method in the field of network computers. The specific implementation scheme is as follows: acquiring a candidate word set corresponding to a specified character, and determining the display times of each candidate word in the candidate word set; determining the screen-on times of each candidate word in the candidate word set; and determining the display sequence of each candidate word in the candidate word set based on the display times and the screen-on times.

Description

Method, device, equipment and storage medium for determining candidate word display sequence
Technical Field
The present disclosure relates to the field of computer technology, and in particular, to a word input method in the field of network computers.
Background
In the activities of the user on the internet, the most important communication mode is text communication through the mobile device. The input method is used as a common character input tool, and for example, corresponding candidate items can be displayed according to a character string input by a user, and the candidate items selected by the user are displayed on a screen.
In the related art, the input method may separately maintain candidate words to be presented for different character strings, and for a plurality of candidate words corresponding to a certain character string, the presentation order among the candidate words may be adjusted by the input method according to a corresponding configuration manner.
Disclosure of Invention
The disclosure provides a method, a device, equipment and a storage medium for determining a candidate word display sequence.
According to an aspect of the present disclosure, a method of determining a candidate word presentation order is provided.
Acquiring a candidate word set corresponding to a designated character, and determining the display times of each candidate word in the candidate word set; determining the screen-on times of each candidate word in the candidate word set; and determining the display sequence of each candidate word in the candidate word set based on the display times and the screen-on times.
According to another aspect of the present disclosure, an apparatus for determining a candidate word presentation order is provided.
The acquisition module is used for acquiring a candidate word set corresponding to the specified character; the determining module is used for determining the display times of the candidate words in the candidate word set; determining the screen-on times of each candidate word in the candidate word set; and the display order of each candidate word in the candidate word set is determined based on the display times and the screen-on times.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method referred to above.
According to another aspect of the present disclosure, there is provided a non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method referred to above.
According to another aspect of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the method referred to above.
It should be understood that the statements in this section are not intended to identify key or critical features of the embodiments of the present disclosure, nor are they intended to limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1 is a flow chart of a method of determining a candidate word presentation order shown in the present disclosure;
fig. 2 is a flowchart of a method for determining a presentation order of candidate words in a candidate word set based on presentation times and screen times, according to the present disclosure;
fig. 3 is a flowchart illustrating another method for determining a presentation order of candidate words in a candidate word set based on presentation times and screen times according to the present disclosure;
fig. 4 is a flowchart illustrating another method for determining a display order of candidate words in a candidate word set based on the display times and the screen-up times according to the present disclosure;
fig. 5 is a flowchart illustrating another method for determining a display order of candidate words in a candidate word set based on the display times and the screen-up times according to the present disclosure;
FIG. 6 is a flow chart illustrating a method for candidate word masking according to rank factor in accordance with the present disclosure;
FIG. 7 is a flow diagram illustrating a method for candidate word masking according to a rank factor according to the present disclosure;
FIG. 8 is a schematic flow chart illustrating presenting candidate words to a user according to the present disclosure;
FIG. 9 is a block diagram of an apparatus for determining a candidate word presentation order according to the present disclosure;
FIG. 10 is a block diagram of another apparatus for determining a candidate word presentation order according to the present disclosure;
FIG. 11 shows a schematic block diagram of an example electronic device that may be used to implement embodiments of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the disclosure are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
The method for determining the display sequence of the candidate words, provided by the embodiment of the disclosure, can be applied to a scene in which words are input through an input method. For example, in the case where the user inputs a corresponding character, the input method presents the corresponding candidate word based on the character input by the user.
In the activities of the user on the internet, the most important communication mode is character communication through the mobile device. The input method is a commonly used character input tool, and for example, corresponding candidate items can be displayed according to a character string input by a user, and the candidate items selected by the user are displayed on a screen.
In the related art, the input method may separately maintain candidate words to be presented for different character strings, and for a plurality of candidate words corresponding to a certain character string, the presentation order among the candidate words may be adjusted by the input method according to a corresponding configuration manner. However, in the related art, the adjustment of the display order among the candidate words cannot meet the actual use requirement of the user. For example, in a candidate word sorting method in the related art, after a user screens a wrong word by mistake, the word is added to a learned word, and the wrong word may be sorted in the front and appear in an input method candidate column multiple times, which affects user experience and is difficult to remove. For another example, in another candidate word sorting method in the related art, a default configured system word is preferentially considered, and for a learning word generated based on user input and on screen, even if the learning word has the same prefix as the system word, the learning word is still sorted to the rear of the system word and cannot appear in an input method candidate column, so that the user is difficult to input the word, and the input efficiency is low. In addition, another candidate word sorting method is also related in the related art, and the method sorts the candidate words according to the screen-on times of each candidate word, but because further prioritization is still required between candidate words with the same screen-on times (for example, a candidate word with a display time of 10 and a screen-on time of 8 generally needs to be ranked before a candidate word with a display time of 100 and a screen-on time of 8), the method still cannot optimally achieve candidate word sorting.
In view of the above, the present disclosure provides a method for determining a display order of candidate words, which ranks the candidate words by counting the display times and the screen-on times of the candidate words, so as to reduce the possibility that error words that have been screened only once or a very few times are ranked ahead. In addition, the method mainly refers to the display times and the screen-on times of each candidate word. Therefore, for the system words and the learning words with the same prefix, the problem that the learning words can only be arranged behind the system words can be avoided, the method can enable the candidate word ordering to be more suitable for the user requirements, and the user experience can be improved. In addition, the screen-up times of each candidate word are considered, the display times of the candidate words are also taken into consideration, and further sequencing of a plurality of candidate words with the same screen-up times can be additionally achieved.
Fig. 1 is a flowchart illustrating a method for determining a candidate word presentation order according to the present disclosure, and as shown in fig. 1, the method includes the following steps S101 to S103.
In step S101, a candidate word set corresponding to the designated character is obtained, and the number of times of displaying each candidate word in the candidate word set is determined.
In the embodiment of the present disclosure, the specified character may be a single character or a character string composed of a plurality of characters. Further, the candidate word set corresponding to the designated character also refers to a plurality of candidate words maintained individually for the designated character. For example, if a character is designated as a character string composed of "h" and "a", the corresponding candidate word set may include a plurality of candidate words such as "had", "have", and "happy". For another example, if a character is a character string composed of "b" and "a", the corresponding candidate word set may include a plurality of candidate words such as "back", "bad", and "bat".
In step S102, the screen-up times of each candidate word in the candidate word set is determined.
For example, the screen-on times referred to above in this disclosure refer to the total number of times that a candidate word is screened in a history. Accordingly, the above mentioned number of presentations also refers to the total number of times the candidate word is presented in the history
In step S103, a display order of each candidate word in the candidate word set is determined based on the display times and the screen-on times.
According to the method for determining the display sequence of the candidate words, the display sequence of the candidate words can be determined according to the display times and the screen-on times of the candidate words, so that a user can conveniently and quickly find the required candidate words when performing input operation based on the candidate words.
For example, the number of times of displaying and the number of times of displaying each candidate word may be maintained through Beta distribution. For example, can be obtained by
Figure BDA0003762747170000041
And maintaining the display times and the screen-on times of the candidate words. Wherein a represents the number of times the candidate word is displayed, and b represents the number of times the candidate word is not displayed. On the basis, a + b is the display times of the candidate words. In addition, the screen-on times and the display times of the candidate words can be updated in a mode of Beta (p | a, b) + count (m 1, m 2) = Beta (p | a + m1, b + m 2) in the using process of the input method. And m1 is used for updating the screen-on times of the candidate words, and m2 is used for updating the screen-off times of the candidate words. For example, if the input method shows the candidate word and the user screens the candidate word, the value of m1 is 1, and the value of m2 is 0.
For example, for each candidate word, a ratio between the number of times of screen-on and the number of times of display may be determined, and then a ranking factor for ranking may be obtained according to the determined ratio. For convenience of description, the ratio between the number of times of screen-on of the candidate word and the number of times of display of the candidate word is referred to as a first ratio.
Fig. 2 is a flowchart of a method for determining a display order of candidate words in a candidate word set based on display times and screen-on times, as shown in fig. 2, including the following steps.
In step S201, a first ratio is determined for each candidate word in the candidate word set.
In step S202, a ranking factor is determined based on the first ratio.
In step S203, based on the ranking factor, ranking the candidate words in the candidate word set to obtain a display order of the candidate words in the candidate word set.
In the disclosed embodiment, the first percentage may approximately represent the likelihood that the candidate word is on-screen by the user. In other words, if the first percentage is higher, it indicates that the subsequent word is likely to be displayed on the screen by the user. Therefore, the ranking factor is determined through the first proportion, the candidate words frequently used by the user can be ranked in the front, and therefore the operation cost generated by the user for finding the candidate words is reduced.
As a possible implementation, the first ratio can be directly used as the ranking factor.
Fig. 3 is a flowchart of another method for determining a presentation order of each candidate word in a candidate word set based on the presentation times and the screen-on times shown in this disclosure, as shown in fig. 3, the execution methods of step S301 and step S303 in this disclosure embodiment are similar to the execution methods of step S201 and step S203 in fig. 2, and are not described herein again.
In step S302, the first ratio is used as a ranking factor.
According to the method provided by the embodiment of the disclosure, the first ratio is used as a sorting factor to sort the candidate words, so that the sorting mode of the candidate words is fitted with the use requirement of the user, and the convenience of the user operation is improved.
Generally, the number of times of display of candidate words having the same first percentage may have a large difference. For example, the number of times of presentation of the candidate word a is 10, the number of times of screen-up is 8, and the first ratio is 0.8. The display frequency of the candidate word B is 100, the screen-on frequency is 80, and the first proportion is also 0.8. It can be seen that, although the candidate word a has the same first percentage of the value as the candidate word B, the candidate word a should have a higher priority than the candidate word B as can be seen from the analysis of the number of times of screen display and the number of times of display. Therefore, in the above embodiment, the first ratio is directly used as the ranking factor, and the special candidate words cannot be ranked preferentially. In view of this, in an embodiment of the present disclosure, the number of times of displaying the candidate word may be further introduced based on the first percentage, so as to determine the ranking factor through the first percentage and the number of times of displaying.
Fig. 4 is a flowchart of another method for determining a presentation order of each candidate word in a candidate word set based on the presentation times and the screen-on times shown in this disclosure, and as shown in fig. 4, steps S401 and S403 in this disclosure are similar to the execution method of steps S201 and S203 in fig. 2, and are not repeated here.
In step S402, a ranking factor is determined based on the first ratio and the number of impressions.
In the embodiment of the disclosure, on the basis of the first proportion, the display times are used as auxiliary references, so that the confidence of the ranking factor can be improved, and further the display sequence of the candidate words can be more suitable for the user requirements.
In one embodiment, the determination of the ranking factor based on the first ratio and the number of impressions may be implemented, for example, by using the following formula.
By way of example, may be
Figure BDA0003762747170000061
A ranking factor is determined. Wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure BDA0003762747170000062
representing a first fraction, sj, of candidate words j by the current time t Representing the ranking factor by which the candidate word j is present. In the above embodiment, for a certain time t, 2lnt can be regarded as a constantAnd (4) counting. Based on the above formula, on one hand, for the case that the number of times of displaying the candidate word is large,
Figure BDA0003762747170000063
the value of the term is small, when the ranking factor S is jt In proportion to the first
Figure BDA0003762747170000064
The smaller the difference between them. In other words, in the case that the number of times of displaying the candidate word is large, the first ratio has a high confidence, and the ranking factor mainly refers to the first ratio. On the other hand, for the case that the number of the candidate words is less,
Figure BDA0003762747170000065
the value of the term is large, at which time the ordering factor S jt Quilt
Figure BDA0003762747170000066
The term effect will be significantly different from the first term. In other words, with fewer candidate words being presented, the user can select the candidate word by
Figure BDA0003762747170000071
The items are used as auxiliary parameters, the confidence of the ranking factor can be guaranteed under the condition that the confidence of the first proportion is low, and then the display sequence of the candidate words is more suitable for the requirements of the user.
In addition, at the initial stage of the input method, each candidate word corresponding to the designated character has only a small number of display times, even is not displayed. In this case, the confidence degrees of the first proportion and the number of presentations are both low, and even if the ranking factor is determined by the first proportion and the number of presentations, the higher confidence degree of the ranking factor cannot be guaranteed. In view of this, in an embodiment of the present disclosure, based on the first ratio and the number of times of display, a ratio between the number of times of screen display of the candidate word and a total number of times of screen display of all candidate words in the candidate word set (for convenience of description, this ratio is referred to as a second ratio in the present disclosure) may be further introduced as an auxiliary parameter, and the ranking factor may be determined according to the first ratio, the number of times of display, and the second ratio.
Fig. 5 is a flowchart of another method for determining a presentation order of each candidate word in a candidate word set based on the presentation times and the screen-on times shown in this disclosure, as shown in fig. 5, steps S501 and S503 in this embodiment of the disclosure are similar to the execution method of steps S201 and S203 in fig. 2, and are not repeated here.
In step S502, a ranking factor is determined based on the first ratio, the number of presentations, and the second ratio.
And the second ratio is the ratio of the screen-on times of the candidate words to the total screen-on times of all the candidate words in the candidate word set.
In the embodiment of the disclosure, the second proportion is introduced on the basis of the first proportion and the display times, so that the confidence of the ranking factor can be further improved. On the basis, the input method can display the candidate words according to the display sequence meeting the requirements of the user in the early stage of use.
In one embodiment, the determination of the ranking factor based on the first ratio, the number of presentations, and the second ratio may be implemented, for example, by using the following formula.
By way of example, may be
Figure BDA0003762747170000072
A ranking factor is determined. Wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure BDA0003762747170000073
representing a first ratio, N, of candidate words j by the current time jt Indicating a second ratio, sj, of candidate words j by the current time t And (4) representing the sorting factor of the candidate word j till the current moment, wherein A and 0.6 are correction coefficients.
In addition, for the wrong word of the user mistakenly touching the screen, the first proportion, the display times and the second proportion are all smaller, and the finally obtained ranking factor is also smaller. In other words, in the embodiment of the present disclosure, the wrong word that the user wrongly screens is sorted in the later order of the candidate words in the candidate word set. However, for the user, the actual requirement of the user cannot be met only by mistakenly displaying the wrong word on the screen. In view of this, in one embodiment of the present disclosure, candidate words with ranking factors less than or equal to the ranking factor threshold may be masked, so that the masked candidate words are no longer shown.
Fig. 6 is a flowchart of a method for shielding candidate words according to ranking factors shown in the present disclosure, and as shown in fig. 6, the execution methods of step S601, step S602, and step S603 in the embodiment of the present disclosure are similar to the execution methods of step S201, step S202, and step S203 in fig. 2, and are not repeated herein.
In step S604, candidate words with ranking factors less than or equal to the ranking factor threshold are masked, so that the masked candidate words are no longer presented.
In the embodiment of the disclosure, the wrong words on the screen by mistake of the user can be shielded, so that the wrong words on the screen by mistake are not displayed any more. The wrong words caused by the fact that the user mistakenly puts on the screen are usually learning words obtained by the input method based on user input, but are not system words configured by the input method in a default mode. Therefore, as a preferable mode, word screening and word masking may be performed only on the learning words in the candidate word set.
In a feasible manner, candidate word masking may be implemented by deleting candidate words. In addition, the candidate words to be screened out are considered, and not only may be wrong words on which the user wrongly screens, but also may be candidate words which are not commonly used by the user. Therefore, in another possible way, the candidate word to be masked may not be deleted, and only the candidate word to be masked may be configured not to be presented. On this basis, the masked candidate words and relevant data of the masked candidate words (e.g., the first and second ratios and the ranking factor) may be retained for unmasking if a subsequent ranking factor is again greater than the ranking factor threshold. For example, for a candidate word that is masked, a user may increase a first percentage and a second percentage of the candidate word by inputting and screening the candidate word multiple times, and further increase a ranking factor of the candidate word, so that the masking for the candidate word is released.
Further, in the case that the user inputs a specified character, the input method may display the candidate words in the candidate word set that are not masked according to the determined display order.
Fig. 7 is a flowchart of a method for shielding candidate words according to ranking factors shown in this disclosure, and as shown in fig. 7, the execution methods of step S701, step S702, step S703, and step S704 in this embodiment of the disclosure are similar to the execution methods of step S601, step S602, step S603, and step S604 in fig. 6, and are not described herein again.
In step S705, in response to determining that the user inputs the designated character, the candidate words in the candidate word set that are not masked are presented in the determined presentation order.
By the method provided by the embodiment of the disclosure, a user can acquire a plurality of candidate words which are displayed in the display sequence and are not shielded in a mode of inputting the designated characters, and then select the corresponding candidate words according to actual requirements to execute screen-on operation, so that word input is completed.
Based on the same conception, the embodiment of the disclosure also provides a device for determining the display sequence of the candidate words.
It is understood that, in order to implement the above functions, the apparatus for determining a candidate word presentation sequence provided in the embodiments of the present disclosure includes a hardware structure and/or a software module for performing each function. The disclosed embodiments can be implemented in hardware or a combination of hardware and computer software, in combination with the exemplary modules and algorithm steps disclosed in the disclosed embodiments. Whether a function is performed as hardware or computer software drives hardware depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the subject matter of the embodiments of the present disclosure.
Fig. 8 is a schematic flow chart illustrating presenting candidate words for a user according to the present disclosure.
For example, as shown in fig. 8, a candidate word corresponding to a specified character may be targetedAnd centralizing all candidate words, respectively recording the display times of displaying all the candidate words to the user, and recording the screen-up times of operating the screen-up candidate words in the history record by the user, and further respectively maintaining Beta distribution values aiming at the screen-up times and the screen-down times for all the candidate words. Correspondingly, the ranking factor of each candidate word can be calculated according to the maintained screen-on times and the maintained display times, and then the candidate words with the ranking factors smaller than the threshold value of the ranking factors are used as the candidate words to be shielded. Further, under the condition that the user inputs the specified character, a candidate word set corresponding to the specified character can be determined, and candidate words needing to be shielded are screened out from the candidate word set. For example, for candidate word W 1 Candidate word W 2 To candidate word W n If the candidate word W is a plurality of candidate words 2 Is less than or equal to the ranking factor threshold, the candidate word W is selected 2 Masking, subsequently only the candidate word W 1 Candidate word W 3 Candidate word W 4 To candidate word W n And displaying to the user. Furthermore, for each candidate word that is not masked, the candidate words may be ranked according to the ranking factor of each candidate word, and the ranked candidate words are displayed to the user. For example, if the word W is a candidate 1 Candidate word W 3 To candidate word W n The ranking factor of each candidate word is B 1 、B 3 To B n And the magnitude relationship of each ordering factor is B 3 >B n >...>B 1 Then the display sequence among the candidate words is B 3 、B n To B 1 The method provides candidate word sorting according to the display sequence which can meet the requirements of the user, and can improve the convenience of inputting words by the user.
Fig. 9 is a block diagram of an apparatus for determining a candidate word presentation order according to the present disclosure. Referring to fig. 9, the apparatus 800 includes
An obtaining module 801, configured to obtain a candidate word set corresponding to a specified character. The determining module 802 is configured to determine the number of times that each candidate word in the candidate word set is displayed. And determining the screen-on times of each candidate word in the candidate word set. And the display order determining unit is used for determining the display order of each candidate word in the candidate word set based on the display times and the screen-on times.
In one embodiment, the determining module 802 determines the display order of each candidate word in the candidate word set based on the display times and the screen-on times in the following manner: and respectively determining a first ratio of the screen-on times of the candidate words to the display times of the candidate words aiming at each candidate word in the candidate word set. Based on the first ratio, a ranking factor is determined. And sequencing the candidate words in the candidate word set based on the sequencing factor to obtain the display sequence of the candidate words in the candidate word set.
In one embodiment, the determining module 802 determines the ranking factor based on the first ratio in the following manner: the first ratio is taken as a ranking factor.
In one embodiment, the determining module 802 determines the ranking factor based on the first ratio in the following manner: based on the first ratio and the number of presentations, a ranking factor is determined.
In one embodiment, the determining module 802 determines the ranking factor based on the first percentage and the number of impressions using the following formula:
Figure BDA0003762747170000101
j is used for identifying the candidate words, t represents the current time, represents the display times of the candidate words j ending to the current time, represents the first proportion of the candidate words j ending to the current time, and represents the ranking factor of the candidate words j ending to the current time.
In one embodiment, the determining module 802 determines the ranking factor based on the first ratio in the following manner: and determining a ranking factor based on the first ratio, the display times and the second ratio. And the second ratio is the ratio of the screen-on times of the candidate words to the total screen-on times of all the candidate words in the candidate word set.
In one embodiment, the determining module 802 determines the ranking factor based on the first ratio, the number of presentations, and the second ratio using the following formula:
Figure BDA0003762747170000111
wherein j is used for identifying candidate words, and t is whenThe previous moment represents the display times of the candidate word j ending to the current moment, represents a first proportion of the candidate word j ending to the current moment, represents a second proportion of the candidate word j ending to the current moment, and represents a ranking factor of the candidate word j ending to the current moment.
In one embodiment, the apparatus further comprises a processing module.
Fig. 10 is a block diagram of another apparatus for determining a candidate word presentation order according to the present disclosure. As shown in fig. 10, the apparatus 800 further comprises a processing module 803.
The processing module 803 is configured to mask candidate words whose ranking factor is less than or equal to the ranking factor threshold, so that the masked candidate words are no longer displayed.
In one embodiment, the processing module 803 is further configured to: and in response to determining that the user inputs the designated character, displaying the candidate words in the candidate word set which are not masked according to the determined display sequence.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
The present disclosure also provides an electronic device, a readable storage medium, and a computer program product according to embodiments of the present disclosure.
FIG. 11 illustrates a schematic block diagram of an example electronic device 900 that can be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processors, cellular telephones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not intended to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 11, the apparatus 900 includes a computing unit 901 which can perform various appropriate actions and processes in accordance with a computer program stored in a Read Only Memory (ROM) 902 or a computer program loaded from a storage unit 908 into a Random Access Memory (RAM) 903. In the RAM 903, various programs and data required for the operation of the device 900 can also be stored. The calculation unit 901, ROM 902, and RAM 903 are connected to each other via a bus 904. An input/output (I/O) interface 905 is also connected to bus 904.
A number of components in the device 900 are connected to the I/O interface 905, including: an input unit 906 such as a keyboard, a mouse, and the like; an output unit 907 such as various types of displays, speakers, and the like; a storage unit 908 such as a magnetic disk, optical disk, or the like; and a communication unit 909 such as a network card, a modem, a wireless communication transceiver, and the like. The communication unit 909 allows the device 900 to exchange information/data with other devices through a computer network such as the internet and/or various telecommunication networks.
The computing unit 901 may be a variety of general and/or special purpose processing components with processing and computing capabilities. Some examples of the computing unit 901 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various dedicated Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, and so forth. The calculation unit 901 performs the respective methods and processes described above, such as a method of determining a candidate word presentation order. For example, in some embodiments, the method of determining a candidate word presentation order may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as storage unit 908. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 900 via ROM 902 and/or communications unit 909. When loaded into RAM 903 and executed by computing unit 901, may perform one or more of the steps of the above-described method of determining a candidate word presentation order. Alternatively, in other embodiments, the computing unit 901 may be configured to perform the method of determining the candidate word presentation order by any other suitable means (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuitry, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), system on a chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowchart and/or block diagram to be performed. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), and the Internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server may be a cloud server, a server of a distributed system, or a server with a combined blockchain.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present disclosure may be executed in parallel, sequentially, or in different orders, as long as the desired results of the technical solutions disclosed in the present disclosure can be achieved, and the present disclosure is not limited herein.
The above detailed description should not be construed as limiting the scope of the disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present disclosure should be included in the protection scope of the present disclosure.

Claims (21)

1. A method for determining a candidate word presentation order comprises the following steps:
acquiring a candidate word set corresponding to a specified character, and determining the display times of each candidate word in the candidate word set;
determining the screen-on times of each candidate word in the candidate word set;
and determining the display sequence of each candidate word in the candidate word set based on the display times and the screen-on times.
2. The method of claim 1, wherein the determining a presentation order of the candidate words in the candidate word set based on the presentation times and the screen-up times comprises:
respectively determining a first ratio for each candidate word in the candidate word set, wherein the first ratio is a ratio of the screen-on times of the candidate word to the display times of the candidate word;
determining a ranking factor based on the first fraction;
and sequencing the candidate words in the candidate word set based on the sequencing factor to obtain the display sequence of the candidate words in the candidate word set.
3. The method of claim 2, wherein the determining a ranking factor based on the first fraction comprises:
and taking the first ratio as a ranking factor.
4. The method of claim 2, wherein the determining a ranking factor based on the first fraction comprises:
determining a ranking factor based on the first percentage and the number of impressions.
5. The method of claim 4, wherein the ranking factor is determined based on the first fraction and the number of impressions using the following formula:
Figure FDA0003762747160000011
wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure FDA0003762747160000012
representing a first ratio, S, of candidate words j by the current time jt Representing the ranking factor by which the candidate word j is present.
6. The method of claim 2, wherein the determining a ranking factor based on the first fraction comprises:
determining a ranking factor based on the first proportion, the display times and a second proportion;
and the second ratio is the ratio of the screen-on times of the candidate words to the total screen-on times of all the candidate words in the candidate word set.
7. The method of claim 6, wherein a ranking factor is determined based on the first fraction, the number of presentations, and the second fraction using the following formula:
Figure FDA0003762747160000021
wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure FDA0003762747160000022
representing a first ratio, N, of candidate words j by the current time jt Representing a second ratio, S, of candidate words j by the current time jt Representing the ranking factor of the candidate word j by the current time.
8. The method of any of claims 2 to 7, further comprising:
and screening the candidate words with the ranking factors smaller than or equal to the ranking factor threshold value, so that the screened candidate words are not shown any more.
9. The method of claim 8, further comprising:
and in response to determining that the specified characters are input by the user, displaying the candidate words which are not masked in the candidate word set according to the determined display sequence.
10. An apparatus for determining a presentation order of candidate words, comprising:
the acquisition module is used for acquiring a candidate word set corresponding to the specified character;
the determining module is used for determining the display times of the candidate words in the candidate word set; determining the screen-on times of each candidate word in the candidate word set; and the display order of each candidate word in the candidate word set is determined based on the display times and the screen-on times.
11. The apparatus of claim 10, wherein the determining module determines the presentation order of the candidate words in the candidate word set based on the number of presentations and the number of screen shots as follows:
respectively determining a first ratio for each candidate word in the candidate word set, wherein the first ratio is a ratio of the screen-on times of the candidate word to the display times of the candidate word;
determining a ranking factor based on the first fraction;
and sequencing the candidate words in the candidate word set based on the sequencing factor to obtain the display sequence of the candidate words in the candidate word set.
12. The apparatus of claim 11, wherein the means for determining determines a ranking factor based on the first fraction as follows:
and taking the first proportion as a ranking factor.
13. The apparatus of claim 11, wherein the means for determining determines a ranking factor based on the first fraction as follows:
and determining a ranking factor based on the first proportion and the display times.
14. The apparatus of claim 13, wherein the means for determining determines the ranking factor based on the first fraction and the number of impressions using the following formula:
Figure FDA0003762747160000031
wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure FDA0003762747160000032
representing a first ratio, S, of candidate words j by the current time jt Representing the ranking factor by which the candidate word j is present.
15. The apparatus of claim 11, wherein the means for determining determines a ranking factor based on the first fraction as follows:
determining a ranking factor based on the first proportion, the display times and a second proportion;
and the second ratio is the ratio of the screen-on times of the candidate words to the total screen-on times of all the candidate words in the candidate word set.
16. The apparatus of claim 15, wherein the means for determining determines the ranking factor based on the first fraction, the number of impressions, and the second fraction using the following formula:
Figure FDA0003762747160000033
wherein j is used for identifying candidate words, T represents the current moment, and T j,t Representing the number of presentations of the candidate word j by the current time,
Figure FDA0003762747160000041
representing a first ratio, N, of candidate words j by the current time jt Representing a second ratio, S, of candidate words j by the current time jt Representing the ranking factor by which the candidate word j is present.
17. The apparatus of any of claims 11 to 16, further comprising a processing module to:
and screening the candidate words with the ranking factors smaller than or equal to the ranking factor threshold value, so that the screened candidate words are not shown any more.
18. The apparatus of claim 17, the processing module further to:
and in response to determining that the specified characters are input by the user, displaying the candidate words which are not masked in the candidate word set according to the determined display sequence.
19. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-9.
20. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-9.
21. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1-9.
CN202210884529.9A 2022-07-25 2022-07-25 Method, device, equipment and storage medium for determining candidate word display sequence Pending CN115344127A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210884529.9A CN115344127A (en) 2022-07-25 2022-07-25 Method, device, equipment and storage medium for determining candidate word display sequence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210884529.9A CN115344127A (en) 2022-07-25 2022-07-25 Method, device, equipment and storage medium for determining candidate word display sequence

Publications (1)

Publication Number Publication Date
CN115344127A true CN115344127A (en) 2022-11-15

Family

ID=83950090

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210884529.9A Pending CN115344127A (en) 2022-07-25 2022-07-25 Method, device, equipment and storage medium for determining candidate word display sequence

Country Status (1)

Country Link
CN (1) CN115344127A (en)

Similar Documents

Publication Publication Date Title
US11880382B2 (en) Systems and methods for generating tables from print-ready digital source documents
US20120290947A1 (en) Methods and systems to filter and display electronic messages
CN112084366A (en) Method, apparatus, device and storage medium for retrieving image
CN111859997A (en) Model training method and device in machine translation, electronic equipment and storage medium
CN113590796A (en) Training method and device of ranking model and electronic equipment
CN111563198B (en) Material recall method, device, equipment and storage medium
CN108959259B (en) New word discovery method and system
CN111368153A (en) Searching method and device
CN112560499A (en) Pre-training method and device of semantic representation model, electronic equipment and storage medium
CN112749300A (en) Method, apparatus, device, storage medium and program product for video classification
CN112380847A (en) Interest point processing method and device, electronic equipment and storage medium
WO2022156534A1 (en) Video quality assessment method and device
CN114330249A (en) Information editing method, device, equipment and storage medium
CN111435406A (en) Method and device for correcting database statement spelling errors
CN112329453A (en) Sample chapter generation method, device, equipment and storage medium
CN111680599A (en) Face recognition model processing method, device, equipment and storage medium
CN115344127A (en) Method, device, equipment and storage medium for determining candidate word display sequence
CN111665956B (en) Candidate character string processing method and device, electronic equipment and storage medium
CN114021642A (en) Data processing method and device, electronic equipment and storage medium
CN113408632A (en) Method and device for improving image classification accuracy, electronic equipment and storage medium
CN112560987A (en) Image sample processing method, device, equipment, storage medium and program product
CN111783872A (en) Method and device for training model, electronic equipment and computer readable storage medium
CN111522863A (en) Topic concept mining method, device, equipment and storage medium
CN111695036A (en) Content recommendation method and device
CN111367424A (en) Intelligent adjustment method and device for information candidate items and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination