WO2011127788A1 - Method, device, server and system for character input by user - Google Patents

Method, device, server and system for character input by user Download PDF

Info

Publication number
WO2011127788A1
WO2011127788A1 PCT/CN2011/072268 CN2011072268W WO2011127788A1 WO 2011127788 A1 WO2011127788 A1 WO 2011127788A1 CN 2011072268 W CN2011072268 W CN 2011072268W WO 2011127788 A1 WO2011127788 A1 WO 2011127788A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
network
input
corpus
term
Prior art date
Application number
PCT/CN2011/072268
Other languages
French (fr)
Chinese (zh)
Inventor
戴帅湘
张晓禹
瞿轶蒙
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN2010101485637A external-priority patent/CN102063450A/en
Priority claimed from CN2010101485726A external-priority patent/CN102063451A/en
Priority claimed from CN 201010148571 external-priority patent/CN102063194A/en
Priority claimed from CN 201010187082 external-priority patent/CN102063452A/en
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Publication of WO2011127788A1 publication Critical patent/WO2011127788A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes

Definitions

  • the present invention relates to computer network technology, and more particularly to a method, device, server and system for utilizing a computer network for text input by a user.
  • These input methods are based on a key sequence input by a user on a user device such as a desktop computer, a notebook computer, a mobile phone, a PDA, etc., and search for candidate terms in the local corpus of the user device and then displayed to the user for selection.
  • These input methods dynamically update the local corpus and the order or priority of occurrence of each candidate term in the corpus based on the user's selection of candidate terms.
  • the priority of each entry in the local corpus is The adjustment is more and more in line with the input habits of the particular user, so the text input speed will be faster and faster.
  • these existing input methods also have the following disadvantages:
  • Some input methods can keep the user corpus of the registered user on the web server.
  • the user corpus retained on the web server can be requested. Synchronize with the corpus of the local user device.
  • the corpus may contain some sensitive information that the user often enters. If the user performs a synchronization operation on another person's computer or Internet cafe computer, it will cause sensitive information in the corpus to be downloaded to another person or a public computer. And the information of the original corpus on the other person or on the public computer is also synchronized to the user corpus reserved by the web server for the user. This inevitably leads to confusion and leaks in the user corpus.
  • Local corpus has limited capacity, untimely updates, and limited local device processing capabilities:
  • the web server of these input methods can periodically collect hotspot words on the network and synchronize to the local corpus of the user equipment.
  • the local corpus cannot be too large, and it is impossible to put all these words on the network, such as new online hot words, daily spoken words, new product brands, and latest movies. Emerging words or phrases are synchronized to the local corpus. According to statistics, if the corpus formed by collecting vocabulary or idiomatic phrases and language models appearing on the network may exceed 4G capacity, it is almost impossible to synchronize the 4G network corpus to the local device, especially for handheld This is especially true for thin clients like devices.
  • the input method has a single function and does not provide other additional functions that are convenient for users, such as the web search function.
  • An object of the present invention is to provide a character input method for solving the above problems, and an apparatus, a network server and a system using the character input method.
  • a method for inputting text by a user of the present invention includes the following steps:
  • a user equipment for text input by a user of the present invention includes: means for detecting an input sequence;
  • a network server for text input by a user of the present invention comprises: means for receiving an input sequence on a user equipment via a network;
  • the network entry option for obtaining is returned to the device of the user device.
  • a system for inputting text by a user of the present invention includes the above-described device, and the above-described network server.
  • a cross-platform, installation-free input method is implemented.
  • the centralized large-scale thesaurus and algorithm model maintained by the network server can bring high-accuracy and high-authority input.
  • for user equipment with limited local storage space. In addition, it can save valuable resources;
  • New terms such as new online hot words, slang, new product brands, latest movies, etc. will appear in the user's candidate terms in a timely manner;
  • the user can fully combine the rapid response of the local corpus and the high precision of the large-capacity network corpus in the text entry to improve the accuracy and efficiency of the entry;
  • the overhead of constructing the classified thesaurus is reduced, so that the efficient input method can be realized at a lower cost.
  • FIG. 1 is a flowchart of a method for inputting characters on a user equipment side of a network server communication according to a first embodiment of the present invention
  • FIG. 2 is a flowchart of a method for assisting a user to perform character input on a web server according to the first embodiment of the present invention
  • FIG. 3 is a flow chart showing specific steps of performing a matching query in a network corpus in a method for assisting a user to input text on a web server according to the first embodiment of the present invention
  • FIG. 4 is a block diagram of a user equipment for communicating with a web server for text input by a user in the first embodiment of the present invention; Block diagram of the device;
  • FIG. 6 is a flow chart showing a method of inputting characters on a user equipment terminal that communicates with a network server in a second embodiment of the present invention
  • FIG. 7 is a flowchart of a method for assisting a user to perform character input on a web server according to a second embodiment of the present invention.
  • Figure 8 is a block diagram of a system for assisting a user to perform character input in a second embodiment of the present invention.
  • Figure 10 is a schematic diagram of a group of users registered on a network server in a second embodiment of the present invention.
  • FIG. 11 is a schematic diagram showing an attribute list of a group entry in a group corpus maintained on a network server in a second embodiment of the present invention.
  • Figure 12 is a schematic diagram of a computer network in a third embodiment of the present invention.
  • 13a-13d are schematic diagrams of a human-machine interface displayed at a user equipment for setting an input method according to a third embodiment of the present invention.
  • FIG. 14 is a flowchart of a method for a user to perform character input in a user equipment connected to a network server in a third embodiment of the present invention
  • 15 is a flowchart of a method for assisting a user of a user equipment to perform text input in a network server according to a third embodiment of the present invention.
  • Figure 16 is a screenshot of a text input implemented by a web-based browser in a third embodiment of the present invention.
  • 17 is a block diagram of a user equipment connected to a network server for text input by a user in a third embodiment of the present invention.
  • FIG. 18 is a block diagram of a network server for assisting a user of a user equipment to perform character input in a third embodiment of the present invention.
  • a schematic diagram of a user equipment for inputting information related to search related information a schematic diagram of a user equipment and a network device for inputting information related to search related information; a schematic diagram of user equipment and network equipment for inputting information related to searching related information;
  • FIG. 22 is a flowchart of a method for simultaneously providing search related information related to input information when a user performs character input in a user equipment according to a fourth embodiment of the present invention
  • FIG. 23 is a flowchart of a method for providing search related information related to input information when a user performs text input in cooperation with a network device according to a fourth embodiment of the present invention
  • FIG. 24 is a user in a fourth embodiment of the present invention
  • the mouse and the keyboard are used as input devices for human-computer interaction
  • the display of the user device is used as an output device for human-computer interaction
  • the present invention does not exclude the use of other input devices and
  • an output device for example, the user inputs through a tablet, the user device passes through a speaker as an output device, and the like.
  • the present invention proposes a method, user equipment, network server and system for assisting text entry by using a network server.
  • the present invention is directed to a first embodiment of a method, apparatus, server and system for text input by a user:
  • a user corpus 1403 is stored on the user equipment 140 of the first embodiment of the present invention.
  • the local corpus 1403 stores a basic vocabulary set, a basic language model, and a vocabulary set generated by the user during the input method.
  • the local corpus 1403 can also store some auxiliary information: for example, various setting properties of the input method by the user, including but not limited to fuzzy sound, multi-body, double spell, full spell, barrel spell, etc.; And user attribute information, including but not limited to occupations, hobbies, professional fields, calendars, ages, etc. This auxiliary information helps to optimize the ordering of candidate terms.
  • the user device 140 also has a keyboard 1401 for inputting pinyin letters or stroke sequences of text by the user.
  • the matching device 1402 in the user device 140 finds a matching local term option in the local corpus 1403 based on the entered pinyin letter or stroke sequence and displays it through the display device 1406 for selection by the user.
  • the keyboard can be a pure numeric keyboard or a full alphanumeric keyboard (QWERTY keyboard), or it can be a physical keyboard or a virtual keyboard.
  • the network communication device 1404 and the aggregation device 1405 are added to the user device 140 of the present invention.
  • the network communication device 1404 communicates with the web server 150 via the Internet or a local area network, and transmits the pinyin letters and stroke sequences input through the keyboard 1401 to the web server 150.
  • Web server 150 leverages a vast network corpus and powerful processing power to find suitable term options.
  • the network term option obtained by the web server 150 is returned to the network communication device 1404, and the received network term option is transmitted by the network communication device 1404 to the summary device 1405.
  • the summary device 1405 receives local entry options from the matching device 1402 and from the network communication device 1404 The network entry options are summarized and displayed on the display device 1406 for selection by the user.
  • the web server 150 may be a plurality of web servers 1501 ... 150n distributed over the Internet. These network servers 1501...150n work together to form a server cloud that provides services to a large number of users.
  • the web server 150 can also be one or more servers located on the corporate local area network.
  • step S1101 the key input sequence of the user on the keyboard 1401 of the user device is detected.
  • the key sequence can be one or more phrases or even a sentence of spelling or full spelling.
  • the user has to input "I like to use Baidu search engine", and can input each word initials "wxhybdssyq", and can input each The word “woxihuanyongbaidusousuoyinqing" can also be input into the mixed input "woxhuanybaidssyinq”.
  • the candidate words are more precise, reducing the number of page-turning searches, but more input is required. If all the input spells are spelled, the weight is more, which leads to longer page search time and less efficiency. Therefore, it is usually more effective to mix and match the input.
  • the local corpus 1403 will synchronously add the new entry to the local corpus 1403 according to the user's choice, and the next time the user enters the entry again, the entry can be quickly entered by simply using the spell.
  • step S1102 after obtaining the user's key input sequence, the input sequence is performed in the local corpus 1403 of the user equipment 140 to perform a matching query to obtain one or more matching local entry options.
  • step S1103 the key input sequence is transmitted to the web server 150.
  • steps S1102 and S1103 may be performed sequentially or simultaneously.
  • step S1105 In order to quickly display the obtained entry options, after obtaining the local entry option in step S1102, it is immediately possible to proceed to step S1105, and the obtained local entry options are summarized and displayed to the user for selection.
  • web server 150 receives the key input sequence from user device 140 and looks for a matching network term option in the network corpus.
  • the user equipment 140 receives the network entry option from the web server 150 and sends it to the summary device 1405.
  • the local entry option from the matching device 1402 and the network entry option from the web server 150 are summarized in the summary device 1405 and provided to the display device 1406 for display, which is selected by the user.
  • the term option after summarizing with the network entry option is more precise.
  • step S1105 is followed by a step S1101 to detect the key input of the user equipment.
  • steps S1102 and S1103 can be reversed, and the detected key input sequence is first sent to the network server.
  • the summary device 1405 will generally receive the local entry option before receiving the network entry option.
  • the local entry can be immediately
  • the options are provided to display device 1406 for user selection without having to be displayed with the network entry option.
  • the summary device 1405 in order to quickly display the candidate term, obtains the obtained network term option before receiving the network term option.
  • the local entry options are displayed to the user in the entry bar according to their priority order, wherein the higher the priority, the higher the input term option is displayed.
  • the matching device 1402 can determine the priority level according to the selection frequency of each term option in the user input history and the semantic relevance between each term in each term option.
  • the matching device 1402 can also determine the priority level based on the input preference selection set by the user.
  • the user may have selected a partial entry in the previously displayed local entry option, or has turned the page through a partial local entry. Item.
  • some of the entries in the network term option received may be the same as the previously obtained local term option. It is therefore necessary to remove these entries that have been selected and/or repeated from the network entry options.
  • the remaining network entry options are inserted into the current and subsequent displayed local entry options according to the priority of the entry without changing the order of the local candidate entries.
  • the entry “I like” is inserted into the appropriate position of the subsequent displayed entry option according to its priority.
  • the network entry option "I like” that has been inserted in the local entry option is displayed.
  • the term "I like to use Baidu search engine” may be directly returned, instead of turning the page selection words one by one, so even if the network feedback is slightly delayed, it will still be greatly Speed up the input.
  • the network entry option is a supplement to the local entry option. It only needs to select the network entry option that is not in the local entry option to be inserted in a certain order. It can be in the currently displayed entry. This does not change the order in which the local entry options are displayed, just add the network that is not in the local entry options. Entry options.
  • the second example of the first embodiment of the present invention is similar to the first example.
  • the summary device 1405 after obtaining the local term option, before receiving the network term option, is also The obtained local entry options are first displayed to the user in the entry bar according to their priority order.
  • the difference from the first example is that after receiving the network term option, the network term option remaining after the already displayed term is removed from the network term option is currently displayed and not yet displayed.
  • the local entry options are rearranged according to the priority of these terms and displayed for the user to select.
  • the already displayed term described here refers to the term option that has been viewed before the currently displayed entry is excluded from page turning.
  • the first few terms are "no signal, miniaturization, infinitely good, jokes, and so on.” Since the displayed item has not been selected by the user, the previously displayed item is first removed from the received network entry option, and the remaining network entry options are displayed with the current display and the local display that has not yet been displayed. The term options are rearranged according to the priority of these terms. Since the correct term "I like" has a higher priority in the network entry option, it is rearranged and adjusted to the currently displayed term option.
  • An advantage of this embodiment is the ability to quickly adjust the correct term to the top or top of the term option. Since the same term may have different priorities in the local term option and the network term option, the new order may be determined according to the weighted average of the two priorities of the term when rearranging.
  • the network entry options received later are rearranged according to certain rules with the currently displayed and undisplayed local entry options. At this time, the local entry option will be changed first. Post-order.
  • the third example of the first embodiment of the present invention is similar to the second example. The only difference is that the rearrangement is performed in accordance with the priority of the network entry option, and details are not described herein again.
  • the local entry option is no longer displayed after the received network entry option, but the network entry option is not displayed previously.
  • the fourth example of the first embodiment of the present invention is different from the first to third examples, and after obtaining the local entry option, the obtained local entry option is not immediately displayed to the user according to its priority for its Selecting, but waiting to receive the network entry option, after being arranged according to the priority of these terms, is displayed to the user for selection.
  • the term option displayed in this embodiment is relatively accurate, especially for the case where the entire sentence is continuously input and then the term selection is more advantageous, because the continuous input time of the whole sentence is relatively long, and the delay of the network response does not cause too much Impact, and the matching of the whole sentence requires the support of a larger corpus, language matching model, and processing power, so waiting to receive the network entry option is displayed to the user selection will provide more accurate results.
  • the local term and the network term option are sorted together (the need to eliminate the repeat option), in the case where the network response is very fast and the entire sentence is continuously input. More favorable.
  • the matching device 1402 may be based on whether the entry was previously selected, the chronological order in which the entry was previously selected, the number of times the entry was previously selected, the user-preset input preference option, and/or the number of times the term was searched on the network. To determine the priority of local entry options.
  • the network corpus on the web server 150 can include a user network corpus 1501 and a public network corpus 1504, respectively, corresponding to each user.
  • the user network corpus 1501 is a local corpus 1403 of each registered user backed up on the web server 150.
  • the user equipment 140 further includes a local synchronization device (not shown) for uploading the local corpus 1403 on the user device to the network server 150 or retaining the user on the network according to the user's selection after the registered user logs in to the network server 150.
  • the user network corpus 1501 on the server 150 is synchronized with the local corpus 1403.
  • User equipment 140 also includes A local update device (not shown) is provided for updating the local corpus 1403 based on user selection of the terms and transmitting the selection to the web server 150 to update the user network corpus 1501. Since some words will be repeatedly input in the context, it is necessary to update the local thesaurus 1403 and the user network corpus 1501 in time to increase the priority of the recently entered entry to speed up the input.
  • the user network corpus 1501 also stores a basic vocabulary set, a basic language model, and a vocabulary set generated by the user during the input method. It is also possible to store some auxiliary information: for example, various setting properties of the input method by the user, including but not limited to fuzzy sounds, cascading, double spelling, full spelling, barrel spelling, etc.; and user attribute information, including but not Limited to occupations, hobbies, areas of expertise, calendars, ages, etc.
  • the user network corpus is stored on the web server 150, regardless of which terminal device the user uses, as long as it can connect to the web server 150, it can be quickly entered by synchronizing the local corpus 1403 after login or using the user network corpus 1501 online.
  • the public network corpus 1504 is formed based on analysis and statistics on public documents, publications, input from a large number of users, retrieval vocabulary of a large number of users on a web search engine, index keywords of a large number of web pages, and/or keyword advertisement information, which reflects users.
  • the commonality or hotspot of the group is formed based on analysis and statistics on public documents, publications, input from a large number of users, retrieval vocabulary of a large number of users on a web search engine, index keywords of a large number of web pages, and/or keyword advertisement information, which reflects users.
  • the commonality or hotspot of the group is formed based on analysis and statistics on public documents, publications, input from a large number of users, retrieval vocabulary of a large number of users on a web search engine, index keywords of a large number of web pages, and/or keyword advertisement information, which reflects users.
  • the commonality or hotspot of the group is formed based on analysis and statistics on public documents, publications, input from a large number of users
  • the network server 150 of the first embodiment of the present invention includes a user network corpus 1501, a matching device 1502, a network communication device 1503, a public network corpus 1504, a corpus update device 1505, a keyword advertisement library 1506, and a synchronization device 1507. .
  • the network communication device 1503 is connected to one or more user devices 140 via a network for receiving a key input sequence of the user on the user device via the network, and feeding back the network entry option obtained based on the key input sequence back to the user Equipment, for users to choose.
  • the matching device 1502 is connected to the user network corpus 1501, the public network corpus 1504, and the matching device 1502, for performing matching queries in the user network corpus 1501 and the public network corpus 1504 based on the key input sequence to obtain one or more matching network words. Bar options.
  • the matching device 1502 further includes priority determining means (not shown) for selecting whether the term has been previously selected, the time series in which the term was previously selected, the number of times the term was previously selected, the user-preset input preference option. And/or the number of times the term is searched on the network to determine the priority of each of the matching term options.
  • the synchronization device 1507 is connected to the user network corpus 1501 and the network communication device 1503, for when the user logs in to the network server 150 through the user device 140, after receiving the corpus synchronization instruction from the user device, the user of the user
  • the web corpus 1501 is synchronized with the local corpus 1403 in the user device.
  • the corpus update device 1505 is coupled to the user network corpus 1501 and the public network corpus 1504 for updating the user network corpus 1501 according to user input and word selection, and based on input to a large number of users, a large number of users searching words on a web search engine.
  • the index keywords and/or keyword advertisement information of a plurality of web pages are analyzed and updated to update the public network corpus 1504.
  • the keyword advertisement library 1506 is for providing an advertisement link related to a keyword. Some manufacturers can buy a number of keywords or letter combinations, for example, Baidu can buy "Baidu”, “search engine,” such keywords, you can also buy “bd”, “baidu”, “ssyq”, etc. Such a combination of letters, when the matching device 1502 matches a term such as “Baidu” or “Search Engine” in the network corpus based on the key sequence from the user device 140, or receives "bd”, “baidu”, “ssyq” In such a combination, the corresponding keyword advertisement information "Baidu.com” and its link are found from the keyword advertisement library 1506. The keyword advertisement information is returned to the user device 140 through the network communication device 1503 and displayed in the entry option.
  • the user selects the advertisement information to jump to the corresponding website link.
  • the "selection" here includes a mouse click, and also includes selecting a corresponding digital selection button directly through the keyboard.
  • the priority determining device in the matching device 1502 can be a keyword
  • the advertisement information is assigned a higher priority to ensure that the advertisement information is arranged in the first round of entry options displayed to the user or currently displayed. Display options.
  • the keyword advertisement library 1506 can be incorporated into the public network corpus 1504 as needed. Or merged into the user network corpus 1501.
  • the new keyword advertisement information only needs to be added to the user network corpus 1501 in the web server 150, and downloaded to the local device device 140 in a synchronized manner.
  • the local corpus 1403 includes a keyword advertisement library.
  • the keyword advertisement information is displayed as a local entry option, and the keyword advertisement information has a link, when the user selects the location. You can jump to the relevant link when displaying the ad information. Even if the user device 140 is not connected to the Internet, the entry option of the advertisement information may appear when the text is input, thereby increasing the exposure rate of the advertisement.
  • FIG. 2 is a flow chart of a method for assisting a user to perform text input on a web server in accordance with a first embodiment of the present invention.
  • step S1201 the network communication device 1503 of the web server 150 receives a key input sequence of the user on the user device 140 via the network;
  • step S1202 performing a matching query in the network corpus based on the key input sequence to obtain one or more matching network term options
  • step S1203 the obtained network term option is fed back to the user equipment 140 for user selection.
  • the user can register as a registered user on the web server 150.
  • the registered user can log in to the web server 150 and retain the user web corpus 1501 on the web server 150.
  • the user can also choose whether to synchronize the user network corpus 1501 with the user device's local corpus 1403.
  • the matching device 1502 can perform different matching operations to provide accurate network entry options.
  • Figure 3 shows the processing steps in these cases.
  • FIG. 3 is a flow chart showing the specific steps of performing a matching query in a network corpus in a method for assisting a user to perform text input on a web server according to the first embodiment of the present invention.
  • step S1301 is performed to determine whether the user logs in to the network server 150. If not, the network server 150 cannot determine the user's body. If the user does not use the user network corpus 1501 retained by the user on the network server 150 to perform a matching query, the process proceeds to step S1503, and only in the public network corpus when receiving the key input sequence of the user on the user device. retrieve matching network term options.
  • log in There are various ways to log in, such as logging in with a username and password, automatically logging in with the MAC address of the user device, automatically logging in using the fixed IP address of the user device, and so on.
  • step S1302 determines whether the user's user network corpus 1501 is synchronized with the local corpus 1403 of the user device. If the user is using another person's or public computer, he does not want to synchronize his user's web corpus 1501 to the local device, nor does it want to synchronize the other people's thesaurus on the local device to his user's web corpus 1501, so the user can Choose not to sync. However, users still want to be supported by their user web corpus 1501.
  • step S1302 when it is determined in step S1302 that there is no synchronization, the matching network term option is retrieved in the public network corpus 1504 and the user's user network corpus 1501 upon receiving the user's key input sequence on the user device. If it is determined in step S1302 that no user network corpus has been synchronized with the local corpus of the user equipment, since the user equipment will first find the matching term option in the local corpus, the web server 150 does not have to be repeated in the same user network corpus 1501. The matching term option is found, so that only the matching network term option is retrieved in the public network corpus 1504 upon receiving the user's key input sequence on the user device.
  • the matching device 1402 and the matching device 1502 described above are at the word level according to the user input sequence.
  • the summary device 1405 displays the plurality of matched input term options provided by the matching device 1402 and the matching device 1502 in the item column to the user in priority order, wherein the higher the priority, the higher the input term option is. display.
  • the matching device 1402 and the matching device 1502 can determine the priority level according to the selection frequency of each term option in the user history record and the semantic relevance between each term in each term option.
  • the matching device 1402 and the matching device 1502 can also determine the priority according to the input preference selection set by the user, for example, when the user sets
  • the input preferences are: 1) Priority: Computer vocabulary > Electronic vocabulary > General vocabulary; 2) Priority level: Chinese > English, the input sequence "woshiyongwindows” can be judged “I use Windows software” has the highest priority, "I Use the window “Second, "I use windows” again.
  • the matching device 1402 and the matching device 1502 can also determine the region in which the user equipment is located according to the IP address of the current user equipment, so that the priority of the vocabulary related to the region in the input sequence can be determined, for example, but the user input sequence is "Woxihuanbund” , the translation of "bund” has "1 Embankment 2 Terminal 3 Alliance 4 (Shanghai) Bund", but the network communication device 1503 is known based on the IP address of the user equipment and is currently located in Shanghai, China, so that the "bund” corresponding translation can be determined.
  • “Shanghai Bund” or “Bund” has the highest priority, so the following entry options are available: 1 I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like the league.
  • a first embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters according to the first embodiment of the present invention and an auxiliary user equipment according to the first embodiment of the present invention. Enter the text server.
  • the present invention is directed to a second embodiment of a method, apparatus, server and system for a user to input text,
  • the second embodiment is based on the first embodiment, and the concept of "group corpus" is added to better improve the hit rate of the input method preferences and improve the input efficiency:
  • Multi-users of intranets or local area networks such as enterprise customers, Internet cafes, and translation services, usually have obvious commonalities. This commonality may be the same or similar work content, the same or similar hobbies, the same or similar age stages, the same or similar geographic areas, and the like. A convergence or similarity.
  • the method provides a word frequency adjustment function, which adjusts the display order of the alternative words according to the latest history of the input selection.
  • the adjustment must be learned or trained.
  • the target is a word that is not commonly used or even relatively uncommon, this kind of The tediousness is self-evident.
  • this existing word frequency adjustment is also unstable. For example, when the user changes another computer, it will need to re-learn and train. Therefore, if the previous input of other users having greater commonality can be utilized through the network, the auxiliary user can perform text entry on the user terminal, which will undoubtedly greatly improve the accuracy and efficiency of the text entry.
  • the user equipment for communicating with the web server for text input by the user is the same as that disclosed in the first embodiment.
  • step S2101 the key input sequence of the user on the keyboard 1401 of the user device is detected.
  • step S1101 the first embodiment of the present invention.
  • the user can further input the input, and even if only the first letter of each word is used, the desired result can be quickly obtained because the available group The term is already trained by other users who have something in common with it.
  • step S2102 after obtaining the key input sequence of the user, the input sequence is performed in the local corpus 1403 of the user equipment 140 to perform a matching query to obtain one or more matching local entry options.
  • step S2103 the key input sequence is sent to a web server.
  • steps S2102 and S2103 may be performed sequentially or simultaneously.
  • step S2105 In order to quickly display the obtained term option, after obtaining the local term option in step S2102, it is immediately possible to proceed to step S2105, and the obtained local term options are summarized and displayed to the user for selection.
  • the web server receives the keystroke from the user device 140. Enter the sequence and find the matching group entry option in the group corpus associated with the group to which it belongs.
  • the user equipment 140 receives the group entry option from the web server and sends it to the summary device 1405. Then, proceeding to step S2105, the local entry option from the matching device 1402 and the group entry option from the web server are summarized in the summary device 1405 and provided to the display device 1406 for display, which is selected by the user. Due to the lag of network transmission and server processing, the summary device 1405 will generally receive the local entry option before receiving the group entry option. When the network server has not returned the group entry option, it can immediately localize. The entry options are provided to display device 1406 for user selection without having to be displayed with the group entry option. Of course, the term option after summarizing with the group entry option is more precise.
  • step S2105 is followed by a step S2101 to detect the key input of the user equipment.
  • step S2102 and S2103 can be reversed, and the detected key input sequence is first sent to the web server.
  • the above method may further include a registration step S2106 on the user equipment side, that is, the user may be registered to one or more of the network servers by the network communication device 1404 before the text input is performed.
  • a user group the user group being associated with the group corpus.
  • Such a registration process may employ, for example, a group registration function as is known in the art.
  • Step S2107 of user identity information so that the network server determines the group of users associated with it, and thereby determines the group corpus associated with it.
  • this step is also not an essential step for the web server to determine the user group.
  • the web server itself is an intranet server that is only applicable to one or some groups of users, the user can be considered to be able to utilize the group corpus on the web server without any authentication procedures.
  • the foregoing method may further include the following step S2108: the user sends the selected item to the network server, so that the network server is updated to be associated with the user group to which the user belongs.
  • Group corpus According to this function, each member of the user group can provide its own contribution to the group corpus, such as new terms, self-entrancing input habits, etc. These resources can be collected in the appropriate language in the group corpus.
  • a greater weight is assigned to contributions of members with higher authority or dominance in the user group. For example, new terms provided by department heads have higher priority in feedback.
  • web server 250 includes one or more group corpora 2501 (only one of which is shown for the collapsed view), matching device 2502, network communication device 2503, group management device 2504, and group Corpus management and update device 2505.
  • the network communication device 2503 is connected to one or more user devices 140 via a network for receiving a key input sequence of the user on the user device via the network, and the group obtained based on the key input sequence.
  • the term option is fed back to the user device for selection by the user.
  • the network communication device 2503 further includes identity information receiving means for receiving optional user identity information and forwarding it to the group management device 2504 for determining the management thereof. Household group.
  • the network communication device 2503 further includes a term receiving means for receiving the term finally selected by the user upon input, for forwarding it to the group corpus management and updating means 2505 for updating the terms with the terms.
  • the term receiving device is further configured to receive material related to the user group to forward it to the group corpus management and update device 2505 to use the materials to initially construct the user group to which the user belongs. Associated group corpus.
  • the matching device 2502 is coupled to the user group corpus 2501, the matching device 2502, and the group management device 2504, for performing a matching query in the user group corpus 2501 based on the key input sequence to obtain one or more matching group entries.
  • the option then sends the group entry option to the network communication device 2503 for return to the user device 140.
  • the matching device 2502 selects one or more group corpora corresponding to the input user from the plurality of group corpora 2501 according to the user group information determined by the group management device 2503 to perform a matching query.
  • the matching device 2502 further includes priority determining means (not shown) for prioritizing the user according to the entry source, whether the entry was previously selected, the time sequence in which the entry was previously selected, and the entry being previously The number of selections, the user-preset input preference options, and/or the number of times the term is searched on the network determines the priority of each of the matching term options.
  • the group management device 2504 is connected to the matching device 2502, the network communication device 2503, and the group corpus management and update device 2505.
  • the group management device 2504 is responsible for managing user groups, including receiving user registration information from the network communication device 2503, and registering the user with one or more user groups; maintaining user group information, such as group name, group member ID The group corpus number corresponding to the group, etc.; determining the group to which the group belongs according to the identity of the user, and transmitting the determination result to the matching device 2502 to help it select the group corpus 2501 for performing the matching query.
  • the group management apparatus also assists the group corpus management and update apparatus 2505 in managing and updating the thesaurus, for example, transmitting priority information of a certain user in the user group to the group corpus management and update apparatus 2505, the latter according to This information adjusts the priority attributes of related terms and the like.
  • the group corpus management and update device 2505 is connected to the network communication device 2503 for receiving The term sent by the user is updated to the term of the group corpus 2501 or its attribute.
  • the group corpus management and update device 2505 can also receive material related to the user group from the network communication device 2503, and initialize or update the group corpus 2501 by learning or training it. This feature is very useful for further downsizing user input and reducing the overhead of building a categorical lexicon. For example, for a group working on translation of patent documents in the semiconductor field, users can upload a file containing common semiconductor field terms such as "etching", "vapor deposition", “coating", and the like.
  • the group corpus management and update device 505 uses the material to initialize a group corpus 2501 for the group, thereby eliminating the need for training labor for members to enter the relevant terms for the first time.
  • the group corpus 2501 is an important concept introduced by the present invention, which directly corresponds to a user group, and typically each user group corresponds to a group corpus.
  • the group corpus 2501 contains the most commonly used terms of the group members in the corresponding user group, and sharing the corpus in the group members can save these member users a lot of time-consuming and laborious input method training steps, in other The input of the member directly obtains the input result that you want.
  • the composition of the group corpus 2501 and the attributes of the group entry will be described in detail below with reference to FIG.
  • Fig. 7 is a flow chart showing a method of assisting a user to perform character input on a web server according to the present embodiment.
  • step S2201 the network communication device 2503 of the web server 250 receives a key input sequence of the user on the user device 140 via the network;
  • step S2202 a matching query is performed in the corresponding group corpus based on the key input sequence and the group joined by the user to obtain one or more matching group entry options;
  • step S2203 the obtained group term option is fed back to the user equipment 140 for selection by the user.
  • the foregoing method may further include a registration step S2204 on the network server side, that is, before the text input, the registration information of the user may be received by the network communication device 2503, and the user is registered to one or more user groups.
  • the user group is related to the group corpus Union.
  • Such a registration process may employ, for example, a group registration function as is known in the art.
  • user groups can be established according to various criteria, ie, any commonality that the user has, such as engaging in related work, completing the same task, having similar hobbies, or living in the same city. An example of a user group is detailed below in conjunction with FIG.
  • the web server can immediately determine that it belongs to the business department group based on its identity, and searches for the relevant group entry from the corresponding group corpus.
  • the method may further include the step S2205 of receiving user identity information sent by the user, to determine a group of users associated with the user group, and thereby determining a group corpus associated with the group.
  • this step is also not an essential step for the web server 250 to determine the user group.
  • the web server 250 itself is an intranet server that is only applicable to one or some groups of users, the user can be considered to utilize the group corpus 2501 on the web server 250 without any authentication procedures.
  • the method may further include the step S2206 of receiving the user returning the entry to update the group corpus 2501.
  • each member of the user group can provide its own contribution to the group corpus, such as new terms, self-entrancing input habits, etc. These resources can be collected in the appropriate language in the group corpus.
  • the bar or term attribute for other members of the group to refer to or directly use.
  • a greater weight is assigned to contributions of members with higher authority or dominance in the user group. For example, new terms provided by department heads have higher priority in feedback.
  • FIG. 8 is a block diagram of a system 230 for assisting a user to perform text input in accordance with the present embodiment.
  • system 230 includes a web server 250 and user equipment 140, a network server. 250 and user equipment 140 are connected through a network.
  • the network represents a worldwide network and set of gateways that communicate with each other using, for example, the TCP/IP suite of protocols, which may be the Internet centered on the backbone of high-speed data communication lines between the primary nodes or host computers, which are made up of thousands Tens of thousands of businesses, governments, education, and other computer systems that route data and messages.
  • TCP/IP suite of protocols which may be the Internet centered on the backbone of high-speed data communication lines between the primary nodes or host computers, which are made up of thousands Tens of thousands of businesses, governments, education, and other computer systems that route data and messages.
  • the network can also be implemented as a number of different types of networks, such as, for example, an intranet, a local area network (LAN), or a wide area network (WAN).
  • FIG. 8 is intended as an example and is not a structural limitation of the various exemplary embodiments.
  • FIG. 10 shows a schematic diagram of a group of users registered on a network server according to the present embodiment.
  • the figure shows four user groups, namely "Basketball League” group 2601, “Zhenghua Garden Community” group 2602, “Semiconductor Domain Translation” group 2603, and “Expo Tour” group 2604.
  • These groups represent user groups that are divided according to the user's common interests, living areas, work, and short-term concerns. In fact, there are more grouping criteria, such as players in the same online game, students in the same university, and so on. In real life, users may be inextricably linked, resulting in some commonality among multiple users. Any commonality can be used as the basis for classifying user groups in the input method of the present invention.
  • the user groups 2601 ⁇ 2604 each include their group members, and the group members can register with the group using their real names or network IDs and the like. Members included in different groups may have repetitions or intersections.
  • the "Semiconductor Domain Translation” group 2603 and the “Expo Tour” group 2604 each include a member “Li Meng”
  • all members of the "Basketball League” group 2601 are members of the "Zhenghua Garden” group 2602 because The basketball league is an organization between the owners of the community.
  • members of each user group that are highlighted (indicated by a bold font in the figure) e.g., Zhang Xiaoliang in group 2601) have a higher priority, and other members have a lower priority.
  • This priority distinction is primarily used to help group members prioritize the contribution of terms when contributing to the group corpus.
  • the priority of the term will be assigned or updated according to the priority of the source user, the current priority of the entry, the most recently selected time or number of times.
  • the term priority is an important item in the term attribute and is used to determine the order in which the group terms provided to the user appear in the candidate box. For example, the captain of the "Basketball League” group recently proposed to promote the flag football game to the group members to improve the physical fitness of the members.
  • the term "yq-waist flag" provided by the source user will be given more than a member.
  • the entry "yq- ⁇ " has a higher priority.
  • the group corpus corresponding to the "Zhenghua Garden” group contains the words “xfxch - Shuangfu car wash line”, “syj - Sanyouju hotel", “lj - Lijia baby”
  • FIG. 1 A schematic diagram of an attribute list 2700 of group entries in a group corpus maintained on a web server in accordance with one embodiment of the present invention is shown in FIG. Multiple entries for the group corpus of a basketball league group are listed in this list 2700.
  • the first column 2701 is the content of the entry, such as "zb - walking (basketball technical terminology)", “lb - rebounding (basketball technical terminology)”, “bkl - Barkley (basketball star name)” and “huren - Lakers (American NBA basketball team name).
  • the group corpus also includes "zhhb - Zhenghua Cup (community event name)", “yq - waist flag (group member activity)” Specific group terms.
  • the second to seventh columns 2702 - 2707 of list 2700 list the various attributes of the entry, which are group identification, priority level, number of times selected, time of most recent selection, source user, and/or target user. Among them, the key points are: priority level, source user and/or target user. Attributes.
  • the priority level determines the order in which the user is provided with the entry after the multiple entries are matched, the higher priority entry is placed in the front position, and optionally the group entry can be set. Better than the priority of local terms.
  • the source user is a member user who provides the entry to the corpus management and update device, and this attribute will affect the priority setting of the entry.
  • the target user it refers to the range of users who can provide the term to it.
  • the target user of some terms may be all members, and the target users of other terms may be only a part of the staff, and the association with the entry is excluded. Ordinary staff for the choice of the term.
  • the target users of the terms “bdtb - Baidu Post Bar”, “ssyq - search engine” can be all members, and the terms “zx - compose", “qq”
  • the target user of _infringement can be limited to only members of the company's intellectual property management function to further improve the accuracy of group term feedback and user input efficiency.
  • the term priority attribute can be dynamically adjusted according to the selected time, number of times, source user, etc., so that the order of the returned terms is more in line with the input habits of the group members, and the most accurate terms will be The option is returned to the user.
  • many input methods assign the most convenient space key to the first-order alternative words, and the following alternative words need to be selected by numeric keys, " + " and " - " or other shortcut keys.
  • the later words even need to use the page-turning key. If you can put the most accurate terms in the front position or even the first place, it will undoubtedly save the user's labor and improve the user's input efficiency.
  • a second embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters and a second embodiment of the present invention for assisting user equipment according to the second embodiment of the present invention. Enter the text server.
  • the third embodiment of the method, device, server and system for the user to input text is provided by the present invention: 12 is a schematic diagram of a typical computer network implementing the present embodiment and the present invention.
  • the various network elements shown include user equipments 311 and 312, Internet 321, 331 and network servers 341-343.
  • the user device 311 is also referred to as a computer 311
  • the user device 312 is also referred to as a mobile phone 312.
  • the number of various network elements shown in FIG. 12 for the sake of clarity may be less than the number in a real network, but this omission is undoubtedly clear and sufficient to the present invention.
  • the disclosure is premised.
  • Those skilled in the art will also appreciate that some other types of network elements are also omitted from the figure, such as a modem, an access device (e.g., a DSLAM), etc., typically located between the user equipment and the Internet.
  • Fig. 12 although three network servers 341 to 343 are shown as being routed by the same router 331, the respective network servers 341 to 343 belong to different routers depending on the geographical area, and serve differently. User equipment in the area. Typically, one such server can be provided for a city, a province, or even a country. Of course, considering the heterogeneity of the distribution of end users, the distribution of servers can be subject to the distribution of end users. In summary, any changes in the network layout are within the scope and spirit of the invention. Schematic diagram of the human-machine interface set by the line.
  • each object indicated by the reference numeral may be a plurality of buttons provided in the Windows operating system, wherein the button is pressed by 3 ⁇ 4 3201, which can be seen in conjunction with Figs. 13a to 13c, when the user controls the mouse.
  • 3 ⁇ 4 3201 Click the left button to change the display state of 3 ⁇ 4 3201 and change the state of the input method.
  • the representation shown in Figure 13c is activated.
  • the color according to 3 ⁇ 4 3201 is distinguished from the 3 ⁇ 4 3201 which is not activated in Figure 13a.
  • the user equipment based on the present embodiment and The interactive input method of the web server becomes the input method for easy representation.
  • FIG 13a it is also shown that by 3 ⁇ 4 3202, a mouse click can be used to switch between Chinese and English input.
  • the present invention is not limited to the input of Chinese and/or English (e.g., according to 3 ⁇ 4 3202'), and to the characters of almost any language that can be input through the user equipment, those skilled in the art. It can also be understood by reading the context that when the input method is used to input Chinese, it is not limited to pinyin input, but also other Chinese input methods such as strokes, five strokes, and the input methods formed by them.
  • the Chinese and English input switching of this input method can also be controlled by a specific button on the keyboard.
  • CapsLock CapsLK
  • this input method will provide the user with a selection list of Chinese characters or words.
  • the user adds the input result to the corresponding input box through further selection of the keyboard and the mouse; correspondingly, when CapsLK is in uppercase state, the input method will not provide a selection list including Chinese characters, without loss of generality, directly Add the English alphabet sequence corresponding to the user's key sequence to the corresponding input box.
  • Fig. 13a it is also shown that by 3 ⁇ 4 3203, the mouse can be switched between the full angle and the half angle.
  • the crescent shape of the 3 ⁇ 4 3203 in Fig. 13a represents the half angle
  • Fig. 13b The circle in 3 ⁇ 4 3203' indicates the full angle.
  • a button 3204 is also shown, which can be switched between Chinese punctuation marks and English punctuation marks by a mouse click.
  • the Chinese punctuation marks are represented by 3 ⁇ 4 3204 in Fig. 13a
  • the pressing in Fig. 13b is shown in Fig. 13b. 3 ⁇ 4 3204, indicating English punctuation.
  • Shown in any of Figures 13a-13c is a control bar that defaults to the lower right corner of the user device screen and is preferably draggable by the mouse.
  • Figure 13d shows another human-computer interaction interface for the user to configure the on/off of the input method, which is specifically a part of a web-based browser, in which "Whether you want to use Baidu online input method (This input method) After the item, two options for opening and closing are provided. By clicking, you can manually control whether this input method is activated.
  • the entry method is activated, even if the computer 311 has other input methods pre-installed, this input method becomes the current default input method.
  • the computer 311 preferably automatically activates other pre-installed input methods such as Google input.
  • the activation of other input methods can also be manually controlled by the user to select other input methods that he wishes to switch to.
  • a popup menu appears, which lists Other available input methods are available for the user to select.
  • a flag indicating the newly selected other input method will appear at the position of 3 ⁇ 4 3201. This is not a problem.
  • the input method can also be the only input method available to the computer 311.
  • the input method can be activated under other conditions, for example, when the user turns on the online input method in the personality setting of FIG. 13d, each time the user accesses a predetermined network address set using the browser. This input method is activated at any address.
  • FIG. 14 and FIG. 12 a method for providing text input to a user in a user equipment exemplified by the computer 311 is introduced.
  • the computer 311 is taken as an example, those skilled in the art understand that the same process can also be performed.
  • Figure 14 is a flow chart showing a method for a user to input text in a user equipment connected to a network server according to the present embodiment.
  • the user for example, Zhang San has activated the input method.
  • Figure 16 It should be understood that the manner of connection between the user equipment and the network server is not limited to a high speed, stable wired connection, but also includes a wireless connection or a wired and wireless hybrid.
  • step S3301 the computer 311 obtains input information provided by the user.
  • Zhang San opens the IE browser, and the home page www.baidu.com is automatically opened, so that the content as shown in FIG. 16 is presented.
  • the search bar 3501 should be empty and the map is located below the search bar. The content should be ignored.
  • Zhang San moves the cursor to a position near or in the search bar 3501, clicks the mouse, and can then input in the search bar 3501.
  • Zhang San Tap the following buttons on the keyboard in turn, each of which has a tap count of 1 and the press time is below a threshold: C, A, 0, M, E, I, W, A, N, G.
  • the information provided by each of the above-mentioned taps of Zhang San is regarded as one input information, that is, when the three presses the C key, one input information indicating that the C key is pressed is Computer 311 is obtained.
  • the input information is identified by a key or a combination of keys indicated by the input information, and is in the form of input information "XX", and the double quoted part indicates the corresponding key or key combination.
  • the input information indicating that the button C is pressed is referred to as input information "C”
  • the input information of the buttons C, A, 0, M, E, and I is sequentially pressed, and is referred to as input information "CAOMEI".
  • step S3301 may be performed in such a manner that the browser retrieves input information provided by the user through a script or function therein. That is, the functional modules of the input method distributed on the computer 311 for obtaining input information are implemented by a web-based browser. Therefore, if Zhang San uses the corresponding browser that provides this input method, then when downloading and installing the browser, he downloads and installs the function module that implements the input method on the client, and can start based on the input method. Enter text.
  • the computer 311 may install a browser-independent application, which is similar to the client software of the input method.
  • the client software is mainly responsible for Subsequent operations such as acquisition of the above input information and transmission of input information which will be mentioned later.
  • step S3302 the computer 311 transmits the input information "C" to the web server.
  • network server the input information specifically depends on depends on the specific configuration of the network shown, as well as routing algorithms and the like. The most direct way is to specify a network server for each user equipment of the IP address segment. Therefore, when the computer 311 clears its own IP address, it can know which network server to send the input information to, or The correspondence between the IP address segment and the network server is stored in the routing device.
  • the network server as the destination can be determined;
  • input information can be sent to multiple network services
  • the network servers communicate with each other to determine a network server that performs subsequent operations; as another of the various alternatives, the network servers shown in FIG. 12 have a division of labor, when Zhang San passes through FIG. 13a.
  • the human-machine interface of 13c sets the input language
  • the input information sent by the computer 311 can carry the corresponding language, such as the Chinese identifier, and then the Chinese input server (if any) is responsible for the subsequent operations.
  • the present invention does not limit the manner of selection of the network server and the manner of interaction between the user equipment and the network server. For example, such interaction may be based on the IP protocol, or may be based on other communication protocols for the Internet.
  • the input information sent from the computer 311 is received by a network server such as the network server 341 (hereinafter referred to as the server 341) in Fig. 12 in step S3401.
  • a network server such as the network server 341 (hereinafter referred to as the server 341) in Fig. 12 in step S3401.
  • the server 341 performs a matching query in the dictionary database based on the input information "C" to generate an alternative set of input items.
  • the server 341 uses different algorithms to translate the input information "C", which includes general English alphabet input, Chinese pinyin, Chinese strokes and the like. Taking the English input as an example, if the association input is not considered, the server 341 will generate a set of alternative entries containing an alternate entry, the English letter C. If you consider association input, then this collection will include at least one word beginning with the letter c. If Chinese Pinyin is used, the set of alternative entries will include the Chinese characters with the initial letter C in Pinyin.
  • step S3403 the server 341 sends the generated set of alternative entries back to the computer 311.
  • the sending process in step S3403 can be implemented based on WEB, and thus, the set of alternative input items will be encapsulated in the transmission unit under the http protocol for transmission.
  • the transmission may also be sent in the form of instant messaging (IM), such as the interaction between the small i-robot and the client.
  • IM instant messaging
  • Zhang San performs Chinese input so this collection includes "from, this, talent, place, eat, out, into, car, poor...,, etc. Chinese characters, each of which is in Chinese. Became an alternative input item or a cartridge input item.
  • the computer 311 notifies the set to the set.
  • This step can employ any known technical means for the computer to provide human readable information, such as screen display, speaker playback, and the like. Without loss of generality, this example uses a screen display as an example.
  • the collection "C” contains more Chinese characters, it may be difficult to display them completely in a prompt bar (as indicated by reference numeral 3505 in Figure 16). Therefore, 5 alternative inputs can be displayed in each line in the prompt bar. Item, and give the sequence number mark before, so that the user can select by pressing the number keys on the keyboard.
  • the prompt bar will also include a 3 ⁇ 4, which is convenient for the user to click on the mouse to display the next line of alternative entries.
  • the user can also command to display an alternate entry for the next line by pressing a designated key on the keyboard, such as a pagedown key.
  • Zhang San will select one of these alternative inputs by mouse and keyboard operation, and confirm by pressing the left mouse button or the corresponding numeric key, thereby giving the computer 311 an indication information, for example, when the mouse is hovering
  • the computer 311 is given an instruction indicating that "From" is the selected entry.
  • step S3305 the computer 311 receives the indication information provided by Zhang San, and accordingly, in step S3306, the Chinese character "from” is used as the input result of the input, and is displayed when the user inputs the designation.
  • the location for example, in the search bar of the browser.
  • step S3307 the computer 311 also transmits this information to the web server 341.
  • the web server 341 saves itself according to the information selected by Zhang San from the alternative input item set "C" (hereinafter referred to as the set "C").
  • the dictionary database is trained and updated. Therefore, when the input method has a considerable user group, the user's choice can be learned, and the vocabulary can be dynamically updated, for example, a new word with more than a predetermined number of users input for a period of time, such as "sharp brother". In the thesaurus.
  • Zhang San does not make a selection from the set "C”, but instead presses the A key on the keyboard, as is known to those skilled in the art, the alternative entry at this time Collection will Will converge.
  • the alternatives as one of the alternatives:
  • the computer 311 performs step S3301 again to obtain input information "A";
  • step S3302 the computer 311 also transmits the input information "A" to the web server 341; thereafter, step S3402 is performed, wherein the web server 341 integrates the previous input information "C” with the new input information "A” to obtain The result "CA” of the integration result "CA”, or a new input information alternative entry, is returned to the computer 311.
  • the process thereafter is similar to the above, and it is not awkward here.
  • the computer 311 caches the set "C” when the step S3303 is first executed.
  • the cached set "C” is Clear.
  • computer 311 will rely on the set "C” in the cache to respond to further input.
  • This kind of cache-dependent scene for example: Zhang San enters “C", “A”, "0” in turn and then selects an input item such as "grass,,, or Zhang” to input “C", "A", "I”. ", then enter the backspace, enter "0", and so on. In this way, the interaction between the client and the server on the communication link under the input method can be appropriately reduced to reduce the occupation of network resources.
  • the computer 311 will take the sequence that has been previously input as the input information when the user prompts it to intercept the input information, and sends it to the server 341.
  • the user inputs C, A, 0, M, E, I, W, A, N, G in sequence, and finally presses a space.
  • This operation of pressing a space triggers the computer 311 to "C, A, 0, M. , E, I, W, A, N, G" are collectively transmitted as input information to the server 341.
  • the screen display on the computer 311 is not lost in general as shown in FIG. 16, wherein the search bar 3501 is the position where the user moves the cursor, and input here, and can be seen.
  • the "caomeiwang” in the input information bar 3502 is also displayed on the screen along with the set of alternative entries returned by the server 341, and is also used between the syllable and the syllable.
  • the mark ",” is separated to give the user a clearer experience.
  • the various alternative entries in the prompt bar 3505, such as 3503 and 3504, are identified by sequential numbers, respectively. If the user selects "Strawberry Net", the three Chinese characters “Strawberry Net" will eventually appear in the search bar 3501. Preferably, the cursor will be located after the "Net” word.
  • the operation of the input method can also be associated with the identity information of the user, which will be described below with reference to Figs. 14 and 15.
  • the input method or the application associated with the input method provides the user with a human-machine interface indicating the identity information thereof.
  • step S3308 is performed to provide a login interface, and the user passes Entering the user name and password to authenticate to the web server 341, and after obtaining the identity information of the user, sending it to the web server 341 in step S3309, if the authentication is passed, the web server 341 will respond to the text input operation of the user thereafter, and this A process also considers the identity information of the user.
  • the manner in which the web server 341 obtains the identity information of the user is not limited thereto.
  • the computer 311 can access user identity information in the operating system or other applications, and if such access can be allowed, the identity of the user can be obtained therefrom.
  • the information is reported to the server 341, which is more suitable for the case where the computer 311 is used for home or other private use.
  • the identity information of Zhang San is received by the server 341 in step S3406.
  • the input history record of Zhang San can be retrieved in step S3407, which is all the characters input by the server 341 in the past period of time, and the input of the input method by the input method, including the three input times from the alternate input. Those inputs selected in the item collection.
  • the input history record retrieved in step S3407 may be applied to the generation process of the candidate input item set in step S3402.
  • the computer 311 first performs a matching query in the dictionary database based on the input information such as "C”. , get the preliminary query result, the content is the same as the alternative input item set "C" in the above example.
  • the computer 311 processes the preliminary query result according to the input history of Zhang San to generate the alternative input item set "C" in the present example.
  • computer 311 compares the input history with the preliminary query results,
  • the contents (input items) also included in the input history in the preliminary query result are arranged in a position in preference to other contents (input items), and the alternative input item set "c" in this example.
  • the input history records may correspond to the input information at the time, and thus, when the preliminary query result is compared with the input history record, only the input history records corresponding to the current input information may be used to arrange each of the input history records. The location of the input item in turn generates a set of alternative input items.
  • help server 341 updates the input history of the user according to the indication information received in step S3404, for example, adding the input indicated by the indication information to the input history. Recording, or, adding the input information to the input history in association with the input indicated by the indication information.
  • FIG. 17 is a block diagram of a user equipment connected to a network server for text input by a user according to the embodiment.
  • the user equipment 311 shown in FIG. 12 is taken as an example, and includes:
  • Obtaining device 3111 configured to obtain input information provided by a user
  • a first sending device 3112 configured to send the input information to the network server, where the network server provides feedback information to the user equipment based on the input information;
  • the first receiving device 3113 is configured to receive feedback information sent back by the network server, and the notification device 3114 is configured to notify the user of the feedback information for further human-computer interaction.
  • the user equipment 311 further includes:
  • An identity obtaining device 3115 configured to acquire identity information of the user
  • the first sending device 3112 is further configured to send the identity information of the user to the network server.
  • the user equipment 311 further includes:
  • a second receiving device 3116 configured to receive indication information provided by the user, used to represent an input item selected by the user in the set of candidate input items, and use the input item as the user input Input result;
  • the first sending device 3112 is further configured to send the indication information to the network server.
  • the user equipment 311 is used by the user to perform text input in the WEB-based application.
  • the user equipment 311 is used by the user to perform text input in the WEB-based browser program.
  • 18 is a block diagram of a network server for assisting a user of a user equipment to perform text input, such as the server 341 shown in FIG. 12, according to the present embodiment, including:
  • a third receiving device 3411 configured to receive input information provided by the user equipment and sent by the user;
  • a generating device 3412 configured to perform a matching query in the dictionary database based on the input information to generate an alternative input item set
  • the second sending device 3413 is configured to send the set of candidate input items to the user equipment.
  • the third receiving device 3411 is further configured to:
  • the generating device 3412 is further configured to: integrate the new input information with previously received input information to obtain an integration result; perform a matching query in the dictionary database based on the integration result to generate a new preparation Select a set of input items;
  • the second transmitting device 3413 is further configured to send the new set of candidate input items to the user equipment.
  • the third receiving device 3411 is further configured to: receive identity information of the user sent by the user equipment;
  • the server 341 further includes: a retrieval device 3414, configured to retrieve an input history record of the user according to the identity information of the user;
  • the generating device 3412 further includes: a querying device 34121, configured to perform a matching query in the dictionary database based on the input information to obtain a preliminary query result; and the processing device 34122 is configured to: according to the input history record of the user, The preliminary query results are processed to generate the set of candidate inputs.
  • a querying device 34121 configured to perform a matching query in the dictionary database based on the input information to obtain a preliminary query result
  • the processing device 34122 is configured to: according to the input history record of the user, The preliminary query results are processed to generate the set of candidate inputs.
  • the processing device 34122 is further configured to: compare the input history record with the preliminary query result, and include the first query result in the input history record.
  • the content is arranged in a position that is prioritized over other content to generate the set of alternative entries.
  • the third receiving device 3411 is further configured to receive indication information from the user equipment, where is used to indicate an input item selected by the user in the set of candidate input items;
  • the server 341 further includes an updating means 3415 for performing at least one of the following items according to the input item indicated by the indication information: - updating the input history of the user; - storing the network server
  • the dictionary database is trained and updated.
  • a third embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters and a third embodiment of the present invention for assisting user equipment according to the third embodiment of the present invention. Enter the text server.
  • the fourth embodiment of the method, device, server and system for the user to input text is provided by the present invention:
  • Fig. 19 shows a user device 41 for simultaneously providing search related information related to input information when a user performs character input, according to an aspect of the present embodiment. That is, when the user performs text input on the user device 41, the user device 41 provides a corresponding input term option according to the user input sequence, and also searches for related search related information, such as advertisement information and webpage information, according to the input sequence of the user device. , travel information or map information.
  • related search related information such as advertisement information and webpage information
  • travel information or map information travel information or map information.
  • the advertisement information can be any electronic product that can interact with the user through a keyboard, a remote controller, a touch panel, or a voice control device, such as a computer, a smart phone, a PDA, Game consoles, or IPTV, etc.
  • the user equipment 41 includes a first obtaining means 411, a querying means 412, a providing means 413, a storage means 414 for storing the local thesaurus (for the sake of clarity, the following dictionary is referred to as the thesaurus 414), and the user saves the keyword advertisement library.
  • the storage device 414 (for the sake of clarity, the following is referred to as a local advertisement library 414, or referred to as a search information base 414,).
  • a local advertisement library 414 for the sake of clarity, the following is referred to as a search information base 414,).
  • storage devices 414 and 414 may be separate or identical, or may be implemented by a set of memory arrays, respectively.
  • the first obtaining device 411 acquires an input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user.
  • the interactive device can be a keyboard, Remote control, touchpad or voice control device. Taking the keyboard as an example, when the user taps the key in the keyboard to input, the first obtaining device 411 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
  • the querying device 412 matches the user input sequence provided by the first obtaining device 411 with the thesaurus 414 to obtain one or more matching input term options.
  • the following takes Chinese as an example for description.
  • the present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes.
  • the querying device 412 also searches in the keyword advertisement library 414 based on the user input sequence to obtain the relevant one or more advertisement information options. For example, when the user taps the button to input "woaiwaitan", the query device 412 queries in the thesaurus 414 to obtain a combination of terms such as "1 I love the Bund; 2 I love", and in the advertisement library 414, the query is obtained with "The Bund”.
  • the relevant advertising information includes landmarks such as “Bund No. 3” and “Bund 18", so the advertisement information option “3 Bund No. 3; 4 Bund No. 18" is provided.
  • the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
  • the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
  • the providing means 413 then provides the user with one or more matching input term options obtained in the order in which they are selected for specific input. For example, by displaying to the user in an input window bar of the display, multiple entry options can be displayed in the input sequence, and multiple entry options can be included in the next column for the user to select. Preferably, only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key.
  • the specific function keys can be, for example, "+" and "-".
  • the advertisement information option may adopt different display manners in the entry bar, such as different colors or gray scales.
  • a web page IP address or a uniform resource identifier (URL) related to the advertisement information is built in the advertisement information option.
  • the user can press the corresponding number of the option Select the ad information option by either the word key or by hovering the mouse to the option to hover or click.
  • the URL targeting device (not shown) in the user device 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, and connected to the web server corresponding to the webpage via the network. And display its web page to the user in the browser.
  • the first obtaining means 411 and the inquiring means 412 and the providing means 413 are continuously operated. Specifically, the first obtaining means 411 acquires the input sequence of the user in real time and continuously supplies it to the querying means 412, for example "w”, "wo” ... “wo” ... "woai” ...
  • the query device 412 also performs a matching query on the user input sequence continuously provided by the first obtaining device 411 in real time to continuously acquire the term options corresponding to the above input sequences, for example, "w” corresponds to "1 me, 2 ⁇ , 3 grips, 4 nests;; “woai” corresponds to “1 I love, 2 ⁇ , 3 grips, 4 nests”; “woaiwaitan” corresponds to "1 I love the Bund, 2 Bund 3, 3 Bund 18,
  • continuous refers to the action that is performed until the user finally selects an entry option. For example, the user may pause for a while after tapping the key sequence "woai", such as 0.5 second. Continue to tap the subsequent buttons.
  • the querying device 412 also obtains its respective priority when performing a matching query in the thesaurus 414 and the advertising library 414 based on the user input sequence to obtain a plurality of input term options and advertising information options.
  • the providing device 413 displays the plurality of matched input term options and advertisement information options provided by the querying device 412 in the item column to the user in a priority order, wherein the higher the priority, the input term option or the advertisement information The more the option is displayed.
  • the preferred input item option of priority is generally placed at the foremost position, so that the user can select by pressing "ENTER" or the space bar, and the advertisement information option is usually placed in each position. The position of the option at the end of the line.
  • the querying device 412 can query in the thesaurus 414 and the advertisement library 414 according to the user characteristics to obtain matching input term options and advertisement information options.
  • the priority level can also be determined based on user characteristics.
  • User characteristics include user's input history, user-set personal preferences, users Attributes, user addresses, etc.
  • User attributes include information about the user's occupation, gender, international, place of birth, age, etc. that reflect personal characteristics.
  • the querying device 412 can also determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option.
  • the querying device 412 can also determine the priority according to the personal preference selected by the user.
  • the device obtains a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users.
  • the set personal preference can be judged as the highest priority of "Bund No. 3" and "Bund 18," such as shopping and catering.
  • the query device 412 can also judge according to the IP address of the current user equipment.
  • the user device 41 can store the user input history, user set input preferences, and various associations between vocabulary in the local memory.
  • the user equipment 41 may also update information such as saved user input history, input preferences, and inter-vocabulary relevance.
  • the user equipment 41 further includes a second obtaining means 415 and an updating means 416.
  • the second obtaining means 415 obtains the selection of the plurality of input term options provided by the providing device 413 by the user through further interaction with the user.
  • the update device 416 updates the thesaurus and the user input history, vocabulary according to the user selection provided by the second obtaining device 415.
  • the new term option and the priority of the existing term option can be added to the thesaurus 414, user characteristics. More preferably, if the user equipment can access the Internet, the second obtaining means 415 can also search for a new combination of terms on the Internet and update the thesaurus 414 and the like.
  • the ad library 414 in the user device 41 can be actively updated at any time and periodically, e.g., the user device 41 is connected to and synchronized with one or more network devices via a network.
  • the advertisement library 414 may be located outside the user equipment 41, for example, at a network device or distributed among multiple network devices, and the user equipment 41 may be connected to the network device via the network, so as to query and The user enters a sequence of related advertising information options.
  • the network 20 illustrates a user device 41 and a network device 42 for simultaneously providing search related information related to input information when a user performs character input according to another aspect of the present embodiment, wherein the user device 41 is associated with the network device 42 via a network.
  • the network can be the Internet, intranet, etc. That is, when the user performs text input on the user equipment 41, the user equipment 41 sends a query request to the network device 42 via the network, requesting the network device 42 to search for relevant search related information, such as advertisement information, webpage information, according to the user input sequence. The travel information or the map information is then provided to the user along with the search related information fed back by the network device together with the input term option obtained by the network device query.
  • relevant search related information such as advertisement information, webpage information
  • the user equipment 41 includes a first obtaining device 411, a first transmitting device 417, a first receiving device 418, and a providing device 413.
  • the network device 42 includes a second receiving device 421, a querying device 422, a second transmitting device 423, a storage device 424 for saving the network lexicon (for the sake of clarity, the following is called a network vocabulary 424) and for storing keywords
  • the storage device 424 of the advertisement library (for the sake of clarity, the following is called the network advertisement library 424).
  • the first obtaining device 411 acquires an input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user.
  • the interactive device can be a keyboard, a remote control, a touch pad or a voice control device, and the like. Take the keyboard as an example, but the user presses the button on the keyboard.
  • the first obtaining means 411 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
  • the first transmitting device 417 in the user device 41 transmits the user input sequence provided by the first obtaining device 411 to the network device 42 in real time and continuously.
  • the second receiving device 421 in the network device 42 receives the input sequence and provides it to the querying device 422.
  • the querying device 422 performs a matching query on the user input sequence with the thesaurus 424 to obtain one or more matching input term options.
  • the following is a description of the Chinese language.
  • the present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes.
  • the querying device 422 also searches in the keyword advertisement library 424 according to the user input sequence to obtain related one or more advertisement information options.
  • the query device 422 queries the vocabulary 424 to obtain a combination of terms such as “1 I love the Bund; 2 I love", and in the advertisement library 424, the query is obtained with the "Bund".
  • Relevant advertising information includes landmarks such as “Bund No. 3” and “Bund 18", so the advertisement information option "3 Bund No. 3; 4 Bund No. 18" is provided.
  • the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
  • the second transmitting device 423 in the network device 42 also transmits the input term options provided by the querying device 422 to the user device 41 in real time and continuously.
  • the first receiving device 419 in the user device 41 receives the input term option and provides it to the providing device 413 in real time and continuously, and the providing device 413 then sequentially obtains one or more matching input term options in a certain order.
  • the format is provided to the user for selection to make specific input. For example, by displaying to the user in an input window bar of the display, multiple entry options can be displayed in the input sequence, and multiple entry options can be included in the next column for the user to select.
  • only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key.
  • the specific function keys for example, can be "+" and "-".
  • the advertisement information option can be displayed differently in the entry bar. Ways, such as different colors or grayscale.
  • a web page IP address or a uniform resource identifier (URL) related to the advertisement information is built in the advertisement information option.
  • the user can select the ad information option by pressing the corresponding number key for the option or by hovering the mouse to hover or click on the option.
  • the user equipment 1 can be directed to its corresponding webpage URL through the network, for example, in the browser open situation, connected to the web server corresponding to the webpage via the network, and display the webpage to the user in the browser. .
  • the first obtaining means 411, the first transmitting means 417, the first receiving means, and the second receiving means 421, the inquiring means 412 and the second transmitting means 423 of the network device 42 are continuously in between Constantly cooperate with the work.
  • the first obtaining device 411 is
  • the first transmitting device 417 also transmits various input sequences to the network device 42 in real time and continuously.
  • the second receiving device in the network device 42 sends the query device 422, and the query device 422 then performs a matching query on the user input sequence continuously provided by the first receiving device 421 in real time to continuously acquire the words corresponding to the above input sequences.
  • “w” corresponds to "1 me, 2 ⁇ , 3 grips, 4 nests”
  • "woai” corresponds to "1 I love, 2 ⁇ , 3 grips, 4 nests”
  • "woaiwaitan” corresponds to "1 I love the Bund” 2, Bund 3, 3 Bund 18,.
  • buttons “continued”, refers to the action that is always performed before the user finally selects an entry option, such as the user tapping the key sequence. After “woai”, you may pause for a while, such as 0.5 seconds, and then continue to tap the subsequent buttons.
  • the querying device 422 also obtains its respective priority when performing a matching query in the network vocabulary 424 and the network advertisement library 424 according to the user input sequence to obtain a plurality of input term options and advertisement information options.
  • the providing device 413 in the user equipment 41 displays a plurality of matching input term options and advertisement information options provided by the network device 42 in the item column to the user in a priority order, wherein the higher the priority, the input word The top option or ad information option is displayed.
  • the priority is the best
  • the entry options are generally placed in the front position so that the user can select by pressing ''ENTER' or the space bar, and the ad information option is usually placed in the last option position in each line.
  • the querying device 422 of the network device 42 can acquire the user feature according to the ID of the user login.
  • the ID of the user login For example, user input history, user-specific user vocabulary, user-set personal preferences, user attribute information, and the like.
  • the user features may be stored in the network device 42 or in other network devices to which the network device 42 is connected.
  • the querying device 422 can query the network vocabulary 424 and the network advertisement library 424 according to the user characteristics to obtain matching input term options and advertisement information options. Specifically, the querying device 422 can determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option. The querying device 422 can also determine the priority according to the personal preference selected by the user.
  • the query The device queries the network advertisement library 424 to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users.
  • the set personal preference can be judged as the highest priority for shopping, dining and other architectural attractions such as "The Bund No. 3" and "The Bund 18".
  • the querying device 422 can also determine the region in which the user is located according to the IP address of the current user equipment, so that the priority of the vocabulary related to the region in the input sequence can be determined, for example, but the user input sequence is "woxihuanbund", where The translation of "bund” has “1 Embankment 2 Terminal 3 Alliance 4 (Shanghai) Bund", when the inquiry device 412 is known based on the IP address of the user equipment, currently located in Shanghai, China, so that the "bund” corresponding translation in the "Shanghai Bund” or “The Bund” has the highest priority, so the following entry options are available: “I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like the league.”
  • the network device 42 may also update information such as saved user input history, input preferences, and inter-vocabulary associations.
  • the user equipment 41 further includes a second obtaining device 415 and a third transmitting device 418.
  • the network device 42 further includes a second receiving device 425 and an updating device 426.
  • the second obtaining means 415 of the user equipment 41 acquires the selection of the plurality of input term options provided by the providing device 413 by the user through further interaction with the user, and is transmitted by the third transmitting device 418 to the network device.
  • the update device 426 updates the thesaurus and the user input history, the association between the vocabularies, and the like according to the user selection received by the second receiving device 425. For example, a new term option and an existing term option may be added to the network thesaurus 424. Priority, user characteristics. More preferably, the network device 42 may further include a third obtaining means (not shown), which may also search for a new combination of terms on the Internet and update the network vocabulary 424 and the like.
  • the advertisement library 414 may be located outside the network device 42, for example at another network device or distributed among other network devices, and the network device 42 may be connected to the other network via the network.
  • the devices are connected to query the advertising information options associated with the user input sequence.
  • Fig. 21 shows another preferred example according to the present embodiment, wherein the user device 41 itself also includes a query means 412 and a memory 414 (hereinafter referred to as a local thesaurus 414) for storing a local thesaurus, a local thesaurus 414 and The user-specific user vocabulary in the network vocabulary of the network device 42 is synchronized at any time or periodically.
  • a query means 412 and a memory 414 (hereinafter referred to as a local thesaurus 414) for storing a local thesaurus, a local thesaurus 414 and
  • the user-specific user vocabulary in the network vocabulary of the network device 42 is synchronized at any time or periodically.
  • the first obtaining device 411 may first provide the user input sequence to the query device 412 of the user equipment 41 for matching query.
  • the specific query process is as described above with reference to FIGS. 19-20.
  • the content of the description is not described herein; the first obtaining device 411 can also send the user input sequence to the network device 42 through the third sending device 418, and the query device 422 performs a matching query to obtain one or more
  • the input term option and the advertisement information option related to the user input sequence are specifically referred to as described above with reference to FIG. 20, and the content reference is not described herein.
  • the user device 41 also includes a merging device 420 that will enter one or more input term options from the query device 412 itself and from the network.
  • One or more input term options provided by the querying device 422 of the device 42 are merged, the repeating options therein are deleted, and a plurality of terminology options resulting from the final merge and the feedback and input from the network device 42 are determined according to certain rules.
  • the prioritized order of the sequence-related advertising information options is then provided to the providing means 413, which provides them to the user in a corresponding priority order.
  • the input term option provided by the network device 42 should be more accurate, so the priority is higher than the local query to obtain the input term option, and similarly, the ad information option is usually placed on each line so as not to affect the user's text input. The option position at the end of the middle.
  • Fig. 22 is a flowchart showing a method of simultaneously providing search related information related to input information when a user performs character input in a user equipment according to an aspect of the embodiment. That is, when the user performs text input on the user device 41, the user device 41 provides a corresponding input term option according to the user input sequence, and also searches for related search related information, such as advertisement information and webpage information, according to the input sequence of the user device. , travel information or map information.
  • related search related information such as advertisement information and webpage information
  • step S41 the user device 41 acquires the input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user.
  • the interactive device can be a keyboard, a remote control, a touch pad or a voice control device, and the like. Taking the keyboard as an example, when the user taps a key in the keyboard to input, the user device 41 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
  • the lexicon hereinafter referred to as the local lexicon) performs a matching query to obtain one or more matching input terms.
  • the following takes Chinese as an example for description. This embodiment allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes.
  • the user equipment also searches in the locally stored keyword advertisement library (hereinafter referred to as a local advertisement library) according to the user input sequence, and obtains one or more related advertisement information options.
  • a local advertisement library hereinafter referred to as a local advertisement library
  • the user device 41 queries the local vocabulary to obtain a combination of terms such as "1 I love the Bund; 2 I love", and in the advertisement library 414, the query is obtained with "The Bund”.
  • “The relevant advertising information includes "Bund No. 3" and "Bund 18".
  • the landmark building therefore offers the advertising information option "3 Bund 3; 4 Bund 18".
  • the process of querying the advertisement information (or searching for related information) related to the input sequence may adopt various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
  • the user device 41 provides the obtained one or more matching input term options to the user in a certain order and format for selection. For example, by displaying to the user in an input window column in the display of the user device 41, multiple entry options and the input sequence can be displayed in columns, and multiple entry options can be included in the next column for the user to select. .
  • the specific function keys can be, for example, "+" and "-".
  • the advertisement information option may use different display modes in the entry bar, such as different colors or gray scales.
  • the advertisement information option has a web page IP address or a uniform resource identifier (URL) associated with the advertisement information.
  • URL uniform resource identifier
  • the user and user device 41 can make further human-computer interaction according to the provided input term options.
  • the user can select the advertisement information option by pressing the corresponding numeric key of the option on the keyboard of the user device 41 or by moving the cursor of the user device 41 to hover or click at the option.
  • the user equipment 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, connected to the web server corresponding to the webpage via the network, and the webpage is displayed in the browser. user.
  • the steps S41 to S43 are continuously cycled.
  • the query is performed locally in the step.
  • the user continuously inputs “w”, “wo” ... “wo” ... “woai” ... “woaiwaitan”, and in step S42, the user device 41 also Matching queries based on the continuously acquired user input sequence in real time to continuously obtain the entry options corresponding to the above input sequences, for example, “w” corresponds to "1 me, 2 ⁇ , 3 grips, 4 nests";”woai"Correspondence" 1 I love, 2 ⁇ , 3 grips, 4 nests;; “woaiwaitan” corresponds to "1 I love the Bund, 2 Bund three No. 3, No.
  • the user equipment 41 also obtains its own priority when performing a matching query in the thesaurus and the advertisement library according to the user input sequence to obtain a plurality of input term options and advertisement information options.
  • the user equipment 41 displays the plurality of matched input term options and advertisement information options obtained by the query in the item column to the user in priority order, wherein the higher the priority, the input term option Or the higher the ad information option is displayed.
  • the preferred input item option of priority is generally placed at the foremost position, so that the user can select by pressing "ENTER" or the space bar, and the advertisement information option is usually placed in each position. The position of the option at the end of the line.
  • the user equipment 41 may further perform a query in the thesaurus and the advertisement library according to the user characteristics, and obtain matching input term options and advertisement information options.
  • the priority level can also be determined based on user characteristics.
  • User characteristics include the user's input history, user-set personal preference selection, user attributes, user address, etc.
  • User attributes include the user's occupation, gender, international, place of birth, age, etc., reflecting personal characteristics.
  • the user equipment 1 may determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option.
  • the user equipment 41 can also determine the priority level according to the personal preference selection set by the user.
  • the user The device 41 queries the advertisement library to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, and Bund 18, etc.
  • the personal preference may determine that the "Bund No. 3", "The Bund 18," and other architectural attractions that are mainly shopping and catering have the highest priority.
  • the user equipment 41 may also be based on the current user equipment. IP address to determine the region in which it is located, so that the input sequence can be determined
  • the priority of the vocabulary associated with the territory in the column for example, but the user input sequence is
  • the user device 41 can store the user input history, user set input preferences, and various associations between vocabulary in the local memory. Preferably, the user device 41 may also update information such as saved user input history, input preferences, and inter-vocabulary relevance.
  • the user device 41 also obtains the user's selection of the provided plurality of input term options by further interaction with the user, and then updates the thesaurus and user input according to the acquired user selection. Historical records, vocabulary associations, etc., for example, can add new entry options and prioritization of existing term options in the thesaurus, user characteristics. More preferably, if the user equipment can access the Internet, in step S45, the user equipment 41 can also search for a new combination of terms on the Internet and use it to update the thesaurus and the like.
  • the advertisement library in the user device 41 can be actively updated at any time and periodically, for example, the user device 41 is connected to one or more network devices via a network, and in another preferred example, the advertisement library It may be located outside the user equipment 41, for example at a network device or distributed among a plurality of network devices.
  • the user device 41 may be connected to the network device via the network to query an advertisement related to the user input sequence. Information option.
  • the user equipment 41 is connected to the network device 42 via a network, and the network may be the Internet, an intranet, or the like. That is, when the user performs text input on the user equipment 41, the user equipment 41 sends a query request to the network device 42 via the network, requesting the network device 42 to search for relevant search related information, such as advertisement information, webpage information, according to the user input sequence.
  • relevant search related information such as advertisement information, webpage information, according to the user input sequence.
  • the travel information or the map information is then provided to the user along with the search related information fed back by the network device together with the input term option obtained by the network device query.
  • network device 42 maintains a network vocabulary and a keyword advertisement library (for the sake of clarity, the following is referred to as a network advertisement library or a search related information library).
  • the user equipment 41 may be a keyboard, a remote controller, a touch pad or a voice control device or the like by any one of the interactive devices.
  • the user device 1 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
  • step S42 the user equipment 41 transmits the acquired user input sequence to the network device 42 in real time and continuously.
  • step S43 the network device 42 performs a matching query in the network lexicon according to the received user input sequence to obtain one or more matching input term options.
  • the following takes Chinese as an example for description.
  • the present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes.
  • web or multiple ad information options For example, when the user taps the button to input "woaiwaitan", the network device 42 queries the network vocabulary to obtain a combination of terms such as "1 I love the Bund; 2 I love", and inquires in the online advertisement library to obtain the "Bund".
  • Relevant advertising information includes landmarks such as "Bund No. 3" and "Bund 18", so the advertisement information option "3 Bund No. 3; 4 Bund No. 18" is provided.
  • the process of querying the advertisement information (or searching for related information) associated with the input sequence may employ various currently known intelligent or fuzzy search algorithms and will not be described herein.
  • step S45 the network device 42 also inputs the queried entry in real time and continuously.
  • the option is sent to the user device 41.
  • step S47 the user equipment 41 receives the input term option from the network device 42 in real time and continuously and provides it to the user, and the user device 41 can press the obtained one or more matching input term options to be determined.
  • the order and format are provided to the user for selection or specific interaction. For example, by displaying to the user in an input window column of the display, multiple entry options can be displayed in the input sequence column, and multiple entry options can be included in the next column for the user to select.
  • only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key.
  • the specific function keys can be, for example, "+,, and "-”.
  • the advertisement information option may use different display modes in the entry bar, such as different colors or gray scales.
  • the advertisement information option has a web page IP address or a uniform resource identifier (URL) associated with the advertisement information.
  • URL uniform resource identifier
  • step S48 the user and user device 41 can perform further human-computer interaction according to the provided input term options.
  • the user can select this ad information option by pressing the corresponding number key for that option or by hovering the mouse to hover or click on the option.
  • the user device 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, connected to the web server corresponding to the webpage via the network, and displaying the webpage in the browser. To the user.
  • the steps S41 to S47 are continuously cycled. Specifically, at step
  • Network device 42 such as “w”, “wo” ... “wo” ... “woai” ... “woaiwaitan”, network device 42 also performs matching queries in real time and continuously according to the user input sequence, and The sequence of the entered input vocabulary is continuously sent back to the user device 41, for example, “w” corresponds to “1 me, 2 ⁇ , 3 grips, 4 nests”;”woai” corresponds to "1 I love, 2 ⁇ , 3 grips, 4 nests; “woaiwaitan” corresponds to "1 I love the Bund, 2 Bund 3, 3 Bund 18".
  • buttons refer to a manner of action that is always performed before the user finally selects an entry option, such as a user tapping.
  • button sequence "woai” it may pause for a while, such as 0.5 seconds, and then continue to tap the subsequent buttons.
  • the network device 42 also obtains its respective priority when performing a matching query in the network thesaurus and the network advertisement library according to the user input sequence to obtain a plurality of input term options and advertisement information options.
  • the user equipment 41 displays the plurality of matched input term options and advertisement information options provided by the network device 42 in the entry column to the user in a priority order, wherein the higher the priority, the input word The top option or ad information option is displayed.
  • the preferred input entry option is generally placed at the foremost position, so that the user can select by pressing ' 'ENTER' or the space bar, and the advertisement information option is usually placed. The last option position in each line.
  • the network device 42 can also acquire the user feature according to the ID of the user login.
  • the user inputs a history record, a user-specific user vocabulary, a user-set personal preference, user attribute information, and the like.
  • the user features may be stored in the network device 42 or in other network devices to which the network device 42 is connected.
  • the network device 42 may perform a query in the network thesaurus and the network advertisement library according to the user characteristics to obtain matching input term options and advertisement information options. Specifically, in step S43, the network device 42 may determine, according to the frequency of selection of the vocabulary in each term option or the term option in the user input history record, and the semantic relevance between each vocabulary in each term option. Its priority is high. The network device 42 can also determine the priority according to the personal preference selected by the user.
  • the network device 42 queries the network advertisement library to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users.
  • the set personal preference can judge that the "Bund No. 3", “The Bund 18," and other architectural attractions that are mainly shopping and dining have the highest priority.
  • the network device 42 can also be based on the current user equipment.
  • IP address to determine the region in which it is located, so that it can be determined that the input sequence is related to the region
  • the priority of the closed vocabulary for example, but the user input sequence is "woxihuanbund", where the translation of "bund” has “1 embankment 2 dock 3 alliance 4 (Shanghai) Bund", in step S43, when the network device 42 is based on the user
  • the IP address of the device is currently located in Shanghai, China, so that it can be determined that the "Bund” corresponds to the highest level of "Shanghai Bund” or "The Bund”.
  • network device 42 may also update information such as saved user input history, input preferences, and inter-vocabulary associations.
  • the user device 41 acquires the user's selection of the provided plurality of input term options by further interaction with the user and transmits to the network device; in step S410 (not shown)
  • the network device 42 updates the thesaurus and the user input history, the association between the vocabularies, and the like according to the received user selection.
  • the new term option and the existing term option may be added to the network thesaurus. feature.
  • the network device 42 can also search for new combinations of terms on the Internet and update the network vocabulary and the like.
  • the network advertisement library may be located outside the network device 42, for example at another network device or distributed at other multiple network devices, and the network device 42 may be connected to the other network device via the network, thereby querying and querying the user. Enter the ad information options associated with the sequence.
  • Fig. 24 shows another preferred example according to the present embodiment, in which the user equipment 41 itself also holds a local vocabulary, and can synchronize with the user-specific user vocabulary in the network vocabulary of the network device 42 at any time or periodically. .
  • step S44 the user equipment 41 performs a matching query in the local vocabulary according to the user input sequence.
  • the specific query process is as described in the foregoing step S42 described with reference to FIG. 44, and the content is referenced here. Do not repeat them.
  • Steps S41 to S43 are the contents of steps S41 to S43 described above with reference to FIG. 23, and the content references are not described herein.
  • steps S41 to S43 and step S44 may be performed synchronously, the completion time of which mainly depends on the processing speed of the user equipment 41 and the network device 42 and the network transmission delay between the user equipment 41 and the network equipment 42.
  • step S46 the user equipment 41 merges one or more input term options that are locally queried with one or more input term options from the network device 42, deletes the repeating options therein, and according to certain rules. Determining a priority order of the plurality of term options obtained by the final merge and the advertisement information options related to the input sequence fed back from the network device 42, and then, in step S48, pressing the input term option and the advertisement information option accordingly
  • the priority order is provided to the user for selection or further human interaction.
  • the input term option provided by the network device 42 should be more accurate, so the priority is higher than the local query to obtain the input term option, and similarly, the ad information option is usually placed on each line so as not to affect the user's text input. The option position at the end of the middle.
  • the network vocabulary in the network device 42 may also be the user.
  • a specific user vocabulary may also be the user.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method, a device, a server and a system for character input by a user are disclosed, the method involves: an input sequence in a user device is detected (S11Ol); the input sequence is sent to a network server (S 1103); at least one candidate entry option based on matching the input sequence is received from the network server (S1104); the candidate entry option is showed ( SI1105 ). When the character is input by the user, the quick response of a local corpus and the high precision of a large capacity network corpus could be fully integrated, and the input accuracy, the input efficiency, the input speed and the preferred word hit rate are increased.

Description

用于供用户进行文字输入的方法、 设备、 服务器及系统 本申请要求了申请日为 2010年 4月 16日,申请号为 201010148563.7, 发明名称为"一种基于网络的供用户进行文字输入的方法和设备"、申请日为 2010年 4月 16日, 申请号为 201010148571.1 , 发明名称为"用于供用户进 行文字输入的方法、 设备、 服务器和系统"、 申请日为 2010年 4月 16 曰, 申请号为 201010148572.6, 发明名称为"供用户进行文字输入及同时提供搜 索相关信息的方法和设备"、 申请曰为 2010 年 5 月 31 曰, 申请号为 201010187082.7 , 发明名称为"用于供用户进行文字输入的方法、 设备、 服 务器和系统"的中国专利申请的优先权, 其全部内容通过引用结合在本申请 中。  Method, device, server and system for user inputting texts The application filed on April 16, 2010, the application number is 201010148563.7, and the invention name is "a network-based method for text input by users" And the equipment", the application date is April 16, 2010, the application number is 201010148571.1, and the invention name is "method, equipment, server and system for users to input text". The application date is April 16, 2010. The application number is 201010148572.6, and the invention name is "method and equipment for users to input text and provide relevant information for search". The application number is May 31, 2010, the application number is 201010187082.7, and the invention name is "for users." The priority of the Chinese Patent Application, the entire disclosure of which is incorporated herein by reference.
【技术领域】 [Technical Field]
本发明涉及计算机网络技术, 尤其涉及利用计算机网络来供用户进行 文字输入的方法、 设备、 服务器及系统。  The present invention relates to computer network technology, and more particularly to a method, device, server and system for utilizing a computer network for text input by a user.
【背景技术】 【Background technique】
目前比较流行的中文输入法包括拼音加加、 Google输入法以及搜狗输 入法。 这些输入法将词典数据库 (筒称词库)和基于词库的转化方法或称 算法模型实现保存在本地, 也即存储在用户设备的存储设备如硬盘、 闪存 等之中。 在下文中将词库和算法模型的总和称为语料库。  Currently popular Chinese input methods include Pinyin Plus, Google Input, and Sogou input. These input methods store the dictionary database (the vocabulary) and the lexicon-based conversion method or algorithm model implementation locally, that is, in the storage device of the user device such as a hard disk, a flash memory, or the like. The sum of the thesaurus and the algorithm model is hereinafter referred to as a corpus.
这些输入法都是基于用户在例如台式机、 笔记本电脑、 手机、 PDA等 用户设备上输入的按键序列在用户设备的本地语料库中搜索候选词条的然 后显示给用户进行选择。 这些输入法根据用户对候选词条的选择动态地更 新本地语料库以及语料库中各个候选词条的出现次序或优先级。 随着用户 长时间地使用该输入法, 不断磨合, 本地语料库中的各个词条的优先级被 调整为越来越符合该特定用户的输入习惯, 因此文字输入速度也会越来越 快。 但是, 这些现有输入法还存在如下缺点: These input methods are based on a key sequence input by a user on a user device such as a desktop computer, a notebook computer, a mobile phone, a PDA, etc., and search for candidate terms in the local corpus of the user device and then displayed to the user for selection. These input methods dynamically update the local corpus and the order or priority of occurrence of each candidate term in the corpus based on the user's selection of candidate terms. As the user uses the input method for a long time and keeps running, the priority of each entry in the local corpus is The adjustment is more and more in line with the input habits of the particular user, so the text input speed will be faster and faster. However, these existing input methods also have the following disadvantages:
1.在网络文化日益发展的今天,各种语言如中、英文的字词日新月异, 虽然上述输入法中的搜狗输入法能够定期更新其本地词典数据库, 但这种 更新的频率也难以紧跟网络语言的发展, 由此导致用户的输入不能带来最 理想和权威的候选结果列表, 更勿论那些不支持本地词库在线更新的输入 法。  1. Today, with the growing development of online culture, the words of various languages such as Chinese and English are changing with each passing day. Although the Sogou input method in the above input method can update its local dictionary database regularly, the frequency of this update is difficult to keep up with the network. The development of language, which leads to the user's input can not bring the most ideal and authoritative list of candidate results, let alone those input methods that do not support the online update of the local thesaurus.
2. 本地语料库的备份和转移不方便: 假设这样一种情形, 用户 A经常 使用办公室的台式机 a进行文字录入,因此存储在台式机 a上的本地语料库 al非常适合用户 A的输入习惯。如果他暂时需要用其同事 B的笔记本电脑 b进行文字录入, 该笔记本电脑 b中存储有该同事 B的本地语料库 bl。 为 了继续使用自己的语料库 al提高录入速度, 用户 A需要: 1 )从台式机 a 中备份出语料库 al ; 2 )从笔记本电脑 b中备份出同事 B的语料库 bl , 以 免导入语料库 al时覆盖笔记本电脑 b的语料库 bl ; 3 )将语料库 al导入笔 记本电脑 b, 然后开始使用该输入法; 4 )使用完笔记本电脑 b后, 还需要 把语料库 bl导回到笔记本电脑 b。 如此繁瑣的操作即使对于精通电脑的人 士来说也是非常麻烦的。  2. Local corpus backup and transfer is not convenient: Suppose a situation where user A often uses the office desktop a for text entry, so the local corpus stored on desktop a is very suitable for user A's input habits. If he temporarily needs to use his colleague B's laptop b for text entry, the laptop b stores the colleague B's local corpus bl. In order to continue to use their own corpus to improve the entry speed, User A needs to: 1) backup the corpus a from desktop a; 2) back up the colleague bl of colleague B from laptop b to avoid overwriting the laptop when importing corpus al b) corpus bl; 3) import the corpus a into the laptop b, and then start using the input method; 4) after using the laptop b, you also need to bring the corpus bl back to the laptop b. Such cumbersome operations are very troublesome even for those who are proficient in computers.
3. 本地语料库的同步和更新不方便: 一些输入法可以在网络服务器上 保留注册用户的用户语料库, 当用户登录网络服务器时, 为了使用自己的 用户语料库, 可以要求将网络服务器上保留的用户语料库与本地用户设备 的语料库同步。但是,语料库中可能包含该用户经常录入的一些敏感信息, 如果该用户在他人的电脑或网吧的电脑上执行同步操作时, 会导致将其语 料库中的敏感信息下载到他人或公共的电脑上, 并且他人或公共电脑上的 原有语料库的信息也会被同步到网络服务器为该用户保留的用户语料库中。 这样不可避免地造成用户语料库的混乱和泄密。  3. It is inconvenient to synchronize and update the local corpus: Some input methods can keep the user corpus of the registered user on the web server. When the user logs in to the web server, in order to use his own user corpus, the user corpus retained on the web server can be requested. Synchronize with the corpus of the local user device. However, the corpus may contain some sensitive information that the user often enters. If the user performs a synchronization operation on another person's computer or Internet cafe computer, it will cause sensitive information in the corpus to be downloaded to another person or a public computer. And the information of the original corpus on the other person or on the public computer is also synchronized to the user corpus reserved by the web server for the user. This inevitably leads to confusion and leaks in the user corpus.
4. 本地语料库容量有限、 更新不及时以及本地设备处理能力有限: 一 些输入法的网络服务器可以定期地收集网络上的热点词汇, 并且同步到用 户设备的本地语料库。 但是, 由于用户设备的存储容量和处理速度所限, 本地语料库不能太大, 而网络上每日不断涌现新的网络热词、 日常口语、 新产品品牌、 最新电影等词汇, 不可能把所有这些新出现的词汇或短语都 同步到本地语料库中。 据统计, 如果把网络上出现的词汇或惯用短语以及 语言模型都收集起来形成的语料库可能超过 4G容量, 而把 4G的网络语料 库都同步到本地设备上几乎是不可能实现的, 特别是对于手持设备这样的 瘦客户端来说尤其如此。 即便能够把如此大的语料库存储在本地设备, 要 在这样的语料库中搜索出匹配的词条对本地设备的处理能力提出很高的要 求。 我们知道, 语料库或语料库越大, 所获得的候选词条就越准确。 而基 于有限的本地语料库进行匹配所获得的词条将受限制, 并且准确度较低。 由于例如手持电话这样的本地设备处理能力的限制不能提供复杂的功能, 如中英文混合输入功能、 或者多种语言混合输入功能。 4. Local corpus has limited capacity, untimely updates, and limited local device processing capabilities: The web server of these input methods can periodically collect hotspot words on the network and synchronize to the local corpus of the user equipment. However, due to the limited storage capacity and processing speed of the user equipment, the local corpus cannot be too large, and it is impossible to put all these words on the network, such as new online hot words, daily spoken words, new product brands, and latest movies. Emerging words or phrases are synchronized to the local corpus. According to statistics, if the corpus formed by collecting vocabulary or idiomatic phrases and language models appearing on the network may exceed 4G capacity, it is almost impossible to synchronize the 4G network corpus to the local device, especially for handheld This is especially true for thin clients like devices. Even if such a large corpus can be stored in a local device, searching for a matching entry in such a corpus places high demands on the processing power of the local device. We know that the larger the corpus or corpus, the more accurate the candidate terms obtained. The terms obtained by matching based on a limited local corpus will be limited and the accuracy will be lower. Due to limitations in the processing power of local devices such as hand-held phones, complex functions such as Chinese-English mixed input functions or multi-language mixed input functions cannot be provided.
5. 输入法功能单一, 并未能提供其他地方便用户使用的附加功能, 如 网络搜索功能等。  5. The input method has a single function and does not provide other additional functions that are convenient for users, such as the web search function.
为了解决上述问题, 需要提供一种新的文字输入方法。  In order to solve the above problems, it is necessary to provide a new text input method.
【发明内容】 [Summary of the Invention]
本发明的目的在于提供一种解决上述问题的文字输入方法, 以及使用 该文字输入方法的设备、 网络服务器和系统。  SUMMARY OF THE INVENTION An object of the present invention is to provide a character input method for solving the above problems, and an apparatus, a network server and a system using the character input method.
为了实现上述目的, 本发明的一种用于供用户进行文字输入的方法, 包括下述步骤:  In order to achieve the above object, a method for inputting text by a user of the present invention includes the following steps:
51、 检测用户设备上的输入序列;  51. Detect an input sequence on the user equipment;
52、 将所述输入序列发送至网络服务器;  52. Send the input sequence to a network server.
S3、 接收来自所述网络服务器返回的与所述输入序列匹配的至少一个 网络词条选项; S4、 展现所述网络词条选项。 S3. Receive at least one network entry option returned from the network server that matches the input sequence. S4. Display the network term option.
相应地, 本发明的一种用于供用户进行文字输入的用户设备, 包括: 用于检测输入序列的装置;  Correspondingly, a user equipment for text input by a user of the present invention includes: means for detecting an input sequence;
用于将所述输入序列发送至网络服务器的装置;  Means for transmitting the input sequence to a web server;
用于接收来自所述网络服务器返回的与所述输入序列匹配的至少一个 网络词条选项;  </ RTI> for receiving at least one network entry option returned from the web server that matches the input sequence;
用于展现所述网络词条选项的装置。  Means for presenting the network entry options.
相应地,本发明的一种用于供用户进行文字输入的网络服务器,包括: 用于经由网络接收用户设备上输入序列的装置;  Correspondingly, a network server for text input by a user of the present invention comprises: means for receiving an input sequence on a user equipment via a network;
用于基于所述输入序列在网络服务器中进行匹配查询获得至少一个网 络词条选项的装置;  Means for obtaining at least one network term option based on the input sequence performing a matching query in a web server;
用于获得的所述网络词条选项返回至所述用户设备的装置。  The network entry option for obtaining is returned to the device of the user device.
相应地, 本发明的一种用于供用户进行文字输入的系统, 包括上述的 设备, 以及上述的网络服务器。  Accordingly, a system for inputting text by a user of the present invention includes the above-described device, and the above-described network server.
与现有技术相比, 本发明的优势在于:  The advantages of the present invention over the prior art are:
1. 实现了跨平台、免安装的输入法,由网络服务器维护的集中式大型词 库、 算法模型能够带来高准确度、 高权威性的输入, 另外, 对于本地存储 空间十分有限的用户设备而言, 还能够节约出宝贵的资源;  1. A cross-platform, installation-free input method is implemented. The centralized large-scale thesaurus and algorithm model maintained by the network server can bring high-accuracy and high-authority input. In addition, for user equipment with limited local storage space. In addition, it can save valuable resources;
2. 只要连接到互联网, 用户可以在任何设备上使用自己的语料库进行 录入;  2. Users can use their own corpus to enter on any device as long as they are connected to the Internet;
3.用户使用自己的语料库时,并不一定要把该语料库同步到本地设备; 3. When users use their own corpus, they do not have to synchronize the corpus to the local device;
4. 新的网络热词、 曰常口语、 新产品品牌、 最新电影等等新词条将及 时出现在用户的候选词条中; 4. New terms such as new online hot words, slang, new product brands, latest movies, etc. will appear in the user's candidate terms in a timely manner;
5. 用户在进行文字录入时充分结合本地语料库的快速反应以及大容量 网络语料库的高精确度, 提高录入的准确性和效率;  5. The user can fully combine the rapid response of the local corpus and the high precision of the large-capacity network corpus in the text entry to improve the accuracy and efficiency of the entry;
6. 在用户端将文字输入功能与信息搜索功能很好地结合起来, 无论用 户是在诸如 WORD、 WPS等文字编辑软件、 或在起拟电子邮件时、 或在浏 览非搜索网站网页时均可获得与其输入信息相关的搜索相关信息, 而且通 过将搜索相关信息按一定规则和显示方式置于输入词条选项栏较后位置, 在提供搜索相关信息功能同时不会影响用户的文字输入速度; 6. Combine text input function with information search function on the user side, no matter The user can obtain search related information related to the input information when writing the software such as WORD, WPS, or when starting an email, or when browsing a non-search website, and by searching the relevant information according to certain rules and The display mode is placed in the lower position of the input term option bar, and the function of searching for related information is provided without affecting the text input speed of the user;
7. 利用不同用户之间可能具有的某种共性, 实现多用户协同输入。 同 累进的输入习惯, 相当于减少了用户自身对输入法的训练次数, 由此提高 了输入速度和首选词命中率;  7. Achieve multi-user collaborative input by taking advantage of some commonality between different users. The same progressive input habits are equivalent to reducing the user's own training times on the input method, thereby increasing the input speed and the preferred word hit rate;
8. 基于各种标准定义用户群组, 同一用户可以参加一个或多个群组, 从而在有目的地共享一类词语的同时, 赋予用户选择词语的丰富度和灵活 度, 更符合人类用户具有多重属性的基本事实;  8. Define user groups based on various criteria. The same user can participate in one or more groups, so that when users share a certain type of words purposefully, the user is given the richness and flexibility of selecting words, which is more in line with the human users. Basic facts of multiple attributes;
9. 根据需要为不同用户赋予不同权限或级别, 实现分层次的词条共享, 进一步提高了该输入法的共享效率。  9. Different users are given different permissions or levels according to their needs to realize hierarchical sharing of terms, which further improves the sharing efficiency of the input method.
10. 对群组用户可能共同占有的某种材料进行学习和训练, 以形成用 于该群组的初始共享语料库。 通过集中获取与用户共性有关的词条资源, 减少系统构建分类词库的开销, 从而可以以较低成本实现该高效的输入法。  10. Learn and train a material that a group of users may share in common to form an initial shared corpus for the group. By centrally obtaining the term resources related to the commonality of the user, the overhead of constructing the classified thesaurus is reduced, so that the efficient input method can be realized at a lower cost.
【附图说明】 [Description of the Drawings]
图 1 是本发明第一实施方式中在网络服务器通信的用户设备端输入文 字的方法的流程图;  1 is a flowchart of a method for inputting characters on a user equipment side of a network server communication according to a first embodiment of the present invention;
图 2是本发明第一实施方式中在网络服务器上辅助用户进行文字输入 的方法的流程图;  2 is a flowchart of a method for assisting a user to perform character input on a web server according to the first embodiment of the present invention;
图 3是本发明第一实施方式中在网络服务器上辅助用户进行文字输入 的方法中在网络语料库中进行匹配查询的具体步骤的流程图;  3 is a flow chart showing specific steps of performing a matching query in a network corpus in a method for assisting a user to input text on a web server according to the first embodiment of the present invention;
图 4是本发明第一实施方式中与网络服务器进行通信的供用户进行文 字输入的用户设备的方框图; 器的方框图; 4 is a block diagram of a user equipment for communicating with a web server for text input by a user in the first embodiment of the present invention; Block diagram of the device;
图 6是本发明第二实施方式中在与网络服务器通信的用户设备端输入 文字的方法的流程图;  6 is a flow chart showing a method of inputting characters on a user equipment terminal that communicates with a network server in a second embodiment of the present invention;
图 7是本发明第二实施方式中在网络服务器上辅助用户进行文字输入 的方法的流程图;  7 is a flowchart of a method for assisting a user to perform character input on a web server according to a second embodiment of the present invention;
图 8是本发明第二实施方式中辅助用户进行文字输入的系统的框图; 器的方框图;  Figure 8 is a block diagram of a system for assisting a user to perform character input in a second embodiment of the present invention;
图 10是本发明第二实施方式中在网络服务器上注册的用户群组的示意 图;  Figure 10 is a schematic diagram of a group of users registered on a network server in a second embodiment of the present invention;
图 11是本发明第二实施方式中在网络服务器上维护的群组语料库中群 组词条的属性列表的示意图;  11 is a schematic diagram showing an attribute list of a group entry in a group corpus maintained on a network server in a second embodiment of the present invention;
图 12是本发明第三实施方式中计算机网络的示意图;  Figure 12 is a schematic diagram of a computer network in a third embodiment of the present invention;
图 13a-13d是在用户设备处显示的用于对根据本发明第三实施方式的 输入法进行设置的人机界面示意图;  13a-13d are schematic diagrams of a human-machine interface displayed at a user equipment for setting an input method according to a third embodiment of the present invention;
图 14是本发明第三实施方式中在连接到一个网络服务器的用户设备中 供用户进行文字输入的方法流程图;  14 is a flowchart of a method for a user to perform character input in a user equipment connected to a network server in a third embodiment of the present invention;
图 15是本发明第三实施方式中在网络服务器中用于辅助一个用户设备 的用户来进行文字输入的方法流程图;  15 is a flowchart of a method for assisting a user of a user equipment to perform text input in a network server according to a third embodiment of the present invention;
图 16是本发明第三实施方式中通过基于 WEB的浏览器实现的文字输 入的屏幕截图;  Figure 16 is a screenshot of a text input implemented by a web-based browser in a third embodiment of the present invention;
图 17是本发明第三实施方式中连接到网络服务器的供用户进行文字输 入的用户设备的框图;  17 is a block diagram of a user equipment connected to a network server for text input by a user in a third embodiment of the present invention;
图 18是本发明第三实施方式中用于辅助一个用户设备的用户来进行文 字输入的网络服务器的框图; 输入信息相关的搜索相关信息的用户设备的示意图; 输入信息相关的搜索相关信息的用户设备和网络设备的示意图; 输入信息相关的搜索相关信息的用户设备和网络设备的示意图; 18 is a block diagram of a network server for assisting a user of a user equipment to perform character input in a third embodiment of the present invention; A schematic diagram of a user equipment for inputting information related to search related information; a schematic diagram of a user equipment and a network device for inputting information related to search related information; a schematic diagram of user equipment and network equipment for inputting information related to searching related information;
图 22是本发明第四实施方式中在用户设备中当用户进行文字输入时同 时提供与输入信息相关的搜索相关信息的方法流程图;  22 is a flowchart of a method for simultaneously providing search related information related to input information when a user performs character input in a user equipment according to a fourth embodiment of the present invention;
图 23是本发明第四实施方式中用户设备与网络设备相配合来当用户进 行文字输入时同时提供与输入信息相关的搜索相关信息的方法流程图; 图 24是本发明第四实施方式中用户设备与网络设备相配合来当用户进 行文字输入时同时提供与输入信息相关的搜索相关信息的方法流程图。  23 is a flowchart of a method for providing search related information related to input information when a user performs text input in cooperation with a network device according to a fourth embodiment of the present invention; FIG. 24 is a user in a fourth embodiment of the present invention; A method flow diagram in which the device cooperates with the network device to provide search related information related to the input information when the user performs text input.
【具体实施方式】 【detailed description】
现在, 将参考附图来详细地描述本发明的具体实施方式。  DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Now, specific embodiments of the present invention will be described in detail with reference to the accompanying drawings.
以下对本发明的具体实施例进行了描述。 需要理解的是, 本发明并不 局限于下述特定实施方式, 本领域技术人员可以在所附权利要求的范围内 做出各种变形或修改。  Specific embodiments of the invention are described below. It is to be understood that the invention is not limited to the specific embodiments described below, and those skilled in the art can make various modifications or changes within the scope of the appended claims.
应当理解, 本申请之任一流程图中所示的方法步骤并不要求严格按照 图示的顺序执行。某些步骤可以在另一些步骤之前执行,或并入其它步骤, 有些步骤可以同步执行, 诸如此类。  It should be understood that the method steps shown in any one of the flowcharts of the present application are not required to be performed strictly in the order illustrated. Some steps may be performed before other steps, or incorporated into other steps, some steps may be performed concurrently, and the like.
虽然以下的各具体实施例中, 均以鼠标、 键盘作为人机交互时的输入 设备, 并以用户设备的显示器作为人机交互的输出设备, 应当理解, 本发 明并不排除使用其它输入设备和输出设备的情形, 例如, 用户通过手写板 进行输入, 用户设备通过扬声器作为输出设备等。  Although in the following specific embodiments, the mouse and the keyboard are used as input devices for human-computer interaction, and the display of the user device is used as an output device for human-computer interaction, it should be understood that the present invention does not exclude the use of other input devices and In the case of an output device, for example, the user inputs through a tablet, the user device passes through a speaker as an output device, and the like.
当前的用户终端大多能够通过以太网、 WIFI、 3G或 2G随时连接到互 联网, 因此如果能够利用广泛分布于互联网上的服务器的强大处理能力和 庞大的语料库来辅助用户在用户终端上进行文字录入将能够大大地提高文 字录入的准确度和效率。 因此本发明提出一种利用网络服务器辅助进行文 字录入的方法、 用户设备、 网络服务器和系统。 Most of the current user terminals can connect to each other at any time via Ethernet, WIFI, 3G or 2G. Networking, so if you can take advantage of the powerful processing power of a server widely distributed on the Internet and a large corpus to assist users in text entry on the user terminal, the accuracy and efficiency of text entry can be greatly improved. Therefore, the present invention proposes a method, user equipment, network server and system for assisting text entry by using a network server.
本发明用于供用户进行文字输入的方法、 设备、 服务器及系统的第一 实施例:  The present invention is directed to a first embodiment of a method, apparatus, server and system for text input by a user:
如图 4所示, 在本发明第一实施方式的用户设备 140上存储有本地语 料库 1403 , 该本地语料库 1403中存储有基础词汇集、 基础语言模型、 用户 使用输入法过程中生成的词汇集。  As shown in FIG. 4, a user corpus 1403 is stored on the user equipment 140 of the first embodiment of the present invention. The local corpus 1403 stores a basic vocabulary set, a basic language model, and a vocabulary set generated by the user during the input method.
除此之外, 该本地语料库 1403还可以存储一些辅助信息: 例如, 用户 对输入法的各种设置属性, 包括但不限于模糊音、 繁筒体、 双拼、 全拼、 筒拼等等; 以及用户的属性信息, 包括但不限于职业、 爱好、 专业领域、 筒历、 年龄等等。 这些辅助的信息有助于对候选词条进行优化排序。  In addition, the local corpus 1403 can also store some auxiliary information: for example, various setting properties of the input method by the user, including but not limited to fuzzy sound, multi-body, double spell, full spell, barrel spell, etc.; And user attribute information, including but not limited to occupations, hobbies, professional fields, calendars, ages, etc. This auxiliary information helps to optimize the ordering of candidate terms.
用户设备 140还具有键盘 1401 , 用于由用户输入文字的拼音字母或者 笔画序列。 用户设备 140中的匹配装置 1402基于所输入的拼音字母或笔画 序列在本地语料库 1403中查找匹配的本地词条选项,并通过显示装置 1406 显示出来,供用户选择。该键盘可以是纯数字键盘或全字母键盘(QWERTY 键盘), 也可以是实体键盘或虚拟键盘。  The user device 140 also has a keyboard 1401 for inputting pinyin letters or stroke sequences of text by the user. The matching device 1402 in the user device 140 finds a matching local term option in the local corpus 1403 based on the entered pinyin letter or stroke sequence and displays it through the display device 1406 for selection by the user. The keyboard can be a pure numeric keyboard or a full alphanumeric keyboard (QWERTY keyboard), or it can be a physical keyboard or a virtual keyboard.
为了在输入过程中通过互联网获得来自网络服务器 150 的协助, 本发 明的用户设备 140中增加了网络通信装置 1404和汇总装置 1405。网络通信 装置 1404通过互联网或局域网与网络服务器 150进行通信, 把通过键盘 1401 输入的拼音字母和笔画序列发送到网络服务器 150。 网络服务器 150 利用庞大的网络语料库和强大的处理能力来查找适合的词条选项。 由网络 服务器 150获得的网络词条选项被返回到网络通信装置 1404, 并且由网络 通信装置 1404将接收到的网络词条选项传输给汇总装置 1405。 汇总装置 1405接收来自匹配装置 1402的本地词条选项和来自网络通信装置 1404的 网络词条选项, 经过汇总后在显示装置 1406上显示出来, 供用户选择。 网络服务器 150 可以是分布在互联网上的多个网络服务器 1501......150n。 这些网络服务器 1501......150η协同工作, 构成一个服务器 云, 为大量用户提供服务。 网络服务器 150也可以是位于企业局域网上的 一个或多个服务器。 In order to obtain assistance from the web server 150 via the Internet during the input process, the network communication device 1404 and the aggregation device 1405 are added to the user device 140 of the present invention. The network communication device 1404 communicates with the web server 150 via the Internet or a local area network, and transmits the pinyin letters and stroke sequences input through the keyboard 1401 to the web server 150. Web server 150 leverages a vast network corpus and powerful processing power to find suitable term options. The network term option obtained by the web server 150 is returned to the network communication device 1404, and the received network term option is transmitted by the network communication device 1404 to the summary device 1405. The summary device 1405 receives local entry options from the matching device 1402 and from the network communication device 1404 The network entry options are summarized and displayed on the display device 1406 for selection by the user. The web server 150 may be a plurality of web servers 1501 ... 150n distributed over the Internet. These network servers 1501...150n work together to form a server cloud that provides services to a large number of users. The web server 150 can also be one or more servers located on the corporate local area network.
图 1 为根据本发明第一实施方式, 在与网络服务器通信的用户设备端 输入文字的方法的流程图。 如图所示, 在步骤 S1101 中, 检测用户在用户 设备的键盘 1401上的按键输入序列。 该按键序列可以是一个或多个词组甚 至一句话的筒拼或全拼,例如用户要输入"我喜欢用百度搜索引擎",可以输 入每个字声母的筒拼 "wxhybdssyq" , 可以输入每个字的全拼 "woxihuanyongbaidusousuoyinqing" , 还可以输入筒拼和全拼的混合输入 "woxhuanybaidssyinq'O 一般来说, 全部输入全拼的话, 候选词更加精确, 减少翻页查找的次数, 但是需要输入较多字符。 如果全部输入筒拼, 则重 码较多, 导致翻页查找时间较长, 效率不高。 因此通常全拼和筒拼混合输 入比较有效。 特别是, 如果用户输入一个新词条, 本地语料库 1403会根据 用户的选词同步地把新词条加入到本地语料库 1403 , 下次用户再次输入该 词条时可以仅仅用筒拼即可快速输入该词条。  1 is a flow chart showing a method of inputting text at a user equipment terminal that communicates with a web server according to a first embodiment of the present invention. As shown, in step S1101, the key input sequence of the user on the keyboard 1401 of the user device is detected. The key sequence can be one or more phrases or even a sentence of spelling or full spelling. For example, the user has to input "I like to use Baidu search engine", and can input each word initials "wxhybdssyq", and can input each The word "woxihuanyongbaidusousuoyinqing" can also be input into the mixed input "woxhuanybaidssyinq". In general, if all inputs are spelled together, the candidate words are more precise, reducing the number of page-turning searches, but more input is required. If all the input spells are spelled, the weight is more, which leads to longer page search time and less efficiency. Therefore, it is usually more effective to mix and match the input. In particular, if the user enters a new entry, the local corpus 1403 will synchronously add the new entry to the local corpus 1403 according to the user's choice, and the next time the user enters the entry again, the entry can be quickly entered by simply using the spell.
接着, 在步骤 S1102 中, 获得用户的按键输入序列后, 将所述输入序 列在用户设备 140的本地语料库 1403中进行匹配查询, 以获得一个或多个 匹配的本地词条选项。  Next, in step S1102, after obtaining the user's key input sequence, the input sequence is performed in the local corpus 1403 of the user equipment 140 to perform a matching query to obtain one or more matching local entry options.
在步骤 S1103中, 将所述按键输入序列发送至网络服务器 150。  In step S1103, the key input sequence is transmitted to the web server 150.
上述步骤 S1102和 S1103可以先后执行, 也可以同时执行。  The above steps S1102 and S1103 may be performed sequentially or simultaneously.
为了快速地显示所获得的词条选项, 在步骤 S1102 中获得本地词条选 项之后, 可以立即转到步骤 S1105 ,将所获得的本地词条选项汇总并显示给 用户, 供其选择。 与此同时, 网络服务器 150收到来自用户设备 140的按 键输入序列, 并在网络语料库中查找匹配的网络词条选项。 在步骤 S1104,用户设备 140接收到来自网络服务器 150的网络词条选 项并发送到汇总装置 1405。 然后转到步骤 S1105 , 在汇总装置 1405中将来 自匹配装置 1402的本地词条选项和来自网络服务器 150的网络词条选项进 行汇总后提供给显示装置 1406显示, 由用户进行选择。 经过与网络词条选 项汇总后的词条选项更加精确。 In order to quickly display the obtained entry options, after obtaining the local entry option in step S1102, it is immediately possible to proceed to step S1105, and the obtained local entry options are summarized and displayed to the user for selection. At the same time, web server 150 receives the key input sequence from user device 140 and looks for a matching network term option in the network corpus. At step S1104, the user equipment 140 receives the network entry option from the web server 150 and sends it to the summary device 1405. Then, proceeding to step S1105, the local entry option from the matching device 1402 and the network entry option from the web server 150 are summarized in the summary device 1405 and provided to the display device 1406 for display, which is selected by the user. The term option after summarizing with the network entry option is more precise.
由于输入的过程是动态连续的过程, 词条选项也是随着用户按键输入 而不断变化的, 因此步骤 S1105之后又转回步骤 S1101检测用户设备的按 键输入。  Since the input process is a dynamic continuous process, the entry option is also constantly changed as the user inputs the key, so step S1105 is followed by a step S1101 to detect the key input of the user equipment.
上述步骤之间的次序是可以调换的, 而不影响本发明的实现。 例如为 了更快地获得网络词条选项, 可以调换步骤 S1102和 S1103的次序, 先把 检测到的按键输入序列发送到所述网络服务器。  The order between the above steps is interchangeable without affecting the implementation of the present invention. For example, to obtain the network entry option more quickly, the order of steps S1102 and S1103 can be reversed, and the detected key input sequence is first sent to the network server.
在汇总装置 1405收到本地词条选项和网络词条选项之后如何进行汇总 和显示是本发明要解决的一个问题。  How to summarize and display after the summary device 1405 receives the local entry option and the network entry option is a problem to be solved by the present invention.
由于网络传输和服务器处理的滞后, 汇总装置 1405—般会先收到本地 词条选项然后才收到网络词条选项, 当网络服务器还没有反馈回网络词条 选项时, 可以立即将本地词条选项提供给显示装置 1406供用户选择, 不必 与网络词条选项一同显示。  Due to the lag of network transmission and server processing, the summary device 1405 will generally receive the local entry option before receiving the network entry option. When the network server has not returned the network entry option, the local entry can be immediately The options are provided to display device 1406 for user selection without having to be displayed with the network entry option.
根据本发明第一实施方式中的第一实例, 为了快速地显示出候选词条, 汇总装置 1405在获得所述本地词条选项之后, 在接收到所述网络词条选项 之前, 将所获得的本地词条选项按照其优先级排序在词条栏中显示给所述 用户, 其中优先级越高, 该输入词条选项越靠前显示。 具体地, 匹配装置 1402可以根据用户输入历史记录中对各个词条选项的选择频度、 各词条选 项中各个词汇间的文义关联性来确定其优先级高低。 匹配装置 1402也可根 据用户设定的输入偏好选择来确定优先级高低。  According to the first example in the first embodiment of the present invention, in order to quickly display the candidate term, the summary device 1405, after obtaining the local term option, obtains the obtained network term option before receiving the network term option The local entry options are displayed to the user in the entry bar according to their priority order, wherein the higher the priority, the higher the input term option is displayed. Specifically, the matching device 1402 can determine the priority level according to the selection frequency of each term option in the user input history and the semantic relevance between each term in each term option. The matching device 1402 can also determine the priority level based on the input preference selection set by the user.
在汇总装置 1405接收到所述网络词条选项之后, 用户可能已经在先前 显示的本地词条选项中选定部分词条, 或者已经翻页浏览部分本地词条选 项。 这时所接收到的网络词条选项中可能有部分词条与先前获得的本地词 条选项相同。 因此需要从网络词条选项中剔除这些已经被选定和 /或重复的 词条。 接着, 将剩余的网络词条选项按照该词条的优先级插入到当前和后 续显示的本地词条选项中, 而不改变本地候选词条排列的先后次序。 这样 处理的优点是, 在加入网络候选词条时, 用户当前浏览的词条选项栏上的 词条位置不会有太大的变化。 After the summary device 1405 receives the network entry option, the user may have selected a partial entry in the previously displayed local entry option, or has turned the page through a partial local entry. Item. At this time, some of the entries in the network term option received may be the same as the previously obtained local term option. It is therefore necessary to remove these entries that have been selected and/or repeated from the network entry options. Next, the remaining network entry options are inserted into the current and subsequent displayed local entry options according to the priority of the entry without changing the order of the local candidate entries. The advantage of this processing is that when joining a network candidate term, the position of the term on the term option bar currently browsed by the user does not change much.
假设,要用筒拼" wxhybdssyq"输入"我喜欢用百度搜索引擎",从本地语 料库中检索出 "无信号, 微型化, 无限好, 玩笑话, ..."等本地词条选项对应 于字母组合" wxh" ,随后收到对应于" wxh"的网络词条选项"无信号,微型化, 我喜欢, 无限好, 玩笑话..."。 网络词条选项比本地词条选项多了一个"我喜 欢"。 如果此时用户还没有对字母组合" wxh"选定词条, 则将网络词条选项 中重复的词条剔除后剩下的词条"我喜欢"插入到当前显示的选项中,并不改 变本地候选词条排列的先后次序。如果词条"我喜欢 "的排序优先级比较当前 显示的本地词条选项更低,则将该词条"我喜欢 "按照其优先级插入到后续显 示的词条选项的适当位置。 当用户在词条选项栏上向后翻页时会显示出已 经插入在本地词条选项中的网络词条选项"我喜欢"。  Assume that you want to use the "wxhybdssyq" input "I like to use Baidu search engine" to retrieve the "no signal, miniaturization, infinitely good, joke, ..." and other local entry options from the local corpus. Combine "wxh" and then receive the network entry option corresponding to "wxh" "No signal, miniaturization, I like, infinitely good, joke...". The online entry option has one more "I like it" than the local entry option. If the user has not selected the entry for the letter combination "wxh" at this time, the remaining words "I like" after the repeated entry in the network entry option are inserted into the currently displayed option do not change. The order in which local candidate terms are arranged. If the sort priority of the entry "I like" is lower than the currently displayed local entry option, the entry "I like" is inserted into the appropriate position of the subsequent displayed entry option according to its priority. When the user flips the page backwards on the entry options bar, the network entry option "I like" that has been inserted in the local entry option is displayed.
如果本地词条选项中也有选项"我喜欢", 并且在收到对应于" wxh"网络 词条选项时, 用户已经在本地词条选项中选择了 "我喜欢", 则与字母组合 "wxh"对应的词条被确定, 则剔除对应 "wxh"的所有网络词条选项。  If there is also the option "I like" in the local entry option, and the user has selected "I like" in the local entry option when receiving the "wxh" network entry option, then the combination with the letter "wxh" If the corresponding entry is determined, all network entry options corresponding to "wxh" are removed.
由于网络语料库非常庞大, 匹配的结果会更加准确, 例如可能直接返 回"我喜欢用百度搜索引擎"这个词条, 而不需逐个词组进行翻页选词, 因此 即便网络反馈稍有滞后仍然会大大加快输入速度。  Since the network corpus is very large, the matching result will be more accurate. For example, the term "I like to use Baidu search engine" may be directly returned, instead of turning the page selection words one by one, so even if the network feedback is slightly delayed, it will still be greatly Speed up the input.
在上述实例中, 由于网络滞后, 一些本地词条选项已经被显示, 网络 词条选项作为对本地词条选项的补充, 只需要选择本地词条选项中没有的 网络词条选项按照一定次序插入到当前显示的词条中即可。 这时并不改变 本地词条选项显示的先后次序, 只是补充一下本地词条选项中没有的网络 词条选项。 In the above example, due to the network lag, some local entry options have been displayed. The network entry option is a supplement to the local entry option. It only needs to select the network entry option that is not in the local entry option to be inserted in a certain order. It can be in the currently displayed entry. This does not change the order in which the local entry options are displayed, just add the network that is not in the local entry options. Entry options.
本发明第一实施方式的第二实例与第一实例类似, 为了快速地显示出 候选词条, 汇总装置 1405在获得所述本地词条选项之后, 在接收到所述网 络词条选项之前, 也是先将所获得的本地词条选项按照其优先级排序在词 条栏中显示给所述用户。  The second example of the first embodiment of the present invention is similar to the first example. In order to quickly display the candidate term, the summary device 1405, after obtaining the local term option, before receiving the network term option, is also The obtained local entry options are first displayed to the user in the entry bar according to their priority order.
与第一实例不同之处在于, 在接收到所述网络词条选项之后, 将所述 网络词条选项中剔除已经显示的词条后剩余的网络词条选项与当前显示和 还未被显示的本地词条选项一同按照这些词条的优先级重新排列后显示, 供用户选择。 这里所述的已经显示的词条是指当前显示的词条之前被浏览 过的、 被翻页排除的词条选项。  The difference from the first example is that after receiving the network term option, the network term option remaining after the already displayed term is removed from the network term option is currently displayed and not yet displayed. The local entry options are rearranged according to the priority of these terms and displayed for the user to select. The already displayed term described here refers to the term option that has been viewed before the currently displayed entry is excluded from page turning.
举例来说, 假设要用筒拼" wxhybdssyq"输入"我喜欢用百度搜索引擎", 从本地语料库中检索出"无信号, 微型化, 无限好, 玩笑话, ..., 我喜欢" 等本地词条选项对应于字母组合" wxh", 其中正确的词条 "我喜欢,,优先级比 较靠后,需要多几翻页才能找到。随后收到对应于" wxh"的网络词条选项"我 喜欢, 无信号, 微型化, 无限好, 玩笑话 ...", 由于匹配精度较高, 在网络 词条选项中 "我喜欢"的优先级比较靠前。这时用户正在浏览本地词条选项的 前几个词条"无信号, 微型化, 无限好, 玩笑话,,等无关词条。 已经显示过的 词条由于没有被用户所选择, 因此先从收到的网络词条选项中剔除已经显 示过的词条, 将剩下的网络词条选项与当前显示和还未被显示的本地词条 选项一同按照这些词条的优先级重新排列后,由于正确的词条 "我喜欢"在网 络词条选项中优先级较高, 重新排列后被调整到当前显示的词条选项中。 该实施例的优点是能够迅速地把正确的词条调整到词条选项的前列或首位。 由于相同词条在本地词条选项和网络词条选项中的优先级可能不同, 因此 在重新排列时可以按照该词条的两个优先级的加权平均来决定新的次序。  For example, suppose you want to use the "wxhybdssyq" input "I like to use Baidu search engine" to retrieve "no signal, miniaturization, infinitely good, joke, ..., I like" from the local corpus. The entry option corresponds to the letter combination "wxh", where the correct entry "I like it, the priority is lower, it takes a few more pages to find. Then I receive the network entry option corresponding to "wxh"" Like, no signal, miniaturization, infinitely good, joke...", due to the high matching accuracy, the priority of "I like" in the network entry option is higher. At this time, the user is browsing the local term option. The first few terms are "no signal, miniaturization, infinitely good, jokes, and so on." Since the displayed item has not been selected by the user, the previously displayed item is first removed from the received network entry option, and the remaining network entry options are displayed with the current display and the local display that has not yet been displayed. The term options are rearranged according to the priority of these terms. Since the correct term "I like" has a higher priority in the network entry option, it is rearranged and adjusted to the currently displayed term option. An advantage of this embodiment is the ability to quickly adjust the correct term to the top or top of the term option. Since the same term may have different priorities in the local term option and the network term option, the new order may be determined according to the weighted average of the two priorities of the term when rearranging.
在第二实例中, 由于后收到的网络词条选项与当前显示和未显示的本 地词条选项按照一定规则重新排列。 这时并会改变本地词条选项显示的先 后次序。 In the second example, the network entry options received later are rearranged according to certain rules with the currently displayed and undisplayed local entry options. At this time, the local entry option will be changed first. Post-order.
本发明第一实施方式的第三实例与第二实例类似, 区别仅仅在于重新 排列时完全按照该网络词条选项的优先级进行重排, 在此不再赘述。  The third example of the first embodiment of the present invention is similar to the second example. The only difference is that the rearrangement is performed in accordance with the priority of the network entry option, and details are not described herein again.
在第三实例中, 收到的网络词条选项后不再显示本地词条选项, 而是 显示先前未显示网络词条选项。  In the third example, the local entry option is no longer displayed after the received network entry option, but the network entry option is not displayed previously.
本发明第一实施方式的第四实例与第一至第三实例不同, 在获得所述 本地词条选项之后, 并不立即将所获得的本地词条选项按照其优先级显示 给用户以供其选择, 而是等待接收到所述网络词条选项之后才按照这些词 条的优先级整体排列后显示给用户进行选择。 该实施例显示的词条选项比 较准确, 特别是对于整句话连续输入再进行词条选择的情况更加有利, 因 为整句话连续输入的时间比较长, 网络响应的延迟不会造成太大的影响, 而整句话的匹配需要更大语料库、 语言匹配模型和处理能力的支持, 因此 等待接收到网络词条选项才显示给用户选择将提供更加准确的结果。  The fourth example of the first embodiment of the present invention is different from the first to third examples, and after obtaining the local entry option, the obtained local entry option is not immediately displayed to the user according to its priority for its Selecting, but waiting to receive the network entry option, after being arranged according to the priority of these terms, is displayed to the user for selection. The term option displayed in this embodiment is relatively accurate, especially for the case where the entire sentence is continuously input and then the term selection is more advantageous, because the continuous input time of the whole sentence is relatively long, and the delay of the network response does not cause too much Impact, and the matching of the whole sentence requires the support of a larger corpus, language matching model, and processing power, so waiting to receive the network entry option is displayed to the user selection will provide more accurate results.
在第四实例中, 等待接收到网络词条选项后才一同排序和显示本地词 条和网络词条选项(需要剔除重复选项), 在网络响应速度非常快的情况以 及整句连续输入的情况下比较有利。  In the fourth example, waiting for the network term option to be received, the local term and the network term option are sorted together (the need to eliminate the repeat option), in the case where the network response is very fast and the entire sentence is continuously input. More favorable.
匹配装置 1402可以根据词条先前是否被选择过、 词条先前被选择的时 间先后、 词条先前被选择的次数、 用户预置的输入偏好选项和 /或词条在网 络上的被搜索的次数来确定本地词条选项的优先级。  The matching device 1402 may be based on whether the entry was previously selected, the chronological order in which the entry was previously selected, the number of times the entry was previously selected, the user-preset input preference option, and/or the number of times the term was searched on the network. To determine the priority of local entry options.
在网络服务器 150上的网络语料库可以包括分别对应于每个用户的用 户网络语料库 1501以及公共网络语料库 1504。  The network corpus on the web server 150 can include a user network corpus 1501 and a public network corpus 1504, respectively, corresponding to each user.
用户网络语料库 1501是每个注册的用户的本地语料库 1403在网络服 务器 150上备份。 用户设备 140还包括本地同步装置(未示出), 用于当注 册用户登录网络服务器 150后, 可以根据用户的选择将用户设备上的本地 语料库 1403上传到网络服务器 150或者将该用户保留在网络服务器 150上 的用户网络语料库 1501与本地语料库 1403进行同步。 用户设备 140还包 括本地更新装置(未示出), 用于根据用户对词条的选择更新所述本地语料 库 1403 , 并且将该选择发送到网络服务器 150以更新所述用户网络语料库 1501。 由于一些词在上下文中将被重复输入, 需要及时更新本地词库 1403 和用户网络语料库 1501 , 提高最近被输入词条的优先级以加快输入。 The user network corpus 1501 is a local corpus 1403 of each registered user backed up on the web server 150. The user equipment 140 further includes a local synchronization device (not shown) for uploading the local corpus 1403 on the user device to the network server 150 or retaining the user on the network according to the user's selection after the registered user logs in to the network server 150. The user network corpus 1501 on the server 150 is synchronized with the local corpus 1403. User equipment 140 also includes A local update device (not shown) is provided for updating the local corpus 1403 based on user selection of the terms and transmitting the selection to the web server 150 to update the user network corpus 1501. Since some words will be repeatedly input in the context, it is necessary to update the local thesaurus 1403 and the user network corpus 1501 in time to increase the priority of the recently entered entry to speed up the input.
与本地语料库 1403相同,用户网络语料库 1501也存储有基础词汇集、 基础语言模型、 用户使用输入法过程中生成的词汇集。 还可以存储一些辅 助信息: 例如, 用户对输入法的各种设置属性, 包括但不限于模糊音、 繁 筒体、 双拼、 全拼、 筒拼等等; 以及用户的属性信息, 包括但不限于职业、 爱好、 专业领域、 筒历、 年龄等等。  Like the local corpus 1403, the user network corpus 1501 also stores a basic vocabulary set, a basic language model, and a vocabulary set generated by the user during the input method. It is also possible to store some auxiliary information: for example, various setting properties of the input method by the user, including but not limited to fuzzy sounds, cascading, double spelling, full spelling, barrel spelling, etc.; and user attribute information, including but not Limited to occupations, hobbies, areas of expertise, calendars, ages, etc.
由于网络服务器 150上保存有用户网络语料库, 因此无论用户用哪个 终端设备, 只要能够连接到网络服务器 150都可以通过登录后同步本地语 料库 1403或在线使用用户网络语料库 1501来快速地进行录入。  Since the user network corpus is stored on the web server 150, regardless of which terminal device the user uses, as long as it can connect to the web server 150, it can be quickly entered by synchronizing the local corpus 1403 after login or using the user network corpus 1501 online.
公共网络语料库 1504基于对公开文献、 出版物、 大量用户的输入、 大 量用户在网络搜索引擎上的检索词汇、 大量网页的索引关键词和 /或关键词 广告信息进行分析统计而形成, 其反映用户群体的共性或热点。  The public network corpus 1504 is formed based on analysis and statistics on public documents, publications, input from a large number of users, retrieval vocabulary of a large number of users on a web search engine, index keywords of a large number of web pages, and/or keyword advertisement information, which reflects users. The commonality or hotspot of the group.
下面参照图 2、 图 3和图 5描述网络服务器 150的结构和其操作流程。 如图 5所示, 本发明第一实施方式的网络服务器 150包括用户网络语 料库 1501、 匹配装置 1502、 网络通信装置 1503、 公共网络语料库 1504、 语料库更新装置 1505、 关键词广告库 1506以及同步装置 1507。  The structure of the web server 150 and its operational flow will be described below with reference to Figs. 2, 3 and 5. As shown in FIG. 5, the network server 150 of the first embodiment of the present invention includes a user network corpus 1501, a matching device 1502, a network communication device 1503, a public network corpus 1504, a corpus update device 1505, a keyword advertisement library 1506, and a synchronization device 1507. .
如上文所述, 用户网络语料库 1501和公共网络语料库 1504合称为网 络语料库。 网络通信装置 1503通过网络连接到一个或多个用户设备 140, 用于经由网络接收用户在用户设备上的按键输入序列, 并将基于该按键输 入序列所获得的网络词条选项反馈回所述用户设备, 供用户选择。  As described above, the user network corpus 1501 and the public network corpus 1504 are collectively referred to as a network corpus. The network communication device 1503 is connected to one or more user devices 140 via a network for receiving a key input sequence of the user on the user device via the network, and feeding back the network entry option obtained based on the key input sequence back to the user Equipment, for users to choose.
匹配装置 1502连接到用户网络语料库 1501、 公共网络语料库 1504和 匹配装置 1502,用于基于所述按键输入序列在用户网络语料库 1501和公共 网络语料库 1504中进行匹配查询获得一个或多个匹配的网络词条选项。 该 匹配装置 1502还包括优先级确定装置 (未示出), 用于根据词条先前是否被 选择过、 词条先前被选择的时间先后、 词条先前被选择的次数、 用户预置 的输入偏好选项和 /或词条在网络上的被搜索的次数来确定所匹配的词条选 项中各个词条的优先级。 The matching device 1502 is connected to the user network corpus 1501, the public network corpus 1504, and the matching device 1502, for performing matching queries in the user network corpus 1501 and the public network corpus 1504 based on the key input sequence to obtain one or more matching network words. Bar options. The The matching device 1502 further includes priority determining means (not shown) for selecting whether the term has been previously selected, the time series in which the term was previously selected, the number of times the term was previously selected, the user-preset input preference option. And/or the number of times the term is searched on the network to determine the priority of each of the matching term options.
同步装置 1507连接到用户网络语料库 1501和网络通信装置 1503 , 用 于当用户通过所述用户设备 140登录网络服务器 150时, 在接收到来自用 户设备的语料库同步指令之后, 将该用户的所述用户网络语料库 1501与该 用户设备中的本地语料库 1403进行同步。  The synchronization device 1507 is connected to the user network corpus 1501 and the network communication device 1503, for when the user logs in to the network server 150 through the user device 140, after receiving the corpus synchronization instruction from the user device, the user of the user The web corpus 1501 is synchronized with the local corpus 1403 in the user device.
语料库更新装置 1505连接到用户网络语料库 1501和公共网络语料库 1504, 用于根据用户的输入和选词来更新用户网络语料库 1501 , 以及根据 对大量用户的输入、 大量用户在网络搜索引擎上的检索词汇、 大量网页的 索引关键词和 /或关键词广告信息进行分析统计来更新公共网络语料库 1504。  The corpus update device 1505 is coupled to the user network corpus 1501 and the public network corpus 1504 for updating the user network corpus 1501 according to user input and word selection, and based on input to a large number of users, a large number of users searching words on a web search engine. The index keywords and/or keyword advertisement information of a plurality of web pages are analyzed and updated to update the public network corpus 1504.
关键词广告库 1506用于提供与关键词相关的广告链接。 一些厂商可以 买下若干关键词或者字母组合, 例如, 百度公司可以买下"百度"、 "搜索引 擎,,这样的关键词,也可以买下" bd"、 "baidu"、 "ssyq"等等这样的字母组合, 当匹配装置 1502基于来自用户设备 140的按键序列在网络语料库中匹配到 "百度 "或"搜索引擎"这样的词条, 或者收到" bd"、 "baidu"、 "ssyq"这样的组 合时, 从关键词广告库 1506中找到对应的关键词广告信息"百度网"及其链 接。 将该关键词广告信息通过网络通信装置 1503返回到用户设备 140并显 示在词条选项中。 用户选择该广告信息就可以跳转到相应的网址链接。 这 里的"选择"包括鼠标点击, 也包括直接通过键盘选择对应的数字选择按键。 匹配装置 1502中的优先级确定装置可以对关键词广告信息分配较高的优先 级, 以保证该广告信息被安排在显示给用户的第一轮词条选项中或当前显 示的选项中显示。  The keyword advertisement library 1506 is for providing an advertisement link related to a keyword. Some manufacturers can buy a number of keywords or letter combinations, for example, Baidu can buy "Baidu", "search engine," such keywords, you can also buy "bd", "baidu", "ssyq", etc. Such a combination of letters, when the matching device 1502 matches a term such as "Baidu" or "Search Engine" in the network corpus based on the key sequence from the user device 140, or receives "bd", "baidu", "ssyq" In such a combination, the corresponding keyword advertisement information "Baidu.com" and its link are found from the keyword advertisement library 1506. The keyword advertisement information is returned to the user device 140 through the network communication device 1503 and displayed in the entry option. The user selects the advertisement information to jump to the corresponding website link. The "selection" here includes a mouse click, and also includes selecting a corresponding digital selection button directly through the keyboard. The priority determining device in the matching device 1502 can be a keyword The advertisement information is assigned a higher priority to ensure that the advertisement information is arranged in the first round of entry options displayed to the user or currently displayed. Display options.
根据需要, 关键词广告库 1506可以合并到公共网络语料库 1504中, 或者合并到用户网络语料库 1501中, 当需要更新关键词广告信息时, 只需 把新的关键词广告信息在网络服务器 150加入到用户网络语料库 1501 中, 通过同步的方式下载到用户设备 140的本地语料库 1403。 这样本地语料库 1403 中包含关键词广告库, 在本地语料库中找到关键词广告信息时, 将该 关键词广告信息作为本地词条选项显示出来, 所述关键词广告信息带有链 接, 当用户选择所显示的广告信息时可以跳转到相关链接。 即使用户设备 140没有连接到互联网, 也可以在输入文字时出现广告信息的词条选项, 增 加广告的曝光率。 The keyword advertisement library 1506 can be incorporated into the public network corpus 1504 as needed. Or merged into the user network corpus 1501. When the keyword advertisement information needs to be updated, the new keyword advertisement information only needs to be added to the user network corpus 1501 in the web server 150, and downloaded to the local device device 140 in a synchronized manner. Corpus 1403. Thus, the local corpus 1403 includes a keyword advertisement library. When the keyword advertisement information is found in the local corpus, the keyword advertisement information is displayed as a local entry option, and the keyword advertisement information has a link, when the user selects the location. You can jump to the relevant link when displaying the ad information. Even if the user device 140 is not connected to the Internet, the entry option of the advertisement information may appear when the text is input, thereby increasing the exposure rate of the advertisement.
图 2为根据本发明第一实施方式在网络服务器上辅助用户进行文字输 入的方法的流程图。  2 is a flow chart of a method for assisting a user to perform text input on a web server in accordance with a first embodiment of the present invention.
如图 2中所示,在步骤 S1201中,网络服务器 150的网络通信装置 1503 经由网络接收用户在用户设备 140上的按键输入序列;  As shown in FIG. 2, in step S1201, the network communication device 1503 of the web server 150 receives a key input sequence of the user on the user device 140 via the network;
在步骤 S1202 中, 基于所述按键输入序列在网络语料库中进行匹配查 询获得一个或多个匹配的网络词条选项;  In step S1202, performing a matching query in the network corpus based on the key input sequence to obtain one or more matching network term options;
在步骤 S1203 中, 将所获得的所述网络词条选项反馈回所述用户设备 140, 供用户选择。  In step S1203, the obtained network term option is fed back to the user equipment 140 for user selection.
如上文所述, 用户可以在网络服务器 150上注册为注册用户。 注册用 户可以登录到网络服务器 150,并在网络服务器 150上保留用户网络语料库 1501。 在已登录的情况下, 用户还可以选择是否同步用户网络语料库 1501 和用户设备的本地语料库 1403。 根据用户是否登录以及是否执行语料库的 同步, 匹配装置 1502可以执行不同的匹配操作, 以提供精确的网络词条选 项。 图 3示出在这些情况下的处理步骤。  As described above, the user can register as a registered user on the web server 150. The registered user can log in to the web server 150 and retain the user web corpus 1501 on the web server 150. In the case of a login, the user can also choose whether to synchronize the user network corpus 1501 with the user device's local corpus 1403. Depending on whether the user is logged in and whether the corpus synchronization is performed, the matching device 1502 can perform different matching operations to provide accurate network entry options. Figure 3 shows the processing steps in these cases.
图 3 为根据本发明第一实施方式在网络服务器上辅助用户进行文字输 入的方法中在网络语料库中进行匹配查询的具体步骤的流程图。  3 is a flow chart showing the specific steps of performing a matching query in a network corpus in a method for assisting a user to perform text input on a web server according to the first embodiment of the present invention.
如图 3所示, 在步骤 S1201之后, 执行步骤 S1301 , 判断用户是否登录 网络服务器 150, 如果没有登录, 由于网络服务器 150不能判断该用户的身 份, 则不能利用该用户保留在网络服务器 150上的用户网络语料库 1501进 行匹配查询, 而是转到步骤 S1503 ,在接收到用户在用户设备上的按键输入 序列时仅仅在所述公共网络语料库中检索匹配的网络词条选项。 登录的方 式可以有多种, 例如利用用户名和密码登录、利用用户设备的 MAC地址自 动登录、 利用用户设备的固定 IP地址自动登录等等。 As shown in FIG. 3, after step S1201, step S1301 is performed to determine whether the user logs in to the network server 150. If not, the network server 150 cannot determine the user's body. If the user does not use the user network corpus 1501 retained by the user on the network server 150 to perform a matching query, the process proceeds to step S1503, and only in the public network corpus when receiving the key input sequence of the user on the user device. Retrieve matching network term options. There are various ways to log in, such as logging in with a username and password, automatically logging in with the MAC address of the user device, automatically logging in using the fixed IP address of the user device, and so on.
如果用户已经登录, 则进行到步骤 S1302,判断该用户的用户网络语料 库 1501是否与用户设备的本地语料库 1403同步。 如果用户是用他人的或 公共的电脑, 他不希望把自己的用户网络语料库 1501 同步到本地设备上, 也不希望本地设备上他人的词库同步到自己的用户网络语料库 1501上, 因 此用户可以选择不同步。但是,用户还是希望能获得其用户网络语料库 1501 的支持。 因此, 当步骤 S1302判断没有同步的情况下, 在接收到用户在用 户设备上的按键输入序列时在所述公共网络语料库 1504和该用户的用户网 络语料库 1501中检索匹配的网络词条选项。如果步骤 S1302判断没有用户 网络语料库已经与用户设备的本地语料库同步的情况下, 由于用户设备会 首先在本地语料库中查找匹配词条选项, 因此网络服务器 150没有必要重 复在相同的用户网络语料库 1501中查找匹配的词条选项, 因此这时在接收 到用户在用户设备上的按键输入序列时仅仅在所述公共网络语料库 1504中 检索匹配的网络词条选项。  If the user has logged in, it proceeds to step S1302 to determine whether the user's user network corpus 1501 is synchronized with the local corpus 1403 of the user device. If the user is using another person's or public computer, he does not want to synchronize his user's web corpus 1501 to the local device, nor does it want to synchronize the other people's thesaurus on the local device to his user's web corpus 1501, so the user can Choose not to sync. However, users still want to be supported by their user web corpus 1501. Therefore, when it is determined in step S1302 that there is no synchronization, the matching network term option is retrieved in the public network corpus 1504 and the user's user network corpus 1501 upon receiving the user's key input sequence on the user device. If it is determined in step S1302 that no user network corpus has been synchronized with the local corpus of the user equipment, since the user equipment will first find the matching term option in the local corpus, the web server 150 does not have to be repeated in the same user network corpus 1501. The matching term option is found, so that only the matching network term option is retrieved in the public network corpus 1504 upon receiving the user's key input sequence on the user device.
另外, 上述的匹配装置 1402、 匹配装置 1502在根据用户输入序列在词 先级。 汇总装置 1405将匹配装置 1402和匹配装置 1502提供的多个匹配的 输入词条选项按优先级顺序在词条栏中显示给所述用户, 其中优先级越高, 该输入词条选项越靠前显示。 具体地, 匹配装置 1402和匹配装置 1502可 以根据用户历史记录中对各个词条选项的选择频度、 各词条选项中各个词 汇间的文义关联性来确定其优先级高低。 匹配装置 1402和匹配装置 1502 也可根据用户设定的输入偏好选择来确定优先级高低, 例如, 当用户设定 输入偏好为: 1 )优先级高低: 计算机词汇 >电子词汇>普通词汇; 2 )优先 级高低: 中文 >英文, 则在输入序列 "woshiyongwindows"可判断 "我使用视 窗软件"优先级最高, "我使用视窗"次之、 "我使用 windows"再次之。另外, 匹配装置 1402和匹配装置 1502还可根据目前用户设备的 IP地址来判断其 所处的地域, 从而可以确定输入序列中与该地域相关的词汇的优先级, 例 如, 但用户输入序列为 "woxihuanbund" , 其中 "bund"的译文有 " 1堤岸 2码 头 3同盟 4 (上海)外滩", 但网络通信装置 1503根据用户设备 IP地址获 知目前位于中国上海市,从而可确定" bund"对应译文中"上海外滩"或"外滩" 优先级最高, 因而可提供如下输入词条选项" 1 我喜欢上海外滩; 2我喜欢 外滩; 3我喜欢码头; 4我喜欢堤岸; 5我喜欢同盟"。 In addition, the matching device 1402 and the matching device 1502 described above are at the word level according to the user input sequence. The summary device 1405 displays the plurality of matched input term options provided by the matching device 1402 and the matching device 1502 in the item column to the user in priority order, wherein the higher the priority, the higher the input term option is. display. Specifically, the matching device 1402 and the matching device 1502 can determine the priority level according to the selection frequency of each term option in the user history record and the semantic relevance between each term in each term option. The matching device 1402 and the matching device 1502 can also determine the priority according to the input preference selection set by the user, for example, when the user sets The input preferences are: 1) Priority: Computer vocabulary > Electronic vocabulary > General vocabulary; 2) Priority level: Chinese > English, the input sequence "woshiyongwindows" can be judged "I use Windows software" has the highest priority, "I Use the window "Second, "I use windows" again. In addition, the matching device 1402 and the matching device 1502 can also determine the region in which the user equipment is located according to the IP address of the current user equipment, so that the priority of the vocabulary related to the region in the input sequence can be determined, for example, but the user input sequence is "Woxihuanbund" , the translation of "bund" has "1 Embankment 2 Terminal 3 Alliance 4 (Shanghai) Bund", but the network communication device 1503 is known based on the IP address of the user equipment and is currently located in Shanghai, China, so that the "bund" corresponding translation can be determined. "Shanghai Bund" or "Bund" has the highest priority, so the following entry options are available: 1 I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like the league.
本发明的第一实施方式还提供一种用于供用户进行文字输入的系统, 其中包括本发明第一实施方式的用于输入文字的用户设备和本发明第一实 施方式的用于辅助用户设备输入文字的网络 务器。  A first embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters according to the first embodiment of the present invention and an auxiliary user equipment according to the first embodiment of the present invention. Enter the text server.
本发明用于供用户进行文字输入的方法、 设备、 服务器及系统的第二 实施方式,  The present invention is directed to a second embodiment of a method, apparatus, server and system for a user to input text,
该第二实施方式是在第一实施方式的基础上,增加了 "群组语料库"的概 念, 以更好的提升输入法首选项的命中率, 提高输入效率:  The second embodiment is based on the first embodiment, and the concept of "group corpus" is added to better improve the hit rate of the input method preferences and improve the input efficiency:
内部网或局域网的多用户, 如企业客户、 网吧、 翻译服务等类型的用 户通常具有较明显的共性。 这种共性可能是相同或相似的工作内容、 相同 或相似的兴趣爱好、 相同或相似的年龄阶段、 相同或相似的地理区域等。 出一种趋同或相似。 例如, 同样是键入" shzh"这样的缩略输入, 对于深圳市 的居民,命中率最高的首选词项是"深圳"; 对于河北省深州市的居民,命中 率最高的首选词项则 "深州";对于航天产业的从业人员,命中率最高的首选 词项则很有可能是"神舟(N号 ) "; 而对于以上共性皆不具备的普通用户来 说, 命中率最高的首选词项有可能是"神州(大地)"。 尽管现在大多数输入 法都提供词频调整功能, 即根据输入选择的最新历史记录调整备选词项的 显示顺序, 然而这种调整必须进行学习或训练, 当目标是一个并不常用甚 至较为生僻的词时, 这种繁瑣是不言而喻的。 并且这种现有的词频调整也 是不稳定的,例如当用户换了另一台计算机,将需要重新进行学习和训练。 因此, 如果能够通过网络对具有较大共性的其他用户的先前输入进行利用, 辅助用户在用户终端上进行文字录入, 无疑将大大地提高文字录入的准确 度和效率。 Multi-users of intranets or local area networks, such as enterprise customers, Internet cafes, and translation services, usually have obvious commonalities. This commonality may be the same or similar work content, the same or similar hobbies, the same or similar age stages, the same or similar geographic areas, and the like. A convergence or similarity. For example, the same type of sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh sh Shenzhou"; for the aerospace industry practitioners, the most preferred term for the highest hit rate is likely to be "Shenzhou (N)"; for ordinary users who do not have the same commonality, the highest preferred hit rate The item may be "Shenzhou (the earth)". Despite most of the input now The method provides a word frequency adjustment function, which adjusts the display order of the alternative words according to the latest history of the input selection. However, the adjustment must be learned or trained. When the target is a word that is not commonly used or even relatively uncommon, this kind of The tediousness is self-evident. And this existing word frequency adjustment is also unstable. For example, when the user changes another computer, it will need to re-learn and train. Therefore, if the previous input of other users having greater commonality can be utilized through the network, the auxiliary user can perform text entry on the user terminal, which will undoubtedly greatly improve the accuracy and efficiency of the text entry.
在本实施方式中, 与网络服务器进行通信的供用户进行文字输入的用 户设备与第一实施方式所揭示的相同, 在此可参考说明书第一实施方式中 结合图 4的阐述, 在此不再赘述。  In the present embodiment, the user equipment for communicating with the web server for text input by the user is the same as that disclosed in the first embodiment. Here, reference may be made to the first embodiment in conjunction with the explanation of FIG. 4, and no longer Narration.
图 6为根据本发明第二实施方式在与网络服务器通信的用户设备端输 入文字的方法的流程图。 如图所示, 在步骤 S2101 中, 检测用户在用户设 备的键盘 1401 上的按键输入序列。 该步骤和本发明第一实施方式中步骤 S1101相似,其具体应用举例可参本发明第一实施方式步骤 S1101中所记载 的内容, 在此不再赘述。 但利用根据本实施方式的文字输入方法, 用户可 以更进一步地筒化输入, 甚至在只用每个字的首字母的情况下, 也能很快 得到想要的结果, 因为可利用的群组词条, 是已经由与之具有共性的其他 用户训练过的。  6 is a flow chart of a method of inputting text at a user equipment terminal in communication with a web server in accordance with a second embodiment of the present invention. As shown in the figure, in step S2101, the key input sequence of the user on the keyboard 1401 of the user device is detected. This step is similar to the step S1101 in the first embodiment of the present invention. For the specific application example, reference may be made to the content described in step S1101 of the first embodiment of the present invention, and details are not described herein again. However, with the text input method according to the present embodiment, the user can further input the input, and even if only the first letter of each word is used, the desired result can be quickly obtained because the available group The term is already trained by other users who have something in common with it.
接着, 在步骤 S2102 中, 获得用户的按键输入序列后, 将所述输入序 列在用户设备 140的本地语料库 1403中进行匹配查询, 以获得一个或多个 匹配的本地词条选项。  Next, in step S2102, after obtaining the key input sequence of the user, the input sequence is performed in the local corpus 1403 of the user equipment 140 to perform a matching query to obtain one or more matching local entry options.
在步骤 S2103中, 将所述按键输入序列发送至网络服务器。  In step S2103, the key input sequence is sent to a web server.
上述步骤 S2102和 S2103可以先后执行, 也可以同时执行。  The above steps S2102 and S2103 may be performed sequentially or simultaneously.
为了快速地显示所获得的词条选项, 在步骤 S2102 中获得本地词条选 项之后, 可以立即转到步骤 S2105 ,将所获得的本地词条选项汇总并显示给 用户, 供其选择。 与此同时, 网络服务器收到来自用户设备 140 的按键输 入序列, 并在与之所属群组相关联的群组语料库中查找匹配的群组词条选 项。 In order to quickly display the obtained term option, after obtaining the local term option in step S2102, it is immediately possible to proceed to step S2105, and the obtained local term options are summarized and displayed to the user for selection. At the same time, the web server receives the keystroke from the user device 140. Enter the sequence and find the matching group entry option in the group corpus associated with the group to which it belongs.
在步骤 S2104,用户设备 140接收到来自网络服务器的群组词条选项并 发送到汇总装置 1405。 然后转到步骤 S2105 , 在汇总装置 1405中将来自匹 配装置 1402的本地词条选项和来自网络服务器的群组词条选项进行汇总后 提供给显示装置 1406显示, 由用户进行选择。 由于网络传输和服务器处理 的滞后, 汇总装置 1405—般会先收到本地词条选项然后才收到群组词条选 项, 当网络服务器还没有反馈回群组词条选项时, 可以立即将本地词条选 项提供给显示装置 1406供用户选择,不必与群组词条选项一同显示。当然, 经过与群组词条选项汇总后的词条选项更加精确。  At step S2104, the user equipment 140 receives the group entry option from the web server and sends it to the summary device 1405. Then, proceeding to step S2105, the local entry option from the matching device 1402 and the group entry option from the web server are summarized in the summary device 1405 and provided to the display device 1406 for display, which is selected by the user. Due to the lag of network transmission and server processing, the summary device 1405 will generally receive the local entry option before receiving the group entry option. When the network server has not returned the group entry option, it can immediately localize. The entry options are provided to display device 1406 for user selection without having to be displayed with the group entry option. Of course, the term option after summarizing with the group entry option is more precise.
由于输入的过程是动态连续的过程, 词条选项也是随着用户按键输入 而不断变化的, 因此步骤 S2105之后又转回步骤 S2101检测用户设备的按 键输入。  Since the input process is a dynamic continuous process, the entry option is also constantly changed as the user inputs the key, so step S2105 is followed by a step S2101 to detect the key input of the user equipment.
上述步骤之间的次序是可以调换的, 而不影响本发明的实现。 例如为 了更快地获得群组词条选项, 可以调换步骤 S2102和 S2103的次序, 先把 检测到的按键输入序列发送到所述网络服务器。  The order between the above steps is interchangeable without affecting the implementation of the present invention. For example, to obtain the group entry option more quickly, the order of steps S2102 and S2103 can be reversed, and the detected key input sequence is first sent to the web server.
在本实施方式的第一实例中, 上述方法还可以包括用户设备侧的注册 步骤 S2106, 即在进行文字输入之前, 可以通过网络通信装置 1404将所述 用户注册到网络服务器上的一个或多个用户群组, 所述用户群组与所述群 组语料库相关联。 这种注册过程例如可以采用本领域公知的群组注册功能。 但是,也可能不需要这样的注册步骤即可将用户与某个群组相关联。例如, 由于用户属于某个业务部门,针对该业务部门的群组在(例如由部门主管) 创建时, 部门成员包括该用户即被自动添加到了该群组。 这样, 当用户在 进行文字输入时, 网络服务器基于其身份可以立即确定其属于该业务部门 群组, 并从所对应的群组语料库中为其搜索相关群组词条。 用户身份信息的步骤 S2107,以便所述网络服务器确定与之关联的用户群组, 进而确定与之关联的群组语料库。 然而该步骤也并非网络服务器用于确定 用户群组所必不可少的步骤。 例如, 当网络服务器本身是仅适用于某个或 某些用户群组的内部网服务器, 则用户无需任何认证程序即可被认为可以 利用该网络服务器上的群组语料库。 In the first example of the present embodiment, the above method may further include a registration step S2106 on the user equipment side, that is, the user may be registered to one or more of the network servers by the network communication device 1404 before the text input is performed. a user group, the user group being associated with the group corpus. Such a registration process may employ, for example, a group registration function as is known in the art. However, it may not be necessary to associate a user with a group without such a registration step. For example, since a user belongs to a business unit, when a group for that business unit is created (for example, by a department head), the department member including the user is automatically added to the group. In this way, when the user inputs text, the web server can immediately determine that it belongs to the business department group based on its identity, and searches for the relevant group entry from the corresponding group corpus. Step S2107 of user identity information, so that the network server determines the group of users associated with it, and thereby determines the group corpus associated with it. However, this step is also not an essential step for the web server to determine the user group. For example, when the web server itself is an intranet server that is only applicable to one or some groups of users, the user can be considered to be able to utilize the group corpus on the web server without any authentication procedures.
在本实施方式中,上述方法还可以包括如下步骤 S2108: 所述用户将自 己选定的词条发送到所述网络服务器, 以便所述网络服务器更新至与所述 用户所属用户群组相关联的群组语料库。 根据这一功能, 用户群组中的每 个成员可以向群组语料库提供自己的贡献, 例如新的词条、 自身累进的输 入习惯等, 这些资源可以以适当形式被收集在群组语料库的词条或词条属 性中, 供群组中其他成员参考或直接利用。 优选地, 对于用户群组中具有 较高权限或占主导地位的成员的贡献, 赋予较大的权重。 例如, 部门主管 所提供的新词条, 在反馈时具有较高优先级。  In this implementation manner, the foregoing method may further include the following step S2108: the user sends the selected item to the network server, so that the network server is updated to be associated with the user group to which the user belongs. Group corpus. According to this function, each member of the user group can provide its own contribution to the group corpus, such as new terms, self-entrancing input habits, etc. These resources can be collected in the appropriate language in the group corpus. In the bar or term attribute, for other members of the group to refer to or directly use. Preferably, a greater weight is assigned to contributions of members with higher authority or dominance in the user group. For example, new terms provided by department heads have higher priority in feedback.
在图 6 中, 以虚线方框以及虚线连接线来表示上述可选步骤 S2106-S2108。  In Fig. 6, the above optional steps S2106-S2108 are indicated by dashed boxes and dashed connecting lines.
下面参照图 7、图 9描述本实施方式中网络服务器的结构和其操作流程。 框图。  The structure of the network server and the operation flow thereof in the present embodiment will be described below with reference to FIGS. 7 and 9. block diagram.
如图 9中所示, 网络服务器 250包括一个或多个群组语料库 2501 (为 筒化示图只示出了其中一个)、 匹配装置 2502、 网络通信装置 2503、 群组 管理装置 2504以及群组语料库管理和更新装置 2505。  As shown in FIG. 9, web server 250 includes one or more group corpora 2501 (only one of which is shown for the collapsed view), matching device 2502, network communication device 2503, group management device 2504, and group Corpus management and update device 2505.
如图 9中所示, 网络通信装置 2503通过网络连接到一个或多个用户设 备 140, 用于经由网络接收用户在用户设备上的按键输入序列, 并将基于该 按键输入序列所获得的群组词条选项反馈回所述用户设备, 供用户选择。 如上所述, 网络通信装置 2503还包括身份信息接收装置, 用于接收可选的 用户身份信息, 并将其转发给群组管理装置 2504以便其确定与之管理的用 户群组。 网络通信装置 2503还包括词条接收装置, 用于接收所述用户在输 入时最终选定的词条, 以便将其转发给群组语料库管理和更新装置 2505 , 使其利用这些词条来更新与所述用户所属用户群组相关联的群组语料库。 优选地, 该词条接收装置还用于接收与用户群组有关的材料, 以便将其转 发给群组语料库管理和更新装置 2505 , 使其利用这些材料来初始地构建所 述用户所属用户群组相关联的群组语料库。 As shown in FIG. 9, the network communication device 2503 is connected to one or more user devices 140 via a network for receiving a key input sequence of the user on the user device via the network, and the group obtained based on the key input sequence. The term option is fed back to the user device for selection by the user. As described above, the network communication device 2503 further includes identity information receiving means for receiving optional user identity information and forwarding it to the group management device 2504 for determining the management thereof. Household group. The network communication device 2503 further includes a term receiving means for receiving the term finally selected by the user upon input, for forwarding it to the group corpus management and updating means 2505 for updating the terms with the terms. A group corpus associated with the user group to which the user belongs. Preferably, the term receiving device is further configured to receive material related to the user group to forward it to the group corpus management and update device 2505 to use the materials to initially construct the user group to which the user belongs. Associated group corpus.
匹配装置 2502连接到用户群组语料库 2501、 匹配装置 2502和群组管 理装置 2504,用于基于所述按键输入序列在用户群组语料库 2501中进行匹 配查询获得一个或多个匹配的群组词条选项, 然后将该群组词条选项发送 至网络通信装置 2503 , 以返回给用户设备 140。 其中匹配装置 2502根据群 组管理装置 2503所确定的用户群组信息从多个群组语料库 2501 中选择输 入用户所对应的一个或多个群组语料库来进行匹配查询。 另外, 该匹配装 置 2502还包括优先级确定装置 (未示出),用于根据词条来源用户的优先级、 词条先前是否被选择过、 词条先前被选择的时间先后、 词条先前被选择的 次数、 用户预置的输入偏好选项和 /或词条在网络上的被搜索的次数来确定 所匹配的词条选项中各个词条的优先级。  The matching device 2502 is coupled to the user group corpus 2501, the matching device 2502, and the group management device 2504, for performing a matching query in the user group corpus 2501 based on the key input sequence to obtain one or more matching group entries. The option then sends the group entry option to the network communication device 2503 for return to the user device 140. The matching device 2502 selects one or more group corpora corresponding to the input user from the plurality of group corpora 2501 according to the user group information determined by the group management device 2503 to perform a matching query. In addition, the matching device 2502 further includes priority determining means (not shown) for prioritizing the user according to the entry source, whether the entry was previously selected, the time sequence in which the entry was previously selected, and the entry being previously The number of selections, the user-preset input preference options, and/or the number of times the term is searched on the network determines the priority of each of the matching term options.
群组管理装置 2504连接到匹配装置 2502、 网络通信装置 2503以及群 组语料库管理和更新装置 2505。群组管理装置 2504负责管理用户群组, 包 括从网络通信装置 2503接收用户注册信息, 并将用户注册到一个或多个用 户群组; 维护用户群组信息, 例如群组名称、 群组成员 ID、 群组所对应的 群组语料库编号等; 根据用户身份确定其所属群组, 并将确定结果发送至 匹配装置 2502帮助其选择进行匹配查询的群组语料库 2501。 另外,群组管 理装置还辅助群组语料库管理和更新装置 2505管理和更新词库, 例如将某 个用户在用户群组中的优先级别信息发送至群组语料库管理和更新装置 2505 , 后者根据该信息调整相关词条的优先级属性等。  The group management device 2504 is connected to the matching device 2502, the network communication device 2503, and the group corpus management and update device 2505. The group management device 2504 is responsible for managing user groups, including receiving user registration information from the network communication device 2503, and registering the user with one or more user groups; maintaining user group information, such as group name, group member ID The group corpus number corresponding to the group, etc.; determining the group to which the group belongs according to the identity of the user, and transmitting the determination result to the matching device 2502 to help it select the group corpus 2501 for performing the matching query. In addition, the group management apparatus also assists the group corpus management and update apparatus 2505 in managing and updating the thesaurus, for example, transmitting priority information of a certain user in the user group to the group corpus management and update apparatus 2505, the latter according to This information adjusts the priority attributes of related terms and the like.
群组语料库管理和更新装置 2505连接到网络通信装置 2503 ,用于接收 用户所发送的词条, 将其更新到群组语料库 2501的词条或其属性中。 优选 地, 群组语料库管理和更新装置 2505还可以从网络通信装置 2503接收与 用户群组有关的材料, 通过对其进行学习或训练, 来对群组语料库 2501进 行初始化或更新。 该功能对于进一步筒化用户输入, 减少系统构建分类词 库的开销来说是非常有用的。 例如, 对于一个从事半导体领域专利文件翻 译的群组, 用户可以上传一份包含常用半导体领域词汇如"蚀刻"、 "汽相沉 积"、 "涂覆 "等的资料。 群组语料库管理和更新装置 505利用该资料为该群 组初始化一个群组语料库 2501 , 从而不再需要其中成员在首次输入相关词 条时要付出的训练劳动。 The group corpus management and update device 2505 is connected to the network communication device 2503 for receiving The term sent by the user is updated to the term of the group corpus 2501 or its attribute. Preferably, the group corpus management and update device 2505 can also receive material related to the user group from the network communication device 2503, and initialize or update the group corpus 2501 by learning or training it. This feature is very useful for further downsizing user input and reducing the overhead of building a categorical lexicon. For example, for a group working on translation of patent documents in the semiconductor field, users can upload a file containing common semiconductor field terms such as "etching", "vapor deposition", "coating", and the like. The group corpus management and update device 505 uses the material to initialize a group corpus 2501 for the group, thereby eliminating the need for training labor for members to enter the relevant terms for the first time.
群组语料库 2501是本发明所引入的重要概念,其直接对应于用户群组, 通常每个用户群组对应于一个群组语料库。 群组语料库 2501包含其所对应 的用户群组中的群组成员最常用的词条, 将该语料库在群组成员中共享可 以使这些成员用户节省很多耗时费力的输入法训练步骤, 在其他成员的输 入基础上直接获得自己想要的输入结果。 在下文中将结合图 11对群组语料 库 2501的构成以及群组词条的属性进行详细说明。  The group corpus 2501 is an important concept introduced by the present invention, which directly corresponds to a user group, and typically each user group corresponds to a group corpus. The group corpus 2501 contains the most commonly used terms of the group members in the corresponding user group, and sharing the corpus in the group members can save these member users a lot of time-consuming and laborious input method training steps, in other The input of the member directly obtains the input result that you want. The composition of the group corpus 2501 and the attributes of the group entry will be described in detail below with reference to FIG.
图 7为根据本实施方式在网络服务器上辅助用户进行文字输入的方法 的流程图。  Fig. 7 is a flow chart showing a method of assisting a user to perform character input on a web server according to the present embodiment.
如图 7中所示,在步骤 S2201中,网络服务器 250的网络通信装置 2503 经由网络接收用户在用户设备 140上的按键输入序列;  As shown in FIG. 7, in step S2201, the network communication device 2503 of the web server 250 receives a key input sequence of the user on the user device 140 via the network;
在步骤 S2202 中, 基于所述按键输入序列和用户所加入的群组在对应 的群组语料库中进行匹配查询获得一个或多个匹配的群组词条选项;  In step S2202, a matching query is performed in the corresponding group corpus based on the key input sequence and the group joined by the user to obtain one or more matching group entry options;
在步骤 S2203中, 将所获得的群组词条选项反馈回所述用户设备 140, 供用户选择。  In step S2203, the obtained group term option is fed back to the user equipment 140 for selection by the user.
在本实施方式中,上述方法还可以包括网络服务器侧的注册步骤 S2204, 即在文字输入之前, 可以通过网络通信装置 2503接收用户的注册信息, 并 将用户注册到一个或多个用户群组, 所述用户群组与所述群组语料库相关 联。 这种注册过程例如可以采用本领域公知的群组注册功能。 优选地, 可 以根据各种标准即用户所具有的任何共性来建立用户群组, 例如从事相关 的工作、 完成同一项任务、 具有相似的爱好或者居住于同一个城市。 下文 中结合图 10详述了用户群组的例子。 In this embodiment, the foregoing method may further include a registration step S2204 on the network server side, that is, before the text input, the registration information of the user may be received by the network communication device 2503, and the user is registered to one or more user groups. The user group is related to the group corpus Union. Such a registration process may employ, for example, a group registration function as is known in the art. Preferably, user groups can be established according to various criteria, ie, any commonality that the user has, such as engaging in related work, completing the same task, having similar hobbies, or living in the same city. An example of a user group is detailed below in conjunction with FIG.
但是, 也可能不需要这样的注册步骤即可将用户与某个群组相关联。 例如, 由于用户属于某个业务部门, 针对该业务部门的群组在 (例如由部 门主管)创建时, 部门成员包括该用户即被自动添加到了该群组。 这样, 当用户在进行文字输入时, 网络服务器基于其身份可以立即确定其属于该 业务部门群组, 并从所对应的群组语料库中为其搜索相关群组词条。  However, it may not be necessary to associate a user with a group without such a registration step. For example, since a user belongs to a business unit and a group for that business unit is created (for example, by a department head), the department member including the user is automatically added to the group. Thus, when the user is typing text, the web server can immediately determine that it belongs to the business department group based on its identity, and searches for the relevant group entry from the corresponding group corpus.
在本实施方式中, 上述方法还可以包括接收所述用户发送的用户身份 信息的步骤 S2205 , 以便确定与之关联的用户群组,进而确定与之关联的群 组语料库。 然而该步骤也并非网络服务器 250用于确定用户群组所必不可 少的步骤。 例如, 当网络服务器 250本身是仅适用于某个或某些用户群组 的内部网服务器, 则用户无需任何认证程序即可被认为可以利用该网络服 务器 250上的群组语料库 2501。  In this embodiment, the method may further include the step S2205 of receiving user identity information sent by the user, to determine a group of users associated with the user group, and thereby determining a group corpus associated with the group. However, this step is also not an essential step for the web server 250 to determine the user group. For example, when the web server 250 itself is an intranet server that is only applicable to one or some groups of users, the user can be considered to utilize the group corpus 2501 on the web server 250 without any authentication procedures.
在本实施方式中, 上述方法还可以包括接收用户返回词条以更新群组 语料库 2501的步骤 S2206。 根据这一功能, 用户群组中的每个成员可以向 群组语料库提供自己的贡献, 例如新的词条、 自身累进的输入习惯等, 这 些资源可以以适当形式被收集在群组语料库的词条或词条属性中, 供群组 中其他成员参考或直接利用。 优选地, 对于用户群组中具有较高权限或占 主导地位的成员的贡献, 赋予较大的权重。 例如, 部门主管所提供的新词 条, 在反馈时具有较高优先级。  In this embodiment, the method may further include the step S2206 of receiving the user returning the entry to update the group corpus 2501. According to this function, each member of the user group can provide its own contribution to the group corpus, such as new terms, self-entrancing input habits, etc. These resources can be collected in the appropriate language in the group corpus. In the bar or term attribute, for other members of the group to refer to or directly use. Preferably, a greater weight is assigned to contributions of members with higher authority or dominance in the user group. For example, new terms provided by department heads have higher priority in feedback.
在图 7 中, 以虚线方框以及虚线连接线来表示上述可选步骤 S2204-S2206。  In Fig. 7, the above optional steps S2204-S2206 are indicated by dashed boxes and dashed connecting lines.
图 8为根据本实施方式的辅助用户进行文字输入的系统 230的框图。 如图 8所示, 系统 230包括网络服务器 250和用户设备 140, 网络服务器 250和用户设备 140通过网络连接。 网络代表使用例如 TCP/IP协议集来彼 此通信的全球范围内的网络和网关集合, 其可以是以主要节点或主机计算 机之间的高速数据通信线路的骨干网为核心的因特网, 其由成千上万的商 业、 政府、 教育和对数据和消息进行路由的其他计算机系统组成。 网络还 可以实现为大量不同类型的网络, 诸如, 例如, 内联网、 局域网 (LAN ) 或广域网(WAN )。 图 8意在作为一个示例, 而不是对不同示范性实施方式 的结构性限制。 FIG. 8 is a block diagram of a system 230 for assisting a user to perform text input in accordance with the present embodiment. As shown in FIG. 8, system 230 includes a web server 250 and user equipment 140, a network server. 250 and user equipment 140 are connected through a network. The network represents a worldwide network and set of gateways that communicate with each other using, for example, the TCP/IP suite of protocols, which may be the Internet centered on the backbone of high-speed data communication lines between the primary nodes or host computers, which are made up of thousands Tens of thousands of businesses, governments, education, and other computer systems that route data and messages. The network can also be implemented as a number of different types of networks, such as, for example, an intranet, a local area network (LAN), or a wide area network (WAN). FIG. 8 is intended as an example and is not a structural limitation of the various exemplary embodiments.
图 10示出了根据本实施方式在网络服务器上注册的用户群组的示意图。 图中示出了 4个用户群组, 分别为 "篮球同盟"群组 2601、 "正华花园小区" 群组 2602、 "半导体领域翻译"群组 2603以及"世博旅游"群组 2604。这些群 组分别代表了根据用户共同的兴趣爱好、 居住区域、 从事工作、 短期关注 来划分的用户群组。 实际上, 还可以有更多的分组标准, 例如同一个网络 游戏的玩家、 同一个大学的学生等等。 在现实生活中, 用户之间可能具有 千丝万缕的联系, 导致多个用户之间能够具有某种共性, 任何共性都可以 用来作为本发明输入方法中分类用户群组的基础, 区别仅仅在于该分类对 于群组成员的输入法效率的促进效果。 对于共同拥有很多为群组之外的人 所不了解的资源的群组来说, 这种群组之间的共享是最有效果的。 例如, 上面例子中专业性较强的半导体群组以及同玩网络游戏的玩家群组, 这些 群组中的群组成员之间的共同语言对于一般输入法来说常常是比较生僻的 词条, 诸如"无面素树脂"(半导体术语)、 "虫族、 神族"(网络游戏用语) 等。  FIG. 10 shows a schematic diagram of a group of users registered on a network server according to the present embodiment. The figure shows four user groups, namely "Basketball League" group 2601, "Zhenghua Garden Community" group 2602, "Semiconductor Domain Translation" group 2603, and "Expo Tour" group 2604. These groups represent user groups that are divided according to the user's common interests, living areas, work, and short-term concerns. In fact, there are more grouping criteria, such as players in the same online game, students in the same university, and so on. In real life, users may be inextricably linked, resulting in some commonality among multiple users. Any commonality can be used as the basis for classifying user groups in the input method of the present invention. It is the effect of the classification on the efficiency of the input method of the group members. For groups that share a lot of resources that are not known to people outside the group, sharing between such groups is most effective. For example, in the above example, the more professional semiconductor group and the group of players playing the online game, the common language between the group members in these groups is often a relatively uncommon entry for the general input method. Such as "no noodle resin" (semiconductor terminology), "Zerg, Protoss" (online game language).
如图所示, 用户群组 2601 ~2604各自包括其群组成员, 群组成员可以 利用自己的真实名字或网络 ID等来注册到群组。 不同的群组所包括的成员 可以有重复或交叉。 例如, "半导体领域翻译"群组 2603和"世博旅游"群组 2604均包括成员"李梦", "篮球同盟"群组 2601中的成员全部是"正华花园" 群组 2602的成员, 因为该篮球同盟就是该小区业主之间的一个组织。 如上所述, 各用户群组中被突出显示 (图中利用加粗字体表示) 的成 员 (例如, 群组 2601中的张小亮)具有较高优先级, 其他成员具有较低优 先级。 如果需要, 还可以设置更细化的优先级级别。 这种优先级区别主要 用于群组成员向群组语料库贡献词条时帮助确定词条的优先级。 网络服务 器在接收到群组成员反馈或贡献的词条时, 将根据其来源用户的优先级、 该词条的当前优先级、 最近被选择时间或次数来分配或更新该词条的优先 级。 词条优先级是词条属性中重要的一项, 用于确定向用户提供的群组词 条在备选框中出现的顺序。例如, "篮球同盟"群组的队长最近拟向群组成员 推广腰旗橄榄球运动来提高成员身体素质, 其作为来源用户提供的词条 "yq -腰旗,,将被赋予比某一成员提供的词条" yq -延庆"具有更高的优先级。 As shown, the user groups 2601 ~ 2604 each include their group members, and the group members can register with the group using their real names or network IDs and the like. Members included in different groups may have repetitions or intersections. For example, the "Semiconductor Domain Translation" group 2603 and the "Expo Tour" group 2604 each include a member "Li Meng", and all members of the "Basketball League" group 2601 are members of the "Zhenghua Garden" group 2602 because The basketball league is an organization between the owners of the community. As described above, members of each user group that are highlighted (indicated by a bold font in the figure) (e.g., Zhang Xiaoliang in group 2601) have a higher priority, and other members have a lower priority. You can also set a more granular priority level if needed. This priority distinction is primarily used to help group members prioritize the contribution of terms when contributing to the group corpus. When the web server receives the entry or contribution of the group member, the priority of the term will be assigned or updated according to the priority of the source user, the current priority of the entry, the most recently selected time or number of times. The term priority is an important item in the term attribute and is used to determine the order in which the group terms provided to the user appear in the candidate box. For example, the captain of the "Basketball League" group recently proposed to promote the flag football game to the group members to improve the physical fitness of the members. The term "yq-waist flag" provided by the source user will be given more than a member. The entry "yq-延庆" has a higher priority.
在图 10中的下半部分中, 还给出了这些用户群组所针对的群组语料库 所包含的部分词条, 以补充说明这种用户分组对于输入法的促进意义。 例 如,其中 "正华花园"群组所对应的群组语料库所包含词条有" xfxch -双福洗 车行"、 "syj -三友居饭店"、 "lj -丽家宝贝"这些小区周边商户的名称, 以 及" psy -潘山屹(物业经理名字) "、 "rjq -任坚强 (业务会主任) "这样的 小区业主熟知常用的词条。  In the lower part of Fig. 10, some of the terms contained in the group corpus for these user groups are also given to supplement the significance of this user grouping for the input method. For example, the group corpus corresponding to the "Zhenghua Garden" group contains the words "xfxch - Shuangfu car wash line", "syj - Sanyouju hotel", "lj - Lijia baby" The name, as well as the "psy - Pan Shanyu (property manager name)", "rjq - Ren Jianqiang (business director)" such community owners familiar with commonly used terms.
图 11中示出了根据本发明一个实施例的在网络服务器上维护的群组语 料库中群组词条的属性列表 2700的示意图。 该列表 2700中列出了用于某 个篮球同盟群组的群组语料库的多个词条。 第一列 2701 为该词条的内容, 如" zb -走步 (篮球技术术语 ) "、 "lb -篮板(篮球技术术语 ) "、 "bkl -巴 克利 (篮球明星名字) "以及 "huren -湖人(美国 NBA篮球队名字"。 除了 这些常用的篮球相关词汇, 该群组语料库还包括 "zhhb -正华杯(小区赛事 名称) " 、 "yq -腰旗(群组成员活动) "这样的特定群组用语。  A schematic diagram of an attribute list 2700 of group entries in a group corpus maintained on a web server in accordance with one embodiment of the present invention is shown in FIG. Multiple entries for the group corpus of a basketball league group are listed in this list 2700. The first column 2701 is the content of the entry, such as "zb - walking (basketball technical terminology)", "lb - rebounding (basketball technical terminology)", "bkl - Barkley (basketball star name)" and "huren - Lakers (American NBA basketball team name). In addition to these commonly used basketball related vocabulary, the group corpus also includes "zhhb - Zhenghua Cup (community event name)", "yq - waist flag (group member activity)" Specific group terms.
列表 2700中第二列至第第七列 2702 - 2707列出了词条的各个属性, 分别为群组标识、 优先级别、 被选次数、 最近被选择时间、 来源用户和 /或 目标用户。 其中需要重点说明的是, 优先级别、 来源用户和 /或目标用户三 个属性。 其中优先级别确定了当匹配到多个词条后以何种顺序向用户提供 词条, 优先级别较高的词条被排在靠前的位置, 并且可选地可以设定群组 词条均优于本地词条的优先级。 如前所述, 来源用户是指向语料库管理和 更新装置提供该词条的成员用户, 该属性将影响该词条的优先级设定。 至 于目标用户, 则是指可以向其提供该词条的用户范围。 例如, 可以设定, 仅将" xincheng -新成广场"提供给每周参加活动的篮球队成员, 而同样喜欢 篮球的外围成员则可以不共享该词条以免不必要地打扰这些用户。 又例如, 对于一个以工作内容为基础形成的群组, 某些词条的目标用户可以是全体 成员, 而另一些词条的目标用户可以仅仅是一部分职员, 而排除了与该词 条无关联的普通职员对于该词条的选择。例如,在针对"百度"公司群组的群 组语料库中, 词条" bdtb -百度贴吧"、 "ssyq -搜索引擎"的目标用户可以是 全体成员, 而词条 "zx -撰写"、 "qq _侵权"的目标用户则可以仅限定为该公 司知识产权管理职能部门成员, 以进一步提高群组词条反馈的准确性和用 户的输入效率。 The second to seventh columns 2702 - 2707 of list 2700 list the various attributes of the entry, which are group identification, priority level, number of times selected, time of most recent selection, source user, and/or target user. Among them, the key points are: priority level, source user and/or target user. Attributes. The priority level determines the order in which the user is provided with the entry after the multiple entries are matched, the higher priority entry is placed in the front position, and optionally the group entry can be set. Better than the priority of local terms. As mentioned earlier, the source user is a member user who provides the entry to the corpus management and update device, and this attribute will affect the priority setting of the entry. As for the target user, it refers to the range of users who can provide the term to it. For example, it can be set that only "xincheng-Xincheng Square" is provided to the basketball team members who participate in the activity every week, while peripheral members who also like basketball can not share the term to avoid unnecessarily disturbing these users. For another example, for a group formed on the basis of work content, the target user of some terms may be all members, and the target users of other terms may be only a part of the staff, and the association with the entry is excluded. Ordinary staff for the choice of the term. For example, in the group corpus for the "Baidu" company group, the target users of the terms "bdtb - Baidu Post Bar", "ssyq - search engine" can be all members, and the terms "zx - compose", "qq" The target user of _infringement can be limited to only members of the company's intellectual property management function to further improve the accuracy of group term feedback and user input efficiency.
前面提到, 词条优先级属性可以根据被选时间、 次数、 来源用户等来 进行动态调整, 以使得所返回的词条的顺序更符合该群组成员的输入习惯, 将最准确的词条选项返回给用户。 众所周知, 很多输入法为第一顺位的备 选词分配了最方便使用的空格键, 而下面顺序的备选词则需要通过数字键、 " + "和" - "或者其他快捷键来选择,而在重码很多的情况下更后面的词条甚 至需要用到翻页键。 如果能够将最准确的词条排在靠前位置甚至第一位, 无疑将大大节省用户的劳动, 提高用户的输入效率。  As mentioned earlier, the term priority attribute can be dynamically adjusted according to the selected time, number of times, source user, etc., so that the order of the returned terms is more in line with the input habits of the group members, and the most accurate terms will be The option is returned to the user. As we all know, many input methods assign the most convenient space key to the first-order alternative words, and the following alternative words need to be selected by numeric keys, " + " and " - " or other shortcut keys. In the case of a lot of re-codes, the later words even need to use the page-turning key. If you can put the most accurate terms in the front position or even the first place, it will undoubtedly save the user's labor and improve the user's input efficiency.
本发明的第二实施方式还提供一种用于供用户进行文字输入的系统, 其中包括本发明第二实施方式的用于输入文字的用户设备和本发明第二实 施方式的用于辅助用户设备输入文字的网络 务器。  A second embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters and a second embodiment of the present invention for assisting user equipment according to the second embodiment of the present invention. Enter the text server.
本发明用于供用户进行文字输入的方法、 设备、 服务器及系统的第三 实施方式: 图 12为实现本实施方式以及本发明的一个典型的计算机网络的示意图, 示出的各类网络元素包括用户设备 311和 312, 互联网 321 , 路由器 331和 网络服务器 341~343。 下文中, 用户设备 311也称电脑 311 , 用户设备 312 也称手机 312。 本领域技术人员理解, 图 12中仅为筒明起见而示出的各类 网络元素的数量可能小于一个实际网络中的数量, 但这种省略无疑地是以 不会影响对本发明进行清楚、 充分的公开为前提的。 本领域技术人员还理 解, 图中还省略了一些其它类型的网络元素, 例如, 通常位于用户设备与 互联网之间的调制解调器(modern ), 接入设备(如 DSLAM )等。 The third embodiment of the method, device, server and system for the user to input text is provided by the present invention: 12 is a schematic diagram of a typical computer network implementing the present embodiment and the present invention. The various network elements shown include user equipments 311 and 312, Internet 321, 331 and network servers 341-343. Hereinafter, the user device 311 is also referred to as a computer 311, and the user device 312 is also referred to as a mobile phone 312. It will be understood by those skilled in the art that the number of various network elements shown in FIG. 12 for the sake of clarity may be less than the number in a real network, but this omission is undoubtedly clear and sufficient to the present invention. The disclosure is premised. Those skilled in the art will also appreciate that some other types of network elements are also omitted from the figure, such as a modem, an access device (e.g., a DSLAM), etc., typically located between the user equipment and the Internet.
图 12中, 虽然将三个网络服务器 341~343表示为由同一路由器 331提 供路由, 的适用: 各个网络 服务器 341~343 因所处地域不同的原因而归属于不同的路由器, 并分别服 务于不同区域的用户设备。 典型地, 可以为一个城市、 省份甚至一个国家 提供一个这样的服务器, 当然, 考虑到终端用户的分布的不均勾性, 服务 器的分布可以服从于终端用户的分布情况。 总之, 网络布局的任何变化都 落入本发明的范围和精神之内。 行设置的人机界面示意图。  In Fig. 12, although three network servers 341 to 343 are shown as being routed by the same router 331, the respective network servers 341 to 343 belong to different routers depending on the geographical area, and serve differently. User equipment in the area. Typically, one such server can be provided for a city, a province, or even a country. Of course, considering the heterogeneity of the distribution of end users, the distribution of servers can be subject to the distribution of end users. In summary, any changes in the network layout are within the scope and spirit of the invention. Schematic diagram of the human-machine interface set by the line.
如前所述, 本发明提供了一种为用户提供文字输入的解决方案, 实则 提供一种新的输入法, 这种输入法可以是计算机 311 使用的唯一的一种输 中进行选择。参看图 13a, 由附图标记标示的各个对象可以是 windows操作 系统中提供的若干按 ¾,其中,按¾ 3201是输入法的开关按鈕,结合图 13a~ 图 13c可以看出, 当用户控制鼠标 3205移至按 ¾ 3201上后, 点击左键, 即可改变按 ¾ 3201的显示状态并改变了该输入法的状态, 为了清楚地向用 户表明这种改变,图 13c中所示的表示激活的按¾ 3201,的颜色区别于图 13a 中表示未激活的按 ¾ 3201。 下文中, 将根据本实施方式的基于用户设备与 网络服务器的互动的输入法成为本输入法, 以便于表述。 As previously stated, the present invention provides a solution for providing text input to a user, and in fact provides a new input method that can be selected by the only type of input used by computer 311. Referring to Fig. 13a, each object indicated by the reference numeral may be a plurality of buttons provided in the Windows operating system, wherein the button is pressed by 3⁄4 3201, which can be seen in conjunction with Figs. 13a to 13c, when the user controls the mouse. After 3205 is moved to press 3⁄4 3201, click the left button to change the display state of 3⁄4 3201 and change the state of the input method. In order to clearly indicate the change to the user, the representation shown in Figure 13c is activated. The color according to 3⁄4 3201 is distinguished from the 3⁄4 3201 which is not activated in Figure 13a. Hereinafter, the user equipment based on the present embodiment and The interactive input method of the web server becomes the input method for easy representation.
在图 13a中, 还示出了按 ¾ 3202, 通过鼠标点击, 可以在中文和英文 输入之间进行切换。 本领域技术人员通过阅读上下文能够理解, 本发明并 不限于中文和 /或英文(如按 ¾ 3202' )的输入, 而及于几乎各种可以通过用 户设备输入的语言的文字, 本领域技术人员通过阅读上下文还能理解, 本 输入法用于输入中文时, 并不限于拼音输入, 而及于笔画、 五笔等其它中 文输入方式以及它们组合后形成的输入方式。  In Figure 13a, it is also shown that by 3⁄4 3202, a mouse click can be used to switch between Chinese and English input. Those skilled in the art will understand from the reading of the context that the present invention is not limited to the input of Chinese and/or English (e.g., according to 3⁄4 3202'), and to the characters of almost any language that can be input through the user equipment, those skilled in the art. It can also be understood by reading the context that when the input method is used to input Chinese, it is not limited to pinyin input, but also other Chinese input methods such as strokes, five strokes, and the input methods formed by them.
典型地, 本输入法的中英文输入切换也可以用键盘上的特定按键来控 制, 例如, 当键盘上的 CapsLock (CapsLK) 处于小写状态时, 本输入法将 为用户提供汉字或词语的选择列表, 用户通过键盘、 鼠标的进一步选择将 输入结果添加至相应的输入框中; 相对应的, 当 CapsLK处于大写状态时, 本输入法将不提供包括汉字的选择列表, 不失一般性地, 直接将用户按键 顺序所对应的英文字母序列添加在相应的输入框中。  Typically, the Chinese and English input switching of this input method can also be controlled by a specific button on the keyboard. For example, when CapsLock (CapsLK) on the keyboard is in a lowercase state, this input method will provide the user with a selection list of Chinese characters or words. The user adds the input result to the corresponding input box through further selection of the keyboard and the mouse; correspondingly, when CapsLK is in uppercase state, the input method will not provide a selection list including Chinese characters, without loss of generality, directly Add the English alphabet sequence corresponding to the user's key sequence to the corresponding input box.
在图 13a中, 还示出了按 ¾ 3203 , 通过鼠标点击, 可以在全角和半角 之间进行切换, 不失一般性地, 图 13a中的按 ¾ 3203的新月形图形表示半 角, 图 13b中的按 ¾ 3203'中的圆形表示全角。  In Fig. 13a, it is also shown that by 3⁄4 3203, the mouse can be switched between the full angle and the half angle. Without loss of generality, the crescent shape of the 3⁄4 3203 in Fig. 13a represents the half angle, Fig. 13b The circle in 3⁄4 3203' indicates the full angle.
在图 13a中, 还示出了按鈕 3204, 通过鼠标点击, 可以在中文标点符 号与英文标点符号之间进行切换, 典型地, 图 13a中的按 ¾ 3204表示中文 标点符号, 图 13b中的按 ¾ 3204,表示英文标点符号。  In Fig. 13a, a button 3204 is also shown, which can be switched between Chinese punctuation marks and English punctuation marks by a mouse click. Typically, the Chinese punctuation marks are represented by 3⁄4 3204 in Fig. 13a, and the pressing in Fig. 13b is shown in Fig. 13b. 3⁄4 3204, indicating English punctuation.
图 13a~13c 中任一附图所示的是一个控制栏, 其默认地展现在用户设 备屏幕的右下角, 并优选地可以被鼠标拖动。  Shown in any of Figures 13a-13c is a control bar that defaults to the lower right corner of the user device screen and is preferably draggable by the mouse.
图 13d所示的是另一种供用户对本输入法的开 /关进行配置的人机交互 界面, 其中具体的是一个基于 web的浏览器的一部分, 其中, 在"是否希望 使用百度在线输入法(本输入法) "一项之后, 提供了表示开启与关闭的两 个选项, 通过点选, 即可人工地控制本输入法是否激活。  Figure 13d shows another human-computer interaction interface for the user to configure the on/off of the input method, which is specifically a part of a web-based browser, in which "Whether you want to use Baidu online input method (This input method) After the item, two options for opening and closing are provided. By clicking, you can manually control whether this input method is activated.
本领域技术人员理解, 当用户通过以上方式或其等同替换方式将本输 入法激活, 即使计算机 311 预先安装了其它的输入法, 本输入法仍成为当 前的默认输入方式。 相对应地, 当本输入法进入关闭状态时, 为了保证计 算机 311上有可用的输入法, 计算机 311优选地自动将其它预先安装的输 入法例如 Google输入法激活。 当然, 其它输入法的激活也可以由用户来手 动控制, 从而选择希望切换至的其它输入法, 典型地, 参看图 13a, 当鼠标 右键点击按 ¾ 3201时, 出现一个弹出菜单, 其中列出了其它可用的各种输 入法供用户选择, 当用户选择了其中之一后, 按¾ 3201的位置将出现一个 表示新近选择的其它输入法的标志。 对此不赘。 作为一种替代方式, 本输 入法也可以是计算机 311唯一可用的输入法。 Those skilled in the art understand that when the user passes the above manner or its equivalent replacement The entry method is activated, even if the computer 311 has other input methods pre-installed, this input method becomes the current default input method. Correspondingly, when the input method enters the off state, in order to ensure that there is an input method available on the computer 311, the computer 311 preferably automatically activates other pre-installed input methods such as Google input. Of course, the activation of other input methods can also be manually controlled by the user to select other input methods that he wishes to switch to. Typically, referring to Figure 13a, when the right mouse button clicks 3⁄4 3201, a popup menu appears, which lists Other available input methods are available for the user to select. When the user selects one of them, a flag indicating the newly selected other input method will appear at the position of 3⁄4 3201. This is not a problem. As an alternative, the input method can also be the only input method available to the computer 311.
根据本实施方式, 本输入法可以在其它一些条件下被激活, 例如, 当 用户在图 13d的个性设置中开启了在线输入法后, 每当用户使用浏览器访 问一个预定的网络地址集合中的任一地址时, 本输入法即被激活。  According to the present embodiment, the input method can be activated under other conditions, for example, when the user turns on the online input method in the personality setting of FIG. 13d, each time the user accesses a predetermined network address set using the browser. This input method is activated at any address.
以下结合图 14并参照图 12对以计算机 311为例的用户设备中用于为 用户提供文字输入的方法进行介绍, 虽然以计算机 311 为例, 但本领域技 术人员理解, 同样的流程也可以在手机 312 中实现。 下文的描述会涉及到 网络服务器的一些内容, 这些内容还会在下文中有专门的详述。  Referring to FIG. 14 and FIG. 12, a method for providing text input to a user in a user equipment exemplified by the computer 311 is introduced. Although the computer 311 is taken as an example, those skilled in the art understand that the same process can also be performed. Implemented in mobile phone 312. The following description will cover some of the contents of the web server, which are also detailed below.
图 14所示的是根据本实施方式的在连接到一个网络服务器的用户设备 中供用户进行文字输入的方法流程图, 本实施方式中, 用户例如张三已经 将本输入法激活。 为了更形象地介绍这一过程, 还参照图 16。 应当理解, 用户设备与网络服务器之间的连接方式不限于高速、 稳定的有线连接, 也 包括无线连接或有线与无线混合的方式。  Figure 14 is a flow chart showing a method for a user to input text in a user equipment connected to a network server according to the present embodiment. In the present embodiment, the user, for example, Zhang San has activated the input method. To introduce this process more visually, reference is also made to Figure 16. It should be understood that the manner of connection between the user equipment and the network server is not limited to a high speed, stable wired connection, but also includes a wireless connection or a wired and wireless hybrid.
首先, 在步骤 S3301 中, 计算机 311获得用户提供的输入信息。 具体 地, 张三打开 IE浏览器, 首页 www.baidu.com自动被打开, 于是呈现出大 致如图 16所示的内容, 当然其中的搜索栏 3501应为空且图中位于搜索栏 下方的相关内容应忽略。 张三将光标移至搜索栏 3501 附近或其中的某处, 点击鼠标, 于是可以在搜索栏 3501中进行输入。 不失一般性地, 假设张三 依次敲击键盘上的以下按键, 其中每个按键敲击次数为 1 且按下时间低于 一个阈值: C,A,0,M,E,I,W,A,N,G。 First, in step S3301, the computer 311 obtains input information provided by the user. Specifically, Zhang San opens the IE browser, and the home page www.baidu.com is automatically opened, so that the content as shown in FIG. 16 is presented. Of course, the search bar 3501 should be empty and the map is located below the search bar. The content should be ignored. Zhang San moves the cursor to a position near or in the search bar 3501, clicks the mouse, and can then input in the search bar 3501. Without loss of generality, suppose Zhang San Tap the following buttons on the keyboard in turn, each of which has a tap count of 1 and the press time is below a threshold: C, A, 0, M, E, I, W, A, N, G.
根据本实施方式的第一实例, 张三的上述每一次敲击所提供的信息都 视为一个输入信息, 也即, 当张三按下 C键, 一个表示 C键被按下的输入 信息被计算机 311 获得。 以下, 用输入信息所表示的按键或按键组合来标 识该输入信息, 并采用输入信息" XX"的形式, 双引号部分表示相应的按键 或按键组合。 例如, 表示按下按键 C的输入信息, 称为输入信息 "C", 表示 依次按下按键 C,A,0,M,E,I的输入信息, 称为输入信息 "CAOMEI"。  According to the first example of the present embodiment, the information provided by each of the above-mentioned taps of Zhang San is regarded as one input information, that is, when the three presses the C key, one input information indicating that the C key is pressed is Computer 311 is obtained. Hereinafter, the input information is identified by a key or a combination of keys indicated by the input information, and is in the form of input information "XX", and the double quoted part indicates the corresponding key or key combination. For example, the input information indicating that the button C is pressed is referred to as input information "C", and the input information of the buttons C, A, 0, M, E, and I is sequentially pressed, and is referred to as input information "CAOMEI".
具体地, 步骤 S3301 的实现可以采用这样的方式, 由浏览器通过其中 的一段脚本或函数来检索用户提供的输入信息。 也即, 本输入法分布在计 算机 311上的用于获得输入信息的功能模块是由基于 web的浏览器来实现 的。 于是, 如果张三使用提供本输入法的相应浏览器, 那么他在下载和安 装该浏览器的同时, 也就下载和安装了实现该输入法在客户端的功能模块, 即可开始基于本输入法进行文字输入。  Specifically, the implementation of step S3301 may be performed in such a manner that the browser retrieves input information provided by the user through a script or function therein. That is, the functional modules of the input method distributed on the computer 311 for obtaining input information are implemented by a web-based browser. Therefore, if Zhang San uses the corresponding browser that provides this input method, then when downloading and installing the browser, he downloads and installs the function module that implements the input method on the client, and can start based on the input method. Enter text.
上述的浏览器的特定脚本或函数是可以替代的, 例如, 计算机 311 可 以安装独立于浏览器的应用程序, 其类似于本输入法的客户端软件, 根据 本实施例, 该客户端软件主要负责上述的输入信息的获取以及下文中还将 提到的输入信息发送等后续操作。  The specific script or function of the above browser may be replaced. For example, the computer 311 may install a browser-independent application, which is similar to the client software of the input method. According to the embodiment, the client software is mainly responsible for Subsequent operations such as acquisition of the above input information and transmission of input information which will be mentioned later.
接着,在步骤 S3302中,计算机 311将输入信息" C"发送给网络服务器。 参看图 12, 该输入信息具体到达哪个网络服务器将取决于所示网络的具体 配置, 以及路由算法等等。 最直接的一种做法是, 为每一个 IP地址段的用 户设备指定一个网络服务器, 于是, 当计算机 311明确了自己的 IP地址以 后, 就可以知道该向哪个网络服务器发送输入信息, 或者, 预先将 IP地址 段与网络服务器的对应关系保存在路由设备中, 在从来自计算机 311的 IP 包中获知计算机 311的 IP地址后, 查询该对应关系, 即可确定作为目的地 的网络服务器; 作为多种替代方式之一, 输入信息可以发往多个网络服务 器, 再由这些网络服务器之间进行通信来确定一个执行后续操作的网络服 务器; 作为多种替代方式中的另一种, 图 12所示的网络服务器互有分工, 当张三通过图 13a~13c的人机界面设置了输入语言时,计算机 311发出的输 入信息可以携带相应的语言例如中文的标识, 于是, 由中文输入服务器(如 有)来负责后续操作。 另外, 应当理解, 本发明不限制对网络服务器的选 择方式及用户设备与网络服务器之间的交互方式, 例如, 这种交互可以基 于 IP协议, 也可以基于其它用于互联网的通信协议。 Next, in step S3302, the computer 311 transmits the input information "C" to the web server. Referring to Figure 12, which network server the input information specifically depends on depends on the specific configuration of the network shown, as well as routing algorithms and the like. The most direct way is to specify a network server for each user equipment of the IP address segment. Therefore, when the computer 311 clears its own IP address, it can know which network server to send the input information to, or The correspondence between the IP address segment and the network server is stored in the routing device. After the IP address of the computer 311 is obtained from the IP packet from the computer 311, the corresponding relationship is queried, and the network server as the destination can be determined; One of the alternatives, input information can be sent to multiple network services And then, the network servers communicate with each other to determine a network server that performs subsequent operations; as another of the various alternatives, the network servers shown in FIG. 12 have a division of labor, when Zhang San passes through FIG. 13a. When the human-machine interface of 13c sets the input language, the input information sent by the computer 311 can carry the corresponding language, such as the Chinese identifier, and then the Chinese input server (if any) is responsible for the subsequent operations. In addition, it should be understood that the present invention does not limit the manner of selection of the network server and the manner of interaction between the user equipment and the network server. For example, such interaction may be based on the IP protocol, or may be based on other communication protocols for the Internet.
参看图 15 , 其中, 计算机 311发出的输入信息在步骤 S3401中由一个 网络服务器例如图 12中的网络服务器 341 (以下筒称服务器 341 )接收。  Referring to Fig. 15, the input information sent from the computer 311 is received by a network server such as the network server 341 (hereinafter referred to as the server 341) in Fig. 12 in step S3401.
随后的步骤 S3402中,服务器 341基于输入信息 "C"来在词典数据库中 进行匹配查询, 以生成备选输入项集合。 其中, 根据不同的输入方式, 服 务器 341使用不同的算法来对输入信息 "C"进行翻译, 这些输入方式包括一 般的英文字母输入、 中文拼音、 中文笔画等等。 以英文输入为例, 如不考 虑联想输入, 那么服务器 341将生成包含一个备选输入项即英文字母 C的 备选输入项集合。 如果考虑联想输入, 那么这个集合将包括以字母 c开头 的至少一个单词。 如果采用中文拼音输入, 则备选输入项集合将包括拼音 时首字母为 C的各个中文字。  In the subsequent step S3402, the server 341 performs a matching query in the dictionary database based on the input information "C" to generate an alternative set of input items. Among them, according to different input modes, the server 341 uses different algorithms to translate the input information "C", which includes general English alphabet input, Chinese pinyin, Chinese strokes and the like. Taking the English input as an example, if the association input is not considered, the server 341 will generate a set of alternative entries containing an alternate entry, the English letter C. If you consider association input, then this collection will include at least one word beginning with the letter c. If Chinese Pinyin is used, the set of alternative entries will include the Chinese characters with the initial letter C in Pinyin.
在步骤 S3403中, 服务器 341将生成的备选输入项集合发送回计算机 311。 同样的, 步骤 S3403中的发送过程可以基于 WEB实现, 于是, 备选 输入项集合将被封装在 http协议下的传输单元中进行发送。 可选地, 这种 发送也可以以即时信息 (IM ) 的方式发送, 例如小 i机器人与用户端之间 的互动方式。本例中,不妨假设张三进行中文输入,于是这个集合包括"从、 此、 才、 处、 、 吃、 出、 成、 车、 差...,,等中文字, 其中的每一个中文字 成为一个备选输入项或筒称输入项。 在下文中, 为输入信息和通过输入该 输入信息而得到的备选输入项集合使用相同的标识方式, 以筒要地明确它 们的对应关系, 于是上述集合称为集合" C"。 参看图 14, 服务器 341发回的备选输入项发回计算机 311后, 在步骤 S3303中由计算机 311接收。 In step S3403, the server 341 sends the generated set of alternative entries back to the computer 311. Similarly, the sending process in step S3403 can be implemented based on WEB, and thus, the set of alternative input items will be encapsulated in the transmission unit under the http protocol for transmission. Alternatively, the transmission may also be sent in the form of instant messaging (IM), such as the interaction between the small i-robot and the client. In this case, let's assume that Zhang San performs Chinese input, so this collection includes "from, this, talent, place, eat, out, into, car, poor...,, etc. Chinese characters, each of which is in Chinese. Became an alternative input item or a cartridge input item. In the following, the same identification method is used for the input information and the set of alternative input items obtained by inputting the input information, so as to clarify their correspondences, so The collection is called the collection "C". Referring to Figure 14, after the alternate entry sent back by the server 341 is sent back to the computer 311, it is received by the computer 311 in step S3303.
随后的步骤 S3304中, 计算机 311将该集合通知给张三, 这一步骤可 以采用任何已知的计算机提供人可读信息的技术手段, 典型的例子如屏幕 显示、 扬声器播放等。 不失一般性地本例以屏幕显示为例。  In the subsequent step S3304, the computer 311 notifies the set to the set. This step can employ any known technical means for the computer to provide human readable information, such as screen display, speaker playback, and the like. Without loss of generality, this example uses a screen display as an example.
由于集合" C"中包含较多的中文字, 可能难以在一个提示栏 (如图 16 中附图标记 3505所指) 中完整显示, 因此, 可以在提示栏中每行显示 5个 备选输入项, 并在之前给予顺序号标记, 以方便用户通过按下键盘上的数 字键进行选择。 另外, 提示栏还将包括一个按¾ , 方便用户通过鼠标点击 来显示下一行的备选输入项。 优选地, 用户也可以通过按下键盘上的指定 键来命令显示下一行的备选输入项, 例如 pagedown键。  Since the collection "C" contains more Chinese characters, it may be difficult to display them completely in a prompt bar (as indicated by reference numeral 3505 in Figure 16). Therefore, 5 alternative inputs can be displayed in each line in the prompt bar. Item, and give the sequence number mark before, so that the user can select by pressing the number keys on the keyboard. In addition, the prompt bar will also include a 3⁄4, which is convenient for the user to click on the mouse to display the next line of alternative entries. Preferably, the user can also command to display an alternate entry for the next line by pressing a designated key on the keyboard, such as a pagedown key.
之后,张三会通过鼠标、键盘操作来从这些备选的输入项中选择一个, 并通过按下鼠标左键或相应数字键的方式确认, 从而给予计算机 311 一个 指示信息, 例如, 当鼠标悬停在中文字 "从"上时点击左键, 就给予计算机 311—个指示"从"为被选择的输入项的指示信息。  After that, Zhang San will select one of these alternative inputs by mouse and keyboard operation, and confirm by pressing the left mouse button or the corresponding numeric key, thereby giving the computer 311 an indication information, for example, when the mouse is hovering When the left button is clicked on the Chinese character "From", the computer 311 is given an instruction indicating that "From" is the selected entry.
于是,在步骤 S3305中,计算机 311接收到张三提供的这一指示信息, 并据此在步骤 S3306 中将中文字"从"作为此次输入的输入结果, 将其显示 在用户进行输入时指定的位置, 例如, 浏览器的搜索栏中。  Then, in step S3305, the computer 311 receives the indication information provided by Zhang San, and accordingly, in step S3306, the Chinese character "from" is used as the input result of the input, and is displayed when the user inputs the designation. The location, for example, in the search bar of the browser.
在步骤 S3307中,计算机 311还将这一信息发送给网络服务器 341。根 据本发明的一个具体实施例, 参看图 15 , 网络服务器 341根据张三从备选 输入项集合" C" (以下筒称集合" C" ) 中选择 "从"这一信息, 来对自身保存 的词典数据库进行训练和更新。于是,在本输入法拥有可观的用户群体时, 就能学习全体用户的选择, 动态地更新词库, 例如, 将一段时间内有超过 预定数量的用户输入的一个新词如"犀利哥 "加入到词库中。  In step S3307, the computer 311 also transmits this information to the web server 341. According to a specific embodiment of the present invention, referring to FIG. 15, the web server 341 saves itself according to the information selected by Zhang San from the alternative input item set "C" (hereinafter referred to as the set "C"). The dictionary database is trained and updated. Therefore, when the input method has a considerable user group, the user's choice can be learned, and the vocabulary can be dynamically updated, for example, a new word with more than a predetermined number of users input for a period of time, such as "sharp brother". In the thesaurus.
根据本发明的另一具体实例, 张三并未从集合" C"中做出选择, 而是又 按下了键盘上的 A键, 如本领域技术人员所知的, 此时备选输入项集合将 会收敛。 具体地: 作为相互替代的方式之一: According to another embodiment of the present invention, Zhang San does not make a selection from the set "C", but instead presses the A key on the keyboard, as is known to those skilled in the art, the alternative entry at this time Collection will Will converge. Specifically: as one of the alternatives:
计算机 311再次执行步骤 S3301 , 得到输入信息 "A";  The computer 311 performs step S3301 again to obtain input information "A";
进入步骤 S3302,计算机 311将输入信息" A"也发送至网络服务器 341; 此后, 执行步骤 S3402, 其中, 网络服务器 341将之前的输入信息" C" 和新的输入信息" A"进行整合,得到整合结果 "CA",或称一个新的输入信息 备选输入项所构成的集合" CA"返回给计算机 311。此后的过程与上述类似, 在此不赘。  Proceeding to step S3302, the computer 311 also transmits the input information "A" to the web server 341; thereafter, step S3402 is performed, wherein the web server 341 integrates the previous input information "C" with the new input information "A" to obtain The result "CA" of the integration result "CA", or a new input information alternative entry, is returned to the computer 311. The process thereafter is similar to the above, and it is not awkward here.
作为相互替代的方式之一, 计算机 311在首次执行步骤 S3303时, 将 集合" C"緩存起来, 优选地, 当张三在步骤 S3305中提供了指示信息的情况 下, 緩存的集合" C"被清除。 反之, 计算机 311将会依赖緩存中的集合" C" 来响应张三进一步的输入。 这种依赖緩存的场景例如: 张三依次输入 "C", "A", "0"而后才选择一个输入项例如"草,,,或者张三依次输入 "C", "A", "I", 而后输入退格, 又输入 "0", 诸如此类。 采用这种方式, 可以适当减少本输 入法下的用户端与服务器端在通信链路上的交互, 以降低对网络资源的占 用。  As one of the alternative ways, the computer 311 caches the set "C" when the step S3303 is first executed. Preferably, when the third provides the indication information in step S3305, the cached set "C" is Clear. Conversely, computer 311 will rely on the set "C" in the cache to respond to further input. This kind of cache-dependent scene, for example: Zhang San enters "C", "A", "0" in turn and then selects an input item such as "grass,,, or Zhang" to input "C", "A", "I". ", then enter the backspace, enter "0", and so on. In this way, the interaction between the client and the server on the communication link under the input method can be appropriately reduced to reduce the occupation of network resources.
在以上的例子中, 用户每次按键的操作都触发了一个对应于单次按键 的输入信息的获得和发送。 以下介绍这个例子的一个变化例。  In the above example, the user's operation each time a key triggers the acquisition and transmission of an input message corresponding to a single key. A variation of this example is described below.
具体地, 计算机 311 将在用户提示它截取输入信息时, 才将之前已经 输入的序列作为输入信息, 并发送给服务器 341。 例如, 用户依次输入 C,A,0,M,E,I,W,A,N,G, 最后按下空格, 这个按下空格的操作即可触发计算 机 311将" C,A,0,M,E,I,W,A,N,G"整体地作为输入信息, 发送给服务器 341。 本实施方式中, 执行步骤 S3304后, 计算机 311上的屏幕显示不失一般性 地如图 16所示, 其中, 搜索栏 3501就是用户将光标移动到的位置, 并在 此进行输入,可以看到,输入信息栏 3502中的" caomeiwang"也随服务器 341 返回的备选输入项集合一起显示在屏幕上, 并且, 音节与音节之间还用上 标", "分隔开, 以给用户更加清晰的体验。提示栏 3505中各个备选输入项如 3503和 3504分别以顺序编号加以标识。 如果用户选择 "草莓网", 则"草莓 网"这三个中文字将会最终出现在搜索栏 3501中,优选地,光标将会位于 "网" 字之后。 Specifically, the computer 311 will take the sequence that has been previously input as the input information when the user prompts it to intercept the input information, and sends it to the server 341. For example, the user inputs C, A, 0, M, E, I, W, A, N, G in sequence, and finally presses a space. This operation of pressing a space triggers the computer 311 to "C, A, 0, M. , E, I, W, A, N, G" are collectively transmitted as input information to the server 341. In this embodiment, after performing step S3304, the screen display on the computer 311 is not lost in general as shown in FIG. 16, wherein the search bar 3501 is the position where the user moves the cursor, and input here, and can be seen. The "caomeiwang" in the input information bar 3502 is also displayed on the screen along with the set of alternative entries returned by the server 341, and is also used between the syllable and the syllable. The mark "," is separated to give the user a clearer experience. The various alternative entries in the prompt bar 3505, such as 3503 and 3504, are identified by sequential numbers, respectively. If the user selects "Strawberry Net", the three Chinese characters "Strawberry Net" will eventually appear in the search bar 3501. Preferably, the cursor will be located after the "Net" word.
根据本实施方式中, 本输入法的操作过程还可与用户的身份信息相关 联, 以下仍参照图 14和图 15进行说明。  According to the present embodiment, the operation of the input method can also be associated with the identity information of the user, which will be described below with reference to Figs. 14 and 15.
优选地, 本输入法或者与本输入法相关联的应用程序为用户提供表示 其身份信息的人机界面,例如,在用户希望激活本输入法时,执行步骤 S3308 , 提供一个登陆界面, 用户通过输入用户名和密码来向网络服务器 341 进行 认证, 在获取用户的身份信息后, 在步骤 S3309中发送给网络服务器 341 , 如果认证通过, 则网络服务器 341 将会响应用户此后的文字输入操作, 且 这一过程还考虑该用户的身份信息。 以下还会对此进行详述。 当然, 网络 服务器 341得到用户的身份信息的方式不限于此, 例如, 计算机 311可以 访问操作系统或其它应用程序中的用户身份信息, 如果这种访问可以被允 许, 则即可从中得到用户的身份信息, 并报告给服务器 341 , 此种方式更适 用于计算机 311用于家庭或其它私人用途的情况。  Preferably, the input method or the application associated with the input method provides the user with a human-machine interface indicating the identity information thereof. For example, when the user wishes to activate the input method, step S3308 is performed to provide a login interface, and the user passes Entering the user name and password to authenticate to the web server 341, and after obtaining the identity information of the user, sending it to the web server 341 in step S3309, if the authentication is passed, the web server 341 will respond to the text input operation of the user thereafter, and this A process also considers the identity information of the user. This will be detailed below. Of course, the manner in which the web server 341 obtains the identity information of the user is not limited thereto. For example, the computer 311 can access user identity information in the operating system or other applications, and if such access can be allowed, the identity of the user can be obtained therefrom. The information is reported to the server 341, which is more suitable for the case where the computer 311 is used for home or other private use.
张三的身份信息在步骤 S3406中由服务器 341接收到。 根据这一身份 信息, 可以在步骤 S3407中调取张三的输入历史记录, 这是服务器 341保 存的过去一段时间内张三用本输入法输入的所有文字, 其中包括张三历次 从备选输入项集合中选择的那些输入项。  The identity information of Zhang San is received by the server 341 in step S3406. According to this identity information, the input history record of Zhang San can be retrieved in step S3407, which is all the characters input by the server 341 in the past period of time, and the input of the input method by the input method, including the three input times from the alternate input. Those inputs selected in the item collection.
步骤 S3407中调取的输入历史记录可以作用于步骤 S3402中的备选输 入项集合的生成过程, 具体地: 在步骤 S3402中, 计算机 311首先基于输 入信息例如" C"在词典数据库中进行匹配查询, 得到初步查询结果, 其内容 与上述实例中的备选输入项集合" C"相同。 接着, 计算机 311根据张三的输 入历史记录, 对初步查询结果进行处理, 以生成本实例中的备选输入项集 合" C"。 典型地, 计算机 311将输入历史记录与初步查询结果进行对比, 将 初步查询结果中的同样包含在输入历史记录中的内容(输入项)排列在优 先于其它内容(输入项)的位置, 以本实例中的备选输入项集合" c"。 可选 地, 这些输入历史记录可以与当时的输入信息相对应, 于是, 在进行初步 查询结果与输入历史记录的比对时, 可以仅参照对应于当前的输入信息的 那些输入历史记录来排列各个输入项的位置进而生成备选输入项集合。 The input history record retrieved in step S3407 may be applied to the generation process of the candidate input item set in step S3402. Specifically, in step S3402, the computer 311 first performs a matching query in the dictionary database based on the input information such as "C". , get the preliminary query result, the content is the same as the alternative input item set "C" in the above example. Next, the computer 311 processes the preliminary query result according to the input history of Zhang San to generate the alternative input item set "C" in the present example. Typically, computer 311 compares the input history with the preliminary query results, The contents (input items) also included in the input history in the preliminary query result are arranged in a position in preference to other contents (input items), and the alternative input item set "c" in this example. Optionally, the input history records may correspond to the input information at the time, and thus, when the preliminary query result is compared with the input history record, only the input history records corresponding to the current input information may be used to arrange each of the input history records. The location of the input item in turn generates a set of alternative input items.
用户的身份信息的另一示范性作用在于, 帮助服务器 341 根据步骤 S3404中接收到的指示信息, 来对该用户的输入历史记录进行更新, 例如, 将指示信息所标示的输入项加入到输入历史记录, 或者, 将输入信息与指 示信息所标示的输入项相关联地加入到输入历史记录。  Another exemplary function of the user's identity information is that the help server 341 updates the input history of the user according to the indication information received in step S3404, for example, adding the input indicated by the indication information to the input history. Recording, or, adding the input information to the input history in association with the input indicated by the indication information.
图 17为根据本实施方式的连接到网络服务器的供用户进行文字输入的 用户设备的框图, 以图 12所示的用户设备 311为例, 其中包括:  FIG. 17 is a block diagram of a user equipment connected to a network server for text input by a user according to the embodiment. The user equipment 311 shown in FIG. 12 is taken as an example, and includes:
获得装置 3111 , 用于获得用户提供的输入信息;  Obtaining device 3111, configured to obtain input information provided by a user;
第一发送装置 3112, 用于将所述输入信息发送至所述网络服务器, 所 述网络服务器基于该输入信息来向所述用户设备提供反馈信息;  a first sending device 3112, configured to send the input information to the network server, where the network server provides feedback information to the user equipment based on the input information;
第一接收装置 3113, 用于接收所述网络服务器发回的反馈信息; 通知装置 3114, 用于将所述反馈信息通知所述用户, 以用于进一步的 人机交互。  The first receiving device 3113 is configured to receive feedback information sent back by the network server, and the notification device 3114 is configured to notify the user of the feedback information for further human-computer interaction.
进一步地, 用户设备 311还包括:  Further, the user equipment 311 further includes:
身份获取装置 3115, 用于获取所述用户的身份信息;  An identity obtaining device 3115, configured to acquire identity information of the user;
第一发送装置 3112还用于, 将所述用户的身份信息发送给所述网络服 务器。  The first sending device 3112 is further configured to send the identity information of the user to the network server.
进一步地, 用户设备 311还包括:  Further, the user equipment 311 further includes:
第二接收装置 3116, 用于接收所述用户提供的指示信息, 其用于表示 所述用户在所述备选输入项集合中所选择的输入项, 并将所述输入项作为 所述用户输入的输入结果;  a second receiving device 3116, configured to receive indication information provided by the user, used to represent an input item selected by the user in the set of candidate input items, and use the input item as the user input Input result;
第一发送装置 3112还用于, 将所述指示信息发送给所述网络服务器。 其中, 用户设备 311供用户在基于 WEB的应用程序中进行文字输入。 其中,用户设备 311供用户在基于 WEB的浏览器程序中进行文字输入。 图 18为根据本实施方式的用于辅助一个用户设备的用户来进行文字输 入的网络服务器的框图, 例如图 12所示的服务器 341 , 其中包括: The first sending device 3112 is further configured to send the indication information to the network server. The user equipment 311 is used by the user to perform text input in the WEB-based application. The user equipment 311 is used by the user to perform text input in the WEB-based browser program. 18 is a block diagram of a network server for assisting a user of a user equipment to perform text input, such as the server 341 shown in FIG. 12, according to the present embodiment, including:
第三接收装置 3411 , 用于接收所述用户设备发来的由所述用户提供的 输入信息;  a third receiving device 3411, configured to receive input information provided by the user equipment and sent by the user;
生成装置 3412, 用于基于所述输入信息来在词典数据库中进行匹配查 询, 以生成一个备选输入项集合;  a generating device 3412, configured to perform a matching query in the dictionary database based on the input information to generate an alternative input item set;
第二发送装置 3413 ,用于将所述备选输入项集合发送给所述用户设备。 其中, 第三接收装置 3411还用于:  The second sending device 3413 is configured to send the set of candidate input items to the user equipment. The third receiving device 3411 is further configured to:
-接收所述用户设备发来的新的输入信息;  Receiving new input information sent by the user equipment;
生成装置 3412还用于: 将所述新的输入信息与之前接收到的输入信息 进行整合, 以得到整合结果; 基于所述整合结果来在所述词典数据库中进 行匹配查询, 以生成新的备选输入项集合;  The generating device 3412 is further configured to: integrate the new input information with previously received input information to obtain an integration result; perform a matching query in the dictionary database based on the integration result to generate a new preparation Select a set of input items;
第二发送装置 3413还用于, 将所述新的备选输入项集合发送给所述用 户设备。  The second transmitting device 3413 is further configured to send the new set of candidate input items to the user equipment.
进一步地, 其中, 第三接收装置 3411还用于, 接收所述用户设备发来 的所述用户的身份信息;  Further, the third receiving device 3411 is further configured to: receive identity information of the user sent by the user equipment;
服务器 341还包括: 调取装置 3414, 用于根据所述用户的身份信息, 调取所述用户的输入历史记录;  The server 341 further includes: a retrieval device 3414, configured to retrieve an input history record of the user according to the identity information of the user;
生成装置 3412还包括: 查询装置 34121 , 用于基于所述输入信息在所 述词典数据库中进行匹配查询, 得到初步查询结果; 处理装置 34122, 用于 根据所述用户的输入历史记录, 对所述初步查询结果进行处理, 以生成所 述备选输入项集合。  The generating device 3412 further includes: a querying device 34121, configured to perform a matching query in the dictionary database based on the input information to obtain a preliminary query result; and the processing device 34122 is configured to: according to the input history record of the user, The preliminary query results are processed to generate the set of candidate inputs.
其中, 处理装置 34122还用于: 将所述输入历史记录与所述初步查询 结果进行对比, 将所述初步查询结果中的同样包含在所述输入历史记录中 的内容排列在优先于其它内容的位置, 以生成所述备选输入项集合。 The processing device 34122 is further configured to: compare the input history record with the preliminary query result, and include the first query result in the input history record. The content is arranged in a position that is prioritized over other content to generate the set of alternative entries.
其中,第三接收装置 3411还用于,接收来自所述用户设备的指示信息, 其用于表示所述用户在所述备选输入项集合中所选择的输入项;  The third receiving device 3411 is further configured to receive indication information from the user equipment, where is used to indicate an input item selected by the user in the set of candidate input items;
该服务器 341还包括更新装置 3415 , 用于根据所述指示信息所表示的 输入项, 执行以下各项中的至少一项: -更新所述用户的输入历史记录; - 对所述网络服务器所存储的词典数据库进行训练和更新。  The server 341 further includes an updating means 3415 for performing at least one of the following items according to the input item indicated by the indication information: - updating the input history of the user; - storing the network server The dictionary database is trained and updated.
本发明的第三实施方式还提供一种用于供用户进行文字输入的系统, 其中包括本发明第三实施方式的用于输入文字的用户设备和本发明第三实 施方式的用于辅助用户设备输入文字的网络 务器。  A third embodiment of the present invention further provides a system for a user to input characters, including a user equipment for inputting characters and a third embodiment of the present invention for assisting user equipment according to the third embodiment of the present invention. Enter the text server.
本发明用于供用户进行文字输入的方法、 设备、 服务器及系统的第四 实施方式:  The fourth embodiment of the method, device, server and system for the user to input text is provided by the present invention:
图 19示出根据本实施方式一个方面的用于当用户进行文字输入时同时 提供与输入信息相关的搜索相关信息的用户设备 41。 也即, 当用户在用户 设备 41上进行文字输入时, 用户设备 41在根据用户输入序列提供相应输 入词条选项, 还根据用户设备的输入序列搜索相关的搜索相关信息, 如广 告信息、 网页信息、 旅游信息或地图信息。 以下以广告信息为例进行说明: 其中, 用户设备 41可以是任何一种可与用户通过键盘、 遥控器、 触摸 板、或声控设备进行人机交互的电子产品, 例如计算机、 智能手机、 PDA、 游戏机、 或 IPTV等。  Fig. 19 shows a user device 41 for simultaneously providing search related information related to input information when a user performs character input, according to an aspect of the present embodiment. That is, when the user performs text input on the user device 41, the user device 41 provides a corresponding input term option according to the user input sequence, and also searches for related search related information, such as advertisement information and webpage information, according to the input sequence of the user device. , travel information or map information. The following is an example of the advertisement information: wherein the user equipment 41 can be any electronic product that can interact with the user through a keyboard, a remote controller, a touch panel, or a voice control device, such as a computer, a smart phone, a PDA, Game consoles, or IPTV, etc.
用户设备 41中包括第一获取装置 411、 查询装置 412、 提供装置 413、 用于保存本地词库的存储装置 414 (为筒明起见, 以下筒称词库 414 )、 和 用户保存关键词广告库的存储装置 414, (为筒明起见, 以下筒称本地广告 库 414,, 或称为搜索信息库 414,)。 本领域技术人员应能理解, 存储装置 414和 414,可以为分离或同一存储器, 也可分别由一组存储器阵列来实现。  The user equipment 41 includes a first obtaining means 411, a querying means 412, a providing means 413, a storage means 414 for storing the local thesaurus (for the sake of clarity, the following dictionary is referred to as the thesaurus 414), and the user saves the keyword advertisement library. The storage device 414, (for the sake of clarity, the following is referred to as a local advertisement library 414, or referred to as a search information base 414,). Those skilled in the art will appreciate that storage devices 414 and 414 may be separate or identical, or may be implemented by a set of memory arrays, respectively.
具体地, 第一获取装置 411 通过任何一种可与用户进行人机交互的交 互设备来实时地获取用户正在输入的输入序列。 该交互设备可以是键盘、 遥控器、 触摸板或声控设备等。 以键盘为例, 但用户敲击键盘中按键进行 输入时,第一获取装置 411实时地获取用户敲击的按键序列(为筒明起见, 以下仍称输入序列 )。 Specifically, the first obtaining device 411 acquires an input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user. The interactive device can be a keyboard, Remote control, touchpad or voice control device. Taking the keyboard as an example, when the user taps the key in the keyboard to input, the first obtaining device 411 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
查询装置 412将第一获取装置 411提供的用户输入序列与词库 414进 行匹配查询, 获得一个或多个匹配的输入词条选项。 以下以中文为例进行 说明, 本发明允许用户在按全拼、 双拼、 五笔等方法输入中文。 同时, 查 询装置 412还根据用户输入序列在关键词广告库 414,中进行搜索, 获得相 关的一个或多个广告信息选项。与例如, 当用户敲击按键输入 "woaiwaitan", 查询装置 412在词库 414中查询获得 "1 我爱外滩; 2 我爱"等词条组合, 同时在广告库 414,中查询获得与 "外滩 "有关的广告信息有 "外滩三号"、 "外 滩 18 号"等地标性建筑, 因此提供广告信息选项" 3 外滩三号; 4 外滩 18 号"。 本领域技术人员应理解, 查询与输入序列相关的广告信息 (或搜索相 关信息) 的过程可以采用目前公知的各种智能或模糊搜索算法, 在此不作 赘述。 本领域技术人员应理解, 查询与输入序列相关的广告信息 (或搜索 相关信息) 的过程可以采用目前公知的各种智能或模糊搜索算法, 在此不 作赘述。  The querying device 412 matches the user input sequence provided by the first obtaining device 411 with the thesaurus 414 to obtain one or more matching input term options. The following takes Chinese as an example for description. The present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes. At the same time, the querying device 412 also searches in the keyword advertisement library 414 based on the user input sequence to obtain the relevant one or more advertisement information options. For example, when the user taps the button to input "woaiwaitan", the query device 412 queries in the thesaurus 414 to obtain a combination of terms such as "1 I love the Bund; 2 I love", and in the advertisement library 414, the query is obtained with "The Bund". "The relevant advertising information includes landmarks such as "Bund No. 3" and "Bund 18", so the advertisement information option "3 Bund No. 3; 4 Bund No. 18" is provided. Those skilled in the art should understand that the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein. Those skilled in the art should understand that the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
提供装置 413 随后将查询装置获得的一个或多个匹配的输入词条选项 按一定顺序和格式提供给所述用户, 供其选择以作具体输入。 例如, 通过 在显示器的一个输入窗口栏中显示给用户是, 可将多个词条选项与输入序 列分栏显示, 多个词条选项可全部列入下一栏中供用户选择。 优选地, 可 以在词条栏中仅显示一行词条选项, 该行词条选项数目可以是缺省的也可 由用户设定, 通过由用户按动特定功能键显示上一行或下一行词条选项, 该特定功能键例如可以是" +"和" -"。  The providing means 413 then provides the user with one or more matching input term options obtained in the order in which they are selected for specific input. For example, by displaying to the user in an input window bar of the display, multiple entry options can be displayed in the input sequence, and multiple entry options can be included in the next column for the user to select. Preferably, only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key. , the specific function keys can be, for example, "+" and "-".
优选地, 为便于用户注意, 广告信息选项在词条栏中可采用不同显示 方式, 例如不同颜色或灰度。 而且广告信息选项中内置有与该广告信息相 关的网页 IP地址或统一资源标示符(URL )。用户可通过按该选项相应的数 字键或通过鼠标移动光标至该选项处悬停或点击来选择该广告信息选项。 当用户选择该广告信息选项, 用户设备 41中的网址定向装置(未示出 )可 通过网络定向到其对应的网页网址, 例如在浏览器打开情形, 经由网络连 接到该网址对应的网页服务器, 并在浏览器中显示其网页给用户。 Preferably, for the convenience of the user, the advertisement information option may adopt different display manners in the entry bar, such as different colors or gray scales. Moreover, a web page IP address or a uniform resource identifier (URL) related to the advertisement information is built in the advertisement information option. The user can press the corresponding number of the option Select the ad information option by either the word key or by hovering the mouse to the option to hover or click. When the user selects the advertisement information option, the URL targeting device (not shown) in the user device 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, and connected to the web server corresponding to the webpage via the network. And display its web page to the user in the browser.
在本实施方式一实例中, 第一获取装置 411和查询装置 412和提供装 置 413之间是持续不断地工作。 具体地, 第一获取装置 411 实时地获取用 户的输入序列并持续不断地提供给查询装置 412 , 例如 "w"、 "wo" ... "wo" ... "woai" ... "woaiwaitan" , 查询装置 412也实时地对第一获取装 置 411 持续不断地提供的用户输入序列进行匹配查询, 以持续获取与上述 各输入序列相对应的词条选项,例如" w"对应" 1我、 2喔、 3握、 4窝"; "woai" 对应" 1我爱、 2喔、 3握、 4窝"; "woaiwaitan"对应" 1我爱外滩、 2外滩三 号、 3 外滩 18号,,。 在此, 本领域技术人员应理解"持续"是指在用户最终选 择一个词条选项前一直进行的动作方式, 例如用户在敲击按键序列" woai" 后可能稍停片刻, 如 0.5秒, 再继续敲击随后的按键。  In the first embodiment of the present embodiment, the first obtaining means 411 and the inquiring means 412 and the providing means 413 are continuously operated. Specifically, the first obtaining means 411 acquires the input sequence of the user in real time and continuously supplies it to the querying means 412, for example "w", "wo" ... "wo" ... "woai" ... "woaiwaitan The query device 412 also performs a matching query on the user input sequence continuously provided by the first obtaining device 411 in real time to continuously acquire the term options corresponding to the above input sequences, for example, "w" corresponds to "1 me, 2喔, 3 grips, 4 nests;; "woai" corresponds to "1 I love, 2喔, 3 grips, 4 nests"; "woaiwaitan" corresponds to "1 I love the Bund, 2 Bund 3, 3 Bund 18, Here, those skilled in the art should understand that "continuation" refers to the action that is performed until the user finally selects an entry option. For example, the user may pause for a while after tapping the key sequence "woai", such as 0.5 second. Continue to tap the subsequent buttons.
在另一个实例中, 查询装置 412在根据用户输入序列在词库 414和广 告库 414,中进行匹配查询获得多个输入词条选项和广告信息选项时还获得 其各自的优先级。 提供装置 413将查询装置 412提供的多个匹配的输入词 条选项和广告信息选项按优先级顺序在词条栏中显示给所述用户, 其中优 先级越高, 该输入词条选项或广告信息选项越靠前显示。 优选地, 为便于 用户进行文字输入, 优先级最好的输入词条选项一般置于最前位置, 使得 用户可通过筒单地按 "ENTER"或空格键来选择, 而广告信息选项通常置于 每行中较末尾选项位置。  In another example, the querying device 412 also obtains its respective priority when performing a matching query in the thesaurus 414 and the advertising library 414 based on the user input sequence to obtain a plurality of input term options and advertising information options. The providing device 413 displays the plurality of matched input term options and advertisement information options provided by the querying device 412 in the item column to the user in a priority order, wherein the higher the priority, the input term option or the advertisement information The more the option is displayed. Preferably, in order to facilitate the user to input text, the preferred input item option of priority is generally placed at the foremost position, so that the user can select by pressing "ENTER" or the space bar, and the advertisement information option is usually placed in each position. The position of the option at the end of the line.
具体地, 查询装置 412可以根据用户特征来在词库 414和广告库 414, 中进行查询, 获得匹配的输入词条选项和广告信息选项。 在获得匹配的多 个输入词条选项和广告信息选项后, 还可根据用户特征来确定其优先级高 低。 用户特征包括用户的输入历史记录、 用户设定的个人偏好选择、 用户 属性、 用户地址等, 用户属性包括用户的职业、 性别、 国际、 出生地、 年 龄等体现个人特性的信息。 查询装置 412还可根据用户输入历史记录中对 各个词条选项或词条选项中的词汇的选择频度、 各词条选项中各个词汇间 的文义关联性来确定其优先级高低。 查询装置 412也可根据用户设定的个 人偏好选择来确定优先级高低, 例如, 当用户设定输入偏好为: 优先级高 低: 购物 >饮食 >旅游, 则获取用户输入序列 "woaiwaitan"后, 查询装置在广 告库 14,中查询获得与 "waitan"对应的多个位于外滩的地标性建筑或旅游景 点, 如招商局总部、 汇丰大厦、 花旗银行、 外滩三号、 外滩 18号等, 随后 根据用户设定的个人偏好可判断"外滩三号"、 "外滩 18号,,等以购物、 餐饮 为主的建筑景点的优先级最高。 另外, 查询装置 412还可根据目前用户设 备的 IP地址来判断其所处的地域, 从而可以确定输入序列中与该地域相关 的词汇的优先级, 例如, 但用户输入序列为 "woxihuanbund" , 其中 "bund" 的译文有" 1堤岸 2码头 3同盟 4 (上海)外滩", 当查询装置 412根据用户 设备 IP地址获知目前位于中国上海市,从而可确定" bund"对应译文中"上海 外滩"或"外滩"优先级最高, 因而可提供如下输入词条选项" 1我喜欢上海外 滩; 2我喜欢外滩; 3我喜欢码头; 4我喜欢堤岸; 5我喜欢同盟"。 为筒明 起见,我们可将用户输入历史记录,用户设定个人偏好、计算机 IP地址(或 用户地址)等统称为用户特征, 且本领域技术人员应能理解, 用户特征包 括但不限于上述内容。 Specifically, the querying device 412 can query in the thesaurus 414 and the advertisement library 414 according to the user characteristics to obtain matching input term options and advertisement information options. After obtaining matching multiple input term options and advertising information options, the priority level can also be determined based on user characteristics. User characteristics include user's input history, user-set personal preferences, users Attributes, user addresses, etc. User attributes include information about the user's occupation, gender, international, place of birth, age, etc. that reflect personal characteristics. The querying device 412 can also determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option. The querying device 412 can also determine the priority according to the personal preference selected by the user. For example, when the user sets the input preference as: Priority: Shopping > Diet > Travel, after the user input sequence "woaiwaitan" is obtained, the query In the advertisement library 14, the device obtains a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users. The set personal preference can be judged as the highest priority of "Bund No. 3" and "Bund 18," such as shopping and catering. The query device 412 can also judge according to the IP address of the current user equipment. The region in which it is located, thereby determining the priority of the vocabulary associated with the region in the input sequence, for example, but the user input sequence is "woxihuanbund", where the translation of "bund" has "1 Embankment 2 Dock 3 Alliance 4 (Shanghai "Bund", when the query device 412 is known based on the IP address of the user equipment, currently located in Shanghai, China, and thus can determine the "bund" corresponding translation in "Shanghai "Or "The Bund" has the highest priority, so the following entry options are available: 1 I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like the league.) For the sake of the tube, We may refer to user input history, user set personal preferences, computer IP address (or user address), etc. as user features, and those skilled in the art will appreciate that user features include, but are not limited to, the above.
本领域技术人员应能理解, 用户设备 41可以在本地存储器中保存上述 用户输入历史记录、用户设定输入偏好以及词汇间的各种关联性。优选地, 用户设备 41还可对所保存的用户输入历史记录、 输入偏好及词汇间关联性 等信息进行更新。 如图 20所示, 用户设备 41还包括第二获取装置 415、 更 新装置 416。其中第二获取装置 415通过与用户的进一步交互来获取该用户 对提供装置 413所提供多个输入词条选项的选择。 更新装置 416根据第二 获取装置 415提供的用户选择来更新词库和用户输入历史记录、 词汇间的 关联性等,例如可在词库 414中增加新词条选项和已有词条选项的优先级, 用户特征。 更优选地, 如用户设备可接入互联网, 第二获取装置 415还可 自行在互联网中搜寻新的词条组合, 并用以更新词库 414等。 Those skilled in the art will appreciate that the user device 41 can store the user input history, user set input preferences, and various associations between vocabulary in the local memory. Preferably, the user equipment 41 may also update information such as saved user input history, input preferences, and inter-vocabulary relevance. As shown in FIG. 20, the user equipment 41 further includes a second obtaining means 415 and an updating means 416. The second obtaining means 415 obtains the selection of the plurality of input term options provided by the providing device 413 by the user through further interaction with the user. The update device 416 updates the thesaurus and the user input history, vocabulary according to the user selection provided by the second obtaining device 415. Relevance, etc., for example, the new term option and the priority of the existing term option can be added to the thesaurus 414, user characteristics. More preferably, if the user equipment can access the Internet, the second obtaining means 415 can also search for a new combination of terms on the Internet and update the thesaurus 414 and the like.
在一个优选实例中, 用户设备 41中的广告库 414,可随时与周期性地进 行主动更新, 例如用户设备 41经由网络与一个或多个网络设备相连接, 并 同步。  In a preferred embodiment, the ad library 414 in the user device 41 can be actively updated at any time and periodically, e.g., the user device 41 is connected to and synchronized with one or more network devices via a network.
在另一优选实例中, 广告库 414,可以是位于用户设备 41的外部, 例如 位于一个网络设备处或分布于多个网络设备处, 用户设备 41可经由网络与 网络设备相连接, 从而查询与用户输入序列相关的广告信息选项。  In another preferred example, the advertisement library 414 may be located outside the user equipment 41, for example, at a network device or distributed among multiple network devices, and the user equipment 41 may be connected to the network device via the network, so as to query and The user enters a sequence of related advertising information options.
图 20示出根据本实施方式另一方面的用于当用户进行文字输入时同时 提供与输入信息相关的搜索相关信息的用户设备 41和网络设备 42,其中用 户设备 41经由网络与网络设备 42相连接, 该网络可以为互联网、 内部网 等。 也即, 当用户在用户设备 41上进行文字输入时, 用户设备 41经由网 络向网络设备 42发送查询请求, 请求网络设备 42根据用户输入序列搜索 相关的搜索相关信息, 如广告信息、 网页信息、 旅游信息或地图信息, 然 后将网络设备反馈的搜索相关信息与网络设备查询获得的输入词条选项一 起提供给用户。 以下以广告信息为例进行说明:  20 illustrates a user device 41 and a network device 42 for simultaneously providing search related information related to input information when a user performs character input according to another aspect of the present embodiment, wherein the user device 41 is associated with the network device 42 via a network. Connection, the network can be the Internet, intranet, etc. That is, when the user performs text input on the user equipment 41, the user equipment 41 sends a query request to the network device 42 via the network, requesting the network device 42 to search for relevant search related information, such as advertisement information, webpage information, according to the user input sequence. The travel information or the map information is then provided to the user along with the search related information fed back by the network device together with the input term option obtained by the network device query. The following is an example of advertising information:
在一个实例中, 用户设备 41中包括第一获取装置 411、 第一发送装置 417、第一接收装置 418、提供装置 413。网络设备 42包括第二接收装置 421、 查询装置 422、 第二发送装置 423、 用于保存网络词库的存储装置 424 (为 筒明起见, 以下筒称网络词库 424 )和用于保存关键词广告库的存储装置 424, (为筒明起见, 以下筒称网络广告库 424,)。  In one example, the user equipment 41 includes a first obtaining device 411, a first transmitting device 417, a first receiving device 418, and a providing device 413. The network device 42 includes a second receiving device 421, a querying device 422, a second transmitting device 423, a storage device 424 for saving the network lexicon (for the sake of clarity, the following is called a network vocabulary 424) and for storing keywords The storage device 424 of the advertisement library, (for the sake of clarity, the following is called the network advertisement library 424).
具体地, 第一获取装置 411 通过任何一种可与用户进行人机交互的交 互设备来实时地获取用户正在输入的输入序列。 该交互设备可以是键盘、 遥控器、 触摸板或声控设备等。 以键盘为例, 但用户敲击键盘中按键进行 输入时,第一获取装置 411实时地获取用户敲击的按键序列(为筒明起见, 以下仍称输入序列 )。 Specifically, the first obtaining device 411 acquires an input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user. The interactive device can be a keyboard, a remote control, a touch pad or a voice control device, and the like. Take the keyboard as an example, but the user presses the button on the keyboard. When inputting, the first obtaining means 411 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
用户设备 41中的第一发送装置 417实时并持续不断地将第一获取装置 411提供的用户输入序列发送至网络设备 42。 网络设备 42中的第二接收装 置 421接收到该输入序列并提供给查询装置 422。查询装置 422将用户输入 序列与词库 424进行匹配查询, 获得一个或多个匹配的输入词条选项。 以 下以中文为例进行说明, 本发明允许用户在按全拼、 双拼、 五笔等方法输 入中文。 同时, 查询装置 422还根据用户输入序列在关键词广告库 424中 进行搜索, 获得相关的一个或多个广告信息选项。 与例如, 当用户敲击按 键输入 "woaiwaitan", 查询装置 422在词库 424中查询获得 "1 我爱外滩; 2 我爱"等词条组合, 同时在广告库 424中查询获得与 "外滩 "有关的广告信息 有"外滩三号"、 "外滩 18号"等地标性建筑, 因此提供广告信息选项" 3 外滩 三号; 4 外滩 18 号"。 本领域技术人员应理解, 查询与输入序列相关的广 告信息 (或搜索相关信息) 的过程可以采用目前公知的各种智能或模糊搜 索算法, 在此不作赘述。  The first transmitting device 417 in the user device 41 transmits the user input sequence provided by the first obtaining device 411 to the network device 42 in real time and continuously. The second receiving device 421 in the network device 42 receives the input sequence and provides it to the querying device 422. The querying device 422 performs a matching query on the user input sequence with the thesaurus 424 to obtain one or more matching input term options. The following is a description of the Chinese language. The present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes. At the same time, the querying device 422 also searches in the keyword advertisement library 424 according to the user input sequence to obtain related one or more advertisement information options. For example, when the user taps the button to input "woaiwaitan", the query device 422 queries the vocabulary 424 to obtain a combination of terms such as "1 I love the Bund; 2 I love", and in the advertisement library 424, the query is obtained with the "Bund". Relevant advertising information includes landmarks such as "Bund No. 3" and "Bund 18", so the advertisement information option "3 Bund No. 3; 4 Bund No. 18" is provided. Those skilled in the art should understand that the process of querying the advertisement information (or searching for related information) related to the input sequence may employ various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
网络设备 42 中的第二发送装置 423 也实时和持续不断地将查询装置 422提供的输入词条选项发送至用户设备 41。 用户设备 41中的第一接收装 置 419接收到所述输入词条选项并实时和持续地提供给提供装置 413 ,提供 装置 413 随后将获得的一个或多个匹配的输入词条选项按一定顺序和格式 提供给所述用户, 供其选择以作具体输入。 例如, 通过在显示器的一个输 入窗口栏中显示给用户是, 可将多个词条选项与输入序列分栏显示, 多个 词条选项可全部列入下一栏中供用户选择。 优选地, 可以在词条栏中仅显 示一行词条选项, 该行词条选项数目可以是缺省的也可由用户设定, 通过 由用户按动特定功能键显示上一行或下一行词条选项, 该特定功能键, 例 如可以是" +"和" -"。  The second transmitting device 423 in the network device 42 also transmits the input term options provided by the querying device 422 to the user device 41 in real time and continuously. The first receiving device 419 in the user device 41 receives the input term option and provides it to the providing device 413 in real time and continuously, and the providing device 413 then sequentially obtains one or more matching input term options in a certain order. The format is provided to the user for selection to make specific input. For example, by displaying to the user in an input window bar of the display, multiple entry options can be displayed in the input sequence, and multiple entry options can be included in the next column for the user to select. Preferably, only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key. , the specific function keys, for example, can be "+" and "-".
优选地, 为便于用户注意, 广告信息选项在词条栏中可采用不同显示 方式, 例如不同颜色或灰度。 而且广告信息选项中内置有与该广告信息相 关的网页 IP地址或统一资源标示符(URL )。用户可通过按该选项相应的数 字键或通过鼠标移动光标至该选项处悬停或点击来选择该广告信息选项。 当用户选择该广告信息选项, 用户设备 1 可通过网络定向到其对应的网页 网址,例如在浏览器打开情形,经由网络连接到该网址对应的网页服务器, 并在浏览器中显示其网页给用户。 Preferably, for the user's attention, the advertisement information option can be displayed differently in the entry bar. Ways, such as different colors or grayscale. Moreover, a web page IP address or a uniform resource identifier (URL) related to the advertisement information is built in the advertisement information option. The user can select the ad information option by pressing the corresponding number key for the option or by hovering the mouse to hover or click on the option. When the user selects the advertisement information option, the user equipment 1 can be directed to its corresponding webpage URL through the network, for example, in the browser open situation, connected to the web server corresponding to the webpage via the network, and display the webpage to the user in the browser. .
优选地, 用户设备 41 中的第一获取装置 411、 第一发送装置 417、 第 一接收装置、 和网络设备 42中的第二接收装置 421、 查询装置 412和第二 发送装置 423之间是持续不断地配合工作。 具体地, 第一获取装置 411 实  Preferably, the first obtaining means 411, the first transmitting means 417, the first receiving means, and the second receiving means 421, the inquiring means 412 and the second transmitting means 423 of the network device 42 are continuously in between Constantly cooperate with the work. Specifically, the first obtaining device 411 is
"wo" ... "wo" ... "woai" ... "woaiwaitan" , 第一发送装置 417也实时和持续不 断地将各种输入序列发送给网络设备 42。 网络设备 42 中的第二接收装置 给查询装置 422,查询装置 422随即实时地对第一接收装置 421持续不断地 提供的用户输入序列进行匹配查询, 以持续获取与上述各输入序列相对应 的词条选项, 例如" w"对应" 1我、 2喔、 3握、 4窝"; "woai"对应" 1我爱、 2喔、 3握、 4窝"; "woaiwaitan"对应" 1我爱外滩、 2外滩三号、 3 外滩 18 号,,。 在此,本领域技术人员应理解 "持续,,是指在用户最终选择一个词条选 项前一直进行的动作方式, 例如用户在敲击 按键序列" woai"后可能稍停片 刻, 如 0.5秒, 再继续敲击随后的按键。 "wo" ... "wo" ... "woai" ... "woaiwaitan", the first transmitting device 417 also transmits various input sequences to the network device 42 in real time and continuously. The second receiving device in the network device 42 sends the query device 422, and the query device 422 then performs a matching query on the user input sequence continuously provided by the first receiving device 421 in real time to continuously acquire the words corresponding to the above input sequences. For example, "w" corresponds to "1 me, 2喔, 3 grips, 4 nests"; "woai" corresponds to "1 I love, 2喔, 3 grips, 4 nests"; "woaiwaitan" corresponds to "1 I love the Bund" 2, Bund 3, 3 Bund 18,. Here, those skilled in the art should understand that "continued, refers to the action that is always performed before the user finally selects an entry option, such as the user tapping the key sequence. After "woai", you may pause for a while, such as 0.5 seconds, and then continue to tap the subsequent buttons.
在本实施方式一个优选实例中, 查询装置 422在根据用户输入序列在 网络词库 424和网络广告库 424中进行匹配查询获得多个输入词条选项和 广告信息选项时还获得其各自的优先级。 用户设备 41中的提供装置 413将 网络设备 42提供的多个匹配的输入词条选项和广告信息选项按优先级顺序 在词条栏中显示给所述用户, 其中优先级越高, 该输入词条选项或广告信 息选项越靠前显示。 优选地, 为便于用户进行文字输入, 优先级最好的输 入词条选项一般置于最前位置, 使得用户可通过筒单地按' 'ENTER"或空格 键来选择, 而广告信息选项通常置于每行中较末尾选项位置。 In a preferred embodiment of the present embodiment, the querying device 422 also obtains its respective priority when performing a matching query in the network vocabulary 424 and the network advertisement library 424 according to the user input sequence to obtain a plurality of input term options and advertisement information options. . The providing device 413 in the user equipment 41 displays a plurality of matching input term options and advertisement information options provided by the network device 42 in the item column to the user in a priority order, wherein the higher the priority, the input word The top option or ad information option is displayed. Preferably, in order to facilitate the user to input text, the priority is the best The entry options are generally placed in the front position so that the user can select by pressing ''ENTER' or the space bar, and the ad information option is usually placed in the last option position in each line.
优选地, 当用户通过用户设备 41登录网络设备时, 网络设备 42的查 询装置 422可根据用户登录的 ID来获取用户特征。例如用户输入历史记录, 用户特定的用户词库、 用户设定的个人偏好、 用户属性信息等。 所述用户 特征可以保存在网络设备 42中, 也可保存在于网络设备 42相连接的其他 网络设备中。  Preferably, when the user logs in to the network device through the user device 41, the querying device 422 of the network device 42 can acquire the user feature according to the ID of the user login. For example, user input history, user-specific user vocabulary, user-set personal preferences, user attribute information, and the like. The user features may be stored in the network device 42 or in other network devices to which the network device 42 is connected.
随后, 查询装置 422可以根据用户特征来在网络词库 424和网络广告 库 424 中进行查询, 获得匹配的输入词条选项和广告信息选项。 具体地, 查询装置 422可根据用户输入历史记录中对各个词条选项或词条选项中的 词汇的选择频度、 各词条选项中各个词汇间的文义关联性来确定其优先级 高低。查询装置 422也可根据用户设定的个人偏好选择来确定优先级高低, 例如, 当用户设定输入偏好为: 优先级高低: 购物 >饮食 >旅游, 则获取用 户输入序列" woaiwaitan"后, 查询装置在网络广告库 424 中查询获得与 "waitan"对应的多个位于外滩的地标性建筑或旅游景点, 如招商局总部、 汇 丰大厦、 花旗银行、 外滩三号、 外滩 18号等, 随后根据用户设定的个人偏 好可判断"外滩三号"、 "外滩 18号"等以购物、 餐饮为主的建筑景点的优先 级最高。 另外, 查询装置 422还可根据目前用户设备的 IP地址来判断其所 处的地域,从而可以确定输入序列中与该地域相关的词汇的优先级,例如, 但用户输入序列为" woxihuanbund", 其中" bund"的译文有" 1堤岸 2码头 3 同盟 4 (上海)外滩", 当查询装置 412根据用户设备 IP地址获知目前位于 中国上海市,从而可确定" bund"对应译文中"上海外滩"或"外滩"优先级最高, 因而可提供如下输入词条选项" 1我喜欢上海外滩; 2我喜欢外滩; 3我喜欢 码头; 4我喜欢堤岸; 5 我喜欢同盟"。 为筒明起见, 我们可将用户输入历 史记录, 用户设定个人偏好、 计算机 IP地址(或用户地址)等统称为用户 特征, 且本领域技术人员应能理解, 用户特征包括但不限于上述内容。 优选地, 网络设备 42还可对所保存的用户输入历史记录、 输入偏好及 词汇间关联性等信息进行更新。 如图 21所示, 用户设备 41还包括第二获 取装置 415、 第三发送装置 418; 网络设备 42还包括第二接收装置 425和 更新装置 426。 其中用户设备 41中的第二获取装置 415通过与用户的进一 步交互来获取该用户对提供装置 413所提供多个输入词条选项的选择, 并 由第三发送装置 418发送至网络设备。更新装置 426根据第二接收装置 425 所接收的用户选择来更新词库和用户输入历史记录、 词汇间的关联性等, 例如可在网络词库 424 中增加新词条选项和已有词条选项的优先级, 用户 特征。 更优选地, 网络设备 42还可包括第三获取装置 (未示出), 其还可 自行在互联网中搜寻新的词条组合, 并用以更新网络词库 424等。 Subsequently, the querying device 422 can query the network vocabulary 424 and the network advertisement library 424 according to the user characteristics to obtain matching input term options and advertisement information options. Specifically, the querying device 422 can determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option. The querying device 422 can also determine the priority according to the personal preference selected by the user. For example, when the user sets the input preference to: Priority: Shopping > Diet > Travel, after the user input sequence "woaiwaitan" is obtained, the query The device queries the network advertisement library 424 to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users. The set personal preference can be judged as the highest priority for shopping, dining and other architectural attractions such as "The Bund No. 3" and "The Bund 18". In addition, the querying device 422 can also determine the region in which the user is located according to the IP address of the current user equipment, so that the priority of the vocabulary related to the region in the input sequence can be determined, for example, but the user input sequence is "woxihuanbund", where The translation of "bund" has "1 Embankment 2 Terminal 3 Alliance 4 (Shanghai) Bund", when the inquiry device 412 is known based on the IP address of the user equipment, currently located in Shanghai, China, so that the "bund" corresponding translation in the "Shanghai Bund" or "The Bund" has the highest priority, so the following entry options are available: "I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like the league." For the sake of clarity, we may refer to user input history, user setting personal preferences, computer IP address (or user address), etc. as user features, and those skilled in the art will appreciate that user features include, but are not limited to, the above. . Preferably, the network device 42 may also update information such as saved user input history, input preferences, and inter-vocabulary associations. As shown in FIG. 21, the user equipment 41 further includes a second obtaining device 415 and a third transmitting device 418. The network device 42 further includes a second receiving device 425 and an updating device 426. The second obtaining means 415 of the user equipment 41 acquires the selection of the plurality of input term options provided by the providing device 413 by the user through further interaction with the user, and is transmitted by the third transmitting device 418 to the network device. The update device 426 updates the thesaurus and the user input history, the association between the vocabularies, and the like according to the user selection received by the second receiving device 425. For example, a new term option and an existing term option may be added to the network thesaurus 424. Priority, user characteristics. More preferably, the network device 42 may further include a third obtaining means (not shown), which may also search for a new combination of terms on the Internet and update the network vocabulary 424 and the like.
在本实施方式另一优选实例中, 广告库 414,可以是位于网络设备 42以 外, 例如位于另一个网络设备处或分布于其他多个网络设备处, 网络设备 42可经由网络与所述其他网络设备相连接, 从而查询与用户输入序列相关 的广告信息选项。  In another preferred embodiment of the present embodiment, the advertisement library 414 may be located outside the network device 42, for example at another network device or distributed among other network devices, and the network device 42 may be connected to the other network via the network. The devices are connected to query the advertising information options associated with the user input sequence.
图 21示出根据本实施方式的另一个优选实例, 其中用户设备 41本身 也包括查询装置 412和用于保存本地词库的存储器 414(以下筒称本地词库 414 ), 本地词库 414并可随时或定期地与网络设备 42的网络词库中该用户 特定的用户词库进行同步。  Fig. 21 shows another preferred example according to the present embodiment, wherein the user device 41 itself also includes a query means 412 and a memory 414 (hereinafter referred to as a local thesaurus 414) for storing a local thesaurus, a local thesaurus 414 and The user-specific user vocabulary in the network vocabulary of the network device 42 is synchronized at any time or periodically.
如图 21所示, 第一获取装置 411在获取用户输入序列后, 可将所述用 户输入序列先提供给用户设备 41的查询装置 412进行匹配查询, 具体查询 过程如前面参照图 19~20所描述的内容, 该内容引用在此不作赘述; 第一 获取装置 411还可通过第三发送装置 418将用户输入序列发送至网络设备 42, 由其中的查询装置 422进行匹配查询, 获得一个或多个与用户输入序 列有关的输入词条选项和广告信息选项具体查询过程如前面参照图 20所描 述的内容,该内容引用在此不作赘述。用户设备 41还包括一个合并装置 420, 其将来自本身的查询装置 412提供的一个或多个输入词条选项和来自网络 设备 42的查询装置 422提供的一个或多个输入词条选项进行合并处理, 删 除其中的重复选项, 并根据一定规则来确定最终合并得到的多个词条选项 和来自网络设备 42反馈的与输入序列有关的广告信息选项的优先级顺序, 随后提供给提供装置 413 , 由其按相应的优先级顺序提供给用户。 通常, 网 络设备 42提供的输入词条选项应该更为准确, 因此优先级较本地查询获得 输入词条选项为高, 而同样地, 为不影响用户的文字输入, 广告信息选项 通常置于每行中较末尾选项位置。 As shown in FIG. 21, after acquiring the user input sequence, the first obtaining device 411 may first provide the user input sequence to the query device 412 of the user equipment 41 for matching query. The specific query process is as described above with reference to FIGS. 19-20. The content of the description is not described herein; the first obtaining device 411 can also send the user input sequence to the network device 42 through the third sending device 418, and the query device 422 performs a matching query to obtain one or more The input term option and the advertisement information option related to the user input sequence are specifically referred to as described above with reference to FIG. 20, and the content reference is not described herein. The user device 41 also includes a merging device 420 that will enter one or more input term options from the query device 412 itself and from the network. One or more input term options provided by the querying device 422 of the device 42 are merged, the repeating options therein are deleted, and a plurality of terminology options resulting from the final merge and the feedback and input from the network device 42 are determined according to certain rules. The prioritized order of the sequence-related advertising information options is then provided to the providing means 413, which provides them to the user in a corresponding priority order. In general, the input term option provided by the network device 42 should be more accurate, so the priority is higher than the local query to obtain the input term option, and similarly, the ad information option is usually placed on each line so as not to affect the user's text input. The option position at the end of the middle.
图 22为根据本实施方式一个方面的在用户设备中当用户进行文字输入 时同时提供与输入信息相关的搜索相关信息的方法流程图。 也即, 当用户 在用户设备 41上进行文字输入时, 用户设备 41在根据用户输入序列提供 相应输入词条选项, 还根据用户设备的输入序列搜索相关的搜索相关信息, 如广告信息、 网页信息、 旅游信息或地图信息。 以下以广告信息为例进行 说明:  Fig. 22 is a flowchart showing a method of simultaneously providing search related information related to input information when a user performs character input in a user equipment according to an aspect of the embodiment. That is, when the user performs text input on the user device 41, the user device 41 provides a corresponding input term option according to the user input sequence, and also searches for related search related information, such as advertisement information and webpage information, according to the input sequence of the user device. , travel information or map information. The following is an example of advertising information:
在步骤 S41中, 用户设备 41通过任何一种可与用户进行人机交互的交 互设备来实时地获取用户正在输入的输入序列。 该交互设备可以是键盘、 遥控器、 触摸板或声控设备等。 以键盘为例, 但用户敲击键盘中按键进行 输入时, 用户设备 41实时地获取用户敲击的按键序列 (为筒明起见, 以下 仍称输入序列 )。 词库 (以下筒称本地词库 )进行匹配查询, 获得一个或多个匹配的输入词 条选项。 以下以中文为例进行说明,本实施方式允许用户在按全拼、双拼、 五笔等方法输入中文。 同时, 用户设备还根据用户输入序列在本地保存的 关键词广告库 (以下筒称本地广告库) 中进行搜索, 获得相关的一个或多 个广告信息选项。 与例如, 当用户敲击按键输入 "woaiwaitan", 用户设备 41 在本地词库中查询获得 "1 我爱外滩; 2 我爱"等词条组合, 同时在广告库 414,中查询获得与"外滩"有关的广告信息有"外滩三号"、 "外滩 18 号"等地 标性建筑, 因此提供广告信息选项" 3 外滩三号; 4 外滩 18号"。 本领域技 术人员应理解, 查询与输入序列相关的广告信息 (或搜索相关信息) 的过 程可以采用目前公知的各种智能或模糊搜索算法, 在此不作赘述。 In step S41, the user device 41 acquires the input sequence that the user is input in real time through any interactive device that can perform human-computer interaction with the user. The interactive device can be a keyboard, a remote control, a touch pad or a voice control device, and the like. Taking the keyboard as an example, when the user taps a key in the keyboard to input, the user device 41 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below). The lexicon (hereinafter referred to as the local lexicon) performs a matching query to obtain one or more matching input terms. The following takes Chinese as an example for description. This embodiment allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes. At the same time, the user equipment also searches in the locally stored keyword advertisement library (hereinafter referred to as a local advertisement library) according to the user input sequence, and obtains one or more related advertisement information options. For example, when the user taps the button to input "woaiwaitan", the user device 41 queries the local vocabulary to obtain a combination of terms such as "1 I love the Bund; 2 I love", and in the advertisement library 414, the query is obtained with "The Bund". "The relevant advertising information includes "Bund No. 3" and "Bund 18". The landmark building, therefore offers the advertising information option "3 Bund 3; 4 Bund 18". Those skilled in the art should understand that the process of querying the advertisement information (or searching for related information) related to the input sequence may adopt various intelligent or fuzzy search algorithms that are currently known, and will not be described herein.
在步骤 S43中, 用户设备 41将所获得的一个或多个匹配的输入词条选 项按一定顺序和格式提供给所述用户, 供其选择以作具体输入。 例如, 通 过在用户设备 41的显示器中一个输入窗口栏中显示给用户是, 可将多个词 条选项与输入序列分栏显示, 多个词条选项可全部列入下一栏中供用户选 择。 优选地, 可以在词条栏中仅显示一行词条选项, 该行词条选项数目可 以是缺省的也可由用户设定, 通过由用户按动特定功能键显示上一行或下 一行词条选项, 该特定功能键例如可以是" +"和" -"。  In step S43, the user device 41 provides the obtained one or more matching input term options to the user in a certain order and format for selection. For example, by displaying to the user in an input window column in the display of the user device 41, multiple entry options and the input sequence can be displayed in columns, and multiple entry options can be included in the next column for the user to select. . Preferably, only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key. , the specific function keys can be, for example, "+" and "-".
优选地, 为便于用户注意, 广告信息选项在词条栏中可采用不同显示 方式, 例如不同颜色或灰度。 而且广告信息选项中内置有与该广告信息相 关的网页 IP地址或统一资源标示符( URL )。  Preferably, for the user's attention, the advertisement information option may use different display modes in the entry bar, such as different colors or gray scales. Also, the advertisement information option has a web page IP address or a uniform resource identifier (URL) associated with the advertisement information.
在步骤 S44中, 用户和用户设备 41可根据所提供的输入词条选项做进 一步人机交互。 用户可通过在用户设备 41的键盘上按该选项相应的数字键 或通过用户设备 41的鼠标移动光标至该选项处悬停或点击来选择该广告信 息选项。 而当用户选择该广告信息选项, 用户设备 41可通过网络定向到其 对应的网页网址, 例如在浏览器打开情形, 经由网络连接到该网址对应的 网页服务器, 并在浏览器中显示其网页给用户。  In step S44, the user and user device 41 can make further human-computer interaction according to the provided input term options. The user can select the advertisement information option by pressing the corresponding numeric key of the option on the keyboard of the user device 41 or by moving the cursor of the user device 41 to hover or click at the option. When the user selects the advertisement information option, the user equipment 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, connected to the web server corresponding to the webpage via the network, and the webpage is displayed in the browser. user.
优选地, 步骤 S41至 S43之间是是持续不断地循环。 具体地, 在步骤 本 地 进 行 查 询 , 例 如 , 用 户 持 续 地 输 入 "w" 、 "wo" ... "wo" ... "woai" ... "woaiwaitan" , 在步骤 S42中, 用户设备 41也实时 地对根据持续获取的用户输入序列进行匹配查询, 以持续获取与上述各输 入序列相对应的词条选项, 例如" w"对应" 1我、 2喔、 3握、 4窝"; "woai" 对应" 1我爱、 2喔、 3握、 4窝"; "woaiwaitan"对应" 1我爱外滩、 2外滩三 号、 3 外滩 18号,,。 在此, 本领域技术人员应理解"持续"是指在用户最终选 择一个词条选项前一直进行的动作方式, 例如用户在敲击按键序列" woai" 后可能稍停片刻, 如 0.5秒, 再继续敲击随后的按键。 Preferably, the steps S41 to S43 are continuously cycled. Specifically, the query is performed locally in the step. For example, the user continuously inputs "w", "wo" ... "wo" ... "woai" ... "woaiwaitan", and in step S42, the user device 41 also Matching queries based on the continuously acquired user input sequence in real time to continuously obtain the entry options corresponding to the above input sequences, for example, "w" corresponds to "1 me, 2喔, 3 grips, 4 nests";"woai"Correspondence" 1 I love, 2喔, 3 grips, 4 nests;; "woaiwaitan" corresponds to "1 I love the Bund, 2 Bund three No. 3, No. 18 on the Bund,. Here, those skilled in the art should understand that "continuous" refers to the action that is performed until the user finally selects an entry option. For example, the user may pause for a while after tapping the key sequence "woai", such as 0.5 second. Continue to tap the subsequent buttons.
优选地, 在步骤 S42中, 用户设备 41在根据用户输入序列在词库和广 告库中进行匹配查询获得多个输入词条选项和广告信息选项时还获得其各 自的优先级。 在步骤 S43中, 用户设备 41将查询获取的多个匹配的输入词 条选项和广告信息选项按优先级顺序在词条栏中显示给所述用户, 其中优 先级越高, 该输入词条选项或广告信息选项越靠前显示。 优选地, 为便于 用户进行文字输入, 优先级最好的输入词条选项一般置于最前位置, 使得 用户可通过筒单地按 "ENTER"或空格键来选择, 而广告信息选项通常置于 每行中较末尾选项位置。  Preferably, in step S42, the user equipment 41 also obtains its own priority when performing a matching query in the thesaurus and the advertisement library according to the user input sequence to obtain a plurality of input term options and advertisement information options. In step S43, the user equipment 41 displays the plurality of matched input term options and advertisement information options obtained by the query in the item column to the user in priority order, wherein the higher the priority, the input term option Or the higher the ad information option is displayed. Preferably, in order to facilitate the user to input text, the preferred input item option of priority is generally placed at the foremost position, so that the user can select by pressing "ENTER" or the space bar, and the advertisement information option is usually placed in each position. The position of the option at the end of the line.
优选地, 在步骤 S42中, 用户设备 41还可以根据用户特征来在词库和 广告库中进行查询, 获得匹配的输入词条选项和广告信息选项。 在获得匹 配的多个输入词条选项和广告信息选项后, 还可根据用户特征来确定其优 先级高低。用户特征包括用户的输入历史记录、用户设定的个人偏好选择、 用户属性、 用户地址等, 用户属性包括用户的职业、 性别、 国际、 出生地、 年龄等体现个人特性的信息。 具体地, 用户设备 1 可根据用户输入历史记 录中对各个词条选项或词条选项中的词汇的选择频度、 各词条选项中各个 词汇间的文义关联性来确定其优先级高低。 用户设备 41也可根据用户设定 的个人偏好选择来确定优先级高低, 例如, 当用户设定输入偏好为: 优先 级高低: 购物 >饮食 >旅游, 则获取用户输入序列 "woaiwaitan"后, 用户设备 41 在广告库中查询获得与" waitan"对应的多个位于外滩的地标性建筑或旅 游景点, 如招商局总部、 汇丰大厦、 花旗银行、 外滩三号、 外滩 18号等, 随后根据用户设定的个人偏好可判断"外滩三号"、 "外滩 18号,,等以购物、 餐饮为主的建筑景点的优先级最高。 另外, 在步骤 S42中, 用户设备 41还 可根据目前用户设备的 IP地址来判断其所处的地域, 从而可以确定输入序 列中与该地域相关的词汇的优先级, 例如, 但用户输入序列为Preferably, in step S42, the user equipment 41 may further perform a query in the thesaurus and the advertisement library according to the user characteristics, and obtain matching input term options and advertisement information options. After obtaining matching multiple input term options and advertising information options, the priority level can also be determined based on user characteristics. User characteristics include the user's input history, user-set personal preference selection, user attributes, user address, etc. User attributes include the user's occupation, gender, international, place of birth, age, etc., reflecting personal characteristics. Specifically, the user equipment 1 may determine the priority level according to the frequency of selection of the vocabulary in each term option or the term option in the user input history, and the semantic relevance between each vocabulary in each term option. The user equipment 41 can also determine the priority level according to the personal preference selection set by the user. For example, when the user sets the input preference as: Priority level: Shopping > Diet > Travel, after the user input sequence "woaiwaitan" is obtained, the user The device 41 queries the advertisement library to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, and Bund 18, etc. The personal preference may determine that the "Bund No. 3", "The Bund 18," and other architectural attractions that are mainly shopping and catering have the highest priority. In addition, in step S42, the user equipment 41 may also be based on the current user equipment. IP address to determine the region in which it is located, so that the input sequence can be determined The priority of the vocabulary associated with the territory in the column, for example, but the user input sequence is
"woxihuanbund", 其中" bund,,的译文有 "1堤岸 2码头 3同盟 4 (上海)夕卜 滩", 当用户设备 41根据用户设备 IP地址获知目前位于中国上海市, 从而 可确定 "bund"对应译文中"上海外滩,,或"外滩,,优先级最高, 因而可提供如下 输入词条选项 "1我喜欢上海外滩; 2我喜欢外滩; 3我喜欢码头; 4我喜欢 堤岸; 5我喜欢同盟"。 为筒明起见, 我们可将用户输入历史记录, 用户设 定个人偏好、 计算机 IP地址(或用户地址)等统称为用户特征, 且本领域 技术人员应能理解, 用户特征包括但不限于上述内容。 "woxihuanbund", where "bund," has "1 embankment 2 dock 3 alliance 4 (Shanghai) Xibtan", when user equipment 41 is known based on the user equipment IP address is currently located in Shanghai, China, thus determining "bund" Corresponding translations in "Shanghai Bund," or "The Bund," have the highest priority, so the following entry options are available: "I like to go to the Bund; 2 I like the Bund; 3 I like the pier; 4 I like the embankment; 5 I like it. Alliance." For the sake of clarity, we can refer to user input history, user setting personal preferences, computer IP address (or user address), etc. as user features, and those skilled in the art should understand that user features include but not Limited to the above.
本领域技术人员应能理解, 用户设备 41可以在本地存储器中保存上述 用户输入历史记录、用户设定输入偏好以及词汇间的各种关联性。优选地, 用户设备 41还可对所保存的用户输入历史记录、 输入偏好及词汇间关联性 等信息进行更新。 在步骤 S45 (未示出) 中, 用户设备 41还通过与用户的 进一步交互来获取该用户对所提供多个输入词条选项的选择, 然后根据所 获取的用户选择来更新词库和用户输入历史记录、 词汇间的关联性等, 例 如可在词库中增加新词条选项和已有词条选项的优先级, 用户特征。 更优 选地, 如用户设备可接入互联网, 在步骤 S45中, 用户设备 41还可自行在 互联网中搜寻新的词条组合, 并用以更新词库等。  Those skilled in the art will appreciate that the user device 41 can store the user input history, user set input preferences, and various associations between vocabulary in the local memory. Preferably, the user device 41 may also update information such as saved user input history, input preferences, and inter-vocabulary relevance. In step S45 (not shown), the user device 41 also obtains the user's selection of the provided plurality of input term options by further interaction with the user, and then updates the thesaurus and user input according to the acquired user selection. Historical records, vocabulary associations, etc., for example, can add new entry options and prioritization of existing term options in the thesaurus, user characteristics. More preferably, if the user equipment can access the Internet, in step S45, the user equipment 41 can also search for a new combination of terms on the Internet and use it to update the thesaurus and the like.
在一个优选实例中, 用户设备 41中的广告库可随时与周期性地进行主 动更新, 例如用户设备 41经由网络与一个或多个网络设备相连接, 并随时 在另一优选实例中, 广告库可以是位于用户设备 41的外部, 例如位于 一个网络设备处或分布于多个网络设备处, 在步骤 S42中, 用户设备 41可 经由网络与网络设备相连接, 从而查询与用户输入序列相关的广告信息选 项。  In a preferred embodiment, the advertisement library in the user device 41 can be actively updated at any time and periodically, for example, the user device 41 is connected to one or more network devices via a network, and in another preferred example, the advertisement library It may be located outside the user equipment 41, for example at a network device or distributed among a plurality of network devices. In step S42, the user device 41 may be connected to the network device via the network to query an advertisement related to the user input sequence. Information option.
图 23为根据本实施方式另一方面的用户设备与网络设备相配合来当用 户进行文字输入时同时提供与输入信息相关的搜索相关信息的方法流程图。 其中用户设备 41经由网络与网络设备 42相连接, 该网络可以为互联 网、 内部网等。 也即, 当用户在用户设备 41上进行文字输入时, 用户设备 41经由网络向网络设备 42发送查询请求, 请求网络设备 42根据用户输入 序列搜索相关的搜索相关信息, 如广告信息、 网页信息、 旅游信息或地图 信息, 然后将网络设备反馈的搜索相关信息与网络设备查询获得的输入词 条选项一起提供给用户。 以下以广告信息为例进行说明: 23 is a flowchart of a method in which a user equipment cooperates with a network device to provide search related information related to input information when a user performs character input according to another aspect of the embodiment. The user equipment 41 is connected to the network device 42 via a network, and the network may be the Internet, an intranet, or the like. That is, when the user performs text input on the user equipment 41, the user equipment 41 sends a query request to the network device 42 via the network, requesting the network device 42 to search for relevant search related information, such as advertisement information, webpage information, according to the user input sequence. The travel information or the map information is then provided to the user along with the search related information fed back by the network device together with the input term option obtained by the network device query. The following is an example of advertising information:
在一个实例中, 网络设备 42保存网络词库和关键词广告库(为筒明起 见, 以下筒称网络广告库或搜索相关信息库)。  In one example, network device 42 maintains a network vocabulary and a keyword advertisement library (for the sake of clarity, the following is referred to as a network advertisement library or a search related information library).
具体地, 如图 23所示, 在步骤 S41中, 用户设备 41通过任何一种可 该交互设备可以是键盘、 遥控器、 触摸板或声控设备等。 以键盘为例, 但 用户敲击键盘中按键进行输入时, 用户设备 1 实时地获取用户敲击的按键 序列 (为筒明起见, 以下仍称输入序列)。  Specifically, as shown in FIG. 23, in step S41, the user equipment 41 may be a keyboard, a remote controller, a touch pad or a voice control device or the like by any one of the interactive devices. Taking the keyboard as an example, when the user taps a key on the keyboard to input, the user device 1 acquires the key sequence of the user's tap in real time (for the sake of clarity, the input sequence is still referred to below).
在步骤 S42中, 用户设备 41实时并持续不断地将获取的用户输入序列 发送至网络设备 42。  In step S42, the user equipment 41 transmits the acquired user input sequence to the network device 42 in real time and continuously.
在步骤 S43中, 网络设备 42根据接收的用户输入序列在网络词库中进 行匹配查询, 获得一个或多个匹配的输入词条选项。 以下以中文为例进行 说明, 本发明允许用户在按全拼、 双拼、 五笔等方法输入中文。 同时, 网 或多个广告信息选项。 与例如, 当用户敲击按键输入 "woaiwaitan", 网络设 备 42在网络词库中查询获得 "1 我爱外滩; 2 我爱"等词条组合, 同时在网 络广告库中查询获得与"外滩"有关的广告信息有"外滩三号"、 "外滩 18号" 等地标性建筑, 因此提供广告信息选项" 3 外滩三号; 4 外滩 18号"。 本领 域技术人员应理解, 查询与输入序列相关的广告信息 (或搜索相关信息) 的过程可以采用目前公知的各种智能或模糊搜索算法, 在此不作赘述。  In step S43, the network device 42 performs a matching query in the network lexicon according to the received user input sequence to obtain one or more matching input term options. The following takes Chinese as an example for description. The present invention allows the user to input Chinese in the manner of full spelling, double spelling, and five strokes. At the same time, web or multiple ad information options. For example, when the user taps the button to input "woaiwaitan", the network device 42 queries the network vocabulary to obtain a combination of terms such as "1 I love the Bund; 2 I love", and inquires in the online advertisement library to obtain the "Bund". Relevant advertising information includes landmarks such as "Bund No. 3" and "Bund 18", so the advertisement information option "3 Bund No. 3; 4 Bund No. 18" is provided. Those skilled in the art will appreciate that the process of querying the advertisement information (or searching for related information) associated with the input sequence may employ various currently known intelligent or fuzzy search algorithms and will not be described herein.
在步骤 S45中, 网络设备 42也实时和持续不断地将所查询的输入词条 选项发送至用户设备 41。在步骤 S47中,用户设备 41将接收到的来自网络 设备 42实时和持续地所述输入词条选项并提供给用户, 用户设备 41可将 获得的一个或多个匹配的输入词条选项按一定顺序和格式提供给所述用户, 供其选择以作具体输入或进一步交互。 例如, 通过在显示器的一个输入窗 口栏中显示给用户是, 可将多个词条选项与输入序列分栏显示, 多个词条 选项可全部列入下一栏中供用户选择。 优选地, 可以在词条栏中仅显示一 行词条选项, 该行词条选项数目可以是缺省的也可由用户设定, 通过由用 户按动特定功能键显示上一行或下一行词条选项, 该特定功能键例如可以 是 "+,,和 "―"。 In step S45, the network device 42 also inputs the queried entry in real time and continuously. The option is sent to the user device 41. In step S47, the user equipment 41 receives the input term option from the network device 42 in real time and continuously and provides it to the user, and the user device 41 can press the obtained one or more matching input term options to be determined. The order and format are provided to the user for selection or specific interaction. For example, by displaying to the user in an input window column of the display, multiple entry options can be displayed in the input sequence column, and multiple entry options can be included in the next column for the user to select. Preferably, only one line item option may be displayed in the term column, and the number of the line item option may be default or user-settable, and the previous line or the next line item option is displayed by the user pressing a specific function key. , the specific function keys can be, for example, "+,, and "-".
优选地, 为便于用户注意, 广告信息选项在词条栏中可采用不同显示 方式, 例如不同颜色或灰度。 而且广告信息选项中内置有与该广告信息相 关的网页 IP地址或统一资源标示符( URL )。  Preferably, for the user's attention, the advertisement information option may use different display modes in the entry bar, such as different colors or gray scales. Also, the advertisement information option has a web page IP address or a uniform resource identifier (URL) associated with the advertisement information.
在步骤 S48中, 用户和用户设备 41可根据所提供的输入词条选项做进 一步人机交互。 用户可通过按该选项相应的数字键或通过鼠标移动光标至 该选项处悬停或点击来选择该广告信息选项。 而, 当用户选择该广告信息 选项, 用户设备 41可通过网络定向到其对应的网页网址, 例如在浏览器打 开情形, 经由网络连接到该网址对应的网页服务器, 并在浏览器中显示其 网页给用户。  In step S48, the user and user device 41 can perform further human-computer interaction according to the provided input term options. The user can select this ad information option by pressing the corresponding number key for that option or by hovering the mouse to hover or click on the option. However, when the user selects the advertisement information option, the user device 41 can be directed to its corresponding webpage URL through the network, for example, in a browser open situation, connected to the web server corresponding to the webpage via the network, and displaying the webpage in the browser. To the user.
优选地, 步骤 S41至 S47之间是持续不断地循环工作。 具体地, 在步  Preferably, the steps S41 to S47 are continuously cycled. Specifically, at step
络设备 42, 例如" w"、 "wo" ... "wo" ... "woai" ... "woaiwaitan" , 网络设备 42 也根据用户输入序列后实时和持续不断进行匹配查询, 并将查询到的输入 词条序列持续不断地发送回用户设备 41 , 例如" w"对应" 1我、 2喔、 3握、 4窝"; "woai"对应" 1我爱、 2喔、 3握、 4窝"; "woaiwaitan"对应" 1我爱外 滩、 2外滩三号、 3 外滩 18号"。 在此, 本领域技术人员应理解"持续"是 指在用户最终选择一个词条选项前一直进行的动作方式, 例如用户在敲击 按键序列" woai"后可能稍停片刻, 如 0.5秒, 再继续敲击随后的按键。 Network device 42, such as "w", "wo" ... "wo" ... "woai" ... "woaiwaitan", network device 42 also performs matching queries in real time and continuously according to the user input sequence, and The sequence of the entered input vocabulary is continuously sent back to the user device 41, for example, "w" corresponds to "1 me, 2 喔, 3 grips, 4 nests";"woai" corresponds to "1 I love, 2 喔, 3 grips, 4 nests; "woaiwaitan" corresponds to "1 I love the Bund, 2 Bund 3, 3 Bund 18". Here, those skilled in the art should understand that "continuation" refers to a manner of action that is always performed before the user finally selects an entry option, such as a user tapping. After the button sequence "woai", it may pause for a while, such as 0.5 seconds, and then continue to tap the subsequent buttons.
优选地, 在步骤 S43 , 网络设备 42在根据用户输入序列在网络词库和 网络广告库中进行匹配查询获得多个输入词条选项和广告信息选项时还获 得其各自的优先级。 在步骤 S47中, 用户设备 41将网络设备 42提供的多 个匹配的输入词条选项和广告信息选项按优先级顺序在词条栏中显示给所 述用户, 其中优先级越高, 该输入词条选项或广告信息选项越靠前显示。 优选地, 为便于用户进行文字输入, 优先级最好的输入词条选项一般置于 最前位置, 使得用户可通过筒单地按' 'ENTER"或空格键来选择, 而广告信 息选项通常置于每行中较末尾选项位置。  Preferably, in step S43, the network device 42 also obtains its respective priority when performing a matching query in the network thesaurus and the network advertisement library according to the user input sequence to obtain a plurality of input term options and advertisement information options. In step S47, the user equipment 41 displays the plurality of matched input term options and advertisement information options provided by the network device 42 in the entry column to the user in a priority order, wherein the higher the priority, the input word The top option or ad information option is displayed. Preferably, in order to facilitate the user to input text, the preferred input entry option is generally placed at the foremost position, so that the user can select by pressing ' 'ENTER' or the space bar, and the advertisement information option is usually placed. The last option position in each line.
优选地, 当用户通过用户设备 41登录网络设备时, 在步骤 S43中, 网 络设备 42还可根据用户登录的 ID来获取用户特征。 例如用户输入历史记 录, 用户特定的用户词库、 用户设定的个人偏好、 用户属性信息等。 所述 用户特征可以保存在网络设备 42中, 也可保存在于网络设备 42相连接的 其他网络设备中。  Preferably, when the user logs in to the network device through the user equipment 41, in step S43, the network device 42 can also acquire the user feature according to the ID of the user login. For example, the user inputs a history record, a user-specific user vocabulary, a user-set personal preference, user attribute information, and the like. The user features may be stored in the network device 42 or in other network devices to which the network device 42 is connected.
随后, 在步骤 S43中, 网络设备 42可以根据用户特征来在网络词库和 网络广告库中进行查询, 获得匹配的输入词条选项和广告信息选项。 具体 地, 在步骤 S43中, 网络设备 42可根据用户输入历史记录中对各个词条选 项或词条选项中的词汇的选择频度、 各词条选项中各个词汇间的文义关联 性来确定其优先级高低。 网络设备 42也可根据用户设定的个人偏好选择来 确定优先级高低, 例如, 当用户设定输入偏好为: 优先级高低: 购物 >饮食 > 旅游, 则获取用户输入序列 "woaiwaitan"后, 网络设备 42在网络广告库中 查询获得与" waitan"对应的多个位于外滩的地标性建筑或旅游景点, 如招商 局总部、 汇丰大厦、 花旗银行、 外滩三号、 外滩 18号等, 随后根据用户设 定的个人偏好可判断 "外滩三号"、 "外滩 18号,,等以购物、 餐饮为主的建筑 景点的优先级最高。 另外, 在步骤 S43中, 网络设备 42还可根据目前用户 设备的 IP地址来判断其所处的地域, 从而可以确定输入序列中与该地域相 关的词汇的优先级, 例如, 但用户输入序列为 " woxihuanbund" , 其中 "bund" 的译文有" 1堤岸 2码头 3同盟 4 (上海)外滩", 在步骤 S43中, 当网络 设备 42根据用户设备 IP地址获知目前位于中国上海市,从而可确定" bund" 对应译文中"上海外滩"或"外滩"优先级最高,因而可提供如下输入词条选项 "1我喜欢上海外滩; 2我喜欢外滩; 3我喜欢码头; 4我喜欢堤岸; 5我喜 欢同盟"。 为筒明起见, 我们可将用户输入历史记录, 用户设定个人偏好、 计算机 IP地址(或用户地址)等统称为用户特征, 且本领域技术人员应能 理解, 用户特征包括但不限于上述内容。 Subsequently, in step S43, the network device 42 may perform a query in the network thesaurus and the network advertisement library according to the user characteristics to obtain matching input term options and advertisement information options. Specifically, in step S43, the network device 42 may determine, according to the frequency of selection of the vocabulary in each term option or the term option in the user input history record, and the semantic relevance between each vocabulary in each term option. Its priority is high. The network device 42 can also determine the priority according to the personal preference selected by the user. For example, when the user sets the input preference to: Priority: Shopping > Diet > Travel, after obtaining the user input sequence "woaiwaitan", the network The device 42 queries the network advertisement library to obtain a plurality of landmark buildings or tourist attractions located on the Bund corresponding to "waitan", such as China Merchants Headquarters, HSBC Building, Citibank, Bund No. 3, Bund 18, etc., and then according to users. The set personal preference can judge that the "Bund No. 3", "The Bund 18," and other architectural attractions that are mainly shopping and dining have the highest priority. In addition, in step S43, the network device 42 can also be based on the current user equipment. IP address to determine the region in which it is located, so that it can be determined that the input sequence is related to the region The priority of the closed vocabulary, for example, but the user input sequence is "woxihuanbund", where the translation of "bund" has "1 embankment 2 dock 3 alliance 4 (Shanghai) Bund", in step S43, when the network device 42 is based on the user The IP address of the device is currently located in Shanghai, China, so that it can be determined that the "Bund" corresponds to the highest level of "Shanghai Bund" or "The Bund". Therefore, the following entry options are available: "I like to go to the Bund; 2 I like the Bund." 3 I like the pier; 4 I like the embankment; 5 I like the league." For the sake of clarity, we may refer to user input history, user setting personal preferences, computer IP address (or user address), etc. as user features, and those skilled in the art will appreciate that user features include, but are not limited to, the above. .
优选地, 网络设备 42还可对所保存的用户输入历史记录、 输入偏好及 词汇间关联性等信息进行更新。 在步骤 S49 (未示出) 中, 用户设备 41通 过与用户的进一步交互来获取该用户对所提供多个输入词条选项的选择, 并发送至网络设备; 在步骤 S410 (未示出)中, 网络设备 42根据所接收的 用户选择来更新词库和用户输入历史记录、 词汇间的关联性等, 例如可在 网络词库中增加新词条选项和已有词条选项的优先级, 用户特征。 更优选 地, 网络设备 42还可自行在互联网中搜寻新的词条组合, 并用以更新网络 词库等。  Preferably, network device 42 may also update information such as saved user input history, input preferences, and inter-vocabulary associations. In step S49 (not shown), the user device 41 acquires the user's selection of the provided plurality of input term options by further interaction with the user and transmits to the network device; in step S410 (not shown) The network device 42 updates the thesaurus and the user input history, the association between the vocabularies, and the like according to the received user selection. For example, the new term option and the existing term option may be added to the network thesaurus. feature. More preferably, the network device 42 can also search for new combinations of terms on the Internet and update the network vocabulary and the like.
优选地, 网络广告库可以是位于网络设备 42以外, 例如位于另一个网 络设备处或分布于其他多个网络设备处, 网络设备 42可经由网络与所述其 他网络设备相连接, 从而查询与用户输入序列相关的广告信息选项。  Preferably, the network advertisement library may be located outside the network device 42, for example at another network device or distributed at other multiple network devices, and the network device 42 may be connected to the other network device via the network, thereby querying and querying the user. Enter the ad information options associated with the sequence.
图 24示出根据本实施方式的另一个优选实例, 其中用户设备 41本身 也保存有本地词库, 并可随时或定期地与网络设备 42的网络词库中该用户 特定的用户词库进行同步。  Fig. 24 shows another preferred example according to the present embodiment, in which the user equipment 41 itself also holds a local vocabulary, and can synchronize with the user-specific user vocabulary in the network vocabulary of the network device 42 at any time or periodically. .
如图 24所示, 在步骤 S44中, 用户设备 41根据所述用户输入序列在 本地词库中进行匹配查询, 具体查询过程如前面参照图 44 所描述的步骤 S42的内容, 该内容引用在此不作赘述。 步骤 S41至 S43如前面参照图 23 所描述的步骤 S41~S43的内容, 该内容引用在此不作赘述。 本领域技术人 员应能理解, 步骤 S41至 S43和步骤 S44可以是同步进行, 其完成时间主 要取决于用户设备 41和网络设备 42的处理速度以及用户设备 41与网络设 备 42之间的网络传输延时。 在步骤 S46中, 用户设备 41将本地查询到的 一个或多个输入词条选项和来自网络设备 42的一个或多个输入词条选项进 行合并处理, 删除其中的重复选项, 并根据一定规则来确定最终合并得到 的多个词条选项和来自网络设备 42反馈的与输入序列有关的广告信息选项 的优先级顺序, 随后, 在步骤 S48 中, 将所述输入词条选项和广告信息选 项按相应的优先级顺序提供给用户,供其选择或作进一步人机交互。通常, 网络设备 42提供的输入词条选项应该更为准确, 因此优先级较本地查询获 得输入词条选项为高, 而同样地, 为不影响用户的文字输入, 广告信息选 项通常置于每行中较末尾选项位置。 As shown in FIG. 24, in step S44, the user equipment 41 performs a matching query in the local vocabulary according to the user input sequence. The specific query process is as described in the foregoing step S42 described with reference to FIG. 44, and the content is referenced here. Do not repeat them. Steps S41 to S43 are the contents of steps S41 to S43 described above with reference to FIG. 23, and the content references are not described herein. Technical person It should be understood that steps S41 to S43 and step S44 may be performed synchronously, the completion time of which mainly depends on the processing speed of the user equipment 41 and the network device 42 and the network transmission delay between the user equipment 41 and the network equipment 42. In step S46, the user equipment 41 merges one or more input term options that are locally queried with one or more input term options from the network device 42, deletes the repeating options therein, and according to certain rules. Determining a priority order of the plurality of term options obtained by the final merge and the advertisement information options related to the input sequence fed back from the network device 42, and then, in step S48, pressing the input term option and the advertisement information option accordingly The priority order is provided to the user for selection or further human interaction. In general, the input term option provided by the network device 42 should be more accurate, so the priority is higher than the local query to obtain the input term option, and similarly, the ad information option is usually placed on each line so as not to affect the user's text input. The option position at the end of the middle.
本领域技术人员应理解, 在以上参照图 20、 21、 23和 24所描述的实 施方式中, 当用户经由用户设备 41登录网络设备 42时, 网络设备 42中的 网络词库也可为该用户特定的用户词库。  It should be understood by those skilled in the art that in the embodiments described above with reference to FIGS. 20, 21, 23 and 24, when the user logs in to the network device 42 via the user device 41, the network vocabulary in the network device 42 may also be the user. A specific user vocabulary.
上面以中文为例来描述本发明的实施方式, 本领域技术人员应能理解 本发明还可适用于另一种文字进行输入的情形,例如韩文、 日文、 法文、 德 文或意大利文等, 需要更改和调整的仅仅是将中文输入规则替换为所述另 一种文字的输入规则, 以及更换相应的词库和用户设定输入偏好等。  The embodiments of the present invention are described above by taking Chinese as an example, and those skilled in the art should understand that the present invention is also applicable to the case where another text is input, such as Korean, Japanese, French, German, or Italian, etc. The only changes and adjustments are to replace the Chinese input rules with the input rules of the other text, and to replace the corresponding thesaurus and user-set input preferences.
对于本领域技术人员而言, 显然本发明不限于上述示范性实施例的细 节, 而且在不背离本发明的精神或基本特征的情况下, 能够以其他的具体 形式实现本发明。 因此, 无论从哪一点来看, 均应将实施例看作是示范性 的, 而且是非限制性的, 本发明的范围由所附权利要求而不是上述说明限 定, 因此旨在将落在权利要求的等同要件的含义和范围内的所有变化嚢括 在本发明内。 不应将权利要求中的任何附图标记视为限制所涉及的权利要 求。 此外, 显然"包括"一词不排除其他单元或步骤, 单数不排除复数。 系统 权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者 硬件来实现。 第一, 第二等词语用来表示名称, 而并不表示任何特定的顺 序。 It is obvious to those skilled in the art that the present invention is not limited to the details of the above-described exemplary embodiments, and the present invention can be embodied in other specific forms without departing from the spirit or essential characteristics of the invention. Therefore, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the invention is defined by the appended claims All changes in the meaning and scope of equivalent elements are included in the present invention. Any reference signs in the claims should not be construed as limiting the claim. In addition, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. The plurality of units or devices recited in the system claims may also be by a unit or device via software or Hardware to achieve. The first, second, etc. words are used to denote names, and do not denote any particular order.

Claims

权利 要求 Rights request
1. 一种用于供用户进行文字输入的方法, 其特征在于, 包括下述步骤:A method for a user to input text, characterized in that it comprises the following steps:
51、 检测用户设备上的输入序列; 51. Detect an input sequence on the user equipment;
52、 将所述输入序列发送至网络服务器;  52. Send the input sequence to a network server.
S3、 接收来自所述网络服务器返回的与所述输入序列匹配的至少一个 网络词条选项;  S3. Receive at least one network entry option returned from the network server that matches the input sequence.
S4、 展现所述网络词条选项。  S4. Display the network term option.
2. 根据权利要求 1所述的方法,其特征在于,在步骤 S2之前或之后, 还包括步骤:  2. The method according to claim 1, wherein before or after step S2, the method further comprises the steps of:
将所述输入序列在用户设备的本地语料库中进行匹配查询, 以获得一 个或多个匹配的本地词条选项。  The input sequence is matched into the local corpus of the user device to obtain one or more matching local term options.
3. 根据权利要求 2所述的方法, 其特征在于, 其中, 在获得所述本地 词条选项之后, 在接收到所述网络词条选项之前, 将所获得的本地词条选 项按照其优先级显示给用户以供其选择;  The method according to claim 2, wherein, after obtaining the local term option, the obtained local term option is prioritized before receiving the network term option Displayed to the user for their choice;
在接收到所述网络词条选项之后, 将所述网络词条选项中剔除已经被 选定的词条和 /或与本地词条选项重复的词条, 然后将剩余的网络词条选项 按照该词条的优先级插入到当前和后续显示的本地词条选项中。  After receiving the network term option, the network term option is excluded from the selected term and/or the term that is repeated with the local term option, and then the remaining network term options are followed. The priority of the entry is inserted into the local entry options for the current and subsequent displays.
4. 根据权利要求 2所述的方法, 其特征在于, 其中, 在获得所述本地 词条选项之后, 在接收到所述网络词条选项之前, 将所获得的本地词条选 项按照其优先级显示给用户以供其选择;  The method according to claim 2, wherein, after obtaining the local term option, the obtained local term option is prioritized before receiving the network term option Displayed to the user for their choice;
在接收到所述网络词条选项之后, 在当前和后续的词条显示中, 按照 —网络词条选项的优先级显示  After receiving the network term option, in the current and subsequent entry display, according to the priority of the network term option
词条选项外的其他网络词条选项。 Other network entry options other than the entry option.
5. 根据权利要求 2所述的方法, 其特征在于, 所述网络服务器包括: 分别对应于每个用户的用户网络语料库; 以及 公共网络语料库。 The method according to claim 2, wherein the network server comprises: a user network corpus corresponding to each user respectively; Public network corpus.
6. 根据权利要求 5所述的方法, 其特征在于, 基于用户对本地词条或 网络词条的选择更新所述本地语料库, 并且根据所述本地语料库更新所述 用户网络语料库。  6. The method of claim 5, wherein the local corpus is updated based on a user selection of a local term or a network term, and the user network corpus is updated based on the local corpus.
7. 根据权利要求 5所述的方法, 其特征在于, 当用户通过所述用户设 备登录所述网络服务器时, 在接收到来自所述用户设备的语料库同步指令 之后, 将所述用户网络语料库与所述本地语料库进行同步。  The method according to claim 5, wherein when the user logs in to the network server through the user equipment, after receiving the corpus synchronization instruction from the user equipment, the user network corpus is The local corpus is synchronized.
8. 根据权利要求 5所述的方法, 其特征在于, 如果用户没有登录所述 网络服务器, 则在接收到用户在所述用户设备上的输入序列时仅仅在所述 公共网络语料库中检索匹配的网络词条选项;  8. The method according to claim 5, wherein if the user does not log in to the network server, only the matching in the public network corpus is retrieved when receiving the input sequence of the user on the user equipment Network entry option;
如果用户已经登录网络服务器, 并且该用户的所述用户网络语料库已 经与该用户设备上的本地语料库同步, 则在接收到用户在用户设备上的输 入序列时仅仅在所述公共网络语料库中检索匹配的网络词条选项; 以及 如果用户已经登录网络服务器, 但是该用户的所述用户网络语料库没 有与该用户设备上的本地语料库同步, 则在接收到用户在用户设备上的输 入序列时在所述公共网络语料库和该用户的用户网络语料库中检索匹配的 网络词条选项。  If the user has logged into the web server and the user's web corpus of the user has been synchronized with the local corpus on the user device, then only the matching in the public network corpus is retrieved when the user's input sequence on the user device is received Network term option; and if the user has logged into the web server, but the user's web corpus of the user is not synchronized with the local corpus on the user device, then upon receiving the input sequence of the user on the user device A matching network term option is retrieved from the public network corpus and the user's user network corpus.
9. 根据权利要求 1至 8中任意一项所述的方法, 其特征在于, 反馈回 所述用户设备网络词条选项的根据所述网络词条先前是否被选择过、 先前 被选择的时间先后、 先前被选择的次数、 用户预置的输入偏好选项和 /或词 条在网络上的被搜索的次数来确定所匹配的所述网络词条选项中各个网络 词条的优先级。  The method according to any one of claims 1 to 8, wherein the feedback to the user equipment network term option is based on whether the network term has been previously selected or previously selected. The number of times previously selected, the user-preset input preference options, and/or the number of times the term is searched on the network determines the priority of each of the network term options in the matched network entry.
10. 根据权利要求 1所述的方法, 其特征在于, 所述 S3步骤具体为: 接收来自所述网络服务器返回的基于所述输入序列在与用户对应的群 组语料库中进行匹配查询所获得的至少一个网络词条选项。  The method according to claim 1, wherein the step S3 is specifically: receiving, by the network server, a matching query obtained by performing a matching query in a group corpus corresponding to a user based on the input sequence At least one web entry option.
11.根据权利要求 10所述的方法,其特征在于,所述方法还包括步骤: 在进行文字输入之前将所述用户注册到一个或多个用户群组, 所述用户群 组与所述群组语料库相关联。 The method according to claim 10, wherein the method further comprises the steps of: The user is registered to one or more user groups prior to text entry, the user group being associated with the group corpus.
12.根据权利要求 10所述的方法,其特征在于,所述方法还包括步骤: 所述用户向所述网络服务器发送用户身份信息, 以便所述网络服务器确定 与之关联的用户群组, 进而确定与之关联的群组语料库。  The method according to claim 10, wherein the method further comprises the step of: the user sending user identity information to the network server, so that the network server determines a user group associated with the network server, and further Determine the group corpus associated with it.
13. 根据权利要求 10所述的方法, 其特征在于, 所述用户将自己选定 的词条发送到所述网络服务器, 以便所述网络服务器更新至与所述用户所 属用户群组相关联的群组语料库。  13. The method according to claim 10, wherein the user sends the selected item to the network server, so that the network server is updated to be associated with the user group to which the user belongs. Group corpus.
14. 根据权利要求 10所述的方法, 其特征在于, 所述群组语料库中的 群组词条具有一个或多个属性, 所述属性包括群组标识、 优先级别、 被选 次数、 最近被选择时间、 来源用户和 /或目标用户。  The method according to claim 10, wherein the group term in the group corpus has one or more attributes, the attribute includes a group identifier, a priority level, a selected number of times, and a recent Select time, source user, and/or target user.
15. 根据权利要求 14所述的方法, 其特征在于, 所述用户所获得网络 词条选项范围和顺序基于所述群组词条的属性。  15. The method according to claim 14, wherein the range and order of network term options obtained by the user are based on attributes of the group entry.
16. 根据权利要求 14所述的方法, 其特征在于, 用户在所述用户群组 中具有不同优先级, 所述网络服务器根据某个网络词条来源用户的优先级、 该网络词条的当前优先级、 最近被选择时间和 /或被选择次数向该网络词条 分配不同的优先级或更新该网络词条的优先级。  The method according to claim 14, wherein the user has different priorities in the user group, and the network server according to a network entry source user priority, the network entry current The priority, the most recently selected time, and/or the selected number of times assign different priorities to the network term or update the priority of the network term.
17. 根据权利要求 10所述的方法, 其特征在于, 通过对与所述用户有 关的材料进行学习或训练来对所述群组语料库进行初始化。  17. The method of claim 10, wherein the group corpus is initialized by learning or training materials related to the user.
18. 根据权利要求 1 所述的方法, 其特征在于, 所述方法供所述用户 在基于 WEB的应用程序中进行文字输入。  18. The method according to claim 1, wherein the method is for the user to perform text input in a WEB-based application.
19. 根据权利要求 1 所述的方法, 其特征在于, 所述方法供所述用户 在基于 WEB的浏览器程序中进行文字输入。  19. The method according to claim 1, wherein the method is for the user to perform text input in a web-based browser program.
20. 根据权利要求 1所述的方法, 其特征在于, 在所述 S3步骤之前还 包括:  The method according to claim 1, wherein before the step S3, the method further comprises:
接收所述用户设备发来的所述用户的身份信息; 根据所述用户的身份信息, 调取所述用户的输入历史记录; 所述 S3步骤还包括: Receiving identity information of the user sent by the user equipment; Determining an input history record of the user according to the identity information of the user; the step S3 further includes:
531. 基于所述输入序列在所述网络服务器中进行匹配查询, 得到初步 查询结果;  531. Perform a matching query in the network server based on the input sequence, and obtain a preliminary query result;
532. 将所述输入历史记录与所述初步查询结果进行对比, 将所述初步 查询结果中的同样包含在所述输入历史记录中的内容排列在优先于其它内 容的位置, 以生成所述网络词条选项。  532. Comparing the input history record with the preliminary query result, and arranging the content also included in the input history record in the preliminary query result in a position prior to other content to generate the network. Entry options.
21. 根据权利要求 1所述的方法, 其特征在于, 在所述 S4步骤后, 还 包括:  The method according to claim 1, wherein after the step S4, the method further comprises:
55. 接收来自所述用户设备的指示信息,其用于表示所述用户在所述网 络词条选项中选择的输入项;  55. Receive indication information from the user equipment, which is used to indicate an input item selected by the user in the network entry option;
56. 根据所述指示信息所表示的输入项,执行以下各项操作中的至少一 项:  56. Perform at least one of the following operations according to the input indicated by the indication information:
更新所述用户的输入历史记录;  Updating the input history of the user;
对所述网络服务器所存储的词典数据库进行训练和更新。  Training and updating the dictionary database stored by the web server.
22. 根据权利要求 1 所述的方法, 其特征在于, 所述网络词条选项包 括至少一个搜索相关信息选项。  22. The method of claim 1, wherein the network term option comprises at least one search related information option.
23.根据权利要求 1所述的方法,其特征在于,所述 S3步骤具体包括: 获取该用户的用户特征信息;  The method according to claim 1, wherein the step S3 comprises: acquiring user characteristic information of the user;
根据所述用户特征信息确定在所述网络服务器中所述用户特定的词库; 根据所述输入序列信息, 并基于所述用户特征信息在所述用户特定的 词库中进行匹配查询和在所述网络服务器的搜索信息库进行匹配查询, 以 获得一个或多与所述输入序列匹配的网络词条选项, 所述网络词条选项包 括至少一个搜索相关信息选项。  Determining, according to the user characteristic information, the user-specific vocabulary in the network server; performing matching query and in the user-specific vocabulary based on the input sequence information according to the user characteristic information; The search information base of the web server performs a matching query to obtain one or more network term options matching the input sequence, the network term option including at least one search related information option.
24. 根据权利要求 23所述的方法, 其特征在于, 所述用户特征信息包 括以下各项中的至少一项: 用户的历史输入记录; 24. The method according to claim 23, wherein the user characteristic information comprises at least one of the following: User's history input record;
用户设定的个人偏好;  User-set personal preferences;
用户的属性信息;  User attribute information;
用户的地址信息。  User's address information.
25. 根据权利要求 24所述的方法, 其特征在于, 所述方法还包括: 接收来自所述用户设备的该用户对所述一个或多个网络词条选项的选 择;  25. The method of claim 24, wherein the method further comprises: receiving, by the user of the user equipment, selection of the one or more network entry options;
根据该用户对所述一个或多个网络词条选项的选择来更新所述网络服 务器的所述用户特定的词库和 /或所述搜索相关信息库和 /或所述用户的历 史输入记录。  The user-specific vocabulary of the network server and/or the search-related information base and/or the user's historical input record are updated based on the user's selection of the one or more network entry options.
26. 一种用于供用户进行文字输入的用户设备, 其特征在于, 所述用 户设备包括:  26. A user equipment for text input by a user, wherein the user equipment comprises:
用于检测输入序列的装置;  Means for detecting an input sequence;
用于将所述输入序列发送至网络服务器的装置;  Means for transmitting the input sequence to a web server;
用于接收来自所述网络服务器返回的与所述输入序列匹配的至少一个 网络词条选项的装置;  Means for receiving at least one network term option returned from the web server that matches the input sequence;
用于展现所述网络词条选项的装置。  Means for presenting the network entry options.
27. 根据权利要求 26所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 26, wherein the user equipment further comprises:
用于存储本地语料库的装置;  a device for storing a local corpus;
用于将所述输入序列在用户设备的本地语料库中进行匹配查询, 以获 得一个或多个匹配的本地词条选项的装置。  Means for performing a matching query in the local corpus of the user device to obtain one or more matching local term options.
28. 根据权利要求 27 所述的用户设备, 其特征在于, 所述"用于展现 所述网络词条选项的装置"还可用于展现所述本地词条。  28. The user equipment of claim 27, wherein the "means for presenting the network term option" is further operable to present the local term.
29. 根据权利要求 28所述的用户设备, 其特征在于, 所述用户设备还 包括: 用于在获得所述本地词条选项之后, 在接收到所述网络词条选项之前, 将所获得的本地词条选项按照其优先级显示给用户以供其选择的装置; 用于在接收到所述网络词条选项之后, 将所述网络词条选项中剔除已 经被选定的词条和 /或与本地词条选项重复的词条, 然后将剩余的网络词条 选项按照该词条的优先级插入到当前和后续显示的本地词条选项中的装置The user equipment according to claim 28, wherein the user equipment further comprises: Means for displaying, after receiving the local entry option, the obtained local entry option to the user for selection according to its priority before receiving the network entry option; for receiving After the network term option, the network term option is excluded from the selected term and/or the term that is repeated with the local term option, and then the remaining network term options are followed by the term. A device whose priority is inserted into the current and subsequent displayed local entry options
30. 根据权利要求 28所述的用户设备, 其特征在于, 所述用户设备还 包括: The user equipment according to claim 28, wherein the user equipment further comprises:
用于在获得所述本地词条选项之后, 在接收到所述网络词条选项之前, 将所获得的本地词条选项按照其优先级显示给用户以供其选择的装置; 用于在接收到所述网络词条选项之后, 在后续的词条显示中, 按照所 条选项外的其他网络词条选项的装置。  Means for displaying, after receiving the local entry option, the obtained local entry option to the user for selection according to its priority before receiving the network entry option; for receiving After the network term option, in the subsequent entry display, the device is selected according to other network term options other than the selected option.
31. 根据权利要求 26所述的用户设备, 其特征在于, 所述网络服务器 包括:  The user equipment according to claim 26, wherein the network server comprises:
分别对应于每个用户的用户网络语料库; 以及  a user network corpus corresponding to each user;
公共网络语料库。  Public network corpus.
32. 根据权利要求 31所述的用户设备, 其特征在于, 所述用户设备还 包括: 语料库, 并且根据所述本地语料库更新所述用户网络语料库。  The user equipment according to claim 31, wherein the user equipment further comprises: a corpus, and updating the user network corpus according to the local corpus.
33. 根据权利要求 31所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 31, wherein the user equipment further comprises:
本地同步装置, 用于当用户设备连接并登录到网络服务器时, 在接收 到来自所述用户设备的语料库同步指令之后, 将所述用户网络语料库与所 述本地语料库进行同步。  The local synchronization device is configured to synchronize the user network corpus with the local corpus after receiving the corpus synchronization instruction from the user equipment when the user equipment connects and logs in to the network server.
34.根据权利要求 26至 33中任意一项所述的用户设备,其特征在于, 所述用户设备还包括用于根据词条先前是否被选择过、 词条先前被选择的 时间先后、 词条先前被选择的次数、 用户预置的输入偏好选项和 /或词条在 网络上的被搜索的次数来确定所匹配的词条选项中各个词条的优先级的装 置。 The user equipment according to any one of claims 26 to 33, characterized in that The user equipment also includes a time sequence for depending on whether the entry was previously selected, a time when the entry was previously selected, a number of times the entry was previously selected, a user-preset input preference option, and/or a term on the network. The number of times the number of searches is to determine the priority of each of the matching term options.
35. 根据权利要求 26所述的用户设备, 其特征在于, 所述网络词条为 基于所述输入序列在与用户对应的群组语料库中进行匹配查询所获得的网 络群组词条。  The user equipment according to claim 26, wherein the network term is a network group entry obtained by performing a matching query in a group corpus corresponding to the user based on the input sequence.
36. 根据权利要求 35所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 35, wherein the user equipment further comprises:
用户注册装置, 用于在进行文字输入之前将所述用户注册到一个或多 个用户群组, 所述用户群组与所述群组语料库相关联。  User registration means for registering the user to one or more user groups prior to text entry, the user group being associated with the group corpus.
37. 根据权利要求 35所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 35, wherein the user equipment further comprises:
身份信息发送装置, 用于向所述网络服务器发送用户身份信息, 用于 由所述网络服务器确定与之关联的用户群组, 进而确定与之关联的群组语 料库。  The identity information sending device is configured to send user identity information to the network server, for determining, by the network server, a group of users associated with the network server, and further determining a group corpus associated with the network corpus.
38. 根据权利要求 35所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 35, wherein the user equipment further comprises:
词条发送装置, 用于将所述用户选定的词条发送到所述网络服务器, 以便所述网络服务器更新至与所述用户所属用户群组相关联的群组语料库。  The term sending means is configured to send the selected item of the user to the network server, so that the network server updates to a group corpus associated with the user group to which the user belongs.
39. 根据权利要求 35所述的用户设备, 其特征在于, 所述群组语料库 中的群组词条具有一个或多个属性, 所述属性包括群组标识、 优先级别、 被选次数、 最近被选择时间、 来源用户和 /或目标用户。  39. The user equipment according to claim 35, wherein the group term in the group corpus has one or more attributes, the attribute includes a group identifier, a priority level, a selected number of times, and a most recent The time, source user, and/or target user are selected.
40. 根据权利要求 39所述的用户设备, 其特征在于, 所述用户所获得 词条选项范围和顺序基于所述群组词条的属性。  40. The user equipment according to claim 39, wherein the range and order of terms obtained by the user are based on attributes of the group entry.
41. 根据权利要求 39所述的用户设备, 其特征在于, 所述用户在所述 用户群组中具有不同优先级, 所述网络服务器根据某个词条来源用户的优 先级、 该词条的当前优先级、 最近被选择时间和 /或被选择次数向该词条分 配不同的优先级或更新该词条的优先级。 The user equipment according to claim 39, wherein the user is in the The user group has different priorities, and the network server assigns different priorities to the term according to the priority of the user of a certain entry, the current priority of the entry, the most recently selected time, and/or the selected number of times. Level or update the priority of the term.
42. 根据权利要求 26所述的用户设备, 其特征在于, 所述用户设备供 所述用户在基于 WEB的应用程序中进行文字输入。  The user equipment according to claim 26, wherein the user equipment is provided by the user for text input in a WEB-based application.
43. 根据权利要求 26所述的用户设备, 其特征在于, 所述用户设备供 所述用户在基于 WEB的浏览器程序中进行文字输入。  43. The user equipment according to claim 26, wherein the user equipment is provided by the user for text input in a web-based browser program.
44. 根据权利要求 26所述的用户设备, 其特征在于, 所述网络词条选 项包括至少一个搜索相关信息选项。  44. The user equipment of claim 26, wherein the network entry option comprises at least one search related information option.
45. 根据权利要求 26所述的用户设备, 其特征在于, 所述用户设备还 包括:  The user equipment according to claim 26, wherein the user equipment further comprises:
特征获取装置, 用于获取该用户的用户特征信息;  a feature acquiring device, configured to acquire user feature information of the user;
匹配查询装置, 用于根据所述输入序列信息, 并基于所述用户特征信 息在所述用户特定的词库中进行匹配查询和在所述网络服务器的搜索信息 库进行匹配查询, 以获得一个或多个与所述输入序列匹配的网络词条选项 , 所述网络词条选项包括至少一个搜索相关信息选项。  Matching query means, configured to perform a matching query in the user-specific vocabulary based on the input sequence information, and perform a matching query in the search information base of the web server according to the user characteristic information to obtain one or A plurality of network term options that match the input sequence, the network term options including at least one search related information option.
46. 根据权利要求 45所述的用户设备, 其特征在于, 所述用户特征信 息包括以下各项中的至少一项:  46. The user equipment according to claim 45, wherein the user characteristic information comprises at least one of the following:
用户的历史输入记录;  User's history input record;
用户设定的个人偏好;  User-set personal preferences;
用户的属性信息;  User attribute information;
用户的地址信息。  User's address information.
47. 根据权利要求 46所述的网络设备, 其特征在于, 所述用户设备还 包括:  The network device according to claim 46, wherein the user equipment further comprises:
更新装置, 用于根据所述用户对所述一个或多个网络词条选项的选择 来更新所述网络服务器的所述用户特定的词库和 /或所述搜索相关信息库和 /或所述用户的历史输入记录。 An update device, configured to update the user-specific vocabulary and/or the search-related information base of the network server according to the user's selection of the one or more network entry options / or the user's history input record.
48. —种用于供用户进行文字输入的网络服务器, 其特征在于, 所述 网络服务器包括:  48. A network server for text input by a user, wherein the network server comprises:
用于经由网络接收用户设备上输入序列的装置;  Means for receiving an input sequence on a user equipment via a network;
用于基于所述输入序列在网络服务器中进行匹配查询获得至少一个网 络词条选项的装置;  Means for obtaining at least one network term option based on the input sequence performing a matching query in a web server;
用于获得的所述网络词条选项返回至所述用户设备的装置。  The network entry option for obtaining is returned to the device of the user device.
49. 根据权利要求 48所述的网络服务器, 其特征在于, 所述网络服务 器包括:  49. The network server according to claim 48, wherein the network server comprises:
分别对应于每个用户的用户网络语料库; 以及  a user network corpus corresponding to each user;
公共网络语料库。  Public network corpus.
50. 根据权利要求 49所述的网络服务器, 其特征在于, 所述网络服务 器还包括网络同步装置, 用于当用户通过所述用户设备登录网络服务器时, 在接收到来自用户设备的语料库同步指令之后, 将所述用户网络语料库与 所述用户设备中的本地语料库进行同步。  The network server according to claim 49, wherein the network server further comprises network synchronization means, configured to receive a corpus synchronization instruction from the user equipment when the user logs in to the network server through the user equipment. Thereafter, the user network corpus is synchronized with a local corpus in the user device.
51. 根据权利要求 50所述的网络服务器, 其特征在于, 如果用户没有 登录网络服务器, 则在接收到用户在用户设备上的按键输入序列时仅仅在 所述公共网络语料库中检索匹配的网络词条选项的装置;  51. The network server according to claim 50, wherein if the user does not log in to the network server, only the matching network words are retrieved in the public network corpus upon receiving the key input sequence of the user on the user equipment. Device of options;
如果用户已经登录网络服务器, 并且该用户的所述用户网络语料库已 经与该用户设备上的本地语料库同步, 则在接收到用户在用户设备上的按 键输入序列时仅仅在所述公共网络语料库中检索匹配的网络词条选项; 以 及  If the user has logged into the web server and the user's web corpus of the user has been synchronized with the local corpus on the user device, only the public network corpus is retrieved upon receiving the key input sequence of the user on the user device Matching network term options; and
如果用户已经登录网络服务器, 但是该用户的所述用户网络语料库没 有与该用户设备上的本地语料库同步, 则在接收到用户在用户设备上的按 键输入序列时在所述公共网络语料库和该用户的用户网络语料库中检索匹 配的网络词条选项。 If the user has logged into the web server, but the user's web corpus of the user is not synchronized with the local corpus on the user device, then the public network corpus and the user are received upon receiving the key input sequence of the user on the user device Retrieve matching network entry options in the user's web corpus.
52. 根据权利要求 48至 51 中任意一项所述的网络服务器, 其特征在 于, 所述网络服务器还包括优先级确定装置, 用于根据词条先前是否被选 择过、 词条先前被选择的时间先后、 词条先前被选择的次数、 用户预置的 输入偏好选项和 /或词条在网络上的被搜索的次数来确定所匹配的词条选项 中各个词条的优先级。 The network server according to any one of claims 48 to 51, wherein the network server further comprises priority determining means for determining whether the entry was previously selected according to whether the entry was previously selected. The chronological order, the number of times the term was previously selected, the user-preset input preference options, and/or the number of times the term was searched on the network determines the priority of each of the matching term options.
53. 根据权利要求 48所述的网络服务器, 其特征在于, 所述网络词条 为基于所述输入序列在与用户对应的群组语料库中进行匹配查询所获得的 网络群组词条。  53. The network server according to claim 48, wherein the network term is a network group entry obtained by performing a matching query in a group corpus corresponding to the user based on the input sequence.
54. 根据权利要求 53所述的网络服务器, 其特征在于, 所述网络服务 器还包括服务器侧注册装置, 用于将所述用户注册到一个或多个用户群组, 所述用户群组与所述群组语料库相关联。  The network server according to claim 53, wherein the network server further comprises server side registration means for registering the user to one or more user groups, the user group and the The group corpus is associated.
55. 根据权利要求 53所述的网络服务器, 其特征在于, 所述网络服务 器还包括身份信息接收装置, 用于接收所述用户的用户身份信息, 以便确 定与之关联的用户群组, 进而确定与之关联的群组语料库。  The network server according to claim 53, wherein the network server further comprises identity information receiving means, configured to receive user identity information of the user, to determine a user group associated with the user group, and further determine The group corpus associated with it.
56. 根据权利要求 53所述的网络服务器, 其特征在于, 所述网络服务 器还包括词条接收装置, 用于接收所述用户在输入时最终选定的词条, 以 便更新与所述用户所属用户群组相关联的群组语料库。  The network server according to claim 53, wherein the network server further comprises a term receiving device, configured to receive a term finally selected by the user when inputting, so as to update and belong to the user. The group corpus associated with the user group.
57. 根据权利要求 53所述的网络服务器, 其特征在于, 所述群组语料 库中的群组词条具有一个或多个属性,所述属性包括群组标识、优先级别、 被选次数、 最近被选择时间、 来源用户和 /或目标用户。  57. The network server according to claim 53, wherein the group term in the group corpus has one or more attributes, the attribute includes a group identifier, a priority level, a selected number of times, and a most recent The time, source user, and/or target user are selected.
58. 根据权利要求 57所述的网络服务器, 其特征在于, 根据所述群组 词条的属性向正在进行输入的用户提供所述群组词条。  58. The network server according to claim 57, wherein the group entry is provided to a user who is making an input according to an attribute of the group entry.
59. 根据权利要求 57所述的网络服务器, 其特征在于, 用于为所述用 户赋予在所述用户群组中的不同优先级, 以便根据某个词条来源用户的优 先级、 该词条的当前优先级、 最近被选择时间和 /或被选择次数向该词条分 配不同的优先级或更新该词条的优先级的装置。 59. The network server according to claim 57, wherein: the user is given different priorities in the user group, so that the priority of the user according to a certain entry, the entry The current priority, the most recently selected time, and/or the number of times selected to assign a different priority to the term or to update the priority of the term.
60. 根据权利要求 53所述的网络服务器, 其特征在于, 所述网络服务 语料库进行初始化的装置。 60. The network server according to claim 53, wherein the network service corpus performs initialization.
61. 根据权利要求 48所述的网络服务器, 其特征在于, 所述网络服务 器还包括:  The network server according to claim 48, wherein the network server further comprises:
用于接收所述用户设备发来的所述用户的身份信息的装置;  Means for receiving identity information of the user sent by the user equipment;
用于根据所述用户的身份信息, 调取所述用户的输入历史记录的装置; 用于基于所述输入信息在所述词典数据库中进行匹配查询, 得到初步 查询结果的装置;  Means for retrieving an input history record of the user according to the identity information of the user; means for performing a matching query in the dictionary database based on the input information to obtain a preliminary query result;
用于根据所述用户的输入历史记录, 对所述初步查询结果进行处理, 以生成所述备选输入项集合, 将所述输入历史记录与所述初步查询结果进 行对比, 将所述初步查询结果中的同样包含在所述输入历史记录中的内容 排列在优先于其它内容的位置, 以生成所述备选输入项集合的装置。  And processing, according to the input history record of the user, processing the preliminary query result to generate the candidate input item set, comparing the input history record with the preliminary query result, and performing the preliminary query The content of the result that is also included in the input history is arranged in a position that is prioritized over other content to generate the set of candidate input items.
62. 根据权利要求 48或 61所述的网络服务器, 其特征在于, 所述网 络服务器还包括:  The network server according to claim 48 or claim 61, wherein the network server further comprises:
更新装置, 用于根据用户的指示信息所表示的输入项, 执行以下各项 中的至少一项:  And an updating device, configured to perform at least one of the following according to the input item indicated by the user's instruction information:
更新所述用户的输入历史记录;  Updating the input history of the user;
对所述网络服务器所存储的词典数据库进行训练和更新。  Training and updating the dictionary database stored by the web server.
63. 根据权利要求 48所述的网络服务器, 其特征在于, 所述网络词条 选项包括至少一个搜索相关信息选项。  63. The network server of claim 48, wherein the network term option comprises at least one search related information option.
64. 根据权利要求 63所述的网络服务器, 其特征在于, 所述网络服务 器还包括:  64. The network server according to claim 63, wherein the network server further comprises:
特征获取装置, 用于获取该用户的用户特征信息;  a feature acquiring device, configured to acquire user feature information of the user;
确定装置, 用于根据所述用户特征信息确定在该网络设备中所述用户 特定的词库; 匹配查询装置, 用于根据所述输入序列信息, 并基于所述用户特征信 息在所述用户特定的词库中进行匹配查询和在该网络设备的搜索信息库进 行匹配查询, 以获得一个或多与之匹配的输入词条选项, 其中包括至少一 个与所述输入序列信息和用户特征信息相关的搜索相关信息。 Determining means, configured to determine, according to the user characteristic information, the user-specific vocabulary in the network device; a matching query device, configured to perform a matching query in the user-specific lexicon and perform a matching query in the search information base of the network device according to the input sequence information, to obtain one or more The input term option matched thereto includes at least one search related information related to the input sequence information and the user feature information.
65. 根据权利要求 64所述的网络服务器, 其特征在于, 所述用户特征 信息包括以下各项中的至少一项:  65. The network server according to claim 64, wherein the user characteristic information comprises at least one of the following:
用户的历史输入记录;  User's history input record;
用户设定的个人偏好;  User-set personal preferences;
用户的身份信息;  User identity information;
用户的地址信息。  User's address information.
66. 根据权利要求 48所述的网络服务器, 其特征在于, 所述网络服务 器还包括:  The network server according to claim 48, wherein the network server further comprises:
更新装置, 用于根据该用户对所述一个或多个输入词条选项的选择来 更新该网络设备的所述用户特定的词库和 /或所述搜索相关信息库和 /或所 述用户的历史输入记录。  An update device, configured to update the user-specific vocabulary of the network device and/or the search-related information base and/or the user according to the user's selection of the one or more input term options History input record.
67. 一种用于供用户进行文字输入的系统, 其特征在于, 所述系统包 括权利要求 26至 47中任意一项所述的用户设备, 以及权利要求 48至 66 中任意一项所述的网络服务器。  67. A system for a text input by a user, the system comprising the user equipment of any one of claims 26 to 47, and the method of any one of claims 48 to 66 Network Server.
PCT/CN2011/072268 2010-04-16 2011-03-29 Method, device, server and system for character input by user WO2011127788A1 (en)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
CN2010101485637A CN102063450A (en) 2010-04-16 2010-04-16 Method and equipment for user to input words on basis of network
CN2010101485726A CN102063451A (en) 2010-04-16 2010-04-16 Method and equipment for inputting characters by user and providing relevant search information
CN201010148563.7 2010-04-16
CN201010148571.1 2010-04-16
CN 201010148571 CN102063194A (en) 2010-04-16 2010-04-16 Method, equipment, server and system for inputting characters by user
CN201010148572.6 2010-04-16
CN201010187082.7 2010-05-31
CN 201010187082 CN102063452A (en) 2010-05-31 2010-05-31 Method, equipment, server and system for inputting characters by user

Publications (1)

Publication Number Publication Date
WO2011127788A1 true WO2011127788A1 (en) 2011-10-20

Family

ID=44798278

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/072268 WO2011127788A1 (en) 2010-04-16 2011-03-29 Method, device, server and system for character input by user

Country Status (1)

Country Link
WO (1) WO2011127788A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10009222B2 (en) 2016-03-30 2018-06-26 International Business Machines Corporation Input method engine management for edge services
CN109144285A (en) * 2017-06-16 2019-01-04 北京搜狗科技发展有限公司 A kind of input method and device
US10218566B2 (en) 2016-03-30 2019-02-26 International Business Machines Corporation Proactive input method engine management for edge services based on crowdsourcing data
US10382264B2 (en) 2016-12-15 2019-08-13 International Business Machines Corporation Fog computing for machine translation
US11977830B2 (en) 2022-09-13 2024-05-07 International Business Machines Corporation Demand-based deployment of font server to an edge device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101013342A (en) * 2007-01-22 2007-08-08 魏新成 Chinese online input method based on Chinese network word base
CN101178737A (en) * 2006-12-12 2008-05-14 北京搜狗科技发展有限公司 Method and system for publishing information related to internet key character
CN101520786A (en) * 2008-02-27 2009-09-02 北京搜狗科技发展有限公司 Method for realizing input method dictionary and input method system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101178737A (en) * 2006-12-12 2008-05-14 北京搜狗科技发展有限公司 Method and system for publishing information related to internet key character
CN101013342A (en) * 2007-01-22 2007-08-08 魏新成 Chinese online input method based on Chinese network word base
CN101520786A (en) * 2008-02-27 2009-09-02 北京搜狗科技发展有限公司 Method for realizing input method dictionary and input method system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10009222B2 (en) 2016-03-30 2018-06-26 International Business Machines Corporation Input method engine management for edge services
US10218566B2 (en) 2016-03-30 2019-02-26 International Business Machines Corporation Proactive input method engine management for edge services based on crowdsourcing data
US10382264B2 (en) 2016-12-15 2019-08-13 International Business Machines Corporation Fog computing for machine translation
CN109144285A (en) * 2017-06-16 2019-01-04 北京搜狗科技发展有限公司 A kind of input method and device
CN109144285B (en) * 2017-06-16 2022-09-06 北京搜狗科技发展有限公司 Input method and device
US11977830B2 (en) 2022-09-13 2024-05-07 International Business Machines Corporation Demand-based deployment of font server to an edge device

Similar Documents

Publication Publication Date Title
US11461003B1 (en) User interface for presenting suggestions from a local search corpus
WO2012139394A1 (en) Resource candidate sequencing result determination method, apparatus and equipment
JP5997350B2 (en) Structured search query based on social graph information
US8825694B2 (en) Mobile device retrieval and navigation
JP5827380B2 (en) Non-standard position-based text input
US10152541B2 (en) Method of and system for conducting personalized federated search and presentation of results therefrom
US7966003B2 (en) Disambiguating ambiguous characters
US8996507B2 (en) Location in search queries
US7966309B2 (en) Providing relevance-ordered categories of information
US7548899B1 (en) Method and system for information retrieval based on menu selections
WO2011150730A1 (en) Method and device for mixed input in english and another kind of language
CN102063450A (en) Method and equipment for user to input words on basis of network
WO2012000335A1 (en) Input method and device combined with application interfaces
AU2008206127A1 (en) Presentation of location related and category related search results
WO2008080192A1 (en) Process and apparatus for selecting an item from a database
RU2677379C2 (en) Method of forming a user query
CN102063194A (en) Method, equipment, server and system for inputting characters by user
WO2011127788A1 (en) Method, device, server and system for character input by user
CN107123318B (en) Foreign language writing learning system based on input method device
JP5827449B2 (en) Personalized structured search queries for online social networks
JP2010033240A (en) Information access device
AU2016204569A1 (en) Improved process and apparatus for selecting an item from a database
TW201447615A (en) Social entity previews in query formulation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11768400

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 05/03/2013)

122 Ep: pct application non-entry in european phase

Ref document number: 11768400

Country of ref document: EP

Kind code of ref document: A1