CN102209083B - Method and server for synchronous update of user lexicon and input method system - Google Patents

Method and server for synchronous update of user lexicon and input method system Download PDF

Info

Publication number
CN102209083B
CN102209083B CN201010137311.4A CN201010137311A CN102209083B CN 102209083 B CN102209083 B CN 102209083B CN 201010137311 A CN201010137311 A CN 201010137311A CN 102209083 B CN102209083 B CN 102209083B
Authority
CN
China
Prior art keywords
user
subset
user thesaurus
thesaurus
entry
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201010137311.4A
Other languages
Chinese (zh)
Other versions
CN102209083A (en
Inventor
王天一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201010137311.4A priority Critical patent/CN102209083B/en
Publication of CN102209083A publication Critical patent/CN102209083A/en
Application granted granted Critical
Publication of CN102209083B publication Critical patent/CN102209083B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention provides a method and a server for synchronous update of a user lexicon and an input method system. The method comprises the following steps that: different storage space is maintained aiming at different users, and at least two separation equipment user lexicons are stored based on different equipments under a user name in a storage space of the user; when a lexicon of a present equipment terminal under the user name is needed to be updated, at least one separation equipment user lexicon, which satisfies the demand of the present equipment terminal, is selected from the separation equipment user lexicons in the storage space of the user; and a entry and/or a parameter, which is/are required and used for updating, is/ are obtained from the separation equipment user lexicon, and then the lexicon of the present equipment terminal is updated synchronously by the entry and/or the parameter. According to the invention, information of a general input habit of a user can be conveniently transmitted to each equipment terminal of the user by the server; at the same time, at least two separation equipment lexicons are maintained for the user at the server. Therefore, the individual input demand and the input habit of the user on different equipment terminals can be reflected and satisfied.

Description

A kind of updating user vocabulary synchronouslly method, update server and input method system
Technical field
The present invention relates to input method technique field, particularly relate to a kind of method of updating user vocabulary synchronouslly, update server and a kind of input method system.
Background technology
Current input method system (comprising Chinese, Japanese and Korean etc.) is generally all for user provides candidate word and sequence thereof in Information Inputting Process based on the word frequency in its word bank system and word bank system.Candidate word and sequence thereof are important indicators of user's first-selected word hit rate height in Information Inputting Process.Certainly, for Chinese character coding input method, technically, input method system itself cannot know that word is that user needs most; But in vast as the open sea Chinese words, the use of each words and the frequency of occurrences are different, by words higher for frequency of occurrences sequence in the front first-selected word hit rate that just greatly can improve input method system, namely can improve from probability the possibility that the preceding vocabulary of sequence meets user's needs.
In the prior art, input method system generally comprises a system dictionary, for meeting the use habit of most of user, also comprises a user thesaurus, for recording the use habit of specific user, better to improve the input efficiency of this user.But along with the develop rapidly of computer technology, existing input method user generally likely uses multiple equipment end, such as, office computer, home computer and mobile notebook or other people computer etc.; And likely frequent transitions between multiple stage computer.Therefore, the user thesaurus that user wishes in each equipment end can both reflect the use habit of this user, if relearn record in each equipment end, obtains user thesaurus, then inefficiency, and the input having a strong impact on user is experienced.
Publication number is the Chinese patent literature of CN101030157, disclose a kind of solution of user thesaurus being carried out back up at server end, what the program can ensure that user uses on different devices is all the same user thesaurus that can reflect this user's use habit, even if this user switches use on different devices, also its experience can not be affected.
But above-mentioned solution still exists certain problem, although because a user has some use habits communicated in each equipment end, for different equipment end, in fact also there are some specially for the use habit of an equipment end in this user.Such as, for office computer and home computer, except the words use habit that some are substantially identical, office computer may exist some, and to be exclusively used in work relevant (such as, specialized vocabulary etc.) words use habit, and home computer may exist, and some are special with amusement with the words use habit of game.Such as, " expectation " word is higher in the frequency of utilization of home computer, and " language material " word is higher in the frequency of utilization of office computer, and the two is different.If simply direct, user thesaurus is united by server, then can overemphasize the general character for this user of user thesaurus in each equipment end, and ignore the individual character of distinct device end.
Especially when PC input method is applied on the mobile terminals such as mobile phone gradually, due to the input of the Mobile terminal keyboard such as mobile phone, the not convenient property selecting word, above-mentioned to ignore the problem that individual character causes more outstanding.
In a word, the technical problem needing those skilled in the art urgently to solve is exactly: how can improve existing server-side user dictionary backup scenario, meets the common requirement of user on distinct device end and individual needs.
Summary of the invention
Technical problem to be solved by this invention is to provide a kind of method of input method updating user vocabulary synchronouslly, it can download for different equipment end the entry information meeting this device requirement, namely can ensure the general character input habit of this user, also can take into account the individual character input habit of this user on current device.
Accordingly, present invention also offers a kind of user thesaurus update server and a kind of input method system, to implement the above described methods, take into account the general character input habit of user and the individual character input habit on current device.
In order to solve the problem, the invention discloses a kind of method of updating user vocabulary synchronouslly, comprise: safeguard different memory spaces for different user, in the memory space of a user, store at least two subset user thesaurus for the distinct device under this user name; When need be a user name under a current device end carry out Word library updating time, from the subset user thesaurus described user storage space selector be fated before at least one subset user thesaurus of equipment end demand; From selected subset user thesaurus, obtain entry and/or the parameter of required renewal, synchronized update is to described current device end.
Preferably, described synchronous updating method, can also comprise: the entry and/or the parameter that receive the needs renewal that current device end is uploaded; The described entry that receives and/or parameter are updated to subset user thesaurus corresponding to this current equipment end.
Preferably, described synchronous updating method, can also comprise: the download demand being obtained current device end by the download attribute of current device end, wherein, described download attribute is preset in server end, or described download attribute is attached in the update request that current device end sends; In described subset user thesaurus from user storage space selector be fated before at least one subset user thesaurus of equipment end demand comprise: according to the download attribute of described current device end, from the described subset user thesaurus this user storage space selector be fated before at least one equipment user's dictionary of equipment end demand.
Preferably, the user thesaurus that described selection meets demand comprises: the single subset user thesaurus selecting described current device end corresponding is the user thesaurus meeting demand; Or, select at least one other subset user thesaurus except subset dictionary corresponding to described current device end to be the user thesaurus meeting demand; Or, select at least two the subset user thesaurus belonging to same device class to be the user thesaurus meeting demand.
Preferably, described multiple subset user thesaurus for distinct device are multiple dictionary files independently; Each dictionary file record has the identification parameter of entry source device; Or described multiple subset user thesaurus for distinct device are a dictionary file, wherein store the identification parameter of its source device for entry, to form the virtual multiple subset user thesaurus for distinct device.
Preferably, a described identification parameter represents an entity hardware device; Or arrange according to user, described identification parameter representative belongs to multiple entity hardware devices of a class.
According to another embodiment of the present invention, also disclose a kind of update server of user thesaurus, comprise memory cell, communication unit and update process unit, wherein:
Memory cell, for safeguarding different memory space for different user, stores at least two subset user thesaurus for the distinct device under this user name in the memory space of a user;
Communication unit, for the request of receiving equipment end down loading updating user thesaurus; And the entry that the needs obtained by update process unit upgrade and/or parameter synchronization are updated to current device end;
Update process unit, at least one subset user thesaurus of equipment end demand before being fated for selector from the subset user thesaurus stored of memory cell, obtains entry and/or the parameter of wherein required renewal.
Preferably, the entry that the needs that described communication unit is also uploaded for receiving equipment end upgrade and/or parameter; Described update process unit is also for being updated to subset user thesaurus corresponding to this current equipment end of described memory cell by received entry and/or parameter.
Preferably, the subset user thesaurus meeting demand described in comprises: the single subset user thesaurus that current device end is corresponding; Or, at least one other subset user thesaurus except the subset dictionary that current device end is corresponding; Or, belong at least two subset user thesaurus of same device class.
According to another embodiment of the present invention, also disclose a kind of input method system, be positioned at an equipment end, comprise: for recording the system dictionary of basic words and parameter thereof; And be included in the local user vocabulary of at least two subset user thesaurus under same user name, described at least two subset user thesaurus are respectively for the distinct device that this input method user uses; Download unit, for sending down loading updating request to server end, and receiving the entry and/or parameter downloaded, being updated to local user vocabulary.
Preferably, described input method system can also comprise: uploading unit, for uploading in current device end the entry and/or parameter that need to upgrade to server end.
Preferably, described input method system can also comprise:
Equipment end interactive unit, for the synchronized update information with the mutual user thesaurus separately of the input method system of another equipment end;
Staging server unit, for according to mutual user thesaurus update status, determines whether to select current device end as staging server; If so, then from local user vocabulary, select at least one the subset user thesaurus meeting demand, the entry and/or the parameter that obtain wherein required renewal are sent to another equipment end described.
Preferably, described input method system can also comprise: weight unit, for when same entry repeats at multiple subset user thesaurus respectively, then give different weight according to presetting rule for the parameter of this entry in each subset user thesaurus, and calculate final argument; Described parameter is used for candidate item sequence.
Preferably, the subset user thesaurus meeting demand described in comprises: the single subset user thesaurus that current device end is corresponding; Or, at least one other subset user thesaurus except the subset dictionary that current device end is corresponding; Or, belong at least two subset user thesaurus of same device class.
Preferably, described multiple subset user thesaurus for distinct device are multiple dictionary files independently; Each dictionary file record has the identification parameter of entry source device;
Or described multiple subset user thesaurus for distinct device are a dictionary file, wherein store the identification parameter of its source device for entry, to form the virtual multiple subset user thesaurus for distinct device.
According to another embodiment of the present invention, also disclose a kind of method of updating user vocabulary synchronouslly, comprising: the synchronized update information of the user thesaurus of mutual first equipment end and the second equipment end; According to mutual comparative result, choose one of them equipment end as staging server; Wherein, described first equipment end and the second equipment end store the local user vocabulary comprising at least two subset user thesaurus, and described at least two subset user thesaurus are respectively for the distinct device that current input method user uses; Staging server receives the request of another equipment end down loading updating user thesaurus; At least one the subset user thesaurus meeting another equipment end demand is selected from the subset user thesaurus that staging server stores; From selected subset user thesaurus, obtain entry and/or the parameter of required renewal, synchronized update is to another equipment end.
Preferably, the user thesaurus that described selection meets demand comprises: the single subset user thesaurus selecting current download equipment end corresponding is the user thesaurus meeting demand; Or, select at least one other subset user thesaurus except subset dictionary corresponding to current device end to be the user thesaurus meeting demand; Or, select at least two the subset user thesaurus belonging to same device class to be the user thesaurus meeting demand.
Compared with prior art, the present invention has the following advantages:
The present invention, in order to meet the general character input demand of user in multiple equipment end, adopts server end maintenance for the user thesaurus mode of this user, can be passed in each equipment end of this user by the general character input habit of user; Further, the present invention, in order to meet the individual character input demand of user on distinct device end, maintains multiple subset dictionary (at least two) for a user account, to embody this user input habit on different devices.
Such as, etymology facility information can be added in safeguarded user thesaurus entry attribute, to realize subset management.Due to the existence of the etymology facility information of entry, make when equipment end information upload, can subset dictionary only corresponding to backup updating oneself, and other subset dictionary can not be changed; When downloading, then can download by the dictionary obtained needed for current device end from multiple subset dictionary, such as, subset dictionary of identical category etc.
Further, due to the existence of the etymology facility information of entry, when equipment end specifically inputs, can by modes such as weights to embody the individual character input habit of user on current device (such as, improve or reduce the candidate item sorting position that etymology facility information is the entry of current device end); And owing to also having downloaded other equipment neologisms simultaneously, so the vocabulary quantity extended in current device end subscriber dictionary and scope simultaneously, namely still ensure that the general character input habit of this user when distinct device end inputs.
Accompanying drawing explanation
Fig. 1 is the method flow diagram of a kind of updating user vocabulary synchronouslly described in the embodiment of the present invention;
Fig. 2 is the schematic diagram that described in the embodiment of the present invention, a server end dictionary stores;
Fig. 3 is the schematic diagram that another server end dictionary described in the embodiment of the present invention stores;
Fig. 4 is the first equipment end described in the embodiment of the present invention and the synchronous updating method flow chart between the second equipment end two equipment end;
Fig. 5 is the structured flowchart of a kind of user thesaurus server example described in the embodiment of the present invention;
Fig. 6 is the structured flowchart of a kind of input method system embodiment described in the embodiment of the present invention.
Embodiment
For enabling above-mentioned purpose of the present invention, feature and advantage become apparent more, and below in conjunction with the drawings and specific embodiments, the present invention is further detailed explanation.
First user thesaurus involved in the present invention is simply introduced.Entry record in user thesaurus of the present invention can comprise: the existing words of user's input and corresponding property parameters; And/or, the self-made characters word of user's input and corresponding property parameters.Data store organisation for a record of user thesaurus can be:
(entry; Property parameters 1; Property parameters 2; ...; Property parameters n)
Wherein, property parameters can be word frequency information, rise time, last service time, binary crelation etc.Input method user manually can operate and modify to the entry in dictionary record, property parameters.Input method system also can, by automatically changing corresponding entry parameter according to presetting rule the detection of user's input behavior information, such as, select word to increase word frequency according to user; Or, according to the time, word frequency information is decayed.
All only adopting the most frequently used word frequency information to be described for the property parameters of entry in embodiment below, but for those of ordinary skills, extended to other property parameters, should be apparent.
With reference to Fig. 1, show the method flow diagram of a kind of updating user vocabulary synchronouslly of the present invention, specifically can comprise:
S101, the down loading updating request of server receiving equipment end;
Server end safeguards its corresponding user thesaurus for each user.Equipment end of the present invention all refers to the equipment of user side.
In practical application, a user may use multiple equipment end, such as home computer, office machine, PDA etc.The feature of server of the present invention is, it can also be safeguarded respectively for multiple user thesaurus of multiple equipment for a user.Any existing feasible mode can be adopted to realize the storage of user thesaurus at server end, such as, the mode of database or the mode of data file etc.
Due to server stores for the dictionary of different user, therefore usually need to comprise user totem information (such as when initiating update request, the user name of user's registration, or the identification number etc. of server-assignment), and in order to the personalization input demand meeting current device end, the identification information (hardware identifier of such as current device end or address designation etc.) of current device end can also be comprised.
In another preferred embodiment of the present invention, described update request also can not comprise user totem information, and only comprise the identification information of current device end, as long as equipment identification information can ensure uniqueness, because when user is constant, determines current device and also just determine active user.
In order to more clearly demonstrate, with reference to Fig. 2, show the schematic diagram that a server end dictionary stores, under a user account, storing equipment 1 user thesaurus, equipment 2 user thesaurus, equipment 3 user thesaurus, equipment 4 user thesaurus and equipment 5 user thesaurus.In fig. 2, above-mentioned 5 subset user thesaurus, can be 5 dictionary files independently, each dictionary file records the identification parameter of entry source device respectively, namely this dictionary file is identified for which equipment, to record the user personality custom on this equipment.
With reference to Fig. 3, above-mentioned 5 subset user thesaurus can also be different groups according to category division, only to upgrade the dictionary information in a certain group, and the dictionary information do not affected in other groups, namely the present invention not only can record the input habit of individual equipment, can also record and belong to other input habit of same class.
In a preferred embodiment of the invention, also can not adopt the mode shown in Fig. 2, and adopt all entry information in above-mentioned 5 dictionaries of dictionary file storage.Certainly, in order to the multiple subset user thesaurus for distinct device can be formed in a dictionary file, then can a property parameters be set for each entry in this dictionary---the identification parameter of entry source device; Or, also can for one group of entry record corresponding property parameters---the identification parameter of entry source device, identify all entries in this group and parameter all derives from this equipment.Although thus the user thesaurus of server end exists with the form of a dictionary file, can be formed virtual respectively for multiple subset user thesaurus of multiple equipment.
It should be noted that, in practical application, also the mode being less than 5 dictionary files can be adopted, such as, adopt 2 dictionary files, wherein, dictionary file 1 is the entry information of originating for memory device 1, equipment 2, and dictionary file 2 is the entry information of originating for memory device 3, equipment 4, equipment 5.
S102, from least two subset user thesaurus (for multiple user thesaurus of multiple equipment same user name) described in server end, selects the subset user thesaurus meeting demand;
Because the present invention have recorded the input habit of individual equipment when safeguarding dictionary, can ensure that generated subset dictionary is relatively independent on the one hand, simultaneously, in a step 102, also according to the change of service condition, according to different condition, each subset dictionary can be integrated at any time, or collaborative work.
Concrete, server end can obtain the download demand of current device end by the download attribute of current device end; Wherein, described download attribute can be preset in server end, or described download attribute also can be attached in the update request of current device end.The situation being in advance preset in server end to download attribute is below described.
Such as, when using some equipment, wish that the dictionary of this equipment keeps independent, then can set the attribute of a separate backup reduction when server end sets up this equipment user's dictionary, namely the download attribute of current device end is the subset user thesaurus only corresponding to down loading updating current device end.When the equipment end with this attribute initiates update request, server only needs by its corresponding equipment user's Word library updating to current device end, and the dictionary of miscellaneous equipment end can not disturb current device end.Namely the subset user thesaurus meeting demand described in is: the single subset user thesaurus that current device end is corresponding.
Again such as, when using some equipment, wish that the user thesaurus between the equipment of this equipment identical category (as work desktop computer and notebook) is shared, and and between the equipment dictionary of other classifications, keep independent, then can set a category attribute when server end sets up this equipment user's dictionary, namely the download attribute of current device end is need to download and the subset user thesaurus under this equipment end identical category.When the equipment end with this attribute initiates update request, by other equipment user's Word library updatings in its generic to current device end, the equipment user's dictionary in other classifications can not disturb current device end.Namely the subset user thesaurus meeting demand described in is: belong to each subset user thesaurus under same device class.
Again such as, when using some equipment, wishing that the user thesaurus of this equipment and other equipment rooms is all shared, then can set a full Update attribute when server end sets up this equipment user's dictionary; Namely the download attribute of current device end is all subset user thesaurus needed to upgrade under this user name.When the equipment end with this attribute initiates update request, by this equipment user's dictionary and server sync, and by other equipment user's Word library updatings under this user name to current device end.Namely the subset user thesaurus meeting demand described in is: each subset user thesaurus of all devices under this user name of server end.
It should be noted that, under normal circumstances, general in down loading updating process, do not need to download the corresponding subset user thesaurus of current device end at server end, because the corresponding subset user thesaurus that server end stores is exactly that backup uploaded by local device, this locality is identical (even renewal) compared with server file, therefore only needs other equipment user's dictionaries of down loading updating.But, in some cases, such as, equipment refitting system, local user vocabulary damage or lose, the hand-operated forced refreshing local user vocabulary of user etc., selected meeting in the subset user thesaurus of demand also can comprise current device end server end corresponding subset user thesaurus.
In another preferred embodiment of the present invention, download demand is more flexible, described update request can also comprise the personalized download request (the download attribute by current download equipment end is attached in described update request) of user, such as, user specifies certain classification before the update, then can download equipment user's dictionary of specified classification, this appointment classification may be different with the classification of current device end; Or, user's designated equipment mark etc.
S103, obtains the wherein required entry upgraded, corresponding entry and/or parameter synchronization is updated to current device end.
After selecting one or more equipment user's dictionary needed for current download equipment end, obtain entry and/or the parameter of wherein required renewal, by its down loading updating to current device end.Simply, the entry of required renewal can be all entries in one or more selected equipment user's dictionary.Certainly, also can adopt the pattern of incremental update, namely the required entry upgraded also can be in selected subset user thesaurus, from the entry that last time has changed since synchronized update.Such as, the entry-word caused due to user's input behavior changes frequently.If some there occurs change in the property parameters of this entry, then its all properties parameter all can be upgraded, also only can upgrade the property parameters of changing unit.
Server end can independently store subset dictionary, and its renewal is fairly simple, and each subset dictionary correspondence upgrades the entry wherein needing to upgrade.
And server end also can adopt entry to merge situation about storing, such as, same entry appears in two equipment dictionaries, then an entry can be it can be used as to carry out record at server end, retain word frequency, the binary crelation data in the last service time of that record rearward, and last service time, but still record birth device id and the date of birth (entry source-information) of rise time more last word.Certainly, this entry logically still belongs to the entry in two subset dictionaries.
Server and the synchronous flow process of equipment end are completed to embody rule entry incremental update mode below, be briefly described.
Equipment end is to the backup procedure of server end:
Each entry in local file, multiple data field of its word frequency is once change, and each entry needs corresponding change mark and marks.When needs upload server, by change mark, the data increment changed between twice simultaneous operation is transferred to server end, be transmitted the rear change mark cancelling correspondence.
Server end is to the renewal process of equipment end:
In general, because the dictionary on server is local backup character, can not than the Word library updating of this locality; But special, when the corresponding multiple equipment of a dictionary, distinct device intersects in renewal process, and the data on server can be caused to have part new for current local device dictionary.Now, in the subset dictionary of this server end, also need the data different from equipment end to mark.
In another embodiment of the invention, can distinguish with equipment change entry at server end.Such as, when a local device A is after server uploading data, then upload the more new logo of the entry interpolation device A of renewal for local device A, updated by device A to represent this entry, miscellaneous equipment lower subsynchronous time, namely need to download the more new data uploaded by device A specifically.When equipment end B downloads, then by this entry and parameter downloads thereof to local, and on this entry, add the mark that equipment end B upgraded, when secondary device end B upgrades instantly, just can not upgrade this entry again; Undertaken synchronously by respective mark by each equipment when namely upgrading.
In another embodiment of the invention, also can while server end add change mark to change entry data, to each entry mark-on update time; Meanwhile, the also respective markers transformation period of each entry of local device dictionary.When uploading synchronous, need the entry uploaded to upload onto the server mark, then by server by contrasting update time, merge (as retained newer data, or merging two data etc. by predetermined Weight algorithm) by certain principle; When downloading dictionary, by local device lock in time last time, compare with the update time of entry in server backup, the data down transmission being later than lock in time last time update time is preserved to local device.
In a preferred embodiment of the invention, the property parameters of " last service time " can also be comprised in entry parameter; If dictionary form has " last service time " parameter, then also directly can be multiplexed with timestamp information, as the final updating time.
In actual applications, sync mark can also comprise change category attribute, to represent newly-increased, deletion, the state such as change.Concrete, because sync mark belongs to the known technology of synchronizing information technical field, be not described in detail in this.
Mainly describe the process of equipment end from server end down loading updating above, generally, the embodiment of the present invention also can comprise: the entry that needs back up by equipment end and/or parameter upload onto the server end step; Namely backup procedure completes in once communicating with renewal process.Such as, aforesaid synchronized update embodiment can also comprise: the request of received server-side equipment end upload user dictionary; Equipment end uploads in current device end the entry parameter needing to upgrade, by described server end synchronized update to user thesaurus corresponding to this current equipment end.
Because equipment end generally only uploads the entry parameter changed on this equipment, therefore, only need by upper once upload backup after the change entry of (as deleted, newly-increased, change etc.) and parameter thereof all to upload onto the server end, server is updated to corresponding equipment user's dictionary.Certainly, for the purpose of simple, also can upload onto the server whole entry and parameter thereof end, but significantly, can increase the pressure of network transmission resource.The entry parameter uploaded can comprise sync mark, to facilitate the more newly downloaded of other equipment.
It should be noted that, for when uploading backup first, need to set this device attribute first on the server, described device attribute can comprise device id, device type (equipment of the global synchronization that such as, can comprise self-synchronizer, belong to the inter-sync equipment of certain class, participate in) etc.; Described device attribute can also comprise: artificial regulation network is for the limit current value limiting each communication, the synchronous trigger condition (once or when new term amount reaches 100 automatically upgrading as automatic synchronization week about upgrades) of manually specifying.Preferably, for backing up first, the end that directly all uploaded onto the server by local device user thesaurus carries out storing, and can not need to carry out selectivity by sync mark and upload.
Certainly, in actual applications, for some equipment end, it must not need to perform uploading step.Such as, for a kind of special installation: gadget, this type of device attribute for only from server end download needed for equipment user's dictionary (other equipment are uploaded), and this equipment end does not need to upload the subset user thesaurus of oneself to server, server end does not need to safeguard the user thesaurus for this equipment yet.Namely uploading step is for some equipment end, and it is not necessary.
In above-described embodiment, server end safeguards there is corresponding subset user thesaurus for each entity hardware device, such as, distinguished by device id; In fact, the present invention also can be arranged by user, the subset user thesaurus that server end is safeguarded, can to should multiple entity hardware devices of using of user.Such as, for by office desktop computer and office notebook, be all recorded as a device identification at server end, the two down loading updating request initiated or upload backup request and be considered as same equipment and send.
Also it should be noted that, down loading updating process above in embodiment completes at server end and equipment end, and in fact, the present invention also can be applied between two equipment end, namely one of them equipment end serves as virtual server, thus when incorporeity server, also the updating user vocabulary synchronouslly of equipment end to equipment end can be carried out by LAN.Concrete, can comprise: the first equipment end and the second equipment end carry out the mutual of dictionary synchronizing information; According to the user thesaurus update status of described first equipment end and the second equipment end, choose one of them equipment end as staging server end.
With reference to Fig. 4, show the synchronous updating method flow chart between the first equipment end and the second equipment end two equipment end.
S401, sets up the connection of the first equipment end and the second equipment end;
It should be noted that, described first equipment end and the second equipment end store the local user vocabulary comprising multiple subset user thesaurus, and described multiple subset user thesaurus is respectively for the distinct device that current input method user uses;
S402, the synchronizing information of interactive user dictionary, compares update status;
First equipment end and the second equipment end send synchronization request respectively to the other side, and described synchronization request can comprise the synchronizing information (such as, the update status of each equipment dictionary and server end) of self dictionary; After receiving synchronizing information, the update status of the other user's dictionary is done corresponding comparison to the update status of oneself active user's dictionary, as the last update date etc.
S403, judges whether to need synchronized update;
According to above-mentioned comparative result, judge that current two equipment end be connected are the need of synchronized update, if update status is different, then perform S404; If update status is identical, then illustrate does not need synchronized update between these two equipment end, then perform S405, disconnect current connection.
S404, according to mutual comparative result, as staging server, emulating server completes synchronized update, then performs S405 to choose one of them equipment end (such as, update date is up-to-date);
Describe in detail in the embodiment of concrete synchronized update process above, be simply described as follows at this:
Staging server receives the request of another equipment end down loading updating user thesaurus;
At least one the subset user thesaurus meeting another equipment end demand is selected from the subset user thesaurus that staging server stores;
From selected subset user thesaurus, obtain entry and/or the parameter of required renewal, synchronized update is to another equipment end
S405, disconnects current connection.
In the above-described embodiments, because concrete down loading updating process is similar to the reciprocal process of equipment end with aforesaid server end, so do not repeat them here.
Concrete, described in meet demand user thesaurus comprise: the single subset user thesaurus that current download equipment end is corresponding is the user thesaurus meeting demand; Or at least one other subset user thesaurus except the subset dictionary that current device end is corresponding are the user thesaurus meeting demand; Or at least one the subset user thesaurus belonging to same device class is the user thesaurus meeting demand.
When an equipment does not re-use because of various situation, user can delete this equipment at server end by arranging interface.While this equipment of deletion, can select only to delete this device attribute, and the user thesaurus of its correspondence still retains, continue to use; Or also can select whole deletion.Such as, if the user thesaurus of this equipment is self-synchronization properties, then corresponding subset user thesaurus can be deleted; If select to retain user thesaurus, then the etymology device attribute of this user thesaurus can be inherited by other selected equipment, certainly, also can directly change to without main equipment entry (or without etymology equipment entry), to be continued to use by other equipment.
When user by a certain equipment end to server end upload backup needs upgrade entry and/or parameter, comprising when deleting word record, then server end first can record and delete word information, after by the time other equipment also complete synchronous down loading updating, then performs and concrete deletes word operation.Otherwise above-mentioned word information of deleting only may delete entry in the corresponding subset user thesaurus of server end, and the corresponding entry downloaded in each equipment end then continues to retain.Simply, the present invention at received server-side to after deleting word information, directly can initiatively initiate the synchronization removal request for this user's all devices, makes each equipment of this user can this entry of synchronization removal.
With reference to Fig. 5, show the structured flowchart of a kind of user thesaurus server example of the present invention, comprise memory cell, communication unit and update process unit, wherein:
Memory cell 501, for safeguarding different memory space for different user, stores at least two subset user thesaurus for the distinct device under this user name in the memory space of a user;
Communication unit 502, for the request of receiving equipment end down loading updating user thesaurus; And the entry that the needs obtained by update process unit upgrade and/or parameter synchronization are updated to current device end;
Update process unit 503, at least one subset user thesaurus of equipment end demand before being fated for selector in the subset user thesaurus that stores from memory cell, obtains the wherein required entry that upgrades and/or parameter.
Preferably, when equipment end connection server end carries out uploading backup operation, the communication unit 502 of the server shown in Fig. 5 can also be used for the request that receiving equipment end uploads entry and/or parameter; And the entry that upgrades of the needs uploaded of receiving equipment end and/or parameter; Described update process unit 503 can also be used for uploaded entry and/or parameter being updated to subset user thesaurus corresponding to this equipment end in described memory cell.
User thesaurus is set up for each equipment at server end, can ensure that the generated subset user thesaurus for each equipment is relatively independent, according to the change of service condition or can to integrate at any time according to different condition or collaborative, dictionary can immediate updating to the state meeting user and use.Concrete, described in meet demand subset user thesaurus can comprise: the single subset user thesaurus that current device end is corresponding; Or, at least one other subset user thesaurus except the subset dictionary that current device end is corresponding; Or, belong at least two subset user thesaurus of same device class.
Specifically how to store the multiple subset user thesaurus for distinct device about server end, describe in detail in the aforementioned embodiment, do not repeat them here.
With reference to Fig. 6, show the structured flowchart of a kind of input method system embodiment of the present invention, described input method is installed in a certain equipment end, and it can comprise:
Input interface unit 601, for receiving the input information of user;
Information conversion unit 602, for according to the input information received, retrieves, obtains corresponding candidate item and sort in dictionary 605;
Represent unit 603, for sequentially representing candidate item;
Result output unit 604, for receiving the selection information of user, exports the candidate item or network address of specifying;
Wherein said dictionary 605 comprises: for recording the system dictionary 6051 of basic words and parameter thereof, and local user vocabulary; Local user vocabulary can be divided into multiple subset user thesaurus 6052 of the distinct device used for this input method user respectively; Such as, two subset user thesaurus respectively for equipment 1 and equipment 2 are shown in Fig. 6;
And download unit 606, for sending down loading updating request to server end, and receiving the entry and/or parameter downloaded, being updated to local user vocabulary.Such as, the entry downloaded from server end and/or parameter are updated to corresponding subset dictionary respectively; Because server end also stores multiple subset user thesaurus for this user.
Preferably, the input method system shown in Fig. 6, can also comprise: uploading unit 607, for sending the request of upload user dictionary to server end; And upload in current device end the entry and/or parameter that need to upgrade.
In preferred embodiments more of the present invention, when there is no server end, between the input method system of two equipment end, also can realize the synchronous of user thesaurus.Because the update status of two equipment end user thesaurus is different, such as, the situation that an equipment end upgrades relatively newer (such as update time is close to now) relatively more complete (the subset dictionary upgraded from server end is many) is there is in two equipment end, therefore can by the synchronized update between two equipment end, to meet when server end cannot connect, realize sharing of user thesaurus.Now, the input method system described in Fig. 6 may further include:
Equipment end interactive unit 608, for the synchronizing information with the mutual user thesaurus separately of the input method system of another equipment end;
Staging server unit 609, for according to mutual user thesaurus update status, determines whether to select current device end as staging server; If so, then from from local user vocabulary, select at least one the subset user thesaurus meeting demand, the entry and/or the parameter that obtain wherein required renewal are sent to another equipment end described.
When same entry repeats at multiple subset user thesaurus respectively, in input process, its entry parameter there will be conflict, therefore in a preferred embodiment of the invention, input method system described in Fig. 6 can also comprise: weight unit 610, for when same entry repeats at multiple subset user thesaurus respectively, then give different weight according to presetting rule for the parameter of this entry in each subset user thesaurus, and calculate final argument; Described parameter is used for candidate item sequence.
Such as, when same entry repeats at multiple subset user thesaurus respectively, then the parameter of giving this entry in the subset user thesaurus of corresponding current device end is weight limit, calculates final argument.For word frequency, suppose that same entry " language material " occurs in the user thesaurus respectively for three device A, B, C (A is current device end), its word frequency is respectively a, b, c, then can directly using word frequency corresponding for equipment end A as final argument, or calculate (a*0.9+b*0.05+c*0.05), obtain final argument.
Again such as, when same entry repeats at multiple subset user thesaurus respectively, then also by the weight calculating each word frequency parameter service time, and last word frequency can be drawn.Such as, same entry " language material " occurs in the user thesaurus respectively for three device A, B, C (A is current device end), its word frequency is respectively a, b, c, its down time (last service time is apart from now) is respectively Ta=20, Tb=5, Tc=2, then in candidate item sequence, the word frequency COMPREHENSIVE CALCULATING result of this entry can be:
(a×90%+b×97%+c×100%)/3
Wherein, suppose that the time period corresponding to Tb=5, Tc=2, weight is set to 1-10%, 1-3%, 1-0% respectively for three down time Ta=20.Certainly, in actual applications, for the purpose of simple, also directly can arrange update time nearest weight is 100%, namely directly adopts word frequency c to carry out the candidate item sequence of entry " language material ".
The multiple subset user thesaurus for distinct device are stored in the equipment end input method system shown in Fig. 6, aforesaid various implementation (server end) can be adopted, such as one or more dictionary file, in a word, record corresponding entry source device information, thus can be obtained respectively for the user thesaurus of each equipment by etymology classification of equipment, to embody the personalized input habit of each equipment.
Each embodiment in this specification all adopts the mode of going forward one by one to describe, and what each embodiment stressed is the difference with other embodiments, between each embodiment identical similar part mutually see.For device embodiment, due to itself and embodiment of the method basic simlarity, so description is fairly simple, relevant part illustrates see the part of embodiment of the method.
Above to the method for a kind of updating user vocabulary synchronouslly provided by the present invention, user thesaurus update server and a kind of equipment end input method system, be described in detail, apply specific case herein to set forth principle of the present invention and execution mode, the explanation of above embodiment just understands method of the present invention and core concept thereof for helping; Meanwhile, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (17)

1. a method for updating user vocabulary synchronouslly, is characterized in that, comprising:
Safeguard different memory spaces for different user, in the memory space of a user, store at least two subset user thesaurus for the distinct device under this user name;
When need be a user name under a current device end carry out Word library updating time, from the subset user thesaurus described user storage space selector be fated before at least one subset user thesaurus of equipment end demand;
From selected subset user thesaurus, obtain entry and/or the parameter of required renewal, synchronized update is to described current device end.
2. the method for claim 1, is characterized in that, also comprises:
Receive entry and/or the parameter of the needs renewal that current device end is uploaded;
The described entry that receives and/or parameter are updated to subset user thesaurus corresponding to this current equipment end.
3. the method for claim 1, is characterized in that, also comprises:
Obtained the download demand of current device end by the download attribute of current device end, wherein, described download attribute is preset in server end, or described download attribute is attached in the update request that current device end sends;
In described subset user thesaurus from user storage space selector be fated before at least one subset user thesaurus of equipment end demand comprise: according to the download attribute of described current device end, from the described subset user thesaurus this user storage space selector be fated before at least one equipment user's dictionary of equipment end demand.
4. the method for claim 1, is characterized in that, the user thesaurus that described selection meets demand comprises:
The single subset user thesaurus selecting described current device end corresponding is the user thesaurus meeting demand;
Or, select at least one other subset user thesaurus except subset dictionary corresponding to described current device end to be the user thesaurus meeting demand;
Or, select at least two the subset user thesaurus belonging to same device class to be the user thesaurus meeting demand.
5. the method for claim 1, is characterized in that,
Described multiple subset user thesaurus for distinct device are multiple dictionary files independently; Each dictionary file record has the identification parameter of entry source device;
Or described multiple subset user thesaurus for distinct device are a dictionary file, wherein store the identification parameter of its source device for entry, to form the virtual multiple subset user thesaurus for distinct device.
6. method as claimed in claim 5, is characterized in that,
A described identification parameter represents an entity hardware device;
Or arrange according to user, described identification parameter representative belongs to multiple entity hardware devices of a class.
7. a update server for user thesaurus, is characterized in that, comprises memory cell, communication unit and update process unit, wherein:
Memory cell, for safeguarding different memory space for different user, stores at least two subset user thesaurus for the distinct device under this user name in the memory space of a user;
Communication unit, for the request of receiving equipment end down loading updating user thesaurus; And the entry that the needs obtained by update process unit upgrade and/or parameter synchronization are updated to current device end;
Update process unit, at least one subset user thesaurus of equipment end demand before being fated for selector from the subset user thesaurus stored of memory cell, obtains entry and/or the parameter of wherein required renewal.
8. server as claimed in claim 7, is characterized in that,
The entry that the needs that described communication unit is also uploaded for receiving equipment end upgrade and/or parameter;
Described update process unit is also for being updated to subset user thesaurus corresponding to this current equipment end of described memory cell by received entry and/or parameter.
9. server as claimed in claim 7, is characterized in that, described in meet demand subset user thesaurus comprise:
The single subset user thesaurus that current device end is corresponding;
Or, at least one other subset user thesaurus except the subset dictionary that current device end is corresponding;
Or, belong at least two subset user thesaurus of same device class.
10. an input method system, is positioned at an equipment end, it is characterized in that, comprising:
For recording the system dictionary of basic words and parameter thereof;
And be included in the local user vocabulary of at least two subset user thesaurus under same user name, described at least two subset user thesaurus are respectively for the distinct device that this input method user uses;
Download unit, for sending down loading updating request to server end, and receiving the entry and/or parameter downloaded, being updated to local user vocabulary.
11. input method systems as claimed in claim 10, is characterized in that, also comprise:
Uploading unit, for uploading in current device end the entry and/or parameter that need to upgrade to server end.
12. input method systems as claimed in claim 10, is characterized in that, also comprise:
Equipment end interactive unit, for the synchronized update information with the mutual user thesaurus separately of the input method system of another equipment end;
Staging server unit, for according to mutual user thesaurus update status, determines whether to select current device end as staging server; If so, then from local user vocabulary, select at least one the subset user thesaurus meeting demand, the entry and/or the parameter that obtain wherein required renewal are sent to another equipment end described.
13. input method systems as claimed in claim 10, is characterized in that, also comprise:
Weight unit, for when same entry repeats at multiple subset user thesaurus respectively, then gives different weight according to presetting rule for the parameter of this entry in each subset user thesaurus, and calculates final argument; Described parameter is used for candidate item sequence.
14. input method systems as claimed in claim 12, is characterized in that, described in meet demand subset user thesaurus comprise:
The single subset user thesaurus that current device end is corresponding;
Or, at least one other subset user thesaurus except the subset dictionary that current device end is corresponding;
Or, belong at least two subset user thesaurus of same device class.
15. input method systems as claimed in claim 10, is characterized in that,
Described multiple subset user thesaurus for distinct device are multiple dictionary files independently; Each dictionary file record has the identification parameter of entry source device;
Or described multiple subset user thesaurus for distinct device are a dictionary file, wherein store the identification parameter of its source device for entry, to form the virtual multiple subset user thesaurus for distinct device.
The method of 16. 1 kinds of updating user vocabulary synchronousllies, is characterized in that, comprising:
The synchronized update information of the user thesaurus of mutual first equipment end and the second equipment end;
According to mutual comparative result, choose one of them equipment end as staging server; Wherein, described first equipment end and the second equipment end store the local user vocabulary comprising at least two subset user thesaurus, and described at least two subset user thesaurus are respectively for the distinct device that current input method user uses;
Staging server receives the request of another equipment end down loading updating user thesaurus;
At least one the subset user thesaurus meeting another equipment end demand is selected from the subset user thesaurus that staging server stores;
From selected subset user thesaurus, obtain entry and/or the parameter of required renewal, synchronized update is to another equipment end.
17. methods as claimed in claim 16, it is characterized in that, the user thesaurus that described selection meets demand comprises:
The single subset user thesaurus selecting current download equipment end corresponding is the user thesaurus meeting demand;
Or, select at least one other subset user thesaurus except subset dictionary corresponding to current device end to be the user thesaurus meeting demand;
Or, select at least two the subset user thesaurus belonging to same device class to be the user thesaurus meeting demand.
CN201010137311.4A 2010-03-31 2010-03-31 Method and server for synchronous update of user lexicon and input method system Active CN102209083B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010137311.4A CN102209083B (en) 2010-03-31 2010-03-31 Method and server for synchronous update of user lexicon and input method system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010137311.4A CN102209083B (en) 2010-03-31 2010-03-31 Method and server for synchronous update of user lexicon and input method system

Publications (2)

Publication Number Publication Date
CN102209083A CN102209083A (en) 2011-10-05
CN102209083B true CN102209083B (en) 2015-03-18

Family

ID=44697747

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010137311.4A Active CN102209083B (en) 2010-03-31 2010-03-31 Method and server for synchronous update of user lexicon and input method system

Country Status (1)

Country Link
CN (1) CN102209083B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103108012B (en) * 2011-11-15 2019-11-19 深圳市世纪光速信息技术有限公司 A kind of user thesaurus synchronous method and user thesaurus sync server
CN104049766B (en) * 2013-03-11 2017-05-31 百度国际科技(深圳)有限公司 Cloud server and its terminal for updating language model in cloud input method
CN103810157A (en) * 2014-02-28 2014-05-21 百度在线网络技术(北京)有限公司 Method and device for achieving input method
CN106557178B (en) * 2016-11-29 2021-03-09 百度国际科技(深圳)有限公司 Method and device for updating entries of input method
CN108304078B (en) * 2017-01-11 2024-01-30 北京搜狗科技发展有限公司 Input method and device and electronic equipment
CN109669550B (en) * 2017-10-17 2023-05-16 北京搜狗科技发展有限公司 Method and device for obtaining user word stock
CN108052529A (en) * 2017-11-09 2018-05-18 福建省天奕网络科技有限公司 A kind of filtering sensitive words method and terminal
CN110362686B (en) * 2018-04-02 2024-02-06 北京搜狗科技发展有限公司 Word stock generation method and device, terminal equipment and server
CN108874175A (en) * 2018-06-20 2018-11-23 北京百度网讯科技有限公司 A kind of data processing method, device, equipment and medium
CN109597498B (en) * 2018-11-29 2021-01-19 北京蓦然认知科技有限公司 Word stock maintenance management method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101030157A (en) * 2007-04-20 2007-09-05 北京搜狗科技发展有限公司 Method and system for updating user vocabulary synchronouslly
CN101079037A (en) * 2006-06-26 2007-11-28 腾讯科技(深圳)有限公司 Chinese character library updating method and system
CN101645087A (en) * 2009-09-01 2010-02-10 腾讯科技(深圳)有限公司 Classified word bank system and updating and maintaining method thereof and client side
CN101645093A (en) * 2009-09-02 2010-02-10 腾讯科技(深圳)有限公司 Method of realizing classified lexicon and input method client end

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079037A (en) * 2006-06-26 2007-11-28 腾讯科技(深圳)有限公司 Chinese character library updating method and system
CN101030157A (en) * 2007-04-20 2007-09-05 北京搜狗科技发展有限公司 Method and system for updating user vocabulary synchronouslly
CN101645087A (en) * 2009-09-01 2010-02-10 腾讯科技(深圳)有限公司 Classified word bank system and updating and maintaining method thereof and client side
CN101645093A (en) * 2009-09-02 2010-02-10 腾讯科技(深圳)有限公司 Method of realizing classified lexicon and input method client end

Also Published As

Publication number Publication date
CN102209083A (en) 2011-10-05

Similar Documents

Publication Publication Date Title
CN102209083B (en) Method and server for synchronous update of user lexicon and input method system
CN102207957B (en) Partial item change tracking and synchronization
CN101573923B (en) Propagation method of digital synchronous conflict knowledge
CN1989762B (en) Method and device for rendering one or menus in user interface
CN101512498B (en) The access to the data file be distributed in multiple dissimilar subscriber equipment is provided to user
CN1517885B (en) Method and system for updating central cache by atomicity
CN103701913B (en) Data synchronization method and device
CN104573093B (en) A kind of method and apparatus for managing file directory
CN102857570A (en) Cloud synchronized method of files and cloud storage server
US20150032785A1 (en) Non-transitory computer-readable media storing file management program, file management apparatus, and file management method
CN103428264A (en) Data synchronization method, device and system
CN104838379A (en) Database synchronization
CN102646041A (en) Software installation method and system
CN103475721A (en) System for updating digital assets and method thereof
CN103037005A (en) File synchronization method and device of on-line storage service
CN102495739A (en) Data compatible method and system as well as inter-plate message method and system
CN102325367B (en) Data packet synchronizing device and method for client application
CN102163197A (en) Skin changing method, system and device
CN102186163A (en) Data synchronizing method of multi-account address book of smart phone
CN108111598B (en) Cloud disk data issuing method and device and storage medium
CN104581695A (en) Mobile terminal configuration method and system
CN103327480A (en) Intelligent mobile phone multiple-account contact information synchronizing method
CN103389910A (en) Virtual machine building method, virtual machine managing method and virtual machine managing device
CN111538520B (en) Updating method and device for super-converged cluster, terminal and storage medium
EP3975605B1 (en) Base station type replacement method, sdr network management system, base station type replacement apparatus and computer-readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant