CN101334774A - Character input method and input method system - Google Patents

Character input method and input method system Download PDF

Info

Publication number
CN101334774A
CN101334774A CN 200710118177 CN200710118177A CN101334774A CN 101334774 A CN101334774 A CN 101334774A CN 200710118177 CN200710118177 CN 200710118177 CN 200710118177 A CN200710118177 A CN 200710118177A CN 101334774 A CN101334774 A CN 101334774A
Authority
CN
China
Prior art keywords
dictionary
text data
interim dictionary
input
interim
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200710118177
Other languages
Chinese (zh)
Other versions
CN101334774B (en
Inventor
张智敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN 200710118177 priority Critical patent/CN101334774B/en
Publication of CN101334774A publication Critical patent/CN101334774A/en
Application granted granted Critical
Publication of CN101334774B publication Critical patent/CN101334774B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The invention provides a character input method comprising the following steps: the text data associated with an application program in the present system environment is acquired; the text data is analyzed to generate a temporary word bank; the word banks existing in the input method and the temporary word bank are loaded; the input information of users is received; a search is carried out in the word banks existing in the input method and the temporary word bank according to the received input information to get the corresponding candidates; the selection information of users is received and the pointed candidates are displayed on a screen. With the invention, the input method can automatically learn about the text content corresponding to the environment when users adopt the input method so as to form a temporary word relation bank for the access of users. The user can get the best input experience in each new conversation by the means so as to basically solve the problems of strong comprehensiveness and poor individuality of the word bank of the existing input method.

Description

A kind of method and input method system of character input
Technical field
The present invention relates to computerized information input field, particularly relate to a kind of method and system of character input, a kind of generation method and system of interim dictionary, and a kind of method and system of optimizing the input method dictionary.
Background technology
Along with popularizing and development of computer technology and Internet technology, the user of different professional domains, different interest and use habit is more and more higher for the intelligent and personalized requirement of input method system.
In the prior art, input method system generally comprises system's dictionary, and described system dictionary is by (for example, traditional news, newspaper) analysis obtains more common word frequency and ordering, thereby guarantees the first-selected speech hit rate of input method system to numerous collection of document.But owing to generate the collection of document source that dictionary relied on all is sealing, specific, and information expands rapidly in people's life, vocabulary changes frequent situation so can not satisfy.
" a kind of generation method and system of the input-method word frequency base based on internet information " disclosed among the Chinese patent file CN1936893, its input method dictionary can obtain from the internet information of vastness in statistical study, thereby can satisfy the needs that information is propagated fast, improve the hit rate of user's first-selected speech, improved input speed and efficient.
But all there is a common problem in two kinds of top input method dictionaries, promptly are a fixing comprehensive dictionary in a period of time.That is to say that the existing input method dictionary is not considered: input method user needs different lexical sets in different applied environments, such as, the user is in chat, returns mail, writes document ... or the like.Input method user is in different applied environments, what face is some different words, such as input Pinyin " ciba ", possibility in computer realm " PowerWord " is bigger, and be that the possibility of " glutinous rice cake " is bigger when talking about food, employed vocabulary was inequality certainly when employed vocabulary was with chat when for another example, writing paper.
In a word, a technical matters that presses for those skilled in the art's solution is exactly: how to improve the existing input method dictionary, make its demand that can satisfy different application environment user, better realize user's personalization input.
Summary of the invention
Technical matters to be solved by this invention provides the method and the input method system of a kind of character input, can make the input method user can both the fastest best obtaining in different applied environments wish the words of importing.
Accordingly, one object of the present invention is, a kind of generation method and system of interim dictionary is provided, and a kind of method and system of optimizing the input method dictionary, be used to help to obtain best input method dictionary, thereby make the input method user in different applied environments, can both obtain extraordinary experience.
In order to address the above problem, according to embodiments of the invention, a kind of method of character input is disclosed, specifically can may further comprise the steps: obtain in the current system environments text data that application program is related; Described text data is analyzed, generated interim dictionary; Load existing dictionary of input method and described interim dictionary; Receive user's input information; According to the input information that is received, in existing dictionary of input method and described interim dictionary, retrieve, obtain corresponding candidate item; Receive user's selection information, with screen output on the candidate item of appointment.
Wherein, can obtain described text data in the following manner: when the videotex data, by preset function, the output content of capturing operation system Chinese version output function.
Perhaps, also can obtain described text data: the path that obtains file by following steps; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.Preferably, the path that obtains file path or obtain new reception file by monitoring application program by scan procedure.
Perhaps, also can obtain described text data in the following manner: the analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
Preferably, in this method, can generate an interim dictionary at an application document in the current system environments; Perhaps, also can generate an interim dictionary at an application program in the current system environments; Perhaps, also can generate an interim dictionary at a plurality of application programs in the current system environments.
Preferably, described interim dictionary comprises word order information, binary or n-tuple relation information.
Preferably, this method can also comprise: at each entry in the interim dictionary, the word frequency information according in word frequency information in the interim dictionary and the existing dictionary of input method obtains weight parameter, and described weight parameter is used for the candidate item ordering.
Preferably, this method can also comprise: store described interim dictionary, and; The described interim dictionary of finishing analysis obtains effective information.Wherein, described effective information can comprise neologisms; Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information also can comprise the analysis result at entry and input environment thereof.
Preferably, this method can also comprise: application programs is screened; And/or the text data of application programs screens.
According to another embodiment of the present invention, a kind of input method system is disclosed, specifically can comprise:
The text data acquisition module is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation module is used for described text data is analyzed, and generates interim dictionary;
Input interface module is used to receive user's input information;
The information translation module is used for according to the input information that is received, and retrieves in existing dictionary of input method and described interim dictionary, obtains corresponding candidate item;
Show output module, be used to show described candidate item, and the selection information that receives the user, with screen output on the candidate item of appointment.
Wherein, can obtain described text data in the following manner: when the videotex data, by preset function, the output content of capturing operation system Chinese version output function; Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document; Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
Preferably, described system can also comprise: the system monitoring module be used for the current state of supervisory system, and when meeting prerequisite, notice starts the text data acquisition module; And/or, be used to monitor the text data that is obtained, and when meeting prerequisite, notice starts interim dictionary generation module.
Preferably, described system can also comprise: the weight parameter generation module is used for each entry at interim dictionary, according to the word frequency information in word frequency information in the interim dictionary and the existing dictionary of input method, obtain weight parameter, described weight parameter is used for the candidate item ordering.
Preferably, described system can also comprise: temporary storage module is used to store described interim dictionary; Optimal module is used for the described interim dictionary of finishing analysis, obtains effective information.Wherein, described effective information can comprise neologisms; Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information also can comprise the analysis result at entry and input environment thereof.
According to another embodiment of the present invention, a kind of generation method of interim dictionary is disclosed, can comprise: obtain in the current system environments text data that application program is related; Described text data is analyzed, generated interim dictionary; Described interim dictionary is used for retrieval and obtains candidates of input method.
Wherein, can obtain described text data in the following manner: when the videotex data, by preset function, the output content of capturing operation system Chinese version output function; Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document; Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
According to another embodiment of the present invention, a kind of generation system of interim dictionary is disclosed, can comprise:
The text data acquiring unit is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit is used for described text data is analyzed, and generates interim dictionary; Described interim dictionary is used for the existing dictionary of input method together, and retrieval obtains candidates of input method.
Wherein, can obtain described text data in the following manner: when the videotex data, by preset function, the output content of capturing operation system Chinese version output function; Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document; Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
According to another embodiment of the present invention, a kind of method of optimizing the input method dictionary is disclosed, can comprise: obtain in the current system environments text data that application program is related; Described text data is analyzed, generated interim dictionary; The described interim dictionary of finishing analysis obtains effective information.Wherein, described effective information can comprise neologisms; Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information also can comprise the analysis result at entry and input environment thereof.
According to another embodiment of the present invention, a kind of system that optimizes the input method dictionary is disclosed, can comprise:
The text data acquiring unit is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit is used for described text data is analyzed, and generates interim dictionary;
Optimize the unit, be used for the described interim dictionary of finishing analysis, obtain effective information.
Wherein, described effective information can comprise neologisms; Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information also can comprise the analysis result at entry and input environment thereof.
Compared with prior art, the present invention has the following advantages:
When the user uses input method, often be accompanied by the operation of respective document, that is to say to have certain language context, such as, the user opens one piece of document, receives new message from IM, receives new mail ... or the like.Use the present invention, when the user uses input method in these applied environments, input method just can be learnt these content of text in the corresponding environment automatically, forms an interim word and concerns the storehouse, uses for the user.In this way, the user can well be imported experience in each new session, thereby can fundamentally solve comprehensive too strong, the personalized more weak problem of existing input method dictionary.
And the present invention can further analyze resulting interim dictionary, by the interim dictionary of continuous analysis user, extract effective information, for example, neologisms or the like, thereby further optimize input method, for example, can form a personalized dictionary at this user oneself.Further, preferred, described effective information can also comprise the analysis result at entry and input environment thereof, and then forms cell dictionary (at the special-purpose dictionary of a certain specific environment) or the like, optimizes input method from another angle.
Description of drawings
Fig. 1 is the flow chart of steps of a kind of method embodiment of character input;
Fig. 2 is the structured flowchart of a kind of embodiment of input method system;
Fig. 3 is the flow chart of steps of a kind of temporary word library generating method embodiment;
Fig. 4 is the structured flowchart of a kind of embodiment of interim dictionary generation system;
Fig. 5 is a kind of flow chart of steps of optimizing the method embodiment of input method dictionary.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
The present invention goes for various language, for example, Chinese, Japanese, Korean, English etc. are because the application flow of the present invention in various spoken and written languages all is similar, so for convenience of description, only the present invention being applied in Chinese situation below describes.
The input mode that the present invention can adopt can comprise keyboard symbol, hand-written information and phonetic entry or the like, because the message switching mode in these input modes all belongs to known technology, has not just described in detail at this.
The personalized input process that the present invention realized can be used in numerous general or special purpose computingasystem environment or the configuration.For example: personal computer, server computer, handheld mobile device or portable set, plate equipment, multicomputer system, the system based on microprocessor, set top box, programmable consumer-elcetronics devices, network PC, small-size computer, mainframe computer, comprise distributed computing environment of above any system or equipment or the like.
With reference to Fig. 1, show a kind of method embodiment of character input, specifically can may further comprise the steps:
Step 101 is obtained in the current system environments text data that application program is related;
Described application program can comprise the various softwares that relate to content of text, for example, and Word, PDF, txt file, mail, immediate communication tool, web browser or the like.To describe in detail in the back of this instructions for concrete obtain manner.
Step 102 is analyzed described text data, generates interim dictionary;
General analytic process can comprise the participle statistic processes, for example, finds neologisms, statistics word frequency and statistics word relation information.General, interim dictionary can comprise entry, word order information and binary information.Because generally,, only add up binary information and get final product for the statistics of word relation.Described binary information is meant the annexation between the speech of expression text front and back, and (perhaps, Bigram), " binary " wherein refered in particular to the statistics of neighbouring relations in twos generally also can be called bigram statistics.For example, input information is " One is not a true man unless he comes to the Great Wall ", if we are with word during as the fractionation unit of minimum, we can split out " no " and " arrive " 7 individual characters in " length " " city " " non-" " good " " Chinese ", and binary wherein comprise " less than ", " to long ", " Great Wall ", " city is non-", " non-good ", " brave man ".Word frequency in the collected input information and binary combination relation can reflect that this user some vocabulary and language commonly used in daily input process uses style, thereby can satisfy this user's personalization input demand.
Need to prove that binary information also can be based on speech, perhaps between word and the speech also not only based on word.Certainly, in order to add up binary information, be to need a word-dividing mode based on speech.
Certainly, the present invention does not limit and only collects binary information, in fact says from effect, and the relation information that can collect n unit (n 〉=2) is better, just is limited to the computing power of present user terminal, and only obtaining binary information is the scheme of a comparative optimization.
Certainly, under the situation that computing power is allowed, described interim dictionary (for example can also comprise more information, probabilistic relation between word and the word, between word and the speech, words from application program or the like), its generative process also can adopt more accurate computing method, and the present invention does not need this to be limited.
Further, the generation of interim dictionary can also comprise some optimized Measures, for example, just can not need to join in the interim dictionary to some speech that does not meet prerequisite after the text data analysis, for example, the speech after ordering is leaned on very much, though perhaps the ordering in interim dictionary of this speech is forward, but it is forward also to sort in the existing dictionary of input method, and frequency is very high, then can get rid of outside interim dictionary.Described ordering generally is meant the ordering of the words that coded string is identical, for example, for spelling input method, then is exactly the ordering of the identical words of phonetic.
Step 103 loads existing dictionary of input method and described interim dictionary;
The existing dictionary of described input method can comprise the input method system dictionary, also can comprise input method user thesaurus or the like, and just being meant does not in a word need the interim dictionary that generates.
Described loading procedure can for: a dictionary merged in interim dictionary and existing dictionary, place buffer memory.The user can directly use according to the use-pattern of common dictionary in subsequent operation like this.Promptly when the user triggers certain interim dictionary, then should interim dictionary and existing dictionary merge, place buffer memory, the retrieval when being used for the user and importing.Certainly, can also the dictionary after merging in by adding mode such as mark, belong to interim dictionary or existing dictionary to distinguish certain entry.
Described loading procedure also can for: interim dictionary and existing dictionary are placed buffer memory as two independent dictionaries, and set dictionary priority according to presetting rule; Described priority is used for the demonstration ordering of candidate item.Generally speaking, the priority of interim dictionary is higher than existing dictionary.Promptly in loading procedure, interim dictionary is placed existing dictionary space specified in addition, and in the existing dictionary of retrieval, also retrieve interim dictionary.
Step 104, reception user's input information;
Step 105 according to the input information that is received, is retrieved in existing dictionary of input method and described interim dictionary, obtains corresponding candidate item;
General, the weighted value of interim dictionary is greater than the weighted value of existing dictionary, for example, the simplest a kind of situation, search strategy can directly be set at the ordering of speech in the interim dictionary all prior to the speech in the existing dictionary.Certainly, also the mode that can set by the user or the mode by automatic setting are directly exported the entry in the interim dictionary with the fixed position.
Preferably, in the present embodiment, the search strategy of employing can for: when certain candidate item only retrieves, then export this candidate item in existing dictionary, and serve as according to sorting with the word frequency in the existing dictionary or other information; When certain candidate item only retrieves in interim dictionary, then export this candidate item, and serve as according to sorting with the word order in the interim dictionary or other information; When certain candidate item all finds, then can be weighted correction, according to revised word frequency sort (certainly, can store, also can not store) for revised word frequency to the word frequency in the existing dictionary in two dictionaries.
Further, the present invention can also adopt other feasible search strategies, for example, when generating interim dictionary, for each entry, calculates a weighted value (perhaps being weight parameter), and described weighted value is relevant with the word frequency in the existing dictionary of input method; Then, when when output coupling, according to this weighted value to the output of sorting of each candidate item.For example, a kind of simple implementation multiply by this weighted value by the word frequency of the word frequency of interim dictionary or existing dictionary and obtains parameters sortnig at this candidate item.
Step 106, reception user's selection information will screen output on the candidate item of appointment.
Step 104 to the realization of step 106 can be adopted existing various input method implementation, is not described in detail in this.
Need to prove that the generation of the interim dictionary described in the present embodiment can generate an interim dictionary at a document in the current system environments, for example, the user has opened 5 word documents, then generates 5 corresponding interim dictionaries.
Interim dictionary also can generate an interim dictionary at an application program in the current system environments, for example, though the user has opened 5 notepads, the content of text overall treatment in these 5 notepads is generated an interim dictionary; A ppt document of opening at this user generates another interim dictionary then.
Interim dictionary also can generate an interim dictionary at a plurality of application programs in the current system environments, and for example, the user has opened 1 word document, 1 notepad, a ppt document, then these 3 content of text that application program is related of overall treatment generate an interim dictionary.
Above-mentioned various generating mode (perhaps claiming generation strategy) can be applied to various occasion, and those skilled in the art select for use as required and get final product, and the present invention does not need the concrete generation situation of interim dictionary is limited.If generated a plurality of interim dictionaries, then can distinguish by sign between each interim dictionary, to point to correct input environment.
When having generated a plurality of interim dictionary, concrete loading procedure may change to some extent.General, can only load the pairing interim dictionary of the current document of operating of user and get final product.Certainly, also can load whole interim dictionaries, whether operate, give each interim dictionary different weighted values according to the user is current.For example, give the highest weighted value of interim dictionary of current operation, only retrieve this interim dictionary and existing dictionary during retrieval and get final product; Perhaps, also can retrieve a plurality of interim dictionary that weighted value is higher than certain threshold value.
Need to prove, be not to immobilize for the ordinal relation between the above-mentioned a plurality of steps in the present embodiment, illustrates it only is for convenience successively at this.For example, input method starts, and loads existing dictionary, obtains text data then and generates interim dictionary, loads interim dictionary; Certainly, also can finish obtaining and the generation of dictionary temporarily of text data in advance, when input method system starts, finish loading and get final product.
Obtain for data, can in the whole process of user's operation, accumulate; And generate for dictionary, then in fact can finish at any time.
After interim dictionary generated, As time goes on, the related content of text of application program may change (for example, the user has newly imported the content of text of 3000 words to the word document in a period of time), and then interim dictionary just needs to upgrade.Can carry out for the process of obtaining content of text always, when meeting prerequisite, (for example, satisfy the time interval of presetting) then, generate new interim dictionary and new and old interim dictionary, and then the notice input method load new interim dictionary.Because the data volume of interim dictionary is less, so its renewal process generally can not influence user's input behavior.Certainly, if data volume is bigger, then can be undertaken by asynchronous mode.
Preferably, present embodiment can also may further comprise the steps: store described interim dictionary, and the described interim dictionary of finishing analysis, obtain effective information, be incorporated in the existing dictionary of described input method.For example, with the temporary word library storage on the subscriber's local hard disk, utilize the computational resource of local system free time, all interim dictionaries are carried out finishing analysis, extraction meets the information of prerequisite--and effective information (for example, word frequency and 2 yuan of relations are greater than certain threshold value) merges to the mode of these information according to merger in the existing dictionary, thereby to a certain degree enriching the existing dictionary of subscriber's local, satisfying this user's individual demand from another angle.Interim dictionary behind the finishing analysis can be deleted from this locality, to reduce taking local storage resources.
Further, various modes can be arranged, for example, each user's interim dictionary be compiled,, find neologisms thereby analyze by server by the network end that uploads onto the server for the analysis of interim dictionary.Again for example, compile each interim dictionary, analyze the relation of each entry and input environment thereof (for example application name or type etc.), thereby form a plurality of respectively at special-purpose dictionary--the cell dictionary of certain specific environment, these neologisms or cell dictionary can offer other users, are used for further optimizing input method system.Certainly,, need from interim dictionary, remove the vocabulary of non-user's input, because the vocabulary of non-user input may bring inappropriate influence to analysis result in order to obtain cell dictionary accurately.For the input vocabulary that how to obtain the user, multiple implementation can be arranged, for example, can obtain by screen vocabulary on the intercepting and capturing input method, also can be when opening an application program as the user the difference of the text data that obtains and the text data that obtained after after a while, obtain this user's input vocabulary.
Below several modes that specifically may adopt of obtaining text data in the step 101 are introduced, only be used to illustrate realization of the present invention, and should not be construed as limitation of the present invention, those skilled in the art can also adopt other feasible modes to realize obtaining of text data.The present invention can be applied in the existing various operating system, for example, and Windows, Linux, MacOS, FreeBSD, Unix, Solaris or the like, and the PalmOS that is used for portable terminal, Windows Mobile, Symbian or the like.Following explanation is that example describes with Windows operating system commonly used only.
Obtain manner 1:, realize intercepting and capturing the purpose of content of text by monitoring computing equipment screen display content.For example, can obtain described text data by following steps: in operating system, preset the API Hook Function; When the videotex data, intercept and capture the output content of text output function.
With Windows operating system is example, general screen display content all needs to finish by text output function (for example TextOut etc.), then the present invention can realize the intercepting of screen text to the mode that API Hook Function (Hook function) is hung by system, as having write a jmp statement in the beginning of literal output functions such as TextOut, by the Hook function, jump in the good function of predefined, obtain the text of wanting TextOut to draw.
In this way, can obtain various text datas by screen display.For example, even mail communication message of opening, word document or the like.
Obtain manner 2 obtains by the mode that directly reads the text data in the file.For example, can obtain described text data by following steps: the path that obtains file; The com interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.
Particularly, for text (for example txt file), can directly read its text data.And for non-document file, existing operating system generally all provides the OLE technology, and OLE is the abbreviation of ObjectLinking and Embedding, can be translated as object linking and embedding.OLE is transmission and one group of comprehensive standard of shared information between client applications, and it allows to create the hybrid document that has the link of pointing to application program so that the agreement that needn't switch between application program during user's modification.The present invention can utilize the OLE technology to read the text data in most of file on the existing operating system.
For example, in the later operating system of windows 2000, provide the com interface of an IFilter, allowed application program to register this interface as files such as Office, PDF.The file that every application program of having registered this interface produces, other application program can read its content of text by this interface, and such as for files such as Office, Adobe, PDF, the present invention can read its content of text in this way.General process can for: obtain the IFilter object of corresponding document according to file path, judge that whether this document is registered, if this document is registered, then obtains content of text by IFilter::GetValue.
Wherein, how obtaining the path of file, is technical issues that need to address.The present invention provides several feasible modes: a here, the mode by the scanning system process obtains file path; B, obtain file path (for example, can be applied to mail, instant messaging supervisor, realize monitoring) by resolving its interface by the mode of monitoring application program interface; C, by the content of text of monitoring application program by screen display, learn file path (for example, obtain immediate communication tool the new store path that receives file).For example: in the chat window of live messenger, occur " You have successfully receivedE: Documents My Received Files txt.txt from (C). ", illustrate that then its file receives, and the file path that can therefrom will be referred to takes out.
By obtain manner 2, can obtain the various data in the file, comprise text data, also can comprise those not data by screen display by screen display, for example, a word document has 30 pages, and this input method user has only browsed preceding 5 pages, and adds annotations and comments to the 5th page, then by obtain manner 1, only can obtain preceding 5 pages text data, and, then can obtain all text datas of the document by obtain manner 2.By obtain manner 2, can also obtain the text data that does not belong in the current operation document, for example, the user has opened two word documents, is operating a document, by obtain manner 2, can obtain the text data in another document; And the user received a file by msn, but also do not open, and just can obtain text data in this document by obtain manner 2.
Obtain manner 3, the mode by the read operation system cache realize, for example clipbook.
The most existing application program all supports text resolution (for example, word), to simulate " full choosing (Ctrl_A) " " duplicating (Ctrl_C) " message at certain application program.If application program is handled these message, data just can enter clipbook; Then and then can obtain required text data by reading in the clipbook mode of data.Certainly, simulation " full choosing (Ctrl_A) " " duplicating (Ctrl_C) " message only is one gives an example, in fact can be by various keyboard combination message of simulation or mouse information, to reach choosing arbitrary text data in certain application program.
Preferably, embodiment shown in Figure 1 can also comprise the screening step: application programs is screened, and/or the text data of application programs screens; Therefrom remove some non-text datas, perhaps remove some non-input environments.
For example, the application program that the user opens is an audio/video player, or paint program, then the related text data of this application program is not just needed to obtain, when the program of user operation meets prerequisite (described prerequisite can be the program name of predefined or attribute etc.), just begin to obtain the step of text data or just begin to carry out analysis text data.
Again for example, though what the user opened is the word document, but when the text data that is obtained by obtain manner 1 can comprise some invalid datas (as, file, edit, attempt, literal such as toolbar title such as insertion, form, instrument, form, window and help), and these text datas can not reflect the environment of importing the user, so these text datas should not enter analysis process, therefore, can these invalid datas be screened out by some strategies that presets.
With reference to Fig. 2, show the embodiment of a kind of input method system of the present invention, specifically can comprise:
Text data acquisition module 201 is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation module 202 is used for described text data is analyzed, and generates interim dictionary 206;
Input interface module 203 is used to receive user's input information; Wherein said user's input information can comprise keyboard symbol, hand-written information and phonetic entry or the like;
Information translation module 204 is used for according to the input information that is received, and retrieves in existing dictionary 207 of input method and described interim dictionary 206, obtains corresponding candidate item;
Show output module 205, be used to show described candidate item, and the selection information that receives the user, with screen output on the candidate item of appointment.
The embodiment of this input method system can generate corresponding interim dictionary at user's input environment, to satisfy the demand of user individual input.
Obtain in the described text data acquisition module 201 text data mode can for: in operating system, preset the API Hook Function; When the videotex data, intercept and capture the output content of text output function.Described text data acquisition module 201 also can obtain described text data in the following manner: the path that obtains file; The com interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.Wherein, the path that can obtain file path or obtain new reception file by monitoring application program by scan procedure.Certainly, also can operate by analoging reproduction; From the operating system buffer memory, obtain the related text data of application program.
Further, this input method embodiment can also comprise: the system monitoring module be used for the current state of supervisory system, and when meeting prerequisite, notice starts the text data acquisition module; And/or, be used to monitor the text data that is obtained, and when meeting prerequisite, notice starts interim dictionary generation module.Further avoid invalid data handling procedure, improve counting yield.
Conflict for fear of interim dictionary and existing dictionary, and the accuracy that further improves ordering, present embodiment can also comprise the weight parameter generation module, be used for each entry at interim dictionary, according to the word frequency information in word frequency information in the interim dictionary and the existing dictionary of input method, obtain weight parameter, described weight parameter is used for the candidate item ordering.
In order further to utilize the text data that is compiled, then present embodiment can also comprise: the temporary storage module that is used to store described interim dictionary; And, be used for the described interim dictionary of finishing analysis, obtain effective information, be incorporated into the optimal module in the existing dictionary of described input method.
The embodiment of above-mentioned input method system can be common input method system, as, finish whole input process by the subscriber's local computing equipment, comprise information input, information translation and show output.The embodiment of above-mentioned input method system also can be the input method in network system, as, finish the access of input information by the subscriber's local computing equipment, and the demonstration of candidate item output, the information translation process is then finished in another computing equipment.Certainly, if be applied to the input method in network system, then the present embodiment interim dictionary that also needs to generate is sent to another computing equipment, perhaps the text data that obtains is sent to another computing equipment.That is to say that the present invention does not need to limit the particular geographic location of each module among the input method system embodiment, as long as have function corresponding and corresponding annexation.
With reference to Fig. 3, show a kind of generation method embodiment of interim dictionary, specifically can comprise:
Step 301 is obtained in the current system environments text data that application program is related;
Step 302 is analyzed described text data, generates interim dictionary; Described interim dictionary is used for retrieval and obtains candidates of input method.For example, generate after the interim dictionary, by the mode of the interim dictionary of notice input method system start-up loading, to be implemented in the effect of the interim dictionary of performance in user's input process.The generation of described interim dictionary can generate at an application document, also can generate at an application program, also can generate at a plurality of application programs.
Wherein, can obtain described text data by following steps: in operating system, preset the API Hook Function; When the videotex data, intercept and capture the output content of text output function.Perhaps, also can obtain described text data: the path that obtains file by following steps; The com interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.Certainly, also can operate by analoging reproduction; From the operating system buffer memory, obtain the related text data of application program.
Preferably, present embodiment can also comprise step 303, stores described interim dictionary; And, step 304, the described interim dictionary of finishing analysis obtains effective information.Described effective information can be incorporated in the existing dictionary of described input method, also can be used to obtain neologisms or be used for cellulation dictionary or the like.The present invention does not need the concrete application mode of the particular content of effective information and effective information is limited.
With reference to Fig. 4, show a kind of generation system embodiment of interim dictionary, can comprise:
Text data acquiring unit 401 is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit 402 is used for described text data is analyzed, and generates interim dictionary 403; Described interim dictionary 403 is used for the existing dictionary of input method together, and retrieval obtains candidates of input method.
Interim dictionary can send it to input method system by present embodiment and load after generating, thereby helps to realize personalized input, perhaps by present embodiment with the temporary word library storage in appointed position, input method is directly called, to finish personalized input.
Wherein, can obtain described text data by following steps: in operating system, preset the API Hook Function; When the videotex data, intercept and capture the output content of text output function.Perhaps, also can obtain described text data: the path that obtains file by following steps; The com interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.
Because interim dictionary has write down much user personality input information accurately, so present embodiment can also will add to after these individual information analysis-by-synthesis in the existing dictionary of input method: the temporary storage cell 404 that is used to store described interim dictionary by with lower module; And, be used for the described interim dictionary 403 of finishing analysis, obtain effective information, be incorporated into the optimization unit 405 in the existing dictionary 406 of described input method.Certainly, described effective information can be incorporated in the existing dictionary of described input method, also can be used to obtain neologisms or be used for the cellulation dictionary.
With reference to Fig. 5, show a kind of method embodiment that optimizes the input method dictionary of the present invention, can comprise:
Step 501, obtain in the current system environments text data that application program is related;
Step 502, described text data is analyzed, generated interim dictionary;
Step 503, the described interim dictionary of finishing analysis obtain effective information.Described effective information can be incorporated in the existing dictionary of described input method, also can be used to obtain neologisms or be used for the cellulation dictionary.Step 503 can regularly be carried out, and for example, every certain time interval, perhaps the quantity of interim dictionary reaches predetermined threshold, and perhaps the data volume of interim dictionary reaches predetermined threshold or the like.
Need to prove that the interim dictionary that is generated can not offer input method and use, promptly present embodiment can be used as the embodiment of pure optimization input method dictionary.
Accordingly, can also there be a kind of system embodiment of optimizing the input method dictionary, specifically comprises:
The text data acquiring unit is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit is used for described text data is analyzed, and generates interim dictionary;
Optimize the unit, be used for the described interim dictionary of finishing analysis, obtain effective information.Described effective information can be incorporated in the existing dictionary of described input method, also can be used to obtain neologisms or be used for the cellulation dictionary.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and identical similar part is mutually referring to getting final product between each embodiment, and each embodiment stresses all is difference with other embodiment.Especially, for system embodiment, because it is substantially similar in appearance to method embodiment, so description is fairly simple, relevant part gets final product referring to the part explanation of method embodiment.
More than to the method and system of a kind of character input provided by the present invention, a kind of generation method and system of interim dictionary, and a kind of method and system of optimizing the input method dictionary, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (25)

1, a kind of method of character input is characterized in that, comprising:
Obtain in the current system environments text data that application program is related;
Described text data is analyzed, generated interim dictionary;
Load existing dictionary of input method and described interim dictionary;
Receive user's input information;
According to the input information that is received, in existing dictionary of input method and described interim dictionary, retrieve, obtain corresponding candidate item;
Receive user's selection information, with screen output on the candidate item of appointment.
2, the method for claim 1 is characterized in that, obtains described text data in the following manner:
When the videotex data, by preset function, the output content of capturing operation system Chinese version output function.
3, the method for claim 1 is characterized in that, obtains described text data by following steps:
Obtain the path of file;
The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document.
4, method as claimed in claim 3 is characterized in that, the path that obtains file path or obtain new reception file by monitoring application program by scan procedure.
5, the method for claim 1 is characterized in that, obtains described text data in the following manner:
The analoging reproduction operation;
From the operating system buffer memory, obtain the related text data of application program.
6, the method for claim 1 is characterized in that,
Generate an interim dictionary at an application document in the current system environments;
Perhaps, generate an interim dictionary at an application program in the current system environments;
Perhaps, generate an interim dictionary at a plurality of application programs in the current system environments.
7, the method for claim 1 is characterized in that, described interim dictionary comprises word order information, binary or n-tuple relation information.
8, the method for claim 1 is characterized in that, also comprises:
At each entry in the interim dictionary, the word frequency information according in word frequency information in the interim dictionary and the existing dictionary of input method obtains weight parameter, and described weight parameter is used for the candidate item ordering.
9, the method for claim 1 is characterized in that, also comprises:
Store described interim dictionary, and;
The described interim dictionary of finishing analysis obtains effective information.
10, method as claimed in claim 9 is characterized in that:
Described effective information comprises neologisms;
Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information comprised the analysis result at entry and input environment thereof.
11, the method for claim 1 is characterized in that, also comprises:
Application programs is screened;
And/or the text data of application programs screens.
12, a kind of input method system is characterized in that, comprising:
The text data acquisition module is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation module is used for described text data is analyzed, and generates interim dictionary;
Input interface module is used to receive user's input information;
The information translation module is used for according to the input information that is received, and retrieves in existing dictionary of input method and described interim dictionary, obtains corresponding candidate item;
Show output module, be used to show described candidate item, and the selection information that receives the user, with screen output on the candidate item of appointment.
13, system as claimed in claim 12 is characterized in that, obtains described text data in the following manner:
When the videotex data, by preset function, the output content of capturing operation system Chinese version output function;
Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document;
Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
14, system as claimed in claim 12 is characterized in that, also comprises:
The system monitoring module is used for the current state of supervisory system, and when meeting prerequisite, notice starts the text data acquisition module; And/or, be used to monitor the text data that is obtained, and when meeting prerequisite, notice starts interim dictionary generation module.
15, system as claimed in claim 12 is characterized in that, also comprises:
The weight parameter generation module is used for each entry at interim dictionary, and the word frequency information according in word frequency information in the interim dictionary and the existing dictionary of input method obtains weight parameter, and described weight parameter is used for the candidate item ordering.
16, system as claimed in claim 12 is characterized in that, also comprises:
Temporary storage module is used to store described interim dictionary;
Optimal module is used for the described interim dictionary of finishing analysis, obtains effective information.
17, system as claimed in claim 16 is characterized in that:
Described effective information comprises neologisms;
Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information comprised the analysis result at entry and input environment thereof.
18, a kind of generation method of interim dictionary is characterized in that, comprising:
Obtain in the current system environments text data that application program is related;
Described text data is analyzed, generated interim dictionary; Described interim dictionary is used for retrieval and obtains candidates of input method.
19, method as claimed in claim 18 is characterized in that, obtains described text data in the following manner:
When the videotex data, by preset function, the output content of capturing operation system Chinese version output function;
Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document;
Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
20, a kind of generation system of interim dictionary is characterized in that, comprising:
The text data acquiring unit is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit is used for described text data is analyzed, and generates interim dictionary; Described interim dictionary is used for the existing dictionary of input method together, and retrieval obtains candidates of input method.
21, system as claimed in claim 20 is characterized in that, obtains described text data in the following manner:
When the videotex data, by preset function, the output content of capturing operation system Chinese version output function;
Perhaps, obtain the path of file; The interface that provides by operating system reads the content of text of this document, perhaps directly reads the content of text of this document;
Perhaps, analoging reproduction operation; From the operating system buffer memory, obtain the related text data of application program.
22, a kind of method of optimizing the input method dictionary is characterized in that, comprising:
Obtain in the current system environments text data that application program is related;
Described text data is analyzed, generated interim dictionary;
The described interim dictionary of finishing analysis obtains effective information.
23, method as claimed in claim 22 is characterized in that:
Described effective information comprises neologisms;
Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information comprised the analysis result at entry and input environment thereof.
24, a kind of system that optimizes the input method dictionary is characterized in that, comprising:
The text data acquiring unit is used for obtaining current system environments, the text data that application program is related;
Interim dictionary generation unit is used for described text data is analyzed, and generates interim dictionary;
Optimize the unit, be used for the described interim dictionary of finishing analysis, obtain effective information.
25, system as claimed in claim 24 is characterized in that:
Described effective information comprises neologisms;
Perhaps, when interim dictionary recorded the corresponding input environment information of entry, described effective information comprised the analysis result at entry and input environment thereof.
CN 200710118177 2007-06-29 2007-06-29 Character input method and input method system Active CN101334774B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200710118177 CN101334774B (en) 2007-06-29 2007-06-29 Character input method and input method system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200710118177 CN101334774B (en) 2007-06-29 2007-06-29 Character input method and input method system

Publications (2)

Publication Number Publication Date
CN101334774A true CN101334774A (en) 2008-12-31
CN101334774B CN101334774B (en) 2013-08-14

Family

ID=40197377

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200710118177 Active CN101334774B (en) 2007-06-29 2007-06-29 Character input method and input method system

Country Status (1)

Country Link
CN (1) CN101334774B (en)

Cited By (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102135953A (en) * 2011-03-29 2011-07-27 中国科学院自动化研究所 Text coherence editing method
CN102314334A (en) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 Method for caching content input into application program by user and equipment
CN102314461A (en) * 2010-06-30 2012-01-11 北京搜狗科技发展有限公司 Navigation prompt method and system
CN102346561A (en) * 2010-07-30 2012-02-08 腾讯科技(深圳)有限公司 Method and device for adding user words in input method
CN102404107A (en) * 2010-09-13 2012-04-04 腾讯科技(深圳)有限公司 Method, device, transmitting end and receiving end all capable of guaranteeing safety of inputted content
CN102455845A (en) * 2010-10-14 2012-05-16 北京搜狗科技发展有限公司 Character entry method and device
CN102637166A (en) * 2012-03-15 2012-08-15 广东威创视讯科技股份有限公司 Method and device for optimizing word order of input method and system of input method
CN102722483A (en) * 2011-03-29 2012-10-10 百度在线网络技术(北京)有限公司 Method, apparatus and equipment for determining candidate-item sequence of input method
CN102810095A (en) * 2011-06-02 2012-12-05 北京搜狗科技发展有限公司 Word learning method and device
CN102982070A (en) * 2012-10-26 2013-03-20 北京百度网讯科技有限公司 Word bank updating method and system and cloud server used for input method application program
CN103019406A (en) * 2012-11-20 2013-04-03 张仁平 System realizing Chinese character output through handwritten codes and use method of system
CN103019401A (en) * 2011-09-26 2013-04-03 北京搜狗科技发展有限公司 Hybrid type sentence input method and device
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN103246703A (en) * 2013-04-03 2013-08-14 百度在线网络技术(北京)有限公司 Method and equipment for determining application word banks
CN103365833A (en) * 2012-03-28 2013-10-23 百度在线网络技术(北京)有限公司 Context scene based candidate word input prompt method and system for implementing same
CN103376909A (en) * 2012-04-19 2013-10-30 腾讯科技(深圳)有限公司 Method and system of adjusting sequence of candidate characters in use of input methods
CN103389800A (en) * 2012-05-11 2013-11-13 北京百度网讯科技有限公司 Entry generating method and device
CN103473313A (en) * 2013-09-11 2013-12-25 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN103493047A (en) * 2011-08-30 2014-01-01 宇龙计算机通信科技(深圳)有限公司 Dictionary database update device, input system, input method, and terminal
CN103500016A (en) * 2013-09-27 2014-01-08 北京邮电大学 Character input optimization method based on interaction
CN103631388A (en) * 2012-08-28 2014-03-12 华为终端有限公司 Method and device for optimizing handwriting input method
CN103678371A (en) * 2012-09-14 2014-03-26 富士通株式会社 Lexicon updating device, data integration device and method and electronic device
CN103886043A (en) * 2014-03-11 2014-06-25 北京搜狗科技发展有限公司 Method and device for showing candidate items
CN101639863B (en) * 2009-09-04 2014-07-16 腾讯科技(深圳)有限公司 Method system and equipment for loading city lexicon
CN104102720A (en) * 2014-07-18 2014-10-15 上海触乐信息科技有限公司 Efficient input prediction method and device
WO2014194450A1 (en) * 2013-06-03 2014-12-11 东莞宇龙通信科技有限公司 Association prompt input system, terminal and association prompt input method
CN104331393A (en) * 2014-05-06 2015-02-04 广州三星通信技术研究有限公司 Equipment and method for providing option by aiming at input operation of user
CN104423623A (en) * 2013-09-02 2015-03-18 联想(北京)有限公司 To-be-selected word processing method and electronic equipment
CN104699809A (en) * 2015-03-20 2015-06-10 广东睿江科技有限公司 Method and device for controlling optimized word bank
CN104850241A (en) * 2015-05-28 2015-08-19 北京奇点机智信息技术有限公司 Mobile terminal and text input method thereof
CN105045412A (en) * 2015-08-28 2015-11-11 百度在线网络技术(北京)有限公司 Method and system for generating candidate item of input method
CN105095377A (en) * 2015-06-30 2015-11-25 小米科技有限责任公司 Method and device for instant message processing
CN105718147A (en) * 2016-01-22 2016-06-29 百度在线网络技术(北京)有限公司 Input method panel enabling method and device and input method and input method system
CN105786202A (en) * 2014-12-23 2016-07-20 苏州精易会信息技术有限公司 Input method capable of configuring data source
CN106775794A (en) * 2015-11-24 2017-05-31 北京搜狗科技发展有限公司 A kind of input method client installation method and device
CN106844501A (en) * 2016-12-27 2017-06-13 北京五八信息技术有限公司 A kind of input method entry lookup method and device
CN106896937A (en) * 2017-02-28 2017-06-27 百度在线网络技术(北京)有限公司 Method and apparatus for being input into information
CN106933379A (en) * 2017-02-13 2017-07-07 北京奇虎科技有限公司 The generation method and device of a kind of dictionary
CN106951104A (en) * 2017-02-13 2017-07-14 北京奇虎科技有限公司 A kind of entry processing method and device based on dictionary
CN107104881A (en) * 2015-05-29 2017-08-29 北京搜狗科技发展有限公司 A kind of information processing method and device
CN107608532A (en) * 2016-07-11 2018-01-19 北京搜狗科技发展有限公司 A kind of association-feeding method, device and electronic equipment
CN107832035A (en) * 2017-11-13 2018-03-23 赵桂银 A kind of pronunciation inputting method of intelligent terminal
CN108037839A (en) * 2017-12-28 2018-05-15 广东欧珀移动通信有限公司 Character input method and related product
CN108073303A (en) * 2016-11-17 2018-05-25 北京搜狗科技发展有限公司 A kind of input method, device and electronic equipment
CN108197243A (en) * 2017-12-29 2018-06-22 北京奇虎科技有限公司 Method and device is recommended in a kind of input association based on user identity
CN108664141A (en) * 2017-03-31 2018-10-16 微软技术许可有限责任公司 Input method with document context self-learning function
CN108664142A (en) * 2017-03-31 2018-10-16 微软技术许可有限责任公司 Input method with self-learning function between document
CN109002183A (en) * 2017-06-07 2018-12-14 北京搜狗科技发展有限公司 A kind of method and device of information input
CN110009444A (en) * 2019-02-21 2019-07-12 厦门同城乐家电子商务有限公司 A kind of wisdom marketing method based on payment terminal data
CN110083253A (en) * 2018-01-25 2019-08-02 北京搜狗科技发展有限公司 A kind of input method and device
CN111354342A (en) * 2020-02-28 2020-06-30 科大讯飞股份有限公司 Method, device, equipment and storage medium for updating personalized word stock
CN113010665A (en) * 2019-12-20 2021-06-22 北京搜狗科技发展有限公司 Word processing method and related device
CN115577694A (en) * 2022-11-15 2023-01-06 南方电网科学研究院有限责任公司 Intelligent recommendation method for standard writing

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488796B (en) * 2013-10-12 2018-12-07 惠州Tcl移动通信有限公司 Based on context the method and mobile terminal inputted
CN106774969B (en) * 2015-11-20 2021-12-14 北京搜狗科技发展有限公司 Input method and device
CN108227953B (en) * 2017-12-28 2022-01-11 Oppo广东移动通信有限公司 Character input method and related product

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1490701A (en) * 2002-10-15 2004-04-21 英业达股份有限公司 Inputting method system with dynamic adjustable lexicon and method thereof
CN100405371C (en) * 2006-07-25 2008-07-23 北京搜狗科技发展有限公司 Method and system for abstracting new word

Cited By (72)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101639863B (en) * 2009-09-04 2014-07-16 腾讯科技(深圳)有限公司 Method system and equipment for loading city lexicon
CN102314334A (en) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 Method for caching content input into application program by user and equipment
CN102314461A (en) * 2010-06-30 2012-01-11 北京搜狗科技发展有限公司 Navigation prompt method and system
CN102314461B (en) * 2010-06-30 2015-03-11 北京搜狗科技发展有限公司 Navigation prompt method and system
CN102346561B (en) * 2010-07-30 2016-12-21 深圳市世纪光速信息技术有限公司 User's word adding method of input method and device
CN102346561A (en) * 2010-07-30 2012-02-08 腾讯科技(深圳)有限公司 Method and device for adding user words in input method
CN102404107A (en) * 2010-09-13 2012-04-04 腾讯科技(深圳)有限公司 Method, device, transmitting end and receiving end all capable of guaranteeing safety of inputted content
CN102404107B (en) * 2010-09-13 2016-06-01 腾讯科技(深圳)有限公司 A kind of ensure the method for input content safety, device, transmitting terminal and receiving terminal
CN102455845A (en) * 2010-10-14 2012-05-16 北京搜狗科技发展有限公司 Character entry method and device
CN102455845B (en) * 2010-10-14 2015-02-18 北京搜狗科技发展有限公司 Character entry method and device
CN102135953A (en) * 2011-03-29 2011-07-27 中国科学院自动化研究所 Text coherence editing method
CN102135953B (en) * 2011-03-29 2012-12-12 中国科学院自动化研究所 Text coherence editing method
CN102722483A (en) * 2011-03-29 2012-10-10 百度在线网络技术(北京)有限公司 Method, apparatus and equipment for determining candidate-item sequence of input method
CN102810095A (en) * 2011-06-02 2012-12-05 北京搜狗科技发展有限公司 Word learning method and device
CN103493047A (en) * 2011-08-30 2014-01-01 宇龙计算机通信科技(深圳)有限公司 Dictionary database update device, input system, input method, and terminal
CN103019401B (en) * 2011-09-26 2016-04-13 北京搜狗科技发展有限公司 A kind of mixed type input by sentence method and apparatus
CN103019401A (en) * 2011-09-26 2013-04-03 北京搜狗科技发展有限公司 Hybrid type sentence input method and device
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN102637166A (en) * 2012-03-15 2012-08-15 广东威创视讯科技股份有限公司 Method and device for optimizing word order of input method and system of input method
CN103365833A (en) * 2012-03-28 2013-10-23 百度在线网络技术(北京)有限公司 Context scene based candidate word input prompt method and system for implementing same
CN103365833B (en) * 2012-03-28 2016-06-08 百度在线网络技术(北京)有限公司 A kind of input candidate word reminding method based on context and system
CN103376909A (en) * 2012-04-19 2013-10-30 腾讯科技(深圳)有限公司 Method and system of adjusting sequence of candidate characters in use of input methods
CN103389800A (en) * 2012-05-11 2013-11-13 北京百度网讯科技有限公司 Entry generating method and device
CN103389800B (en) * 2012-05-11 2016-08-17 北京百度网讯科技有限公司 A kind of method and apparatus generating entry
CN103631388A (en) * 2012-08-28 2014-03-12 华为终端有限公司 Method and device for optimizing handwriting input method
CN103678371A (en) * 2012-09-14 2014-03-26 富士通株式会社 Lexicon updating device, data integration device and method and electronic device
CN102982070A (en) * 2012-10-26 2013-03-20 北京百度网讯科技有限公司 Word bank updating method and system and cloud server used for input method application program
CN103019406A (en) * 2012-11-20 2013-04-03 张仁平 System realizing Chinese character output through handwritten codes and use method of system
CN103246703B (en) * 2013-04-03 2017-09-15 百度在线网络技术(北京)有限公司 A kind of method and apparatus for being used to determine application dictionary
CN103246703A (en) * 2013-04-03 2013-08-14 百度在线网络技术(北京)有限公司 Method and equipment for determining application word banks
CN104854585A (en) * 2013-06-03 2015-08-19 东莞宇龙通信科技有限公司 Association prompt input system, terminal and association prompt input method
WO2014194450A1 (en) * 2013-06-03 2014-12-11 东莞宇龙通信科技有限公司 Association prompt input system, terminal and association prompt input method
CN104423623A (en) * 2013-09-02 2015-03-18 联想(北京)有限公司 To-be-selected word processing method and electronic equipment
CN103473313A (en) * 2013-09-11 2013-12-25 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN103473313B (en) * 2013-09-11 2017-01-18 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN103500016A (en) * 2013-09-27 2014-01-08 北京邮电大学 Character input optimization method based on interaction
CN103886043B (en) * 2014-03-11 2017-10-20 北京搜狗科技发展有限公司 A kind of method and device for showing candidate item
CN103886043A (en) * 2014-03-11 2014-06-25 北京搜狗科技发展有限公司 Method and device for showing candidate items
CN104331393A (en) * 2014-05-06 2015-02-04 广州三星通信技术研究有限公司 Equipment and method for providing option by aiming at input operation of user
CN104102720B (en) * 2014-07-18 2018-04-13 上海触乐信息科技有限公司 The Forecasting Methodology and device efficiently input
CN104102720A (en) * 2014-07-18 2014-10-15 上海触乐信息科技有限公司 Efficient input prediction method and device
CN105786202A (en) * 2014-12-23 2016-07-20 苏州精易会信息技术有限公司 Input method capable of configuring data source
CN104699809A (en) * 2015-03-20 2015-06-10 广东睿江科技有限公司 Method and device for controlling optimized word bank
CN104850241A (en) * 2015-05-28 2015-08-19 北京奇点机智信息技术有限公司 Mobile terminal and text input method thereof
CN107104881A (en) * 2015-05-29 2017-08-29 北京搜狗科技发展有限公司 A kind of information processing method and device
CN105095377A (en) * 2015-06-30 2015-11-25 小米科技有限责任公司 Method and device for instant message processing
CN105045412A (en) * 2015-08-28 2015-11-11 百度在线网络技术(北京)有限公司 Method and system for generating candidate item of input method
CN106775794A (en) * 2015-11-24 2017-05-31 北京搜狗科技发展有限公司 A kind of input method client installation method and device
CN105718147A (en) * 2016-01-22 2016-06-29 百度在线网络技术(北京)有限公司 Input method panel enabling method and device and input method and input method system
CN107608532A (en) * 2016-07-11 2018-01-19 北京搜狗科技发展有限公司 A kind of association-feeding method, device and electronic equipment
CN108073303B (en) * 2016-11-17 2021-11-30 北京搜狗科技发展有限公司 Input method and device and electronic equipment
CN108073303A (en) * 2016-11-17 2018-05-25 北京搜狗科技发展有限公司 A kind of input method, device and electronic equipment
CN106844501A (en) * 2016-12-27 2017-06-13 北京五八信息技术有限公司 A kind of input method entry lookup method and device
CN106933379A (en) * 2017-02-13 2017-07-07 北京奇虎科技有限公司 The generation method and device of a kind of dictionary
CN106951104A (en) * 2017-02-13 2017-07-14 北京奇虎科技有限公司 A kind of entry processing method and device based on dictionary
CN106896937A (en) * 2017-02-28 2017-06-27 百度在线网络技术(北京)有限公司 Method and apparatus for being input into information
CN108664141A (en) * 2017-03-31 2018-10-16 微软技术许可有限责任公司 Input method with document context self-learning function
CN108664142A (en) * 2017-03-31 2018-10-16 微软技术许可有限责任公司 Input method with self-learning function between document
CN108664142B (en) * 2017-03-31 2023-03-10 微软技术许可有限责任公司 Input method with inter-document self-learning function
CN108664141B (en) * 2017-03-31 2022-08-09 微软技术许可有限责任公司 Input method with document context self-learning function
CN109002183A (en) * 2017-06-07 2018-12-14 北京搜狗科技发展有限公司 A kind of method and device of information input
CN109002183B (en) * 2017-06-07 2022-11-29 北京搜狗科技发展有限公司 Information input method and device
CN107832035B (en) * 2017-11-13 2021-03-12 深圳市矽昊智能科技有限公司 Voice input method of intelligent terminal
CN107832035A (en) * 2017-11-13 2018-03-23 赵桂银 A kind of pronunciation inputting method of intelligent terminal
CN108037839A (en) * 2017-12-28 2018-05-15 广东欧珀移动通信有限公司 Character input method and related product
CN108037839B (en) * 2017-12-28 2022-01-11 Oppo广东移动通信有限公司 Character input method and related product
CN108197243A (en) * 2017-12-29 2018-06-22 北京奇虎科技有限公司 Method and device is recommended in a kind of input association based on user identity
CN110083253A (en) * 2018-01-25 2019-08-02 北京搜狗科技发展有限公司 A kind of input method and device
CN110009444A (en) * 2019-02-21 2019-07-12 厦门同城乐家电子商务有限公司 A kind of wisdom marketing method based on payment terminal data
CN113010665A (en) * 2019-12-20 2021-06-22 北京搜狗科技发展有限公司 Word processing method and related device
CN111354342A (en) * 2020-02-28 2020-06-30 科大讯飞股份有限公司 Method, device, equipment and storage medium for updating personalized word stock
CN115577694A (en) * 2022-11-15 2023-01-06 南方电网科学研究院有限责任公司 Intelligent recommendation method for standard writing

Also Published As

Publication number Publication date
CN101334774B (en) 2013-08-14

Similar Documents

Publication Publication Date Title
CN101334774B (en) Character input method and input method system
CN110287278B (en) Comment generation method, comment generation device, server and storage medium
CN101373468B (en) Method for loading word stock, method for inputting character and input method system
CN101388011B (en) Method and apparatus for recording information into user thesaurus
US20230015606A1 (en) Named entity recognition method and apparatus, device, and storage medium
CN101203849B (en) Predictive conversion of user input
US10423649B2 (en) Natural question generation from query data using natural language processing system
US20100100371A1 (en) Method, System, and Apparatus for Message Generation
Linhares Pontes et al. Impact of OCR quality on named entity linking
EP2570974A1 (en) Automatic crowd sourcing for machine learning in information extraction
CN108717853B (en) Man-machine voice interaction method, device and storage medium
CN108304375A (en) A kind of information identifying method and its equipment, storage medium, terminal
CN101520786A (en) Method for realizing input method dictionary and input method system
CN101556596B (en) Input method system and intelligent word making method
CN111178076B (en) Named entity recognition and linking method, device, equipment and readable storage medium
CN101566995A (en) Method and system for integral release of internet information
CN109766085B (en) Method and device for processing enumeration type codes
CN111143556A (en) Software function point automatic counting method, device, medium and electronic equipment
CN102737030A (en) Patent document data outputting method, terminal and system
CN110245334B (en) Method and device for outputting information
CN101470701A (en) Text analyzer supporting semantic rule based on finite state machine and method thereof
KR101651963B1 (en) Method of generating time and space associated data, time and space associated data generation server performing the same and storage medium storing the same
CN109885583A (en) Data query method, apparatus, equipment and storage medium based on block chain
CN116486812A (en) Automatic generation method and system for multi-field lip language recognition sample based on corpus relation
CN103020311A (en) Method and system for processing user search terms

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant