CN104317961B - A kind of professional system inputs intelligent prompt system - Google Patents
A kind of professional system inputs intelligent prompt system Download PDFInfo
- Publication number
- CN104317961B CN104317961B CN201410647575.2A CN201410647575A CN104317961B CN 104317961 B CN104317961 B CN 104317961B CN 201410647575 A CN201410647575 A CN 201410647575A CN 104317961 B CN104317961 B CN 104317961B
- Authority
- CN
- China
- Prior art keywords
- word
- module
- chinese character
- user
- input
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The present invention provides a kind of professional system input intelligent prompt system, it divides word modules, word and Chinese character indexing module, Word prediction module, user interactive module including participle, word modules, word and Chinese character indexing module, Word prediction module is wherein divided to be arranged on server-side, user interactive module is arranged on client, server-side is separated as independent service with professional system, and client is used as the extension element of professional system by professional system.Can be in user and professional system interaction using present system, the prompt message that user may wish to is provided in the case where user does not make complete input also completely, and can be in the continuous interaction of user, system constantly updates its knowledge structure, prompt the more accurate information of user, the ability for being desirably integrated into other systems is provided at the same time, to extend the versatility of other application.
Description
Technical field
The invention belongs to field of information processing, and in particular to a kind of professional system inputs intelligent prompt system.
Background technology
In most computers software systems, it can all be related to the mistake by user's inputting word information and system interaction
Journey.Most software system interaction mode is all directly by text box inputting word information, then by system according to defeated by user
Enter result to be inquired about, can be there are substantial amounts of redundancy in these query results, and non-user is really desired, it is difficult to accomplish
It is accurately positioned the desired information of user.Another aspect system may may require that user inputs the data of specific format, Yong Huke
Can the frequent input error of meeting.The reason for this problem occur mainly two aspects.On the one hand, most users are handed over system
It is difficult that the idea of oneself is converted into system information to understand quickly during mutually, therefore inputs inaccurate.On the other hand,
System gives prompting deficiency input by user, causes user to input inaccurate.It is then desired to propose a kind of software systems input intelligence
Can reminding method, user's prompting is given in user's input process, allowing user, progressively clear and definite its idea solves the problems, such as this.
It is achieved in that used by the first method of the prior art related to the present invention and provides a static example
Text, to some promptings of user and instructs, helps user to input correct information.
If the first processing mode will there are problems with using above-mentioned.First, expressed by static sample text
Information content is seldom, if the information that user did not understood or thought expression is still extremely difficult to beyond example ranges, user's input
Input form required by system.Finally, static sample text number of words is limited, causes generality too high, and user understands that static state is shown
The difficulty of example text sheet is larger, so as to increase the difficulty that user uses system.
Prior art second method related to the present invention is to be directed to specific input scene, customize advanced inquiry or
The input frame of customization, it is desirable to which user is inputted in a particular manner.
If second of processing mode will there are problems with using above-mentioned.First, necessarily resulted in for special scenes
Method does not have autgmentability, bad adaptability.Secondly, the advanced inquiry of customization or the input frame of customization must add system friendship
Mutual complexity.Finally, this mode simply definitely limits user's input, is but not given to user and prompts enough, i.e.,
Do not solve the problems, such as at all.
The content of the invention
The technical problem to be solved in the present invention is:A kind of professional system input intelligent prompt system is provided, solves existing skill
User cannot provide more accurate prompting in the input process of professional system in art, cause user and system interaction to there is barrier
The problem of hindering.
The technical solution taken by the invention to solve the above technical problem is:A kind of professional system input intelligent prompt system
System, it is characterised in that:It divides word modules, word and Chinese character indexing module, Word prediction module, user mutual mould including participle
Block, wherein dividing word modules, word and Chinese character indexing module, Word prediction module to be arranged on server-side, user interactive module is set
In client, server-side is separated as independent service with professional system, and client is special as the extension element of professional system
Industry system uses;
Divide word modules, the sample storehouse for being collected into from external data source is segmented, by samples of text data point
The data acquisition system being cut into units of English word and Chinese character;
Word and Chinese character indexing module, for from English word and Chinese character in the data acquisition system of unit, by English word
Alphabetical sequence is converted into, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, is set
In each node will index its all Chinese character and English word set that can be associated with;
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets all
It is required that the Chinese character predicted or English word be ranked up according to the probability of generation, by probability from high to low export prediction knot
Fruit;
User interactive module, is inputted for providing text box to user, when user inputs letter, list in text box
When word or Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction of Word prediction module
Content input by user is sent to participle and divides word modules to be updated by the results show into prompting view.
By such scheme, the participle divides word modules to include input sample module, language classification module and splits module;
Input sample module, for according to specified Rule Extraction text message, being organized into data acquisition system from sample storehouse,
It is attached to while extracting text message using its latitude information as data source additional information in text message;
Language classification module, for the data acquisition system put in order to be classified by language;
Module is split, divides word for dividing word rule to carry out participle according to difference defined in different language, splits
Into the data acquisition system in units of English word and Chinese character, the English word and Chinese character in data acquisition system are attached with data source
Add information.
By such scheme, the word and Chinese character indexing module establish module and index module including index tree;
Index tree establishes module, for the data set in units of Chinese character and English word that word modules will be divided to provide
Conjunction is converted into pinyin sequence and alphabetical sequence, then these sequences are stored with the data structure of prefix trees;Prefix trees according to
Data source additional information is first classified, and the branch by phonetic and/or alphabetical sequence spanning tree is carried out in each class;Work as Chinese character
For polyphone when, respectively appear in corresponding sequence;
Index module, for establishing the position of input frame and the correspondence of data source additional information, by defeated
Enter the position of frame and the correspondence of data source additional information, the first selection sort from prefix trees, further according to input Pinyin and
Alphabetical sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English
The frequency of usage sequence output of literary word, the frequency is identical, is sorted alphabetically.
By such scheme, the Word prediction module include word matrix computations module, word order probability statistical module and
Prediction result module;
Word matrix computations module, for according to data acquisition system generate one to express English word and English word,
The matrix of Chinese character and Chinese character, English word and Chinese character relation between any two, is stored by the way of orthogonal list;
Word order probability statistical module, for establishing the conditional probability model P (w2 | w1) of above-mentioned relation between any two, i.e. w1
Represent one of Chinese character or English word, w2 represents another Chinese character or English word, and w2 occurs under conditions of w1 generations
Probability is P (w2 | w1), probability results is inserted in above-mentioned matrix, if wherein w1 and w2 is added in same data source
Then its probability of happening bigger under information;
Prediction result module, in the case where receiving and have input a letter, passing through word and Chinese character indexing mould
Block indexes the Chinese character being associated with it or English word ranking results, each obtained further according to word order probability statistical module
The probability of the next Chinese character or English word of Chinese character or English word, is finally sorted from high to low by probability results.
By such scheme, the user interactive module includes prediction monitor, prompting view and sample storehouse update module;
Predict monitor,, will be pre- from word whenever user have input a letter for the input behavior of monitoring users
Survey module and obtain prediction result;
View is prompted, for providing the interface of a display ranking results;
Sample storehouse update module, for recording the input behavior of user, and updates sample storehouse, and notice participle divides word modules more
The newly data acquisition system in units of English word and Chinese character;This renewal process is set by user according to the operating condition of professional system
Determine the update cycle, while a probe is provided, in the case where the data source of professional system changes, system is updated place
Reason.
Beneficial effects of the present invention are:Using present system can in user and professional system interaction, with
Family provides the prompt message that user may wish in the case of not making complete input completely also, and can not break off a friendship in user
During mutually, system constantly updates its knowledge structure, prompts the more accurate information of user, while provide and be desirably integrated into
The ability of other systems, to extend the versatility of other application.
Brief description of the drawings
Fig. 1 is the structure diagram of one embodiment of the invention.
Embodiment
With reference to instantiation and attached drawing, the present invention will be further described.
In order to give user one more accurately interactivity prompting, system needs to predict the content that user may input, first
First need to be segmented from the samples of text data that previous experiences or system designer are collected into, i.e., split from sample storehouse
Into the set of letters in units of word, Chinese character, this process needs to consider the difference between different language, such as:English can be with
Two words are distinguished in space, and Chinese then needs to divide individual Chinese character.
After completing and dividing word to generate the word set that may be used in system from sample, use for convenience
Family inputs, and user need to only input several letters of word or Chinese character.So do can play faster by user's idea with
The purpose that information combines in system.This just needs to establish a letter to word and the index of Chinese character.
After word and Chinese character indexing has been established, user can retrieve one group by one or more letters quickly
Matching word or Chinese character, but these words or Chinese character those should be only the words of user's most probable needs, it is and defeated
Enter after completing this word or Chinese character, its next word or Chinese character most likely which.
According to above-mentioned principle, the present embodiment provides a kind of professional system to input intelligent prompt system, as shown in Figure 1, including
Divide word modules, word and Chinese character indexing module, Word prediction module, user interactive module, wherein divide word modules, word and
Chinese character indexing module, Word prediction module are arranged on server-side, and user interactive module is arranged on client, and server-side is as independent
Service separated with professional system, client is used as the extension element of professional system by professional system.So facilitate
The system integration, can enable the system to enough more easily be integrated into professional system.
Divide word modules, the sample storehouse for being collected into from external data source is segmented, by samples of text data point
The data acquisition system being cut into units of English word and Chinese character.
Divide word modules to include input sample module, language classification module and split module;Input sample module, is used for
From sample storehouse according to specified Rule Extraction text message, data acquisition system is organized into, by its latitude while text message is extracted
Degree information is attached in text message as data source additional information;Language classification module, for the data set that will be put in order
Conjunction is classified by language;Module is split, for dividing word rule to be segmented according to different defined in different language
Point word, the data acquisition system being divided into units of English word and Chinese character, English word and Chinese character in data acquisition system carry
Data source additional information.
Word and Chinese character indexing module, for from English word and Chinese character in the data acquisition system of unit, by English word
Alphabetical sequence is converted into, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, is set
In each node will index its all Chinese character and English word set that can be associated with.
Word and Chinese character indexing module establish module and index module including index tree;Index tree establishes module, for inciting somebody to action
The data acquisition system in units of Chinese character and English word that word modules provide is divided to be converted into pinyin sequence and alphabetical sequence, then
These sequences are stored with the data structure of prefix trees;Prefix trees are first classified according to data source additional information, often
The branch by phonetic and/or alphabetical sequence spanning tree is carried out in a class;When Chinese character is polyphone, corresponding sequence is respectively appeared in
In row;Index module, for establishing the position of input frame and the correspondence of data source additional information, passes through input frame
Position and data source additional information correspondence, first the selection sort from prefix trees, further according to input Pinyin and letter
Sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English list
The frequency of usage sequence output of word, the frequency is identical, is sorted alphabetically.
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets all
It is required that the Chinese character predicted or English word be ranked up according to the probability of generation, by probability from high to low export prediction knot
Fruit.
Word prediction module includes word matrix computations module, word order probability statistical module and prediction result module;Word
Matrix computations module, for generating one according to data acquisition system expressing English word and English word, Chinese character and Chinese character, English
The matrix of literary word and Chinese character relation between any two, is stored by the way of orthogonal list;Word order probability statistical module, is used
In the conditional probability model P (w2 | w1) for establishing above-mentioned relation between any two, i.e. w1 represents one of Chinese character or English word,
W2 represents another Chinese character or English word, and the probability that w2 occurs under conditions of w1 occurs is P (w2 | w1), by probability results
Insert in above-mentioned matrix, wherein w1 and w2 its probability of happening bigger if under same data source additional information;Prediction
Object module, in the case where receiving and have input a letter, by word and Chinese character indexing module index under it
The Chinese character or English word ranking results being associated with, each Chinese character obtained further according to word order probability statistical module or English are single
The probability of the next Chinese character or English word of word, is finally sorted from high to low by probability results.
User interactive module, is inputted for providing text box to user, when user inputs letter, list in text box
When word or Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction of Word prediction module
Content input by user is sent to participle and divides word modules to be updated by the results show into prompting view.
User interactive module includes prediction monitor, prompting view and sample storehouse update module;Monitor is predicted, for supervising
The input behavior of user is listened, whenever user have input a letter, prediction result will be obtained from Word prediction module;Prompting regards
Figure, for providing the interface of a display ranking results;Sample storehouse update module, for recording the input behavior of user, and more
New samples storehouse, notice divide word modules to update the data acquisition system in units of English word and Chinese character;This renewal process by
User sets the update cycle according to the operating condition of professional system, while provides a probe, is sent out in the data source of professional system
In the case of changing(Such as generate new word when the quantity of caused new word reached user setting threshold value when
Wait), system is updated processing.
Above example is merely to illustrate the Computation schema and feature of the present invention, and its object is to make technology in the art
Personnel can understand present disclosure and implement according to this, and protection scope of the present invention is not limited to above-described embodiment.So it is all according to
The equivalent variations made according to disclosed principle, mentality of designing or modification, within protection scope of the present invention.
Claims (4)
1. a kind of professional system inputs intelligent prompt system, it is characterised in that:The system is arranged in professional system, the system bag
Include participle and divide word modules, word and Chinese character indexing module, Word prediction module, user interactive module, wherein dividing word modules, word
Server-side is arranged on Chinese character indexing module, Word prediction module, user interactive module is arranged on client, and server-side is as only
Vertical service is separated with professional system, and client is used as the extension element of professional system by professional system;
Divide word modules, the sample storehouse for being collected into from external data source is segmented, and samples of text data are divided into
Data acquisition system in units of English word and Chinese character;
Word and Chinese character indexing module, in the data acquisition system of unit, English word to be changed from English word and Chinese character
Into alphabetical sequence, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, it is every in tree
One node will index its all Chinese character and English word set that can be associated with;
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets the requirements all
The Chinese character predicted or English word be ranked up according to the probability of generation, export prediction result from high to low by probability;
User interactive module, is inputted for providing text box to user, when user inputted in text box letter, word or
During Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction result of Word prediction module
It is shown in prompting view, and content input by user is sent to participle and divides word modules to be updated;
The participle divides word modules to include input sample module, language classification module and splits module;
Input sample module, for, according to specified Rule Extraction text message, being organized into data acquisition system from sample storehouse, extracting
It is attached to while text message using its latitude information as data source additional information in text message;
Language classification module, for the data acquisition system put in order to be classified by language;
Split module, for dividing word rule to carry out participle to divide word according to different defined in different language, be divided into
English word and Chinese character are the data acquisition system of unit, and the English word and Chinese character in data acquisition system add letter with data source
Breath.
A kind of 2. professional system input intelligent prompt system according to claim 1, it is characterised in that:The word and
Chinese character indexing module establishes module and index module including index tree;
Index tree establishes module, for the data acquisition system in units of Chinese character and English word for dividing word modules to provide to be turned
Change pinyin sequence and alphabetical sequence into, then these sequences are stored with the data structure of prefix trees;Prefix trees are according to data
Source additional information is first classified, and the branch by phonetic and/or alphabetical sequence spanning tree is carried out in each class;When Chinese character is more
During sound word, respectively appear in corresponding sequence;
Index module, for establishing the position of input frame and the correspondence of data source additional information, passes through input frame
Position and data source additional information correspondence, first the selection sort from prefix trees, further according to input Pinyin and letter
Sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English list
The frequency of usage sequence output of word, the frequency is identical, is sorted alphabetically.
A kind of 3. professional system input intelligent prompt system according to claim 1, it is characterised in that:The word is pre-
Surveying module includes word matrix computations module, word order probability statistical module and prediction result module;
Word matrix computations module, for generating one according to data acquisition system expressing English word and English word, Chinese character
With the matrix of Chinese character, English word and Chinese character relation between any two, stored by the way of orthogonal list;
Word order probability statistical module, for establishing the conditional probability model P (w2 | w1) of above-mentioned relation between any two, i.e. w1 is represented
One of Chinese character or English word, w2 represent another Chinese character or English word, the probability that w2 occurs under conditions of w1 generations
As P (w2 | w1), probability results are inserted in above-mentioned matrix, if wherein w1 and w2 is in same data source additional information
Under then its probability of happening bigger;
Prediction result module, in the case where receiving and have input a letter, passing through word and Chinese character indexing module rope
The Chinese character being associated with it or English word ranking results are guided to, each Chinese character obtained further according to word order probability statistical module
Or the probability of the next Chinese character or English word of English word, finally sorted from high to low by probability results.
A kind of 4. professional system input intelligent prompt system according to claim 1, it is characterised in that:The user hands over
Mutual module includes prediction monitor, prompting view and sample storehouse update module;
Predict monitor,, will be from Word prediction mould whenever user have input a letter for the input behavior of monitoring users
Block obtains prediction result;
View is prompted, for providing the interface of a display ranking results;
Sample storehouse update module, for recording the input behavior of user, and updates sample storehouse, notice participle divide word modules renewal with
English word and the data acquisition system that Chinese character is unit;This renewal process is set more by user according to the operating condition of professional system
The new cycle, while a probe is provided, in the case where the data source of professional system changes, system is updated processing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410647575.2A CN104317961B (en) | 2014-11-14 | 2014-11-14 | A kind of professional system inputs intelligent prompt system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410647575.2A CN104317961B (en) | 2014-11-14 | 2014-11-14 | A kind of professional system inputs intelligent prompt system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104317961A CN104317961A (en) | 2015-01-28 |
CN104317961B true CN104317961B (en) | 2018-04-27 |
Family
ID=52373193
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410647575.2A Expired - Fee Related CN104317961B (en) | 2014-11-14 | 2014-11-14 | A kind of professional system inputs intelligent prompt system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104317961B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109062888B (en) * | 2018-06-04 | 2023-03-31 | 昆明理工大学 | Self-correcting method for input of wrong text |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101697109A (en) * | 2009-10-26 | 2010-04-21 | 北京搜狗科技发展有限公司 | Method and system for acquiring candidates of input method |
CN101727271A (en) * | 2008-10-22 | 2010-06-09 | 北京搜狗科技发展有限公司 | Method and device for providing error correcting prompt and input method system |
JP2011180941A (en) * | 2010-03-03 | 2011-09-15 | National Institute Of Information & Communication Technology | Phrase table generator and computer program therefor |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101436193B (en) * | 2007-11-14 | 2011-05-04 | 深圳富泰宏精密工业有限公司 | Text editor forecast input system and method |
-
2014
- 2014-11-14 CN CN201410647575.2A patent/CN104317961B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101727271A (en) * | 2008-10-22 | 2010-06-09 | 北京搜狗科技发展有限公司 | Method and device for providing error correcting prompt and input method system |
CN101697109A (en) * | 2009-10-26 | 2010-04-21 | 北京搜狗科技发展有限公司 | Method and system for acquiring candidates of input method |
JP2011180941A (en) * | 2010-03-03 | 2011-09-15 | National Institute Of Information & Communication Technology | Phrase table generator and computer program therefor |
Also Published As
Publication number | Publication date |
---|---|
CN104317961A (en) | 2015-01-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112579707B (en) | Log data knowledge graph construction method | |
KR102591421B1 (en) | Intent recommendation method, apparatus, device and storage medium | |
CN109960726B (en) | Text classification model construction method, device, terminal and storage medium | |
CN111984689B (en) | Information retrieval method, device, equipment and storage medium | |
US11295071B2 (en) | Graphical systems and methods for human-in-the-loop machine intelligence | |
JP6655788B2 (en) | Bilingual corpus creation method, apparatus and program, and machine translation system | |
CN105094368B (en) | A kind of control method and control device that frequency modulation sequence is carried out to candidates of input method | |
CN106202059A (en) | Machine translation method and machine translation apparatus | |
CN107783973A (en) | The methods, devices and systems being monitored based on domain knowledge spectrum data storehouse to the Internet media event | |
KR102491172B1 (en) | Natural language question-answering system and learning method | |
CN104102626A (en) | Method for computing semantic similarities among short texts | |
CN105022733B (en) | DINFO OEC text analyzings method for digging and equipment | |
EP3425532A1 (en) | Information mining method and system, electronic device and readable storage medium | |
CN104076944A (en) | Chat emoticon input method and device | |
CN103870000A (en) | Method and device for sorting candidate items generated by input method | |
KR101757900B1 (en) | Method and device for knowledge base construction | |
CN111738014A (en) | Drug classification method, device, equipment and storage medium | |
CN104484057A (en) | Associative result providing method and device | |
CN110209781B (en) | Text processing method and device and related equipment | |
US20180341646A1 (en) | Translated-clause generating method, translated-clause generating apparatus, and recording medium | |
Saddam et al. | Sentiment analysis of flood disaster management in Jakarta on Twitter using support vector machines | |
KR102146433B1 (en) | Method for providing context based language learning service using associative memory | |
US20160203139A1 (en) | Standardizing attributes and entities in a social networking system | |
WO2019163642A1 (en) | Summary evaluation device, method, program, and storage medium | |
CN104317961B (en) | A kind of professional system inputs intelligent prompt system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 430074 Kanto science and Technology Industrial Zone, East Lake New Technology Development Zone, Hubei, Wuhan Applicant after: Wuhan leading technology Co., Ltd. Address before: Three road 430074 Hubei city of Wuhan province Wuchang Guanshan Applicant before: Wuhan NEC Fiber Optic Communications Industry Co., Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180427 Termination date: 20191114 |
|
CF01 | Termination of patent right due to non-payment of annual fee |