CN104317961B - A kind of professional system inputs intelligent prompt system - Google Patents

A kind of professional system inputs intelligent prompt system Download PDF

Info

Publication number
CN104317961B
CN104317961B CN201410647575.2A CN201410647575A CN104317961B CN 104317961 B CN104317961 B CN 104317961B CN 201410647575 A CN201410647575 A CN 201410647575A CN 104317961 B CN104317961 B CN 104317961B
Authority
CN
China
Prior art keywords
word
module
chinese character
user
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410647575.2A
Other languages
Chinese (zh)
Other versions
CN104317961A (en
Inventor
何双江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Leading Technology Co Ltd
Original Assignee
Wuhan Leading Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Leading Technology Co Ltd filed Critical Wuhan Leading Technology Co Ltd
Priority to CN201410647575.2A priority Critical patent/CN104317961B/en
Publication of CN104317961A publication Critical patent/CN104317961A/en
Application granted granted Critical
Publication of CN104317961B publication Critical patent/CN104317961B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention provides a kind of professional system input intelligent prompt system, it divides word modules, word and Chinese character indexing module, Word prediction module, user interactive module including participle, word modules, word and Chinese character indexing module, Word prediction module is wherein divided to be arranged on server-side, user interactive module is arranged on client, server-side is separated as independent service with professional system, and client is used as the extension element of professional system by professional system.Can be in user and professional system interaction using present system, the prompt message that user may wish to is provided in the case where user does not make complete input also completely, and can be in the continuous interaction of user, system constantly updates its knowledge structure, prompt the more accurate information of user, the ability for being desirably integrated into other systems is provided at the same time, to extend the versatility of other application.

Description

A kind of professional system inputs intelligent prompt system
Technical field
The invention belongs to field of information processing, and in particular to a kind of professional system inputs intelligent prompt system.
Background technology
In most computers software systems, it can all be related to the mistake by user's inputting word information and system interaction Journey.Most software system interaction mode is all directly by text box inputting word information, then by system according to defeated by user Enter result to be inquired about, can be there are substantial amounts of redundancy in these query results, and non-user is really desired, it is difficult to accomplish It is accurately positioned the desired information of user.Another aspect system may may require that user inputs the data of specific format, Yong Huke Can the frequent input error of meeting.The reason for this problem occur mainly two aspects.On the one hand, most users are handed over system It is difficult that the idea of oneself is converted into system information to understand quickly during mutually, therefore inputs inaccurate.On the other hand, System gives prompting deficiency input by user, causes user to input inaccurate.It is then desired to propose a kind of software systems input intelligence Can reminding method, user's prompting is given in user's input process, allowing user, progressively clear and definite its idea solves the problems, such as this.
It is achieved in that used by the first method of the prior art related to the present invention and provides a static example Text, to some promptings of user and instructs, helps user to input correct information.
If the first processing mode will there are problems with using above-mentioned.First, expressed by static sample text Information content is seldom, if the information that user did not understood or thought expression is still extremely difficult to beyond example ranges, user's input Input form required by system.Finally, static sample text number of words is limited, causes generality too high, and user understands that static state is shown The difficulty of example text sheet is larger, so as to increase the difficulty that user uses system.
Prior art second method related to the present invention is to be directed to specific input scene, customize advanced inquiry or The input frame of customization, it is desirable to which user is inputted in a particular manner.
If second of processing mode will there are problems with using above-mentioned.First, necessarily resulted in for special scenes Method does not have autgmentability, bad adaptability.Secondly, the advanced inquiry of customization or the input frame of customization must add system friendship Mutual complexity.Finally, this mode simply definitely limits user's input, is but not given to user and prompts enough, i.e., Do not solve the problems, such as at all.
The content of the invention
The technical problem to be solved in the present invention is:A kind of professional system input intelligent prompt system is provided, solves existing skill User cannot provide more accurate prompting in the input process of professional system in art, cause user and system interaction to there is barrier The problem of hindering.
The technical solution taken by the invention to solve the above technical problem is:A kind of professional system input intelligent prompt system System, it is characterised in that:It divides word modules, word and Chinese character indexing module, Word prediction module, user mutual mould including participle Block, wherein dividing word modules, word and Chinese character indexing module, Word prediction module to be arranged on server-side, user interactive module is set In client, server-side is separated as independent service with professional system, and client is special as the extension element of professional system Industry system uses;
Divide word modules, the sample storehouse for being collected into from external data source is segmented, by samples of text data point The data acquisition system being cut into units of English word and Chinese character;
Word and Chinese character indexing module, for from English word and Chinese character in the data acquisition system of unit, by English word Alphabetical sequence is converted into, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, is set In each node will index its all Chinese character and English word set that can be associated with;
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets all It is required that the Chinese character predicted or English word be ranked up according to the probability of generation, by probability from high to low export prediction knot Fruit;
User interactive module, is inputted for providing text box to user, when user inputs letter, list in text box When word or Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction of Word prediction module Content input by user is sent to participle and divides word modules to be updated by the results show into prompting view.
By such scheme, the participle divides word modules to include input sample module, language classification module and splits module;
Input sample module, for according to specified Rule Extraction text message, being organized into data acquisition system from sample storehouse, It is attached to while extracting text message using its latitude information as data source additional information in text message;
Language classification module, for the data acquisition system put in order to be classified by language;
Module is split, divides word for dividing word rule to carry out participle according to difference defined in different language, splits Into the data acquisition system in units of English word and Chinese character, the English word and Chinese character in data acquisition system are attached with data source Add information.
By such scheme, the word and Chinese character indexing module establish module and index module including index tree;
Index tree establishes module, for the data set in units of Chinese character and English word that word modules will be divided to provide Conjunction is converted into pinyin sequence and alphabetical sequence, then these sequences are stored with the data structure of prefix trees;Prefix trees according to Data source additional information is first classified, and the branch by phonetic and/or alphabetical sequence spanning tree is carried out in each class;Work as Chinese character For polyphone when, respectively appear in corresponding sequence;
Index module, for establishing the position of input frame and the correspondence of data source additional information, by defeated Enter the position of frame and the correspondence of data source additional information, the first selection sort from prefix trees, further according to input Pinyin and Alphabetical sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English The frequency of usage sequence output of literary word, the frequency is identical, is sorted alphabetically.
By such scheme, the Word prediction module include word matrix computations module, word order probability statistical module and Prediction result module;
Word matrix computations module, for according to data acquisition system generate one to express English word and English word, The matrix of Chinese character and Chinese character, English word and Chinese character relation between any two, is stored by the way of orthogonal list;
Word order probability statistical module, for establishing the conditional probability model P (w2 | w1) of above-mentioned relation between any two, i.e. w1 Represent one of Chinese character or English word, w2 represents another Chinese character or English word, and w2 occurs under conditions of w1 generations Probability is P (w2 | w1), probability results is inserted in above-mentioned matrix, if wherein w1 and w2 is added in same data source Then its probability of happening bigger under information;
Prediction result module, in the case where receiving and have input a letter, passing through word and Chinese character indexing mould Block indexes the Chinese character being associated with it or English word ranking results, each obtained further according to word order probability statistical module The probability of the next Chinese character or English word of Chinese character or English word, is finally sorted from high to low by probability results.
By such scheme, the user interactive module includes prediction monitor, prompting view and sample storehouse update module;
Predict monitor,, will be pre- from word whenever user have input a letter for the input behavior of monitoring users Survey module and obtain prediction result;
View is prompted, for providing the interface of a display ranking results;
Sample storehouse update module, for recording the input behavior of user, and updates sample storehouse, and notice participle divides word modules more The newly data acquisition system in units of English word and Chinese character;This renewal process is set by user according to the operating condition of professional system Determine the update cycle, while a probe is provided, in the case where the data source of professional system changes, system is updated place Reason.
Beneficial effects of the present invention are:Using present system can in user and professional system interaction, with Family provides the prompt message that user may wish in the case of not making complete input completely also, and can not break off a friendship in user During mutually, system constantly updates its knowledge structure, prompts the more accurate information of user, while provide and be desirably integrated into The ability of other systems, to extend the versatility of other application.
Brief description of the drawings
Fig. 1 is the structure diagram of one embodiment of the invention.
Embodiment
With reference to instantiation and attached drawing, the present invention will be further described.
In order to give user one more accurately interactivity prompting, system needs to predict the content that user may input, first First need to be segmented from the samples of text data that previous experiences or system designer are collected into, i.e., split from sample storehouse Into the set of letters in units of word, Chinese character, this process needs to consider the difference between different language, such as:English can be with Two words are distinguished in space, and Chinese then needs to divide individual Chinese character.
After completing and dividing word to generate the word set that may be used in system from sample, use for convenience Family inputs, and user need to only input several letters of word or Chinese character.So do can play faster by user's idea with The purpose that information combines in system.This just needs to establish a letter to word and the index of Chinese character.
After word and Chinese character indexing has been established, user can retrieve one group by one or more letters quickly Matching word or Chinese character, but these words or Chinese character those should be only the words of user's most probable needs, it is and defeated Enter after completing this word or Chinese character, its next word or Chinese character most likely which.
According to above-mentioned principle, the present embodiment provides a kind of professional system to input intelligent prompt system, as shown in Figure 1, including Divide word modules, word and Chinese character indexing module, Word prediction module, user interactive module, wherein divide word modules, word and Chinese character indexing module, Word prediction module are arranged on server-side, and user interactive module is arranged on client, and server-side is as independent Service separated with professional system, client is used as the extension element of professional system by professional system.So facilitate The system integration, can enable the system to enough more easily be integrated into professional system.
Divide word modules, the sample storehouse for being collected into from external data source is segmented, by samples of text data point The data acquisition system being cut into units of English word and Chinese character.
Divide word modules to include input sample module, language classification module and split module;Input sample module, is used for From sample storehouse according to specified Rule Extraction text message, data acquisition system is organized into, by its latitude while text message is extracted Degree information is attached in text message as data source additional information;Language classification module, for the data set that will be put in order Conjunction is classified by language;Module is split, for dividing word rule to be segmented according to different defined in different language Point word, the data acquisition system being divided into units of English word and Chinese character, English word and Chinese character in data acquisition system carry Data source additional information.
Word and Chinese character indexing module, for from English word and Chinese character in the data acquisition system of unit, by English word Alphabetical sequence is converted into, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, is set In each node will index its all Chinese character and English word set that can be associated with.
Word and Chinese character indexing module establish module and index module including index tree;Index tree establishes module, for inciting somebody to action The data acquisition system in units of Chinese character and English word that word modules provide is divided to be converted into pinyin sequence and alphabetical sequence, then These sequences are stored with the data structure of prefix trees;Prefix trees are first classified according to data source additional information, often The branch by phonetic and/or alphabetical sequence spanning tree is carried out in a class;When Chinese character is polyphone, corresponding sequence is respectively appeared in In row;Index module, for establishing the position of input frame and the correspondence of data source additional information, passes through input frame Position and data source additional information correspondence, first the selection sort from prefix trees, further according to input Pinyin and letter Sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English list The frequency of usage sequence output of word, the frequency is identical, is sorted alphabetically.
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets all It is required that the Chinese character predicted or English word be ranked up according to the probability of generation, by probability from high to low export prediction knot Fruit.
Word prediction module includes word matrix computations module, word order probability statistical module and prediction result module;Word Matrix computations module, for generating one according to data acquisition system expressing English word and English word, Chinese character and Chinese character, English The matrix of literary word and Chinese character relation between any two, is stored by the way of orthogonal list;Word order probability statistical module, is used In the conditional probability model P (w2 | w1) for establishing above-mentioned relation between any two, i.e. w1 represents one of Chinese character or English word, W2 represents another Chinese character or English word, and the probability that w2 occurs under conditions of w1 occurs is P (w2 | w1), by probability results Insert in above-mentioned matrix, wherein w1 and w2 its probability of happening bigger if under same data source additional information;Prediction Object module, in the case where receiving and have input a letter, by word and Chinese character indexing module index under it The Chinese character or English word ranking results being associated with, each Chinese character obtained further according to word order probability statistical module or English are single The probability of the next Chinese character or English word of word, is finally sorted from high to low by probability results.
User interactive module, is inputted for providing text box to user, when user inputs letter, list in text box When word or Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction of Word prediction module Content input by user is sent to participle and divides word modules to be updated by the results show into prompting view.
User interactive module includes prediction monitor, prompting view and sample storehouse update module;Monitor is predicted, for supervising The input behavior of user is listened, whenever user have input a letter, prediction result will be obtained from Word prediction module;Prompting regards Figure, for providing the interface of a display ranking results;Sample storehouse update module, for recording the input behavior of user, and more New samples storehouse, notice divide word modules to update the data acquisition system in units of English word and Chinese character;This renewal process by User sets the update cycle according to the operating condition of professional system, while provides a probe, is sent out in the data source of professional system In the case of changing(Such as generate new word when the quantity of caused new word reached user setting threshold value when Wait), system is updated processing.
Above example is merely to illustrate the Computation schema and feature of the present invention, and its object is to make technology in the art Personnel can understand present disclosure and implement according to this, and protection scope of the present invention is not limited to above-described embodiment.So it is all according to The equivalent variations made according to disclosed principle, mentality of designing or modification, within protection scope of the present invention.

Claims (4)

1. a kind of professional system inputs intelligent prompt system, it is characterised in that:The system is arranged in professional system, the system bag Include participle and divide word modules, word and Chinese character indexing module, Word prediction module, user interactive module, wherein dividing word modules, word Server-side is arranged on Chinese character indexing module, Word prediction module, user interactive module is arranged on client, and server-side is as only Vertical service is separated with professional system, and client is used as the extension element of professional system by professional system;
Divide word modules, the sample storehouse for being collected into from external data source is segmented, and samples of text data are divided into Data acquisition system in units of English word and Chinese character;
Word and Chinese character indexing module, in the data acquisition system of unit, English word to be changed from English word and Chinese character Into alphabetical sequence, Chinese character is converted into pinyin sequence, then these sequences are stored with the data structure of prefix trees, it is every in tree One node will index its all Chinese character and English word set that can be associated with;
Word prediction module, for establishing the conditional probability model between data acquisition system and input sample, meets the requirements all The Chinese character predicted or English word be ranked up according to the probability of generation, export prediction result from high to low by probability;
User interactive module, is inputted for providing text box to user, when user inputted in text box letter, word or During Chinese character, the input of user is received, is sent to word and Chinese character indexing module, finally receives the prediction result of Word prediction module It is shown in prompting view, and content input by user is sent to participle and divides word modules to be updated;
The participle divides word modules to include input sample module, language classification module and splits module;
Input sample module, for, according to specified Rule Extraction text message, being organized into data acquisition system from sample storehouse, extracting It is attached to while text message using its latitude information as data source additional information in text message;
Language classification module, for the data acquisition system put in order to be classified by language;
Split module, for dividing word rule to carry out participle to divide word according to different defined in different language, be divided into English word and Chinese character are the data acquisition system of unit, and the English word and Chinese character in data acquisition system add letter with data source Breath.
A kind of 2. professional system input intelligent prompt system according to claim 1, it is characterised in that:The word and Chinese character indexing module establishes module and index module including index tree;
Index tree establishes module, for the data acquisition system in units of Chinese character and English word for dividing word modules to provide to be turned Change pinyin sequence and alphabetical sequence into, then these sequences are stored with the data structure of prefix trees;Prefix trees are according to data Source additional information is first classified, and the branch by phonetic and/or alphabetical sequence spanning tree is carried out in each class;When Chinese character is more During sound word, respectively appear in corresponding sequence;
Index module, for establishing the position of input frame and the correspondence of data source additional information, passes through input frame Position and data source additional information correspondence, first the selection sort from prefix trees, further according to input Pinyin and letter Sequence quickly indexes its all Chinese character and English word that can be associated with from above-mentioned prefix trees, and presses Chinese character and English list The frequency of usage sequence output of word, the frequency is identical, is sorted alphabetically.
A kind of 3. professional system input intelligent prompt system according to claim 1, it is characterised in that:The word is pre- Surveying module includes word matrix computations module, word order probability statistical module and prediction result module;
Word matrix computations module, for generating one according to data acquisition system expressing English word and English word, Chinese character With the matrix of Chinese character, English word and Chinese character relation between any two, stored by the way of orthogonal list;
Word order probability statistical module, for establishing the conditional probability model P (w2 | w1) of above-mentioned relation between any two, i.e. w1 is represented One of Chinese character or English word, w2 represent another Chinese character or English word, the probability that w2 occurs under conditions of w1 generations As P (w2 | w1), probability results are inserted in above-mentioned matrix, if wherein w1 and w2 is in same data source additional information Under then its probability of happening bigger;
Prediction result module, in the case where receiving and have input a letter, passing through word and Chinese character indexing module rope The Chinese character being associated with it or English word ranking results are guided to, each Chinese character obtained further according to word order probability statistical module Or the probability of the next Chinese character or English word of English word, finally sorted from high to low by probability results.
A kind of 4. professional system input intelligent prompt system according to claim 1, it is characterised in that:The user hands over Mutual module includes prediction monitor, prompting view and sample storehouse update module;
Predict monitor,, will be from Word prediction mould whenever user have input a letter for the input behavior of monitoring users Block obtains prediction result;
View is prompted, for providing the interface of a display ranking results;
Sample storehouse update module, for recording the input behavior of user, and updates sample storehouse, notice participle divide word modules renewal with English word and the data acquisition system that Chinese character is unit;This renewal process is set more by user according to the operating condition of professional system The new cycle, while a probe is provided, in the case where the data source of professional system changes, system is updated processing.
CN201410647575.2A 2014-11-14 2014-11-14 A kind of professional system inputs intelligent prompt system Expired - Fee Related CN104317961B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410647575.2A CN104317961B (en) 2014-11-14 2014-11-14 A kind of professional system inputs intelligent prompt system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410647575.2A CN104317961B (en) 2014-11-14 2014-11-14 A kind of professional system inputs intelligent prompt system

Publications (2)

Publication Number Publication Date
CN104317961A CN104317961A (en) 2015-01-28
CN104317961B true CN104317961B (en) 2018-04-27

Family

ID=52373193

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410647575.2A Expired - Fee Related CN104317961B (en) 2014-11-14 2014-11-14 A kind of professional system inputs intelligent prompt system

Country Status (1)

Country Link
CN (1) CN104317961B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109062888B (en) * 2018-06-04 2023-03-31 昆明理工大学 Self-correcting method for input of wrong text

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101697109A (en) * 2009-10-26 2010-04-21 北京搜狗科技发展有限公司 Method and system for acquiring candidates of input method
CN101727271A (en) * 2008-10-22 2010-06-09 北京搜狗科技发展有限公司 Method and device for providing error correcting prompt and input method system
JP2011180941A (en) * 2010-03-03 2011-09-15 National Institute Of Information & Communication Technology Phrase table generator and computer program therefor

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101436193B (en) * 2007-11-14 2011-05-04 深圳富泰宏精密工业有限公司 Text editor forecast input system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101727271A (en) * 2008-10-22 2010-06-09 北京搜狗科技发展有限公司 Method and device for providing error correcting prompt and input method system
CN101697109A (en) * 2009-10-26 2010-04-21 北京搜狗科技发展有限公司 Method and system for acquiring candidates of input method
JP2011180941A (en) * 2010-03-03 2011-09-15 National Institute Of Information & Communication Technology Phrase table generator and computer program therefor

Also Published As

Publication number Publication date
CN104317961A (en) 2015-01-28

Similar Documents

Publication Publication Date Title
CN112579707B (en) Log data knowledge graph construction method
KR102591421B1 (en) Intent recommendation method, apparatus, device and storage medium
CN109960726B (en) Text classification model construction method, device, terminal and storage medium
CN111984689B (en) Information retrieval method, device, equipment and storage medium
US11295071B2 (en) Graphical systems and methods for human-in-the-loop machine intelligence
JP6655788B2 (en) Bilingual corpus creation method, apparatus and program, and machine translation system
CN105094368B (en) A kind of control method and control device that frequency modulation sequence is carried out to candidates of input method
CN106202059A (en) Machine translation method and machine translation apparatus
CN107783973A (en) The methods, devices and systems being monitored based on domain knowledge spectrum data storehouse to the Internet media event
KR102491172B1 (en) Natural language question-answering system and learning method
CN104102626A (en) Method for computing semantic similarities among short texts
CN105022733B (en) DINFO OEC text analyzings method for digging and equipment
EP3425532A1 (en) Information mining method and system, electronic device and readable storage medium
CN104076944A (en) Chat emoticon input method and device
CN103870000A (en) Method and device for sorting candidate items generated by input method
KR101757900B1 (en) Method and device for knowledge base construction
CN111738014A (en) Drug classification method, device, equipment and storage medium
CN104484057A (en) Associative result providing method and device
CN110209781B (en) Text processing method and device and related equipment
US20180341646A1 (en) Translated-clause generating method, translated-clause generating apparatus, and recording medium
Saddam et al. Sentiment analysis of flood disaster management in Jakarta on Twitter using support vector machines
KR102146433B1 (en) Method for providing context based language learning service using associative memory
US20160203139A1 (en) Standardizing attributes and entities in a social networking system
WO2019163642A1 (en) Summary evaluation device, method, program, and storage medium
CN104317961B (en) A kind of professional system inputs intelligent prompt system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 430074 Kanto science and Technology Industrial Zone, East Lake New Technology Development Zone, Hubei, Wuhan

Applicant after: Wuhan leading technology Co., Ltd.

Address before: Three road 430074 Hubei city of Wuhan province Wuchang Guanshan

Applicant before: Wuhan NEC Fiber Optic Communications Industry Co., Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180427

Termination date: 20191114

CF01 Termination of patent right due to non-payment of annual fee