WO2012152039A1 - Method and device for determining candidate character in handwriting input - Google Patents

Method and device for determining candidate character in handwriting input Download PDF

Info

Publication number
WO2012152039A1
WO2012152039A1 PCT/CN2011/084849 CN2011084849W WO2012152039A1 WO 2012152039 A1 WO2012152039 A1 WO 2012152039A1 CN 2011084849 W CN2011084849 W CN 2011084849W WO 2012152039 A1 WO2012152039 A1 WO 2012152039A1
Authority
WO
WIPO (PCT)
Prior art keywords
radical
input
handwriting
candidate word
candidate
Prior art date
Application number
PCT/CN2011/084849
Other languages
French (fr)
Chinese (zh)
Inventor
江桂凤
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012152039A1 publication Critical patent/WO2012152039A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • G06F3/04883Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures for inputting data by handwriting, e.g. gesture or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters

Definitions

  • the present invention relates to the field of electronic device input technologies, and in particular, to a method and apparatus for determining a candidate word in handwriting input. Background technique
  • Handwriting input is more in line with Chinese writing habits than keyboard input, so various fast handwriting input methods have emerged.
  • Chinese characters are a kind of block characters with a special shape structure.
  • the order of Chinese characters can be divided into whole words, parts and strokes.
  • the Chinese character handwriting input method can be roughly classified into three types: handwriting input method based on whole word recognition, handwriting input method based on part recognition, and handwriting input method based on stroke recognition.
  • the handwriting input method based on whole word recognition is: the complete Chinese characters input by the user are matched by the identification system for sample or template, and the matching samples with the largest similarity and the smallest difference are used as the recognition result, and the input strokes are many and the recognition is complicated. The characteristics, recognition efficiency and accuracy are relatively low.
  • the Chinese character handwriting input method based on the stroke sequence only considers the stroke itself and the sequence between strokes and strokes, regardless of the spatial structure information of the Chinese characters. Although the normative requirements for the writing of the user are not high, the recognition efficiency is relatively low.
  • Handwriting input method based on component recognition By identifying the component input by the user and obtaining the candidate word containing the component, the number of strokes input by the user can be reduced, and the recognition efficiency and accuracy are higher than the other two handwriting input methods.
  • a fast Chinese character handwriting input method is provided, and the specific implementation manner thereof is: encoding a Chinese character according to a radical and a location to form a code font.
  • Each field of the character field in the writing area is divided into 16 radicals.
  • the user inputs a radical, and the encoded string consisting of the radical encoding and the radical encoding is obtained.
  • the encoded string is encoded as a search key.
  • the search in the font is satisfied All Chinese characters of the condition.
  • This fast Chinese character handwriting input method can improve the handwriting input speed of Chinese characters to a certain extent.
  • the storage space is relatively large, and the user must be correct. In the small square area, writing radicals can find problems such as pre-entering Chinese characters.
  • the encoding of the Chinese character "part” consists of three radical encodings and the location encoding of the positions of the three radicals. It is assumed that the user writes “standing” in the upper left corner of the writing area, according to the radical encoding containing the "standing” and the upper left.
  • the code string of the corner location code can be queried from the code font library to obtain the candidate word sequence whose position is "right” in the upper left corner, and then find the "part”. If the user writes " ⁇ " in the left half of the writing area, according to the radical encoding containing the "leaf” and the encoding code of the left half location encoding, the candidate word sequence of "part" cannot be queried from the encoding font.
  • the invention provides a method and a device for determining a word to be selected in a handwriting input, which solves the problem that the handwriting input terminal in the prior art has difficulty in ensuring handwriting input recognition because the writing area is small and the user writing standardization is difficult to ensure.
  • the present invention provides a method for determining a word to be selected in a handwriting input.
  • the method is applied to a smart device including a handwriting input device, including:
  • the method further includes:
  • Chinese characters in the candidate word that are identical to the glyph structure are preferentially arranged.
  • the method further includes:
  • the candidate word sequence includes N candidate words, and each candidate word of the N candidate words corresponds to a usage frequency value, according to the usage frequency value of the N candidate words
  • the N candidate words are sorted, wherein the usage frequency value of the M-1th candidate word in the ranking is greater than or equal to the usage frequency value of the Mth candidate word in the ranking, and the M is greater than 2 or less than or An integer equal to N, resulting in an optimized sequence of candidate words.
  • the method further includes: The first stroke of the remaining Chinese characters is prioritized, and the Chinese characters having the same strokes as the first stroke of the remaining Chinese characters are preferentially arranged to obtain an optimized sequence of the selected words.
  • the personal handwriting sample is determined by the radical handwriting input by the user, and includes: after receiving the radical input by the user, determining whether the radical is included in the stored personal handwriting sample, if included, displaying Outputting a sequence of to-be-selected words including the identified radicals; if not, determining the input radicals from the candidate radical sequence according to the input information of the user, and querying whether the input radicals have a radical sample, if If the input radical has a radical sample, the personal handwriting sample is used to update the radical handwriting sample; if the input radical does not have the radical handwriting sample, the personal handwriting sample is saved as the radical handwriting sample.
  • the present invention also provides a device for determining a word to be selected in handwriting input, the device
  • the device includes: an input module, a radical determination module, and a candidate word determination module; wherein, the input module is configured to determine a radical information of the pre-input Chinese character according to the radical input of the pre-input Chinese character input by the user, and send the radical information to Radical determination module;
  • a radical determining module configured to match a radical handwriting in the radical information sent by the input module with the stored personal handwriting sample, and if the matching is successful, obtaining a radical index number corresponding to the personal handwriting sample Sending a radical index number to the candidate word determining module; wherein, the personal handwriting sample is determined by a radical handwriting input by the user;
  • a candidate word determining module configured to obtain, by the radical index number sent by the radical determining module, a sequence of to-be-selected words including the radical, and determine, from the candidate word sequence, the pre-entered Chinese character to be selected Select words.
  • the device further includes:
  • a glyph structure sorting module configured to receive a candidate word sequence sent by the candidate word determining module and obtain the input coordinates of the radical in the radical information, and determine a glyph structure of the pre-input Chinese character according to the input coordinate According to the glyph structure, the Chinese characters in the candidate word sequence that are identical to the glyph structure are preferentially arranged;
  • the candidate word determining module is further configured to send the candidate word sequence and the radical information to the font structure sorting module.
  • the apparatus further includes: a frequency ordering module, configured to receive the sorted candidate word sequence sent by the font structure sorting module, if the candidate word sequence includes N candidate words, and N candidates Each candidate word in the word corresponds to a usage frequency value, and the N candidate words are sorted according to the usage frequency value of the N candidate words, wherein the M-1 candidate words in the ranking
  • the usage frequency value is greater than or equal to the usage frequency value of the Mth candidate word in the ranking, the M is an integer greater than 2 less than or equal to N
  • the optimized sequenced candidate word sequence is sent to the candidate word to be determined.
  • the font structure sorting module is further configured to send the sorted candidate word sequence to the usage frequency sorting module;
  • the candidate word determining module is further configured to receive a candidate word sequence after being sorted by using a frequency sorting module.
  • the device further includes: a remaining stroke sorting module, configured to receive a candidate word sequence sent by the candidate word determining module, and a remaining Chinese character first stroke of the pre-input Chinese character except the radical, according to the The first stroke of the remaining Chinese characters is prioritized, and the Chinese characters with the same strokes of the remaining Chinese characters are preferentially arranged, and the sequence of the selected candidate words is sent to the candidate word determination module;
  • a remaining stroke sorting module configured to receive a candidate word sequence sent by the candidate word determining module, and a remaining Chinese character first stroke of the pre-input Chinese character except the radical, according to the The first stroke of the remaining Chinese characters is prioritized, and the Chinese characters with the same strokes of the remaining Chinese characters are preferentially arranged, and the sequence of the selected candidate words is sent to the candidate word determination module;
  • the candidate word determining module is further configured to send the candidate word sequence to the remaining stroke sorting module, and receive the candidate word sequence after the optimal sorting by the remaining stroke sorting module.
  • the device further includes: a personal handwriting sample determining module, configured to check whether the radical is included in the existing personal handwriting sample according to the radical handwriting sent by the radical determining module, and if included, the The first determining module sends a radical index number corresponding to the personal handwriting sample, and if not, whether the radical input sample exists in the input radical according to the radical handwriting, and if the input radical has a radical sample, Updating to the personal handwriting sample determined by the radical handwriting, if there is no radical handwriting sample in the input section, the radical handwriting in the radical information is saved as a personal handwriting sample and the radical number corresponding to the personal handwriting sample is added. ;
  • the radical determining module is specifically configured to send a radical handwriting to the personal handwriting sample determination module, receive a radical index number corresponding to the personal handwriting sample sent by the personal handwriting sample determination module, and send the radical index number to The word selection module to be selected;
  • the candidate word determining module is configured to obtain, according to the radical index number sent by the radical determining module, a sequence of to-be selected words including the radical.
  • the method and device provided by the present invention store a sample of a personal handwriting generated according to a user's personal handwriting habit, and the font library is simple to implement and has a small storage space, and does not need to be
  • the Chinese characters in the font are encoded, and only the font is indexed by the radicals for easy retrieval; in addition, since each user's handwriting is stable and clear
  • the explicit personal characteristics, using the user's radical handwriting as a matching sample can reduce the number of matches, improve the matching efficiency, and can improve the recognition efficiency and accuracy of the Chinese handwriting input method, reduce the word storage space; further because the user writes the radical
  • the handwriting and the Chinese character radicals are established-corresponding relationship, which makes the user's writing more random and user-adaptive.
  • FIG. 1 is a flowchart of a method for determining a word to be selected in handwriting input according to an embodiment of the present invention
  • FIG. 2 is a flow chart of a method for implementing handwriting input by applying the method provided by the embodiment of the present invention
  • FIG. 3 is a schematic diagram of a user inputting a head according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of an apparatus for determining a word to be selected in handwriting input according to an embodiment of the present invention. detailed description
  • An embodiment of the present invention provides a method for determining a word to be selected in a handwriting input, the method comprising: determining, according to a radical of a pre-input Chinese character input by a user, a radical information of a pre-input Chinese character; The first handwriting is matched with the stored personal handwriting sample, and if the matching is successful, obtaining the radical index number corresponding to the personal handwriting sample; wherein the personal handwriting sample is determined by the radical handwriting input by the user; The first index number acquires a candidate word sequence including the radical, and the candidate word of the pre-entered Chinese character is determined from the candidate word sequence.
  • an embodiment of the present invention provides a method for determining a word to be selected in handwriting input, where the method is applied to a smart device including a handwriting input device, and the specific embodiments of the present invention are described below with reference to the accompanying drawings. Detailed description:
  • Step 101 Determine the radical information of the pre-entered Chinese character according to the radical of the pre-entered Chinese character input by the user.
  • the radical input of the pre-input Chinese character is first input, and when the user inputs the radical, the user needs to input the Chinese character according to the pre-input.
  • the shape structure writes the radicals at the corresponding positions of the input device; the radicals of the input Chinese characters are saved as the radical information. For example, if you need to enter the word "part", you first enter "; and in the glyph structure of the "part", in the right half, so when you input the word "part”, write 1" in the right half of the input device. .
  • the radical information includes: a radical handwriting and input coordinates.
  • Step 102 Matching the radical handwriting in the radical information with the stored personal handwriting sample to obtain a radical index number corresponding to the personal handwriting sample.
  • the personal handwriting sample is determined by the radical handwriting input by the user, and the personal handwriting sample can store different writing methods of the same radical, thereby improving the recognition of the input handwriting by the system.
  • the personal handwriting sample can be the same for the user.
  • the writing habits change and are updated instantly;
  • the radical index number is a radical index number of the middle of the font searched by the radical.
  • the radical index number corresponding to the personal handwriting sample is the number of the middle of the font searched by the radical, and the personal handwriting sample and the middle of the font are established by the radical number-corresponding relationship.
  • Step 103 Obtain a candidate word sequence including the radical according to the radical index number, and determine a candidate word of the pre-entered Chinese character from the candidate font sequence.
  • sequence of the selected words may be: Sorting the words to be selected according to the sorting condition Sequence of composition
  • the sorting conditions include the following:
  • each user has some common words, and the candidate words include N, and each of the candidate words has a usage frequency value.
  • the N candidate words are also sorted according to the usage frequency values of the N candidate words, where the usage frequency of the M-1 candidate words in the sorting is greater than or equal to the sorting.
  • the frequency of use of the Mth candidate word, the M being an integer greater than 2 less than or equal to N.
  • one or two sorting conditions may be selected to sort the selected words, or the above three sorting conditions may be freely combined and the selected words are sorted to form a candidate word sequence.
  • the sorting method may be selected according to the following judgment method:
  • the Chinese character glyph structure is used to sort the selected character sequences. If the Chinese characters containing the radical are frequently used, the user frequency records are used to sort the sequences to be selected. If the user inputs the first stroke and then inputs the first stroke of the remaining Chinese characters, the remaining Chinese characters are used to sort the selected words.
  • a method for determining a candidate word in a handwriting input includes:
  • Step 201 Receive a radical input by the user in the right half of the input device and record the radical Information.
  • the radical information is: a radical handwriting (as shown in Figure 3) and input coordinates.
  • Step 202 Match the radical handwriting in the radical information with the stored personal handwriting sample, and determine a radical index number corresponding to the personal handwriting sample.
  • Step 203 Obtain a sequence of candidate words containing " ⁇ " according to the radical index number.
  • the sequence of to-be-selected words obtained in this embodiment may include Xl ⁇ Team Deng's suburbs accompanying .
  • inputting "p" from the right half of the input device can further determine the structure of the pre-input Chinese character, and further includes:
  • Step 204 Obtain the input coordinate from the radical information, and determine a glyph structure of the pre-input Chinese character according to the input coordinate; according to the glyph structure, the font structure in the candidate word sequence is the glyph structure
  • the Chinese characters are prioritized.
  • the order of the selected word sequences can be 1 ⁇ Deng Suburbs' team accompaniment... ⁇ .
  • Step 205 Prioritize Chinese characters with high frequency of use in the candidate word sequence according to Chinese character usage frequency.
  • the method may further input, by the user, the first strokes of the remaining Chinese characters except the radicals of the pre-input Chinese characters, if the user inputs the remaining first strokes after inputting "P" ", ", then further includes:
  • Step 206 Align the Chinese characters of the to-be selected words with the same first strokes except the radicals according to the first stroke of the remaining strokes.
  • sequence of the selected words can be arranged as 1 ⁇ suburban with Dengdu team... ⁇ .
  • the embodiment of the present invention further provides a handwriting input.
  • the device for determining a word to be selected comprising: an input module 401, a radical determination module 402, and a candidate word determination module 403;
  • the input module 401 is configured to determine a radical information of the pre-input Chinese character according to the radical input of the pre-input Chinese character input by the user, and send the radical information to the radical determining module 402;
  • the radical determining module 402 is configured to match the radical handwriting in the radical information sent by the input module 401 with the stored personal handwriting sample, and if the matching is successful, acquire the radical index corresponding to the personal handwriting sample. And sending the radical index number to the candidate word determining module 403; wherein the personal handwriting sample is determined by the radical handwriting input by the user, and the personal handwriting sample corresponds to the radical index number;
  • the candidate word determining module 403 is configured to obtain the candidate word sequence including the radical according to the radical index number sent by the radical determining module 402, and determine the pre-entered Chinese character from the candidate word sequence. The word to be selected.
  • the apparatus further includes: a glyph structure sorting module 404, configured to receive the to-be-selected word determining module 403 Determining a font structure of the pre-input Chinese character according to the input coordinates; and, according to the font structure, prioritizing Chinese characters in the candidate word sequence that are identical to the glyph structure
  • the candidate word determining module 403 is further configured to send the to-be-selected word sequence and the radical information to the font structure sorting module 404.
  • the apparatus further includes: a remaining stroke ordering module 406, configured to receive a sequence of to-be-selected words sent by the candidate word determining module 403, and a first stroke of the remaining Chinese characters of the pre-input Chinese characters except the radicals
  • the candidate word determining module 403 is further configured to send the sorted candidate word sequence and the pre-input Chinese characters except the radicals to the remaining stroke sorting module 406 according to the remaining strokes.
  • the candidate word determining module 403 is further configured to receive the candidate word sequence that is optimized and sorted by the remaining stroke sorting module 406.
  • the remaining stroke sorting module 406 is further configured to optimize the sorted candidate words. The sequence is sent to the candidate word determination module 403.
  • the frequency ordering module 405 is configured to receive the sorted candidate word sequence sent by the font structure sorting module 404, if the candidate word sequence includes N candidate words, and each of the N candidate words Corresponding to the frequency of use, the N candidate words are sorted according to the frequency of use of the N candidate words, wherein the frequency of use of the M-1 candidate words in the sorting is greater than or And is equal to the use frequency value of the Mth candidate word in the sorting, wherein the M is an integer greater than 2 and less than or equal to N, and the sequence of the selected word to be selected is fed back to the candidate word determining module 403;
  • the font structure is arranged in the candidate word determining module 403, and is further configured to receive the selected word sequence after being sorted by using the frequency sorting module 405.
  • the apparatus further includes:
  • the device provided by the embodiment of the present invention further includes: a personal handwriting sample determining module 407, configured to query, according to the radical handwriting sent by the radical determining module 402, whether the radical is included in the existing personal handwriting sample, if included And sending a radical index number corresponding to the personal handwriting sample to the radical determining module 402. If not, querying, according to the radical handwriting, whether the input radical has a radical sample, if the input radical already exists. The first handwriting sample is updated to the personal handwriting sample determined by the radical handwriting. If the input radical does not have the radical handwriting sample, the radical handwriting in the radical information is saved as a personal handwriting sample and the corresponding department is added.
  • the radical determining module 402 is specifically configured to send the personal handwriting sample determining module 407 The first handwriting is sent, and the radical index number corresponding to the personal handwriting sample sent by the personal handwriting sample determining module 407 is received.
  • the method and apparatus provided by the embodiments of the present invention first store a sample of a personal handwriting generated according to a user's personal handwriting habit, and the font library is simple to implement and has a small storage space, and does not need to encode the Chinese characters in the font library, and only needs to press the font library according to the department.
  • the first index is convenient for retrieval; in addition, since each user's handwriting has stable and obvious personal characteristics, using the user's radical handwriting as a matching sample can reduce the number of matches, improve matching efficiency, and improve the handwriting input method of Chinese characters. Identify the efficiency and accuracy, reduce the word storage space; further, because the user writes the handwriting and the Chinese character radicals to establish a corresponding relationship, the user is more casual and has the characteristics of user self-adaptation.
  • the method and apparatus provided by the embodiments of the present invention further apply the Chinese character glyph structure, the user frequency record, and the remaining word first stroke to optimize the sorting of the to-be-selected word sequences including the same radical, so that the sorted candidate words are more accurate. And can reflect the user's input habits.
  • Other embodiments are obtained by the skilled person in accordance with the technical solution of the present invention, and are also within the scope of the technical innovation of the present invention.

Abstract

Disclosed are a method and a device for determining a candidate character in handwriting input, which are applied in the technical field of input to electronic apparatuses. The method comprises: determining, according to a radical, of a pre-input Chinese character, input by a user, radical information of the pre-input Chinese character; matching radical handwriting in the radical information with a stored personal handwriting sample, and if the matching is successful, acquiring a radical index number corresponding to the personal handwriting sample, the personal handwriting sample being determined according to handwriting of a radical input by the user and the personal handwriting sample being corresponding to a radical index number; and acquiring, according to the radical index number, a sequence of candidate characters comprising the radical, and determining a candidate character of the pre-input Chinese character from the candidate character sequence. In the method and the device of the present invention, by storing personal radical handwriting samples generated according to the user's personal handwriting habit, a character library is easily implemented and requires small storage space.

Description

一种手写输入中确定待选字的方法及装置  Method and device for determining candidate word in handwriting input
技术领域 本发明涉及电子设备输入技术领域, 尤其涉及一种手写输入中确定待 选字的方法及装置。 背景技术 TECHNICAL FIELD The present invention relates to the field of electronic device input technologies, and in particular, to a method and apparatus for determining a candidate word in handwriting input. Background technique
目前, 向电子终端中录入数据主要通过键盘输入和手写输入两种方法。 手写输入相对键盘输入更符合中国人的书写习惯, 因而各种快速手写输入 方法应运而生。  At present, data entry into an electronic terminal is mainly through two methods of keyboard input and handwriting input. Handwriting input is more in line with Chinese writing habits than keyboard input, so various fast handwriting input methods have emerged.
汉字是一种方块字, 具有特殊的形体结构, 按层次从高到底的顺序, 汉字形体结构可分为整字、 部件和笔划。 相应地, 汉字手写输入法可大致 分为三类: 基于整字识别的手写输入法、 基于部件识别的手写输入法和基 于笔划识别的手写输入法。 其中, 基于整字识别的手写输入法是: 将用户 输入的完整汉字通过识别系统进行样本或模板匹配, 以相似性最大、 差异 性最小的匹配样本作为识别的结果, 具有输入笔划多且识别复杂的特点, 识别效率和准确率比较低。 基于笔划序列的汉字手写输入法只考虑笔划本 身以及笔划与笔划间顺序, 而不考虑汉字的空间结构信息, 虽然对用户的 书写规范性要求不高, 但是识别效率比较低。 基于部件识别的手写输入法 通过识别用户输入的部件, 得到包含该部件的待选字, 可减少用户输入的 笔划数, 相对于另外两种手写输入法识别效率和准确率要高。  Chinese characters are a kind of block characters with a special shape structure. The order of Chinese characters can be divided into whole words, parts and strokes. Accordingly, the Chinese character handwriting input method can be roughly classified into three types: handwriting input method based on whole word recognition, handwriting input method based on part recognition, and handwriting input method based on stroke recognition. Among them, the handwriting input method based on whole word recognition is: the complete Chinese characters input by the user are matched by the identification system for sample or template, and the matching samples with the largest similarity and the smallest difference are used as the recognition result, and the input strokes are many and the recognition is complicated. The characteristics, recognition efficiency and accuracy are relatively low. The Chinese character handwriting input method based on the stroke sequence only considers the stroke itself and the sequence between strokes and strokes, regardless of the spatial structure information of the Chinese characters. Although the normative requirements for the writing of the user are not high, the recognition efficiency is relatively low. Handwriting input method based on component recognition By identifying the component input by the user and obtaining the candidate word containing the component, the number of strokes input by the user can be reduced, and the recognition efficiency and accuracy are higher than the other two handwriting input methods.
现有技术中提供一种速汉字手写输入法, 其具体实现方式是: 将汉字 按部首和区位进行编码以组成编码字库。 书写区域内的每个田字格区域被 划分为 16个部首区位, 用户输入一个部首, 将得到部首编码和部首区位编 码组成的编码串, 将该编码串作为查找关键字在编码字库中查找得到满足 条件的所有汉字。 这种速汉字手写输入法可在一定程度上提高汉字手写输 入速度, 但是, 它应用汉字按部首和区位信息来进行编码, 会存在编码字 库实现复杂、 存储占用空间比较大, 用户必须在正确的小方格区域内书写 部首才能找到预输入汉字等问题。 In the prior art, a fast Chinese character handwriting input method is provided, and the specific implementation manner thereof is: encoding a Chinese character according to a radical and a location to form a code font. Each field of the character field in the writing area is divided into 16 radicals. The user inputs a radical, and the encoded string consisting of the radical encoding and the radical encoding is obtained. The encoded string is encoded as a search key. The search in the font is satisfied All Chinese characters of the condition. This fast Chinese character handwriting input method can improve the handwriting input speed of Chinese characters to a certain extent. However, it applies Chinese characters according to the radicals and location information for encoding. There will be a complicated implementation of the coded font library, and the storage space is relatively large, and the user must be correct. In the small square area, writing radicals can find problems such as pre-entering Chinese characters.
例如,汉字"部"的编码由 3个部首编码和 3个部首所在位置的区位编码 组成,假设用户在书写区域的左上角书写"立",根据包含"立"的部首编码和 左上角区位编码的编码串,可从编码字库中查询得到左上角位置为"立"的待 选字序列, 进而找到"部"。 如果用户在书写区域的左半部书写"立", 根据包 含"立"的部首编码和左半部区位编码的编码串,则不能从编码字库中查询得 到"部"的待选字序列。 但是, 上述汉字手写输入法对书写位置有严格要求, 如果部首书写位置不正确, 就无法得到欲输入汉字; 且在手提式终端设备 中, 因为书写区域较小, 用户书写规范性比较难保证, 所以导致上述方法 应用存在输入识别困难的问题。 发明内容  For example, the encoding of the Chinese character "part" consists of three radical encodings and the location encoding of the positions of the three radicals. It is assumed that the user writes "standing" in the upper left corner of the writing area, according to the radical encoding containing the "standing" and the upper left. The code string of the corner location code can be queried from the code font library to obtain the candidate word sequence whose position is "right" in the upper left corner, and then find the "part". If the user writes "立" in the left half of the writing area, according to the radical encoding containing the "leaf" and the encoding code of the left half location encoding, the candidate word sequence of "part" cannot be queried from the encoding font. However, the above-mentioned Chinese character handwriting input method has strict requirements on the writing position. If the writing position of the radical is not correct, the Chinese character cannot be obtained; and in the portable terminal device, since the writing area is small, the user writing standardization is difficult to ensure. Therefore, there is a problem that the above method application has difficulty in input recognition. Summary of the invention
本发明提供一种手写输入中确定待选字的方法及装置, 解决现有技术 中手写输入终端因为书写区域较小, 用户书写规范性难保证, 从而导致手 写输入识别困难的问题。  The invention provides a method and a device for determining a word to be selected in a handwriting input, which solves the problem that the handwriting input terminal in the prior art has difficulty in ensuring handwriting input recognition because the writing area is small and the user writing standardization is difficult to ensure.
本发明提供一种手写输入中确定待选字的方法, 所述方法应用于包括 有手写输入设备的智能设备中, 包括:  The present invention provides a method for determining a word to be selected in a handwriting input. The method is applied to a smart device including a handwriting input device, including:
根据用户输入的预输入汉字的部首, 确定预输入汉字的部首信息; 将所述部首信息中的部首笔迹与存储的个人笔迹样本进行匹配, 如果 匹配成功, 则根据获取所述个人笔迹样本对应的部首索引号; 其中, 所述 个人笔迹样本由用户输入的部首笔迹确定;  Determining the radical information of the pre-input Chinese character according to the radical input of the pre-entered Chinese character input by the user; matching the radical handwriting in the radical information with the stored personal handwriting sample, and if the matching is successful, obtaining the individual according to the a radical index number corresponding to the handwriting sample; wherein, the personal handwriting sample is determined by a radical handwriting input by a user;
按所述部首索引号获取包含所述部首的待选字序列, 从所述待选字序 列中确定所述预输入汉字的待选字。 上述方案中, 从所述待选字序列中确定所述预输入汉字的待选字之后, 该方法还包括: Obtaining a sequence of candidate words including the radical according to the radical index number, and determining a candidate word of the pre-entered Chinese character from the sequence of candidate fonts. In the above solution, after determining the candidate word of the pre-entered Chinese character from the candidate word sequence, the method further includes:
从所述部首信息中获取所述部首的输入坐标, 并根据该输入坐标判断 所述预输入汉字的字形结构;  Obtaining an input coordinate of the radical from the radical information, and determining a glyph structure of the pre-entered Chinese character according to the input coordinate;
根据所述字形结构, 将所述待选字中与所述字形结构相同的汉字优先 排列。  According to the glyph structure, Chinese characters in the candidate word that are identical to the glyph structure are preferentially arranged.
上述方案中, 从所述待选字序列中确定所述预输入汉字的待选字之后 , 该方法还包括:  In the above solution, after determining the candidate word of the pre-entered Chinese character from the candidate word sequence, the method further includes:
若待选字序列中包括 N个待选字, 并所述 N个侯选字中每个侯选字都 对应有使用频度值, 根据所述 N个侯选字的使用频度值对所述 N个侯选字 排序, 其中, 排序中第 M-1个待选字的使用频度值大于或等于排序中第 M 个待选字的使用频度值, 所述 M是大于 2小于或等于 N的整数, 得到经过 优化排序的待选字序列。  If the candidate word sequence includes N candidate words, and each candidate word of the N candidate words corresponds to a usage frequency value, according to the usage frequency value of the N candidate words The N candidate words are sorted, wherein the usage frequency value of the M-1th candidate word in the ranking is greater than or equal to the usage frequency value of the Mth candidate word in the ranking, and the M is greater than 2 or less than or An integer equal to N, resulting in an optimized sequence of candidate words.
上述方案中, 从所述待选字序列中确定所述预输入汉字的待选字之后, 若接收到所述预输入汉字除所述部首外的剩余汉字首笔划, 该方法还包括: 根据所述剩余汉字首笔划, 将所述待选字中与所述剩余汉字首笔划相 同的汉字优先排列, 得到经过优化排序的待选字序列。  In the above solution, after determining the to-be-selected word of the pre-entered Chinese character from the candidate word sequence, if receiving the first Chinese character stroke of the pre-input Chinese character except the radical, the method further includes: The first stroke of the remaining Chinese characters is prioritized, and the Chinese characters having the same strokes as the first stroke of the remaining Chinese characters are preferentially arranged to obtain an optimized sequence of the selected words.
上述方案中, 所述个人笔迹样本由用户输入的部首笔迹确定, 包括: 当接收到用户输入的部首后, 确定已存的个人笔迹样本中是否包括所 述部首, 如果包括, 则显示输出包含已识别部首的待选字序列; 如果没有, 则根据用户的输入信息从待选部首序列中确定输入部首, 并查询所述输入 部首是否已存在部首字迹样本, 如果所述输入部首已存在部首字迹样本, 则使用个人笔迹样本更新部首字迹样本; 如果所述输入部首不存在部首字 迹样本, 则将个人笔迹样本保存为部首字迹样本。  In the above solution, the personal handwriting sample is determined by the radical handwriting input by the user, and includes: after receiving the radical input by the user, determining whether the radical is included in the stored personal handwriting sample, if included, displaying Outputting a sequence of to-be-selected words including the identified radicals; if not, determining the input radicals from the candidate radical sequence according to the input information of the user, and querying whether the input radicals have a radical sample, if If the input radical has a radical sample, the personal handwriting sample is used to update the radical handwriting sample; if the input radical does not have the radical handwriting sample, the personal handwriting sample is saved as the radical handwriting sample.
根据上述方法本发明还提供一种手写输入中确定待选字的装置, 该装 置包括: 输入模块、 部首确定模块和待选字确定模块; 其中, 输入模块, 用于根据用户输入的预输入汉字的部首, 确定预输入汉字 的部首信息, 将部首信息发送给部首确定模块; According to the above method, the present invention also provides a device for determining a word to be selected in handwriting input, the device The device includes: an input module, a radical determination module, and a candidate word determination module; wherein, the input module is configured to determine a radical information of the pre-input Chinese character according to the radical input of the pre-input Chinese character input by the user, and send the radical information to Radical determination module;
部首确定模块, 用于将输入模块发来的所述部首信息中的部首笔迹与 存储的个人笔迹样本进行匹配, 如果匹配成功, 则根据获取所述个人笔迹 样本对应的部首索引号, 将部首索引号发送给待选字确定模块; 其中, 所 述个人笔迹样本由用户输入的部首笔迹确定;  a radical determining module, configured to match a radical handwriting in the radical information sent by the input module with the stored personal handwriting sample, and if the matching is successful, obtaining a radical index number corresponding to the personal handwriting sample Sending a radical index number to the candidate word determining module; wherein, the personal handwriting sample is determined by a radical handwriting input by the user;
待选字确定模块, 用于按部首确定模块发来的所述部首索引号获取包 含所述部首的待选字序列, 从所述待选字序列中确定所述预输入汉字的待 选字。  a candidate word determining module, configured to obtain, by the radical index number sent by the radical determining module, a sequence of to-be-selected words including the radical, and determine, from the candidate word sequence, the pre-entered Chinese character to be selected Select words.
上述方案中, 该装置还包括:  In the above solution, the device further includes:
字形结构排序模块, 用于接收待选字确定模块发来的待选字序列和所 述部首信息中获取所述部首的输入坐标, 并根据该输入坐标判断所述预输 入汉字的字形结构; 根据所述字形结构, 将所述待选字序列中与所述字形 结构相同的汉字优先排列;  a glyph structure sorting module, configured to receive a candidate word sequence sent by the candidate word determining module and obtain the input coordinates of the radical in the radical information, and determine a glyph structure of the pre-input Chinese character according to the input coordinate According to the glyph structure, the Chinese characters in the candidate word sequence that are identical to the glyph structure are preferentially arranged;
相应的, 所述待选字确定模块, 还用于向字形结构排序模块发送待选 字序列和部首信息。  Correspondingly, the candidate word determining module is further configured to send the candidate word sequence and the radical information to the font structure sorting module.
上述方案中, 该装置还包括: 使用频度排序模块, 用于接收字形结构 排序模块发来的排序后的待选字序列, 若待选字序列包括 N个待选字, 并 N个侯选字中每个侯选字都对应有使用频度值, 根据所述 N个侯选字的使 用频度值对所述 N个侯选字排序, 其中, 排序中第 M-1个待选字的使用频 度值大于或等于排序中第 M个待选字的使用频度值,所述 M是大于 2小于 或等于 N的整数, 将优化排序后的待选字序列发送给待选字确定模块; 相应的, 所述字形结构排序模块, 还用于将排序后的待选字序列发送 给使用频度排序模块; 所述待选字确定模块, 还用于接收经过使用频度排序模块优化排序后 的待选字序列。 In the above solution, the apparatus further includes: a frequency ordering module, configured to receive the sorted candidate word sequence sent by the font structure sorting module, if the candidate word sequence includes N candidate words, and N candidates Each candidate word in the word corresponds to a usage frequency value, and the N candidate words are sorted according to the usage frequency value of the N candidate words, wherein the M-1 candidate words in the ranking The usage frequency value is greater than or equal to the usage frequency value of the Mth candidate word in the ranking, the M is an integer greater than 2 less than or equal to N, and the optimized sequenced candidate word sequence is sent to the candidate word to be determined. Correspondingly, the font structure sorting module is further configured to send the sorted candidate word sequence to the usage frequency sorting module; The candidate word determining module is further configured to receive a candidate word sequence after being sorted by using a frequency sorting module.
上述方案中, 该装置还包括: 剩余笔划排序模块, 用于接收待选字确 定模块发来的待选字序列以及所述预输入汉字除所述部首外的剩余汉字首 笔划, 根据所述剩余汉字首笔划, 将所述待选字序列中与所述剩余汉字首 笔划相同的汉字优先排列, 将优化排序后的待选字序列发送给待选字确定 模块;  In the above solution, the device further includes: a remaining stroke sorting module, configured to receive a candidate word sequence sent by the candidate word determining module, and a remaining Chinese character first stroke of the pre-input Chinese character except the radical, according to the The first stroke of the remaining Chinese characters is prioritized, and the Chinese characters with the same strokes of the remaining Chinese characters are preferentially arranged, and the sequence of the selected candidate words is sent to the candidate word determination module;
相应的, 所述待选字确定模块, 还用于向剩余笔划排序模块发送待选 字序列, 接收经过剩余笔划排序模块优化排序后的待选字序列。  Correspondingly, the candidate word determining module is further configured to send the candidate word sequence to the remaining stroke sorting module, and receive the candidate word sequence after the optimal sorting by the remaining stroke sorting module.
上述方案中, 该装置还包括: 个人笔迹样本确定模块, 用于根据部首 确定模块发来的部首笔迹, 查看已存的个人笔迹样本中是否包括所述部首, 如果包括, 则向部首确定模块发送个人笔迹样本对应的部首索引号, 如果 没有, 则根据部首笔迹查询所述输入部首是否已存在部首字迹样本, 如果 所述输入部首已存在部首字迹样本, 则更新为由部首笔迹确定的个人笔迹 样本, 如果所述输入部首不存在部首字迹样本, 则将部首信息中的部首笔 迹保存为个人笔迹样本并添加个人笔迹样本对应的部首编号;  In the above solution, the device further includes: a personal handwriting sample determining module, configured to check whether the radical is included in the existing personal handwriting sample according to the radical handwriting sent by the radical determining module, and if included, the The first determining module sends a radical index number corresponding to the personal handwriting sample, and if not, whether the radical input sample exists in the input radical according to the radical handwriting, and if the input radical has a radical sample, Updating to the personal handwriting sample determined by the radical handwriting, if there is no radical handwriting sample in the input section, the radical handwriting in the radical information is saved as a personal handwriting sample and the radical number corresponding to the personal handwriting sample is added. ;
相应的, 所述部首确定模块, 具体用于向个人笔迹样本确定模块发送 部首笔迹, 接收个人笔迹样本确定模块发来的个人笔迹样本对应的部首索 引号, 将部首索引号发送给待选字确定模块;  Correspondingly, the radical determining module is specifically configured to send a radical handwriting to the personal handwriting sample determination module, receive a radical index number corresponding to the personal handwriting sample sent by the personal handwriting sample determination module, and send the radical index number to The word selection module to be selected;
所述待选字确定模块, 用于按部首确定模块发来的所述部首索引号获 取包含所述部首的待选字序列。  The candidate word determining module is configured to obtain, according to the radical index number sent by the radical determining module, a sequence of to-be selected words including the radical.
上述技术方案中的一个或两个, 至少具有如下技术效果: 首先本发明 所提供的方法和装置存储根据用户个人手写习惯生成的个人部首笔迹样 本, 字库实现简单且存储空间小, 不需要对字库中的汉字进行编码, 只需 将字库按部首索引以方便检索; 另外, 由于每个用户的笔迹都有稳定、 明 显的个人特征, 将用户的部首笔迹作为匹配样本, 可减少匹配次数, 提高 匹配效率, 并且能够提高汉字手写输入法的识别效率和准确率、 减少字库 存储空间; 进一步因为将用户书写部首笔迹和汉字部首建立——对应关系, 使得用户书写更随意, 具有用户自适应的特点。 附图说明 One or both of the above technical solutions have at least the following technical effects: Firstly, the method and device provided by the present invention store a sample of a personal handwriting generated according to a user's personal handwriting habit, and the font library is simple to implement and has a small storage space, and does not need to be The Chinese characters in the font are encoded, and only the font is indexed by the radicals for easy retrieval; in addition, since each user's handwriting is stable and clear The explicit personal characteristics, using the user's radical handwriting as a matching sample, can reduce the number of matches, improve the matching efficiency, and can improve the recognition efficiency and accuracy of the Chinese handwriting input method, reduce the word storage space; further because the user writes the radical The handwriting and the Chinese character radicals are established-corresponding relationship, which makes the user's writing more random and user-adaptive. DRAWINGS
图 1为本发明实施例一种手写输入中确定待选字的方法的流程图; 图 2为应用本发明实施例所提供的方法进行手写输入的实现方法的流 程图;  1 is a flowchart of a method for determining a word to be selected in handwriting input according to an embodiment of the present invention; FIG. 2 is a flow chart of a method for implementing handwriting input by applying the method provided by the embodiment of the present invention;
图 3为本发明实施例用户输入部首的示意图;  3 is a schematic diagram of a user inputting a head according to an embodiment of the present invention;
图 4为本发明实施例一种手写输入中确定待选字的装置的结构示意图。 具体实施方式  FIG. 4 is a schematic structural diagram of an apparatus for determining a word to be selected in handwriting input according to an embodiment of the present invention. detailed description
本发明实施例提供一种手写输入中确定待选字的方法, 该方法包括: 根据用户输入的预输入汉字的部首, 确定预输入汉字的部首信息; 将所述 部首信息中的部首笔迹与存储的个人笔迹样本进行匹配, 如果匹配成功, 则根据获取所述个人笔迹样本对应的部首索引号; 其中, 所述个人笔迹样 本由用户输入的部首笔迹确定; 按所述部首索引号获取包含所述部首的待 选字序列, 从所述待选字序列中确定所述预输入汉字的待选字。  An embodiment of the present invention provides a method for determining a word to be selected in a handwriting input, the method comprising: determining, according to a radical of a pre-input Chinese character input by a user, a radical information of a pre-input Chinese character; The first handwriting is matched with the stored personal handwriting sample, and if the matching is successful, obtaining the radical index number corresponding to the personal handwriting sample; wherein the personal handwriting sample is determined by the radical handwriting input by the user; The first index number acquires a candidate word sequence including the radical, and the candidate word of the pre-entered Chinese character is determined from the candidate word sequence.
如图 1 所示, 本发明实施例提供一种手写输入中确定待选字的方法, 所述方法应用于包括有手写输入设备的智能设备中, 下面结合说明书附图 对本发明的具体实施方式进行详细说明:  As shown in FIG. 1 , an embodiment of the present invention provides a method for determining a word to be selected in handwriting input, where the method is applied to a smart device including a handwriting input device, and the specific embodiments of the present invention are described below with reference to the accompanying drawings. Detailed description:
步驟 101 : 根据用户输入的预输入汉字的部首, 确定预输入汉字的部首 信息。  Step 101: Determine the radical information of the pre-entered Chinese character according to the radical of the pre-entered Chinese character input by the user.
具体的, 在本发明实施例中在用户输入汉字时, 根据系统提示首先输 入预输入汉字的偏旁部首, 用户在输入部首时, 需要根据预输入汉字的字 形结构将部首写在输入设备的对应位置; 将输入汉字的部首作为部首信息 进行保存。 例如, 需要输入"部"字, 则首先输入" ; 并且在 "部"的字形结 构中, 在右半部, 所以在输入"部"字时, 则将 1 "写在输入设备的右半 部。 Specifically, in the embodiment of the present invention, when the user inputs the Chinese character, according to the prompt of the system, the radical input of the pre-input Chinese character is first input, and when the user inputs the radical, the user needs to input the Chinese character according to the pre-input. The shape structure writes the radicals at the corresponding positions of the input device; the radicals of the input Chinese characters are saved as the radical information. For example, if you need to enter the word "part", you first enter "; and in the glyph structure of the "part", in the right half, so when you input the word "part", write 1" in the right half of the input device. .
这里, 所述部首信息包括: 部首笔迹和输入坐标。  Here, the radical information includes: a radical handwriting and input coordinates.
步驟 102:将所述部首信息中的部首笔迹与存储的个人笔迹样本进行匹 配, 获取所述个人笔迹样本对应的部首索引号。  Step 102: Matching the radical handwriting in the radical information with the stored personal handwriting sample to obtain a radical index number corresponding to the personal handwriting sample.
这里, 所述个人笔迹样本由用户输入的部首笔迹确定, 所述个人笔迹 样本中可以存储同一部首的不同写法, 从而提高系统对输入笔迹的辨识度 个人笔迹样本可以因为用户对同一部首的书写习惯改变而即时更新;  Here, the personal handwriting sample is determined by the radical handwriting input by the user, and the personal handwriting sample can store different writing methods of the same radical, thereby improving the recognition of the input handwriting by the system. The personal handwriting sample can be the same for the user. The writing habits change and are updated instantly;
所述部首索引号为部首索引号为按部首检索的字库中部首的编号。 当接收到用户输入的部首后, 确定所述部首的部首笔迹, 并用所述部 首笔迹与已存的个人笔迹样本进行匹配, 如果匹配到相同的个人笔迹样本, 则返回个人笔迹样本对应的部首索引号; 如果没有匹配到相同的个人笔迹 样本, 则根据部首信息中的部首笔迹从待选部首序列中查询是否有匹配的 部首字迹样本, 如果有, 则将已有的部首字迹样本更新为由部首笔迹确定 的个人笔迹样本; 如果没有, 则将部首信息中的部首笔迹保存为个人笔迹 样本并保存个人笔迹样本对应的部首索引号。  The radical index number is a radical index number of the middle of the font searched by the radical. After receiving the radical input by the user, determining the radical handwriting of the radical, and matching the existing personal handwriting sample with the radical handwriting, and returning the personal handwriting sample if matching the same personal handwriting sample Corresponding radical index number; if there is no match to the same personal handwriting sample, then according to the radical handwriting in the radical information, it is queried from the candidate radical sequence whether there is a matching radical sample, if any, Some radical handwriting samples are updated to the personal handwriting samples determined by the radical handwriting; if not, the radical handwriting in the radical information is saved as a personal handwriting sample and the radical index number corresponding to the personal handwriting sample is saved.
其中, 所述个人笔迹样本对应的部首索引号是按部首检索的字库中部 首的编号, 个人笔迹样本和字库中部首是通过部首编号建立——对应关系 的。  The radical index number corresponding to the personal handwriting sample is the number of the middle of the font searched by the radical, and the personal handwriting sample and the middle of the font are established by the radical number-corresponding relationship.
步驟 103: 按所述部首索引号获取包含所述部首的待选字序列,从所述 待选字序列中确定所述预输入汉字的待选字。  Step 103: Obtain a candidate word sequence including the radical according to the radical index number, and determine a candidate word of the pre-entered Chinese character from the candidate font sequence.
这里, 所述待选字序列可以为: 根据排序条件, 对待选字进行排序所 组成的序列; Here, the sequence of the selected words may be: Sorting the words to be selected according to the sorting condition Sequence of composition
其中, 所述排序条件包括以下几种:  The sorting conditions include the following:
( 1 )从所述部首信息中获取所述输入坐标, 并根据该输入坐标判断所 述预输入汉字的字形结构; 根据所述字形结构, 将所述待选字中为所述字 形结构的汉字优先排列。  (1) acquiring the input coordinates from the radical information, and determining a glyph structure of the pre-input Chinese characters according to the input coordinates; according to the glyph structure, the candidate characters are in the glyph structure Chinese characters are prioritized.
( 2 )根据用户使用情况的不同, 每个用户都存在一些常用字, 在待选 字包括 N个, 并 N个侯选字中每个侯选字都对应有使用频度值。 则本实施 例还根据所述 N个侯选字的使用频度值对所述 N个侯选字排序, 其中, 排 序中第 M-1个待选字的使用频度值大于或等于排序中第 M个待选字的使用 频度值, 所述 M是大于 2小于或等于 N的整数。  (2) According to the user's usage, each user has some common words, and the candidate words include N, and each of the candidate words has a usage frequency value. In this embodiment, the N candidate words are also sorted according to the usage frequency values of the N candidate words, where the usage frequency of the M-1 candidate words in the sorting is greater than or equal to the sorting. The frequency of use of the Mth candidate word, the M being an integer greater than 2 less than or equal to N.
( 3 )若接收到用户输入所述预输入汉字除所述部首外的剩余汉字首笔 划, 则根据所述剩余笔划的首笔划, 将所述待选字中除部首外与所述首笔 划相同的汉字优先排列。  (3) if receiving a user inputting the first stroke of the remaining Chinese characters other than the radicals of the pre-input Chinese characters, according to the first stroke of the remaining strokes, excluding the radicals from the first stroke and the first stroke Chinese characters with the same strokes are prioritized.
在具体的应用中, 可以任选其中一种或两种排序条件对待选字进行排 序, 也可以是以上三种排序条件自由组合后对待选字进行排序组成待选字 序列。  In a specific application, one or two sorting conditions may be selected to sort the selected words, or the above three sorting conditions may be freely combined and the selected words are sorted to form a candidate word sequence.
根据所述排序条件将待选字组成待选字序列时, 可以根据以下判断方 式选择排序的方法:  When the to-be-selected words are formed into a candidate word sequence according to the sorting condition, the sorting method may be selected according to the following judgment method:
如果用户书写部首的坐标能很好的反映预输入汉字的字形结构, 则采 用汉字字形结构对待选字序列排序。 如果包含该部首的汉字使用频度很高, 则采用用户频度记录对待选字序列排序。 如果用户输入完部首后再输入剩 余汉字首笔划, 则采用剩余汉字首笔划对待选字序列排序。  If the coordinates of the user's writing radicals can well reflect the glyph structure of the pre-input Chinese characters, the Chinese character glyph structure is used to sort the selected character sequences. If the Chinese characters containing the radical are frequently used, the user frequency records are used to sort the sequences to be selected. If the user inputs the first stroke and then inputs the first stroke of the remaining Chinese characters, the remaining Chinese characters are used to sort the selected words.
如图 2所示, 当预输入汉字为"部"时,本发明实施例所提供的一种手写 输入中确定待选字的方法, 包括:  As shown in FIG. 2, when a pre-entered Chinese character is a "part", a method for determining a candidate word in a handwriting input provided by an embodiment of the present invention includes:
步驟 201: 接收用户在输入设备右半部分输入的部首 并记录部首 信息。 Step 201: Receive a radical input by the user in the right half of the input device and record the radical Information.
所述部首信息为: 部首笔迹(如图 3所示) 以及输入坐标。  The radical information is: a radical handwriting (as shown in Figure 3) and input coordinates.
步驟 202: 将部首信息中的部首笔迹与存储的个人笔迹样本进行匹配, 确定所述个人笔迹样本对应的部首索引号。  Step 202: Match the radical handwriting in the radical information with the stored personal handwriting sample, and determine a radical index number corresponding to the personal handwriting sample.
步驟 203: 按所述部首索引号获取包含"卩 "的待选字序列。  Step 203: Obtain a sequence of candidate words containing "卩" according to the radical index number.
在本实施例中获取到的待选字序列可以包括 Xl{队 邓 阴 郊 部 陪 都 …)。  The sequence of to-be-selected words obtained in this embodiment may include Xl{Team Deng's suburbs accompanying ...).
因为"部"的字形结构是比较明显的左右结构,则从输入设备右半部分输 入" p "可以进一步确定预输入汉字的结构, 则进一步包括:  Since the glyph structure of the "part" is a relatively obvious left and right structure, inputting "p" from the right half of the input device can further determine the structure of the pre-input Chinese character, and further includes:
步驟 204: 从所述部首信息中获取所述输入坐标, 并根据该输入坐标判 断所述预输入汉字的字形结构; 根据所述字形结构, 将所述待选字序列中 为所述字形结构的汉字优先排列。  Step 204: Obtain the input coordinate from the radical information, and determine a glyph structure of the pre-input Chinese character according to the input coordinate; according to the glyph structure, the font structure in the candidate word sequence is the glyph structure The Chinese characters are prioritized.
经过进一步优化排列之后, 待选字序列的排列则可以为 1{邓 郊 部 都 队 阴 陪 ... }。  After further optimization, the order of the selected word sequences can be 1{Deng Suburbs' team accompaniment...}.
如果包含该部首的汉字使用频度很高, 则进一步包括:  If the Chinese characters containing the radical are used frequently, then further include:
步驟 205: 根据汉字使用频度, 将所述待选字序列中使用频度高的汉字 优先排列。  Step 205: Prioritize Chinese characters with high frequency of use in the candidate word sequence according to Chinese character usage frequency.
如果还需要更准确的定位预输入汉字, 则该方法还可以由用户输入所 述预输入汉字除所述部首外的剩余汉字首笔划,若用户在输入" P "之后又输 入剩余字首笔划"、 ", 则进一步包括:  If it is further required to locate the pre-input Chinese characters more accurately, the method may further input, by the user, the first strokes of the remaining Chinese characters except the radicals of the pre-input Chinese characters, if the user inputs the remaining first strokes after inputting "P" ", ", then further includes:
步驟 206: 根据所述剩余笔划的首笔划, 将所述待选字中除部首外与所 述首笔划相同的汉字优排列。  Step 206: Align the Chinese characters of the to-be selected words with the same first strokes except the radicals according to the first stroke of the remaining strokes.
经过进一步优化排列之后, 待选字序列的排列则可以为 1{郊 部 陪 邓 都 队 阴 ... }。  After further optimization, the sequence of the selected words can be arranged as 1{suburban with Dengdu team...}.
如图 4所示, 根据图 1所示的方法, 本发明实施例还提供一种手写输 入中确定待选字的装置, 该装置包括: 输入模块 401、 部首确定模块 402和 待选字确定模块 403; 其中, As shown in FIG. 4, according to the method shown in FIG. 1, the embodiment of the present invention further provides a handwriting input. The device for determining a word to be selected, the device comprising: an input module 401, a radical determination module 402, and a candidate word determination module 403;
输入模块 401 , 用于根据用户输入的预输入汉字的部首, 确定预输入汉 字的部首信息, 将部首信息发送给部首确定模块 402;  The input module 401 is configured to determine a radical information of the pre-input Chinese character according to the radical input of the pre-input Chinese character input by the user, and send the radical information to the radical determining module 402;
部首确定模块 402,用于将输入模块 401发来的所述部首信息中的部首 笔迹与存储的个人笔迹样本进行匹配, 如果匹配成功, 则获取所述个人笔 迹样本对应的部首索引号,将部首索引号发送给待选字确定模块 403;其中, 所述个人笔迹样本由用户输入的部首笔迹确定, 并且所述个人笔迹样本与 部首索引号对应;  The radical determining module 402 is configured to match the radical handwriting in the radical information sent by the input module 401 with the stored personal handwriting sample, and if the matching is successful, acquire the radical index corresponding to the personal handwriting sample. And sending the radical index number to the candidate word determining module 403; wherein the personal handwriting sample is determined by the radical handwriting input by the user, and the personal handwriting sample corresponds to the radical index number;
待选字确定模块 403 ,用于按部首确定模块 402发来的所述部首索引号 获取包含所述部首的待选字序列, 从所述待选字序列中确定所述预输入汉 字的待选字。  The candidate word determining module 403 is configured to obtain the candidate word sequence including the radical according to the radical index number sent by the radical determining module 402, and determine the pre-entered Chinese character from the candidate word sequence. The word to be selected.
因为用户在输入部首时, 是根据预输入汉字的字形结构输入的, 所以 本发明实施例所提供的装置进一步包括: 字形结构排序模块 404, 用于接收 待选字确定模块 403发来的待选字序列和部首信息中的输入坐标, 根据该 输入坐标判断所述预输入汉字的字形结构; 根据所述字形结构, 将所述待 选字序列中与所述字形结构相同的汉字优先排列; 相应的, 所述待选字确 定模块 403 , 还用于向字形结构排序模块 404发送待选字序列和部首信息。  Because the user inputs the radicals according to the glyph structure of the pre-input Chinese characters, the apparatus provided by the embodiment of the present invention further includes: a glyph structure sorting module 404, configured to receive the to-be-selected word determining module 403 Determining a font structure of the pre-input Chinese character according to the input coordinates; and, according to the font structure, prioritizing Chinese characters in the candidate word sequence that are identical to the glyph structure Correspondingly, the candidate word determining module 403 is further configured to send the to-be-selected word sequence and the radical information to the font structure sorting module 404.
另外, 如果用户输入完部首后再输入剩余字首笔划, 则可采用剩余字 首笔划对待选字序列排序。 本发明实施例所提供的装置进一步包括: 剩余 笔划排序模块 406,用于接收待选字确定模块 403发来的待选字序列以及所 述预输入汉字除所述部首外的剩余汉字首笔划, 根据所述剩余笔划, 将所 述待选字确定模块 403 ,还用于将排序后的待选字序列和预输入汉字除所述 部首外的剩余汉字首笔划发送给剩余笔划排序模块 406。 所述待选字确定模块 403 ,还用于接收经过剩余笔划排序模块 406优化 排序后的待选字序列; 相应的, 所述剩余笔划排序模块 406, 还用于将优化 排序后的待选字序列发送给待选字确定模块 403。 In addition, if the user inputs the remaining word strokes after inputting the radicals, the remaining word strokes may be used to sort the selected word sequences. The apparatus provided by the embodiment of the present invention further includes: a remaining stroke ordering module 406, configured to receive a sequence of to-be-selected words sent by the candidate word determining module 403, and a first stroke of the remaining Chinese characters of the pre-input Chinese characters except the radicals And the candidate word determining module 403 is further configured to send the sorted candidate word sequence and the pre-input Chinese characters except the radicals to the remaining stroke sorting module 406 according to the remaining strokes. . The candidate word determining module 403 is further configured to receive the candidate word sequence that is optimized and sorted by the remaining stroke sorting module 406. Correspondingly, the remaining stroke sorting module 406 is further configured to optimize the sorted candidate words. The sequence is sent to the candidate word determination module 403.
因为根据用户使用情况的不同, 每个用户都存在一些常用字, 所以本 发明实施例所提供的装置, 还包括:  The device provided by the embodiment of the present invention further includes:
使用频度排序模块 405 ,用于接收字形结构排序模块 404发来的排序后 的待选字序列, 若待选字序列包括 N个待选字, 并 N个侯选字中每个侯选 字都对应有使用频度值, 根据所述 N个侯选字的使用频度值对所述 N个侯 选字排序, 其中, 排序中第 M-1个待选字的使用频度值大于或等于排序中 第 M个待选字的使用频度值, 所述 M是大于 2小于或等于 N的整数, 将 排序后的待选字序列反馈给待选字确定模块 403; 相应的, 所述字形结构排 所述待选字确定模块 403,还用于接收经过使用频度排序模块 405优化排序 后的待选字序列。  The frequency ordering module 405 is configured to receive the sorted candidate word sequence sent by the font structure sorting module 404, if the candidate word sequence includes N candidate words, and each of the N candidate words Corresponding to the frequency of use, the N candidate words are sorted according to the frequency of use of the N candidate words, wherein the frequency of use of the M-1 candidate words in the sorting is greater than or And is equal to the use frequency value of the Mth candidate word in the sorting, wherein the M is an integer greater than 2 and less than or equal to N, and the sequence of the selected word to be selected is fed back to the candidate word determining module 403; The font structure is arranged in the candidate word determining module 403, and is further configured to receive the selected word sequence after being sorted by using the frequency sorting module 405.
由于每个用户的笔迹都有稳定、 明显的个人特征, 将用户的部首笔迹 作为匹配样本, 可减少匹配次数, 提高匹配效率; 所以本发明实施例提供 的装置还包括:  Since the handwriting of each user has a stable and obvious personal feature, the user's radical handwriting is used as a matching sample, which can reduce the number of matching and improve the matching efficiency. Therefore, the apparatus provided by the embodiment of the present invention further includes:
本发明实施例所提供的装置进一步包括: 个人笔迹样本确定模块 407, 用于根据部首确定模块 402发来的部首笔迹, 查询已存的个人笔迹样本中 是否包括所述部首, 如果包括, 则向部首确定模块 402发送个人笔迹样本 对应的部首索引号, 如果没有, 则根据部首笔迹查询所述输入部首是否已 存在部首字迹样本, 如果所述输入部首已存在部首字迹样本, 则更新为由 部首笔迹确定的个人笔迹样本, 如果所述输入部首不存在部首字迹样本, 则将部首信息中的部首笔迹保存为个人笔迹样本并添加对应的部首编号; 相应的, 所述部首确定模块 402, 具体用于向个人笔迹样本确定模块 407发 送部首笔迹, 接收个人笔迹样本确定模块 407发来的个人笔迹样本对应的 部首索引号。 The device provided by the embodiment of the present invention further includes: a personal handwriting sample determining module 407, configured to query, according to the radical handwriting sent by the radical determining module 402, whether the radical is included in the existing personal handwriting sample, if included And sending a radical index number corresponding to the personal handwriting sample to the radical determining module 402. If not, querying, according to the radical handwriting, whether the input radical has a radical sample, if the input radical already exists. The first handwriting sample is updated to the personal handwriting sample determined by the radical handwriting. If the input radical does not have the radical handwriting sample, the radical handwriting in the radical information is saved as a personal handwriting sample and the corresponding department is added. Correspondingly, the radical determining module 402 is specifically configured to send the personal handwriting sample determining module 407 The first handwriting is sent, and the radical index number corresponding to the personal handwriting sample sent by the personal handwriting sample determining module 407 is received.
本申请实施例中的上述一个或多个技术方案, 至少具有如下的技术效 果:  The above one or more technical solutions in the embodiments of the present application have at least the following technical effects:
本发明实施例所提供的方法和装置, 首先存储根据用户个人手写习惯 生成的个人部首笔迹样本, 字库实现简单且存储空间小, 不需要对字库中 的汉字进行编码, 只需将字库按部首索引以方便检索; 另外, 由于每个用 户的笔迹都有稳定、 明显的个人特征, 将用户的部首笔迹作为匹配样本, 可减少匹配次数, 提高匹配效率, 并且能够提高汉字手写输入法的识别效 率和准确率、 减少字库存储空间; 进一步因为将用户书写部首笔迹和汉字 部首建立——对应关系, 使得用户书写更随意, 具有用户自适应的特点。  The method and apparatus provided by the embodiments of the present invention first store a sample of a personal handwriting generated according to a user's personal handwriting habit, and the font library is simple to implement and has a small storage space, and does not need to encode the Chinese characters in the font library, and only needs to press the font library according to the department. The first index is convenient for retrieval; in addition, since each user's handwriting has stable and obvious personal characteristics, using the user's radical handwriting as a matching sample can reduce the number of matches, improve matching efficiency, and improve the handwriting input method of Chinese characters. Identify the efficiency and accuracy, reduce the word storage space; further, because the user writes the handwriting and the Chinese character radicals to establish a corresponding relationship, the user is more casual and has the characteristics of user self-adaptation.
另外, 本发明实施例所提供的方法和装置还应用汉字字形结构、 用户 频度记录和剩余字首笔划对包含同一部首的待选字序列进行优化排序, 使 得排序后的待选字更加准确, 并能够体现用户的输入习惯。 术人员根据本发明的技术方案得出其它的实施方式, 同样属于本发明的技 术创新范围。  In addition, the method and apparatus provided by the embodiments of the present invention further apply the Chinese character glyph structure, the user frequency record, and the remaining word first stroke to optimize the sorting of the to-be-selected word sequences including the same radical, so that the sorted candidate words are more accurate. And can reflect the user's input habits. Other embodiments are obtained by the skilled person in accordance with the technical solution of the present invention, and are also within the scope of the technical innovation of the present invention.
显然, 本领域的技术人员可以对本发明进行各种改动和变型而不脱离 本发明的精神和范围。 这样, 倘若本发明的这些修改和变型属于本发明权 利要求及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在 内。  It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and modifications of the invention

Claims

权利要求书 Claim
1、 一种手写输入中确定待选字的方法, 所述方法应用于包括有手写输 入设备的智能设备中, 其特征在于, 包括:  A method for determining a word to be selected in a handwriting input, the method being applied to a smart device including a handwriting input device, comprising:
根据用户输入的预输入汉字的部首, 确定预输入汉字的部首信息; 将所述部首信息中的部首笔迹与存储的个人笔迹样本进行匹配, 如果 匹配成功, 则获取所述个人笔迹样本对应的部首索引号; 其中, 所述个人 笔迹样本由用户输入的部首笔迹确定, 所述个人笔迹样本与部首索引号对 应;  Determining the radical information of the pre-input Chinese character according to the radical input of the pre-entered Chinese character input by the user; matching the radical handwriting in the radical information with the stored personal handwriting sample, and if the matching is successful, acquiring the personal handwriting a radical index number corresponding to the sample; wherein, the personal handwriting sample is determined by a radical handwriting input by a user, and the personal handwriting sample corresponds to a radical index number;
按所述部首索引号获取包含所述部首的待选字序列, 从所述待选字序 列中确定所述预输入汉字的待选字。  And obtaining, by the radical index number, a candidate word sequence including the radical, and determining, from the candidate font sequence, the candidate word of the pre-entered Chinese character.
2、 如权利要求 1所述的方法, 其特征在于, 从所述待选字序列中确定 所述预输入汉字的待选字之后, 该方法还包括:  2. The method according to claim 1, wherein after determining the candidate word of the pre-entered Chinese character from the candidate word sequence, the method further comprises:
从所述部首信息中获取所述部首的输入坐标, 并根据该输入坐标判断 所述预输入汉字的字形结构;  Obtaining an input coordinate of the radical from the radical information, and determining a glyph structure of the pre-entered Chinese character according to the input coordinate;
根据所述字形结构, 将所述待选字中与所述字形结构相同的汉字优先 排列。  According to the glyph structure, Chinese characters in the candidate word that are identical to the glyph structure are preferentially arranged.
3、 如权利要求 1所述的方法, 其特征在于, 从所述待选字序列中确定 所述预输入汉字的待选字之后, 该方法还包括:  3. The method according to claim 1, wherein after determining the candidate word of the pre-entered Chinese character from the candidate word sequence, the method further comprises:
若待选字序列中包括 N个待选字, 并所述 N个侯选字中每个侯选字都 对应有使用频度值, 根据所述 N个侯选字的使用频度值对所述 N个侯选字 排序, 其中, 排序中第 M-1个待选字的使用频度值大于或等于排序中第 M 个待选字的使用频度值, 所述 M是大于 2小于或等于 N的整数, 得到经过 优化排序的待选字序列。  If the candidate word sequence includes N candidate words, and each candidate word of the N candidate words corresponds to a usage frequency value, according to the usage frequency value of the N candidate words The N candidate words are sorted, wherein the usage frequency value of the M-1th candidate word in the ranking is greater than or equal to the usage frequency value of the Mth candidate word in the ranking, and the M is greater than 2 or less than or An integer equal to N, resulting in an optimized sequence of candidate words.
4、 如权利要求 1所述的方法, 其特征在于, 从所述待选字序列中确定 所述预输入汉字的待选字之后, 若接收到所述预输入汉字除所述部首外的 剩余汉字首笔划, 该方法还包括: The method according to claim 1, wherein after determining the candidate word of the pre-input Chinese character from the candidate word sequence, if the pre-input Chinese character is received, except for the radical The first stroke of the remaining Chinese characters, the method also includes:
根据所述剩余汉字首笔划, 将所述待选字中与所述剩余汉字首笔划相 同的汉字优先排列, 得到经过优化排序的待选字序列。  And according to the first stroke of the remaining Chinese characters, the Chinese characters in the candidate words that are identical to the first strokes of the remaining Chinese characters are preferentially arranged to obtain an optimized sequence of the selected words.
5、 如权利要求 1至 4任一项所述的方法, 其特征在于, 所述个人笔迹 样本由用户输入的部首笔迹确定, 包括:  The method according to any one of claims 1 to 4, wherein the personal handwriting sample is determined by a radical handwriting input by a user, and includes:
当接收到用户输入的部首后, 确定已存的个人笔迹样本中是否包括所 述部首, 如果包括, 则显示输出包含已识别部首的待选字序列; 如果没有, 则根据用户的输入信息从待选部首序列中确定输入部首, 并查询所述输入 部首是否已存在部首字迹样本, 如果所述输入部首已存在部首字迹样本, 则使用个人笔迹样本更新部首字迹样本; 如果所述输入部首不存在部首字 迹样本, 则将个人笔迹样本保存为部首字迹样本。  After receiving the radical input by the user, determining whether the radical is included in the stored personal handwriting sample, if included, displaying the output candidate sequence containing the identified radical; if not, according to the user input The information determines an input radical from the candidate radical sequence, and queries whether the radical input sample exists in the input radical. If the radical input sample already exists in the input radical, the personal handwriting sample is used to update the radical writing Sample; if there is no radical sample in the input section, the personal handwriting sample is saved as a radical handwriting sample.
6、 一种手写输入中确定待选字的装置, 其特征在于, 该装置包括: 输 入模块、 部首确定模块和待选字确定模块; 其中,  6. A device for determining a word to be selected in handwriting input, wherein the device comprises: an input module, a radical determination module, and a candidate word determination module; wherein
输入模块, 用于根据用户输入的预输入汉字的部首, 确定预输入汉字 的部首信息, 将部首信息发送给部首确定模块;  The input module is configured to determine a radical information of the pre-input Chinese character according to the radical input of the pre-input Chinese character input by the user, and send the radical information to the radical determination module;
部首确定模块, 用于将所输入模块发来的述部首信息中的部首笔迹与 存储的个人笔迹样本进行匹配, 如果匹配成功, 则获取所述个人笔迹样本 对应的部首索引号, 将部首索引号发送给待选字确定模块; 其中, 所述个 人笔迹样本由用户输入的部首笔迹确定, 并且所述个人笔记样本与部首索 引号对应;  a radical determining module, configured to match the radical handwriting in the radical information sent by the input module with the stored personal handwriting sample, and if the matching is successful, acquire the radical index number corresponding to the personal handwriting sample, Sending a radical index number to the candidate word determining module; wherein, the personal handwriting sample is determined by a radical handwriting input by a user, and the personal note sample corresponds to a radical index number;
待选字确定模块, 用于按部首确定模块发来的所述部首索引号获取包 含所述部首的待选字序列, 从所述待选字序列中确定所述预输入汉字的待 选字。  a candidate word determining module, configured to obtain, by the radical index number sent by the radical determining module, a sequence of to-be-selected words including the radical, and determine, from the candidate word sequence, the pre-entered Chinese character to be selected Select words.
7、 如权利要求 6所述的装置, 其特征在于, 该装置还包括:  7. The device of claim 6, wherein the device further comprises:
字形结构排序模块, 用于接收待选字确定模块发来的待选字序列和所 述部首信息中获取所述部首的输入坐标, 并根据该输入坐标判断所述预输 入汉字的字形结构; 根据所述字形结构, 将所述待选字序列中与所述字形 结构相同的汉字优先排列; a font structure sorting module, configured to receive a candidate word sequence and a sentiment sent by the candidate word determining module Obtaining an input coordinate of the radical in the radical information, and determining a font structure of the pre-input Chinese character according to the input coordinate; according to the font structure, the same sequence of the font is selected in the candidate font sequence Chinese characters are prioritized;
相应的, 所述待选字确定模块, 还用于向字形结构排序模块发送待选 字序列和部首信息。  Correspondingly, the candidate word determining module is further configured to send the candidate word sequence and the radical information to the font structure sorting module.
8、 如权利要求 6所述的装置, 其特征在于, 该装置还包括: 使用频度排序模块, 用于接收字形结构排序模块发来的排序后的待选 字序列, 若待选字序列包括 N个待选字, 并 N个侯选字中每个侯选字都对 应有使用频度值, 根据所述 N个侯选字的使用频度值对所述 N个侯选字排 序, 其中, 排序中第 M-1个待选字的使用频度值大于或等于排序中第 M个 待选字的使用频度值, 所述 M是大于 2小于或等于 N的整数, 将优化排序 后的待选字序列发送给待选字确定模块;  The apparatus according to claim 6, wherein the apparatus further comprises: a frequency ordering module, configured to receive the sorted candidate word sequence sent by the font structure sorting module, if the candidate word sequence includes N candidate words, and each candidate word of the N candidate words corresponds to a usage frequency value, and the N candidate words are sorted according to the usage frequency value of the N candidate words, wherein The usage frequency value of the M-1th candidate word in the sorting is greater than or equal to the usage frequency value of the Mth candidate word in the sorting, and the M is an integer greater than 2 less than or equal to N, and the optimized sorting is performed. The candidate word sequence is sent to the candidate word determining module;
相应的, 所述字形结构排序模块, 还用于将排序后的待选字序列发送 给使用频度排序模块;  Correspondingly, the glyph structure sorting module is further configured to send the sorted candidate word sequence to the usage frequency sorting module;
所述待选字确定模块, 还用于接收经过使用频度排序模块优化排序后 的待选字序列。  The candidate word determining module is further configured to receive a sequence of candidate words after being sorted by using a frequency sorting module.
9、 如权利要求 6所述的装置, 其特征在于, 该装置还包括: 剩余笔划排序模块, 用于接收待选字确定模块发来的待选字序列以及 所述预输入汉字除所述部首外的剩余汉字首笔划, 根据所述剩余汉字首笔 优化排序后的待选字序列发送给待选字确定模块;  The apparatus of claim 6, further comprising: a remaining stroke ordering module, configured to receive a sequence of to-be-selected words sent by the candidate word determining module, and the pre-input Chinese characters except the part The first stroke of the remaining Chinese characters in the first foreign language is sent to the candidate word determining module according to the sequence of the selected words after the first optimization of the remaining Chinese characters;
相应的, 所述待选字确定模块, 还用于向剩余笔划排序模块发送待选 字序列, 接收经过剩余笔划排序模块优化排序后的待选字序列。  Correspondingly, the candidate word determining module is further configured to send the candidate word sequence to the remaining stroke sorting module, and receive the candidate word sequence after the optimal sorting by the remaining stroke sorting module.
10、 如权利要求 6至 9任一项所述的装置, 其特征在于, 该装置还包 括: 个人笔迹样本确定模块, 用于根据部首确定模块发来的部首笔迹, 查 看已存的个人笔迹样本中是否包括所述部首, 如果包括, 则向部首确定模 块发送个人笔迹样本对应的部首索引号, 如果没有, 则根据部首笔迹查询 所述输入部首是否已存在部首字迹样本, 如果所述输入部首已存在部首字 迹样本, 则更新为由部首笔迹确定的个人笔迹样本, 如果所述输入部首不 存在部首字迹样本, 则将部首信息中的部首笔迹保存为个人笔迹样本并添 加个人笔迹样本对应的部首编号; The device according to any one of claims 6 to 9, wherein the device further comprises: a personal handwriting sample determining module, configured to: according to the radical handwriting sent by the radical determining module, to check whether the radical is included in the existing personal handwriting sample, and if included, send the personal handwriting sample corresponding to the radical determining module a radical index number, if not, according to the radical handwriting query whether the input radical has a radical sample of the radical, if the input radical has a radical sample, the individual is determined by the radical handwriting a handwriting sample, if the input portion does not have a radical handwriting sample, the radical handwriting in the radical information is saved as a personal handwriting sample and a radical number corresponding to the personal handwriting sample is added;
相应的, 所述部首确定模块, 具体用于向个人笔迹样本确定模块发送 部首笔迹, 接收个人笔迹样本确定模块发来的个人笔迹样本对应的部首索 引号, 将部首索引号发送给待选字确定模块;  Correspondingly, the radical determining module is specifically configured to send a radical handwriting to the personal handwriting sample determination module, receive a radical index number corresponding to the personal handwriting sample sent by the personal handwriting sample determination module, and send the radical index number to The word selection module to be selected;
所述待选字确定模块, 用于按部首确定模块发来的所述部首索引号获 取包含所述部首的待选字序列。  The candidate word determining module is configured to obtain, according to the radical index number sent by the radical determining module, a sequence of to-be selected words including the radical.
PCT/CN2011/084849 2011-09-29 2011-12-28 Method and device for determining candidate character in handwriting input WO2012152039A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110298039.2A CN102360265B (en) 2011-09-29 2011-09-29 The method and device of word selection is treated in determination in a kind of handwriting input
CN201110298039.2 2011-09-29

Publications (1)

Publication Number Publication Date
WO2012152039A1 true WO2012152039A1 (en) 2012-11-15

Family

ID=45585599

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/084849 WO2012152039A1 (en) 2011-09-29 2011-12-28 Method and device for determining candidate character in handwriting input

Country Status (2)

Country Link
CN (1) CN102360265B (en)
WO (1) WO2012152039A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104007836B (en) * 2014-05-07 2018-03-16 惠州Tcl移动通信有限公司 A kind of processing method and terminal device of handwritten word input
US10095673B2 (en) 2014-11-17 2018-10-09 Lenovo (Singapore) Pte. Ltd. Generating candidate logograms
CN104765837B (en) * 2015-04-16 2019-09-13 刘立德 The inspection of Chinese Character first row and information processing method
CN107870678A (en) * 2016-09-26 2018-04-03 中兴通讯股份有限公司 A kind of hand-written inputting method and device
CN114237484A (en) * 2020-09-09 2022-03-25 北京搜狗科技发展有限公司 Handwriting input recognition method and device, electronic equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995008158A1 (en) * 1993-09-17 1995-03-23 Fficiency Software, Inc. Universal symbolic handwriting recognition system
US6041137A (en) * 1995-08-25 2000-03-21 Microsoft Corporation Radical definition and dictionary creation for a handwriting recognition system
CN101276249A (en) * 2007-03-30 2008-10-01 北京三星通信技术研究有限公司 Method and device for forecasting and discriminating hand-written characters
CN101354749A (en) * 2007-07-24 2009-01-28 夏普株式会社 Method for making dictionary, hand-written input method and apparatus

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101281449B (en) * 2007-04-03 2013-03-06 诺基亚(中国)投资有限公司 Hand-written character recognizing method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995008158A1 (en) * 1993-09-17 1995-03-23 Fficiency Software, Inc. Universal symbolic handwriting recognition system
US6041137A (en) * 1995-08-25 2000-03-21 Microsoft Corporation Radical definition and dictionary creation for a handwriting recognition system
CN101276249A (en) * 2007-03-30 2008-10-01 北京三星通信技术研究有限公司 Method and device for forecasting and discriminating hand-written characters
CN101354749A (en) * 2007-07-24 2009-01-28 夏普株式会社 Method for making dictionary, hand-written input method and apparatus

Also Published As

Publication number Publication date
CN102360265A (en) 2012-02-22
CN102360265B (en) 2017-11-03

Similar Documents

Publication Publication Date Title
CN108491433B (en) Chat response method, electronic device and storage medium
JP5860171B2 (en) Input processing method and apparatus
US20080294982A1 (en) Providing relevant text auto-completions
WO2021169718A1 (en) Information acquisition method and apparatus, electronic device, and computer-readable storage medium
CN112035730B (en) Semantic retrieval method and device and electronic equipment
JP2007317022A (en) Handwritten character processor and method for processing handwritten character
JP2009524852A5 (en)
CN110413764B (en) Long text enterprise name recognition method based on pre-built word stock
CN110427483B (en) Text abstract evaluation method, device, system and evaluation server
CN107992523B (en) Function option searching method of mobile application and terminal equipment
WO2012152039A1 (en) Method and device for determining candidate character in handwriting input
CN105209858B (en) The uncertainty of business location's data disappears qi and matching
WO2021098794A1 (en) Text search method, device, server, and storage medium
CN108351876A (en) System and method for point of interest identification
CN111198936B (en) Voice search method and device, electronic equipment and storage medium
JP2012094117A (en) Method and system for marking arabic language text with diacritic
CN111259170A (en) Voice search method and device, electronic equipment and storage medium
CN101405693A (en) Personal synergic filtering of multimodal inputs
CN112988784B (en) Data query method, query statement generation method and device
CN100541522C (en) The method and apparatus that is used for recognition of handwritten patterns
CN111602129B (en) Smart search for notes and ink
CN112989011B (en) Data query method, data query device and electronic equipment
CN103761294A (en) Handwritten track and speech recognition based query method and device
US20130332824A1 (en) Embedded font processing method and device
CN109635075B (en) Method and device for marking word-dividing marks on text contents

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11865418

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11865418

Country of ref document: EP

Kind code of ref document: A1