CN102184034B - Chinese character input method based on retrieval elements - Google Patents

Chinese character input method based on retrieval elements Download PDF

Info

Publication number
CN102184034B
CN102184034B CN201110143478.6A CN201110143478A CN102184034B CN 102184034 B CN102184034 B CN 102184034B CN 201110143478 A CN201110143478 A CN 201110143478A CN 102184034 B CN102184034 B CN 102184034B
Authority
CN
China
Prior art keywords
code
word
current block
stroke
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201110143478.6A
Other languages
Chinese (zh)
Other versions
CN102184034A (en
Inventor
丁恩明
马居里
丁镭
丁威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201110143478.6A priority Critical patent/CN102184034B/en
Publication of CN102184034A publication Critical patent/CN102184034A/en
Application granted granted Critical
Publication of CN102184034B publication Critical patent/CN102184034B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a Chinese character input method based on retrieval elements, which has high choice utilization rate. Code elements according with the public character-finding habit are standardized; and mapping symbols are optimized, a retrieval sequence is optimized, and a simple and recyclable retrieval rule is formulated, therefore, the difficulty of single acoustic code input is solved. In the Chinese character input method, coding space is fully utilized, input code length is compressed, and hit rate of single-screen characters or words is increased. Because i, v and u are increased as new generalized acoustic codes, full-spelling syllable rimes are discarded, a single acoustic code input mode is adopted, coding space utilization rate and input efficiency are greatly increased; and single-screen dialogue is adopted for determining characters or words and avoiding page turning. According to the invention, more than 30 memory points and simple rules are only increased, as long as first-order words are held, tens of thousands of Chinese characters can be conveniently driven, and sequence retrieval and input of Chinese characters are integrated naturally.

Description

A kind of input method of Chinese character based on retrieval elements
Technical field
The present invention relates to a kind of input method of Chinese character.
Technical background
At present, intelligent phonetic letter is the popular main flow input method generally using.
Input mode taking syllable as base unit, need not learn although have, the advantage of just bringing for use, and the disappearance existing aspect as follows, needs to improve greatly and promote:
It is low that the head of 1 word, word shields hit rate, not only made us helpless, and affected input speed by page turning, and one of its reason is exactly that the inborn space encoder utilization rate of phonetic code is low.
The single, double finger input of 2 mobile phones, panel computer, has higher requirement to input code length, must shorten the tediously long input code string of phonetic word, to promote input efficiency.
3 high in the clouds data, for word storehouse provides great convenience, and are no more than the words table of 9 position index codes, are convenient to efficient, the storehouse, high speed high in the clouds of configuration ten million and even hundred million order of magnitude entries.
The difficult word that 4 masses use once in a while, the Chinese character that professional person uses and Japan and Korea S's Chinese character, the simplified complex form of Chinese characters, all must be realized by same input method.
Input mode taking single acoustic code as base unit, is the ideal technology path of bar, although go through trial for many years, fails all the time to achieve one's goal, and its main cause is exactly that micro-Chinese characters cannot crack.
Technical scheme
The present invention seeks to openly a kind of input method of Chinese character based on retrieval elements.The present invention increases i, v, u, as new broad sense acoustic code, gives up the simple or compound vowel of a Chinese syllable code of spelling, and the single acoustic code input mode that adopts input code to be no more than 9 is come typing word and word, has greatly promoted space encoder utilization rate and input efficiency; Take solely to shield dialog box and determine word or word, thoroughly do away with page turning;
Simple and clear, clear for what the present invention was narrated, do not produce ambiguity and misread, to term used herein, the special agreement as follows of doing:
1 Chinese characters: the figure by the stroke institute framework of determining is referred to as, containing single stroke, no matter pronunciation whether.
2 everyday characters: refer to the word in GB-2312 (80) character set.Be primary word (3755 word) and
The summation of secondary word (3008 word)
3 marking-ups, doubt word: target Chinese characters that wish is got by input method, do not know its pronunciation or without mark
The marking-up of accurate pronunciation.
4 standard pronunciations: the three kinds of GB pronunciations of China, Japan and Korea that refer to Chinese characters.
6 acoustic codes: 26 English alphabets are all the acoustic codes of broad sense.
8 hit word: the word that meets definition.
9 preconditions: the not breakdown of original stroke, crossing stroke is not taken apart, and the stroke of joining is detachable.
10 definition: fill a part necessary condition.
11 unloadings: deletion physically or meditate in conceal unloading element, all regard unloading as
12 auxiliary drawing: the auxiliary introducing physically or in meditating, all regard auxiliary drawing as.
13 current blocks: before unloading, the Chinese characters after unloading, is all called current block.
14 frameworks: structural framing.
15 connect a frame: by intersecting, joining, or handing-over coexist and seamless link up more than 1 stroke
And more than the continuous stroke framework frame of 1 angle, be called for short and connect a frame, wherein bag
Draw together the angle of folded pen picture itself.
16 most circles: company's frame of choosing maximum magnitude and uniting framework.
17 tangent lines: between two strokes that company's penholder structure frame of circle does not hold together to close to the greatest extent, follow plane geometry
The straight line of tangent rule in.
18 standard word: the trend of the Chinese characters stroke relating to herein and mutual position, without exception with
National standard Song typeface maximum number font is as the criterion
19 transverse axis, vertical pivot: the i.e. X-axis of plane right-angle coordinate, Y-axis.
20 associated intersecting: first stroke is with to enclose frame crossing, though second stroke is direct with to enclose frame crossing, because of with
First stroke intersect, second pen just by first pen with enclose frame formed associated crossing,
Multiple associated by that analogy.
21 weldering pictures: follow the rule of " stroke of intersection is not torn open without exception ", every itself participation
Corral, but because with to enclose frame stroke directly crossing, or directly add associated phase
Hand over and all must and enclose the stroke of frame consubstantiality unloading, all having the crosslinked weldering of enclosing frame made to order
Body stroke, is called for short weldering picture.
22 parts, containing word: herein alleged parts, containing word, be all the part in current block, its
Stroke number is less than total stroke number of current block, and not with remaining stroke phase
Hand over, join but can have.
Cardinal principle
Introduce texture, draw numerical value element, by the statistical data analysis of magnanimity, selected utilization factor is high, meets the code element of popular searching custom, by its standardization, pithy formula; Optimize sorted order; Optimize the mapping symbol without standard pronunciation code element, formulate search rule brief and that can be recycled, by sieve regular inspection;
Code element and mapping thereof
One full other parts element, is called for short full unit or full.
Definition A:
1 when radical part Epileptic, Rui, Bing, Fu, Jie, jin, lonely, si, Niu, Woo, chi, Cannibals, , the-Fan, yin, or insect without feet or legs, wherein any one, while structurally having taken the leftmost side of current block or the rightmost side, this radical is just decided to be the full unit of current block; If the leftmost side and the rightmost side of the separation of two radicals in above-listed current blocks, only select the full unit of the leftmost side;
2 full units respectively according to, b ← Epileptic, d ← Rui or Bing, e ← Fu, or Jie, i ← j ← Jin or lonely, l ← Si or n ← Niu, p ← Woo or q ← r ← or Chi, s ← Cannibals or , w ← The-Fan, x ← y ← z ← yin, or the corresponding relation of insect without feet or legs mapping
3 full first sequence numbers are 1, mnemonic(al) pithy formula: the full limit of radical structure; Refer to the full unit of accompanying drawing 1 example;
Two elements of rounding up and hunt, are called for short and enclose unit or enclose.
Definition B:
1 encloses to the greatest extent territory: connect a frame at the not closed framework of most circle and do not hold together to close between the two-stroke of end, auxiliaryly draw one
The not virtual tangent crossing with any stroke of this company's frame, the empty sealing of the most circle frame inner region of being mended Feng Suocheng by virtual tangent, is called most circle territory; Refer to the auxiliary dotted line example of drawing in accompanying drawing 1, auxiliaryly draw rear judgement and have no the example of omission, and each example of whether enclosing to the greatest extent;
2 sternly enclose: in current block, not only have a company's frame that encloses to the greatest extent Closed Architecture, be called for short and close to the greatest extent frame; And, after closing to the greatest extent frame stroke, or except after closing to the greatest extent frame stroke and associated weldering picture, also separately there is other stroke, and this all other strokes, have no the ground of omission and be all contained in Jin Bi frame circle; Close to the greatest extent the continuous stroke of frame, or close to the greatest extent frame continuous stroke and associated weldering picture be exactly current block sternly enclose unit;
Leakage is enclosed: sternly not enclosing in first current block, not only have a company's frame that encloses to the greatest extent not closed framework, be called for short most not frame; And, after most not frame stroke, or except after most not frame stroke and associated weldering picture, also separately there is other stroke, and this all other strokes, while having no to omit in the Jin Quan territory circle that is all contained in this most not frame; The most continuous stroke of frame not, or frame continuous stroke and associated weldering picture are not exactly that unit is enclosed in the leakage of current block to the greatest extent; Both enclose unit at general designation, and the stroke being enclosed is called hunts picture;
Sternly enclose unit or the leakage of 3 various frameworks are enclosed unit and are all mapped in the w key mapping of keyboard;
4 because expire unit and enclose unit and cannot exist simultaneously in current block, be listed as 1, mnemonic(al) pithy formula: frame is rounded up and hunt and entirely contained therefore enclose first sequence number; Referring to accompanying drawing 1 sternly encloses example, defines by virtual tangent whether milli exhaustively leaks and encloses example; And mouth in accompanying drawing 1, Shen, recessed, western, the sixth of the twelve Earthly Branches, each example was to only have frame or only have frame and weldering picture and containing remaining picture, therefore just do not exist and enclose unit;
Three left word parts elements, are called for short left unit or a left side.
Definition C:
The leftmost side of 1 current block is a word parts that have standard pronunciation, if while being all positioned at the right side of this word except all the other strokes of this word, this left word is just decided to be the left unit of current block; Comprising having the left unit of joining with all the other strokes;
2 left yuan be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, shine upon from sound;
The sequence number of 3 left yuan is 2, mnemonic(al) pithy formula: all right existing more than a left side; Refer to 1 left yuan of example of accompanying drawing;
Four cloud parts elements, are called for short cloud unit or cloud.
Definition D:
If 1 parts Tou, Lv, Http, zhao, si in any one, occupy alone the top layer of current block, and when all the other strokes of this parts are all under top layer plane, these parts are exactly the cloud unit of current block;
2 cloud units respectively according to, a ← Tou or Lv, b ← Http, or e ← huo , f ← or Zhao, l ← or o ← or p ← , or q ← or s ← Si, x ← corresponding relation mapping
The sequence number of 3 cloud units is 3, mnemonic(al) pithy formula: the cloud layer exhibition of being left; Refer to accompanying drawing 1 cloud unit example;
Five parts elements not, are not called for short first or not.
Definition E:
In 1 current block, contain the order that independently writes containing word or parts, if it is not the left unit of current block, and, when the interior stroke without any other of these parts, should be exactly the not first of current block containing word or this parts; If both coexist, only select the former;
2 not first orders or , be all mapped in Europe of keyboard, in O key mapping;
3 not first sequence numbers are 4, mnemonic(al) pithy formula: order not left another code Europe, pronounces: the not left another code of order wood Europe; Refer to the not first example of accompanying drawing 1;
The six word parts elements that fall, are called for short to fall unit or fall, be i.e. vector font.
Definition F:
1 under precondition, and the unit that falls do not stop by other stroke, can vertical translation go out current block lower limb containing word; Fall unit be some shift out stroke in word minimum and be no less than 3 strokes have a standard pronunciation that containing word, comprising with separately have stroke to have to join containing word; When what if having, multiple strokes equated hits word, only choosing with the row leftmost side or the irregular vigour of style in writing the most close lower end containing word;
2 fall unit be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, shine upon from sound
3 first sequence numbers that fall are 5;
Seven liters of word parts elements, are called for short and rise unit or rise i.e. vector font.
Definition G:
1 under precondition, and rising unit is not stopped by other stroke, can vertical translation go out current block coboundary containing word; Rise unit and be some shift out stroke in word minimum and be no less than 3 strokes have a standard pronunciation that containing word, comprising with the word that contains that separately has stroke to have to join; When what if having, multiple strokes equated hits word, only choosing with the row leftmost side or the irregular vigour of style in writing the most close upper end containing word;
2 liters of units be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, shine upon from sound;
The sequence number of 3 liters of units is 6, falls, rises first mnemonic(al) pithy formula: fall to rising few three and draw peak Stroll, select in the vicinity with misarrangement peak; Refer to accompanying drawing 1 fall unit, rise unit example;
Certain element defined above, if the retrieval elements of current block, after code fetch, this element must be fulfiled unloading, therefore they also all have another name called as dynamic class element, after dynamic element is unloaded, must present the current block after element variation; The mapping code another name of dynamic class element is activity code, and activity code item indicates with small letter d in formula; And each element defining below all has another name called as static elements, the mapping code another name of static class element is quiet code;
Eight core Character tables, are called for short core unit or core.
Definition H:
1 fulfiled unloading, and not containing the current block of any dynamic element, if it is that while having the word of standard pronunciation, this word is just decided to be core unit;
2 core units be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, shine upon from sound; Core code item indicates with small letter h in formula;
The sequence number of 3 core units is 7;
The texel of nine current blocks, is called for short line unit or line.
Definition I:
1 is not all decided to be dust line unit containing intersecting the various current block of stroke, is all decided to be excellent line unit and contain the various current block that intersects stroke, and both are referred to as line unit, therefore any current block, line unit can be empty; The line code item that perseverance has indicates with capitalization w in formula;
2 various dust lines units or excellent line unit are all mapped in respectively in the i key mapping or u key mapping of keyboard;
The sequence number of 3 line units is 8;
Three class stroke numerical value elements in ten current blocks, referred to as several units or number.
Definition J:
The stroke of Chinese character is divided three classes: be referred to as axle with transverse axis or the completely parallel stroke of vertical pivot and draw, wherein do not comprise the stroke of little time hook of tail end band; The folding picture being commonly called as, comprising the stroke of little time hook of tail end band; Do not belong to front two classes and remaining various stroke merger is called oblique picture; With tiltedly drawing successively, folding is drawn, axle is drawn single class total number, be called for short tiltedly, the value of folding, axle inputs the number of drawing of all kinds of strokes of current block, every class has value certainly;
Definition K:
If in 1 current block the total numerical value of a certain single class stroke be 0,1,2,3 or be greater than 3 several time, zero stroke, stroke, two strokes, three strokes or many divisions are not several units of its value;
2 several yuan respectively according to the corresponding relation mapping of, L ← zero stroke, Y ← stroke, E ← bis-stroke, S ← tri-stroke or D ← many strokes, and L is placeholder; Can not indicate with the s item of capitalization for three empty figure place units breviary in formula;
The sequence number of 3 several yuan is 9; The mnemonic(al) pithy formula of core, line, number: clean core two line bias axles;
The each example of accompanying drawing 1 is referred in core unit, line unit, several unit;
Exempt from the regulation of ambiguity: because core unit, line unit, tiltedly, folding, axle, be all the static elements without uninstall process, every quiet code with unit can only be inputted once;
Brief description of the drawings
Each dvielement example of Fig. 1 current block
Fig. 2 individual character input table
Implementation of the present invention is:
Definition L:
1 retrieval elements: the element of never getting code and sequence number minimum of current block, also code fetch element; Refer to retrieval unit, h item, W item, S item in the each example of accompanying drawing 2;
2 sieves: the cross pithy formula of screening current block retrieval elements, that is: completely enclose Zuoyun and do not fall to rising core line number;
3 by sieve regular inspection: follow sieve by unit's order, location current block retrieval elements; Search successively the whether full unit of current block? enclose unit? there is there left unit? by that analogy, and first find that there is and never got code element, be exactly the retrieval elements of current block;
4 when entering code: the mapping code of retrieval elements is decided to be the input code of current block, is called for short when entering code;
The present invention can realize by following input method
One input method of the single character
The general formula of individual character input code composition: Z=pdh+WS
A: first yard:
There is sound word: the acoustic code of input marking-up, i.e. the code of p item in formula, is shown in Fig. 2 p item; B continues;
Doubt word: the p item of formula is exempted from; By sieve regular inspection, see first retrieval unit of figure 2 Jing, Eol, defeated
Enter to feel uncertain this current block of word when entering code: if activity code, after input, unloading is got
Data code, must present element the current block changing has occurred, see figure 2 Jing,
First of Eol unloads rear current block; If there is no dynamic element in current block, just without unloading
The process of carrying, ought enter code can only be the W item code that perseverance has, as the line code u of Fig. 1 Quan;
B continues;
B: second code:
There is sound word: continue the item after formula p, by sieve regular inspection, see first retrieval unit of front 16 words of Fig. 2,
That inputs this current block of marking-up ought enter code; If activity code, see first in Fig. 2
Activity code, unloading code fetch element after input, must present element variation has occurred
Current block, is shown in that first in Fig. 2 unloads rear current block; If it is not moving in current block
State element, just without uninstall process, ought enter code can only be the W item code that perseverance has, and sees
Fig. 2 Yin, the u returning, i; C continues;
Doubt word: by sieve regular inspection, that inputs current block ought enter code; If activity code, unloading after input
Code fetch element, presents the current block after element variation, sees that the rank rear of figure 2 Eol unloads
Rear current block; If there is no dynamic element in current block, ought enter code is a quiet code,
See the core code j of figure 2 Jing, or the oblique code e of Fig. 1 Quan; C continues;
C: third yard:
There is sound word: by sieve regular inspection, that inputs current block ought enter code; If activity code is shown in Fig. 2
Rank rear activity code, unloading code fetch element, presents the current block after element variation, as
Fruit does not have dynamic element in current block, and ought enter code is a quiet code, see Fig. 2 month and
The h item code of Lin Jiang, or see that Fig. 2 sees the w item code of Huang Ziding, or see that Fig. 2 Yin returns
Oblique code y, the y of s item; D continues;
Doubt word: by sieve regular inspection, that inputs current block ought enter code; If activity code, unloading code fetch unit
Element, presents the current block after element variation; If it is not dynamically first in current block
Element, ought enter code is a quiet code, sees the line code i of figure 2 Eol, or Fig. 1 Quan
Folding code y; D continues;
D: the 4th yard:
There is sound word: by sieve regular inspection, input is when entering code; On software intelligence word selection, shield, according to screen prompting platform
Formula machine-,=,, [,]; , ',,. ,/ten options buttons and
A space acknowledgement key, or handset-selected mode is carried out the artificial end of choosing eventually;
Doubt word: by sieve regular inspection, input is when entering code; On software intelligence word selection, shield, according to screen prompting platform
Formula machine-,=,, [,]; , ',,. ,/ten options buttons and
A space acknowledgement key, or handset-selected mode is carried out the artificial end of choosing eventually;
Three code systems are taked in the application of GB 2312 character set, no longer continue, and need not page turning can satisfy the demand completely; GB 18030 professional versions are 4 code systems, can be used for larger character set;
Two word input methods
The word of two word to seven words: first input word acoustic code word for word, then input word tail word and the each marking-up of lead-in when entering code, i.e. the adjacent code of acoustic code in the first each word input code string of tail;
If P nfor the phonetic acoustic code of n word in word, V nfor n marking-up in word when entering code, in word in n marking-up input code string adjacent yard of acoustic code;
The general formula of word input code: CV=P 1p nv nv 1
2≤n≤7 wherein, 4≤CV code length≤9
That is: two-character word language=P 1p 2v 2v 1
Three-character words and phrases=P 1p 2p 3v 3v 1
Four-word phrase=P 1p 2p 3p 4v 4v 1
Five character word language=P 1p 2p 3p 4p 5v 5v 1
Six words language=P 1p 2p 3p 4p 5p 6v 6v 1
Seven words language=P 1p 2p 3p 4p 5p 6p 7v 7v 1
Refer to code for Chinese word and phrase example below;
Embodiment
Individual character:
Element standardization ten million order of magnitude coding
vee sbx bme zdr hrq quy wiy swn?lls jlj blh msm
Space words leads to into the poor efficiency courage thing of sleeping with one's head on a high pillow
kbg jrm zbz cyk tzy cgi drw xwa?gak wcb dvs sul
The suitable storehouse Chinese is enjoyed with sound electricity in Bei Shijianbu road
bra sul jeq bie dzf ywu yaf dul?xaz szk kgc hdy
Ground is falling to execute lion
dty biw dig sfp sqj
Word:
The general cost of words is inefficient high endures hardships to accomplish some ambition to get twice the result with half the effort he is got rid of
zcyb tywz cbdrg xlgaw wxcdvc sbgbru btsdii
Long-term unlike not knowing that Amur has a liking for food lion acoustic code and win phonetic
snmyjeu bbbzdzi sssssqf ysmspyaw
Computer cell phone is shared and is suitable for word storehouse, high in the clouds Chinese character index new world
dnsjgxau syvdcvkgz hzjsxtdtd
In each word tail, lead-in input code string, the adjacent code of acoustic code, sees above part single word code.
The present invention has cracked a difficult problem for single acoustic code input, has utilized fully 10 to compress input code length with interior space encoder, has promoted the hit rate of only screen word or word.The present invention only increases 30 several memory points and simple and easy rule, a GPRS primary word, just can control several ten thousand Chinese characters including Japan and Korea S's Chinese character, the micro-merit of thing is huge.
Coding of the present invention, makes sequence retrieval and the input of Chinese character, and perfect pair is like nature itself.

Claims (2)

1. the input method of Chinese character based on retrieval elements, its method is:
Full first sequence number is 1, when radical part Epileptic, Rui, Bing, Fu, Jie, Dao, Rolling, Jin, lonely, fork-like farm tool used in ancient China, Si, Niu, Woo, Yi, Quan, Ren, Chi, Cannibals, the-Fan, Xin, Yan, Chuo, Yin, or insect without feet or legs, wherein any one, while structurally having taken the leftmost side of current block or the rightmost side, this radical is just decided to be the full unit of current block; If the leftmost side and the rightmost side of the separation of two radicals in above-listed current blocks, only select the full unit of the leftmost side;
Enclose first sequence number and be listed as 1, enclose to the greatest extent territory: connect a frame at the not closed framework of most circle and do not hold together to close between the two-stroke of end, auxiliaryly draw not crossing with any stroke of this company frame virtual tangent, mended the empty sealing of the most circle frame inner region of Feng Suocheng by virtual tangent, be called most circle territory, Yan Wei: in current block, not only there is a company's frame that encloses to the greatest extent Closed Architecture, be called for short and close to the greatest extent frame; And, after closing to the greatest extent frame stroke, or except after closing to the greatest extent frame stroke and associated weldering picture, also separately there is other stroke, and this all other strokes, have no the ground of omission and be all contained in Jin Bi frame circle; Close to the greatest extent the continuous stroke of frame, or to close to the greatest extent frame continuous stroke and associated weldering picture be exactly sternly enclosing unit, leaking and enclose of current block: sternly not enclosing in first current block, not only have a company's frame that encloses to the greatest extent not closed framework, be called for short to the greatest extent not frame; And, after most not frame stroke, or except after most not frame stroke and associated weldering picture, also separately there is other stroke, and this all other strokes, have no to omit ground and be all contained in the Jin Quan territory circle of this most not frame; The most continuous stroke of frame not, or frame continuous stroke and associated weldering picture are not exactly that unit is enclosed in the leakage of current block to the greatest extent; Both enclose unit at general designation, and the stroke being enclosed claims to hunt picture;
The sequence number of left unit is 2, and the leftmost side of current block is a word that has standard pronunciation, if while being all positioned at the right side of this word except all the other strokes of this word, this word is just decided to be the left unit of current block; Comprising having the left unit of joining with all the other strokes;
The sequence number of Yun Yuan is 3, if parts Tou, Lv, Http, zhao, ji, any one in Si, Xi, occupies alone the top layer of current block, and when all the other strokes of this parts are all under top layer plane, these parts are exactly the cloud unit of current block;
Not first sequence number is 4, in current block, contain the order that independently writes containing word or and parts, if it is not the left unit of current block, and, in these parts, during without any other stroke, should be exactly the not first of current block containing word or this parts; If both coexist, only select the former;
First sequence number that falls is 5, and under precondition, the unit that falls do not stop by other stroke, can vertical translation go out current block lower limb containing word; Fall unit be some shift out stroke in word minimum and be no less than 3 strokes have a standard pronunciation that containing word, comprising with separately have stroke to have to join containing word; When what if having, multiple strokes equated hits word, only choosing with the row leftmost side or the irregular vigour of style in writing the most close lower end containing word;
Rising first sequence number is 6, and under precondition, rising unit is not stopped by other stroke, can vertical translation goes out the word that contains of current block coboundary; Rise unit and be some shift out stroke in word minimum and be no less than 3 strokes have a standard pronunciation that containing word, comprising with the word that contains that separately has stroke to have to join; When what if having, multiple strokes equated hits word, only choosing with the row leftmost side or the irregular vigour of style in writing the most close upper end containing word;
Certain above element, if the retrieval elements of current block, after code fetch, this element must be fulfiled unloading, therefore they also all have another name called as dynamic class element, after dynamic element is unloaded, must present the current block after element variation; And each element below all has another name called as static class element;
The sequence number of core unit is 7, fulfils unloading, and not containing the current block of any dynamic element, if it is that while having the word of standard pronunciation, this word is just decided to be core unit;
The sequence number of line unit is 8, is not all decided to be dust line unit containing intersecting the various current block of stroke, is all decided to be excellent line unit and contain the various current block that intersects stroke; Both are referred to as line unit;
The sequence number of several units is 9, and the stroke of Chinese character is divided three classes: be referred to as axle with transverse axis or the completely parallel stroke of vertical pivot and draw, wherein do not comprise the stroke of little time hook of tail end band; The folding picture being commonly called as, comprising the stroke of little time hook of tail end band; Do not belong to front two classes and remaining various stroke merger is called oblique picture; With tiltedly drawing successively, folding is drawn, axle is drawn all kinds of total numbers, is called for short tiltedly, the value of folding, axle inputs the number of drawing of all kinds of strokes of current block, every class has value certainly; If the total numerical value of a certain single class stroke be 0,1,2,3 or be greater than 3 several time, zero stroke, stroke, two strokes, three strokes or many divisions are not several units of its value;
The mnemonic(al) pithy formula of above 10 kinds of elements: the full limit of radical structure, frame is rounded up and hunt and entirely contained, all right existing more than a left side, the cloud layer exhibition of being left, order not left another code Europe, falls to rising few three and draws peak Stroll, selects clean core two line bias axles with misarrangement peak in the vicinity;
Described full unit respectively according to, b ← Epileptic, d ← Rui or Bing, e ← Fu, Dao or Jie, i ← Rolling, j ← Jin or lonely, 1 ← Si or or or Chi, s ← Cannibals or w ← The-Fan, x ← Xin, y ← Yan, z ← Chuo, Yin, or corresponding relation mapping, described various framework sternly enclose that unit or leakage enclose that unit is all mapped in the w key mapping of keyboard, described left unit be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, from sound mapping, described cloud unit respectively according to, a ← Tou or Lv, or or, or or or or or corresponding relation mapping, described not first, all be mapped in the Europe of keyboard, be in O key mapping, the described unit that falls be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, from sound mapping, described rise unit be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, from the mapping code of sound mapping, above-mentioned dynamic element, have another name called as activity code, activity code item indicates with small letter d in formula, and static elements below mapping code, another name is quiet code, described core unit be mapped in separately with the keyboard key-position of himself acoustic code with symbol on, shine upon from sound, core code item indicates with small letter h in formula, described various dust line unit or excellent line unit are all mapped in respectively in the i key mapping or u key mapping of keyboard, the line code item that perseverance has indicates with the w of capitalization in formula, described several units respectively according to, L ← zero stroke, Y ← stroke, E ← bis-stroke, S ← tri-stroke, or the corresponding relation of D ← many strokes mapping, L is placeholder, can not indicate with the s item of capitalization for three empty figure place units breviary in formula,
The mapping code of each element above, if without the yu of initial consonant, when circuitous sound, replaces with V without exception: except statement, and all case insensitives of the letter of all mappings;
Retrieval elements: never got the element of code and sequence number minimum in current block, also code fetch element;
Sieve: the cross pithy formula of screening current block retrieval elements, that is: completely enclose Zuoyun and do not fall to rising core line number:
By sieve regular inspection: follow sieve by unit's order, location current block retrieval elements; Search successively the whether full unit of current block? enclose unit? there is there left unit? by that analogy, and first find that there is and never got code element, be exactly the retrieval elements of current block;
When entering code: the mapping code of retrieval elements is decided to be the input code of current block, be called for short when entering code;
Exempt from the regulation of ambiguity: because core unit, line unit, tiltedly, folding, axle, be all the static class element without uninstall process, every quiet code with unit can only be inputted once;
The general formula of individual character input code composition: Z=pdh+WS
A: first yard:
There is sound word: the acoustic code of input marking-up, the i.e. code of p item in formula; B continues;
Doubt word: the p item of formula is exempted from; By sieve regular inspection, that inputs doubt word ought enter code; If activity code, unloading code fetch element after input, must present element the current block changing has occurred, if there is no dynamic element in current block, just without uninstall process, ought enter code can only be the W item code that perseverance has; B continues;
B: second code:
There is sound word: continue the item after formula p, by sieve regular inspection, that inputs marking-up ought enter code; If activity code, after input, unloading code fetch element, presents the current block after element variation, if there is no dynamic element in current block, just without uninstall process, ought enter code can only be the W item code that perseverance has; C continues;
Doubt word: by sieve regular inspection, input is when entering code; If activity code, after input, unloading code fetch element, presents the current block after element variation, if there is no dynamic element in current block, ought enter code is a quiet code; C continues;
C: third yard:
There is sound word: by sieve regular inspection, input is when entering code; If activity code, after input, unloading code fetch element, presents the current block after element variation, if there is no dynamic element in current block, ought enter code is a quiet code; D continues;
Doubt word: by sieve regular inspection, input is when entering code; If activity code, after input, unloading code fetch element, presents the current block after element variation, if there is no dynamic element in current block, ought enter code is a quiet code; D continues;
D: the 4th yard:
There is sound word: by sieve regular inspection, input is when entering code; On software intelligence word selection, shield, according to screen prompting with desktop computer-,=,, [,]; , ',,. ,/ten options buttons and a space acknowledgement key, or handset-selected mode is carried out artificial choosing eventually and is finished;
Doubt word: by sieve regular inspection, input is when entering code; On software intelligence word selection, shield, according to screen prompting with desktop computer-,=,, [,]; , ',,. ,/ten options buttons and a space acknowledgement key, or handset-selected mode is carried out artificial choosing eventually and is finished;
Three code systems are taked in the application of GB2312 character set, GB18030 professional version be 4 code systems.
2. the input method of Chinese character based on retrieval elements as claimed in claim 1, its word input method is:
The word of two word to seven words: first input word acoustic code word for word, then input word tail word and the each marking-up of lead-in when entering code, i.e. the adjacent code of acoustic code in the first each word input code string of tail;
If P nfor the phonetic acoustic code of n word in word, V nfor n marking-up in word when entering code, in word in n marking-up input code string adjacent yard of acoustic code; The general formula of word input code: CV=P 1p nv nv 1
2≤n≤7 wherein, 4≤CV code length≤9
That is: two-character word language=P 1p 2v 2v 1
Three-character words and phrases=P 1p 2p 3v 3v 1
Four-word phrase=P 1p 2p 3p 4v 4v 1
Five character word language=P 1p 2p 3p 4p 5v 5v 1
Six words language=P 1p 2p 3p 4p 5p 6v 6v 1
Seven words language=P 1p 2p 3p 4p 5p 6p 7v 7v 1.
CN201110143478.6A 2011-05-31 2011-05-31 Chinese character input method based on retrieval elements Expired - Fee Related CN102184034B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110143478.6A CN102184034B (en) 2011-05-31 2011-05-31 Chinese character input method based on retrieval elements

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110143478.6A CN102184034B (en) 2011-05-31 2011-05-31 Chinese character input method based on retrieval elements

Publications (2)

Publication Number Publication Date
CN102184034A CN102184034A (en) 2011-09-14
CN102184034B true CN102184034B (en) 2014-10-22

Family

ID=44570217

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110143478.6A Expired - Fee Related CN102184034B (en) 2011-05-31 2011-05-31 Chinese character input method based on retrieval elements

Country Status (1)

Country Link
CN (1) CN102184034B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104007833A (en) * 2013-06-14 2014-08-27 赵建民 Ternary basic code input method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1180186A (en) * 1996-10-11 1998-04-29 潘玉琦 Chinese characters word font input method (Zhengyang code) and shuangyang code
CN1251438A (en) * 1999-11-24 2000-04-26 肖金卯 Chinese character digital coding input method based on Chinese character basic elements and normal parts
CN1384425A (en) * 2001-08-16 2002-12-11 项有建 Indefinite code Chinese character input method for computer and keyboard thereof
CN1527183A (en) * 2003-05-08 2004-09-08 丁恩明 Square-shaped character encoding and inputting method and keyboard

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1180186A (en) * 1996-10-11 1998-04-29 潘玉琦 Chinese characters word font input method (Zhengyang code) and shuangyang code
CN1251438A (en) * 1999-11-24 2000-04-26 肖金卯 Chinese character digital coding input method based on Chinese character basic elements and normal parts
CN1384425A (en) * 2001-08-16 2002-12-11 项有建 Indefinite code Chinese character input method for computer and keyboard thereof
CN1527183A (en) * 2003-05-08 2004-09-08 丁恩明 Square-shaped character encoding and inputting method and keyboard

Also Published As

Publication number Publication date
CN102184034A (en) 2011-09-14

Similar Documents

Publication Publication Date Title
CN107562824B (en) Text similarity detection method
CN105550170B (en) A kind of Chinese word cutting method and device
CN110377740A (en) Feeling polarities analysis method, device, electronic equipment and storage medium
CN105210055B (en) According to the hyphenation device across languages phrase table
CN110362820B (en) Bi-LSTM algorithm-based method for extracting bilingual parallel sentences in old and Chinese
CN106503101A (en) Electric business customer service automatically request-answering system sentence keyword extracting method
CN110134951A (en) A kind of method and system for analyzing the potential theme phrase of text data
CN102184034B (en) Chinese character input method based on retrieval elements
JPWO2008146583A1 (en) Dictionary registration system, dictionary registration method, and dictionary registration program
CN106156006B (en) Tibetan language word component analyzing method, Tibetan collation method and corresponding intrument
Pedersen Machine learning with lexical features: The duluth approach to senseval-2
Menai et al. Genetic algorithm for Arabic word sense disambiguation
CN101630309A (en) Word processing system with fault tolerance function and method
CN103744532A (en) 26 radical root Chinese and English harmonic inputting method
WO2023030266A1 (en) Input method lexicon updating method and apparatus, device and server
CN101488057B (en) Combined coding technique
CN101226430A (en) Character-checking typewriting idem code input method as well as input device and application thereof
CN103744533A (en) Thirty Chinese character component input method
CN102566904B (en) A kind of West Xia Dynasty's voice terminal based on West Xia Dynasty's literary composition holographic code exchange interface
CN107203625A (en) A kind of imperial palace dress ornament Text Clustering Method and device
CN104536590A (en) Embedded soft keyboard system based on Xixia character sound rhyme and root input method
CN105204657B (en) Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard
CN100361057C (en) Chinese character input method using small keyboard of computer keyboard
CN106325540B (en) A kind of simple stroke input method of Northeast Yunnan, China subdialect seedling text and its application
CN106201008B (en) A kind of entering method keyboard layout method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141022

Termination date: 20170531