CN107066114A - Bipartite structure code - Google Patents
Bipartite structure code Download PDFInfo
- Publication number
- CN107066114A CN107066114A CN201710062351.9A CN201710062351A CN107066114A CN 107066114 A CN107066114 A CN 107066114A CN 201710062351 A CN201710062351 A CN 201710062351A CN 107066114 A CN107066114 A CN 107066114A
- Authority
- CN
- China
- Prior art keywords
- block
- word
- chinese character
- character
- coding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Document Processing Apparatus (AREA)
Abstract
Bipartite structure code belongs to a kind of input method of Chinese character.Selection and distribution, Chinese character separating method and word coding method method including 12 keyboard coding parts.Bipartite structure code according to the Hanzi structure relation of its delimitation and it is corresponding split rule, Chinese character natural split into most two blocks according to the stroke sequential write of Chinese character, it is dividing by means of characters method clear concept, simple and clear.Bipartite structure code is encoded according to its coding rule.Individual character block word takes two yards, and two block words take trigram, and phrase takes four yards.The most frequently used word can all use one yard, two yards of brevity code typings." " one yard of one key typing of word.Phrase repeated code is few.
Description
Technical field
Bipartite structure code belongs to a kind of input method of Chinese character.Selection and distribution including keyboard coding part, two volumes of Chinese character
The coding method of the method for splitting of codeword block, individual character and phrase.
Background technology
At present, conventional input method has spelling input method, tone-form input method, shape phonetic input method, pure shape input method and stroke
Input method etc.;
Spelling input method existing defects:1st, single character code is long;2nd, phrase repeated code is more;3rd, when cacoepy, difficulty of reading, it can make
It is difficult into input;
Current existing shape phonetic input method, the defect of pure shape input method generally existing:1st, keyboard coding part is more, keyboard set up
Undesirable 2, divide by means of characters complicated, the connected stroke that will intersect having is split, what is had is not suitable by the natural writing of Chinese-character stroke
Sequence is split, and is mostly repeatedly split Chinese character, and specific, concept is not apparent for some dividing by means of characters methods;
Tone-form input method:Tone code and the advantage of shape code are taken into account, while also having passed on tone code and the shortcoming of shape code;
Stroke input method:Auxiliary input method can only be used as.
The content of the invention
Brief introduction
Chinese character natural is split into most two by bipartite structure code according to the Hanzi structure relation and the corresponding rule that splits of delimitation
For the block of coding, then encoded according to the coding rule of bipartite structure code;
12 radicals are chosen as keyboard device to be distributed on keyboard by stroke subregion and form;
Dividing by means of characters method clear concept, simple and clear, individual character block word takes two yards, and two block words take trigram, and phrase takes four yards, phrase weight
Code is few.
First, area code
25 English characters of keyboard are divided into five areas, area code corresponds to numeral 1,2,3,4,5 respectively, per region and five-position, equally, position
Numeral 1,2,3,4,5 number is also corresponded to respectively.Meanwhile, by five strokes " horizontal, vertical, left, flick, folding " of Chinese character also by numeral 1,2,3,
4th, 5 numbering, take two can be corresponding with the English character for setting area code.The corresponding area code difference of 25 English characters
For:One area G11, F12, D13, S14, A15, two area H21, J22, K23, L24, M25, three area T31, R32, E33, W34, Q35, four
Area Y41, U42, I43, O44, P45, five area B51, V52, C53, X54, Z55.Left point is considered as slash, right point and is considered as by bipartite structure code
Right-falling stroke, left vertical hook are considered as that perpendicular, starting writing is considered as horizontal stroke.Such as:The left point of " Mi, Http, sword, snow " etc. is considered as slash, " small Rolling Pin water cun " etc.
Left vertical hook is considered as perpendicular, and " starting writing for Bing Rui Rolling Niu " etc. is considered as horizontal stroke.
2nd, keyboard device
Totally ten two keyboard devices:
Q35 | W34 | The E33 months | R32 | T31 | Y41 | U42 Lv | I43 fire | O44 | P45 |
A15 | S14 wood | D13 | F12 | G11 | H21 mountains | J22 mouthfuls | K23 days Xin | L24 mesh door | ; |
Z55 | X54 Rolling | C53 | V52 | B51 | N | M25 | , | . | / |
Keyboard device characteristic distributions:" Lv, Rolling " take shape;If " Xin, door " is suitable by first intermediate and then both sides, first left and then right writing
Sequence, it for perpendicular, secondary pen is respectively slash and right-falling stroke that its is the first sum of, and their the first two strokes and area code is corresponding;The head of other keyboard devices
Pen is all corresponding with area code.
3rd, Chinese character separating principle
1st, Chinese character natural is split into most two by bipartite structure code according to the Hanzi structure relation and the corresponding rule that splits of delimitation
The individual block for being used to encode
The Chinese character that can not be split is individual character block Chinese character,
Under rare occasion, single can turn into coding block, such as:The last pen of words such as " skill, big, chi, unreal, arts ";
2nd, intersecting connected stroke can not split
It is intersecting:Stroke, which intersects, lifts one's head
It is connected:Stroke joins end to end;
3rd, the stroke of a block can not be separated by the stroke of another block
In addition, two coding blocks of bipartite structure code can form encirclement, semi-surrounding relation, the stroke energy of each block
By sequential write self-assembling formation one, block shape can be with changeable;
4th, non-tiled configuration Chinese character:Prefix single can not be split.
4th, Chinese character coding rule
1st, single is encoded:Area code is determined by single stroke, position number takes 1, and specific coding is:Horizontal G, perpendicular H, slash T, right-falling stroke Y, folding B;
2nd, single block is encoded:Encoded by single;
3rd, keyboard device is encoded:Keyboard location number coding where button disk component, such as:Keyboard location number where keyboard device " Lv "
For " 42 ", it is encoded to " U ";
4th, two combine coding:The first stroke determines area code, and specific numbering is:Horizontal stroke 1, perpendicular 2, slash 3, right-falling stroke 4, folding 5, second determination position
Number, specific numbering is:Horizontal stroke 1, perpendicular 2, slash 3, right-falling stroke 4, folding 5, two combine the area code for determining to encode, such as:" ten " word is by two knots
When compiling in collaboration with yard, correspondence position numbering is 12, and correspondence is encoded to F;
5th, many block codings:If block or block stem are keyboard device, keypad component coding, otherwise, by block
The first sum of and time pen combines coding, example by two:" journey " word is removable to be divided into block " standing grain " and " being in ", and " being in " presses " mouth " and be encoded to J,
" thinking " word is removable to be divided into block " phase " and " heart ", and " phase " presses " wood " and be encoded to S;
6th, two block words are encoded:
First yard:Encoded by first character block
Second code:Encoded by second block
Third yard:By the last pen of the last pen of first character block and second block coding is combined by two.
5th, right frame, lower frame, frame structure
The mount structure word that right frame " Contraband ", lower frame " Jiong ", square frame " mouth " are constituted, frame is allocated as first character block, inframe part
It is used as second block.Stem is encoded for the block of right mount structure by right frame.
6th, left, upper semi-surrounding structure
Left, the upper semi-surrounding block word being made up of radical " factory, wide, Epileptic, xu, corpse, family ", enclosure is allocated as first character block,
It is surrounded part and is used as second block.
7th, left, lower semi-surrounding structure
Left, lower semi-surrounding structure is split as surrounding part and the block of besieged part two, is split by the sequencing of writing.It is real
Example:" super, awkward, inscribe, return ".
8th, tiled configuration
The tiled configuration Chinese character write from left to right is split out naturally by stroke sequential write according to tiled configuration relation can not be again
The block of left and right fractionation is carried out as first character block, remaining part is used as second block.Tiled configuration word splits supplement
Rule:Single stroke is to left side block merger in the middle of block.Such as:" repairing " word.
9th, up-down structure
The up-down structure Chinese character write from top to bottom is by stroke sequential write according to up-down structure relation according on bipartite structure code
Lower block word split rule split into naturally above and below two blocks;
The priority that divides by means of characters is as follows:
1st, prefix is keyboard device, and keyboard device is used as second block as first character block, remainder;
2nd, by radical, " the up-down structure word that ' rain ' prefix, ' cave ' prefix, Http " are constituted, will " ' rain ' prefix, ' cave ' prefix, Http " works
For the first block, remaining part is used as the second block;
3rd, prefix can be split out three and more than three Chinese characters naturally by up-down structure relation, taken word principle to split out Chinese character by maximum and made
For first character block, remaining part is used as second block, example:" good fortune, it is true, more, reputation ";
4th, suffix can be split out two and more than two Chinese characters naturally by up-down structure relation, taken word principle to split out Chinese character by maximum and made
For second block, remaining part is used as first character block, example:" blue or green, sell, the spring, general, Securities, tool, soldier, tight ";
5th, prefix or suffix are frame-type, semi-surrounding, the up-down structure word of tiled configuration, by sequential write by frame-type, semi-surrounding, a left side
Right structure and remaining part are split as two blocks, example:" height, a widow, be preced with, cover ";
Supplementary notes:If the fractionation priority of up-down structure word appropriately adjusted, influence is little.
Tenth, miscellaneous body structure
1st, prefix can split out three and more than three Chinese characters naturally by stroke writing sequencing, take word principle to split out the Chinese by maximum
Word is used as second block, example as first character block, remainder:" deuterium, art ";
2nd, suffix can split out three and more than three Chinese characters naturally by stroke writing sequencing, take word principle to split out the Chinese by maximum
Word is used as first character block, example as second block, remainder:" person, rear, left ".
11, individual character block Chinese character
The Chinese character that can not be split by above-mentioned rule is individual character block Chinese character;
Keyboard coding part word is encoded:Two yards are taken by keyboard location number where it;
Other individual character block Chinese character coding rules:
First yard:By Chinese-character stroke sequential write, encoded by first and second by area code.Single word is encoded by single
Second code:By Chinese-character stroke sequential write, encoded by third and fourth pen by area code.Single word, two words take the first of word
Code, three words are encoded by the 3rd by single.
12, keyboard device homologous structure
Non- radical " Lv ", but stem is the block of " Lv " configuration, button disk component " Lv " coding;Include the individual character of " Lv " configuration
Block word " twenty is sweet ", button disk component word coded system coding;" Si " " mesh " is encoded together, and " saying " " day " encodes , " Pin together " together " wood "
Coding, " Mao " " moon " is encoded together.
13, phrase is encoded
Two-character phrase is encoded:First two yards of two words are taken successively, such as:" structure ZFSQ, principle DRMJ ";
Three word coding methods:The first two word takes prefix successively, and the 3rd word order takes two yards, such as:" constructive code ZSDZ ";
Four word coding methods:The prefix of each word is taken successively, such as:" happy life FPTO ";
Multiword Chinese word coding:First three word takes prefix successively, the 4th yard take the last character prefix.Such as:", Chinese people's republicanism
State MRWM ".
14, individual character auxiliaring coding
Individual character auxiliaring coding is independent coding, and discord individual character normal encoding produces repeated code, and phrase of also getting along well produces repeated code;
Individual character auxiliaring coding contributes to the words frequency modulation of normal encoding, improves words input efficiency, and non-commonly used word input is normal to compile
After code, if can not be shown in candidate's homepage, auxiliaring coding typing can be passed through;
1st, " " word one key typing coding:N,
2nd, two yards of all Chinese characters increase trigram auxiliaring coding:First two yards are Chinese character basic coding, and third yard is N,
3rd, all trigram Chinese characters increase by four yards of auxiliaring codings:Preceding trigram is Chinese character basic coding, and the 4th yard is N.
15, brevity code
In order to improve individual character input efficiency, the most frequently used word all can be set to one yard, two yards of brevity codes.
16, non-word radical coding
First yard:By bipartite structure code coding rule code fetch,
Second code:N.
Claims (5)
1. bipartite structure code belongs to a kind of input method of Chinese character, include selection and distribution, the Chinese character separating method of keyboard coding part
With word coding method method, method includes principle and rule, and Chinese character separating is first the block for coding, block by bipartite structure code
Coding takes one yard, then determines word coding method according to word coding method method, it is characterised in that:In Chinese character separating method, according to draw
Fixed Hanzi structure relation and it is corresponding split rule by Chinese character natural split into most two be used for encode block, above and below
Chinese character prefix, suffix are introduced when structure and the fractionation of miscellaneous body structure to take in word method for splitting, the method for Chinese character coding, block coding draws
Keypad component coding method when having entered block stem for keyboard device.
2. the Hanzi structure relation delimited according to claim 1 and corresponding fractionation rule, it is characterised in that:Right frame
The mount structure word that " Contraband ", lower frame " Jiong ", square frame " mouth " are constituted, frame is allocated as first character block, and inframe part is used as second
Individual block;Left, the upper semi-surrounding block word being made up of radical " factory, wide, Epileptic, xu, corpse, family ", enclosure is allocated as first character
Block, is surrounded part and is used as second block;Left, lower semi-surrounding structure is split as surrounding part and the word of besieged part two
Block, is split by the sequencing of writing;The tiled configuration Chinese character write from left to right is by stroke sequential write according to tiled configuration
Relation splits out the block that can not carry out left and right fractionation again as first character block naturally, and remaining part is used as second word
Block;The up-down structure Chinese character write from top to bottom by stroke sequential write according to up-down structure relation according to bipartite structure code above and below
Block word split rule split into naturally above and below two blocks, divide by means of characters priority be:Prefix is keyboard device, and keyboard device is made
For first character block, remainder is as second block, by the radical " knot up and down that ' rain ' prefix, ' cave ' prefix, Http " are constituted
Structure word, by ", as the first block, remaining part is as the second block, and prefix can be by knot up and down by ' rain ' prefix, ' cave ' prefix, Http "
Structure relation splits out three and more than three Chinese characters naturally, takes word principle to split out Chinese character as first character block, remaining portion by maximum
It is allocated as second block, suffix can be split out two and more than two Chinese characters naturally by up-down structure relation, takes word former by maximum
Chinese character is then split out as second block, remaining part is as first character block, and prefix or suffix are frame-type, semi-surrounding, left and right
The up-down structure word of structure, is split as two blocks, such as by sequential write by frame-type, semi-surrounding, tiled configuration and remaining part
Fruit appropriately adjusts the fractionation priority of up-down structure word, and influence is little;Miscellaneous body block word dividing by means of characters priority is:Prefix
Three and more than three Chinese characters can be split out naturally by stroke writing sequencing, take word principle to split out Chinese character as first by maximum
Individual block, remainder can split out three and more than three naturally as second block, suffix by stroke writing sequencing
Chinese character, takes word principle to split out Chinese character as second block, remainder is used as first character block by maximum;Can not be by above-mentioned rule
The Chinese character then split is individual character block Chinese character;In the case where following method for splitting described in claim 1, drawn described in claim 1
Fixed Hanzi structure relation and the appropriate change of fractionation rule work accordingly, influence little.
3. according to claim 1 bipartite structure code, it is characterised in that:12 keyboard devices are chosen, by stroke and form
Distribution:Moon E33, W34, wood S14, Rolling X54, Lv U42, fire I43, mountain H21, mouth J22, day K23 mesh L24, Xin K23, door L24.
4. according to claim 1 bipartite structure code, it is characterised in that:Two block word third yards are by the last pen of first and second block
Combine and encode by two.
5. according to claim 1 bipartite structure code, it is characterised in that:Individual character increases auxiliaring coding, including:" " one key of word
Typing encodes N;All two yards of Chinese characters increase trigram auxiliaring codings, first two yards are Chinese character basic coding, and third yard is N;It is all
Trigram Chinese character increase by four yards of auxiliaring codings, preceding trigram be Chinese character basic coding, the 4th yard be N.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710062351.9A CN107066114A (en) | 2017-02-02 | 2017-02-02 | Bipartite structure code |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710062351.9A CN107066114A (en) | 2017-02-02 | 2017-02-02 | Bipartite structure code |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107066114A true CN107066114A (en) | 2017-08-18 |
Family
ID=59598751
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710062351.9A Pending CN107066114A (en) | 2017-02-02 | 2017-02-02 | Bipartite structure code |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107066114A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111796692A (en) * | 2020-07-08 | 2020-10-20 | 合肥埃米特信息技术有限公司 | Chinese character input method |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1206137A (en) * | 1996-01-17 | 1999-01-27 | 邹鹏程 | Binary code Chinese character input system |
CN1269541A (en) * | 1999-04-05 | 2000-10-11 | 童立志 | Shiji code Chinese character input method (including configurational code and pictophonetic code) |
CN1482530A (en) * | 2003-07-04 | 2004-03-17 | 曾艾明 | Four-four square array coding method for input Chinese character and supporting keyboard |
CN1773432A (en) * | 2005-09-21 | 2006-05-17 | 魏华彬 | U Code Chinese character inputting method |
CN101078954A (en) * | 2007-07-30 | 2007-11-28 | 刘兆荣 | Coding technology for simplifying five-stroke shape-pronunciation code |
US20110115715A1 (en) * | 2008-03-31 | 2011-05-19 | Chunman Li | Four-corner cut square chinese character input method based on excellence code |
-
2017
- 2017-02-02 CN CN201710062351.9A patent/CN107066114A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1206137A (en) * | 1996-01-17 | 1999-01-27 | 邹鹏程 | Binary code Chinese character input system |
CN1269541A (en) * | 1999-04-05 | 2000-10-11 | 童立志 | Shiji code Chinese character input method (including configurational code and pictophonetic code) |
CN1482530A (en) * | 2003-07-04 | 2004-03-17 | 曾艾明 | Four-four square array coding method for input Chinese character and supporting keyboard |
CN1773432A (en) * | 2005-09-21 | 2006-05-17 | 魏华彬 | U Code Chinese character inputting method |
CN101078954A (en) * | 2007-07-30 | 2007-11-28 | 刘兆荣 | Coding technology for simplifying five-stroke shape-pronunciation code |
US20110115715A1 (en) * | 2008-03-31 | 2011-05-19 | Chunman Li | Four-corner cut square chinese character input method based on excellence code |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111796692A (en) * | 2020-07-08 | 2020-10-20 | 合肥埃米特信息技术有限公司 | Chinese character input method |
CN111796692B (en) * | 2020-07-08 | 2023-12-22 | 合肥埃米特信息技术有限公司 | Chinese character input method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101694601B (en) | Zero-memory Chinese character coding input method | |
CN107066114A (en) | Bipartite structure code | |
CN106168858A (en) | 26 radical radical and stroke Chinese-character input methods | |
CN103744533A (en) | Thirty Chinese character component input method | |
CN103324299B9 (en) | Chinese character pictographic code computer input method based on Chinese character basic components | |
CN105302330A (en) | Combined phonetic and stroke type main and auxiliary code Chinese character and word and phrase coding input method and keyboard adopting method | |
CN103760989B (en) | He-Chinese horizontal stroke-vertical stroke-left descending stroke-right descending stroke font technology and input method | |
CN105912139A (en) | Corresponding recognition method for coding Chinese characters by using modular strokes | |
CN105278697B (en) | Combined double-spelling class major-minor code Chinese character, word coded input method and its keyboard | |
CN107066113A (en) | The code inputting method of 20 part individual character two | |
HALL | Language, dialect and ‘regional Italian’ | |
CN104063070B (en) | Amount property code Chinese character entering method | |
CN104133556B (en) | Double-stroke type main and auxiliary code letter type radical dictionary and sonic dictionary Chinese character coding input method and keyboard adopting method | |
CN1538276A (en) | Chinese charactor stroke and sound combined code input method | |
CN102520808A (en) | Head and tail dual stroke Chinese character input method | |
CN102609106B (en) | As the existing kanji code trinity input method of Comnputer Chinese character | |
CN1530805A (en) | Chinese character shape inputting system | |
CN101241403B (en) | Parts Chinese character coding input method and its corresponding keyboard | |
CN105320291B (en) | Combined type pronunciation and meaning class major-minor code Chinese character, word coded input method and its keyboard | |
CN105320290B (en) | Pronunciation and meaning class major-minor code word parent form radical dictionary, sonic system dictionary encoding of chinese characters input method and its keyboard | |
Steiner | Sky Rider: Park Van Tassel and the Rise of Ballooning in the West. By Gary B. Fogel | |
CN104750264B (en) | Computer Chinese-character associates fast code inputting method | |
CN1148201A (en) | Sound and stroke code for Chinese character input for computer | |
CN102929398A (en) | Zero-memory maximum sub Chinese character encoding input method | |
CN109725739A (en) | The Comnputer Chinese character input technology encoded with stroke and radical |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170818 |
|
WD01 | Invention patent application deemed withdrawn after publication |