CN1400522A - Chinese character dynamic structure easily-coding technique - Google Patents

Chinese character dynamic structure easily-coding technique Download PDF

Info

Publication number
CN1400522A
CN1400522A CN 01124007 CN01124007A CN1400522A CN 1400522 A CN1400522 A CN 1400522A CN 01124007 CN01124007 CN 01124007 CN 01124007 A CN01124007 A CN 01124007A CN 1400522 A CN1400522 A CN 1400522A
Authority
CN
China
Prior art keywords
character
chinese
coding
chinese character
easy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 01124007
Other languages
Chinese (zh)
Inventor
李晓明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 01124007 priority Critical patent/CN1400522A/en
Publication of CN1400522A publication Critical patent/CN1400522A/en
Pending legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention 'Chinese character dynamic-coding easy-code technique' refers to a Chinese character shape-code technique, which belongs to the Chinese character information processing field. It builds the concept as to Chinese character dynamic configuration and dynamic configuration cell collectino (stroke, the side of a Chinese character and ready character), adopts Chinese character and Chinese phonetic initial character called Chinese character dynamic configuration cell as the code character, forms the internal relation between the code character and the code object, and establishes the oneness of the four skills, the Chinese character 'identifying', 'writing', 'typing' and 'checking', which constructs out the easy-code technique project, which includes the 'easy-input' method applied to inputting Chinese characters in the information-processing equipments such as the computer and so on.

Description

Chinese character dynamic structure easy coding technology
The invention relates to a Chinese character coding technology, and belongs to the field of Chinese character information processing. Specifically, the invention establishes and constructs a standard, easy-to-learn, easy-to-use and efficient Chinese character dynamic structure easy-to-code system.
The Chinese characters have the advantages of inherent pictograph, ideation, conciseness and holography, which are incomparable with any Western languages. However, in modern information processing systems, the handling of kanji information presents unique difficulties compared to western languages. Thus, the problem of Chinese character encoding arises. Over the years, a plurality of expert scholars and inventors have conducted extensive research on the problem, and some relatively excellent coding schemes have played a good role in the field of Chinese character input of computers.
The existing technical scheme of Chinese character coding can be roughly divided into two categories: namely phonetic codes and font codes.
The phonetic codes take spelling information of whole Chinese character pronunciation as coding basis and Chinese phonetic alphabet as coding symbols. Its advantages are simple coding rule and high mastering power. The disadvantages are high duplication code rate and low input efficiency; because the phonetic code has a higher requirement on the pronunciation level of the user Mandarin, the user in non-northern areas is more difficult. In fact, the phonetic codes are not Chinese character codes in strict sense, but Chinese character patterns are converted by Chinese character pronunciation.
The font code uses the optimized number of form radicals to form a basic radical set, and uses this as coding basis, after classification and merging, uses English letters or numbers as coding symbols. Its advantages are low duplicate rate and high input efficiency. The disadvantages are that the coding rule is complex, the correlation degree of the code symbol and the font is low, so the learning difficulty is large, and the learning difficulty is difficult to be mastered by non-professional input personnel.
The font code is represented by a 'five-stroke font', and all existing font code schemes are almost the thought of the 'five-stroke font', and only the selection of a 'basic etymon set' and the distribution, or adjustment, or optimization of the 'basic etymon set' on a keyboard are different, even some of the scenarios are also degraded, and the scenarios are complicated. The former is like phonetic table character and shape codes, thus reducing the difficulty of mastering the character and shape code input method to a certain extent; the latter, like the English letter analog font code, can bring more confusion to correct cognition and use of Chinese characters.
The existing font coding is difficult to learn and master, and the fundamental reason is the inherent defect of the font coding schemes. This "congenital defect" is: the Chinese character recognition rules and the correlation between the Chinese character recognition rules and the coding technology are lack of deep analysis research and corresponding technical processing.
Most important, often the simplest; the simplest, often the most natural. The invention fundamentally solves the 'inherent defect' of the existing font coding technology.
Any Chinese character has a certain structural form and is composed of one or more than one structural unit (hereinafter referred to as structural element). The structural element of the Chinese character is a dynamic set, and the etymon in the prior coding technology is a static set which only reflects partial characteristics of the dynamic set of the structural element of the Chinese character and does not fully reveal the essence of the dynamic set. The 'dynamic structure' and 'dynamic structure element set' of Chinese characters are the important concepts of the invention.
The method is based on constructing a basic etymon set as a coding basis and is one of the 'inherent defects' of the existing font coding technology.
The Chinese character "Guo" is composed of Jiong structural elements, Yu and Yi, and the "recognition" and "writing" of the Chinese character both follow this structural relationship, while in the coding of the five-stroke character and other characters, the Chinese character "Guo" is a combination of the root of Chinese character, such as Kaikou, Wang and Xie-Shi. Because there is no "jade" in the "basic etymon", but there is "mouth". Thus, the structural relationship of the Chinese character 'nation' is distorted, and the 'recognition', 'writing' and 'typing (computer Chinese character input)' of the 'nation' characters become two things which are not coherent and contradictory. Therefore, the existing font code is really caused by the fact that the font code is difficult to learn and grasp. Many examples of such are given.
Therefore, any character configuration code and various flexible coding schemes constructed on the basis of the static radical set have to make some non-standard and complex coding rules at the expense of the cognitive specification of the Chinese characters so as to replace the lower coincident code rate. This situation inevitably produces obvious interference and confusion effects when people recognize, learn and understand Chinese characters, and thus causes difficulty in learning and mastering font codes.
The determination of the code symbol in an artificial way independent of the character itself is the second "inherent defect" of the existing character pattern coding technology.
The coding symbols used by the existing Chinese character coding technology comprise English characters, numbers and special symbols (such as "<", ">", "/", and the like). The code symbol used and the Chinese character 'etymon' have no internal relation or are the symbols of the strong guild. For example, the code symbols "G" and "F" in "the five-stroke font" correspond to "the fifth of the Wang-side blue head " and "the second Tu-Shi-Do ten-inch rain", and what reason is in the middle, none, you can only remember hard to carry; for another example, some character codes identify the Chinese character component '' as the English character 'P', identify the 'mouth' as 'O', and make people in cloud.
Each structural element in the Chinese character dynamic structural element set established by the invention has standard form information and standard sound state information.
The morphological information of the Chinese character dynamic structural element specification is characterized by three layers: strokes, components and finished words.
The phonetic state information of Chinese character dynamic structure element specification is characterized by Chinese phonetic first character corresponding to name.
Thus, the code symbol and Chinese character structure element related by the invention establish the internal relation, and the relation is the natural extension of the Chinese character knowledge of 'recognizing' and 'writing' for users. The problem of correspondence between the rote-remembered hard-backed code symbols and the etymons in the existing font codes is solved.
The dynamic structure of the Chinese characters is not only exquisite, but also unified and simple in structural rule. On the basis, the invention establishes the following specific scientific concept and easy-coding technical scheme.
Three-layer structure element for one-Chinese character and Chinese character
Any Chinese character is always composed of corresponding structural elements. For example, the "Li" structural elements are "wood" and "son" and "Chen" structural elements are "" and "east", the "Li" structural elements are "standing grain" and "vacuum control unit", and the "learning" structural elements are
Figure A0112400700051
Structural elements of the words "", "feather" and "force" are "learning" and "learning", respectivelyAnd, the structural elements of the "H" and "H" are "H" and "force", etc.
The structural elements of the Chinese characters are divided into three levels: 1. performing strokes; 2. a component; 3. and (5) forming characters.
Figure A0112400700053
And (iv) belonging to stroke structural elements;
"", "vacuum control unit" belongs to the side structure element;
the "wood" and "worker" belong to the word-forming structural elements.
Dynamic structure relationship of Chinese characters (generation sequence of dynamic structure elements)
1. Chinese characters are generated by structural elements thereof according to a standard sequence.
Such as "mu, zi, li", ", east, chen", is the correct and proper perception of the order of structural elements to generate chinese characters. No matter the characters are learned, written or typed, the order is conformed to the structural element specification, so that the operation is smooth, convenient and comfortable; violating the structural element specification sequence, it is difficult to be complicated. For example, the structure element and the standard generation sequence of the 'nation' word are 'Jiong, Yu, Yi and Guo', so that the recognition is clear, the writing is also beautiful, and the typing is simple, because the three are completely uniform.
2. Chinese characters are generated by dynamic structural elements according to a standard sequence.
As described above, the static "etymons" cannot completely and correctly reflect the structural relationship of the chinese characters themselves and the knowledge of the chinese characters by people. Because, any Chinese character is generated by the corresponding dynamic structural elements in the dynamic structural element set according to the standard sequence. In other words, the structural elements of a Chinese character are not stationary, fixed, but relative, floating.
Such as the "factory of structural elements
Figure A0112400700055
Vertical and horizontal stroke, left-falling stroke and sequence generation of Chinese characters; however, the good idea is not that the structure elements "A" and "factory, Horizontal, vertical, horizontal, vertical components are generated sequentially from the structural elements "catalyst", i.e., the so-called "catalyst", Cheng "sequences generate relationships.
For another example, the structural element I constitutes the Chinese character ten; the structural elements, the horizontal, the ground and the ground constitute Chinese characters, wood; the structural element 'wood and son' forms the Chinese character 'Li'.
It can be seen that the structural elements are dynamic and variable for different chinese characters. In other words, the external shape of a Chinese character is a dynamic structural relationship, rather than the concept of a static "radical" can cover.
Dynamic structure elements contain more information than static structures.
For example, the structural elements of the Chinese character "Nu" are obviously "woman" and "again". What the structural elements of the Chinese character ' Nu ' are, from the viewpoint of static ' etymon ', should be ' woman ', ' and ' power '; in the concept of dynamic structure, it should be "nu" and "force". Please note that the dynamic structural element "nu" contains the sound state information of "nu", i.e. the information of the sound side where "nu" is shown. In the static "radical" view observation, this tonal information is lost.
Three, Chinese character structure element title and code symbol
The pronunciation of the mandarin chinese character is the name of the chinese character. For example, "Li" reads "l ǐ," and "l ǐ" is the name of "Li" word, "Chen" reads "ch ren," and "ch ren" is the name of "Chen" word.
The stroke structural elements and the radical structural elements are the same as Chinese characters and are called by specification. For example, the strokes "horizontal" are termed "pi ě" i.e. "skimming", the strokes "フ" are termed "hengpi ě" i.e. "horizontal skimming", and the partial "vacuum control units" are termed "i im d ā o" i.e. "vertical knife".
The structural element designation may be expressed by pinyin.
The first character of the Chinese phonetic alphabet called by the structural element or the first character of the key word (aiming at the compound stroke and the component structural element) is the easy coding character of the invention naturally. The easy coding symbol is not related to tone, front nasal sound and back nasal sound, curled tongue sound and flat tongue sound; meanwhile, the Chinese character structure element is closely connected with the Chinese character structure element and the Chinese character. Therefore, the invention unifies the character learning, writing, typing and searching of the characters of people on the method and the technology, changes the recognition, writing, typing and searching of the characters of people into a matter, and can know the characters and understand the characters. More precisely, the easy coding technology naturally extends the existing knowledge and ability of 'recognition' and 'writing' of Chinese characters to the field of 'typing' and 'searching'.
The number of the first phonetic symbols is only 23, and the number of the corresponding easy-to-code symbols is only 23, namely
a,b,c,d,e,f,g,h,j,k,l,m,
n,o,p,q,r,s,t,w,x,y,z。
If the invention is applied to the Chinese character input of the computer, the key positions of the computer keyboard corresponding to the code symbols are as follows:
Q W E R T Y O P
A S D F G H J K L
Z X C B N M
four, stroke structure element easy coding
The stroke structural element is the most basic and smallest Chinese structural element.
The number of strokes of each Chinese character is slightly different but is generally consistent. From the technical application perspective, the invention divides the strokes of the Chinese characters into two types, namely simple strokes and composite strokes.
All strokes of a Chinese character are described by only one Chinese character, and are called as pure strokes of the Chinese character.
There are six kinds of simple strokes:
1. 'dian (dot)'; "one (cross)"; "I (vertical)"; -drawings (skimming); "ground (pressed)"; 6."
Figure A0112400700071
(mention) ".
It can be seen that the six pure strokes do not have any turning condition in any direction from beginning to end.
All strokes whose characteristics can be accurately described only by using two or more Chinese characters are called composite strokes of Chinese characters.
The composite strokes are of three types. Note that not three, but three types.
1. The above-mentioned classes, such as,(vertical lifting),(horizontal folding lifting);
2. hooks, e.g. Chinese character strokes (horizontal hook), strokes (vertical hook),
Figure A0112400700074
(horizontal folding hook);
3. folds, such as  (cross fold), "the formazan (vertical fold)," the angle (left fold); and so on.
Compound strokes are, in fact, interconnected compounds or extensions of simple strokes.
Because "The Chinese character stroke is a simple stroke and can form a composite stroke, so that the six simple strokes and the three composite strokes form eight basic strokes for people to recognize and write Chinese characters. The traditional calligraphy and the modern calligraphy of China roughly summarize the basic strokes of Chinese characters by using 'permanent' characters, so that the Chinese character calligraphy has the theory of 'permanent eight-character calligraphy'.
In the modern Chinese character retrieval field, people often merge eight basic strokes into five strokes, namely 'Chinese stroke (sitz merging in the stroke)', 'one (Chinese character)' (Chinese character)And wherein) "," -, "(all turning strokes are incorporated)".
It must be noted that the five basic strokes in the field of Chinese character retrieval cannot be directly used in the fields of character learning, writing and typing. The five basic strokes in the search field are specified only for the convenience of search, and have great difference with the original structural form of the Chinese characters. If we must not transplant, we must establish several rules as the cost, so as to establish the relationship between the search rule and the original structural form of Chinese characters. Of course, this inevitably deviates from the simplicity and is cumbersome to add. Imagine how "bobbing" and "ground stone" with inward contraction as "point" and outward spread as "right-hand", are different in cognition and writing practice.
The table below shows the result of the combing and the corresponding list of the code symbols, the corresponding title is in parentheses, and the corresponding example character is in square brackets.
In thirty stroke forms generated by eight basic strokes, the last character named by each stroke form (the last character named by a single stroke is the first character) reflects the most main characteristics (belonging to the stroke category) of the stroke form, and the characteristics are the characteristics of receiving strokes during writing and moving strokes. By grasping the characteristic, the stroke structural element is easy to encode and is clear at a glance.
For example, compound strokes ""it is called" horizontal folding hook ", the receiving pen is hook, belongs to the hook stroke, the easy code symbol is" g "certainly; for another example, compound stroke "フ" is called "horizontal left-falling stroke", and the first stroke is left-falling stroke, which belongs to the left-falling stroke, and the easy-coding symbol is "p" naturally.
Five, radical structure element easy coding
The radical structural element refers to a stroke combination structure which can not be used as an independent character, such as ' (vertical heart side)', 'single side' and the like, and the following table shows all the radical structural elements and corresponding easy coding characters related to the invention.
Figure A0112400700092
Figure A0112400700101
Six, word-forming structure element easy coding
The Chinese characters can exist independently. The Chinese character forming structural element is a Chinese character which can independently exist and serves as a structural element in another Chinese character. For example, "wood" and "son" are both character-forming, and in the Chinese character "plum", the "wood" and "son" are character-forming structural elements; both the "worker" and the "force" are character-forming, and in the Chinese character "Gong", the "worker" and the "force" serve as character-forming structural elements.
Obviously, the character-forming structural element can use the first character of the Chinese phonetic alphabet of the pronunciation thereof as an easy coding character, and the character-forming structural element completely corresponds to the easy coding character and is very natural.
The following table lists the pronunciations and corresponding easy-coding symbols of all Chinese characters.
Coding symbol Chinese character structure element (indistinguishable Chinese phonetic tone)
a a,ai,an,ang,ao,
b ba,bai,ban,bang,bao,bei,ben,beng,bi,bian,biao,bie,bin,bing,bo,bu,
c ca,cai,can,cang,cao,ce,cei,cen,ceng,cha,chai,chan,chang,chao,chen,cheng,chi,chong,chou,chu,chua,chuai,chuan,chuang,chui,chun,chuo,ci,cong,cou,cu,cuan,cui,cun,cuo
d da,dai,dan,dang,dao,de,dei,den,deng,di,dia,dian,diao,die,ding,diu,dong,dou,du,duan,dui,dun,duo
e e,ê,en,eng,er
f fa,fan,fang,fei,fen,feng,fiao,fo,fou,fu
Coding symbol Word-forming structural element (Indistinguishable Chinese phonetic tones
g ga,gai,gan,gang,gao,ge,gei,gen,geng,gong,gou,gu,gua,guai,guan,guang,gui,gun,guo
h ha,hai,han,hang,hao,he,hei,hen,heng,hm,hng,hong,hou,hu,hua,huai,huan,huang,hui,hun,huo
j ji,jia,jian,jiang,jiao,jie,jin,jing,jiong,jiu,ju,juan,jue,jun
k ka,kai,kan,kang,kao,ke,kei,ken,keng,kong,kou,ku,kua,kuai,kuan,kuang,kui,kun,kuo
l la,lai,lan,lang,lao,le,lei,leng,li,lia,lian,liang,liao,lie,lin,ling,liu,long,lou,lu,luan,lüe,lun,luo
m m,ma,mai,man,mang,mao,me,mei,men,meng,mi,mian,miao,mie,min,ming,miu,mo,mou,mu
n n,na,nai,nan,nang,nao,ne,nei,nen,neng,ng,ni,nian,niang,niao,nie,nin,ning,niu,nong,nou,nu,nü,nuan,nüe,nun,nuo
o o,ou
p pa,pai,pan,pang,pao,pei,pen,peng,pi,pian,piao,pie,pin,ping,po,pou,pu
q qi,qia,qian,qiang,qiao,qie,qin,qing,qiang,qiu,qu,quan,que,qun
r ran,rang,rao,re,ren,reng,ri,rong,rou,ru,ruan,rui,run,ruo
s sa,sai,san,sang,sao,se,sen,seng,sha,shai,shan,shang,shao,she,shei,shen,sheng,shi,shou,shu,shua,shuai,shuan,shuang,shui,shun,shuo,si,sāng,sou,su,suan,sui,sun,suo,
t ta,tai,tan,tang,tao,te,tei,teng,ti,tian,tiao,tie,ting,tong,tou,tu,tuan,tui,tun,tuo
w wa,wai,wan,wang,wei,wen,weng,wo,wu
x xi,xia,xian,xiang,xiao,xie,xin,xing,xiong,xiu,xu,xuan,xue,xun
y ya,yan,yang,yao,ye,yi,yin,ying,yo,yong,you,yu,yuan,yue,yun
z za,zai,zan,zang,zao,ze,zei,zen,zeng,zha,zhai,zhan,zhang,zhao,zhe,zhei,zhen,zheng,zhi,zhang,zhou,zhu,zhuai,zhuan,zhuang,zhui,zhun,zhuo,zi,zong,zou,zu,zuan,zui,zun,zuo
Theoretically, all Chinese characters can be used as character forming structural elements for easy coding of the Chinese characters. But in practice this may not be done completely. Why? Because some Chinese characters are very rare, the learners are few, and the character-forming ability is very low. If the Chinese characters are used as character forming structural elements, difficulties are inevitably caused to users. Then, what Chinese characters can be used as character-forming structural elements, what are not, how should the criteria be determined?
The "common word" is the basic standard of easy coding of word-forming structural elements. Then, shIs the standard of a commonly used word? The first-level Chinese characters in the national standard are common character sets. Generally speaking, all Chinese characters exceeding the national standard first-level Chinese character set belong to the non-used characters. After repeated research and test, the Chinese characters which can be used for forming character structural elements except for common characters are limited to the following table.
The unused word marked with the symbol in the table can still be used for degrading the code. For example, the unusual use of the word "" can be used as word-forming feature or degraded as stroke feature "and" as "ground". Taking the Chinese character "words" as an example, it can be regarded as the composition of the structural elements "" and "", and also as the combination of the structural elements "", "horizontal" and "ground".
Basic rule of easy coding for dynamic structure of seven, Chinese characters
Summarizing the technical points, the invention forms the following basic rules of easy coding:
for convenience of description, the four-code scheme is used as an example below.
According to this basic rule, it is obvious that the easy code of "wood, seed, plum" is "mzl"; ", east, Chen" corresponds to "edc"; "He, vacuum control unit", Li "corresponds to" hdl "; "Vertical and horizontal, force "corresponds to" gpl "; "Fang, Earth, Kun" corresponds to "fftk"; "death, mouth, month, shellfish, fan, win" corresponds to "wkyb", etc.
Easy coding structure for eight-character and Chinese character
1. Chinese character coding structure
Standard structure: [ structural element name first character + whole character initial character ].
Hereinafter, "structural element name first character" is abbreviated as "structural element"; the whole-word initial consonant character is abbreviated as a whole word or a word.
For example, "one" is "hy". Wherein,
"h" is an initial named "renghua" of the structural element "one", i.e. the abscissa;
"y" is the first character of the Chinese character "the Chinese Pinyin" y ī ".
2. Two-structure element Chinese character coding structure
Standard structure: [ first structural element + last structural element + whole word ].
For example, "a bougainvillea, a branch, a skill" is "szj"; "Kaa", a, o ".
The easy coding of most two-structure element Chinese characters belongs to standard structure.
The structure is simplified: [ first structural element + last structural element ]. Such as, for example,
"gunn" ═ eh "," old "═ ed", and so on.
Some simplified structure of Chinese character coding and standard structure exist at the same time,
such as ", east, old" ═ ed "═ edc".
And (3) expanding the structure: [ first structure element + last structure element + whole word + last bit identification code ].
Wherein, the last identification code is also called as duplication code identification code, and the meaning is: "the whole word end structure element contains the maximum structure element without stroke.
For example, the Chinese character "layer" corresponds to an easy coding structure of "corpse, cloud, layer, " ═ sys "; wherein "" is the "last digit identifier" which is the "largest structural element including the last stroke" in the "layer" word end structural element "cloud".
For another example, the Chinese character "material" has a corresponding coding structure of "rice, bucket, material, ten" ═ mdls "; wherein, the ten is the last identification code which is the maximum structural element containing the last stroke in the last structural element of the material character.
3. Three-structure element Chinese character coding structure
Standard structure: [ first structural element + second structural element + last structural element + whole word ].
For example, the Chinese character "inverse", the corresponding coding structure "phys", such as "phyf"; the Chinese character ' bi ' has a corresponding coding structure of ' wang, white, stone, bi ' ═ wbsb '.
The standard structure can also be simplified to be
[ primary structural element + secondary structural element + last structural element ] or [ primary structural element + secondary structural element ].
"wang, bai, shi" ═ wbs "═ bi"; ", ," ═ eb "═ pottery".
Some simplified structure of Chinese character coding and standard structure exist at the same time,
such as ", , fou, pottery" ═ ed "═ edf" ═ ebft ".
If the Chinese characters are encoded by degrading in potteries, the Chinese characters will be changed into four-structural-element Chinese characters.
4. Four-structure element and more than four-structure element Chinese character coding structure
Standard structure: [ primary structural element + secondary structural element + third structural element + fourth structural element ].
For example, the chinese character "pottery" corresponds to easy codes ", , wu, u, pottery" ═ ebwk "; the Chinese character "front", the correspondent easy code is ", one, month, vacuum control unit" ═ bhyd "; the Chinese character "boon" is the corresponding easy code "Yi, Shen, Yi, Shi-Shi, Xin, hui" ═ hshd ".
Some Chinese characters with four or more than four structural elements also have corresponding simplified coding structures. Such as, for example,
[ primary structural element + secondary structural element + third structural element ] or [ primary structural element + secondary structural element ].
", earth, eight" ═ etb "═ tomb"; ", one" ═ bh "═ before".
Easy coding structure for nine words
The words referred to herein include multiple words and phrases. The easy coding of words uniformly adopts a standard structure.
1. Easy coding structure for two-word
Standard structure: [ first character structural element + first character + last character structural element ]. Such as, for example,
"yingxin" ═ "," yingxin "," heart "," dian ", or" dzxd ";
"thank" ("salty", "kang", "thank", "xie", "emitting" ("xgxs");
"china" ═ mouth "," middle "," country "and" one "═ kzgh";
"forward" ═ "," forward "-" (bqjz);
"cannot" ═ two "," none "," fad "-" ewfq ";
"front" and "front" vacuum control unit "and" emqd "; and so on.
2. Three-word easy coding structure
Standard structure: [ first word + second word + last word last structural element ]. Such as, for example,
"actually" ═ real "," actual "," upper "," one "═ sjsh";
"turn between eyes" ═ turn "," eyes "," turn "and" day "-" zyjr ";
"attention" - "force" - "zylp" - "horizontal force" - "zylp" -;
"motto" ("motto" ")", "motto" "," right "", "and" zymm ";
"State Council" means "state", "affairs", "Hospital" and "finished" means "gwywyw"; and so on.
3. Four-word easy coding structure
Standard structure: first word + second word + third word + last word. Such as, for example,
"sailing" means "one", "sail", "wind" and "sail";
grin-through ″' grin "" mouth "";
"important head" ═ heavy "," important "," off "and" head "-" zygt ";
"middle and primary school students" ═ middle "," small "," school "and" students "-" zxxs ";
"straight", "no", and "center", may be an unblendable property;
"people daily report" ═ people "," day "," newspaper "═ zmrb";
"common center" ═ middle "," common "," middle ", and" central "═ zgzy"; and so on.
4. Easy coding structure for words with more than four characters
Standard structure: first word + second word + third word + fourth word. Such as, for example,
"the people's republic of China" ═ zhrm ";
"Hanxin Bingchang duo-good" ═ hxdb ";
"political economics" ═ zzjj ";
"central tv station" ═ zyds; and so on.
The structure of the abbreviation: the standard structure [ initials + secondary words + third words + fourth words ] is still adopted after the abbreviation. Such as, for example,
"China people Bank" ═ China people line "═ zgrh";
"central civil broadcasting station" ═ central station "═ zydt";
"China Industrial and commercial Bank" - "China industry" - "zggh"; and so on.
The simple and easy coding structure of words with more than four characters is generally used for self-defined coding of special words of users.
One of the application examples of the present invention: computer Chinese character and word input scheme-easy input method.
An input device: a computer standard keyboard.
The longest code length: 4 bits.
The words are compiled at the same time: the characters adopt an easily coded standard structure, a simplified structure and an extended structure, and the words adopt an easily coded standard structure; the word volume covers the modern Chinese dictionary and the Xinhua dictionary. The total code rate is 2%.
Wildcard querier: is there a . Wildcard character? "can be used at any coding bit so that fuzzy input and coding queries can be performed.
Some easy-to-encode examples are enumerated: 1. all one code word
Q W E R T Y U I O P
And is I am While Human being He has a main body Is provided with Zone(s) Pie
A S D F G H J K L
Love Is that Is/are as follows Hair-like device Worker's tool And then is turned on This is achieved by To master
Z X C V B N M
In that Lower part Go out Is not limited to You are People
2. Two code word example ab represents three forward progress of the BhdhchzzqqqqqqqqqbqzbqbqzbqzbqzbqzbqbzbqbzbqbqbzbzbzbqbzbzbqbzbzbqbzbzbzbzbqbzbzbzbzbqbzbzbzbqbzbzbzbzbzbzbqbzbzbzbzbzbzbBfyzbzbzbzbBfyzbzbzbqbzbzbzbzbzbqbzbzbzbzbzbzbzbzbzbzbzbBfyzbzbzbzbzbzbzbBfyzbzbzbzbzbzbzbBfyzbzbzbzbzbBfyzbzbBfyzbzbzbBfyzbzbzbzbBfyzbzbzbzbzbzbzbBfyzbBfyzbBfyzbBfyzbzbBfyzbzbzbBfyzbzbzbzbzbBfyzbBfyzbBfyzbzbzbzbzbx China special zgts socialism shzy marquey mlzy deep immeasurable sbkc eager diggingwell lkjj
8. Words with more than four characters
Mark Si Lin Ning Zhi mksl
Duncao theory dxpl
Hundred rule head further bcgt
Zhrm of the people's republic of China
Open fund kfsj
The second application example of the invention: dictionary and dictionary fast query scheme-easy retrieval method.
The easy retrieval method can be applied to the compilation of text dictionaries and can also be applied to the compilation of electronic dictionaries and electronic dictionaries.
Easy-to-retrieve methods may compile word dictionaries using a variety of schemes, such as:
1. easy coding equal length structure (complement code symbol o for short length) ordering (suitable for medium and large dictionary);
2. easy coding standard structure ordering (suitable for small-sized word dictionary); and the like.
The easy retrieval method can adopt word separation and easy coding sorting compilation, and also can adopt word mixing and easy coding uniform sorting compilation.
In the following, an example of a "word order compilation with an easily-coded equal-length (four-code) structure" scheme is given:
the so-called "babbb" -c.b.c.b. * cbc "-c.b.c.b. of the chelate ayah abbbr ayak toilet bowl bd is more than bbbg-b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.b.c. cbbd bud cbbs.... pottery ebft strang ebmb arrange ebmz..... Limited egxn in egyo gunn ehgr, these ehgr deaf egkb deficient egko.. fath fbg squall fbbs poor fbpr.. 9. An fybb guava fybh.... an fzfn fzkk.. an. oagan storehouse gaaw guest gbbb target gbbg.. an. nange ngyd sqyd.g. an. galyd Hoogyd.g.. an. warm hayd gayb bb. Abortive hbbbThe selection of xzzn phobic xzzr choking xzzt.. the selection of amine yaan yaay.. the selection of chopsticks zzr trail zzzzzzzzzzs

Claims (9)

1. A Chinese character dynamic structure easy coding technology is characterized in that: the Chinese character dynamic structure relation and dynamic structure element set which accord with the cognition standard are taken as a coding object set, the dynamic structure element title or the Chinese pinyin first character of the title key word is taken as a coding character, and the Chinese character word is coded according to the easy coding rule and the easy coding structure.
2. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the number and title of the stroke structural elements and the radical structural elements in the Chinese character dynamic structural element set are determined, and the stroke structural elements and the radical structural elements have clear corresponding relation with the code symbols.
3. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the common character-forming structural elements belonging to the character-forming structural elements in the Chinese character dynamic structural element set have definite corresponding relation with the coding symbols.
4. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the number of the non-common character-forming structural elements belonging to the character-forming structural elements in the Chinese character dynamic structural element set is limited, and the non-common character-forming structural elements have clear corresponding relation with the code symbols.
5. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the easy coding of Chinese character and word follows the easy coding rule and conforms to the easy coding structure of Chinese character and the easy coding structure of word.
6. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the last bit identification code in the easy coding structure of the Chinese character is a coding symbol corresponding to the maximum structural element containing no stroke in the whole character last structural element.
7. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the easy coding technology is applied to Chinese character and word input of computer and other information processing equipment to form an easy input method.
8. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the easy coding technology is applied to lexicography to form an easy retrieval method.
9. The dynamic structure easy coding technique for Chinese characters according to claim 1, characterized in that: the easy coding technology can generate a plurality of Chinese character word coding schemes of 3-bit code symbols and code symbols above 3 bits.
CN 01124007 2001-08-06 2001-08-06 Chinese character dynamic structure easily-coding technique Pending CN1400522A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 01124007 CN1400522A (en) 2001-08-06 2001-08-06 Chinese character dynamic structure easily-coding technique

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 01124007 CN1400522A (en) 2001-08-06 2001-08-06 Chinese character dynamic structure easily-coding technique

Publications (1)

Publication Number Publication Date
CN1400522A true CN1400522A (en) 2003-03-05

Family

ID=4665435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 01124007 Pending CN1400522A (en) 2001-08-06 2001-08-06 Chinese character dynamic structure easily-coding technique

Country Status (1)

Country Link
CN (1) CN1400522A (en)

Similar Documents

Publication Publication Date Title
CN101055588A (en) Method for catching limit word information, optimizing output and input method system
CN1389777A (en) Chinese input system
CN1400522A (en) Chinese character dynamic structure easily-coding technique
CN1186711C (en) Mongol input method
CN1227930A (en) Computer Chinese character shaohui code input method
CN1102768C (en) Chinese character sound-shape coding input method for electronic computer
CN100397303C (en) Chinese input system and method
CN1150442C (en) Standard writing sequence code for teenager
CN1121645C (en) Sound and shape word code Chinese character input method
CN106484133B (en) The method for carrying out input in Chinese using handwriting stenograph symbol
CN1129058C (en) Chinese character phonetic code and keyboard design
CN1573662A (en) Method and system for input and output of chinese character
CN1105962C (en) Grammar type phonetic Chinese input method and its dialogue keyboard
CN1303036A (en) Chinese-Roman phonetic letters decoding method and input keyboard with it
CN1818837A (en) Chinese character inputting method of normalizing applied Chinese phonetic alphabet scheme
CN1164689A (en) Computer input method for Chinese characters&#39; sound pattern meaning based on word and Chinese-Spanish compatible keyboard
CN1173253C (en) Ten-key four-code Chinese handset input method
CN1532676A (en) Keyboard and non-keyboard chinese character input method
CN1866176A (en) English computer input method
CN1975640A (en) Chinese inputting method for normalizing application of Chinese phonetic Alphabet scheme
CN1244680A (en) Alphabet inspection method utilizing pronunciation and wildcard
CN1186710C (en) Colour-fitting digit input method
CN1492303A (en) Two division Chinese character coding small keyboard input and its display method
CN1815421A (en) Chinese character polyphony coding input method
CN1564111A (en) Chinese character input method of using voice cocking code

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication