CN100501649C - Shape-pronunciation encoding input method of Chinese characters - Google Patents

Shape-pronunciation encoding input method of Chinese characters Download PDF

Info

Publication number
CN100501649C
CN100501649C CNB2007100658196A CN200710065819A CN100501649C CN 100501649 C CN100501649 C CN 100501649C CN B2007100658196 A CNB2007100658196 A CN B2007100658196A CN 200710065819 A CN200710065819 A CN 200710065819A CN 100501649 C CN100501649 C CN 100501649C
Authority
CN
China
Prior art keywords
stroke
chinese
word
letter
yard
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2007100658196A
Other languages
Chinese (zh)
Other versions
CN101038517A (en
Inventor
施冰
段利华
李锟华
陈本辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dali University
Original Assignee
Dali University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dali University filed Critical Dali University
Priority to CNB2007100658196A priority Critical patent/CN100501649C/en
Publication of CN101038517A publication Critical patent/CN101038517A/en
Application granted granted Critical
Publication of CN100501649C publication Critical patent/CN100501649C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention relates to a Chinese character pattern and sound encoding input method, which belongs to the computer letters information processing field. The Chinese character is classified into a single component character and double-component character, the primary stroke of the Chinese character is classified into three categories: the inclined, the horizontal and the vertical and which are correspondent to three lines of English letters keys respectively, the second stroke secondary stroke is classified into six categories: the horizontal, the vertical, the left-fallings, the right-fallings, the single-turnings and the double turnings and which are correspondent to the third to the eighth row of the English letters keys respectively, the relation between the primary stroke and the second stroke is classified into the 'crossing' and 'square' types and which are correspondent to the first and second row of the English letter keyboard, thereby a 3*8 two-dimensional coordinate encoding key system is formed. The first and the second stroke of the two-component word are obtained by the combination of the primary and secondary stroke, the third stroke is the primary letter of the pinyin of the word that forms by the first component with the maximum stokes, the fourth stroke is the initial letter of the pinyin of the word; the single-component word acquires the stroke pair by the writing sequence, the next stroke is the initial letter of the pinyin of the word, the encoding length of the word is four-word. The invention has simple, direct and standard disintegration method, unique key-encoding arrangement, small burden on memory, and short study period, the user can input the Chinese character after being familiar with the lines and rows of the primary and secondary stroke of the Chinese word, the pattern and the primary letter of the initial consonant of the Chinese word.

Description

A kind of shape-pronunciation encoding input method of Chinese characters
Technical field
The present invention relates to a kind of encode method for entering Chinese characters, is a kind of ideophone coded input method based on the Chinese character simple classification, belongs to the computer Chinese information process field.
Background technology
In recent years, though computer technology is constantly developed, its range of application is also constantly enlarging, operate but still rely on the encode Chinese characters for computer mode in large quantities in the computer Chinese-character input, the standard of Chinese-character input scheme in the computer Chinese information processing procedure, easily and input speed, accuracy etc. remain one of main bottleneck of restriction user job efficient.The present situation of Hanzi keyboard input is: though the situation of " ten thousand yards Pentium " has appearred in encode Chinese characters for computer, but compliant, easily, encoding scheme efficient, that be fit to conventional user learning can be counted on one's fingers, especially in the Hanzi keyboard of school input teaching except spelling input method, do not have better Chinese-character input scheme, directly influenced the raising of student's Chinese characters for keyboard inputting level.At present, the problem of encode Chinese characters for computer existence is mostly:
1, the font code scheme adopts the root coding method mostly, and the quantity of radical is many, learn radical by heart, also will remember the distribution of radical, and the disassembly principle of GPRS radical and coding rule make the root coding scheme difficult note that finds it difficult to learn.Therefore, the root coding scheme is easy to generate that coding is lack of standardization, cataloged procedure is complicated, memory capacitance is big, input method finds it difficult to learn, a period of time do not use and a series of problems such as will forget.
2, simple sound code plan is to unacquainted Chinese character or read inaccurate Chinese character and be difficult to typing, and because Chinese character has only more than 400 syllable, repetition rate of coding height (as spelling, Two bors d's oeuveres scheme).Therefore, input efficiency is low, can't import the Chinese character that can not read, can not adapt to that various level personnel use is the defective of phonetic coding scheme, and these problems can not fundamentally be resolved in phonetic coding scheme.
In fact, Chinese character is the graphical symbol that is made of " sound, shape, justice " three elements, and Hanzi keyboard input coding scheme all is to utilize " shape " and " sound " of Chinese character will usually encode for two kinds.In the Hanzi coding scheme design, extract Chinese character phonetic initial letters, part stroke and order of strokes observed in calligraphy information, can embody Hanzi features, simplified the information of Chinese character " shape " and " sound " again, both helped choosing of code element, be easy to user's grasp again.Make full use of the information of Chinese character " shape " and " sound ", can reduce the repetition rate of coding of coding naturally, scheme is easy to learn and use.
Along with progressively enlarging of computer application field and deepening continuously of level of application, the complicacy of computer Chinese input method and learnability have become one of principal element of restriction Chinese character processing technology development, therefore, be necessary to explore simple, easily, standard, Hanzi coding input method fast.
Summary of the invention
The object of the present invention is to provide a kind of shape-pronunciation encoding input method of Chinese characters, as long as be familiar with the row of Chinese character first-stroke place keyboard, the row of inferior stroke place keyboard, and Chinese character body and first phonetic letter, just can import Chinese character, be fit to the personnel's study and the use of any level, can not forget after the grasp.
The present invention realizes by following technical proposal: Chinese character is divided into two kinds of single character and two body words by natural structure, again the first sum of picture of each body of Chinese character is divided into tiltedly, horizontal, perpendicular three classes, corresponding with the triplex row of English letter keyboard respectively, inferior stroke is divided into horizontal stroke, perpendicular, cast aside, press down (point), single folding, roll over six classes again, the the 3rd to the 8th row with English letter keyboard are corresponding respectively, first, be divided between the inferior stroke and intersect and square frame two classes, respectively with first of English letter keyboard, the secondary series correspondence, form 3 * 8 two-dimensional coordinate key letter position, wherein tiltedly comprise left-falling stroke, press down, the point, carry four kinds of strokes, perpendicular comprise perpendicular and roll over two kinds of strokes, concrete corresponding relation is seen Fig. 1.
Two body words and single character are by following rule encoding, and the Chinese character maximum code length is four yards:
1, two body words:
About two body words comprise, about, inside and outside three kinds of structures, according to stroke order be divided into the 1st body word and the 2nd body word, its coding rule is:
First yard: the letter key that the head of the 1st body word, the corresponding English letter keyboard row, column of inferior stroke intersect;
Second yard: the letter key that the head of the 2nd body word, the corresponding English letter keyboard row, column of inferior stroke intersect;
Trigram: the 1st body becomes word, gets the Chinese Pin Yin initial that it becomes font;
The 1st body does not become word, the Chinese Pin Yin initial of the great achievement font of getting that several leading stroke is formed in this body;
Do not have great achievement font in the 1st body, get the Chinese Pin Yin initial (seeing Table 1) of this body radical;
Do not meet above listed situation, get the Chinese Pin Yin initial (seeing Table 2) of the first sum of picture of first body;
The 4th yard: the Chinese Pin Yin initial of the Chinese character of compiling;
2, single character:
By the Chinese-character stroke sequential write single character is divided into: one, two stroke words, three, four stroke words, five and above word, all kinds of type-words are pressed the row rule encoding:
(1) one, two stroke words:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard; The Chinese Pin Yin initial of Chinese character;
Trigram: English alphabet O key;
(2) three, four stroke words:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: the letter key that the 3rd stroke, the corresponding English letter keyboard row, column of end stroke intersect;
Trigram: the Chinese Pin Yin initial of Chinese character;
The 4th yard: English alphabet O key;
(3) five strokes, the above word of five strokes:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: three, the crossing letter key of the corresponding English letter keyboard row, column of the 4th stroke;
Trigram: five, the crossing letter key of the corresponding English letter keyboard row, column of end stroke;
The 4th yard: the Chinese Pin Yin initial of Chinese character;
3, above by stroke to the process of carrying out code fetch in, intersect or preferential code fetch during square frame if right first stroke of stroke and second stroke constitute.
The specific coding method:
(1) two body word:
The coded sequence of two body words is (letter key that the corresponding English letter keyboard row, column of the head of the 1st body word, inferior stroke intersects) (letter key that the head of the 2nd body word, the corresponding English letter keyboard row, column of inferior stroke intersect) (the 1st body becomes the Chinese Pin Yin initial of word) Chinese Pin Yin initial of volume Chinese character (), that is:
1, first yard and second yard is respectively the letter key that the 1st body and the 2nd body head, the corresponding English letter keyboard row, column of inferior stroke intersect, wherein constitute to intersect and preferential code fetch during square frame, as:
" sign indicating number " is split as " stone; horse " two body words, the 1st body is " stone ", its the first sum of picture is " one ", inferior stroke is " Pie ", corresponding English letter keyboard is capable, the letter key that row intersect is " g " (seeing the 2nd row the 5th row among Fig. 1), the 2nd body is " horse ", its the first sum of picture Shi “ Ya " (single folding); inferior stroke is " ㄅ " (multiple folding); corresponding English letter keyboard is capable; the letter key that row intersect is " l " (seeing the 3rd row the 8th row among Fig. 1); trigram is the Chinese Pin Yin initial " s " of the 1st body " stone "; the 4th yard is the Chinese Pin Yin initial " m " of this Chinese character, and therefore coding is respectively " glsm ".
" dish " is split as " boat, ware ", and the 1st yard and the 2nd yard is respectively " t, x " (second body is a square frame).
" body " is split as " Ren, basis ", and the 1st yard and the 2nd yard is respectively " r, a " (second body is for intersecting).
2, trigram is determined coding in the following order:
(1) the 1st body when this word of composition becomes word, gets the Chinese Pin Yin initial that the 1st body becomes word, as:
The Chinese Pin Yin initial of the 1st body " stone " of " sign indicating number " is " s ";
" " the 1st body be " soil ", Chinese Pin Yin initial is " t ";
The 1st body of " dashing forward " is " cave ", and Chinese Pin Yin initial is " x ".
(2) do not become word when the 1st body, get in this body first letter of pinyin by the great achievement font of sequential write (several leading stroke), as:
The great achievement font of the 1st body is " Si " in " energy ", is encoded to " s ";
The great achievement font of the 1st body is " rice " in " breaking ", is encoded to " m ";
The great achievement font of the 1st body is " standing " in " firm ", is encoded to " l ";
(3) do not have great achievement font, get the Chinese Pin Yin initial (seeing Table 1) of the 1st body radical, as:
The 1st body is " Ren " in " generation ", is encoded to " r ";
The 1st body is "stripe of a tiger" in "tiger", is encoded to "h".
The 1st body is " Rui " in " ditch ", is encoded to " s ".
When (4) not meeting above listed situation, get the Chinese Pin Yin initial (seeing Table 2) of the first sum of picture of the 1st body, as:
The 1st body Wei “ Myeon of " stone " ", the Chinese Pin Yin initial of the first sum of picture is " h ";
The first sum of picture of first body is " Pie " in " system ", and its Chinese Pin Yin initial is " p ";
First body of " party " be " ", the first sum of picture is " Shu ", its Chinese Pin Yin initial is " s ".
3, the 4th yard is the Chinese Pin Yin initial of this Chinese character.
(2) single character:
Single character is divided into by the Chinese-character stroke sequential write: one, two stroke words, and three, four stroke words, three types of five and above words, by following rule encoding:
1, the coded sequence of one, two stroke words is:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: the Chinese Pin Yin initial of Chinese character;
Trigram: English alphabet O key.
2, the coded sequence of three, four stroke words is:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: the letter key that the 3rd stroke, the corresponding English letter keyboard row, column of end stroke intersect;
Trigram: the Chinese Pin Yin initial of Chinese character;
The 4th yard: English alphabet O key.
3, the coded sequence of five strokes, the above word of five strokes is:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: three, the crossing letter key of the corresponding English letter keyboard row, column of the 4th stroke;
Trigram: five, the crossing letter key of the corresponding English letter keyboard row, column of end stroke;
The 4th yard: the Chinese Pin Yin initial of Chinese character.
That is: single character is got 3 groups of strokes at most by the Chinese-character stroke sequential write combination is encoded, next code is the Chinese Pin Yin initial of Chinese character, when the stroke number of forming Chinese character less than 5 the time, finish coding in first phonetic letter coding back word adding mother " O " expression.As:
Being encoded to of " one " " dyo "; Being encoded to of " ten " " aso ";
Being encoded to of " soil " " adto "; Being encoded to of " king " " dcwo ";
Being encoded to of " jade " " dcyy "; Being encoded to of " life " " eads ";
Being encoded to of " first " " xdvj "; Being encoded to of " basis " " aydb ";
Being encoded to of " string " " xszc "; Being encoded to of " weight " " exdz " or " exdc ".
Description of drawings
Fig. 1 and forms the key letter bitmap of 3 * 8 two-dimensional coordinates for the relation (intersecting and square frame two classes) between the first sum of picture (oblique, horizontal, vertical three classes) of each body of Chinese character of the present invention, inferior stroke (horizontal, vertical, cast aside, press down (point), single folding, roll over six classes again), first, the inferior stroke is listed as correspondingly with triplex row, the 3rd to the 8th row, first and second of English letter keyboard respectively.The i.e. key letter position corresponding relation figure of a centering head, inferior stroke.
Annotate: the third line and the 8th is listed as because of symbol ", " key among Fig. 1, actual corresponding letter " L " key during coding.
Embodiment
The present invention embodiment that encodes sees Table 3:
The dissimilar encode Chinese characters for computer examples of table 3
Chinese character Classification Coding Chinese character Classification Coding
Single character (unicursal word) dyo Tongue Two body words (up-down structure) exqs
Ten Single character (two stroke words) aso High Two body words (up-down structure) exdg
The scholar Single character (three stroke words) adto Knit Two body words (left and right sides structure) mxrz
Five Single character (four stroke words) fcwo Converge Two body words (left and right sides structure) yjsh
This Single character (five stroke words) aydb And Two body words (external and internal compositions) xdsq
Really Single character (the above word of five strokes) xdhg Occupy Two body words (external and internal compositions) casj
Table 2 stroke first letter of pinyin coding schedule
Stroke The first letter of pinyin coding Stroke The first letter of pinyin coding Stroke The first letter of pinyin coding
One h Shu s Pie p
Dian d Ya (Yin 亅  ) z
Table 1 radical, radicals by which characters are arranged in traditional Chinese dictionaries first letter of pinyin coding schedule
Radical or radicals by which characters are arranged in traditional Chinese dictionaries The first letter of pinyin coding Radical or radicals by which characters are arranged in traditional Chinese dictionaries The first letter of pinyin coding Radical or radicals by which characters are arranged in traditional Chinese dictionaries The first letter of pinyin coding
Lv c Rolling s Rain (head) y
Dao d Stripe of a tiger h Si s
Foot z z The-Fan w
Cannibals s Jin j Quan q
Ren r Niu n Yi y
Bing d Epileptic b Yan y
Xin x Rui s Woo d
Chuo z Fu e Si r
Annotate: 1, this table uses 24 radicals to encode altogether, is the initial of corresponding radical pronunciation initial consonant, need not special memory.2, proportionately word or its first sum of picture code fetch of listed other radical in this table not.

Claims (2)

1, a kind of encode method for entering Chinese characters, it is characterized in that Chinese character is divided into two kinds of single character and two body words by natural structure, the first sum of picture of each font is divided into tiltedly, horizontal, perpendicular three classes, corresponding with the triplex row of English letter keyboard respectively, inferior stroke is divided into horizontal stroke, perpendicular, cast aside, press down or point, single folding, roll over six classes again, the the 3rd to the 8th row with English letter keyboard are corresponding respectively, first, be divided between the inferior stroke and intersect and square frame two classes, respectively with first of English letter keyboard, the secondary series correspondence, form 3 * 8 two-dimensional coordinate key letter position, wherein tiltedly comprise left-falling stroke, press down, the point, carry four kinds of strokes, perpendicular comprise perpendicular and roll over two kinds of strokes, concrete corresponding relation is as follows, wherein, and the actual corresponding alphabetical L key of the third line the 8th row:
Figure C200710065819C00021
Two body words and single character are by following rule encoding, and the Chinese character maximum code length is four yards:
(1) two body word:
About two body words comprise, about, inside and outside three kinds of structures, according to stroke order be divided into the 1st body word and the 2nd body word, its coding rule is:
First yard: the letter key that the head of the 1st body word, the corresponding English letter keyboard row, column of inferior stroke intersect;
Second yard: the letter key that the head of the 2nd body word, the corresponding English letter keyboard row, column of inferior stroke intersect;
Trigram: the 1st body becomes word, gets the Chinese Pin Yin initial that it becomes font;
The 1st body does not become word, the Chinese Pin Yin initial of the great achievement font of getting that several leading stroke is formed in this body;
Do not have great achievement font in the 1st body, get the Chinese Pin Yin initial of this body radical;
Do not meet above listed situation, get the Chinese Pin Yin initial of first body the first sum of picture;
The 4th yard: the Chinese Pin Yin initial of the Chinese character of compiling;
(2) single character:
By the Chinese-character stroke sequential write single character is divided into: one, two stroke words, three, four stroke words, five strokes and above word thereof, all kinds of type-words are pressed the row rule encoding:
One, two stroke words:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: the Chinese Pin Yin initial of Chinese character;
Trigram: English alphabet O key;
Three, four stroke words:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: the letter key that the 3rd stroke, the corresponding English letter keyboard row, column of end stroke intersect;
Trigram: the Chinese Pin Yin initial of Chinese character;
The 4th yard: English alphabet O key;
Five strokes, the above word of five strokes:
First yard: the letter key that the corresponding English letter keyboard row, column of first, inferior stroke intersects;
Second yard: three, the crossing letter key of the corresponding English letter keyboard row, column of the 4th stroke;
Trigram: five, the crossing letter key of the corresponding English letter keyboard row, column of end stroke;
The 4th yard: the Chinese Pin Yin initial of Chinese character.
2, encode method for entering Chinese characters as claimed in claim 1 is characterized in that in above process of carrying out code fetch by stroke, intersects or preferential code fetch during square frame if first stroke of stroke and second stroke constitute.
CNB2007100658196A 2007-04-18 2007-04-18 Shape-pronunciation encoding input method of Chinese characters Expired - Fee Related CN100501649C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007100658196A CN100501649C (en) 2007-04-18 2007-04-18 Shape-pronunciation encoding input method of Chinese characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100658196A CN100501649C (en) 2007-04-18 2007-04-18 Shape-pronunciation encoding input method of Chinese characters

Publications (2)

Publication Number Publication Date
CN101038517A CN101038517A (en) 2007-09-19
CN100501649C true CN100501649C (en) 2009-06-17

Family

ID=38889453

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100658196A Expired - Fee Related CN100501649C (en) 2007-04-18 2007-04-18 Shape-pronunciation encoding input method of Chinese characters

Country Status (1)

Country Link
CN (1) CN100501649C (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107728805A (en) * 2016-08-11 2018-02-23 吴敬祖 Stroke and spelling input method
CN106708286B (en) * 2017-01-10 2022-10-18 厦门雅迅网络股份有限公司 Intelligent watch input method
CN107885338A (en) * 2017-10-17 2018-04-06 惠州Tcl移动通信有限公司 Stroke input processing method, computer-readable recording medium and terminal

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1374577A (en) * 2001-03-12 2002-10-16 肖湘茂 General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1455322A (en) * 2003-04-22 2003-11-12 李建学 Position-shape-sound Chinese character coding and computer keyboard layout method
CN1538276A (en) * 2003-04-17 2004-10-20 吴宗继 Chinese charactor stroke and sound combined code input method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1374577A (en) * 2001-03-12 2002-10-16 肖湘茂 General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1538276A (en) * 2003-04-17 2004-10-20 吴宗继 Chinese charactor stroke and sound combined code input method
CN1455322A (en) * 2003-04-22 2003-11-12 李建学 Position-shape-sound Chinese character coding and computer keyboard layout method

Also Published As

Publication number Publication date
CN101038517A (en) 2007-09-19

Similar Documents

Publication Publication Date Title
CN100501649C (en) Shape-pronunciation encoding input method of Chinese characters
CN103616960A (en) Six vowel binary syllabification input method
CN1027558C (en) Five-stroke and two-dimension encoding method and keyboard
CN103744532A (en) 26 radical root Chinese and English harmonic inputting method
CN101477408B (en) DongBa character primitive input method
KR101777545B1 (en) Chinese input keyboard
CN106168858A (en) 26 radical radical and stroke Chinese-character input methods
CN101976117B (en) Chinese character input method and chinese character input keyboard
CN100458668C (en) Input method for Chinese character of first pronunciation
CN105807947A (en) Method for correspondingly identifying modular stroke coded Chinese characters
CN101063905B (en) Sound and digital code Chinese-character input method
US20070160292A1 (en) Method of inputting chinese characters
CN104536590B (en) Embedded software keyboard system based on West Xia Dynasty's text sound character roots input method
CN103729068B (en) Coding input method for pinyin initial letters of Chinese characters and word roots
CN101782808B (en) Chinese character input method and platform
CN108459735A (en) Phonetic double-click touch screen method for inputting pinyin
CN106325540A (en) Simplified input method of northeast Yunnan sub-dialect Miao language and application of simplified input method
CN1409201A (en) Yi character input method for computer
CN214670487U (en) BPMF keyboard for computer
CN101957663B (en) Five-stroke chinese character input method
CN104267828A (en) Four-bit code input method and keyboard
CN101105721A (en) Chinese phonetic input method and its keyboard
CN1327313C (en) Computer Chinese <<10 large structures>> (symbol type) input method
CN104615270A (en) Logic Chinese character pattern spelling code and input keyboard thereof
CN100445935C (en) Chinese character input encoding method for computer

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090617

Termination date: 20100418