CN103440047A - Universal code-fetching Chinese character input method - Google Patents

Universal code-fetching Chinese character input method Download PDF

Info

Publication number
CN103440047A
CN103440047A CN2013104114100A CN201310411410A CN103440047A CN 103440047 A CN103440047 A CN 103440047A CN 2013104114100 A CN2013104114100 A CN 2013104114100A CN 201310411410 A CN201310411410 A CN 201310411410A CN 103440047 A CN103440047 A CN 103440047A
Authority
CN
China
Prior art keywords
code
pen
alphabet
coding
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013104114100A
Other languages
Chinese (zh)
Other versions
CN103440047B (en
Inventor
任振敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201310411410.0A priority Critical patent/CN103440047B/en
Publication of CN103440047A publication Critical patent/CN103440047A/en
Application granted granted Critical
Publication of CN103440047B publication Critical patent/CN103440047B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)
  • Character Discrimination (AREA)

Abstract

The invention relates to a universal code-fetching Chinese character input method in which a pure code edition, a tone-shape mixed edition and a digital holographic edition are adopted, namely a mobile phone digital keyboard Chinese character input method which comprises the following steps: dividing strokes of Chinese characters into five types, also dividing a continuously written same-name stroke set in the same part into five types, and calling the strokes and the stroke sets as coding pens (sets) used for respectively determining a digital code; determining the five states and state digital codes of the five states according to characteristics and numbers of common points between one stroke and another stroke; determining a binary digital code according to codes of the two continuous coding pens or codes of the stroke and the state of the stroke; sequentially determining a Latin letter corresponding to each binary digital code. The universal code-fetching Chinese character input method disclosed by the invention is simple and easy to learn; etymons are not remembered; national regulation Chinese character writing standards are met; a universal keyboard is used; furthermore, the repeated code rate is low; the input speed is high; the pure code edition of the universal code-fetching Chinese character input method is independent of pinyin; because of being dependent of the pinyin, the tone-shape mixed edition is particularly suitable for elementary literacy and pinyin spelling teaching; the digital holographic edition is applied to mobile phone users.

Description

General code fetch input method of Chinese character
Technical field
The present invention relates to a kind of general code fetch input method of Chinese character for Computer input of the Chinese character.
Background technology
Current existing Chinese character coding method often needs to learn by heart radical, exists input fast not easy to learn, the problem that input easy to learn is slow.Some methods are also violated the Chinese-character writing standard, and what have will be transformed universal keyboard.
Summary of the invention
The technical problem to be solved in the present invention is, overcomes the above-mentioned defect that prior art exists, and provides a kind of easy to learn, with middle and primary schools' Chinese-character writing teaching promotion mutually, directly use universal keyboard, needn't remember radical, the repetition rate of coding is low, the general code fetch input method of Chinese character of high input speed.
The technical solution adopted for the present invention to solve the technical problems is:
The present invention, by determining the kind of Chinese-character stroke, determines the kind of the same part stroke group of the same name of continuous writing, determines the digital code of these strokes and stroke group.The present invention also according on a stroke with feature and the quantity of the common point of other strokes, determine the state of a stroke and determine the digital code of various states.The present invention has stipulated the method for the binary digit code that definite pen (group) is right, has stipulated the method for definite stroke state binary digit code.Designed " general code fetch table ", and set up the corresponding relation of two-bit digital code and the Latin alphabet by " general code fetch table ".
The present invention be take the standard Song typeface as coded object.
(1) all Chinese-character strokes are fallen into 5 types and encode pen and determine its digital code:
1) horizontal class: comprise horizontal and carry, corresponding number is 1;
2) perpendicular class: comprise perpendicularly, corresponding number is 2;
3) skim class: comprise slash, corresponding number is 3;
4) some class: comprise a little and press down, corresponding digital 4;
5) curved class: comprise in title the stroke and " the horizontal slash " that contain "fold", " curved ", " hook ".Corresponding digital 5.
(2) stroke of the same name with a part (determining the method for Chinese character part in vide infra about concept partly) of continuous writing being compiled in collaboration with is one group, is called a coding group, and determines the digital code of a coding group:
1) how horizontal: at the horizontal stroke of the continuous writing with a part of, also correspondence digital 2;
2) how perpendicular: that in same partial continuous, writes is perpendicular, and also corresponding number is 3;
3) skim: the slash of writing in same partial continuous, and each starting point of skimming is regardless of the both sides that occupy other stroke, also corresponding digital 4 more;
4) multiple spot: the point of writing in same partial continuous, also corresponding number 5;
5) how curved: continuous writing the curved of common point arranged, also corresponding digital 1.
Determine two coding pens or the two digits code corresponding to pen group of continuous writing, method is: take the first coding pen or the corresponding number of the first coding pen group as the first, the second coding pen or the corresponding number of the second coding pen group are the position, end.
(3) common point of a stroke and other stroke is divided into to 3 classes:
1) contact: be simultaneously 2 strokes end points;
2) logical point: the end points that is a stroke is the non-end points of another stroke simultaneously;
3) intersection point: the non-end points that is simultaneously two strokes.
The state of determining stroke with feature and the quantity of other stroke common point according to a stroke and corresponding number.
1) the first state: there is no common point or only have contact between a stroke and other stroke, also corresponding number 1;
2) the second state: have and only have logical point between a stroke and other stroke, also corresponding number is 2;
3) third state: contact and logical point are arranged between a stroke and other stroke simultaneously, or 1 intersection point is arranged, also corresponding number 3;
4) the 4th state: 2 intersection points are arranged, also corresponding number 4 between a stroke and other stroke;
5) the 5th state: between a stroke and other stroke, the intersection point more than 3 is arranged, also corresponding number 5.
Determine the two digits code of stroke state, method is: the corresponding number of the stroke of take is the first, and the corresponding number of state of stroke of take is the position, end.
The present invention uses alphabet " general code fetch table " to determine the corresponding Latin alphabet of two digits code, and the form of alphabet " general code fetch table " is:
Figure 2013104114100100002DEST_PATH_IMAGE001
Use alphabet " general code fetch table " to determine that the specific rules of the corresponding Latin alphabet of binary digit code is:
In alphabet " general code fetch table ", the row that the first place number of binary digit code of take is row number with take the row that the last digit code of binary digit code is line number and intersect the Latin alphabet in grid, be exactly the corresponding Latin alphabet of binary digit code.
The present invention determines that the method for Chinese character part is as follows:
(1) for whole radicals and the combination thereof of Chinese character, and meet following condition: 1) if Chinese character is up-down structure, the integral body that takes a horizontally-arranged is a part; If Chinese character is the left and right structure, the integral body that takes a vertical setting of types is a part; Every stroke interosculated is generally with a part, but (2), (3) money indicate except;
As " bridle " word: the integral body that three top radicals are combined into is a part, and " mouth " is a part." beach " word: be divided into four parts.Good " word: also be divided into four parts.The stroke of " Mi " structure of " Cuan " word and its upper part interosculates, inseparable, forms a part, and " Mi " following stroke is divided into 3 parts.
(2) for surrounding parts, be the semi-surrounding type Chinese character that continuous writing completes: the stroke set that surrounds parts is combined into a part, and the stroke set of besieged parts is combined into one or two part.
(3) for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice: the stroke set that the encirclement parts are write for the first time is combined into a part, the stroke set that the encirclement parts are write for the second time is combined into another part, and the stroke set of besieged parts is combined into 1 or 2 parts.
The present invention is divided into 4 types by Chinese character, respectively:
1) single character.
2) two parts word: the Chinese character formed by two parts, but the 3rd situation of enumerating exception.
3) three partial words: have three kinds of situations: the Chinese character 1. formed by three parts, in the Chinese character 2. formed two parts, three coding pens of one of them part less than or a group, another part can be divided into plural part.3. mentioned above for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice.
The present invention all is encoded to four Latin alphabets by all Chinese characters, first Latin alphabet present position is called the first code bit, second Latin alphabet present position is called the second code bit, the 3rd Latin alphabet present position is called the third yard position, the 4th Latin alphabet present position is called the 4th code bit, and each Latin alphabet is determined by alphabet " general code fetch table " by the two digits code.
By above-mentioned principle, the invention provides a kind of pure shape code version of general code fetch input method of Chinese character.The method of using pure shape code version to be encoded to Chinese character:
For single character, the method for carrying out encode Chinese characters for computer is:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described single character are determined the two digits code with the second coding pen or pen group, described two digits code be called first coding pen to or the pen group right, obtain the Latin alphabet on the first code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (2) second code bits: the 3rd coding pen or pen group by described single character are determined the two digits code with the 4th coding pen or pen group, described two digits code be called second coding pen to or the pen group right, obtain the Latin alphabet on the second code bit by determined two digits code by looking into alphabet " general code fetch table ";
(3) Latin alphabet on the third yard position: the 5th coding pen or pen group by described single character are determined the two digits code with the 6th coding pen or pen group, described two digits code be called the 3rd coding pen to or the pen group right, obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by described single character are determined the two digits code with coding pen last or pen group, described two digits code be called the 4th coding pen to or the pen group right, obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ".
For single character, when coding pen or the pen of described single character are organized less, be not enough to form four coding pens to or the pen group right, perhaps part coding pen to or a group to occurring when identical, at first with the stroke of the end by single character and the end stroke the two digits code that state was obtained, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; If the Latin alphabet four of less thaies also now, for non-numeric single character, fill vacancies in the proper order with the initial of the Chinese phonetic alphabet of the first sum of stroke title, so, to the single character of numeral, fill vacancies in order of precedence with the Chinese phonetic alphabet of the pronunciation of numeral own, until meet 4.
For can tripartite Chinese character, the method for carrying out encode Chinese characters for computer be:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on (2) second code bits: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by first obtain the Latin alphabet on the second code bit by looking into alphabet " general code fetch table ";
If the first coding pen of Chinese character or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of first or pen group and coding pen last or pen group when identical, the Latin alphabet on the second code bit is determined by alphabet " general code fetch table " by the first coding pen of second portion or pen group and the second coding pen or pen group;
(3) Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by second portion obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ";
When second portion only has two strokes or pen group, the Latin alphabet on the third yard position is determined by alphabet " general code fetch table " with the second coding pen or pen group by the first coding pen or the pen group of getting third part;
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of third part or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, the Latin alphabet on the 4th code bit is by the two digits code that obtains of state of an end stroke and an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order.
For the Chinese character that can be divided into two parts (hereinafter referred to as first and second portion), the method for carrying out encode Chinese characters for computer is:
(1) can determine two different Latin alphabets on two code bits according to the first of Chinese character, in the time of determining the different Latin alphabet of on two other code bit two according to the second portion of Chinese character, the first of Chinese character and second portion have three strokes or pen group when above:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
(2) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can be divided into two subdivisions:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by the first subdivision of second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the first subdivision of second portion obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
If the first coding pen of the first subdivision of second portion or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of the first subdivision of second portion or pen group and coding pen last or pen group when identical, the Latin alphabet on the third yard position is determined by alphabet " general code fetch table " by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the second subdivision of second portion obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on the third yard position by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group by alphabet " general code fetch table " while determining, if the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group are organized and are duplicated with coding pen last or pen with coding pen second from the bottom or the pen group of the second subdivision of second portion respectively, the second subdivision that is second portion only exists two strokes or a group, the Latin alphabet on the 4th code bit is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realization is filled vacancies in the proper order.
(3) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can not be divided into two subdivisions:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: the 3rd coding pen or pen group by second portion are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position;
When if second portion only exists two coding pens or pen group, be that second portion does not exist the 3rd coding pen or pen to organize while organizing with the 4th coding pen or pen, the Latin alphabet on the third yard position is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by " general code fetch table ", realize filling vacancies in the proper order; The Latin alphabet on the 4th code bit fills vacancies in order of precedence with the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character;
When if second portion only exists three coding pens or pen group, the Latin alphabet on the third yard position is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, by determined two digits code, by looking into " general code fetch table ", obtains the Latin alphabet on the second code bit; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order.
(4) first of Chinese character can be divided into two subdivisions (i.e. the first subdivision and the second subdivision), in the time of only determining a Latin alphabet on code bit according to the second portion of Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by the first subdivision are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by the first subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of the first subdivision or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of the first subdivision or pen group and coding pen last or pen respectively, be that the first subdivision of the first of Chinese character is while only existing two coding pens or pen group, the Latin alphabet on the second code bit is determined the two digits code by the first coding pen or the pen group of the second subdivision of first with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by the second subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of the first subdivision or pen group are identical with coding pen last or pen group with coding pen second from the bottom or the pen group of the first subdivision respectively with the second coding pen or pen group, the first subdivision that is the first of Chinese character only exists two coding pens or a group, and when the second subdivision of first also only exists two coding pens or pen group, the Latin alphabet on the third yard position is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ", the Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by looking into alphabet " general code fetch table ", realizes filling vacancies in the proper order.
(5) when the first of Chinese character only exists a coding pen or pen group:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
When if the second portion of Chinese character only has two coding pens or pen group, the Latin alphabet on the third yard position is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; The Latin alphabet on the 4th code bit fills vacancies in order of precedence with the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character;
When if the second portion of Chinese character only has three coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
When if the second portion of Chinese character only has four coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by " general code fetch table ", realizes filling vacancies in the proper order;
If the second portion of Chinese character has five coding pens or pen group when above, the Latin alphabet on the third yard position: the 3rd coding pen or pen group by second portion are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, by determined two digits code, by looking into " general code fetch table ", obtains the Latin alphabet on the 4th code bit.
(6) when the second portion of Chinese character only exists a coding pen or pen group:
If, when the first of Chinese character only exists a coding pen or a group and two coding pens or pen group, regard single character as and treat;
If when the first of Chinese character only exists three coding pens or a group and four coding pens or pen group: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
If the first of Chinese character exists five coding pens or pen group when above: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: the 3rd coding pen or pen group by Chinese character are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
The present invention is to every 3 parts Chinese character of (comprising 3 parts) that surpasses, from first, merge successively, after reaching 3, the contained coding pen of merged part (group) number ends to merge, before hang up, merged part is the first coding section, and remaining is the second coding section, then according to two partial words codings.For example: beach: first two section is associated with 4 (groups), merges into the first coding section, and other merges into the second coding section; Stand: first has had 3 (groups), ends to merge.
The present invention also provides a kind of sound shape of general code fetch input method of Chinese character to mix version.
Sound shape is mixed version the phoneme symbol in the Chinese phonetic alphabet and circumflex is referred to as to phonogram, determines the order of phoneme symbol by the natural order of phoneme, using circumflex as last phonogram.Use the assignment rule of phonogram: phoneme symbol B, P, M, F, A, high and level tone symbol be corresponding number 1 all; Phoneme symbol D, T, N, L, O, rising tone symbol be corresponding number 2 all; Phoneme symbol G, K, H, J, Q, X, NG, E, upper sound symbol be corresponding number 3 all; Phoneme symbol ZH, CH, SH, R, I, Y and falling tone symbol be corresponding number 4 all; Phoneme symbol Z, C, S, U, V and softly symbol (blank) and ancient entering tone symbol (itself is no-set, but have corresponding digital, for special code) all corresponding digital 5.
As: monogram YU, ER regards respectively the combination of two phonograms as.
Use general code fetch input method of Chinese character sound shape to mix the method that version is encoded to Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of Chinese character or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, use the first letter of the Chinese phonetic alphabet of first stroke of described Chinese character as the Latin alphabet on the second code bit.
If the Chinese phonetic alphabet of Chinese character is comprised of 3 phonograms: the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by last phoneme symbol of the Chinese phonetic alphabet of described Chinese character and circumflex (be high and level tone, rising tone, upper sound, falling tone, softly and ancient entering tone) combination, determine the two digits code, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.
Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.But last phonogram is all circumflex.
When if the Chinese phonetic alphabet of Chinese character is comprised of 4 phonograms, the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: the 3rd phoneme and last phonogram by the Chinese phonetic alphabet of described Chinese character are the combination of circumflex, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.NG, ZH, CH, SH all regard a phoneme as.
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.NG, ZH, CH, SH all regard a phoneme as.
The Chinese character that contains 5 phonograms for the Chinese phonetic alphabet, last phonogram is that circumflex does not participate in coding.
If the Chinese phonetic alphabet of Chinese character only is comprised of two phonograms: the Latin alphabet on the third yard position; By two phonograms, be that a phoneme symbol and a corresponding orderly number of circumflex are determined passing through " general code fetch table ".The Latin alphabet on the 4th code bit: first Latin alphabet of the first sum of stroke and pronunciation of Chinese character is curvedly all meaned by " w " as all.Do not only have the Chinese character of a phonogram in Chinese, have a phoneme at least, a tone, so just have two phonograms.
The present invention also provides a kind of general code fetch input method of Chinese character holographic digital version, i.e. numeric keyboard of mobile telephone input method of Chinese character.
There are ten numerals on numeric keyboard of mobile telephone: 0,1,2,3,4,5,6,7,8,9, determine a Chinese character by six numerals on six code bits:
Numeral on the first code bit, numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit is determined by four groups of corresponding two digits codes, herein, the method of determining four groups of two digits codes is: (1) is for the Chinese character that contains 5 above strokes, determine that the method for four groups of two digits codes determines that with the pure shape code version with general code fetch input method of Chinese character the method for four groups of two digits codes is identical, then determine the numeral on the first code bit according to determined four groups of two digits codes by looking into digital version " general code fetch table ", numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit, while just using the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character to be filled vacancies in the proper order, the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character need be changed is the corresponding number of the first sum of stroke title of Chinese character, (2), for the Chinese character of 4 strokes, each stroke corresponding number successively is the numeral on numeral on the first code bit, the numeral on the second code bit, the numeral on the third yard position, the 4th code bit, (3) for the Chinese character of 4 strokes of less than, at first each stroke successively corresponding number be the numeral on each code bit, not enoughly determine 4 code bits, more successively the corresponding number of the state of each stroke filled vacancies in the proper order, still during four of less thaies, supply four with number 0 without exception,
If the Chinese phonetic alphabet of Chinese character is comprised of 3 phonograms: the numeral on the 5th code bit: directly the corresponding number of first phoneme symbol is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: be high and level tone, rising tone, upper sound, falling tone, softly and the combination of ancient entering tone by last phoneme of the Chinese phonetic alphabet of described Chinese character and tone, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.
If the Chinese phonetic alphabet of Chinese character is comprised of 4 phonograms, contain 3 and circumflex in non-zero initial, alliteration, the head vowel of a final and the ending of a final: the numeral on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 5th code bit; The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as; Numeral on the 6th code bit: the 3rd phoneme symbol and last phonogram by the Chinese phonetic alphabet of described Chinese character are that circumflex is determined the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; NG, ZH, CH, SH all regard a phoneme symbol as.
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 5th code bit; The Latin alphabet on the 6th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 6th code bit.NG, ZH, CH, SH all regard a phoneme as.
The Chinese character that contains 5 phonograms for the Chinese phonetic alphabet, last phonogram is that circumflex does not participate in coding.
If if the Chinese phonetic alphabet of Chinese character only is comprised of 2 phonogram sounds: the numeral on the 5th code bit: directly the corresponding number of phoneme is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: directly the corresponding number of tone is defined as to the numeral on the 6th code bit.
Digital version " general code fetch table ":
Figure 548245DEST_PATH_IMAGE002
The present invention falls into 5 types Chinese-character stroke, and the stroke group of the same name that same partial continuous is write also falls into 5 types, and these strokes and stroke group are called coding pen (group), and they are determined to a digital code separately; According on a stroke with feature and the quantity of the common point of other strokes, determine its 5 kinds of states and numeric status code thereof; Determine a binary digit code by the code of continuous two coding pens or the code of stroke and its state; Determine according to " general code fetch table " Latin alphabet that each two-bit digital code is corresponding; The code fetch coding pen (group) of determining all types of Chinese characters to and corresponding letter, determine each code bit of encode Chinese characters for computer according to the order of strokes observed in calligraphy.The present invention is easy to learn, does not remember radical, observes the Chinese-character writing standard of national regulation, use universal keyboard, and the repetition rate of coding is low, high input speed, and the pure shape code version of general code fetch input method of Chinese character does not rely on phonetic; Sound shape is mixed version need use phonetic, is particularly suitable for primary school's character learning phonetic teaching; The digital hologram version is applicable to the cellphone subscriber.
Embodiment
Below in conjunction with embodiment, the invention will be further described.
Embodiment 1:
The present invention be take the standard Song typeface as coded object.
(1) all Chinese-character strokes fallen into 5 types and determine its digital code:
1) horizontal class: comprise horizontal and carry, corresponding number is 1;
2) perpendicular class: comprise perpendicularly, corresponding number is 2;
3) skim class: comprise that skimming corresponding number is 3;
4) some class: comprise a little and press down, corresponding digital 4;
5) curved class: comprise in title the stroke and " the horizontal slash " that contain "fold", " curved ", " hook ".Corresponding digital 5.
(2) stroke of the same name with a part (determining the method for Chinese character part in vide infra about concept partly) of continuous writing being compiled in collaboration with is one group, is called a coding group, and determines the digital code of a coding group:
1) how horizontal: at the horizontal stroke of the continuous writing with a part of, also correspondence digital 2;
2) how perpendicular: that in same partial continuous, writes is perpendicular, and also corresponding number is 3;
3) skim: the slash of writing in same partial continuous, and each starting point of skimming is regardless of the both sides that occupy other stroke, also corresponding digital 4 more;
4) multiple spot: the point of writing in same partial continuous, also corresponding number 5;
5) how curved: continuous writing the curved of common point arranged, also corresponding digital 1.
Determine two coding pens or the two digits code corresponding to pen group of continuous writing, method is: take the first coding pen or the corresponding number of the first coding pen group as the first, the second coding pen or the corresponding number of the second coding pen group are the position, end.
(3) common point of a stroke and other stroke is divided into to 3 classes:
1) contact: be simultaneously 2 strokes end points;
2) logical point: the end points that is a stroke is the non-end points of another stroke simultaneously;
3) intersection point: the non-end points that is simultaneously two strokes.
The state of determining stroke with feature and the quantity of other stroke common point according to a stroke and corresponding number.
1) the first state: there is no common point or only have contact between a stroke and other stroke, also corresponding number 1;
2) the second state: have and only have logical point between a stroke and other stroke, also corresponding number is 2;
3) third state: contact and logical point are arranged between a stroke and other stroke simultaneously, or 1 intersection point is arranged, also corresponding number 3;
4) the 4th state: 2 intersection points are arranged, also corresponding number 4 between a stroke and other stroke;
5) the 5th state: between a stroke and other stroke, the intersection point more than 3 is arranged, also corresponding number 5.
Determine the two digits code of stroke state, method is: the corresponding number of the stroke of take is the first, and the corresponding number of state of stroke of take is the position, end.
The present invention uses alphabet " general code fetch table " to determine the corresponding Latin alphabet of two digits code, and the form of alphabet " general code fetch table " is:
Figure 465385DEST_PATH_IMAGE001
Use alphabet " general code fetch table " to determine that the specific rules of the corresponding Latin alphabet of binary digit code is:
In alphabet " general code fetch table ", the row that the first place number of binary digit code of take is row number with take the row that the last digit code of binary digit code is line number and intersect the Latin alphabet in grid, be exactly the corresponding Latin alphabet of binary digit code.
The present invention determines that the method for Chinese character part is as follows:
(1) for whole radicals and the combination thereof of Chinese character, and meet following condition: 1) if Chinese character is up-down structure, the integral body that takes a horizontally-arranged is a part; If Chinese character is the left and right structure, the integral body that takes a vertical setting of types is a part; Every stroke interosculated is generally with a part, but (2), (3) money indicate except;
As " bridle " word: the integral body that three top radicals are combined into is a part, and " mouth " is a part." beach " word: be divided into four parts.Good " word: also be divided into four parts.The stroke of " Mi " structure of " Cuan " word and its upper part interosculates, inseparable, forms a part, and " Mi " following stroke is divided into 3 parts.
(2) for surrounding parts, be the semi-surrounding type Chinese character that continuous writing completes: the stroke set that surrounds parts is combined into a part, and the stroke set of besieged parts is combined into one or two part.
(3) for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice: the stroke set that the encirclement parts are write for the first time is combined into a part, the stroke set that the encirclement parts are write for the second time is combined into another part, and the stroke set of besieged parts is combined into 1 or 2 parts.
The present invention is divided into 3 types by Chinese character, respectively:
1) single character.
2) two parts word: the Chinese character formed by two parts, but the 3rd situation of enumerating exception.
3) three partial words: have three kinds of situations: the Chinese character 1. formed by three parts, in the Chinese character 2. formed two parts, three coding pens of one of them part less than (group), another part can be divided into plural part.3. mentioned above for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice.
The present invention all is encoded to four Latin alphabets by all Chinese characters, first Latin alphabet present position is called the first code bit, second Latin alphabet present position is called the second code bit, the 3rd Latin alphabet present position is called the third yard position, the 4th Latin alphabet present position is called the 4th code bit, and each Latin alphabet is determined by alphabet " general code fetch table " by the two digits code.
By above-mentioned principle, the invention provides a kind of pure shape code version of general code fetch input method of Chinese character, the method for using this version to be encoded to Chinese character:
For single character, the method for carrying out encode Chinese characters for computer is:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described single character are determined the two digits code with the second coding pen or pen group, described two digits code be called first coding pen to or the pen group right, obtain the Latin alphabet on the first code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (2) second code bits: the 3rd coding pen or pen group by described single character are determined the two digits code with the 4th coding pen or pen group, described two digits code be called second coding pen to or the pen group right, obtain the Latin alphabet on the second code bit by determined two digits code by looking into alphabet " general code fetch table ";
(3) Latin alphabet on the third yard position: the 5th coding pen or pen group by described single character are determined the two digits code with the 6th coding pen or pen group, described two digits code be called the 3rd coding pen to or the pen group right, obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by described single character are determined the two digits code with coding pen last or pen group, described two digits code be called the 4th coding pen to or the pen group right, obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ".
For single character, when coding pen or the pen of described single character are organized less, be not enough to form four coding pens to or the pen group right, perhaps part coding pen to or a group to occurring when identical, at first with the stroke of the end by single character and the end stroke the two digits code that state was obtained, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; If the Latin alphabet four of less thaies also now, for non-numeric single character, fill vacancies in the proper order with the initial of the Chinese phonetic alphabet of the first sum of stroke title, so, to the single character of numeral, fill vacancies in order of precedence with the Chinese phonetic alphabet of the pronunciation of numeral own, until meet 4.
For can tripartite Chinese character, the method for carrying out encode Chinese characters for computer be:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on (2) second code bits: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by first obtain the Latin alphabet on the second code bit by looking into alphabet " general code fetch table ";
If the first coding pen of Chinese character or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of first or pen group and coding pen last or pen group when identical, the Latin alphabet on the second code bit is determined by alphabet " general code fetch table " by the first coding pen of second portion or pen group and the second coding pen or pen group;
(3) Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by second portion obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ";
When second portion only has two strokes or pen group, the Latin alphabet on the third yard position is determined by alphabet " general code fetch table " with the second coding pen or pen group by the first coding pen or the pen group of getting third part;
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of third part or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, the Latin alphabet on the 4th code bit is by the two digits code that obtains of state of an end stroke and an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order.
For the Chinese character that can be divided into two parts (hereinafter referred to as first and second portion), the method for carrying out encode Chinese characters for computer is:
(1) can determine two different Latin alphabets on two code bits according to the first of Chinese character, and can determine the different Latin alphabet of on two other code bit two according to the second portion of Chinese character the time, the first of Chinese character and second portion have three strokes or pen group when above:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
(2) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can be divided into two subdivisions:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by the first subdivision of second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the first subdivision of second portion obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
If the first coding pen of the first subdivision of second portion or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of the first subdivision of second portion or pen group and coding pen last or pen group when identical, the Latin alphabet on the third yard position is determined by " general code fetch table " by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the second subdivision of second portion obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on the third yard position by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group by alphabet " general code fetch table " while determining, if the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group are organized and are duplicated with coding pen last or pen with coding pen second from the bottom or the pen group of the second subdivision of second portion respectively, the second subdivision that is second portion only exists two strokes or a group, the Latin alphabet on the 4th code bit is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realization is filled vacancies in the proper order.
(3) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can not be divided into two subdivisions: this situation can be according to the coding method coding of single character.(note, left out more word herein)
(4) first of Chinese character can be divided into two subdivisions (i.e. the first subdivision and the second subdivision), in the time of only determining a Latin alphabet on code bit according to the second portion of Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by the first subdivision are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by the first subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of the first subdivision or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of the first subdivision or pen group and coding pen last or pen respectively, be that the first subdivision of the first of Chinese character is while only existing two coding pens or pen group, the Latin alphabet on the second code bit is determined the two digits code by the first coding pen or the pen group of the second subdivision of first with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by the second subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of the first subdivision or pen group are identical with coding pen last or pen group with coding pen second from the bottom or the pen group of the first subdivision respectively with the second coding pen or pen group, the first subdivision that is the first of Chinese character only exists two coding pens or a group, and when the second subdivision of first also only exists two coding pens or pen group, the Latin alphabet on the third yard position is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ", the Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order.
(5) when the first of Chinese character only exists a coding pen or pen group:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
When if the second portion of Chinese character only has two coding pens or pen group, the Latin alphabet on the third yard position is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; The Latin alphabet on the 4th code bit fills vacancies in order of precedence with the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character;
When if the second portion of Chinese character only has three coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
When if the second portion of Chinese character only has four coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
If the second portion of Chinese character has five coding pens or pen group when above, the Latin alphabet on the third yard position: the 3rd coding pen or pen group by second portion are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the 4th code bit.
(6) when the second portion of Chinese character only exists a coding pen or pen group:
If, when the first of Chinese character only exists a coding pen or a group and two coding pens or pen group, regard single character as and treat;
If when the first of Chinese character only exists three coding pens or a group and four coding pens or pen group: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
If the first of Chinese character exists five coding pens or pen group when above: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: the 3rd coding pen or pen group by Chinese character are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
The example word:
Single character:
East: first is combined as " horizontal stroke, how curved ", and the binary digit code is 11, and corresponding alphabetical A second is combined as " skim, point ", and corresponding letter is R, and last pen state is 41, and the correspondence letter is D, and the first stroke Chinese phonetic alphabet the first letter is H, thus " east " be encoded to ARDH
one: only have one, can't form stroke combination, the first sum of namely end pen, its state is 11, and code is A, and " one " is numeral, and pronunciation is YI, so, being encoded to AYI, the 4th code bit is space.
two: only have group " how horizontal ", can't form pen (group) right, last pen state is 11, corresponding alphabetical A, and pronunciation own is ER, so be encoded to AER, last position is space.
three: only have group, position, end state is 1, and corresponding alphabetical A, be encoded to ASAN so pronunciation own is SAN..
narrow eyes into a slit: only have group " how curved ", the end pen is " curved ", and code is 5, and state is for there being an intersection point, and code is 3, and the binary digit code is 53, corresponding alphabetical O, and the first sum of pronunciation is WAN, the first letter is W, therefore be encoded to OWAN.
first: first is combined as " perpendicular, curved ", and two codes are 25, and corresponding letter is V, second is combined as " how horizontal, perpendicular ", and two codes are that 22. corresponding letters are G, and an end code name is 2, state code is 4, and corresponding letter is Q, and the first sum of phonetic the first letter is S, therefore code is VGQS.
shen: last pen state is 25, and together, its code is VGVS for other and first word.
by: due to the variation of the order of strokes observed in calligraphy, two horizontal discontinuous writing, can not be charge-coupled.Coding is VFBS.
?: the end pen is " curved ", and state is 2, is encoded to EUJW.
oneself: the end pen is " curved ", and state is 1, is encoded to EUEW.
Two parts word:
The Chinese: the left side is " multiple spot-carrying ", and corresponding two-bit digital code 51, check in letter e, and the right is " curved-right-falling stroke ", and corresponding number 54, check in tee.The end pen state is " right-falling stroke-3 " not, and corresponding number 43, check in alphabetical N.The first sum of is " point ", so stroke and pronunciation the first letter is ETND for the pure shape code of this word of D..
Family: the left side first is combined as " point-horizontal stroke ", finally is combined as " curved-skim ", and the right first is combined as " slashs-horizontal stroke ", and finally being combined as of full word " slash-point ", distinguish corresponding D, O, and C, so the pure shape code of this word of R. is DOCR.
Return: first's (by semi-surrounding) first is combined as " skim-curved " more, finally is combined as " curved-point ", and second portion first is combined as " point-curved ", and full word finally is combined as " curved-right-falling stroke ", the corresponding alphabetical X of difference, and T, X, T, so pure shape code is XTXT.
Build: second portion only has two (group), therefore first should take out 3 different stroke combination as far as possible.First also only has 3 groups, can only compile out 2 code bits.Its first combination is " curved-how horizontal ", corresponding alphabetical J, and last stroke combination is " how horizontal-perpendicular ", corresponding letter only has a combination " curved-right-falling stroke ", corresponding tee for the G. second portion. must utilize last pen state " right-falling stroke-3 ", its corresponding alphabetical N.Therefore the pure shape code of this word is JGTN.
Ring: this word first first combination, first finally combines, second portion the first combination, the last combination of second portion is respectively " perpendicular-curved " successively, " curved-horizontal stroke ", " skimming-curved " and " curved-horizontal stroke ", corresponding alphabetical V successively, E, so the pure shape code of W and this word of E. is VEWE.
Subtract: the first of this word only has " point-propose " combination.Second portion can not be divided into two subdivisions.So word can be encoded by the single character method.Its first to the 3rd combination is respectively " point-carry ", " horizontal-as to skim ", " horizontal-perpendicular " and " slash-point ".So its pure shape code is DKFR.
Benevolence: full word first is combined as " skimming-perpendicular ", finally is combined as " perpendicular-how horizontal ", and last pen state is " horizontal-1 " not, and the first sum of pronunciation the first letter is P, so SUOYI, pure shape code is HGAP.
Mushroom: first is " horizontal-how perpendicular ", and second portion can be divided into two subdivisions.The first combination and the last combination of the first subdivision are respectively " curved-as to skim " and " slash-horizontal stroke ", and full word finally is combined as " curved-horizontal stroke ", therefore pure shape code is KOCE.
Good: this word has four natural parts, and " scholar " contained 3 groups, is the first coded portion, and remaining is the second coded portion, and the pure shape code of second son is FBVE.
Guilt: this word also has four natural parts, but 3 of " ten " less thaies (group) are merged into the first coded portion with " mouth ".All the other are the second coded portion.The pure shape code of this word is FEDF.
Old: first partly only has one naturally, and the perpendicular of different piece can not charge-coupledly be " how perpendicular ".Full word first is combined as " perpendicular-as to erect ", and second portion first is combined as " perpendicular-curved ", and full word finally is combined as " curved-how horizontal ", then, without other combinations, the last pen state not pure shape code of " horizontal-1 " event is GVJA.
Peng: the right only has " skim " group more.3 (groups) are contained on top, the left side " scholar ", and the left side finally is combined as " slash-horizontal stroke " full word and finally is combined as " horizontal-slash " more.The pure shape code of this word is FBCP.
Three partial words
: there are 3 coding pens in first, should determine 2 code bits, and each determines a code bit other parts, and it is right that second portion is got the first stroke (group), and it is right that third part is got last coding pen (group).Its coding is VEJE.
Proud: first only has 2 (groups), can only determine a code bit.Second portion and third part all can be determined 2 code bits, but second portion sequence is front, thus respectively by its first and last coding pen (group) to determining second and the third yard position.The last coding pen (group) of full word is determined the 4th code bit.Its coding is HGOR.
Become: first has 2 (groups), and second portion only has 1 (group), can't form pen (group) right, and with the first stroke (group) of third part, to be combined as pen (group) right.This pen (group) also will be combined as third part the first stroke (group) with third part second (group), and last 2 (groups) determine the 4th code bit, and it is encoded to KYOR..
The present invention is to every 3 parts Chinese character of (comprising 3 parts) that surpasses, from first, merge successively, after reaching 3, the contained coding pen of merged part (group) number ends to merge, before hang up, merged part is the first coding section, and remaining is the second coding section, then according to two partial words code fetches.For example: beach: first two section is associated with 4 coding pens (group), merges into the first coding section, and other merges into the second coding section, and its coding is: ETHB; Stand: there have been 3 coding pens (group) in first, ends to merge, and it is encoded to: UETB.
Embodiment 2:
The present embodiment provides the sound shape of general code fetch input method of Chinese character to mix version.
Sound shape is mixed version the phoneme symbol in the Chinese phonetic alphabet and circumflex is referred to as to phonogram, determines the order of phoneme symbol by the natural order of phoneme, using circumflex as last phonogram.Use the assignment rule of phonogram: phoneme symbol B, P, M, F, A, high and level tone symbol be corresponding number 1 all; Phoneme symbol D, T, N, L, O, rising tone symbol be corresponding number 2 all; Phoneme symbol G, K, H, J, Q, X, NG, E, upper sound symbol be corresponding number 3 all; Phoneme symbol ZH, CH, SH, R, I, Y and falling tone symbol be corresponding number 4 all; Phoneme symbol Z, C, S, U, V and softly symbol (blank) and ancient entering tone symbol (itself is no-set, but have corresponding digital, for special code) all corresponding digital 5.。
Use the sound shape of general code fetch input method of Chinese character to mix the method that version is encoded to Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of Chinese character or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, use the first letter of the Chinese phonetic alphabet of first stroke of described Chinese character as the Latin alphabet on the second code bit.
If the Chinese phonetic alphabet of Chinese character is comprised of 3 phonograms, i.e. two phoneme symbols and circumflex: the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by last phoneme of the Chinese phonetic alphabet of described Chinese character and tone (be high and level tone, rising tone, upper sound, falling tone, softly and ancient entering tone) combination, determine the two digits code, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the 4th code bit.
The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.
Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.YU, ER regards respectively the combination of two phonograms as.
If the Chinese phonetic alphabet of Chinese character forms (containing 3 and circumflex in non-zero initial, alliteration, the head vowel of a final and the ending of a final) by 4 phonograms: the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme, determine the two digits code, by determined two digits code, by looking into " general code fetch table ", obtain the Latin alphabet on the 4th code bit.NG, ZH, CH, SH all regard a phoneme as.
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit.NG, ZH, CH, SH all regard a phoneme as.
The Chinese character that contains 5 phonograms for the Chinese phonetic alphabet, last phonogram is that circumflex does not participate in coding.
If the Chinese phonetic alphabet of Chinese character only is comprised of two phonograms: the Latin alphabet on the third yard position; By two phonograms, be that a phoneme symbol and a corresponding orderly number of circumflex are determined passing through " general code fetch table ".The Latin alphabet on the 4th code bit: first Latin alphabet of the first sum of stroke and pronunciation of Chinese character is curvedly all meaned by " w " as all.Do not only have the Chinese character of a phonogram in Chinese, have a phoneme at least, a tone, so just have two phonograms.
Two phonogram words
Salt down: the first stroke combination is " skimming-curved " corresponding letter w, and last stroke combination is that " how horizontal-curved " corresponding letter is V, and phoneme symbol is a, and tone is high and level tone, and corresponding letter is A.The first letter of the first sum of pronunciation " slash " is P, so the sound shape of this word is mixed version, is encoded to WVAP.
E: the first stroke combination is " skimming-perpendicular ", corresponding alphabetical H, and last stroke combination is " how horizontal-curved ", corresponding letter is V, and phoneme is e, and circumflex is the falling tone symbol, phonogram combines corresponding letter r, and the first sum of pronunciation the first letter is P, and the mixing version of this word is encoded to HVRP.
Wo: phoneme is o, and tone is upper sound, and the corresponding letter of phonogram combination is L, and the first sum of pronunciation the first letter is S, and the hybrid code of this word is VTLS.
Three phonogram words
: phoneme is followed successively by a, i, and tone is high and level tone, and the corresponding letter of the first phonogram combination is P, and the corresponding letter of last phonogram combination is VRPD for the hybrid code of this word of D..
Father: phoneme is followed successively by b, a, and tone is falling tone, and the corresponding letter of the first phonogram combination is A, and the corresponding letter of last phonogram combination is P, and hybrid code is RUAP.
District: phoneme is followed successively by q, u, and tone is high and level tone.The corresponding letter of the first phonogram combination is W, and the corresponding letter of last phonogram combination is E, and hybrid code is KXWE.
Grace: phoneme is followed successively by e, n, and tone is high and level tone, hybrid code is VYHB.
Four phonogram words
The child: phoneme is followed successively by h, a, and i, tone is rising tone.The corresponding letter of the first phonogram combination is C, and last phonogram is combined as D.Hybrid code is ARCD.
Press: phoneme is followed successively by q, i, and n, tone is falling tone.Third yard position letter is R. the 4th code bit letter Q, and hybrid code is PRRQ.
Alarm: phoneme is followed successively by s, o, and ng, tone is upper sound.Third yard position letter is J, and the 4th code bit letter is M, and hybrid code is
RHJM。
Arm spread: phoneme is followed successively by t, u, and o, tone is upper sound.Third yard position letter is V, and the 4th code bit letter is L, and hybrid code is
DRVL。
Five phonogram words
Yellow: phoneme is h successively, u, and a, ng, tone does not participate in coding.The first phonogram combines corresponding letter w, and it is K that last coding schedule phonemic notation combines corresponding letter, and hybrid code is KRWK.
Advise: phoneme is q successively, u, and a, n, tone does not participate in coding.Third yard position letter is W, and the 4th code bit letter is F, and hybrid code is TOWF.
Jump: phoneme is t successively, i, and a, o, tone does not participate in coding.Hybrid code is VRQF.
Cuan: phoneme is c successively, u, and a, n, tone does not participate in coding.Hybrid code is HRYF.
Hit: phoneme is followed successively by zh, u, and a, ng, tone does not participate in coding.Hybrid code is UGXK.
Embodiment 3:
The present invention also provides i.e. " general code fetch input method of Chinese character " the holographic digital version of a kind of numeric keyboard of mobile telephone input method of Chinese character.
There are ten numerals on numeric keyboard of mobile telephone: 0,1,2,3,4,5,6,7,8,9, determine a Chinese character by six numerals on six code bits:
Numeral on the first code bit, numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit is determined by four groups of corresponding two digits codes, herein, the method of determining four groups of two digits codes is: (1) is for the Chinese character that contains 5 above strokes, determine that the method for four groups of two digits codes determines that with the pure shape code version with general code fetch input method of Chinese character the method for four groups of two digits codes is identical, then determine the numeral on the first code bit according to determined four groups of two digits codes by looking into digital version " general code fetch table ", numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit, while just using the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character to be filled vacancies in the proper order, the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character need be changed is the corresponding number of the first sum of stroke title of Chinese character, (2), for the Chinese character of 4 strokes, each stroke corresponding number successively is the numeral on numeral on the first code bit, the numeral on the second code bit, the numeral on the third yard position, the 4th code bit, (3) for the Chinese character of 4 strokes of less than, at first each stroke successively corresponding number be the numeral on each code bit, not enoughly determine 4 code bits, more successively the corresponding number of the state of each stroke filled vacancies in the proper order, still during four of less thaies, supply four with number 0 without exception.
If the Chinese phonetic alphabet of Chinese character is comprised of two phoneme symbols and circumflex 3 phonograms: the numeral on the 5th code bit: directly the corresponding number of first phoneme symbol is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: be high and level tone, rising tone, upper sound, falling tone, softly and the combination of ancient entering tone by last phoneme of the Chinese phonetic alphabet of described Chinese character and tone, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as.
If the Chinese phonetic alphabet of Chinese character is comprised of 4 phonograms, contain 3 and circumflex in non-zero initial, alliteration, the head vowel of a final and the ending of a final: the numeral on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 5th code bit; The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as; Numeral on the 6th code bit: the 3rd phoneme symbol and last phonogram by the Chinese phonetic alphabet of described Chinese character are that circumflex is determined the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; NG, ZH, CH, SH all regard a phoneme symbol as.
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 5th code bit; The Latin alphabet on the 6th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 6th code bit.NG, ZH, CH, SH all regard a phoneme as.
The Chinese character that contains 5 phonograms for the Chinese phonetic alphabet, last phonogram is that circumflex does not participate in coding.
If if the Chinese phonetic alphabet of Chinese character only is comprised of 2 phonogram sounds: the numeral on the 5th code bit: directly the corresponding number of phoneme is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: directly the corresponding number of tone is defined as to the numeral on the 6th code bit.
Digital version " general code fetch table ":
Figure 58172DEST_PATH_IMAGE002
Narrow eyes into a slit: this word only has 2,5 strokes of less than, the first stroke and second equal corresponding number 5, the state of two is corresponding number 3 all, phoneme is followed successively by m, i, and the e. tone is high and level tone, it is definite that the 5th code bit and the 6th code bit are looked into the general code fetch table of digital version by digital 14 and 31, therefore be respectively the numeric type holographic code of 5 and 4. these words, is 553354.
One: this word only has one.Corresponding number 1, state is corresponding number 1 also, with digital 0, supplies four.Phoneme is followed successively by y, i, and the desirable high and level tone of tone, the 5th code bit directly determined by the assignment of y, is that 4, the six code bits are desirable 5, so " one " while reading high and level tone, the digital hologram code is 110045.
Two: each corresponding number 1 of this word, each pen state is corresponding number 1 also, and phonographic alphabet is followed successively by e, r, tone is falling tone.The corresponding number 3 of the 5th code bit, so the numeric type holographic code of corresponding digital 8. these words of the 6th code bit is 111138.
Three: this word each and each pen state is corresponding number 1 all, and phoneme is followed successively by s, a, and n, tone is high and level tone.The corresponding number 3 of corresponding digital 6, the six code bits of the 5th code bit, therefore holographic code is 111163.
Instrument: the first code bit is determined by " skimming-perpendicular ", corresponding two-bit digital code 32, and checking in corresponding number is 5, the second code bit is determined by " point-slash ", check in corresponding number and be 7. third yard positions by " skimming-right-falling stroke ", corresponding number is also that 7. last pen state are " right-falling strokes-3 ", corresponding digital or 7, phonographic alphabet is followed successively by y, i, tone is falling tone, the 5th code bit is the assignment 4 of y, the number of the 6th code bit is 8, and holographic code is 577748.
Rather: the first code bit is definite by " multiple spot-curved ", and corresponding digital 0, the second code bit is definite by " horizontal stroke-curved ", and the digital 6. last pen state of correspondence are " curved-2 ", corresponding number 7, and the first sum of is a little, correspondence number 4.Phonogram is followed successively by n, i, and ng, tone can be falling tone (doing the surname used time is falling tone).The 5th code bit numeral is that 6, the six code bits are 7, and holographic code is 067467.
Star: the first to the 4th code bit is respectively by " perpendicular-curved ", and " curved-how horizontal ", " skimming-how horizontal ", " perpendicular-horizontal stroke " determine, respectively corresponding digital 7,7,5,3 phonographic alphabets (or combination) are followed successively by x, i, ng, tone is high and level tone, and the five, six code bit number is respectively 7 and 4, and its holographic code is 775374.
Credit: front four code bits are successively respectively by " perpendicular-point ", and " skimming-curved ", " curved-point " and " perpendicular-horizontal stroke " determined, its corresponding number 6,8,9, and 3. its phoneme is y, a, o, tone is falling tone.The number that the number of the 5th code bit is 5, the six code bits is 6, and its holographic code is 689356.

Claims (6)

1. general code fetch input method of Chinese character, is characterized in that,
(1) all Chinese-character strokes are fallen into 5 types and encode pen and determine its digital code:
1) horizontal class: comprise horizontal and carry, corresponding number is 1;
2) perpendicular class: comprise perpendicularly, corresponding number is 2;
3) skim class: comprise slash, corresponding number is 3;
4) some class: comprise a little and press down, corresponding digital 4;
5) curved class: comprise in title the stroke and " the horizontal slash " that contain "fold", " curved ", " hook ", corresponding digital 5;
(2) stroke of the same name with a part of continuous writing being compiled in collaboration with is one group, is called a coding group, and determines the digital code of coding pen group:
1) how horizontal: at the horizontal stroke of the continuous writing with a part of, also correspondence digital 2;
2) how perpendicular: that in same partial continuous, writes is perpendicular, and also corresponding number is 3;
3) skim: the slash of writing in same partial continuous, and each starting point of skimming is regardless of the both sides that occupy other stroke, also corresponding digital 4 more;
4) multiple spot: the point of writing in same partial continuous, also corresponding number 5;
5) how curved: continuous writing the curved of common point arranged, also corresponding digital 1;
Determine two coding pens or the two digits code corresponding to pen group of continuous writing, method is: take the first coding pen or the corresponding number of the first coding pen group as the first, the second coding pen or the corresponding number of the second coding pen group are the position, end;
(3) common point of a stroke and other stroke is divided into to 3 classes:
1) contact: be simultaneously 2 strokes end points;
2) logical point: the end points that is a stroke is the non-end points of another stroke simultaneously;
3) intersection point: the non-end points that is simultaneously two strokes;
The state of determining stroke with feature and the quantity of other stroke common point according to a stroke and corresponding number;
1) the first state: there is no common point or only have contact between a stroke and other stroke, also corresponding number 1;
2) the second state: have and only have logical point between a stroke and other stroke, also corresponding number is 2;
3) third state: contact and logical point are arranged between a stroke and other stroke simultaneously, or 1 intersection point is arranged, also corresponding number 3;
4) the 4th state: 2 intersection points are arranged, also corresponding number 4 between a stroke and other stroke;
5) the 5th state: between a stroke and other stroke, the intersection point more than 3 is arranged, also corresponding number 5;
Determine the two digits code of stroke state, method is: the corresponding number of the stroke of take is the first, and the corresponding number of state of stroke of take is the position, end.
2. general code fetch input method of Chinese character according to claim 1, is characterized in that, uses alphabet " general code fetch table " to determine the corresponding Latin alphabet of two digits code, and the form of alphabet " general code fetch table " is:
Use alphabet " general code fetch table " to determine that the specific rules of the corresponding Latin alphabet of binary digit code is:
In alphabet " general code fetch table ", the row that the first place number of binary digit code of take is row number with take the row that the last digit code of binary digit code is line number and intersect the Latin alphabet in grid, be exactly the corresponding Latin alphabet of binary digit code.
3. general code fetch input method of Chinese character according to claim 2, it is characterized in that, the method of part of determining Chinese character is as follows: for whole radicals and the combination thereof of Chinese character, and meet following condition: (1), if Chinese character is up-down structure, the integral body that takes a horizontally-arranged is a part; If Chinese character is the left and right structure, the integral body that takes a vertical setting of types is a part; Every stroke interosculated is generally with a part, but (2), (3) money indicate except;
(2) for surrounding parts, be the semi-surrounding type Chinese character that continuous writing completes: the stroke set that surrounds parts is combined into a part, and the stroke set of besieged parts is combined into one or two part;
(3) for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice: the stroke set that the encirclement parts are write for the first time is combined into a part, the stroke set that the encirclement parts are write for the second time is combined into another part, and the stroke set of besieged parts is combined into 1 or 2 parts;
The present invention is divided into 4 types by Chinese character, respectively:
1) single character;
2) situation that two parts word: by two Chinese characters of forming of part, but the 3rd) bar is enumerated makes an exception;
3) three partial words: have three kinds of situations: the Chinese character 1. formed by three parts, in the Chinese character 2. formed two parts, three coding pens of one of them part less than or a group, another part can be divided into plural part; 3. mentioned above for surrounding semi-surrounding type and the full encirclement type Chinese character of parts for having write at twice;
The present invention all is encoded to four Latin alphabets by all Chinese characters, first Latin alphabet present position is called the first code bit, second Latin alphabet present position is called the second code bit, the 3rd Latin alphabet present position is called the third yard position, the 4th Latin alphabet present position is called the 4th code bit, and each Latin alphabet is determined by alphabet " general code fetch table " by the two digits code.
4. general code fetch input method of Chinese character according to claim 3, is characterized in that, a kind of pure shape code version of general code fetch input method of Chinese character is provided;
For single character, the method for carrying out encode Chinese characters for computer is:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described single character are determined the two digits code with the second coding pen or pen group, described two digits code be called first coding pen to or the pen group right, obtain the Latin alphabet on the first code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (2) second code bits: the 3rd coding pen or pen group by described single character are determined the two digits code with the 4th coding pen or pen group, described two digits code be called second coding pen to or the pen group right, obtain the Latin alphabet on the second code bit by determined two digits code by looking into alphabet " general code fetch table ";
(3) Latin alphabet on the third yard position: the 5th coding pen or pen group by described single character are determined the two digits code with the 6th coding pen or pen group, described two digits code be called the 3rd coding pen to or the pen group right, obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by described single character are determined the two digits code with coding pen last or pen group, described two digits code be called the 4th coding pen to or the pen group right, obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ";
For single character, when the coding pen of described single character or a group be not enough to form four coding pens to or the pen group right, perhaps coding pen to or a group to occurring when identical, at first with the stroke of the end by single character and the end stroke the two digits code that state was obtained, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; If the Latin alphabet four of less thaies also now, for non-numeric single character, fill vacancies in the proper order with the initial of the Chinese phonetic alphabet of the first sum of stroke title, so, to the single character of numeral, fill vacancies in order of precedence with the Chinese phonetic alphabet of the pronunciation of numeral own, until meet 4;
For tripartite Chinese character, the method for carrying out encode Chinese characters for computer is:
(1) Latin alphabet on the first code bit: the first coding pen or pen group by described Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on (2) second code bits: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by first obtain the Latin alphabet on the second code bit by looking into alphabet " general code fetch table ";
If the first coding pen of Chinese character or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of first or pen group and coding pen last or pen group when identical, the Latin alphabet on the second code bit is determined by alphabet " general code fetch table " by the first coding pen of second portion or pen group and the second coding pen or pen group;
(3) Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by second portion obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ";
When second portion only has two strokes or pen group, the Latin alphabet on the third yard position is determined by alphabet " general code fetch table " with the second coding pen or pen group by the first coding pen or the pen group of getting third part;
The Latin alphabet on (4) the 4th code bits: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of third part or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, the Latin alphabet on the 4th code bit is by the two digits code that obtains of state of an end stroke and an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order;
For being divided into two-part Chinese character, hereinafter referred to as first and second portion, the method for carrying out encode Chinese characters for computer is:
(1) can determine two different Latin alphabets on two code bits according to the first of Chinese character, in the time of determining the different Latin alphabet of on two other code bit two according to the second portion of Chinese character, the first of Chinese character and second portion have three strokes or pen group when above:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
(2) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can be divided into two subdivisions:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by the first subdivision of second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the first subdivision of second portion obtain the Latin alphabet on the third yard position by determined two digits code by looking into alphabet " general code fetch table ";
If the first coding pen of the first subdivision of second portion or pen group and the second coding pen or pen group be respectively with the coding pen second from the bottom of the first subdivision of second portion or pen group and coding pen last or pen group when identical, the Latin alphabet on the third yard position is determined by alphabet " general code fetch table " by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group and coding pen last or the definite two digits code of pen group by the second subdivision of second portion obtain the Latin alphabet on the 4th code bit by determined two digits code by looking into alphabet " general code fetch table ";
The Latin alphabet on the third yard position by the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group by alphabet " general code fetch table " while determining, if the first coding pen of the second subdivision of second portion or pen group and the second coding pen or pen group are organized and are duplicated with coding pen last or pen with coding pen second from the bottom or the pen group of the second subdivision of second portion respectively, the second subdivision that is second portion only exists two strokes or a group, the Latin alphabet on the 4th code bit is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realization is filled vacancies in the proper order,
(3) only can determine the Latin alphabet on code bit according to the first of Chinese character, when the second portion of Chinese character can not be divided into two subdivisions:
The Latin alphabet on the first code bit: the first coding pen or pen group by first are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: the 3rd coding pen or pen group by second portion are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
When if second portion only exists two coding pens or pen group, be that second portion does not exist the 3rd coding pen or pen to organize while organizing with the 4th coding pen or pen, the Latin alphabet on the third yard position is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; The Latin alphabet on the 4th code bit fills vacancies in order of precedence with the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character;
When if second portion only exists three coding pens or pen group, the Latin alphabet on the third yard position is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the second code bit; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
(4) first of Chinese character can be divided into two subdivisions (i.e. the first subdivision and the second subdivision), in the time of only determining a Latin alphabet on code bit according to the second portion of Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by the first subdivision are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by the first subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of the first subdivision or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of the first subdivision or pen group and coding pen last or pen respectively, be that the first subdivision of the first of Chinese character is while only existing two coding pens or pen group, the Latin alphabet on the second code bit is determined the two digits code by the first coding pen or the pen group of the second subdivision of first with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the second code bit;
The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by the second subdivision are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position;
The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
If the first coding pen of the first subdivision or pen group are identical with coding pen last or pen group with coding pen second from the bottom or the pen group of the first subdivision respectively with the second coding pen or pen group, the first subdivision that is the first of Chinese character only exists two coding pens or a group, and when the second subdivision of first also only exists two coding pens or pen group, the Latin alphabet on the third yard position is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, obtain the Latin alphabet on the third yard position by looking into alphabet " general code fetch table ", the Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order,
(5) when the first of Chinese character only exists a coding pen or pen group:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: the first coding pen or pen group by second portion are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
When if the second portion of Chinese character only has two coding pens or pen group, the Latin alphabet on the third yard position is with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, determine the corresponding Latin alphabet by alphabet " general code fetch table ", realize filling vacancies in the proper order; The Latin alphabet on the 4th code bit fills vacancies in order of precedence with the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character;
When if the second portion of Chinese character only has three coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
When if the second portion of Chinese character only has four coding pens or pen group, the Latin alphabet on the third yard position: coding pen second from the bottom or pen group by second portion are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
If the second portion of Chinese character has five coding pens or pen group when above, the Latin alphabet on the third yard position: the 3rd coding pen or pen group by second portion are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit is determined the two digits code with coding pen second from the bottom or the pen group of Chinese character with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtains the Latin alphabet on the 4th code bit;
(6) when the second portion of Chinese character only exists a coding pen or pen group:
If, when the first of Chinese character only exists a coding pen or a group and two coding pens or pen group, regard single character as and treat;
If when the first of Chinese character only exists three coding pens or a group and four coding pens or pen group: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit, with an end stroke of Chinese character and the two digits code that state was obtained of an end stroke, is determined the corresponding Latin alphabet by alphabet " general code fetch table ", realizes filling vacancies in the proper order;
If the first of Chinese character exists five coding pens or pen group when above: the Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit; The Latin alphabet on the second code bit: the 3rd coding pen or pen group by Chinese character are determined the two digits code with the 4th coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit; The Latin alphabet on the third yard position: coding pen second from the bottom or pen group by first are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit;
Every Chinese character that surpasses 3 parts, from first, merge successively, after the contained coding pen of merged part or a coding group group number reach 3, end to merge, before hang up, merged part is the first coding section, remaining is the second coding section, then according to two partial words codings.
5. general code fetch input method of Chinese character according to claim 3, is characterized in that, provides a kind of sound shape of general code fetch input method of Chinese character to mix version:
Phoneme symbol in the Chinese phonetic alphabet and circumflex are referred to as to phonogram, and circumflex is last phonogram;
Sound shape is mixed the assignment rule that version uses phonetic phoneme symbol and circumflex: phoneme symbol B, and P, M, F, A, the high and level tone symbol is corresponding number 1 all; Phoneme symbol D, T, N, L, O rising tone symbol is corresponding number 2 all; Phoneme symbol G, K.H, J, Q, X, NG, E, upper sound symbol is corresponding number 3 all; Phoneme symbol ZH, CH, SH, R, I, Y and falling tone symbol be corresponding number 4 all; Phoneme symbol Z, C, S, U, V and softly and ancient entering tone all corresponding digital 5;
Use the sound shape of general code fetch input method of Chinese character to mix the method that version is encoded to Chinese character:
The Latin alphabet on the first code bit: the first coding pen or pen group by Chinese character are determined the two digits code with the second coding pen or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the first code bit;
The Latin alphabet on the second code bit: coding pen second from the bottom or pen group with Chinese character are determined the two digits code with coding pen last or pen group, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the second code bit;
If the first coding pen of Chinese character or pen group and the second coding pen or pen group are organized while duplicating with the coding pen second from the bottom of Chinese character or pen group and coding pen last or pen respectively, use the first letter of the Chinese phonetic alphabet of first stroke of described Chinese character as the Latin alphabet on the second code bit;
If the Chinese phonetic alphabet of Chinese character is comprised of 3 phonograms: the Latin alphabet on the third yard position, the first combination by the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: be high and level tone, rising tone, upper sound, falling tone, softly and the combination of ancient entering tone by last phoneme symbol of the Chinese phonetic alphabet of described Chinese character and circumflex, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit; The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as; Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as;
If the Chinese phonetic alphabet of Chinese character is comprised of 4 phonograms: the Latin alphabet on the third yard position, the first combination by the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: the 3rd phoneme symbol and last phonogram by the Chinese phonetic alphabet of described Chinese character are the combination of circumflex, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit; NG, ZH, CH, SH all regard a phoneme as;
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the third yard position: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the third yard position; The Latin alphabet on the 4th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into alphabet " general code fetch table ", obtain the Latin alphabet on the 4th code bit; NG, ZH, CH, SH all regard a phoneme as;
If the Chinese phonetic alphabet of Chinese character only is comprised of two phonograms: the Latin alphabet on the third yard position; By two phonograms, be that a phoneme symbol and an orderly number corresponding to circumflex are determined by looking into alphabet " general code fetch table "; The Latin alphabet on the 4th code bit: first Latin alphabet of the first sum of stroke and pronunciation of Chinese character;
The Chinese character that does not only have a phonogram, have a phoneme at least, and a tone is arranged.
6. general code fetch input method of Chinese character according to claim 4, is characterized in that, it is the numeric keyboard of mobile telephone input method of Chinese character that the holographic digital version of a kind of general code fetch input method of Chinese character is provided,
There are ten numerals on numeric keyboard of mobile telephone: 0,1,2,3,4,5,6,7,8,9, determine a Chinese character by six numerals on six code bits:
Numeral on the first code bit, numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit is determined by four groups of corresponding two digits codes, herein, the method of determining four groups of two digits codes is: (1) is for the Chinese character that contains 5 above strokes, determine that the method for four groups of two digits codes determines that with the pure shape code version with general code fetch input method of Chinese character the method for four groups of two digits codes is identical, then determine the numeral on the first code bit according to determined four groups of two digits codes by looking into digital version " general code fetch table ", numeral on the second code bit, numeral on the third yard position, numeral on the 4th code bit, while just using the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character to be filled vacancies in the proper order, the initial of the Chinese phonetic alphabet of the first sum of stroke title of Chinese character need be changed is the corresponding number of the first sum of stroke title of Chinese character, (2), for the Chinese character of 4 strokes, each stroke corresponding number successively is the numeral on numeral on the first code bit, the numeral on the second code bit, the numeral on the third yard position, the 4th code bit, (3) for the Chinese character of 4 strokes of less than, at first each stroke successively corresponding number be the numeral on each code bit, not enoughly determine 4 code bits, more successively the corresponding number of the state of each stroke filled vacancies in the proper order, still during four of less thaies, supply four with number 0 without exception,
If the Chinese phonetic alphabet of Chinese character is comprised of 3 phonograms: the numeral on the 5th code bit: directly the corresponding number of first phoneme symbol is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: be high and level tone, rising tone, upper sound, falling tone, softly and the combination of ancient entering tone by last phoneme of the Chinese phonetic alphabet of described Chinese character and tone, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; Last phoneme of the Chinese phonetic alphabet of described Chinese character generally refers to last English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as;
If the Chinese phonetic alphabet of Chinese character is comprised of 4 phonograms, contain 3 and circumflex in non-zero initial, alliteration, the head vowel of a final and the ending of a final: the numeral on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 5th code bit; The first combination of the Chinese phonetic alphabet of described Chinese character comprises first phoneme and second phoneme of the Chinese phonetic alphabet, generally refers to first English alphabet and second English alphabet of the Chinese phonetic alphabet, but NG, ZH, CH, SH all regard a phoneme as; Numeral on the 6th code bit: the 3rd phoneme symbol and last phonogram by the Chinese phonetic alphabet of described Chinese character are that circumflex is determined the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the numeral on the 6th code bit; NG, ZH, CH, SH all regard a phoneme symbol as;
When if the Chinese phonetic alphabet of Chinese character is comprised of 5 phonograms, the Latin alphabet on the 5th code bit: by the first combination of the Chinese phonetic alphabet of described Chinese character, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 5th code bit; The Latin alphabet on the 6th code bit: by the 3rd phoneme of the Chinese phonetic alphabet of described Chinese character and the combination of last phoneme symbol, determine the two digits code, by determined two digits code, by looking into digital version " general code fetch table ", obtain the Latin alphabet on the 6th code bit; NG, ZH, CH, SH all regard a phoneme as;
The Chinese character that contains 5 phonograms for the Chinese phonetic alphabet, last phonogram is that circumflex does not participate in coding;
If the Chinese phonetic alphabet of Chinese character only is comprised of 2 phonogram sounds: the numeral on the 5th code bit: directly the corresponding number of phoneme is defined as to the numeral on the 5th code bit; Numeral on the 6th code bit: directly the corresponding number of tone is defined as to the numeral on the 6th code bit;
Digital version " general code fetch table ":
Figure 2013104114100100001DEST_PATH_IMAGE004
CN201310411410.0A 2013-09-11 2013-09-11 General code fetch input method of Chinese character Active CN103440047B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310411410.0A CN103440047B (en) 2013-09-11 2013-09-11 General code fetch input method of Chinese character

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310411410.0A CN103440047B (en) 2013-09-11 2013-09-11 General code fetch input method of Chinese character

Publications (2)

Publication Number Publication Date
CN103440047A true CN103440047A (en) 2013-12-11
CN103440047B CN103440047B (en) 2016-04-13

Family

ID=49693741

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310411410.0A Active CN103440047B (en) 2013-09-11 2013-09-11 General code fetch input method of Chinese character

Country Status (1)

Country Link
CN (1) CN103440047B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1080748A (en) * 1992-06-30 1994-01-12 吴桦 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof
CN1173661A (en) * 1997-05-28 1998-02-18 李忠良 Computer input method of Yuanma codes Chinese characters
CN1240957A (en) * 1998-07-08 2000-01-12 邱国权 Chinese-character typing method based on complement between radical-stroke order and use frequency-phonetic letter
CN1512305A (en) * 2002-12-31 2004-07-14 中国科学院测量与地球物理研究所 Sound-shape digital Chinese character input method
CN1645303A (en) * 2005-01-15 2005-07-27 李建学 Azimuth six-code Chinese inputting method
CN102073382A (en) * 2009-11-19 2011-05-25 王治阳 Stroke, main and auxiliary radical input method
WO2012081527A1 (en) * 2010-12-16 2012-06-21 Satake Yasuhiko Input method for chinese language electronic devices

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1080748A (en) * 1992-06-30 1994-01-12 吴桦 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof
CN1173661A (en) * 1997-05-28 1998-02-18 李忠良 Computer input method of Yuanma codes Chinese characters
CN1240957A (en) * 1998-07-08 2000-01-12 邱国权 Chinese-character typing method based on complement between radical-stroke order and use frequency-phonetic letter
CN1512305A (en) * 2002-12-31 2004-07-14 中国科学院测量与地球物理研究所 Sound-shape digital Chinese character input method
CN1645303A (en) * 2005-01-15 2005-07-27 李建学 Azimuth six-code Chinese inputting method
CN102073382A (en) * 2009-11-19 2011-05-25 王治阳 Stroke, main and auxiliary radical input method
WO2012081527A1 (en) * 2010-12-16 2012-06-21 Satake Yasuhiko Input method for chinese language electronic devices

Also Published As

Publication number Publication date
CN103440047B (en) 2016-04-13

Similar Documents

Publication Publication Date Title
US6292768B1 (en) Method for converting non-phonetic characters into surrogate words for inputting into a computer
CN102682022B (en) Implementation method for Chinese character holographic movable character library
US9965045B2 (en) Chinese input method using pinyin plus tones
CN105045410B (en) A kind of formalization phonetic and Chinese character is corresponding knows method for distinguishing
CN102830809A (en) Chinese character coding input method
CN103440047B (en) General code fetch input method of Chinese character
CN100533359C (en) Oracle spelling and component disintegration and input method
CN102053955B (en) Method and system for inputting symbols
CN101551711A (en) Chinese character coding input method based on structure and primitive
CN203825832U (en) Chinese pinyin spelling combination teaching aid for children
CN101952790B (en) Method for inputting chinese characters apapting for chinese teaching
Campbell William Stubbs (1825–1901)
CN105278697B (en) Combined double-spelling class major-minor code Chinese character, word coded input method and its keyboard
CN106325540A (en) Simplified input method of northeast Yunnan sub-dialect Miao language and application of simplified input method
CN102439648A (en) Method for learning chinese pronunciation using korean spelling and input device thereof
CN1542591A (en) Chinese spelling simulation input method
Lee The Korean alphabet: an optimal featural system with graphical ingenuity
CN1050206C (en) Regular Chinese phonetic alphabet Chinese character input method
빼니를 et al. HOW WERE THE GRAPHEMES OF HUNMIN CHŎNG'ŮM DESIGNED? Hitherto uninvestigated aspects of the Korean writing system include graphemes and calligraphy. The first question that has never been asked is how King Sejong the Great, who invented the alpha
CN105892704B (en) The first sum of phonemic alphabet phonetic input method
Werner Indigenous Language Revitalization using Virtual Reality
CN104133556B (en) Double-stroke type main and auxiliary code letter type radical dictionary and sonic dictionary Chinese character coding input method and keyboard adopting method
Pournader Preliminary proposal to encode the Book Pahlavi script in the Unicode Standard
CN109407856A (en) The phase code mosaic method for describing Chinese character using sound and shape characteristic and the phase code inputting method based on it
CA2270956A1 (en) Method for converting non-phonetic characters into surrogate words for inputting into a computer

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant