CN1074296A - A kind of Chinese phonetic phoneme method of Chinese character coding - Google Patents

A kind of Chinese phonetic phoneme method of Chinese character coding Download PDF

Info

Publication number
CN1074296A
CN1074296A CN 92110056 CN92110056A CN1074296A CN 1074296 A CN1074296 A CN 1074296A CN 92110056 CN92110056 CN 92110056 CN 92110056 A CN92110056 A CN 92110056A CN 1074296 A CN1074296 A CN 1074296A
Authority
CN
China
Prior art keywords
sound
chinese character
chinese
phoneme
parts
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 92110056
Other languages
Chinese (zh)
Inventor
江荻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 92110056 priority Critical patent/CN1074296A/en
Publication of CN1074296A publication Critical patent/CN1074296A/en
Pending legal-status Critical Current

Links

Images

Abstract

The Chinese phonetic phoneme coding is used sound position and rhythm position as code symbols, be different structure phoneme forms with same sound position or the cutting of rhythm position again.Chinese character segmentation was got little mode in two minutes or two minutes by font and is carried out.The high frequency word adopts no repeated code, all-key mode to import, and needn't remember high frequency word kind under the dynamical fashion and implements touch system input.Two yards are hanged down code word with trigram and are not also had repeated code.Word is coded in and is the all-key mode in any case.

Description

A kind of Chinese phonetic phoneme method of Chinese character coding
The present invention relates to a kind of sound-form Chinese language words input technology and keyboard Designing thereof.
Existing sound sign indicating number design all with the voice of Chinese character by words or Hanzi component performance as code symbols, used voice representation is the phonetic plan of formulation in 1958.The core of phonetic plan is that method (" phonetic plan " first) represented in the phoneme phonetic alphabet, and its theoretical foundation is Western Modern Phonology theory." phonetic plan " absorbed Chinese traditional initial consonant, simple or compound vowel of a Chinese syllable and tone notion simultaneously, and set up the female representation (" phonetic plan " second and third part) of monogram formula sound with the phoneme letter.These two kinds of representations are exactly first representation of Chinese keyboard input Pinyin sound sign indicating number.Phonetic sound code plan adopts Latin alphabet symbol, and number is consistent with international modular keypad key-position symbol, has certain popular basis.Weak point is: to the west of phonetic aspect of a dialect potential theory lack han nationality cognitive psychological basis for the phoneme phonetic alphabet spelling or the symbol unit of pure phonetic code plan on basis, the phonetic unit that forms in its phonemic alphabet unit and the han nationality language performance in several thousand is inconsistent, and this causes the obstacle that is difficult to go beyond just for easy, easy-to-use, the easy note of scheme.Though and the initial consonant and simple or compound vowel of a Chinese syllable Two bors d's oeuveres, simplicity or the three phonetic code plans that constitute with the phonetic alphabet array configuration have certain practicality, lack supporting theory.Initial consonant and simple or compound vowel of a Chinese syllable even are not any one-level linguistic units in the Modern Linguistics.This gives from now on, and formulation, standard application and the theoretical work of national standard keyboard representation method all cause difficulty.
Analyze font and be excavation, but this encoding scheme is paid attention to more excavating as the people of main body to the psychological perception as the Chinese character object of object graphical symbol feature to the Chinese character objective attribute.Cognition of Chinese characters psychology has following characteristics:
A: ambiguity.Chinese and overseas scholars studies have shown that the consciousness of alphabetical figure and Chinese character image, has the cognitive difference of local feature and gestalt feature in the font identifying.People often discern according to the fuzzy impression of the cardinal principle profile (gestalt feature) that obtains from font.With regard to the Chinese character opinion, the gestalt feature of reflection profiles such as head and the tail stroke, housing, radical had obtained extracting before local feature (as intermediate member and meticulous stroke) is recognized clearly.In the reading, Chinese character is in the big character string sight of statement, and vision scanning rapidly all proves absolutely the fuzzy diagnosis process of people's psychology to font with psychological perception immediately.
B: bisectability.This character is seldom directly fully research always, but the correlative study achievement is very plentiful and substantial.The important foundation of bisectability is the polymerism notion, and Chinese character is based on phonogram, and according to the study, in 7000 Chinese characters of " contemporary Chinese common word table " that country issues, the word of ideophone structure is totally 5636 words, accounts for the last 80%.Phonogram is made of pictograph and symbol, and various pictographs or symbol have common feature, and pictograph and two polymeric type of symbol formation Chinese character in people's cognitive psychological close with two when therefore discerning Chinese character to a great extent and two minutes recognition mode carries out.In addition, dialectical sight of philosophical binary and bisectability are in logic all supported two fens ideas of Chinese character pattern.
C: habituation.Habituation is a kind of experience, is the reflection in practice of ambiguity and bisectability.As among the people debating of unisonance surname analysed: bow-length-open; Upright-early-chapter; Gu-the moon-Hu; Speech-the noon-permitted.The common saying source word have " people's words for the letter; Sheep is greatly beautiful; Shellfish is weary of demoting; Three people are many; Upright woman is the concubine " or the like.Jargon or black language have " soldier is the soldier " or the like.
The objective of the invention is to avoid above-mentioned weak point of the prior art and provide on a kind of general standard keyboard the method for input Chinese character by words satisfy people's note tone deaf lose into requirement.
The present invention sets up according to sound Phonology theory and method thereof, mainly is Chinese character by words voice (syllable) are divided into sound position and rhythm position (and positioning), in the coding sound position and rhythm position is used as code symbols.
One, supporting theory
Chinese phonetic phonology is based on to last with synchronic language system existing objective linguistic unit and proposes.The Chinese phonemic system must be based upon on the cognitive basis of han nationality to the linguistic unit conclusion, truly reflects objective reality, sentience and identifiability that phoneme has on the han nationality psychology of language.We have proposed three archiphoneme classes of Chinese for this reason: sound position, rhythm position and positioning.List sound position and rhythm position below, and represent (also available other sign format is represented) with the current Latin alphabet.
Sound bit sign (in the bracket is the International Phonetic Symbols)
b[p] p[p'] d[t] t[t'] g[k] k[k']
z[ts] c[ts'] zh[t
Figure 921100566_IMG3
s] ch[t
Figure 921100566_IMG4
s'] j[t
Figure 921100566_IMG5
] q[t ']
f[f] s[s] sh[
Figure 921100566_IMG7
s] r[z
Figure 921100566_IMG8
] x[
Figure 921100566_IMG9
] h[x]
m[m] n[n]
l[l]
Rhythm bit sign (in the bracket is the International Phonetic Symbols)
i[i] u[u] ü[y]
a[a] ia[ia] ua[ua]
(o[o]) uo[uo]
e[r] ie[iε] üe[y]
-i[
Figure 921100566_IMG10
]/[
Figure 921100566_IMG11
]
er[
Figure 921100566_IMG12
]
ai[ai] uai[uai]
ei[ei] uei[uei]
ao[au] iao[iau]
ou[ou] iou[iou]
an[an] ian[ian] uan[uan] üan[yan]
en[
Figure 921100566_IMG13
n] in[in] uen[u n] ün[yn]
ang[aη] iang[iaη] uang[uaη]
eng[
Figure 921100566_IMG15
η] ing[iη] ueng[u
Figure 921100566_IMG16
η]
(ong[uη] iong[yη]
Three, symbol design
The Chinese phonetic phoneme has specific syntagmatic, sees the following form:
Figure 921100566_IMG17
In order to increase the distinctiveness code element, discrete homonymous phenomena and balanced key position charge capacity can be divided into different structure phoneme forms with same sound position or rhythm position according to sound phoneme syntagmatic: first phoneme and the position and for phoneme of changing voice.The position employing of changing voice adds before and after first phoneme to be represented special meeting, and originally is coded in first phoneme front and back and adds "-" expression.
1. play all and u or u first reading sound form and ü or ü play the sound position of first reading sound form rhythm bit pattern, get change of voice position as code element, otherwise then get unit sound position as code element.
2. play all and non-u or u first reading sound form and non-ü or ü play zero sound position (syllable that rhythm position form is promptly only arranged) of first reading sound form rhythm bit pattern, get the initial Latin alphabet symbol in rhythm position as Dai Shengwei.
3. Latin alphabet W is got as Dai Shengwei in all zero sound positions of playing the bit pattern of first reading sound form rhythm with u or u.
4. Latin alphabet symbol y is got as Dai Shengwei in all zero sound positions of playing the bit pattern of first reading sound form rhythm with ü or ü.This rule that will satisfy article one simultaneously.
5. change rhythm position is got as code element in all i rhythm positions of making up with sibilus sound position (being z, c, s, zh, ch, sh, r, i, q, x).This coding is represented with-i form.
6. itself is got as changing voice a form code element in the sound position (as mandarin m, n, GuangZhou native language η etc.) of several special self-syllables.
Therefore, existing unit sound position, change of voice position and the Dai Shengwei as code element of this coding amounts to 48, and the combination rule of sound position and rhythm position sees the following form:
Figure 921100566_IMG18
Three, key position design
Consider the exist actually of the phonetic aspect of a dialect in the Chinese, this coding is designed to mandarin scheme, north and south general scheme and Guangdong dialect scheme and other dialect scheme by phonetic aspect of a dialect difference with the phoneme sign indicating number.Keyboard Designing is seen accompanying drawing, and Fig. 1 is a sound phoneme key position distribution plan (general-purpose version); Fig. 2 is sound phoneme key position distribution plan (north version).
In the mandarin scheme, the rhythm position that the present invention will have complementary relationship dexterously is arranged on the same key position, as ong and ueng, ia and ua, ve and uei etc., vision is accorded with the close rhythm position of shape to be arranged on the same key position, as vn and un, van and uan etc. are with close being placed on the same key position of pronunciation, as o and uo ,-m and-n etc.Also first of cerebral and first of corresponding non-cerebral are arranged on the same key position simultaneously, perhaps first of cerebral change of voice position and non-cerebral come on the same key position, both have been convenient to remember, and easily are connected with general-purpose version again, as sh and s-, ch and c-, zh and z-, s and sh-, c and ch-, z and zh-.In addition, this coding is also abided by the newest fruits of operator's keystroke rule research, and high frequency and low frequency key position are transferred to optimum condition.
Four, Hanzi component class
Analyze from font, all Chinese characters constitute by parts, and its structure is as follows:
Figure 921100566_IMG19
As parts, its unit type is independent body type Chinese character with stroke:
Title horizontal stroke (carrying) perpendicular (perpendicular colluding) is cast aside and is pressed down (point) folding
Form one
Figure 921100566_IMG20
Shu 亅 Pie
Figure 921100566_IMG21
Dian second
Figure 921100566_IMG22
Independent body type Chinese character as first stroke of a Chinese character parts, finishes stroke as an end parts with initial stroke.
Stroke and only at independent body type Chinese characters kind as parts.Parts "-" with ambiguity independent body type Chinese characters kind be decided to be parts "-" (horizontal stroke, hen), build Chinese characters kind not merely be decided to be "-" (one, yi).
But this coding with character formation component and non-word read component as the code fetch information source, but give the pronunciation except that indivedual particular component, other can not all not become the information source parts by read component.
Five, Chinese character segmentation principle
1. cardinal rule (two fens principles)
All Chinese characters all are cut into two parts by font.First stroke of a Chinese character stroke place parts are first stroke of a Chinese character parts, and an end stroke place parts are an end parts.
2. become word principle (or readable principle)
But each parts that is syncopated as will become word or one-tenth read component.
3. get little principle
But the parts that cut out are not if become word or read component, then cut out the next stage first stroke of a Chinese character or an end place parts are selected parts.
4. residue principle
Remove indivedual exceptions, but the cutting remainder also should become word or read component.
Six, single character code
Individual character with its first stroke of a Chinese character and end a parts and individual character itself as the code fetch information source, and with the sound position of the sound of parts sound and individual character as code element.Code fetch is in proper order: parts sound sound position+Chinese character sound position+Chinese character rhythm position, first stroke of a Chinese character parts sound sound position+end.
For example:
Chinese character first stroke of a Chinese character end first stroke of a Chinese character portion end portion's Chinese character Chinese character character code
Rhythm position, sound position, part sound position, parts components sound position
Sigh mouth K Y T an KYTJ again
Raise Rolling
Figure 921100566_IMG24
T Y Y ang TYYH
Green Wang Shi W S B i WSBI
Wide Dian Pie N P K-uang NPHM
Seven, word coding
As code element, the length of word is defined as the longest 4 Chinese characters to word with the sound position of the rising of the first Chinese character and end position Chinese character, an end parts sound.Code fetch is: a lead-in first stroke of a Chinese character parts sound sound position+parts sound sound position, a lead-in end+last word first stroke of a Chinese character parts sound sound position+parts sound sound position, last word end.
For example:
3 positions, 2 positions, 1 position, phrase parts 1 parts 4 positions of 2 parts, 3 parts, 4 speech sign indicating numbers
Technology is Lv second H H C Y HHCY one by one
The phonetic Rolling day T B L R TBLR of existing side by side
Beautify the big Ren seven Y D R B YDRB of sheep
Handle The-Fan and foretell W M W L WMWL in the king
Chinese Shu Shu Dian Dian S-S-N N UUNN
The present invention has following advantage compared to existing technology:
1. adopt Chinese character initial consonant phonology to set up encoding scheme, meet han nationality speech perception and cognition of Chinese characters psychology.
The Chinese character segmentation rule succinctly, clearly unified, needn't remember the code element of font parts correspondence especially, thereby dialectical font code element (the no matter less or many) memory problems that solved.
3. according to the structural linguistics method, under the prerequisite of respecting objective voice phenomenon rule, limited sound phoneme form quantity is expanded greatly, open up a new road for distinguishing repeated code, and the form of sound phoneme cuts apart and has strict rule, help learning and memory.
4. keyboard Designing is ingenious clear, and high keystroke rate key position and low keystroke key position equiblibrium mass distribution meet the ergonomics principle.
5. originally be encoded to " high frequency all-key " design (being that the high frequency word is imported in the all-key mode), for realizing that memoryless voice touch system has established solid foundation under the dynamical fashion.Its medium-high frequency word adopts no repeated code all-key mode to import, and two yards are not had repeated code with the trigram low-frequency word yet, and word is coded in and is the all-key mode in any case.

Claims (4)

1, a kind of Chinese phonetic phoneme method of Chinese character coding, its principal character is that Chinese-character word-phrase voice (syllable) are divided into sound position and rhythm position (and positioning), in the coding sound position and rhythm position are used as code symbols, with same sound position or the cutting of rhythm position is different structure phoneme forms: first phoneme, the position of changing voice, for phoneme, be but that the cutting under different condition of same phoneme is two or more form of changing voice, to a sound position and rhythm position (and positioning) form cutting of can further changing voice at many levels.
2, according to the described coding method of claim 1, it is characterized in that by phonetic aspect of a dialect difference with the phoneme sign indicating number be designed to that mandarin, north and south are general, Cantonise dialect and other dialect scheme.Keyboard Designing is as follows:
Sound phoneme key position distribution plan (general-purpose version)
Sound phoneme key position distribution plan (north version)
Figure 921100566_IMG2
3,, it is characterized in that the method for Chinese character segmentation is according to the described coding method of claim 1:
(1) all Chinese characters all are divided into the first stroke of a Chinese character, an end parts by font two, according to two minutes principles, readable principles, get little principle and residue principle cutting Chinese character.
(2) independent body type Chinese character is an initial part with initial stroke, and finishing stroke is an end parts.
4,, it is characterized in that the coded system of words is according to the described coding method of claim 1:
(1) single character code: parts sound sound position+Chinese character sound position+Chinese character rhythm position, first stroke of a Chinese character parts sound sound position+end.
(2) word coding: a lead-in first stroke of a Chinese character parts sound sound position+parts sound sound position, a lead-in end+last word first stroke of a Chinese character parts sound sound position+parts sound sound position, last word end.
CN 92110056 1992-08-27 1992-08-27 A kind of Chinese phonetic phoneme method of Chinese character coding Pending CN1074296A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 92110056 CN1074296A (en) 1992-08-27 1992-08-27 A kind of Chinese phonetic phoneme method of Chinese character coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 92110056 CN1074296A (en) 1992-08-27 1992-08-27 A kind of Chinese phonetic phoneme method of Chinese character coding

Publications (1)

Publication Number Publication Date
CN1074296A true CN1074296A (en) 1993-07-14

Family

ID=4944583

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 92110056 Pending CN1074296A (en) 1992-08-27 1992-08-27 A kind of Chinese phonetic phoneme method of Chinese character coding

Country Status (1)

Country Link
CN (1) CN1074296A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1099066C (en) * 1995-12-04 2003-01-15 陈文就 Chinese character consonant-vowel computer input method
CN106155347A (en) * 2016-04-29 2016-11-23 武道峰 Chinese phonetic alphabet input scheme and keyboard thereof

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1099066C (en) * 1995-12-04 2003-01-15 陈文就 Chinese character consonant-vowel computer input method
CN106155347A (en) * 2016-04-29 2016-11-23 武道峰 Chinese phonetic alphabet input scheme and keyboard thereof
CN106155347B (en) * 2016-04-29 2019-03-15 武道峰 Chinese Pinyin input keyboard

Similar Documents

Publication Publication Date Title
CN1120436C (en) Speech recognition method and system for identifying isolated non-relative Chinese character
CN1141633C (en) 24-radical sorting encode method for Chinese characters and its keyboard
CN1074296A (en) A kind of Chinese phonetic phoneme method of Chinese character coding
CN100458668C (en) Input method for Chinese character of first pronunciation
CN1475896A (en) Chinese language phonetic transcription simple and quick full spelling input method and its keyboare
CN1148637C (en) Precise alphabetic writing input method via common digit keyboard
CN1054219C (en) Substitution type Chinese phonetic character, word input coding method and keyboard thereof
CN1137432C (en) Kuaiyi code Chinese character input method and keyboard
CN1106146A (en) Computer input method by computer Chinese-character phonology-tone coding and its keyboard
CN1051161C (en) Chinese character inputting technology by numbering and shape codes
CN1207648C (en) '5-3 code' and its keyboard
CN1081773A (en) " many recursion associations " Chinese word encoding
CN1088210C (en) Easy-to-learn Chinese spelling key input scheme and easy-to-learn Chinese character input method
CN1419179A (en) Chinese characters input method according to stroke sequence and keyboard thereof
CN1060277C (en) Chinese characters coding and input method for computer using sentences as input unit
CN1332402A (en) Universal character, word and sentence combination Chinese character input method
CN100361053C (en) Chinese character pole number input method
CN1347024A (en) Natural reading code input method for both simplified and unsimplified Chinese characters
CN1108553C (en) Universal popular voice form Chinese character coding input method
CN1652069A (en) Phonetic digital code input method
CN1354416A (en) Syllable and phonetic code input method
CN86107214A (en) A kind of Chinese word input method and keyboard thereof
CN1112255A (en) First- and last-stroke, first phonetic letter Chinese-character input method and its keyboard
CN101093420A (en) Free mode input method
CN1098525A (en) Profile phonetic compound code

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C01 Deemed withdrawal of patent application (patent law 1993)
WD01 Invention patent application deemed withdrawn after publication