CN101004640A - Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word - Google Patents

Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word Download PDF

Info

Publication number
CN101004640A
CN101004640A CNA2007100003103A CN200710000310A CN101004640A CN 101004640 A CN101004640 A CN 101004640A CN A2007100003103 A CNA2007100003103 A CN A2007100003103A CN 200710000310 A CN200710000310 A CN 200710000310A CN 101004640 A CN101004640 A CN 101004640A
Authority
CN
China
Prior art keywords
chinese
words
chinese character
character
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2007100003103A
Other languages
Chinese (zh)
Other versions
CN100474219C (en
Inventor
史颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Han Code Rubik''s Cube Technology Co. Ltd.
Original Assignee
史颖
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 史颖 filed Critical 史颖
Priority to CNB2007100003103A priority Critical patent/CN100474219C/en
Priority to PCT/CN2007/000134 priority patent/WO2008086653A1/en
Publication of CN101004640A publication Critical patent/CN101004640A/en
Application granted granted Critical
Publication of CN100474219C publication Critical patent/CN100474219C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters

Abstract

A method for inputting Chinese character by computer in storage mode includes using fixed four positions of line, column, longitudinal and order English capital letters as unique corresponding identification of Chinese character; enabling to use 26 English capital letters and two ASCII code symbols to directly spell out modern Chinese characters and phrase in reciprocal-corresponding way as per standard stroke attribute and pronunciation attributes of initial consonant and vowel as well as four tones.

Description

But words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting
Technical field
The invention belongs to the computer Chinese-character speech storage method for inputting of a kind of Chinese mosaic, assembly Chinese word coding.
Background technology
At present, the 21st century world that faces will enter New Economy Era, and world economy, information integral development trend are irreversible, information-basedly implement and will develop into the important scale of weighing a modernization of the country degree.The development of current internet is rapidly unusual, each website spatial database interactive access such as the Internet bank, ecommerce, E-Government, shared data are handled, the polymorphic type digitizer interconnects, military IT-based warfare, global data accessible mutual, handle and storage becomes a megatrend.The breakthrough of Chinese information processing technology has thundering meaning to global character cultural circle more than 20 hundred million populations.Chinese character is the underlying carrier of Chinese information, and its accessible in computing machine and all kinds of digitizer software seems very important alternately.The great attention that the complexity that explosion type increases progressively, information is increasing, unified pattern, simple and safe, the Chinese meaning of one's words of information carrier in computing machine and digital device that how to adapt to the amount of social information from now on accurately understood and reliably problem such as control has caused social science and natural science circle.
Yet China's computer Chinese-character processing mode is but still continued to use the methods of the eighties at present.From computer Chinese-character input aspect, still continue to use the code table corresponded manner.As long as find a kind of code table corresponded manner can produce a kind of Chinese character input method, so that thousands of kinds of Chinese character input methods have been produced.Simultaneously, also produced and only need see word knowledge sign indicating number, need not see yard drawback of the two-layer skin of character learning.The Chinese phonetic alphabet and input method also are difficult to determine because of producing a large amount of repeated codes.Make Chinese character input finding, the content of thinking, being beaten inconsistent fully, be difficult to enter the touch system state, caused the input difficulty of computer Chinese-character.
From computer Chinese-character internal code aspect, directly require computer operating system byte in service must high-order set, add sign, Chinese internal code before and after double byte or the inseparable dislocation of nybble, the Chinese internal code as field or file processing and multiple Chinese internal code and deposit, between various computer operating systems and various database and increasing digital application equipment and software systems, can't confirm mutual and cause mess code, the solution Chinese character for computer is handled problems, the development of keeping pace with the times has been the problem that we must face.Simultaneously, then require information basic coding space to strengthen in order to contain the Chinese words collection, yet, even 16 whole uses of two bytes also have only 60,000 odd encoder spaces, require far apart with nearly 100,000 Chinese character independent identifications, if use 3 bytes or 4 bytes as the essential information unit of sign, not only increase the complicacy of Computer Processing and reduced safe reliability, and, use big space encoder like this to distinguish expression simultaneously and have only some letters waste greatly beyond doubt among 26 letter word collection.Because Chinese character itself exists with many meanings of word multitone state, and the international standard characters collection only provides unique coding, make that the accuracy and the accuracy of Chinese expression can not be satisfactory.The Scheme for the Chinese Phonetic Alphabet should further optimize, perfect, make it uniquely piece together Chinese character, speech uses as literal.Sign indicating number is known in Chinese character for computer input as seen word; See that the sign indicating number character learning is reciprocal and realize capable of touch typing.Chinese internal code two bytes, the whole independent method for expressing of nybble must be transformed the operating system Chinesizing at present.Because conflict with the expansion ASCII character, can not accessiblely enter the parallel processing of western language grid.The minimum addressing of computing machine, processing, storage cell are byte (as ASCII character).Letter word (ASCII character byte) but be the character type computing, two bytes, nybble of expression Chinese character are difficulty relatively, and can not split, misplace, otherwise can produce a string mess code (having potential safety hazard); Because Chinese Character Set is too big, Chinese character is not suitable for the whole independent expression of computing machine multibyte.Image says that just as the car that highway runs, dumb effect is bad certainly to be held together race with two or four.Polyphone is not distinguished, the disconnected speech storage mode of the word that do not break has greatly influenced computing machine the meaning of one's words of Chinese is understood and the digital quantization analyzing and processing.Computer Processing Chinese should only not use as typewriter from now on.Letter word can be combined into the Chinese phonetic alphabet, English, French, Spanish etc., in like manner also the digital quantization partition of Chinese character combination own can be combined into.Chinese character for computer ASCII character mosaic storage mode can be included Chinese information in global information integral system, and can allow computing machine a large amount of Chinese language processing work such as western language is analyzed, retrieved, ordering, computing, control as handling.Show at Chinese, corresponding GB 18030 ISNs of Typeset and Print link, with the Chinese character state of present use without any difference.Also can use at present popular any input method input Chinese character, internal code is changed on the backstage.
No. 96102085.7 the three-dimensional Chinese words mosaic scheme of the computer Chinese-character storage method for inputting patent of invention (CN1052313C) of mapping code has one by one solved above-mentioned problem from pattern, but the capacity of its word Ji Ku is obviously not enough.The present invention one by one on the computer Chinese-character storage method for inputting basis of mapping code, has increased the practicality of word collection storage capacity and use at above-mentioned three-dimensional Chinese words just by improvement.
Summary of the invention
But the purpose of this invention is to provide a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting,, realize the Chinese character digitalized problem of computing machine to solve Chinese language computer input code and storage transmission code complete unity.Make the English computer operating system of the accessible complete unique turnover of Chinese information, and participation character type computing, greatly improve security, the reliability of Computer Processing Chinese, and be not subjected to the restriction of operating system and data base management system (DBMS), and improved the capacity of word Ji Ku widely.
But words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting of the present invention uses four of fixed length---row, column, vertical, preface English capitalization are as the unique corresponding sign of Chinese character.Performing step is as follows:
1), behavior initial consonant wherein, use 23 western language capitalizations to represent, no initial consonant is represented first character that alphabetical IVU uses as punctuation mark and other literal code space as row, corresponding relation is as follows:
A:a?B:b?C:c,ch?D:d?E:e?F:f?G:g?H:h?J:j?K:k?L:l?M:m
N:n?O:o?P:p Q:q?R:r?S:s,sh?T:t?W:w?X:x?Y:y?Z:z,zh
2), wherein classify simple or compound vowel of a Chinese syllable as, use 26 western language capitalizations to represent that corresponding relation is as follows:
A:a?B:an?C:ang?D:ao?E:e?F:ei?G:en?H:eng?I:i?J:ia,ua
K:ian?L:iang,uang?M:iao?N:ie,uai?O:o,er?P:in?Q:ing
R:iong,ong?S:iu?T:ou?U:u,u?V:uan?W:ue,ui?X:un?Y:uo?Z:ai
3), wherein vertical is the four tones of standard Chinese pronunciation, use 26 western language capitalizations to represent, corresponding relation is as follows: ABC DEF represents high and level tone () tone in proper order, GHI JKL represents rising tone (two) tone in proper order, MNO PQRS represents sound (three) tone in proper order, TUV WXYZ represents falling tone (four tones of standard Chinese pronunciation) tone in proper order, wherein is included into S softly; Row, column, vertical three are defined as specific this tone word; Initial consonant is ch, sh, and zh and simple or compound vowel of a Chinese syllable u use DEF, JKL, PQRS, WXYZ tone letter.
4), wherein preface is this tone standard stroke sequence code; Use A-Z totally 26 capitalizations, cut apart with the space between the words;
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting is represented everyday character that each brevity code correspondence four all-keys separately use for calculation process with one, two, three brevity codes.Row one bit representation that an above-mentioned brevity code is this word; Row, column two bit representations that two above-mentioned brevity codes are these words; The row, column that three above-mentioned brevity codes are these words, vertical three bit representations.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting connects with "-" symbol between each word of Chinese word; XX-XX, XX-XX-XX, XX-XX-XX-XX form are standard words, and they are whole corresponding expressions, not limited by corresponding two brevity code words; All the other speech all can use any brevity code, four represented words of all-key to carry out the combination in any speech.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, the back asyllabia in the Chinese, speech use to be easy to the speech difference and to be convenient to reading comprehension and the computing machine meaning of one's words is understood " ' of handling " symbol is connected.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, to the input of keyboard Chinese character, storage, Network Transmission is that the plug-in Chinese international 18030 internal code converse routines of using system periphery get final product when needs show and print Hanzi font by the decomposition and combination of the attribute that pronounces.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, when the computing machine simultaneous interpretation, by Chinese character separately the front three of standalone feature (initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation) ascii character Chinese mandarin Chinese character pronunciation is decomposed, computing machine is to front and back words calculation process, the requirement location of four all-key brevity codes of this words and the asyllabia, back and the speech note method meaning of one's words arranged form a complete sentence; The independent front three of Chinese character speech (initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation) by the pronunciation of first initial consonant and second also the audio group of the 3rd four tones of standard Chinese pronunciation simple or compound vowel of a Chinese syllable be merged out the pronunciation of this word.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, a brevity lists wherein: like A not B go out C the good H of D E part F worker G with regard to J mouth K you the N Europe flat P of O of L M to remove Q people R be that the little X of he I W of T of S has the sub-Z of Y.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, in the embodiment that two brevity lists wherein can be seen below, the content of relevant two brevity lists.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, three brevity codes and four all-keys all have the corresponding relation with GB Chinese Character Set GB18030, because length is limit, corresponding code table is omitted.
But aforesaid a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, behavior I, V, U represent punctuation mark or non-Chinese character literal.
The present invention has advantage and good effect is as follows:
1, Chinese character is answered the letter word combination by pronunciation attribute initial consonant, simple or compound vowel of a Chinese syllable, tone and ordered pair.Thereby realized the fractionation of each Chinese character and the combination of digital quantization fully.Each link wherein all strictness is followed [Scheme for the Chinese Phonetic Alphabet], sound, sound preface and the corresponding GB18030 Chinese characters of the national standard coding of [Chinese character standard dictionary] middle Chinese character that country carries out.Having formed visible word knows sign indicating number, sees the reciprocal intermediary's Chinese information processing system of sign indicating number character learning.
2, successfully realized " civilian unisonance ".Meet the needs that modern society's voice messaging is unified, standard exchanges development.Because not having the pronunciation attribute, Chinese character itself do not cause the whole nation not phenomenon of unisonance of becoming literate, the pronunciation with corresponding Chinese character (the comprising the four tones of standard Chinese pronunciation) locking that Chinese mosaic coding is then directly perceived, complete.
When 3, using Chinese mosaic coding input storage, can word, speech be broken, present Chinese character can not occur, the speech ISN is crowded together the situation that computing machine can't be judged continuously as the Chinese phonetic alphabet.
For example: Wuhan City's Yangtze Bridge need be seen a doctor.WU-HB?SI-ZC?JL-DA-QM?XU-YD?KB-BQ.
He is our good model.T’S?W-M’D?H?BC-YC.
4, polyphone is represented respectively, improved the accuracy and the degree of accuracy of Chinese expression.Because at present national standard codes is only selected a coding of this font with polyphone, and polyphone can't be distinguished, very easily cause judge chaotic.
For example: also owe 2,000,000 yuan of Renminbi.HVGB?QKT?RG-MP-BI?200?WB-YV.
" going back " character pronunciation wherein is that " Huan " or " Hai " have then represented two kinds of diametrically opposite meanings, and it is unique reciprocal corresponding to advise that for this reason GB increase polyphone coding is encoded itself and the present invention, makes Chinese information processing system more perfect, accurate and accurate.Make more than 1000 polyphone that a perfect home to return to be arranged.
5, Chinese character is made of the letter of each pronunciation attribute, has been realized the digital quantization of Chinese character, can participate in the character type computing.And can be combined into the pronunciation of this word by the monogram of each attribute that pronounces, for decomposing compose operation, Chinese machine talk provides a kind of new means.Realized handling simultaneously the target of Chinese, made that Chinese information is accessible to enter global information integral development track at western language system platform and network data base.
For example: bright moon light before the bed, CLJA QKGI MQGB YWTA GLAA,
Be suspected to be frost on the ground.YIIP?SIXD?DITB?SCWB?SLDD.
Light wherein and white word mosaic coding second must consistently just can show and rhyme with the 3rd.This clearlys show that also each letter (ASCII) after the Chinese character partition has separately independently function and effect in the mosaic scheme in own relevant position, and can judge own relevant position function and act under the control of distinct program in addition character type computing independently separately, revise, displacement, statistics independently function and effect separately.Just as english dictionary can be to each forms alphabetical Automatic Program ordering in the word.
6, Chinese information realization card-coating, writer are read, importing automatically to handle for Chinese information provides possibility.Because the Chinese character of 100,000 orders of magnitude is if independent expression then can not realize card-coating, writer reads, can only be manually hand-written after again by the keyboarder with the Hanzi keyboard typing.Chinese mosaic coding is then read as convenient card-coating, the writer realized of western language.It can be widely used in a large amount of registrations, examination, investigation, generaI investigation, variously fill out occasion such as single and comprise that information card-coating, the writer of Chinese character read, and greatly improves information processing efficiency and accuracy rate.Has the very big market space.
Reliable degree when 7, greatly having improved Computer Processing Chinese.System's Chinesizing any special measures such as system can because of present Chinese double byte or nybble not can not be split, can not misplace, the high-order set of byte cause Chinese mess code even cause system deadlock or the serious consequence of system crash.
8, can conveniently between computing machine and all kinds of digitizer (ASCII character collection) and network, carry out Chinese character information transmission and control.
9, than the easier and reliable encryption of two bytes, nybble internal code of using the expression Chinese character.Should there be good application space in departments such as military affairs, bank, traffic, trade.
10, can by the front three of standalone feature (initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation) ascii character separately to Chinese mandarin pronunciation decompose, Computing is handled and the combination of independent function of pronunciation.
11, easy to learn, only need a class can grasp the mapping relations of Chinese mosaic and phonetic during student's learning Chinese phonetic, can enter the touch system state rapidly.
Embodiment
The present invention is that the Chinese character Chinese word coding is corresponding with GB18030 character library Chinese character standard coding, use relevant 4 corresponding unique corresponding each Chinese characters of capitalization QWERTY keyboard western language character with Chinese character pronunciation, sound, accent, preface series arrangement according to the Chinese character standard dictionary, simultaneously, possess simplified Chinese character and compile the speech rule, it is synthetic that it carries out digital quantization with the pronunciation attribute of Chinese character, has the character code correlativity as the Chinese phonetic alphabet, and on the basis of the Chinese phonetic alphabet, carried out optimization process, make it piece together word.Like this, the present invention has just formed has identifiability, and comprises the complete intermediary's Chinese information processing system of ASCII character digital quantization of initial consonant, simple or compound vowel of a Chinese syllable, tone and preface in the Chinese character pronunciation attribute.This coding meets uniqueness, can recognize the property read, fully letter, character standard.And can accomplish to see that word knows sign indicating number, and see that the sign indicating number character learning is reciprocal, can be used as the direct touch system input of encode Chinese characters for computer Chinese, realize storage computing of machine Chinese and Network Transmission.Because do not need the machine operation system is carried out the Chinese Chinesizing, so can on all kinds of ASCII character digital devices, realize Chinese programming, control and application system.Only need in computing machine, to move a plug-in smart screen Chinese character demonstration in case of necessity and print converse routine and get final product.
The unique relevant method for expressing of this Chinese character character can make Chinese language computer input code and storage transmission code complete unity, truly realizes the Chinese character digitalized of computing machine.Make the English computer operating system of the accessible complete unique turnover of Chinese information, and participate in the character type computing, greatly improve security, the reliability of Computer Processing Chinese, and be not subjected to the restriction of operating system and data base management system (DBMS), and reduced system overhead, improved work efficiency.For the possible approach that provides is provided for the action control of Chinese language computer and all kinds of digitizer identification Chinese order and the Internet bank, global electronic commercial affairs, E-Government and the Digital Nervous System that are about to set up.
Coding rule of the present invention:
Row, column, vertical, four western language capitalizations of preface are represented each Chinese character:
Behavior initial consonant wherein.Use 23 western language capitalizations to represent, no initial consonant represents that as row alphabetical IVU is as specific use with first character.Corresponding relation is as follows:
A:a?B:b?C:c,ch?D:d?E:e?F:f?G:g?H:h?J:j?K:k?L:l?M:m
N:n?O:o?P:p?Q:q?R:r?S:s,sh?T:t?W:w?X:x?Y:y?Z:z,zh
Wherein classify simple or compound vowel of a Chinese syllable as.Use 26 western language capitalizations to represent that corresponding relation is as follows:
A:a?B:an?C:ang?D:ao?E:e?F:ei?G:en?H:eng?I:i?J:ia,ua
K:ian?L:iang,uang?M:iao?N:ie,uai?O:o,er?P:in?Q:ing
R:iong,ong?S:iu?T:ou?U:u,u?V:uan?W:ue,ui?X:un?Y:uo?Z:ai
Wherein vertical is the four tones of standard Chinese pronunciation.Use 26 western language capitalizations to represent, corresponding relation is as follows: ABC DEF represents high and level tone () tone in proper order, and GHI JKL represents rising tone (two) tone in proper order, and MNO PQRS represents sound (three) tone in proper order, TUV WXYZ represents falling tone (four tones of standard Chinese pronunciation) tone in proper order
Wherein be included into S softly.Row, column, vertical three are defined as specific this tone word.Initial consonant is ch, sh, and zh and simple or compound vowel of a Chinese syllable u use D, J, the rear portion tone letter that P, W begin.
Wherein preface is this tone sequence code.Speech connects with "-" symbol.Special speech can connect with special symbol.
Back asyllabia adds " ' " the symbol connection.All the word collection by " Chinese character standard dictionary " (ISBN7-5610-3502-0) sound preface index of Chinese Characters arrange, make things convenient for the student to learn and use.
The decomposition and combination function of Chinese character itself being pressed attribute not only is suitable for (the plug-in Chinese international 18030 internal code converse routines of using system periphery get final product when needs show and print Hanzi font, and effect Chinesizes identical with present system) to the input of keyboard Chinese character, storage, Network Transmission.Simultaneously can by Chinese character separately the front three of standalone feature (initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation) ascii character to the audio frequency of Chinese character pronunciation decompose, computing machine is to front and back words calculation process, four all-key brevity codes that make this words and asyllabia, back and speech accurate location fast, realizes the computing machine simultaneous interpretation.On the contrary, the independent front three of Chinese character speech (initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation) can be merged out the pronunciation of this word by the audio group of the pronunciation of first initial consonant and second and the 3rd four tones of standard Chinese pronunciation simple or compound vowel of a Chinese syllable, makes individual character independently pronounce to become and makes up, thereby reduced the pronunciation sample.
A brevity lists:
Like A not B go out C the good H of D E part F worker G with regard to J mouth K your N of L M
It is that the little X of he I W of T of S has the sub-Z of Y that O flat P in Europe removes Q people R
Two brevity lists:
The high AC of AA sound of sighing the AZ peace AB AD eight BA difficult to understand hundred BZ half BB BC of group protects BD north BF
This BG collapses BH and compiles BK table other BN guest BP of BM and the BU Ppun BX of BQ ripple BO portion than BI
Wealth CZ ginseng CB looks into the old CG of the long CC of CA super CD car CE and claims CH chi CI to take out the CU of CT place
Xu CJ carry CN pass CV wound CL blow CW spring CX from the big DA of the wrong CY of CR for the single DB of DZ
Get the flirtatious DJ electricity of DH ground DI DK accent father DM DN such as DF Den DG to DD moral DE as DC
Decide DQ and lose the DS east short DV of DR bucket DT degree DU many DY of DW ton DX volume EE grace EG
Eng EH two EO send out the anti-FB side FC of FA and fly FF to divide FG wind FH Fiao FM Buddhist FO deny FT
Multiple FU loud, high-pitched sound GA changes the dried high GD of the GB port GC GE of GZ and follows the more public GR of GH of GG to GF
The well-behaved GN of the ancient GU melon of structure GT GJ closes GV light GL rule GW and rolls HA sea, GX state GY Kazakhstan HZ
The black HF of capable HC HD of Chinese HB and HE is HT family HU China HJ behind the red HR of the horizontal HH of HG very
The yellow HL meeting of the bosom joyous HV of HN HW wedding HX HY meter alive JI valency JJ builds JK and says JL angle JM
Knot JN gold JP contributes the JV JX of the JW army card KA that determines through bright JR nine JU of JS office of JQ and opens KZ
See that the anti-KC of KB examines the empty KR mouth of KD gram KE G KF willing KG hole KH KT storehouse KU and overstates KJ
The fast wide KV condition of KN KL loses the tired KX expansion of KW KY and draws LA to come the old LD of the blue LB wave LC of LZ
The sharp LI LJ connection of the cold LH of happy LE class LF LK amount LL material LM row LN woods LP zero LQ
Six LS cough up LO dragon LR building LT road LU mountains in a range LV slightly the LW opinion LX LY horse MA that falls bury MZ
The busy MC of full MB emits the beautiful MF door of MD ME MG to cover the MH rice MI face MK MM second MN that goes out
That NA of the wrong MS mould of people MP name MQ MO MT order MU is NZ south NB capsule NC brain ND
The tender NG of NF can pinch the peaceful NQ of you NP of NN by NH Buddhist nun NI ma NK NL bird NM in the NE
The warm cruel NW Nun of the NV NX promise NY OO idol of ox NS farming NR weeding hoe NT woman NU OT is afraid of PA
Row PZ expects that the other PC of PB runs PD and joins PF basin friend PG PH and criticize PI sheet PK ticket PM and shoot a glance at PN
Product PP comments the broken PO of PQ to cut open the strong QL bridge of proper QJ thousand QK of general PU seven QI of PT QM and cuts QN
The full QV of the parent clear QQ of QP poor QR autumn QS district QU lacks the right RB of QW group QX allows RC disturb RD
Hot RE appoint RG still RH day RI hold RR meat RT such as the auspicious RW profit RX of the soft RV of RU if RY
The husky SA solarization of Three S's B SZ goes up the living SH hand of the few SE of the SD society body SG of SC ST and counts SU brush SJ
The two SL water SW of rate SN bolt SV along SX say its TA of SY four SI pine SR too TZ talk TB
Painful TI days TK bars of the TH body TM iron TN of the special TE of hall TC cover TD listens TQ with TR TT
The figure TU TV of group pushes away TW and gulps down outer WZ ten thousand WB of TY watt of WA of TX holder toward WC position WF literary composition WG
Father-in-law WH holds the new XP star of the little XM XN of association of the existing XK phase XL of XJ XQ under the XI of WO five WU west
Brother XR repaiies XS and is permitted XU a surname XV and learn XW ten days XX and press the tight YB sun of YA YC to want YD page or leaf YE
The silver-colored YP English of one YI YQ YO transports the assorted ZA of YX with the right YT language of the YR YU YV month YW of unit
Open ZR week ZT among the positive ZH of the true ZG of this ZE of the ZC million ZD ZI at ZZ thief ZF station ZB
Main ZU grabs Z J and drags the special ZV of ZN dress ZL and chase after the accurate ZX of ZW left side ZY
The example explanation:
1, Chinese character is answered the letter word combination by pronunciation attribute initial consonant, simple or compound vowel of a Chinese syllable, tone and ordered pair.Thereby realized the fractionation of each Chinese character and the combination of digital quantization fully.Each link wherein all strictness is followed [Scheme for the Chinese Phonetic Alphabet], sound, sound preface and the corresponding GB18030 Chinese characters of the national standard coding of [Chinese character standard dictionary] middle Chinese character that country carries out.Having formed visible word knows sign indicating number, sees the reciprocal intermediary's Chinese information processing system of sign indicating number character learning.
For example: scolding (MATD), ant (MATF) wherein first alphabetical M are initial consonant, and second letter A is a simple or compound vowel of a Chinese syllable, and the 3rd tee is the fourth sound, just the mandarin Received Pronunciation of this word have been locked with first three letter like this.The 4th letter then is the preface that unisonance, the people having the same aspiration and interest are arranged by the national standard stroke order.The Chinese character that this coding is corresponding then is a GB18030 Chinese characters of the national standard coding.
2, the no initial consonant of ABAD peace expression is represented first character as row.
3, V ... the expression symbol " ... " VAB represent symbol ", "; VAC represent symbol "." or the like.
4, long Lu LUPA of the long ZCPA of CCJA represents that respectively the similar shape polyphone needs independent separately expression and cerebral and simple or compound vowel of a Chinese syllable U to use DEF, JKL, PQRS, WXYZ rear portion tone letter.
5, MASA wherein the 3rd vertical be that S represents softly.
6, T (TAAB) he; TC (TCGD) hall; TCM (TCMG) lies and represents that respectively everyday character is provided with that one, two, three brevity codes are represented and each brevity code correspondence four all-keys separately.
7, BC-YC model; XP-HJ-SE Xinhua News Agency; The whole independent standard words that is not subjected to corresponding two brevity code words restriction is represented in HU-HE-HD-TE Huhehaote respectively.
8, W-M we; The DBAE-JJTD stretcher represents to use any brevity code, four represented words of all-key to carry out the combination in any speech.
9, T ' S he be; DA ' JJAS; DA-JJ represent the asyllabia, back usage and with the differentiation of speech, be convenient to the reading comprehension and the computing machine meaning of one's words and understand and handle.
10, by receiving mandarin pronunciation " mother ", the computing immediately of machine program is decomposed into " MAAMAA " each three phoneme codes as calculated, can draw definite Chinese mosaic sign indicating number of " MA-MA " again through computing: the pronunciation that can be combined into this word according to first initial consonant and second, third four tones of standard Chinese pronunciation simple or compound vowel of a Chinese syllable of three phoneme codes in like manner.
11, by above Chinese mosaic rule can form accurately, intermediary's Chinese information processing system accurately.Can be easily with the composition function of Chinese and Chinese character perform mathematical calculations, semantic understanding and control accurately.
Sentence example: Wuhan City's Yangtze Bridge.WU-HD-SI?CC-JL?DA-QM。
WU-HB?SI-ZC?JLA-DA-QM。
He is our good model.T’S?W-M’D?H?BC-YC。
Also owe 2,000,000 yuan of Renminbi.HVGB?QKT?RG-MP-BI?200?WB-YV。
HZG?QKT?RG-MP-BI?200?WB-YV。
Unconventional customization moulding.FFA’CC-GW?DQ-ZIWK?ZD-XQ。
The word that wherein should the break speech that breaks; Long (chang) and long polyphones such as (zhang) need be represented respectively; Wherein " go back " character pronunciation and be " Huan " or " Hai " then represented two kinds of diametrically opposite meanings; Wherein " unconventional " then needs the asyllabia, back to be distinguished.Because multitone ambiguity shape similar word is nearly more than 1000 in the Chinese character, and is mostly everyday character, so just brings difficulty for accurate semantic understanding.The solution that the present invention is then complete the problems referred to above, Chinese is expressed accurately.The ASCII character sequence can be handled the decomposition function of Chinese character pronunciation alone and the clog-free world's information integral system that enters separately simultaneously.

Claims (10)

  1. But 1, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting is characterized in that its uses four of fixed length---row, column, vertical, preface English capitalization are as the unique corresponding sign of Chinese character.Performing step is as follows:
    1), behavior initial consonant wherein, use 23 western language capitalizations to represent, no initial consonant is represented first character that alphabetical IVU uses as punctuation mark and other literal code space as row, corresponding relation is as follows:
    A:a B:b C:c,ch D:d E:e F:f G:g H:h J:j K:k L:l M:mN:n O:o P:p Q:q R:r S:s,sh T:t W:w X:x Y:y Z:z,zh
    2), wherein classify simple or compound vowel of a Chinese syllable as, use 26 western language capitalizations to represent that corresponding relation is as follows:
    A:a B:an C:ang D:ao E:e F:ei G:en H:eng I:i J:ia,uaK:ian L:iang,uang M:iao N:ie,uai O:o,er P:in Q:ingR:iong,ong S:iu T:ou U:u,u V:uan W:ue,ui X:un Y:uo Z:ai
    3), wherein vertically be the four tones of standard Chinese pronunciation, use 26 western language capitalizations to represent, corresponding relation is as follows: ABC DEF represents the high and level tone tone in proper order, GHI JKL represents the rising tone tone in proper order, MNO PQRS represents several accent in proper order, and TUV WXYZ represents the falling tone tone in proper order, wherein is included into S softly; Row, column, vertical three are defined as specific this tone word; Initial consonant is ch, sh, and zh and simple or compound vowel of a Chinese syllable u use DEF, JKL, PQRS, WXYZ tone letter.
    4), wherein preface is this tone standard stroke sequence code; Use A-Z totally 26 capitalizations, cut apart with the space between the words.
  2. But 2, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 1, it is characterized in that: everyday character is represented with one, two, three brevity codes each brevity code correspondence four all-keys separately use for calculation process.
  3. But 3, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 1 or 2 is characterized in that: connect with "-" symbol between each word of Chinese word; XX-XX, XX-XX-XX, XX-XX-XX-XX form are standard words, and they are whole corresponding expressions, not limited by corresponding two brevity code words; All the other speech all can use any brevity code, four represented words of all-key to carry out the combination in any speech.
  4. But 4, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 1, it is characterized in that: the back asyllabia in the Chinese, speech, use to be easy to the speech difference and to be convenient to reading comprehension and the computing machine meaning of one's words is understood " " of handling ' symbol is connected.
  5. But 5, as claim 1,2 or 4 described a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, it is characterized in that: to the input of keyboard Chinese character, storage, Network Transmission is that the plug-in Chinese international 18030 internal code converse routines of using system periphery get final product when needs show and print Hanzi font by the decomposition and combination of the attribute that pronounces.
  6. But 6, as claim 1,2 or 4 described a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting, it is characterized in that: when the computing machine simultaneous interpretation, press the Chinese character front three of standalone feature separately, the ascii character that is initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation decomposes Chinese mandarin Chinese character pronunciation, computing machine is to front and back words calculation process, the requirement location of four all-key brevity codes of this words and the asyllabia, back and the speech note method meaning of one's words arranged form a complete sentence; The independent front three of Chinese character speech, promptly initial consonant, simple or compound vowel of a Chinese syllable, the four tones of standard Chinese pronunciation are merged out the pronunciation of this word by the audio group of first initial consonant pronunciation and second and the 3rd four tones of standard Chinese pronunciation simple or compound vowel of a Chinese syllable.
  7. But 7, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 2 is characterized in that being expressed as of a brevity lists: like A not B go out C the good H of D E part F worker G with regard to J mouth K you the N Europe flat P of O of L M to remove Q people R be that the little X of he I W of T of S has the sub-Z of Y.
  8. 8、2,:AA AZ AB AC AD BA BZ BB BC BD BFBG BH BI BK BM BN BP BQ BO BU BXCZ CB CA CC CD CE CG CH CI CT CUCJ CN CV CL CW CX CR CY DA DZ DBDC DD DE DF DG DH DI DJ DK DM DNDQ DS DR DT DU DV DW DX DY EE EGEH EO FA FB FC FF FG FH FM FO FTFU GA GZ GB GC GD GE GF GG GH GRGT GU GJ GN GV GL GW GX GY HA HZHB HC HD HE HF HG HH HR HT HU HJHN HV HL HW HX HY JI JJ JK JL JMJN JP JQ JR JS JU JV JW JX KA KZKB KC KD KE KF KG KH KR KT KU KJKN KV KL KW KX KY LA LZ LB LC LDLE LF LH LI LJ LK LL LM LN LP LQLS LO LR LT LU LV LW LX LY MA MZMB MC MD ME MF MG MH MI MK MM MNMP MQ MS MO MT MU NA NZ NB NC NDNE NF NG NH NI NK NL NM NN NP NQNS NR NT NU NV NW NX NY OO OT PAPZ PB PC PD PF PG PH PI PK PM PNPP PQ PO PT PU QI QJ QK QL QM QNQP QQ QR QS QU QV QW QX RB RC RDRE RG RH RI RR RT RU RV RW RX RYSB SA SZ SC SD SE SG SH ST SU SJSN SV SL SW SX SY SI SR TA TZ TBTC TD TE TH TI TK TM TN TQ TR TTTU TV TW TX TY WA WZ WB WC WF WGWH WO WU XI XJ XK XL XM XN XP XQXR XS XU XV XW XX YA YB YC YD YEYI YP YQ YO YR YT YU YV YW YX ZAZZ ZF ZB ZC ZD ZE ZG ZH ZI ZR ZTZU ZJ ZN ZV ZL ZW ZX ZY。
  9. But 9, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 2 is characterized in that: three brevity codes and four all-keys all have the corresponding relation with GB Chinese Character Set GB18030.
  10. But 10, a kind of words phoneme calculation function calculation of coding machine Chinese character speech storage method for inputting as claimed in claim 1, it is characterized in that: behavior I, V, U represent punctuation mark or non-Chinese character literal.
CNB2007100003103A 2007-01-08 2007-01-08 Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word Expired - Fee Related CN100474219C (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CNB2007100003103A CN100474219C (en) 2007-01-08 2007-01-08 Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word
PCT/CN2007/000134 WO2008086653A1 (en) 2007-01-08 2007-01-12 Computer chinese character storing and inputting method of word phoneme calculable function coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100003103A CN100474219C (en) 2007-01-08 2007-01-08 Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word

Publications (2)

Publication Number Publication Date
CN101004640A true CN101004640A (en) 2007-07-25
CN100474219C CN100474219C (en) 2009-04-01

Family

ID=38703830

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100003103A Expired - Fee Related CN100474219C (en) 2007-01-08 2007-01-08 Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word

Country Status (2)

Country Link
CN (1) CN100474219C (en)
WO (1) WO2008086653A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104331173A (en) * 2012-04-16 2015-02-04 宗刚 Computer processing method and system for character information
CN110334527A (en) * 2019-05-31 2019-10-15 范玉明 A kind of Chinese character encrypting and decrypting method based on the Chinese phonetic alphabet
CN111833660A (en) * 2020-06-17 2020-10-27 胡屹 Chinese character learning implementation system, learning device and method
US11017170B2 (en) 2018-09-27 2021-05-25 At&T Intellectual Property I, L.P. Encoding and storing text using DNA sequences

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107066080B (en) * 2016-12-15 2023-02-10 郑远泾 Chinese character pronunciation, chinese character and symbol coding input method

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1122469A (en) * 1995-06-09 1996-05-15 广州师范学院 Spelling tone and first stroke code Chinese character input method
CN1052313C (en) * 1996-03-07 2000-05-10 史颖 Three-dimension Chinese words and characters one by one casting method and keyboard input
CN1219699A (en) * 1997-12-12 1999-06-16 余彦中 Chinese phonetic character input scheme

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104331173A (en) * 2012-04-16 2015-02-04 宗刚 Computer processing method and system for character information
US11017170B2 (en) 2018-09-27 2021-05-25 At&T Intellectual Property I, L.P. Encoding and storing text using DNA sequences
CN110334527A (en) * 2019-05-31 2019-10-15 范玉明 A kind of Chinese character encrypting and decrypting method based on the Chinese phonetic alphabet
CN111833660A (en) * 2020-06-17 2020-10-27 胡屹 Chinese character learning implementation system, learning device and method

Also Published As

Publication number Publication date
WO2008086653A1 (en) 2008-07-24
CN100474219C (en) 2009-04-01

Similar Documents

Publication Publication Date Title
CN101989256B (en) Typesetting method of document file and device
CN100474219C (en) Storage method for inputting Chinese word into computer by calculation functional encoding phoneme of word
CN103455608A (en) Management and inquiry system based on medicine coding
CN102883059B (en) The display method and apparatus of note, the method and apparatus of answer short message
CN104572615A (en) Method and system for on-line case investigation processing
CN108694214A (en) Generation method, generating means, readable medium and the electronic equipment of data sheet
CN106815366A (en) A kind of method and system of Mass production data
CN112115143A (en) Automatic data updating and synchronizing method and device, electronic equipment and storage medium
CN104915698B (en) Chemical producting safety information quick search and complete period tracking digital labelling system
CN105912723A (en) Storage method of custom field
CN103778110B (en) The conversion method of simplified and traditional Chinese characters and system
CN112085357A (en) System and method for recognizing and processing important points of conflict of plot planning conditions
CN104407839A (en) Complex calculation logic analytical method and device
CN104750380A (en) Information processing method and electronic equipment
CN111008013A (en) Analysis and visualization system for field fissure zone test data
CN112860653A (en) Government affair information resource catalog management method and system
CN109739923A (en) A kind of method and system that data import
CN103488616B (en) A kind of embedded font processing method and device
CN104598636A (en) Complex document separating and organizing method and complex document automatic generating method
CN107346338A (en) File directory sort method and device
CN100371866C (en) Fast and convenient inputting method with code number and pictograph
CN101968682B (en) A kind of Chinese character input method and system thereof
CN104794236A (en) Map making rule construction and structured organization method and system thereof
CN203217485U (en) Computer quick-recording device
CN113190668A (en) Man-machine interaction method, device and equipment based on multi-turn conversation and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20170126

Address after: 100027 Dongcheng District Wang Hutong, room 10, No. 230, room

Patentee after: Beijing Han Code Rubik''s Cube Technology Co. Ltd.

Address before: Room 8, unit 6, building 102208, Tongda garden, Beijing, Huilongguan, China

Patentee before: Shi Ying

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090401

Termination date: 20200108

CF01 Termination of patent right due to non-payment of annual fee