CN1523518A - Intelligent Chinese cultural dictionary system - Google Patents

Intelligent Chinese cultural dictionary system Download PDF

Info

Publication number
CN1523518A
CN1523518A CNA031040500A CN03104050A CN1523518A CN 1523518 A CN1523518 A CN 1523518A CN A031040500 A CNA031040500 A CN A031040500A CN 03104050 A CN03104050 A CN 03104050A CN 1523518 A CN1523518 A CN 1523518A
Authority
CN
China
Prior art keywords
character
word
tone
chinese
pronunciation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA031040500A
Other languages
Chinese (zh)
Inventor
郭慧民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA031040500A priority Critical patent/CN1523518A/en
Publication of CN1523518A publication Critical patent/CN1523518A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention belongs to information processing field. The invention merges each kind of culture books and records of china by using character, word, sentence and text as multielement knowledge points through setting netlike knowledge radiation system. It realizes screen full text hotspots, synchronous switch of horizontal and perpendicular typeface, complex character or simple character supported by coding and compressing, composing image and system integrating technology. The hotspot character and word can know phonetic alphabet tone, stroke and writing, explain, multivocal group, relative expressions, proverbs and allusions, rhesis and proverbs, antithetical couplet and so on. The book refers to Confucian classics, history, philosophy and literature, culture files, and so on; it provides various inquiring mode, code matching, text searching and classification selecting.

Description

Intelligent Chinese culture dictionary system
One. technical field
The invention belongs to field of information processing.System has set up netted study framework, reconfigure cultural dictionary, to knowledge type idiom material cross-searching, the information extraction data, the input and the study of Chinese character are combined together, the present invention can be applicable to fields such as microcomputer, PDA, notebook, panel computer, network communication, and occurs with forms such as unit product, networking products, digital library, education network sight, printed matter, audio-visual products.
Two. background technology
Development of computer has produced profound influence to the propagation of Chinese culture.The electronic solution of tradition dictionary has been brought into play the advantage in computer storage and the retrieval, as CD version " Modern Chinese dictionary ", OEM version " Kingsoft Powerword " and various PDA e-dictionary, but is still the electronic form of book.The common part of above dictionary on learning functionality is the next corresponding related content of selection by input text inquiry or tabulation subitem, comprises that the lexical or textual analysis of word is explained.Belong to a kind of unidirectional knowledge transmission course.
Digital library is presented on books miscellaneous in face of the people by network, and the e-book e-book becomes the fashion mode that people download.Complete convenience is fast to be its advantage, magnanimity search and the excessive use that is restricting people of download capability, and various vastness of content makes that the horizontal knowledge relationship of strengthening the learner is particularly important.
Web education and multimedia courseware are to be core with the classroom instruction, by modem technology,, integrate and utilize multiple information resources as campus network, and be synchronous knowledge-transference behavior.
Three. summary of the invention
1. deal with problems
Break the idea of " book ", cultural ancient books and records reference book is carried out scheme reform, make information reach comprehensive perforation.By the contact of knowledge point, become passive type study into guiding study, draw as netted " order " with learned side, as netted " guiding principle ", once the key link is grasped, every thing falls in to place with the knowledge framework.The present invention is non-to be confined to disposable knowledge learning, is intended to reallocating resources.All texts of screen all can be used as focus, under the structure net system of system, carry out the relevant knowledge link.As the ancient rhythm of the phonetic that can further understand this word by braille, numerous different simplified, stroke is write, word origin, lexical or textual analysis group speech, with contents such as anti-near synonym woods, related phrase, idiomatic allusion, well-known phrase proverb, distich word fan, two-part allegorical sayings.By multiple inquiry integrated mode, as encoding scheme, text input, category filter is searched the location required information.The setting of grade, frequency makes the level of learning of a words sentence piece of writing different because of need because of the people, sharing data resources, compressed storage space.
The present invention is to propagate traditional culture knowledge and universal education, can relate to the literature dictionary, Confucian classics, history, philosophy and literature, private school's reading matter, cultural resources such as opera calligraphy and painting, garden architecture, medicine culinary art, wushu folk custom, and various well-known phrase phrase vocabulary dictionaries, multi-functional dictionary etc., multidirectional means such as classification collocation, order of classification and senses of a dictionary entry selection are provided, realize intersecting and read.The knowledge chain is all linked with one another, from one to the other.Multiple classifition classification mechanism makes the traditional culture subject content of Chinese network realize regularly substituting automatically.
Quality-oriented education embodies the aim that people-oriented.The present invention provides the aid of equality to the academics and students, has the many prefaces of classification, vocabulary collocation mechanism, autonomous level setting and shielding are set, for grasping vocabulary, the user of different regions, different levels provides correspondingly linguistic context, at teaching practice, can increase and delete chapter vocabulary data, be autonomous exchange study approach.
2. technical scheme
Intelligent Chinese culture dictionary system is examined the single Chinese character of multiple information characteristics as Data Base, build on this basis is the Open architecture of the group certainly database of compression unit layer by layer with " position ", and attached with coding techniques, typesetting technique, compress technique, graph image technology and system integration technology, the application software modelling of carrying out with the idea of system development.
Native system has adopted the open type data structure, the division in classification word storehouse has improved the ratio of compression of two words, three words, multi-character words, the combination of concordance list and algorithm makes the location of magnanimity statement rapid, powerful multidirectional mark make punctuation mark, to cut aspects such as speech note, loose-leaf composing accurately practical.Under the support of this technology, speech recognition and speech synthesis technique also have ample scope for one's abilities.Constantly perfect by system promotes the standardization of data structure, shares laying the foundation for the data resource of realizing traditional culture.
All literal storages of system all are to be based upon on level of abstraction Chinese character meaning and pronunciation storehouse and the word compression basis, cutting storehouse, but not the Hanzi internal code of file structure formula.From the storage structure design, can realize the unlimited dilatation of Chinese character.When system transplantation and upgrade expanding, database need not be revised, and highlights data compression and shares advantage.
The study code inputting method that the dictionary system derives from is made up of words input magazine, Chinese character information storehouse, word note storehouse, classification words and phrases storehouse and auxiliary resources storehouse five part of module, and different and other input methods only have the unitary system of words input magazine.
The words input magazine realizes basic Chinese character input, is the main coherent literal input of thinking of finishing with phonetic, has individual character haracter pattern rule stroke mode simultaneously concurrently and mistake is known the other alternative means of debating of Chinese character, awkwardly recognizes and misdeems the Chinese character input of providing convenience.The Chinese character information storehouse provides Chinese character radicals and strokes, pronunciation font, collocations information.Word note storehouse is also included encyclopaedia clauses and subclauses commonly used in except that Chinese language senses of a dictionary entry clauses and subclauses.Classification words and phrases storehouse is a trunk with scape, thing, people, thing, reason, feelings, refinement word's kinds content.The auxiliary resources storehouse can be selected to articulate as required, enriches online vocabulary sign resources for online family provides.
Four. description of drawings
The operational flowchart of system is with reference to Figure of description one;
The operational flowchart of modern Chinese dictionary is with reference to Figure of description two;
The technical pattern framework of system is with reference to Figure of description three.
Five. embodiment
System development platform is based on the Visual C++ under the Windows operating system at present, can various in the same way operating systems, in Linux, WinCE and various LAN (Local Area Network), Hanzi internal code adopts GBK character set and Unicode sign indicating number two sets of plan, also can be applicable to various kanji codes be.
1. storage example
Compression dictionary and cutting dictionary are arranged on basic pronunciation and meaning character library, become the double byte or the interior pattern storage of nybble of original Chinese character, be abstract pronunciation and meaning sign indicating number of various ways and the storages of compression speech pattern such as individual character double byte, double word double byte, three word double bytes, multiword double byte, individual character nybble, structurally make the compressed storage of Chinese character and the dilatation of China and foreign countries' literal melt one altogether.The compression data file form combines with the embedding of basic pronunciation and meaning character library, compression cutting dictionary, make that the data file of text is relevant, note, inquiry, complicated and simple, set type, write, full spectrum information such as compression.
It is as follows that the Tang Dynasty poet opens the corresponding sentence formula unit compressed format of the four-line poem with seven characters to a line " autumn think of " of nationality:
Words storage and correspondence code are: (autumn wind) is seen by (Luoyang) (inside the city)
26155 30704 58 26611
(desire work) (letter from home) meaning (ten thousand weights)
41196 26053 162 40744
Multiple probably (hurriedly) says (not to the utmost)
176?263?28231?2880?30706
(pedestrian) faces and sends out (Kaifeng) again
26172?259 332?298 30707
Punctuation bit and punctuation mark are 4*16+3,4*16+5,5*16+3,5*16+5, i.e. first Chinese-character word-phrase storage accounts for nybble, punctuation mark be ", ", second Chinese-character word-phrase stores and accounts for nybble, punctuation mark be ".", by that analogy.
Its group speech shielding is 2 for number, and the shielding classification is a full-shield, and cutting dictionary mask off code is 32265 and 30707, and corresponding phrase is " writer " and " Kaifeng ".
Several 1 of word explanation part word explanation is orientated second the 3rd compression words as, and length is 2 compression words, i.e. and " meaning (ten thousand weights) ", note length is 12 bytes.Punctuation bit and punctuation mark are 2*16+0 (containing " " word), 2*16+5,0*16+0,0*16+0, storage of note words and correspondence code are: (express) (meaning) (a lot)
112?27343 29672 33210
Whole verse words storage accounts for the 18*2 byte, and the punctuate mark accounts for 4 bytes, and the shielding of group speech accounts for the 1+2*2 byte, and word explanation accounts for the 1+2+12 byte, amounts to 60 bytes.
2. coded query example
By the coded combination of sound, rhythm, shape, justice and asterisk wildcard and all kinds of conditions, finish any search of words and phrases, realize required information location, have the statistical function of autonomous limited range simultaneously.
Up and down the sentence locating query:
It repaiies Endless Way far
Can piece together lmmqxyx or spelling mode lumanmanqixiuyuanxi by contracting;
The phrase inquiry:
Detect the verse that all contain " spring breeze "
*chunfeng *
The harmonious sounds inquiry:
Detecting first Chinese character is the following verse of secondary of high and level tone chun or high and level tone qiu
chunl|qiul *:2
Detecting simple or compound vowel of a Chinese syllable is the secondary verse of ou
*(ou):20
Detecting the new rhythm rhyme of China is the verse of rising tone " trace " rhythm or rising tone " heptan " rhythm
*(215|217)
Detect the verse of " level and oblique tone level and oblique tone is narrow flat "
(!@!@@@!)
Detecting thirteen rhyme schemes rhyme is the verses of high and level tone " Huailai " rhythm one or two words for " bright moon "
mingyue *(226)
Detecting the civilian rhyme rhyme of wearing is the verse of last flat " trembling with fear "
*<114>
Detecting the positive rhythm rhyme of speech woods is the words and phrases of three ones of last sound
*[303]
Detect the bent sentence that the bent rhythm rhyme in Central Region is an even tone " Huan is joyous "
*{109}
The inquiry of font WITH statement:
Detect the verse that 3,4 words contain grass-character-head
??609609 *
Detecting Chinese character is the poem with five characters to a line sentence of left and right sides structure
+00+00+00+00+00
Detect three the poem with seven characters to a line sentences that one or two of Chinese characters are stroke anyhow in end
*121212=7
Detecting first Chinese character five-stroke, to be depicted as " anyhow cast aside press down press down " stroke number be eight verses of drawing, as " woods ", " maple "
12344&08 *
Sound shape limits vocabulary inquiry:
Detecting the first word radicals by which characters are arranged in traditional Chinese dictionaries pieces together for " Rolling " contracts and is two words of yh
yh/s
yh/ss
The inquiry of subclass WITH statement:
The word that detection Ouyang repaiies is the ancient speech of butterfly love flower
dlh\poyx
Detect the Tang poetry four-line poem with seven characters to a line
*\s04
Detect li po's friendship poem
*\p606n20
Detect women author's poem
*\p000
Front and back intersect to be inquired about:
Detect eight prose masters of the Tang-Song period personage's (another name attitude)
tsbdj
Detect the another name (another name attitude) of Pai Chu-yi
-bjy
Detect two-part allegorical saying preceding half section " mix small onions with beancurd "
xcbdf
Detect the two-part allegorical saying second half section " perfectly clear "
-yqeb
3. learn the code inputting method example
Remove common phoneticizing type input, the study code inputting method also has information attitude, note attitude, classification attitude.
1. the information attitude is " I " pattern, adds the corresponding coding of individual character, and radicals by which characters are arranged in traditional Chinese dictionaries, stroke, pronunciation, the senses of a dictionary entry, collocations, the sentence-making example information of Chinese character is provided.
" Ah "'s information attitude is encoded to iaa, and testing result is as follows:
The left Fu of Ah " portion, seven " [a1]<Wu dialect〉prefix.◎ is used in the front of seniority among brothers and sisters, pet name or surname, and parent's meaning is arranged.The ※ eldest | A Bao ◎ is used in the front of some relatives' title.Granny ※ | elder brother ◎ is used in verb or adjective front, makes query tone auxiliary word.※ Ah going? | does Ah recognize? [a] interjection.With " " [a], modern general writing.◇ joins IABE.
2. the note attitude is " U " pattern, adds phonological encoding, both can be at the individual character multitone, again can be at the multiword word, the former correspondence " I " pattern-coding, the latter is except that Chinese language senses of a dictionary entry clauses and subclauses, also include encyclopaedia clauses and subclauses commonly used in, and corresponding classification attitude " V " pattern-coding is provided.
Syllable is that the testing result of a is as follows: ua
Ah ◇ joins IAA
◇ joins IAB
Breathe out ◇ ginseng IAC
A word used for translation ◇ joins IAD
Actinium ◇ joins IAE
The ◇ that salts down joins IAF
Sha ◇ joins IAG
Syllable is that the testing result of abl is as follows: uabl
Apollo in Apollo [A1-Bo1-Luo2] ◎ Greek mythology.Main refreshing Zeus's son.With Artemis be heterosexual twins.Be responsible for light, youth, medicine, herding, music, poem.And represent Zeus to declare refreshing purport.◇ joins VDIAF
" UI " pattern adds an individual character preface.Horizontal 1, perpendicular 2, cast aside 3, press down 4, folding 5.
" aunt " testing result is as follows: ui53112251
Aunt ◇ joins IGUA
3. the attitude of classifying is " V " pattern, add the classification coding, classification is that the A universe earth, B new word thing, C human body, D human society, E food live that row, F affective behavior, G agricultural, H industrial technology medicine, J communications and transportation, K economy and trade merchant duty, L politics and laws, M military affairs, N historical geography, O culture and arts, P philosophical education, the legend of Q religious belief, R material object, S thing situation, T measure, U other, each sport has the subitem of different numbers, and subitem is shown related word or well-known phrase.
The result is as follows for the love classification and Detection: VDF
◆ love/first love ◆ A
First love | admire fondly | emotionally | harbour the amorous thoughts of spring | think the spring | the lover | seek a spouse | pay court to | express love | yearning between lovers | unrequited love | the innocent childhood friend | be innocent playmates | first awakening of love | one-sided wish
◆ a love/secret meeting of lovers ◆ B
A secret meeting of lovers | a secret meeting of lovers | appointment
It is pretty for quiet woman, waits me in the corner of a city wall; Like and lose that the Chu that paces up and downs scratches one's head." the quiet woman of Book of Songs ⊙ "
Dawn is towards cloud, and be row rain dusk, and every morning and evening is under the balcony.The ⊙ Song Yu of the Warring states " Gaotang tax preface "
……
◆ love/affectionate ◆ C
Embrace | nestle | attachment | kiss | kiss | kiss | throw the bosom and go into to embrace | bill and coo
Lovely and innocently not frighteningly guess and the clothing bosom of falling people of sleeping.The refined treasure of Song Zhu ⊙ " peaceful pleasure "
◆ love/sweet heart ◆ D
The sweet heart | the lover | lovers | lover | object | friend | the lover | the lover
A lover sees a Xishi in his beloved | and cuckold looks mung bean---to last eye.
◆ love/love ◆ E
Love | love | deep love | loved | true love | conjugal love | be deeply in love | susceptible | unreasoning passion | touching | deeply attached to each other | sentimentally attached with a sudden impulse | find each other congenial | lingering sentiments | be passionately devoted | sincere | deep love justice is sincere | quietly send the message of love | full of tenderness | flash amorous glances | stealthily give the glad eye | eyebrow becomes tacitly consent to | flirtatious | on very close terms | like the shadow following the person | show endearments | like honey mixed with oil | exceedingly sentimental | bill and coo | deeply attached to each other | throw glue like lacquer | well-matched pair | gifted scholar and beautiful woman
The lovely lady, the gentlemen's good mate." Book of Songs ⊙ closes Ju "
Must become more dead, be willing to that doing mandarin duck does not admire celestial being than what diction of order.Lu ⊙ of Tang is according to adjacent " the ancient meaning in Chang'an "
The west of sunrise in the east rain, the road is that mercilessness is in love.The ⊙ Liu Yu of Tang tin " ancient folk songs with love as their main theme two head "
Be willing to do pair of love birds in the sky, be willing to be two trees with branches interlocked on ground.The ⊙ of Tang poses as easily " long song resolutely "
Body does not have the color phoenix round trip flight wing, and hearts which beat in unison are linked.Merchant Lee ⊙ of Tang conceals " untitled "
……
◆ love/departure ◆ F
Be reluctant to part | be distressed at parting | reluctant to part with | loath to part from each other | separate forever
The dead contract of giving birth to is wealthy, with sub-accepted theory, holds the hand of son, lives together to a ripe old age with son." Book of Songs ⊙ beats a drum "
The every trade overline is capable, gives birth to monarch and takes leave of.Chinese ⊙ anonymous person " ancient poetry 19 first ⊙ every trade overlines are capable "
Difficult Hard To Say Goodbye when meeting each other, the unable all sorts of flowers of east wind are residual.Merchant Lee ⊙ of Tang conceals " untitled "
Susceptible from ancient times the wound parted, and more that may, be treated coldly the clear autumn air.Song ⊙ Liu Yong " rain continuous heavy rain bell "
Deeply attached, wedding day such as dream bear turning round and look at the Magpie Bridge return road.Song ⊙ Qin Guan " Magpie Bridge celestial being "
……
◆ love/yearning between lovers ◆ G
Yearning between lovers | miss | care for | lovesickness
Yearn day and night | toss about in bed miss | keep gazing with great anxiety | look forward to sth. with great eagerness | one day seems like a year | and one day apart seems like three years | lie awake all night | feel deep anxiety about | feel very depressed at the prospect | seeing the thing one thinks of the person | red beans that inspirit the memory of the love
Free from restraint, toss about." Book of Songs ⊙ closes Ju "
Do not see gentleman, Distressed is as transferring famine." Book of Songs ⊙ Ru Fen "
So-called she is water one side." Book of Songs ⊙ In the Center of Water "
Think monarch such as full moon, subtract clear and bright light every night.The ⊙ of Tang opened for nine ages " tax derives from going out of monarch "
If yearning between lovers does not have day and night, and is vast and mighty the stream ripple.The ⊙ li po of Tang " posting 12 head far away " its six
Hate the letter that do not had a tidal wave of mutually, the yearning between lovers beginning feels that the sea is non-dark.The ⊙ of Tang Pai Chu-yi " wave is washed the sand "
……
◆ love/fickle ◆ H
Fickle | inconstant in love | mercilessness | faithless | the heartless lover | the heartless man
Abandon the old for the new | changeable | shift one's love to another person | fickle few justice | love and newly forget old friends | sympathize with newly abandon old | despise the poor and curry favour with the rich | desolate youth stranger | make a clean break
Generally speaking, the advantage of intelligent Chinese culture dictionary system is to take " bridge " for learner shop " road ", becomes objective passive type study and is subjective guiding study.The present invention can be applied to the cultural dictionary of the various in the world pictograph family of languageies or the alphabetic writing family of languages and integrate.

Claims (10)

1. the guiding of cultural ancient books and records is learnt solution, the netted radiation system in polynary knowledge point that it is characterized in that word, speech, sentence, a piece of writing, various cultural ancient books and records cross-linked, various query composition with unique code means, because of the realization mechanism of people because of the different study level setting of need, this focus of screen full text, system uses open data structure, form omnibearing knowledge association, the key element that constitutes intelligent Chinese culture dictionary system is basic pronunciation and meaning character library, compression cutting dictionary, compression data file form and coded query mode.
2. the solution of an input method of Chinese character that derives from from intelligent Chinese culture dictionary system coding inquiry mode, it is characterized in that forming China's study code system by words input magazine, Chinese character information storehouse, word note storehouse and classification words and phrases storehouse four part of module, be different from other input methods and have only single words input magazine module
The words input magazine realizes basic Chinese character input, is the main literal input of finishing with phonetic, has individual character haracter pattern rule stroke mode concurrently, and mistake is known the other alternative means of debating of Chinese character, and difficulty is recognized and misdeemed that Chinese character can use the Chinese character input; The Chinese character information storehouse provides Chinese character radicals and strokes, pronunciation word sound, collocations information; Word note storehouse is included encyclopaedia clauses and subclauses commonly used in except that Chinese language senses of a dictionary entry clauses and subclauses; Classification words and phrases storehouse is a trunk with scape, thing, people, thing, reason, feelings, refinement word's kinds content.
3. cultural ancient books and records guiding study solution according to claim 1, it is characterized in that in the machine of standard the basic pronunciation and meaning character library of abstract one deck on GB, BIG5 sign indicating number, GBK sign indicating number and the UNICODE sign indicating number basis, the numerous different letter, the harmonious sounds shape justice information that contain Chinese character, the sealing language material is accomplished to determine the uniqueness of Chinese-character shape-pronunciation Yi Tezheng, be different from usually open language material is determined the employed statistical probability likelihood method based on natural language understanding of Chinese character
Double byte is held 65536 codings, the Chinese character abstract code preface of pronunciation and meaning character library adopts double byte sign indicating number position 1-26000, be single word information, wherein 25900 are line feed information, and 25901-25999 is the byte alphabetical information, 26000 is superwood information, promptly when the Chinese character abstract code was 26000, two bytes were multi-character words information for appending China and foreign countries' pictograph information greater than 26000 subsequently, pronunciation and meaning character library is to be the standard recording data structure of measurement unit with " bit position ", wherein:
The JJNM integer type is simplified internal code;
The FJNM integer type is numerous allosome internal code;
PY1 byte PY1 PY2 PY3 determine the status flag of phonetic notation 1 phonetic notation 2 jointly, contain the syllable tone, often read,
The PY2 byte reads not only, oldly read, old but read, information such as dialect, when a certain Chinese-character pronunciation surpass more than two or
When PY3 byte abnormity Chinese character is above above two, appends a record and deposit its information, and in the BZW weighting;
The V1 byte is the pronunciation file sequence number of corresponding phonetic notation 1;
The V2 byte is the pronunciation file sequence number of corresponding phonetic notation 2;
The HZJG byte is the character structure information of simplified and traditional allosome, as single character, about, up and down, combinde rqdical character etc.;
The JBH byte is the stroke number of simplified Chinese character;
The FBH byte is the stroke number of numerous variant Chinese character;
The JHZDZ nybble is that the order of strokes observed in calligraphy of simplified Chinese character is write word address;
The FHZDZ nybble is that the order of strokes observed in calligraphy of numerous variant Chinese character is write word address;
The JWBH byte is five stroke code prefaces of simplified Chinese character, and horizontal 1 perpendicular 2 casts aside 3 presses down 4 foldings 5;
The FWBH byte is five stroke code prefaces of numerous variant Chinese character, and horizontal 1 perpendicular 2 casts aside 3 presses down 4 foldings 5;
The JPP1 byte is simplified Chinese character radical 1 an index sequence number;
The JPP2 byte is simplified Chinese character radical 2 index sequence numbers;
The JPP3 byte is simplified Chinese character radical 1 a diaphone sequence index number;
The JPP4 byte is simplified Chinese character radical 2 diaphone sequence indexs number;
The FPP1 byte is numerous variant Chinese character radical 1 index sequence number;
The FPP2 byte is numerous variant Chinese character radical 2 index sequence numbers;
The FPP3 byte is numerous variant Chinese character radical 1 diaphone sequence index number;
The FPP4 byte is numerous variant Chinese character radical 2 diaphone sequence indexs number;
The ZY1 byte is Chinese character harmonious sounds information, i.e. individual character corresponding final;
The ZY2 double byte is the corresponding new rhythm information of China;
The ZY3 double byte is corresponding thirteen rhyme schemes information;
The ZY4 double byte is corresponding pendant literary composition rhyme information;
The ZY5 double byte is the corresponding positive rhythm information of speech woods;
The ZY6 double byte is the corresponding bent rhythm information in middle continent;
The ZY7 double byte is corresponding ancient rhythm information;
The BZW byte is the Chinese character zone bit, have next Chinese character for this record append the Chinese character sign, the word source indicator is arranged,
The writing sign, group speech sign, collocation sign etc.;
The corresponding origin of Chinese characters module's address of ZYDZ nybble;
The corresponding Chinese character copybook of ZJDZ nybble module's address;
The corresponding Chinese character group of ZCDZ nybble speech module's address;
The corresponding Chinese character collocation of DPDZ nybble module's address;
Above structure length is 56 bytes, and each writes down a corresponding Chinese character information,
At cultural ancient books and records idiom material characteristic, realize screen this focus in full by the composing compress technique, as the precondition of extending the knowledge point, all literal storages of system all are to be based upon on level of abstraction Chinese character meaning and pronunciation storehouse and the word compression basis, storehouse, but not the Hanzi internal code of file structure formula, during system transplantation, according to the pronunciation and meaning character library table of comparisons, it is the corresponding relation of internal code, substitute original pronunciation and meaning character library, other data file need not be revised, and the upgrading and the dilatation of directly serving system version shared in the compression of data file.
4. according to claim 1 or the described cultural ancient books and records guiding study solution of claim 3, it is characterized in that on basic pronunciation and meaning character library, having compression dictionary and cutting dictionary, Chinese character becomes pattern storage in original double byte or the nybble, be abstract pronunciation and meaning sign indicating number of various ways and the storages of compression speech pattern such as individual character double byte, double word double byte, three word double bytes, multiword double byte, individual character nybble, structurally make the compressed storage of Chinese character and the dilatation of China and foreign countries' literal melt one altogether
The addressing space of compression dictionary is 26001-65536, and wherein 26001-42000 is the compressions of two words, and 42001-52000 is the compressions of three words, and 52001-60000 is the compressions of four words, and 60001-65536 is the multi-character words compression,
Two words compression storehouse, each speech takies 6 bytes, be respectively pronunciation and meaning character code, the syllable tone of first word, the pronunciation and meaning character code of second word, syllable tone, 26001-36000 is two words index commonly used, 36001-42000 is archaic Chinese and the shared addressing space of scientific and technological term two words, can distinguish the two according to word senses of a dictionary entry sign in the compression data file
Three words compression storehouse, each speech takies 9 bytes, be respectively pronunciation and meaning character code, the syllable tone of first word, the pronunciation and meaning character code of second word, syllable tone, the pronunciation and meaning character code of the 3rd word, syllable tone, 42001-48000 is three words index commonly used, and 48001-52000 is archaic Chinese and the shared addressing space of scientific and technological term three words
Four words compression storehouse, each speech takies 12 bytes, be respectively pronunciation and meaning character code, the syllable tone of first word, the pronunciation and meaning character code of second word, syllable tone, the pronunciation and meaning character code of the 3rd word, syllable tone, the pronunciation and meaning character code of the 4th word, syllable tone, 52001-57000 is four words index commonly used, 57001-60000 is archaic Chinese and the shared addressing space of scientific and technological term four words
Multi-character words compression storehouse, each speech takies byte and determines according to the difference of prefix addresses position, storehouse, the prefix addresses length of each multi-character words correspondence is three bytes, behind the prefix addresses table of storehouse, be multi-character words information, be respectively the pronunciation and meaning character code of first word, syllable tone, the pronunciation and meaning character code of second word, syllable tone, the pronunciation and meaning character code of the 3rd word, syllable tone, ... the pronunciation and meaning character code of last word, syllable tone, 60001-63000 is multi-character words index commonly used, 63001-65536 is archaic Chinese and the shared addressing space of scientific and technological term multi-character words
65536 addressing spaces have been formed in basic pronunciation and meaning character library and the combination of compression dictionary, constitute the basic compressed storage mode of intelligent Chinese culture dictionary;
The cutting dictionary is the basis that braille is cut speech, wherein two, three, four, the addressing space of multi-character words is three bytes, the Senior Three position is a cutting dictionary classification, the back is a 1-2097152 addressing sequence number, the word explanation address of each segmenting word correspondence is a nybble, the cutting dictionary also is the addressing basis of dictionary mode, comprises various ways dictionaries such as Chinese idiom, allusion, Chinese language, encyclopaedia, specialty, classification, has common vocabulary SEQ.XFER.
5. according to claim 1 or claim 3 or the described cultural ancient books and records guiding study solution of claim 4, it is characterized in that the compression data file form is as one of key element that constitutes intelligent Chinese culture dictionary system, with basic pronunciation and meaning character library, the embedding combination of compression cutting dictionary, make that the data file of text is relevant, note, inquiry, complicated and simple, set type, write, full spectrum informations such as compression, the compression data file form is divided into vocabulary formula senses of a dictionary entry form, sentence formula unit compressed format, the matched combined form, kinds such as individual character format write
Vocabulary formula senses of a dictionary entry form is mainly determined vocabulary note address by three byte addressing spaces, the not synonymity of vocabulary and the study grade of this speech and usage frequency hook, grade, frequency set up 8 grades separately, corresponding different senses of a dictionary entry contents, the study grade is set at student's degree of understanding, usage frequency is grasped the Chinese complexity at foreign friend and is set
Lexical or textual analysis is divided into the outer lexical or textual analysis of Ci hai lexical or textual analysis, encyclopaedia lexical or textual analysis, professional lexical or textual analysis, Chinese language lexical or textual analysis and the Chinese, in Chinese lexical or textual analysis, words and phrases content punctuation mark is cut the speech storage by compression, outside the Chinese, in the lexical or textual analysis foreign language such as English, method, moral, west, Russian are stored with Huffman compressed format
In the vocabulary ordering, switch on forms such as sound preface, backward, shape preface, class preface synchronously according to concordance list, the sound preface is pressed whole syllable word-building orderings of vocabulary, backward is pressed the reverse syllable word-building ordering of vocabulary, the shape preface is pressed the radical stroke word-building ordering of vocabulary, and the class preface is pressed the categorical attribute word-building ordering of vocabulary
Provide relevant online network address for proper noun, realize that content is interconnected on the net;
Sentence formula unit compressed format is divided into words storage in punctuation mark mark, the sentence, the shielding of group speech and word explanation four parts, and each record contains maximum four of sentence, and minimum is one,
The punctuation mark mark accounts for 4 bytes, is every punctuation bit and symbol code, and symbol code is: in 0 nothing or the Modern Chinese " " 1,2; 3,4? 5.6!7:8:“9。”10?”11!”12“......,13......。”14:‘15。', the group speech shields the relevant group of speech that listed phrase requirement stops this character segmentation dictionary, avoids producing ambiguity,
The writings in the vernacular note of word in the sentence of word explanation location,
Sentence formula unit compressed format both can be at simple sentence, parallelism sentence formula type such as common saying, proverb, distich, crossword puzzle etc., again can be at length formula article type, and as poem Qu Wen, the latter must increase the management in table of contents index library,
Table of contents index library content is exercise question address 3 bytes, author address 2 bytes, address 4 bytes are explained in enjoyment, vernacular translation address 4 bytes, English translation address 4 bytes, sentence formula packed record start bit and sentence formula packed record length 4 bytes, article type 1 byte, comprise Music Bureau, old style, regulated verse, the poem of four lines, five speeches, seven speeches etc. about poetic prose, article content defines 1 byte, comprises singing of history poem, frontier fortress's poem, boudoir repinings poem, chants thing poem, discipline You Shi etc., article specific item and grade mark 2 bytes about poetic prose, historical dynasty sign 1 byte etc.
The combination of the segmentation project in sentence formula unit's compressed format and table of contents index library thereof and catalogue, coded query mode is finished the polymorphic combination of chapter and sentence and is searched;
The click incision of the corresponding words of matched combined form, by clicking the collocation state that individual character and cutting word enter vocabulary, not synonymity according to the words lexical or textual analysis, corresponding different collocation forms, the content increase and decrease is variable, be mainly items such as the relevant synonym of this words, near synonym, antonym, the discrimination of close word, similar vocabulary, related vocabulary, word-building, phrase application, a group sentence example, Chinese idiom, allusion, well-known phrase, common saying proverb, distich, two-part allegorical saying, maxim, riddle
The compressed storage of collocation form is close with formula unit's compressed format, except that showing matched combined, also can click the classification state that enters a related words words and phrases piece of writing,
Every collocation has autonomous level setting and shielding is set, and provides correspondingly linguistic context for the user of different regions, different levels grasps vocabulary, at teaching practice, can increase all kinds of collocation vocabulary data;
The corresponding Chinese-character writing of individual character format write, adopt curve fitting algorithm to determine the presentation direction of stroke, and carry out AND operation with the vector font library type matrix, finish writing of Chinese character by pen, make the corresponding one by one animation order of strokes observed in calligraphy of Chinese character more than 20,000, compressed the data-storing space, write font and contain the Song typeface, regular script, black matrix, lishu
That the format write parameter index relates to is common, the sign of horizontal line, vertical line, three sections lines is set, the differentiation of the first stroke of a Chinese character, son pen, line segment initial, stop coordinate, line segment width, vigour of style in writing state etc.
6. according to claim 1 or claim 3 or claim 4 or the described cultural ancient books and records guiding study solution of claim 5, it is characterized in that the coded combination of coded query mode by sound, rhythm, shape, justice and asterisk wildcard and all kinds of conditions, finish any search of words and phrases, realize required information location, the statistical function that has autonomous limited range simultaneously, " intelligence " characteristic of embodiment system;
Press cultural ancient books and records content such as poem, speech, bent, literary composition, distich, proverb, common saying, Chinese idiom, formal classifications such as vocabulary are with sentence speech sound preface storage coding, the code database prefix is a sound sequence index table, first sound preface with preceding double word syllable in the sentence is AB, AC, AD ... mode, pass through difference, location immediate addressing scope, behind sound sequence index table, be row's sentence storage structure, the encode Chinese characters for computer capacity is 15 words in the maximum sentence, exceed and block, each record length is 28 bytes, 15 Chinese character syllable sequence numbers take 15 bytes, the position takies 5 bytes in three words in pronunciation and the sentence again, and every attribute comprises grade, mood, the features such as position of corresponding formula unit's compressed format take 2 bytes, sentence formula unit compression address, storehouse takies 3 bytes, and address, table of contents index library takies 3 bytes;
By sentence formula unit compression address, storehouse, and embedded pronunciation and meaning character library and compression dictionary, at addressing of quick sound preface or gamut addressing, can generate 15 word information temporary files immediately, include sound, rhythm, shape characteristic information, according to the coding collocation, finish universal query, the coding collocation of its middle pitch, rhythm, shape is a unit with the word, and various combination is unrestricted;
Various sounds, rhythm, shape characteristic information remove the correspondence storage sequence number in the pronunciation and meaning character library, also have arrangement code table storehouse separately, serve a Chinese word coding inquiry mode;
The coded query scheme is: 26 English alphabet keys, 10 numerical keys, little, in, the braces key,?, * ,-,+,/,, ^,! , @, #, ﹠amp; , |:, ' etc. symbolic key;
Asterisk wildcard? be positioned individual character, * is positioned any position;
26 English alphabet keys are combined as an interior phonetic transcriptions of Chinese characters syllable, spelling, the assembly of contracting all can, contract to piece together at zh, ch, sh and also can adopt z, c, s form, " ' " number as the syllable space character, at the Taiwan with can not use the crowd of phonetic, code table is provided, the contrast relationship of the Chinese phonetic script and pinyin syllable is arranged, the Chinese phonetic script focus input shown in can pressing;
Can add tone information behind syllable, for " ^ " adds 0 to 5 numerical key, 0 for softly, 1 be high and level tone, 2 for rising tone, 3 for last sound, 4 is falling tone, also can omit in the operation " ^ ";
The search operaqtion of rhythm in the relevant sentence is divided into common user's simple or compound vowel of a Chinese syllable retrieval, the new rhythm of the China of free verse written in the venacular, thirteen rhyme schemes retrieval, and the pendant literary composition rhyme retrieval of ancient poetry, the positive rhythm retrieval of the speech woods of ancient speech, the bent rhythm retrieval in Central Region of Yuan songs,
The simple or compound vowel of a Chinese syllable retrieval format be asterisk wildcard+" ("+simple or compound vowel of a Chinese syllable+tone+... + simple or compound vowel of a Chinese syllable+tone+... + ") "+asterisk wildcard, tone can omit, and can use " | " or operational character between the simple or compound vowel of a Chinese syllable,
The new rhythm retrieval format of China be asterisk wildcard+ " ("+level and oblique tone symbol or rhythm portion+...+level and oblique tone symbol or tone rhythm portion+...+") "+asterisk wildcard, the level and oblique tone sign convention be "! " for flat, " " is narrow, " # " is for can narrowly putting down, and tone rhythm section forms by three, and first is tone, and 1 is high and level tone; 2 is rising tone, and 3 is upper sound, and 4 is falling tone, and 0 for softly, and 6 for entering the moon, and 7 for entering sun; 8 on entering, and 9 for entering, and 5 is light for entering, and rhythm section is 18 kinds by the word cent, 01 fiber crops, 02 ripple; 03 song, 04 all, and 05,06 youngster, 07 is neat, and 08 is little; 09 opens, 10 aunts, and 11 fishes, 12 marquis, 13 persons of outstanding talent, 14 is cold; 15 traces, 16 Tang, in 17 heptan, " | " or operator can be used between level and oblique tone symbol or the rhythm section in 18 east.
Thirteen rhyme schemes retrieval format be asterisk wildcard+" ("+level and oblique tone symbol or rhythm portion+... + level and oblique tone symbol or tone rhythm portion+... + ") "+asterisk wildcard, level and oblique tone symbol and tone regulation are the same, and rhythm portion is 13 kinds by the word cent, 21 grow dim, 22 spindle waves, and 23 squint, 24 clothing phases, 25 aunts Soviet Union, 26 Huailai, 27 dust heaps, 28 distant, 29 oil are asked, before 30 speeches, 31 people's occasion, 32 river sun, " | " or operational character can be used between level and oblique tone symbol or the rhythm portion in 33 Middle East
The civilian rhyme retrieval format of wearing be asterisk wildcard+"<"+level and oblique tone symbol or rhythm portion+... + level and oblique tone symbol or tone rhythm portion+... + ">"+asterisk wildcard, the level and oblique tone sign convention is the same, tone 1 is last flat, 2 for flat down, 3 is last sound, and 4 is falling tone, and 5 are entering tone, rhythm portion is divided into 100 Lu Yun by the level and oblique tone four tones of standard Chinese pronunciation
Last rhymes in the even tone portion is 01 east, 02 winter, and 03 river, 04,05 is little, 06 fish, 07 anxiety, 08 is neat, and 09 is good, 10 ashes, 11 is true, 12 literary compositions, 13 yuan, 14 is cold, and 15 delete,
Following rhymes in the even tone portion is 01 earlier, and 02 is desolate, 03 meat and fish dishes, and 04 milli, 05 song, 06 fiber crops, 07 sun, 08 heptan, 09 green grass or young crops, 10 steam, and 11 is outstanding, and 12 invade, 13 Tans, 14 salt, 15 is salty,
Last sound portion is 01 Dong, and 02 is swollen, and 03 says, 04 paper, and 05 tail, 06 language, 07 Yu, 08 Chestnut, 09 crab, 10 bribe, 11 cross boards at the rear of an ancient carriage, 12 kisses, 13 Ruan, 14 droughts, 15 in tears, and 16 mill, 17 Xiao, 18 is skilful, and 19 is white, 20 approves, 21 horses, 22 support, 23 stalks, 24 is widely different, and 25 have, and 26 get into bed, 27 senses, 28 a kind of jades, 29 Half-grown-beans,
Falling tone rhythm portion 01 send, 02 Song, and 03 is deep red, 04 Set, 05 is not, and 06 drives, and 07 meets 08 cease raining or snowing, 09 Thailand, 10 divinatory symbol, 11 teams, 12 shakes, 13 ask, 14 are willing to, 15 writing brushes, 16 remonstrate with, 17 graupels, 18 make a whistling sound, and 19 imitate, and No. 20,21 A, 22 Ma, 23 ripple, and 24 respect, 25 footpaths, 26 excuses, 27 ooze, and 28 survey, and 29 is gorgeous, and 30 fall into
Going into sound portion is 01 room, and 02 is fertile, and 03 feels, 04 matter, and 05 thing, 06 month, 07 how, and 08 is crafty, 09 bits, 10 medicines, 11 footpaths between fields, 12 tin, 13 duties, 14 seize, and 15 close, 16 Leaf, 17 are in harmony,
Can use " | " or operational character between level and oblique tone symbol or the rhythm portion,
The positive rhythm retrieval format of speech woods be asterisk wildcard+" ["+level and oblique tone symbol or rhythm portion+... + level and oblique tone symbol or tone rhythm portion+... + "] "+asterisk wildcard, the level and oblique tone sign convention is the same, and tone 1 is an even tone, 3 is last sound, and 4 is falling tone, and 5 are entering tone, 6 go into to do even tone, and 8 go into to do to go up sound, and 9 go into to do falling tone, rhythm portion is even tone, goes up 14 ones of sound, falling tone time-divisions, corresponding 01-14 numbering, rhythm portion is divided into 5 ones, corresponding 15-19 numbering during for entering tone, can use " | " or operational character between level and oblique tone symbol or the rhythm portion
The bent rhythm retrieval format in Central Region be asterisk wildcard+" { "+level and oblique tone symbol or rhythm portion+... + level and oblique tone symbol or tone rhythm portion+... + " } "+asterisk wildcard, the level and oblique tone sign convention is the same, and tone 1 is an even tone, and 3 is last sound, 4 is falling tone, and 6 go into to do even tone, and 8 go into to do to go up sound, 9 go into to do falling tone, make even tone on a, and b removes to do even tone, rhythm portion is 19 kinds by the word cent, 01 eastern clock, 02 river sun, 03 think of, 04 is little together, 05 fish mould, 06 all comes, and 07 is very civilian, 08 Han Shan, 09 Huan is joyous, and 10 is congenital, 11 Xiao Hao, 12 song dagger-axes, 13 family fiber crops, 14 cars hide, 15 heptan green grass or young crops, 16 outstanding marquis, 17 seek and invading, 18 prisons are salty, and 19 honest and clean fibres can use " | " or operational character between level and oblique tone symbol or the rhythm portion:
One of form retrieval mode of word press the first sum of preface of Chinese character radicals corresponding to the horizontal first stroke of a Chinese character 6 for the radicals by which characters are arranged in traditional Chinese dictionaries retrieval in the sentence speech, holds up pen 7, casts aside the first stroke of a Chinese character 8, presses down the first stroke of a Chinese character 9, turns up pen 0, according to the first stroke of a Chinese character of " Ci hai " radicals by which characters are arranged in traditional Chinese dictionaries and three preface sign indicating numbers of picture number concordance list composition radicals by which characters are arranged in traditional Chinese dictionaries,
The horizontal first stroke of a Chinese character: 601 1,6,020,603 factories, 604 Na, 605 Contraband, 606 do, 607 workers, 608 soil, scholar, 609 Lv, European-allies, 610 is big, and 611 is towering, 612 In-particular, 613 Rolling, 614 cun, 615 shoot a retrievable arrow, 616 , 617 days, 618 Weis, 619 is old, 620 twenty,
Figure A031040500007C1
621 wood, 622,623 are not, 624 dogs, 625 is bad, 626 cars, Trucks, 627 dagger-axes, 628 ratios, 629 Ji, 630 teeth, 631 watts, 632 633 jade, 634 show, and 635 go, and 636 is sweet, 637 stones, 638 penta, 639 dragons, Long, 640 fork-like farm tools used in ancient China, 641 642 ears, 643 Asias, Asia, 644 ministers, 645
Figure A031040500007C4
The west, 646 647 and, 648 pages, Page, 649 to, 650 wheats, Wheat, 651 Zhang, Long, 652 walk, 653 is red, 654 the bundle, 655 beans, 656 tenth of the twelve Earthly Branches, 657 occasion, 658 pigs, 659 green grass or young crops, 660
Figure A031040500007C6
661 rain, 662 leather, 663,664 665 is drooping, and 666 separate, 667 Huangs, and 668 drums,
Hold up pen: 701|, 702 foretell, 703 Dao, 704 Jiong, 705 , 706 mouthfuls, 707 mouthfuls, 708 towel, 709 mountains, 710 end, and 711,712 days, 713 days, 714 In 715,716 shellfishes, Tony 717 see, See, 718 industries, 719
Figure A031040500007C9
720 orders, 721 Shens, 722 fields, 723 by, 724 Si, 725 wares, 726 , 727 light, 728 worms, 729 meat, 730 Lu, Halogen, 731 li, 732 foots, , 733 Mian, Strider, 734 is non-, 735 Chi, Tooth, 736 tigers, 737 fragrant-flowered garlic, the 738th, 739 bones, 740 needleworks, 741 ancient cooking vessels, 742 is black
Cast aside the first stroke of a Chinese character: 801 Pie, 802 , 803 Ren, 804
Figure A031040500007C10
805 8,806 people, go into, 807 Qe, 808 , 809 Bao, 810 an ancient type of spoons, 811 youngsters, more than 812,813 Chi, 814 San, 815 Quan, 816 sunset, 817 For-additional, 818 balls, 8l9 Cannibals,
Figure A031040500007C11
Food, 820 Ns, Niu, 821 hands, 822 maos, 823 gas, 824 days, 825 The-Fan, 826,827 jin, 828 pawls, Zhao, 829 fathers, 830 months, 831Shi, 832 owe, 833 Feng, Wind, and 834 an ancient weapon made of bamboos, 835 Jin, gold, 836 give birth to, and 837 vow, 838 standing grain, 839 is white, 840 melons, 841 birds, Ukraine, 842 narrow-necked earthen jars, 843 tongues, 844 bamboos, , 845 mortars, 846 certainly, 847,848 blood, 849 boats, 850 looks, 851 bodies, 852 adopt, 853 paddy, 854 insect without feet or legs, 855 tortoises,
Figure A031040500008C1
856 jiaos, 857 ovum, 858 Cui, 859 Yu, Fish, 860 perfume (or spice), 861 ghosts, 862 broomcorn millets, 863 mouse, 864 noses, 865 a unit of measure used in ancient Chinas are pressed down the first stroke of a Chinese character: 901 Dian, 902 Tou, 903 Bing, 904 Ha, 905 Mi, 906 Yan, speech, 907 Zhuang, slit bamboo or chopped wood, 908 is wide, 909 die, 910 Men, Door, 911 Rui, 912 Xin, 913 Http, 914 Chuo, 915 literary compositions, 916 sides, 917 fire, 918 buckets, 919 Xiangxi, 920 families, 921 Woo, 922 hearts, 923 Epileptic, 924 is upright, and 925 is profound, 926 also, 927 Qi, Chestnut, 928 clothing, 929 sheep,
Figure A031040500008C3
930
Figure A031040500008C4
931 meters, 932 sufferings, 933
Figure A031040500008C5
934 sounds, 935 head, 936 height, 937 938 fiber crops, 939 deer,
Turn up pen: 001 left folding, 002 right folding, 003 Qian, 004 Jie, 005 left Fu, 006 right Fu, 007 cutter, 008 power, 009 010 Si, 011 again, 012 Yin, 013 Ji,
Figure A031040500008C8
014 corpse, 015 oneself, the sixth of the twelve Earthly Branches, 016 bow, 017 Cao, 018 woman, 019 is little, 020 son, lonely, 021 horse, horse, 022 Si, yarn, 023 Chuan, 024 Nie,
Figure A031040500008C9
Then, 025 chi, 026 slit bamboo or chopped wood, 027 Mother, 028 water, 029 people, 030 Shu, 031 skin, 032 Bo, 033 lance, 034 is blunt, 035 plumage, 036 is subordinate to:
The form retrieval mode of word two is five strokes in the sentence speech, and corresponding to 1 horizontal stroke, 2 is perpendicular by the first sum of preface, and 3 cast aside, and 4 press down, 5 foldings, and 0 complement code, individual character gets one, two, three, four, end pen sign indicating number, and the every word of multiword is got preceding two yards;
Three of the form retrieval mode of word are sound shape combination in the sentence speech, add first and end stroke sign indicating number formation by sound first letter of word;
The form retrieval mode of word four be stroke number in the sentence speech, press whole stroke number two-bit digital codes of word and import, between the loigature so that " ' " cuts apart;
The form retrieval mode of word five is the word structure in the sentence speech, number is sign with "+", 00 single character structure, 10 left and right sides structures, about 11 two modular constructions, 12 left, center, rights, three modular constructions, 20 up-down structures, two modular constructions about in the of 21,22 upper, middle and lowers, three modular constructions, 30 package structures, 31 upper left package structures, 32 upper right package structures, 33 lower-left package structures, 34 bottom right package structures, package structure on 35,36 times package structures, 37 right package structures, 38 interior package structures;
The form retrieval mode of word six be the complement code state in the sentence speech, behind sentence speech sound sign indicating number, number is sign with "/", adds the radicals by which characters are arranged in traditional Chinese dictionaries sound mother letter that condenses according to the order of sequence, and the alphabetical number of corresponding Chinese character radicals can be less to one;
No matter sound sign indicating number or font code can use " Yu ﹠amp between the word in the sentence speech; " operational character, " or | " operational character;
The sentence word and search has study grade setting simultaneously, number be sign with ": ", divides the 1-8 level, after to add " 0 " at the corresponding levels for only detecting, not having " 0 " is below the detection corresponding levels;
The sentence word and search can carry out number of words and limit, and number is sign with "=", adds the word number;
The sentence retrieval has the subclass setting, with " " number be sign, divide classifications such as " t " dynasty, " p " author, " s " poetic prose type, " c " word, " q " name of tune, " z " Za Ju, " n " content, " f " mood, after connect the corresponding inquiry of combination of numbers code table.
7. according to claim 1 or claim 3 or claim 4 or the described cultural ancient books and records guiding study solution of claim 5, the Real Time Compression that it is characterized in that graphic image is handled, promptly set up basic unit graphic image primitive mechanism, by personage, animal, spirit, plant, food, articles for use, equipment, building, traffic, military affairs, nature, universe, society, culture, abstract, other etc. the classification storage, provide static vectorial combination illustration and motion vector combination figure form, the online art editor who realizes property of participation and non-repeatability.
8. China according to claim 2 study code inputting method solution is characterized in that dictionary, dictionary function are incorporated in the input method,
The input attitude is common phoneticizing type input, has the assembly of contracting function;
The information attitude is " I " pattern, adds individual character keynote coding, and radicals by which characters are arranged in traditional Chinese dictionaries, stroke, pronunciation, the senses of a dictionary entry, collocations, the sentence-making example information of Chinese character is provided;
The note attitude is " U " pattern, adds phonological encoding, both can be at the individual character multitone, again can be at the multiword word, the former correspondence " I " pattern-coding, the latter is except that Chinese language senses of a dictionary entry clauses and subclauses, also include encyclopaedia clauses and subclauses commonly used in, and corresponding classification attitude " V " pattern-coding is provided;
The classification attitude is " V " pattern, add the classification coding, classification is that the A universe earth, B biology, C human body, D human society, E food live that row, F affective behavior, G agricultural, H industrial technology medicine, J communications and transportation, K economy and trade merchant duty, L politics and laws, M military affairs, N historical geography, O culture and arts, P philosophical education, the legend of Q religious belief, R material object, S thing situation, T measure, U other, each sport has the subitem of different numbers;
Chinese character stores corresponding level of abstraction Chinese character meaning and pronunciation storehouse, can complicated and simple synchronous switching.
9. learn solution, China's study code inputting method solution according to claim 1 or the described cultural ancient books and records guiding of claim 2, the pardon that it is characterized in that cultural ancient books and records and dictionary scope, under the unified standard of basic pronunciation and meaning character library, compression cutting dictionary, compression data file form and coded query mode, delimitation according to classification, grade, subclass, omnibearing priori association, particular content relates to the literature dictionary, Confucian classics, history, philosophy and literature, private school's reading matter, cultural resources such as opera calligraphy and painting, garden architecture, medicine culinary art, wushu folk custom.
10. learn solution, China's study code inputting method solution according to claim 1 or the described cultural ancient books and records guiding of claim 2, it is characterized in that the present invention can be applicable to fields such as microcomputer, PDA, notebook, panel computer, network communication, and occur with forms such as unit product, networking products, digital library, education network sight, printed matter, audio-visual products.
CNA031040500A 2003-02-17 2003-02-17 Intelligent Chinese cultural dictionary system Pending CN1523518A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA031040500A CN1523518A (en) 2003-02-17 2003-02-17 Intelligent Chinese cultural dictionary system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA031040500A CN1523518A (en) 2003-02-17 2003-02-17 Intelligent Chinese cultural dictionary system

Publications (1)

Publication Number Publication Date
CN1523518A true CN1523518A (en) 2004-08-25

Family

ID=34282163

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA031040500A Pending CN1523518A (en) 2003-02-17 2003-02-17 Intelligent Chinese cultural dictionary system

Country Status (1)

Country Link
CN (1) CN1523518A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100421110C (en) * 2005-08-31 2008-09-24 北京金山软件有限公司 Search method of dictionary data
CN101401087B (en) * 2006-03-15 2011-05-18 微软公司 Efficient encoding of alternative graphic sets
TWI417749B (en) * 2009-06-09 2013-12-01
CN103955523A (en) * 2014-05-09 2014-07-30 袁长宝 Retrieving method for level and oblique tones of Chinese characters of poems
CN104516866A (en) * 2013-09-26 2015-04-15 北大方正集团有限公司 Text along-line typesetting method
CN104645610A (en) * 2014-09-22 2015-05-27 北京乐动卓越科技有限公司 Game object identification code coding method and system
CN104866607A (en) * 2015-06-04 2015-08-26 北京信息科技大学 Dongba character interpretation database building method
CN105653659A (en) * 2015-12-29 2016-06-08 韩宏华 Method for recording and popularizing Wushu proverb in APP (Application) form
CN105892700A (en) * 2014-10-17 2016-08-24 朱庆祥 Chinese character component stroke coordinate Chinese character input method with Chinese character dictionary function
CN106708485A (en) * 2015-11-13 2017-05-24 北大方正集团有限公司 Electronic copybook temperature managing method and system
CN107451114A (en) * 2017-06-28 2017-12-08 广州尚恩科技股份有限公司 A kind of archaic Chinese semantic analysis and its system
CN107491543A (en) * 2017-08-24 2017-12-19 中国传媒大学 A kind of client-based calligraphy auxiliary exercise method and system
CN109766978A (en) * 2019-01-17 2019-05-17 北京悦时网络科技发展有限公司 A kind of generation method of word code, recognition methods, device, storage medium
CN116821271A (en) * 2023-08-30 2023-09-29 安徽商信政通信息技术股份有限公司 Address recognition and normalization method and system based on voice-shape code

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100421110C (en) * 2005-08-31 2008-09-24 北京金山软件有限公司 Search method of dictionary data
CN101401087B (en) * 2006-03-15 2011-05-18 微软公司 Efficient encoding of alternative graphic sets
TWI417749B (en) * 2009-06-09 2013-12-01
CN104516866A (en) * 2013-09-26 2015-04-15 北大方正集团有限公司 Text along-line typesetting method
CN104516866B (en) * 2013-09-26 2017-10-20 北大方正集团有限公司 The method of typesetting along word
CN103955523A (en) * 2014-05-09 2014-07-30 袁长宝 Retrieving method for level and oblique tones of Chinese characters of poems
CN104645610A (en) * 2014-09-22 2015-05-27 北京乐动卓越科技有限公司 Game object identification code coding method and system
CN105892700A (en) * 2014-10-17 2016-08-24 朱庆祥 Chinese character component stroke coordinate Chinese character input method with Chinese character dictionary function
CN104866607A (en) * 2015-06-04 2015-08-26 北京信息科技大学 Dongba character interpretation database building method
CN104866607B (en) * 2015-06-04 2018-01-12 北京信息科技大学 A kind of Dongba character textual research and explain database building method
CN106708485A (en) * 2015-11-13 2017-05-24 北大方正集团有限公司 Electronic copybook temperature managing method and system
CN106708485B (en) * 2015-11-13 2020-07-14 北大方正集团有限公司 Electronic copybook heat management method and system
CN105653659A (en) * 2015-12-29 2016-06-08 韩宏华 Method for recording and popularizing Wushu proverb in APP (Application) form
CN107451114A (en) * 2017-06-28 2017-12-08 广州尚恩科技股份有限公司 A kind of archaic Chinese semantic analysis and its system
CN107491543A (en) * 2017-08-24 2017-12-19 中国传媒大学 A kind of client-based calligraphy auxiliary exercise method and system
CN109766978A (en) * 2019-01-17 2019-05-17 北京悦时网络科技发展有限公司 A kind of generation method of word code, recognition methods, device, storage medium
CN116821271A (en) * 2023-08-30 2023-09-29 安徽商信政通信息技术股份有限公司 Address recognition and normalization method and system based on voice-shape code
CN116821271B (en) * 2023-08-30 2023-11-24 安徽商信政通信息技术股份有限公司 Address recognition and normalization method and system based on voice-shape code

Similar Documents

Publication Publication Date Title
Tymoczko Translation in a postcolonial context: Early Irish literature in English translation
Fortson IV Indo-European language and culture: An introduction
Pound Confucius: the Great digest, the Unwobbling pivot, and the Analects
Teng A reference grammar of Puyuma, an Austronesian language of Taiwan
Gigante Life: Organic form and romanticism
Grene et al. The philosophy of biology: an episodic history
Clark A medieval book of beasts: the second-family bestiary: commentary, art, text and translation
CN1523518A (en) Intelligent Chinese cultural dictionary system
Hobart The great rift
Zhongshu Patchwork: Seven Essays on Art and Literature
CN113268581B (en) Topic generation method and device
Blakney A Course in the Analysis of Chinese Characters...
Yıldırım Ottoman plants, nature studies, and the attentiveness of translational labor
Lan et al. A cognitive approach to the conceptual metaphors in Shi Jing (The Book of Poetry)
Jiang et al. A General Theory of Ancient Chinese
Hsieh Gold and jade filled halls: A cognitive linguistic study of financial and economic expressions in Chinese and German
McNaughton Reading & Writing Chinese Simplified Character Edition:(HSK Levels 1-4)
Lin Reduplicant vowels in Truku reduplication
CN112328095B (en) Four-purpose phonetic and shape code Chinese character input method and input platform without using number keys
Steeds Project Earth and Art’s Exposability
CN1086481C (en) New Chinese character code and Chinese character input keyboard thereof
Ornelas E HOʻOkohukohu I NĀ Limu O HAWAIʻI: Developing Methodologies for Limu Identification and Socio-Cultural Knowledge Enshrouded in the NŪPepa HAWAIʻI
Duoduo A Repertoire of Dongba Pictographs: Challenges and Solutions
CN1086480C (en) Real code coding method for Chinese characters and using keyboard thereof
Balemberg Mossop's revision parameters as analytical categories in the analysis of A song of ice and fire in Brazil

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication