CN107451105A - A kind of bright braille converting system based on new Chinese character holographic coding rule - Google Patents

A kind of bright braille converting system based on new Chinese character holographic coding rule Download PDF

Info

Publication number
CN107451105A
CN107451105A CN201710517639.0A CN201710517639A CN107451105A CN 107451105 A CN107451105 A CN 107451105A CN 201710517639 A CN201710517639 A CN 201710517639A CN 107451105 A CN107451105 A CN 107451105A
Authority
CN
China
Prior art keywords
chinese
holographic
character
pronunciation
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710517639.0A
Other languages
Chinese (zh)
Other versions
CN107451105B (en
Inventor
富明慧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Yat Sen University
Original Assignee
Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Yat Sen University filed Critical Sun Yat Sen University
Priority to CN201710517639.0A priority Critical patent/CN107451105B/en
Publication of CN107451105A publication Critical patent/CN107451105A/en
Application granted granted Critical
Publication of CN107451105B publication Critical patent/CN107451105B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Printers Characterized By Their Purpose (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention provides a kind of bright braille converting system based on new Chinese character holographic coding rule, including:Text collection module, for obtaining Chinese-character text from outside;Pronunciation data storehouse, for storing the pronunciation of Chinese character;Pretreatment module is segmented, for from the Chinese-character text of outside acquisition, automatic or manual to insert the mark of word segmentation to text collection module;Holographic code for Chinese characters precompile module, for the Chinese-character text to be compiled into the coded format of holographic code for Chinese characters, and store into Chinese-character holographic file storage module;Chinese-character holographic file storage module, for storing the file of holographic code for Chinese characters form.Whether the present invention uses new holographic code for Chinese characters form stored as a file, it is determined that while Chinese character pattern, has also uniquely determined its pronunciation, it is also expressly that segmented with Chinese character below, contain full detail required during bright braille conversion.Using the present invention, the problems such as fundamentally overcoming " obscure " of generally existing, " misunderstanding " in current Chinese character Braille reading.

Description

A kind of bright braille converting system based on new Chinese character holographic coding rule
Technical field
The present invention relates to encoding of chinese characters and word processing field, and in particular to one kind is based on new Chinese character holographic coding rule Bright braille converting system.
Background technology
Chinese character is unique word in the world, and each word has " sound ", " shape ", " meaning " three key elements, " sound " OK In " meaning ", " meaning " accumulates in " shape ", and three is inseparable, indispensable.But the braille of current Chinese character, really a kind of phonetic side Case, because the phenomenon of unisonance multiword, a word multitone largely be present in Chinese, therefore after Chinese character changes into braille, can generally existing only Can not uniquely determine word meaning, with pronunciation so as to cause blind person's obscure, situation about even misreading when reading, this be also China promote and The biggest problem that popularization braille is faced.
With the popularization of the development of information technology, especially computer and braille display (hereinafter referred to as putting aobvious device) and general And create advantage thoroughly to solve the above problems.
The content of the invention
In view of this, it is necessary to for problems of the prior art, there is provided one kind is based on new Chinese character holographic coding Rule bright braille converting system, Chinese character is changed and stored using special form, by " sound " of Chinese character, " shape ", " meaning " is merged in same set of coding rule, and accuracy is expressed to improve the implication of bright braille conversion.
To achieve the above object, the present invention uses following technical scheme:
A kind of bright braille converting system based on new Chinese character holographic coding rule, including:
Text collection module, for obtaining Chinese-character text from outside;
Pronunciation data storehouse, for storing the pronunciation of Chinese character;Wherein, multiple different pronunciations of each polyphone are by according to certain Order is numbered, and one of pronunciation is set to give tacit consent to pronunciation;
Pretreatment module is segmented, for from the Chinese-character text of outside acquisition, automatic or manual to be inserted to text collection module Enter the mark of word segmentation;
Holographic code for Chinese characters precompile module, for combining the acquiescence pronunciation set in pronunciation data storehouse and participle pretreatment The mark of word segmentation inserted in module, the Chinese-character text is compiled into the coded format of holographic code for Chinese characters, and stored complete to Chinese character Cease in file storage module;
Chinese-character holographic file storage module, for storing the file of holographic code for Chinese characters form;
Wherein, the coded format of the holographic code for Chinese characters is:
The corresponding Chinese character of one holographic code for Chinese characters;
Preceding 2 byte of holographic code for Chinese characters is the ISN of the Chinese character;
Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as segmenting identification code, to segment the different numerical value marks of identification code Know whether the Chinese character segments with next Chinese character composition;
4th byte of holographic code for Chinese characters is defined as pronunciation identification code, and the Chinese character is identified with the numerical values recited of pronunciation identification code Numbering corresponding to right pronunciation within a context;
The system also includes:
Text editing module is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Chinese character information and participle information in holographic code for Chinese characters are interpreted, and show corresponding Chinese-character text and the mark of word segmentation, for Family is reviewed and changed;When user modifies to Chinese-character text or the mark of word segmentation, synchronous vacations Chinese-character holographic file is deposited The holographic code for Chinese characters stored in storage module;
Phonetic notation editor module, it is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Chinese character information and pronunciation information in holographic code for Chinese characters are interpreted, and show the pronunciation letter of corresponding Chinese-character text and polyphone Breath, with reference to pronunciation data storehouse, checks for user and corrects the right pronunciation of polyphone;When user is carried out more to the pronunciation of polyphone When changing, the holographic code for Chinese characters that is stored in synchronous vacations Chinese-character holographic file storage module;
Braille modular converter, it is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Participle information and pronunciation information in holographic code for Chinese characters are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, with Chinese character information in holographic code for Chinese characters is converted into braille to check and change for user;When user modifies to braille, together The holographic code for Chinese characters stored in step modification Chinese-character holographic file storage module.
Further, in pretreatment module is segmented, it is by combining an outside or system to be automatically inserted into the mark of word segmentation Built-in participle database realizing, conventional participle is stored with the participle database, the pretreatment module that segments is by text This acquisition module is compared from the Chinese-character text that outside obtains with the participle in participle database, with automatic in Chinese-character text Insert the mark of word segmentation.
Further, in addition to:
Read through model is listened, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Participle information and pronunciation information in holographic code are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, to use Computerized speech is read aloud;Wherein, the stall position read aloud determines according to the position of punctuation mark and the mark of word segmentation.
Further, in addition to:
Lexical or textual analysis module, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Chinese character information, participle information and pronunciation information in holographic code are interpreted, and determine the font, pronunciation and participle shape of each Chinese character State, inquired about with providing the correct implication of each Chinese character or phrase within a context for user.
Further, in addition to the aobvious device of point, for by text editing module, phonetic notation editor module, braille modular converter and The content of lexical or textual analysis module is shown in the form of braille.
Further, the coded format of the holographic code for Chinese characters also includes:
Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as giving tacit consent to pronunciation identification code, to give tacit consent to pronunciation identification code not Whether the pronunciation used within a context with the numerical identity Chinese character is to give tacit consent to pronunciation;When the reading that the Chinese character uses within a context When sound is gives tacit consent to pronunciation, the 4th byte of holographic code for Chinese characters is omitted.
Further, in the holographic code for Chinese characters, the information in the 3rd byte only uses last position and penultimate position;
Last position in 3rd byte is acquiescence pronunciation identification code, and the Chinese character is somebody's turn to do using acquiescence pronunciation when taking 1 when the position takes 0 The pronunciation of Chinese character is specified by the 4th byte;
Penultimate position in 3rd byte is participle identification code, the position represented when taking 0 the Chinese character not with next Chinese character group composition Word, take the 1 expression Chinese character to be formed with next Chinese character and segment.
Further, the coded format of the holographic code for Chinese characters also includes:
When the Chinese character is monosyllabic word, the 4th byte of holographic code for Chinese characters is omitted.
Further, the coded format of the holographic code for Chinese characters also includes:
When the 4th byte of the holographic code for Chinese characters of the Chinese character is omitted, and the Chinese character does not form participle with next Chinese character, the Chinese 3rd byte of word holographic code is omitted.
Further, in pronunciation data storehouse, multiple different pronunciations of polyphone are according to frequency of use from high to low suitable Sequence sorts and is numbered, and wherein frequency of use highest pronunciation is set as giving tacit consent to pronunciation.
By above technical scheme, the present invention uses new holographic code for Chinese characters form stored as a file, it is determined that the Chinese While word font, its pronunciation is also uniquely determined, it is also expressly that whether segmented with Chinese character below, when containing bright braille conversion Required full detail.Using it is provided by the invention it is a kind of based on new Chinese character holographic coding rule bright braille converting system, The problems such as " obscure " of generally existing, " misunderstanding " in current Chinese character Braille reading can fundamentally be overcome.In addition, Publishing branch exists During books printed in braille made of paper being made using the present invention for blind person, the holographic code for Chinese characters form that is synchronously generated as " byproduct " File, can be greatly reduced blind person reading is listened on computer or mobile phone, in touching reading on braille display when misunderstanding rate.Ensureing to believe While breath passes on accuracy, realize and achieve many things at one stroke.
Brief description of the drawings
Fig. 1 is a kind of function mould of bright braille converting system based on new Chinese character holographic coding rule provided by the invention Block schematic diagram.
Embodiment
Technical scheme is described in detail below in conjunction with accompanying drawing and specific embodiment.
The embodiments of the invention provide a kind of bright braille converting system based on new Chinese character holographic coding rule, in system In, introduce a kind of new Chinese character holographic coding rule, i.e. holographic code for Chinese characters;It is intended to " sound " of Chinese character, " shape ", " meaning " Fusion expresses accuracy in same set of coding rule, to improve the implication of bright braille conversion.
Specifically, the technological core as the present invention, the coded format of the holographic code for Chinese characters are as follows:
The corresponding Chinese character of one holographic code for Chinese characters;
Preceding 2 byte of holographic code for Chinese characters is the ISN of the Chinese character;
Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as segmenting identification code, to segment the different numerical value marks of identification code Know whether the Chinese character segments with next Chinese character composition;
Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as segmenting identification code, to segment the different numerical value marks of identification code Know whether the Chinese character segments with next Chinese character composition;3rd byte separately has one to be defined as giving tacit consent to pronunciation identification code, is read with giving tacit consent to Whether the pronunciation that the different numerical identities of the sound identification code Chinese character uses within a context is for acquiescence pronunciation;
4th byte of holographic code for Chinese characters is defined as pronunciation identification code, and the Chinese character is identified with the numerical values recited of pronunciation identification code Numbering corresponding to right pronunciation within a context.
Further, in the holographic code for Chinese characters, the information in the 3rd byte only uses last position and penultimate position;
Last position in 3rd byte is acquiescence pronunciation identification code, and the Chinese character is somebody's turn to do using acquiescence pronunciation when taking 1 when the position takes 0 The pronunciation of Chinese character is specified by the 4th byte;Wherein, due to monosyllabic word, one and only one gives tacit consent to pronunciation, therefore the Chinese character of monosyllabic word The byte last position of holographic code the 3rd is necessarily 0, and it 0 is also likely to be 1 that the byte last position of holographic code for Chinese characters the 3rd of polyphone, which is probably,;
Penultimate position in 3rd byte is participle identification code, the position represented when taking 0 the Chinese character not with next Chinese character group composition Word, take the 1 expression Chinese character to be formed with next Chinese character and segment.
According to defined above, because the information in the 3rd byte only uses last position and penultimate position, corresponding to them only It is the control character that is of little use in 4 ASCII characters, so conventional ASCII character character does not have occupied, when them and Chinese character mixing Shi Buhui causes ambiguity, improves computing and the storage efficiency of computer.
As an improvement, appropriate province can also be carried out to the 3rd byte of holographic code for Chinese characters and the 4th byte in the following ways Slightly:
When the pronunciation that the Chinese character uses within a context is acquiescence pronunciation, the 4th byte of holographic code for Chinese characters is omitted;When this When Chinese character is monosyllabic word, the 4th byte of holographic code for Chinese characters is omitted;I.e. when the last position of the 3rd byte is 0, the 4th byte is omitted;
When the 4th byte of the holographic code for Chinese characters of the Chinese character is omitted, and the Chinese character does not form participle with next Chinese character, the Chinese 3rd byte of word holographic code is omitted;I.e. when two, the end of the 3rd byte is simultaneously 0, the 3rd byte is also omitted, the Chinese character of the Chinese character Holographic code only takes preceding 2 byte.
Rule, is suitably omitted to the byte not comprising essential information, can greatly reduce storage information more than Data bits used, to reduce memory space.
Start to be discussed in detail below provided in an embodiment of the present invention a kind of based on the bright blind of new Chinese character holographic coding rule Literary converting system, as shown in figure 1, the system specifically includes:
Text collection module, for obtaining Chinese-character text from outside;
Pronunciation data storehouse, for storing the pronunciation of Chinese character;Wherein, multiple different pronunciations of each polyphone are by according to certain Order is numbered, and one of pronunciation is set to give tacit consent to pronunciation.In the present embodiment, multiple different pronunciations of polyphone Sort and be numbered according to the order of frequency of use from high to low, wherein frequency of use highest pronunciation is set as that acquiescence is read Sound.It should be noted that what is stored in pronunciation data storehouse is not only the pronunciation of the pronunciation, also monosyllabic word of polyphone, simply The pronunciation of monosyllabic word is unique and is acquiescence pronunciation, and the numbering of its pronunciation also only has one.
Pretreatment module is segmented, for from the Chinese-character text of outside acquisition, automatic or manual to be inserted to text collection module Enter the mark of word segmentation.The mark of word segmentation inserted in participle pretreatment module is mainly used in when Chinese-character text is converted into holographic code for Chinese characters Basic participle information reference is provided, the position of the mark of word segmentation need not entirely accurate;Therefore, it is manually inserted into participle mark to remove from Extensive work caused by note, the form of the automated intelligent insertion mark of word segmentation can also be used.Specifically, it is automatically inserted into the mark of word segmentation Function need to be stored with conventional point in the participle database with reference to the participle database realizing built in outside one or system Word, the participle pretreatment module enter text collection module from the Chinese-character text that outside obtains and the participle in participle database Row compares, to be automatically inserted into the mark of word segmentation in Chinese-character text.
Holographic code for Chinese characters precompile module, for combining the acquiescence pronunciation set in pronunciation data storehouse and participle pretreatment The mark of word segmentation inserted in module, the Chinese-character text is compiled into the coded format of holographic code for Chinese characters, and stored complete to Chinese character Cease in file storage module.
Chinese-character holographic file storage module, for storing the file of holographic code for Chinese characters form, i.e. holographic code for Chinese characters file.Base Contain Chinese character information simultaneously in the definition of the coded format of the holographic code for Chinese characters above provided, holographic code for Chinese characters file, divide Word information and pronunciation information.Specifically, Chinese character information is determined by 2 bytes before holographic code for Chinese characters, and participle information is by Chinese-character holographic The participle identification code of the 3rd byte of code determines that pronunciation information is then by the acquiescence pronunciation identification code of the byte of holographic code for Chinese characters the 3rd and the 4 byte combination pronunciation data storehouses determine.
Text editing module is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Chinese character information and participle information in holographic code for Chinese characters are interpreted, and show corresponding Chinese-character text and the mark of word segmentation, for Family is reviewed and changed.In the module, Chinese character can be shown in text window, now can be as handling conventional plain text text Part carries out the operation such as Chinese character addition, change and deletion like that, can also change the position of the mark of word segmentation;In the present embodiment, marking Participle ending beyond point symbol, using TAB keys as the mark of word segmentation.When user modifies to Chinese-character text or the mark of word segmentation When, the module understands the holographic code for Chinese characters stored in synchronous vacations Chinese-character holographic file storage module.
Phonetic notation editor module, it is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Chinese character information and pronunciation information in holographic code for Chinese characters are interpreted, and show the pronunciation letter of corresponding Chinese-character text and polyphone Breath, with reference to pronunciation data storehouse, checks for user and corrects the right pronunciation of polyphone.In the module, what text window was shown It is Chinese-character text and symbol, before cursor moves to polyphone, meeting one drop-down menu of automatic spring, can be selected by upper and lower cursor The right pronunciation of settled preceding Chinese character.Before cursor moves to non-polyphone, phonetic notation menu can close automatically.When user is to polyphone When pronunciation is modified, the module understands the holographic code for Chinese characters stored in synchronous vacations Chinese-character holographic file storage module.
Due to the holographic code for Chinese characters conversion carried out in holographic code for Chinese characters precompile module, located in advance based on more rough participle The acquiescence pronunciation of reason and default;Although the Intelligent Recognition work(of pretreatment module and pronunciation data storehouse can be segmented by improveing The accuracy of information matches can be improved, but can not make the Chinese-character holographic initially stored in Chinese-character holographic file storage module all the time Express to code file entirely accurate the participle information and pronunciation information of Chinese character.But by by text editing module and phonetic notation Editor module, the Chinese character information in the presence of mistake on a small quantity, participle information and pronunciation information can be adjusted, further improve the Chinese The accuracy of word holographic code file.Herein on basis, then it can also increase various functions module, using in holographic code for Chinese characters Comprising Chinese character information, participle information and pronunciation information be user service.
Specifically, as an improvement, present invention additionally comprises following functions module:
Braille modular converter, it is right for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module Participle information and pronunciation information in holographic code for Chinese characters are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, with Chinese character information in holographic code for Chinese characters is converted into braille to check and change for user.In the module, it can show in text window Show braille, user the check and correction editing such as can also be increased, deleted except browsing braille to braille, and modification is irrational The mark of word segmentation.When user modifies to braille, stored in module meeting synchronous vacations Chinese-character holographic file storage module Holographic code for Chinese characters.
Read through model is listened, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Participle information and pronunciation information in holographic code are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, to use Computerized speech is read aloud;Wherein, the stall position read aloud determines according to the position of punctuation mark and the mark of word segmentation.The module In, the holographic code for Chinese characters after parsing can be read aloud with screen software is read, due to believing in holographic code for Chinese characters containing pronunciation simultaneously Breath and participle information, polyphone can correctly be read aloud by reading screen software, can be had and more reasonably be paused, therefore not only avoid because more Sound word misreads caused error message, and also has and more comfortably listen reading effect, and this is that conventional text can not when listening reading Realize.
Lexical or textual analysis module, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Chinese character information, participle information and pronunciation information in holographic code are interpreted, and determine the font, pronunciation and participle shape of each Chinese character State, inquired about with providing the correct implication of each Chinese character or phrase within a context for user.Because the braille of Chinese character is a kind of expression The word of pronunciation, influenceed by Chinese character unisonance discrepancy, a word multitone, often occur in traditional braille dictionary software one it is blind Literary individual character or phrase correspond to the situation of multiple different Chinese characters or phrase, can not confirm that the reality that Chinese character original text is intended by contains Justice, and stored using holographic code for Chinese characters provided by the invention, then one with Chinese character original text can be realized after braille is read One correspondence, it can accurately realize lexical or textual analysis function.Specifically, the lexical or textual analysis module is it is determined that font, pronunciation and the participle of each Chinese character After state, matching inquiry is carried out from a paraphrasing data storehouse, the words implication inquired is showed into user.Wherein, it is described to release Adopted database can integrate internal database in systems or the network dictionary from external reference, dictionary Deng external data base.
The aobvious device of point, for by the content of text editing module, phonetic notation editor module, braille modular converter and lexical or textual analysis module with The form of braille is shown.Wherein, text editing module, phonetic notation editor module, braille editor module export on the aobvious device of point Be all ASCII character that identical current word is expert at, the TAB keys for being intended only as the mark of word segmentation are shown as half-angle sky Lattice;Coordinating text editing module and braille editor module in use, the aobvious device of point can show the content that current word is expert at, can be with Proofreaded by touching reading cooperation, content additions and deletions and participle operate;Coordinating phonetic notation editor module in use, when in computer screen During the phonetic annotation of Chinese characters menu ejection of display, current pronunciation can be shown by putting aobvious device, and switching points of engagement by upper and lower cursor shows device, can be with Phonetic notation is completed to select;Coordinate lexical or textual analysis module in use, allowing the current word of display to be explained and organized word operation, press phase Shortcut is answered, the explanation of Chinese character or group word information can be shown in a little on aobvious device.
It should be evident that the display device of the present invention is not limited solely to a little aobvious device, LCDs etc. can also be connected Other display equipment, the content of text editing module, phonetic notation editor module, braille modular converter and lexical or textual analysis module is exported Display.
By above technical scheme, the present invention uses holographic code for Chinese characters form stored as a file, it is determined that Chinese character pattern While, also uniquely determine its pronunciation, it is also expressly that whether segment, contain required during the bright blind conversion of Chinese character with Chinese character below Full detail.Using the holographic code for Chinese characters form stored as a file in the present invention, it can fundamentally overcome current Chinese character blind During text is read the problems such as " obscure " of generally existing, " misunderstanding ".
Several specific examples will be lifted below, to illustrate the transfer process of holographic code for Chinese characters and technical advantage.
Specifically, for monosyllabic word, or acquiescence pronunciation (refering in particular to frequency of use highest pronunciation in the present embodiment) is read Polyphone, its 4th yard is OX1 (16 system), now can be default.
Example one:
(" big " of size) greatly, is polyphone, there is two pronunciations, and da4 and dai4, da4 are the 1st pronunciation, therefore it is holographic Kanji code=big ISN adds OX1+OX1, wherein 16 system number OX1 of the 3rd byte are liaison and polyphone keying, because of it most Last position is " 1 ", and expression is polyphone, and pronunciation will be specified by the 4th byte;4th byte is OX1, corresponding to the 1 of 10 systems, is represented The word reads the 1st pronunciation, that is, frequency highest pronunciation da4.Because the 3rd byte OX1 penultimate position is zero, represent not with the Chinese below Word composition participle.
In addition, size is big, because pronunciation is the 1st pronunciation, the 4th byte OX1 of its holographic kanji code can be default;Because not Participle is formed with word below and the 4th byte is default, therefore the 3rd byte can also be omitted.The holographic kanji code of so big (size big) It can be reduced to:Big ISN.
For another example:(" big " of doctor) greatly:It is the 2nd pronunciation of polyphone " big ", the ISN of therefore its holographic kanji code=big+ OX1+OX2;
Example two:
It is rich:It is monosyllabic word, an only pronunciation fu4, therefore the ISN+OX1+OX1 of its complete holographic kanji code=richness.
Because being monosyllabic word, can also be abbreviated as:Rich ISN+OX1;
When not forming word with word below, its 3rd byte is OX1, now can also continue to be reduced to:Rich holographic kanji code The ISN of=richness.
The Chinese-character holographic kanji code seen below under phrase state:
Hobby:Love is monosyllabic word, forms and segments with word below;It is polyphone well, the 1st pronunciation is " hao3 ", the 2nd pronunciation For " hao4 ".
(equivalent to binary one 0, last position zero, expression is single-tone to the ISN+OX2 of holographic kanji code=love of hobby Word, penultimate position are 1, represent to segment with the composition of word below;Because being monosyllabic word, the 4th byte is omitted)+good ISN+OX1 (most ends Position is 1, represents polyphone, and penultimate position is zero, represents not segment with the composition of word below)+OX2 (read the 2nd and read by the 2 of 10 systems, expression Sound).
Example three:
Jilin Province:Lucky, woods is that monosyllabic word province is polyphone, but reads the 1st pronunciation (sheng3).
Therefore, the ISN of ISN+OX2 (monosyllabic word and rear word composition segment)+woods of the holographic kanji code=Ji in Jilin Province ISN+the OX1+OX1 that+OX2+ is saved, it is clear that rear 2 byte of province can be omitted.
Example four:
Love ease and hate work:First word is polyphone, reads the 2nd pronunciation;3rd word is also polyphone (e4, wu4), reads the 2nd pronunciation, Therefore the holographic kanji code of the word is:
ISN+OX2 (monosyllabic word and rear word group of ISN+OX3 well (polyphone and rear word composition segment)+OX2+ ease Into participle) ISN of+ISN+OX3 (polyphone, with rear word group word)+OX2 (the 2nd pronunciation of evil)+labor for disliking (the 3rd, 4 bytes save Slightly)).
Default in holographic kanji code, which will not cause, to be obscured.Because Chinese character all takes the 1st in most cases Pronunciation (including unique pronunciation), and in an article word of more than half not with rear word form segment, therefore it is default can be significantly Save memory space.
By using holographic code for Chinese characters form stored as a file, the present invention can both avoid Chinese character to braille change when, it is more Puzzlement in the selection of sound word;Occurs sound synonymous different mistake when can also avoid braille from being changed to Chinese character.By coordinating voice software The text after pronunciation editor is played, blind person can more accurately, more easily content be listened in understanding, when being avoided that conventional text listens reading The problem of polyphone of appearance is misread, phrase mistake is taken;Blind person may be used also when braille display touching reading runs into strange or difficult word ISN is called current word to be explained or provided conventional group word by computer operation, this is traditional braille conversion method institute nothing The technical advantage that method provides.
Embodiment described above only expresses the several embodiments of the present invention, and its description is more specific and detailed, but simultaneously Therefore the limitation to the scope of the claims of the present invention can not be interpreted as.It should be pointed out that for one of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention Protect scope.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims (10)

  1. A kind of 1. bright braille converting system based on new Chinese character holographic coding rule, it is characterised in that including:
    Text collection module, for obtaining Chinese-character text from outside;
    Pronunciation data storehouse, for storing the pronunciation of Chinese character;Wherein, multiple different pronunciations of each polyphone are by according to certain order It is numbered, and one of pronunciation is set to give tacit consent to pronunciation;
    Pretreatment module is segmented, for from the Chinese-character text of outside acquisition, automatic or manual insertion to divide to text collection module Word marks;
    Holographic code for Chinese characters precompile module, for combining the acquiescence pronunciation set in pronunciation data storehouse and participle pretreatment module The mark of word segmentation of middle insertion, the Chinese-character text is compiled into the coded format of holographic code for Chinese characters, and stored to Chinese-character holographic text In part memory module;
    Chinese-character holographic file storage module, for storing the file of holographic code for Chinese characters form;
    Wherein, the coded format of the holographic code for Chinese characters is:
    The corresponding Chinese character of one holographic code for Chinese characters;
    Preceding 2 byte of holographic code for Chinese characters is the ISN of the Chinese character;
    Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as segmenting identification code, should to segment the different numerical identities of identification code Whether Chinese character forms with next Chinese character segments;
    4th byte of holographic code for Chinese characters is defined as pronunciation identification code, and the Chinese character is identified upper with the numerical values recited of pronunciation identification code The hereinafter numbering corresponding to right pronunciation;
    The system also includes:
    Text editing module, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Chinese character information and participle information in holographic code are interpreted, and are shown corresponding Chinese-character text and the mark of word segmentation, are entered for user Row is checked and changed;When user modifies to Chinese-character text or the mark of word segmentation, synchronous vacations Chinese-character holographic file storage mould The holographic code for Chinese characters stored in block;
    Phonetic notation editor module, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Chinese character information and pronunciation information in holographic code are interpreted, and show the pronunciation information of corresponding Chinese-character text and polyphone, With reference to pronunciation data storehouse, checked for user and correct the right pronunciation of polyphone;When user is modified to the pronunciation of polyphone When, the holographic code for Chinese characters that stores in synchronous vacations Chinese-character holographic file storage module;
    Braille modular converter, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese character Participle information and pronunciation information in holographic code are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, by the Chinese Chinese character information in word holographic code is converted to braille and checks and change for user;When user modifies to braille, synchronously repair Change the holographic code for Chinese characters stored in Chinese-character holographic file storage module.
  2. 2. the bright braille converting system according to claim 1 based on new Chinese character holographic coding rule, it is characterised in that In pretreatment module is segmented, it is real by the participle database with reference to built in an outside or system to be automatically inserted into the mark of word segmentation Existing, conventional participle is stored with the participle database, the participle pretreatment module obtains text collection module from outside The Chinese-character text taken is compared with the participle in participle database, to be automatically inserted into the mark of word segmentation in Chinese-character text.
  3. 3. the bright braille converting system according to claim 1 based on new Chinese character holographic coding rule, it is characterised in that Also include:
    Read through model is listened, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese-character holographic Participle information and pronunciation information in code are interpreted, and combine the pronunciation that pronunciation data storehouse determines each Chinese character, to use computer Voice is read aloud;Wherein, the stall position read aloud determines according to the position of punctuation mark and the mark of word segmentation.
  4. 4. the bright braille converting system according to claim 3 based on new Chinese character holographic coding rule, it is characterised in that Also include:
    Lexical or textual analysis module, for reading the file of holographic code for Chinese characters form from Chinese-character holographic file storage module, to Chinese-character holographic Chinese character information, participle information and pronunciation information in code are interpreted, and determine the font, pronunciation and participle state of each Chinese character, with The correct implication of each Chinese character or phrase within a context is provided to inquire about for user.
  5. 5. the bright braille converting system according to claim 4 based on new Chinese character holographic coding rule, it is characterised in that Also include the aobvious device of point, for by the content of text editing module, phonetic notation editor module, braille modular converter and lexical or textual analysis module with blind The form of text is shown.
  6. 6. the bright braille converting system according to claim 1 based on new Chinese character holographic coding rule, it is characterised in that The coded format of the holographic code for Chinese characters also includes:
    Wherein one of the byte of holographic code for Chinese characters the 3rd is defined as giving tacit consent to pronunciation identification code, to give tacit consent to the different numbers of pronunciation identification code Whether the pronunciation that the value mark Chinese character uses within a context is for acquiescence pronunciation;When the pronunciation that the Chinese character uses within a context for When giving tacit consent to pronunciation, the 4th byte of holographic code for Chinese characters is omitted.
  7. 7. the bright braille converting system according to claim 6 based on new Chinese character holographic coding rule, it is characterised in that In the holographic code for Chinese characters, the information in the 3rd byte only uses last position and penultimate position;
    Last position in 3rd byte is acquiescence pronunciation identification code, and the Chinese character is using acquiescence pronunciation when the position takes 0, Chinese character when taking 1 Pronunciation specified by the 4th byte;
    Penultimate position in 3rd byte is participle identification code, and the position represents that the Chinese character does not form with next Chinese character and segmented when taking 0, Take the 1 expression Chinese character to form with next Chinese character to segment.
  8. 8. the bright braille converting system according to claim 1 based on new Chinese character holographic coding rule, it is characterised in that The coded format of the holographic code for Chinese characters also includes:
    When the Chinese character is monosyllabic word, the 4th byte of holographic code for Chinese characters is omitted.
  9. 9. the bright braille converting system based on new Chinese character holographic coding rule according to claim 6 or 8, its feature exist In the coded format of the holographic code for Chinese characters also includes:
    When the 4th byte of the holographic code for Chinese characters of the Chinese character is omitted, and the Chinese character does not form participle with next Chinese character, Chinese character is complete The 3rd byte for ceasing code is omitted.
  10. 10. the method according to claim 1 or 6, it is characterised in that in pronunciation data storehouse, multiple differences of polyphone Pronunciation sorts and is numbered according to the order of frequency of use from high to low, and wherein frequency of use highest pronunciation is set as giving tacit consent to Pronunciation.
CN201710517639.0A 2017-06-29 2017-06-29 Bright braille conversion system based on novel Chinese character holographic coding rule Expired - Fee Related CN107451105B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710517639.0A CN107451105B (en) 2017-06-29 2017-06-29 Bright braille conversion system based on novel Chinese character holographic coding rule

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710517639.0A CN107451105B (en) 2017-06-29 2017-06-29 Bright braille conversion system based on novel Chinese character holographic coding rule

Publications (2)

Publication Number Publication Date
CN107451105A true CN107451105A (en) 2017-12-08
CN107451105B CN107451105B (en) 2020-04-07

Family

ID=60488117

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710517639.0A Expired - Fee Related CN107451105B (en) 2017-06-29 2017-06-29 Bright braille conversion system based on novel Chinese character holographic coding rule

Country Status (1)

Country Link
CN (1) CN107451105B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108415899A (en) * 2018-01-31 2018-08-17 北京联合大学 A kind of braille participle amending method and system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1591414A (en) * 2004-06-03 2005-03-09 华建电子有限责任公司 Automatic translating converting method for Chinese language to braille
CN1661526A (en) * 2004-02-24 2005-08-31 商荣杰 Set symbol computer keyboard and design of encoding signal input system
CN1848049A (en) * 2006-03-27 2006-10-18 富明慧 Half square braille digital coding Chinese character inputting method
JP2006302149A (en) * 2005-04-22 2006-11-02 Chiba Univ Japanese input device
CN101408803A (en) * 2008-11-04 2009-04-15 中兴通讯股份有限公司 Method for inputting Braille to terminal equipment and terminal equipment thereof
CN103870008A (en) * 2014-04-03 2014-06-18 可牛网络技术(北京)有限公司 Method and device for output and input of Braille characters on touch screen
CN103995600A (en) * 2014-03-20 2014-08-20 江苏科技大学 Braille and Chinese character converting device and method
GB2532770A (en) * 2014-11-27 2016-06-01 Stuart Wainwright Michael Apparatus for use in teaching a language

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1661526A (en) * 2004-02-24 2005-08-31 商荣杰 Set symbol computer keyboard and design of encoding signal input system
CN1591414A (en) * 2004-06-03 2005-03-09 华建电子有限责任公司 Automatic translating converting method for Chinese language to braille
JP2006302149A (en) * 2005-04-22 2006-11-02 Chiba Univ Japanese input device
CN1848049A (en) * 2006-03-27 2006-10-18 富明慧 Half square braille digital coding Chinese character inputting method
CN101408803A (en) * 2008-11-04 2009-04-15 中兴通讯股份有限公司 Method for inputting Braille to terminal equipment and terminal equipment thereof
CN103995600A (en) * 2014-03-20 2014-08-20 江苏科技大学 Braille and Chinese character converting device and method
CN103870008A (en) * 2014-04-03 2014-06-18 可牛网络技术(北京)有限公司 Method and device for output and input of Braille characters on touch screen
GB2532770A (en) * 2014-11-27 2016-06-01 Stuart Wainwright Michael Apparatus for use in teaching a language

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108415899A (en) * 2018-01-31 2018-08-17 北京联合大学 A kind of braille participle amending method and system
CN108415899B (en) * 2018-01-31 2021-09-17 北京联合大学 Braille word segmentation modification method and system

Also Published As

Publication number Publication date
CN107451105B (en) 2020-04-07

Similar Documents

Publication Publication Date Title
CN101950285A (en) Utilize native language pronunciation string converting system and the method thereof of statistical method to Chinese character
CN107368474A (en) A kind of automatical and efficient translation conversion method of Chinese to braille
CN102915122B (en) Based on the intelligent family moving platform spelling input method of language model
CN107133198A (en) Method for typesetting and format conversion of document
CN106980620A (en) A kind of method and device matched to Chinese character string
CN100462901C (en) GB phoneticize input method
CN101520693A (en) Method and system for rapidly inputting bulk information
CN103136453B (en) The automatic volume group method of document function topic and automatic marking method
CN100432903C (en) Half square braille digital coding Chinese character inputting method
CN102023854A (en) Template-based semantic variable extraction method
CN107451105A (en) A kind of bright braille converting system based on new Chinese character holographic coding rule
CN101452368B (en) Hand-written character input method
CN101604308A (en) Mongolian coding technology adopting alphabetic variant forms
CN107145478B (en) Method for converting Chinese sentence into braille
CN110888976A (en) Text abstract generation method and device
CN85100094A (en) Phonetic transcriptions of Chinese characters association coding and spelling keyboard
CN101587382A (en) Character input method suitable for Uighur, Kazakh and Khalkhas
CN1269542A (en) Association Chinese character input system
CN102622098B (en) New pictophonetic code Chinese character input method
CN1310371B (en) Method and apparatus for inputting characters
CN102110082B (en) Method and system for outputting complementary word of galley proof file
CN110210014B (en) Intelligent form system
TW460804B (en) Data processing apparatus and method for converting the sequence and arrangement of strokes of Chinese characters into the composition of binary data codes
CN1333325C (en) Pictographic character direct-viewing coding input method
CN1252555A (en) Three-Three phonetic code and Three-Three digital code

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200407

CF01 Termination of patent right due to non-payment of annual fee