CN1121015C - Intelligent Chinese computer system for the blind - Google Patents
Intelligent Chinese computer system for the blind Download PDFInfo
- Publication number
- CN1121015C CN1121015C CN 01129619 CN01129619A CN1121015C CN 1121015 C CN1121015 C CN 1121015C CN 01129619 CN01129619 CN 01129619 CN 01129619 A CN01129619 A CN 01129619A CN 1121015 C CN1121015 C CN 1121015C
- Authority
- CN
- China
- Prior art keywords
- chinese
- braille
- chinese character
- character
- order
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Landscapes
- Machine Translation (AREA)
Abstract
The present invention relates to an intelligent Chinese computer system for a blindman, which belongs to the field of a pattern recognition and artificial intelligence technology. The present invention is mainly composed of a personal computer host machine which is connected with the Internet, hardware and a software module arranged in the host machine and the relevant hardware, wherein the hardware is composed of a microphone, a sound box or an earphone, a scanner, a printer and a Braille display for the blindman, and the elements are connected with interfaces of the host machine. The present invention can make the blindman fully exert abilities of listening, speaking and touching when the computer is used, and the blindman can selectively naturally and conveniently operate the computer so as to make an interaction process humanized and intellectualized. The present invention provides a tool for the blindman to process documents and to be communicated with normal persons, and the present invention provides a teaching tool for a teacher in a school for blindmen.
Description
Technical field
The invention belongs to pattern-recognition and field of artificial intelligence.Be particularly related to the intelligent computer systems design that Chinese blind person uses.
Background technology
The blind person uses braille (touching the braille symbol of reading) to carry out attending classes and information interchange.In some developed countries, having worked out preferably, the blind person uses computing machine and operating platform thereof.Britain has developed the computing machine that the blind person uses, and each key of its keyboard is to be differed by size, shape, texture, and every key all has the interaction of multimedia information function of acoustic mechanism.Microsoft (Microsoft) expression, plan is cooperated with the Pause Data International of New Zealand dysopia technology manufacturer, but develops the electronic book reading machine of blind man and visually impaired person use.Ground such as Taiwan, Hong Kong also has corresponding braille computing machine (mainly being to have the blind person to put apparent device) to put goods on the market.Price is all very high, and a point shows device and wants 4000~5000 dollars, and general Chinese blind person can't afford.In China, in recent years, for the blind person can being used a computer and can reading the work that plain text has also been done some parts, under the subsidy of China Disabled Federation and China Blind Person Association is supported, develop braille word link writing system as Chinese braille bookstore; Reading machine for the blind was studied in the National Library of China under Dos operating system, be the common Chinese-character text of block letter is discerned by scanning input computer, converted the Chinese character of discerning to sound again and was exported by computing machine; Make the blind person can hear plain text; Department of Automation of Tsing-Hua University studied the blind person and used inputting method, helped word selection with sound, and the conversion of the Chinese character braille under Dos.
In addition, person of good sense's Chinese Character Recognition, speech recognition, speech synthesis technique have reached practical or approaching practical level.But, the intelligent Chinese computer system that does not also have the blind person to use in the world at present.
Summary of the invention
The objective of the invention is to overcome the deficiency of above-mentioned technology, proposed the intelligent Chinese computer system that a kind of blind person uses, the blind person is given full play to listen, say, touch ability when using a computer, more natural selectively, operational computations machine more easily.Compare with traditional man-machine information interaction means, this system has adopted multimodal interactive mode.The user both can use keyboard, blind person to put traditional interactive modes such as apparent device, display, also can adopt voice and OCR new interaction techniques such as (optical character identification) simultaneously.Make reciprocal process hommization more, intellectuality.Give blind person's document process, exchange with the normal person, school for the blind's teachers ' teaching provides instrument.
The intelligent Chinese computer system that a kind of blind person that the present invention proposes uses mainly comprises the common personal computer that can surf the Net, transforms into the also apparent device output unit of point of output of standard braille ASCII character in order to Chinese character or the Braille that computing machine is shown; In order to receive image that scanner obtains and the optical character recognition device that converts computer version by identification to; The keyboard for blind person input and the editing machine of input Chinese character and Braille; Voice messaging or control command are conveyed into the voice input device of computing machine; Printer output device in order to the content output that computing machine is to be exported; In order to statement, phrase, speech or syllable are become the phonetic synthesis output unit of sound waveform and output; It is characterized in that, also comprise be arranged in the said computing machine in order to realize Chinese braille to Chinese character and Chinese character to the blind converter of the Chinese of the automatic conversion of Chinese braille; The comprehensive knowledge base of required various knowledge bases when being arranged in the said computing machine in order to the blind intertranslation of the management Chinese; Be arranged on the speech recognition device in order to said phonetic entry is circulated be changed to computer version or identify the key words in this phonetic entry stream in the said computing machine; Be arranged on the natural language generation device that has the Chinese sentence of intonation in the said computing machine in order to generation; Said optical character recognition device, keyboard for blind person input and editing machine, three kinds of input channels of voice input device constitute unified input interface; Show three kinds of output channels that device output unit, printer output device, phonetic synthesis output unit constitute by point; Be arranged in the said computing machine in order to controller overhead control between said three kinds of inputs, the output channel; Be arranged in the said computing machine in order to the man-machine interaction content is analyzed, in time make the theme prediction, estimate the next action of user, the inference system device of different interactive strategies is specified and carried out to the analysis user demand; The blind converter of the said Chinese comprises the automatic switch of Chinese braille to the automatic switch of Chinese character and Chinese character to Chinese braille, wherein, Chinese braille is discerned braille to the automatic switch of Chinese character in order to books printed in braille are scanned the back, or with keyboard with after the braille input, the notion of braille by phonetic is converted to Chinese character; Each link of said phonetic and Chinese character conversion, utilize the Chinese braille comprehensive knowledge base, phonetic in band transition probability weight adopts the searching method of state transition path to obtain N optimum in order to Chinese character conversion search graph, realizes by the automatic conversion of braille to Chinese character; Said Chinese character is done braille word link writing according to Chinese braille word link writing rule to Chinese-character text for elder generation to the automatic switch of Chinese braille, then speech is converted to braille; Said participle is one by one speech branch to be come write; Said write the two or more syllables of a word together are the characteristics according to braille, and the principle that is the right length by logicality and custom, the syllable of Chinese grammar, voice links up some speech to be write, and too disperses to avoid syllable structure, is convenient to touch and reads.
The function of each module and implementation method are described as follows among the present invention:
OCR (optical character identification) device: adopt prior art.Both person of good sense's the Braille text that is printed on the Chinese-character text on the paper or is engraved on the paper can be sent into computing machine by optical scanner, automatically handle, comprise automatic identification to printed Chinese character, handwritten Chinese character, Braille, image file is converted into e-text, provides necessary condition for reading processing such as (reading aloud), editor.
Voice input device: facing to the microphone speech, voice messaging (control command etc.) is sent into computing machine, adopt prior art.
The apparent device output unit of point: Chinese character or Braille that computing machine is shown transform into standard braille ASCII character, output to the apparent device of point that the blind person uses, the blind person can be read by touching, understand computing machine, reach and use a computer and the purpose mutual with it just in content displayed.
Table one: the table of comparisons of braille ASCII character and Braille sign indicating number (the braille code be by about 2 each three point of row, from left bank is 1,2,3 points from top to bottom, row is 4,5,6 points from top to bottom from the right side, is referred to as a position, and the binary value of braille code is from left to right to be followed successively by 1-6 point position)
ASCII character is worth 16 systems | ASCII code value 10 systems | Control character | Symbol | The Braille code, 2 systems | Braille code 16 systems |
?20 | ?32 | Short side | |||
?21 | ?33 | Exclamation mark | 011101 | ?1D | |
?22 | ?34 | Double quotation marks | 000010 | ?02 | |
?23 | ?35 | Pound sign | 001111 | ?0F | |
?24 | ?36 | Dollar number | 110101 | ?35 | |
?25 | ?37 | Percentage sign | 100101 | ?25 | |
?26 | ?38 | With number | 111101 | ?3D | |
?27 | ?39 | Single quotation marks | 001000 | ?08 | |
?28 | ?40 | Open parenthesis | 111011 | ?3B | |
?29 | ?41 | Close symbol | 011111 | ?1F | |
?2A | ?42 | Asterisk | 100001 | ?21 | |
?2B | ?43 | Plus sige | 001101 | ?0D | |
?2C | ?44 | Comma | 000001 | ?01 | |
?2D | ?45 | Minus sign | 001001 | ?09 | |
?2E | ?46 | Round dot | 000101 | ?05 | |
?2F | ?47 | Oblique fraction line | 001100 | ?0C | |
?30 | ?48 | 0 | ?001011 | ?0B | |
?31 | ?49 | 1 | ?010000 | ?10 |
32 | 50 | 2 | 011000 | 18 | |
33 | 51 | 3 | 010010 | 12 | |
34 | 52 | 4 | 010011 | 13 | |
35 | 53 | 5 | 010001 | 11 | |
36 | 54 | 6 | 011010 | 1A | |
37 | 55 | 7 | 011011 | 1B | |
38 | 56 | 8 | 011001 | 19 | |
39 | 57 | 9 | 001010 | 0A | |
3A | 58 | Colon | 100011 | 23 | |
3B | 59 | Branch | 000011 | 03 | |
3C | 60 | Is less than | 110001 | 31 | |
3D | 61 | Equal sign | 111111 | 3F | |
3E | 62 | Greater-than sign | 001110 | 0E | |
3F | 63 | Question mark | 100111 | 27 | |
40 | 64 | NUL (sky) | Circle a | 000100 | 04 |
41 | 65 | SOH (start of header) | A | 100000 | 20 |
42 | 66 | STX (start of text) | B | 110000 | 30 |
43 | 67 | ETX (end of text) | C | 100100 | 24 |
44 | 68 | EOT (end of transmission (EOT)) | D | 100110 | 26 |
45 | 69 | ENQ (inquiry) | E | 100010 | 22 |
46 | 70 | ACK (admitting) | F | 110100 | 34 |
47 | 71 | BEL (bell character (BEL)) | G | 110110 | 36 |
48 | 72 | BS (backspace) | H | 110010 | 32 |
49 | 73 | HT (horizontal tabulation) | I | 010100 | 14 |
4A | 74 | LF (line feed) | J | 010110 | 16 |
4B | 75 | VT (vertical tab) | K | 101000 | 28 |
4C | 76 | FF (skipping) | L | 111000 | 38 |
4D | 77 | CR (carriage return) | M | 101100 | 2C |
4E | 78 | SO (displacement output) | N | 101110 | 2E |
4F | 79 | SI (displacement input) | O | 101010 | 2A |
50 | 80 | DLE (data link escape) | P | 111100 | 3C |
51 | 81 | DC1 (device control 1) | Q | 111110 | 3E |
52 | 82 | DC2 (device control 2) | R | 111010 | 3A |
53 | 83 | DC3 (device control 3) | S | 011100 | 1C |
54 | 84 | DC4 (device control 4) | T | 011110 | 1E |
55 | 85 | NAK (negating) | U | 101001 | 29 |
56 | 86 | SYN (synchronously) | V | 111001 | 39 |
57 | 87 | ETB (transmission block end) | W | 010111 | 17 |
58 | 88 | CAN (calcellation) | X | 101101 | 2D |
59 | 89 | EM (medium are with finishing) | Y | 101111 | 2F |
5A | 90 | SUB (displacement) | Z | 101011 | 2B |
5B | 91 | ESC (escape) | Open bracket | 010101 | 15 |
5C | 92 | FS (file separator) | Fall oblique line | 110011 | 33 |
5D | 93 | GS (group separater) | Close bracket | 110111 | 37 |
5E | 94 | RS (rs chacter) | Last pinnacle | 000110 | 06 |
5F | 95 | US (unit separator) | Short-term | 000111 | 07 |
60 | 96 | Single apostrophe | 000100 | 04 | |
61 | 97 | a | 100000 | 20 | |
62 | 98 | b | 110000 | 30 | |
63 | 99 | c | 100100 | 24 | |
64 | 100 | d | 100110 | 26 | |
65 | 101 | e | 100010 | 22 | |
66 | 102 | f | 110100 | 34 | |
67 | 103 | g | 110110 | 36 | |
68 | 104 | h | 110010 | 32 | |
69 | 105 | i | 010100 | 14 | |
6A | 106 | j | 010110 | 16 | |
6B | 107 | k | 101000 | 28 | |
6C | 108 | l | 111000 | 38 | |
6D | 109 | m | 101100 | 2C | |
6E | 110 | n | 101110 | 2E | |
6F | 111 | o | 101010 | 2A | |
70 | 112 | p | 111100 | 3C | |
71 | 113 | q | 111110 | 3E | |
72 | 114 | r | 111010 | 3A | |
73 | 115 | s | 011100 | 1C | |
74 | 116 | t | 011110 | 1E | |
75 | 117 | u | 101001 | 29 | |
76 | 118 | v | 111001 | 39 | |
77 | 119 | w | 010111 | 17 | |
78 | 120 | x | 101101 | 2D | |
79 | 121 | y | 101111 | 2F | |
7A | 122 | z | 101011 | 2B | |
7B | 123 | Open braces | 010101 | 15 | |
7C | 124 | Two vertical lines | 110011 | 33 | |
7D | 125 | Close braces | 110111 | 37 | |
7E | 126 | Tilde | 000110 | 06 | |
7F | 127 | Deletion | 000111 | 07 |
The printer output device: the content that computing machine is to be exported outputs to blind person's printer or Chinese characters in current use printer, adopts prior art.
Phonetic synthesis output unit: statement, phrase, speech or syllable are become sound waveform, sound by audio amplifier or earphone.Adopt prior art.
The blind converter of the Chinese: comprise the automatic conversion of Chinese braille to the automatic conversion of Chinese character and Chinese character to Chinese braille.Wherein, Chinese braille to the Implementation of automatic transformation method of Chinese character is: with books printed in braille scanning back identification braille, or with keyboard with the braille input after, the notion of braille by phonetic is converted to Chinese character; Each link of said phonetic and Chinese character conversion, utilize the Chinese braille comprehensive knowledge base, phonetic in band transition probability weight adopts Viterbi searching method (a kind of dynamic programming algorithm to Chinese character conversion search graph, here be used for the search of state transition path) obtain N optimum in order, realize by the automatic conversion of braille to Chinese character.
Said Chinese braille comprehensive knowledge base: comprise electronic dictionary, rule base and statistical information storehouse (showing the probability storehouse together in abutting connection with speech) by what the extensive real corpus of statistics obtained.
Above-mentioned Chinese braille comprises following concrete steps to the automatic switching method of Chinese character:
1) reads in the not whole continuous non-Braille symbol of converting text head;
Whether 2) current input point word symbol represents non-Chinese character meaning, if the expression Chinese character changes step 4); If the non-Chinese character of expression is searched for the N-best path and selected best path in the Viterbi search graph, obtain transformation result, and the non-Braille symbol that begins to read in is inserted into correspondence position;
3) transformation result of minute book sentence, the transformation result of the input point word symbol of the non-Chinese character meaning of record expression empties the viterbi search graph, changes step 5) over to;
4) search all Chinese character speech candidates that the braille symbol of current input can mate, and in the Viterbi search graph the corresponding node of structure.
5) judge whether that all conversion finishes? if, output conversion back Chinese character result; If not, change step 1).
Chinese character to the Implementation of automatic transformation method of Chinese braille is:
At first Chinese-character text is done the braille word link writing, convert speech to braille then according to Chinese braille word link writing rule; Said participle is one by one speech branch to be come write; Said write the two or more syllables of a word together are the characteristics according to braille, and the principle that is the right length by logicality and custom, the syllable of Chinese grammar, voice links up some speech to be write, and too disperses to avoid syllable structure, is convenient to touch and reads.
Above-mentioned Chinese character specifically can may further comprise the steps to the automatic switching method of Chinese braille:
1) at first non-Chinese symbol is carried out pre-cutting and handle, read in one section continuous Chinese character string, use MM method (oppositely maximum match method) and RMM method (oppositely maximum match method) respectively, carry out participle according to vocabulary;
2) relatively whether MM is identical, identical with the RMM word segmentation result, and the record word segmentation result changes step 4) over to;
3) when MM and RMM word segmentation result are inequality, the ambiguity tree of structure ambiguity field is searched for optimum word segmentation result, and the record word segmentation result changes step 4) over to;
4) do you judge that the text participle finishes? if, according to braille word link writing rule word segmentation result is made amendment, generate the Braille of word segmentation result correspondence; If not, change step 1).
Comprehensive knowledge base: required various knowledge bases when being the blind intertranslation of the Chinese comprise:
(1) electronic dictionary for Braille: comprise the electronic dictionary of Chinese character (60,000 speech) to the electronic dictionary of braille, braille (60,000 speech) to Chinese character, Chinese word segmenting dictionary etc.
(2) rule base: comprise Chinese braille word link writing rule, morphological rule, phrase rule, syntactic rule etc.
(3) statistical information storehouse: in order to reflect the Chinese context relation, the adjacent speech of Chinese that obtains with several hundred million word real corpus statistics connects dependence statistical knowledge etc. with showing between probability storehouse, part of speech.
Blind person's common keyboard input interface module: the blind person imports Chinese character or Braille with the common computer keyboard by blind person's input method of Chinese character and braille input method.Adopt prior art.
Speech recognition device: dual mode is arranged
(1) unspecified person, continuous speech recognition are prior arts.
(2) keyword voice identification: the key word recognition that will import in the voice flow is come out, and is convenient to speaker's semantic understanding.Be used to differentiate the computer command of the various sayings that the blind person sends.It is prior art.
Natural language generation device: according to the mutual content of control, as when needing voice suggestion or inquiry, produce and have Chinese sentence intonation, that the blind person easily understands.What adopt at present is the voice that record in advance according to content choice, plays.
Controller: be the overhead control between above several input, the output channel.The dialogue management layer is the core of system, it is organized the whole session process according to certain dialog strategy, is responsible for the communication between each module, make the reaction of system according to corresponding decision rule, so that man-machine interaction is normally carried out under Expected Results.
Controller is by state analyzer, and dialog manager and state storehouse are formed.
The state of storing in the state storehouse comprises system state and dialogue state.System state is described the situation (as: program name, ongoing operation, operation requirement etc.) that the current application program module starts and moves with certain data structure, has also comprised the pattern of the input and output of using simultaneously.Dialogue state has reflected the situation of current man-machine interaction process.Because the restriction of the form of system operation order, dialogue state is by (env; Act; Obj; Condition) case-frame (case grammar) expression.
Dialog manager is made corresponding dialogue action by dialogue state is analyzed, and perhaps finishes system acting, perhaps carries out corresponding system prompt.Dialog manager adopts present general slot-filling algorithm (groove fill method) to realize corresponding dialog strategy, manages and dispatches in order to the process to dialogue.Prior art.
State analyzer is responsible for the multi-mode input of the system that accepts, and selects following action to carry out according to current system state: start dialog manager; Start the corresponding application module; Send message to application program module; Transfer the I/O control to application program module.The final state analyzer converts input to canonical form and is put in the system state storehouse.Prior art.
The inference system device
This device is taked based on the semantic description system of case grammar with based on the analytical algorithm of robust mode coupling.Form with case frame (Case Frame) is carried out semantic expressiveness to discourse content, a case frame comprises a notion and relevant attribute (being groove) thereof, with recursive transition network (Recursive Transition Networks, RTN) the possible linguistic form to these grooves is described; Carry out the Viterbi beam search of (Viterbi beta pruning, a kind of optimized Algorithm are used to improve the arithmetic speed of state transition path search here) then, obtain the result of ultimate analysis according to certain evaluation mark.
The present invention can realize following function:
1. the present invention's input has three passages: common keyboard input, OCR input and phonetic entry; Output has three passages: voice output, printout and point show device output.Input and output following several modes capable of being combined:
1) braille computing machine input, Chinese-character text output (point shows device output, display output, printer output).To have under voice suggestion helps that document on the braille paper converts the electronic edition document to by the OCR converter or by keyboard input and editor's braille document, and convert thereof into by blind Chinese translation function and be the Chinese character document.By normal printer or display output.The order of equipment therefor is: OCR converter, input interface, the blind converter of the Chinese (braille is converted to Chinese character), natural-sounding are understood device, speech recognition device, natural language generation device, phonetic synthesis output unit, comprehensive knowledge base, controller, printer output device (universal printer).(blind person and person of good sense use alternately)
2) braille computing machine input, braille text output (printer output).The document that will have under voice suggestion helps on the braille paper is converted to the electronic edition document or is imported and edit the braille document by keyboard by the OCR converter.Directly show device output by braille printer or point.The order of equipment therefor is: OCR converter, input interface, controller, natural language understanding device, speech recognition device, natural language generation device, phonetic synthesis output unit, point show device output unit, printer output device (braille printer).(blind person and blind person use alternately)
3) Chinese-character text input, (point shows device output, printer output in braille output.To have under voice suggestion helps that document on the Chinese character paper converts the electronic edition document to by the OCR converter or by keyboard input and editor's Chinese character document, and convert thereof into by the blind translation function of the Chinese and be the braille document.Show device output by braille printer or point.The order of equipment therefor is: OCR converter, input interface, controller, the blind converter of the Chinese (Chinese character is converted to braille), natural language understanding device, speech recognition device, natural language generating apparatus, phonetic synthesis output unit, point show device output unit, comprehensive knowledge base, printer output device (braille printer).(annotate: blind person's teaching and braille publishing are used)
4) Chinese-character text input, Chinese character output (display, printer output).The document that will have under voice suggestion helps on the Chinese character paper is converted to the electronic edition document or is imported and edit the Chinese character document by keyboard by the OCR converter.Directly by normal printer or display output.The order of used module is: OCR converter, input interface, controller, natural language understanding device, speech recognition device, natural language generating apparatus, phonetic synthesis output unit, printer output device (universal printer).(blind person and person of good sense use alternately)
2. braille Chinese character auto-conversion function: the braille document is converted into the Chinese character document automatically.Equipment therefor is: the blind converter of the Chinese (braille is converted to Chinese character), comprehensive knowledge base.
3. Chinese character braille auto-conversion function: the Chinese character document is automatically converted to the braille document.Equipment therefor is: the blind converter of the Chinese (Chinese character is converted to braille), comprehensive knowledge base.
4. the blind person listens and reads Chinese-character text (novel, magazine, newspaper, Chinese character mail), and the order of equipment therefor is: OCR converter, controller, speech recognition device, natural language generating apparatus, phonetic synthesis output unit, natural language understanding device.
5. the blind person uses email manager: but blind person's sending and receiving Email, and read aloud mail of receiving and the mail of writing.Relate to blind person's Voice Navigation, Braille or Chinese character input and Chinese-character text output, document function of reading aloud.Equipment therefor is: input interface, speech recognition device, natural-sounding understand that device, inference system device, controller, natural language generation device, phonetic synthesis output unit, the blind converter of the Chinese, comprehensive knowledge base, natural-sounding are understood device, point shows device output unit, printer output device.
6. the blind person uses browser: the various information on blind person's browse network.The function of reading aloud of use blind person's Voice Navigation, the input of blind person's computing machine and Chinese-character text output, the blind person listening Chinese-character text.Equipment therefor is: input interface, speech recognition device, natural-sounding understand that device, inference system device, control interface, natural language generation device, phonetic synthesis output unit, the blind conversion of the Chinese, comprehensive knowledge base, natural-sounding are understood device, point shows device output unit, printer output device.
7. braille file manager: the mode with the order bar helps braille managing queries file.Equipment therefor is: input interface, inference system device, controller, natural language generation device, speech synthetic device, the blind converter of the Chinese, comprehensive knowledge base, natural-sounding are understood device.
8. blind person's Voice Navigation: the blind person can use a computer and network freely.The controllable order of every menu and hot key all can exhale order to replace with mouth, simultaneously can mouth exhales to close mouse, close order such as phonetic entry.The order of equipment therefor is: speech recognition device, natural-sounding are understood device, inference system device, controller, natural language generation device, phonetic synthesis output unit.
Characteristics of the present invention are: have multiple interactive mode, can select separately hardware configuration according to economic conditions and needs, that gives full play to the blind person when using a computer listens, says, touches ability.Make the blind person can be more natural selectively, operational computations machine more easily.Compare with traditional man-machine information interaction means, this system has adopted multimodal interactive mode.The user both can use keyboard, mouse, blind person to put traditional interactive modes such as apparent device, display, also can adopt new interaction techniques such as voice and OCR simultaneously.Make reciprocal process hommization more, intellectuality.Give blind person's document process, exchange with the normal person, school for the blind's teachers ' teaching provides instrument.
Description of drawings
Fig. 1 is that blind person's computer system of the present invention constitutes synoptic diagram.
Fig. 2 is embodiments of the invention Braille input synoptic diagram.
Fig. 3 is an embodiments of the invention blind person editing machine synoptic diagram.
Fig. 4 uses the email manager synoptic diagram for the embodiments of the invention blind person.
Fig. 5 uses the browser synoptic diagram for the embodiments of the invention blind person.
Embodiment
The intelligent Chinese computer system that a kind of blind person that the present invention proposes uses is described as follows in conjunction with each drawings and Examples:
A kind of embodiment that the present invention proposes is the minimum system that the blind person uses, its hardware comprises: the common personal computer that can surf the Net, the suitable Intel Pentium of basic hardware configuration requirement: CPU II is more than 400, more than the internal memory 128M, more than the hard disk 4G, sound card, microphone, audio amplifier or earphone and the required basic configuration of general computing machine.Basic software comprises: operating system Microsoft Windows9x or Windows 2000.
The composition and the course of work of present embodiment each several part are described in detail as follows:
1. keyboard input: dual mode can be arranged
(1) Braille input: international standard braille keyboard, use FDS, six keys of JKL are corresponding braille one side respectively, promptly from left to right, six points from top to bottom.In proper order: 3 points in the first left side, from top to bottom, 3 points in the right, back, from top to bottom.In the process of input phonetic entry prompting is arranged, make the blind person know oneself to hit down be which key, what sound.
(2) Chinese phonetic alphabet input method: can select western language, words spelling, words Two bors d's oeuveres etc.In the process of input phonetic entry prompting is arranged, make the blind person know oneself to hit down be which key, what sound.Can select the candidate of polyphone by voice suggestion.
Open or a newly-built braille file, promptly can import Braille.Open or a newly-built Chinese character file, promptly can import common Chinese character.
Characteristics are: except that each operation all has voice suggestion or response, can obtain corresponding Chinese converted contents in the input braille, as shown in Figure 2, be convenient to person of good sense (as: teacher) check and correction braille manuscript; Blind person and person of good sense's written communication.
2. read aloud Chinese-character text
To the Chinese character electronic document that has obtained, read aloud with phoneme synthesizing method.Open the Chinese character file, in the choice menus item " massage voice reading ", just can begin reading aloud of Chinese-character text, select this menu item will stop to read aloud once more.In addition can also read aloud automatically the menu in the menu bar of current cursor place.
Characteristics are: the blind person not only can listen and read electronic edition Chinese character document, also can read various forms by the OCR translation function simultaneously, as the Chinese character document of storages such as CD, books.
3. Voice Navigation
This system adopts the keyword recognition technology to realize Voice Navigation.Therefore, send when order can be with various close, more ambiguous statements.For example the user wants the a1.txt that opens file, and he may say:
1) a1.txt that opens file
2) file a1.txt is opened
3) open a1.txt
4) a1.txt is opened
These four kinds of saying connotations are identical, but as the signal of phonetic entry a great difference just arranged, and observing the common ground that these sayings can find out them is that a verb-" opening " all arranged, all with object-" filename " of a logical meaning.Also there is similar problem for sayings such as copy, deletions.This system finds out the object of one of verb crucial in the phonetic entry and important attribute thereof, finishes identification and affirmation to user input commands.The keyword recognition system generally is used in the situation of unspecified person, continuous speech.Employing is based on the keyword recognition method of HMM (hidden Markov model, a kind of model of describing the state transitions relation) framework, and its principle is:
At first with the voice flow segmentation of input, every section corresponding and a sentence or the voice paragraph that sentence length is suitable.Which keyword then, searches in each section and determined whether keyword, be if there is keyword also must determine.The input of system is made up of keyword input and the outer voice of antistop list, and the latter is called rubbish, can comprise non-key speech, non-language (sucking mouth sound, breathing sound etc.) and ground unrest three parts.System sets up a cover HMM model for each keyword, in like manner also will set up some cover HMM to rubbish.The feature vector sequence of any one section input voice is obtained the status switch corresponding with this sequence with the Viterbi algorithm, if in the state of experience the person that belongs to the keyword is arranged, can detect the keyword of correspondence.
4. voice system control.Be characterized in: the special-purpose subsystem of several blind persons is integrated with Voice Navigation.Judge the residing duty of present system, suitable operation is carried out in order according to the working environment analyzing speech.The controllable order of every menu and hot key all can exhale order to replace with mouth.Simultaneously can mouth exhale and close mouse, close orders such as phonetic entry.(beginning with particular key control phonetic entry) to avoid noise
Voice Navigation not only integrates a plurality of blind persons with subsystem, the interactive mode that makes things convenient for the close friend is provided for simultaneously these softwares, makes modern technologies such as the blind person can use a computer more freely, network, joins among the informationized society.
5. blind person's editing machine and braille printout
Blind person's editing machine is the editing machine that makes things convenient for the blind person to use, and it must have the basic function of general editing machine, and appropriate voice interactive function is provided.This editing machine is based on Keyboard Control, and promptly the blind person controls current working state by keyboard.The blind person is an indispensable ingredient (as described above) of blind person's editing machine with Braille input method and Chinese character input method.The method for designing of editing machine is: in input process, after the end of input, or after opening certain electronic document, the blind person can learn current cursor position by voice suggestion, with which which row mark of row.The blind person can listen and read, edit, revise document, as deleting, add, duplicate literal, paragraph etc. under the help of voice.When running into phonetically similar word and can not recognize, the blind person can use explanation function, by phrase differentiate be which phonetically similar word as: red, pronunciation Hong, when looking into the meaning of word, computing machine will be used the voice informing user: red, red flag red; Red is red.Same flood, pronunciation Hong will be apprised of flood, the flood of flood.Can obtain the translator of English of this speech if desired by Chinese-English dictionary; If English, can read, can explain that the Chinese of this English looks like by english Chinese dictionary as needs.At last, the blind person can select to read aloud continuously, listens and reads content in full.So the Core Feature of blind person's editing machine is: State Control, exercisable function difference under different states, as, can not arbitrarily delete under the file management state, cursor leaves the file operation district, and prompting or help user return; Read aloud; Keyboard or voice control by the order of various keyboard operations or phonetic entry, help the user to finish document and listen the task of reading, understand, writing.As shown in Figure 3.
1) State Control:
Monitor the cursor current location, avoid system to carry out illegal operation.
2) read aloud:
Report current cursor position (which row, which row); Read aloud cursor left side letter or Chinese character, cursor right side letter or Chinese character; Can make an explanation to Chinese character in case of necessity, centering, english make an explanation, translate; Read continuously from current location, stop.
3) read aloud automatically:
Automatically read when finishing cursor left, read automatically during cursor right, read the right when cursor up down is moved automatically, read the left side when cursor up down is moved automatically.
Native system can directly connect the braille printer, as: the INDEX BRAILLE series of products of the Index Embossers company of Sweden Sweden, carry out Braille to current braille electronic document and print.
6. the blind person uses email manager: as shown in Figure 4.
Under voice control, in the ordinary electronic E-Mail Manager, add Voice Navigation, massage voice reading, the blind Chinese, the blind translation function of the Chinese.This blind person can finish the basic function of writing, send and receive e-mail, set up common email managers such as address book with email manager.Can be the blind person and read aloud the mail of receiving, writing.Blind users relies on voice and system interaction to finish the operation of sending and receiving Email.For example: computing machine is informed user's " you have new mail " by sound; Inquiry " your receiving emails? ", " will read aloud mail? " Prompting " please import receiver's address ", " please import e-mail theme " etc.After each operation corresponding voice answer-back or voice suggestion are arranged all.
7. the blind person uses browser: as shown in Figure 5.
Under voice control, in generic browser, add Voice Navigation, massage voice reading, OCR conversion, the blind Chinese, the blind translation function of the Chinese.This blind person can finish the basic function of general browsers such as inquiry, reading with browser.Can be the blind person and read aloud web page contents.The user relies on voice and system interaction to finish operations such as keyword, network address input, inquiry.For example: computing machine is informed user " you arrived so-and-so webpage " by sound; Inquire " you want what is inquired about? " Prompting " please import keyword ", " please import network address " etc.After arriving named web page, put in order, read aloud web page contents according to webpage.Can skip by keyboard and finish to read aloud.
8. intelligent inference function
This belongs to medium-scale towards blind person's man-machine information interaction system, and the theme of application is the various common operations and the simple information inquiry of computing machine.Because main blind man uses, so real-time requires better, the friendly degree of use is higher.Consider these factors, take based on the semantic description system of case grammar and the analytical algorithm of mating based on robust mode.Form with case frame (Case Frame) is carried out semantic expressiveness to discourse content, a case frame comprises a notion and relevant attribute (being groove) thereof, (Recursiye TransitionNetworks, RTN) the possible linguistic form to these grooves is described with recursive transition network.When analyzing, use top-down RTNchart (chart) analytical algorithm that sentence is mated, as import sentence and the word outside the system dictionary occurred, give elimination and do not do analysis, for asyntactic composition in the input, directly skip during analysis, search can constitute the segment (being significant phrase) of notion.Carry out the search of Viterbi beam, obtain the result of ultimate analysis according to certain evaluation mark.Be mapped to semantic frame by the phrase that analyzes, so just obtained one or several such case frame and represented for sentence.Interaction content is carried out performance analysis, and the real-time update system state.
Simultaneity factor is in time made the theme prediction, estimates the next action of user, optimizes the knowledge base searching algorithm.Different interactive strategies is specified and carried out to the demand of analysis user.Predict and induce according to user's past behavior and current behavior, accelerate the realization of systematic search function, avoid because mouth exhales the identification of order and misconstruction etc. to make system enter endless loop.
The present invention has set up tens required comprehensive knowledge bases of the blind intertranslation of the Chinese, the theory that Chinese natural language is understood is applied in the braille technology for automatically treating first, finished one and can carry out the blind Chinese of Chinese, the blind automatic conversion of the Chinese, the input editing of collection braille Chinese, blind person are controlled in the intelligent computer systems towards Chinese blind person of one with Email sending and receiving management, voice system.With the artificial intelligence representation of knowledge and reasoning, the theme prediction, the content analysis scheduling theory is applied to the system state analysis, makes it have certain voice human-computer interaction function, and can utilize man-machine conversation, and system points out the user and induces, and is user-friendly.
Claims (1)
1, the intelligent Chinese computer system used of a kind of blind person mainly comprises the common personal computer that can surf the Net, and the point that transforms into standard braille ASCII character and output in order to Chinese character that computing machine is shown or Braille shows the device output unit; In order to receive image that scanner obtains and the optical character recognition device that converts computer version by identification to; The keyboard for blind person input and the editing machine of input Chinese character and Braille; Voice messaging or control command are conveyed into the voice input device of computing machine; Printer output device in order to the content output that computing machine is to be exported; In order to statement, phrase, speech or syllable are become the phonetic synthesis output unit of sound waveform and output; It is characterized in that, also comprise be arranged in the said computing machine in order to realize Chinese braille to Chinese character and Chinese character to the blind converter of the Chinese of the automatic conversion of Chinese braille; The comprehensive knowledge base of required various knowledge bases when being arranged in the said computing machine in order to the blind intertranslation of the management Chinese; Be arranged on the speech recognition device in order to said phonetic entry is circulated be changed to computer version or identify the key words in this phonetic entry stream in the said computing machine; Be arranged on the natural language generation device that has the Chinese sentence of intonation in the said computing machine in order to generation; Said optical character recognition device, keyboard for blind person input and editing machine, three kinds of input channels of voice input device constitute unified input interface; Show three kinds of output channels that device output unit, printer output device, phonetic synthesis output unit constitute by point; Be arranged in the said computing machine in order to controller overhead control between said three kinds of inputs, the output channel; Be arranged in the said computing machine in order to the man-machine interaction content is analyzed, in time make the theme prediction, estimate the next action of user, the inference system device of different interactive strategies is specified and carried out to the analysis user demand; The blind converter of the said Chinese comprises the automatic switch of Chinese braille to the automatic switch of Chinese character and Chinese character to Chinese braille, wherein, Chinese braille is discerned braille to the automatic switch of Chinese character in order to books printed in braille are scanned the back, or with keyboard with after the braille input, the notion of braille by phonetic is converted to Chinese character; Each link of said phonetic and Chinese character conversion, utilize the Chinese braille comprehensive knowledge base, phonetic in band transition probability weight adopts the searching method of state transition path to obtain N optimum in order to Chinese character conversion search graph, realizes by the automatic conversion of braille to Chinese character; Said Chinese character is done braille word link writing according to Chinese braille word link writing rule to Chinese-character text for elder generation to the automatic switch of Chinese braille, then speech is converted to braille; Said participle is one by one speech branch to be come write; Said write the two or more syllables of a word together are the characteristics according to braille, and the principle that is the right length by logicality and custom, the syllable of Chinese grammar, voice links up some speech to be write, and too disperses to avoid syllable structure, is convenient to touch and reads.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01129619 CN1121015C (en) | 2001-06-22 | 2001-06-22 | Intelligent Chinese computer system for the blind |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 01129619 CN1121015C (en) | 2001-06-22 | 2001-06-22 | Intelligent Chinese computer system for the blind |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1323003A CN1323003A (en) | 2001-11-21 |
CN1121015C true CN1121015C (en) | 2003-09-10 |
Family
ID=4669316
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 01129619 Expired - Fee Related CN1121015C (en) | 2001-06-22 | 2001-06-22 | Intelligent Chinese computer system for the blind |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1121015C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838358A (en) * | 2012-11-23 | 2014-06-04 | 英业达科技有限公司 | Braille electronic device and Braille reading and voice-playing method |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100337232C (en) * | 2004-08-04 | 2007-09-12 | 华建电子有限责任公司 | Braille-Chinese contrapositive editing/typesetting system and editing/typesetting method |
US7715570B2 (en) * | 2005-12-12 | 2010-05-11 | International Business Machines Corporation | Method and system for providing audio-guided deployment of data processing systems |
CN102799433A (en) * | 2012-07-04 | 2012-11-28 | 桂林电子科技大学 | Implementing method of software capable of being used by disabled people |
CN105404621B (en) * | 2015-09-25 | 2018-07-10 | 中国科学院计算技术研究所 | A kind of method and system that Chinese character is read for blind person |
CN106356057A (en) * | 2016-08-24 | 2017-01-25 | 安徽咪鼠科技有限公司 | Speech recognition system based on semantic understanding of computer application scenario |
CN107093353A (en) * | 2017-06-28 | 2017-08-25 | 西安电子科技大学 | Blindmen intelligent terminal interaction accessory system |
CN111833872B (en) * | 2020-07-08 | 2021-04-30 | 北京声智科技有限公司 | Voice control method, device, equipment, system and medium for elevator |
-
2001
- 2001-06-22 CN CN 01129619 patent/CN1121015C/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838358A (en) * | 2012-11-23 | 2014-06-04 | 英业达科技有限公司 | Braille electronic device and Braille reading and voice-playing method |
Also Published As
Publication number | Publication date |
---|---|
CN1323003A (en) | 2001-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4267081B2 (en) | Pattern recognition registration in distributed systems | |
US8249879B2 (en) | System and method of providing a spoken dialog interface to a website | |
KR101263332B1 (en) | Automatic translation apparatus by using user interaction in mobile device and its method | |
AU2004201089B2 (en) | Syntax tree ordering for generating a sentence | |
JP5166661B2 (en) | Method and apparatus for executing a plan based dialog | |
KR101322486B1 (en) | General dialogue service apparatus and method | |
CN1384940A (en) | Language input architecture fot converting one text form to another text form with modeless entry | |
JP2000353161A (en) | Method and device for controlling style in generation of natural language | |
Wahlster | Mobile speech-to-speech translation of spontaneous dialogs: An overview of the final Verbmobil system | |
JP2001100781A (en) | Method and device for voice processing and recording medium | |
WO2007005884A2 (en) | Generating chinese language couplets | |
US11257484B2 (en) | Data-driven and rule-based speech recognition output enhancement | |
US20070016420A1 (en) | Dictionary lookup for mobile devices using spelling recognition | |
CN1121015C (en) | Intelligent Chinese computer system for the blind | |
Panda | Automated speech recognition system in advancement of human-computer interaction | |
CN111553157A (en) | Entity replacement-based dialog intention identification method | |
Imamguluyev | The rise of gpt-3: implications for natural language processing and beyond | |
CN1275174C (en) | Chinese language input method possessing speech sound identification auxiliary function and its system | |
Shih et al. | Improved Rapid Automatic Keyword Extraction for Voice-based Mechanical Arm Control. | |
CN113971212A (en) | Multilingual question and answer method and device, electronic equipment and storage medium | |
Dandge et al. | Multilingual Global Translation using Machine Learning | |
Zhou et al. | Applying the Na ï ve Bayes Classifier to Assist Users in Detecting Speech Recognition Errors | |
Wahlster | Robust translation of spontaneous speech: a multi-engine approach | |
CN1064464C (en) | Speech procesisng system based on multiple evaluation function | |
Carson-Berndsen | Multilingual time maps: portable phonotactic models for speech technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |