CN1101577C - Speech input memorandum - Google Patents

Speech input memorandum Download PDF

Info

Publication number
CN1101577C
CN1101577C CN 98117693 CN98117693A CN1101577C CN 1101577 C CN1101577 C CN 1101577C CN 98117693 CN98117693 CN 98117693 CN 98117693 A CN98117693 A CN 98117693A CN 1101577 C CN1101577 C CN 1101577C
Authority
CN
China
Prior art keywords
speech
microprocessor
input
converter
recognition device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 98117693
Other languages
Chinese (zh)
Other versions
CN1247347A (en
Inventor
刘迎建
马梁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hanwang Science and Technology Co., Ltd., Beijing
Original Assignee
ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING filed Critical ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING
Priority to CN 98117693 priority Critical patent/CN1101577C/en
Publication of CN1247347A publication Critical patent/CN1247347A/en
Application granted granted Critical
Publication of CN1101577C publication Critical patent/CN1101577C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Calculators And Similar Devices (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The present invention relates to a speech input memorandum which belongs to a personal digital assistant, namely a called electronic memorandum, PDA for short. The present invention is a PDA product based on speech, and can record a name in the function of an ordinary PDA name card in a speech mode. In an inquiry process, as long as a human name is spoken, corresponding phone numbers can be displayed; the speech is used for reading the human name and company/address information, wherein the phone numbers can also be inputted by the speech. The present invention can also store 30 minutes of digital records in a segmented mode; the records can be played in preset time. The present invention can complete functions identical with those of a traditional PDA stroke and a traditional PDA memorandum in the speech mode.

Description

Speech input memorandum
The invention belongs to " personal digital assistant " (Personal Digital Assistant), promptly usually said electronic notebook is called for short " PDA ".
At present, the PDA product can be divided into two big classes:
With the low-grade PDA headed by homemade " Wenquxing " and Hong Kong " electricity is translated logical ", be characterized in keyboard input, adopt encoding schemes such as phonetic or five, convert the GB Chinese character to, store or inquire about.Be characterized in simple in structure, memory consumption is few, and CPU requires low.Thereby price is very low.But because keyboard is little, operation inconvenience.Add that the viewing area is little, the candidate of phonetic is many, and the phonetic input is had any problem.And many people can five or spelling input method, can't use.
Also have with the Taiwan instant translator, translate logically well, the high-grade PDA of the artificial representative of Hong Kong name adopts handwriting input, a lot of specific functions is arranged, as infrared interface, digital recording, beeper reception etc.Because carry out handwriting input, these products all are furnished with bigger LCD, pressure type touch-screen, the RAM that 128K is above, and 1 to 4,000,000 ROM.Also very high to the requirement of CPU simultaneously, processing power all meets or exceeds the level of 286-16.Therefore their price is all very high.The input method of handwriting recognition is more humane, but CPU processing power and storage space is required very high, and the system requirements of for example intelligent pen are: Pentium is more than 75,8,000,000 internal memories.Because the restriction of PDA processing power, the process of writing are still not too convenient, require carefully and neatly to write, and the discrimination of its handwriting input, adaptive faculty and speed, all also far short of what is expected than the identification software of PC version.A lot of people feel and are inconvenient.
At present, in PDA, also do not possess the Chinese speech input function,, can only be used in the mainframe computer though the phonetic entry of English and numeral thereof is abroad arranged.
The objective of the invention is to designed PDA and have following characteristics:
1. when " name " in the business card function imported, read name, the feature with sound identification module analysis and extraction voice stores in the cog region of storer.Simultaneously with compression module with compress speech, store in the recording zone of storer.
2. when retrieval, extract the sound characteristic of the name of reading with sound identification module, and with storer in the feature comparison, find corresponding clauses and subclauses.
3. input digit and English adopt the little vocabulary speech recognition of specific people, compare with the eigenwert of extracting in advance with the eigenwert of sound identification module with the voice of institute's reading word, letter, identify corresponding digital or letter.
4. when consulting content, with the broadcast of contracing of the voice decompress(ion) of name, address in the storer recording zone.But not common demonstration Chinese character.
5. when consulting content, telephone number is except that character display, and also available massage voice reading comes out.
Purpose of the present invention realizes in the following manner:
As shown in Figure 1, microphone is connected with the input amplifilter; The input amplifilter is connected with high-precision a/d converter; A/D converter is connected with speech recognition device and compress speech/decompression machine simultaneously; Speech recognition device links to each other with main control microprocessor by data bus and control signal wire.Speech compressor links to each other with main control microprocessor by data bus and control signal wire.Microprocessor is connected with storer, and wherein memory inside is divided into characteristic storage district and recording memory block.
The effect of this circuit is with the simulating signal of phonetic entry, is converted to digital code stream, and the characteristic extracting module of speech recognition device is extracted phonetic feature from speech data, and passes to microprocessor by data bus.Speech compressor is also passed to microprocessor by data bus after speech data is compressed.After processor obtains speech data and phonetic feature, they are deposited in respectively in the zones of different of storer use for identification and playback.
Microprocessor also is connected with decompressor, and decompressor is connected with D/A converter, and D/A converter is connected with the output amplifilter, and the output amplifilter is connected with loudspeaker.
The effect of this circuit is with the reduction of the speech data after the compression and plays.
Microprocessor also links to each other with the feature comparison module of speech recognition device by data bus and control signal wire.Speech recognition device directly links to each other with storer again.And microprocessor also is connected with display.
The effect of this circuit is feature comparison module, the eigenwert in the usefulness storer and this value comparison that speech recognition device passed to phonetic feature by microprocessor.And the result passed to microprocessor, to search information such as telephone number corresponding, phone is shown by display, and by the loudspeaker plays relevant information.
Outward appearance of the present invention is as shown in Figure 2: this notepad can be held in the palm, one-handed performance.And circuit output plughole 1 is housed, acknowledgement key 2, cancel key 3, loudspeaker 4, display screen 5, knob 6 and microphone 7.
The present invention compares with existing PDA.Have the following advantages:
Speech input memorandum does not need speech conversion is become literal, and is low to system requirements, do not miss the knowledge problem, and input easily.Ten numerals adopt the speech recognition input, and because of vocabulary is little, discrimination is very high, and speed is fast.Adopt the method for specific people's study, do not have the restriction of accent, language.Therefore phonetic entry is the input method of present most convenient.All inputs and inquiry, reading process all realize that by voice all operations all has voice suggestion, and blind person or the people who has defective vision also can easily use, and technology maturation, the low-grade PDA of close prices.
Description of drawings:
Fig. 1 is the speech input memorandum system diagram
Fig. 2 is the speech input memorandum outside drawing:
1-circuit output plughole; 2-acknowledgement key; 3-cancel key; 4-loudspeaker; 5-show
Screen; 6-knob; 7-microphone.
Fig. 3 is the voice output process flow diagram
Fig. 4 is the speech retrieval process flow diagram
Fig. 5 is the phonetic entry process flow diagram of numeral
Fig. 6 is the phonetic entry process flow diagram
Embodiment;
The present invention can use 2 joint 3A batteries, can use more than 30 days.Product appearance as shown in Figure 2, shell is streamlined, elegant in appearance, small and exquisite and easy to operate.The 16*80 lattice type LCD is adopted in positive demonstration, and 16 more eye-catching icons are arranged.
Knob and affirmation, cancel key are housed respectively in the left and right sides of product, and the upper end is circuit delivery outlet and microphone, and the lower end is provided with loudspeaker.
Adjusting knob upwards transfers increment to, is downwards decrement, can select digital 0-9 easily, alphabetical A-Z and each menu option.
Affirmation/cancel key: the outward appearance long strip type, respectively there is a switch at inner two ends up and down.This key is confirmed by top, is cancellation by the bottom.
The recorded content of this notepad:
Can write down 200 business cards or 30 minutes voice, after business card increased, record length can shorten.
Each business card has maximum 2 seconds name recording and personal information recording in maximum 5 seconds.Maximum 5 telephone numbers: phone 1, phone 2, fax, home telephone, BP.
Digital recording can be divided into multistage, amounts to maximum 30 minutes, all can set reproduction time for every section, then begins automatically to play, as appointment/stroke reminding.
Circuit output and microphone interface:
Use this interface, the user can connect earphone and the microphone of oneself.Also can computing machine be transcribed in recording and carry out speech recognition by circuit output.
The course of work of this notepad is as follows:
One, phonetic entry: the analog voice signal of microphone input, entering the input amplifilter becomes analog electrical signal, is converted to digital code stream by A/D converter again, passes to speech recognition device and speech compressor simultaneously.The characteristic extracting module of speech recognition device is extracted phonetic feature from speech data, and passes to microprocessor by data bus.Speech compressor is also passed to microprocessor by data bus after speech data is compressed.After processor obtains speech data and phonetic feature, they are deposited in respectively in the zones of different of storer.
During retrieval, sound identification module, the eigenwert in the speech recognition device usefulness storer and this value comparison of speech recognition device passed to phonetic feature by processor.And the result passed to microprocessor.
Two, voice output: as shown in Figure 3, behind the playback, microprocessor control voice decompressor and D/A converter are started working, microprocessor takes out the compressed voice data from storer simultaneously, pass to the voice decompressor, the data stream after decompressor will decompress is exported to D/A converter, is converted to simulating signal, after the amplification of output amplifilter, export by loudspeaker.
Three, speech retrieval: as shown in Figure 4, microprocessor control display display reminding information, and control A/D converter, speech compressor and speech recognition device are started working.The voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal, obtains data stream.The characteristic extracting module of speech recognition device is extracted phonetic feature, and passes to microprocessor, and microprocessor controlling features comparison module compares the phonetic feature of the name stored in this phonetic feature and the storer, and the result is passed to microprocessor.Microprocessor shows corresponding recorded information according to the result, and enters editing mode.
Operate as follows: 1. primary option menu:
At first enter master menu after the start, turning knob, cursor switches between options such as " retrieval business card ", " browse business card ", " consulting recording ", " input business card ", " recording ", " default ", " internal memory situation ", " time set ", " voice training ", " recreation ", and dot matrix LCD goes up the icon that shows each function.Cursor is along with selecting button to move, and after stopping 1 second, the term pronunciation goes out this function title.Enter function by acknowledgement key, return previous menu by " cancellation ".2. business card input
Enter " business card input " function, cursor moves between " name input ", " personal information ", " input of name phonetic ", " phone input ".
1) name input service process as shown in Figure 4.
Operate as followsly, earlier cursor is moved on on " name ", one second attonity, then voice suggestion " please be read name ", pins " affirmation ", microprocesser initialization A/D converter, speech compressor and speech recognition device.Begin recording then, acknowledgement key unclamps the back to be finished, and surpasses 2 seconds, overtime warning.Repeat to record and then cover last content.The voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Speech recognition device extracts phonetic feature, and deposits characteristic in storer by control chip.Simultaneously, speech compressor deposits data compression in storer by control chip.
2) personal information input: earlier cursor is moved on on " personal information ", one second attonity, then voice suggestion " personal information " is pinned " affirmation " and is begun recording, and finish the back of loosing one's grip, and surpasses 5 seconds, overtime warning.Repeat to record and to cover last content.In the Recording Process,, and extract phonetic feature without recognizer only with compression storage after the voice digitization.
3) telephone number input:
Available knob or phonetic entry, digital speech input service process as shown in Figure 5.
Operate as followsly, with selecting button that cursor is moved on to: any in phone 1, phone 2, home telephone, fax, the beeper enters by " affirmation ".Rotate to select button can be in phonetic entry, digital 0-9, space, return and select between deleting, finishing.During display digit 0-9, import and enter next numeral by " affirmation ", choosing " is returned and is deleted " and then deletes a numeral, pins back and deletes then full scale clearance in 2 seconds.Wherein, phonetic entry, return to delete and be icon, when choosing phonetic entry, pin " affirmation ", read a numeral, unclamp back identification, microprocessor initialization A/D of elder generation and speech recognition device, the voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Identification module extracts phonetic feature, and with storer in the phonetic feature of 0-9 compare, and the result is passed to microprocessor.The microprocessor control display screen shows corresponding digital 0-9, enters the next one then automatically.Then store and get back to the upper level option by " cancellation " key.
4) by " cancellation " key, whether prompting stores, and returns the retrieval of master menu 3. business cards after the user selects:
Browse and the speech retrieval dual mode.
1) speech retrieval:
The retrieval course of work as shown in Figure 6.
Operate as followsly, select business card retrieval under the master menu, begin recording with LCD and voice suggestion user.The user pins " affirmation " key and reads name simultaneously and unclamp then.Microprocessor control A/D, speech recognition device are started working.The voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Identification module extracts phonetic feature, and with storer in the phonetic feature of the name stored compare, and the result is passed to microprocessor.Microprocessor shows corresponding recorded information according to the result.Call over name, address, phone simultaneously.Stop and entering editing mode by " affirmation ".When the close candidate of a plurality of pronunciations was arranged, if rotary knob then can switch between each candidate's business card, display showed sequence number and first telephone number.Stopped 1 second, and then called over name, personal information and phone.Stop and entering editing mode by " affirmation " key.Return master menu by " cancellation " key.
2) browse:
After entering, cursor is moved between name, remarks, phone and " deletion business card " icon, when moving on to name, address, stop to read after one second with knob.
Enter respective selection by " affirmation ", make amendment, then return upper level by " cancellation ".4. numeral and English alphabet recording:
Adopt the little vocabulary speech recognition of specific people, specific people is meant: recognizer is trained at specific user's sound pronunciation.To unbred user, the low or not identification of discrimination.Little vocabulary is meant: identified range is confined to (be generally less than 1000) in the limited vocabulary, rather than instructs arbitrarily or statement.Identifying is exactly the voice phrase of input and the voice phrase of setting are in advance compared and to mate, and finds out the most close result.
1) recording
Operate as followsly, select sound-recording function, pin acknowledgement key, begin recording.
Whether the inquiry of recording back regularly broadcasts, and selects "Yes" then to continue to require setting-up time.Then can broadcast.
2) retrieval
Can only check in proper order.Show record length when checking, length and playback duration are set.Began after stopping 1 second to play.By after " affirmations " enter, can revise timing reproduction time or deletion.5. system's setting
Identification parameter, volume and voice suggestion can be set opens or closes.
Each operation all has Chinese character or figure and voice suggestion, and wherein voice suggestion can be closed.6.0-9 the study of numeral
Each numeral reads twice, to voice gather and digitizing after, extract feature and be kept in the storer with recognizer.Behind the learning success with regard to available phonetic entry numeral.

Claims (2)

1. one kind by microphone, the speech input memorandum that input amplifilter, A/D converter, D/A converter, speech recognition device, speech compressor, voice decompressor, data bus, control signal wire, microprocessor, output amplifilter, loudspeaker, storer, button, knob, display screen constitute is characterized in that: microphone is connected with the input amplifilter; The input amplifilter is connected with high-precision a/d converter; A/D converter is connected with speech recognition device and compress speech/decompression machine simultaneously; Speech recognition device links to each other with microprocessor by data bus and control signal wire; Speech compressor links to each other with microprocessor by data bus and control signal wire; Microprocessor is connected with storer, and wherein memory inside is divided into characteristic storage district and recording memory block; Microprocessor also is connected with the voice decompressor, and the voice decompressor is connected with D/A converter, and D/A converter is connected with the output amplifilter, and the output amplifilter is connected with loudspeaker; Microprocessor also links to each other with the voice comparison module of speech recognition device by data bus and control signal wire; Speech recognition device directly links to each other with storer again; Microprocessor also is connected with display.
2. the described speech input memorandum of claim 1 is characterized in that the shell of notepad is equipped with microphone, and acknowledgement key is set, cancel key, knob, display screen and loudspeaker.
CN 98117693 1998-09-09 1998-09-09 Speech input memorandum Expired - Fee Related CN1101577C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 98117693 CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 98117693 CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Publications (2)

Publication Number Publication Date
CN1247347A CN1247347A (en) 2000-03-15
CN1101577C true CN1101577C (en) 2003-02-12

Family

ID=5225645

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 98117693 Expired - Fee Related CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Country Status (1)

Country Link
CN (1) CN1101577C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101874246B (en) * 2007-09-27 2017-05-10 西门子通讯公司 Method and apparatus for secure electronic business card exchange

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8219146B2 (en) 2009-11-06 2012-07-10 Sony Corporation Audio-only user interface mobile phone pairing
CN102207845A (en) * 2011-05-25 2011-10-05 盛乐信息技术(上海)有限公司 Device and method for managing schedule in voice manner
CN108682332A (en) * 2018-05-21 2018-10-19 南通市第人民医院 A kind of special patient's identification card of hemodialysis and deck

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101874246B (en) * 2007-09-27 2017-05-10 西门子通讯公司 Method and apparatus for secure electronic business card exchange

Also Published As

Publication number Publication date
CN1247347A (en) 2000-03-15

Similar Documents

Publication Publication Date Title
US6864809B2 (en) Korean language predictive mechanism for text entry by a user
KR101586890B1 (en) Input processing method and apparatus
CN1301026C (en) Mobile terminal voice telephone directory system
CN1333501A (en) Dynamic Chinese speech synthesizing method
CN101825953A (en) Chinese character input product with combined voice input and Chinese phonetic alphabet input functions
CN1101577C (en) Speech input memorandum
CN1224889C (en) Chinese character input method and system by using said method
CN201251767Y (en) Intelligent electronic dictionary
CN1755589A (en) Letter inputting method and apparatus based on press-key and speech recognition
CN101035154A (en) Blind portable electronic address list
CN1979468A (en) Sound-controlled document inquiring system and method thereof
CN1510554B (en) Embedded applied Chinese character inputting method
CN1854997A (en) Numbers and alphabets inputting method
CN1384426A (en) Dian code Chinese character input method for computer
CN1218212A (en) Method and apparatus for operation by hand written alphabets and symbols
CN1472626A (en) Intelligent embedded character inputting method and device
CN1105985C (en) Device and method for Chinese input by hand writing and speech sound
CN2424513Y (en) Multifunctional speech identifying notebook and large capacity digital recording integrated machine
CN1198198C (en) Character input method and unit
CN2845026Y (en) Voice controlled file searching system
CN1269542A (en) Association Chinese character input system
CN1501223A (en) Press key input system and method
CN1232336A (en) Voice command system for automatic dialing
CN1752900A (en) Chinese character input method combined with speech sound identification
CN101034403A (en) Intelligent search method of small keyboard electronic dictionary

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: BEIJING HANWANG SCIENCE AND TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY CO., BEIJING

Effective date: 20030418

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20030418

Patentee after: Hanwang Science and Technology Co., Ltd., Beijing

Patentee before: Zhongzi Hangwang Science and Technology Co., Beijing

C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee