CN1247347A - Speech input memorandum - Google Patents

Speech input memorandum Download PDF

Info

Publication number
CN1247347A
CN1247347A CN 98117693 CN98117693A CN1247347A CN 1247347 A CN1247347 A CN 1247347A CN 98117693 CN98117693 CN 98117693 CN 98117693 A CN98117693 A CN 98117693A CN 1247347 A CN1247347 A CN 1247347A
Authority
CN
China
Prior art keywords
speech
microprocessor
converter
recognition device
speech recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 98117693
Other languages
Chinese (zh)
Other versions
CN1101577C (en
Inventor
刘迎建
马梁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hanwang Science and Technology Co., Ltd., Beijing
Original Assignee
ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING filed Critical ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY Co BEIJING
Priority to CN 98117693 priority Critical patent/CN1101577C/en
Publication of CN1247347A publication Critical patent/CN1247347A/en
Application granted granted Critical
Publication of CN1101577C publication Critical patent/CN1101577C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)
  • Calculators And Similar Devices (AREA)

Abstract

A speech input memorandum, belonging to a personal digital assitant (PDA), is a PDA based on speech, and features that the name and telephone number can be recorded by speech, and on inquiring, when a name is talked to it, a telephone number is directly displayed and the name and address/orgnization can be said out. It can also store 30-minute digitalized sound segmentally and then reproduce it at predefined time. The "journey" and "event record" functions can be performed in speech mode.

Description

Speech input memorandum
The invention belongs to " personal digital assistant " (Personal Digital Assistant), both usually said electronic notebook is called for short " PDA ".
At present, the PDA product can be divided into two big classes:
With the low-grade PDA headed by homemade " Wenquxing " and Hong Kong " electricity is translated logical ", be characterized in keyboard input, adopt encoding schemes such as phonetic or five, convert the GB Chinese character to, store or inquire about.Be characterized in simple in structure, memory consumption is few, and CPU requires low.Thereby price is very low.But because keyboard is little, operation inconvenience.Add that the viewing area is little, the candidate of phonetic is many, and the phonetic input is had any problem.And many people can five or spelling input method, can't use.
Also have with the Taiwan instant translator, translate logically well, the high-grade PDA of the artificial representative of Hong Kong name adopts handwriting input, a lot of specific functions is arranged, as infrared interface, digital recording, beeper reception etc.Because carry out handwriting input, these products all are furnished with bigger LCD, pressure type touch-screen, the RAM that 128K is above, and 1 to 4,000,000 ROM.Also very high to the requirement of CPU simultaneously, processing power all meets or exceeds the level of 286-16.Therefore their price is all very high.The input method of handwriting recognition is more humane, but CUP processing power and storage space is required very high, and the system requirements of for example intelligent pen are: Pentium is more than 75,8,000,000 internal memories.Because the restriction of PDA processing power, the process of writing are still not too convenient, require carefully and neatly to write, and the discrimination of its handwriting input, adaptive faculty and speed, all also far short of what is expected than the identification software of PC version.A lot of people feel and are inconvenient.
The objective of the invention is to designed PDA and have following characteristics:
1. when " name " in the business card function imported, read name, the feature with sound identification module analysis and extraction voice stores in the cog region of storer.Simultaneously with compression module with compress speech, store in the recording zone of storer.
2. when retrieval, extract the sound characteristic of the name of reading with sound identification module, and with storer in the feature comparison, find corresponding clauses and subclauses.
3. input digit and English adopt the little vocabulary speech recognition of specific people, compare with the eigenwert of extracting in advance with the eigenwert of sound identification module with the voice of institute's reading word, letter, identify corresponding digital or letter.
4. when consulting content, with the broadcast of contracing of the voice decompress(ion) of name, address in the storer recording zone.But not common demonstration Chinese character.
5. when consulting content, telephone number is except that character display, and also available massage voice reading comes out.Purpose of the present invention realizes in the following manner:
Shown in figure one, microphone is connected with amplifilter; Amplifilter is connected with high-precision a/d converter; A/D converter is connected with speech recognition device and compress speech element simultaneously; Speech recognition device links to each other with main control microprocessor by data bus and control signal wire.Speech compressor links to each other with main control microprocessor by data bus and control signal wire.Microprocessor is connected with storer.
The effect of this circuit is with the simulating signal of phonetic entry, is converted to digital code stream, and the characteristic extracting module of speech recognition device is extracted phonetic feature from speech data, and passes to microprocessor by data bus.Speech compressor is also passed to microprocessor by data bus after speech data is compressed.After processor obtains speech data and phonetic feature, they are deposited in respectively in the zones of different of storer use for identification and playback.
Microprocessor also is connected with the decompression element, and the decompression element is connected with D/A converter, and D/A converter is connected with amplifier, and amplifier is connected with loudspeaker.
The effect of this circuit is with the reduction of the speech data after the compression and plays.
Microprocessor also links to each other with the identification module of speech recognition device by data bus and control signal wire.Speech recognition device directly links to each other with storer again.
The effect of this circuit is sound identification module, the eigenwert in the speech recognition device usefulness storer and this value comparison that speech recognition device passed to phonetic feature by microprocessor.And the result passed to microprocessor.
Outward appearance of the present invention is as shown in Figure 2: this notepad can be held in the palm, one-handed performance.And microphone is housed, acknowledgement key is set, cancel key, knob, display screen and loudspeaker.
The present invention compares with existing PDA, has the following advantages:
Voice memo does not originally need speech conversion is become literal, and is low to system requirements, do not miss the knowledge problem.Ten numerals adopt the speech recognition input, and because of vocabulary is little, discrimination is very high, and speed is fast.Adopt the method for specific people's study, do not have the restriction of accent, language.Therefore phonetic entry is the input method of present most convenient.All inputs and inquiry, reading process all realize that by voice all operations all has voice suggestion, and blind person or the people who has defective vision also can easily use, and technology maturation, the low-grade PDA of close prices.
Description of drawings:
Fig. 1 is the speech input memorandum system diagram
Fig. 2 is the speech input memorandum outside drawing:
1-circuit output plughole; The 2-acknowledgement key; The 3-cancel key; The 4-loudspeaker; 5-
Display screen; The 6-knob; The 7-microphone.
Fig. 3 is the speech retrieval process flow diagram
Fig. 4 is the phonetic entry process flow diagram
Fig. 5 is the phonetic entry process flow diagram of numeral
Fig. 6 is the voice output process flow diagram
Embodiment:
The present invention can use 2 joint 3A batteries, can use more than 30 days.Product appearance as shown in Figure 2, shell is streamlined, elegant in appearance, small and exquisite and easy to operate.Positive demonstration adopts 16 *80 lattice type LCDs, and 16 more eye-catching icons are arranged.
Knob and affirmation, cancel key are housed respectively in the left and right sides of product, and the upper end is circuit delivery outlet and microphone, and the lower end is provided with loudspeaker.
Adjusting knob upwards transfers increment to, is downwards decrement, can select digital 0-9 easily, alphabetical A-Z and each menu option.
Affirmation/cancel key: the outward appearance long strip type, respectively there is a switch at inner two ends up and down.This key is confirmed by top, is cancellation by the bottom.
The recorded content of this notepad:
Can write down 200 business cards or 30 minutes voice.After business card increased, record length can shorten.
Each business card has maximum 2 seconds name recording and personal information recording in maximum 5 seconds.Maximum 5 telephone numbers: phone 1, phone 2, fax, home telephone, BP.
Digital recording can be divided into multistage, amounts to maximum 30 minutes, all can set reproduction time for every section, then begins automatically to play, as appointment/stroke reminding.
Circuit output and microphone interface:
Use this interface, the user can connect earphone and the microphone of oneself.Also can computing machine be transcribed in recording and carry out speech recognition by circuit output.Function and operation:
The course of work of this notepad following (as shown in Figure 1):
Phonetic entry: the analog voice signal of microphone input, entering amplifilter becomes analog electrical signal, is converted to digital code stream by A/D converter again, passes to speech recognition device and speech compressor simultaneously.The characteristic extracting module of speech recognition device is extracted phonetic feature from speech data, and passes to microprocessor by data bus.Speech compressor is also passed to microprocessor by data bus after speech data is compressed.After processor obtains speech data and phonetic feature, they are deposited in respectively in the zones of different of storer.
During retrieval, sound identification module, the eigenwert in the speech recognition device usefulness storer and this value comparison of speech recognition device passed to phonetic feature by processor.And the result passed to microprocessor.
Voice output: as shown in Figure 6, processor takes out the compressed voice data from storer, pass to the voice decompressor, data stream after decompressor will decompress is exported to D/A converter, be converted to simulating signal, after amplifying by amplifilter, export by loudspeaker.
1. primary option menu:
At first enter master menu after the start, turning knob, cursor is in " retrieval business card ", " browse business card ", " consulting recording ", " input business card ", " recording ", " default ", switch between options such as " internal memory situation ", " time set ", " voice training ", " recreation ", dot matrix LCD goes up the icon that shows each function.Cursor is along with selecting button to move, and after stopping 1 second, the term pronunciation goes out this function title.Enter function by acknowledgement key, return previous menu by " cancellation ".
2. business card input
Enter " business card input " function, cursor moves between " phone input " in " name input ", " personal information ", " input of name phonetic ".
1) the name input process as shown in Figure 3.
Earlier cursor is moved on on " name ", one second attonity, then voice suggestion " please be read name ", pins " affirmation ", microprocesser initialization A/D converter, speech compressor and speech recognition device.Begin recording then, acknowledgement key unclamps the back to be finished, and surpasses 2 seconds, overtime warning.Repeat to record and then cover last content.The voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Speech recognition device extracts phonetic feature, and deposits characteristic in storer by control chip.Simultaneously, speech compressor deposits data compression in storer by control chip.
2) personal information input: earlier cursor is moved on on " personal information ", one second attonity, then voice suggestion " personal information " is pinned " affirmation " and is begun recording, and finish the back of loosing one's grip, and surpasses 5 seconds, overtime warning.Repeat to record and to cover last content.In the Recording Process,, and extract phonetic feature without recognizer only with compression storage after the voice digitization.
3) telephone number input:
Available knob or phonetic entry, the phonetic entry process as shown in Figure 5.
With selecting button that cursor is moved on to: any among phone 1, phone 2, home telephone, fax, the BP enters by " affirmation ".Rotating the selection button can be at phonetic entry, digital 0-9, and the space is returned to delete between end and selected.During display digit 0-9, import and enter next numeral by " affirmation ", choosing " is returned and is deleted " and then deletes a numeral, pins back and deletes then full scale clearance in 2 seconds.Wherein, phonetic entry, return to delete and be icon, when choosing phonetic entry, pin " affirmation ", read a numeral, unclamp back identification, microprocessor initialization A/D of elder generation and recognition component, the voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Identification module extracts phonetic feature, and with storer in the phonetic feature of 0-9 compare, and the result is passed to microprocessor.The microprocessor control display screen shows corresponding digital 0-9, enters the next one then automatically.Then store and get back to the upper level option by " cancellation " key.
4) by " cancellation " key, whether prompting stores, and the user returns master menu after selecting.
3. business card retrieval:
Browse and the speech retrieval dual mode.
1) speech retrieval:
Retrieving as shown in Figure 4.
Select the business card retrieval under the master menu, begin recording with LCD and voice suggestion user.The user pins " affirmation " key and reads name simultaneously and unclamp then.Microprocessor control A/D, speech recognition device are started working.The voice signal that microphone is gathered through amplification, filtering, carries out the A/D conversion with A/D converter to voice signal.Identification module extracts phonetic feature, and with storer in the phonetic feature of the name stored compare, and the result is passed to microprocessor.Microprocessor shows corresponding recorded information according to the result.Call over name, address, phone simultaneously.Stop and entering editing mode by " affirmation ".When the close candidate of a plurality of pronunciations was arranged, if rotary knob then can switch between each candidate's business card, display showed sequence number and first telephone number.Stopped 1 second, and then called over name, personal information and phone.Stop and entering editing mode by " affirmation "." cancellation " returns master menu.
After entering, cursor is moved between name, remarks, phone and " deletion business card " icon, when moving on to name, address, stop to read after one second with knob.
Enter respective selection by " affirmation ", make amendment, " cancellation " returns upper level.
4. digital recording
1) recording
Select sound-recording function, pin acknowledgement key, begin recording.
Whether the back inquiry of recording regularly broadcasts, when selecting "Yes" then to continue the requirement setting
Between.Then can broadcast.
2) retrieval
Can only check in proper order.Show record length when checking, length and playback duration are set.
Began after stopping 1 second to play.By after " affirmations " enter, can revise regularly broadcast
Time or deletion.
5. system's setting
Identification parameter, volume and voice suggestion can be set opens or closes.
Each operation all has Chinese character or figure and voice suggestion, and wherein voice suggestion can be closed.
6. the study of 0-9 numeral
Each numeral reads twice, to voice gather and digitizing after, extract feature and be kept in the storer with recognizer.Behind the learning success with regard to available phonetic entry numeral.

Claims (2)

1. one kind by microphone, the speech input memorandum that amplifilter, A/D converter, D/A converter, speech recognition device, speech compressor, voice decompressor, data bus, control signal wire, main control microprocessor, amplifier, loudspeaker, storer, button, knob, display screen constitute, it is characterized in that: microphone is connected with amplifilter; Amplifilter is connected with high-precision a/d converter; A/D converter is connected with speech recognition device and compress speech element simultaneously; Speech recognition device links to each other with main control microprocessor by data bus and control signal wire, and speech compressor links to each other with main control microprocessor by data bus and control signal wire; Microprocessor is connected with storer; Microprocessor also is connected with voice decompression element, and voice decompression element is connected with D/A converter, and D/A converter is connected with amplifier, and amplifier is connected with loudspeaker; Microprocessor also links to each other with the identification module of speech recognition device by data bus and control signal wire, and speech recognition device directly links to each other with storer again.
2. according to the described speech input memorandum of claim 1, it is characterized in that the shell of notepad is equipped with microphone, acknowledgement key is set, cancel key, knob, display screen and loudspeaker.
CN 98117693 1998-09-09 1998-09-09 Speech input memorandum Expired - Fee Related CN1101577C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 98117693 CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 98117693 CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Publications (2)

Publication Number Publication Date
CN1247347A true CN1247347A (en) 2000-03-15
CN1101577C CN1101577C (en) 2003-02-12

Family

ID=5225645

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 98117693 Expired - Fee Related CN1101577C (en) 1998-09-09 1998-09-09 Speech input memorandum

Country Status (1)

Country Link
CN (1) CN1101577C (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102207845A (en) * 2011-05-25 2011-10-05 盛乐信息技术(上海)有限公司 Device and method for managing schedule in voice manner
US8219146B2 (en) 2009-11-06 2012-07-10 Sony Corporation Audio-only user interface mobile phone pairing
US9031614B2 (en) 2007-09-27 2015-05-12 Unify, Inc. Method and apparatus for secure electronic business card exchange
CN108682332A (en) * 2018-05-21 2018-10-19 南通市第人民医院 A kind of special patient's identification card of hemodialysis and deck

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9031614B2 (en) 2007-09-27 2015-05-12 Unify, Inc. Method and apparatus for secure electronic business card exchange
US8219146B2 (en) 2009-11-06 2012-07-10 Sony Corporation Audio-only user interface mobile phone pairing
CN102207845A (en) * 2011-05-25 2011-10-05 盛乐信息技术(上海)有限公司 Device and method for managing schedule in voice manner
CN108682332A (en) * 2018-05-21 2018-10-19 南通市第人民医院 A kind of special patient's identification card of hemodialysis and deck

Also Published As

Publication number Publication date
CN1101577C (en) 2003-02-12

Similar Documents

Publication Publication Date Title
US6864809B2 (en) Korean language predictive mechanism for text entry by a user
KR101586890B1 (en) Input processing method and apparatus
CN1301026C (en) Mobile terminal voice telephone directory system
CN102033928B (en) Handwritten original handwriting recording and querying system and electronic device
CN104468959A (en) Method, device and mobile terminal displaying image in communication process of mobile terminal
CN1581898A (en) Managing an address book in portable terminal having a camera
CN1333501A (en) Dynamic Chinese speech synthesizing method
CN1101577C (en) Speech input memorandum
CN1371043A (en) Numeral operation system
CN1224889C (en) Chinese character input method and system by using said method
CN201251767Y (en) Intelligent electronic dictionary
CN114913745B (en) Braille reader, braille coding method and system
CN101035154A (en) Blind portable electronic address list
CN1979468A (en) Sound-controlled document inquiring system and method thereof
CN1755589A (en) Letter inputting method and apparatus based on press-key and speech recognition
CN1734467A (en) Mobile phone capable of inputting text by taking pictures and inputting method thereof
CN2424513Y (en) Multifunctional speech identifying notebook and large capacity digital recording integrated machine
CN2845026Y (en) Voice controlled file searching system
CN1218212A (en) Method and apparatus for operation by hand written alphabets and symbols
CN1198198C (en) Character input method and unit
CN1384426A (en) Dian code Chinese character input method for computer
CN1472626A (en) Intelligent embedded character inputting method and device
CN1897606A (en) Method for realizing written dial
CN1501223A (en) Press key input system and method
CN1232225A (en) Electronic memo and sound-controlled Chinese character identification method

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: BEIJING HANWANG SCIENCE AND TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: ZHONGZI HANGWANG SCIENCE AND TECHNOLOGY CO., BEIJING

Effective date: 20030418

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20030418

Patentee after: Hanwang Science and Technology Co., Ltd., Beijing

Patentee before: Zhongzi Hangwang Science and Technology Co., Beijing

C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee