US20030072013A1 - Document creation through embedded speech recognition - Google Patents

Document creation through embedded speech recognition Download PDF

Info

Publication number
US20030072013A1
US20030072013A1 US09/976,534 US97653401A US2003072013A1 US 20030072013 A1 US20030072013 A1 US 20030072013A1 US 97653401 A US97653401 A US 97653401A US 2003072013 A1 US2003072013 A1 US 2003072013A1
Authority
US
United States
Prior art keywords
text
audio data
data
selection criteria
electronic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/976,534
Other versions
US7006968B2 (en
Inventor
Corey Norris
Steve Wiley
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hewlett Packard Development Co LP
Original Assignee
Hewlett Packard Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett Packard Co filed Critical Hewlett Packard Co
Priority to US09/976,534 priority Critical patent/US7006968B2/en
Assigned to HEWLETT-PACKARD COMPANY reassignment HEWLETT-PACKARD COMPANY ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: NORRIS, COREY J., WILEY, STEVE R.
Publication of US20030072013A1 publication Critical patent/US20030072013A1/en
Assigned to HEWLETT-PACKARD DEVELOPMENT COMPANY L.P. reassignment HEWLETT-PACKARD DEVELOPMENT COMPANY L.P. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: HEWLETT-PACKARD COMPANY
Application granted granted Critical
Publication of US7006968B2 publication Critical patent/US7006968B2/en
Adjusted expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • This invention relates in general to voice recognition technology and, more particularly, to voice recognition technology in a printing device.
  • Speech recognition software has been developed for recognizing words of human speech.
  • words are recognized and reproduced as electronic text in a computer word processor.
  • the electronic text is then available for editing, manipulation, and printing using the functions of the word processor.
  • this application of speech recognition software requires a user to manually intervene in order to obtain a printed record of the electronic text.
  • audio data is captured, then filtered to extract text data and print commands.
  • the text data is converted to electronic text.
  • the electronic text is accumulated in a text buffer. Audio data is captured and filtered until a print command is extracted from the audio data. All of the extracted text data is converted to electronic text, which is accumulated in the text buffer.
  • the electronic text is translated into a rendered document. The electronic text is either translated alone or with a form template. The rendered document is printed and the text buffer is purged.
  • the audio data is captured by first sensing the audio data.
  • An analog audio data signal is generated in response to the sensed audio data.
  • the analog audio data signal is digitizing into captured audio data.
  • the captured audio data is filtered by first comparing the captured audio data with text data selection criteria and print command selection criteria.
  • the captured audio data meeting the print command selection criteria is extracted as a print command.
  • the captured audio data meeting the text data selection criteria is extracted as text data,
  • the text data is converted to electronic text by inputting the text data into a speech recognition module and receiving electronic text from the speech recognition module.
  • FIG. 1 is a block diagram representing one embodiment of the system of the present invention for printing text.
  • FIG. 2 is a block diagram representing an alternate embodiment of the system of the present invention for printing text.
  • FIG. 3 is a flow chart illustrating one embodiment of the method of the present invention for printing text.
  • System 2 includes a remote audio collector 4 and a printer 6 .
  • Remote audio collector 4 includes an audio digitizer 8 , an audio filter 10 , a speech recognition module 12 , a text buffer 14 , a rendering subsystem 16 , a processor 18 , a storage system 20 , and a communication interface 22 . Additionally, remote audio collector 4 optionally includes a form template 24 and a buffer cleaner 26 .
  • Audio digitizer 8 is any combination of hardware and executable code configured to capture audio data.
  • Executable code is any code capable of execution by a processor or processing system. Examples of executable code include software, firmware, and hardwired code.
  • audio digitizer 8 includes a microphone 28 and an analog to digital (A/D) converter 30 .
  • Microphone 28 is any electronic device configured to sense audio data and generate an analog audio data signal in response to the sensed audio data.
  • A/D converter 30 is any combination of hardware and executable code configured to digitize the analog audio data signal into captured audio data.
  • A/D converter 30 is a sound card for a computer system.
  • Audio filter 10 is any combination of hardware and executable code configured to filter captured audio data to extract text data and print commands.
  • audio filter 10 includes a selection criteria database 32 , a comparator 34 , a print command extractor 36 , and a text data extractor 38 .
  • Selection criteria database 32 is any form of database configured to hold print command selection criteria 40 and text data selection criteria 42 .
  • Print command selection criteria 40 is any criteria configured to select a print command from the audio data. Examples of print command selection criteria 40 include amplitude, frequency, and duration criteria indicative of a print command. The amplitude, frequency, and duration criteria may be user configurable to suit a user.
  • Text data selection criteria 42 is any criteria for selecting a text data from the audio data.
  • Examples of text data selection criteria 42 include amplitude, frequency, and duration criteria indicative of a text data.
  • the amplitude, frequency, and duration criteria may be user configurable to suit a user.
  • Comparator 34 is any combination of hardware and executable code configured to compare captured audio data with print command selection criteria 40 and text data selection criteria 42 .
  • comparator 34 is executable code performed by processor 18 to compare captured audio data with print command selection criteria 40 and text data selection criteria 42 .
  • Print command extractor 36 is any combination of hardware and executable code configured to extract, as a print command, captured audio data meeting the print command selection criteria 40 .
  • print command extractor 36 is executable code performed by processor 18 to extract, as a print command, captured audio data meeting the print command selection criteria 40 .
  • Text data extractor 38 is any combination of hardware and executable code configured to extract, as text data, captured audio data meeting the text data selection criteria 42 .
  • text data extractor 38 is executable code performed by processor 18 to extract, as text data, captured audio data meeting the text data selection criteria 42 .
  • Speech recognition module 12 is any combination of hardware and executable code configured to convert text data to electronic text.
  • speech recognition module 12 is executable code performed by processor 18 to convert text data to electronic text.
  • Text buffer 14 is any combination of hardware and executable code configured to accumulate electronic text.
  • text buffer 14 is a portion of storage system 20 designated to accumulate the electronic text.
  • Rendering subsystem 16 is any combination of hardware and executable code configured to translate electronic text accumulated in text buffer 14 into a rendered document.
  • rendering subsystem 16 is executable code performed by processor 18 to translate electronic text accumulated in text buffer 14 into a rendered document.
  • Processor 18 is any device or system configured to process executable code. Processor 18 controls remote audio collector 4 .
  • Storage system 20 is any system configured to store data or executable code.
  • Storage system 20 may also be a program storage system tangibly embodying a program, applet, or instructions executable by processor 18 for performing the method steps of the present invention executable by processor 18 .
  • Storage system 20 may be any type of storage media such as magnetic, optical, or electronic storage media.
  • Storage system 20 is illustrated in FIG. 1 as a single device. Alternatively, storage system 20 may include a plurality of devices. Furthermore, each device of storage system 20 may be embodied in a different media type. For example, one device of storage system 20 may be a magnetic storage media while another device of storage system 20 is an electronic storage media.
  • Communication interface 22 is any device or system configured to allow remote audio collector 4 to communicate with printer 6 .
  • Examples of communication interface 22 include a direct connection interface and a network interface.
  • Form template 24 is any template saved in any suitable fashion for use by rendering subsystem 16 with the text data.
  • form template 24 includes empty fields into which text data is inserted before rendering the text data with form template 24 .
  • Buffer cleaner 26 is any combination of hardware and executable code configured to purge text buffer 14 .
  • buffer cleaner 26 is executable code performed by processor 18 to purging text buffer 14 after the rendered document is printed.
  • Printer 6 includes a communication interface 44 , an imaging subsystem 46 , and a processor 48 .
  • Imaging subsystem 46 is any combination of hardware and executable code configured to print a rendered document.
  • Processor 48 is any device or system configured to process executable code. Processor 48 controls printer 6 .
  • Communication interface 44 is any device or system configured to allow printer 6 to communicate with remote audio collector 4 .
  • Examples of communication interface 44 include a direct connection interface and a network interface.
  • Communication interface 44 is of a type suitable for use with communication interface 22 of remote audio collector 4 .
  • FIG. 2 Illustrated in FIG. 2 is an alternate embodiment of system 2 , wherein components of the remote audio collector illustrated in FIG. 1 are embedded in printer 6 .
  • Components in FIG. 2 identically numbered to components in FIG. 1 are identical in function to the components in FIG. 1, but may differ in actual embodiment.
  • Printer 2 includes audio digitizer 8 , audio filter 10 , speech recognition module 12 , text buffer 14 , rendering subsystem 16 , storage system 20 , imaging subsystem 46 , and processor 48 . Additionally, printer 6 optionally includes form template 24 and buffer cleaner 26 . Audio digitizer 8 , audio filter 10 , speech recognition module 12 , text buffer 14 , rendering subsystem 16 , storage system 20 , imaging subsystem 46 , processor 48 , form template 24 , and buffer cleaner 26 are as previously described.
  • FIG. 3 is a flow chart representing steps of one embodiment of the present invention. Although the steps represented in FIG. 3 are presented in a specific order, the present invention encompasses variations in the order of steps. Furthermore, additional steps may be executed between the steps illustrated in FIG. 3 without departing from the scope of the present invention.
  • Audio data is captured 50 .
  • the audio data is captured by sensing the audio data and generating an analog audio data signal in response to the sensed audio data.
  • the analog audio data signal is digitized into captured audio data.
  • the captured audio data is filtered 52 to extract text data and print commands.
  • captured audio data is filtered 52 by comparing the captured audio data with text data selection criteria and print command selection criteria.
  • the captured audio data meeting the print command selection criteria is extracted as a print command.
  • the captured audio data meeting the text data selection criteria is extracted as text data.
  • the text data is converted 54 to electronic text.
  • the text data is extracted by inputting the text data into speech recognition module 12 and receiving electronic text from speech recognition module 12 .
  • the electronic text is accumulated 56 in text buffer 14 .
  • the electronic text is accumulated 56 in text buffer 14 by appending the electronic text to contents of text buffer 14 .
  • the process is repeated until a print command is extracted from the audio data.
  • the electronic text accumulated in the text buffer is translated 58 into a rendered document.
  • Translating 58 the electronic text into a rendered document includes translating the electronic text into a form ready for printing.
  • the electronic text is combined with form template 24 before translating 58 .
  • the electronic text with form template 24 are translated into the rendered document.
  • the rendered document is printed 60 .
  • Printing 60 the rendered document includes reproducing the rendered document onto print media.
  • the text buffer is purged 62 .

Abstract

Audio data is captured, then filtered to extract text data and print commands. The audio data is captured by sensing the audio data, generating an analog audio data signal in response to the sensed audio data, and digitizing the analog audio data signal into captured audio data. The text data is converted to electronic text. The electronic text is accumulated in a text buffer. Audio data is captured and filtered to extract text data and print comments until a print command is extracted from the audio data. When a print command is extracted from the audio data, the accumulated electronic text is translated into a rendered document. The electronic text is either translated alone or with a form template. The rendered document is printed and the text buffer is purged.

Description

    FIELD OF THE INVENTION
  • This invention relates in general to voice recognition technology and, more particularly, to voice recognition technology in a printing device. [0001]
  • BACKGROUND OF THE INVENTION
  • Often there is a need to transcribe spoken words to printed text. Traditionally, transcription has been accomplished by transcriptionists who listen to spoken words and transcribe the spoken words to readable documents. The spoken words are either spoken concurrently with the transcription or previously recorded and played back for the transcriptionist. Manual transcription by a transcriptionist is time consuming, costly, and error prone. [0002]
  • Speech recognition software has been developed for recognizing words of human speech. In one conventional application of the speech recognition software, words are recognized and reproduced as electronic text in a computer word processor. The electronic text is then available for editing, manipulation, and printing using the functions of the word processor. While convenient, this application of speech recognition software requires a user to manually intervene in order to obtain a printed record of the electronic text. [0003]
  • SUMMARY OF THE INVENTION
  • According to principles of the present invention, audio data is captured, then filtered to extract text data and print commands. The text data is converted to electronic text. The electronic text is accumulated in a text buffer. Audio data is captured and filtered until a print command is extracted from the audio data. All of the extracted text data is converted to electronic text, which is accumulated in the text buffer. When a print command is extracted from the audio data, the electronic text is translated into a rendered document. The electronic text is either translated alone or with a form template. The rendered document is printed and the text buffer is purged. [0004]
  • According to further principles of the present invention, the audio data is captured by first sensing the audio data. An analog audio data signal is generated in response to the sensed audio data. The analog audio data signal is digitizing into captured audio data. [0005]
  • According to further principles of the present invention, the captured audio data is filtered by first comparing the captured audio data with text data selection criteria and print command selection criteria. The captured audio data meeting the print command selection criteria is extracted as a print command. The captured audio data meeting the text data selection criteria is extracted as text data, [0006]
  • According to further principles of the present invention, the text data is converted to electronic text by inputting the text data into a speech recognition module and receiving electronic text from the speech recognition module.[0007]
  • DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram representing one embodiment of the system of the present invention for printing text. [0008]
  • FIG. 2 is a block diagram representing an alternate embodiment of the system of the present invention for printing text. [0009]
  • FIG. 3 is a flow chart illustrating one embodiment of the method of the present invention for printing text.[0010]
  • DETAILED DESCRIPTION OF THE INVENTION
  • Illustrated in FIG. 1 is a system [0011] 2 for printing text. System 2 includes a remote audio collector 4 and a printer 6. Remote audio collector 4 includes an audio digitizer 8, an audio filter 10, a speech recognition module 12, a text buffer 14, a rendering subsystem 16, a processor 18, a storage system 20, and a communication interface 22. Additionally, remote audio collector 4 optionally includes a form template 24 and a buffer cleaner 26.
  • [0012] Audio digitizer 8 is any combination of hardware and executable code configured to capture audio data. Executable code is any code capable of execution by a processor or processing system. Examples of executable code include software, firmware, and hardwired code.
  • In one embodiment, [0013] audio digitizer 8 includes a microphone 28 and an analog to digital (A/D) converter 30. Microphone 28 is any electronic device configured to sense audio data and generate an analog audio data signal in response to the sensed audio data. A/D converter 30 is any combination of hardware and executable code configured to digitize the analog audio data signal into captured audio data. In one embodiment, A/D converter 30 is a sound card for a computer system.
  • [0014] Audio filter 10 is any combination of hardware and executable code configured to filter captured audio data to extract text data and print commands. In one embodiment, audio filter 10 includes a selection criteria database 32, a comparator 34, a print command extractor 36, and a text data extractor 38.
  • [0015] Selection criteria database 32 is any form of database configured to hold print command selection criteria 40 and text data selection criteria 42. Print command selection criteria 40 is any criteria configured to select a print command from the audio data. Examples of print command selection criteria 40 include amplitude, frequency, and duration criteria indicative of a print command. The amplitude, frequency, and duration criteria may be user configurable to suit a user.
  • Text [0016] data selection criteria 42 is any criteria for selecting a text data from the audio data. Examples of text data selection criteria 42 include amplitude, frequency, and duration criteria indicative of a text data. The amplitude, frequency, and duration criteria may be user configurable to suit a user.
  • [0017] Comparator 34 is any combination of hardware and executable code configured to compare captured audio data with print command selection criteria 40 and text data selection criteria 42. In one embodiment, comparator 34 is executable code performed by processor 18 to compare captured audio data with print command selection criteria 40 and text data selection criteria 42.
  • [0018] Print command extractor 36 is any combination of hardware and executable code configured to extract, as a print command, captured audio data meeting the print command selection criteria 40. In one embodiment, print command extractor 36 is executable code performed by processor 18 to extract, as a print command, captured audio data meeting the print command selection criteria 40.
  • [0019] Text data extractor 38 is any combination of hardware and executable code configured to extract, as text data, captured audio data meeting the text data selection criteria 42. In one embodiment, text data extractor 38 is executable code performed by processor 18 to extract, as text data, captured audio data meeting the text data selection criteria 42.
  • [0020] Speech recognition module 12 is any combination of hardware and executable code configured to convert text data to electronic text. In one embodiment, speech recognition module 12 is executable code performed by processor 18 to convert text data to electronic text.
  • [0021] Text buffer 14 is any combination of hardware and executable code configured to accumulate electronic text. In one embodiment, text buffer 14 is a portion of storage system 20 designated to accumulate the electronic text.
  • Rendering [0022] subsystem 16 is any combination of hardware and executable code configured to translate electronic text accumulated in text buffer 14 into a rendered document. In one embodiment, rendering subsystem 16 is executable code performed by processor 18 to translate electronic text accumulated in text buffer 14 into a rendered document.
  • [0023] Processor 18 is any device or system configured to process executable code. Processor 18 controls remote audio collector 4.
  • [0024] Storage system 20 is any system configured to store data or executable code. Storage system 20 may also be a program storage system tangibly embodying a program, applet, or instructions executable by processor 18 for performing the method steps of the present invention executable by processor 18. Storage system 20 may be any type of storage media such as magnetic, optical, or electronic storage media. Storage system 20 is illustrated in FIG. 1 as a single device. Alternatively, storage system 20 may include a plurality of devices. Furthermore, each device of storage system 20 may be embodied in a different media type. For example, one device of storage system 20 may be a magnetic storage media while another device of storage system 20 is an electronic storage media.
  • [0025] Communication interface 22 is any device or system configured to allow remote audio collector 4 to communicate with printer 6. Examples of communication interface 22 include a direct connection interface and a network interface.
  • [0026] Form template 24 is any template saved in any suitable fashion for use by rendering subsystem 16 with the text data. In one embodiment, form template 24 includes empty fields into which text data is inserted before rendering the text data with form template 24.
  • [0027] Buffer cleaner 26 is any combination of hardware and executable code configured to purge text buffer 14. In one embodiment, buffer cleaner 26 is executable code performed by processor 18 to purging text buffer 14 after the rendered document is printed.
  • [0028] Printer 6 includes a communication interface 44, an imaging subsystem 46, and a processor 48. Imaging subsystem 46 is any combination of hardware and executable code configured to print a rendered document. Processor 48 is any device or system configured to process executable code. Processor 48 controls printer 6.
  • [0029] Communication interface 44 is any device or system configured to allow printer 6 to communicate with remote audio collector 4. Examples of communication interface 44 include a direct connection interface and a network interface. Communication interface 44 is of a type suitable for use with communication interface 22 of remote audio collector 4.
  • Illustrated in FIG. 2 is an alternate embodiment of system [0030] 2, wherein components of the remote audio collector illustrated in FIG. 1 are embedded in printer 6. Components in FIG. 2 identically numbered to components in FIG. 1 are identical in function to the components in FIG. 1, but may differ in actual embodiment.
  • Printer [0031] 2 includes audio digitizer 8, audio filter 10, speech recognition module 12, text buffer 14, rendering subsystem 16, storage system 20, imaging subsystem 46, and processor 48. Additionally, printer 6 optionally includes form template 24 and buffer cleaner 26. Audio digitizer 8, audio filter 10, speech recognition module 12, text buffer 14, rendering subsystem 16, storage system 20, imaging subsystem 46, processor 48, form template 24, and buffer cleaner 26 are as previously described.
  • FIG. 3 is a flow chart representing steps of one embodiment of the present invention. Although the steps represented in FIG. 3 are presented in a specific order, the present invention encompasses variations in the order of steps. Furthermore, additional steps may be executed between the steps illustrated in FIG. 3 without departing from the scope of the present invention. [0032]
  • Audio data is captured [0033] 50. In one embodiment, the audio data is captured by sensing the audio data and generating an analog audio data signal in response to the sensed audio data. The analog audio data signal is digitized into captured audio data.
  • The captured audio data is filtered [0034] 52 to extract text data and print commands. In one embodiment, captured audio data is filtered 52 by comparing the captured audio data with text data selection criteria and print command selection criteria. The captured audio data meeting the print command selection criteria is extracted as a print command. The captured audio data meeting the text data selection criteria is extracted as text data.
  • If a print command is not extracted from the audio data and instead text data is extracted from the audio data, the text data is converted [0035] 54 to electronic text. In one embodiment, the text data is extracted by inputting the text data into speech recognition module 12 and receiving electronic text from speech recognition module 12.
  • The electronic text is accumulated [0036] 56 in text buffer 14. In one embodiment, the electronic text is accumulated 56 in text buffer 14 by appending the electronic text to contents of text buffer 14.
  • The process is repeated until a print command is extracted from the audio data. The electronic text accumulated in the text buffer is translated [0037] 58 into a rendered document. Translating 58 the electronic text into a rendered document includes translating the electronic text into a form ready for printing. In one embodiment, the electronic text is combined with form template 24 before translating 58. The electronic text with form template 24 are translated into the rendered document.
  • The rendered document is printed [0038] 60. Printing 60 the rendered document includes reproducing the rendered document onto print media. In one embodiment, after the rendered document is printed 60, the text buffer is purged 62.
  • The foregoing description is only illustrative of the invention. Various alternatives and modifications can be devised by those skilled in the art without departing from the invention. Accordingly, the present invention embraces all such alternatives, modifications, and variances that fall within the scope of the appended claims. [0039]

Claims (20)

What is claimed is:
1. A method for printing text, the method comprising:
(a) capturing audio data;
(b) filtering the captured audio data to extract text data and print commands;
(c) converting the text data to electronic text;
(d) accumulating the electronic text in a text buffer;
(e) repeating steps (a) through (d) until a print command is extracted from the audio data;
(f) translating the electronic text accumulated in the text buffer into a rendered document; and,
(g) printing the rendered document.
2. The method of claim 1 wherein capturing the audio data includes:
(a) sensing the audio data;
(b) generating an analog audio data signal in response to the sensed audio data; and,
(c) digitizing the analog audio data signal into captured audio data.
3. The method of claim 1 wherein filtering the captured audio data includes:
(a) comparing the captured audio data with text data selection criteria and print command selection criteria;
(b) extracting, as a print command, the captured audio data meeting the print command selection criteria; and,
(c) extracting, as text data, the captured audio data meeting the text data selection criteria.
4. The method of claim 1 wherein converting the text data to electronic text includes:
(a) inputting the text data into a speech recognition module; and,
(b) receiving electronic text from the speech recognition module.
5. The method of claim 1 wherein accumulating the electronic text includes appending the electronic text to contents of the text buffer.
6. The method of claim 1 wherein translating the electronic text includes:
(a) combining the electronic text accumulated in the text buffer with a form template; and,
(b) translating the electronic text with the form template into the rendered document.
7. The method of claim 1 wherein translating the electronic text includes translating the electronic text into a form ready for printing.
8 The method of claim 1 wherein printing the rendered document includes reproducing the rendered document onto print media.
9 The method of claim 1 further including purging the text buffer after printing the rendered document.
10. A system for printing text, the system comprising:
(a) an audio digitizer configured to capture audio data;
(b) an audio filter configured to filter captured audio data to extract text data and print commands;
(c) a speech recognition module configured to convert text data to electronic text;
(d) a text buffer configured to accumulate the electronic text;
(e) a rendering subsystem configured to translate the electronic text accumulated in the text buffer into a rendered document; and,
(f) an imaging subsystem configured to print the rendered document.
11. The system of claim 10 wherein the audio digitizer includes:
(a) a microphone configured to sense audio data and generate an analog audio data signal in response to the sensed audio data; and,
(b) an analog to digital converter configured to digitize the analog audio data signal into captured audio data.
12. The system of claim 10 wherein the audio filter includes:
(a) a selection criteria database including text data selection criteria and print command selection criteria;
(b) a comparator configured to compare captured audio data with the text data selection criteria and the print command selection criteria;
(c) a print command extractor configured to extract, as a print command, captured audio data meeting the print command selection criteria; and,
(d) a text data extractor configured to extract, as text data, captured audio data meeting the text data selection criteria.
13. The system of claim 10 further including a buffer cleaner configured to purge the text buffer after printing the rendered document.
14. A program storage system readable by a computer, tangibly embodying a program, applet, or instructions executable by the computer to perform method steps for printing text, the method steps comprising:
(a) capturing audio data;
(b) filtering the captured audio data to extract text data and print commands;
(c) converting the text data to electronic text;
(d) accumulating the electronic text in a text buffer;
(e) repeating steps (a) through (d) until a print command is extracted from the audio data;
(f) translating the electronic text accumulated in the text buffer into a rendered document; and,
(g) printing the rendered document.
15. The program storage system of claim 14 wherein the method step of capturing the audio data includes:
(a) sensing the audio data;
(b) generating an analog audio data signal in response to the sensed audio data; and,
(c) digitizing the analog audio data signal into captured audio data.
16. The program storage system of claim 14 wherein the method step of filtering the captured audio data includes:
(a) comparing the captured audio data with text data selection criteria and print command selection criteria;
(b) extracting, as a print command, the captured audio data meeting the print command selection criteria; and,
(c) extracting, as text data, the captured audio data meeting the text data selection criteria.
17. The program storage system of claim 14 wherein the method step of converting the text data to electronic text includes:
(a) inputting the text data into a speech recognition module; and,
(b) receiving electronic text from the speech recognition module.
18. The program storage system of claim 14 wherein the method step of translating the electronic text includes:
(a) combining the electronic text accumulated in the text buffer with a form template; and,
(b) translating the electronic text with the form template into the rendered document.
19. The program storage system of claim 14 wherein the method step of printing the rendered document includes reproducing the rendered document onto print media.
20. The program storage system of claim 14 wherein the method steps further include purging the text buffer after printing the rendered document.
US09/976,534 2001-10-11 2001-10-11 Document creation through embedded speech recognition Expired - Lifetime US7006968B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/976,534 US7006968B2 (en) 2001-10-11 2001-10-11 Document creation through embedded speech recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/976,534 US7006968B2 (en) 2001-10-11 2001-10-11 Document creation through embedded speech recognition

Publications (2)

Publication Number Publication Date
US20030072013A1 true US20030072013A1 (en) 2003-04-17
US7006968B2 US7006968B2 (en) 2006-02-28

Family

ID=25524196

Family Applications (1)

Application Number Title Priority Date Filing Date
US09/976,534 Expired - Lifetime US7006968B2 (en) 2001-10-11 2001-10-11 Document creation through embedded speech recognition

Country Status (1)

Country Link
US (1) US7006968B2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040111265A1 (en) * 2002-12-06 2004-06-10 Forbes Joseph S Method and system for sequential insertion of speech recognition results to facilitate deferred transcription services
US20050012955A1 (en) * 2003-07-14 2005-01-20 Edwards Mark Joseph Method and apparatus for recording sound information and playing sound information back using an all-in-one printer
US20050096910A1 (en) * 2002-12-06 2005-05-05 Watson Kirk L. Formed document templates and related methods and systems for automated sequential insertion of speech recognition results
US7774694B2 (en) 2002-12-06 2010-08-10 3M Innovation Properties Company Method and system for server-based sequential insertion processing of speech recognition results
WO2019212459A1 (en) 2018-04-30 2019-11-07 Hewlett-Packard Development Company, L.P Navigation printing with voice tagging

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050272415A1 (en) * 2002-10-01 2005-12-08 Mcconnell Christopher F System and method for wireless audio communication with a computer
US20050180464A1 (en) * 2002-10-01 2005-08-18 Adondo Corporation Audio communication with a computer
US8868670B2 (en) * 2004-04-27 2014-10-21 Avaya Inc. Method and apparatus for summarizing one or more text messages using indicative summaries
US8462367B2 (en) * 2005-02-10 2013-06-11 Hewlett-Packard Development Company, L.P. Printer voice portal
US8335691B2 (en) 2008-12-31 2012-12-18 International Business Machines Corporation Attaching audio generated scripts to graphical representations of applications

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5051924A (en) * 1988-03-31 1991-09-24 Bergeron Larry E Method and apparatus for the generation of reports
US5168548A (en) * 1990-05-17 1992-12-01 Kurzweil Applied Intelligence, Inc. Integrated voice controlled report generating and communicating system
US5231670A (en) * 1987-06-01 1993-07-27 Kurzweil Applied Intelligence, Inc. Voice controlled system and method for generating text from a voice controlled input
US5950167A (en) * 1998-01-26 1999-09-07 Lucent Technologies Inc. Screen-less remote voice or tone-controlled computer program operations via telephone set
US5995936A (en) * 1997-02-04 1999-11-30 Brais; Louis Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sound and image to formatted text locations
US6006183A (en) * 1997-12-16 1999-12-21 International Business Machines Corp. Speech recognition confidence level display
US6243682B1 (en) * 1998-11-09 2001-06-05 Pitney Bowes Inc. Universal access photocopier
US6332120B1 (en) * 1999-04-20 2001-12-18 Solana Technology Development Corporation Broadcast speech recognition system for keyword monitoring
US6839669B1 (en) * 1998-11-05 2005-01-04 Scansoft, Inc. Performing actions identified in recognized speech

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5231670A (en) * 1987-06-01 1993-07-27 Kurzweil Applied Intelligence, Inc. Voice controlled system and method for generating text from a voice controlled input
US5051924A (en) * 1988-03-31 1991-09-24 Bergeron Larry E Method and apparatus for the generation of reports
US5168548A (en) * 1990-05-17 1992-12-01 Kurzweil Applied Intelligence, Inc. Integrated voice controlled report generating and communicating system
US5995936A (en) * 1997-02-04 1999-11-30 Brais; Louis Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sound and image to formatted text locations
US6006183A (en) * 1997-12-16 1999-12-21 International Business Machines Corp. Speech recognition confidence level display
US5950167A (en) * 1998-01-26 1999-09-07 Lucent Technologies Inc. Screen-less remote voice or tone-controlled computer program operations via telephone set
US6839669B1 (en) * 1998-11-05 2005-01-04 Scansoft, Inc. Performing actions identified in recognized speech
US6243682B1 (en) * 1998-11-09 2001-06-05 Pitney Bowes Inc. Universal access photocopier
US6332120B1 (en) * 1999-04-20 2001-12-18 Solana Technology Development Corporation Broadcast speech recognition system for keyword monitoring

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040111265A1 (en) * 2002-12-06 2004-06-10 Forbes Joseph S Method and system for sequential insertion of speech recognition results to facilitate deferred transcription services
US20050096910A1 (en) * 2002-12-06 2005-05-05 Watson Kirk L. Formed document templates and related methods and systems for automated sequential insertion of speech recognition results
US7444285B2 (en) 2002-12-06 2008-10-28 3M Innovative Properties Company Method and system for sequential insertion of speech recognition results to facilitate deferred transcription services
US7774694B2 (en) 2002-12-06 2010-08-10 3M Innovation Properties Company Method and system for server-based sequential insertion processing of speech recognition results
US20050012955A1 (en) * 2003-07-14 2005-01-20 Edwards Mark Joseph Method and apparatus for recording sound information and playing sound information back using an all-in-one printer
US7359085B2 (en) * 2003-07-14 2008-04-15 Lexmark International, Inc. Method and apparatus for recording sound information and playing sound information back using an all-in-one printer
WO2019212459A1 (en) 2018-04-30 2019-11-07 Hewlett-Packard Development Company, L.P Navigation printing with voice tagging
US11354071B2 (en) 2018-04-30 2022-06-07 Hewlett-Packard Development Company, L.P. Navigation printing with voice tagging

Also Published As

Publication number Publication date
US7006968B2 (en) 2006-02-28

Similar Documents

Publication Publication Date Title
EP0887788B1 (en) Voice recognition apparatus for converting voice data present on a recording medium into text data
JP4466564B2 (en) Document creation / viewing device, document creation / viewing robot, and document creation / viewing program
US7496510B2 (en) Method and apparatus for the automatic separating and indexing of multi-speaker conversations
US7006968B2 (en) Document creation through embedded speech recognition
JPWO2005069171A1 (en) Document association apparatus and document association method
JPH07191690A (en) Minutes generation device and multispot minutes generation system
JP6095381B2 (en) Data processing apparatus, data processing method, and program
US20060149547A1 (en) Recording apparatus and voice recorder program
JP2016029468A (en) Speech information control method and terminal apparatus
KR20070118038A (en) Information processing apparatus, information processing method, and computer program
CN101025860A (en) Digital media adaptor with voice control function and its voice control method
JP4405418B2 (en) Information processing apparatus and method
KR101756836B1 (en) Method and system for generating document using speech data, and Image forming apparatus having it
US7421394B2 (en) Information processing apparatus, information processing method and recording medium, and program
KR101727587B1 (en) A Method Generating Transcripts Of Digital Recording File
JP2009288567A (en) Device, method, program and system for preparing minutes
US6212499B1 (en) Audible language recognition by successive vocabulary reduction
CN116472705A (en) Conference content display method, conference system and conference equipment
KR101781353B1 (en) A Method Generating Digital Recording File Having Integrity
CN111709253A (en) AI translation method and system for automatically converting dialect into caption
CN110853622A (en) Method and system for sentence segmentation by voice
CN210606618U (en) System for realizing voice and character recording
JP3698635B2 (en) Speech recognition processor
JP4462658B2 (en) Speech synthesizer and storage medium
JP2005024736A (en) Time series information control system and method therefor, and time series information control program

Legal Events

Date Code Title Description
AS Assignment

Owner name: HEWLETT-PACKARD COMPANY, COLORADO

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NORRIS, COREY J.;WILEY, STEVE R.;REEL/FRAME:012486/0148

Effective date: 20011009

AS Assignment

Owner name: HEWLETT-PACKARD DEVELOPMENT COMPANY L.P., TEXAS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HEWLETT-PACKARD COMPANY;REEL/FRAME:014061/0492

Effective date: 20030926

Owner name: HEWLETT-PACKARD DEVELOPMENT COMPANY L.P.,TEXAS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HEWLETT-PACKARD COMPANY;REEL/FRAME:014061/0492

Effective date: 20030926

STCF Information on status: patent grant

Free format text: PATENTED CASE

CC Certificate of correction
FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12