CA1169969A - Dictation system and method - Google Patents

Dictation system and method

Info

Publication number
CA1169969A
CA1169969A CA000384228A CA384228A CA1169969A CA 1169969 A CA1169969 A CA 1169969A CA 000384228 A CA000384228 A CA 000384228A CA 384228 A CA384228 A CA 384228A CA 1169969 A CA1169969 A CA 1169969A
Authority
CA
Canada
Prior art keywords
words
word
signals
dictation
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
CA000384228A
Other languages
French (fr)
Inventor
Gregor N. Neff
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Application granted granted Critical
Publication of CA1169969A publication Critical patent/CA1169969A/en
Expired legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M11/00Telephonic communication systems specially adapted for combination with other electrical systems
    • H04M11/10Telephonic communication systems specially adapted for combination with other electrical systems with dictation recording and playback systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/652Means for playing back the recorded messages by remote control over a telephone line

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Machine Translation (AREA)

Abstract

ABSTRACT OF THE DISCLOSURE

The spoken word is converted directly into the written word by the use of an automatic speech recognition device or "translator", which translates the spoken word into corresponding coded signals, and a word processing system.
To compensate for the limited vocabulary and other shortcomings of the translator device, special control means are provided for allowing the dictator to verbally spell out each word or expression which the translator is incapable of translating.
In this mode of operation, the machine automatically assembles the letters together to form words, and assembles the words so formed with other words to form sentences. The spoken words and letters appear on the video display screen of the word processor. The dictator views the text, makes any necessary corrections, and either stores the text codes in an electrical storage device for later transmission to a printer, or directly transmits the text codes to a printer which prints the text as relatively high speed. The keyboard of the word processor also can be used to input special characters or words, if desired. Also described is the improvement whereby, if the translator is too slow to keep up with the dictator, dictation is recorded first in a recorder from which it is taken by the translator at a rate at which the translator is capable of landling it. This makes use of the pauses which normally occur m dictation to give the translator time in which to operate. In an alternative system, dictation from a remote location to a central transcribing station is facilitated by locating a video monitor and a dictating device at the dictator's location, and by providing a communication link, such as by telephone lines or wireless, from the video monitor and the dictation device to the transcription station.

The transcriber transcribes the dictation and transmits it to the video monitor at the dictator's station, where the dictator can dictate corrections into his dictation machine, or can enter the corrections directly by means of the word processing system.
When the corrections have been made, the document is typed in final form. Thus, the preparation and transmittal of a written draft have been avoided, and early correction of the text shortly after dictation has been facilitated.

Description

~3~ 9 This invention relates to de~ices and methods for converting the spoken word into the written word; more specifically, ~chis invention relates to devices and methods for dictating, transcribing and recordi~g dictation in written form without human intervention between the dictation and the recording ~teps .
It long has been desired to provide a machine which will convert the ~poken word direc~ly into the written word.
To this end, a substan~ial amount of work has been done on 10 automatic speech recognition devices or "translators". Such devices convert spoken words or ~haracters in~o coded electrical signals, which ~hen can be displayed, printed or otherwise utilized. If such translators were perfect, it would be a relatively simple matter to utilize them in the au~om~tic printing or typing of speech. However, sueh devices are quite far from perfect.
: :One drawback of the transla~ors presently available ~ ~ commercially is that they have a relatively limited vocabulary.
:~ Most such translators have vocabularies of from thirty to ~hree hundred w~rds, and the most sophistieated m~chines claim to : have vocabul:aries of from one thousand to two thousand words.
; This, of cour6e, is unsatisfactory for use in mos~ dictation since a dict~ating machine should be capable of handling :
virtually any word, charac~er, ~ymbol~ or numeral in the ~5~ ~language used by the dictator.
: Another problem with~translators presently available is that~ they are incapable of satisfactorily handling the problems caused by h~monyms; that is, words which sound the :same but have different spellings (e.g., "see" and "sea" are : 30 homonyms, as are 'ibear" and "bare"). The translator is not - 1 .
:: ~, I ~ 699~

capable of discerning the proper spelling of the word from the text, and ~hus may not spell ~he word correctly when translating it.
A similar problem with translators presently av~ilable i~ that they usually require the uneconomieal use of programming and memory capacity to handle the translation of the proper names of persons or places.
Another problem with the available transla~ors is the cost. The cost usually is directly proportional to the size of the vocabulary of the machine, as w211 as its speed of opera~ion. It is believed, therefore, that the cost of a translator with the size of vocabulary required for reasonably complete dictation capabilities would be prohibitive. .-An addi~ional problem with many translators is tha~
the translator must be programmed to recognize words spoken by a particular individual. In doing the programming, usually the indi~idual must speak each individual word into ~he machine ~everal times in order for the machine ~o properly record a word recogniti~n pattern a~ainst which the machine can compare ~ words spoken~later:by the indi~idual. Of course, the larger : the ~ocabulary of the machine, the longer the time it ~akes to properly program it.
~ One drawback with most conventional.dictating systems : is~ ~hat the dict~tor hlmself does not promp~ly see a visible
2~ representation of the dictation and thus cannot review, edit and ~correct the text until later, after it has been transcribed.
: Accordingly, it is an objec~ of the present invention to provide an automatic ~r semi-automatic device and method for converting sp2ech dir~ectly into written words; that is, to : -2-..

~ 96~

convert the spoken word into printed or typed form. More partieularly, it is ~n object of the in~en~ion to provide such a device and method in which ~he problems and shortcomings mentioned above have been alleviated or eliminated.
It is another object of the invention o provide such a device and method in whieh eommerci~lly available eranslators with relatively limited vocabularies can be used effectively.
It is a further object of the inven~ion ~o provide such a device and method in which homonyms and errors of the translator can be corrected rela~ively qu~ckly and e~sily, either in advance, or shortly after dicta~ion, withou~ additional handling or personnel.
It is a further object ~o provide such a devire and method in which proper names can be handled relatively efficiently and accurately.
It is yet another object to provide cueh a device and method in which the individual programming time for each individual using the equipment is ~inimized.
Another obj ect of ~he invention is to provide such a device and method which is rel2tively simple and low in cost.
: In accordance with the:present invention, the foregoing objects are satisfied by the provision of a dictation : writing device using a translator for translating the spoken word into electrically co~e~ form, and a printer responsive ~o the coded signal6 for printing the words. At the dictator's option, the translator can be specially adapted to enable the operator to orally spell words which are incapable of bein~
translated by the translator mechanism. The machine then correctly assembles th2 word or words formed by the spelling :

~ 996~ 33-392 technique together with translated words in order to form sentences. The printer then prints the words so formed.
Preferably, the translator is used t~gether with a word processing sys~em having a video display ~erminal with a 5 kçyboard. The dictator ~ees the worts he has dictated almost immediately as they appear on the Ecreen of the terminal. If the m2chine has made an error in translatio~, or if the dictator finds that the word is untranslatable, or if he knows in advance that the word is untranslatable, he merely switches ~he machine into ~he "spell" mode of operation and spells out each word until the problem has been solved.
~ Preferably, the dictator can make correc~ions in the text as he sees it on the screen, and then, when the text is correct, can transfer it to a storage device, such ~s a magne~lc - 15 ~disc storage unit, for later transmission to a relatively high-speed typewri~er, printer, photocomposing or other recording : device. Such a recording device can, for example, produce finished, ~ addressed letters typed ~utom~ically without human intervention : between the dictation and the typing process.
20 ~ The invention thus provides an opportunity for ~ery bstantial~savlngs in labor costs in the production of typed ;and printed~m~tter, and frees office personnel from typing :
duties and makes them available for more productive tasks.
Purthermore, the system and method enables the dic~ator to 25: ;improve:~the quality ~f the written word he produces because he IS able to make~ehe corrections virtually immediately after ~dictati~on instead of later, when he has fvrgotten certain :matters which ~i~ht require eorrection.
One of the potential problems with devices such as 30 those described above~is ehat man~ ~ranslators which are commercially .

available at the present time are relatively slow in operation.
Some of these machines cannot keep up with the dictator; that is, the translators cannot translate the words as fast as the dictator can dictate them. This is very unsatisfactory because it is inefficient, and may distract the dictator and cause him to lose his train of thought.
Accordingly, one of the additional objects of the present invention is to solve the foregoing problem and ~o provide a dictation system of the abo~e described type in which the dictator need not wait for the translator to complete its translation ~efore dictating the next word.
This object is met, in accordance with the present invention, by the use of an audio recorder to record ~he dictation, and control means for causing the recorded dictation be read out to the translator at a rate at which the translator can process it. Preferably the recorder also records tone signals which are used to control the translator and word processor. It is preferred to use a recorder of the type in which dictation can be recorded at the same time that earlier dictation is being reproduced, and in which the dictation and reproduction can proceed simultaneously at two different rates.
Preferably, the reproducer is stopped when the pause ~; ;between words is detected, and is re-started when the translator is ready.
This feature of ~he invention has an additional benefit in that the separation of the words from one another is more disti~ct. ~ This helps the translator to distinguish ; ~ the words from one another and improves the accuracy of the translation. If the translator falls behind the dictator by any significant amount of ~ime, the dictator can take the ~3~9~9 opportunity to re-record portions of his dictation, if he wishes to change it.
A problem with all known dictating sys~ems is th~t the preparation and handling of drafts is relatively inefficient.
The time required in delivering the draft to the dictator, hand-correcting the draft, and retrieving the draft from the dictator would be better spent on other tasks. Moreover, when the dictator is very remote from the transcribing station, it often takes a very long time for the dictator to receive a draft of his dictation, correct it, and send it back for final typing. In fact, if the draft must be sent by mail, this can take a ~at~er of days or weeks.
Accordingly, it is another object of ~he invention to provide a dictation and transcribing system in which a draft 15 (or the final text) of the dictation is made available very quickly for the dictator to review, withou~ the need for physical delivery of a copy of the text, and without regard to the actual distance of the dictator from the ~ranscription station. If is a further object of the invention to provide : :20 Such a system and method in which corrections can be made very rapidly and easily, without writing them by hand. It is yet another object of the invention to provide such a system and method in which the dictation and/or corrections are transcribed automatically by an automatic speech recognition device or ~, ~
~; ~ 25 "transIator".
These objects are met~by the provision of a dictation system and method in which a dictation device and a visual ~; display device are Iocated at a dictation station. A transcriber and means for converting the transcribed words into coded electrical signals capable of being displayed on the visual ~ -6-display device are provided at a transcribing station.
Preferably, the devices at the dictation and transcription stations are linked for communication by means of either a directly-wired connection, or a telephone line, or a radio line, or by other communication means.
The dictator dictates into the dicta~ing device at the dictation station, and his dictation is transmitted to the transcription station where it is transcribed and converted into coded signals which then are transmitted to the visual display device at the dictation station.
The dictator then reviews the text on the visual display device, makes any necessary corrections, either by dictating them into his dictation device, or by making them electronically on his visual display device, and then transmits the corrections to the transcription s~ation.
The corrections then are made a~ the transcription station, and the text is typed in final orm.
The dictator also can review the final text by means of the visual display device, and can make any further changes which may be necessary.
The transcription can be done either by an operator on a word-processing machine or system, or by an automatic transcriptlon device sùch as the device described above. In either case, the final text is printed automatically ` 25 by the printer of the word processing system.
` By means of the foregoing system and method, the dictator can review a draft of his dictation much more quickly than if a typed or printed draft were prepared and carried into his office. In fact, he can : :

~ , ~

5~9 review the text whilP it is still being transcribed.
This facilitates more efficient dictation, since the dictator has less chance to forget important corrections which are to be made. What is more, the dictator can make corrections either by dictation or electronically, instead of by hand. In most instances, either dictation or electronic correction is faster than correction by hand. This invention is extremely advantageous when the dictator is remote from the transcription station, because it provides a means for the dictator to review the dictation minutes, hours or even days earlier than if he had to wait for a typed draft to be delivered to him.
The foregoing and other objects and advan~ages of the invention will be set forth in or apparent from the following description and drawings. In t'ne drawings:
Figure 1 is a perspective view of a preferred dictation terminal as it sits on the desk of a dictator;
~ Figure 2 is a sche~matic circuit diagram of a system :~: 20 o~ which the dictation terminal of Fiugre 1 is a part;
~: : Figure 3 is a schematic diagram of an alternative ; ~ embodiment of the invention;
~ Figure 4 is an elevation view of a hand microphone : and control unit of the dictation te~nînal shown in Figures 1 and 2.
: Figure 5 is a side elevation view of the microphone shown in Figure 4;
Figure 6 is an elevation view of an alternative hand microphone of the type shown in Figures 4 and 5;
Figure 7 is a perspective schematic view of a 1~69~

complete automatic dictation typing system which might be used in the dictator's office;
Figure 8 is a schematic circuit diagram illustrating a system utilizing a portion of the apparatus shown in Figure 7 with several different dictation stations;
Figure 9 is a schematic circuit diagram showing the detailed interconnections of a portion of the system shown in Figure 2;
Figure 10 is a schematic circuit diagram of another system constructed in accordance with ~he present invention;
Figure 11 is a schema~ic circuit diagram of yet another embodiment of the system of the present invention;
Figure 12 is a par~ially schematic view of a visual display device utilized in the system of Figure 11; and Figure 13 is a schematic representation of controls on the operation panel of the unit shown in Figure 12.
GENEP~L DESCRIPTION
-: Figure 1 shows a dictation terminal 10 resting on the top of a desk 14. The dictation terminal 10 has a hand microphone 12, a video screen 16 for displaying ~: written words which have been dictated, and a keyboard 18.
Preferably, the unit 10 is of ~he type used in word : ; processing systems, with the addition of the hand microphone 12, or a desk-top microphone and a foot pedal (not shown), ~: : 25 :if desired.

g_ : ~ :

~ ~ ~9~69 33-392 , Figure 2 is a schematic diagram sh~wing how ehe dictation terminal 10 i~ onnected ~4 the microphone 12 ~nd o~her equipment in the dictation writing syseem. The unit 10 is one of four different rem~te unit~, eaeh of which c~n be l'ocated in a different office or area, either in ehe same or another place of business.
Each of the units 10 i~ connected to one channel of a central translator unit 30 which translates the worts dictated into the microphone 12 into binary digital da~a, and returns that data to the unit 10 which displays it on its video screen 16.
The video screen 16 preferably is capable of storing a full page of written text. The dictator can see each word as it is dictated and appears a~most immediately on a screen 16.
When the text on the screen is sati~factory~ the dic~ator uses the keyboard to transfer the data for the page appearing vn the screen 16 into a central disc file 32 which is of the type used with typical word processing 6ystems. The unit 32 is a mul~i-disc magnetic disc stora~e unitO
: 20 The dictaeor then proceeds eo dictate another ~creen full of wri.tten information, and transfers it to the disc file 32.
When the dooument ~eing dictated is complete, the operator can operate ~he unle 10 to transmit the information stor~d in tXe disc file 32 to a printer 34, 36 or 38.
THE WORD PROCESSOR
: ~arious w~rd processing systems are suitable for use with the present invention. One such system is the "Dual Di~play" word processor sold by Dictaphone Corporation, Rye, New York. Another ls the "CPT 8000" word processor ~old by CPT Corporation, Minneapolis, Minnesot~. Such systems _10-~lS~9~9 include video tenminals with keyboards, disc files and prin~ers, as it is well known. The above-identified word processsrs actu-ally may have more features than would be necessary to make them suitable for use in this invention. Therefore, even ~impler machines can be used, if desired.
THE PRINTER
Although many different types of computer data printers can be used, for general office work, it is preferred that the printer be one like tha~ which is used in most word processors, 1~ namely, a "daisy-wheel" printer or the equivalent, which produces typewritten m~tex on sheets of paper. Such sheets ~f paper can be letterhead paper or the like, ~o that the result of the print ing operation is a typed letter, ready to be reviewed, signed and mailed. Additional printers 36 and 38 optionally can be provided 1~ with different types or sizes of paper so as to facilitate auto-matic typing on a varie~y of different media, merely by selec~ing the printer to be used. Alternati~ely, the addi~ional printers can be used so as to enable ~wo or more printing jobs to proceed : simultaneously, thus increasing the production rate of the system.
The typewriter can be 8 standard typewTiter or one of the proportional spacing type.
Alternatively, the unit to which the informa~ion is delivered can be a photocomposing machine which produces photo-graphic film or paper upon which the written matter is recorded in order to be used in making printing plates.
: If desired, the matter stored in each terminal 10 for printing can be delivered directly to the printer 34 instead of to the disc file 32.

.

. . ~. , , ' .

~ ~ 699~g 33-392 THE TR~NSLATOR
The ~ranslator 30 preferably is a commercially-a~ailable uni~ which is used for converting spoken words into coded electrical signals. Preferably, the device 30 ~hould have a relatively large vocabulary. The device al~o 6hould have the capability of handling inputs from several different sources simultaneously, if sever~l dictators are to use ~he 6ystem.
There are several devices which meet these requirements. ~or example, one such dcvice is the Model YDES automatic speech recognition system sold by Interstate Electronics, Inc., Anaheim, California. This device has a vocabulary of up to 800 words, and can handle inputs from four different users si~ultaneously.
It is believed that, in actual tests which have been performed, trained operators have achieved a recognition accuracy of over 99%.
i$ Where only one dictator will use the system, and where a ~maller vocabulary is acceptable, a lower cost ~ranslator can be used. For example, it may be possible to use the translator ~ unit sold by Heuristics, Inc. of Sunnyvale, California. It is : .called ~he Model H~000 "Speechliné" Automa~ic Speech Recognition System.
A number of other co~mer~ial systems are available : ~ which are believed to be satisfactory for use in the present invention. For their various capabilities and cost, see the article entitled "Words Into Action: I" by Gadi Kaplan, "IEEEE
:25~ Spectrum", June 1980, pages 22-26~
: With most of the available translator devices, the speaker mus~ pause briefly between successive words. This is because the machine is not capable of differentiating between words unless there is a certain minimum amoun~ of time between 30 them. However, some devices, suc~ as the DP-100 device : '. ' '.:

manufactured by Nippon Electric Co. Ltd., Tokyo, Japan, are capable of the limited recognition of "connected speech", that is, words spoken withou-t pauses between them, such as normally is done in ordinary speech. If the capability of recognizing connected speech is important, then such a machine should be selected. Another machine which is reportedly capable of detecting and recognizing connected speech is called the "Quiktalk" High-speed Speech Recognition System sold by Threshold Technology, Inc., Delran, New Jersey.
"Speaker independent'~ devices, that is, devices which recognize speech without programming for each individual user, are available. For example, such a device is sold by Dialog Systems, Inc. of Belmont, Massachusetts. The use of such devices will reduce programming time requirements, but may give reduced recognition accuracy, reduced vocabulary, and may be more costly than other systems.
Most translators must be programmed for operation by a~part1cular individual. The individual speaks each word ~; of ~the machine's vocabulary a plurality of times, The machine then derives a pattern for the average of the signals received when the~word is spoken, and stores this pattern in memory.
Then~, whe~n the same person speaks that word during operation of~the machine, the machlne compares the incoming speech patterns with~that stored~and issues a code representative of the~correct word.

:

~ .

.. ~, .
L~.
'` : ' . , . , :, ` ' ~ 1 699~

Since the translator recognizes words strictly by their sounds, it cannot differentiate between homonyms. For example, if the dictator were to dictate the word "see", the translator coult give the code representing "see", or the word "~ea", or the letter "C". Which of these altern~tive~ it would select depends upon ~ow lt is programmed. However, two of the three choices would be erroneous. A h-man operator usually can determine the proper spelling because of the meaning of the word as used in the sentence. Even then, it may be necessary for the dictator to give special instructio~s to avoid errors. It is believed that, at the present time, available ~ranslators are not capable of automatically differentiating between different spellings of the same sound.
Althou~h a translator can be programmed tG recognize proper names, names of cities, and towns and countries, etc., vrdinarily it is impractical to program it to recognize more than~a few frequently-used names because of the memory require-ments and th~ programming time required. Although this m~y not provide an impediment for most of the current commercial uses of such translat~rs, such as in quality control, etc., it creates a substantial impediment to the use of the translator in a dicta-tion system.
A further problem of such translator devices is thPir ~relatively limited vocabularies. The largest vocabulary claimed ZS for;~ny of~the commercial devices presently ~vailable is somewhat over 1,000 words. Were the dictator able to use only words in ~uch a ~ocabulary; the machine probably would be of extremely limlted usefulness and probably would be of little commercial interest as a dictating machine.

:: :

' ' -1 3 ~9~6~

An additional problem is created when the vocabulary of ~he machine i5 made very large. Since the translator usually must be programmed to the specific voice of a particular pPrson, every person w~o uses the machine must repeat every w~rd to ~e ~ored in the vocabulary during programming several times over.
Therefore, the larger the vocabulary of ~he m~chine, the longer the operator must spend in initially programming ~he machine.
Of course, the larger vocabulary makes the machine considerably more expensive, too.
THE "SPELL" M~DE
In accordance with one aspect of the present invention, the foregoing problems are solved or alleviated by providing means whereby the dictator can switch the machine from its normal mode into a "spell" mode in which each word can be spelled-out orally. The translator will recognize each character uttered by the dictator, and will assemble the characters together to form a word. The machine also assembles ~hat word ~ogether wi~h :other words previously or subsequently dictated, in order to form : ~entences. Further, if, for any reason, the use of the "spell"
option is undesirable or unsatisactory, ~he machine can be oper-ated in a "type" mode in which the words can be typed on the keyboard 18 of the ~nit ~0, as in the normal operation of any : word processor. How~ver, it is believed ~hat there will be : little or no necessity for entering text in the "type" mode, 2:5 with the result that virtually all of the text is entered orally, rather than manually.
THE MICROPHONE
Figure 4 is ~n ele~ation view of the hand microphone 12 used with the dictation unit 10 of Figures 1 and 2. The mierophone 12 includes a body or housing 42 having a grill 40 ~ 1 699~9 protecting a microphone inside the housing, and a cord 44 ~o transmit signals ~o and from the microphone.
A plurality of function keys is located on the microphone body. One such key is a "Dictatet' key 48 which is pressed when the dictator wishes to have ~he word translated by the translator device.
As it is shown in Figure 9, depression of the "Dicta~e"
key 48 sends a signal over a line 98 to ~he unit 10. ThP unit 10 is adapted so that when it receives a signal on line 98, it 10 switches from operation with input from ~he keyboard 18, to operation with signals coming from the translator 3~. In other words, normal operation of the word processor is inhibited and it is adapted to receive and process codes from the ~ranslator.
Also provided is a "Spell" button 50. On depression of the button 50, as it i5 shown in Figure 9, a signal is sent over a line 100 to the unit 10. This signal also sdapts ~he word processor to receive input signals from the translator, as in the "Dictate" mode. Also, ~he automatic word spacing provided in the "Dictate" mode is altered so that the charact rs ~are~written without spacing between them. Thus, the characters are assembled to form words.
Preferably, the uni~ 10 is adapted to reco~nize the rece$pt on line 100 of a positive-going`electrical pulse as an in truction to crea~e an interword space in the character train being recorded. If desired, a flip-flop circui~ 102 can be ~on-nected in the manner shown so as to produce a positive-going pulse upon the release of the "Spell" button 5~ so as to create an inter-character space. Thus, the release of the "Spell" button ~at the end of each word which has been spelled will produce a space ; 30 between that word and the next one.

'':

A third bucton on the microphone 12 shown in Figure 4 is a "Type" buttorl 52 which is depressed ~o change the system into the chird mode of operation, namely, one in which inpuc is from the keyboard of the word processor A number of other control but~ons appear on the handset 12. These inolute a "Backspace" button 54 to backspace by one characte~ space; a "Word Back" button 56 to go back one word (that is, to the prec~ding interword space); and a but~o~ 58 to backspace by one entire line.
~lso psovided ar~ buttons 60 to space in ~he forward direction by one character space; a button 62 to space one line forward, and a button 64 to space one word forward ~that is, to the next interword space, when reviewing existing text).
Also provided a~e a button 66 to ~elete a character;
lS a button 68 to delete an entire word, and a but~on 70 to delete an entire line, all for the purposes of making corrections.
These buttons actuate the correction mechanisms of the word processor to make the corrections in a known manner.
It can be seen from Figure 5 that the tops of the buttons are at different elevations from one another so as to make them easier to tvuch without interference with adjaeent buttons.
; ~ A modified hand microphone unit 12 is shown in Figure 6. The unit ~hown in Figure 6 has 8 "~ictate" button ~ 2~ 4g and a "Spell" bu~on 50, as in Figure 4, and has several :~ o~ber butcons which also appear in the device of Figure 4, and;which are given cor~esponding reference numerals. The "Type"
button 52 of Figure ~ has been omitted because it is not necessary.
When the microphone of Figure 6 is used, the machine automatically is in the "Type" mode at a~l times, unless one of the control buttor~is pressed tD change the mode of operation.

' ' .. . . . .

9 ~ g Additional buttons whieh are provided in the device of Figure 6 include a "Paragraph" but~on 72, which causes the text to automatically shift to a new line and indent to start a new paragraph. Additionally, a button 74 is provided to capitalize words or characters being dictated.
Also provided in the device of Figure 6 are a "Store Word" button 76 and a "Recall" word 78. When a particular word, numeral or expression has been spelled out, it may be desirable to store ~he word in memory for a later recall so ~hat ~he same word will not have to be spelled out several different times while dictating a single text. The depression of but~on 78 causes the display of all the words stored in this manner on the screen of the unit 10, and allows selection of-the desired word by opera-tion of the keyboard 18. It is desirable to locate as many of the function keys of the keyboard 18 on the microphone hand se~
as is practical.
EXAMPLE
As an exa~ple of the operation of the foregoing system, the dictation steps which would be required for dictat-ing the following letter will be explained. The le~ter is datedApril 20, 1980, and is addressed to Mr. Joseph Jones, at Jones :~ Men's Fashions, 4939 Hillside Avenue, Milwaukee, Wisconsin 53202.
Dear Mr. Jones:
We:have received your letter of April 19th and your Purchase Order 4259 for three gross of men's sheepskin caps at $10.00 each for a total of $4,320.00.
Please be advised that the price on this item now is: $11.50 each. If this priee is acceptable to you, please ~. ~ B~ 33-392 si~n a copy of this letter and return it to us to confirm your order at the new price.
Sincerely yours, Stanley A. Penn Sales Manager The dictator st~rts by pressing the "Space Forward"
button 6Q (Figure 4) to properly locate the date. Then he presses the dictate button 4S and dictates the date in its entirety. The translator is programmed to correctly recognize ~he months of the year, ~he year 1980, and all numbers from O to at least 31, and it is programmed to cause a com~a eo be printed when the dictator dictates the word "comma".
Next, the dictator presses the "Line-Forward" button 62 to prepare for the dictation of the address of ~he le~ter.
Since the name and address of the addressee are not easily programmable in the automatic dictation mode of the machine, :: the operator now depresses ~he "Spell" button 50 to enable him :
: :to spell the name and address of the adressee. He then oralIy ; pell the~addressee's name. Capital letters prefe~ably are 2~0 ~ formed by pressing:~he "CAP" button 74 (Figure 6). However, they :also can be formed by programming the machine to capitalize the next character when a code word i6 spoken. The dictator then p~resses the "Line-Forward" button S2 and orally spells ~he address of the:addressee. He then presses the "Line-Forward" button ~sga~in, reIeases the "Speil" button, depresses the "Dictate"
:~ button to dictate the whole words: "Dear Mr.". (The machine is pro~ra~med to ~pell l'Mr." when it~hears "mister"). Then the ~ dictator reverts to the "SpelI" mode to spell "Jones", and dictates ::: :~:: : :: : :

~ ~ 6~9~9 33-392 the word "c~lon" t~ produce the colon. The dictator then proceeds to dictate the text of the letter.
The machine has sufficient vocabulary to translate the first sentence of the text up to the number "4259". Sinee the n~mbers 4259 normally would be printed with spa~e~ between th~m, the operator depresses the "5pell" button 50 and dictates ~he numbers "4259". The machine is not ~ble ~o correctly translate "for", sinee it has homonyms. For example, the same sound could mean the numeral "4" or "four" as well as the word "for". There-fore, the dictator shifts into the "Spell" mode and spells out the word "for". He then shif~s back into the dic~ate mode by depress-ing the button 48 until he reaches the expression "men's sheepskin caps". Since these words are ~ot found in the vocabulary of the machine, the dictator switches to the "Spell" mode and spells these words orally, releasin~ the "Spell" button 50 at the end of eaeh word in order ~o space the words from one another.
~ Similarly, when the word "for" and the sum "$4,320.00" are ;~ reached, these words and numbers also are spelled out.
: The operator then depresses the paragraph key 72 (Figure 6), i~ such a key is provided, which automatically ~ spaces forward one line and indents ~o the start of a line.
:; : Alternatively, he can say "paragraph", or give another oral eommand, and the machine, when specially programmed to do so, will~space and indent automatically.
The re~ainder of the letter, except for the $11.50 price, can ~e translated by the translator unit 30, 60 that the "Spell" mode need be used only once more durin~ the dictation of the letter. Although the name of the writer, Stanley Penn, is a proper name, it is stor~d in memory so that the translator can recognize ~t, because it will be used repeatedly by Mr. Penn and this makes it worthwhile to store the name.

g~

The depression of the ~'Capital" button, either on the keyboard 18 or on the hand microphone 12, ~utomatically capitalizes only the irst letter of the word being capitalized.
If the capitalization of every letter of the w~rd i~ desired, S this can be accomplished by holding the "CPpital" butt~n down ~hile continuing dictation.
AN INDIVIDUAL SYSTEM
Figure 7 shows a complete individual dicta~ion system 80 which might be used in a one-man office, or by a person in a larger cffice desiring to have all the equipmen~ nearby. The system 80 includes a separate translator unit 82 wi~h a hand microphone 12. The translator ~2 is connected to the ~ideo keyboard unit 10, which is connected to ~he disc ~ile 32 and the printer 34. Paper can be fed into the printer 34, and the printed or typed text can be taken out of the prin~er by the dictator himself.
Figure 8 shows a multiple-terminal dictation system using a plurality of de~ices 10 and individual translators 82 at different work station~ 86, 88, 90 and 92, but usin~ a single 20: ~ disc-fil~ 32 and printer 34.
: At each s~ation, the outpu~ of the microphone 12 is delivered to the translator 82, which delivers its output to a ~: ~switching device 94 which Bwitches the unit 10 between the typing mode, the "Spell" mode, and the "Dictate" mode, and ;operation is substantially as described above. The output of ~eaeh station is transmitted by means vf a multiplexer circuit 96 to the disc file unit 32. The multiplexer un~t 96 may contain a buffer storage device to store data received from one of the dictators until later when it can be recorded in the disc file.
This ena~:les ai~ultaneous operation of the various work stations without ~ntQrfe~ence between them.

' ~

~3-39~

PAP~R HANDLING
It is pcssible ~hat ~he system of ei~her Figure 2 or Figure 8 can be used with a single printer whirh is tended by an opera~or ~ho puts in the desired sizes of paper, takes ~t the typed produc~s, puts them i~ envelopes, ~nd mails them.
However, if the 6ystem of Figure 7 is used, i~ is desirable ~hat the paper be fed fr~m a r~ o that the repeated insertion of sheets ~y the dictaeor is not required. If letterhead paper is desired, the letterhead can be prin~d at regular intervals along the sheet. These headings are printed at dis~ances from one another which are excessive for normal use, and the excess is trimmed off by a knife or cutter (not shown). If a particular page is to be unadorned or blank, the hPading simply can be cut off of ~he top of the next shee~, e~c.
-~ 15~ If desired, an automatic sheet feeding device can be provided as an input to the printer 34 in order to input 8-1/2" by ll"
~ letterhead, plain bond, or larger sizes of paper, as desired.
:: : Alternatively, each of the separate printers 34, 36 and 38 can be et up ~o print on a specific type of roll fed paper9 with the 20~ paper being cut exactly to the desired length by a knife in the machine. Selection of ~he printer determines the type of paper used. This can be done electrically by the dic~ator.
desired, the speration of the machine can be se~ up i~ different formats by the use of conventional computerized format control. Thus, by simple selection of the desired format by~operatlon ~f the keyboard 18, the machine can be adapted auto-matically to set up for the prepar tion of let~ers, or reports, : etc. This can ~e done in aceordance with techniques well known in ~he art.

' ~' :

~ ~ 6~9S~

REMOTE COMMUNICATIONS EMBODIMENT
Figure 3 shows an al~ernative form of the in~ention in ~hich the input to the device is by means of a telephone handset 106. The handset 106 is connected through ehe telephone lines 107 remotely to a receiver 109 at the location of the ~ictation wlitin~ deviee. The translator 82 is pre-programmed to recognize only a specific caller's voice. When it does so recognize his VQiCe, it converts the words used by the sp~aker into digi~al form, and sends them to a speech reproduction device 106 which transmits audible reproductions of the words over the telephone lines back to the diotator in order to allow him to check the correctness of the translation. After initial identification, ~he dictator can dirtate remotely and have his words stored in the disc-iile, and ~hen he can cause the trans-mission from the disc-file of the diceation to a printer 34.
Thus, a remote dictation feature has been provided for the invention; one which requires no separate hand-held code sending device for remote actuation.
If preferred, the telephone handset and lines can be ~0 replaced by a radio transceiver, or by other types of remote communication devices.
All function instructions such as "spell" and "cap.", etc. can be spoken and need not be input by means of pushbuttons, ~; in this e~bodiment of the inventivn. This is accomplished by : :25 programming.
: ~ : Similarly, the translator 82 can be used for remote : : identîfication of a subscriber or owner of a telephone answering .
:~ machine 108. When the caller has been properly identified, the telephone answering machine 108 will automatically read the messages stored in the machine out ~o the call~r and allow him ' $~5J

~o gi~e the machine new instructions. Thus, the invention provides for the remote retrieval ~f information from an automatic telephone answering mæchine, without the usual hand-held coding device.
One use envisioned for the present inYention is in 5 or~dinary offices. Another is for use as an inpu~c device to type composing mac}lines,.particularly phototypesetting m~chines.
Such a system can be used, for example, in composing a daily newspaper. Each repor~er can dictate his story at a terminal of the type shown in Figure 1, and he can review and edit the column before it actually is composed by the pho~ocomposing machine.
One of the advantages of the inven~ion is that the dictator sees the product of his dictation on the video screen 16 virtually immediately after he has dictated it. This gives him an opportunity to edit or correct the text while the 6ubject matter of the dictation is fresh in his memory. Further-more, since the letter or other document is typed virtually immediately after dic~ation is complete, the dictator can see a copy of the typed or printed text shortly after it has been ~ dic~ated, thus avoiding the ofeen substantial delay in transcrip-tion of dictation by usual means~
It is believed that the foregoing factors may be enough to improve the overall dictation efficiency of the dictator, compared with his efficiency when using other dic~a-tion equipment. When this is coupled with ~he ability to mailletters more promp~ly and to otherwise complete ~asks at an earlier date, the overall improvement should substantially outweight any increase in the ~ime of dictati~n required to spell selected portions of the dictation. In this regard, it should be noted that in normal dictation it often is required that the dic~ator ~9~9 spell certain unusual words, names, addresses, towns, etc.
Therefore, the increased amount of ~ime required ~o spell additional words not capable of being translated correc~ly is not as great as it otherwise might be.
As translator devices improve with further developmen~, at is probable that the ~ize of the ~vailable vocabulary will increase without a corresponding i~crease in eos~, and the programming time will decrease, 60 that progressively fewer words and terms must be spelled.
Another time-saving feature of the invention is provided by the fact that the dictator need not operate a rewind mechanism and hunt for previously dictated material because this material nor~ally will be in full view. Thus, if he forgets what he said previously, he merely need ~o refer to the screen quickly, without operating any buttons, to regain his train of thought. If the material he is looking for is on a previous page, most word processing machines have the capability of recalling the previous page or an earlier psge of text rapidly and easily.
~2~: In A preferred embodiment of the invention, it is preferred that all characters, punctuation marks and numbers ~ : b~e dictated only in the "Spell" mode, and that cer~ain homonyms :: of:those items can ~e programmed to be retrieved during the "Dicta~te" mode. For example, during operation in the "Dictate"
:mode, dictation of the ~ound for the letter "r" would be ~rans-lated as "are`'. However, during the "Spell" mode, ~he same ::sound will be translated as the letter "r" (or "R"). Similarly, :
during the "Dictate'~ mode, the sound for the numeral "2" would : be translated 8S "too" (or "two", if preferred). However, during the "Spell" mode the same sound would be translated as the numeral "2".

~ ~ ~99~9 Of course, all punctuation marks either must be dictated or inserted by means of keys on the microphone 12 or the keyboard 18. During the "Dictate" mode, a "." punctua-tion mark would be translated as the word "period". However, during the "Spell" mode, the same sound is translated as ".".
Capital letters also can be handled either by the use of a key on the microphone 12, or on the keyboard 18, or by dictation of the word "Capital" preceding each letter to be capitalized. The machine can be programmed to automatically capitalize the first letter of the first word of each new sentencP. Similarly, it should be programmed to automatically provide for two inter-word spaces following each period punctuation mark.
By means of the foregoing separation of letters and numbers into the "Spell" mode only, a form of au~omatic - treatment of certain homonyms has been accomplished.
ACCELERATED DICTATION SYSTEM AND METHOD
Figure 10 shows a dic~ation and transcription ~system llO which is substantially the same as the system ; 20 o~ Figure 1, except that the sys~em 110 includes a recorder/
reproducer unit 16 which provides an improvement in the operation of the system.
The system of Figure 1 comprises a plurality of dictation;devices, each of which includes a visual display 25~ device 120 and a microphone 112. The dictation spoken~into the microphone 112 is delivered to a central translator device l18 which converts the spoken words into electrical ~: slgnals represen~ing ~he corresponding written words, and ;` those~signals are delivered to the visual display device 120 ~ 30 ~ where the words are displayed on the screen for thP dictator : ~: : :
~ 26-~, ~: .

to see.
The dictator can correct the text which he sees, and then deliver it to a disc file 122 or other storage device for storage, and thence to a printer or a photocomposer 124 or 126 to prepare a printed text.
There are two dictation sta~ions shown at Figure lO.
Each dictation station is in a different office or location in one or more buildings, and the ~ranslator unit 118, disc file 122 and the printer 124 or 126 can be located at a central location within the same building or a different building. Although only two dictation stations are shown in Figure 10, it should be understood that this has been done solely to simplify the drawings, and that more dictation stations can be used, if desired.
As it is explained in greater detail above, special control means are provided to enable the dictator to selectively spell words which the translator is incapable of correctly translating, so as to avoid the need for a very large vocabulary for the transla~or, and to overcome other shortcomings of that device.
; ~ Pushbuttons 114 on the microphone 114 are used to create signals indicating the selection of the spelling mode, as well as to delete, backspace, etc., in order to make corrections in the text appearing on the screen of the CRT 120.
In accordance with the present invention, :
a sound recorder/reproducer device 116 is interposed ~ ~between the microphone 112 and the remainder of the system.
: :

~ 1 ~g969 Preferably, the recorder/reproducer 116 is of the ~ype in which recording and reproduction can take place simultaneously and at different rates. One device which has the capability, for example, is an endless-tape random-storage recorder/reproducer sold under the trademark "Thought Tank" by Diotaphone Corporation, Rye, New York. That device has a storage housing 128.
In the housing is an endless magnetic tape 136 which is "jumble-stored" (allowed to pile up randomly in the housing), a recording unit 130 and a reproducing unit 134.
A separate capstan 132 driven by its own motor moves the tape 136 past the recording head of the unit 130, and another capstan and motor moves the tape past the reproducing head of the unit 134. With such a device, dictation can be reproduced within a few seconds after it has been dictated, and reproduction can take place simultaneously with and independently from dictation.
Moreover, the reproduction and dictation rates can be quite different. If the transcription lags behind the dictation, the tape bearing the dictation will accumulate in the housing 128 until it can be transcribed.

:

- , 33-396 g 6 ~

Voice signals are delivered over a linell3 to the recording unit ~0 which records them on the tape ~6. Signals indicating the selection of the "spell" mode of operation of the translator also are delivered over the linell3 and r~corded on the tape. These signals preferably are audio-frequency tone-~oded signals developed by tone generators operated by the push-buttonsll4, in the nature of "Touch--Tone" telephone pushbuttons. Other signals to be used in the operation of the transl tor similarly are recorded on the tape. Decoding circuitry is provided in ~he translator to decode ~he tone coded signals and in`struct the translator in ~ts operations when the tones are reproduced by the reproducing unitl34.
Also transmitted over the linell3 are signals which are used to stop, start and control the recording unit ~0 as in the normal operation of the recorder/reproducerll6.
Other signals developed by operation of ~he pushbutton~
114 in order to control the operatisn of the visual display unit12`0 for corrections, etc.,are delivered directly to the unitl20 over a line 115.
At the start o operations/ a ST~RT signal is delivered through an OR gatei41 to the reproducing unit'134 to start it. Preferably, the START signal is developed by : the operation of a key on the keyboard of the display unit 120 Connected to the reproducing unitl34 is a detector 25 circuit 13-8 which detects the gaps between words or function signals (pauses of at least 0.1 second duration which ~re required betwee~ successive words in the dictation or proper operation of the translator) and disables the reproduction device 134 ; and stops movement of the tape past the recording head. This condition persists until another detector circuitl~0 detects a .

~ 33-396 a signal sent over a linell7 upon the deli~ery to the visual display unitl20 of the coded signals representin~ a word which has been translated by ~he translatorll8 and sends a signal through the OR gatel41 which start~ the reproducing unitl34 again. The circuit140 also sends a reset signal to the detector~38 to reset it and ready it for the detection of the next inter-word gap. The detector~38 also sends a signal o~er a lineIl9 when it detects a gap so as to indicate the end of each word more positively than if the gap were detected solely by the ~ransla~or.
Thus, by means of the foregoing construction, the recorded words are reproduced one-at-a-time, at a rate at which the translator is eapable of transla~ing them. If the translator is finished with the translation of a wosd before the nex~ inter-word gap is detected, the reset signal from the detectorl4o will prevent the circuitl38 from stopping the tape, and the next word will be reproduced without stopping the reprcdueîng uni~.
. In addition to, or instead of, the recorder/reproducer h 6, a digital buffPr storage unit can be used ~o store voice and translator func~ion signals.
Circu~ts for detecting a pause in voice signals : ~ and turning a device on or off in response to such a detection ~ are known and used, for example, in automatic telephone ; 25 answering machines, and they will not be described in detail ; herein.
The above-described system and method al-low ~he d~ctator to dictate at his own pace, without regard to whether the translator can keep up with him. The ~ranslator proceeds at its own pace t u~lizing the recorded dictation as fast a~ it 11 ~ 6~9~

can. It is believed that this system and method take advantage of the pauses which a dictator normally has in his dictation. That is, if the translator lags behind, during the pauses which normally occur in the dictation, 5 the translator continues to translate stored dictation, thus ,enabling it to utilize the time which otherwise would be was~ed to help to match the speed of the translator to that of ~he dictator.
It also is believed that the stop-start operation 10 of the reproducer helps to enhance the correct detection of the gaps between words. This is because the repro~ucer actually stops between words. This is believed to improve the correct translation of the words being dictated.
REMOTE DICTATION SYST~M AND METHOD
Fi~ure ll describes a remote dictation sys~e~ including two dictation stations 142 andl44, and a transcription station 14~.
The dictation'stations142 andl44 are in two different offices in one building146, while the transcription sta~ion 14~ is in another building. Of course, the areasl42, 14~ and 14~ also can represent different areas within the ofices of a single business establishment. Moreover, although only ~wo dictation stations 142and 144are disclosed, it should be understood that : the system can include more dictation stations, if desired.
: At each dictation station 142 or 144 there is a dictation 25~ device 1780r 180and a visual display device 170 or 172, The :~ dictation devicel78 is shown as a telephone hand-set type of : ~ input~ device for a "Thought Tank" remote dictation system such : as the on described above.
The "Thought Tank" di.ctation syst~m includes a recorder/reproducer device 164at the transcription station, ~996~

as well as a head-set 168 for the operator ~o use in listening - to the dictation. Dictation is trans~itted over a line162 ~o the remote recorder/repr~ducer 16~.
The line 162 is shown schematically. It can represent either a wire extending between the offices of the~dictators rand the transcription station, or i~ can be a telephone line, or it can represent a radis, video, or other communication link suikable for transmitting dictation ~o the transcription station.
The visu~l display device 170 orl72preferably is a CRT deYice of the type used in word processing systems. The visual display devices 170and..'172are connected to the transcription station1,48 by means of the same line 162.
Also located at the transcripti~n station 148is a word processing system generally indicated at 150. The word processing device l50includes a CRT display unitl52 with a keyboard ~, a disc file~54, and a reLatively high-speed printer or composer 1S6. The coded signals which form the output of the word processor are delivered over a line "A" or '~B", etc., through the line 162to the ~isual display device 1700r 172 ~: which has the corresponding letter next to i~
In the case in which the line!l~2 is a ~elephone line~
the word processor output is coupled to the line l62 by ~eans of ~ a modem 16Q,which converts the digital signals ~rom the word : 25 processor into audio signals sui~able for sending over telephone lines. Similarly, each of the visual display devices 170 and l72 ~: ~is connected to the line 162through its own modem 176 or 174, respectiveLy. In this case, the input units 17~ and 180 are : ~telephone hand set~ wh~ch are coupled to the telephone lin~ 162 by means of normal ~elephone coupling ~ev~cesl8~ At the transcription stations 14S,a unit 1.66is used to couple th~

recorder/reproducer 164 to the telephone line 162. The uni~
166 i~ a standard uni~ which is used to couple the recorder/
reproducer 164 for remote dictation, and will not be described in detail herein.
S An automatic word recognition device or translator . 169 is shown in dashed outline at the transcription s~ation in Figure 11. It is shown in dashed outline to indicate that it can be used instead of an operator to au~omatîcally translate and thus transcribe dictation, in the manner described above.
The system shown in Figure llopera~es as follo~s.
The dictator dictates into the devicc 17~or 18~ and this dictation is transmitted to the recorder/repr~ducer 164 where it is recorded. Shortly thereafter, or at a later time which is convenien~, the operator reproduces the dic~ation and listens to i~ by means of the headset 168 (or the translator 169 translates the dictation).
In transcribing the dictation, the operator uses the keyboard 158to produce t~e dictation and display it on the screen of the CRT display device 152~,or the translator 169 performs the same function. Simultaneously, signals are ~ ~ transmitted over the line 162to the visual display devicel70.
; The words there are formed on the screen of the visual display ~ device so that the dictator can review the dictation for making corrections or other purposes.
When making corrections, the dictator can dictate them into the input device 178so that they are recorded in the recorder/reproducer. Alternatively, corrections can be made electronically, in the same manner in which they are made in word processor systems.

I ~ 699~9 If the corrections are not ma'de electronically, the operator listens to the correc~ions and makes ~hem. In either case, after corrections ha~e been made, the text is sent to the disc file and/or the printex 156to produce the final printer copy. If desired, the dicta~or can again re~iew the rprinted text after the corrections have been ~ade and before thP printed copy has been prepared.
Figure 12 is a schematic elevation view of one of the identical visual display devices 170 and 172. Th~ unit 170 includes a CRT screen 184, and a control panel186. The CRT
screen 184has a ~ertical array ~8 of line ~arkings in the left hand margin to faeilitate oral reference to specific lines of the text by the dictator when dictating corrections.
The most basic components of the control panel are enclosed in Figure 13 in dashed outlinel90. These controls - include an on/off swit¢h 192, a "p~ge forward" switchi94 to change the di~play on the screen to the next page of text material, and a "page back" swit~h 196, to change the page of material back one page.
Al~o included in the controls 190 is an indicator light 198 indicatin~ that transcription îs in progress, and a second indicator ligh~ 200 indicating that transcription is complete and ready for review. The indicator 19~ is lighted ; whenever transcription is being taken from the recorder~reproducer ~5. The indicator 200 is lighted by the operator or the translator deviee when a transcription job is complete, and can be extinguished by the dictator. Thus, the dictator can ~ete~mine when transcription is in progress and when lt is eomplete so that he can decide when to turn on the unit 170 to review the text.

9~9 Also shown in Figure ~ are optional controls These include a keypad 202 including keys 204 for entering each of the numerals from 0 to 9, as well as keys 206 for moving a cursor up, downl to the left and to ~he right on ~he CRT screen 184. This cursor is for the purpose of indicating t~e location rat which a particular correction is to be made. A~so included is an "Enter" switch 108 which enters the number selected on the keypad 202. Means are provided for transmitting the location of the cursor to the transcription station in response to operation of the Enter switch.
Additional keys include a "Display Document" key 2~0 which is used to display a particular document ~hich has ~een stored in the word processor system, regardless of the order in which the documents there are stored. This key is used in coniunction with the keypad 202, which is used to identify the document desired by its identification number, The button 210 is pressed to cause the selected document to be displayed.
A 'IDocument Forward" switch 12 and a "Document Back"
switch 214 also are provided. These switches will allow the dictator to review one document stored immediately preceeding the document bein~ displayed, or a document ollowin~ th~
do:cument being displayed. The circuitry and programming necessary to perform the functions and operations controlled by these switches are well known in the art and will no~ be described in detail herein.
As an alternative to oral identification and cursor location of corrections, a "light pen 210 and control circuit ~1 can be used to quickly identify the location of the correction ~ and transmit that location to the transcribing station.
The system shown in Figures llthroughl3 has the .

I ~ 69969 33-396 advantages described abov~. It is possible for the dictator to see a draft of his dictation (or the final copy~ if desired) on the visual display device very shortly after he has dictated ~he dicta~ion. This enables him to review and correct it before he has forgotten what is is all aboutO This tends to ~make for greater ~ictation efficiency.
The draft is in front of the dictator more rapidly than it would be if it were a typed or printed draft. This is because the draft is delivered by elec~rical means rather than manually.
Another advantage is ~hat paper for the draf~ is not wasted. The only paper which is used is that for the final copy. Another ad~antage is that corrections can be dictated, if desired, thus making the correction pr~cess potentially faster than if corrections were made by hand.
The system of Figure llmakes it possible to provide a centralized transcription service for a relatively large number of dictation stations which can be located on the same floor in the same building, or in separate buildings within an industrial complex, or in separate buildings within a city or municipal area, or even in differe~t citiesO This permits the provision of transcription services for even the most remote outpos ts, where such services no~mally would be totally impractical to provide. Moreover, it facilitates a transcription service operation in which dictation is transcribed by ~he independent agency operating at the transcription station, and subscribers are ccnnected by telephone or other communication links to the transcription service so that fast transcription can be provided without the physical delivery of typed copies to and from the transcription service offices.

.... :

I 1 69~9 The above description of the invention is intended to be illustrative and not limiting. Various changes or modifications in the embodiments described may occur to those skilled in the art and these can be made without departing from the spirit or scope of the invention.
t : ::

.

Claims (34)

THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE
PROPERTY OR PRIVILEGE IS CLAIMED AXE DEFINED AS FOLLOWS:
1. A device for converting speech into corresponding written words, said device comprising, in combination, speech recognition means for translating spoken word and character sounds into corresponding coded electrical translated word and character signals, spelling means for selectively assembling said translated character signals to form spelled word signals and assembling said spelled word signals with said translated word signals to form word sequence signals representing a written word sequence, and visual display means for visually displaying words corresponding to said word sequence signals.
2. A device as claimed in Claim 1, including transducer means for converting spoken sounds into electrical speech signals, storage means for storing said electrical speech signals prior to being delivered to said speech recognition means, and means for delivering said speech signals from said storage means to said speech recognition means at a rate at which said speech recognition means is capable of translating them.
3. A device as claimed in Claim 1, including speech transducer means for converting spoken sounds into electrical signals prior to being delivered to said speech recognition means, said transducer means and said visual display means being located adjacent one another so that one speaking into said transducer means can see said visual display means.
4. A device as claimed in claim 1, in which said spelling means includes switching means for changing the operation of said device between a dictate mode in which all words and characters are displayed separately, as spoken, and a spelling mode in which said spelling means is enabled.
5. A device as claimed in Claim 4, in which selected homonyms of certain sounds are assigned to one of said dictate and spelling modes, and other homonyms for those sounds are assigned to the other mode, so that selection between said modes also is used to select between homonyms of words capable of being translated by said speech recognition means.
6. A device as claimed in claim 1, including correction means connected for deleting and correcting words and characters displayed by said visual means.
7. A device as claimed in Claim 6, in which said correction means includes said speech recognition means for developing replacement words and characters.
8. A device as claimed in claim 1, claim 2 or claim 3 including a keyboard and means for permitting said translated character signals to be input to said visual display means from either said speech recognition means or said keyboard.
9. A device as claimed in claim 1, claim 2 or claim 3 in which said visual display means includes a cathode ray tube screen for displaying a plurality of lines of said words.
10. A device as claimed in claim 1 in which said visual display means includes graphic means for recording said words and sentences on sheet material.
11. A device as claimed in Claim 10, in which said graphic means is a printer.
12. A device as claimed in Claim 10, in which said graphic means is a photographic type composing machine.
13. A device as claimed in Claims 10, 11 or 12, including storage means for storing said word sequence signals for later recording on sheet material.
14. A device as claimed in claim 1, including manual control means for switching said spelling means between a word-forming mode and a character-forming mode, said spelling means being adapted, in the last-named mode, to form said translated character signals into said spelled word signals by selectively locating the characters adjacent one another until a spacing command is received.
15. A device as claimed in Claim 14, including transducer means as claimed in Claim 2 or Claim 3, in which said transducer means is a microphone, said manual control means being mounted integrally with said microphone.
16. A device as claimed in Claim 14, including spacing means for automatically spacing each of said spelled words from any preceding words upon the operation of said manual control means, and providing proper spacing of each of said spelled words from the next following word.
17. A device as claimed in claim 1, including mode switching means operable for selecting among a plurality of different modes of operation, said mode switching means being adapted to cause said speech recognition means to produce word signals representing a predetermined homonym when operated to select a predetermined mode of operation.
18. A device as claimed in Claim 17, in which punctuation marks are assigned to one of said modes, and the homonyms of said punctuation marks are assigned to another of said modes.
19. A device as claimed in Claim 17, in which numerals are assigned to one of said modes, and the homonyms of said numerals are assigned to another of said modes.
20. A device as claimed in Claim 2, in which said electrical speech signal-storage means comprises a voice signal recorder device capable of recording speech and simultaneously r producing and delivering electrical signals representative of previously stored speech, the rates of recording and reproducing being independent of one another.
21. A device as claimed in Claim 20, in which said recorder device is a random-storage endless magnetic tape recorder-reproducer.
22. A device as claimed in Claim 2, including sound reproducing means for reproducing dictation, said delivering means comprising means for stopping and starting said reproducing means as necessary to allow said speech recognition means to complete an operation.
23. A device as claimed in Claim 22, including means for detecting pauses between words and causing said reproducing means to stop in response to the detection of said pauses.
24. A device as claimed in claim 1, claim 2 or claim 3 and in which said display means includes a cathode ray tube.
25. A method of converting speech into written form utilizing a speech recognition device which translates spoken words and character sounds into corresponding coded electrical signals, said method comprising the steps of;
speaking into said device those words which are translatable by said device, speaking into said device characters which spell words which are not translatable by said device, selectively assembling the resulting character signals to form word signals, and visibly displaying words corresponding to said word signals.
26. A method as claimed in Claim 25, in which said displaying step comprises displaying said words on a video screen, and selectively correcting said words electronically.
27. A method as claimed in Claim 25 including recording said words on sheet material, said recording step being selected from the group comprising printing and photocomposing.
28. A method as claim in Claim 27, including storing said word signals in memory prior to recording them.
29. A method as claimed in claim 25, claim 26 or claim 27 including the step of assigning one spelling of homonyms to one mode of operation of said device, and another spelling to the "spelling" mode of operation, whereby a predetermined differentiation between said spellings is made.
30. A method as claimed in Claim 25, which includes storing speech signals prior to delivering them to said speech recognition device, and reading said speech signals out of storage at a rate corresponding to the rate at which the signals can be translated by said speech recognition device.
31. A method as claimed in Claim 30, in which said speech signal-storing step is performed by recording said signals in a recorder/reproducer device, and stopping and starting the reproduction function of said recorder/reproducer device when necessary to allow said speech recognition device to complete the translation of words in progress.
32. A method as claimed in Claim 31, in which said stopping step comprises detecting pauses between words and stopping the reproduction function in response to such detection.
33. A device as claimed in Claim 1, including a receiver for receiving remote voice signals, said speech recognition means being programmed to receive and translate the speech of a specific individual, said receiver being connected to deliver remote voice signals to said speech recognition device.
34. A device as claimed in Claim 33, including means for transmitting back to the sender an audible reproduction of the output of said speech recognition means.
CA000384228A 1980-08-20 1981-08-19 Dictation system and method Expired CA1169969A (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US17980880A 1980-08-20 1980-08-20
US22010880A 1980-12-24 1980-12-24
US179,808 1988-04-11
US220,108 1988-06-24

Publications (1)

Publication Number Publication Date
CA1169969A true CA1169969A (en) 1984-06-26

Family

ID=26875701

Family Applications (1)

Application Number Title Priority Date Filing Date
CA000384228A Expired CA1169969A (en) 1980-08-20 1981-08-19 Dictation system and method

Country Status (2)

Country Link
CA (1) CA1169969A (en)
GB (1) GB2082820B (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4627001A (en) * 1982-11-03 1986-12-02 Wang Laboratories, Inc. Editing voice data
US4779209A (en) * 1982-11-03 1988-10-18 Wang Laboratories, Inc. Editing voice data
FR2542952B1 (en) * 1983-03-18 1985-10-04 Lecomte Daniel DEVICE FOR AUTOMATIC ANSWERING AND DIVERSION OF DELAYED CALLS FOR TELEPHONE SET
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
FR2581469B1 (en) * 1985-05-06 1987-10-30 Matra Communication VOICE INPUT-OUTPUT DEVICE AND SPEECH RECOGNITION OR SYNTHESIS SYSTEM WITH APPLICATION
NL8502959A (en) * 1985-08-26 1987-03-16 Lely Nv C Van Der ELECTRONIC DEVICE RESPONDING TO SOUND.
WO1987007803A1 (en) * 1986-06-13 1987-12-17 Edwin Kellenberger System for text processing
DE3927234A1 (en) * 1988-03-10 1991-02-21 Grundig Emv Computer with speech I=O unit and command converter - can be operated like dictation machine without special skills
DE3807851A1 (en) * 1988-03-10 1989-09-21 Grundig Emv COMPUTER, ESPECIALLY PERSONNEL COMPUTER, WITH A VOICE INPUT AND A VOICE OUTPUT SYSTEM
GB8818170D0 (en) * 1988-07-29 1988-09-01 Edwards J A Voice type computer
AT391035B (en) * 1988-12-07 1990-08-10 Philips Nv VOICE RECOGNITION SYSTEM
WO1993018506A1 (en) * 1992-03-06 1993-09-16 Dragon Systems, Inc. Speech recognition system for languages with compound words
DE19616029A1 (en) * 1996-04-23 1997-11-06 Ingo Prof Demske Interactive data transmission arrangement
GB2323693B (en) * 1997-03-27 2001-09-26 Forum Technology Ltd Speech to text conversion
US6173259B1 (en) * 1997-03-27 2001-01-09 Speech Machines Plc Speech to text conversion
GB2323694B (en) * 1997-03-27 2001-07-18 Forum Technology Ltd Adaptation in speech to text conversion
US6163768A (en) 1998-06-15 2000-12-19 Dragon Systems, Inc. Non-interactive enrollment in speech recognition
US6434526B1 (en) * 1998-06-29 2002-08-13 International Business Machines Corporation Network application software services containing a speech recognition capability
FR2783334B1 (en) * 1998-09-11 2001-07-27 Denis Moura VOICE MESSAGE RECORDING DEVICE FOR VOICE DICTATION SOFTWARE
US6535848B1 (en) * 1999-06-08 2003-03-18 International Business Machines Corporation Method and apparatus for transcribing multiple files into a single document
JP3969908B2 (en) * 1999-09-14 2007-09-05 キヤノン株式会社 Voice input terminal, voice recognition device, voice communication system, and voice communication method
WO2002005537A1 (en) * 2000-06-29 2002-01-17 Koninklijke Philips Electronics N.V. Speech quality estimation for off-line speech recognition
DE20100105U1 (en) * 2001-01-04 2001-03-15 Dosch & Amand Gmbh & Co Kg Dictation device
JP2003143256A (en) * 2001-10-30 2003-05-16 Nec Corp Terminal and communication control method
US7519042B2 (en) 2003-09-12 2009-04-14 Motorola, Inc. Apparatus and method for mixed-media call formatting
US20080031475A1 (en) 2006-07-08 2008-02-07 Personics Holdings Inc. Personal audio assistant device and method
JP6166831B1 (en) * 2016-10-21 2017-07-19 犬養 俊輔 Word learning support device, word learning support program, and word learning support method
CN111292721A (en) * 2020-02-20 2020-06-16 深圳壹账通智能科技有限公司 Code compiling method and device and computer equipment

Also Published As

Publication number Publication date
GB2082820A (en) 1982-03-10
GB2082820B (en) 1984-03-28

Similar Documents

Publication Publication Date Title
CA1169969A (en) Dictation system and method
US3648249A (en) Audio-responsive visual display system incorporating audio and digital information segmentation and coordination
US6298326B1 (en) Off-site data entry system
US4969136A (en) Communications network and method with appointment information communication capabilities
CN1140871C (en) Method and system for realizing voice frequency signal replay of multisource document
CA1334869C (en) Method and apparatus for the generation of reports
US4424575A (en) Text processing system including means to associate commentary with text
Gould et al. Human factors challenges in creating a principal support office system—The speech filing system approach
US4509138A (en) Word processor
EP0716381A2 (en) Character information processor for printing characters
GB2195863A (en) Communication network
US4462085A (en) Word processor for controlling an external dictating machine
US3328764A (en) Copy editor processing device
US5805297A (en) Multifunctional apparatus with selectable functions and a common printer for facsimile reception
JP2765656B2 (en) Komi Unity Network
CN101753666B (en) Mobile phone for blind user
US20060167685A1 (en) Method and device for the rapid, pattern-recognition-supported transcription of spoken and written utterances
CA1173974A (en) Portable word processor
JPH0572968A (en) Intention transmission device for physically handicapped person
GB2195864A (en) Communications network and method with visible message transmission capabilities
Mellor Technical innovations in braille reading, writing, and production
JPS60251466A (en) Chinese braille point voice word processor
CN115268663A (en) Intelligent wrong-word early warning method for blind typing of visually impaired people
van Nes Design and evaluation of applications with speech interfaces-experimental results and practical guidelines
Allen Composition and editing of spoken letters

Legal Events

Date Code Title Description
MKEX Expiry