WO1999046762A1 - Traducteur vocal automatique - Google Patents
Traducteur vocal automatique Download PDFInfo
- Publication number
- WO1999046762A1 WO1999046762A1 PCT/US1999/005058 US9905058W WO9946762A1 WO 1999046762 A1 WO1999046762 A1 WO 1999046762A1 US 9905058 W US9905058 W US 9905058W WO 9946762 A1 WO9946762 A1 WO 9946762A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- language
- translator
- signals
- text
- Prior art date
Links
- 238000013519 translation Methods 0.000 claims description 18
- 238000000034 method Methods 0.000 claims description 10
- 239000004973 liquid crystal related substance Substances 0.000 claims description 3
- 230000014616 translation Effects 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000004888 barrier function Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Definitions
- the invention relates generally to automatic speech translation.
- a speech translator includes a microphone and a processor configured to receive speech signals sensed by the microphone where the speech signals represent speech in a first language.
- the processor is further configured to convert the speech signals to a first text file in the first language and to convert the first text file to a second text file in a second language.
- the translator - 2 - includes a speech synthesizer which converts the second text file to speech signals in the second language.
- An amplifier receives and amplifies the speech signals in the second language.
- the translator can be a stand-alone portable unit or can be implemented using, for example, a personal computer.
- the translator can translate speech between two or more languages so that persons speaking different languages can converse without the need for a human translator.
- the translator also can be used, for example, for other purposes, such as phonic and pronunciation training in a single language or in different languages.
- the invention also includes a method of performing speech translation.
- Exemplary implementations of the translator and method are discussed in greater detail below.
- Various implementations include one or more of the following advantages.
- the translator can be compact, making it particularly convenient for use during travel abroad, for example, at business meetings.
- the translator can be incorporated as part of a headphone set to facilitate its use in theaters or in a lecture hall.
- the translator can be versatile, allowing the user to select from a menu the languages in which the parties will converse.
- Some implementations permit bi-directional translation.
- FIG. 1 is a block diagram illustrating one implementation of a speech translator according to the invention.
- FIG. 2 illustrates a portable speech translator according to the invention.
- an automatic speech translator 2 is designed to allow persons speaking different languages, such as English and French, or English and Japanese, to communicate with one another even if one person, for example, does not speak or understand the other person's language.
- the translator 2 can be implemented, for example, using a laptop computer.
- the speech translator 2 includes a computer or other processor 4, at least one microphone 6, an amplifier such as a speaker 8, and a monitor 16.
- the computer 4 can be a personal computer configured with the appropriate software, or a microprocessor programmed with dedicated sof ware.
- a Pentium 300 device is suitable for some implementations.
- the microphone 6 and the speaker 8 are illustrated as discrete components of the translator 2, in some implementations, the microphone and the speaker can be integrated with the translator as part of a single integrated unit.
- An operating and management software module 15 controls the overall operation of the computer 4 and the interaction between the various components of the translator 2.
- the translator 2 is designed to translate speech from a first language, such as English, to a second language, such as French.
- first and second languages are selected, for example, at the time of manufacture, and the translator 2 is configured to convert speech only from the first language to the second language.
- the translator 2 can include a switch or menu which allows the user to select either the first language, the second language, or both, from among two or more options.
- the computer 4 includes a speech-to-text conversion unit 10, a text-to-text translation unit 12, and a text-to-speech conversion unit 14.
- the units 10, 12, 14 can be implemented in hardware, software, or a combination of both hardware and software.
- speech signals received by the microphone 6 are directed to the speech-to-text unit 10, for example, through a sound card associated with the computer 4.
- the unit 10 converts the received signals into a first digital text file in the same language as the received speech.
- the speech-to-text unit 10 includes a continuous speech recognition software module, such as the Naturally SpeakingTM system available from Dragon Systems, Inc.
- the text can be displayed on the display 16 to permit the person speaking to confirm that the speech-to-text translation is accurate.
- the translator 2 can re-convert the text output from the speech-to-text unit 10 to audio signals to allow the speaker to confirm the accuracy of the translation from speech to text.
- the computer 4 further is configured to convert the text from the first language to the second language using the text-to-text translation unit 12.
- the translation unit 12 can include a software module such as the Power TranslatorTM available from Global Link. In general, proper names, including names of persons or places, should be transliterated rather than translated.
- the text-to-text translation unit 12 forms a second text file which represents the received speech in the second language. In some implementations, the translated text in the second-language is displayed on the display.
- the second text file then is converted to speech in the second language by the text-to-speech conversion unit 14.
- the text-to-speech unit 14 includes a speech synthesizer, and, in one particular implementation, can include a software module such as Mac in TalkTM available from Macintosh or TrueVoiceTM available from LearnOut & Hauspie. Other suitable text-to-speech software modules are available, for example, from International Business Machine, Inc.
- the signals generated by the text-to- speech conversion unit 14 are sent to the speaker 8 which amplifies the received signals so that they can be heard by persons in the vicinity of the translator 2. To illustrate the operation of the translator 2, it is assumed that the translator 2 is configured to translate speech from English to French. In some cases, prior to using the translator 2, it may desirable or necessary for a person to speak sample words or phrases to train the translator and to allow it to recognize a particular person's accent.
- a first person would speak in English into the microphone 6.
- the person might say, for example, "Where is the bus?"
- the speech-to-text unit 10 would receive digital signals representing the speech and would form a first text file corresponding to that speech.
- the text "Where is the bus?” would appear on the display 16.
- the text-to-text translation unit 12 would convert the first text file to a second text file in French, and the text "O ⁇ est 1' autobus?" would appear on the display 16.
- the text-to-speech unit 14 would generate sounds corresponding to the sentence "O ⁇ est 1' autobus?" which would be amplified by the speaker 8 so that one or more other persons in the vicinity of the translator 2 would hear the translated sentence in French.
- the translator 2 is programmed to perform the translation word by word, whereas in other implementations, the translation is performed, for example, sentence by sentence.
- the translator 2 can be used for converting speech from the second language (e.g. French) to the first language (e.g., English) as well.
- the various units 10, 12, 14 are configured to handle the conversions and translations from the first language to the second language as well as from the second language to the first language.
- the translator 2 is programmed to be in one of two modes.
- the translator 2 In the first mode, the translator 2 assumes that the received speech signals correspond to speech in the first language for translation to the second language, whereas in the second mode, the translator 2 assumes that the received speech signals correspond to speech in the second language for translation to the first language.
- a manual switch or button 18 can be provided on the exterior of the translator 2 to switch between the first and second modes.
- a two-microphone sound card can be provided which allows the translator automatically to identify each language and perform the translation without the need to press the switch 18.
- FIG. 2 illustrates additional features of an automatic speech translator 20.
- the translator 20 is configured and programmed to translate speech between selected languages in a manner similar to the translator 2 described above.
- the translator 20 is a portable unit which can be held, for example, in a person's hand.
- the translator 20 includes a power switch 22 for turning the unit on and off, a volume switch or knob 24 for controlling the volume of generated speech.
- the translator 20 also includes a menu display 38 from one which one of several options, including at least either the first or second language, can be selected by using a selection button or knob 26.
- the selection button or knob 26 allows one to scroll through the available options and to select one of the options from - 8 - the menu.
- the translator 20 is capable of translating speech between any two of the following languages: English, Japanese, French, German, Italian, Russian or Chinese. Of course, in other implementations, additional or different languages are available.
- the menu display 38 can be a liquid crystal display (LCD) .
- the translator 20 includes multiple input ports 28A, 28B for connection to respective microphones. For example, one person can use a microphone connected to the input port 28A. A second person can use a microphone connected to the input port 28B.
- the translator 20 also has a speaker 36 through which the translated speech can be heard and a liquid crystal display (LCD) 34 on which the text of the original and translated speech can be displayed.
- the display 34 can be folded onto the top of the translator 20 when the unit is not being used to provide additional compactness.
- a switch or button 32 can be provided on the exterior of the translator 20 to switch between different modes which perform translations respectively from a first language to a second language and vice-versa.
- the translator 20 includes multiple output ports 30A, 30B for connection to respective headphone sets.
- the person using a microphone connected to the input port 28A can use a headphone set connected to the output port 30A.
- the person using a microphone connected to the input port 28B can use a headphone set connected to the output port 30B.
- headphone sets are connected to the output ports 30A,
- the translated speech signals are heard only by the persons wearing the headphone sets. This feature can provide additional privacy to the parties carrying on the conversation and can lessen disturbances to the parties from other noise in the vicinity.
- the translator can include a port 40 for connection to other input/output devices, such as a keyboard, hand-writing recognition devices, or a printer.
- the translator 2 saves the entire dialogue which subsequently can be printed out .
- the translator 20 also can include holders for the microphone and headphone sets when they are not in use.
- the holders can be attached to the side of the translator 20.
- the translator 20 can include a shoulder strap 42 for carrying the translator.
- the translators 2 and 20 also can be used phonic and pronunciation training.
- the translator can be configured so that the selected first and second languages are the same.
- the translator 2 can be used to translate speech from television, radio, telephone or other speech sources as well as directly from another person.
- the translator 2 (or 20) also can be connected to a telephone line to permit conversations to take place, for example, across the Internet.
- a compact disc (CD) drive can be included as part of the translator.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
La présente invention concerne un traducteur vocal (2) comprenant un microphone (6) et un convertisseur voix-texte (10) configuré pour prendre en compte des signaux vocaux d'une première langue et convertir les signaux vocaux en un premier fichier texte dans la première langue. Ce premier fichier texte est converti en un second fichier texte dans une seconde langue (12). Le traducteur vocal inclut un synthétiseur vocal (14) qui convertit le second fichier texte en signaux vocaux de la seconde langue. Un amplificateur reçoit et amplifie les signaux vocaux de la seconde langue.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US7726398P | 1998-03-09 | 1998-03-09 | |
US60/077,263 | 1998-03-09 | ||
US9721698A | 1998-06-12 | 1998-06-12 | |
US09/097,216 | 1998-06-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1999046762A1 true WO1999046762A1 (fr) | 1999-09-16 |
Family
ID=26759087
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/005058 WO1999046762A1 (fr) | 1998-03-09 | 1999-03-09 | Traducteur vocal automatique |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO1999046762A1 (fr) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001039036A1 (fr) * | 1999-11-23 | 2001-05-31 | Qualcomm Incorporated | Procede et appareil destine a un dispositif de traduction de langue etrangere a commande vocale |
WO2002043360A2 (fr) * | 2000-11-01 | 2002-05-30 | Lps Associates, Llc | Systeme telephonique multimedia a interface de reunion sur internet |
WO2002048907A1 (fr) * | 2000-12-13 | 2002-06-20 | Metso Automation Networks Oy | Method of and information search system for searching for information in process control environment |
US8103508B2 (en) | 2002-02-21 | 2012-01-24 | Mitel Networks Corporation | Voice activated language translation |
US20220030343A1 (en) * | 2018-10-11 | 2022-01-27 | Vivek Dahiya | An automated microphone system and method of adjustment thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5384701A (en) * | 1986-10-03 | 1995-01-24 | British Telecommunications Public Limited Company | Language translation system |
US5526259A (en) * | 1990-01-30 | 1996-06-11 | Hitachi, Ltd. | Method and apparatus for inputting text |
US5544050A (en) * | 1992-09-03 | 1996-08-06 | Hitachi, Ltd. | Sign language learning system and method |
US5546500A (en) * | 1993-05-10 | 1996-08-13 | Telia Ab | Arrangement for increasing the comprehension of speech when translating speech from a first language to a second language |
US5724526A (en) * | 1994-12-27 | 1998-03-03 | Sharp Kabushiki Kaisha | Electronic interpreting machine |
US5768603A (en) * | 1991-07-25 | 1998-06-16 | International Business Machines Corporation | Method and system for natural language translation |
-
1999
- 1999-03-09 WO PCT/US1999/005058 patent/WO1999046762A1/fr active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5384701A (en) * | 1986-10-03 | 1995-01-24 | British Telecommunications Public Limited Company | Language translation system |
US5526259A (en) * | 1990-01-30 | 1996-06-11 | Hitachi, Ltd. | Method and apparatus for inputting text |
US5768603A (en) * | 1991-07-25 | 1998-06-16 | International Business Machines Corporation | Method and system for natural language translation |
US5544050A (en) * | 1992-09-03 | 1996-08-06 | Hitachi, Ltd. | Sign language learning system and method |
US5546500A (en) * | 1993-05-10 | 1996-08-13 | Telia Ab | Arrangement for increasing the comprehension of speech when translating speech from a first language to a second language |
US5724526A (en) * | 1994-12-27 | 1998-03-03 | Sharp Kabushiki Kaisha | Electronic interpreting machine |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001039036A1 (fr) * | 1999-11-23 | 2001-05-31 | Qualcomm Incorporated | Procede et appareil destine a un dispositif de traduction de langue etrangere a commande vocale |
US6438524B1 (en) | 1999-11-23 | 2002-08-20 | Qualcomm, Incorporated | Method and apparatus for a voice controlled foreign language translation device |
WO2002043360A2 (fr) * | 2000-11-01 | 2002-05-30 | Lps Associates, Llc | Systeme telephonique multimedia a interface de reunion sur internet |
WO2002043360A3 (fr) * | 2000-11-01 | 2003-01-30 | Lps Associates Llc | Systeme telephonique multimedia a interface de reunion sur internet |
WO2002048907A1 (fr) * | 2000-12-13 | 2002-06-20 | Metso Automation Networks Oy | Method of and information search system for searching for information in process control environment |
US8103508B2 (en) | 2002-02-21 | 2012-01-24 | Mitel Networks Corporation | Voice activated language translation |
US20220030343A1 (en) * | 2018-10-11 | 2022-01-27 | Vivek Dahiya | An automated microphone system and method of adjustment thereof |
US11601740B2 (en) * | 2018-10-11 | 2023-03-07 | Vivek Dahiya | Automated microphone system and method of adjustment thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7593842B2 (en) | Device and method for translating language | |
US20030115059A1 (en) | Real time translator and method of performing real time translation of a plurality of spoken languages | |
WO2007124109A3 (fr) | Procédé et système de communicateur de la parole de conversation interactif | |
NZ335767A (en) | Methods and apparatus for translating between languages | |
US20050192811A1 (en) | Portable translation device | |
KR20030044899A (ko) | 음성으로 제어되는 외국어 번역기용 방법 및 장치 | |
WO2003052624A1 (fr) | Traducteur en temps reel et procede de traduction en temps reel d'une pluralite de langues en langage parle | |
JP2667408B2 (ja) | 翻訳通信システム | |
JP3473204B2 (ja) | 翻訳装置及び携帯端末装置 | |
US6574598B1 (en) | Transmitter and receiver, apparatus and method, all for delivery of information | |
WO1999046762A1 (fr) | Traducteur vocal automatique | |
JPH08278972A (ja) | 音声入力翻訳装置 | |
US20090055167A1 (en) | Method for translation service using the cellular phone | |
JP2020113150A (ja) | 音声翻訳対話システム | |
KR19990037776A (ko) | 음성인식자동번역및통역장치 | |
JP2002027039A (ja) | 通信通訳システム | |
JPH10224520A (ja) | マルチメディア公衆電話システム | |
GB2342202A (en) | Simultaneous translation | |
JPS58142479A (ja) | 電子翻訳装置 | |
JPH0512246A (ja) | 音声文書作成装置 | |
JP2010164921A (ja) | 障害者用会話補助装置 | |
JP2002323969A (ja) | コミュニケーション支援方法およびこの方法を用いたシステムならびに装置 | |
JP2007272260A (ja) | 自動翻訳装置 | |
JP3136038B2 (ja) | 通訳装置 | |
JP2000184077A (ja) | ドアホンシステム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): CA IL JP |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
122 | Ep: pct application non-entry in european phase |