WO2011107420A1 - System for translating spoken language into sign language for the deaf - Google Patents

System for translating spoken language into sign language for the deaf Download PDF

Info

Publication number
WO2011107420A1
WO2011107420A1 PCT/EP2011/052894 EP2011052894W WO2011107420A1 WO 2011107420 A1 WO2011107420 A1 WO 2011107420A1 EP 2011052894 W EP2011052894 W EP 2011052894W WO 2011107420 A1 WO2011107420 A1 WO 2011107420A1
Authority
WO
WIPO (PCT)
Prior art keywords
video sequences
computer
language
video
sign language
Prior art date
Application number
PCT/EP2011/052894
Other languages
French (fr)
Inventor
Klaus Illgner-Fehns
Original Assignee
Institut für Rundfunktechnik GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institut für Rundfunktechnik GmbH filed Critical Institut für Rundfunktechnik GmbH
Priority to US13/581,993 priority Critical patent/US20130204605A1/en
Priority to EP11704994A priority patent/EP2543030A1/en
Priority to JP2012555378A priority patent/JP2013521523A/en
Priority to CN2011800117965A priority patent/CN102893313A/en
Priority to KR1020127025846A priority patent/KR20130029055A/en
Publication of WO2011107420A1 publication Critical patent/WO2011107420A1/en

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • G09B21/009Teaching or communicating with deaf persons

Definitions

  • the invention relates to a system for translating spoken language into sign language for the deaf.
  • Sign language is the name given to visually perceivable gestures, which are primarily formed using the hands in connection with facial expression, mouth expression, and posture. Sign languages have their own grammatical structures, because sign languages cannot be converted into spoken language word by word. In particular, multiple pieces of information may be transmitted simultaneously using a sign language, whereas a spoken language consists of consecutive pieces of information, i.e. sounds and words.
  • sign language interpreters which - comparable to foreign language interpreters - are trained in a full- time study program.
  • sign language interpreters For audio-visual media, in particular film and television, there exists a large demand for translation of film and television sound into sign language coming from deaf people, which, however, may only be met inadequately due to default of a sufficient number of sign language interpreters.
  • the technical problem of the invention is to automatise the translation of spoken language into sign language in order to manage without human interpreter services.
  • the invention bases on the idea of storing in a database on the one hand text data of words and syntax of a spoken language, for example of the German standard language, and on the other hand sequences of video data of the corresponding meaning in the sign language.
  • the database comprises an audio-visual language dictionary, in which, for words and/or terms of the spoken language, the corresponding images or video sequences of the sign language are available.
  • a computer communicates with the database, wherein textual information, which particularly may also consist of speech components of an audio-visual signal converted into text, is fed into the computer.
  • the pitch (prosody) and the volume of the speech components are analyzed insofar as this is required for the detection of the semantics.
  • the video sequences corresponding to the fed text data are read out by the computer from the database and connected to a complete video sequence.
  • This may be reproduced self-contained (for example for radio programs, podcast or the like) or, for example, fed into an image overlay, which overlays the video sequences in the original audio-visual signal as a "picture in picture".
  • Both image signals may be synchronized to each other by means of a dynamical adjustment of the playback speed. Hence, a larger time delay between spoken language and sign language may be reduced in the "on-line” mode and largely avoided in the "off-line” mode.
  • video sequences of initial hand states are stored in the form of metadata in the database, wherein the video sequences of the initial hand states are inserted between the grammatical structures of the sign language during the translation.
  • the transitions between the individual segments play an important role for obtaining a fluent "visual" speech impression.
  • corresponding crossfades may be computed by means of the stored metadata regarding the initial hand states and the hand states at the transitions so that the hand positions follow seamlessly at the transition from one segment to the next segment.
  • FIG. 1 shows a schematic block diagram of a system for translating spoken language into a sign language for the deaf in form of video sequences
  • Fig. 2 shows a schematic block diagram of a first embodiment for the processing of the video sequences generated using the system according to Fig. 1, and
  • Fig. 3 shows a schematic block diagram of a second embodiment for the processing of the video sequences generated using the system according to Fig. 1.
  • the reference sign 10 designates a database, which is constructed as an audiovisual language dictionary, in which, for words and/or terms of a spoken language, the corresponding images of a sign language are stored in form of video sequences (clips).
  • the database 10 communicates with a computer 20, which addresses the database 10 with text data of words and/or terms of the spoken language and reads out the corresponding, therein stored video sequences of the sign language onto its output line 21.
  • metadata for initial hand states of the sign language may be stored, which define transition positions of the individual gestures and, in form of transition sequences, are inserted between consecutive video sequences of the individual gestures.
  • the generated video and transition sequences are referred to only as "video sequences”.
  • the video sequences read out by the computer 20 onto the output line 21 are fed to an image overlay 120 either directly or, after intermediate storing in a video memory (“sequence memory”) 130 has taken place, via its output 131.
  • the video sequences stored in the video memory 130 may be displayed on a display 180 via the output 132 of the memory 130.
  • the output of the stored video sequences onto the outputs 131 and 132 is controlled by a control 140, which is connected to the memory 130 via an output 141.
  • an analogue television signal from a television signal converter 110 converting an audio-visual signal into a standardized analogue television signal at its output 111 is fed into the image overlay 120.
  • the image overlay 120 inserts the read-out video sequences in the analogue television signal, for example, as "picture in picture” ("picture in picture”, abbreviated as "PIP").
  • the "PIP" television signal so generated at the output 121 of the image overlay 120 is transmitted according to Fig. 2 from a television signal transmitter 150 via an analogue transmission path 151 to a receiver 160.
  • a reproduction apparatus 170 display
  • the image component of the audio-visual signal and, separated therefrom, the gestures of a sign language interpreter may be observed simultaneously.
  • the video sequences read out by the computer 20 onto the output line 21 are fed to a multiplexer 220 either directly or, after intermediate storing in a video memory (“sequence memory”) 130 has taken place, via its output 131.
  • a digital television signal comprising a separate data channel, in which the multiplexer 220 inserts the video sequences, is fed into the multiplexer 220 from the television signal converter 110 from its output 112.
  • the digital television signal so processed at the output 221 of the multiplexer 240 is in turn transmitted to a receiver 160 via a television transmitter 150 via a digital transmission path 151.
  • the image component of the audiovisual signal and, separated therefrom, the gestures of a sign language interpreter may be observed simultaneously.
  • the video sequences 21 may further be transmitted to a user from the memory 130 (or directly from the computer 20) via an independent second transmission path 190 (for example via the internet).
  • an independent second transmission path 190 for example via the internet.
  • the video sequences and transition sequences received by the user via the independent second transmission path 190 may be inserted on user demand and via an image overlay 200 in the digital television signal received by the receiver 160 and the gestures may be reproduced on the display 170 as picture in picture.
  • FIG. 3 Another alternative shown in Fig. 3 is that the generated video sequences 21 are played individually via the second transmission path 190 (broadcast or streaming) or are offered for a retrieval (for example for an audio book 210) via an output 133 of the video memory 130.
  • Fig. 1 shows, as an example, an offline version and an online version for the feeding of the text data into the computer 20.
  • the audio-visual signal is generated in a television or film studio by means of a camera 61 and a speech microphone 62.
  • the speech component of the audio-visual signal is fed into a text converter 70, which converts the spoken language into text data comprising words and/or terms of the spoken language and thus generates an intermediate format.
  • the text data is transmitted to the computer 20 via a text data line 71, where they address the corresponding data of the sign language in the database 10.
  • the text data of the telepromter 90 is fed into the text converter 70 via the line 91 or (not shown) directly into the computer 20 via the line 91.
  • the speech component of the audio-visual signal is, for example, scanned at the audio output 81 of a film scanner 80, which converts a film into a television sound signal.
  • a disc storage medium for example DVD
  • the speech component of the scanned audio-visual signal in turn is fed into the text converter 70 (or another, not explicitly shown text converter), which, for the computer 20, converts the spoken language into text data comprising words and/or terms of the spoken language.
  • the audio-visual signals from the studio 60 or the film scanner 80 may further preferably be stored on a signal memory 50 via their outputs 65 or 82. Via its output 51, the signal memory 50 feeds the stored audio-visual signal into the television converter 110, which generates an analogue or digital television signal from the fed audio-visual signal. Naturally, it is also possible to feed the audio-visual signals from the studio 60 or the film scanner 80 directly into the television signal converter 110.
  • a logic 100 for example a frame rate converter
  • a logic 100 may optionally be connected, which, by means of the time information from the original audio signal and the video signal (time stamp of the camera 61 at the camera output 63), dynamically varies (accelerates or decelerates) both the playback speed of the gesture video sequence from the computer 20 and of the original audio-visual signal from the signal memory 50.
  • the control output 101 of the logic 100 is connected both with the computer 20 and the with the signal memory 50.

Abstract

For automatising the translation of spoken language into sign language and manage without human interpreter services, a system is proposed, which comprises the following features: A database (1), in which text data of words and syntax of the spoken language as well as sequences of video data with the corresponding meanings in the sign language are stored, and a computer (20), which communicates with a database (10) in order to translate fed text data of a spoken language into corresponding video sequences of the sign language, wherein, further, video sequences of initial hand states for definition of transition positions between individual grammatical structures of the sign language are stored in the database (10) as metadata, which are inserted by the computer (20) between the video sequences of the grammatical structures of the sign language during the translation.

Description

SYSTEM FOR TRANSLATING SPOKEN LANGUAGE INTO SIGN LANGUAGE FOR THE DEAF
DESCRIPTION
The invention relates to a system for translating spoken language into sign language for the deaf.
Sign language is the name given to visually perceivable gestures, which are primarily formed using the hands in connection with facial expression, mouth expression, and posture. Sign languages have their own grammatical structures, because sign languages cannot be converted into spoken language word by word. In particular, multiple pieces of information may be transmitted simultaneously using a sign language, whereas a spoken language consists of consecutive pieces of information, i.e. sounds and words.
The translation of spoken language into a sign language is performed by sign language interpreters, which - comparable to foreign language interpreters - are trained in a full- time study program. For audio-visual media, in particular film and television, there exists a large demand for translation of film and television sound into sign language coming from deaf people, which, however, may only be met inadequately due to default of a sufficient number of sign language interpreters.
The technical problem of the invention is to automatise the translation of spoken language into sign language in order to manage without human interpreter services.
According to the invention, this technical problem is solved by the features in the characterizing portion of the patent claim 1.
Advantageous embodiments and developments of the system according to the invention follow from the dependent claims.
The invention bases on the idea of storing in a database on the one hand text data of words and syntax of a spoken language, for example of the German standard language, and on the other hand sequences of video data of the corresponding meaning in the sign language. As a result, the database comprises an audio-visual language dictionary, in which, for words and/or terms of the spoken language, the corresponding images or video sequences of the sign language are available. For the translation of spoken language into sign language, a computer communicates with the database, wherein textual information, which particularly may also consist of speech components of an audio-visual signal converted into text, is fed into the computer. For spoken texts, the pitch (prosody) and the volume of the speech components are analyzed insofar as this is required for the detection of the semantics. The video sequences corresponding to the fed text data are read out by the computer from the database and connected to a complete video sequence. This may be reproduced self-contained (for example for radio programs, podcast or the like) or, for example, fed into an image overlay, which overlays the video sequences in the original audio-visual signal as a "picture in picture". Both image signals may be synchronized to each other by means of a dynamical adjustment of the playback speed. Hence, a larger time delay between spoken language and sign language may be reduced in the "on-line" mode and largely avoided in the "off-line" mode.
Because the initial hand states between the individual grammatical structures must be recognisable for understanding of the sign language, further, video sequences of initial hand states are stored in the form of metadata in the database, wherein the video sequences of the initial hand states are inserted between the grammatical structures of the sign language during the translation. Apart from the initial hand states, the transitions between the individual segments play an important role for obtaining a fluent "visual" speech impression. For this purpose, corresponding crossfades may be computed by means of the stored metadata regarding the initial hand states and the hand states at the transitions so that the hand positions follow seamlessly at the transition from one segment to the next segment.
The invention is described in more detail by means of the embodiments in the drawings. Fig. 1 shows a schematic block diagram of a system for translating spoken language into a sign language for the deaf in form of video sequences;
Fig. 2 shows a schematic block diagram of a first embodiment for the processing of the video sequences generated using the system according to Fig. 1, and
Fig. 3 shows a schematic block diagram of a second embodiment for the processing of the video sequences generated using the system according to Fig. 1.
In Fig. 1, the reference sign 10 designates a database, which is constructed as an audiovisual language dictionary, in which, for words and/or terms of a spoken language, the corresponding images of a sign language are stored in form of video sequences (clips). Via a data bus 11, the database 10 communicates with a computer 20, which addresses the database 10 with text data of words and/or terms of the spoken language and reads out the corresponding, therein stored video sequences of the sign language onto its output line 21. Further and preferably, in the database 10, metadata for initial hand states of the sign language may be stored, which define transition positions of the individual gestures and, in form of transition sequences, are inserted between consecutive video sequences of the individual gestures. In the following, the generated video and transition sequences are referred to only as "video sequences".
In a first embodiment shown in Fig. 2, for the processing of the generated video sequences, the video sequences read out by the computer 20 onto the output line 21 are fed to an image overlay 120 either directly or, after intermediate storing in a video memory ("sequence memory") 130 has taken place, via its output 131. Additionally, the video sequences stored in the video memory 130 may be displayed on a display 180 via the output 132 of the memory 130. The output of the stored video sequences onto the outputs 131 and 132 is controlled by a control 140, which is connected to the memory 130 via an output 141. Further, an analogue television signal from a television signal converter 110 converting an audio-visual signal into a standardized analogue television signal at its output 111 is fed into the image overlay 120. The image overlay 120 inserts the read-out video sequences in the analogue television signal, for example, as "picture in picture" ("picture in picture", abbreviated as "PIP"). The "PIP" television signal so generated at the output 121 of the image overlay 120 is transmitted according to Fig. 2 from a television signal transmitter 150 via an analogue transmission path 151 to a receiver 160. During the reproduction of the received television signal 50 on a reproduction apparatus 170 (display), the image component of the audio-visual signal and, separated therefrom, the gestures of a sign language interpreter may be observed simultaneously.
In a second embodiment shown in Fig. 3, for the processing of the generated video sequences, the video sequences read out by the computer 20 onto the output line 21 are fed to a multiplexer 220 either directly or, after intermediate storing in a video memory ("sequence memory") 130 has taken place, via its output 131. Further, a digital television signal comprising a separate data channel, in which the multiplexer 220 inserts the video sequences, is fed into the multiplexer 220 from the television signal converter 110 from its output 112. The digital television signal so processed at the output 221 of the multiplexer 240 is in turn transmitted to a receiver 160 via a television transmitter 150 via a digital transmission path 151. During reproduction of the received digital television signal 50 on a reproduction apparatus 170 (display), the image component of the audiovisual signal and, separated therefrom, the gestures of a sign language interpreter may be observed simultaneously.
As shown in Fig. 3, the video sequences 21 may further be transmitted to a user from the memory 130 (or directly from the computer 20) via an independent second transmission path 190 (for example via the internet). In this case, no insertion of the video sequences in the digital television signal by a multiplexer 220 takes place. Rather, the video sequences and transition sequences received by the user via the independent second transmission path 190 may be inserted on user demand and via an image overlay 200 in the digital television signal received by the receiver 160 and the gestures may be reproduced on the display 170 as picture in picture.
Another alternative shown in Fig. 3 is that the generated video sequences 21 are played individually via the second transmission path 190 (broadcast or streaming) or are offered for a retrieval (for example for an audio book 210) via an output 133 of the video memory 130.
Depending on which form the audio-visual signal is generated or deduced, Fig. 1 shows, as an example, an offline version and an online version for the feeding of the text data into the computer 20. In the online version, the audio-visual signal is generated in a television or film studio by means of a camera 61 and a speech microphone 62. Via a sound output 64 of the speech microphone 60, the speech component of the audio-visual signal is fed into a text converter 70, which converts the spoken language into text data comprising words and/or terms of the spoken language and thus generates an intermediate format. Then, the text data is transmitted to the computer 20 via a text data line 71, where they address the corresponding data of the sign language in the database 10.
In the case of using what is referred to as "telepromter" 90 in the studio 60, at which a speaker reads the text to be spoken from a monitor, the text data of the telepromter 90 is fed into the text converter 70 via the line 91 or (not shown) directly into the computer 20 via the line 91.
In the offline version, the speech component of the audio-visual signal is, for example, scanned at the audio output 81 of a film scanner 80, which converts a film into a television sound signal. Instead of a film scanner 80, a disc storage medium (for example DVD) may also be provided for the audio-visual signal. The speech component of the scanned audio-visual signal in turn is fed into the text converter 70 (or another, not explicitly shown text converter), which, for the computer 20, converts the spoken language into text data comprising words and/or terms of the spoken language.
The audio-visual signals from the studio 60 or the film scanner 80 may further preferably be stored on a signal memory 50 via their outputs 65 or 82. Via its output 51, the signal memory 50 feeds the stored audio-visual signal into the television converter 110, which generates an analogue or digital television signal from the fed audio-visual signal. Naturally, it is also possible to feed the audio-visual signals from the studio 60 or the film scanner 80 directly into the television signal converter 110.
In case of radio signals, above remarks apply in an analogue manner except that no video signal exists in parallel to the audio signal. In the online mode, the audio signal is directly recorded via the microphone 60 and fed into the text converter 70 via 64. In the offline mode, the audio signal of an audio file, which may be present in any format, is fed into the text converter. For optimizing the synchronisation of the video sequences with the gestures and the parallel video sequence, a logic 100 (for example a frame rate converter) may optionally be connected, which, by means of the time information from the original audio signal and the video signal (time stamp of the camera 61 at the camera output 63), dynamically varies (accelerates or decelerates) both the playback speed of the gesture video sequence from the computer 20 and of the original audio-visual signal from the signal memory 50. For this purpose, the control output 101 of the logic 100 is connected both with the computer 20 and the with the signal memory 50. By means of this synchronisation, a larger time delay between the spoken language and the sign language may be reduced in the "on-line" mode and may largely be avoided in the "off-line" mode.

Claims

1. System for translating spoken language into a sign language for the deaf, characterized by the following features:
A database (1), in which text data of words and syntax of the spoken language as well as sequences of video data with the corresponding meanings in the sign language are stored, and
a computer (20), which communicates with a database (10) in order to translate fed text data of a spoken language into corresponding video sequences of the sign language,
wherein, further, video sequences of initial hand states for definition of transition positions between individual grammatical structures of the sign language are stored in the database (10) as metadata, which are inserted by the computer (20) between the video sequences of the grammatical structures of the sign language during the translation.
2. System according to claim 1 , characterized by a device (120; 220) for inserting the video sequences translated by the computer (20) in an audio-visual signal.
3. System according to claim 1 or 2, characterized by a converter (70) for converting the sound signal component of an audio-visual signal into text data and for feeding the text data into the computer (20).
4. System according to one of the claims 1 to 3, characterized in that a logic device (100) is provided, which feeds a time information deduced from the audio-visual signal into the computer (20), wherein the fed time information dynamically varies both the playback speed of the video sequence from the computer (20) and of the original audiovisual signal.
5. System according to one of the claims 1 to 4, wherein the audio-visual signal is transmitted to a receiver (160) as digital signal via a television signal transmitter (150), characterized in that an independent second transmission path 190 (for example via the internet) is provided for the video sequences (21), via which the video sequences (21) are transmitted to a user from a video memory (130) or directly from the computer (20) and that an image overlay (200) is connected with the receiver (160) in order to insert the video sequences (21) transmitted to the user via the independent second transmission path (1 0) in the digital television signal received by the receiver (160) as picture in picture.
6. System according to one of the claims 1 to 4, characterized in that an independent second transmission path 190 (for example via the internet) is provided for the video sequences (21), via which the video sequences (21) are played from the a video memory (130) or directly from a computer (20) for broadcast or streaming applications or offered for a retrieval (for example for an audio book 210).
7. Receiver for a digital audio-visual signal, characterized by an image overlay (200) connected with the receiver (160) in order to insert the video sequences (21) transmitted via an independent second transmission path (190) in the digital television signal received by the receiver (160) as picture in picture.
PCT/EP2011/052894 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf WO2011107420A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US13/581,993 US20130204605A1 (en) 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf
EP11704994A EP2543030A1 (en) 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf
JP2012555378A JP2013521523A (en) 2010-03-01 2011-02-28 A system for translating spoken language into sign language for the hearing impaired
CN2011800117965A CN102893313A (en) 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf
KR1020127025846A KR20130029055A (en) 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE102010009738.1 2010-03-01
DE102010009738A DE102010009738A1 (en) 2010-03-01 2010-03-01 Arrangement for translating spoken language into a sign language for the deaf

Publications (1)

Publication Number Publication Date
WO2011107420A1 true WO2011107420A1 (en) 2011-09-09

Family

ID=43983702

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/052894 WO2011107420A1 (en) 2010-03-01 2011-02-28 System for translating spoken language into sign language for the deaf

Country Status (8)

Country Link
US (1) US20130204605A1 (en)
EP (1) EP2543030A1 (en)
JP (1) JP2013521523A (en)
KR (1) KR20130029055A (en)
CN (1) CN102893313A (en)
DE (1) DE102010009738A1 (en)
TW (1) TWI470588B (en)
WO (1) WO2011107420A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102723019A (en) * 2012-05-23 2012-10-10 苏州奇可思信息科技有限公司 Sign language teaching system
US10176366B1 (en) 2017-11-01 2019-01-08 Sorenson Ip Holdings Llc Video relay service, communication system, and related methods for performing artificial intelligence sign language translation services in a video relay service environment

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9282377B2 (en) 2007-05-31 2016-03-08 iCommunicator LLC Apparatuses, methods and systems to provide translations of information into sign language or other formats
EP2760002A3 (en) * 2013-01-29 2014-08-27 Social IT Pty Ltd Methods and systems for converting text to video
WO2015061248A1 (en) * 2013-10-21 2015-04-30 iCommunicator LLC Apparatuses, methods and systems to provide translations of information into sign language or other formats
US10248856B2 (en) 2014-01-14 2019-04-02 Toyota Motor Engineering & Manufacturing North America, Inc. Smart necklace with stereo vision and onboard processing
US10024679B2 (en) 2014-01-14 2018-07-17 Toyota Motor Engineering & Manufacturing North America, Inc. Smart necklace with stereo vision and onboard processing
US9915545B2 (en) 2014-01-14 2018-03-13 Toyota Motor Engineering & Manufacturing North America, Inc. Smart necklace with stereo vision and onboard processing
US10360907B2 (en) 2014-01-14 2019-07-23 Toyota Motor Engineering & Manufacturing North America, Inc. Smart necklace with stereo vision and onboard processing
WO2015116014A1 (en) * 2014-02-03 2015-08-06 IPEKKAN, Ahmet Ziyaeddin A method of managing the presentation of sign language by an animated character
US11875700B2 (en) 2014-05-20 2024-01-16 Jessica Robinson Systems and methods for providing communication services
US10460407B2 (en) * 2014-05-20 2019-10-29 Jessica Robinson Systems and methods for providing communication services
US10146318B2 (en) 2014-06-13 2018-12-04 Thomas Malzbender Techniques for using gesture recognition to effectuate character selection
US10024667B2 (en) 2014-08-01 2018-07-17 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable earpiece for providing social and environmental awareness
US10024678B2 (en) 2014-09-17 2018-07-17 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable clip for providing social and environmental awareness
US9922236B2 (en) 2014-09-17 2018-03-20 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable eyeglasses for providing social and environmental awareness
US10490102B2 (en) 2015-02-10 2019-11-26 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for braille assistance
US9586318B2 (en) 2015-02-27 2017-03-07 Toyota Motor Engineering & Manufacturing North America, Inc. Modular robot with smart device
US9972216B2 (en) 2015-03-20 2018-05-15 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for storing and playback of information for blind users
US10395555B2 (en) * 2015-03-30 2019-08-27 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for providing optimal braille output based on spoken and sign language
US9898039B2 (en) 2015-08-03 2018-02-20 Toyota Motor Engineering & Manufacturing North America, Inc. Modular smart necklace
CZ306519B6 (en) * 2015-09-15 2017-02-22 Západočeská Univerzita V Plzni A method of providing translation of television broadcasts in sign language, and a device for performing this method
DE102015016494B4 (en) 2015-12-18 2018-05-24 Audi Ag Motor vehicle with output device and method for issuing instructions
KR102450803B1 (en) 2016-02-11 2022-10-05 한국전자통신연구원 Duplex sign language translation apparatus and the apparatus for performing the duplex sign language translation method
US10024680B2 (en) 2016-03-11 2018-07-17 Toyota Motor Engineering & Manufacturing North America, Inc. Step based guidance system
US9958275B2 (en) 2016-05-31 2018-05-01 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for wearable smart device communications
US10561519B2 (en) 2016-07-20 2020-02-18 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable computing device having a curved back to reduce pressure on vertebrae
US10432851B2 (en) 2016-10-28 2019-10-01 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable computing device for detecting photography
USD827143S1 (en) 2016-11-07 2018-08-28 Toyota Motor Engineering & Manufacturing North America, Inc. Blind aid device
US10012505B2 (en) 2016-11-11 2018-07-03 Toyota Motor Engineering & Manufacturing North America, Inc. Wearable system for providing walking directions
US10521669B2 (en) 2016-11-14 2019-12-31 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for providing guidance or feedback to a user
US10008128B1 (en) 2016-12-02 2018-06-26 Imam Abdulrahman Bin Faisal University Systems and methodologies for assisting communications
US10855888B2 (en) * 2018-12-28 2020-12-01 Signglasses, Llc Sound syncing sign-language interpretation system
CN111385612A (en) * 2018-12-28 2020-07-07 深圳Tcl数字技术有限公司 Television playing method based on hearing-impaired people, smart television and storage medium
WO2021014189A1 (en) * 2019-07-20 2021-01-28 Dalili Oujan Two-way translator for deaf people
US11610356B2 (en) 2020-07-28 2023-03-21 Samsung Electronics Co., Ltd. Method and electronic device for providing sign language
CN114639158A (en) * 2020-11-30 2022-06-17 伊姆西Ip控股有限责任公司 Computer interaction method, apparatus and program product
US20220327309A1 (en) * 2021-04-09 2022-10-13 Sorenson Ip Holdings, Llc METHODS, SYSTEMS, and MACHINE-READABLE MEDIA FOR TRANSLATING SIGN LANGUAGE CONTENT INTO WORD CONTENT and VICE VERSA
IL283626A (en) * 2021-06-01 2022-12-01 Yaakov Livne Nimrod A sign language translation method and system thereof
WO2023195603A1 (en) * 2022-04-04 2023-10-12 Samsung Electronics Co., Ltd. System and method for bidirectional automatic sign language translation and production

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040066914A1 (en) * 2002-10-03 2004-04-08 David Crosson Systems and methods for providing a user-friendly computing environment for the hearing impaired
US20060134585A1 (en) * 2004-09-01 2006-06-22 Nicoletta Adamo-Villani Interactive animation system for sign language
US20060174315A1 (en) * 2005-01-31 2006-08-03 Samsung Electronics Co.; Ltd System and method for providing sign language video data in a broadcasting-communication convergence system
US20090012788A1 (en) * 2007-07-03 2009-01-08 Jason Andre Gilbert Sign language translation system

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5982853A (en) * 1995-03-01 1999-11-09 Liebermann; Raanan Telephone for the deaf and method of using same
DE69526871T2 (en) * 1995-08-30 2002-12-12 Hitachi Ltd SIGNALING TELEPHONE SYSTEM FOR COMMUNICATION BETWEEN HEARING AND NON-HEARING
DE19723678A1 (en) * 1997-06-05 1998-12-10 Siemens Ag Data communication method with reduced content based on sign language
JP2000149042A (en) * 1998-11-18 2000-05-30 Fujitsu Ltd Method, device for converting word into sign language video and recording medium in which its program is recorded
JP2001186430A (en) * 1999-12-22 2001-07-06 Mitsubishi Electric Corp Digital broadcast receiver
US7774194B2 (en) * 2002-08-14 2010-08-10 Raanan Liebermann Method and apparatus for seamless transition of voice and/or text into sign language
TW200405988A (en) * 2002-09-17 2004-04-16 Ginganet Corp System and method for sign language translation
TWI250476B (en) * 2003-08-11 2006-03-01 Univ Nat Cheng Kung Method for generating and serially connecting sign language images
CA2592508C (en) * 2005-01-11 2017-05-02 Yakkov Merlin Method and apparatus for facilitating toggling between internet and tv broadcasts
CN200969635Y (en) * 2006-08-30 2007-10-31 康佳集团股份有限公司 Television set with cued speech commenting function
JP2008134686A (en) * 2006-11-27 2008-06-12 Matsushita Electric Works Ltd Drawing program, programmable display, and display system
US8345827B2 (en) * 2006-12-18 2013-01-01 Joshua Elan Liebermann Sign language public addressing and emergency system
TWI372371B (en) * 2008-08-27 2012-09-11 Inventec Appliances Corp Sign language recognition system and method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040066914A1 (en) * 2002-10-03 2004-04-08 David Crosson Systems and methods for providing a user-friendly computing environment for the hearing impaired
US20060134585A1 (en) * 2004-09-01 2006-06-22 Nicoletta Adamo-Villani Interactive animation system for sign language
US20060174315A1 (en) * 2005-01-31 2006-08-03 Samsung Electronics Co.; Ltd System and method for providing sign language video data in a broadcasting-communication convergence system
US20090012788A1 (en) * 2007-07-03 2009-01-08 Jason Andre Gilbert Sign language translation system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102723019A (en) * 2012-05-23 2012-10-10 苏州奇可思信息科技有限公司 Sign language teaching system
US10176366B1 (en) 2017-11-01 2019-01-08 Sorenson Ip Holdings Llc Video relay service, communication system, and related methods for performing artificial intelligence sign language translation services in a video relay service environment
US10885318B2 (en) 2017-11-01 2021-01-05 Sorenson Ip Holdings Llc Performing artificial intelligence sign language translation services in a video relay service environment

Also Published As

Publication number Publication date
CN102893313A (en) 2013-01-23
TW201135684A (en) 2011-10-16
EP2543030A1 (en) 2013-01-09
JP2013521523A (en) 2013-06-10
DE102010009738A1 (en) 2011-09-01
KR20130029055A (en) 2013-03-21
US20130204605A1 (en) 2013-08-08
TWI470588B (en) 2015-01-21

Similar Documents

Publication Publication Date Title
US20130204605A1 (en) System for translating spoken language into sign language for the deaf
EP2356654B1 (en) Method and process for text-based assistive program descriptions for television
US20160066055A1 (en) Method and system for automatically adding subtitles to streaming media content
US20120105719A1 (en) Speech substitution of a real-time multimedia presentation
US20060285654A1 (en) System and method for performing automatic dubbing on an audio-visual stream
US20080195386A1 (en) Method and a Device For Performing an Automatic Dubbing on a Multimedia Signal
US20060272000A1 (en) Apparatus and method for providing additional information using extension subtitles file
US9767825B2 (en) Automatic rate control based on user identities
US9940947B2 (en) Automatic rate control for improved audio time scaling
ES2370218B1 (en) PROCEDURE AND DEVICE FOR SYNCHRONIZING SUBTITLES WITH AUDIO IN DIRECT SUBTITLE.
US20130151251A1 (en) Automatic dialog replacement by real-time analytic processing
JP2007324872A (en) Delay controller and delay control program for video signal with closed caption
KR101618777B1 (en) A server and method for extracting text after uploading a file to synchronize between video and audio
US11665392B2 (en) Methods and systems for selective playback and attenuation of audio based on user preference
JP2004336606A (en) Caption production system
KR100202223B1 (en) Words caption input apparatus
WO2009083832A1 (en) Device and method for converting multimedia content using a text-to-speech engine
JP2007053549A (en) Device and method for processing information signal
WO2008113064A1 (en) Methods and systems for converting video content and information to a sequenced media delivery format
JP2002007396A (en) Device for making audio into multiple languages and medium with program for making audio into multiple languages recorded thereon
Televisió de Catalunya et al. D6. 1–Pilot-D Progress report
Looms Access

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180011796.5

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11704994

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2012555378

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 8226/CHENP/2012

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 13581993

Country of ref document: US

REEP Request for entry into the european phase

Ref document number: 2011704994

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2011704994

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20127025846

Country of ref document: KR

Kind code of ref document: A