EP1576803A2 - Verfahren und gerät zur wiedergabe mit auswählbarer geschwindigkeit ohne sprachverzerrung - Google Patents

Verfahren und gerät zur wiedergabe mit auswählbarer geschwindigkeit ohne sprachverzerrung

Info

Publication number
EP1576803A2
EP1576803A2 EP03813262A EP03813262A EP1576803A2 EP 1576803 A2 EP1576803 A2 EP 1576803A2 EP 03813262 A EP03813262 A EP 03813262A EP 03813262 A EP03813262 A EP 03813262A EP 1576803 A2 EP1576803 A2 EP 1576803A2
Authority
EP
European Patent Office
Prior art keywords
playback
rate
content
video
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP03813262A
Other languages
English (en)
French (fr)
Inventor
Srinivas Gutta
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of EP1576803A2 publication Critical patent/EP1576803A2/de
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/78Television signal recording using magnetic recording
    • H04N5/782Television signal recording using magnetic recording on tape
    • H04N5/783Adaptations for reproducing at a rate different from the recording rate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor
    • H04N5/92Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/005Reproducing at a different information rate from the information rate of recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • G11B2220/25Disc-shaped record carriers characterised in that the disc is based on a specific recording technology
    • G11B2220/2525Magneto-optical [MO] discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • G11B2220/25Disc-shaped record carriers characterised in that the disc is based on a specific recording technology
    • G11B2220/2537Optical discs
    • G11B2220/2562DVDs [digital versatile discs]; Digital video discs; MMCDs; HDCDs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/90Tape-like record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/107Programmed access in sequence to addressed parts of tracks of operating record carriers of operating tapes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/84Television signal recording using optical recording
    • H04N5/85Television signal recording using optical recording on discs or drums

Definitions

  • the present invention relates generally to the field of television. More specifically, the present invention relates to an apparatus and method for selectable rate playback of television programs without distorting the audio portion of the programs.
  • Selectable rate playback of the video content from various storage mediums such as video cassette recorders (VCR) is known.
  • An audio portion of the playback content may be suppressed during selectable rate playback, to avoid distortion of the audio portion.
  • disortion of the audio portion of the playback content means a lack of fidelity in reception or reproduction due to a change in a rate of playback of the audio portion of the playback content compared to the rate of storing the audio portion of the playback content.
  • the present invention provides a method for playback of playback content at selectable rates, comprising: selecting a first portion of separately stored video and audio playback content, wherein the playback content has been stored at a storing rate, wherein the video and audio are synchronized as stored, and wherein the separately stored synchronized video and audio content are retrievable for synchronized playback; selecting a rate of playback of the playback content from the selectable rate, wherein the selected playback rate is different from the storing rate; tagging speech in the selected first portion of the playback content; recognizing an at least one phrase in the tagged speech; and playing said first portion of playback content at said rate of playback, wherein said playing synchronously retrieves the tagged speech, and wherein playing at said rate does not result in distortion of speech in the playback content even though said rate is different than the storing rate, and wherein the video and audio are synchronized at said rate of playback during said playing.
  • a second embodiment of the present invention discloses an apparatus for selectable rate playback of playback content, comprising: a separately stored video and audio playback content, wherein the playback content has been stored at a storing rate; a selected first portion of the separately stored video and audio playback content in a storage medium, wherein the selected first portion of video and audio content are synchronized and a speech portion of the audio content is tagged; a speech recognition device for tagging the speech portion of the audio content; a phrase recognition device for determining valid words for phrases from the tagged speech, wherein the valid words are joined into said phrases; a playback device for playback of the selected first portion of the playback content at a rate selected from the selectable rate, wherein the selected rate is different from the storing rate, wherein playback at the selected rate synchronously retrieves the Tagged Speech portion of the audio content, wherein playback at the selected rate does not result in distortion of speech in the playback content even though the selected rate is different from the storing rate, and wherein the video and audio content are
  • the present invention advantageously provides undistorted presentation of the audio portion of the playback content during selectable rate playback.
  • FIG. 1 depicts a functionality and logic of an apparatus for starting selectable rate playback of playback content or normal viewing, in accordance with embodiments of the present invention
  • FIG. 2 depicts a functionality and logic of an apparatus for selectable rate playback of playback content, in accordance with embodiments of the present invention
  • FIG. 3 depicts a playback list, for selecting a first portion of Separately Stored Synchronized Video and Audio Playback Content
  • FIG. 4 depicts a graphical user interface (GUI), for selecting a first portion of GUI
  • FIG. 5 depicts a method for selectable rate playback of playback content, in accordance with embodiments of the present invention.
  • the present invention relates generally to the field of television. More specifically, the present invention relates to an apparatus and method for selectable rate playback of a selected video and audio playback content, without distortion of the speech due to the selectable rate playback of the playback content.
  • FIG. 1 is a flowchart illustrating a functionality and a logic description of an apparatus 10 for Selectable Rate Playback of Playback Content, in accordance with embodiments of the present invention and in accordance with a method for selectable rate playback of playback content, as depicted by a flow chart 70 in FIG. 5 and described herein.
  • FIG. 1 illustrates that a user may cause a "start" of selectable rate playback in step 65 or a continuing of normal viewing 61, such as viewing independent of the apparatus 10.
  • "Starting" Selectable Rate Playback 65 of Playback Content depends on three inputs: a "Stop” Selectable Rate Playback 64 input; and a "Pause” Selectable Rate Playback 61 input; and a "Selected Rate” 49 input.
  • a user may choose to provide inputs 64, 67, and 49 from a programmable logic controller (PLC), or alternatively from a central processing unit (CPU), equipped with appropriate software.
  • PLC programmable logic controller
  • CPU central processing unit
  • a user may start selectable rate playback in step 65 by providing a "Selected Rate” 49 input, if decision step 55 determines that playback has not been paused and decision step 50 determines that playback has not been stopped.
  • the "Selected Rate” 49 input may be a slower rate or a faster rate of playback than was used to store the playback content.
  • the "Selectaed Rate” 49 was a range from about 50% to about 150% of the rate used to store the playback content or for any other reason.
  • a user may select any appropriate "Selected Rate” 49 that results in a playback of the Selected Separately Stored Synchronized Video and Audio Playback Content 1 that is more clear or understandable to a viewer or listener of the playback content.
  • selectable speed or “selectable rate” means increasing or decreasing a speed or rate of playback of the Selected Separately Stored Synchronized Video and Audio Playback Content 1, compared to the speed or rate of storing the Selected Separately Stored Synchronized Video and Audio Playback Content 1 without causing distortion of speech in the playback content, as depicted in FIG. 2 and described infra.
  • Playback may be paused by providing a "pause" input 67 to the decision step 55.
  • Playback may be stopped by providing a "stop" input 64 to the decision step 50.
  • an audio and video device such as, for example, a television
  • playback is paused by providing the "pause” input 67 for greater than "x" minutes or when playback is stopped by providing a "stop” input 64
  • Normal Viewing 61 on the audio and video device may result.
  • Normal Viewing 61 means, for example television operation or operation of any appropriate audio and video viewing device independent of the selectable rate playback apparatus or method of the present invention.
  • the "pause” input 67 is provided to decision step 53, resulting in normal viewing 61.
  • the "pause” input 67 loops back to decision step 55, and then again to decision step 53 until the "pause” input 67 is removed.
  • the apparatus 10 goes to the "start" Selectable Rate Playback step 65.
  • "x" is less than two (2) minutes.
  • "x" may be a time interval less than five (5) minutes.
  • the value of "x” may be any positive real number that represents a number of minutes a user desires to wait for automatic return to the normal viewing 61 step after the "Pause" input 67 has been provided to the apparatus 10.
  • FIG. 2 depicts an extension of the apparatus 10 of FIG. 1, after adding: a Selecting and Tagging Portion 9; a Phrase and Tokens Recognizing portion 2; and a Selectable Rate Playback portion 4, in accordance with embodiments of the present invention including in accordance with a method for selectable rate playback of playback content, as depicted by a flow chart 70 in FIG. 5 and described infra.
  • the Selecting and Tagging Portion 9 includes: a Selecting Engine 13, wherein the
  • the Selecting Engine 13 may receive inputs from Separately Stored Synchronized Video and Audio Content 1, a Playback List 109, and a Graphical User Interface 16. During retrieval, the Selecting Engine 13 passes the audio content synchronized with the visual content to a speech recognition and tagging system 12 so that the parts of the content 1 that are speech and the parts that are noise are tagged and provided to Tagged Speech 7 storage, and Noise 23 storage.
  • the speech recognition and tagging system 12 also inputs individual words or tokens into Tagged Speech 7.
  • a "token” is any successive group of non-delimiter characters appearing in a string preceded by a delimiter (or appearing at the beginning of the string), wherein a delimiter may be a space, for example, between words or a form of punctuation such as a comma.
  • "synchronization" of speech or written words or phrases with visual content means words are uttered or written with corresponding visual content when said visual content is displayed. Audio content synchronized with the visual content is available because the Synchronized Video and Audio Content 1 is stored separately, and the Separately Stored Synchronized Video and Audio Content 1 is retrievable for synchronized playback.
  • the Phrase and Tokens Recognizing portion 2 of the apparatus 10 includes: a decision step 29 for determining Valid Words for Phrases, wherein the decision is based on a Test Acceptable Words For Validity 21 input and a Phrase Database 42 input.
  • words or “speech” mean written or spoken English language, or any other language.
  • the decision 29 provides an output Join Words Into Phrases step 31.
  • the Test Acceptable Words For Validity 21 may receive an Input Pronunciation Rules 39.
  • the Test Acceptable Words For Validity 21 may use pronunciation rules to cause the valid words to be pronounced correctly on playback.
  • pronouncing correctly means correcting speech for pronunciation error due to accents or mispronunciations.
  • Consecutive successive valid words and a Phrase Database 42 are input into the decision step 29, resulting in a determination whether the successive valid words are valid words for phrases. If yes, the consecutive successive valid words for phrases are input to the Join Words Into Phrases 31 step. If no, the consecutive successive valid words for phrases are input into Buffer of stored Playback Content 37 as words not valid for phrases.
  • the decision step 29 may apply a process that may include comparison of consecutive successive valid words with a database of phrases 42.
  • Valid words that are present in the Phrase Database 42 as phrases may be joined in the Join Words Into Phrases 31 step. Dictionaries, or Lexicons and the like are examples of the Phrase Database 42. Some examples of phrases include phrases such as "good morning,” whose component words often go together.
  • the words of the phrases need to be uttered together, then the words of the phrases are uttered together when the corresponding visual content of the Separately Stored Synchronized Video and Audio Content 1 is played back.
  • the user could also be given the option to input additional words or rules into the Test Acceptable Words For Validity 21 step so th-tt other words not part of an established language could also be j oined together in phrases in the step 31.
  • the Selectable Rate Playback portion 4 comprises: a Buffer of Stored Playback Content 37; a Selectable Rate Playback Engine 67 and a Selectable Rate Playback Viewing73.
  • Phrases may be passed into Buffer of Stored Playback Content 37 from the Join Words Into Phrases step 31.
  • valid words may be provided to the Buffer of Stored playback Content 37 if they are determined by decision step 29 to not be valid words for phrases.
  • Noise 23 may be passed to the Buffer of Stored Playback Content 37.
  • the Selectable Rate Playback Engine 67 provides the Buffer of Stored Playback Content 37 to the Selectable Rate Playback Engine 67.
  • the Selectable Rate Playback Engine 67 provides input to the Selectable Rate Playback Viewing step 73 for Selectable Rate Playback Viewing 73 of the Selected
  • the Selectable Rate Playback Viewing 73 relates to the user not having understood what was uttered or a scene content in a video program was not being clear.
  • the Test Acceptable Word Validity 21 may use a pronunciator device that inputs pronunciation rules 39 and then utters the words or phrases correctly. Thus, words incorrectly spoken by an actor may be correctly pronounced by the pronunciator. The user could be given the option whether the valid words should employ a pronunciator for utterance or if the utterance should be as they are spoken by actors in, for example, the video program.
  • FIG. 3 depicts an example of a List 110 of playback content from the Playback List
  • the Playback List includes a playback "y" minutes list item 120, wherein y represents a time from when the Separately Stored Synchronized Video and Audio Content 1 (see FIG. 2) was stored.
  • the time from when the Separately Stored Synchronized Video and Audio Content 1 was stored depends on a storage capacity of the Buffer of Stored playback Content 37, as depicted in FIG. 2, and described herein.
  • the storage capacity of the Buffer of Stored playback Content 37 may be any appropriate capacity needed to accommodate the Separately Stored Synchronized Video and Audio Content 1. In one embodiment the storage capacity of the Buffer of Stored Playback Content 37 is less than 2 minutes. Alternatively, the storage capacity of the Buffer of Stored Playback Content 37 may be less than 5 minutes. Alternatively, the storage capacity of the Buffer of Stored Playback Content 37 may be the capacity required to store the Separately Stored Synchronized Video and Audio Content 1 of the movie or video program, wherein the video program may be a television program.
  • the Playback List 109 includes a Keywords or Phrases List Item 130 that may be created by a user based on keywords or phrases that the user remembers from listening or viewing the program or movie, that is included in the Separately Stored Synchronized Video and Audio Content 1.
  • the Playback List 109 includes a Key Frames List Item 140, wherein each entry of the Key Frames List Item 140 may be selected by subtracting an intensity "z" of each of two consecutive successive frames and if the difference " ⁇ z" in the intensity "z" between the consecutive successive frames is greater than a threshold "t" then the frame having the higher intensity is selected as the Key Frame.
  • a user can select list items 120, 130 or 140 manually or via a remote selection device. Selection of the list items 120, 130 or 140 provides an input to the Selecting Engine 13.
  • FIG. 4 depicts a List of playback content from a Graphical User Interface (GUI) 16, wherein the List includes a playback "y" minutes list item 160, a Keywords or Phrases List Item 170 and a Key Frames List Item 180 created in like manner as the corresponding list items 120, 130, and 140 depicted in FIG. 3 and described supra.
  • the List of Playback Content from the GUI 16 includes a scroll bar 190 that can be used to scroll to 160, 170 or 180.
  • a user can select list items 160, 170 or 180 manually or via a remote selection device. Selection of the list items 160, 170 or 180 provides an input to the Selecting Engine 13 from the GUI 16 (see FIG. 2).
  • the graphical user interface 16 may be provided with a list of key video frames using key frame extraction.
  • key frame extraction means the key frames having a higher intensity than a threshold intensity are selected into the List of Playback Content from the GUI 16.
  • FIG. 5 depicts a method 70 for Selectable Rate Playback of Playback Content, comprising steps 75, 85, 90, 95 and 97.
  • a television program or alternatively, a movie may be stored on a personal video cassette recorder, a DVD or on any appropriate storage medium such as an optical medium, or a magneto optical medium.
  • the program or movie must be Separately Stored Synchronized Video and Audio Content 1 (see FIG. 2), wherein the video and audio are synchronized as stored, and wherein the Separately Stored Synchronized Video and Audio Content 1 are retrievable for synchronized playback.
  • a user may encounter a portion of the program that may not be satisfactorily understandable such as because either the video portion is unclear or the audio portion is not understandable.
  • the user first stops the playback.
  • a user selects a first portion 44 of the Separately Stored Synchronized Video and Audio Playback Content 1 for "Selected Rate" 49 of playback, wherein the selected first portion 44 corresponds to an list item 120, 130, or 140 from the Playback List 109 of FIG. 3, or a list item 160, 170, or 180 from the GUI 16 of FIG. 4.
  • the playback content 1 has been stored at a storing rate, wherein the storing rate may be any recording rate for a commercial personal video cassette recorder, a DVD or for any appropriate storage medium such as an optical medium, or a magneto optical medium, and wherein the storing rate is different from the "Selected Rate” 49.
  • the "Selected Rate” 49 may be slower or faster than the storing rate for the the playback content 1 without causing distortion of the speech portion of the audio content of the playback content 1.
  • step 85 speech included in the selected first portion 44 of the Separately Stored Synchronized Video and Audio Playback Content 1 (see FIG. 2) corresponding to the selected list item from the Playback Content from Playback List 109 or the Graphical User Interface 16 is tagged by the Speech Recognition and Tagging System 12.
  • acceptable words 7 are recognized by the speech recognition and tagging system 12 (see FIG. 2).
  • step 95 at least one phrase in the Tagged Speech 7 is recognized by the
  • the selected first portion 44 of the Separately Stored Synchronized Video and Audio Content 1 may be retrieved for synchronized playback by the Selecting and Tagging Engine 65 (see FIG. 1), since the video and audio content are synchronized and stored separately, wherein Tagged Speech 7 and corresponding video is presented serially, such that selecting the first portion 44 of the Separately Stored
  • Synchronized Video and Audio Content 1 (see FIG. 2) for playing selects a corresponding Tagged Speech 7 for playing.
  • Speech may be tagged by the Speech Recognition and Tagging System 12, as depicted in FIG. 2, and described in associated text supra.
  • An at least one phrase in the Tagged Speech 7 may be recognized using, for example, the Speech Recognition System and Tagging System 12, as depicted in FIG. 2, and described in associated text supra.
  • the Speech Recognition and Tagging System 12 may use stemming to remove morphological and inflexional endings from words in English from the playback content 1.
  • stemming may be accomplished by the Porter stemming apparatus (or 'Porter stemmer') that is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalization process that is usually done when setting up Information Retrieval systems.
  • morphological endings for words in English are verb tenses, such as past, present or future, and "inflexional” endings for words in English are endings of nouns or verbs such as “s”, “es”, or “ing”, or endings such as “er”, “ier”, “iest” for comparative and superlative forms of adjectives.
  • the selected first portion 44 of the Separately Stored Synchronized Video and Audio Playback Content 1 (see FIG. 2) corresponding to the selected list item from the
  • Playback Content from Playback List 109 or the Graphical User Interface 16 may be played at selectable rate, wherein said playing synchronously retrieves Tagged Speech 7 such as acceptable words. Playing the selected first portion 44 of the Separately Stored Synchronized Video and Audio Playback Content 1 corresponding to the selected list item from the Playback Content from Playback List 109 or the Graphical User Interface 16 at the selectable rate does not result in distortion of speech in the Playback Content 1 (see FIG. 2).
  • the video and audio are to be synchronized at the selectable rate, in accordance with embodiments of the present invention and in accordance with a method, as depicted by the flow chart 70 in FIG. 5 and described supra.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)
  • Television Receiver Circuits (AREA)
EP03813262A 2002-12-16 2003-12-12 Verfahren und gerät zur wiedergabe mit auswählbarer geschwindigkeit ohne sprachverzerrung Withdrawn EP1576803A2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US43372202P 2002-12-16 2002-12-16
US433722P 2002-12-16
PCT/IB2003/005912 WO2004056086A2 (en) 2002-12-16 2003-12-12 Method and apparatus for selectable rate playback without speech distortion

Publications (1)

Publication Number Publication Date
EP1576803A2 true EP1576803A2 (de) 2005-09-21

Family

ID=32595227

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03813262A Withdrawn EP1576803A2 (de) 2002-12-16 2003-12-12 Verfahren und gerät zur wiedergabe mit auswählbarer geschwindigkeit ohne sprachverzerrung

Country Status (6)

Country Link
EP (1) EP1576803A2 (de)
JP (1) JP2006510304A (de)
KR (1) KR20050090398A (de)
CN (1) CN1726707A (de)
AU (1) AU2003303005A1 (de)
WO (1) WO2004056086A2 (de)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7660715B1 (en) 2004-01-12 2010-02-09 Avaya Inc. Transparent monitoring and intervention to improve automatic adaptation of speech models
US7653543B1 (en) 2006-03-24 2010-01-26 Avaya Inc. Automatic signal adjustment based on intelligibility
US7925508B1 (en) 2006-08-22 2011-04-12 Avaya Inc. Detection of extreme hypoglycemia or hyperglycemia based on automatic analysis of speech patterns
US7962342B1 (en) 2006-08-22 2011-06-14 Avaya Inc. Dynamic user interface for the temporarily impaired based on automatic analysis for speech patterns
US8041344B1 (en) 2007-06-26 2011-10-18 Avaya Inc. Cooling off period prior to sending dependent on user's state

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4404932C2 (de) * 1993-02-16 1995-06-22 Gold Star Co Kurzdarstellungs-Playbackvorrichtung und -verfahren für einen Video-Cassettenrecorder
US5583652A (en) * 1994-04-28 1996-12-10 International Business Machines Corporation Synchronized, variable-speed playback of digitally recorded audio and video
US6625387B1 (en) * 2002-03-01 2003-09-23 Thomson Licensing S.A. Gated silence removal during video trick modes

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2004056086A2 *

Also Published As

Publication number Publication date
WO2004056086A3 (en) 2004-11-11
KR20050090398A (ko) 2005-09-13
WO2004056086A2 (en) 2004-07-01
AU2003303005A1 (en) 2004-07-09
CN1726707A (zh) 2006-01-25
AU2003303005A8 (en) 2004-07-09
JP2006510304A (ja) 2006-03-23

Similar Documents

Publication Publication Date Title
US10002612B2 (en) Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment
US6505153B1 (en) Efficient method for producing off-line closed captions
US5649060A (en) Automatic indexing and aligning of audio and text using speech recognition
US6415257B1 (en) System for identifying and adapting a TV-user profile by means of speech technology
US6172675B1 (en) Indirect manipulation of data using temporally related data, with particular application to manipulation of audio or audiovisual data
US8311832B2 (en) Hybrid-captioning system
EP1295482B1 (de) Erzeugung von untertiteln für bewegte bilder
US20060136226A1 (en) System and method for creating artificial TV news programs
JP4127668B2 (ja) 情報処理装置、情報処理方法、およびプログラム
US20080195386A1 (en) Method and a Device For Performing an Automatic Dubbing on a Multimedia Signal
JP2007519987A (ja) 内部及び外部オーディオビジュアルデータの統合解析システム及び方法
JP2001103402A (ja) 記録されたテレビジョン放送についての情報を記憶するための機構
JP4192703B2 (ja) コンテンツ処理装置、コンテンツ処理方法及びプログラム
JP2004343488A (ja) 字幕挿入方法、字幕挿入システム、および字幕挿入プログラム
EP1576803A2 (de) Verfahren und gerät zur wiedergabe mit auswählbarer geschwindigkeit ohne sprachverzerrung
CN100538696C (zh) 用于本征与非本征视听数据的综合分析的系统和方法
JP2007519321A (ja) 視聴覚データストリームのマルチメディア要約を作成する方法及び回路
JP3838775B2 (ja) マルチメディア処理装置、記録媒体
JP2005341138A (ja) 映像要約方法及びプログラム及びそのプログラムを格納した記憶媒体
JP3903738B2 (ja) 情報記録・検索装置、方法、プログラム、および記録媒体
Parsodkar et al. Movie Captioning For Differently Abled People
Robert-Ribes On the use of automatic speech recognition for TV captioning.
JP2002324071A (ja) コンテンツ検索システム、コンテンツ検索方法
Dubinsky SyncWords: A platform for semi-automated closed captioning and subtitles
Friedland et al. Narrative theme navigation for sitcoms supported by fan-generated scripts

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20050718

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

RIC1 Information provided on ipc code assigned before grant

Ipc: H04N 5/783 20060101AFI20060117BHEP

DAX Request for extension of the european patent (deleted)
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20071121