WO2004109549A3 - System and method for performing media content augmentation on an audio signal - Google Patents

System and method for performing media content augmentation on an audio signal Download PDF

Info

Publication number
WO2004109549A3
WO2004109549A3 PCT/IB2004/050822 IB2004050822W WO2004109549A3 WO 2004109549 A3 WO2004109549 A3 WO 2004109549A3 IB 2004050822 W IB2004050822 W IB 2004050822W WO 2004109549 A3 WO2004109549 A3 WO 2004109549A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
media content
performing media
speech
content augmentation
Prior art date
Application number
PCT/IB2004/050822
Other languages
French (fr)
Other versions
WO2004109549A2 (en
Inventor
Martin Franciscus Mckinney
Jan Alexis Daniel Nesvadba
Dirk Jeroen Breebaart
Original Assignee
Koninkl Philips Electronics Nv
Martin Franciscus Mckinney
Jan Alexis Daniel Nesvadba
Dirk Jeroen Breebaart
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv, Martin Franciscus Mckinney, Jan Alexis Daniel Nesvadba, Dirk Jeroen Breebaart filed Critical Koninkl Philips Electronics Nv
Publication of WO2004109549A2 publication Critical patent/WO2004109549A2/en
Publication of WO2004109549A3 publication Critical patent/WO2004109549A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention describes a system (1) for performing media content augmentation on an audio signal (2). The system comprises a speech identifier (3) for identifying speech content in the audio signal (2); a speech-to-text converter (5) for converting the speech content into a digital text format (6); a key phrase identifier (7) for identifying key phrases (19) in the digital text (6); a search engine (8) for searching a source of information (9) for material relating to the key phrases (19), and a search result compiler (10) to provide a user with results of the search (11). Moreover the invention describes an appropriate method for performing media content augmentation on an audio signal (2).
PCT/IB2004/050822 2003-06-05 2004-06-02 System and method for performing media content augmentation on an audio signal WO2004109549A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP03101655 2003-06-05
EP03101655.3 2003-06-05

Publications (2)

Publication Number Publication Date
WO2004109549A2 WO2004109549A2 (en) 2004-12-16
WO2004109549A3 true WO2004109549A3 (en) 2005-02-17

Family

ID=33495629

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2004/050822 WO2004109549A2 (en) 2003-06-05 2004-06-02 System and method for performing media content augmentation on an audio signal

Country Status (1)

Country Link
WO (1) WO2004109549A2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2459308A (en) * 2008-04-18 2009-10-21 Univ Montfort Creating a metadata enriched digital media file
WO2015094158A1 (en) 2013-12-16 2015-06-25 Hewlett-Packard Development Company, L.P. Determining preferred communication explanations using record-relevancy tiers

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1094406A2 (en) * 1999-08-26 2001-04-25 Matsushita Electric Industrial Co., Ltd. System and method for accessing TV-related information over the internet
US20020194004A1 (en) * 2001-06-14 2002-12-19 Glinski Stephen C. Methods and systems for enabling speech-based internet searches

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1094406A2 (en) * 1999-08-26 2001-04-25 Matsushita Electric Industrial Co., Ltd. System and method for accessing TV-related information over the internet
US20020194004A1 (en) * 2001-06-14 2002-12-19 Glinski Stephen C. Methods and systems for enabling speech-based internet searches

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CODEN A.R. ET AL.: "Speech transcript analysis for automatic search", PROC. 34TH. ANNUAL HAWAII INTERNAT. CONF. ON SYSTEM SCIENCES, 3 January 2001 (2001-01-03), LOS ALAMITOS, CA, USA, pages 1 - 9, XP002310679 *

Also Published As

Publication number Publication date
WO2004109549A2 (en) 2004-12-16

Similar Documents

Publication Publication Date Title
Furui et al. Speech-to-text and speech-to-speech summarization of spontaneous speech
AU2002211438A1 (en) Language independent voice-based search system
WO2004090866A3 (en) Phonetically based speech recognition system and method
EP2453436A3 (en) Automatic language model update
AU2003288521A1 (en) Automatic digital music library builder
GB2397406A (en) Index structure of metadata, method for providing indices of metatdata, and metadata searching method and apparatus using the indices of metadata
EP2428950A3 (en) Presenting supplemental content for digital media using a multimodal application
WO2004097791A3 (en) Methods and systems for creating a second generation session file
EP1168298A3 (en) Method of assembling messages for speech synthesis
AU2003299312A1 (en) Text-to-speech method and system, computer program product therefor
GB2397405A (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
WO2000054168A3 (en) Database annotation and retrieval
WO2005070019A3 (en) Contextual searching
Watanabe et al. Transformation of spectral envelope for voice conversion based on radial basis function networks
WO2006086053A3 (en) System and method for automatic enrichment of documents
WO2008142836A1 (en) Voice tone converting device and voice tone converting method
DE60211197D1 (en) METHOD AND DEVICE FOR THE CONVERSION OF SPANISHED TEXTS AND CORRECTION OF THE KNOWN TEXTS
WO2002050662A3 (en) Apparatus and method of video program classification based on syntax of transcript information
US20080065368A1 (en) Spoken Translation System Using Meta Information Strings
WO2004095419A3 (en) System and method for text-to-speech processing in a portable device
MXPA02005387A (en) Method and device for speech recognition with disjoint language models.
DE602004006641D1 (en) AUDIO DIALOG SYSTEM AND LANGUAGE-CONTROLLED BROWSING PROCEDURE
CN101123089A (en) Voice mixing method for Chinese voice code
Van Bael et al. Automatic phonetic transcription of large speech corpora
WO2004109549A3 (en) System and method for performing media content augmentation on an audio signal

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase