WO2004006222A3 - Procede et appareil pour la classification de signaux sonores - Google Patents

Procede et appareil pour la classification de signaux sonores Download PDF

Info

Publication number
WO2004006222A3
WO2004006222A3 PCT/FR2003/002116 FR0302116W WO2004006222A3 WO 2004006222 A3 WO2004006222 A3 WO 2004006222A3 FR 0302116 W FR0302116 W FR 0302116W WO 2004006222 A3 WO2004006222 A3 WO 2004006222A3
Authority
WO
WIPO (PCT)
Prior art keywords
sound signal
frequency
temporal segments
sound
extracting
Prior art date
Application number
PCT/FR2003/002116
Other languages
English (en)
Other versions
WO2004006222A2 (fr
Inventor
Hadi Harb
Liming Chen
Original Assignee
Lyon Ecole Centrale
Hadi Harb
Liming Chen
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lyon Ecole Centrale, Hadi Harb, Liming Chen filed Critical Lyon Ecole Centrale
Priority to US10/518,539 priority Critical patent/US20050228649A1/en
Priority to JP2004518885A priority patent/JP2005532582A/ja
Priority to EP03762744A priority patent/EP1535276A2/fr
Priority to AU2003263270A priority patent/AU2003263270A1/en
Priority to CA002491036A priority patent/CA2491036A1/fr
Publication of WO2004006222A2 publication Critical patent/WO2004006222A2/fr
Publication of WO2004006222A3 publication Critical patent/WO2004006222A3/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Auxiliary Devices For Music (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)

Abstract

L'objet de l'invention concerne un procédé pour affecter au moins une classe sonore à un signal sonore, caractérisé en ce qu'il comprend les étapes suivantes : diviser le signal sonore en des segments temporels présentant une durée déterminée; extraire les paramètres fréquentiels du signal sonore dans chacun des segments temporels, en déterminant une série des valeurs du spectre de fréquence dans une plage de fréquences comprise entre une fréquence minimale et une fréquence maximale; regrouper les paramètres fréquentiels dans des fenêtres temporelles présentant une durée déterminée supérieure à la durée des segments temporels; extraire de chaque fenêtre temporelle, des composantes caractéristiques; et en considération des composantes caractéristiques extraites et à l'aide d'un classificateur, identifier la classe sonore des fenêtres temporelles du signal sonore.
PCT/FR2003/002116 2002-07-08 2003-07-08 Procede et appareil pour la classification de signaux sonores WO2004006222A2 (fr)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US10/518,539 US20050228649A1 (en) 2002-07-08 2003-07-08 Method and apparatus for classifying sound signals
JP2004518885A JP2005532582A (ja) 2002-07-08 2003-07-08 音響信号に音響クラスを割り当てる方法及び装置
EP03762744A EP1535276A2 (fr) 2002-07-08 2003-07-08 Procede et appareil pour la classification de signaux sonores
AU2003263270A AU2003263270A1 (en) 2002-07-08 2003-07-08 Method and apparatus for classifying sound signals
CA002491036A CA2491036A1 (fr) 2002-07-08 2003-07-08 Procede et appareil pour la classification de signaux sonores

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR02/08548 2002-07-08
FR0208548A FR2842014B1 (fr) 2002-07-08 2002-07-08 Procede et appareil pour affecter une classe sonore a un signal sonore

Publications (2)

Publication Number Publication Date
WO2004006222A2 WO2004006222A2 (fr) 2004-01-15
WO2004006222A3 true WO2004006222A3 (fr) 2004-04-08

Family

ID=29725263

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2003/002116 WO2004006222A2 (fr) 2002-07-08 2003-07-08 Procede et appareil pour la classification de signaux sonores

Country Status (8)

Country Link
US (1) US20050228649A1 (fr)
EP (1) EP1535276A2 (fr)
JP (1) JP2005532582A (fr)
CN (1) CN1666252A (fr)
AU (1) AU2003263270A1 (fr)
CA (1) CA2491036A1 (fr)
FR (1) FR2842014B1 (fr)
WO (1) WO2004006222A2 (fr)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
DE10313875B3 (de) * 2003-03-21 2004-10-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Analysieren eines Informationssignals
US20050091066A1 (en) * 2003-10-28 2005-04-28 Manoj Singhal Classification of speech and music using zero crossing
GB2413745A (en) * 2004-04-30 2005-11-02 Axeon Ltd Classifying audio content by musical style/genre and generating an identification signal accordingly to adjust parameters of an audio system
DE102004047069A1 (de) * 2004-09-28 2006-04-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ändern einer Segmentierung eines Audiostücks
US7377233B2 (en) * 2005-01-11 2008-05-27 Pariff Llc Method and apparatus for the automatic identification of birds by their vocalizations
US7707485B2 (en) * 2005-09-28 2010-04-27 Vixs Systems, Inc. System and method for dynamic transrating based on content
US20070083365A1 (en) * 2005-10-06 2007-04-12 Dts, Inc. Neural network classifier for separating audio sources from a monophonic audio signal
US20080033583A1 (en) * 2006-08-03 2008-02-07 Broadcom Corporation Robust Speech/Music Classification for Audio Signals
US8015000B2 (en) * 2006-08-03 2011-09-06 Broadcom Corporation Classification-based frame loss concealment for audio signals
CN101165779B (zh) * 2006-10-20 2010-06-02 索尼株式会社 信息处理装置和方法、程序及记录介质
US7856351B2 (en) * 2007-01-19 2010-12-21 Microsoft Corporation Integrated speech recognition and semantic classification
GB0709044D0 (en) 2007-05-11 2007-06-20 Teradyne Diagnostic Solutions Signal detection
US8422859B2 (en) * 2010-03-23 2013-04-16 Vixs Systems Inc. Audio-based chapter detection in multimedia stream
US9110817B2 (en) * 2011-03-24 2015-08-18 Sony Corporation Method for creating a markov process that generates sequences
WO2013008956A1 (fr) * 2011-07-14 2013-01-17 日本電気株式会社 Procédé de traitement de son, système de traitement de son, procédé de traitement de contenu vidéo, système de traitement de contenu vidéo, dispositif de traitement de son et procédé et programme de commande dudit dispositif
CN102682766A (zh) * 2012-05-12 2012-09-19 黄莹 可自学习的情侣声音对换机
CN103456301B (zh) * 2012-05-28 2019-02-12 中兴通讯股份有限公司 一种基于环境声音的场景识别方法及装置及移动终端
US9263060B2 (en) 2012-08-21 2016-02-16 Marian Mason Publishing Company, Llc Artificial neural network based system for classification of the emotional content of digital music
CN107093991B (zh) 2013-03-26 2020-10-09 杜比实验室特许公司 基于目标响度的响度归一化方法和设备
WO2017001611A1 (fr) 2015-06-30 2017-01-05 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Procédé et dispositif pour affecter des bruits et les analyser
US10490209B2 (en) * 2016-05-02 2019-11-26 Google Llc Automatic determination of timing windows for speech captions in an audio stream
JP6749874B2 (ja) * 2017-09-08 2020-09-02 Kddi株式会社 音波信号から音波種別を判定するプログラム、システム、装置及び方法
JP6812381B2 (ja) * 2018-02-08 2021-01-13 日本電信電話株式会社 音声認識精度劣化要因推定装置、音声認識精度劣化要因推定方法、プログラム
CN109841216B (zh) * 2018-12-26 2020-12-15 珠海格力电器股份有限公司 语音数据的处理方法、装置和智能终端
CN112397090B (zh) * 2020-11-09 2022-11-15 电子科技大学 一种基于fpga的实时声音分类方法及系统
CN112270933B (zh) * 2020-11-12 2024-03-12 北京猿力未来科技有限公司 一种音频识别方法和装置
US11514927B2 (en) * 2021-04-16 2022-11-29 Ubtech North America Research And Development Center Corp System and method for multichannel speech detection

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6714909B1 (en) * 1998-08-13 2004-03-30 At&T Corp. System and method for automated multimedia content indexing and retrieval
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
US6901362B1 (en) * 2000-04-19 2005-05-31 Microsoft Corporation Audio segmentation and classification
US6542869B1 (en) * 2000-05-11 2003-04-01 Fuji Xerox Co., Ltd. Method for automatic analysis of audio including music and speech
US6973256B1 (en) * 2000-10-30 2005-12-06 Koninklijke Philips Electronics N.V. System and method for detecting highlights in a video program using audio properties
US7058889B2 (en) * 2001-03-23 2006-06-06 Koninklijke Philips Electronics N.V. Synchronizing text/visual information with audio playback
US7295977B2 (en) * 2001-08-27 2007-11-13 Nec Laboratories America, Inc. Extracting classifying data in music from an audio bitstream
US20030236663A1 (en) * 2002-06-19 2003-12-25 Koninklijke Philips Electronics N.V. Mega speaker identification (ID) system and corresponding methods therefor
US7082394B2 (en) * 2002-06-25 2006-07-25 Microsoft Corporation Noise-robust feature extraction using multi-layer principal component analysis

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HADI HARB, LIMING CHEN: "Video Scene Description: An Audio Based Approach", PROCEEDINGS OF THE FIRST MEDIANET CONFERENCE MEDIANET2002, June 2002 (2002-06-01), Souss, Tunisia, pages 243 - 254, XP002263716 *
LEFEVRE S ET AL: "3 classes segmentation for analysis of football audio sequences", 2002 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS. DSP 2002 (CAT. NO.02TH8628), 1 July 2002 (2002-07-01) - 3 July 2002 (2002-07-03), SANTORINI, GREECE, Piscataway, NJ, USA, IEEE, USA, pages 975 - 978 vol.2, XP002230889, ISBN: 0-7803-7503-3 *
QUELAVOINE R ET AL: "TRANSIENTS RECOGNITION IN UNDERWATER ACOUSTIC WITH MULTILAYER NEURAL NETWORKS", ENGINEERING BENEFITS FROM NEURAL NETWORKS. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE EANN, XX, XX, 1998, pages 330 - 333, XP000974500 *
ZHU LIU ET AL: "AUDIO FEATURE EXTRACTION AND ANALYSIS FOR SCENE SEGMENTATION AND CLASSIFICATION", JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL. IMAGE, AND VIDEO TECHNOLOGY, KLUWER ACADEMIC PUBLISHERS, DORDRECHT, NL, vol. 20, no. 1/2, 1 October 1998 (1998-10-01), pages 61 - 78, XP000786728, ISSN: 0922-5773 *

Also Published As

Publication number Publication date
AU2003263270A1 (en) 2004-01-23
JP2005532582A (ja) 2005-10-27
CA2491036A1 (fr) 2004-01-15
AU2003263270A8 (en) 2004-01-23
EP1535276A2 (fr) 2005-06-01
CN1666252A (zh) 2005-09-07
US20050228649A1 (en) 2005-10-13
FR2842014B1 (fr) 2006-05-05
WO2004006222A2 (fr) 2004-01-15
FR2842014A1 (fr) 2004-01-09

Similar Documents

Publication Publication Date Title
WO2004006222A3 (fr) Procede et appareil pour la classification de signaux sonores
EP3317879B1 (fr) Procédé et dispositif pour affecter des bruits et les analyser
WO2006041735A3 (fr) Suppression d'échos parasites
EP1640973A3 (fr) Méthode et appareil de traitement de signal audio
WO2007014271A3 (fr) Selection de candidats
WO2005001667A3 (fr) Procede et appareil pour l'analyse de donnees
WO2006019556A3 (fr) Systeme et algorithme de detection de musique a faible complexite
ATE339001T1 (de) Vorrichtung und verfahren zum analysieren eines audio-informationssignals
WO2001020965A3 (fr) Procede de determination d'une situation d'environnement acoustique momentanee, utilisation de ce procede, et prothese auditive
WO2006127129A3 (fr) Systemes et procedes de detection des bordures d'images
WO2004075255A3 (fr) Detection de la fin d'un processus de gravure a multiplexage dans le temps
WO2005022318A3 (fr) Procede et systeme de generation d'empreintes acoustiques
WO2007008012A3 (fr) Dispositif et procédé de traitement d'un signal audio
WO2006023575A3 (fr) Systeme et procede de monde surveillance et de renfort de zone sans fil reglementee
WO2004030022A3 (fr) Optimisation automatisee de l'electronique de syntonisation lc du generateur de forme d'onde asymetrique
BR0116002A (pt) método e equipamento para classificação de fala robusta
WO2004021926A3 (fr) Conception de filtre pour la gestion d'embolies
CA2445703A1 (fr) Surveillance d'un evenement microsismique
CA2188369A1 (fr) Methode et dispositif de classification de signaux vocaux
WO2004003572A3 (fr) Procedes permettant d'ameliorer un processus d'essai et appareil a cet effet
WO2006124309A3 (fr) Procede et dispositif destines a la separation de sources
ATE319160T1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
AU2003232066A1 (en) System and method for quality performance evaluation and reporting
WO1999001942A3 (fr) Procede de reduction de bruit dans des signaux vocaux et appareil d'application du procede
FR2854483A1 (fr) Procede d'identification de sons specifiques

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2491036

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 20038162059

Country of ref document: CN

Ref document number: 2004518885

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2003762744

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10518539

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 2003762744

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2003762744

Country of ref document: EP