WO2004006222A3 - Procede et appareil pour la classification de signaux sonores - Google Patents
Procede et appareil pour la classification de signaux sonores Download PDFInfo
- Publication number
- WO2004006222A3 WO2004006222A3 PCT/FR2003/002116 FR0302116W WO2004006222A3 WO 2004006222 A3 WO2004006222 A3 WO 2004006222A3 FR 0302116 W FR0302116 W FR 0302116W WO 2004006222 A3 WO2004006222 A3 WO 2004006222A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound signal
- frequency
- temporal segments
- sound
- extracting
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 5
- 238000000034 method Methods 0.000 title abstract 2
- 230000002123 temporal effect Effects 0.000 abstract 3
- 238000001228 spectrum Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Auxiliary Devices For Music (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/518,539 US20050228649A1 (en) | 2002-07-08 | 2003-07-08 | Method and apparatus for classifying sound signals |
JP2004518885A JP2005532582A (ja) | 2002-07-08 | 2003-07-08 | 音響信号に音響クラスを割り当てる方法及び装置 |
EP03762744A EP1535276A2 (fr) | 2002-07-08 | 2003-07-08 | Procede et appareil pour la classification de signaux sonores |
AU2003263270A AU2003263270A1 (en) | 2002-07-08 | 2003-07-08 | Method and apparatus for classifying sound signals |
CA002491036A CA2491036A1 (fr) | 2002-07-08 | 2003-07-08 | Procede et appareil pour la classification de signaux sonores |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR02/08548 | 2002-07-08 | ||
FR0208548A FR2842014B1 (fr) | 2002-07-08 | 2002-07-08 | Procede et appareil pour affecter une classe sonore a un signal sonore |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2004006222A2 WO2004006222A2 (fr) | 2004-01-15 |
WO2004006222A3 true WO2004006222A3 (fr) | 2004-04-08 |
Family
ID=29725263
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2003/002116 WO2004006222A2 (fr) | 2002-07-08 | 2003-07-08 | Procede et appareil pour la classification de signaux sonores |
Country Status (8)
Country | Link |
---|---|
US (1) | US20050228649A1 (fr) |
EP (1) | EP1535276A2 (fr) |
JP (1) | JP2005532582A (fr) |
CN (1) | CN1666252A (fr) |
AU (1) | AU2003263270A1 (fr) |
CA (1) | CA2491036A1 (fr) |
FR (1) | FR2842014B1 (fr) |
WO (1) | WO2004006222A2 (fr) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4348970B2 (ja) * | 2003-03-06 | 2009-10-21 | ソニー株式会社 | 情報検出装置及び方法、並びにプログラム |
DE10313875B3 (de) * | 2003-03-21 | 2004-10-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Analysieren eines Informationssignals |
US20050091066A1 (en) * | 2003-10-28 | 2005-04-28 | Manoj Singhal | Classification of speech and music using zero crossing |
GB2413745A (en) * | 2004-04-30 | 2005-11-02 | Axeon Ltd | Classifying audio content by musical style/genre and generating an identification signal accordingly to adjust parameters of an audio system |
DE102004047069A1 (de) * | 2004-09-28 | 2006-04-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ändern einer Segmentierung eines Audiostücks |
US7377233B2 (en) * | 2005-01-11 | 2008-05-27 | Pariff Llc | Method and apparatus for the automatic identification of birds by their vocalizations |
US7707485B2 (en) * | 2005-09-28 | 2010-04-27 | Vixs Systems, Inc. | System and method for dynamic transrating based on content |
US20070083365A1 (en) * | 2005-10-06 | 2007-04-12 | Dts, Inc. | Neural network classifier for separating audio sources from a monophonic audio signal |
US20080033583A1 (en) * | 2006-08-03 | 2008-02-07 | Broadcom Corporation | Robust Speech/Music Classification for Audio Signals |
US8015000B2 (en) * | 2006-08-03 | 2011-09-06 | Broadcom Corporation | Classification-based frame loss concealment for audio signals |
CN101165779B (zh) * | 2006-10-20 | 2010-06-02 | 索尼株式会社 | 信息处理装置和方法、程序及记录介质 |
US7856351B2 (en) * | 2007-01-19 | 2010-12-21 | Microsoft Corporation | Integrated speech recognition and semantic classification |
GB0709044D0 (en) | 2007-05-11 | 2007-06-20 | Teradyne Diagnostic Solutions | Signal detection |
US8422859B2 (en) * | 2010-03-23 | 2013-04-16 | Vixs Systems Inc. | Audio-based chapter detection in multimedia stream |
US9110817B2 (en) * | 2011-03-24 | 2015-08-18 | Sony Corporation | Method for creating a markov process that generates sequences |
WO2013008956A1 (fr) * | 2011-07-14 | 2013-01-17 | 日本電気株式会社 | Procédé de traitement de son, système de traitement de son, procédé de traitement de contenu vidéo, système de traitement de contenu vidéo, dispositif de traitement de son et procédé et programme de commande dudit dispositif |
CN102682766A (zh) * | 2012-05-12 | 2012-09-19 | 黄莹 | 可自学习的情侣声音对换机 |
CN103456301B (zh) * | 2012-05-28 | 2019-02-12 | 中兴通讯股份有限公司 | 一种基于环境声音的场景识别方法及装置及移动终端 |
US9263060B2 (en) | 2012-08-21 | 2016-02-16 | Marian Mason Publishing Company, Llc | Artificial neural network based system for classification of the emotional content of digital music |
CN107093991B (zh) | 2013-03-26 | 2020-10-09 | 杜比实验室特许公司 | 基于目标响度的响度归一化方法和设备 |
WO2017001611A1 (fr) | 2015-06-30 | 2017-01-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Procédé et dispositif pour affecter des bruits et les analyser |
US10490209B2 (en) * | 2016-05-02 | 2019-11-26 | Google Llc | Automatic determination of timing windows for speech captions in an audio stream |
JP6749874B2 (ja) * | 2017-09-08 | 2020-09-02 | Kddi株式会社 | 音波信号から音波種別を判定するプログラム、システム、装置及び方法 |
JP6812381B2 (ja) * | 2018-02-08 | 2021-01-13 | 日本電信電話株式会社 | 音声認識精度劣化要因推定装置、音声認識精度劣化要因推定方法、プログラム |
CN109841216B (zh) * | 2018-12-26 | 2020-12-15 | 珠海格力电器股份有限公司 | 语音数据的处理方法、装置和智能终端 |
CN112397090B (zh) * | 2020-11-09 | 2022-11-15 | 电子科技大学 | 一种基于fpga的实时声音分类方法及系统 |
CN112270933B (zh) * | 2020-11-12 | 2024-03-12 | 北京猿力未来科技有限公司 | 一种音频识别方法和装置 |
US11514927B2 (en) * | 2021-04-16 | 2022-11-29 | Ubtech North America Research And Development Center Corp | System and method for multichannel speech detection |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6714909B1 (en) * | 1998-08-13 | 2004-03-30 | At&T Corp. | System and method for automated multimedia content indexing and retrieval |
US6801895B1 (en) * | 1998-12-07 | 2004-10-05 | At&T Corp. | Method and apparatus for segmenting a multi-media program based upon audio events |
US6901362B1 (en) * | 2000-04-19 | 2005-05-31 | Microsoft Corporation | Audio segmentation and classification |
US6542869B1 (en) * | 2000-05-11 | 2003-04-01 | Fuji Xerox Co., Ltd. | Method for automatic analysis of audio including music and speech |
US6973256B1 (en) * | 2000-10-30 | 2005-12-06 | Koninklijke Philips Electronics N.V. | System and method for detecting highlights in a video program using audio properties |
US7058889B2 (en) * | 2001-03-23 | 2006-06-06 | Koninklijke Philips Electronics N.V. | Synchronizing text/visual information with audio playback |
US7295977B2 (en) * | 2001-08-27 | 2007-11-13 | Nec Laboratories America, Inc. | Extracting classifying data in music from an audio bitstream |
US20030236663A1 (en) * | 2002-06-19 | 2003-12-25 | Koninklijke Philips Electronics N.V. | Mega speaker identification (ID) system and corresponding methods therefor |
US7082394B2 (en) * | 2002-06-25 | 2006-07-25 | Microsoft Corporation | Noise-robust feature extraction using multi-layer principal component analysis |
-
2002
- 2002-07-08 FR FR0208548A patent/FR2842014B1/fr not_active Expired - Fee Related
-
2003
- 2003-07-08 WO PCT/FR2003/002116 patent/WO2004006222A2/fr not_active Application Discontinuation
- 2003-07-08 EP EP03762744A patent/EP1535276A2/fr not_active Withdrawn
- 2003-07-08 JP JP2004518885A patent/JP2005532582A/ja active Pending
- 2003-07-08 AU AU2003263270A patent/AU2003263270A1/en not_active Abandoned
- 2003-07-08 CA CA002491036A patent/CA2491036A1/fr not_active Abandoned
- 2003-07-08 US US10/518,539 patent/US20050228649A1/en not_active Abandoned
- 2003-07-08 CN CN038162059A patent/CN1666252A/zh active Pending
Non-Patent Citations (4)
Title |
---|
HADI HARB, LIMING CHEN: "Video Scene Description: An Audio Based Approach", PROCEEDINGS OF THE FIRST MEDIANET CONFERENCE MEDIANET2002, June 2002 (2002-06-01), Souss, Tunisia, pages 243 - 254, XP002263716 * |
LEFEVRE S ET AL: "3 classes segmentation for analysis of football audio sequences", 2002 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS. DSP 2002 (CAT. NO.02TH8628), 1 July 2002 (2002-07-01) - 3 July 2002 (2002-07-03), SANTORINI, GREECE, Piscataway, NJ, USA, IEEE, USA, pages 975 - 978 vol.2, XP002230889, ISBN: 0-7803-7503-3 * |
QUELAVOINE R ET AL: "TRANSIENTS RECOGNITION IN UNDERWATER ACOUSTIC WITH MULTILAYER NEURAL NETWORKS", ENGINEERING BENEFITS FROM NEURAL NETWORKS. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE EANN, XX, XX, 1998, pages 330 - 333, XP000974500 * |
ZHU LIU ET AL: "AUDIO FEATURE EXTRACTION AND ANALYSIS FOR SCENE SEGMENTATION AND CLASSIFICATION", JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL. IMAGE, AND VIDEO TECHNOLOGY, KLUWER ACADEMIC PUBLISHERS, DORDRECHT, NL, vol. 20, no. 1/2, 1 October 1998 (1998-10-01), pages 61 - 78, XP000786728, ISSN: 0922-5773 * |
Also Published As
Publication number | Publication date |
---|---|
AU2003263270A1 (en) | 2004-01-23 |
JP2005532582A (ja) | 2005-10-27 |
CA2491036A1 (fr) | 2004-01-15 |
AU2003263270A8 (en) | 2004-01-23 |
EP1535276A2 (fr) | 2005-06-01 |
CN1666252A (zh) | 2005-09-07 |
US20050228649A1 (en) | 2005-10-13 |
FR2842014B1 (fr) | 2006-05-05 |
WO2004006222A2 (fr) | 2004-01-15 |
FR2842014A1 (fr) | 2004-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2004006222A3 (fr) | Procede et appareil pour la classification de signaux sonores | |
EP3317879B1 (fr) | Procédé et dispositif pour affecter des bruits et les analyser | |
WO2006041735A3 (fr) | Suppression d'échos parasites | |
EP1640973A3 (fr) | Méthode et appareil de traitement de signal audio | |
WO2007014271A3 (fr) | Selection de candidats | |
WO2005001667A3 (fr) | Procede et appareil pour l'analyse de donnees | |
WO2006019556A3 (fr) | Systeme et algorithme de detection de musique a faible complexite | |
ATE339001T1 (de) | Vorrichtung und verfahren zum analysieren eines audio-informationssignals | |
WO2001020965A3 (fr) | Procede de determination d'une situation d'environnement acoustique momentanee, utilisation de ce procede, et prothese auditive | |
WO2006127129A3 (fr) | Systemes et procedes de detection des bordures d'images | |
WO2004075255A3 (fr) | Detection de la fin d'un processus de gravure a multiplexage dans le temps | |
WO2005022318A3 (fr) | Procede et systeme de generation d'empreintes acoustiques | |
WO2007008012A3 (fr) | Dispositif et procédé de traitement d'un signal audio | |
WO2006023575A3 (fr) | Systeme et procede de monde surveillance et de renfort de zone sans fil reglementee | |
WO2004030022A3 (fr) | Optimisation automatisee de l'electronique de syntonisation lc du generateur de forme d'onde asymetrique | |
BR0116002A (pt) | método e equipamento para classificação de fala robusta | |
WO2004021926A3 (fr) | Conception de filtre pour la gestion d'embolies | |
CA2445703A1 (fr) | Surveillance d'un evenement microsismique | |
CA2188369A1 (fr) | Methode et dispositif de classification de signaux vocaux | |
WO2004003572A3 (fr) | Procedes permettant d'ameliorer un processus d'essai et appareil a cet effet | |
WO2006124309A3 (fr) | Procede et dispositif destines a la separation de sources | |
ATE319160T1 (de) | Verfahren zur rauschrobusten klassifikation in der sprachkodierung | |
AU2003232066A1 (en) | System and method for quality performance evaluation and reporting | |
WO1999001942A3 (fr) | Procede de reduction de bruit dans des signaux vocaux et appareil d'application du procede | |
FR2854483A1 (fr) | Procede d'identification de sons specifiques |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2491036 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20038162059 Country of ref document: CN Ref document number: 2004518885 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2003762744 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10518539 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 2003762744 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2003762744 Country of ref document: EP |