DE112009005215T8 - Verfahren und Vorrichtung zur Audiosignalklassifizierung - Google Patents

Verfahren und Vorrichtung zur Audiosignalklassifizierung Download PDF

Info

Publication number
DE112009005215T8
DE112009005215T8 DE112009005215T DE112009005215T DE112009005215T8 DE 112009005215 T8 DE112009005215 T8 DE 112009005215T8 DE 112009005215 T DE112009005215 T DE 112009005215T DE 112009005215 T DE112009005215 T DE 112009005215T DE 112009005215 T8 DE112009005215 T8 DE 112009005215T8
Authority
DE
Germany
Prior art keywords
audio signal
signal classification
classification
audio
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE112009005215T
Other languages
English (en)
Other versions
DE112009005215T5 (de
Inventor
Juka Vesa Tapani Rauhala
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WSOU Investments LLC
Original Assignee
Nokia Oyj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj filed Critical Nokia Oyj
Publication of DE112009005215T5 publication Critical patent/DE112009005215T5/de
Application granted granted Critical
Publication of DE112009005215T8 publication Critical patent/DE112009005215T8/de
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/046Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for differentiation between music and non-music signals, based on the identification of musical parameters, e.g. based on tempo detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/171Transmission of musical instrument data, control or status information; Transmission, remote access or control of music data for electrophonic musical instruments
    • G10H2240/201Physical layer or hardware aspects of transmission to or from an electrophonic musical instrument, e.g. voltage levels, bit streams, code words or symbols over a physical link connecting network nodes or instruments
    • G10H2240/241Telephone transmission, i.e. using twisted pair telephone lines or any type of telephone network
    • G10H2240/251Mobile telephone transmission, i.e. transmitting, accessing or controlling music data wirelessly via a wireless or mobile telephone receiver, analog or digital, e.g. DECT GSM, UMTS

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Telephone Function (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Circuit For Audible Band Transducer (AREA)
DE112009005215T 2009-08-04 2009-08-04 Verfahren und Vorrichtung zur Audiosignalklassifizierung Expired - Fee Related DE112009005215T8 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2009/060122 WO2011015237A1 (en) 2009-08-04 2009-08-04 Method and apparatus for audio signal classification

Publications (2)

Publication Number Publication Date
DE112009005215T5 DE112009005215T5 (de) 2012-10-04
DE112009005215T8 true DE112009005215T8 (de) 2013-01-03

Family

ID=42025767

Family Applications (1)

Application Number Title Priority Date Filing Date
DE112009005215T Expired - Fee Related DE112009005215T8 (de) 2009-08-04 2009-08-04 Verfahren und Vorrichtung zur Audiosignalklassifizierung

Country Status (4)

Country Link
US (1) US9215538B2 (de)
CN (1) CN102498514B (de)
DE (1) DE112009005215T8 (de)
WO (1) WO2011015237A1 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9215538B2 (en) * 2009-08-04 2015-12-15 Nokia Technologies Oy Method and apparatus for audio signal classification
WO2013150340A1 (en) 2012-04-05 2013-10-10 Nokia Corporation Adaptive audio signal filtering
US9646626B2 (en) * 2013-11-22 2017-05-09 At&T Intellectual Property I, L.P. System and method for network bandwidth management for adjusting audio quality
US9564128B2 (en) * 2013-12-09 2017-02-07 Qualcomm Incorporated Controlling a speech recognition process of a computing device
ES2941782T3 (es) 2013-12-19 2023-05-25 Ericsson Telefon Ab L M Estimación de ruido de fondo en señales de audio
CN104732970B (zh) * 2013-12-20 2018-12-04 中国科学院声学研究所 一种基于综合特征的舰船辐射噪声识别方法
WO2015104447A1 (en) * 2014-01-13 2015-07-16 Nokia Technologies Oy Multi-channel audio signal classifier
CN103854646B (zh) * 2014-03-27 2018-01-30 成都康赛信息技术有限公司 一种实现数字音频自动分类的方法
US10678828B2 (en) 2016-01-03 2020-06-09 Gracenote, Inc. Model-based media classification service using sensed media noise characteristics
CN107146631B (zh) * 2016-02-29 2020-11-10 北京搜狗科技发展有限公司 音乐识别方法、音符识别模型建立方法、装置及电子设备
US10146371B2 (en) * 2016-03-29 2018-12-04 Microchip Technology Incorporated Water robustness and detection on capacitive buttons
US9749733B1 (en) * 2016-04-07 2017-08-29 Harman Intenational Industries, Incorporated Approach for detecting alert signals in changing environments
CN105872899A (zh) * 2016-04-20 2016-08-17 乐视控股(北京)有限公司 音频播放方法、装置和终端设备
CN109147770B (zh) 2017-06-16 2023-07-28 阿里巴巴集团控股有限公司 声音识别特征的优化、动态注册方法、客户端和服务器
US12016098B1 (en) 2019-09-12 2024-06-18 Renesas Electronics America System and method for user presence detection based on audio events
US11889288B2 (en) * 2020-07-30 2024-01-30 Sony Group Corporation Using entertainment system remote commander for audio system calibration
CN112162041B (zh) * 2020-09-30 2024-06-14 陕西师范大学 一种基于幅度均方根值的高斯分布识别金属材料的方法

Family Cites Families (57)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4133976A (en) * 1978-04-07 1979-01-09 Bell Telephone Laboratories, Incorporated Predictive speech signal coding with reduced noise effects
DE3102385A1 (de) * 1981-01-24 1982-09-02 Blaupunkt-Werke Gmbh, 3200 Hildesheim Schaltungsanordnung zur selbstaetigen aenderung der einstellung von tonwiedergabegeraeten, insbesondere rundfunkempfaengern
JPH02110658A (ja) * 1988-10-19 1990-04-23 Hitachi Ltd 文書編集装置
US5208864A (en) * 1989-03-10 1993-05-04 Nippon Telegraph & Telephone Corporation Method of detecting acoustic signal
KR940001861B1 (ko) * 1991-04-12 1994-03-09 삼성전자 주식회사 오디오 대역신호의 음성/음악 판별장치
BE1007355A3 (nl) * 1993-07-26 1995-05-23 Philips Electronics Nv Spraaksignaaldiscriminatieschakeling alsmede een audio-inrichting voorzien van een dergelijke schakeling.
JP3484757B2 (ja) * 1994-05-13 2004-01-06 ソニー株式会社 音声信号の雑音低減方法及び雑音区間検出方法
JP3591068B2 (ja) * 1995-06-30 2004-11-17 ソニー株式会社 音声信号の雑音低減方法
US5918223A (en) * 1996-07-22 1999-06-29 Muscle Fish Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
US6570991B1 (en) * 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
US6819863B2 (en) * 1998-01-13 2004-11-16 Koninklijke Philips Electronics N.V. System and method for locating program boundaries and commercial boundaries using audio categories
US6801895B1 (en) * 1998-12-07 2004-10-05 At&T Corp. Method and apparatus for segmenting a multi-media program based upon audio events
US6714909B1 (en) * 1998-08-13 2004-03-30 At&T Corp. System and method for automated multimedia content indexing and retrieval
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
US6556967B1 (en) * 1999-03-12 2003-04-29 The United States Of America As Represented By The National Security Agency Voice activity detector
US6901362B1 (en) * 2000-04-19 2005-05-31 Microsoft Corporation Audio segmentation and classification
US7065416B2 (en) * 2001-08-29 2006-06-20 Microsoft Corporation System and methods for providing automatic classification of media entities according to melodic movement properties
US7242421B2 (en) * 2000-11-10 2007-07-10 Perceptive Network Technologies, Inc. Methods of establishing a communications link using perceptual sensing of a user's presence
US6694293B2 (en) * 2001-02-13 2004-02-17 Mindspeed Technologies, Inc. Speech coding system with a music classifier
JP3574123B2 (ja) * 2001-03-28 2004-10-06 三菱電機株式会社 雑音抑圧装置
US7328153B2 (en) * 2001-07-20 2008-02-05 Gracenote, Inc. Automatic identification of sound recordings
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
SE524162C2 (sv) * 2002-08-23 2004-07-06 Rickard Berg Förfarande för att behandla signaler
US7454331B2 (en) * 2002-08-30 2008-11-18 Dolby Laboratories Licensing Corporation Controlling loudness of speech in signals that contain speech and other types of audio material
JP3984526B2 (ja) * 2002-10-21 2007-10-03 富士通株式会社 音声対話システム及び方法
US20040167767A1 (en) * 2003-02-25 2004-08-26 Ziyou Xiong Method and system for extracting sports highlights from audio signals
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
WO2004095315A1 (en) * 2003-04-24 2004-11-04 Koninklijke Philips Electronics N.V. Parameterized temporal feature analysis
US20060229878A1 (en) * 2003-05-27 2006-10-12 Eric Scheirer Waveform recognition method and apparatus
MXPA05012785A (es) * 2003-05-28 2006-02-22 Dolby Lab Licensing Corp Metodo, aparato y programa de computadora para el calculo y ajuste de la sonoridad percibida de una senal de audio.
US20050091066A1 (en) * 2003-10-28 2005-04-28 Manoj Singhal Classification of speech and music using zero crossing
US20050096898A1 (en) * 2003-10-29 2005-05-05 Manoj Singhal Classification of speech and music using sub-band energy
EP1531458B1 (de) * 2003-11-12 2008-04-16 Sony Deutschland GmbH Vorrichtung und Verfahren zur automatischen Extraktion von wichtigen Ereignissen in Audiosignalen
FR2863080B1 (fr) * 2003-11-27 2006-02-24 Advestigo Procede d'indexation et d'identification de documents multimedias
US7179980B2 (en) * 2003-12-12 2007-02-20 Nokia Corporation Automatic extraction of musical portions of an audio stream
US7120576B2 (en) * 2004-07-16 2006-10-10 Mindspeed Technologies, Inc. Low-complexity music detection algorithm and system
US7454333B2 (en) * 2004-09-13 2008-11-18 Mitsubishi Electric Research Lab, Inc. Separating multiple audio signals recorded as a single mixed signal
BRPI0518278B1 (pt) * 2004-10-26 2018-04-24 Dolby Laboratories Licensing Corporation Método e aparelho para controlar uma característica de sonoridade particular de um sinal de áudio
US20060122834A1 (en) * 2004-12-03 2006-06-08 Bennett Ian M Emotion detection device & method for use in distributed systems
US8214214B2 (en) * 2004-12-03 2012-07-03 Phoenix Solutions, Inc. Emotion detection device and method for use in distributed systems
US8126706B2 (en) 2005-12-09 2012-02-28 Acoustic Technologies, Inc. Music detector for echo cancellation and noise reduction
KR101200615B1 (ko) * 2006-04-27 2012-11-12 돌비 레버러토리즈 라이쎈싱 코오포레이션 청각 이벤트 검출에 기반한 비-라우드니스를 이용한 자동 이득 제어
TWI312982B (en) 2006-05-22 2009-08-01 Nat Cheng Kung Universit Audio signal segmentation algorithm
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
US20080033583A1 (en) * 2006-08-03 2008-02-07 Broadcom Corporation Robust Speech/Music Classification for Audio Signals
US8948428B2 (en) * 2006-09-05 2015-02-03 Gn Resound A/S Hearing aid with histogram based sound environment classification
US8046218B2 (en) * 2006-09-19 2011-10-25 The Board Of Trustees Of The University Of Illinois Speech and method for identifying perceptual features
WO2008078232A1 (en) * 2006-12-21 2008-07-03 Koninklijke Philips Electronics N.V. A system for processing audio data
ES2391228T3 (es) * 2007-02-26 2012-11-22 Dolby Laboratories Licensing Corporation Realce de voz en audio de entretenimiento
CA2690433C (en) * 2007-06-22 2016-01-19 Voiceage Corporation Method and device for sound activity detection and sound signal classification
EP2191467B1 (de) * 2007-09-12 2011-06-22 Dolby Laboratories Licensing Corporation Spracherweiterung
JP4327886B1 (ja) * 2008-05-30 2009-09-09 株式会社東芝 音質補正装置、音質補正方法及び音質補正用プログラム
US9344051B2 (en) * 2009-06-29 2016-05-17 Nokia Technologies Oy Apparatus, method and storage medium for performing adaptive audio equalization
US9215538B2 (en) * 2009-08-04 2015-12-15 Nokia Technologies Oy Method and apparatus for audio signal classification
CN102044244B (zh) * 2009-10-15 2011-11-16 华为技术有限公司 信号分类方法和装置
WO2011076288A1 (en) * 2009-12-24 2011-06-30 Nokia Corporation Loudspeaker protection apparatus and method thereof
US9998081B2 (en) * 2010-05-12 2018-06-12 Nokia Technologies Oy Method and apparatus for processing an audio signal based on an estimated loudness

Also Published As

Publication number Publication date
CN102498514A (zh) 2012-06-13
CN102498514B (zh) 2014-06-18
US9215538B2 (en) 2015-12-15
US20130103398A1 (en) 2013-04-25
DE112009005215T5 (de) 2012-10-04
WO2011015237A1 (en) 2011-02-10

Similar Documents

Publication Publication Date Title
DE112009005215T8 (de) Verfahren und Vorrichtung zur Audiosignalklassifizierung
ATE526662T1 (de) Vorrichtung und verfahren zur änderung eines audiosignals
DE112009001896A5 (de) Verfahren und Vorrichtung zur automatischen Fahrtrichtungsanzeige
HK1217384A1 (zh) 對音頻信號處理的裝置和對時域音頻信號進行處理的方法
DE602007008194D1 (de) Vorrichtung und Verfahren zur Audiosignalverarbeitung und Abbildungsvorrichtung
EP2259253A4 (de) Verfahren und vorrichtung zur verarbeitung von tonsignalen
DE112010000091A5 (de) Verfahren und vorrichtung zur fahrspurerkennung
EP2259254A4 (de) Verfahren und vorrichtung zur verarbeitung eines tonsignals
DE102008012669B8 (de) Vorrichtung und Verfahren zur visuellen Stimulation
DE602007010523D1 (de) Vorrichtung und Verfahren zur Personenidentifizierung
EP2446642A4 (de) Verfahren und vorrichtung zum verarbeiten von audiosignalen
DE602007000729D1 (de) Verfahren und Vorrichtung zur Authentifizierung
ATE540492T1 (de) Verfahren und vorrichtung zur frequenzbanderkennung
EP2383731A4 (de) Signalverarbeitungsverfahren und -vorrichtung
DE102009039685A8 (de) Verfahren und Vorrichtung zur Detektion von Defekten in einem Objekt
DE602007001927D1 (de) Verfahren und Vorrichtung zur Tonsignalkorrektur und Computerprogramm
DE602009000549D1 (de) Vorrichtung und Verfahren zur Bildverarbeitung
AT10759U2 (de) Verfahren und vorrichtung zur verifizierung eines automatisierungssystems
EP2525357A4 (de) Verfahren und vorrichtung zur verarbeitung eines tonsignals
DE102011122602A8 (de) Vorrichtung und Verfahren zur endoskopischen Fluoreszenzdetektion
DE502007001541D1 (de) Verfahren und vorrichtung zur fälschungssicherung von produkten
EP2522016A4 (de) Vorrichtung zur verarbeitung eines audiosignals und verfahren dafür
DE102010038830A8 (de) Vorrichtung und Verfahren zur wegmessenden Gewindeprüfung
EP2476115A4 (de) Verfahren und vorrichtung zur verarbeitung von tonsignalen
DE602006012831D1 (de) Verfahren und Vorrichtung zur Signalverarbeitung

Legal Events

Date Code Title Description
R163 Identified publications notified
R012 Request for examination validly filed
R079 Amendment of ipc main class

Free format text: PREVIOUS MAIN CLASS: G10L0011020000

Ipc: G10L0025000000

R079 Amendment of ipc main class

Free format text: PREVIOUS MAIN CLASS: G10L0011020000

Ipc: G10L0025000000

Effective date: 20140225

R081 Change of applicant/patentee

Owner name: NOKIA TECHNOLOGIES OY, FI

Free format text: FORMER OWNER: NOKIA CORP., 02610 ESPOO, FI

R082 Change of representative

Representative=s name: COHAUSZ & FLORACK PATENT- UND RECHTSANWAELTE P, DE

R119 Application deemed withdrawn, or ip right lapsed, due to non-payment of renewal fee
R081 Change of applicant/patentee

Owner name: WSOU INVESTMENTS, LLC, LOS ANGELES, US

Free format text: FORMER OWNER: NOKIA TECHNOLOGIES OY, ESPOO, FI

R082 Change of representative

Representative=s name: BARKHOFF REIMANN VOSSIUS, DE