HK1158804A1 - Method and discriminator for classifying different segments of a signal - Google Patents

Method and discriminator for classifying different segments of a signal

Info

Publication number
HK1158804A1
HK1158804A1 HK11112970.6A HK11112970A HK1158804A1 HK 1158804 A1 HK1158804 A1 HK 1158804A1 HK 11112970 A HK11112970 A HK 11112970A HK 1158804 A1 HK1158804 A1 HK 1158804A1
Authority
HK
Hong Kong
Prior art keywords
signal
term
short
long
type
Prior art date
Application number
HK11112970.6A
Other languages
English (en)
Inventor
Guillaume Fuchs
Stefan Bayer
Frederik Nagel
Jurgen Herre
Nikolaus Rettelbach
Stefan Wabnik
Yoshikazu Yokotani
Jens Hirschfeld
Jeremie Lecomte
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of HK1158804A1 publication Critical patent/HK1158804A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Analysis (AREA)
HK11112970.6A 2008-07-11 2011-11-30 Method and discriminator for classifying different segments of a signal HK1158804A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US7987508P 2008-07-11 2008-07-11
PCT/EP2009/004339 WO2010003521A1 (en) 2008-07-11 2009-06-16 Method and discriminator for classifying different segments of a signal

Publications (1)

Publication Number Publication Date
HK1158804A1 true HK1158804A1 (en) 2012-07-20

Family

ID=40851974

Family Applications (1)

Application Number Title Priority Date Filing Date
HK11112970.6A HK1158804A1 (en) 2008-07-11 2011-11-30 Method and discriminator for classifying different segments of a signal

Country Status (20)

Country Link
US (1) US8571858B2 (xx)
EP (1) EP2301011B1 (xx)
JP (1) JP5325292B2 (xx)
KR (2) KR101281661B1 (xx)
CN (1) CN102089803B (xx)
AR (1) AR072863A1 (xx)
AU (1) AU2009267507B2 (xx)
BR (1) BRPI0910793B8 (xx)
CA (1) CA2730196C (xx)
CO (1) CO6341505A2 (xx)
ES (1) ES2684297T3 (xx)
HK (1) HK1158804A1 (xx)
MX (1) MX2011000364A (xx)
MY (1) MY153562A (xx)
PL (1) PL2301011T3 (xx)
PT (1) PT2301011T (xx)
RU (1) RU2507609C2 (xx)
TW (1) TWI441166B (xx)
WO (1) WO2010003521A1 (xx)
ZA (1) ZA201100088B (xx)

Families Citing this family (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5551695B2 (ja) * 2008-07-11 2014-07-16 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ 音声符号器、音声復号器、音声符号化方法、音声復号化方法およびコンピュータプログラム
CN101847412B (zh) * 2009-03-27 2012-02-15 华为技术有限公司 音频信号的分类方法及装置
KR101666521B1 (ko) * 2010-01-08 2016-10-14 삼성전자 주식회사 입력 신호의 피치 주기 검출 방법 및 그 장치
SG189277A1 (en) * 2010-10-06 2013-05-31 Fraunhofer Ges Forschung Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)
US8521541B2 (en) * 2010-11-02 2013-08-27 Google Inc. Adaptive audio transcoding
CN103000172A (zh) * 2011-09-09 2013-03-27 中兴通讯股份有限公司 信号分类方法和装置
US20130090926A1 (en) * 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
EP2772914A4 (en) * 2011-10-28 2015-07-15 Panasonic Corp DECODER FOR HYBRID SOUND SIGNALS, COORDINATORS FOR HYBRID SOUND SIGNALS, DECODING PROCEDURE FOR SOUND SIGNALS AND CODING SIGNALING PROCESSES
CN103139930B (zh) 2011-11-22 2015-07-08 华为技术有限公司 连接建立方法和用户设备
US9111531B2 (en) 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
EP2702776B1 (en) * 2012-02-17 2015-09-23 Huawei Technologies Co., Ltd. Parametric encoder for encoding a multi-channel audio signal
US20130317821A1 (en) * 2012-05-24 2013-11-28 Qualcomm Incorporated Sparse signal detection with mismatched models
EP3301676A1 (en) 2012-08-31 2018-04-04 Telefonaktiebolaget LM Ericsson (publ) Method and device for voice activity detection
US9589570B2 (en) 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
SG10201706626XA (en) * 2012-11-13 2017-09-28 Samsung Electronics Co Ltd Method and apparatus for determining encoding mode, method and apparatus for encoding audio signals, and method and apparatus for decoding audio signals
EP2954635B1 (en) * 2013-02-19 2021-07-28 Huawei Technologies Co., Ltd. Frame structure for filter bank multi-carrier (fbmc) waveforms
PT2959482T (pt) 2013-02-20 2019-08-02 Fraunhofer Ges Forschung Aparelho e método para codificar ou descodificar um sinal de áudio usando uma sobreposição dependente da localização de transiente
CN104347067B (zh) 2013-08-06 2017-04-12 华为技术有限公司 一种音频信号分类方法和装置
US9666202B2 (en) * 2013-09-10 2017-05-30 Huawei Technologies Co., Ltd. Adaptive bandwidth extension and apparatus for the same
KR101498113B1 (ko) * 2013-10-23 2015-03-04 광주과학기술원 사운드 신호의 대역폭 확장 장치 및 방법
EP3109861B1 (en) * 2014-02-24 2018-12-12 Samsung Electronics Co., Ltd. Signal classifying method and device, and audio encoding method and device using same
CN107452391B (zh) 2014-04-29 2020-08-25 华为技术有限公司 音频编码方法及相关装置
KR20180095123A (ko) 2014-05-15 2018-08-24 텔레폰악티에볼라겟엘엠에릭슨(펍) 오디오 신호 분류 및 코딩
CN107424622B (zh) 2014-06-24 2020-12-25 华为技术有限公司 音频编码方法和装置
US9886963B2 (en) * 2015-04-05 2018-02-06 Qualcomm Incorporated Encoder selection
ES2829413T3 (es) * 2015-05-20 2021-05-31 Ericsson Telefon Ab L M Codificación de señales de audio de múltiples canales
US10706873B2 (en) * 2015-09-18 2020-07-07 Sri International Real-time speaker state analytics platform
WO2017196422A1 (en) * 2016-05-12 2017-11-16 Nuance Communications, Inc. Voice activity detection feature based on modulation-phase differences
US10699538B2 (en) * 2016-07-27 2020-06-30 Neosensory, Inc. Method and system for determining and providing sensory experiences
WO2018048907A1 (en) 2016-09-06 2018-03-15 Neosensory, Inc. C/O Tmc+260 Method and system for providing adjunct sensory information to a user
CN107895580B (zh) * 2016-09-30 2021-06-01 华为技术有限公司 一种音频信号的重建方法和装置
US10744058B2 (en) 2017-04-20 2020-08-18 Neosensory, Inc. Method and system for providing information to a user
US10325588B2 (en) 2017-09-28 2019-06-18 International Business Machines Corporation Acoustic feature extractor selected according to status flag of frame of acoustic signal
KR20210102899A (ko) * 2018-12-13 2021-08-20 돌비 레버러토리즈 라이쎈싱 코오포레이션 이중 종단 미디어 인텔리전스
RU2761940C1 (ru) 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
WO2020214541A1 (en) 2019-04-18 2020-10-22 Dolby Laboratories Licensing Corporation A dialog detector
CN110288983B (zh) * 2019-06-26 2021-10-01 上海电机学院 一种基于机器学习的语音处理方法
WO2021062276A1 (en) 2019-09-25 2021-04-01 Neosensory, Inc. System and method for haptic stimulation
US11467668B2 (en) 2019-10-21 2022-10-11 Neosensory, Inc. System and method for representing virtual object information with haptic stimulation
US11079854B2 (en) 2020-01-07 2021-08-03 Neosensory, Inc. Method and system for haptic stimulation
US12062381B2 (en) * 2020-04-16 2024-08-13 Voiceage Corporation Method and device for speech/music classification and core encoder selection in a sound codec
US11497675B2 (en) 2020-10-23 2022-11-15 Neosensory, Inc. Method and system for multimodal stimulation
CN117178322A (zh) * 2021-01-08 2023-12-05 沃伊斯亚吉公司 用于声音信号的统一时域/频域编码的方法和装置
US11862147B2 (en) 2021-08-13 2024-01-02 Neosensory, Inc. Method and system for enhancing the intelligibility of information for a user
US20230147185A1 (en) * 2021-11-08 2023-05-11 Lemon Inc. Controllable music generation
US11995240B2 (en) 2021-11-16 2024-05-28 Neosensory, Inc. Method and system for conveying digital texture information to a user
CN116070174A (zh) * 2023-03-23 2023-05-05 长沙融创智胜电子科技有限公司 一种多类别目标识别方法及系统

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IT1232084B (it) * 1989-05-03 1992-01-23 Cselt Centro Studi Lab Telecom Sistema di codifica per segnali audio a banda allargata
JPH0490600A (ja) * 1990-08-03 1992-03-24 Sony Corp 音声認識装置
JPH04342298A (ja) * 1991-05-20 1992-11-27 Nippon Telegr & Teleph Corp <Ntt> 瞬時ピッチ分析方法及び有声・無声判定方法
RU2049456C1 (ru) * 1993-06-22 1995-12-10 Вячеслав Алексеевич Сапрыкин Способ передачи речевых сигналов
US6134518A (en) 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
JP3700890B2 (ja) * 1997-07-09 2005-09-28 ソニー株式会社 信号識別装置及び信号識別方法
RU2132593C1 (ru) * 1998-05-13 1999-06-27 Академия управления МВД России Многоканальное устройство для передачи речевых сигналов
SE0004187D0 (sv) 2000-11-15 2000-11-15 Coding Technologies Sweden Ab Enhancing the performance of coding systems that use high frequency reconstruction methods
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
CN1279512C (zh) 2001-11-29 2006-10-11 编码技术股份公司 用于改善高频重建的方法和装置
AUPS270902A0 (en) * 2002-05-31 2002-06-20 Canon Kabushiki Kaisha Robust detection and classification of objects in audio using limited training data
JP4348970B2 (ja) * 2003-03-06 2009-10-21 ソニー株式会社 情報検出装置及び方法、並びにプログラム
JP2004354589A (ja) * 2003-05-28 2004-12-16 Nippon Telegr & Teleph Corp <Ntt> 音響信号判別方法、音響信号判別装置、音響信号判別プログラム
EP1758274A4 (en) * 2004-06-01 2012-03-14 Nec Corp SYSTEM, METHOD AND PROGRAM PROVIDING INFORMATION
US7130795B2 (en) * 2004-07-16 2006-10-31 Mindspeed Technologies, Inc. Music detection with low-complexity pitch correlation algorithm
JP4587916B2 (ja) * 2005-09-08 2010-11-24 シャープ株式会社 音声信号判別装置、音質調整装置、コンテンツ表示装置、プログラム、及び記録媒体
ES2343862T3 (es) 2006-09-13 2010-08-11 Telefonaktiebolaget Lm Ericsson (Publ) Metodos y disposiciones para un emisor y receptor de conversacion/audio.
CN1920947B (zh) * 2006-09-15 2011-05-11 清华大学 用于低比特率音频编码的语音/音乐检测器
US9583117B2 (en) * 2006-10-10 2017-02-28 Qualcomm Incorporated Method and apparatus for encoding and decoding audio signals
MX2009006201A (es) * 2006-12-12 2009-06-22 Fraunhofer Ges Forschung Codificador, decodificador y metodos para codificar y decodificar segmentos de datos que representan una corriente de datos del dominio temporal.
KR100964402B1 (ko) * 2006-12-14 2010-06-17 삼성전자주식회사 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치
KR100883656B1 (ko) * 2006-12-28 2009-02-18 삼성전자주식회사 오디오 신호의 분류 방법 및 장치와 이를 이용한 오디오신호의 부호화/복호화 방법 및 장치
US8428949B2 (en) * 2008-06-30 2013-04-23 Waves Audio Ltd. Apparatus and method for classification and segmentation of audio content, based on the audio signal

Also Published As

Publication number Publication date
KR20130036358A (ko) 2013-04-11
US20110202337A1 (en) 2011-08-18
RU2011104001A (ru) 2012-08-20
AR072863A1 (es) 2010-09-29
KR101281661B1 (ko) 2013-07-03
MX2011000364A (es) 2011-02-25
RU2507609C2 (ru) 2014-02-20
CA2730196A1 (en) 2010-01-14
EP2301011B1 (en) 2018-07-25
BRPI0910793B1 (pt) 2020-11-24
PT2301011T (pt) 2018-10-26
CA2730196C (en) 2014-10-21
CN102089803A (zh) 2011-06-08
KR101380297B1 (ko) 2014-04-02
ZA201100088B (en) 2011-08-31
PL2301011T3 (pl) 2019-03-29
TWI441166B (zh) 2014-06-11
CO6341505A2 (es) 2011-11-21
KR20110039254A (ko) 2011-04-15
ES2684297T3 (es) 2018-10-02
BRPI0910793A2 (pt) 2016-08-02
JP2011527445A (ja) 2011-10-27
JP5325292B2 (ja) 2013-10-23
MY153562A (en) 2015-02-27
US8571858B2 (en) 2013-10-29
EP2301011A1 (en) 2011-03-30
TW201009813A (en) 2010-03-01
AU2009267507A1 (en) 2010-01-14
AU2009267507B2 (en) 2012-08-02
CN102089803B (zh) 2013-02-27
WO2010003521A1 (en) 2010-01-14
BRPI0910793B8 (pt) 2021-08-24

Similar Documents

Publication Publication Date Title
HK1158804A1 (en) Method and discriminator for classifying different segments of a signal
WO2012100066A3 (en) Sentiment analysis
GB2526929A (en) Captioning using socially derived acoustic profiles
WO2011027004A3 (en) Method for operating a hearing device and a hearing device
PH12017502232A1 (en) High-band signal generation
EP2137726A4 (en) METHOD AND DEVICE FOR PROCESSING AN AUDIO SIGNAL
WO2006091551A3 (en) Audio signal de-identification
HK1149842A1 (en) Device and method for calculating a fingerprint of an audio signal, device and method for synchronizing and device and method for characterizing a test audio signal
WO2016028628A3 (en) System and method for speech validation
EP3767620A3 (en) Speech endpointing based on word comparisons
EP2186090A4 (en) TRANSIENT DETECTOR AND METHOD FOR SUPPORTING CODING OF AUDIO SIGNAL
GB2464049A (en) System for identifying content of digital data
WO2010041131A8 (en) Associating source information with phonetic indices
WO2008139203A3 (en) Data processing apparatus
IN2013MU02149A (xx)
WO2012027595A3 (en) Techniques for object based operations
AR079998A1 (es) Aparato y metodo para extraer una senal directa/de ambiente de una senal de mezcla descendente e informacion parametrica espacial
WO2010096193A3 (en) Identifying a document by performing spectral analysis on the contents of the document
SG171546A1 (en) Audio system with portable audio enhancement device
EP3748631A3 (en) Low power integrated circuit to analyze a digitized audio stream
EP3182409A3 (en) Determining the inter-channel time difference of a multi-channel audio signal
WO2010117712A3 (en) Systems and methods for measuring speech intelligibility
WO2009081212A3 (en) Data normalisation for investigative data mining
WO2014004567A3 (en) Identifying media on a mobile device
WO2006082868A3 (en) Method and system for identifying speech sound and non-speech sound in an environment