RU2587652C2 - Способ и устройство для оценки структуры в сигнале - Google Patents

Способ и устройство для оценки структуры в сигнале Download PDF

Info

Publication number
RU2587652C2
RU2587652C2 RU2013126409/08A RU2013126409A RU2587652C2 RU 2587652 C2 RU2587652 C2 RU 2587652C2 RU 2013126409/08 A RU2013126409/08 A RU 2013126409/08A RU 2013126409 A RU2013126409 A RU 2013126409A RU 2587652 C2 RU2587652 C2 RU 2587652C2
Authority
RU
Russia
Prior art keywords
signal
spectrum
combined
correlation
time domain
Prior art date
Application number
RU2013126409/08A
Other languages
English (en)
Russian (ru)
Other versions
RU2013126409A (ru
Inventor
Эркан Ферит ГИГИ
Original Assignee
Конинклейке Филипс Электроникс Н.В.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Конинклейке Филипс Электроникс Н.В. filed Critical Конинклейке Филипс Электроникс Н.В.
Publication of RU2013126409A publication Critical patent/RU2013126409A/ru
Application granted granted Critical
Publication of RU2587652C2 publication Critical patent/RU2587652C2/ru

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Auxiliary Devices For Music (AREA)
  • Measurement Of Resistance Or Impedance (AREA)
  • Radar Systems Or Details Thereof (AREA)
RU2013126409/08A 2010-11-10 2011-11-07 Способ и устройство для оценки структуры в сигнале RU2587652C2 (ru)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP10190709.5 2010-11-10
EP10190709 2010-11-10
PCT/IB2011/054951 WO2012063185A1 (en) 2010-11-10 2011-11-07 Method and device for estimating a pattern in a signal

Publications (2)

Publication Number Publication Date
RU2013126409A RU2013126409A (ru) 2014-12-20
RU2587652C2 true RU2587652C2 (ru) 2016-06-20

Family

ID=44999842

Family Applications (1)

Application Number Title Priority Date Filing Date
RU2013126409/08A RU2587652C2 (ru) 2010-11-10 2011-11-07 Способ и устройство для оценки структуры в сигнале

Country Status (7)

Country Link
US (1) US9208799B2 (enExample)
EP (1) EP2638541A1 (enExample)
JP (1) JP5992427B2 (enExample)
CN (1) CN103189916B (enExample)
BR (1) BR112013011312A2 (enExample)
RU (1) RU2587652C2 (enExample)
WO (1) WO2012063185A1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102783034B (zh) * 2011-02-01 2014-12-17 华为技术有限公司 用于提供信号处理系数的方法和设备
JP6114053B2 (ja) * 2013-02-15 2017-04-12 日本電信電話株式会社 音源分離装置、音源分離方法、およびプログラム
EP3537439B1 (en) 2014-05-01 2020-05-13 Nippon Telegraph and Telephone Corporation Periodic-combined-envelope-sequence generation device, periodic-combined-envelope-sequence generation method, periodic-combined-envelope-sequence generation program and recording medium
EP3121814A1 (en) * 2015-07-24 2017-01-25 Sound object techology S.A. in organization A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use
US9717424B2 (en) 2015-10-19 2017-08-01 Garmin Switzerland Gmbh System and method for generating a PPG signal
CN109410980A (zh) * 2016-01-22 2019-03-01 大连民族大学 一种基频估计算法在各类具有谐波结构的信号的基频估计中的应用
EP3396670B1 (en) * 2017-04-28 2020-11-25 Nxp B.V. Speech signal processing
KR101944429B1 (ko) * 2018-11-15 2019-01-30 엘아이지넥스원 주식회사 주파수 분석 방법 및 이를 지원하는 장치
CN110197666B (zh) * 2019-05-30 2022-05-10 广东工业大学 一种基于神经网络的语音识别方法、装置
WO2020261497A1 (ja) * 2019-06-27 2020-12-30 ローランド株式会社 楽音信号のパワーの平坦化方法及び装置、並びに、楽曲のビートタイミング検出方法及び装置
EP3888542A1 (en) 2020-04-01 2021-10-06 Koninklijke Philips N.V. Inductive sensing system and method
CN115067916A (zh) * 2022-06-15 2022-09-20 南京邮电大学 一种基于毫米波雷达的生命体征监测方法
US12336797B2 (en) 2022-10-26 2025-06-24 Garmin International, Inc. Wrist-worn electronic device with optical cardiac monitor
CN116206000B (zh) * 2022-12-12 2025-09-09 中国电子科技集团公司第七研究所 一种基于lab颜色空间映射的幅度相位时频图表达方法
CN118689612B (zh) * 2024-08-23 2024-11-12 卓望数码技术(深圳)有限公司 安全防护任务调度方法、装置、计算机设备及存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2234746C2 (ru) * 2002-10-30 2004-08-20 Пермский государственный университет Способ дикторонезависимого распознавания звуков речи
EP2137725A1 (en) * 2007-04-26 2009-12-30 Dolby Sweden AB Apparatus and method for synthesizing an output signal
RU2009103010A (ru) * 2006-06-30 2010-08-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. (De) Аудиокодер, аудиодекодер и аудиопроцессор, имеющий динамически изменяющуюся характеристику перекоса

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3617636A (en) 1968-09-24 1971-11-02 Nippon Electric Co Pitch detection apparatus
US3622966A (en) * 1970-07-17 1971-11-23 Atlantic Richfield Co Wavelet standardization
US4720802A (en) * 1983-07-26 1988-01-19 Lear Siegler Noise compensation arrangement
NL8400552A (nl) 1984-02-22 1985-09-16 Philips Nv Systeem voor het analyseren van menselijke spraak.
GB2165654B (en) * 1984-10-12 1988-05-25 Yue Lin Thomas Hong Method and apparatus for evaluating auditory distortions of an audio system
US5781880A (en) 1994-11-21 1998-07-14 Rockwell International Corporation Pitch lag estimation using frequency-domain lowpass filtering of the linear predictive coding (LPC) residual
WO1997027578A1 (en) 1996-01-26 1997-07-31 Motorola Inc. Very low bit rate time domain speech analyzer for voice messaging
US5864795A (en) * 1996-02-20 1999-01-26 Advanced Micro Devices, Inc. System and method for error correction in a correlation-based pitch estimator
US5946650A (en) * 1997-06-19 1999-08-31 Tritech Microelectronics, Ltd. Efficient pitch estimation method
WO1999003097A2 (en) * 1997-07-11 1999-01-21 Koninklijke Philips Electronics N.V. Transmitter with an improved speech encoder and decoder
KR100269216B1 (ko) * 1998-04-16 2000-10-16 윤종용 스펙트로-템포럴 자기상관을 사용한 피치결정시스템 및 방법
US6459914B1 (en) * 1998-05-27 2002-10-01 Telefonaktiebolaget Lm Ericsson (Publ) Signal noise reduction by spectral subtraction using spectrum dependent exponential gain function averaging
US6067511A (en) * 1998-07-13 2000-05-23 Lockheed Martin Corp. LPC speech synthesis using harmonic excitation generator with phase modulator for voiced speech
US6470311B1 (en) * 1999-10-15 2002-10-22 Fonix Corporation Method and apparatus for determining pitch synchronous frames
US7337107B2 (en) * 2000-10-02 2008-02-26 The Regents Of The University Of California Perceptual harmonic cepstral coefficients as the front-end for speech recognition
US7272551B2 (en) * 2003-02-24 2007-09-18 International Business Machines Corporation Computational effectiveness enhancement of frequency domain pitch estimators
EP1671316B1 (en) * 2003-09-29 2007-08-01 Koninklijke Philips Electronics N.V. Encoding audio signals
KR100653643B1 (ko) * 2006-01-26 2006-12-05 삼성전자주식회사 하모닉과 비하모닉의 비율을 이용한 피치 검출 방법 및피치 검출 장치
US20090018824A1 (en) * 2006-01-31 2009-01-15 Matsushita Electric Industrial Co., Ltd. Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
CN100541609C (zh) * 2006-09-18 2009-09-16 华为技术有限公司 一种实现开环基音搜索的方法和装置
US8560328B2 (en) * 2006-12-15 2013-10-15 Panasonic Corporation Encoding device, decoding device, and method thereof
EP1944754B1 (en) * 2007-01-12 2016-08-31 Nuance Communications, Inc. Speech fundamental frequency estimator and method for estimating a speech fundamental frequency
CN101599272B (zh) * 2008-12-30 2011-06-08 华为技术有限公司 基音搜索方法及装置
US20100223061A1 (en) * 2009-02-27 2010-09-02 Nokia Corporation Method and Apparatus for Audio Coding
CN101853240B (zh) * 2009-03-31 2012-07-04 华为技术有限公司 一种信号周期的估计方法和装置
EP2249333B1 (en) * 2009-05-06 2014-08-27 Nuance Communications, Inc. Method and apparatus for estimating a fundamental frequency of a speech signal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2234746C2 (ru) * 2002-10-30 2004-08-20 Пермский государственный университет Способ дикторонезависимого распознавания звуков речи
RU2009103010A (ru) * 2006-06-30 2010-08-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. (De) Аудиокодер, аудиодекодер и аудиопроцессор, имеющий динамически изменяющуюся характеристику перекоса
EP2137725A1 (en) * 2007-04-26 2009-12-30 Dolby Sweden AB Apparatus and method for synthesizing an output signal

Also Published As

Publication number Publication date
EP2638541A1 (en) 2013-09-18
JP2013542469A (ja) 2013-11-21
CN103189916B (zh) 2015-11-25
US9208799B2 (en) 2015-12-08
US20130231926A1 (en) 2013-09-05
CN103189916A (zh) 2013-07-03
BR112013011312A2 (pt) 2019-09-24
JP5992427B2 (ja) 2016-09-14
RU2013126409A (ru) 2014-12-20
WO2012063185A1 (en) 2012-05-18

Similar Documents

Publication Publication Date Title
RU2587652C2 (ru) Способ и устройство для оценки структуры в сигнале
Kim et al. Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
US10510363B2 (en) Pitch detection algorithm based on PWVT
CN111128213B (zh) 一种分频段进行处理的噪声抑制方法及其系统
EP2178082A1 (en) Cyclic signal processing method, cyclic signal conversion method, cyclic signal processing device, and cyclic signal analysis method
Ganapathy et al. Feature extraction using 2-d autoregressive models for speaker recognition.
KR20130057668A (ko) 켑스트럼 특징벡터에 기반한 음성인식 장치 및 방법
Krishnamoorthy et al. Enhancement of noisy speech by temporal and spectral processing
Sebastian et al. An analysis of the high resolution property of group delay function with applications to audio signal processing
BRPI0208584B1 (pt) método para formação de parâmetros de reconhecimento de fala
JP2020076907A (ja) 信号処理装置、信号処理プログラム及び信号処理方法
CN118314919B (zh) 语音修复方法、装置、音频设备及存储介质
JP7461192B2 (ja) 基本周波数推定装置、アクティブノイズコントロール装置、基本周波数の推定方法及び基本周波数の推定プログラム
Rao et al. A comparative study of various pitch detection algorithms
JP2880683B2 (ja) 雑音抑制装置
CN110189765B (zh) 基于频谱形状的语音特征估计方法
Bonifaco et al. Comparative analysis of filipino-based rhinolalia aperta speech using mel frequency cepstral analysis and Perceptual Linear Prediction
Rahman et al. Pitch determination using autocorrelation function in spectral domain.
Cui Pitch extraction based on weighted autocorrelation function in speech signal processing
Dörfler et al. Adaptive Gabor frames by projection onto time-frequency subspaces
JP5495858B2 (ja) 音楽音響信号のピッチ推定装置及び方法
Ahmed Active voice detection using ridgelet transform
Vích et al. Speech spectrum envelope modeling
Wiriyarattanakul et al. Accuracy Improvement of MFCC Based Speech Recognition by Preventing DFT Leakage Using Pitch Segmentation
Reju et al. A computationally efficient noise estimation algorithm for speech enhancement

Legal Events

Date Code Title Description
MM4A The patent is invalid due to non-payment of fees

Effective date: 20171108