JP5992427B2 - 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 - Google Patents

信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 Download PDF

Info

Publication number
JP5992427B2
JP5992427B2 JP2013538309A JP2013538309A JP5992427B2 JP 5992427 B2 JP5992427 B2 JP 5992427B2 JP 2013538309 A JP2013538309 A JP 2013538309A JP 2013538309 A JP2013538309 A JP 2013538309A JP 5992427 B2 JP5992427 B2 JP 5992427B2
Authority
JP
Japan
Prior art keywords
signal
spectrum
pitch
zero phase
estimating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2013538309A
Other languages
English (en)
Japanese (ja)
Other versions
JP2013542469A (ja
JP2013542469A5 (enExample
Inventor
エルキャン フェリット ギギ
エルキャン フェリット ギギ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips NV filed Critical Koninklijke Philips NV
Publication of JP2013542469A publication Critical patent/JP2013542469A/ja
Publication of JP2013542469A5 publication Critical patent/JP2013542469A5/ja
Application granted granted Critical
Publication of JP5992427B2 publication Critical patent/JP5992427B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Auxiliary Devices For Music (AREA)
  • Measurement Of Resistance Or Impedance (AREA)
  • Radar Systems Or Details Thereof (AREA)
JP2013538309A 2010-11-10 2011-11-07 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 Active JP5992427B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP10190709.5 2010-11-10
EP10190709 2010-11-10
PCT/IB2011/054951 WO2012063185A1 (en) 2010-11-10 2011-11-07 Method and device for estimating a pattern in a signal

Publications (3)

Publication Number Publication Date
JP2013542469A JP2013542469A (ja) 2013-11-21
JP2013542469A5 JP2013542469A5 (enExample) 2014-12-25
JP5992427B2 true JP5992427B2 (ja) 2016-09-14

Family

ID=44999842

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2013538309A Active JP5992427B2 (ja) 2010-11-10 2011-11-07 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置

Country Status (7)

Country Link
US (1) US9208799B2 (enExample)
EP (1) EP2638541A1 (enExample)
JP (1) JP5992427B2 (enExample)
CN (1) CN103189916B (enExample)
BR (1) BR112013011312A2 (enExample)
RU (1) RU2587652C2 (enExample)
WO (1) WO2012063185A1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102783034B (zh) * 2011-02-01 2014-12-17 华为技术有限公司 用于提供信号处理系数的方法和设备
JP6114053B2 (ja) * 2013-02-15 2017-04-12 日本電信電話株式会社 音源分離装置、音源分離方法、およびプログラム
EP3537439B1 (en) 2014-05-01 2020-05-13 Nippon Telegraph and Telephone Corporation Periodic-combined-envelope-sequence generation device, periodic-combined-envelope-sequence generation method, periodic-combined-envelope-sequence generation program and recording medium
EP3121814A1 (en) * 2015-07-24 2017-01-25 Sound object techology S.A. in organization A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use
US9717424B2 (en) 2015-10-19 2017-08-01 Garmin Switzerland Gmbh System and method for generating a PPG signal
CN109410980A (zh) * 2016-01-22 2019-03-01 大连民族大学 一种基频估计算法在各类具有谐波结构的信号的基频估计中的应用
EP3396670B1 (en) * 2017-04-28 2020-11-25 Nxp B.V. Speech signal processing
KR101944429B1 (ko) * 2018-11-15 2019-01-30 엘아이지넥스원 주식회사 주파수 분석 방법 및 이를 지원하는 장치
CN110197666B (zh) * 2019-05-30 2022-05-10 广东工业大学 一种基于神经网络的语音识别方法、装置
WO2020261497A1 (ja) * 2019-06-27 2020-12-30 ローランド株式会社 楽音信号のパワーの平坦化方法及び装置、並びに、楽曲のビートタイミング検出方法及び装置
EP3888542A1 (en) 2020-04-01 2021-10-06 Koninklijke Philips N.V. Inductive sensing system and method
CN115067916A (zh) * 2022-06-15 2022-09-20 南京邮电大学 一种基于毫米波雷达的生命体征监测方法
US12336797B2 (en) 2022-10-26 2025-06-24 Garmin International, Inc. Wrist-worn electronic device with optical cardiac monitor
CN116206000B (zh) * 2022-12-12 2025-09-09 中国电子科技集团公司第七研究所 一种基于lab颜色空间映射的幅度相位时频图表达方法
CN118689612B (zh) * 2024-08-23 2024-11-12 卓望数码技术(深圳)有限公司 安全防护任务调度方法、装置、计算机设备及存储介质

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3617636A (en) 1968-09-24 1971-11-02 Nippon Electric Co Pitch detection apparatus
US3622966A (en) * 1970-07-17 1971-11-23 Atlantic Richfield Co Wavelet standardization
US4720802A (en) * 1983-07-26 1988-01-19 Lear Siegler Noise compensation arrangement
NL8400552A (nl) 1984-02-22 1985-09-16 Philips Nv Systeem voor het analyseren van menselijke spraak.
GB2165654B (en) * 1984-10-12 1988-05-25 Yue Lin Thomas Hong Method and apparatus for evaluating auditory distortions of an audio system
US5781880A (en) 1994-11-21 1998-07-14 Rockwell International Corporation Pitch lag estimation using frequency-domain lowpass filtering of the linear predictive coding (LPC) residual
WO1997027578A1 (en) 1996-01-26 1997-07-31 Motorola Inc. Very low bit rate time domain speech analyzer for voice messaging
US5864795A (en) * 1996-02-20 1999-01-26 Advanced Micro Devices, Inc. System and method for error correction in a correlation-based pitch estimator
US5946650A (en) * 1997-06-19 1999-08-31 Tritech Microelectronics, Ltd. Efficient pitch estimation method
WO1999003097A2 (en) * 1997-07-11 1999-01-21 Koninklijke Philips Electronics N.V. Transmitter with an improved speech encoder and decoder
KR100269216B1 (ko) * 1998-04-16 2000-10-16 윤종용 스펙트로-템포럴 자기상관을 사용한 피치결정시스템 및 방법
US6459914B1 (en) * 1998-05-27 2002-10-01 Telefonaktiebolaget Lm Ericsson (Publ) Signal noise reduction by spectral subtraction using spectrum dependent exponential gain function averaging
US6067511A (en) * 1998-07-13 2000-05-23 Lockheed Martin Corp. LPC speech synthesis using harmonic excitation generator with phase modulator for voiced speech
US6470311B1 (en) * 1999-10-15 2002-10-22 Fonix Corporation Method and apparatus for determining pitch synchronous frames
US7337107B2 (en) * 2000-10-02 2008-02-26 The Regents Of The University Of California Perceptual harmonic cepstral coefficients as the front-end for speech recognition
RU2234746C2 (ru) * 2002-10-30 2004-08-20 Пермский государственный университет Способ дикторонезависимого распознавания звуков речи
US7272551B2 (en) * 2003-02-24 2007-09-18 International Business Machines Corporation Computational effectiveness enhancement of frequency domain pitch estimators
EP1671316B1 (en) * 2003-09-29 2007-08-01 Koninklijke Philips Electronics N.V. Encoding audio signals
KR100653643B1 (ko) * 2006-01-26 2006-12-05 삼성전자주식회사 하모닉과 비하모닉의 비율을 이용한 피치 검출 방법 및피치 검출 장치
US20090018824A1 (en) * 2006-01-31 2009-01-15 Matsushita Electric Industrial Co., Ltd. Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
MX2008016163A (es) * 2006-06-30 2009-02-04 Fraunhofer Ges Forschung Codificador de audio, decodificador de audio y procesador de audio con caracteristicas de warping variable de manera dinamica.
CN100541609C (zh) * 2006-09-18 2009-09-16 华为技术有限公司 一种实现开环基音搜索的方法和装置
US8560328B2 (en) * 2006-12-15 2013-10-15 Panasonic Corporation Encoding device, decoding device, and method thereof
EP1944754B1 (en) * 2007-01-12 2016-08-31 Nuance Communications, Inc. Speech fundamental frequency estimator and method for estimating a speech fundamental frequency
US8515759B2 (en) * 2007-04-26 2013-08-20 Dolby International Ab Apparatus and method for synthesizing an output signal
CN101599272B (zh) * 2008-12-30 2011-06-08 华为技术有限公司 基音搜索方法及装置
US20100223061A1 (en) * 2009-02-27 2010-09-02 Nokia Corporation Method and Apparatus for Audio Coding
CN101853240B (zh) * 2009-03-31 2012-07-04 华为技术有限公司 一种信号周期的估计方法和装置
EP2249333B1 (en) * 2009-05-06 2014-08-27 Nuance Communications, Inc. Method and apparatus for estimating a fundamental frequency of a speech signal

Also Published As

Publication number Publication date
EP2638541A1 (en) 2013-09-18
JP2013542469A (ja) 2013-11-21
CN103189916B (zh) 2015-11-25
US9208799B2 (en) 2015-12-08
US20130231926A1 (en) 2013-09-05
CN103189916A (zh) 2013-07-03
BR112013011312A2 (pt) 2019-09-24
RU2587652C2 (ru) 2016-06-20
RU2013126409A (ru) 2014-12-20
WO2012063185A1 (en) 2012-05-18

Similar Documents

Publication Publication Date Title
JP5992427B2 (ja) 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置
US10510363B2 (en) Pitch detection algorithm based on PWVT
CN103854662B (zh) 基于多域联合估计的自适应语音检测方法
Azarov et al. Instantaneous pitch estimation based on RAPT framework
KR101110141B1 (ko) 주기 신호 처리 방법, 주기 신호 변환 방법, 주기 신호 처리 장치, 및 주기 신호의 분석 방법
Sukhostat et al. A comparative analysis of pitch detection methods under the influence of different noise conditions
Magron et al. Phase reconstruction of spectrograms with linear unwrapping: application to audio signal restoration
CN103154932A (zh) 用于分析信号、提供瞬时频率和短时傅里叶变换的方法以及用于分析信号的设备
Sebastian et al. An analysis of the high resolution property of group delay function with applications to audio signal processing
Krishnamoorthy et al. Enhancement of noisy speech by temporal and spectral processing
JP5325130B2 (ja) Lpc分析装置、lpc分析方法、音声分析合成装置、音声分析合成方法及びプログラム
JP2009211021A (ja) 残響時間推定装置及び残響時間推定方法
CN103839544B (zh) 语音激活检测方法和装置
JP7461192B2 (ja) 基本周波数推定装置、アクティブノイズコントロール装置、基本周波数の推定方法及び基本周波数の推定プログラム
Rao et al. A comparative study of various pitch detection algorithms
JP5495858B2 (ja) 音楽音響信号のピッチ推定装置及び方法
JP2006215228A (ja) 音声信号分析方法およびこの分析方法を実施する装置、この音声信号分析装置を用いた音声認識装置、この分析方法を実行するプログラムおよびその記憶媒体
Rahman et al. Pitch determination using autocorrelation function in spectral domain.
Llerena et al. Pitch detection in pathological voices driven by three tailored classical pitch detection algorithms
Nechifor et al. COMPARISON OF ALGORITHMS FOR FUNDAMENTAL FREQUENCY DETECTION IN THE CONTEXT OF AUDIO PLUG-INS
Chowdhury et al. Improving the harmonic structure of speech spectrum for robust pitch estimation
Benetos Pitched instrument onset detection based on auditory spectra
Hamid et al. A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment
Rabiner et al. 5 Homomorphic speech analysis.

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20141104

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20141104

RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20141104

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20151120

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20151208

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160209

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20160719

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20160817

R150 Certificate of patent or registration of utility model

Ref document number: 5992427

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250