JP5992427B2 - 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 - Google Patents
信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 Download PDFInfo
- Publication number
- JP5992427B2 JP5992427B2 JP2013538309A JP2013538309A JP5992427B2 JP 5992427 B2 JP5992427 B2 JP 5992427B2 JP 2013538309 A JP2013538309 A JP 2013538309A JP 2013538309 A JP2013538309 A JP 2013538309A JP 5992427 B2 JP5992427 B2 JP 5992427B2
- Authority
- JP
- Japan
- Prior art keywords
- signal
- spectrum
- pitch
- zero phase
- estimating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Auxiliary Devices For Music (AREA)
- Measurement Of Resistance Or Impedance (AREA)
- Radar Systems Or Details Thereof (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP10190709.5 | 2010-11-10 | ||
| EP10190709 | 2010-11-10 | ||
| PCT/IB2011/054951 WO2012063185A1 (en) | 2010-11-10 | 2011-11-07 | Method and device for estimating a pattern in a signal |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2013542469A JP2013542469A (ja) | 2013-11-21 |
| JP2013542469A5 JP2013542469A5 (enExample) | 2014-12-25 |
| JP5992427B2 true JP5992427B2 (ja) | 2016-09-14 |
Family
ID=44999842
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2013538309A Active JP5992427B2 (ja) | 2010-11-10 | 2011-11-07 | 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US9208799B2 (enExample) |
| EP (1) | EP2638541A1 (enExample) |
| JP (1) | JP5992427B2 (enExample) |
| CN (1) | CN103189916B (enExample) |
| BR (1) | BR112013011312A2 (enExample) |
| RU (1) | RU2587652C2 (enExample) |
| WO (1) | WO2012063185A1 (enExample) |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102783034B (zh) * | 2011-02-01 | 2014-12-17 | 华为技术有限公司 | 用于提供信号处理系数的方法和设备 |
| JP6114053B2 (ja) * | 2013-02-15 | 2017-04-12 | 日本電信電話株式会社 | 音源分離装置、音源分離方法、およびプログラム |
| EP3537439B1 (en) | 2014-05-01 | 2020-05-13 | Nippon Telegraph and Telephone Corporation | Periodic-combined-envelope-sequence generation device, periodic-combined-envelope-sequence generation method, periodic-combined-envelope-sequence generation program and recording medium |
| EP3121814A1 (en) * | 2015-07-24 | 2017-01-25 | Sound object techology S.A. in organization | A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use |
| US9717424B2 (en) | 2015-10-19 | 2017-08-01 | Garmin Switzerland Gmbh | System and method for generating a PPG signal |
| CN109410980A (zh) * | 2016-01-22 | 2019-03-01 | 大连民族大学 | 一种基频估计算法在各类具有谐波结构的信号的基频估计中的应用 |
| EP3396670B1 (en) * | 2017-04-28 | 2020-11-25 | Nxp B.V. | Speech signal processing |
| KR101944429B1 (ko) * | 2018-11-15 | 2019-01-30 | 엘아이지넥스원 주식회사 | 주파수 분석 방법 및 이를 지원하는 장치 |
| CN110197666B (zh) * | 2019-05-30 | 2022-05-10 | 广东工业大学 | 一种基于神经网络的语音识别方法、装置 |
| WO2020261497A1 (ja) * | 2019-06-27 | 2020-12-30 | ローランド株式会社 | 楽音信号のパワーの平坦化方法及び装置、並びに、楽曲のビートタイミング検出方法及び装置 |
| EP3888542A1 (en) | 2020-04-01 | 2021-10-06 | Koninklijke Philips N.V. | Inductive sensing system and method |
| CN115067916A (zh) * | 2022-06-15 | 2022-09-20 | 南京邮电大学 | 一种基于毫米波雷达的生命体征监测方法 |
| US12336797B2 (en) | 2022-10-26 | 2025-06-24 | Garmin International, Inc. | Wrist-worn electronic device with optical cardiac monitor |
| CN116206000B (zh) * | 2022-12-12 | 2025-09-09 | 中国电子科技集团公司第七研究所 | 一种基于lab颜色空间映射的幅度相位时频图表达方法 |
| CN118689612B (zh) * | 2024-08-23 | 2024-11-12 | 卓望数码技术(深圳)有限公司 | 安全防护任务调度方法、装置、计算机设备及存储介质 |
Family Cites Families (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3617636A (en) | 1968-09-24 | 1971-11-02 | Nippon Electric Co | Pitch detection apparatus |
| US3622966A (en) * | 1970-07-17 | 1971-11-23 | Atlantic Richfield Co | Wavelet standardization |
| US4720802A (en) * | 1983-07-26 | 1988-01-19 | Lear Siegler | Noise compensation arrangement |
| NL8400552A (nl) | 1984-02-22 | 1985-09-16 | Philips Nv | Systeem voor het analyseren van menselijke spraak. |
| GB2165654B (en) * | 1984-10-12 | 1988-05-25 | Yue Lin Thomas Hong | Method and apparatus for evaluating auditory distortions of an audio system |
| US5781880A (en) | 1994-11-21 | 1998-07-14 | Rockwell International Corporation | Pitch lag estimation using frequency-domain lowpass filtering of the linear predictive coding (LPC) residual |
| WO1997027578A1 (en) | 1996-01-26 | 1997-07-31 | Motorola Inc. | Very low bit rate time domain speech analyzer for voice messaging |
| US5864795A (en) * | 1996-02-20 | 1999-01-26 | Advanced Micro Devices, Inc. | System and method for error correction in a correlation-based pitch estimator |
| US5946650A (en) * | 1997-06-19 | 1999-08-31 | Tritech Microelectronics, Ltd. | Efficient pitch estimation method |
| WO1999003097A2 (en) * | 1997-07-11 | 1999-01-21 | Koninklijke Philips Electronics N.V. | Transmitter with an improved speech encoder and decoder |
| KR100269216B1 (ko) * | 1998-04-16 | 2000-10-16 | 윤종용 | 스펙트로-템포럴 자기상관을 사용한 피치결정시스템 및 방법 |
| US6459914B1 (en) * | 1998-05-27 | 2002-10-01 | Telefonaktiebolaget Lm Ericsson (Publ) | Signal noise reduction by spectral subtraction using spectrum dependent exponential gain function averaging |
| US6067511A (en) * | 1998-07-13 | 2000-05-23 | Lockheed Martin Corp. | LPC speech synthesis using harmonic excitation generator with phase modulator for voiced speech |
| US6470311B1 (en) * | 1999-10-15 | 2002-10-22 | Fonix Corporation | Method and apparatus for determining pitch synchronous frames |
| US7337107B2 (en) * | 2000-10-02 | 2008-02-26 | The Regents Of The University Of California | Perceptual harmonic cepstral coefficients as the front-end for speech recognition |
| RU2234746C2 (ru) * | 2002-10-30 | 2004-08-20 | Пермский государственный университет | Способ дикторонезависимого распознавания звуков речи |
| US7272551B2 (en) * | 2003-02-24 | 2007-09-18 | International Business Machines Corporation | Computational effectiveness enhancement of frequency domain pitch estimators |
| EP1671316B1 (en) * | 2003-09-29 | 2007-08-01 | Koninklijke Philips Electronics N.V. | Encoding audio signals |
| KR100653643B1 (ko) * | 2006-01-26 | 2006-12-05 | 삼성전자주식회사 | 하모닉과 비하모닉의 비율을 이용한 피치 검출 방법 및피치 검출 장치 |
| US20090018824A1 (en) * | 2006-01-31 | 2009-01-15 | Matsushita Electric Industrial Co., Ltd. | Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method |
| US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
| MX2008016163A (es) * | 2006-06-30 | 2009-02-04 | Fraunhofer Ges Forschung | Codificador de audio, decodificador de audio y procesador de audio con caracteristicas de warping variable de manera dinamica. |
| CN100541609C (zh) * | 2006-09-18 | 2009-09-16 | 华为技术有限公司 | 一种实现开环基音搜索的方法和装置 |
| US8560328B2 (en) * | 2006-12-15 | 2013-10-15 | Panasonic Corporation | Encoding device, decoding device, and method thereof |
| EP1944754B1 (en) * | 2007-01-12 | 2016-08-31 | Nuance Communications, Inc. | Speech fundamental frequency estimator and method for estimating a speech fundamental frequency |
| US8515759B2 (en) * | 2007-04-26 | 2013-08-20 | Dolby International Ab | Apparatus and method for synthesizing an output signal |
| CN101599272B (zh) * | 2008-12-30 | 2011-06-08 | 华为技术有限公司 | 基音搜索方法及装置 |
| US20100223061A1 (en) * | 2009-02-27 | 2010-09-02 | Nokia Corporation | Method and Apparatus for Audio Coding |
| CN101853240B (zh) * | 2009-03-31 | 2012-07-04 | 华为技术有限公司 | 一种信号周期的估计方法和装置 |
| EP2249333B1 (en) * | 2009-05-06 | 2014-08-27 | Nuance Communications, Inc. | Method and apparatus for estimating a fundamental frequency of a speech signal |
-
2011
- 2011-11-07 US US13/883,647 patent/US9208799B2/en active Active
- 2011-11-07 WO PCT/IB2011/054951 patent/WO2012063185A1/en not_active Ceased
- 2011-11-07 RU RU2013126409/08A patent/RU2587652C2/ru not_active IP Right Cessation
- 2011-11-07 BR BR112013011312A patent/BR112013011312A2/pt not_active IP Right Cessation
- 2011-11-07 EP EP11785135.2A patent/EP2638541A1/en not_active Withdrawn
- 2011-11-07 JP JP2013538309A patent/JP5992427B2/ja active Active
- 2011-11-07 CN CN201180054354.9A patent/CN103189916B/zh active Active
Also Published As
| Publication number | Publication date |
|---|---|
| EP2638541A1 (en) | 2013-09-18 |
| JP2013542469A (ja) | 2013-11-21 |
| CN103189916B (zh) | 2015-11-25 |
| US9208799B2 (en) | 2015-12-08 |
| US20130231926A1 (en) | 2013-09-05 |
| CN103189916A (zh) | 2013-07-03 |
| BR112013011312A2 (pt) | 2019-09-24 |
| RU2587652C2 (ru) | 2016-06-20 |
| RU2013126409A (ru) | 2014-12-20 |
| WO2012063185A1 (en) | 2012-05-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP5992427B2 (ja) | 信号におけるピッチおよび/または基本周波数に関するパターンを推定する方法および装置 | |
| US10510363B2 (en) | Pitch detection algorithm based on PWVT | |
| CN103854662B (zh) | 基于多域联合估计的自适应语音检测方法 | |
| Azarov et al. | Instantaneous pitch estimation based on RAPT framework | |
| KR101110141B1 (ko) | 주기 신호 처리 방법, 주기 신호 변환 방법, 주기 신호 처리 장치, 및 주기 신호의 분석 방법 | |
| Sukhostat et al. | A comparative analysis of pitch detection methods under the influence of different noise conditions | |
| Magron et al. | Phase reconstruction of spectrograms with linear unwrapping: application to audio signal restoration | |
| CN103154932A (zh) | 用于分析信号、提供瞬时频率和短时傅里叶变换的方法以及用于分析信号的设备 | |
| Sebastian et al. | An analysis of the high resolution property of group delay function with applications to audio signal processing | |
| Krishnamoorthy et al. | Enhancement of noisy speech by temporal and spectral processing | |
| JP5325130B2 (ja) | Lpc分析装置、lpc分析方法、音声分析合成装置、音声分析合成方法及びプログラム | |
| JP2009211021A (ja) | 残響時間推定装置及び残響時間推定方法 | |
| CN103839544B (zh) | 语音激活检测方法和装置 | |
| JP7461192B2 (ja) | 基本周波数推定装置、アクティブノイズコントロール装置、基本周波数の推定方法及び基本周波数の推定プログラム | |
| Rao et al. | A comparative study of various pitch detection algorithms | |
| JP5495858B2 (ja) | 音楽音響信号のピッチ推定装置及び方法 | |
| JP2006215228A (ja) | 音声信号分析方法およびこの分析方法を実施する装置、この音声信号分析装置を用いた音声認識装置、この分析方法を実行するプログラムおよびその記憶媒体 | |
| Rahman et al. | Pitch determination using autocorrelation function in spectral domain. | |
| Llerena et al. | Pitch detection in pathological voices driven by three tailored classical pitch detection algorithms | |
| Nechifor et al. | COMPARISON OF ALGORITHMS FOR FUNDAMENTAL FREQUENCY DETECTION IN THE CONTEXT OF AUDIO PLUG-INS | |
| Chowdhury et al. | Improving the harmonic structure of speech spectrum for robust pitch estimation | |
| Benetos | Pitched instrument onset detection based on auditory spectra | |
| Hamid et al. | A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment | |
| Rabiner et al. | 5 Homomorphic speech analysis. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20141104 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20141104 |
|
| RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7422 Effective date: 20141104 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20151120 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20151208 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20160209 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20160719 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20160817 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 5992427 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |