CN1265351C - 用于估计语音信号的音调频率的方法和装置 - Google Patents
用于估计语音信号的音调频率的方法和装置 Download PDFInfo
- Publication number
- CN1265351C CN1265351C CNB2004100059406A CN200410005940A CN1265351C CN 1265351 C CN1265351 C CN 1265351C CN B2004100059406 A CNB2004100059406 A CN B2004100059406A CN 200410005940 A CN200410005940 A CN 200410005940A CN 1265351 C CN1265351 C CN 1265351C
- Authority
- CN
- China
- Prior art keywords
- function
- frequency
- pitch frequency
- initial
- calculating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 72
- 230000003595 spectral effect Effects 0.000 claims abstract description 76
- 238000001228 spectrum Methods 0.000 claims abstract description 62
- 230000006870 function Effects 0.000 claims description 229
- 238000012886 linear function Methods 0.000 claims description 16
- 238000013459 approach Methods 0.000 claims description 15
- 230000000737 periodic effect Effects 0.000 claims description 15
- 230000007704 transition Effects 0.000 claims description 9
- 230000004044 response Effects 0.000 claims description 5
- 239000000203 mixture Substances 0.000 description 22
- 238000010586 diagram Methods 0.000 description 20
- 230000008569 process Effects 0.000 description 19
- 238000012545 processing Methods 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010606 normalization Methods 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000010363 phase shift Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000008602 contraction Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000012854 evaluation process Methods 0.000 description 1
- 238000007667 floating Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Electrophonic Musical Instruments (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (30)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/373,260 | 2003-02-24 | ||
US10/373,260 US7272551B2 (en) | 2003-02-24 | 2003-02-24 | Computational effectiveness enhancement of frequency domain pitch estimators |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1525435A CN1525435A (zh) | 2004-09-01 |
CN1265351C true CN1265351C (zh) | 2006-07-19 |
Family
ID=32868672
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100059406A Expired - Fee Related CN1265351C (zh) | 2003-02-24 | 2004-02-23 | 用于估计语音信号的音调频率的方法和装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US7272551B2 (zh) |
CN (1) | CN1265351C (zh) |
TW (1) | TWI282972B (zh) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7598447B2 (en) * | 2004-10-29 | 2009-10-06 | Zenph Studios, Inc. | Methods, systems and computer program products for detecting musical notes in an audio signal |
US8093484B2 (en) * | 2004-10-29 | 2012-01-10 | Zenph Sound Innovations, Inc. | Methods, systems and computer program products for regenerating audio performances |
JP4418390B2 (ja) * | 2005-03-22 | 2010-02-17 | 三菱重工業株式会社 | 3次元形状処理装置及び曲面生成プログラム並びに方法 |
US7783488B2 (en) * | 2005-12-19 | 2010-08-24 | Nuance Communications, Inc. | Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information |
US20090018824A1 (en) * | 2006-01-31 | 2009-01-15 | Matsushita Electric Industrial Co., Ltd. | Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method |
JP4757158B2 (ja) * | 2006-09-20 | 2011-08-24 | 富士通株式会社 | 音信号処理方法、音信号処理装置及びコンピュータプログラム |
FR2911228A1 (fr) * | 2007-01-05 | 2008-07-11 | France Telecom | Codage par transformee, utilisant des fenetres de ponderation et a faible retard. |
CN102610222B (zh) * | 2007-02-01 | 2014-08-20 | 缪斯亚米有限公司 | 音乐转录的方法,系统和装置 |
WO2008101130A2 (en) * | 2007-02-14 | 2008-08-21 | Museami, Inc. | Music-based search engine |
JP4882899B2 (ja) * | 2007-07-25 | 2012-02-22 | ソニー株式会社 | 音声解析装置、および音声解析方法、並びにコンピュータ・プログラム |
US8494257B2 (en) | 2008-02-13 | 2013-07-23 | Museami, Inc. | Music score deconstruction |
CN101556795B (zh) * | 2008-04-09 | 2012-07-18 | 展讯通信(上海)有限公司 | 计算语音基音频率的方法及设备 |
CN101727902B (zh) * | 2008-10-29 | 2011-08-10 | 中国科学院自动化研究所 | 一种对语调进行评估的方法 |
US8176067B1 (en) | 2010-02-24 | 2012-05-08 | A9.Com, Inc. | Fixed phrase detection for search |
WO2012063185A1 (en) * | 2010-11-10 | 2012-05-18 | Koninklijke Philips Electronics N.V. | Method and device for estimating a pattern in a signal |
CN102655000B (zh) * | 2011-03-04 | 2014-02-19 | 华为技术有限公司 | 一种清浊音分类方法和装置 |
CN102915728B (zh) * | 2011-08-01 | 2014-08-27 | 佳能株式会社 | 声音分段设备和方法以及说话者识别系统 |
CN103258552B (zh) * | 2012-02-20 | 2015-12-16 | 扬智科技股份有限公司 | 调整播放速度的方法 |
CN102779526B (zh) * | 2012-08-07 | 2014-04-16 | 无锡成电科大科技发展有限公司 | 语音信号中基音提取及修正方法 |
US9263061B2 (en) * | 2013-05-21 | 2016-02-16 | Google Inc. | Detection of chopped speech |
US9548067B2 (en) | 2014-09-30 | 2017-01-17 | Knuedge Incorporated | Estimating pitch using symmetry characteristics |
US9396740B1 (en) * | 2014-09-30 | 2016-07-19 | Knuedge Incorporated | Systems and methods for estimating pitch in audio signals based on symmetry characteristics independent of harmonic amplitudes |
US9870785B2 (en) | 2015-02-06 | 2018-01-16 | Knuedge Incorporated | Determining features of harmonic signals |
US9842611B2 (en) | 2015-02-06 | 2017-12-12 | Knuedge Incorporated | Estimating pitch using peak-to-peak distances |
CN107430850A (zh) * | 2015-02-06 | 2017-12-01 | 弩锋股份有限公司 | 确定谐波信号的特征 |
US9922668B2 (en) | 2015-02-06 | 2018-03-20 | Knuedge Incorporated | Estimating fractional chirp rate with multiple frequency representations |
PT3443557T (pt) * | 2016-04-12 | 2020-08-27 | Fraunhofer Ges Forschung | Codificador de áudio para codificar um sinal de áudio, método para codificar um sinal de áudio e programa de computador sob consideração de uma região espectral de pico detetada numa banda de frequência superior |
EP3382704A1 (en) * | 2017-03-31 | 2018-10-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for determining a predetermined characteristic related to a spectral enhancement processing of an audio signal |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL177950C (nl) * | 1978-12-14 | 1986-07-16 | Philips Nv | Spraakanalysesysteem voor het bepalen van de toonhoogte in menselijke spraak. |
NL8400552A (nl) * | 1984-02-22 | 1985-09-16 | Philips Nv | Systeem voor het analyseren van menselijke spraak. |
US6876953B1 (en) * | 2000-04-20 | 2005-04-05 | The United States Of America As Represented By The Secretary Of The Navy | Narrowband signal processor |
US6587816B1 (en) | 2000-07-14 | 2003-07-01 | International Business Machines Corporation | Fast frequency-domain pitch estimation |
TW589618B (en) * | 2001-12-14 | 2004-06-01 | Ind Tech Res Inst | Method for determining the pitch mark of speech |
CN1430204A (zh) * | 2001-12-31 | 2003-07-16 | 佳能株式会社 | 波形信号分析、基音探测以及句子探测的方法和设备 |
-
2003
- 2003-02-24 US US10/373,260 patent/US7272551B2/en active Active
-
2004
- 2004-02-19 TW TW093104139A patent/TWI282972B/zh not_active IP Right Cessation
- 2004-02-23 CN CNB2004100059406A patent/CN1265351C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US20040167775A1 (en) | 2004-08-26 |
TW200508581A (en) | 2005-03-01 |
TWI282972B (en) | 2007-06-21 |
CN1525435A (zh) | 2004-09-01 |
US7272551B2 (en) | 2007-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1265351C (zh) | 用于估计语音信号的音调频率的方法和装置 | |
CN1248190C (zh) | 快速频域音调估计方法和装置 | |
US7181390B2 (en) | Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization | |
EP2659481B1 (en) | Scene change detection around a set of seed points in media data | |
Lee | Noise robust pitch tracking by subband autocorrelation classification | |
Gerhard | Pitch extraction and fundamental frequency: History and current techniques | |
EP2791935B1 (en) | Low complexity repetition detection in media data | |
AU746342B2 (en) | Method and apparatus for pitch estimation using perception based analysis by synthesis | |
US10255903B2 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
CN1910651A (zh) | 特定音响信号含有区间检测系统及其方法以及程序 | |
US7835905B2 (en) | Apparatus and method for detecting degree of voicing of speech signal | |
US7680657B2 (en) | Auto segmentation based partitioning and clustering approach to robust endpointing | |
WO2017061985A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
De Mulder et al. | An auditory model based transcriber of vocal queries | |
US7012186B2 (en) | 2-phase pitch detection method and apparatus | |
CN1214362C (zh) | 用于确定信号间相关系数和信号音高的设备和方法 | |
Jamaludin et al. | An improved time domain pitch detection algorithm for pathological voice | |
de León et al. | A complex wavelet based fundamental frequency estimator in singlechannel polyphonic signals | |
Messaoud et al. | Formant tracking linear prediction model using HMMs for noisy speech processing | |
Ben Messaoud et al. | An efficient method for fundamental frequency determination of noisy speech | |
KR101140737B1 (ko) | 기본 주파수 추출 장치, 보컬 멜로디 추출 장치 및 방법 | |
Trohidis et al. | Tempo induction from music recordings using ensemble empirical mode decomposition analysis | |
Ashouri et al. | Automatic and accurate pitch marking of speech signal using an expert system based on logical combinations of different algorithms outputs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NEW ANST COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090911 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090911 Address after: Massachusetts, USA Patentee after: Nuance Communications Inc Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20060719 Termination date: 20170223 |
|
CF01 | Termination of patent right due to non-payment of annual fee |