EP1143412A1 - Estimating the pitch of a speech signal using an intermediate binary signal - Google Patents
Estimating the pitch of a speech signal using an intermediate binary signal Download PDFInfo
- Publication number
- EP1143412A1 EP1143412A1 EP00610034A EP00610034A EP1143412A1 EP 1143412 A1 EP1143412 A1 EP 1143412A1 EP 00610034 A EP00610034 A EP 00610034A EP 00610034 A EP00610034 A EP 00610034A EP 1143412 A1 EP1143412 A1 EP 1143412A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- pitch
- speech
- speech signal
- autocorrelation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 claims abstract description 24
- 238000005070 sampling Methods 0.000 claims abstract description 8
- 238000001914 filtration Methods 0.000 claims description 9
- 238000005311 autocorrelation function Methods 0.000 description 12
- 238000001514 detection method Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
Definitions
- the intermediate signal may be provided by calculating the autocorrelation of a signal derived from the speech signal by filtering the speech signal through a filter based on a set of filter parameters estimated by means of linear predictive analysis (LPA).
- LPA linear predictive analysis
- the best estimate is achieved when the device is adapted to select the sample having the maximum amplitude of said conformity function as the estimate of the pitch.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Description
Claims (11)
- A method of estimating the pitch of a speech signal (2), said method comprising the steps of:sampling the speech signal to obtain a series of samples,dividing the series of samples into segments, each segment having a fixed number of consecutive samples,calculating for each segment a conformity function for the signal, anddetecting peaks in the conformity function,providing an intermediate signal derived from the speech signal,converting said intermediate signal to a binary signal, said binary signal being set to logical "1" where the intermediate signal exceeds a pre-selected threshold and to logical "0" where the intermediate signal does not exceed the pre-selected threshold,calculating the autocorrelation of the binary signal, andusing the distance between peaks in the autocorrelation of the binary signal as an estimate of the pitch.
- A method according to claim 1, characterized in that the intermediate signal is provided by filtering the speech signal through a filter (4) based on a set of filter parameters estimated by means of linear predictive analysis (LPA).
- A method according to claim 1, characterized in that the intermediate signal is provided by calculating the autocorrelation of a signal derived from the speech signal by filtering the speech signal through a filter (4) based on a set of filter parameters estimated by means of linear predictive analysis (LPA).
- A method according to any one of claims 1 to 3,
characterized in that it further comprises the step of:selecting, if the peak corresponding to the distance between the peaks is represented by a number of samples, the sample having the maximum amplitude of said conformity function as the estimate of the pitch. - Use of the method according to any one of claims 1 to 4 in a mobile telephone.
- A device adapted to estimate the pitch of a speech signal, and comprising:means (3) for sampling the speech signal to obtain a series of samples,means for dividing the series of samples into segments, each segment having a fixed number of consecutive samples,means (5) for calculating for each segment a conformity function for the signal, andmeans (6) for detecting peaks in the conformity function,means for providing an intermediate signal derived from the speech signal,means (8) for converting said intermediate signal to a binary signal, said binary signal being set to logical "1" where the intermediate signal exceeds a pre-selected threshold and to logical "0" where the intermediate signal does not exceed the pre-selected threshold,means (5) for calculating the autocorrelation of the binary signal, andmeans for using the distance between peaks in the autocorrelation of the binary signal as an estimate of the pitch.
- A device according to claim 6, characterized in that the device is adapted to provide the intermediate signal by filtering the speech signal through a filter (4) based on a set of filter parameters estimated by means of linear predictive analysis (LPA).
- A device according to claim 6, characterized in that the device is adapted to provide the intermediate signal by calculating the autocorrelation of a signal derived from the speech signal by filtering the speech signal through a filter (4) based on a set of filter parameters estimated by means of linear predictive analysis (LPA).
- A device according to any one of claims 6 to 8, characterized in that it is further adapted to select, if the peak corresponding to the distance between the peaks is represented by a number of samples, the sample having the maximum amplitude of said conformity function as the estimate of the pitch.
- A device according to any one of claims 6 to 9, characterized in that the device is a mobile telephone.
- A device according to any one of claims 6 to 9, characterized in that the device is an integrated circuit.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00610034A EP1143412A1 (en) | 2000-04-06 | 2000-04-06 | Estimating the pitch of a speech signal using an intermediate binary signal |
PCT/EP2001/003493 WO2001077635A1 (en) | 2000-04-06 | 2001-03-27 | Estimating the pitch of a speech signal using a binary signal |
CN018076890A CN1216361C (en) | 2000-04-06 | 2001-03-27 | Estimating the pitch of a speech signal using a binary signal |
AU2001273904A AU2001273904A1 (en) | 2000-04-06 | 2001-03-27 | Estimating the pitch of a speech signal using a binary signal |
US09/826,729 US6954726B2 (en) | 2000-04-06 | 2001-04-05 | Method and device for estimating the pitch of a speech signal using a binary signal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00610034A EP1143412A1 (en) | 2000-04-06 | 2000-04-06 | Estimating the pitch of a speech signal using an intermediate binary signal |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1143412A1 true EP1143412A1 (en) | 2001-10-10 |
Family
ID=8174382
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00610034A Withdrawn EP1143412A1 (en) | 2000-04-06 | 2000-04-06 | Estimating the pitch of a speech signal using an intermediate binary signal |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP1143412A1 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5970441A (en) * | 1997-08-25 | 1999-10-19 | Telefonaktiebolaget Lm Ericsson | Detection of periodicity information from an audio signal |
-
2000
- 2000-04-06 EP EP00610034A patent/EP1143412A1/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5970441A (en) * | 1997-08-25 | 1999-10-19 | Telefonaktiebolaget Lm Ericsson | Detection of periodicity information from an audio signal |
Non-Patent Citations (1)
Title |
---|
ALKULAIBI A ET AL: "Fast 3-level binary higher order statistics for simultaneous voiced/unvoiced and pitch detection of a speech signal", SIGNAL PROCESSING. EUROPEAN JOURNAL DEVOTED TO THE METHODS AND APPLICATIONS OF SIGNAL PROCESSING,NL,ELSEVIER SCIENCE PUBLISHERS B.V. AMSTERDAM, vol. 63, no. 2, 1 December 1997 (1997-12-01), pages 133 - 140, XP004102257, ISSN: 0165-1684 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA1301339C (en) | Parallel processing pitch detector | |
EP0677202B1 (en) | Discriminating between stationary and non-stationary signals | |
US6865529B2 (en) | Method of estimating the pitch of a speech signal using an average distance between peaks, use of the method, and a device adapted therefor | |
JP2738534B2 (en) | Digital speech coder with different types of excitation information. | |
US20030191638A1 (en) | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization | |
US6954726B2 (en) | Method and device for estimating the pitch of a speech signal using a binary signal | |
KR20050039454A (en) | Pitch detection method and apparatus | |
EP0653091B1 (en) | Discriminating between stationary and non-stationary signals | |
EP0634041B1 (en) | Method and apparatus for encoding/decoding of background sounds | |
JPH08254994A (en) | Reconfiguration of arrangement of sound coded parameter by list (inventory) of sorting and outline | |
US20010029447A1 (en) | Method of estimating the pitch of a speech signal using previous estimates, use of the method, and a device adapted therefor | |
EP0474496B1 (en) | Speech recognition apparatus | |
Ney | An optimization algorithm for determining the endpoints of isolated utterances | |
KR100463657B1 (en) | Apparatus and method of voice region detection | |
JP2002258881A (en) | Device and program for detecting voice | |
EP1143412A1 (en) | Estimating the pitch of a speech signal using an intermediate binary signal | |
EP1143414A1 (en) | Estimating the pitch of a speech signal using previous estimates | |
EP1143413A1 (en) | Estimating the pitch of a speech signal using an average distance between peaks | |
IL108401A (en) | Method and apparatus for indicating the emotional state of a person | |
Ajgou et al. | Novel detection algorithm of speech activity and the impact of speech codecs on remote speaker recognition system | |
JP3571448B2 (en) | Method and apparatus for detecting pitch of audio signal | |
JPH0114599B2 (en) | ||
Ghaemmaghami et al. | Speech endpoint detection using gradient based edge detection techniques | |
JPH0477798A (en) | Feature amount extracting method for frequency envelop component | |
CN116229988A (en) | Voiceprint recognition and authentication method, system and device for personnel of power dispatching system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17P | Request for examination filed |
Effective date: 20020320 |
|
AKX | Designation fees paid |
Free format text: AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL) |
|
17Q | First examination report despatched |
Effective date: 20050118 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20060127 |