GB9002852D0 - Methods and apparatus for spectral analysis - Google Patents
Methods and apparatus for spectral analysisInfo
- Publication number
- GB9002852D0 GB9002852D0 GB9002852A GB9002852A GB9002852D0 GB 9002852 D0 GB9002852 D0 GB 9002852D0 GB 9002852 A GB9002852 A GB 9002852A GB 9002852 A GB9002852 A GB 9002852A GB 9002852 D0 GB9002852 D0 GB 9002852D0
- Authority
- GB
- United Kingdom
- Prior art keywords
- formants
- band
- power
- centroids
- measured
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000010183 spectrum analysis Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 title 1
- 238000009826 distribution Methods 0.000 abstract 1
- 238000001228 spectrum Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Measuring Frequencies, Analyzing Spectra (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
In automatic speech recognition it is usual to make a spectral analysis of the incoming speech signal and it can be useful to detect the frequencies and intensities of the formants. However although the formants are mostly quite well defined during vowel sounds there are frequent occasions when this is not so and it is not so during a high proportion of consonant sounds. The present invention determines the frequencies at which the centroids of respective frequency versus power distributions occur in a plurality of frequency bands of a signal representing speech (approximately corresponding to the ranges of individual formants). The centroids have most of the desirable properties of formants but also carry significant information for those sounds for which the conventional definition of formants does not seem appropriate. Preferably the powers in the bands in which the centroids are measured are also determined. The incoming signal is filtered (2) into separate frequency bands and the power in each band is measured (4). The output signal in each band is weighted by 3 dB per octave (5) and then the power in that band is measured (6). The power ratio obtained (7) for a band from the power after 3 dB weighting divided by the power before weighting gives an indication of the position of the centroid of that band in the frequency spectrum. <IMAGE>
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9002852A GB2240867A (en) | 1990-02-08 | 1990-02-08 | Speech analysis |
EP19910301034 EP0441642A3 (en) | 1990-02-08 | 1991-02-08 | Methods and apparatus for spectral analysis |
JP1791791A JPH05143098A (en) | 1990-02-08 | 1991-02-08 | Method and apparatus for spectrum analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9002852A GB2240867A (en) | 1990-02-08 | 1990-02-08 | Speech analysis |
Publications (2)
Publication Number | Publication Date |
---|---|
GB9002852D0 true GB9002852D0 (en) | 1990-04-04 |
GB2240867A GB2240867A (en) | 1991-08-14 |
Family
ID=10670649
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB9002852A Withdrawn GB2240867A (en) | 1990-02-08 | 1990-02-08 | Speech analysis |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0441642A3 (en) |
JP (1) | JPH05143098A (en) |
GB (1) | GB2240867A (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5623609A (en) * | 1993-06-14 | 1997-04-22 | Hal Trust, L.L.C. | Computer system and computer-implemented process for phonology-based automatic speech recognition |
GB9323991D0 (en) * | 1993-11-22 | 1994-01-12 | Holmes John N | Method and apparatus for spectral analysis |
CA2161045A1 (en) * | 1994-11-15 | 1996-05-16 | Michael L. Wells | Error detector apparatus with digital coordinate transformation |
FR2762180B1 (en) * | 1997-04-15 | 1999-05-07 | Roland Roger Carrat | METHOD AND DEVICE FOR AMPLIFYING AND ENCODING THE VOICE SIGNAL FOR IMPROVING INTELLIGIBILITY IN A NOISE ENVIRONMENT AND FOR CORRECTING DEAFNESSES |
EP1692912A1 (en) * | 2003-12-01 | 2006-08-23 | Koninklijke Philips Electronics N.V. | Selective audio signal enhancement |
JP4568826B2 (en) * | 2005-09-08 | 2010-10-27 | 株式会社国際電気通信基礎技術研究所 | Glottal closure segment detection device and glottal closure segment detection program |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3649765A (en) * | 1969-10-29 | 1972-03-14 | Bell Telephone Labor Inc | Speech analyzer-synthesizer system employing improved formant extractor |
DE2313141A1 (en) * | 1973-03-16 | 1974-09-19 | Philips Patentverwaltung | PROCEDURE AND ARRANGEMENT FOR REAL-TIME DETERMINATION OF THE TRANSMISSION FUNCTIONS OF SYSTEMS |
DE2448909B2 (en) * | 1974-10-15 | 1978-12-07 | Olympia Werke Ag, 2940 Wilhelmshaven | Electrical circuit arrangement for a device for speech recognition |
NL8203520A (en) * | 1982-09-10 | 1984-04-02 | Philips Nv | DIGITAL FILTER DEVICE. |
-
1990
- 1990-02-08 GB GB9002852A patent/GB2240867A/en not_active Withdrawn
-
1991
- 1991-02-08 JP JP1791791A patent/JPH05143098A/en active Pending
- 1991-02-08 EP EP19910301034 patent/EP0441642A3/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
GB2240867A (en) | 1991-08-14 |
EP0441642A2 (en) | 1991-08-14 |
JPH05143098A (en) | 1993-06-11 |
EP0441642A3 (en) | 1993-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Delgutte | Speech coding in the auditory nerve: II. Processing schemes for vowel‐like sounds | |
Heinz et al. | On the properties of voiceless fricative consonants | |
Bladon et al. | Towards an auditory theory of speaker normalization | |
US5884260A (en) | Method and system for detecting and generating transient conditions in auditory signals | |
US6691083B1 (en) | Wideband speech synthesis from a narrowband speech signal | |
Miller et al. | Formant tuning in a professional baritone | |
AR001928A1 (en) | Filter for signal enhancement and modification | |
Smith et al. | Increasing the intelligibility of sung vowels | |
Fujisaki et al. | Analysis, recognition, and perception of voiceless fricative consonants in Japanese | |
Kewley‐Port et al. | Fundamental frequency effects on thresholds for vowel formant discrimination | |
GB9002852D0 (en) | Methods and apparatus for spectral analysis | |
Mitani et al. | Voiceless affricate/fricative distinction by frication duration and amplitude rise slope | |
Bloothooft et al. | Spectral analysis of sung vowels. II. The effect of fundamental frequency on vowel spectra | |
Karnickaya et al. | Auditory processing of steady-state vowels | |
Borst et al. | Speech research devices based on a channel vocoder | |
Majumder et al. | Some studies on acoustic features of human speech in relation to Hindi speech sounds | |
Gerstman | Noise duration as a cue for distinguishing among fricative, affricate, and stop consonants | |
Itahashi et al. | Automatic formant extraction utilizing mel scale and equal loudness contour | |
Brown et al. | An acoustic study of the intelligible utterances of hearing-impaired speakers. | |
Kiukaanniemi et al. | Long-term speech spectra: A computerized method of measurement and a comparative study of Finnish and English data | |
Ramig et al. | Acoustic correlates of aging | |
Gray Jr et al. | Cosh measure for speech processing | |
Velez et al. | Segmentation and classification of nasal phonemes based on their time-frequency representation | |
Mermelstein | On detecting nasals in continuous speech | |
Elie et al. | Glottal opening and strategies of production of fricatives |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
732 | Registration of transactions, instruments or events in the register (sect. 32/1977) | ||
WAP | Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1) |