CA1147071A - Method of and apparatus for detecting speech in a voice channel signal - Google Patents
Method of and apparatus for detecting speech in a voice channel signalInfo
- Publication number
- CA1147071A CA1147071A CA000359968A CA359968A CA1147071A CA 1147071 A CA1147071 A CA 1147071A CA 000359968 A CA000359968 A CA 000359968A CA 359968 A CA359968 A CA 359968A CA 1147071 A CA1147071 A CA 1147071A
- Authority
- CA
- Canada
- Prior art keywords
- sample
- magnitude
- signal
- speech
- respect
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 238000000034 method Methods 0.000 title claims abstract 14
- 238000012935 Averaging Methods 0.000 claims abstract 3
- 230000001419 dependent effect Effects 0.000 claims 3
- 230000003247 decreasing effect Effects 0.000 claims 2
- 230000004044 response Effects 0.000 claims 2
- 230000003111 delayed effect Effects 0.000 claims 1
- 230000015654 memory Effects 0.000 claims 1
- 206010019133 Hangover Diseases 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Time-Division Multiplex Systems (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Abstract
METHOD OF AND APPARATUS FOR DETECTING
SPEECH IN A VOICE CHANNEL SIGNAL
Abstract of the Disclosure In a digital speech detector, a first detector produces an output signal if any sample of a voice channel signal has a magnitude exceeding a fixed threshold above noise levels. A
second detector produces a second threshold which is adaptively adjusted to noise levels on the channel by being set to a level above the current sample magnitude each time that this does not exceed the preceding sample's magnitude. When the sample magnitude rises above the second threshold, and for each subsequent sample of successively increasing magnitude, the second detector produces an output signal. A speech decision for the voice channel is reached if either detector produces its output signal. The output signals of the first and second detectors are maintained for fixed and variable, respectively, hangover periods to maintain the speech decision during intersyllabic pauses in speech. The samples are supplied to the speech detector by an offset remover and averaging circuit.
- i -
SPEECH IN A VOICE CHANNEL SIGNAL
Abstract of the Disclosure In a digital speech detector, a first detector produces an output signal if any sample of a voice channel signal has a magnitude exceeding a fixed threshold above noise levels. A
second detector produces a second threshold which is adaptively adjusted to noise levels on the channel by being set to a level above the current sample magnitude each time that this does not exceed the preceding sample's magnitude. When the sample magnitude rises above the second threshold, and for each subsequent sample of successively increasing magnitude, the second detector produces an output signal. A speech decision for the voice channel is reached if either detector produces its output signal. The output signals of the first and second detectors are maintained for fixed and variable, respectively, hangover periods to maintain the speech decision during intersyllabic pauses in speech. The samples are supplied to the speech detector by an offset remover and averaging circuit.
- i -
Claims (13)
AN EXCLUSIVE PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED
AS FOLLOWS:-
1. A method of detecting the presence of speech in a sampled voice channel signal, comprising the steps of:-setting a threshold, to a level which is greater than and is dependent upon the magnitude of the current sample, whenever the magnitude of the current sample is not greater than that of the preceding sample;
and providing an indication of the presence of speech whenever the magnitude of the current sample is greater than that of the preceding sample and exceeds said threshold level.
and providing an indication of the presence of speech whenever the magnitude of the current sample is greater than that of the preceding sample and exceeds said threshold level.
2. A method as claimed in claim 1 and including maintaining said indication in respect of a number of samples following each sample whose magnitude is greater than that of the preceding sample.
3. A method as claimed in claim 2 and including the step of determining the number of samples in respect of which said indication is maintained in dependence upon previous sample magnitudes, said number being increased, up to a maximum number, for each sample whose magnitude is greater than that of the preceding sample and being decreased, down to a minimum number, for each sample whose magnitude is not greater than that of the preceding sample.
4. A method as claimed in claim 1, 2, or 3 and including the step of providing an indication of the presence of speech in respect of each sample whose magnitude exceeds a fixed threshold level.
5. A method of detecting the presence of speech signals in a sampled voice channel signal, comprising the steps of:-producing a first signal state whenever the magnitude of a signal sample exceeds a first threshold level;
comparing the magnitude of each sample with that of the preceding sample;
whenever the magnitude of a sample is not greater than that of the preceding sample, setting a second threshold to a level which is greater than and is dependent upon the magnitude of the current sample;
whenever the magnitude of a sample is greater than that of the preceding sample, producing a second signal state if the magnitude of the current sample exceeds the second threshold level; and in response to each of the first and the second signal states, producing a signal, representing the presence of speech, at least in respect of the current sample.
comparing the magnitude of each sample with that of the preceding sample;
whenever the magnitude of a sample is not greater than that of the preceding sample, setting a second threshold to a level which is greater than and is dependent upon the magnitude of the current sample;
whenever the magnitude of a sample is greater than that of the preceding sample, producing a second signal state if the magnitude of the current sample exceeds the second threshold level; and in response to each of the first and the second signal states, producing a signal, representing the presence of speech, at least in respect of the current sample.
6. A method as claimed in claim 5 and including the steps of:-whenever the magnitude of a sample does not exceed the first threshold level and the first signal state was produced in respect of the preceding sample, producing a third signal state in respect of a first predetermined number of consecutive samples commencing with the current sample;
whenever the magnitude of a sample is not greater than that of the preceding sample and the second signal state was produced in respect of said preceding sample, producing a fourth signal state in respect of a second number of consecutive samples commencing with the current sample; and producing the signal representing the presence of speech in response to each of the third and fourth signal states.
whenever the magnitude of a sample is not greater than that of the preceding sample and the second signal state was produced in respect of said preceding sample, producing a fourth signal state in respect of a second number of consecutive samples commencing with the current sample; and producing the signal representing the presence of speech in response to each of the third and fourth signal states.
7. A method as claimed in claim 6 and including the step of determining said second number in dependence upon previous sample magnitudes, said second number being increased by a predetermined amount, up to a maximum number, for each sample in respect of which the second signal state is produced, and being decreased, down to a minimum number, for each sample whose magnitude is not greater than the magnitude of the preceding sample.
8. A method as claimed in claim 6 or 7 and including the steps of:-whenever the magnitude of a sample exceeds that of the preceding sample, and in respect of said preceding sample the fourth signal state was produced but the second signal state was not produced, producing the second signal state in respect of the current sample if its magnitude exceeds a third threshold level; and setting the third threshold level equal to the magnitude of the preceding sample whenever the second signal state was produced in respect of said preceding sample and the magnitude of the current sample is not greater than the magnitude of said preceding sample.
9. A method as claimed in claim 5 wherein, each time that the second threshold level is set, it is set to be greater than the magnitude of the current sample by a predetermined amount.
10. A method as claimed in claim 5, 6, or 7 wherein each signal sample is constituted by an average of a plurality of individual samples of the voice channel signal, the method further comprising the step of producing each signal sample by removing d.c. offsets from and averaging a plurality of individual samples of the voice channel signal.
11. A speech detector comprising one or more read-only memories programmed and arranged to carry out the method of claim 1, 2, or 5.
12. A speech detector for detecting the presence of speech signals in a sampled voice channel signal, comprising:-means for producing a first signal state whenever the magnitude of a signal sample exceeds a first threshold level;
means for generating a second threshold;
means for delaying each sample until the next sample;
means for comparing the magnitude of each sample with that of the preceding sample delayed by said delaying means;
means, responsive to said comparing means determining that the magnitude of a sample is not greater than that of the preceding sample, for setting the second threshold to a level which is greater than and is dependent upon the magnitude of the current sample;
means, responsive to said comparing means determining that the magnitude of a sample is greater than that of the preceding sample, for producing a second signal state if the magnitude of the current sample exceeds the second threshold level; and means responsive to each of the first and second signal states for producing a signal, representing the presence of speech, at least in respect of the current sample.
means for generating a second threshold;
means for delaying each sample until the next sample;
means for comparing the magnitude of each sample with that of the preceding sample delayed by said delaying means;
means, responsive to said comparing means determining that the magnitude of a sample is not greater than that of the preceding sample, for setting the second threshold to a level which is greater than and is dependent upon the magnitude of the current sample;
means, responsive to said comparing means determining that the magnitude of a sample is greater than that of the preceding sample, for producing a second signal state if the magnitude of the current sample exceeds the second threshold level; and means responsive to each of the first and second signal states for producing a signal, representing the presence of speech, at least in respect of the current sample.
13. A speech detector as claimed in claim 12 and including means for producing each signal sample by removing d.c. offsets from and averaging a plurality of individual samples of the voice channel signal.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA000359968A CA1147071A (en) | 1980-09-09 | 1980-09-09 | Method of and apparatus for detecting speech in a voice channel signal |
DE8181303695T DE3164171D1 (en) | 1980-09-09 | 1981-08-13 | Method and apparatus for detecting speech in a voice channel signal |
EP81303695A EP0047589B1 (en) | 1980-09-09 | 1981-08-13 | Method and apparatus for detecting speech in a voice channel signal |
JP56140449A JPS5781733A (en) | 1980-09-09 | 1981-09-08 | Method and means for detecting voice in voice channel signal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA000359968A CA1147071A (en) | 1980-09-09 | 1980-09-09 | Method of and apparatus for detecting speech in a voice channel signal |
Publications (1)
Publication Number | Publication Date |
---|---|
CA1147071A true CA1147071A (en) | 1983-05-24 |
Family
ID=4117844
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA000359968A Expired CA1147071A (en) | 1980-09-09 | 1980-09-09 | Method of and apparatus for detecting speech in a voice channel signal |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0047589B1 (en) |
JP (1) | JPS5781733A (en) |
CA (1) | CA1147071A (en) |
DE (1) | DE3164171D1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1137240A (en) * | 1980-09-09 | 1982-12-07 | Northern Telecom Limited | Method of and apparatus for echo detection in voice channel signals |
DE3276731D1 (en) * | 1982-04-27 | 1987-08-13 | Philips Nv | Speech analysis system |
DE3243231A1 (en) * | 1982-11-23 | 1984-05-24 | Philips Kommunikations Industrie AG, 8500 Nürnberg | METHOD FOR DETECTING VOICE BREAKS |
DE3473373D1 (en) * | 1983-10-13 | 1988-09-15 | Texas Instruments Inc | Speech analysis/synthesis with energy normalization |
JPS619700A (en) * | 1984-06-25 | 1986-01-17 | シャープ株式会社 | Voice feature extraction system |
GB2379148A (en) * | 2001-08-21 | 2003-02-26 | Mitel Knowledge Corp | Voice activity detection |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3712959A (en) * | 1969-07-14 | 1973-01-23 | Communications Satellite Corp | Method and apparatus for detecting speech signals in the presence of noise |
US4052568A (en) * | 1976-04-23 | 1977-10-04 | Communications Satellite Corporation | Digital voice switch |
-
1980
- 1980-09-09 CA CA000359968A patent/CA1147071A/en not_active Expired
-
1981
- 1981-08-13 DE DE8181303695T patent/DE3164171D1/en not_active Expired
- 1981-08-13 EP EP81303695A patent/EP0047589B1/en not_active Expired
- 1981-09-08 JP JP56140449A patent/JPS5781733A/en active Granted
Also Published As
Publication number | Publication date |
---|---|
JPS5781733A (en) | 1982-05-21 |
EP0047589A1 (en) | 1982-03-17 |
JPH0311139B2 (en) | 1991-02-15 |
EP0047589B1 (en) | 1984-06-13 |
DE3164171D1 (en) | 1984-07-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4357491A (en) | Method of and apparatus for detecting speech in a voice channel signal | |
FI92113C (en) | Speech processor and cellular radio terminal | |
US4028496A (en) | Digital speech detector | |
US4052568A (en) | Digital voice switch | |
US5617508A (en) | Speech detection device for the detection of speech end points based on variance of frequency band limited energy | |
KR890702356A (en) | Noise suppression system | |
US4410763A (en) | Speech detector | |
CA1335003C (en) | Voice activity detection | |
US4455676A (en) | Speech processing system including an amplitude level control circuit for digital processing | |
NO146521B (en) | PROCEDURE AND DEVICE FOR DETECTING PRESENCE OR REQUIREMENTS OF A VOICE SIGNAL ON A VOICE CHANNEL | |
RU96102854A (en) | METHOD AND DEVICE FOR NOISE REDUCTION IN SPEECH SIGNAL | |
KR20000023823A (en) | Speech detection system employing multiple determinants | |
KR930007298B1 (en) | Circuit for detecting and suppressing pulse shaped interferences | |
EP0957660A3 (en) | Audio signal processing and reproducing | |
CA1147071A (en) | Method of and apparatus for detecting speech in a voice channel signal | |
CA1137240A (en) | Method of and apparatus for echo detection in voice channel signals | |
DE69132081D1 (en) | Distinguishing between information and noise in a communication signal | |
US4314100A (en) | Data detection circuit for a TASI system | |
JPS6314529B2 (en) | ||
SU1674226A1 (en) | Method and apparatus for detecting speech signals and their boundaries | |
JP2807457B2 (en) | Voice section detection method | |
JPS6058707A (en) | Automatic gain control circuit | |
SU1075440A1 (en) | Noise suppressor | |
SU1494228A1 (en) | Device for eevaluation of signal-to-noise ratio | |
SU1597898A2 (en) | Method and apparatus for detecting ends of words |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MKEX | Expiry |