TR202021840A1 - Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. - Google Patents
Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.Info
- Publication number
- TR202021840A1 TR202021840A1 TR2020/21840A TR202021840A TR202021840A1 TR 202021840 A1 TR202021840 A1 TR 202021840A1 TR 2020/21840 A TR2020/21840 A TR 2020/21840A TR 202021840 A TR202021840 A TR 202021840A TR 202021840 A1 TR202021840 A1 TR 202021840A1
- Authority
- TR
- Turkey
- Prior art keywords
- speech signal
- signal activity
- determining speech
- activity zones
- enables
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 4
- 238000001514 detection method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
- G10L2025/786—Adaptive threshold
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Buluş, yeni bir metot önerisi ile konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem ile ilgilidir. Buluş özellikle, değişik giriş gürültü sinyal seviyeleri için, artan varyans miktarından en az şekilde etkilenen ve maksimum ortalama enerji seviyelerinin korunduğu bir konuşma aktivite bölgesi (KAB) tespitinin elde edilmesini sağlayan sinyallerin kodlanmasını sağlayan bir yöntem ile ilgilidir.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TR2020/21840A TR202021840A1 (tr) | 2020-12-26 | 2020-12-26 | Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. |
US18/017,385 US20240013803A1 (en) | 2020-12-26 | 2021-11-09 | Method enabling the detection of the speech signal activity regions |
PCT/TR2021/051163 WO2022139730A1 (en) | 2020-12-26 | 2021-11-09 | Method enabling the detection of the speech signal activity regions |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TR2020/21840A TR202021840A1 (tr) | 2020-12-26 | 2020-12-26 | Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. |
Publications (1)
Publication Number | Publication Date |
---|---|
TR202021840A1 true TR202021840A1 (tr) | 2022-07-21 |
Family
ID=82160037
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TR2020/21840A TR202021840A1 (tr) | 2020-12-26 | 2020-12-26 | Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240013803A1 (tr) |
TR (1) | TR202021840A1 (tr) |
WO (1) | WO2022139730A1 (tr) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20140026229A (ko) * | 2010-04-22 | 2014-03-05 | 퀄컴 인코포레이티드 | 음성 액티비티 검출 |
US9953661B2 (en) * | 2014-09-26 | 2018-04-24 | Cirrus Logic Inc. | Neural network voice activity detection employing running range normalization |
US20200074997A1 (en) * | 2018-08-31 | 2020-03-05 | CloudMinds Technology, Inc. | Method and system for detecting voice activity in noisy conditions |
-
2020
- 2020-12-26 TR TR2020/21840A patent/TR202021840A1/tr unknown
-
2021
- 2021-11-09 US US18/017,385 patent/US20240013803A1/en active Pending
- 2021-11-09 WO PCT/TR2021/051163 patent/WO2022139730A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2022139730A1 (en) | 2022-06-30 |
US20240013803A1 (en) | 2024-01-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11361784B2 (en) | Detector and method for voice activity detection | |
TR202021840A1 (tr) | Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. | |
US20120215536A1 (en) | Methods and Voice Activity Detectors for Speech Encoders | |
ATE540398T1 (de) | Sprachaktivitätsdetektionseinrichtung und verfahren | |
RU2012141463A (ru) | Способ и система для масштабирования подавления слабого сигнала более сильным в относящихся к речи каналах многоканального звукового сигнала | |
RU2012145972A (ru) | Пространственный аудиопроцессор и способ обеспечения пространственных параметров на основе акустического входного сигнала | |
DE602006021347D1 (de) | Verbessertes verfahren zur signalformung bei der mehrkanal-audiorekonstruktion | |
EP3118852B1 (en) | Method and device for detecting audio signal | |
ATE434846T1 (de) | Mehrstufiger faserverstärker und verfahren zur anpassung einer pumpleistung eines mehrstufigen faserverstärkers | |
ATE503420T1 (de) | Verfahren zur erzeugung von ausgabedaten | |
DE602005017520D1 (de) | Detektionsverfahren für ack/nack-signale und detektor dafür | |
DE602005027819D1 (de) | Verfahren zur rauschverminderung in einer audiovorrichtung und hörgerät mit mitteln zur rauschverminderung | |
WO2013030345A3 (en) | A method and a system for noise suppressing an audio signal | |
FI3405950T3 (fi) | Stereoaudiokoodaus ILD-pohjaisella normalisoinnilla ennen keski/sivupäätöstä | |
ATE485580T1 (de) | System und verfahren zur plapper- geräuschdetektion | |
CN1666571A (zh) | 音频处理 | |
EP1163662A4 (en) | METHOD FOR DETERMINING THE PROBABILITY THAT A VOICE SIGNAL IS TRUE | |
Ding et al. | Objective measures for quality assessment of noise-suppressed speech | |
RU100865U1 (ru) | Адаптивный компенсатор помех | |
Watkins et al. | An investigation of audibility effects on cochlear implant speech perception prediction | |
Fischenich et al. | Parametric measurement of the effects of relative loudness on the relative weights | |
Lukas | METHODOLOGY OF THE COMMENTARY | |
KR102661025B1 (ko) | 소음제거 시스템 및 방법 | |
Jeeva et al. | Formant filters-based multi-band speech enhancement algorithm for intelligibility improvement | |
EP3669556B1 (en) | Audio processing |