TR202021840A1 - Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. - Google Patents

Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.

Info

Publication number
TR202021840A1
TR202021840A1 TR2020/21840A TR202021840A TR202021840A1 TR 202021840 A1 TR202021840 A1 TR 202021840A1 TR 2020/21840 A TR2020/21840 A TR 2020/21840A TR 202021840 A TR202021840 A TR 202021840A TR 202021840 A1 TR202021840 A1 TR 202021840A1
Authority
TR
Turkey
Prior art keywords
speech signal
signal activity
determining speech
activity zones
enables
Prior art date
Application number
TR2020/21840A
Other languages
English (en)
Inventor
Özaydin Selma
Original Assignee
Cankaya Ueniversitesi
Çankaya Üni̇versi̇tesi̇
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cankaya Ueniversitesi, Çankaya Üni̇versi̇tesi̇ filed Critical Cankaya Ueniversitesi
Priority to TR2020/21840A priority Critical patent/TR202021840A1/tr
Priority to US18/017,385 priority patent/US20240013803A1/en
Priority to PCT/TR2021/051163 priority patent/WO2022139730A1/en
Publication of TR202021840A1 publication Critical patent/TR202021840A1/tr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/09Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

Buluş, yeni bir metot önerisi ile konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem ile ilgilidir. Buluş özellikle, değişik giriş gürültü sinyal seviyeleri için, artan varyans miktarından en az şekilde etkilenen ve maksimum ortalama enerji seviyelerinin korunduğu bir konuşma aktivite bölgesi (KAB) tespitinin elde edilmesini sağlayan sinyallerin kodlanmasını sağlayan bir yöntem ile ilgilidir.
TR2020/21840A 2020-12-26 2020-12-26 Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem. TR202021840A1 (tr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
TR2020/21840A TR202021840A1 (tr) 2020-12-26 2020-12-26 Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.
US18/017,385 US20240013803A1 (en) 2020-12-26 2021-11-09 Method enabling the detection of the speech signal activity regions
PCT/TR2021/051163 WO2022139730A1 (en) 2020-12-26 2021-11-09 Method enabling the detection of the speech signal activity regions

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TR2020/21840A TR202021840A1 (tr) 2020-12-26 2020-12-26 Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.

Publications (1)

Publication Number Publication Date
TR202021840A1 true TR202021840A1 (tr) 2022-07-21

Family

ID=82160037

Family Applications (1)

Application Number Title Priority Date Filing Date
TR2020/21840A TR202021840A1 (tr) 2020-12-26 2020-12-26 Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.

Country Status (3)

Country Link
US (1) US20240013803A1 (tr)
TR (1) TR202021840A1 (tr)
WO (1) WO2022139730A1 (tr)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20140026229A (ko) * 2010-04-22 2014-03-05 퀄컴 인코포레이티드 음성 액티비티 검출
US9953661B2 (en) * 2014-09-26 2018-04-24 Cirrus Logic Inc. Neural network voice activity detection employing running range normalization
US20200074997A1 (en) * 2018-08-31 2020-03-05 CloudMinds Technology, Inc. Method and system for detecting voice activity in noisy conditions

Also Published As

Publication number Publication date
WO2022139730A1 (en) 2022-06-30
US20240013803A1 (en) 2024-01-11

Similar Documents

Publication Publication Date Title
US11361784B2 (en) Detector and method for voice activity detection
TR202021840A1 (tr) Konuşma sinyali aktivite bölgelerinin belirlenmesini sağlayan yöntem.
US20120215536A1 (en) Methods and Voice Activity Detectors for Speech Encoders
ATE540398T1 (de) Sprachaktivitätsdetektionseinrichtung und verfahren
RU2012141463A (ru) Способ и система для масштабирования подавления слабого сигнала более сильным в относящихся к речи каналах многоканального звукового сигнала
RU2012145972A (ru) Пространственный аудиопроцессор и способ обеспечения пространственных параметров на основе акустического входного сигнала
DE602006021347D1 (de) Verbessertes verfahren zur signalformung bei der mehrkanal-audiorekonstruktion
EP3118852B1 (en) Method and device for detecting audio signal
ATE434846T1 (de) Mehrstufiger faserverstärker und verfahren zur anpassung einer pumpleistung eines mehrstufigen faserverstärkers
ATE503420T1 (de) Verfahren zur erzeugung von ausgabedaten
DE602005017520D1 (de) Detektionsverfahren für ack/nack-signale und detektor dafür
DE602005027819D1 (de) Verfahren zur rauschverminderung in einer audiovorrichtung und hörgerät mit mitteln zur rauschverminderung
WO2013030345A3 (en) A method and a system for noise suppressing an audio signal
FI3405950T3 (fi) Stereoaudiokoodaus ILD-pohjaisella normalisoinnilla ennen keski/sivupäätöstä
ATE485580T1 (de) System und verfahren zur plapper- geräuschdetektion
CN1666571A (zh) 音频处理
EP1163662A4 (en) METHOD FOR DETERMINING THE PROBABILITY THAT A VOICE SIGNAL IS TRUE
Ding et al. Objective measures for quality assessment of noise-suppressed speech
RU100865U1 (ru) Адаптивный компенсатор помех
Watkins et al. An investigation of audibility effects on cochlear implant speech perception prediction
Fischenich et al. Parametric measurement of the effects of relative loudness on the relative weights
Lukas METHODOLOGY OF THE COMMENTARY
KR102661025B1 (ko) 소음제거 시스템 및 방법
Jeeva et al. Formant filters-based multi-band speech enhancement algorithm for intelligibility improvement
EP3669556B1 (en) Audio processing