KR20160102061A - 누락 데이터에 대한 사인곡선 보간 - Google Patents

누락 데이터에 대한 사인곡선 보간 Download PDF

Info

Publication number
KR20160102061A
KR20160102061A KR1020167020196A KR20167020196A KR20160102061A KR 20160102061 A KR20160102061 A KR 20160102061A KR 1020167020196 A KR1020167020196 A KR 1020167020196A KR 20167020196 A KR20167020196 A KR 20167020196A KR 20160102061 A KR20160102061 A KR 20160102061A
Authority
KR
South Korea
Prior art keywords
objects
peaks
segment
spectrum
interpolation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
KR1020167020196A
Other languages
English (en)
Korean (ko)
Inventor
윌리엄 바스티안 클레인
투라즈 자키자데흐 샤베스타리
Original Assignee
구글 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 구글 인코포레이티드 filed Critical 구글 인코포레이티드
Publication of KR20160102061A publication Critical patent/KR20160102061A/ko
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/26Functional testing
    • G06F11/27Built-in tests
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/162Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/051Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction or detection of onsets of musical sounds or notes, i.e. note attack timings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/061Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of musical phrases, isolation of musically relevant segments, e.g. musical thumbnail generation, or for temporal structure analysis of a musical piece, e.g. determination of the movement sequence of a musical work
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/131Mathematical functions for musical analysis, processing, synthesis or composition
    • G10H2250/215Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
    • G10H2250/235Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/541Details of musical waveform synthesis, i.e. audio waveshape processing from individual wavetable samples, independently of their origin or of the sound they represent
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Computer Hardware Design (AREA)
  • General Health & Medical Sciences (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Complex Calculations (AREA)
  • Noise Elimination (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
KR1020167020196A 2014-02-28 2015-02-27 누락 데이터에 대한 사인곡선 보간 Ceased KR20160102061A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14/194,192 2014-02-28
US14/194,192 US9672833B2 (en) 2014-02-28 2014-02-28 Sinusoidal interpolation across missing data
PCT/US2015/017992 WO2015131040A1 (en) 2014-02-28 2015-02-27 Sinusoidal interpolation across missing data

Related Child Applications (1)

Application Number Title Priority Date Filing Date
KR1020187011991A Division KR102188620B1 (ko) 2014-02-28 2015-02-27 누락 데이터에 대한 사인곡선 보간

Publications (1)

Publication Number Publication Date
KR20160102061A true KR20160102061A (ko) 2016-08-26

Family

ID=52686491

Family Applications (2)

Application Number Title Priority Date Filing Date
KR1020167020196A Ceased KR20160102061A (ko) 2014-02-28 2015-02-27 누락 데이터에 대한 사인곡선 보간
KR1020187011991A Active KR102188620B1 (ko) 2014-02-28 2015-02-27 누락 데이터에 대한 사인곡선 보간

Family Applications After (1)

Application Number Title Priority Date Filing Date
KR1020187011991A Active KR102188620B1 (ko) 2014-02-28 2015-02-27 누락 데이터에 대한 사인곡선 보간

Country Status (8)

Country Link
US (1) US9672833B2 (https=)
EP (1) EP3111444B1 (https=)
JP (1) JP6306718B2 (https=)
KR (2) KR20160102061A (https=)
CN (1) CN105940380B (https=)
AU (1) AU2015222922B2 (https=)
BR (1) BR112016015557B1 (https=)
WO (1) WO2015131040A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3107097B1 (en) * 2015-06-17 2017-11-15 Nxp B.V. Improved speech intelligilibility
US9984701B2 (en) * 2016-06-10 2018-05-29 Apple Inc. Noise detection and removal systems, and related methods
CN108922551B (zh) * 2017-05-16 2021-02-05 博通集成电路(上海)股份有限公司 用于补偿丢失帧的电路及方法
CN111640442B (zh) * 2020-06-01 2023-05-23 北京猿力未来科技有限公司 处理音频丢包的方法、训练神经网络的方法及各自的装置

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3649765A (en) * 1969-10-29 1972-03-14 Bell Telephone Labor Inc Speech analyzer-synthesizer system employing improved formant extractor
JPH06130998A (ja) * 1992-10-22 1994-05-13 Oki Electric Ind Co Ltd 圧縮音声復号化装置
US5517595A (en) * 1994-02-08 1996-05-14 At&T Corp. Decomposition in noise and periodic signal waveforms in waveform interpolation
DE69633164T2 (de) * 1995-05-22 2005-08-11 Ntt Mobile Communications Network Inc. Tondekoder
US6377915B1 (en) * 1999-03-17 2002-04-23 Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. Speech decoding using mix ratio table
US20040054525A1 (en) * 2001-01-22 2004-03-18 Hiroshi Sekiguchi Encoding method and decoding method for digital voice data
US7143032B2 (en) * 2001-08-17 2006-11-28 Broadcom Corporation Method and system for an overlap-add technique for predictive decoding based on extrapolation of speech and ringinig waveform
US6747581B2 (en) * 2002-02-01 2004-06-08 Octiv, Inc. Techniques for variable sample rate conversion
US20040002856A1 (en) * 2002-03-08 2004-01-01 Udaya Bhaskar Multi-rate frequency domain interpolative speech CODEC system
JP2006510938A (ja) * 2002-12-19 2006-03-30 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 音声符号化における正弦波の選択
JP2006510937A (ja) * 2002-12-19 2006-03-30 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ オーディオ符号化における正弦波選択
US7519535B2 (en) * 2005-01-31 2009-04-14 Qualcomm Incorporated Frame erasure concealment in voice communications
WO2006121101A1 (ja) * 2005-05-13 2006-11-16 Matsushita Electric Industrial Co., Ltd. 音声符号化装置およびスペクトル変形方法
US9208821B2 (en) * 2007-08-06 2015-12-08 Apple Inc. Method and system to process digital audio data
CN101437009B (zh) * 2007-11-15 2011-02-02 华为技术有限公司 丢包隐藏的方法及其系统

Also Published As

Publication number Publication date
JP6306718B2 (ja) 2018-04-04
US9672833B2 (en) 2017-06-06
EP3111444A1 (en) 2017-01-04
BR112016015557A2 (https=) 2017-10-03
JP2017509006A (ja) 2017-03-30
WO2015131040A1 (en) 2015-09-03
AU2015222922A1 (en) 2016-06-23
AU2015222922B2 (en) 2017-12-07
CN105940380B (zh) 2019-03-15
KR20180049182A (ko) 2018-05-10
BR112016015557B1 (pt) 2022-11-29
EP3111444B1 (en) 2020-09-02
CN105940380A (zh) 2016-09-14
KR102188620B1 (ko) 2020-12-08
US20150248893A1 (en) 2015-09-03

Similar Documents

Publication Publication Date Title
US8010350B2 (en) Decimated bisectional pitch refinement
US8515085B2 (en) Signal processing apparatus
Halperin et al. Dynamic temporal alignment of speech to lips
WO2008066265A1 (en) Frame error concealment method and apparatus and error concealment scheme construction method and apparatus
US5774836A (en) System and method for performing pitch estimation and error checking on low estimated pitch values in a correlation based pitch estimator
US20080033584A1 (en) Scaled Window Overlap Add for Mixed Signals
US20130208903A1 (en) Reverberation estimator
EP2788980A1 (en) Harmonicity-based single-channel speech quality estimation
ES2994065T3 (en) Pitch lag estimation
KR102188620B1 (ko) 누락 데이터에 대한 사인곡선 보간
WO2021093808A1 (zh) 一种有效语音信号的检测方法、装置及设备
WO1997031366A1 (en) System and method for error correction in a correlation-based pitch estimator
CN106356076B (zh) 基于人工智能的语音活动性检测方法和装置
US9608889B1 (en) Audio click removal using packet loss concealment
JP2001222289A (ja) 音響信号分析方法及び装置並びに音声信号処理方法及び装置
US10636438B2 (en) Method, information processing apparatus for processing speech, and non-transitory computer-readable storage medium
JP6618885B2 (ja) 音声区間検出装置、音声区間検出方法、プログラム
JP2009063700A (ja) 音声信号区間推定装置、方法、プログラムおよびこれを記録した記録媒体
EP4372739B1 (en) Sound signal downmixing method, sound signal encoding method, sound signal downmixing device, sound signal encoding device, and program
JP4413175B2 (ja) 非定常雑音判別方法、その装置、そのプログラム及びその記録媒体
US11004463B2 (en) Speech processing method, apparatus, and non-transitory computer-readable storage medium for storing a computer program for pitch frequency detection based upon a learned value
Prawda et al. Cropping room impulse responses using unimodal regression of their covariance
Kleijn et al. Sinusoidal interpolation across missing data
EP2452293A1 (fr) Localisation de sources
Chelloug et al. An efficient VAD algorithm based on constant False Acceptance rate for highly noisy environments

Legal Events

Date Code Title Description
A201 Request for examination
PA0105 International application

Patent event date: 20160722

Patent event code: PA01051R01D

Comment text: International Patent Application

PA0201 Request for examination
PG1501 Laying open of application
E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20170711

Patent event code: PE09021S01D

E601 Decision to refuse application
PE0601 Decision on rejection of patent

Patent event date: 20180326

Comment text: Decision to Refuse Application

Patent event code: PE06012S01D

Patent event date: 20170711

Comment text: Notification of reason for refusal

Patent event code: PE06011S01I

A107 Divisional application of patent
J201 Request for trial against refusal decision
PA0104 Divisional application for international application

Comment text: Divisional Application for International Patent

Patent event code: PA01041R01D

Patent event date: 20180426

PJ0201 Trial against decision of rejection

Patent event date: 20180426

Comment text: Request for Trial against Decision on Refusal

Patent event code: PJ02012R01D

Patent event date: 20180326

Comment text: Decision to Refuse Application

Patent event code: PJ02011S01I

Appeal kind category: Appeal against decision to decline refusal

Decision date: 20191115

Appeal identifier: 2018101001813

Request date: 20180426

J301 Trial decision

Free format text: TRIAL NUMBER: 2018101001813; TRIAL DECISION FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20180426

Effective date: 20191115

PJ1301 Trial decision

Patent event code: PJ13011S01D

Patent event date: 20191115

Comment text: Trial Decision on Objection to Decision on Refusal

Appeal kind category: Appeal against decision to decline refusal

Request date: 20180426

Decision date: 20191115

Appeal identifier: 2018101001813

J121 Written withdrawal of request for trial
PC1202 Submission of document of withdrawal before decision of registration

Comment text: [Withdrawal of Procedure relating to Patent, etc.] Withdrawal (Abandonment)

Patent event code: PC12021R01D

Patent event date: 20191227

PJ1201 Withdrawal of trial

Patent event code: PJ12011R01D

Patent event date: 20191227

Comment text: Written Withdrawal of Request for Trial

Appeal identifier: 2018101001813

Request date: 20180426

Appeal kind category: Appeal against decision to decline refusal

Decision date: 20191115

WITB Written withdrawal of application