SG11201908390UA - Non-harmonic speech detection and bandwidth extension in a multi-source environment - Google Patents

Non-harmonic speech detection and bandwidth extension in a multi-source environment

Info

Publication number
SG11201908390UA
SG11201908390UA SG11201908390UA SG11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA
Authority
SG
Singapore
Prior art keywords
bitstream
channel
flag
band
signal
Prior art date
Application number
Inventor
Venkata Subrahmanyam Chandra Sekhar Chebiyyam
Venkatraman Atti
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of SG11201908390UA publication Critical patent/SG11201908390UA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Abstract

FirstLoudspeaker Second Output Channel 128 Second Loudspeaker 144 First Audio Channel 130 (\"Reference Channel\") Second Audio Channel 132 (Target Channel\") Second Microphone 148 First Microphone 146 Modified Non Harmonic HB Flag (y) 920 Down-Mix Bitstream 216 ICBWE Bitstream 242 High-Band Mid Channel Bitstream 244 Low-Band Bitstream 246 Modified Non Harmonic FIB Flag (y) 920 Down-Mix Bitstream 216 ICBWE Bitstream 242 High-Band Mid Channel Bitstream 244 Low-Band Bitstream 246 100 First Output Channel 126 First Device 104 Memory 153 Instructions 191 Transmitter 110 Input I nterface(s) 112 Encoder 200 Inter-Channel BWE Encoder 204 Non Harmonic HB Flag (x) 910 Modified Non Harmonic HB Flag (y) 920 ( 106 Second Device Decoder 300 I ICBWE Decoder 306 I Modified Non Harmonic HB Flag (y) 920 (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (43) International Publication Date 25 October 2018 (25.10.2018) WIP0 I PCT ill IIIIIl °million °nolo mu imm Eno iflo oimIE (10) International Publication Number WO 2018/195299 Al (51) International Patent Classification: G1OL 19/008 (2013.01) G1OL 21/038 (2013.01) G1OL 19/02 (2013.01) (21) International Application Number: PCT/US2018/028338 (22) International Filing Date: 19 April 2018 (19.04.2018) (25) Filing Language: English (26) Publication Language: English (30) Priority Data: 62/488,654 21 April 2017 (21.04.2017) US 15/956,645 18 April 2018 (18.04.2018) US (71) Applicant: QUALCOMM INCORPORATED [US/US]; ATTN: International IP Administration, 5775 Morehouse Drive, San Diego, California 92121-1714 (US). (72) Inventors: CHEBIYYAM, Venkata Subrahmanyam Chandra Sekhar; 590 Mill Creek Lane, #205, Santa Clara, California 95054 (US). ATTI, Venkatraman; 5775 More- house Drive, San Diego, California 92121-1714 (US). (74) Agent: TOLER LAW GROUP, PC et al.; 8500 Bluffstone Cove, Suite A201, Austin, Texas 78759 (US). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (54) Title: NON-HARMONIC SPEECH DETECTION AND BANDWIDTH EXTENSION IN A MULTI-SOURCE ENVIRON- = MENT FIG. 1 M Sound Source152 (57) : A device includes a multi-channel encoder configured to receive a first audio signal and a second audio signal, to perform a downmix operation on the first audio signal and the second audio signal to generate a mid signal, to generate a low-band mid signal and a high-band mid signal based on the mid signal, and to determine, based at least partially on a low band voicing value corresponding to the low band signal and a gain value corresponding to the high-band mid signal, a value of a multi-source flag that 00 flag associated with the high-band mid signal. The multi-channel encoder is configured to generate a high-band mid excitation signal O based on the multi-source flag and to generate a bitstream based on the high-band mid excitation signal. The device also includes a transmitter configured to transmit the bitstream and the multi-source flag to a second device. C [Continued on next page] WO 2018/195299 Al OII (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Published: — with international search report (Art. 21(3))
SG11201908390U 2017-04-21 2018-04-19 Non-harmonic speech detection and bandwidth extension in a multi-source environment SG11201908390UA (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201762488654P 2017-04-21 2017-04-21
US15/956,645 US10825467B2 (en) 2017-04-21 2018-04-18 Non-harmonic speech detection and bandwidth extension in a multi-source environment
PCT/US2018/028338 WO2018195299A1 (en) 2017-04-21 2018-04-19 Non-harmonic speech detection and bandwidth extension in a multi-source environment

Publications (1)

Publication Number Publication Date
SG11201908390UA true SG11201908390UA (en) 2019-11-28

Family

ID=63852843

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201908390U SG11201908390UA (en) 2017-04-21 2018-04-19 Non-harmonic speech detection and bandwidth extension in a multi-source environment

Country Status (9)

Country Link
US (1) US10825467B2 (en)
EP (1) EP3613042B1 (en)
KR (1) KR102308966B1 (en)
CN (1) CN110537222B (en)
AU (1) AU2018256414B2 (en)
BR (1) BR112019021903A2 (en)
SG (1) SG11201908390UA (en)
TW (1) TWI775838B (en)
WO (1) WO2018195299A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10957331B2 (en) 2018-12-17 2021-03-23 Microsoft Technology Licensing, Llc Phase reconstruction in a speech decoder
US10847172B2 (en) * 2018-12-17 2020-11-24 Microsoft Technology Licensing, Llc Phase quantization in a speech encoder
KR102570480B1 (en) * 2019-01-04 2023-08-25 삼성전자주식회사 Processing Method of Audio signal and electronic device supporting the same
JP2022543292A (en) * 2019-08-05 2022-10-11 シュアー アクイジッション ホールディングス インコーポレイテッド transmit antenna diversity wireless audio system
US10978083B1 (en) 2019-11-13 2021-04-13 Shure Acquisition Holdings, Inc. Time domain spectral bandwidth replication
KR20210073975A (en) * 2019-12-11 2021-06-21 삼성전자주식회사 Speaker authentication method, learning method for speaker authentication and devices thereof
CN112562686B (en) * 2020-12-10 2022-07-15 青海民族大学 Zero-sample voice conversion corpus preprocessing method using neural network
CN113763980B (en) * 2021-10-30 2023-05-12 成都启英泰伦科技有限公司 Echo cancellation method

Family Cites Families (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7330814B2 (en) * 2000-05-22 2008-02-12 Texas Instruments Incorporated Wideband speech coding with modulated noise highband excitation system and method
SE519976C2 (en) * 2000-09-15 2003-05-06 Ericsson Telefon Ab L M Coding and decoding of signals from multiple channels
SE0004163D0 (en) * 2000-11-14 2000-11-14 Coding Technologies Sweden Ab Enhancing perceptual performance or high frequency reconstruction coding methods by adaptive filtering
ATE331280T1 (en) * 2001-11-23 2006-07-15 Koninkl Philips Electronics Nv BANDWIDTH EXTENSION FOR AUDIO SIGNALS
BRPI0517780A2 (en) * 2004-11-05 2011-04-19 Matsushita Electric Ind Co Ltd scalable decoding device and scalable coding device
KR100707174B1 (en) * 2004-12-31 2007-04-13 삼성전자주식회사 High band Speech coding and decoding apparatus in the wide-band speech coding/decoding system, and method thereof
MX2007012187A (en) * 2005-04-01 2007-12-11 Qualcomm Inc Systems, methods, and apparatus for highband time warping.
ES2358125T3 (en) * 2005-04-01 2011-05-05 Qualcomm Incorporated PROCEDURE AND APPLIANCE FOR AN ANTIDISPERSION FILTER OF AN EXTENDED SIGNAL FOR EXCESSING THE BAND WIDTH SPEED EXCITATION.
TWI324336B (en) * 2005-04-22 2010-05-01 Qualcomm Inc Method of signal processing and apparatus for gain factor smoothing
JP5100380B2 (en) * 2005-06-29 2012-12-19 パナソニック株式会社 Scalable decoding apparatus and lost data interpolation method
CN101273404B (en) * 2005-09-30 2012-07-04 松下电器产业株式会社 Audio encoding device and audio encoding method
US8135047B2 (en) * 2006-07-31 2012-03-13 Qualcomm Incorporated Systems and methods for including an identifier with a packet associated with a speech signal
US8005678B2 (en) * 2006-08-15 2011-08-23 Broadcom Corporation Re-phasing of decoder states after packet loss
CN101548318B (en) * 2006-12-15 2012-07-18 松下电器产业株式会社 Encoding device, decoding device, and method thereof
KR101355376B1 (en) * 2007-04-30 2014-01-23 삼성전자주식회사 Method and apparatus for encoding and decoding high frequency band
KR100970446B1 (en) * 2007-11-21 2010-07-16 한국전자통신연구원 Apparatus and method for deciding adaptive noise level for frequency extension
RU2443028C2 (en) * 2008-07-11 2012-02-20 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Apparatus and method for calculating bandwidth extension data using a spectral tilt controlled framing
ES2539304T3 (en) * 2008-07-11 2015-06-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. An apparatus and a method to generate output data by bandwidth extension
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
EP2144231A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme with common preprocessing
US9037474B2 (en) * 2008-09-06 2015-05-19 Huawei Technologies Co., Ltd. Method for classifying audio signal into fast signal or slow signal
US8532998B2 (en) * 2008-09-06 2013-09-10 Huawei Technologies Co., Ltd. Selective bandwidth extension for encoding/decoding audio/speech signal
CN101763856B (en) * 2008-12-23 2011-11-02 华为技术有限公司 Signal classifying method, classifying device and coding system
CO6440537A2 (en) * 2009-04-09 2012-05-15 Fraunhofer Ges Forschung APPARATUS AND METHOD TO GENERATE A SYNTHESIS AUDIO SIGNAL AND TO CODIFY AN AUDIO SIGNAL
TWI556227B (en) * 2009-05-27 2016-11-01 杜比國際公司 Systems and methods for generating a high frequency component of a signal from a low frequency component of the signal, a set-top box, a computer program product and storage medium thereof
MX2012010415A (en) * 2010-03-09 2012-10-03 Fraunhofer Ges Forschung Apparatus and method for processing an input audio signal using cascaded filterbanks.
US20120029926A1 (en) * 2010-07-30 2012-02-02 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals
KR20120016709A (en) * 2010-08-17 2012-02-27 삼성전자주식회사 Apparatus and method for improving the voice quality in portable communication system
WO2012040897A1 (en) * 2010-09-28 2012-04-05 Huawei Technologies Co., Ltd. Device and method for postprocessing decoded multi-channel audio signal or decoded stereo signal
CN102737636B (en) * 2011-04-13 2014-06-04 华为技术有限公司 Audio coding method and device thereof
WO2013035257A1 (en) * 2011-09-09 2013-03-14 パナソニック株式会社 Encoding device, decoding device, encoding method and decoding method
JP5817499B2 (en) * 2011-12-15 2015-11-18 富士通株式会社 Decoding device, encoding device, encoding / decoding system, decoding method, encoding method, decoding program, and encoding program
US9129600B2 (en) * 2012-09-26 2015-09-08 Google Technology Holdings LLC Method and apparatus for encoding an audio signal
ES2753228T3 (en) * 2012-11-05 2020-04-07 Panasonic Ip Corp America Voice Audio Coding Device, Voice Audio Decoding Device, Voice Audio Coding Procedure and Voice Audio Decoding Procedure
CN105976830B (en) * 2013-01-11 2019-09-20 华为技术有限公司 Audio-frequency signal coding and coding/decoding method, audio-frequency signal coding and decoding apparatus
EP2950308B1 (en) * 2013-01-22 2020-02-19 Panasonic Corporation Bandwidth expansion parameter-generator, encoder, decoder, bandwidth expansion parameter-generating method, encoding method, and decoding method
BR112015017632B1 (en) * 2013-01-29 2022-06-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. Apparatus and method for generating a frequency-enhanced signal using subband temporal smoothing
KR101732059B1 (en) * 2013-05-15 2017-05-04 삼성전자주식회사 Method and device for encoding and decoding audio signal
FR3007563A1 (en) * 2013-06-25 2014-12-26 France Telecom ENHANCED FREQUENCY BAND EXTENSION IN AUDIO FREQUENCY SIGNAL DECODER
FR3008533A1 (en) * 2013-07-12 2015-01-16 Orange OPTIMIZED SCALE FACTOR FOR FREQUENCY BAND EXTENSION IN AUDIO FREQUENCY SIGNAL DECODER
US9620134B2 (en) * 2013-10-10 2017-04-11 Qualcomm Incorporated Gain shape estimation for improved tracking of high-band temporal characteristics
US10083708B2 (en) * 2013-10-11 2018-09-25 Qualcomm Incorporated Estimation of mixing factors to generate high-band excitation signal
CN105765655A (en) * 2013-11-22 2016-07-13 高通股份有限公司 Selective phase compensation in high band coding
US10163447B2 (en) * 2013-12-16 2018-12-25 Qualcomm Incorporated High-band signal modeling
US9564141B2 (en) * 2014-02-13 2017-02-07 Qualcomm Incorporated Harmonic bandwidth extension of audio signals
US9542955B2 (en) * 2014-03-31 2017-01-10 Qualcomm Incorporated High-band signal coding using multiple sub-bands
US9583115B2 (en) * 2014-06-26 2017-02-28 Qualcomm Incorporated Temporal gain adjustment based on high-band signal characteristic
US9984699B2 (en) * 2014-06-26 2018-05-29 Qualcomm Incorporated High-band signal coding using mismatched frequency ranges
US9886963B2 (en) * 2015-04-05 2018-02-06 Qualcomm Incorporated Encoder selection
US10341770B2 (en) * 2015-09-30 2019-07-02 Apple Inc. Encoded audio metadata-based loudness equalization and dynamic equalization during DRC
US10109284B2 (en) 2016-02-12 2018-10-23 Qualcomm Incorporated Inter-channel encoding and decoding of multiple high-band audio signals

Also Published As

Publication number Publication date
WO2018195299A1 (en) 2018-10-25
KR20190139872A (en) 2019-12-18
EP3613042B1 (en) 2022-09-21
TWI775838B (en) 2022-09-01
US10825467B2 (en) 2020-11-03
KR102308966B1 (en) 2021-10-05
EP3613042A1 (en) 2020-02-26
TW201842494A (en) 2018-12-01
AU2018256414A1 (en) 2019-10-03
BR112019021903A2 (en) 2020-05-26
CN110537222B (en) 2023-07-28
US20180308505A1 (en) 2018-10-25
AU2018256414B2 (en) 2022-05-19
CN110537222A (en) 2019-12-03

Similar Documents

Publication Publication Date Title
SG11201908390UA (en) Non-harmonic speech detection and bandwidth extension in a multi-source environment
SG11201903130WA (en) Sequence to sequence transformations for speech synthesis via recurrent neural networks
SG11201907753TA (en) Bispecific binding molecules that are capable of binding cd137 and tumor antigens, and uses thereof
SG11201803050PA (en) Electronic device generating notification based on context data in response to speech phrase from user
SG11201805906WA (en) Diagnostic and prognostic methods for cardiovascular diseases and events
CA3015496A1 (en) Voice control of a media playback system
SG11201908549RA (en) Automated quality control and spectral error correction for sample analysis instruments
SG11201807334SA (en) Methods, compositions, and devices for information storage
SG11201903771XA (en) Binding molecules specific for asct2 and uses thereof
SG11201900201YA (en) Methods for quantitating individual antibodies from a mixture
SG11202000287RA (en) Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques
SG11201809913PA (en) Methods for detecting target nucleic acids in a sample
SG11201810003UA (en) Using programmable dna binding proteins to enhance targeted genome modification
SG11201805950UA (en) Self-assembled nanostructures and separation membranes comprising aquaporin water channels and methods of making and using them
SG11201909348QA (en) Stereo parameters for stereo decoding
SG11201811604UA (en) System and method for real-time transcription of an audio signal into texts
SG11201906370TA (en) Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals
SG11201904752QA (en) Coding of multiple audio signals
SG11201407800SA (en) Selective binding of biological targets to solid phase ureides
SG11201907927SA (en) Binding molecules that specifically bind to tau
SG11201806256SA (en) Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision
SG11201406767VA (en) Microfilter and apparatus for separating a biological entity from a sample volume
SG11201908744PA (en) Anti-c5a antibodies and uses thereof
SG11201903154YA (en) Antibodies that bind zika virus envelope protein and uses thereof
SG11201907208XA (en) Radiolabeled anti-lag3 antibodies for immuno-pet imaging