SG11201908390UA - Non-harmonic speech detection and bandwidth extension in a multi-source environment - Google Patents
Non-harmonic speech detection and bandwidth extension in a multi-source environmentInfo
- Publication number
- SG11201908390UA SG11201908390UA SG11201908390UA SG11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA SG 11201908390U A SG11201908390U A SG 11201908390UA
- Authority
- SG
- Singapore
- Prior art keywords
- bitstream
- channel
- flag
- band
- signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
Abstract
FirstLoudspeaker Second Output Channel 128 Second Loudspeaker 144 First Audio Channel 130 (\"Reference Channel\") Second Audio Channel 132 (Target Channel\") Second Microphone 148 First Microphone 146 Modified Non Harmonic HB Flag (y) 920 Down-Mix Bitstream 216 ICBWE Bitstream 242 High-Band Mid Channel Bitstream 244 Low-Band Bitstream 246 Modified Non Harmonic FIB Flag (y) 920 Down-Mix Bitstream 216 ICBWE Bitstream 242 High-Band Mid Channel Bitstream 244 Low-Band Bitstream 246 100 First Output Channel 126 First Device 104 Memory 153 Instructions 191 Transmitter 110 Input I nterface(s) 112 Encoder 200 Inter-Channel BWE Encoder 204 Non Harmonic HB Flag (x) 910 Modified Non Harmonic HB Flag (y) 920 ( 106 Second Device Decoder 300 I ICBWE Decoder 306 I Modified Non Harmonic HB Flag (y) 920 (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (43) International Publication Date 25 October 2018 (25.10.2018) WIP0 I PCT ill IIIIIl °million °nolo mu imm Eno iflo oimIE (10) International Publication Number WO 2018/195299 Al (51) International Patent Classification: G1OL 19/008 (2013.01) G1OL 21/038 (2013.01) G1OL 19/02 (2013.01) (21) International Application Number: PCT/US2018/028338 (22) International Filing Date: 19 April 2018 (19.04.2018) (25) Filing Language: English (26) Publication Language: English (30) Priority Data: 62/488,654 21 April 2017 (21.04.2017) US 15/956,645 18 April 2018 (18.04.2018) US (71) Applicant: QUALCOMM INCORPORATED [US/US]; ATTN: International IP Administration, 5775 Morehouse Drive, San Diego, California 92121-1714 (US). (72) Inventors: CHEBIYYAM, Venkata Subrahmanyam Chandra Sekhar; 590 Mill Creek Lane, #205, Santa Clara, California 95054 (US). ATTI, Venkatraman; 5775 More- house Drive, San Diego, California 92121-1714 (US). (74) Agent: TOLER LAW GROUP, PC et al.; 8500 Bluffstone Cove, Suite A201, Austin, Texas 78759 (US). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (54) Title: NON-HARMONIC SPEECH DETECTION AND BANDWIDTH EXTENSION IN A MULTI-SOURCE ENVIRON- = MENT FIG. 1 M Sound Source152 (57) : A device includes a multi-channel encoder configured to receive a first audio signal and a second audio signal, to perform a downmix operation on the first audio signal and the second audio signal to generate a mid signal, to generate a low-band mid signal and a high-band mid signal based on the mid signal, and to determine, based at least partially on a low band voicing value corresponding to the low band signal and a gain value corresponding to the high-band mid signal, a value of a multi-source flag that 00 flag associated with the high-band mid signal. The multi-channel encoder is configured to generate a high-band mid excitation signal O based on the multi-source flag and to generate a bitstream based on the high-band mid excitation signal. The device also includes a transmitter configured to transmit the bitstream and the multi-source flag to a second device. C [Continued on next page] WO 2018/195299 Al OII (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Published: — with international search report (Art. 21(3))
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762488654P | 2017-04-21 | 2017-04-21 | |
US15/956,645 US10825467B2 (en) | 2017-04-21 | 2018-04-18 | Non-harmonic speech detection and bandwidth extension in a multi-source environment |
PCT/US2018/028338 WO2018195299A1 (en) | 2017-04-21 | 2018-04-19 | Non-harmonic speech detection and bandwidth extension in a multi-source environment |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201908390UA true SG11201908390UA (en) | 2019-11-28 |
Family
ID=63852843
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201908390U SG11201908390UA (en) | 2017-04-21 | 2018-04-19 | Non-harmonic speech detection and bandwidth extension in a multi-source environment |
Country Status (9)
Country | Link |
---|---|
US (1) | US10825467B2 (en) |
EP (1) | EP3613042B1 (en) |
KR (1) | KR102308966B1 (en) |
CN (1) | CN110537222B (en) |
AU (1) | AU2018256414B2 (en) |
BR (1) | BR112019021903A2 (en) |
SG (1) | SG11201908390UA (en) |
TW (1) | TWI775838B (en) |
WO (1) | WO2018195299A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10957331B2 (en) | 2018-12-17 | 2021-03-23 | Microsoft Technology Licensing, Llc | Phase reconstruction in a speech decoder |
US10847172B2 (en) * | 2018-12-17 | 2020-11-24 | Microsoft Technology Licensing, Llc | Phase quantization in a speech encoder |
KR102570480B1 (en) * | 2019-01-04 | 2023-08-25 | 삼성전자주식회사 | Processing Method of Audio signal and electronic device supporting the same |
JP2022543292A (en) * | 2019-08-05 | 2022-10-11 | シュアー アクイジッション ホールディングス インコーポレイテッド | transmit antenna diversity wireless audio system |
US10978083B1 (en) | 2019-11-13 | 2021-04-13 | Shure Acquisition Holdings, Inc. | Time domain spectral bandwidth replication |
KR20210073975A (en) * | 2019-12-11 | 2021-06-21 | 삼성전자주식회사 | Speaker authentication method, learning method for speaker authentication and devices thereof |
CN112562686B (en) * | 2020-12-10 | 2022-07-15 | 青海民族大学 | Zero-sample voice conversion corpus preprocessing method using neural network |
CN113763980B (en) * | 2021-10-30 | 2023-05-12 | 成都启英泰伦科技有限公司 | Echo cancellation method |
Family Cites Families (51)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7330814B2 (en) * | 2000-05-22 | 2008-02-12 | Texas Instruments Incorporated | Wideband speech coding with modulated noise highband excitation system and method |
SE519976C2 (en) * | 2000-09-15 | 2003-05-06 | Ericsson Telefon Ab L M | Coding and decoding of signals from multiple channels |
SE0004163D0 (en) * | 2000-11-14 | 2000-11-14 | Coding Technologies Sweden Ab | Enhancing perceptual performance or high frequency reconstruction coding methods by adaptive filtering |
ATE331280T1 (en) * | 2001-11-23 | 2006-07-15 | Koninkl Philips Electronics Nv | BANDWIDTH EXTENSION FOR AUDIO SIGNALS |
BRPI0517780A2 (en) * | 2004-11-05 | 2011-04-19 | Matsushita Electric Ind Co Ltd | scalable decoding device and scalable coding device |
KR100707174B1 (en) * | 2004-12-31 | 2007-04-13 | 삼성전자주식회사 | High band Speech coding and decoding apparatus in the wide-band speech coding/decoding system, and method thereof |
MX2007012187A (en) * | 2005-04-01 | 2007-12-11 | Qualcomm Inc | Systems, methods, and apparatus for highband time warping. |
ES2358125T3 (en) * | 2005-04-01 | 2011-05-05 | Qualcomm Incorporated | PROCEDURE AND APPLIANCE FOR AN ANTIDISPERSION FILTER OF AN EXTENDED SIGNAL FOR EXCESSING THE BAND WIDTH SPEED EXCITATION. |
TWI324336B (en) * | 2005-04-22 | 2010-05-01 | Qualcomm Inc | Method of signal processing and apparatus for gain factor smoothing |
JP5100380B2 (en) * | 2005-06-29 | 2012-12-19 | パナソニック株式会社 | Scalable decoding apparatus and lost data interpolation method |
CN101273404B (en) * | 2005-09-30 | 2012-07-04 | 松下电器产业株式会社 | Audio encoding device and audio encoding method |
US8135047B2 (en) * | 2006-07-31 | 2012-03-13 | Qualcomm Incorporated | Systems and methods for including an identifier with a packet associated with a speech signal |
US8005678B2 (en) * | 2006-08-15 | 2011-08-23 | Broadcom Corporation | Re-phasing of decoder states after packet loss |
CN101548318B (en) * | 2006-12-15 | 2012-07-18 | 松下电器产业株式会社 | Encoding device, decoding device, and method thereof |
KR101355376B1 (en) * | 2007-04-30 | 2014-01-23 | 삼성전자주식회사 | Method and apparatus for encoding and decoding high frequency band |
KR100970446B1 (en) * | 2007-11-21 | 2010-07-16 | 한국전자통신연구원 | Apparatus and method for deciding adaptive noise level for frequency extension |
RU2443028C2 (en) * | 2008-07-11 | 2012-02-20 | Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен | Apparatus and method for calculating bandwidth extension data using a spectral tilt controlled framing |
ES2539304T3 (en) * | 2008-07-11 | 2015-06-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | An apparatus and a method to generate output data by bandwidth extension |
EP2144230A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
EP2144231A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme with common preprocessing |
US9037474B2 (en) * | 2008-09-06 | 2015-05-19 | Huawei Technologies Co., Ltd. | Method for classifying audio signal into fast signal or slow signal |
US8532998B2 (en) * | 2008-09-06 | 2013-09-10 | Huawei Technologies Co., Ltd. | Selective bandwidth extension for encoding/decoding audio/speech signal |
CN101763856B (en) * | 2008-12-23 | 2011-11-02 | 华为技术有限公司 | Signal classifying method, classifying device and coding system |
CO6440537A2 (en) * | 2009-04-09 | 2012-05-15 | Fraunhofer Ges Forschung | APPARATUS AND METHOD TO GENERATE A SYNTHESIS AUDIO SIGNAL AND TO CODIFY AN AUDIO SIGNAL |
TWI556227B (en) * | 2009-05-27 | 2016-11-01 | 杜比國際公司 | Systems and methods for generating a high frequency component of a signal from a low frequency component of the signal, a set-top box, a computer program product and storage medium thereof |
MX2012010415A (en) * | 2010-03-09 | 2012-10-03 | Fraunhofer Ges Forschung | Apparatus and method for processing an input audio signal using cascaded filterbanks. |
US20120029926A1 (en) * | 2010-07-30 | 2012-02-02 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals |
KR20120016709A (en) * | 2010-08-17 | 2012-02-27 | 삼성전자주식회사 | Apparatus and method for improving the voice quality in portable communication system |
WO2012040897A1 (en) * | 2010-09-28 | 2012-04-05 | Huawei Technologies Co., Ltd. | Device and method for postprocessing decoded multi-channel audio signal or decoded stereo signal |
CN102737636B (en) * | 2011-04-13 | 2014-06-04 | 华为技术有限公司 | Audio coding method and device thereof |
WO2013035257A1 (en) * | 2011-09-09 | 2013-03-14 | パナソニック株式会社 | Encoding device, decoding device, encoding method and decoding method |
JP5817499B2 (en) * | 2011-12-15 | 2015-11-18 | 富士通株式会社 | Decoding device, encoding device, encoding / decoding system, decoding method, encoding method, decoding program, and encoding program |
US9129600B2 (en) * | 2012-09-26 | 2015-09-08 | Google Technology Holdings LLC | Method and apparatus for encoding an audio signal |
ES2753228T3 (en) * | 2012-11-05 | 2020-04-07 | Panasonic Ip Corp America | Voice Audio Coding Device, Voice Audio Decoding Device, Voice Audio Coding Procedure and Voice Audio Decoding Procedure |
CN105976830B (en) * | 2013-01-11 | 2019-09-20 | 华为技术有限公司 | Audio-frequency signal coding and coding/decoding method, audio-frequency signal coding and decoding apparatus |
EP2950308B1 (en) * | 2013-01-22 | 2020-02-19 | Panasonic Corporation | Bandwidth expansion parameter-generator, encoder, decoder, bandwidth expansion parameter-generating method, encoding method, and decoding method |
BR112015017632B1 (en) * | 2013-01-29 | 2022-06-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. | Apparatus and method for generating a frequency-enhanced signal using subband temporal smoothing |
KR101732059B1 (en) * | 2013-05-15 | 2017-05-04 | 삼성전자주식회사 | Method and device for encoding and decoding audio signal |
FR3007563A1 (en) * | 2013-06-25 | 2014-12-26 | France Telecom | ENHANCED FREQUENCY BAND EXTENSION IN AUDIO FREQUENCY SIGNAL DECODER |
FR3008533A1 (en) * | 2013-07-12 | 2015-01-16 | Orange | OPTIMIZED SCALE FACTOR FOR FREQUENCY BAND EXTENSION IN AUDIO FREQUENCY SIGNAL DECODER |
US9620134B2 (en) * | 2013-10-10 | 2017-04-11 | Qualcomm Incorporated | Gain shape estimation for improved tracking of high-band temporal characteristics |
US10083708B2 (en) * | 2013-10-11 | 2018-09-25 | Qualcomm Incorporated | Estimation of mixing factors to generate high-band excitation signal |
CN105765655A (en) * | 2013-11-22 | 2016-07-13 | 高通股份有限公司 | Selective phase compensation in high band coding |
US10163447B2 (en) * | 2013-12-16 | 2018-12-25 | Qualcomm Incorporated | High-band signal modeling |
US9564141B2 (en) * | 2014-02-13 | 2017-02-07 | Qualcomm Incorporated | Harmonic bandwidth extension of audio signals |
US9542955B2 (en) * | 2014-03-31 | 2017-01-10 | Qualcomm Incorporated | High-band signal coding using multiple sub-bands |
US9583115B2 (en) * | 2014-06-26 | 2017-02-28 | Qualcomm Incorporated | Temporal gain adjustment based on high-band signal characteristic |
US9984699B2 (en) * | 2014-06-26 | 2018-05-29 | Qualcomm Incorporated | High-band signal coding using mismatched frequency ranges |
US9886963B2 (en) * | 2015-04-05 | 2018-02-06 | Qualcomm Incorporated | Encoder selection |
US10341770B2 (en) * | 2015-09-30 | 2019-07-02 | Apple Inc. | Encoded audio metadata-based loudness equalization and dynamic equalization during DRC |
US10109284B2 (en) | 2016-02-12 | 2018-10-23 | Qualcomm Incorporated | Inter-channel encoding and decoding of multiple high-band audio signals |
-
2018
- 2018-04-18 US US15/956,645 patent/US10825467B2/en active Active
- 2018-04-19 KR KR1020197030409A patent/KR102308966B1/en active IP Right Grant
- 2018-04-19 CN CN201880026185.XA patent/CN110537222B/en active Active
- 2018-04-19 AU AU2018256414A patent/AU2018256414B2/en active Active
- 2018-04-19 WO PCT/US2018/028338 patent/WO2018195299A1/en active Application Filing
- 2018-04-19 SG SG11201908390U patent/SG11201908390UA/en unknown
- 2018-04-19 EP EP18724649.1A patent/EP3613042B1/en active Active
- 2018-04-19 BR BR112019021903-0A patent/BR112019021903A2/en unknown
- 2018-04-20 TW TW107113473A patent/TWI775838B/en active
Also Published As
Publication number | Publication date |
---|---|
WO2018195299A1 (en) | 2018-10-25 |
KR20190139872A (en) | 2019-12-18 |
EP3613042B1 (en) | 2022-09-21 |
TWI775838B (en) | 2022-09-01 |
US10825467B2 (en) | 2020-11-03 |
KR102308966B1 (en) | 2021-10-05 |
EP3613042A1 (en) | 2020-02-26 |
TW201842494A (en) | 2018-12-01 |
AU2018256414A1 (en) | 2019-10-03 |
BR112019021903A2 (en) | 2020-05-26 |
CN110537222B (en) | 2023-07-28 |
US20180308505A1 (en) | 2018-10-25 |
AU2018256414B2 (en) | 2022-05-19 |
CN110537222A (en) | 2019-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201908390UA (en) | Non-harmonic speech detection and bandwidth extension in a multi-source environment | |
SG11201903130WA (en) | Sequence to sequence transformations for speech synthesis via recurrent neural networks | |
SG11201907753TA (en) | Bispecific binding molecules that are capable of binding cd137 and tumor antigens, and uses thereof | |
SG11201803050PA (en) | Electronic device generating notification based on context data in response to speech phrase from user | |
SG11201805906WA (en) | Diagnostic and prognostic methods for cardiovascular diseases and events | |
CA3015496A1 (en) | Voice control of a media playback system | |
SG11201908549RA (en) | Automated quality control and spectral error correction for sample analysis instruments | |
SG11201807334SA (en) | Methods, compositions, and devices for information storage | |
SG11201903771XA (en) | Binding molecules specific for asct2 and uses thereof | |
SG11201900201YA (en) | Methods for quantitating individual antibodies from a mixture | |
SG11202000287RA (en) | Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques | |
SG11201809913PA (en) | Methods for detecting target nucleic acids in a sample | |
SG11201810003UA (en) | Using programmable dna binding proteins to enhance targeted genome modification | |
SG11201805950UA (en) | Self-assembled nanostructures and separation membranes comprising aquaporin water channels and methods of making and using them | |
SG11201909348QA (en) | Stereo parameters for stereo decoding | |
SG11201811604UA (en) | System and method for real-time transcription of an audio signal into texts | |
SG11201906370TA (en) | Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals | |
SG11201904752QA (en) | Coding of multiple audio signals | |
SG11201407800SA (en) | Selective binding of biological targets to solid phase ureides | |
SG11201907927SA (en) | Binding molecules that specifically bind to tau | |
SG11201806256SA (en) | Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision | |
SG11201406767VA (en) | Microfilter and apparatus for separating a biological entity from a sample volume | |
SG11201908744PA (en) | Anti-c5a antibodies and uses thereof | |
SG11201903154YA (en) | Antibodies that bind zika virus envelope protein and uses thereof | |
SG11201907208XA (en) | Radiolabeled anti-lag3 antibodies for immuno-pet imaging |