EP2388780A1 - Appareil et procédé pour étendre ou compresser des sections temporelles d'un signal audio - Google Patents

Appareil et procédé pour étendre ou compresser des sections temporelles d'un signal audio Download PDF

Info

Publication number
EP2388780A1
EP2388780A1 EP11155349A EP11155349A EP2388780A1 EP 2388780 A1 EP2388780 A1 EP 2388780A1 EP 11155349 A EP11155349 A EP 11155349A EP 11155349 A EP11155349 A EP 11155349A EP 2388780 A1 EP2388780 A1 EP 2388780A1
Authority
EP
European Patent Office
Prior art keywords
time
audio signal
information content
measure
section
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP11155349A
Other languages
German (de)
English (en)
Inventor
Frederik Nagel
Stefan Geyersberger
Sascha Disch
Max Neuendorf
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority to ARP110101554A priority Critical patent/AR081014A1/es
Priority to TW100116130A priority patent/TW201207847A/zh
Priority to PCT/EP2011/057979 priority patent/WO2011144617A1/fr
Publication of EP2388780A1 publication Critical patent/EP2388780A1/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
EP11155349A 2010-05-19 2011-02-22 Appareil et procédé pour étendre ou compresser des sections temporelles d'un signal audio Withdrawn EP2388780A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
ARP110101554A AR081014A1 (es) 2010-05-19 2011-05-05 Aparato y metodo para extender o comprimir en forma temporal secciones de tiempo de una senal de audio
TW100116130A TW201207847A (en) 2010-05-19 2011-05-09 Apparatus and method for temporarily extending or compressing time sections of an audio signal
PCT/EP2011/057979 WO2011144617A1 (fr) 2010-05-19 2011-05-17 Appareil et procédé pour l'extension ou la compression de sections temporelles d'un signal audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US34612410P 2010-05-19 2010-05-19

Publications (1)

Publication Number Publication Date
EP2388780A1 true EP2388780A1 (fr) 2011-11-23

Family

ID=44263126

Family Applications (1)

Application Number Title Priority Date Filing Date
EP11155349A Withdrawn EP2388780A1 (fr) 2010-05-19 2011-02-22 Appareil et procédé pour étendre ou compresser des sections temporelles d'un signal audio

Country Status (4)

Country Link
EP (1) EP2388780A1 (fr)
AR (1) AR081014A1 (fr)
TW (1) TW201207847A (fr)
WO (1) WO2011144617A1 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2538527A (en) * 2015-05-19 2016-11-23 Thales Holdings Uk Plc Signal processing device for processing an audio waveform for playback through a speaker
EP3244408A1 (fr) * 2016-05-09 2017-11-15 Sony Mobile Communications, Inc Procédé et unité électronique permettant de régler la vitesse de lecture de fichiers multimédia
WO2020026268A1 (fr) * 2018-08-03 2020-02-06 Sling Media Pvt. Ltd. Systèmes et procédés de lecture intelligente
CN108419096B (zh) * 2018-02-26 2020-07-03 浙江创课教育科技有限公司 语音智能播放方法及系统
CN114040030A (zh) * 2021-11-18 2022-02-11 深圳智慧林网络科技有限公司 一种基于预设规则的数据压缩方法、装置、设备和介质

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9697871B2 (en) 2011-03-23 2017-07-04 Audible, Inc. Synchronizing recorded audio content and companion content
US9760920B2 (en) 2011-03-23 2017-09-12 Audible, Inc. Synchronizing digital content
US8948892B2 (en) 2011-03-23 2015-02-03 Audible, Inc. Managing playback of synchronized content
US8855797B2 (en) 2011-03-23 2014-10-07 Audible, Inc. Managing playback of synchronized content
US9734153B2 (en) 2011-03-23 2017-08-15 Audible, Inc. Managing related digital content
US9703781B2 (en) 2011-03-23 2017-07-11 Audible, Inc. Managing related digital content
US9706247B2 (en) 2011-03-23 2017-07-11 Audible, Inc. Synchronized digital content samples
US8862255B2 (en) 2011-03-23 2014-10-14 Audible, Inc. Managing playback of synchronized content
US8849676B2 (en) 2012-03-29 2014-09-30 Audible, Inc. Content customization
US9037956B2 (en) 2012-03-29 2015-05-19 Audible, Inc. Content customization
US9075760B2 (en) 2012-05-07 2015-07-07 Audible, Inc. Narration settings distribution for content customization
US9317500B2 (en) 2012-05-30 2016-04-19 Audible, Inc. Synchronizing translated digital content
US8972265B1 (en) 2012-06-18 2015-03-03 Audible, Inc. Multiple voices in audio content
US9141257B1 (en) 2012-06-18 2015-09-22 Audible, Inc. Selecting and conveying supplemental content
US9536439B1 (en) 2012-06-27 2017-01-03 Audible, Inc. Conveying questions with content
US9679608B2 (en) * 2012-06-28 2017-06-13 Audible, Inc. Pacing content
US10109278B2 (en) 2012-08-02 2018-10-23 Audible, Inc. Aligning body matter across content formats
US9367196B1 (en) 2012-09-26 2016-06-14 Audible, Inc. Conveying branched content
US9632647B1 (en) 2012-10-09 2017-04-25 Audible, Inc. Selecting presentation positions in dynamic content
US9223830B1 (en) 2012-10-26 2015-12-29 Audible, Inc. Content presentation analysis
US9280906B2 (en) 2013-02-04 2016-03-08 Audible. Inc. Prompting a user for input during a synchronous presentation of audio content and textual content
US9472113B1 (en) 2013-02-05 2016-10-18 Audible, Inc. Synchronizing playback of digital content with physical content
US9978395B2 (en) * 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
WO2014178860A1 (fr) 2013-05-01 2014-11-06 Thomson Licensing Lancement d'appel par commande vocale
US9317486B1 (en) 2013-06-07 2016-04-19 Audible, Inc. Synchronizing playback of digital content with captured physical content
US9489360B2 (en) 2013-09-05 2016-11-08 Audible, Inc. Identifying extra material in companion content
CN107211062B (zh) 2015-02-03 2020-11-03 杜比实验室特许公司 虚拟声学空间中的音频回放调度
CN110770819B (zh) 2017-06-15 2023-05-12 北京嘀嘀无限科技发展有限公司 语音识别系统和方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5828994A (en) * 1996-06-05 1998-10-27 Interval Research Corporation Non-uniform time scale modification of recorded audio
US6549884B1 (en) 1999-09-21 2003-04-15 Creative Technology Ltd. Phase-vocoder pitch-shifting
EP1770688A1 (fr) * 2004-07-21 2007-04-04 Fujitsu Limited Convertisseur de vitesse, méthode et programme de conversion de vitesse
DE102008015702A1 (de) 2008-01-31 2009-08-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Bandbreitenerweiterung eines Audiosignals

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5828994A (en) * 1996-06-05 1998-10-27 Interval Research Corporation Non-uniform time scale modification of recorded audio
US6549884B1 (en) 1999-09-21 2003-04-15 Creative Technology Ltd. Phase-vocoder pitch-shifting
EP1770688A1 (fr) * 2004-07-21 2007-04-04 Fujitsu Limited Convertisseur de vitesse, méthode et programme de conversion de vitesse
DE102008015702A1 (de) 2008-01-31 2009-08-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Bandbreitenerweiterung eines Audiosignals

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
A. R6BEL: "New Approach to Transient Processing in the Phase Vocoder", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS OF DAFX-03, 8 September 2003 (2003-09-08), pages DAFX-1 - DAFX6
JEAN LAROCHE; MARK DOLSON: "New Phase-Vocoder Techniques for Pitch-Shifting, Harmonizing and Other Exotic Effects", PROCEEDINGS 1999 IEEE, WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 17 October 1999 (1999-10-17), pages 91 - 94, XP010365068, DOI: doi:10.1109/ASPAA.1999.810857
MARK DOLSON: "The Phase-Vocoder: A Tutorial", COMPUTER MUSIC JOURNAL, vol. 10, no. 4, 1986, pages 14 - 27, XP009029676
MELLER PUCKETTE: "Phase-locked Vocoder", PROCEEDINGS 1995 IEEE, ASSP, CONFERENCE ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTIC NOISE
T. KARRER; E. LEE; J. BORCHERS: "PhaVoRIT: A Phase Vocoder for Real Time Interactive Time-Stretching", PROC. ICMC, November 2006 (2006-11-01), pages 708 - 715

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2538527A (en) * 2015-05-19 2016-11-23 Thales Holdings Uk Plc Signal processing device for processing an audio waveform for playback through a speaker
GB2538527B (en) * 2015-05-19 2018-12-26 Thales Holdings Uk Plc Signal processing device for processing an audio waveform for playback through a speaker
EP3244408A1 (fr) * 2016-05-09 2017-11-15 Sony Mobile Communications, Inc Procédé et unité électronique permettant de régler la vitesse de lecture de fichiers multimédia
CN108419096B (zh) * 2018-02-26 2020-07-03 浙江创课教育科技有限公司 语音智能播放方法及系统
WO2020026268A1 (fr) * 2018-08-03 2020-02-06 Sling Media Pvt. Ltd. Systèmes et procédés de lecture intelligente
US11282534B2 (en) 2018-08-03 2022-03-22 Sling Media Pvt Ltd Systems and methods for intelligent playback
US11972770B2 (en) 2018-08-03 2024-04-30 Dish Network Technologies India Private Limited Systems and methods for intelligent playback
CN114040030A (zh) * 2021-11-18 2022-02-11 深圳智慧林网络科技有限公司 一种基于预设规则的数据压缩方法、装置、设备和介质
CN114040030B (zh) * 2021-11-18 2023-11-24 深圳智慧林网络科技有限公司 一种基于预设规则的数据压缩方法、装置、设备和介质

Also Published As

Publication number Publication date
TW201207847A (en) 2012-02-16
WO2011144617A1 (fr) 2011-11-24
AR081014A1 (es) 2012-05-30

Similar Documents

Publication Publication Date Title
EP2388780A1 (fr) Appareil et procédé pour étendre ou compresser des sections temporelles d'un signal audio
CA2257298C (fr) Modification non uniforme de l'echelle du temps de signaux audio enregistres
US8484035B2 (en) Modification of voice waveforms to change social signaling
Owren et al. Measuring emotion-related vocal acoustics
Grofit et al. Time-scale modification of audio signals using enhanced WSOLA with management of transients
WO2016165334A1 (fr) Procédé et appareil de traitement de la voix, et dispositif terminal
US20100217584A1 (en) Speech analysis device, speech analysis and synthesis device, correction rule information generation device, speech analysis system, speech analysis method, correction rule information generation method, and program
Rudresh et al. Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals
CN110663080A (zh) 通过频谱包络共振峰的频移动态修改语音音色的方法和装置
JP2015068897A (ja) 発話の評価方法及び装置、発話を評価するためのコンピュータプログラム
JP2010014913A (ja) 声質変換音声生成装置および声質変換音声生成システム
Vegesna et al. Prosody modification for speech recognition in emotionally mismatched conditions
CA2483607C (fr) Dispositif d'extraction de noyau syllabique et progiciel associe
Dutoit Corpus-based speech synthesis
Möller et al. Comparison of approaches for instrumentally predicting the quality of text-to-speech systems
JP3576800B2 (ja) 音声分析方法、及びプログラム記録媒体
Akanksh et al. Interconversion of emotions in speech using td-psola
Donnellan et al. Speech-adaptive time-scale modification for computer assisted language-learning
Govind et al. Improving the flexibility of dynamic prosody modification using instants of significant excitation
Sarma et al. Consonant-vowel unit recognition using dominant aperiodic and transition region detection
WO2004077381A1 (fr) Systeme de reproduction vocale
EP3327723A1 (fr) Procédé pour freiner un discours dans un contenu multimédia entré
JP3853923B2 (ja) 音声合成装置
Jang et al. Speech emotion recognition for affective human-robot interaction
Kain et al. Spectral control in concatenative speech synthesis

Legal Events

Date Code Title Description
AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

RIN1 Information on inventor provided before grant (corrected)

Inventor name: NEUENDORF, MAX

Inventor name: DISCH, SASCHA

Inventor name: GEYERSBERGER, STEFAN

Inventor name: NAGEL, FREDERIK

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20120524