DE602004008776D1 - Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse - Google Patents

Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse

Info

Publication number
DE602004008776D1
DE602004008776D1 DE602004008776T DE602004008776T DE602004008776D1 DE 602004008776 D1 DE602004008776 D1 DE 602004008776D1 DE 602004008776 T DE602004008776 T DE 602004008776T DE 602004008776 T DE602004008776 T DE 602004008776T DE 602004008776 D1 DE602004008776 D1 DE 602004008776D1
Authority
DE
Germany
Prior art keywords
semantic
voice
text block
text
identifier
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602004008776T
Other languages
English (en)
Other versions
DE602004008776T2 (de
Inventor
Steven Edward Atkin
Janani Janakiraman
David Bruce Kumhyr
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of DE602004008776D1 publication Critical patent/DE602004008776D1/de
Application granted granted Critical
Publication of DE602004008776T2 publication Critical patent/DE602004008776T2/de
Anticipated expiration legal-status Critical
Active legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Document Processing Apparatus (AREA)
DE602004008776T 2003-06-19 2004-06-11 Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse Active DE602004008776T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US464881 2003-06-19
US10/464,881 US20040260551A1 (en) 2003-06-19 2003-06-19 System and method for configuring voice readers using semantic analysis
PCT/EP2004/051010 WO2004111997A1 (en) 2003-06-19 2004-06-11 System and method for configuring voice readers using semantic analysis

Publications (2)

Publication Number Publication Date
DE602004008776D1 true DE602004008776D1 (de) 2007-10-18
DE602004008776T2 DE602004008776T2 (de) 2008-06-12

Family

ID=33517358

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602004008776T Active DE602004008776T2 (de) 2003-06-19 2004-06-11 Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse

Country Status (8)

Country Link
US (2) US20040260551A1 (de)
EP (1) EP1636790B1 (de)
KR (1) KR100745443B1 (de)
CN (1) CN1788305B (de)
AT (1) ATE372572T1 (de)
DE (1) DE602004008776T2 (de)
IL (1) IL172518A (de)
WO (1) WO2004111997A1 (de)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050096909A1 (en) * 2003-10-29 2005-05-05 Raimo Bakis Systems and methods for expressive text-to-speech
US20050125236A1 (en) * 2003-12-08 2005-06-09 International Business Machines Corporation Automatic capture of intonation cues in audio segments for speech applications
US7672436B1 (en) 2004-01-23 2010-03-02 Sprint Spectrum L.P. Voice rendering of E-mail with tags for improved user experience
US9236043B2 (en) * 2004-04-02 2016-01-12 Knfb Reader, Llc Document mode processing for portable reading machine enabling document navigation
KR100669241B1 (ko) * 2004-12-15 2007-01-15 한국전자통신연구원 화행 정보를 이용한 대화체 음성합성 시스템 및 방법
US20080086490A1 (en) * 2006-10-04 2008-04-10 Sap Ag Discovery of services matching a service request
CN101226523B (zh) * 2007-01-17 2012-09-05 国际商业机器公司 数据概况分析方法和系统
US20090164387A1 (en) * 2007-04-17 2009-06-25 Semandex Networks Inc. Systems and methods for providing semantically enhanced financial information
US20090204243A1 (en) * 2008-01-09 2009-08-13 8 Figure, Llc Method and apparatus for creating customized text-to-speech podcasts and videos incorporating associated media
US20090282042A1 (en) * 2008-05-12 2009-11-12 Expressor Software Method and system for managing the development of data integration projects to facilitate project development and analysis thereof
DE102008060301B4 (de) * 2008-12-03 2012-05-03 Grenzebach Maschinenbau Gmbh Verfahren und Vorrichtung zum kraftschlüssigen Verbinden von glasartigen Bauteilen mit Metallen sowie Computerprogramm und maschinenlesbarer Träger zur Durchführung des Verfahrens
US8903847B2 (en) * 2010-03-05 2014-12-02 International Business Machines Corporation Digital media voice tags in social networks
US8645141B2 (en) * 2010-09-14 2014-02-04 Sony Corporation Method and system for text to speech conversion
US9734637B2 (en) * 2010-12-06 2017-08-15 Microsoft Technology Licensing, Llc Semantic rigging of avatars
CN102543068A (zh) * 2010-12-31 2012-07-04 北大方正集团有限公司 语音播放文本信息的方法和装置
US9286886B2 (en) * 2011-01-24 2016-03-15 Nuance Communications, Inc. Methods and apparatus for predicting prosody in speech synthesis
US8688090B2 (en) 2011-03-21 2014-04-01 International Business Machines Corporation Data session preferences
US20120244842A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Data Session Synchronization With Phone Numbers
US20120246238A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Asynchronous messaging tags
CN102752019B (zh) * 2011-04-20 2015-01-28 深圳盒子支付信息技术有限公司 基于耳机插孔的数据发送、接收、传输方法及系统
US9159313B2 (en) * 2012-04-03 2015-10-13 Sony Corporation Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis and segments not generated using speech synthesis
US9158760B2 (en) 2012-12-21 2015-10-13 The Nielsen Company (Us), Llc Audio decoding with supplemental semantic audio recognition and report generation
US9195649B2 (en) 2012-12-21 2015-11-24 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9183849B2 (en) * 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
CN104281566A (zh) * 2014-10-13 2015-01-14 安徽华贞信息科技有限公司 一种语义化文本描述方法及系统
CN104978961B (zh) * 2015-05-25 2019-10-15 广州酷狗计算机科技有限公司 一种音频处理方法、装置及终端
CN105096932A (zh) * 2015-07-14 2015-11-25 百度在线网络技术(北京)有限公司 有声读物的语音合成方法和装置
US10235989B2 (en) * 2016-03-24 2019-03-19 Oracle International Corporation Sonification of words and phrases by text mining based on frequency of occurrence
CN105741829A (zh) * 2016-04-28 2016-07-06 玉环看知信息科技有限公司 数据转换方法及装置
CN106384586A (zh) * 2016-09-07 2017-02-08 北京小米移动软件有限公司 朗读文本信息的方法及装置
CN107886939B (zh) * 2016-09-30 2021-03-30 北京京东尚科信息技术有限公司 一种在客户端的中止-接续式文本语音播放方法和装置
US10347247B2 (en) 2016-12-30 2019-07-09 Google Llc Modulation of packetized audio signals
US11295738B2 (en) 2016-12-30 2022-04-05 Google, Llc Modulation of packetized audio signals
CN108305611B (zh) * 2017-06-27 2022-02-11 腾讯科技(深圳)有限公司 文本转语音的方法、装置、存储介质和计算机设备
CN108962219B (zh) * 2018-06-29 2019-12-13 百度在线网络技术(北京)有限公司 用于处理文本的方法和装置
US11145289B1 (en) * 2018-09-28 2021-10-12 United Services Automobile Association (Usaa) System and method for providing audible explanation of documents upon request
WO2020256475A1 (ko) * 2019-06-21 2020-12-24 주식회사 머니브레인 텍스트를 이용한 발화 동영상 생성 방법 및 장치
KR102360840B1 (ko) * 2019-06-21 2022-02-09 주식회사 딥브레인에이아이 텍스트를 이용한 발화 동영상 생성 방법 및 장치
CN111291572B (zh) * 2020-01-20 2023-06-09 Oppo广东移动通信有限公司 一种文字排版方法、装置及计算机可读存储介质
CN111667815B (zh) * 2020-06-04 2023-09-01 上海肇观电子科技有限公司 用于文本到语音转换的方法、设备、芯片电路和介质
US11356792B2 (en) * 2020-06-24 2022-06-07 International Business Machines Corporation Selecting a primary source of text to speech based on posture
US20220222437A1 (en) * 2021-01-08 2022-07-14 Nice Ltd. Systems and methods for structured phrase embedding and use thereof
US11907324B2 (en) * 2022-04-29 2024-02-20 Docusign, Inc. Guided form generation in a document management system

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5029214A (en) * 1986-08-11 1991-07-02 Hollander James F Electronic speech control apparatus and methods
US4839853A (en) * 1988-09-15 1989-06-13 Bell Communications Research, Inc. Computer information retrieval using latent semantic structure
US5761640A (en) * 1995-12-18 1998-06-02 Nynex Science & Technology, Inc. Name and address processor
JPH10153998A (ja) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> 補助情報利用型音声合成方法、この方法を実施する手順を記録した記録媒体、およびこの方法を実施する装置
US6226614B1 (en) * 1997-05-21 2001-05-01 Nippon Telegraph And Telephone Corporation Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon
US6108627A (en) * 1997-10-31 2000-08-22 Nortel Networks Corporation Automatic transcription tool
US6119086A (en) * 1998-04-28 2000-09-12 International Business Machines Corporation Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens
JPH11327870A (ja) * 1998-05-15 1999-11-30 Fujitsu Ltd ドキュメント読み上げ装置、読み上げ制御方法及び記 録媒体
JP3180764B2 (ja) * 1998-06-05 2001-06-25 日本電気株式会社 音声合成装置
US6446040B1 (en) * 1998-06-17 2002-09-03 Yahoo! Inc. Intelligent text-to-speech synthesis
JP2000105595A (ja) * 1998-09-30 2000-04-11 Victor Co Of Japan Ltd 歌唱装置及び記録媒体
US6587822B2 (en) * 1998-10-06 2003-07-01 Lucent Technologies Inc. Web-based platform for interactive voice response (IVR)
US6405199B1 (en) * 1998-10-30 2002-06-11 Novell, Inc. Method and apparatus for semantic token generation based on marked phrases in a content stream
JP2000206982A (ja) * 1999-01-12 2000-07-28 Toshiba Corp 音声合成装置及び文音声変換プログラムを記録した機械読み取り可能な記録媒体
JP2001014306A (ja) * 1999-06-30 2001-01-19 Sony Corp 電子文書処理方法及び電子文書処理装置並びに電子文書処理プログラムが記録された記録媒体
US6993476B1 (en) * 1999-08-26 2006-01-31 International Business Machines Corporation System and method for incorporating semantic characteristics into the format-driven syntactic document transcoding framework
US6725190B1 (en) * 1999-11-02 2004-04-20 International Business Machines Corporation Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope
JP3515039B2 (ja) * 2000-03-03 2004-04-05 沖電気工業株式会社 テキスト音声変換装置におけるピッチパタン制御方法
US7010489B1 (en) * 2000-03-09 2006-03-07 International Business Mahcines Corporation Method for guiding text-to-speech output timing using speech recognition markers
US6856958B2 (en) * 2000-09-05 2005-02-15 Lucent Technologies Inc. Methods and apparatus for text to speech processing using language independent prosody markup
US20040054973A1 (en) * 2000-10-02 2004-03-18 Akio Yamamoto Method and apparatus for transforming contents on the web
GB0029576D0 (en) * 2000-12-02 2001-01-17 Hewlett Packard Co Voice site personality setting
JP2002333895A (ja) * 2001-05-10 2002-11-22 Sony Corp 情報処理装置および情報処理方法、記録媒体、並びにプログラム
GB0113570D0 (en) * 2001-06-04 2001-07-25 Hewlett Packard Co Audio-form presentation of text messages
JP4680429B2 (ja) * 2001-06-26 2011-05-11 Okiセミコンダクタ株式会社 テキスト音声変換装置における高速読上げ制御方法
US20030125929A1 (en) * 2001-12-10 2003-07-03 Thomas Bergstraesser Services for context-sensitive flagging of information in natural language text and central management of metadata relating that information over a computer network
WO2003067471A1 (fr) * 2002-02-04 2003-08-14 Celestar Lexico-Sciences, Inc. Appareil et procede permettant de traiter des connaissances dans des documents
US7096183B2 (en) * 2002-02-27 2006-08-22 Matsushita Electric Industrial Co., Ltd. Customizing the speaking style of a speech synthesizer based on semantic analysis
JP4150198B2 (ja) * 2002-03-15 2008-09-17 ソニー株式会社 音声合成方法、音声合成装置、プログラム及び記録媒体、並びにロボット装置
JP2004226711A (ja) * 2003-01-23 2004-08-12 Xanavi Informatics Corp 音声出力装置及びナビゲーション装置

Also Published As

Publication number Publication date
US20070276667A1 (en) 2007-11-29
ATE372572T1 (de) 2007-09-15
CN1788305A (zh) 2006-06-14
KR100745443B1 (ko) 2007-08-03
CN1788305B (zh) 2011-05-04
IL172518A (en) 2011-04-28
IL172518A0 (en) 2006-04-10
KR20060020632A (ko) 2006-03-06
DE602004008776T2 (de) 2008-06-12
EP1636790B1 (de) 2007-09-05
EP1636790A1 (de) 2006-03-22
WO2004111997A1 (en) 2004-12-23
US20040260551A1 (en) 2004-12-23

Similar Documents

Publication Publication Date Title
DE602004008776D1 (de) Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse
ATE413751T1 (de) Verfahren und vorrichtung zur zweistufigen paketklassifikation unter verwendung einer spezifischen filteranpassung und gemeinsamen benutzung auf transportebene
CN1478269B (zh) 根据吠声的特征分析判断狗的情绪的设备及其方法
ATE233935T1 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
ATE220473T1 (de) System, verfahren und programmdatenträger zur darstellung komplexer informationen als klang
DE602005025103D1 (de) Vorrichtung und Verfahren zur zweistufigen Paketklassifikation unter Verwendung von höchst spezifischer Filteranpassung und Transport-Ebenen-Sharing
Greenberg et al. Listening to speech: an auditory perspective
ATE352071T1 (de) Verfahren und vorrichtung zur wahlweisen einstellung des zugangs zu anwendungsmerkmalen
DE60223296D1 (de) Verfahren zur Erzeugung von Passwörtern aus biometrischen Daten
ATE455332T1 (de) Verfahren und computersystem zur abfrageverarbeitung
DE50307634D1 (de) Vorrichtung zur Herstellung von Klebebindungen von Blocks und Broschüren insbesondere für Kleinauflagen
ATE367036T1 (de) Verfahren und vorrichtung zur bereitstellung elektronischer post an ein mobiles gerät
ATE556371T1 (de) System zur automatischen bearbeitung von bestandteilen einer vorrichtung
ATE386989T1 (de) Verfahren und vorrichtung zum dekodieren handschriftlicher zeichen
DE60128270D1 (de) Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung
ATE362395T1 (de) Vorrichtung und verfahren zur herstellung von partikeln
DE60109956D1 (de) Vorrichtung und verfahren zur telefonie-basierten spracherkennung für das bereitstellen von informationen zum sortieren von poststücken und paketen.
ATE292302T1 (de) Vorrichtung und verfahren in einer büroapplikation zur bereitstellung von inhaltsabhängiger hilfeinformation
DE60224763D1 (de) Verfahren und Gerät zur Dateisuche, und Verfahren und Vorrichtung zur Erzeugung von Indexdateien
DE60214850D1 (de) Für eine benutzergruppe spezifisches musterverarbeitungssystem
DE60327020D1 (de) Vorrichtung, Verfahren und computerlesbares Aufzeichnungsmedium zur Erkennung von Schlüsselwörtern in spontaner Sprache
DE60327400D1 (de) Verfahren und Vorrichtung zur Erzeugung von Entscheidungsbaumfragen für die Sprachverarbeitung
DE59902143D1 (de) Verfahren und vorrichtung zur ausgabe von informationen und/oder meldungen per sprache
DE602005006612D1 (de) Verfahren zur benutzeridentifizierung mittels veränderter biometrischer eigenschaften und datenbank zur ausführung dieses verfahrens
ATE305825T1 (de) Verfahren und vorrichtung zur bearbeitung von postsendungen

Legal Events

Date Code Title Description
8381 Inventor (new situation)

Inventor name: JANAKIRAMAN, JANANI, AUSTIN, TEXAS, US

Inventor name: ATKIN, STEVEN EDWARD, WINCHESTER HAMPSHIRE, GB

Inventor name: KUMHYR, DAVID BRUCE, AUSTIN, TEXAS, US

8364 No opposition during term of opposition
8320 Willingness to grant licences declared (paragraph 23)