ATE372572T1 - APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS - Google Patents

APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS

Info

Publication number
ATE372572T1
ATE372572T1 AT04741720T AT04741720T ATE372572T1 AT E372572 T1 ATE372572 T1 AT E372572T1 AT 04741720 T AT04741720 T AT 04741720T AT 04741720 T AT04741720 T AT 04741720T AT E372572 T1 ATE372572 T1 AT E372572T1
Authority
AT
Austria
Prior art keywords
semantic
voice
text block
text
identifier
Prior art date
Application number
AT04741720T
Other languages
German (de)
Inventor
Steven Atkin
Janani Janakiraman
David Kumhyr
Original Assignee
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm filed Critical Ibm
Application granted granted Critical
Publication of ATE372572T1 publication Critical patent/ATE372572T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

A system and method for using semantic analysis to configure a voice reader is presented. A text file includes a plurality of text blocks, such as paragraphs. Processing performs semantic analysis on each text block in order to match the text block's semantic content with a semantic identifier. Once processing matches a semantic identifier with the text block, processing retrieves voice attributes that correspond to the semantic identifier (i.e. pitch value, loudness value, and pace value) and provides the voice attributes to a voice reader. The voice reader uses the text block to produce a synthesized voice signal with properties that correspond to the voice attributes. The text block may include semantic tags whereby processing performs latent semantic indexing on the semantic tags in order to match semantic identifiers to the semantic tags.
AT04741720T 2003-06-19 2004-06-11 APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS ATE372572T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/464,881 US20040260551A1 (en) 2003-06-19 2003-06-19 System and method for configuring voice readers using semantic analysis

Publications (1)

Publication Number Publication Date
ATE372572T1 true ATE372572T1 (en) 2007-09-15

Family

ID=33517358

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04741720T ATE372572T1 (en) 2003-06-19 2004-06-11 APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS

Country Status (8)

Country Link
US (2) US20040260551A1 (en)
EP (1) EP1636790B1 (en)
KR (1) KR100745443B1 (en)
CN (1) CN1788305B (en)
AT (1) ATE372572T1 (en)
DE (1) DE602004008776T2 (en)
IL (1) IL172518A (en)
WO (1) WO2004111997A1 (en)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050096909A1 (en) * 2003-10-29 2005-05-05 Raimo Bakis Systems and methods for expressive text-to-speech
US20050125236A1 (en) * 2003-12-08 2005-06-09 International Business Machines Corporation Automatic capture of intonation cues in audio segments for speech applications
US7672436B1 (en) * 2004-01-23 2010-03-02 Sprint Spectrum L.P. Voice rendering of E-mail with tags for improved user experience
US9236043B2 (en) * 2004-04-02 2016-01-12 Knfb Reader, Llc Document mode processing for portable reading machine enabling document navigation
KR100669241B1 (en) * 2004-12-15 2007-01-15 한국전자통신연구원 System and method of synthesizing dialog-style speech using speech-act information
US20080086490A1 (en) * 2006-10-04 2008-04-10 Sap Ag Discovery of services matching a service request
CN101226523B (en) * 2007-01-17 2012-09-05 国际商业机器公司 Method and system for analyzing data general condition
US20090164387A1 (en) * 2007-04-17 2009-06-25 Semandex Networks Inc. Systems and methods for providing semantically enhanced financial information
US20090204402A1 (en) * 2008-01-09 2009-08-13 8 Figure, Llc Method and apparatus for creating customized podcasts with multiple text-to-speech voices
US8112742B2 (en) * 2008-05-12 2012-02-07 Expressor Software Method and system for debugging data integration applications with reusable synthetic data values
DE102008060301B4 (en) * 2008-12-03 2012-05-03 Grenzebach Maschinenbau Gmbh Method and device for non-positive connection of vitreous components with metals and computer program and machine-readable carrier for carrying out the method
US8903847B2 (en) * 2010-03-05 2014-12-02 International Business Machines Corporation Digital media voice tags in social networks
US8645141B2 (en) * 2010-09-14 2014-02-04 Sony Corporation Method and system for text to speech conversion
US9734637B2 (en) * 2010-12-06 2017-08-15 Microsoft Technology Licensing, Llc Semantic rigging of avatars
CN102543068A (en) * 2010-12-31 2012-07-04 北大方正集团有限公司 Method and device for speech broadcast of text information
US9286886B2 (en) * 2011-01-24 2016-03-15 Nuance Communications, Inc. Methods and apparatus for predicting prosody in speech synthesis
US20120244842A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Data Session Synchronization With Phone Numbers
US20120246238A1 (en) 2011-03-21 2012-09-27 International Business Machines Corporation Asynchronous messaging tags
US8688090B2 (en) 2011-03-21 2014-04-01 International Business Machines Corporation Data session preferences
CN102752019B (en) * 2011-04-20 2015-01-28 深圳盒子支付信息技术有限公司 Data sending, receiving and transmitting method and system based on headset jack
US9159313B2 (en) * 2012-04-03 2015-10-13 Sony Corporation Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis and segments not generated using speech synthesis
US9183849B2 (en) 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9195649B2 (en) 2012-12-21 2015-11-24 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9158760B2 (en) 2012-12-21 2015-10-13 The Nielsen Company (Us), Llc Audio decoding with supplemental semantic audio recognition and report generation
CN104281566A (en) * 2014-10-13 2015-01-14 安徽华贞信息科技有限公司 Semantic text description method and semantic text description system
CN104978961B (en) * 2015-05-25 2019-10-15 广州酷狗计算机科技有限公司 A kind of audio-frequency processing method, device and terminal
CN105096932A (en) * 2015-07-14 2015-11-25 百度在线网络技术(北京)有限公司 Voice synthesis method and apparatus of talking book
US10235989B2 (en) * 2016-03-24 2019-03-19 Oracle International Corporation Sonification of words and phrases by text mining based on frequency of occurrence
CN105741829A (en) * 2016-04-28 2016-07-06 玉环看知信息科技有限公司 Data conversion method and data conversion device
CN106384586A (en) * 2016-09-07 2017-02-08 北京小米移动软件有限公司 Method and device for reading text information
CN107886939B (en) * 2016-09-30 2021-03-30 北京京东尚科信息技术有限公司 Pause-continue type text voice playing method and device at client
US11295738B2 (en) 2016-12-30 2022-04-05 Google, Llc Modulation of packetized audio signals
US10347247B2 (en) 2016-12-30 2019-07-09 Google Llc Modulation of packetized audio signals
CN108305611B (en) * 2017-06-27 2022-02-11 腾讯科技(深圳)有限公司 Text-to-speech method, device, storage medium and computer equipment
CN108962219B (en) * 2018-06-29 2019-12-13 百度在线网络技术(北京)有限公司 method and device for processing text
US11145289B1 (en) * 2018-09-28 2021-10-12 United Services Automobile Association (Usaa) System and method for providing audible explanation of documents upon request
KR102360840B1 (en) * 2019-06-21 2022-02-09 주식회사 딥브레인에이아이 Method and apparatus for generating speech video of using a text
WO2020256475A1 (en) * 2019-06-21 2020-12-24 주식회사 머니브레인 Method and device for generating speech video by using text
CN111291572B (en) * 2020-01-20 2023-06-09 Oppo广东移动通信有限公司 Text typesetting method and device and computer readable storage medium
CN111667815B (en) * 2020-06-04 2023-09-01 上海肇观电子科技有限公司 Method, apparatus, chip circuit and medium for text-to-speech conversion
US11356792B2 (en) * 2020-06-24 2022-06-07 International Business Machines Corporation Selecting a primary source of text to speech based on posture
US20220222437A1 (en) * 2021-01-08 2022-07-14 Nice Ltd. Systems and methods for structured phrase embedding and use thereof
US11907324B2 (en) * 2022-04-29 2024-02-20 Docusign, Inc. Guided form generation in a document management system

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5029214A (en) * 1986-08-11 1991-07-02 Hollander James F Electronic speech control apparatus and methods
US4839853A (en) * 1988-09-15 1989-06-13 Bell Communications Research, Inc. Computer information retrieval using latent semantic structure
US5761640A (en) * 1995-12-18 1998-06-02 Nynex Science & Technology, Inc. Name and address processor
JPH10153998A (en) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> Auxiliary information utilizing type voice synthesizing method, recording medium recording procedure performing this method, and device performing this method
US6226614B1 (en) * 1997-05-21 2001-05-01 Nippon Telegraph And Telephone Corporation Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon
US6108627A (en) * 1997-10-31 2000-08-22 Nortel Networks Corporation Automatic transcription tool
US6119086A (en) * 1998-04-28 2000-09-12 International Business Machines Corporation Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens
JPH11327870A (en) * 1998-05-15 1999-11-30 Fujitsu Ltd Device for reading-aloud document, reading-aloud control method and recording medium
JP3180764B2 (en) * 1998-06-05 2001-06-25 日本電気株式会社 Speech synthesizer
US6446040B1 (en) 1998-06-17 2002-09-03 Yahoo! Inc. Intelligent text-to-speech synthesis
JP2000105595A (en) * 1998-09-30 2000-04-11 Victor Co Of Japan Ltd Singing device and recording medium
US6587822B2 (en) * 1998-10-06 2003-07-01 Lucent Technologies Inc. Web-based platform for interactive voice response (IVR)
US6405199B1 (en) * 1998-10-30 2002-06-11 Novell, Inc. Method and apparatus for semantic token generation based on marked phrases in a content stream
JP2000206982A (en) * 1999-01-12 2000-07-28 Toshiba Corp Speech synthesizer and machine readable recording medium which records sentence to speech converting program
JP2001014306A (en) * 1999-06-30 2001-01-19 Sony Corp Method and device for electronic document processing, and recording medium where electronic document processing program is recorded
US6993476B1 (en) * 1999-08-26 2006-01-31 International Business Machines Corporation System and method for incorporating semantic characteristics into the format-driven syntactic document transcoding framework
US6725190B1 (en) * 1999-11-02 2004-04-20 International Business Machines Corporation Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope
JP3515039B2 (en) * 2000-03-03 2004-04-05 沖電気工業株式会社 Pitch pattern control method in text-to-speech converter
US7010489B1 (en) * 2000-03-09 2006-03-07 International Business Mahcines Corporation Method for guiding text-to-speech output timing using speech recognition markers
US6856958B2 (en) * 2000-09-05 2005-02-15 Lucent Technologies Inc. Methods and apparatus for text to speech processing using language independent prosody markup
US20040054973A1 (en) * 2000-10-02 2004-03-18 Akio Yamamoto Method and apparatus for transforming contents on the web
GB0029576D0 (en) * 2000-12-02 2001-01-17 Hewlett Packard Co Voice site personality setting
JP2002333895A (en) * 2001-05-10 2002-11-22 Sony Corp Information processor and information processing method, recording medium and program
GB0113570D0 (en) * 2001-06-04 2001-07-25 Hewlett Packard Co Audio-form presentation of text messages
JP4680429B2 (en) * 2001-06-26 2011-05-11 Okiセミコンダクタ株式会社 High speed reading control method in text-to-speech converter
US20030125929A1 (en) * 2001-12-10 2003-07-03 Thomas Bergstraesser Services for context-sensitive flagging of information in natural language text and central management of metadata relating that information over a computer network
EP1473639A1 (en) * 2002-02-04 2004-11-03 Celestar Lexico-Sciences, Inc. Document knowledge management apparatus and method
US7096183B2 (en) * 2002-02-27 2006-08-22 Matsushita Electric Industrial Co., Ltd. Customizing the speaking style of a speech synthesizer based on semantic analysis
JP4150198B2 (en) * 2002-03-15 2008-09-17 ソニー株式会社 Speech synthesis method, speech synthesis apparatus, program and recording medium, and robot apparatus
JP2004226711A (en) * 2003-01-23 2004-08-12 Xanavi Informatics Corp Voice output device and navigation device

Also Published As

Publication number Publication date
CN1788305B (en) 2011-05-04
EP1636790A1 (en) 2006-03-22
DE602004008776D1 (en) 2007-10-18
WO2004111997A1 (en) 2004-12-23
US20070276667A1 (en) 2007-11-29
US20040260551A1 (en) 2004-12-23
CN1788305A (en) 2006-06-14
KR20060020632A (en) 2006-03-06
DE602004008776T2 (en) 2008-06-12
KR100745443B1 (en) 2007-08-03
IL172518A0 (en) 2006-04-10
IL172518A (en) 2011-04-28
EP1636790B1 (en) 2007-09-05

Similar Documents

Publication Publication Date Title
ATE372572T1 (en) APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS
ATE413751T1 (en) METHOD AND APPARATUS FOR TWO-LEVEL PACKET CLASSIFICATION USING SPECIFIC FILTER ADAPTATION AND SHARING AT THE TRANSPORT LEVEL
ATE551656T1 (en) METHOD AND DEVICE FOR IDENTIFYING NEW MEDIA CONTENT
DE60223296D1 (en) Method for generating passwords from biometric data
ATE233935T1 (en) DEVICE AND METHOD FOR DISTINGUISHING SIMILAR SOUNDING WORDS IN SPEECH RECOGNITION
DE60043746D1 (en) SYSTEM FOR IDENTIFICATION OF DISTRIBUTED CONTENTS
ATE352071T1 (en) METHOD AND DEVICE FOR SELECTIVELY SETTING ACCESS TO APPLICATION FEATURES
DE60130430D1 (en) METHOD AND DEVICE FOR INFORMATION PROCESSING
DE60330955D1 (en) Method and computer system for query processing
WO2004029755A3 (en) Automated report building system
DE69806492D1 (en) SYSTEM, METHOD AND PROGRAM DATA CARRIER FOR THE DISPLAY OF COMPLEX INFORMATION AS SOUND
ATE325376T1 (en) METHOD AND APPARATUS FOR PRODUCING PHYSICAL SECURITY OF A USER ACCOUNT AND PROVIDING ACCESS TO A USER&#39;S ENVIRONMENT AND PREFERENCES
EP1349123A3 (en) Secure identity and privilege system
DE60225170D1 (en) METHOD AND DEVICE FOR DECODING HANDWRITCH SIGNS
ATE556371T1 (en) SYSTEM FOR AUTOMATICALLY PROCESSING COMPONENTS OF A DEVICE
ATE362395T1 (en) DEVICE AND METHOD FOR PRODUCING PARTICLES
ATE292302T1 (en) APPARATUS AND METHOD IN AN OFFICE APPLICATION FOR PROVIDING CONTENT-DEPENDENT HELP INFORMATION
DE60128270D1 (en) Method and system for generating speaker recognition data, and method and system for speaker recognition
DE60214850D1 (en) FOR A USER GROUP, SPECIFIC PATTERN PROCESSING SYSTEM
DE60327020D1 (en) Apparatus, method and computer readable recording medium for recognizing keywords in spontaneous speech
DE60327400D1 (en) Method and apparatus for generating decision tree questions for speech processing
ATE218728T1 (en) METHOD FOR PRODUCING CARD-SHAPED DATA CARRIERS
DE602005006612D1 (en) METHOD FOR USER IDENTIFICATION USING CHANGED BIOMETRIC PROPERTIES AND DATABASE FOR CARRYING OUT THIS METHOD
ATE305825T1 (en) METHOD AND DEVICE FOR PROCESSING MAIL
DE59603224D1 (en) METHOD AND DEVICE FOR PRODUCING HIGH VISCOSITY OR HIGHLY STABILIZED, REACTION-STABLE POLYAMIDES AND FOR CONTINUOUSLY DEMONOMERIZING POLYAMIDS

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties