ATE372572T1 - APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS - Google Patents
APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSISInfo
- Publication number
- ATE372572T1 ATE372572T1 AT04741720T AT04741720T ATE372572T1 AT E372572 T1 ATE372572 T1 AT E372572T1 AT 04741720 T AT04741720 T AT 04741720T AT 04741720 T AT04741720 T AT 04741720T AT E372572 T1 ATE372572 T1 AT E372572T1
- Authority
- AT
- Austria
- Prior art keywords
- semantic
- voice
- text block
- text
- identifier
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Abstract
A system and method for using semantic analysis to configure a voice reader is presented. A text file includes a plurality of text blocks, such as paragraphs. Processing performs semantic analysis on each text block in order to match the text block's semantic content with a semantic identifier. Once processing matches a semantic identifier with the text block, processing retrieves voice attributes that correspond to the semantic identifier (i.e. pitch value, loudness value, and pace value) and provides the voice attributes to a voice reader. The voice reader uses the text block to produce a synthesized voice signal with properties that correspond to the voice attributes. The text block may include semantic tags whereby processing performs latent semantic indexing on the semantic tags in order to match semantic identifiers to the semantic tags.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/464,881 US20040260551A1 (en) | 2003-06-19 | 2003-06-19 | System and method for configuring voice readers using semantic analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE372572T1 true ATE372572T1 (en) | 2007-09-15 |
Family
ID=33517358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT04741720T ATE372572T1 (en) | 2003-06-19 | 2004-06-11 | APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS |
Country Status (8)
Country | Link |
---|---|
US (2) | US20040260551A1 (en) |
EP (1) | EP1636790B1 (en) |
KR (1) | KR100745443B1 (en) |
CN (1) | CN1788305B (en) |
AT (1) | ATE372572T1 (en) |
DE (1) | DE602004008776T2 (en) |
IL (1) | IL172518A (en) |
WO (1) | WO2004111997A1 (en) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050096909A1 (en) * | 2003-10-29 | 2005-05-05 | Raimo Bakis | Systems and methods for expressive text-to-speech |
US20050125236A1 (en) * | 2003-12-08 | 2005-06-09 | International Business Machines Corporation | Automatic capture of intonation cues in audio segments for speech applications |
US7672436B1 (en) * | 2004-01-23 | 2010-03-02 | Sprint Spectrum L.P. | Voice rendering of E-mail with tags for improved user experience |
US9236043B2 (en) * | 2004-04-02 | 2016-01-12 | Knfb Reader, Llc | Document mode processing for portable reading machine enabling document navigation |
KR100669241B1 (en) * | 2004-12-15 | 2007-01-15 | 한국전자통신연구원 | System and method of synthesizing dialog-style speech using speech-act information |
US20080086490A1 (en) * | 2006-10-04 | 2008-04-10 | Sap Ag | Discovery of services matching a service request |
CN101226523B (en) * | 2007-01-17 | 2012-09-05 | 国际商业机器公司 | Method and system for analyzing data general condition |
US20090164387A1 (en) * | 2007-04-17 | 2009-06-25 | Semandex Networks Inc. | Systems and methods for providing semantically enhanced financial information |
US20090204402A1 (en) * | 2008-01-09 | 2009-08-13 | 8 Figure, Llc | Method and apparatus for creating customized podcasts with multiple text-to-speech voices |
US8112742B2 (en) * | 2008-05-12 | 2012-02-07 | Expressor Software | Method and system for debugging data integration applications with reusable synthetic data values |
DE102008060301B4 (en) * | 2008-12-03 | 2012-05-03 | Grenzebach Maschinenbau Gmbh | Method and device for non-positive connection of vitreous components with metals and computer program and machine-readable carrier for carrying out the method |
US8903847B2 (en) * | 2010-03-05 | 2014-12-02 | International Business Machines Corporation | Digital media voice tags in social networks |
US8645141B2 (en) * | 2010-09-14 | 2014-02-04 | Sony Corporation | Method and system for text to speech conversion |
US9734637B2 (en) * | 2010-12-06 | 2017-08-15 | Microsoft Technology Licensing, Llc | Semantic rigging of avatars |
CN102543068A (en) * | 2010-12-31 | 2012-07-04 | 北大方正集团有限公司 | Method and device for speech broadcast of text information |
US9286886B2 (en) * | 2011-01-24 | 2016-03-15 | Nuance Communications, Inc. | Methods and apparatus for predicting prosody in speech synthesis |
US20120244842A1 (en) | 2011-03-21 | 2012-09-27 | International Business Machines Corporation | Data Session Synchronization With Phone Numbers |
US20120246238A1 (en) | 2011-03-21 | 2012-09-27 | International Business Machines Corporation | Asynchronous messaging tags |
US8688090B2 (en) | 2011-03-21 | 2014-04-01 | International Business Machines Corporation | Data session preferences |
CN102752019B (en) * | 2011-04-20 | 2015-01-28 | 深圳盒子支付信息技术有限公司 | Data sending, receiving and transmitting method and system based on headset jack |
US9159313B2 (en) * | 2012-04-03 | 2015-10-13 | Sony Corporation | Playback control apparatus, playback control method, and medium for playing a program including segments generated using speech synthesis and segments not generated using speech synthesis |
US9183849B2 (en) | 2012-12-21 | 2015-11-10 | The Nielsen Company (Us), Llc | Audio matching with semantic audio recognition and report generation |
US9195649B2 (en) | 2012-12-21 | 2015-11-24 | The Nielsen Company (Us), Llc | Audio processing techniques for semantic audio recognition and report generation |
US9158760B2 (en) | 2012-12-21 | 2015-10-13 | The Nielsen Company (Us), Llc | Audio decoding with supplemental semantic audio recognition and report generation |
CN104281566A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Semantic text description method and semantic text description system |
CN104978961B (en) * | 2015-05-25 | 2019-10-15 | 广州酷狗计算机科技有限公司 | A kind of audio-frequency processing method, device and terminal |
CN105096932A (en) * | 2015-07-14 | 2015-11-25 | 百度在线网络技术(北京)有限公司 | Voice synthesis method and apparatus of talking book |
US10235989B2 (en) * | 2016-03-24 | 2019-03-19 | Oracle International Corporation | Sonification of words and phrases by text mining based on frequency of occurrence |
CN105741829A (en) * | 2016-04-28 | 2016-07-06 | 玉环看知信息科技有限公司 | Data conversion method and data conversion device |
CN106384586A (en) * | 2016-09-07 | 2017-02-08 | 北京小米移动软件有限公司 | Method and device for reading text information |
CN107886939B (en) * | 2016-09-30 | 2021-03-30 | 北京京东尚科信息技术有限公司 | Pause-continue type text voice playing method and device at client |
US11295738B2 (en) | 2016-12-30 | 2022-04-05 | Google, Llc | Modulation of packetized audio signals |
US10347247B2 (en) | 2016-12-30 | 2019-07-09 | Google Llc | Modulation of packetized audio signals |
CN108305611B (en) * | 2017-06-27 | 2022-02-11 | 腾讯科技(深圳)有限公司 | Text-to-speech method, device, storage medium and computer equipment |
CN108962219B (en) * | 2018-06-29 | 2019-12-13 | 百度在线网络技术(北京)有限公司 | method and device for processing text |
US11145289B1 (en) * | 2018-09-28 | 2021-10-12 | United Services Automobile Association (Usaa) | System and method for providing audible explanation of documents upon request |
KR102360840B1 (en) * | 2019-06-21 | 2022-02-09 | 주식회사 딥브레인에이아이 | Method and apparatus for generating speech video of using a text |
WO2020256475A1 (en) * | 2019-06-21 | 2020-12-24 | 주식회사 머니브레인 | Method and device for generating speech video by using text |
CN111291572B (en) * | 2020-01-20 | 2023-06-09 | Oppo广东移动通信有限公司 | Text typesetting method and device and computer readable storage medium |
CN111667815B (en) * | 2020-06-04 | 2023-09-01 | 上海肇观电子科技有限公司 | Method, apparatus, chip circuit and medium for text-to-speech conversion |
US11356792B2 (en) * | 2020-06-24 | 2022-06-07 | International Business Machines Corporation | Selecting a primary source of text to speech based on posture |
US20220222437A1 (en) * | 2021-01-08 | 2022-07-14 | Nice Ltd. | Systems and methods for structured phrase embedding and use thereof |
US11907324B2 (en) * | 2022-04-29 | 2024-02-20 | Docusign, Inc. | Guided form generation in a document management system |
Family Cites Families (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5029214A (en) * | 1986-08-11 | 1991-07-02 | Hollander James F | Electronic speech control apparatus and methods |
US4839853A (en) * | 1988-09-15 | 1989-06-13 | Bell Communications Research, Inc. | Computer information retrieval using latent semantic structure |
US5761640A (en) * | 1995-12-18 | 1998-06-02 | Nynex Science & Technology, Inc. | Name and address processor |
JPH10153998A (en) * | 1996-09-24 | 1998-06-09 | Nippon Telegr & Teleph Corp <Ntt> | Auxiliary information utilizing type voice synthesizing method, recording medium recording procedure performing this method, and device performing this method |
US6226614B1 (en) * | 1997-05-21 | 2001-05-01 | Nippon Telegraph And Telephone Corporation | Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon |
US6108627A (en) * | 1997-10-31 | 2000-08-22 | Nortel Networks Corporation | Automatic transcription tool |
US6119086A (en) * | 1998-04-28 | 2000-09-12 | International Business Machines Corporation | Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens |
JPH11327870A (en) * | 1998-05-15 | 1999-11-30 | Fujitsu Ltd | Device for reading-aloud document, reading-aloud control method and recording medium |
JP3180764B2 (en) * | 1998-06-05 | 2001-06-25 | 日本電気株式会社 | Speech synthesizer |
US6446040B1 (en) | 1998-06-17 | 2002-09-03 | Yahoo! Inc. | Intelligent text-to-speech synthesis |
JP2000105595A (en) * | 1998-09-30 | 2000-04-11 | Victor Co Of Japan Ltd | Singing device and recording medium |
US6587822B2 (en) * | 1998-10-06 | 2003-07-01 | Lucent Technologies Inc. | Web-based platform for interactive voice response (IVR) |
US6405199B1 (en) * | 1998-10-30 | 2002-06-11 | Novell, Inc. | Method and apparatus for semantic token generation based on marked phrases in a content stream |
JP2000206982A (en) * | 1999-01-12 | 2000-07-28 | Toshiba Corp | Speech synthesizer and machine readable recording medium which records sentence to speech converting program |
JP2001014306A (en) * | 1999-06-30 | 2001-01-19 | Sony Corp | Method and device for electronic document processing, and recording medium where electronic document processing program is recorded |
US6993476B1 (en) * | 1999-08-26 | 2006-01-31 | International Business Machines Corporation | System and method for incorporating semantic characteristics into the format-driven syntactic document transcoding framework |
US6725190B1 (en) * | 1999-11-02 | 2004-04-20 | International Business Machines Corporation | Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope |
JP3515039B2 (en) * | 2000-03-03 | 2004-04-05 | 沖電気工業株式会社 | Pitch pattern control method in text-to-speech converter |
US7010489B1 (en) * | 2000-03-09 | 2006-03-07 | International Business Mahcines Corporation | Method for guiding text-to-speech output timing using speech recognition markers |
US6856958B2 (en) * | 2000-09-05 | 2005-02-15 | Lucent Technologies Inc. | Methods and apparatus for text to speech processing using language independent prosody markup |
US20040054973A1 (en) * | 2000-10-02 | 2004-03-18 | Akio Yamamoto | Method and apparatus for transforming contents on the web |
GB0029576D0 (en) * | 2000-12-02 | 2001-01-17 | Hewlett Packard Co | Voice site personality setting |
JP2002333895A (en) * | 2001-05-10 | 2002-11-22 | Sony Corp | Information processor and information processing method, recording medium and program |
GB0113570D0 (en) * | 2001-06-04 | 2001-07-25 | Hewlett Packard Co | Audio-form presentation of text messages |
JP4680429B2 (en) * | 2001-06-26 | 2011-05-11 | Okiセミコンダクタ株式会社 | High speed reading control method in text-to-speech converter |
US20030125929A1 (en) * | 2001-12-10 | 2003-07-03 | Thomas Bergstraesser | Services for context-sensitive flagging of information in natural language text and central management of metadata relating that information over a computer network |
EP1473639A1 (en) * | 2002-02-04 | 2004-11-03 | Celestar Lexico-Sciences, Inc. | Document knowledge management apparatus and method |
US7096183B2 (en) * | 2002-02-27 | 2006-08-22 | Matsushita Electric Industrial Co., Ltd. | Customizing the speaking style of a speech synthesizer based on semantic analysis |
JP4150198B2 (en) * | 2002-03-15 | 2008-09-17 | ソニー株式会社 | Speech synthesis method, speech synthesis apparatus, program and recording medium, and robot apparatus |
JP2004226711A (en) * | 2003-01-23 | 2004-08-12 | Xanavi Informatics Corp | Voice output device and navigation device |
-
2003
- 2003-06-19 US US10/464,881 patent/US20040260551A1/en not_active Abandoned
-
2004
- 2004-06-11 AT AT04741720T patent/ATE372572T1/en not_active IP Right Cessation
- 2004-06-11 CN CN2004800128989A patent/CN1788305B/en not_active Expired - Fee Related
- 2004-06-11 WO PCT/EP2004/051010 patent/WO2004111997A1/en active IP Right Grant
- 2004-06-11 EP EP04741720A patent/EP1636790B1/en not_active Not-in-force
- 2004-06-11 KR KR1020057022069A patent/KR100745443B1/en not_active IP Right Cessation
- 2004-06-11 DE DE602004008776T patent/DE602004008776T2/en active Active
-
2005
- 2005-12-12 IL IL172518A patent/IL172518A/en not_active IP Right Cessation
-
2007
- 2007-08-10 US US11/836,890 patent/US20070276667A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
CN1788305B (en) | 2011-05-04 |
EP1636790A1 (en) | 2006-03-22 |
DE602004008776D1 (en) | 2007-10-18 |
WO2004111997A1 (en) | 2004-12-23 |
US20070276667A1 (en) | 2007-11-29 |
US20040260551A1 (en) | 2004-12-23 |
CN1788305A (en) | 2006-06-14 |
KR20060020632A (en) | 2006-03-06 |
DE602004008776T2 (en) | 2008-06-12 |
KR100745443B1 (en) | 2007-08-03 |
IL172518A0 (en) | 2006-04-10 |
IL172518A (en) | 2011-04-28 |
EP1636790B1 (en) | 2007-09-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE372572T1 (en) | APPARATUS AND METHOD FOR CONFIGURATION OF VOICE READERS USING SEMANTIC ANALYSIS | |
ATE413751T1 (en) | METHOD AND APPARATUS FOR TWO-LEVEL PACKET CLASSIFICATION USING SPECIFIC FILTER ADAPTATION AND SHARING AT THE TRANSPORT LEVEL | |
ATE551656T1 (en) | METHOD AND DEVICE FOR IDENTIFYING NEW MEDIA CONTENT | |
DE60223296D1 (en) | Method for generating passwords from biometric data | |
ATE233935T1 (en) | DEVICE AND METHOD FOR DISTINGUISHING SIMILAR SOUNDING WORDS IN SPEECH RECOGNITION | |
DE60043746D1 (en) | SYSTEM FOR IDENTIFICATION OF DISTRIBUTED CONTENTS | |
ATE352071T1 (en) | METHOD AND DEVICE FOR SELECTIVELY SETTING ACCESS TO APPLICATION FEATURES | |
DE60130430D1 (en) | METHOD AND DEVICE FOR INFORMATION PROCESSING | |
DE60330955D1 (en) | Method and computer system for query processing | |
WO2004029755A3 (en) | Automated report building system | |
DE69806492D1 (en) | SYSTEM, METHOD AND PROGRAM DATA CARRIER FOR THE DISPLAY OF COMPLEX INFORMATION AS SOUND | |
ATE325376T1 (en) | METHOD AND APPARATUS FOR PRODUCING PHYSICAL SECURITY OF A USER ACCOUNT AND PROVIDING ACCESS TO A USER'S ENVIRONMENT AND PREFERENCES | |
EP1349123A3 (en) | Secure identity and privilege system | |
DE60225170D1 (en) | METHOD AND DEVICE FOR DECODING HANDWRITCH SIGNS | |
ATE556371T1 (en) | SYSTEM FOR AUTOMATICALLY PROCESSING COMPONENTS OF A DEVICE | |
ATE362395T1 (en) | DEVICE AND METHOD FOR PRODUCING PARTICLES | |
ATE292302T1 (en) | APPARATUS AND METHOD IN AN OFFICE APPLICATION FOR PROVIDING CONTENT-DEPENDENT HELP INFORMATION | |
DE60128270D1 (en) | Method and system for generating speaker recognition data, and method and system for speaker recognition | |
DE60214850D1 (en) | FOR A USER GROUP, SPECIFIC PATTERN PROCESSING SYSTEM | |
DE60327020D1 (en) | Apparatus, method and computer readable recording medium for recognizing keywords in spontaneous speech | |
DE60327400D1 (en) | Method and apparatus for generating decision tree questions for speech processing | |
ATE218728T1 (en) | METHOD FOR PRODUCING CARD-SHAPED DATA CARRIERS | |
DE602005006612D1 (en) | METHOD FOR USER IDENTIFICATION USING CHANGED BIOMETRIC PROPERTIES AND DATABASE FOR CARRYING OUT THIS METHOD | |
ATE305825T1 (en) | METHOD AND DEVICE FOR PROCESSING MAIL | |
DE59603224D1 (en) | METHOD AND DEVICE FOR PRODUCING HIGH VISCOSITY OR HIGHLY STABILIZED, REACTION-STABLE POLYAMIDES AND FOR CONTINUOUSLY DEMONOMERIZING POLYAMIDS |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |