AU2048001A - System and method of templating specific human voices - Google Patents

System and method of templating specific human voices Download PDF

Info

Publication number
AU2048001A
AU2048001A AU20480/01A AU2048001A AU2048001A AU 2048001 A AU2048001 A AU 2048001A AU 20480/01 A AU20480/01 A AU 20480/01A AU 2048001 A AU2048001 A AU 2048001A AU 2048001 A AU2048001 A AU 2048001A
Authority
AU
Australia
Prior art keywords
voice
data
captured
template
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU20480/01A
Other languages
English (en)
Inventor
Katherine Axia Keough
Steven J. Keough
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of AU2048001A publication Critical patent/AU2048001A/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AU20480/01A 1999-11-23 2000-11-23 System and method of templating specific human voices Abandoned AU2048001A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16716899P 1999-11-23 1999-11-23
US60167168 1999-11-23
PCT/US2000/032328 WO2001039180A1 (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Publications (1)

Publication Number Publication Date
AU2048001A true AU2048001A (en) 2001-06-04

Family

ID=22606225

Family Applications (1)

Application Number Title Priority Date Filing Date
AU20480/01A Abandoned AU2048001A (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Country Status (13)

Country Link
EP (1) EP1252620A1 (zh)
JP (1) JP2003515768A (zh)
KR (1) KR20020060975A (zh)
CN (1) CN1391690A (zh)
AP (1) AP2002002524A0 (zh)
AU (1) AU2048001A (zh)
BR (1) BR0015773A (zh)
CA (1) CA2392436A1 (zh)
EA (1) EA004079B1 (zh)
IL (1) IL149813A0 (zh)
NO (1) NO20022406L (zh)
WO (1) WO2001039180A1 (zh)
ZA (1) ZA200204036B (zh)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
WO2008149547A1 (ja) * 2007-06-06 2008-12-11 Panasonic Corporation 声質編集装置および声質編集方法
US9240182B2 (en) * 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function
US9552810B2 (en) 2015-03-31 2017-01-24 International Business Machines Corporation Customizable and individualized speech recognition settings interface for users with language accents
RU2617918C2 (ru) * 2015-06-19 2017-04-28 Иосиф Исаакович Лившиц Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа
KR101963195B1 (ko) * 2017-06-21 2019-03-28 구동하 사용자 음성을 이용한 생리 주기 결정 방법 및 이를 실행하는 서버
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US11093554B2 (en) 2017-09-15 2021-08-17 Kohler Co. Feedback for water consuming appliance
US11314214B2 (en) 2017-09-15 2022-04-26 Kohler Co. Geographic analysis of water conditions
CN109298642B (zh) * 2018-09-20 2021-08-27 三星电子(中国)研发中心 采用智能音箱进行监控的方法及装置
KR102466736B1 (ko) * 2021-06-18 2022-11-14 주식회사 한글과컴퓨터 사용자에 의해 입력된 음성을 기초로 본인 인증을 수행하는 음성 기반의 사용자 인증 서버 및 그 동작 방법

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
US5594789A (en) * 1994-10-13 1997-01-14 Bell Atlantic Network Services, Inc. Transaction implementation in video dial tone network
US5717828A (en) * 1995-03-15 1998-02-10 Syracuse Language Systems Speech recognition apparatus and method for learning
US5774841A (en) * 1995-09-20 1998-06-30 The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method

Also Published As

Publication number Publication date
IL149813A0 (en) 2002-11-10
ZA200204036B (en) 2003-08-21
BR0015773A (pt) 2002-08-06
KR20020060975A (ko) 2002-07-19
WO2001039180A1 (en) 2001-05-31
EA200200587A1 (ru) 2002-10-31
AP2002002524A0 (en) 2002-06-30
EP1252620A1 (en) 2002-10-30
NO20022406L (no) 2002-07-12
NO20022406D0 (no) 2002-05-21
JP2003515768A (ja) 2003-05-07
CA2392436A1 (en) 2001-05-31
CN1391690A (zh) 2003-01-15
EA004079B1 (ru) 2003-12-25

Similar Documents

Publication Publication Date Title
US20020072900A1 (en) System and method of templating specific human voices
US10381016B2 (en) Methods and apparatus for altering audio output signals
JP6876752B2 (ja) 応答方法及び装置
Gold et al. Speech and audio signal processing: processing and perception of speech and music
Rachman et al. DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech
CN102682769B (zh) 对数字网络进行基于自然语言的控制
US8364488B2 (en) Voice models for document narration
US20050108011A1 (en) System and method of templating specific human voices
JP2023501074A (ja) ユーザ用の音声モデルを生成すること
CN107516511A (zh) 意图识别和情绪的文本到语音学习系统
CN109272984A (zh) 用于语音交互的方法和装置
US20100318362A1 (en) Systems and Methods for Multiple Voice Document Narration
WO2022184055A1 (zh) 文章的语音播放方法、装置、设备、存储介质及程序产品
CN106847258A (zh) 用于共享调适语音简档的方法和设备
CN112164379A (zh) 音频文件生成方法、装置、设备及计算机可读存储介质
AU2048001A (en) System and method of templating specific human voices
Kato et al. Modeling of Rakugo speech and its limitations: Toward speech synthesis that entertains audiences
Ramati Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators
CN112885326A (zh) 个性化语音合成模型创建、语音合成和测试方法及装置
JP2024533345A (ja) バーチャルコンサートの処理方法、処理装置、電子機器およびコンピュータプログラム
WO2004008295A2 (en) System and method for voice characteristic medical analysis
CN114283781A (zh) 语音合成方法及相关装置、电子设备和存储介质
Lee et al. The Sound of Hallucinations: Toward a more convincing emulation of internalized voices
CN115132204B (zh) 一种语音处理方法、设备、存储介质及计算机程序产品
Own et al. The Individual Perception in Synthetic Speech