CA2392436A1 - System and method of templating specific human voices - Google Patents

System and method of templating specific human voices Download PDF

Info

Publication number
CA2392436A1
CA2392436A1 CA002392436A CA2392436A CA2392436A1 CA 2392436 A1 CA2392436 A1 CA 2392436A1 CA 002392436 A CA002392436 A CA 002392436A CA 2392436 A CA2392436 A CA 2392436A CA 2392436 A1 CA2392436 A1 CA 2392436A1
Authority
CA
Canada
Prior art keywords
voice
data
captured
template
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002392436A
Other languages
English (en)
French (fr)
Inventor
Steven J. Keough
Katherine Axia Keough
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2392436A1 publication Critical patent/CA2392436A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CA002392436A 1999-11-23 2000-11-23 System and method of templating specific human voices Abandoned CA2392436A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16716899P 1999-11-23 1999-11-23
US60/167,168 1999-11-23
PCT/US2000/032328 WO2001039180A1 (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Publications (1)

Publication Number Publication Date
CA2392436A1 true CA2392436A1 (en) 2001-05-31

Family

ID=22606225

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002392436A Abandoned CA2392436A1 (en) 1999-11-23 2000-11-23 System and method of templating specific human voices

Country Status (13)

Country Link
EP (1) EP1252620A1 (ru)
JP (1) JP2003515768A (ru)
KR (1) KR20020060975A (ru)
CN (1) CN1391690A (ru)
AP (1) AP2002002524A0 (ru)
AU (1) AU2048001A (ru)
BR (1) BR0015773A (ru)
CA (1) CA2392436A1 (ru)
EA (1) EA004079B1 (ru)
IL (1) IL149813A0 (ru)
NO (1) NO20022406L (ru)
WO (1) WO2001039180A1 (ru)
ZA (1) ZA200204036B (ru)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
WO2008149547A1 (ja) * 2007-06-06 2008-12-11 Panasonic Corporation 声質編集装置および声質編集方法
US9240182B2 (en) * 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function
US9552810B2 (en) 2015-03-31 2017-01-24 International Business Machines Corporation Customizable and individualized speech recognition settings interface for users with language accents
RU2617918C2 (ru) * 2015-06-19 2017-04-28 Иосиф Исаакович Лившиц Способ формирования образа человека с учетом характеристик его психологического портрета, полученных под контролем полиграфа
KR101963195B1 (ko) * 2017-06-21 2019-03-28 구동하 사용자 음성을 이용한 생리 주기 결정 방법 및 이를 실행하는 서버
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US11093554B2 (en) 2017-09-15 2021-08-17 Kohler Co. Feedback for water consuming appliance
US11314215B2 (en) 2017-09-15 2022-04-26 Kohler Co. Apparatus controlling bathroom appliance lighting based on user identity
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
CN109298642B (zh) * 2018-09-20 2021-08-27 三星电子(中国)研发中心 采用智能音箱进行监控的方法及装置
KR102466736B1 (ko) * 2021-06-18 2022-11-14 주식회사 한글과컴퓨터 사용자에 의해 입력된 음성을 기초로 본인 인증을 수행하는 음성 기반의 사용자 인증 서버 및 그 동작 방법

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5007081A (en) * 1989-01-05 1991-04-09 Origin Technology, Inc. Speech activated telephone
US5594789A (en) * 1994-10-13 1997-01-14 Bell Atlantic Network Services, Inc. Transaction implementation in video dial tone network
US5717828A (en) * 1995-03-15 1998-02-10 Syracuse Language Systems Speech recognition apparatus and method for learning
US5774841A (en) * 1995-09-20 1998-06-30 The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method

Also Published As

Publication number Publication date
ZA200204036B (en) 2003-08-21
NO20022406L (no) 2002-07-12
WO2001039180A1 (en) 2001-05-31
EA200200587A1 (ru) 2002-10-31
AP2002002524A0 (en) 2002-06-30
EP1252620A1 (en) 2002-10-30
NO20022406D0 (no) 2002-05-21
IL149813A0 (en) 2002-11-10
KR20020060975A (ko) 2002-07-19
CN1391690A (zh) 2003-01-15
BR0015773A (pt) 2002-08-06
EA004079B1 (ru) 2003-12-25
JP2003515768A (ja) 2003-05-07
AU2048001A (en) 2001-06-04

Similar Documents

Publication Publication Date Title
US20020072900A1 (en) System and method of templating specific human voices
US20240054118A1 (en) Artificial intelligence platform with improved conversational ability and personality development
JP6876752B2 (ja) 応答方法及び装置
Gold et al. Speech and audio signal processing: processing and perception of speech and music
US10088976B2 (en) Systems and methods for multiple voice document narration
US8364488B2 (en) Voice models for document narration
Rachman et al. DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech
US20050108011A1 (en) System and method of templating specific human voices
CN107516511A (zh) 意图识别和情绪的文本到语音学习系统
JP2023501074A (ja) ユーザ用の音声モデルを生成すること
CN111667812A (zh) 一种语音合成方法、装置、设备及存储介质
CN113010138B (zh) 文章的语音播放方法、装置、设备及计算机可读存储介质
US20110219940A1 (en) System and method for generating custom songs
CA2392436A1 (en) System and method of templating specific human voices
CN112164379A (zh) 音频文件生成方法、装置、设备及计算机可读存储介质
JPH09171396A (ja) 音声発生システム
CN114048299A (zh) 对话方法、装置、设备、计算机可读存储介质及程序产品
Wu et al. Exemplar-based emotive speech synthesis
CN112885326A (zh) 个性化语音合成模型创建、语音合成和测试方法及装置
US20220383850A1 (en) System and method for posthumous dynamic speech synthesis using neural networks and deep learning
WO2004008295A2 (en) System and method for voice characteristic medical analysis
Ramati Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators
Lee et al. The Sound of Hallucinations: Toward a more convincing emulation of internalized voices
CN115132204B (zh) 一种语音处理方法、设备、存储介质及计算机程序产品
Midtlyng et al. Voice adaptation by color-encoded frame matching as a multi-objective optimization problem for future games

Legal Events

Date Code Title Description
FZDE Discontinued