SG10201912562SA - A training method, a readable storage medium and a voice cloning method for a voice cloning model - Google Patents

A training method, a readable storage medium and a voice cloning method for a voice cloning model

Info

Publication number
SG10201912562SA
SG10201912562SA SG10201912562SA SG10201912562SA SG10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA
Authority
SG
Singapore
Prior art keywords
voice cloning
storage medium
readable storage
voice
model
Prior art date
Application number
SG10201912562SA
Inventor
Zining Zhang
Xiaoyan Yang
Zhenjie Zhang
Original Assignee
Yitu Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yitu Pte Ltd filed Critical Yitu Pte Ltd
Priority to SG10201912562SA priority Critical patent/SG10201912562SA/en
Priority to CN202010476440.XA priority patent/CN111696521B/en
Publication of SG10201912562SA publication Critical patent/SG10201912562SA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
SG10201912562SA 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model SG10201912562SA (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model
CN202010476440.XA CN111696521B (en) 2019-12-18 2020-05-29 Training method of voice cloning model, readable storage medium and voice cloning method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model

Publications (1)

Publication Number Publication Date
SG10201912562SA true SG10201912562SA (en) 2021-07-29

Family

ID=72478905

Family Applications (1)

Application Number Title Priority Date Filing Date
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model

Country Status (2)

Country Link
CN (1) CN111696521B (en)
SG (1) SG10201912562SA (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112233646B (en) * 2020-10-20 2024-05-31 携程计算机技术(上海)有限公司 Voice cloning method, system, equipment and storage medium based on neural network
CN112185340B (en) * 2020-10-30 2024-03-15 网易(杭州)网络有限公司 Speech synthesis method, speech synthesis device, storage medium and electronic equipment
CN112652291B (en) * 2020-12-15 2024-04-05 携程旅游网络技术(上海)有限公司 Speech synthesis method, system, equipment and storage medium based on neural network
CN112992117B (en) * 2021-02-26 2023-05-26 平安科技(深圳)有限公司 Multi-language voice model generation method, device, computer equipment and storage medium
CN113488057B (en) * 2021-08-18 2023-11-14 山东新一代信息产业技术研究院有限公司 Conversation realization method and system for health care

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11238843B2 (en) * 2018-02-09 2022-02-01 Baidu Usa Llc Systems and methods for neural voice cloning with a few samples
CN108630190B (en) * 2018-05-18 2019-12-10 百度在线网络技术(北京)有限公司 Method and apparatus for generating speech synthesis model
CN110136687B (en) * 2019-05-20 2021-06-15 深圳市数字星河科技有限公司 Voice training based cloned accent and rhyme method
CN110288973B (en) * 2019-05-20 2024-03-29 平安科技(深圳)有限公司 Speech synthesis method, device, equipment and computer readable storage medium

Also Published As

Publication number Publication date
CN111696521A (en) 2020-09-22
CN111696521B (en) 2023-08-08

Similar Documents

Publication Publication Date Title
SG10201912562SA (en) A training method, a readable storage medium and a voice cloning method for a voice cloning model
EP3971772A4 (en) Model training method and apparatus, and terminal and storage medium
EP3709226A4 (en) Model training system and method, and storage medium
EP3862893A4 (en) Recommendation model training method, recommendation method, device, and computer-readable medium
EP3739572A4 (en) Text-to-speech synthesis method and apparatus using machine learning, and computer-readable storage medium
EP3937165A4 (en) Speech synthesis method and apparatus, and computer-readable storage medium
EP3805988A4 (en) Training method for model, storage medium and computer device
EP3792789A4 (en) Translation model training method, sentence translation method and apparatus, and storage medium
EP4068280A4 (en) Speech recognition error correction method, related devices, and readable storage medium
EP3933754A4 (en) Image fusion method, model training method, and related device
EP3690768A4 (en) User behavior prediction method and apparatus, and behavior prediction model training method and apparatus
SG11202105466QA (en) Method and device for generating neural network model, and computer-readable storage medium
ZA202206486B (en) Method and apparatus for detecting fault, method and apparatus for training model, and device and storage medium
EP3968243A4 (en) Method and apparatus for realizing model training, and computer storage medium
EP3989109A4 (en) Image identification method and device, identification model training method and device, and storage medium
EP4024261A4 (en) Model training method, apparatus, and system
EP3937073A4 (en) Method for video classification, method and device for model training, and storage medium
EP3989104A4 (en) Facial feature extraction model training method and apparatus, facial feature extraction method and apparatus, device, and storage medium
EP3992975A4 (en) Compound property analysis method and apparatus, compound property analysis model training method, and storage medium
EP3270239A4 (en) Device characteristic model learning device, device characteristic model learning method, and storage medium
EP3951702A4 (en) Method for training image processing model, image processing method, network device, and storage medium
SG11202104492QA (en) Model training methods, apparatuses, and systems
EP4044175A4 (en) Voice recognition method and apparatus, and computer-readale storage medium
EP3866068A4 (en) Image description model training method and device, and storage medium
EP4181026A4 (en) Recommendation model training method and apparatus, recommendation method and apparatus, and computer-readable medium