WO2023211386A3 - 一种音乐生成方法、装置、系统及存储介质 - Google Patents

一种音乐生成方法、装置、系统及存储介质 Download PDF

Info

Publication number
WO2023211386A3
WO2023211386A3 PCT/SG2023/050290 SG2023050290W WO2023211386A3 WO 2023211386 A3 WO2023211386 A3 WO 2023211386A3 SG 2023050290 W SG2023050290 W SG 2023050290W WO 2023211386 A3 WO2023211386 A3 WO 2023211386A3
Authority
WO
WIPO (PCT)
Prior art keywords
music
music generation
user
storage medium
generation method
Prior art date
Application number
PCT/SG2023/050290
Other languages
English (en)
French (fr)
Other versions
WO2023211386A2 (zh
Inventor
薛愉凡
郑强
牛栋
徐良钦
王晓婵
陈纪同
李博琛
李乃寒
Original Assignee
脸萌有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 脸萌有限公司 filed Critical 脸萌有限公司
Publication of WO2023211386A2 publication Critical patent/WO2023211386A2/zh
Publication of WO2023211386A3 publication Critical patent/WO2023211386A3/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0008Associated control or indicating means
    • G10H1/0025Automatic or semi-automatic music composition, e.g. producing random music, applying rules from music theory or modifying a musical piece
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/101Music Composition or musical creation; Tools or processes therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

摘要本公开实施例涉及一种音乐生成方法、装置、系统及存储介质。本公开的至少一些实施例中,通过响应用户触发音乐生成控件的操作,展示包括文本输入框、音乐生成控件和音乐配置项的音乐生成界面,以便用户在文本输入框中输入自定义文本和通过音乐配置项配置音乐旋律,进而响应用户触发音乐生成控件的操作,可以基于用户输入的自定义文本生成语音,并基于生成的语音和用户配置的音乐旋律,生成包括自定义文本对应语音的音乐。
PCT/SG2023/050290 2022-04-29 2023-04-27 一种音乐生成方法、装置、系统及存储介质 WO2023211386A2 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210475367.3 2022-04-29
CN202210475367.3A CN117012170A (zh) 2022-04-29 2022-04-29 一种音乐生成方法、装置、系统及存储介质

Publications (2)

Publication Number Publication Date
WO2023211386A2 WO2023211386A2 (zh) 2023-11-02
WO2023211386A3 true WO2023211386A3 (zh) 2023-12-21

Family

ID=88519962

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SG2023/050290 WO2023211386A2 (zh) 2022-04-29 2023-04-27 一种音乐生成方法、装置、系统及存储介质

Country Status (2)

Country Link
CN (1) CN117012170A (zh)
WO (1) WO2023211386A2 (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20030036265A (ko) * 2003-01-25 2003-05-09 김성태 자동음악생성프로그램
CN1435816A (zh) * 2002-01-09 2003-08-13 雅马哈株式会社 声音旋律乐曲生成装置及使用该装置的便携终端装置
JP2011133882A (ja) * 2009-11-27 2011-07-07 Media Flats Co Ltd 音声付映像合成システム及び音声付映像合成方法
JP2016157086A (ja) * 2015-02-26 2016-09-01 パイオニア株式会社 歌詞音声出力装置、歌詞音声出力方法、及び、プログラム
CN110189741A (zh) * 2018-07-05 2019-08-30 腾讯数码(天津)有限公司 音频合成方法、装置、存储介质和计算机设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1435816A (zh) * 2002-01-09 2003-08-13 雅马哈株式会社 声音旋律乐曲生成装置及使用该装置的便携终端装置
KR20030036265A (ko) * 2003-01-25 2003-05-09 김성태 자동음악생성프로그램
JP2011133882A (ja) * 2009-11-27 2011-07-07 Media Flats Co Ltd 音声付映像合成システム及び音声付映像合成方法
JP2016157086A (ja) * 2015-02-26 2016-09-01 パイオニア株式会社 歌詞音声出力装置、歌詞音声出力方法、及び、プログラム
CN110189741A (zh) * 2018-07-05 2019-08-30 腾讯数码(天津)有限公司 音频合成方法、装置、存储介质和计算机设备

Also Published As

Publication number Publication date
WO2023211386A2 (zh) 2023-11-02
CN117012170A (zh) 2023-11-07

Similar Documents

Publication Publication Date Title
US20190340895A1 (en) Methods and apparatus for outputting a haptic signal to a haptic transducer
WO2018183650A3 (en) End-to-end text-to-speech conversion
EP4383218A3 (en) Electronic device and method for providing conversational service
WO2004095422A3 (en) Operator performed voicemall transcription
WO2004100638A3 (en) Source-dependent text-to-speech system
EP3852389A3 (en) Systems and methods for providing nature sounds
JP2019101094A5 (ja) 音声合成方法、音声合成システムおよびプログラム
JP2020095719A5 (zh)
JPWO2019026361A1 (ja) 情報処理装置、情報処理方法、およびプログラム
CN110018809A (zh) 一种电子设备和控制方法
WO2022072936A3 (en) Text-to-speech using duration prediction
JP2020076844A5 (ja) 音響処理方法、音響処理システムおよびプログラム
WO2023211386A3 (zh) 一种音乐生成方法、装置、系统及存储介质
MX2023006192A (es) Sistema electronico de suministro de aerosol.
JP2019168542A (ja) 情報処理方法および情報処理装置
TW200620240A (en) System and method for transforming text to speech
CA3236473A1 (en) System for dynamically generating recommendations to purchase sustainable items
CN104468949A (zh) 信息处理方法及电子设备
KR20050080671A (ko) 티티에스 시스템의 이모티콘 처리 방법
GB2616765A (en) AR (augmented reality) based selective sound inclusion from the surrounding while executing any voice command
JP2010032599A (ja) 音声処理装置およびプログラム
Khan et al. Reader: Speech Synthesizer and Speech Recognizer
Rigas Empirically derived multimedia design guidelines for browsing large volumes of e-mail data
ATE512436T1 (de) Sprachgesteurtes aufforderungs-system und - verfahren
JP2008009693A (ja) 聞き起こしシステム、そのサーバ及びサーバ用プログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23796955

Country of ref document: EP

Kind code of ref document: A2