WO2023211386A3 - 一种音乐生成方法、装置、系统及存储介质 - Google Patents
一种音乐生成方法、装置、系统及存储介质 Download PDFInfo
- Publication number
- WO2023211386A3 WO2023211386A3 PCT/SG2023/050290 SG2023050290W WO2023211386A3 WO 2023211386 A3 WO2023211386 A3 WO 2023211386A3 SG 2023050290 W SG2023050290 W SG 2023050290W WO 2023211386 A3 WO2023211386 A3 WO 2023211386A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- music
- music generation
- user
- storage medium
- generation method
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0008—Associated control or indicating means
- G10H1/0025—Automatic or semi-automatic music composition, e.g. producing random music, applying rules from music theory or modifying a musical piece
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/685—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/686—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/101—Music Composition or musical creation; Tools or processes therefor
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Library & Information Science (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Reverberation, Karaoke And Other Acoustics (AREA)
Abstract
摘要本公开实施例涉及一种音乐生成方法、装置、系统及存储介质。本公开的至少一些实施例中,通过响应用户触发音乐生成控件的操作,展示包括文本输入框、音乐生成控件和音乐配置项的音乐生成界面,以便用户在文本输入框中输入自定义文本和通过音乐配置项配置音乐旋律,进而响应用户触发音乐生成控件的操作,可以基于用户输入的自定义文本生成语音,并基于生成的语音和用户配置的音乐旋律,生成包括自定义文本对应语音的音乐。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210475367.3 | 2022-04-29 | ||
CN202210475367.3A CN117012170A (zh) | 2022-04-29 | 2022-04-29 | 一种音乐生成方法、装置、系统及存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2023211386A2 WO2023211386A2 (zh) | 2023-11-02 |
WO2023211386A3 true WO2023211386A3 (zh) | 2023-12-21 |
Family
ID=88519962
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SG2023/050290 WO2023211386A2 (zh) | 2022-04-29 | 2023-04-27 | 一种音乐生成方法、装置、系统及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN117012170A (zh) |
WO (1) | WO2023211386A2 (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20030036265A (ko) * | 2003-01-25 | 2003-05-09 | 김성태 | 자동음악생성프로그램 |
CN1435816A (zh) * | 2002-01-09 | 2003-08-13 | 雅马哈株式会社 | 声音旋律乐曲生成装置及使用该装置的便携终端装置 |
JP2011133882A (ja) * | 2009-11-27 | 2011-07-07 | Media Flats Co Ltd | 音声付映像合成システム及び音声付映像合成方法 |
JP2016157086A (ja) * | 2015-02-26 | 2016-09-01 | パイオニア株式会社 | 歌詞音声出力装置、歌詞音声出力方法、及び、プログラム |
CN110189741A (zh) * | 2018-07-05 | 2019-08-30 | 腾讯数码(天津)有限公司 | 音频合成方法、装置、存储介质和计算机设备 |
-
2022
- 2022-04-29 CN CN202210475367.3A patent/CN117012170A/zh active Pending
-
2023
- 2023-04-27 WO PCT/SG2023/050290 patent/WO2023211386A2/zh unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1435816A (zh) * | 2002-01-09 | 2003-08-13 | 雅马哈株式会社 | 声音旋律乐曲生成装置及使用该装置的便携终端装置 |
KR20030036265A (ko) * | 2003-01-25 | 2003-05-09 | 김성태 | 자동음악생성프로그램 |
JP2011133882A (ja) * | 2009-11-27 | 2011-07-07 | Media Flats Co Ltd | 音声付映像合成システム及び音声付映像合成方法 |
JP2016157086A (ja) * | 2015-02-26 | 2016-09-01 | パイオニア株式会社 | 歌詞音声出力装置、歌詞音声出力方法、及び、プログラム |
CN110189741A (zh) * | 2018-07-05 | 2019-08-30 | 腾讯数码(天津)有限公司 | 音频合成方法、装置、存储介质和计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
WO2023211386A2 (zh) | 2023-11-02 |
CN117012170A (zh) | 2023-11-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190340895A1 (en) | Methods and apparatus for outputting a haptic signal to a haptic transducer | |
WO2018183650A3 (en) | End-to-end text-to-speech conversion | |
EP4383218A3 (en) | Electronic device and method for providing conversational service | |
WO2004095422A3 (en) | Operator performed voicemall transcription | |
WO2004100638A3 (en) | Source-dependent text-to-speech system | |
EP3852389A3 (en) | Systems and methods for providing nature sounds | |
JP2019101094A5 (ja) | 音声合成方法、音声合成システムおよびプログラム | |
JP2020095719A5 (zh) | ||
JPWO2019026361A1 (ja) | 情報処理装置、情報処理方法、およびプログラム | |
CN110018809A (zh) | 一种电子设备和控制方法 | |
WO2022072936A3 (en) | Text-to-speech using duration prediction | |
JP2020076844A5 (ja) | 音響処理方法、音響処理システムおよびプログラム | |
WO2023211386A3 (zh) | 一种音乐生成方法、装置、系统及存储介质 | |
MX2023006192A (es) | Sistema electronico de suministro de aerosol. | |
JP2019168542A (ja) | 情報処理方法および情報処理装置 | |
TW200620240A (en) | System and method for transforming text to speech | |
CA3236473A1 (en) | System for dynamically generating recommendations to purchase sustainable items | |
CN104468949A (zh) | 信息处理方法及电子设备 | |
KR20050080671A (ko) | 티티에스 시스템의 이모티콘 처리 방법 | |
GB2616765A (en) | AR (augmented reality) based selective sound inclusion from the surrounding while executing any voice command | |
JP2010032599A (ja) | 音声処理装置およびプログラム | |
Khan et al. | Reader: Speech Synthesizer and Speech Recognizer | |
Rigas | Empirically derived multimedia design guidelines for browsing large volumes of e-mail data | |
ATE512436T1 (de) | Sprachgesteurtes aufforderungs-system und - verfahren | |
JP2008009693A (ja) | 聞き起こしシステム、そのサーバ及びサーバ用プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23796955 Country of ref document: EP Kind code of ref document: A2 |