KR100363027B1 - 음성 합성 또는 음색 변환을 이용한 노래 합성 방법 - Google Patents
음성 합성 또는 음색 변환을 이용한 노래 합성 방법 Download PDFInfo
- Publication number
- KR100363027B1 KR100363027B1 KR1020000039942A KR20000039942A KR100363027B1 KR 100363027 B1 KR100363027 B1 KR 100363027B1 KR 1020000039942 A KR1020000039942 A KR 1020000039942A KR 20000039942 A KR20000039942 A KR 20000039942A KR 100363027 B1 KR100363027 B1 KR 100363027B1
- Authority
- KR
- South Korea
- Prior art keywords
- song
- tone
- voice
- specific person
- person
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 16
- 238000001308 synthesis method Methods 0.000 claims abstract description 21
- 238000013507 mapping Methods 0.000 claims abstract description 12
- 230000009466 transformation Effects 0.000 claims abstract description 3
- 238000002372 labelling Methods 0.000 claims description 4
- 239000000284 extract Substances 0.000 abstract description 6
- 230000002194 synthesizing effect Effects 0.000 abstract description 6
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 4
- 239000011295 pitch Substances 0.000 description 4
- 230000033764 rhythmic process Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B3/00—Recording by mechanical cutting, deforming or pressing, e.g. of grooves or pits; Reproducing by mechanical sensing; Record carriers therefor
- G11B3/68—Record carriers
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (4)
- 삭제
- 녹음된 특정인의 노래를 준비하고;상기 특정인의 음색과 비슷한 음색을 가진 제3자의 노래를 녹음하고;녹음된 상기 특정인의 노래와 상기 제3자의 노래에서 각자의 음색을 분석하여 음색 변형에 필요한 맵핑 펑션(mapping function)을 추출하고;상기 제3자에게 새로 작사 및 작곡된 노래를 부르게 하여 녹음하고; 그리고상기 맵핑 펑션을 이용하여 녹음된 새로운 노래에 담긴 제3자의 음색을 특정인의 음색으로 변환시키는;단계로 이루어지는 것을 특징으로 하는 음색 변환 방식을 이용한 노래 합성 방법.
- 특정인의 음성이 담긴 녹음 데이터베이스에서 추출된 음성 트랙의 음성 데이터를 음소 또는 음절 등의 작은 단위로 분할한 후 피치(pitch) 또는 온셋(onset) 정보 등을 분석하고 라벨링하여 데이터베이스를 구성하고;제3자의 음성을 녹음하고;상기 분석 및 라벨링된 정보로 구성된 데이터베이스를 이루는 특정인의 음색과 상기 제3자의 녹음된 음색을 비교하여 맵핑 펑션을 추출하고; 그리고상기 맵핑 펑션을 이용하여 상기 제3자의 음색을 상기 특정인의 음색과 동일하도록 변환시켜 상기 특정인의 음성으로 변조시키는;단계로 이루어지는 것을 특징으로 하는 음성 변조 방법.
- 제2항에 따른 방법으로 합성된 노래가 수록된 음반.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020000039942A KR100363027B1 (ko) | 2000-07-12 | 2000-07-12 | 음성 합성 또는 음색 변환을 이용한 노래 합성 방법 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020000039942A KR100363027B1 (ko) | 2000-07-12 | 2000-07-12 | 음성 합성 또는 음색 변환을 이용한 노래 합성 방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20000063438A KR20000063438A (ko) | 2000-11-06 |
KR100363027B1 true KR100363027B1 (ko) | 2002-12-05 |
Family
ID=19677629
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020000039942A KR100363027B1 (ko) | 2000-07-12 | 2000-07-12 | 음성 합성 또는 음색 변환을 이용한 노래 합성 방법 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR100363027B1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9607594B2 (en) | 2013-12-20 | 2017-03-28 | Samsung Electronics Co., Ltd. | Multimedia apparatus, music composing method thereof, and song correcting method thereof |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111540341B (zh) * | 2019-01-22 | 2024-08-06 | 北京搜狗科技发展有限公司 | 一种数据处理方法、装置和用于数据处理的装置 |
KR102168529B1 (ko) * | 2020-05-29 | 2020-10-22 | 주식회사 수퍼톤 | 인공신경망을 이용한 가창음성 합성 방법 및 장치 |
CN113823281B (zh) * | 2020-11-24 | 2024-04-05 | 北京沃东天骏信息技术有限公司 | 语音信号处理方法、装置、介质及电子设备 |
CN113035169B (zh) * | 2021-03-12 | 2021-12-07 | 北京帝派智能科技有限公司 | 一种可在线训练个性化音色库的语音合成方法和系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19980702608A (ko) * | 1995-03-07 | 1998-08-05 | 에버쉐드마이클 | 음성 합성기 |
JPH113096A (ja) * | 1997-06-12 | 1999-01-06 | Baazu Joho Kagaku Kenkyusho:Kk | 音声合成方法及び音声合成システム |
JPH11338480A (ja) * | 1998-05-22 | 1999-12-10 | Yamaha Corp | カラオケ装置 |
KR20010035173A (ko) * | 2001-01-10 | 2001-05-07 | 백종관 | 음성 합성 훈련 툴킷을 이용한 개인용 음성 합성기 및 그방법 |
-
2000
- 2000-07-12 KR KR1020000039942A patent/KR100363027B1/ko active IP Right Grant
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19980702608A (ko) * | 1995-03-07 | 1998-08-05 | 에버쉐드마이클 | 음성 합성기 |
JPH113096A (ja) * | 1997-06-12 | 1999-01-06 | Baazu Joho Kagaku Kenkyusho:Kk | 音声合成方法及び音声合成システム |
JPH11338480A (ja) * | 1998-05-22 | 1999-12-10 | Yamaha Corp | カラオケ装置 |
KR20010035173A (ko) * | 2001-01-10 | 2001-05-07 | 백종관 | 음성 합성 훈련 툴킷을 이용한 개인용 음성 합성기 및 그방법 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9607594B2 (en) | 2013-12-20 | 2017-03-28 | Samsung Electronics Co., Ltd. | Multimedia apparatus, music composing method thereof, and song correcting method thereof |
Also Published As
Publication number | Publication date |
---|---|
KR20000063438A (ko) | 2000-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7601904B2 (en) | Interactive tool and appertaining method for creating a graphical music display | |
KR100582154B1 (ko) | 시퀀스 데이터의 데이터 교환 포맷, 음성 재생 장치 및서버 장치 | |
Tellman et al. | Timbre morphing of sounds with unequal numbers of features | |
Sallis et al. | Live-Electronic Music | |
CN113836344A (zh) | 个性化歌曲文件生成方法和装置、音乐演唱设备 | |
Morrison | Encoding Post-Spectral Sound: Kaija Saariaho’s Early Electronic Works at IRCAM, 1982–87 | |
KR100363027B1 (ko) | 음성 합성 또는 음색 변환을 이용한 노래 합성 방법 | |
Kroher et al. | Computational ethnomusicology: a study of flamenco and Arab-Andalusian vocal music | |
Mazzola et al. | Basic Music Technology | |
von Coler et al. | CMMSD: A data set for note-level segmentation of monophonic music | |
Modegi et al. | Proposals of MIDI coding and its application for audio authoring | |
CN115331648A (zh) | 音频数据处理方法、装置、设备、存储介质及产品 | |
JP4268322B2 (ja) | 再生用符号化データ作成方法 | |
Fremerey | SyncPlayer–a Framework for Content-Based Music Navigation | |
O’Callaghan | Mediated Mimesis: Transcription as Processing | |
Faris | " That Chicago Sound": Playing with (Local) Identity in Underground Rock | |
Modegi | Very low bit-rate audio coding technique using MIDI representation | |
Müller et al. | Freischutz digital: A multimodal scenario for informed music processing | |
Modegi | MIDI encoding method based on variable frame-length analysis and its evaluation of coding precision | |
Morrison | On the Horizon of Digital Technics in Kaija Saariaho’s «IO» and «Nymphéa» | |
Freed | Harmonic Data: AI Music, EDM Transcription, & Minimalist Jazz | |
Blakeley | Genre and Influence: Tracing the Lineage of Timbre and Form in Steven Wilson's Progressive Rock | |
Ludovico | IEEE 1599 and Sound Synthesis | |
de Ulhôa | Triple-time Modinha: Between Dances and Serenades | |
Prätzlich | Freischütz Digital: Processing Audio Signals in Complex Music Scenarios |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20121029 Year of fee payment: 11 |
|
FPAY | Annual fee payment |
Payment date: 20131031 Year of fee payment: 12 |
|
FPAY | Annual fee payment |
Payment date: 20141030 Year of fee payment: 13 |
|
FPAY | Annual fee payment |
Payment date: 20151102 Year of fee payment: 14 |
|
FPAY | Annual fee payment |
Payment date: 20161019 Year of fee payment: 15 |
|
FPAY | Annual fee payment |
Payment date: 20171023 Year of fee payment: 16 |