CN108269560A - A kind of speech synthesizing method and system - Google Patents

A kind of speech synthesizing method and system Download PDF

Info

Publication number
CN108269560A
CN108269560A CN201710004830.5A CN201710004830A CN108269560A CN 108269560 A CN108269560 A CN 108269560A CN 201710004830 A CN201710004830 A CN 201710004830A CN 108269560 A CN108269560 A CN 108269560A
Authority
CN
China
Prior art keywords
lyrics
file
song
audio file
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710004830.5A
Other languages
Chinese (zh)
Inventor
石会超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kuwo Technology Co Ltd
Original Assignee
Beijing Kuwo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kuwo Technology Co Ltd filed Critical Beijing Kuwo Technology Co Ltd
Priority to CN201710004830.5A priority Critical patent/CN108269560A/en
Publication of CN108269560A publication Critical patent/CN108269560A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/36Accompaniment arrangements
    • G10H1/361Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/471General musical sound synthesis principles, i.e. sound category-independent synthesis methods

Abstract

The present embodiments relate to a kind of speech synthesizing method and system, this method includes:The lyrics file of song is analyzed, obtains the segment informations of the lyrics in the song, wherein segment information includes the singer of every lyrics and for identifying the temporal information that the lyrics show the time;According to the analysis result of lyrics file, the voice track in audio file is handled, generates the accompaniment audio file of band part original singer;Store the voice file that user sings;Voice file and accompaniment audio file are synthesized into song files, and be uploaded to server.The vocal sections of any one singer in choral song can be erased, retain the original singer part of other singers in accompaniment, user only needs to sing the corresponding lyrics of the vocal sections erased and can be realized and star's chorus, realize the diversity of user's performance, new K song playing methods are provided to the user, the sense of accomplishment of user can be improved, and then improves user's retention ratio.

Description

A kind of speech synthesizing method and system
Technical field
The present invention relates to mobile terminal K song technical fields more particularly to a kind of speech synthesizing method and systems.
Background technology
As the improvement of people's living standards, people’s lives are just gradually moved towards in K song applications.And with electronic product not Disconnected development, people are constantly inclined to carries out K songs using mobile terminal.
Mobile terminal K sings the combination that system is music player and recording software at present, can both play original singer, can also The song of user is recorded, and the song of recording mix audio mixing to music file with accompaniment, user can also be by above-mentioned sound Music file is uploaded to network, and more people is allowed to hear the song of oneself.But more and more people are unsatisfactory for current independent K song moulds Formula, it is desirable to realize in a song and chorus with other people.
For chorusing with other users and star's chorus can more excite the chorus desire of user, the present invention is by certain Algorithm, the member-retaining portion original singer in accompaniment, user, which only needs to sing, oneself needs the lyrics por-tion sung to can be realized and star Chorus has provided new K song playing methods to the user.
Invention content
The present invention allows user's perception of accomplishment, improves user's retention ratio, provide preferably to realize and the function of star's chorus A kind of speech synthesizing method and system.
On the one hand, a kind of speech synthesizing method is provided, including:The lyrics file of song is analyzed, is obtained in the song The segment information of the lyrics, wherein the segment information include every lyrics singer and for identify the lyrics show the time when Between information;
According to the analysis result of lyrics file, the voice track in the audio file is handled, generates band part The accompaniment audio file of original singer;
The voice file and the accompaniment audio file that user is sung synthesize song files, and be uploaded to server.
Preferably, the lyrics file of the analysis song obtains the segment information of the lyrics in the song, including:According to " man ", " female " and " conjunction " key word analysis marked in the lyrics of the lyrics file goes out the segmentation letter of the lyrics in the song Breath.
Preferably, the analysis result according to lyrics file handles the voice track in the audio file, The accompaniment audio file of band part original singer is generated, including:Voice track in the audio file is handled, according to song Temporal information in word file analysis result, original singer's voice of period is corresponded in voice track of erasing, and generation is former with part The accompaniment audio file sung.
Preferably, the song is two people's choral songs.
On the other hand, a kind of sound synthetic system is provided, including:
Analysis module for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein described Segment information includes singer and the temporal information of every lyrics;
Processing module according to the analysis result of lyrics file, is handled the voice track in the audio file, raw Into the accompaniment audio file of band part original singer;
Synthesis module, voice file and the accompaniment audio file for user to be sung synthesize song files;
Sending module, for the song files to be uploaded to server.
Preferably, the analysis module is specifically used for, according to marked in the lyrics of the lyrics file " man ", " female " with And " conjunction " key word analysis goes out the segment information of the lyrics in the song.
Preferably, the processing module is specifically used for, and the voice track in the audio file is handled, according to song Temporal information in word file analysis result, original singer's voice of period is corresponded in voice track of erasing, and generation is former with part The accompaniment audio file sung.
Preferably, the song is two people's choral songs.
A kind of speech synthesizing method provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer;User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, then synthesizes song files with the accompaniment audio file with part original singer, that is, is obtained complete Works.This method can erase the original singer vocal sections of any one singer in choral song, provide to the user new K song playing methods, realize the function that is synthesized with star well in K sings software, the sense of accomplishment of user, Jin Erti can be improved For the retention ratio of user.
Description of the drawings
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described.It should be evident that the accompanying drawings in the following description is only some embodiments of the present invention.
Fig. 1 is a kind of flow chart of speech synthesizing method provided in an embodiment of the present invention;
Fig. 2 is a kind of structure chart of sound synthetic system provided in an embodiment of the present invention.
Specific embodiment
Purpose, technical scheme and advantage to make the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention In drawings and examples, the technical solution in the embodiment of the present invention is explicitly described.
Fig. 1 is a kind of flow chart of the method for sound rendering provided in an embodiment of the present invention, and this method can sing software by K Mobile client perform, as shown in Figure 1, this method includes:
Step 110, the lyrics file of song is analyzed, obtains the segment information of the lyrics in the song.
Specifically, original singer's audio file of song and corresponding lyrics file are obtained, wherein, the lyrics file packet Include keywords such as " men " that is marked in song lyrics, the lyrics, " female " and " conjunction " and for representing that the lyrics show the time of time Information.
" man ", " female " and " conjunction " key word analysis according to being marked in the lyrics of the lyrics file goes out in the song The segment information of the lyrics, wherein, the segment information includes:The singer of every lyrics and show the time for identifying the lyrics Temporal information, i.e. every lyrics are that who sings and every lyrics sing required time.
It should be noted that song described herein, specifically two people's choral songs.
Step 120, according to the analysis result of lyrics file, the voice track in audio file is handled, generates band The accompaniment audio file of part original singer.
Specifically, the voice track in the audio file is handled, according to lyrics file analysis knot in step 110 Temporal information in fruit corresponds to original singer's voice of period, audio accompaniment of the generation with part original singer in voice track of erasing File.
According to the temporal information of the lyrics, it can erase any one in original singer's audio file according to user wish and sing The original singer vocal sections of person have provided new K songs playing method to the user, have realized the function of being synthesized with star well, can be with The sense of accomplishment of user is provided, and then improves user's retention ratio.
It is exemplified below:
User wants to give song recitals《The love of boat tracker》, it can be according to the analysis result of lyrics file, it, will to audio file processing All " man " sings the vocal sections that the lyrics were corresponded in the period and erases, and generation only carries the companion of the vocal sections of " female " sound original singer Play audio file;Whole " female " the performance lyrics can also be corresponded to the vocal sections in the period and erased by user, and generation only carries The accompaniment audio file of the vocal sections of " man " sound original singer.That is, according to the temporal information of the lyrics, can according to the wish of user, Erase the original singer vocal sections of any one singer in original singer's audio file, audio accompaniment text of the generation with part original singer Part.It in other words, that is, the temporal information according to the lyrics, can be by any one in choral song according to the wish of user The original singer vocal sections of singer erase, accompaniment audio file of the generation with remaining original singer vocal sections.
Step 130, the voice file and accompaniment audio file user sung synthesize song files.
Specifically, the accompaniment audio file with part original singer generated in user's download step 120, and carried according to the lyrics Show, sing the lyrics there is no original singer vocal sections, that is, it is proposed according to the lyrics, sings and erased the corresponding lyrics of vocal sections, Generate new voice file, then by the voice file of generation and download accompaniment audio file synthesize song files to get New complete works have been arrived, have completed the chorus with star.
Step 140, song files are uploaded to server.
Song files can be uploaded to server by user, for oneself and other users appreciation.
A kind of speech synthesizing method provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer;User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, is then synthesized with the accompaniment audio file with part original singer, song files is generated, that is, has obtained Whole works.This method can erase the original singer vocal sections of any one singer in choral song, provide to the user new K song playing methods, realize the function of synthesize with star well, improve the sense of accomplishment of user, and then the retention ratio of offer user.
Corresponding with above method embodiment, the embodiment of the present invention additionally provides a kind of sound synthetic system, specific such as Fig. 2 Shown, which includes:Analysis module 201, processing module 202, synthesis module 203 and sending module 204.
Analysis module 201 for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein The segment information includes singer and the temporal information of every lyrics;
Processing module 202, according to the analysis result of lyrics file, at the voice track in the audio file Reason generates the accompaniment audio file of band part original singer;
Synthesis module 203, voice file and the accompaniment audio file for user to be sung synthesize song files;
Sending module 204, for the song files to be uploaded to server.
Specifically, analysis module 201 is used for, according to marked in the lyrics of the lyrics file " man ", " female " and " conjunction " key word analysis goes out the segment information of the lyrics in the song.Wherein, the song is choral song, and specifically two people close It sings bent.
Specifically, processing module 202 is used for, the voice track in the audio file is handled, according to lyrics text Temporal information in part analysis result, correspond to original singer's voice of period in voice track of erasing, and generation is with part original singer Accompaniment audio file.
Function in the above sound synthesis system provided in an embodiment of the present invention performed by each section is in above-mentioned reality It applies and is discussed in detail in a kind of speech synthesizing method of example offer, which is not described herein again.
A kind of sound synthetic system provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer;User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, then synthesizes song files with the accompaniment audio file with part original singer, that is, is obtained complete Works.The original singer vocal sections of any one singer in choral song can be erased using this system, are provided to the user New K song playing methods, in K sings software realize the function of being synthesized with star, user are allowed to have sense of accomplishment, provides use well The retention ratio at family.
Professional should further appreciate that, be described with reference to the embodiments described herein each exemplary Unit and algorithm steps can be realized with the combination of electronic hardware, computer software or the two, hard in order to clearly demonstrate The interchangeability of part and software generally describes each exemplary composition and step according to function in the above description. These functions are performed actually with hardware or software mode, specific application and design constraint depending on technical solution. Professional technician specifically can realize described function to each using distinct methods, but this realization is not It is considered as beyond the scope of this invention.
One of ordinary skill in the art will appreciate that all or part of the steps of the method in the foregoing embodiments are can be with By program come instruction processing unit complete, program can be stored in computer readable storage medium, storage medium is non-short Temporary property (non-transitory) medium, such as random access memory, read-only memory, flash memory, hard disk, solid-state are hard Disk, tape (magnetic tape), floppy disk (floppy disk), CD (optical disc) and its arbitrary combination.More than, It is merely preferred embodiments of the present invention, but protection scope of the present invention is not limited thereto.

Claims (8)

1. a kind of speech synthesizing method, which is characterized in that including:
The lyrics file of song is analyzed, obtains the segment information of the lyrics in the song, wherein the segment information includes every The singer of the lyrics and for identify the lyrics show the time temporal information;
According to the analysis result of lyrics file, the voice track in the audio file is handled, generates band part original singer Accompaniment audio file;
The voice file and the accompaniment audio file that user is sung synthesize song files, and be uploaded to server.
2. speech synthesizing method according to claim 1, which is characterized in that the lyrics file of the analysis song obtains The segment information of the lyrics in the song, including:
" man ", " female " and " conjunction " key word analysis according to being marked in the lyrics of the lyrics file goes out the lyrics in the song Segment information.
3. speech synthesizing method according to claim 1, which is characterized in that the analysis result according to lyrics file, Voice track in the audio file is handled, generates the accompaniment audio file of band part original singer, including:
Voice track in the audio file is handled, according to the temporal information in lyrics file analysis result, is erased Original singer's voice of period, accompaniment audio file of the generation with part original singer are corresponded in voice track.
4. speech synthesizing method according to claim 1, which is characterized in that the song is two people's choral songs.
5. a kind of sound synthetic system, which is characterized in that including:
Analysis module for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein the segmentation Information includes singer and the temporal information of every lyrics;
Processing module according to the analysis result of lyrics file, is handled the voice track in the audio file, generates band The accompaniment audio file of part original singer;
Synthesis module, voice file and the accompaniment audio file for user to be sung synthesize song files;
Sending module, for the song files to be uploaded to server.
6. sound synthetic system according to claim 5, which is characterized in that the analysis module is specifically used for, according to institute State " man " that is marked in the lyrics of lyrics file, " female " and " conjunction " key word analysis go out the segmentation letters of the lyrics in the song Breath.
7. sound synthetic system according to claim 5, which is characterized in that the processing module is specifically used for, to described Voice track in audio file is handled, according to the temporal information in lyrics file analysis result, in voice track of erasing Original singer's voice of corresponding period, accompaniment audio file of the generation with part original singer.
8. sound synthetic system according to claim 5, which is characterized in that the song is two people's choral songs.
CN201710004830.5A 2017-01-04 2017-01-04 A kind of speech synthesizing method and system Pending CN108269560A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710004830.5A CN108269560A (en) 2017-01-04 2017-01-04 A kind of speech synthesizing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710004830.5A CN108269560A (en) 2017-01-04 2017-01-04 A kind of speech synthesizing method and system

Publications (1)

Publication Number Publication Date
CN108269560A true CN108269560A (en) 2018-07-10

Family

ID=62771678

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710004830.5A Pending CN108269560A (en) 2017-01-04 2017-01-04 A kind of speech synthesizing method and system

Country Status (1)

Country Link
CN (1) CN108269560A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109300459A (en) * 2018-09-07 2019-02-01 传线网络科技(上海)有限公司 Song chorus method and device
CN110264986A (en) * 2019-03-29 2019-09-20 深圳市即构科技有限公司 Online K song device, method and computer readable storage medium
CN110390925A (en) * 2019-08-02 2019-10-29 湖南国声声学科技股份有限公司深圳分公司 Voice and accompaniment synchronous method, terminal, bluetooth equipment and storage medium
CN110660376A (en) * 2019-09-30 2020-01-07 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device and storage medium
CN111091800A (en) * 2019-12-25 2020-05-01 北京百度网讯科技有限公司 Song generation method and device
CN111599328A (en) * 2020-05-22 2020-08-28 广州酷狗计算机科技有限公司 Song synthesis method, device, equipment and storage medium
CN112581924A (en) * 2019-09-30 2021-03-30 广州艾美网络科技有限公司 Audio processing method and device based on point-to-sing equipment, storage medium and equipment
CN113192486A (en) * 2021-04-27 2021-07-30 腾讯音乐娱乐科技(深圳)有限公司 Method, equipment and storage medium for processing chorus audio
CN113270081A (en) * 2020-02-14 2021-08-17 原相科技股份有限公司 Method for adjusting accompaniment sound of song and electronic device for adjusting accompaniment sound of song
CN113380248A (en) * 2021-06-11 2021-09-10 北京声智科技有限公司 Voice control method, device, equipment and storage medium
CN113691841A (en) * 2020-05-18 2021-11-23 聚好看科技股份有限公司 Singing label adding method, rapid audition method and display device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966527A (en) * 2015-05-27 2015-10-07 腾讯科技(深圳)有限公司 Karaoke processing method, apparatus, and system
CN105006234A (en) * 2015-05-27 2015-10-28 腾讯科技(深圳)有限公司 Karaoke processing method and apparatus
CN105023559A (en) * 2015-05-27 2015-11-04 腾讯科技(深圳)有限公司 Karaoke processing method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966527A (en) * 2015-05-27 2015-10-07 腾讯科技(深圳)有限公司 Karaoke processing method, apparatus, and system
CN105006234A (en) * 2015-05-27 2015-10-28 腾讯科技(深圳)有限公司 Karaoke processing method and apparatus
CN105023559A (en) * 2015-05-27 2015-11-04 腾讯科技(深圳)有限公司 Karaoke processing method and system

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109300459A (en) * 2018-09-07 2019-02-01 传线网络科技(上海)有限公司 Song chorus method and device
CN109300459B (en) * 2018-09-07 2022-03-15 阿里巴巴(中国)有限公司 Song chorusing method and device
CN110264986A (en) * 2019-03-29 2019-09-20 深圳市即构科技有限公司 Online K song device, method and computer readable storage medium
CN110390925B (en) * 2019-08-02 2021-08-10 湖南国声声学科技股份有限公司深圳分公司 Method for synchronizing voice and accompaniment, terminal, Bluetooth device and storage medium
CN110390925A (en) * 2019-08-02 2019-10-29 湖南国声声学科技股份有限公司深圳分公司 Voice and accompaniment synchronous method, terminal, bluetooth equipment and storage medium
CN110660376B (en) * 2019-09-30 2022-11-29 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device and storage medium
CN112581924A (en) * 2019-09-30 2021-03-30 广州艾美网络科技有限公司 Audio processing method and device based on point-to-sing equipment, storage medium and equipment
CN110660376A (en) * 2019-09-30 2020-01-07 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device and storage medium
CN111091800A (en) * 2019-12-25 2020-05-01 北京百度网讯科技有限公司 Song generation method and device
CN113270081A (en) * 2020-02-14 2021-08-17 原相科技股份有限公司 Method for adjusting accompaniment sound of song and electronic device for adjusting accompaniment sound of song
CN113691841A (en) * 2020-05-18 2021-11-23 聚好看科技股份有限公司 Singing label adding method, rapid audition method and display device
CN113691841B (en) * 2020-05-18 2022-08-30 聚好看科技股份有限公司 Singing label adding method, rapid audition method and display device
CN111599328A (en) * 2020-05-22 2020-08-28 广州酷狗计算机科技有限公司 Song synthesis method, device, equipment and storage medium
CN111599328B (en) * 2020-05-22 2024-04-09 广州酷狗计算机科技有限公司 Song synthesis method, device, equipment and storage medium
CN113192486A (en) * 2021-04-27 2021-07-30 腾讯音乐娱乐科技(深圳)有限公司 Method, equipment and storage medium for processing chorus audio
CN113192486B (en) * 2021-04-27 2024-01-09 腾讯音乐娱乐科技(深圳)有限公司 Chorus audio processing method, chorus audio processing equipment and storage medium
CN113380248A (en) * 2021-06-11 2021-09-10 北京声智科技有限公司 Voice control method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108269560A (en) A kind of speech synthesizing method and system
JP5318095B2 (en) System and method for automatically beat-mixing a plurality of songs using an electronic device
CN101042752B (en) Method and sytem used for email administration
US7831432B2 (en) Audio menus describing media contents of media players
CN101014994A (en) Content creating device and content creating method
CN103597543A (en) Semantic audio track mixer
US10971125B2 (en) Music synthesis method, system, terminal and computer-readable storage medium
CN110211556A (en) Processing method, device, terminal and the storage medium of music file
d'Escrivan Music technology
CN108269561A (en) A kind of speech synthesizing method and system
US7612279B1 (en) Methods and apparatus for structuring audio data
Fröjd et al. Sound texture synthesis using an overlap–add/granular synthesis approach
Filimowicz Foundations in Sound Design for Linear Media: A Multidisciplinary Approach
Avarese Post sound design: the art and craft of audio post production for the moving image
Loscos Spectral processing of the singing voice.
KR101975193B1 (en) Automatic composition apparatus and computer-executable automatic composition method
US20110077756A1 (en) Method for identifying and playing back an audio recording
KR100826659B1 (en) Method for listening specific performance part which is erased or selected from music file
WO2023010949A1 (en) Method and apparatus for processing audio data
Moffat Evaluation of Synthesised Sound Effects
Harrison et al. A statistical-learning model of harmony perception
JPH11167388A (en) Music player device
Thulin Composing Places: Practices and Potentials of Sound Mapping and Locative Audio
Lamb J. Lynch The Tender Appropriation
Maz Music Technology Essentials: A Home Studio Guide

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180710

RJ01 Rejection of invention patent application after publication