CN108269560A

CN108269560A - A kind of speech synthesizing method and system

Info

Publication number: CN108269560A
Application number: CN201710004830.5A
Authority: CN
Inventors: 石会超
Original assignee: Beijing Kuwo Technology Co Ltd
Current assignee: Beijing Kuwo Technology Co Ltd
Priority date: 2017-01-04
Filing date: 2017-01-04
Publication date: 2018-07-10

Abstract

The present embodiments relate to a kind of speech synthesizing method and system, this method includes：The lyrics file of song is analyzed, obtains the segment informations of the lyrics in the song, wherein segment information includes the singer of every lyrics and for identifying the temporal information that the lyrics show the time；According to the analysis result of lyrics file, the voice track in audio file is handled, generates the accompaniment audio file of band part original singer；Store the voice file that user sings；Voice file and accompaniment audio file are synthesized into song files, and be uploaded to server.The vocal sections of any one singer in choral song can be erased, retain the original singer part of other singers in accompaniment, user only needs to sing the corresponding lyrics of the vocal sections erased and can be realized and star's chorus, realize the diversity of user's performance, new K song playing methods are provided to the user, the sense of accomplishment of user can be improved, and then improves user's retention ratio.

Description

A kind of speech synthesizing method and system

Technical field

The present invention relates to mobile terminal K song technical fields more particularly to a kind of speech synthesizing method and systems.

Background technology

As the improvement of people's living standards, people’s lives are just gradually moved towards in K song applications.And with electronic product not Disconnected development, people are constantly inclined to carries out K songs using mobile terminal.

Mobile terminal K sings the combination that system is music player and recording software at present, can both play original singer, can also The song of user is recorded, and the song of recording mix audio mixing to music file with accompaniment, user can also be by above-mentioned sound Music file is uploaded to network, and more people is allowed to hear the song of oneself.But more and more people are unsatisfactory for current independent K song moulds Formula, it is desirable to realize in a song and chorus with other people.

For chorusing with other users and star's chorus can more excite the chorus desire of user, the present invention is by certain Algorithm, the member-retaining portion original singer in accompaniment, user, which only needs to sing, oneself needs the lyrics por-tion sung to can be realized and star Chorus has provided new K song playing methods to the user.

Invention content

The present invention allows user's perception of accomplishment, improves user's retention ratio, provide preferably to realize and the function of star's chorus A kind of speech synthesizing method and system.

On the one hand, a kind of speech synthesizing method is provided, including：The lyrics file of song is analyzed, is obtained in the song The segment information of the lyrics, wherein the segment information include every lyrics singer and for identify the lyrics show the time when Between information；

According to the analysis result of lyrics file, the voice track in the audio file is handled, generates band part The accompaniment audio file of original singer；

The voice file and the accompaniment audio file that user is sung synthesize song files, and be uploaded to server.

Preferably, the lyrics file of the analysis song obtains the segment information of the lyrics in the song, including：According to " man ", " female " and " conjunction " key word analysis marked in the lyrics of the lyrics file goes out the segmentation letter of the lyrics in the song Breath.

Preferably, the analysis result according to lyrics file handles the voice track in the audio file, The accompaniment audio file of band part original singer is generated, including：Voice track in the audio file is handled, according to song Temporal information in word file analysis result, original singer's voice of period is corresponded in voice track of erasing, and generation is former with part The accompaniment audio file sung.

Preferably, the song is two people's choral songs.

On the other hand, a kind of sound synthetic system is provided, including：

Analysis module for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein described Segment information includes singer and the temporal information of every lyrics；

Processing module according to the analysis result of lyrics file, is handled the voice track in the audio file, raw Into the accompaniment audio file of band part original singer；

Synthesis module, voice file and the accompaniment audio file for user to be sung synthesize song files；

Sending module, for the song files to be uploaded to server.

Preferably, the analysis module is specifically used for, according to marked in the lyrics of the lyrics file " man ", " female " with And " conjunction " key word analysis goes out the segment information of the lyrics in the song.

Preferably, the processing module is specifically used for, and the voice track in the audio file is handled, according to song Temporal information in word file analysis result, original singer's voice of period is corresponded in voice track of erasing, and generation is former with part The accompaniment audio file sung.

Preferably, the song is two people's choral songs.

A kind of speech synthesizing method provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer；User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, then synthesizes song files with the accompaniment audio file with part original singer, that is, is obtained complete Works.This method can erase the original singer vocal sections of any one singer in choral song, provide to the user new K song playing methods, realize the function that is synthesized with star well in K sings software, the sense of accomplishment of user, Jin Erti can be improved For the retention ratio of user.

Description of the drawings

To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described.It should be evident that the accompanying drawings in the following description is only some embodiments of the present invention.

Fig. 1 is a kind of flow chart of speech synthesizing method provided in an embodiment of the present invention；

Fig. 2 is a kind of structure chart of sound synthetic system provided in an embodiment of the present invention.

Specific embodiment

Purpose, technical scheme and advantage to make the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention In drawings and examples, the technical solution in the embodiment of the present invention is explicitly described.

Fig. 1 is a kind of flow chart of the method for sound rendering provided in an embodiment of the present invention, and this method can sing software by K Mobile client perform, as shown in Figure 1, this method includes：

Step 110, the lyrics file of song is analyzed, obtains the segment information of the lyrics in the song.

Specifically, original singer's audio file of song and corresponding lyrics file are obtained, wherein, the lyrics file packet Include keywords such as " men " that is marked in song lyrics, the lyrics, " female " and " conjunction " and for representing that the lyrics show the time of time Information.

" man ", " female " and " conjunction " key word analysis according to being marked in the lyrics of the lyrics file goes out in the song The segment information of the lyrics, wherein, the segment information includes：The singer of every lyrics and show the time for identifying the lyrics Temporal information, i.e. every lyrics are that who sings and every lyrics sing required time.

It should be noted that song described herein, specifically two people's choral songs.

Step 120, according to the analysis result of lyrics file, the voice track in audio file is handled, generates band The accompaniment audio file of part original singer.

Specifically, the voice track in the audio file is handled, according to lyrics file analysis knot in step 110 Temporal information in fruit corresponds to original singer's voice of period, audio accompaniment of the generation with part original singer in voice track of erasing File.

According to the temporal information of the lyrics, it can erase any one in original singer's audio file according to user wish and sing The original singer vocal sections of person have provided new K songs playing method to the user, have realized the function of being synthesized with star well, can be with The sense of accomplishment of user is provided, and then improves user's retention ratio.

It is exemplified below：

User wants to give song recitals《The love of boat tracker》, it can be according to the analysis result of lyrics file, it, will to audio file processing All " man " sings the vocal sections that the lyrics were corresponded in the period and erases, and generation only carries the companion of the vocal sections of " female " sound original singer Play audio file；Whole " female " the performance lyrics can also be corresponded to the vocal sections in the period and erased by user, and generation only carries The accompaniment audio file of the vocal sections of " man " sound original singer.That is, according to the temporal information of the lyrics, can according to the wish of user, Erase the original singer vocal sections of any one singer in original singer's audio file, audio accompaniment text of the generation with part original singer Part.It in other words, that is, the temporal information according to the lyrics, can be by any one in choral song according to the wish of user The original singer vocal sections of singer erase, accompaniment audio file of the generation with remaining original singer vocal sections.

Step 130, the voice file and accompaniment audio file user sung synthesize song files.

Specifically, the accompaniment audio file with part original singer generated in user's download step 120, and carried according to the lyrics Show, sing the lyrics there is no original singer vocal sections, that is, it is proposed according to the lyrics, sings and erased the corresponding lyrics of vocal sections, Generate new voice file, then by the voice file of generation and download accompaniment audio file synthesize song files to get New complete works have been arrived, have completed the chorus with star.

Step 140, song files are uploaded to server.

Song files can be uploaded to server by user, for oneself and other users appreciation.

A kind of speech synthesizing method provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer；User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, is then synthesized with the accompaniment audio file with part original singer, song files is generated, that is, has obtained Whole works.This method can erase the original singer vocal sections of any one singer in choral song, provide to the user new K song playing methods, realize the function of synthesize with star well, improve the sense of accomplishment of user, and then the retention ratio of offer user.

Corresponding with above method embodiment, the embodiment of the present invention additionally provides a kind of sound synthetic system, specific such as Fig. 2 Shown, which includes：Analysis module 201, processing module 202, synthesis module 203 and sending module 204.

Analysis module 201 for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein The segment information includes singer and the temporal information of every lyrics；

Processing module 202, according to the analysis result of lyrics file, at the voice track in the audio file Reason generates the accompaniment audio file of band part original singer；

Synthesis module 203, voice file and the accompaniment audio file for user to be sung synthesize song files；

Sending module 204, for the song files to be uploaded to server.

Specifically, analysis module 201 is used for, according to marked in the lyrics of the lyrics file " man ", " female " and " conjunction " key word analysis goes out the segment information of the lyrics in the song.Wherein, the song is choral song, and specifically two people close It sings bent.

Specifically, processing module 202 is used for, the voice track in the audio file is handled, according to lyrics text Temporal information in part analysis result, correspond to original singer's voice of period in voice track of erasing, and generation is with part original singer Accompaniment audio file.

Function in the above sound synthesis system provided in an embodiment of the present invention performed by each section is in above-mentioned reality It applies and is discussed in detail in a kind of speech synthesizing method of example offer, which is not described herein again.

A kind of sound synthetic system provided in an embodiment of the present invention by analyzing the lyrics, is then tied according to analysis The temporal information of the lyrics handles the voice track in audio file in fruit, erases the original that the period is corresponded in voice track Voice is sung, obtains the accompaniment audio file of band part original singer；User sings the lyrics por-tion for being erased voice according to lyrics prompting, The voice file of user is generated, then synthesizes song files with the accompaniment audio file with part original singer, that is, is obtained complete Works.The original singer vocal sections of any one singer in choral song can be erased using this system, are provided to the user New K song playing methods, in K sings software realize the function of being synthesized with star, user are allowed to have sense of accomplishment, provides use well The retention ratio at family.

Professional should further appreciate that, be described with reference to the embodiments described herein each exemplary Unit and algorithm steps can be realized with the combination of electronic hardware, computer software or the two, hard in order to clearly demonstrate The interchangeability of part and software generally describes each exemplary composition and step according to function in the above description. These functions are performed actually with hardware or software mode, specific application and design constraint depending on technical solution. Professional technician specifically can realize described function to each using distinct methods, but this realization is not It is considered as beyond the scope of this invention.

One of ordinary skill in the art will appreciate that all or part of the steps of the method in the foregoing embodiments are can be with By program come instruction processing unit complete, program can be stored in computer readable storage medium, storage medium is non-short Temporary property (non-transitory) medium, such as random access memory, read-only memory, flash memory, hard disk, solid-state are hard Disk, tape (magnetic tape), floppy disk (floppy disk), CD (optical disc) and its arbitrary combination.More than, It is merely preferred embodiments of the present invention, but protection scope of the present invention is not limited thereto.

Claims

1. a kind of speech synthesizing method, which is characterized in that including：

The lyrics file of song is analyzed, obtains the segment information of the lyrics in the song, wherein the segment information includes every The singer of the lyrics and for identify the lyrics show the time temporal information；

According to the analysis result of lyrics file, the voice track in the audio file is handled, generates band part original singer Accompaniment audio file；

2. speech synthesizing method according to claim 1, which is characterized in that the lyrics file of the analysis song obtains The segment information of the lyrics in the song, including：

" man ", " female " and " conjunction " key word analysis according to being marked in the lyrics of the lyrics file goes out the lyrics in the song Segment information.

3. speech synthesizing method according to claim 1, which is characterized in that the analysis result according to lyrics file, Voice track in the audio file is handled, generates the accompaniment audio file of band part original singer, including：

Voice track in the audio file is handled, according to the temporal information in lyrics file analysis result, is erased Original singer's voice of period, accompaniment audio file of the generation with part original singer are corresponded in voice track.

4. speech synthesizing method according to claim 1, which is characterized in that the song is two people's choral songs.

5. a kind of sound synthetic system, which is characterized in that including：

Analysis module for analyzing the lyrics file of song, obtains the segment information of the lyrics in the song, wherein the segmentation Information includes singer and the temporal information of every lyrics；

Processing module according to the analysis result of lyrics file, is handled the voice track in the audio file, generates band The accompaniment audio file of part original singer；

Sending module, for the song files to be uploaded to server.

6. sound synthetic system according to claim 5, which is characterized in that the analysis module is specifically used for, according to institute State " man " that is marked in the lyrics of lyrics file, " female " and " conjunction " key word analysis go out the segmentation letters of the lyrics in the song Breath.

7. sound synthetic system according to claim 5, which is characterized in that the processing module is specifically used for, to described Voice track in audio file is handled, according to the temporal information in lyrics file analysis result, in voice track of erasing Original singer's voice of corresponding period, accompaniment audio file of the generation with part original singer.

8. sound synthetic system according to claim 5, which is characterized in that the song is two people's choral songs.