CN107122493B

CN107122493B - Song playing method and device

Info

Publication number: CN107122493B
Application number: CN201710359613.8A
Authority: CN
Inventors: 张学华; 刘宇翔; 雷超然; 季雨晴
Original assignee: Beijing Kingsoft Internet Security Software Co Ltd
Current assignee: Beijing Kingsoft Internet Security Software Co Ltd
Priority date: 2017-05-19
Filing date: 2017-05-19
Publication date: 2020-04-28
Anticipated expiration: 2037-05-19
Also published as: CN107122493A

Abstract

The invention discloses a song playing method and a song playing device, wherein the method comprises the following steps: detecting a score and a pitch of a reference song; determining a target duration and a target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation. Therefore, the method and the device realize that any character and music score are used for playing the song, improve the convenience and the interestingness of artificial intelligence and improve the user experience.

Description

Song playing method and device

Technical Field

The invention relates to the field of artificial intelligence, in particular to a song playing method and device.

Background

Artificial Intelligence (Artificial Intelligence), abbreviated in english as AI. The method is a new technical science for researching and developing theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can react in a manner similar to human intelligence, a field of research that includes robotics, speech recognition, image recognition, natural language processing, and expert systems.

At present, a fixed song can be sung through a fixed music score and a fixed vocalizing word, and the mode is single and not convenient for a user to use.

Disclosure of Invention

The present invention has been made to solve at least one of the technical problems of the related art to some extent.

Therefore, one objective of the present invention is to provide a song playing method, which is used for solving the problem that a fixed song can only be sung through a fixed music score and fixed vocalized words in the prior art, so as to play the song by using any character and music score, improve the convenience and interest of artificial intelligence, and improve the user experience.

A second object of the present invention is to provide a song playback apparatus.

A third object of the invention is to propose a computer device.

A fourth object of the invention is to propose a non-transitory computer-readable storage medium.

A fifth object of the invention is to propose a computer program product.

In order to achieve the above object, an embodiment of a first aspect of the present invention provides a song playing method, including: detecting a score and a pitch of a reference song; determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

The song playing method of the embodiment of the invention determines the target time length and the target tone of each character pronunciation in the target lyrics by detecting the music score and the tone of the reference song and according to the music score and the tone of the reference song, and finally plays the target lyrics according to the target time length and the target tone of each character pronunciation. Therefore, the method and the device realize that any character and music score are used for playing the song, improve the convenience and the interestingness of artificial intelligence and improve the user experience.

In addition, the song playing method of the embodiment of the invention also has the following additional technical characteristics:

optionally, the method further includes: determining a target tone used for playing the target lyrics; the playing the target lyrics according to the target duration and the target tone of each word pronunciation comprises the following steps: and playing the target lyrics according to the target duration and the target tone of each character pronunciation by adopting the target tone.

Optionally, the determining a target duration and a target pitch of each word pronunciation in the target lyric according to the score and the pitch of the reference song comprises: determining a word count difference of the lyrics of the reference song and the target lyrics; and determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song and the number difference of the characters.

Optionally, the playing the target lyric according to the target duration and the target tone of each word pronunciation includes: detecting the original time length and the original tone of each character pronunciation in the target lyrics; comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation to obtain the time length difference and the tone difference of each character pronunciation; and adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

Optionally, the detecting an original duration and an original tone of each pronunciation of the word in the target lyric includes: detecting the original time length and the original tone of each character pronunciation of the target lyrics in the reading process; or detecting the original time length and the original tone of each character pronunciation of the target lyrics in the singing process.

To achieve the above object, a second embodiment of the present invention provides a song playing apparatus, including: a detection module for detecting the score and the pitch of the reference song; the first determining module is used for determining the target duration and the target tone of pronunciation of each word in the target lyrics according to the music score and the tone of the reference song; and the playing module is used for playing the target lyrics according to the target duration and the target tone of each character pronunciation.

The song playing device of the embodiment of the invention determines the target time length and the target tone of each character pronunciation in the target lyrics by detecting the music score and the tone of the reference song according to the music score and the tone of the reference song, and finally plays the target lyrics according to the target time length and the target tone of each character pronunciation. Therefore, the method and the device realize that any character and music score are used for playing the song, improve the convenience and the interestingness of artificial intelligence and improve the user experience.

In addition, the song playing device of the embodiment of the invention also has the following additional technical characteristics:

optionally, the apparatus further includes: the second determination module is used for determining a target tone for playing the target lyrics; the playing module is used for: and playing the target lyrics according to the target duration and the target tone of each character pronunciation by adopting the target tone.

Optionally, the first determining module is configured to: determining a word count difference of the lyrics of the reference song and the target lyrics; and determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song and the number difference of the characters.

Optionally, the playing module includes: the detection unit is used for detecting the original time length and the original tone of each character pronunciation in the target lyrics; the acquisition unit is used for comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation to acquire the time length difference and the tone difference of each character pronunciation; and the adjusting unit is used for adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

Optionally, the detection unit is configured to: detecting the original time length and the original tone of each character pronunciation of the target lyrics in the reading process; or detecting the original time length and the original tone of each character pronunciation of the target lyrics in the singing process.

To achieve the above object, a third embodiment of the present invention provides a computer device, including: comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor, when executing the program, enables performing a song playback method, the method comprising: detecting a score and a pitch of a reference song; determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

In order to achieve the above object, a fourth aspect of the present invention provides a non-transitory computer-readable storage medium, in which instructions are executed by a processor to enable a song playback method to be performed, the method including: detecting a score and a pitch of a reference song; determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

In order to achieve the above object, a fifth aspect of the present invention provides a computer program product, wherein when executed by an instruction processor of the computer program product, a song playing method is performed, and the method includes: detecting a score and a pitch of a reference song; determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

Fig. 1 is a flowchart illustrating a song playing method according to an embodiment of the present invention;

fig. 2 is a flowchart illustrating a song playing method according to another embodiment of the present invention;

fig. 3 is a flowchart illustrating a song playing method according to another embodiment of the present invention;

fig. 4 is a schematic structural diagram of a song playback apparatus according to an embodiment of the present invention;

fig. 5 is a schematic structural diagram of a song playback apparatus according to another embodiment of the present invention;

fig. 6 is a schematic structural diagram of a song playback apparatus according to yet another embodiment of the present invention;

fig. 7 is a schematic structural diagram of a song playback apparatus according to still another embodiment of the present invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

A song playback method and apparatus according to an embodiment of the present invention are described below with reference to the accompanying drawings.

Fig. 1 is a flowchart of a song playing method according to an embodiment of the present invention.

It should be noted that the song playing method in the embodiment of the present application may be applied to devices (such as a mobile phone, a tablet, and a computer) of systems such as Android, IOS, and PC.

As shown in fig. 1, the song playing method includes the following steps:

step 101, the score and the pitch of the reference song are detected.

In practical applications, the scores and tones of different songs are different, and the reference song can be selected and the corresponding score and tone can be detected according to requirements. There are many ways to detect the score and pitch of a reference song, for example as follows:

in a first example, the corresponding score and pitch are matched by entering the title of the reference song into the relational database.

In a second example, a corresponding score and pitch are obtained by playing a reference song and performing a correlation algorithm process.

Step 102, determining the target duration and the target pitch of each character pronunciation in the target lyric according to the music score and the pitch of the reference song.

Specifically, after detecting the score and the pitch of the reference song, the target time length and the target pitch of each character pronunciation in the target lyric can be determined according to the score and the pitch of the reference song, and it can be understood that there are many ways to determine the target time length and the target pitch of each character pronunciation in the target lyric, and the setting can be selected according to the practical application needs, for example, as follows:

as an example, the word number difference between the lyrics of the reference song and the target lyrics is determined, and the target time length and the target pitch of each word in the target lyrics is determined according to the score and pitch of the reference song and the word number difference.

And 103, playing the target lyrics according to the target duration and the target tone of each character pronunciation.

Specifically, after the target time length and the target tone of each character pronunciation in the target lyric are determined, the target lyric can be played according to the target time length and the target tone of each character pronunciation. It is understood that there are many ways to play the target lyrics, and the setting can be selected according to the actual application requirement, for example, as follows:

as an example, the original time length and the original tone of each character pronunciation in the target lyric are detected, the target time length and the target tone of each character pronunciation and the original time length and the original tone of each character pronunciation are compared, the time length difference and the tone difference of each character pronunciation are obtained, and the original time length and the original tone of each character pronunciation are adjusted according to the time length difference and the tone difference of each character pronunciation.

Fig. 2 is a flowchart of a song playing method according to another embodiment of the present invention. As shown in fig. 1, the song playing method includes the following steps:

in step 201, the score and pitch of the reference song are detected.

It should be noted that the description of step S201 corresponds to step S101, and thus the description of step S201 refers to the description of step S101, and is not repeated herein.

In step 202, the word count difference between the lyrics of the reference song and the target lyrics is determined.

Step 203, determining the target duration and the target pitch of each character pronunciation in the target lyric according to the music score and the pitch of the reference song and the number difference of the characters.

Specifically, the lyrics of the reference song and the number of the lyrics may be obtained, and the target lyrics may be input by the user in a manner of reading aloud through a related voice device, that is, the number of words corresponding to the target lyrics may be obtained, so that the word number difference between the lyrics of the reference song and the target lyrics may be determined.

Further, determining a target time length and a target pitch of each character pronunciation in the target lyric according to the music score and the pitch of the reference song and the character number difference, for example, when the character number difference is 0, determining that the target time length and the target pitch of each character pronunciation in the target lyric are consistent with the time length and the pitch of each character pronunciation in the corresponding lyric in the reference song; when the number difference is 0, the pronunciation time length and the pronunciation pitch of each word in the corresponding lyrics in the reference song are adjusted and then determined as the target pronunciation time length and the target pronunciation pitch of each word in the target lyrics.

Step 204, determine the target tone for playing the target lyrics.

And step 205, playing the target lyrics according to the target duration and the target tone of each character pronunciation by adopting the target tone.

Specifically, the timbre of the target lyric can be acquired when the user inputs the target lyric by voice, so that the target lyric can be played by adopting the target timbre according to the target duration and the target tone of each character pronunciation. For example, the voice of the Chinese woman is used to read a target lyric, and a music score of the snail is matched, so that the finally obtained song is performed by using the tone of the Chinese woman and the target lyric is sheathed with the melody of the snail.

Thus, using the timbre of any user, together with the lyrics and the score, it is possible to sing a given song with the timbre of this user.

Based on the above embodiments, in order to more clearly describe the specific process of how to play the target lyrics according to the target duration and the target pitch of each word pronunciation, the following description is made in detail with reference to fig. 3.

Fig. 3 is a flowchart of a song playing method according to another embodiment of the present invention. As shown in fig. 3, step 103 includes:

step 301, detecting the original time length and the original tone of each character pronunciation in the target lyric.

Step 302, comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation to obtain the time length difference and the tone difference of each character pronunciation.

Step 303, adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

Specifically, there are many ways to detect the original duration and the original pitch of each character pronunciation in the target lyric, and the detection can be selectively set according to the actual application requirements, for example, as follows:

in the first example, the original time length and the original tone of each word pronunciation of the target lyrics in the reading process are detected.

As a second example, the original duration and pitch of each word utterance of the target lyrics during singing is detected.

And further, comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation, acquiring the time length difference and the tone difference of each character pronunciation, and finally adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

The method is realized as a scene, a section of characters are converted into voice through a voice reading system, then an original tone A required to be sung by target lyrics is found according to a music score of a reference song, an actual target tone B of the currently read target lyrics is found through a detection mode, a tone difference C is calculated, and then the read target tone B is adjusted to the original tone A, so that the actually emitted tone is not the target tone B but the original tone A, and tone matching is carried out.

Meanwhile, the original time length b needing singing currently and the target time length a of the target lyrics to be read aloud currently are obtained according to the music score, the time length difference c is calculated, the target time length actually read aloud is lengthened or shortened to the original time length needing singing according to an algorithm, and after all the target lyrics are finished, the singing audio is obtained, and the reading audio is replaced for playing.

Therefore, the method and the device realize that any character and music score are used for playing the song, improve the convenience and the interestingness of artificial intelligence and improve the user experience.

In order to implement the above embodiments, the present invention provides a song playback apparatus.

Fig. 4 is a schematic structural diagram of a song playback apparatus according to an embodiment of the present invention.

As shown in fig. 4, the song playback apparatus includes: a detection module 11, a first determination module 12 and a play module 13.

Wherein the detecting module 11 is configured to detect a score and a pitch of the reference song.

And a first determining module 12 for determining a target duration and a target pitch of pronunciation of each word in the target lyrics according to the score and the pitch of the reference song.

And the playing module 13 is used for playing the target lyrics according to the target duration and the target tone of each character pronunciation.

Further, as shown in fig. 5, on the basis of fig. 4, the method further includes: a second determination module 14.

Wherein, the second determining module 14 is configured to determine a target tone for playing the target lyric.

And the playing module 13 is used for playing the target lyrics according to the target duration and the target tone of each character pronunciation by adopting the target tone.

Further, a first determining module 12 for determining a word number difference between the lyrics of the reference song and the target lyrics; and determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song and the number difference of the characters.

Further, as shown in fig. 6, the playing module includes: a detection unit 131, an acquisition unit 132, and an adjustment unit 133.

The detecting unit 131 is configured to detect an original time length and an original pitch of each character pronunciation in the target lyric.

An obtaining unit 132, configured to compare the target duration and the target pitch of each character pronunciation with the original duration and the original pitch of each character pronunciation, and obtain a duration difference and a pitch difference of each character pronunciation.

The adjusting unit 133 is configured to adjust the original time length and the original pitch of each character pronunciation according to the time length difference and the pitch difference of each character pronunciation.

Further, the detecting unit 131 is configured to detect an original duration and an original tone of each character pronunciation of the target lyric in the reading process; or detecting the original time length and the original tone of each character pronunciation of the target lyrics in the singing process.

It should be noted that the foregoing explanation on the embodiment of the song playing method is also applicable to the song playing apparatus of this embodiment, and is not repeated here.

In summary, the song playing apparatus according to the embodiment of the present invention determines the target time length and the target pitch of each character in the target lyrics by detecting the score and the pitch of the reference song, and finally plays the target lyrics according to the target time length and the target pitch of each character. Therefore, the method and the device realize that any character and music score are used for playing the song, improve the convenience and the interestingness of artificial intelligence and improve the user experience.

Fig. 7 is a schematic structural diagram of an information providing apparatus based on picture content according to still another embodiment of the present invention. As shown in fig. 7, the picture content-based information providing apparatus includes:

a memory 21, a processor 22 and a computer program stored on the memory 21 and executable on the processor 22.

The processor 22, when executing the program, implements the song playback method provided in the above-described embodiment.

Further, the song playback apparatus further includes:

a communication interface 23 for communication between the memory 21 and the processor 22.

A memory 21 for storing a computer program operable on the processor 22.

The memory 21 may comprise a high-speed RAM memory, and may further include a non-volatile memory (non-volatile memory), such as at least one disk memory.

And a processor 22, configured to implement the song playing method according to the foregoing embodiment when executing the program.

If the memory 21, the processor 22 and the communication interface 23 are implemented independently, the communication interface 21, the memory 21 and the processor 22 may be connected to each other through a bus and perform communication with each other. The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (enhanced Industry Standard Architecture) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 7, but this is not intended to represent only one bus or type of bus.

Optionally, in a specific implementation, if the memory 21, the processor 22 and the communication interface 23 are integrated on a chip, the memory 21, the processor 22 and the communication interface 23 may complete mutual communication through an internal interface.

The processor 22 may be a Central Processing Unit (CPU), or an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits configured to implement embodiments of the present invention.

In order to achieve the above embodiments, the present invention further provides a computer device, which is characterized by comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor executes the program to enable a song playing to be performed, and the method comprises: detecting a score and a pitch of a reference song; determining a target duration and a target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

To achieve the above embodiments, the present invention also proposes a non-transitory computer-readable storage medium in which instructions, when executed by a processor, enable execution of a song playback, the method comprising: detecting a score and a pitch of a reference song; determining a target duration and a target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

To achieve the above embodiments, the present invention further provides a computer program product, which when executed by an instruction processor enables a song playing to be performed, the method comprising: detecting a score and a pitch of a reference song; determining a target duration and a target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song; and playing the target lyrics according to the target duration and the target tone of each character pronunciation.

Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

The logic and/or steps represented in the flowcharts or otherwise described herein, e.g., an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.

It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.

In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.

The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc. Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims

1. A song playback method, comprising the steps of:

detecting a score and a pitch of a reference song; wherein, the reference song is selected according to the requirement;

determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song;

playing the target lyrics according to the target duration and the target tone of each character pronunciation;

determining a target tone used for playing the target lyrics; acquiring the target tone when the target lyrics are input according to the voice of a user;

the playing the target lyrics according to the target duration and the target tone of each word pronunciation comprises the following steps:

playing the target lyrics according to the target duration and the target tone of each character pronunciation by adopting the target tone;

wherein, the determining the target duration and the target pitch of each character pronunciation in the target lyrics according to the music score and the pitch of the reference song comprises:

determining a word count difference of the lyrics of the reference song and the target lyrics;

and determining the target duration and the target tone of each character pronunciation in the target lyrics according to the music score and the tone of the reference song and the number difference of the characters.

2. The method of claim 1, wherein playing the target lyrics according to the target duration and target pitch of each word utterance comprises:

detecting the original time length and the original tone of each character pronunciation in the target lyrics;

comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation to obtain the time length difference and the tone difference of each character pronunciation;

and adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

3. The method of claim 2, wherein the detecting the original duration and the original pitch of the pronunciation of each word in the target lyrics comprises:

detecting the original time length and the original tone of each character pronunciation of the target lyrics in the reading process; alternatively, the first and second electrodes may be,

and detecting the original time length and the original tone of each character pronunciation of the target lyrics in the singing process.

4. A song playback apparatus, comprising:

a detection module for detecting the score and the pitch of the reference song; wherein, the reference song is selected according to the requirement;

the first determining module is used for determining the target duration and the target tone of pronunciation of each word in the target lyrics according to the music score and the tone of the reference song;

the playing module is used for playing the target lyrics according to the target duration and the target tone of each character pronunciation;

the second determination module is used for determining a target tone for playing the target lyrics; acquiring the target tone when the target lyrics are input according to the voice of a user;

the playing module is used for:

wherein the first determination module is to:

5. The apparatus of claim 4, wherein the play module comprises:

the detection unit is used for detecting the original time length and the original tone of each character pronunciation in the target lyrics;

the acquisition unit is used for comparing the target time length and the target tone of each character pronunciation with the original time length and the original tone of each character pronunciation to acquire the time length difference and the tone difference of each character pronunciation;

and the adjusting unit is used for adjusting the original time length and the original tone of each character pronunciation according to the time length difference and the tone difference of each character pronunciation.

6. The apparatus of claim 5, wherein the detection unit is to:

7. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program implements the method of any one of claims 1-3.

8. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, implements the method according to any one of claims 1-3.

9. A computer program product in which instructions, when executed by a processor, perform the method of any one of claims 1-3.