CN110428798B

CN110428798B - Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium

Info

Publication number: CN110428798B
Application number: CN201910712728.XA
Authority: CN
Inventors: 夏波; 李天边; 詹昌寿
Original assignee: Hunan Voc Acoustic Technology Co ltd; Hunan Guosheng Acoustics Technology Co ltd Shenzhen Branch
Current assignee: Hunan Voc Acoustic Technology Co ltd; Hunan Guosheng Acoustics Technology Co ltd Shenzhen Branch
Priority date: 2019-08-02
Filing date: 2019-08-02
Publication date: 2021-08-10
Anticipated expiration: 2039-08-02
Also published as: CN110428798A

Abstract

The embodiment of the invention discloses a voice and accompaniment synchronization method, Bluetooth equipment, a terminal and a storage medium, and relates to the technical field of audio processing. The method comprises the following steps: receiving accompaniment audio sent by a terminal, wherein the accompaniment audio comprises characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; playing accompaniment audio from the position of a first sampling point of the accompaniment music, synchronously triggering voice acquisition, and acquiring voice audio singing by a user according to the accompaniment audio; and compressing the human voice audio and then uploading the compressed human voice audio to the terminal, decompressing the human voice audio by the terminal, and mixing the decompressed human voice audio and the accompaniment music locally stored in the terminal to obtain mixed audio. The embodiment of the invention can realize the complete synchronization of the voice and the accompaniment and improve the sound mixing effect.

Description

Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium

Technical Field

The embodiment of the invention relates to the technical field of audio processing, in particular to a voice and accompaniment synchronization method, Bluetooth equipment, a terminal and a storage medium.

Background

Along with the development of terminal technology, it has become a very common amusement mode to record K song through terminals such as cell-phones, and at present terminal K song generally adopts bluetooth headset as the equipment of broadcast accompaniment, collection voice.

When K sings, the accompaniment audio is transmitted to the Bluetooth earphone through the accompaniment channel of the Bluetooth by the terminal, the received accompaniment audio is played by the audio output part of the Bluetooth earphone, the user sings while listening to the accompaniment audio, the vocal audio of the user singing is collected by the audio collection part of the Bluetooth earphone, then the vocal audio is mixed by the Bluetooth earphone, the sound mixing effect is played for the user, meanwhile, the Bluetooth earphone can also transmit the vocal audio to the terminal through the vocal channel of the Bluetooth, and the terminal can mix the vocal audio and the local accompaniment audio when receiving the vocal audio, so that the song audio is stored to the local part or uploaded to the network storage. However, because the characteristics of accompaniment channel self, the accompaniment audio frequency is transmitted to bluetooth headset from the terminal, makes the accompaniment audio frequency delay more serious, leads to the recorded voice to be asynchronous with the accompaniment, can lead to in the song audio frequency that follow-up audio mixing obtained like this that the accompaniment audio frequency is asynchronous with the voice audio frequency. The solutions adopted in the prior art generally have two types:

one is a pre-estimated delay synchronization method, which pre-estimates a delay time and then synchronizes the audio of the human voice and the audio of the local accompaniment according to the pre-estimated delay time during the audio mixing. However, the synchronization method has the problem that the estimated delay time is inaccurate, so that complete synchronization cannot be achieved.

The other method is a time stamp synchronization method, wherein a time stamp is added to each frame of data of human voice audio when the human voice audio is collected at a Bluetooth headset end, the time stamp is added to each frame of data of local accompaniment at a terminal, and then the human voice audio and the local accompaniment audio are synchronously processed according to the time stamp during sound mixing. However, the clock of the bluetooth headset and the clock of the terminal cannot be completely synchronized, so that the delay value between the accompaniment and the voice calculated according to the timestamp is inaccurate, and the complete synchronization of the voice and the accompaniment cannot be ensured.

It can be seen that, the two existing voice and accompaniment synchronous processing schemes both have the problem that the delay time between the voice and the accompaniment cannot be accurately calculated, so that the voice and the accompaniment cannot be completely synchronized.

Disclosure of Invention

In view of the above, an embodiment of the present invention provides a method for synchronizing a voice and an accompaniment, a bluetooth device, a terminal and a storage medium, so as to solve the problem that the prior synchronization processing scheme for the voice and the accompaniment cannot accurately calculate the delay time between the voice and the accompaniment, so that complete synchronization of the voice and the accompaniment cannot be achieved.

The technical scheme adopted by the embodiment of the invention for solving the technical problems is as follows:

according to a first aspect of the embodiments of the present invention, there is provided a voice and accompaniment synchronization method applied to a bluetooth device, the voice and accompaniment synchronization method including:

receiving accompaniment audio sent by a terminal, wherein the accompaniment audio comprises characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio;

playing the accompaniment audio from the position of a first sampling point of the accompaniment music, synchronously triggering voice acquisition, and acquiring the voice audio singed by the user according to the accompaniment audio;

and uploading the compressed voice audio to the terminal, so that the terminal decompresses the voice audio and mixes the decompressed voice audio and the accompaniment music locally stored in the terminal to obtain mixed audio.

Wherein, it decodes to be right the accompaniment audio frequency, discerns and filters the characteristic audio frequency, acquires the position of the first sampling point of the accompaniment music in the accompaniment audio frequency includes:

decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristics of the prestored characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; or,

and decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristic of the characteristic audio acquired from the terminal, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.

Wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.

Wherein, begin to broadcast the audio frequency of accompaniment from the position of the first sampling point of the music of the accompaniment to trigger the voice to gather synchronously, obtain the voice audio frequency that the user sings according to the audio frequency of accompaniment still includes:

and carrying out sound mixing processing on the human voice audio and the accompaniment obtained after decoding from the accompaniment audio, and playing a sound mixing effect for the user.

According to a second aspect of the embodiments of the present invention, there is provided a method for synchronizing a voice and an accompaniment, applied to a terminal having a bluetooth communication function, the method comprising:

when a karaoke instruction input by a user is received, acquiring accompaniment music of a song selected by the user, inserting characteristic audio into the head of the accompaniment music to generate accompaniment audio, and sending the accompaniment audio to Bluetooth equipment; the characteristic audio is used for enabling the Bluetooth equipment to identify the position of a first sampling point of accompaniment music in the accompaniment audio, playing the accompaniment audio from the position of the first sampling point, synchronously triggering voice collection, and acquiring voice audio singing by a user according to the accompaniment audio;

and receiving the voice audio uploaded by the Bluetooth equipment, decoding the voice audio, and performing sound mixing processing on the decoded voice audio and locally stored accompaniment music to obtain mixed sound audio.

Wherein, when receiving the k song instruction of user input, acquire the accompaniment music of user selection song the head of accompaniment music inserts characteristic audio and generates the accompaniment audio, will still include after accompaniment audio sends to bluetooth equipment:

sending the signal characteristics of the characteristic audio to the Bluetooth equipment, so that the Bluetooth equipment identifies the position of a first sampling point of accompaniment music in the accompaniment audio according to the signal characteristics of the characteristic audio; wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.

According to a third aspect of the embodiments of the present invention, there is provided a bluetooth device, comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the computer program, when executed by the processor, implements the steps of the method for synchronizing a human voice and an accompaniment according to any one of the first aspect.

According to a fourth aspect of the embodiments of the present invention, there is provided a storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the vocal and accompaniment synchronization method according to any one of the above first aspects.

According to a fifth aspect of the embodiments of the present invention, there is provided a terminal, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the computer program is executed by the processor, the method for synchronizing human voice and accompaniment according to any one of the second aspect is implemented.

According to a sixth aspect of embodiments of the present invention, there is provided a storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the vocal and accompaniment synchronization method according to any one of the above second aspects.

Compared with the prior synchronous processing scheme of the voice and the accompaniment, the voice and the accompaniment synchronous processing method, the Bluetooth device, the terminal and the storage medium provided by the embodiment of the invention have the advantages that the delay time between the voice and the accompaniment can not be accurately calculated, and the complete synchronization of the voice and the accompaniment can not be realized.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.

FIG. 1 is an architecture diagram of a vocal and accompaniment synchronization system according to an embodiment of the present invention;

fig. 2 is a schematic flowchart illustrating an implementation of a method for synchronizing vocal sounds and accompaniment according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of the structure of accompaniment audio in an embodiment of a method for synchronizing human voice and accompaniment provided by the embodiment of the present invention

Fig. 4 is a flowchart illustrating a specific implementation of a method for synchronizing vocal sounds and accompaniment according to a second embodiment of the present invention;

fig. 5 is a schematic structural diagram of a bluetooth device according to a third embodiment of the present invention;

fig. 6 is a schematic structural diagram of a terminal according to a fifth embodiment of the present invention.

Detailed Description

In order to make the technical problems, technical solutions and advantageous effects to be solved by the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Fig. 1 is an architecture diagram of a vocal and accompaniment synchronization system according to an embodiment of the present invention. Referring to fig. 1, the voice and accompaniment synchronization system includes a terminal 100 and a bluetooth device 200, wherein the terminal 100 has a bluetooth communication function, and a bluetooth communication connection is established with the bluetooth device 200. The terminal 100 includes, but is not limited to, a mobile phone, a computer, and a tablet with a bluetooth communication function. The bluetooth device 200 has an audio acquisition device and an audio output device, including but not limited to bluetooth headsets and the like.

Based on the above architecture diagram of the vocal and accompaniment synchronization system, the following embodiments of the present invention are proposed.

Example one

Fig. 2 is a flowchart illustrating an implementation of a method for synchronizing a vocal sound and an accompaniment according to an embodiment of the present invention, where the method is executed by a bluetooth device 200 in the system shown in fig. 1. Referring to fig. 2, the method for synchronizing human voice with accompaniment provided by the present embodiment includes:

step S201, receiving an accompaniment audio sent by the terminal 100, wherein the accompaniment audio comprises a characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; and decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.

Wherein, the bluetooth device 200 receives the accompaniment audio transmitted by the terminal 100 through the accompaniment channel. The accompaniment audio comprises two parts of characteristic audio and accompaniment music, and the characteristic audio is seamlessly spliced in front of the accompaniment music, such as: fig. 3 is a schematic diagram illustrating waveform of accompaniment audio in a preferred embodiment. The characteristic audio is an audio signal defined by a user, and is only used for enabling the bluetooth device 200 to identify the position of the first sampling point of the accompaniment music in the accompaniment audio according to the characteristic audio, and the accompaniment audio is not played.

Wherein, the pair the accompaniment audio is decoded, the characteristic audio is identified and filtered, and the position of the first sampling point of the accompaniment music in the accompaniment audio is obtained by the following steps:

decoding the accompaniment audio, identifying and filtering the characteristic audio spliced at the head position of the accompaniment music according to the signal characteristics of the prestored characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; or,

decoding the accompaniment audio, identifying and filtering the characteristic audio spliced at the head position of the accompaniment music according to the signal characteristic of the characteristic audio acquired from the terminal 100, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.

In this embodiment, the bluetooth device 200 or the terminal 100 stores therein signal characteristics of the characteristic audio, including but not limited to length information and waveform characteristics of the characteristic audio, including but not limited to waveform shape and waveform frequency of the characteristic audio.

In this embodiment, the accompaniment audio received by the bluetooth device 200 is compressed, and therefore when the accompaniment audio is received, the accompaniment audio needs to be decoded to obtain the signal characteristics of the decoded accompaniment audio, and then the signal characteristics of the decoded accompaniment audio is compared with the signal characteristics of the pre-stored characteristic audio or the signal characteristics of the characteristic audio obtained from the terminal 100 in real time to identify the position of the first sampling point of the accompaniment music in the accompaniment audio.

Further, in this embodiment, before step S201, the method may further include:

the method comprises the steps of collecting a K song instruction input by a user in a voice mode, uploading the K song instruction to the terminal 100, enabling the terminal 100 to search corresponding accompaniment music according to the K song instruction, splicing the characteristic audio to the head of the accompaniment music, and generating the accompaniment audio.

In this embodiment, the bluetooth device 200 includes an audio acquisition device, and acquires a karaoke instruction input by a user through a voice mode through the audio acquisition device, and then uploads the karaoke instruction to the terminal 100 through a bluetooth serial port. Wherein the Karaoke instruction at least comprises a singing song name. After receiving the K song instruction, the terminal 100 searches accompaniment music corresponding to the singing song name from a local song library or a network song library according to the singing song name in the K song instruction, and when only one piece of accompaniment music exists in a search result, the characteristic audio is directly spliced to the head position of the accompaniment music to generate accompaniment audio; when a plurality of pieces of accompaniment music exist in the search result, the plurality of pieces of accompaniment music are displayed for the user to select, and the feature audio is spliced at the head position of the accompaniment music selected by the user after the user selects, so that the accompaniment audio is generated. Preferably, in order to improve the matching efficiency of the accompaniment music, the karaoke instruction may further include information such as a song singer besides the name of the song to be sung. Of course, when the user forgets to sing the song title, the bluetooth device 200 may select the lyric or singer search mode, and in the lyric or singer search mode, the K song command may only include the lyric or singer name, and the terminal 100 may also search for the corresponding accompaniment music according to the lyric or singer name in the K song command.

Step S202, the accompaniment audio is played from the position of the first sampling point of the accompaniment music, voice collection is synchronously triggered, and the voice audio singing by the user according to the accompaniment audio is obtained.

In this embodiment, after acquiring the position of the first sampling point of the accompaniment music in the accompaniment audio, the bluetooth device 200 starts to play the accompaniment audio from the position of the first sampling point of the accompaniment music, and simultaneously triggers voice acquisition synchronously, so that the first sampling point of the voice acquisition is aligned with the first sampling point of the accompaniment music, thereby ensuring that the acquired voice audio and the accompaniment music are completely synchronous.

And step S203, uploading the compressed voice audio to the terminal 100, so that the terminal 100 decompresses the voice audio, and then mixing the decompressed voice audio and the accompaniment audio locally stored in the terminal 100 to obtain mixed audio.

In this embodiment, bluetooth equipment 200 is right after the collection obtains the voice audio frequency with accompaniment music complete synchronization the voice audio frequency compresses, then arrives the voice audio frequency after the compression through the bluetooth serial ports terminal 100, terminal 100 is right after receiving the voice audio frequency after the compression the voice audio frequency decompresses, because the voice audio frequency after decompressing is complete synchronization with accompaniment music, consequently terminal 100 can directly carry out the audio mixing to the voice audio frequency after decompressing and the accompaniment music of local storage, can obtain the audio mixing audio frequency of voice and accompaniment complete synchronization.

Preferably, in this embodiment, after step S203, the method may further include:

and carrying out sound mixing processing on the human voice audio and the accompaniment music obtained after decoding from the accompaniment audio, and playing a sound mixing effect for a user.

In this embodiment, bluetooth equipment 200 is after gathering the human voice audio frequency, and the accompaniment music in with human voice audio frequency and accompaniment audio frequency carries out the audio mixing and handles to audio frequency after audio mixing processing is played to audio output device through bluetooth equipment 200, can make the user in time learn the effect of singing like this, further promotes user experience.

Above can see, the voice and accompaniment synchronization method that this embodiment provided, because insert the characteristic audio at first at the head of accompaniment audio, make bluetooth equipment 200 according to the position of the first sampling point of accompaniment music in the characteristic audio discernment accompaniment audio, then trigger the voice collection in the position department of a sampling point of accompaniment, thereby can guarantee that the voice audio and the accompaniment that gather keep accurate alignment, follow-up need not to do actions such as comparison again, delay calculation, direct alignment from the first sampling point carries out the audio mixing, can realize the complete synchronization of voice and accompaniment.

Example two

Fig. 4 is a flowchart illustrating an implementation of a method for synchronizing vocal sounds and accompaniment according to a second embodiment of the present invention, where an execution main body of the method is the terminal 100 in the system shown in fig. 1. Referring to fig. 4, the method for synchronizing human voice with accompaniment provided by the present embodiment includes:

step S401, when a karaoke instruction input by a user is received, acquiring accompaniment music of a song selected by the user, inserting a characteristic audio into the head of the accompaniment music to generate an accompaniment audio, and sending the accompaniment audio to the Bluetooth equipment 200; the characteristic audio is used for enabling the bluetooth device 200 to identify the position of a first sampling point of the accompaniment music in the accompaniment audio, play the accompaniment audio from the position of the first sampling point, synchronously trigger voice collection, and acquire the voice audio singing by the user according to the accompaniment audio.

Wherein, the receiving of the karaoke instruction input by the user comprises: receiving a language karaoke control instruction issued by a user through the Bluetooth device 200; or, receiving a karaoke instruction input by a user through a key on the terminal 100; or, receiving a karaoke instruction input by a user through a touch screen of the terminal 100.

Wherein the Karaoke instruction at least comprises a singing song name. After receiving the K song instruction, the terminal 100 searches accompaniment music corresponding to the singing song name from a local song library or a network song library according to the singing song name in the K song instruction, and when only one piece of accompaniment music exists in a search result, the characteristic audio is directly spliced at the head position of the accompaniment music to generate accompaniment audio; when a plurality of pieces of accompaniment music exist in the search result, the plurality of pieces of accompaniment music are displayed for the user to select, and the feature audio is spliced at the head position of the accompaniment music selected by the user after the user selects, so that the accompaniment audio is generated. Preferably, in order to improve the matching efficiency of the accompaniment audio, the karaoke instruction may further include information such as a song singer besides the name of the song to be sung. Certainly, when the user forgets to sing the song name, the lyric or singer search mode may be selected, and in the lyric or singer search mode, the K song command may only include the lyric or the singer name, and the terminal 100 may also search for the corresponding accompaniment music according to the lyric or singer name in the K song command.

Wherein, the terminal 100 generates the accompaniment audio and then transmits the accompaniment audio to the bluetooth device 200 through an accompaniment channel. The accompaniment audio comprises two parts of characteristic audio and accompaniment music, and the characteristic audio is seamlessly spliced in front of the accompaniment music, such as: fig. 3 is a schematic diagram illustrating waveform of accompaniment audio in a preferred embodiment. The characteristic audio is an audio signal defined by a user, and is only used for enabling the bluetooth device 200 to identify the position of the first sampling point of the accompaniment music in the accompaniment audio according to the characteristic audio, and the accompaniment audio is not played.

The bluetooth device 200 or the terminal 100 stores the signal characteristics of the characteristic audio, and when receiving the accompaniment audio, the bluetooth device 200 identifies and filters the characteristic audio in the accompaniment audio according to the signal characteristics of the characteristic audio stored by itself or acquired from the terminal 100 to obtain the position of the first sampling point of the accompaniment music. Preferably, in a specific implementation example, the bluetooth device 200 does not store the signal characteristics of the characteristic audio, and the method for synchronizing the voice and the accompaniment further includes:

sending the signal characteristics of the characteristic audio to the bluetooth device 200, so that the bluetooth device 200 identifies the position of a first sampling point of accompaniment music in the accompaniment audio according to the signal characteristics of the characteristic audio; wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.

Step S402, receiving the voice audio uploaded by the bluetooth device 200, decoding the voice audio, and performing audio mixing processing on the decoded voice audio and locally stored accompaniment music to obtain mixed audio.

In this embodiment, bluetooth equipment 200 can be right after gathering the voice audio frequency with accompaniment music complete synchronization the voice audio frequency compresses, then arrives the voice audio frequency after the compression through the bluetooth serial ports terminal 100, terminal 100 is receiving behind the voice audio frequency, it is right the voice audio frequency is decoded, acquires the voice audio frequency after decoding, because the voice audio frequency after decoding is complete synchronization with accompaniment music, consequently can directly be right this moment the voice audio frequency carries out the audio mixing with the accompaniment music of local storage and handles, and voice and accompaniment are complete synchronization promptly in the audio mixing audio frequency that obtains like this.

Preferably, in this embodiment, after step S402, the method may further include:

storing the mixed audio locally; or uploading the mixed audio to a network for storage.

In this embodiment, as the mixed audio is stored locally or uploaded to the network for storage, the user can play back the music or share the music sung by the user with others conveniently, and the user experience can be further improved.

Above can see, the voice and accompaniment synchronization method that this embodiment provided, because insert the characteristic audio in the front at the accompaniment music equally, make bluetooth equipment 200 according to the position of the first sampling point of characteristic audio identification accompaniment music, and trigger the voice collection at the position department of the first sampling point of accompaniment music, thereby can make user k song, the first sampling point of the voice audio of gathering keeps accurate alignment with the first sampling point of accompaniment music, follow-up need not to compare again, action such as time delay calculation, directly align from the first sampling point and carry out the audio mixing, can realize the complete synchronization of voice and accompaniment promptly.

EXAMPLE III

Fig. 5 is a schematic structural diagram of a bluetooth device 200 according to a third embodiment of the present invention. Only the portions related to the present embodiment are shown for convenience of explanation.

Referring to fig. 5, the bluetooth apparatus 200 provided in this embodiment includes an audio acquisition device 201 and an audio output device 202, the bluetooth apparatus 200 further includes a memory 203, a processor 204, and a computer program 205 stored in the memory 203 and executable on the processor 204, the audio acquisition device 201 and the audio output device 202 are both electrically connected to the processor 204, and when the computer program 205 is executed by the processor 204, the steps of the human voice and accompaniment synchronization method according to the first embodiment are implemented.

The bluetooth device 200 of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the first embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the device embodiments, which are not described herein again.

Example four

A fourth embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method for synchronizing a vocal sound and an accompaniment according to the first embodiment of the present invention is implemented.

The computer-readable storage medium of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the first embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present apparatus embodiment, which is not described herein again.

EXAMPLE five

Fifth embodiment of the present invention provides a terminal 100, where the terminal 100 includes a memory 101, a processor 102, and a computer program 103 stored in the memory 101 and capable of running on the processor 102, and when the computer program 103 is executed by the processor 102, the steps of the human voice and accompaniment synchronization method according to the first embodiment of the present invention are implemented.

The terminal 100 of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the second embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present device embodiment, which is not described herein again.

EXAMPLE six

A sixth embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method for synchronizing a vocal sound and an accompaniment according to the second embodiment of the present invention is implemented.

The computer-readable storage medium of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the second embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present apparatus embodiment, which is not described herein again.

It will be understood by those of ordinary skill in the art that all or some of the steps of the methods, systems, functional modules/units in the devices disclosed above may be implemented as software, firmware, hardware, and suitable combinations thereof.

In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be performed by several physical components in cooperation. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, as is well known to those of ordinary skill in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, Digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any information delivery media as known to those skilled in the art.

The preferred embodiments of the present invention have been described above with reference to the accompanying drawings, and are not to be construed as limiting the scope of the invention. Any modifications, equivalents and improvements which may occur to those skilled in the art without departing from the scope and spirit of the present invention are intended to be within the scope of the claims.

Claims

1. A voice and accompaniment synchronization method is applied to Bluetooth equipment and is characterized by comprising the following steps:

2. The method for synchronizing vocal sounds with accompaniment according to claim 1, wherein said decoding the accompaniment audio, identifying and filtering said characteristic audio, and obtaining the position of the first sample point of the accompaniment music in said accompaniment audio comprises:

3. The human voice and accompaniment synchronization method according to claim 2, wherein the signal characteristics of said characteristic audio include length information and waveform characteristics of said characteristic audio, said waveform characteristics including waveform shape and waveform frequency of said characteristic audio.

4. The method for synchronizing vocal sounds with accompaniment according to claim 1, wherein said accompaniment audio is played from the position of the first sampling point of the accompaniment music and the vocal collection is triggered synchronously, and further comprising the following steps after the vocal audio singing by the user according to the accompaniment audio is obtained:

and carrying out sound mixing processing on the human voice audio and the accompaniment music obtained after decoding from the accompaniment audio sent by the terminal, and playing a sound mixing effect for the user.

5. A method for synchronizing voice and accompaniment is applied to a terminal with a Bluetooth communication function, and is characterized by comprising the following steps:

6. The method for synchronizing human voice with accompaniment according to claim 5, wherein the method for synchronizing human voice with accompaniment comprises the steps of acquiring accompaniment music of a song selected by a user when a karaoke command input by the user is received, inserting characteristic audio into the head of the accompaniment music to generate accompaniment audio, and sending the accompaniment audio to a Bluetooth device, and further comprising the following steps:

7. A Bluetooth device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, performing the steps of the method of synchronizing a vocal sound and an accompaniment according to any one of claims 1 to 4.

8. A storage medium having stored thereon a computer program which, when executed by a processor, carries out the steps of the vocal and accompaniment synchronization method according to any one of claims 1 to 4.

9. A terminal, characterized in that it comprises a memory, a processor and a computer program stored on said memory and executable on said processor, said computer program, when executed by said processor, implementing the steps of the vocal and accompaniment synchronization method according to any one of claims 5 to 6.

10. A storage medium having stored thereon a computer program which, when executed by a processor, carries out the steps of the vocal and accompaniment synchronization method according to any one of claims 5 to 6.