CN110428798B - Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium - Google Patents

Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium Download PDF

Info

Publication number
CN110428798B
CN110428798B CN201910712728.XA CN201910712728A CN110428798B CN 110428798 B CN110428798 B CN 110428798B CN 201910712728 A CN201910712728 A CN 201910712728A CN 110428798 B CN110428798 B CN 110428798B
Authority
CN
China
Prior art keywords
audio
accompaniment
voice
characteristic
music
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910712728.XA
Other languages
Chinese (zh)
Other versions
CN110428798A (en
Inventor
夏波
李天边
詹昌寿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan Voc Acoustic Technology Co ltd
Hunan Guosheng Acoustics Technology Co ltd Shenzhen Branch
Original Assignee
Hunan Voc Acoustic Technology Co ltd
Hunan Guosheng Acoustics Technology Co ltd Shenzhen Branch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan Voc Acoustic Technology Co ltd, Hunan Guosheng Acoustics Technology Co ltd Shenzhen Branch filed Critical Hunan Voc Acoustic Technology Co ltd
Priority to CN201910712728.XA priority Critical patent/CN110428798B/en
Publication of CN110428798A publication Critical patent/CN110428798A/en
Application granted granted Critical
Publication of CN110428798B publication Critical patent/CN110428798B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/36Accompaniment arrangements
    • G10H1/361Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/80Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

The embodiment of the invention discloses a voice and accompaniment synchronization method, Bluetooth equipment, a terminal and a storage medium, and relates to the technical field of audio processing. The method comprises the following steps: receiving accompaniment audio sent by a terminal, wherein the accompaniment audio comprises characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; playing accompaniment audio from the position of a first sampling point of the accompaniment music, synchronously triggering voice acquisition, and acquiring voice audio singing by a user according to the accompaniment audio; and compressing the human voice audio and then uploading the compressed human voice audio to the terminal, decompressing the human voice audio by the terminal, and mixing the decompressed human voice audio and the accompaniment music locally stored in the terminal to obtain mixed audio. The embodiment of the invention can realize the complete synchronization of the voice and the accompaniment and improve the sound mixing effect.

Description

Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium
Technical Field
The embodiment of the invention relates to the technical field of audio processing, in particular to a voice and accompaniment synchronization method, Bluetooth equipment, a terminal and a storage medium.
Background
Along with the development of terminal technology, it has become a very common amusement mode to record K song through terminals such as cell-phones, and at present terminal K song generally adopts bluetooth headset as the equipment of broadcast accompaniment, collection voice.
When K sings, the accompaniment audio is transmitted to the Bluetooth earphone through the accompaniment channel of the Bluetooth by the terminal, the received accompaniment audio is played by the audio output part of the Bluetooth earphone, the user sings while listening to the accompaniment audio, the vocal audio of the user singing is collected by the audio collection part of the Bluetooth earphone, then the vocal audio is mixed by the Bluetooth earphone, the sound mixing effect is played for the user, meanwhile, the Bluetooth earphone can also transmit the vocal audio to the terminal through the vocal channel of the Bluetooth, and the terminal can mix the vocal audio and the local accompaniment audio when receiving the vocal audio, so that the song audio is stored to the local part or uploaded to the network storage. However, because the characteristics of accompaniment channel self, the accompaniment audio frequency is transmitted to bluetooth headset from the terminal, makes the accompaniment audio frequency delay more serious, leads to the recorded voice to be asynchronous with the accompaniment, can lead to in the song audio frequency that follow-up audio mixing obtained like this that the accompaniment audio frequency is asynchronous with the voice audio frequency. The solutions adopted in the prior art generally have two types:
one is a pre-estimated delay synchronization method, which pre-estimates a delay time and then synchronizes the audio of the human voice and the audio of the local accompaniment according to the pre-estimated delay time during the audio mixing. However, the synchronization method has the problem that the estimated delay time is inaccurate, so that complete synchronization cannot be achieved.
The other method is a time stamp synchronization method, wherein a time stamp is added to each frame of data of human voice audio when the human voice audio is collected at a Bluetooth headset end, the time stamp is added to each frame of data of local accompaniment at a terminal, and then the human voice audio and the local accompaniment audio are synchronously processed according to the time stamp during sound mixing. However, the clock of the bluetooth headset and the clock of the terminal cannot be completely synchronized, so that the delay value between the accompaniment and the voice calculated according to the timestamp is inaccurate, and the complete synchronization of the voice and the accompaniment cannot be ensured.
It can be seen that, the two existing voice and accompaniment synchronous processing schemes both have the problem that the delay time between the voice and the accompaniment cannot be accurately calculated, so that the voice and the accompaniment cannot be completely synchronized.
Disclosure of Invention
In view of the above, an embodiment of the present invention provides a method for synchronizing a voice and an accompaniment, a bluetooth device, a terminal and a storage medium, so as to solve the problem that the prior synchronization processing scheme for the voice and the accompaniment cannot accurately calculate the delay time between the voice and the accompaniment, so that complete synchronization of the voice and the accompaniment cannot be achieved.
The technical scheme adopted by the embodiment of the invention for solving the technical problems is as follows:
according to a first aspect of the embodiments of the present invention, there is provided a voice and accompaniment synchronization method applied to a bluetooth device, the voice and accompaniment synchronization method including:
receiving accompaniment audio sent by a terminal, wherein the accompaniment audio comprises characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio;
playing the accompaniment audio from the position of a first sampling point of the accompaniment music, synchronously triggering voice acquisition, and acquiring the voice audio singed by the user according to the accompaniment audio;
and uploading the compressed voice audio to the terminal, so that the terminal decompresses the voice audio and mixes the decompressed voice audio and the accompaniment music locally stored in the terminal to obtain mixed audio.
Wherein, it decodes to be right the accompaniment audio frequency, discerns and filters the characteristic audio frequency, acquires the position of the first sampling point of the accompaniment music in the accompaniment audio frequency includes:
decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristics of the prestored characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; alternatively, the first and second electrodes may be,
and decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristic of the characteristic audio acquired from the terminal, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.
Wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.
Wherein, begin to broadcast the audio frequency of accompaniment from the position of the first sampling point of the music of the accompaniment to trigger the voice to gather synchronously, obtain the voice audio frequency that the user sings according to the audio frequency of accompaniment still includes:
and carrying out sound mixing processing on the human voice audio and the accompaniment obtained after decoding from the accompaniment audio, and playing a sound mixing effect for the user.
According to a second aspect of the embodiments of the present invention, there is provided a method for synchronizing a voice and an accompaniment, applied to a terminal having a bluetooth communication function, the method comprising:
when a karaoke instruction input by a user is received, acquiring accompaniment music of a song selected by the user, inserting characteristic audio into the head of the accompaniment music to generate accompaniment audio, and sending the accompaniment audio to Bluetooth equipment; the characteristic audio is used for enabling the Bluetooth equipment to identify the position of a first sampling point of accompaniment music in the accompaniment audio, playing the accompaniment audio from the position of the first sampling point, synchronously triggering voice collection, and acquiring voice audio singing by a user according to the accompaniment audio;
and receiving the voice audio uploaded by the Bluetooth equipment, decoding the voice audio, and performing sound mixing processing on the decoded voice audio and locally stored accompaniment music to obtain mixed sound audio.
Wherein, when receiving the k song instruction of user input, acquire the accompaniment music of user selection song the head of accompaniment music inserts characteristic audio and generates the accompaniment audio, will still include after accompaniment audio sends to bluetooth equipment:
sending the signal characteristics of the characteristic audio to the Bluetooth equipment, so that the Bluetooth equipment identifies the position of a first sampling point of accompaniment music in the accompaniment audio according to the signal characteristics of the characteristic audio; wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.
According to a third aspect of the embodiments of the present invention, there is provided a bluetooth device, comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the computer program, when executed by the processor, implements the steps of the method for synchronizing a human voice and an accompaniment according to any one of the first aspect.
According to a fourth aspect of the embodiments of the present invention, there is provided a storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the vocal and accompaniment synchronization method according to any one of the above first aspects.
According to a fifth aspect of the embodiments of the present invention, there is provided a terminal, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the computer program is executed by the processor, the method for synchronizing human voice and accompaniment according to any one of the second aspect is implemented.
According to a sixth aspect of embodiments of the present invention, there is provided a storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the vocal and accompaniment synchronization method according to any one of the above second aspects.
Compared with the prior synchronous processing scheme of the voice and the accompaniment, the voice and the accompaniment synchronous processing method, the Bluetooth device, the terminal and the storage medium provided by the embodiment of the invention have the advantages that the delay time between the voice and the accompaniment can not be accurately calculated, and the complete synchronization of the voice and the accompaniment can not be realized.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
FIG. 1 is an architecture diagram of a vocal and accompaniment synchronization system according to an embodiment of the present invention;
fig. 2 is a schematic flowchart illustrating an implementation of a method for synchronizing vocal sounds and accompaniment according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of the structure of accompaniment audio in an embodiment of a method for synchronizing human voice and accompaniment provided by the embodiment of the present invention
Fig. 4 is a flowchart illustrating a specific implementation of a method for synchronizing vocal sounds and accompaniment according to a second embodiment of the present invention;
fig. 5 is a schematic structural diagram of a bluetooth device according to a third embodiment of the present invention;
fig. 6 is a schematic structural diagram of a terminal according to a fifth embodiment of the present invention.
Detailed Description
In order to make the technical problems, technical solutions and advantageous effects to be solved by the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Fig. 1 is an architecture diagram of a vocal and accompaniment synchronization system according to an embodiment of the present invention. Referring to fig. 1, the voice and accompaniment synchronization system includes a terminal 100 and a bluetooth device 200, wherein the terminal 100 has a bluetooth communication function, and a bluetooth communication connection is established with the bluetooth device 200. The terminal 100 includes, but is not limited to, a mobile phone, a computer, and a tablet with a bluetooth communication function. The bluetooth device 200 has an audio acquisition device and an audio output device, including but not limited to bluetooth headsets and the like.
Based on the above architecture diagram of the vocal and accompaniment synchronization system, the following embodiments of the present invention are proposed.
Example one
Fig. 2 is a flowchart illustrating an implementation of a method for synchronizing a vocal sound and an accompaniment according to an embodiment of the present invention, where the method is executed by a bluetooth device 200 in the system shown in fig. 1. Referring to fig. 2, the method for synchronizing human voice with accompaniment provided by the present embodiment includes:
step S201, receiving an accompaniment audio sent by the terminal 100, wherein the accompaniment audio comprises a characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; and decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.
Wherein, the bluetooth device 200 receives the accompaniment audio transmitted by the terminal 100 through the accompaniment channel. The accompaniment audio comprises two parts of characteristic audio and accompaniment music, and the characteristic audio is seamlessly spliced in front of the accompaniment music, such as: fig. 3 is a schematic diagram illustrating waveform of accompaniment audio in a preferred embodiment. The characteristic audio is an audio signal defined by a user, and is only used for enabling the bluetooth device 200 to identify the position of the first sampling point of the accompaniment music in the accompaniment audio according to the characteristic audio, and the accompaniment audio is not played.
Wherein, the pair the accompaniment audio is decoded, the characteristic audio is identified and filtered, and the position of the first sampling point of the accompaniment music in the accompaniment audio is obtained by the following steps:
decoding the accompaniment audio, identifying and filtering the characteristic audio spliced at the head position of the accompaniment music according to the signal characteristics of the prestored characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; alternatively, the first and second electrodes may be,
decoding the accompaniment audio, identifying and filtering the characteristic audio spliced at the head position of the accompaniment music according to the signal characteristic of the characteristic audio acquired from the terminal 100, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.
In this embodiment, the bluetooth device 200 or the terminal 100 stores therein signal characteristics of the characteristic audio, including but not limited to length information and waveform characteristics of the characteristic audio, including but not limited to waveform shape and waveform frequency of the characteristic audio.
In this embodiment, the accompaniment audio received by the bluetooth device 200 is compressed, and therefore when the accompaniment audio is received, the accompaniment audio needs to be decoded to obtain the signal characteristics of the decoded accompaniment audio, and then the signal characteristics of the decoded accompaniment audio is compared with the signal characteristics of the pre-stored characteristic audio or the signal characteristics of the characteristic audio obtained from the terminal 100 in real time to identify the position of the first sampling point of the accompaniment music in the accompaniment audio.
Further, in this embodiment, before step S201, the method may further include:
the method comprises the steps of collecting a K song instruction input by a user in a voice mode, uploading the K song instruction to the terminal 100, enabling the terminal 100 to search corresponding accompaniment music according to the K song instruction, splicing the characteristic audio to the head of the accompaniment music, and generating the accompaniment audio.
In this embodiment, the bluetooth device 200 includes an audio acquisition device, and acquires a karaoke instruction input by a user through a voice mode through the audio acquisition device, and then uploads the karaoke instruction to the terminal 100 through a bluetooth serial port. Wherein the Karaoke instruction at least comprises a singing song name. After receiving the K song instruction, the terminal 100 searches accompaniment music corresponding to the singing song name from a local song library or a network song library according to the singing song name in the K song instruction, and when only one piece of accompaniment music exists in a search result, the characteristic audio is directly spliced to the head position of the accompaniment music to generate accompaniment audio; when a plurality of pieces of accompaniment music exist in the search result, the plurality of pieces of accompaniment music are displayed for the user to select, and the feature audio is spliced at the head position of the accompaniment music selected by the user after the user selects, so that the accompaniment audio is generated. Preferably, in order to improve the matching efficiency of the accompaniment music, the karaoke instruction may further include information such as a song singer besides the name of the song to be sung. Of course, when the user forgets to sing the song title, the bluetooth device 200 may select the lyric or singer search mode, and in the lyric or singer search mode, the K song command may only include the lyric or singer name, and the terminal 100 may also search for the corresponding accompaniment music according to the lyric or singer name in the K song command.
Step S202, the accompaniment audio is played from the position of the first sampling point of the accompaniment music, voice collection is synchronously triggered, and the voice audio singing by the user according to the accompaniment audio is obtained.
In this embodiment, after acquiring the position of the first sampling point of the accompaniment music in the accompaniment audio, the bluetooth device 200 starts to play the accompaniment audio from the position of the first sampling point of the accompaniment music, and simultaneously triggers voice acquisition synchronously, so that the first sampling point of the voice acquisition is aligned with the first sampling point of the accompaniment music, thereby ensuring that the acquired voice audio and the accompaniment music are completely synchronous.
And step S203, uploading the compressed voice audio to the terminal 100, so that the terminal 100 decompresses the voice audio, and then mixing the decompressed voice audio and the accompaniment audio locally stored in the terminal 100 to obtain mixed audio.
In this embodiment, bluetooth equipment 200 is right after the collection obtains the voice audio frequency with accompaniment music complete synchronization the voice audio frequency compresses, then arrives the voice audio frequency after the compression through the bluetooth serial ports terminal 100, terminal 100 is right after receiving the voice audio frequency after the compression the voice audio frequency decompresses, because the voice audio frequency after decompressing is complete synchronization with accompaniment music, consequently terminal 100 can directly carry out the audio mixing to the voice audio frequency after decompressing and the accompaniment music of local storage, can obtain the audio mixing audio frequency of voice and accompaniment complete synchronization.
Preferably, in this embodiment, after step S203, the method may further include:
and carrying out sound mixing processing on the human voice audio and the accompaniment music obtained after decoding from the accompaniment audio, and playing a sound mixing effect for a user.
In this embodiment, bluetooth equipment 200 is after gathering the human voice audio frequency, and the accompaniment music in with human voice audio frequency and accompaniment audio frequency carries out the audio mixing and handles to audio frequency after audio mixing processing is played to audio output device through bluetooth equipment 200, can make the user in time learn the effect of singing like this, further promotes user experience.
Above can see, the voice and accompaniment synchronization method that this embodiment provided, because insert the characteristic audio at first at the head of accompaniment audio, make bluetooth equipment 200 according to the position of the first sampling point of accompaniment music in the characteristic audio discernment accompaniment audio, then trigger the voice collection in the position department of a sampling point of accompaniment, thereby can guarantee that the voice audio and the accompaniment that gather keep accurate alignment, follow-up need not to do actions such as comparison again, delay calculation, direct alignment from the first sampling point carries out the audio mixing, can realize the complete synchronization of voice and accompaniment.
Example two
Fig. 4 is a flowchart illustrating an implementation of a method for synchronizing vocal sounds and accompaniment according to a second embodiment of the present invention, where an execution main body of the method is the terminal 100 in the system shown in fig. 1. Referring to fig. 4, the method for synchronizing human voice with accompaniment provided by the present embodiment includes:
step S401, when a karaoke instruction input by a user is received, acquiring accompaniment music of a song selected by the user, inserting a characteristic audio into the head of the accompaniment music to generate an accompaniment audio, and sending the accompaniment audio to the Bluetooth equipment 200; the characteristic audio is used for enabling the bluetooth device 200 to identify the position of a first sampling point of the accompaniment music in the accompaniment audio, play the accompaniment audio from the position of the first sampling point, synchronously trigger voice collection, and acquire the voice audio singing by the user according to the accompaniment audio.
Wherein, the receiving of the karaoke instruction input by the user comprises: receiving a language karaoke control instruction issued by a user through the Bluetooth device 200; or, receiving a karaoke instruction input by a user through a key on the terminal 100; or, receiving a karaoke instruction input by a user through a touch screen of the terminal 100.
Wherein the Karaoke instruction at least comprises a singing song name. After receiving the K song instruction, the terminal 100 searches accompaniment music corresponding to the singing song name from a local song library or a network song library according to the singing song name in the K song instruction, and when only one piece of accompaniment music exists in a search result, the characteristic audio is directly spliced at the head position of the accompaniment music to generate accompaniment audio; when a plurality of pieces of accompaniment music exist in the search result, the plurality of pieces of accompaniment music are displayed for the user to select, and the feature audio is spliced at the head position of the accompaniment music selected by the user after the user selects, so that the accompaniment audio is generated. Preferably, in order to improve the matching efficiency of the accompaniment audio, the karaoke instruction may further include information such as a song singer besides the name of the song to be sung. Certainly, when the user forgets to sing the song name, the lyric or singer search mode may be selected, and in the lyric or singer search mode, the K song command may only include the lyric or the singer name, and the terminal 100 may also search for the corresponding accompaniment music according to the lyric or singer name in the K song command.
Wherein, the terminal 100 generates the accompaniment audio and then transmits the accompaniment audio to the bluetooth device 200 through an accompaniment channel. The accompaniment audio comprises two parts of characteristic audio and accompaniment music, and the characteristic audio is seamlessly spliced in front of the accompaniment music, such as: fig. 3 is a schematic diagram illustrating waveform of accompaniment audio in a preferred embodiment. The characteristic audio is an audio signal defined by a user, and is only used for enabling the bluetooth device 200 to identify the position of the first sampling point of the accompaniment music in the accompaniment audio according to the characteristic audio, and the accompaniment audio is not played.
The bluetooth device 200 or the terminal 100 stores the signal characteristics of the characteristic audio, and when receiving the accompaniment audio, the bluetooth device 200 identifies and filters the characteristic audio in the accompaniment audio according to the signal characteristics of the characteristic audio stored by itself or acquired from the terminal 100 to obtain the position of the first sampling point of the accompaniment music. Preferably, in a specific implementation example, the bluetooth device 200 does not store the signal characteristics of the characteristic audio, and the method for synchronizing the voice and the accompaniment further includes:
sending the signal characteristics of the characteristic audio to the bluetooth device 200, so that the bluetooth device 200 identifies the position of a first sampling point of accompaniment music in the accompaniment audio according to the signal characteristics of the characteristic audio; wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.
Step S402, receiving the voice audio uploaded by the bluetooth device 200, decoding the voice audio, and performing audio mixing processing on the decoded voice audio and locally stored accompaniment music to obtain mixed audio.
In this embodiment, bluetooth equipment 200 can be right after gathering the voice audio frequency with accompaniment music complete synchronization the voice audio frequency compresses, then arrives the voice audio frequency after the compression through the bluetooth serial ports terminal 100, terminal 100 is receiving behind the voice audio frequency, it is right the voice audio frequency is decoded, acquires the voice audio frequency after decoding, because the voice audio frequency after decoding is complete synchronization with accompaniment music, consequently can directly be right this moment the voice audio frequency carries out the audio mixing with the accompaniment music of local storage and handles, and voice and accompaniment are complete synchronization promptly in the audio mixing audio frequency that obtains like this.
Preferably, in this embodiment, after step S402, the method may further include:
storing the mixed audio locally; or uploading the mixed audio to a network for storage.
In this embodiment, as the mixed audio is stored locally or uploaded to the network for storage, the user can play back the music or share the music sung by the user with others conveniently, and the user experience can be further improved.
Above can see, the voice and accompaniment synchronization method that this embodiment provided, because insert the characteristic audio in the front at the accompaniment music equally, make bluetooth equipment 200 according to the position of the first sampling point of characteristic audio identification accompaniment music, and trigger the voice collection at the position department of the first sampling point of accompaniment music, thereby can make user k song, the first sampling point of the voice audio of gathering keeps accurate alignment with the first sampling point of accompaniment music, follow-up need not to compare again, action such as time delay calculation, directly align from the first sampling point and carry out the audio mixing, can realize the complete synchronization of voice and accompaniment promptly.
EXAMPLE III
Fig. 5 is a schematic structural diagram of a bluetooth device 200 according to a third embodiment of the present invention. Only the portions related to the present embodiment are shown for convenience of explanation.
Referring to fig. 5, the bluetooth apparatus 200 provided in this embodiment includes an audio acquisition device 201 and an audio output device 202, the bluetooth apparatus 200 further includes a memory 203, a processor 204, and a computer program 205 stored in the memory 203 and executable on the processor 204, the audio acquisition device 201 and the audio output device 202 are both electrically connected to the processor 204, and when the computer program 205 is executed by the processor 204, the steps of the human voice and accompaniment synchronization method according to the first embodiment are implemented.
The bluetooth device 200 of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the first embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the device embodiments, which are not described herein again.
Example four
A fourth embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method for synchronizing a vocal sound and an accompaniment according to the first embodiment of the present invention is implemented.
The computer-readable storage medium of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the first embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present apparatus embodiment, which is not described herein again.
EXAMPLE five
Fifth embodiment of the present invention provides a terminal 100, where the terminal 100 includes a memory 101, a processor 102, and a computer program 103 stored in the memory 101 and capable of running on the processor 102, and when the computer program 103 is executed by the processor 102, the steps of the human voice and accompaniment synchronization method according to the first embodiment of the present invention are implemented.
The terminal 100 of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the second embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present device embodiment, which is not described herein again.
EXAMPLE six
A sixth embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method for synchronizing a vocal sound and an accompaniment according to the second embodiment of the present invention is implemented.
The computer-readable storage medium of this embodiment and the method for synchronizing vocal sounds and accompaniment described in the second embodiment belong to the same concept, and specific implementation processes thereof are detailed in the corresponding method embodiments, and technical features in the method embodiments are correspondingly applicable in the present apparatus embodiment, which is not described herein again.
It will be understood by those of ordinary skill in the art that all or some of the steps of the methods, systems, functional modules/units in the devices disclosed above may be implemented as software, firmware, hardware, and suitable combinations thereof.
In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be performed by several physical components in cooperation. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, as is well known to those of ordinary skill in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, Digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any information delivery media as known to those skilled in the art.
The preferred embodiments of the present invention have been described above with reference to the accompanying drawings, and are not to be construed as limiting the scope of the invention. Any modifications, equivalents and improvements which may occur to those skilled in the art without departing from the scope and spirit of the present invention are intended to be within the scope of the claims.

Claims (10)

1. A voice and accompaniment synchronization method is applied to Bluetooth equipment and is characterized by comprising the following steps:
receiving accompaniment audio sent by a terminal, wherein the accompaniment audio comprises characteristic audio and accompaniment music, and the characteristic audio is spliced at the head position of the accompaniment music; decoding the accompaniment audio, identifying and filtering the characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio;
playing the accompaniment audio from the position of a first sampling point of the accompaniment music, synchronously triggering voice acquisition, and acquiring the voice audio singed by the user according to the accompaniment audio;
and uploading the compressed voice audio to the terminal, so that the terminal decompresses the voice audio and mixes the decompressed voice audio and the accompaniment music locally stored in the terminal to obtain mixed audio.
2. The method for synchronizing vocal sounds with accompaniment according to claim 1, wherein said decoding the accompaniment audio, identifying and filtering said characteristic audio, and obtaining the position of the first sample point of the accompaniment music in said accompaniment audio comprises:
decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristics of the prestored characteristic audio, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio; alternatively, the first and second electrodes may be,
and decoding the accompaniment audio, identifying and filtering the characteristic audio according to the signal characteristic of the characteristic audio acquired from the terminal, and acquiring the position of a first sampling point of the accompaniment music in the accompaniment audio.
3. The human voice and accompaniment synchronization method according to claim 2, wherein the signal characteristics of said characteristic audio include length information and waveform characteristics of said characteristic audio, said waveform characteristics including waveform shape and waveform frequency of said characteristic audio.
4. The method for synchronizing vocal sounds with accompaniment according to claim 1, wherein said accompaniment audio is played from the position of the first sampling point of the accompaniment music and the vocal collection is triggered synchronously, and further comprising the following steps after the vocal audio singing by the user according to the accompaniment audio is obtained:
and carrying out sound mixing processing on the human voice audio and the accompaniment music obtained after decoding from the accompaniment audio sent by the terminal, and playing a sound mixing effect for the user.
5. A method for synchronizing voice and accompaniment is applied to a terminal with a Bluetooth communication function, and is characterized by comprising the following steps:
when a karaoke instruction input by a user is received, acquiring accompaniment music of a song selected by the user, inserting characteristic audio into the head of the accompaniment music to generate accompaniment audio, and sending the accompaniment audio to Bluetooth equipment; the characteristic audio is used for enabling the Bluetooth equipment to identify the position of a first sampling point of accompaniment music in the accompaniment audio, playing the accompaniment audio from the position of the first sampling point, synchronously triggering voice collection, and acquiring voice audio singing by a user according to the accompaniment audio;
and receiving the voice audio uploaded by the Bluetooth equipment, decoding the voice audio, and performing sound mixing processing on the decoded voice audio and locally stored accompaniment music to obtain mixed sound audio.
6. The method for synchronizing human voice with accompaniment according to claim 5, wherein the method for synchronizing human voice with accompaniment comprises the steps of acquiring accompaniment music of a song selected by a user when a karaoke command input by the user is received, inserting characteristic audio into the head of the accompaniment music to generate accompaniment audio, and sending the accompaniment audio to a Bluetooth device, and further comprising the following steps:
sending the signal characteristics of the characteristic audio to the Bluetooth equipment, so that the Bluetooth equipment identifies the position of a first sampling point of accompaniment music in the accompaniment audio according to the signal characteristics of the characteristic audio; wherein the signal characteristics of the characteristic audio comprise length information and waveform characteristics of the characteristic audio, and the waveform characteristics comprise a waveform shape and a waveform frequency of the characteristic audio.
7. A Bluetooth device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, performing the steps of the method of synchronizing a vocal sound and an accompaniment according to any one of claims 1 to 4.
8. A storage medium having stored thereon a computer program which, when executed by a processor, carries out the steps of the vocal and accompaniment synchronization method according to any one of claims 1 to 4.
9. A terminal, characterized in that it comprises a memory, a processor and a computer program stored on said memory and executable on said processor, said computer program, when executed by said processor, implementing the steps of the vocal and accompaniment synchronization method according to any one of claims 5 to 6.
10. A storage medium having stored thereon a computer program which, when executed by a processor, carries out the steps of the vocal and accompaniment synchronization method according to any one of claims 5 to 6.
CN201910712728.XA 2019-08-02 2019-08-02 Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium Active CN110428798B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910712728.XA CN110428798B (en) 2019-08-02 2019-08-02 Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910712728.XA CN110428798B (en) 2019-08-02 2019-08-02 Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium

Publications (2)

Publication Number Publication Date
CN110428798A CN110428798A (en) 2019-11-08
CN110428798B true CN110428798B (en) 2021-08-10

Family

ID=68413986

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910712728.XA Active CN110428798B (en) 2019-08-02 2019-08-02 Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium

Country Status (1)

Country Link
CN (1) CN110428798B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112825245B (en) * 2019-11-20 2023-04-28 北京声智科技有限公司 Real-time sound repairing method and device and electronic equipment
CN112216259B (en) * 2020-11-17 2024-03-08 北京达佳互联信息技术有限公司 Method and device for aligning vocal accompaniment
CN112910508B (en) * 2020-12-30 2022-08-02 重庆百瑞互联电子技术有限公司 Method, device and server for realizing stereo call on ESCO link

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5189237A (en) * 1989-12-18 1993-02-23 Casio Computer Co., Ltd. Apparatus and method for performing auto-playing in synchronism with reproduction of audio data
JP2003101958A (en) * 2001-09-20 2003-04-04 Toshiba Corp Method and device for synchronous reproduction
JP2005043709A (en) * 2003-07-23 2005-02-17 Casio Comput Co Ltd Musical sound generator and program for musical sound generating processing
CN101652810A (en) * 2006-09-29 2010-02-17 Lg电子株式会社 Apparatus for processing mix signal and method thereof
CN105788582A (en) * 2016-05-06 2016-07-20 深圳芯智汇科技有限公司 Portable karaoke sound box and karaoke method thereof
CN106251890A (en) * 2016-08-31 2016-12-21 广州酷狗计算机科技有限公司 A kind of methods, devices and systems of recording song audio frequency
CN106409282A (en) * 2016-08-31 2017-02-15 得理电子(上海)有限公司 Audio frequency synthesis system and method, electronic device therefor and cloud server therefor
CN108538302A (en) * 2018-03-16 2018-09-14 广州酷狗计算机科技有限公司 The method and apparatus of Composite tone
CN109151987A (en) * 2018-07-03 2019-01-04 珠海全志科技股份有限公司 A kind of method that more room audio groups are played simultaneously in WLAN

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140000440A1 (en) * 2003-01-07 2014-01-02 Alaine Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
GB2539875B (en) * 2015-06-22 2017-09-20 Time Machine Capital Ltd Music Context System, Audio Track Structure and method of Real-Time Synchronization of Musical Content

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5189237A (en) * 1989-12-18 1993-02-23 Casio Computer Co., Ltd. Apparatus and method for performing auto-playing in synchronism with reproduction of audio data
JP2003101958A (en) * 2001-09-20 2003-04-04 Toshiba Corp Method and device for synchronous reproduction
JP2005043709A (en) * 2003-07-23 2005-02-17 Casio Comput Co Ltd Musical sound generator and program for musical sound generating processing
CN101652810A (en) * 2006-09-29 2010-02-17 Lg电子株式会社 Apparatus for processing mix signal and method thereof
CN105788582A (en) * 2016-05-06 2016-07-20 深圳芯智汇科技有限公司 Portable karaoke sound box and karaoke method thereof
CN106251890A (en) * 2016-08-31 2016-12-21 广州酷狗计算机科技有限公司 A kind of methods, devices and systems of recording song audio frequency
CN106409282A (en) * 2016-08-31 2017-02-15 得理电子(上海)有限公司 Audio frequency synthesis system and method, electronic device therefor and cloud server therefor
CN108538302A (en) * 2018-03-16 2018-09-14 广州酷狗计算机科技有限公司 The method and apparatus of Composite tone
CN109151987A (en) * 2018-07-03 2019-01-04 珠海全志科技股份有限公司 A kind of method that more room audio groups are played simultaneously in WLAN

Also Published As

Publication number Publication date
CN110428798A (en) 2019-11-08

Similar Documents

Publication Publication Date Title
CN110390925B (en) Method for synchronizing voice and accompaniment, terminal, Bluetooth device and storage medium
CN110428798B (en) Method for synchronizing voice and accompaniment, Bluetooth device, terminal and storage medium
EP3522151B1 (en) Method and device for processing dual-source audio data
CN102577360B (en) Synchronized playback of media players
WO2016188322A1 (en) Karaoke processing method, apparatus and system
CN110267081A (en) Method for stream processing, device, system, electronic equipment and storage medium is broadcast live
CN109257499B (en) Method and device for dynamically displaying lyrics
US20070260634A1 (en) Apparatus, system, method, and computer program product for synchronizing the presentation of media content
CN111524494A (en) Remote real-time chorus method and device and storage medium
WO2016188211A1 (en) Audio processing method, apparatus and system
JP2006195385A (en) Device and program for music reproduction
WO2013044872A1 (en) Method and system for audio processing
CN107785037A (en) Use the method, system and medium of audio time code synchronized multimedia content
WO2014067269A1 (en) Sent message playing method, system and related device
US11843921B2 (en) In-sync digital waveform comparison to determine pass/fail results of a device under test (DUT)
CN110808062B (en) Mixed voice separation method and device
EP3203468B1 (en) Acoustic system, communication device, and program
JP2013160890A (en) Information processing program, information processing apparatus, lyrics display method, and communication system
JP4327165B2 (en) Music playback device
KR101230746B1 (en) Method for generating synchronized image data for synchronous outputting music data and for play synchronous output
KR101946471B1 (en) Apparatus and method for synchronizing video and audio
CN115243087A (en) Audio and video co-shooting processing method and device, terminal equipment and storage medium
JP2013122561A (en) Information processing program, communication system, information processing device, and method for drawing lyric telop
CN112927666A (en) Audio processing method and device, electronic equipment and storage medium
JP7117228B2 (en) karaoke system, karaoke machine

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant