CN103886881A - Method and system for expanding song selecting library - Google Patents

Method and system for expanding song selecting library Download PDF

Info

Publication number
CN103886881A
CN103886881A CN201410147686.7A CN201410147686A CN103886881A CN 103886881 A CN103886881 A CN 103886881A CN 201410147686 A CN201410147686 A CN 201410147686A CN 103886881 A CN103886881 A CN 103886881A
Authority
CN
China
Prior art keywords
audio
song
accompaniment
video
mobile terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410147686.7A
Other languages
Chinese (zh)
Other versions
CN103886881B (en
Inventor
陈节省
林剑宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Kaimi Network Science & Technology Co Ltd
Original Assignee
Fujian Star Net eVideo Information Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Star Net eVideo Information Systems Co Ltd filed Critical Fujian Star Net eVideo Information Systems Co Ltd
Priority to CN201410147686.7A priority Critical patent/CN103886881B/en
Publication of CN103886881A publication Critical patent/CN103886881A/en
Application granted granted Critical
Publication of CN103886881B publication Critical patent/CN103886881B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a method for expanding a song selecting library. The method comprises the steps of obtaining song files from a mobile terminal or the Internet, extracting the accompaniment audios from the song files, enabling the accompaniment audios and videos or slide files to be synthesized into audio and video files, obtaining lyric subtitles synchronous with the audios from the mobile terminal or the Internet, embedding the lyric subtitles into the audio and video files, storing the audio and video files into a local song selecting library, synchronizing the mobile terminal and an audio and video terminal through a wireless transmission mode, playing the audio and video files, receiving sounds from the mobile terminal or the audio and video terminal, and processing and playing the sounds. The invention further provides a corresponding system for expanding the song selecting library. The method and system have the advantages that a large number of songs obtained by the Internet and the mobile terminal are fully utilized for expanding the local song selecting library, and the accompaniment audio and video files capable of fully meeting user requirements can be made.

Description

Method and the system thereof in a kind of extension point song storehouse
Technical field
The present invention relates to the audiovisual control and management field of public entertainment or private public place of entertainment, particularly method and the system thereof in a kind of extension point song storehouse.
Background technology
No matter place of public amusement is purchased and buys as the main source of the song in KTV or private home audiovisual system is to copyright at present, due to fund limitation, flow process complexity and mistiming, abundant not, the new song of the common song in song storehouse in these entertainment systems upgrades also not prompt enough, is often difficult to meet user's demand.
Summary of the invention
The technical solution used in the present invention is:
The method in extension point song storehouse, comprises step:
Obtain song files from mobile terminal or internet;
From song files, extract audio accompaniment;
Audio accompaniment and video or slide files are synthesized to audio video document.
Further, the method in described extension point song storehouse also comprises step: obtain the lyrics captions of synchronizeing with song files from mobile terminal or internet, and lyrics captions are embedded to described audio video document.
Further, in the method in described extension point song storehouse, described song files comprises song audio frequency or multimedia file; Described from song files, extract audio accompaniment comprise from song audio frequency, remove original singer or promote vocal accompaniment, from multimedia file, extract song audio frequency and from song audio frequency, remove original singer or promote vocal accompaniment.
Further, in the method in described extension point song storehouse, after audio accompaniment is synthesized to audio video document with video or slide files, also comprise that mirror image is synchronizeed to perform in a radio or TV programme step, specifically comprise:
By wireless transmission method synchronous mobile terminal and audiovisual terminals;
Playback of audio-visual file;
Receive sound from mobile terminal or audiovisual terminals;
Process and play described sound.
Further, in the method in described extension point song storehouse, described wireless transmission method comprises Airplay technology, DLNA technology, Miracast technology or WIDI technology.
Further, in the method in described extension point song storehouse, obtain song files from mobile terminal or internet and specifically comprise: obtain song files by wired connection or wireless connections mode from mobile terminal or internet.
Further, in the method in described extension point song storehouse, obtaining song files from mobile terminal or internet is audio video document or pure audio file;
In the time obtaining song files from mobile terminal or internet and be audio video document, audio accompaniment and video or slide files are synthesized to audio video document and comprise step: the video in audio accompaniment and former audio video document is synthesized to audio video document;
In the time obtaining song files from mobile terminal or internet and be pure audio file, audio accompaniment and video or slide files are synthesized to audio video document and comprise step: by web search and the suitable video file of song files, and the video in audio accompaniment and video file is synthesized to audio video document, or audio accompaniment and default video or magic lantern are synthesized to new audio video document.
The system in extension point song storehouse, comprises audiovisual terminals, and server or mobile terminal;
Audiovisual terminals comprises accompaniment extraction unit, audiovisual synthesis unit, the first processing unit, the first communication unit and storage unit;
Server comprises public server or privately owned server;
The first processing unit calls the first communication unit and obtains song files from mobile terminal or server, call accompaniment extraction unit and extract audio accompaniment from described song files, call audiovisual synthesis unit audio accompaniment and video or slide files are synthesized to audio video document.
Further, in the system in described extension point song storehouse, audiovisual terminals also comprises captions embedded unit;
The first processing unit calls the first communication unit and obtains the lyrics captions of synchronizeing with song files from mobile terminal or server, and calls captions embedded unit described lyrics captions are embedded in audio video document.
Further, in the system in described extension point song storehouse, described song files comprises song audio frequency or multimedia file;
Accompaniment extraction unit is removed original singer or is promoted vocal accompaniment from song audio frequency, or processing unit extracts song audio frequency from multimedia file and accompaniment extraction unit is removed original singer or promoted vocal accompaniment from song audio frequency.
Further, in the system in described extension point song storehouse, audiovisual terminals comprises sound processing unit, the first recording unit, plays driver element and broadcast unit; Mobile terminal comprises the second recording unit, the second processing unit and the second communication unit;
After audio accompaniment and video or slide files are synthesized audio video document by audiovisual synthesis unit, the first communication unit and the second communication unit between mobile terminal and audiovisual terminals by wireless transmission method audio-video synchronization file;
The first processing unit calls plays driver element in order to control broadcast unit playback of audio-visual file;
The first processing unit is transferred the first recording unit and is received acoustic information, or the second processing unit is transferred the second recording unit reception acoustic information and by the second communication unit, described acoustic information transferred to audiovisual terminals;
The first processing unit calls sound processing unit and processes described acoustic information, and calls broadcasting driver element control broadcast unit and play treated sound.
Further, in the system in described extension point song storehouse, described wireless transmission method comprises Airplay technology, DLNA technology, Miracast technology or WIDI technology.
Further, in the system in described extension point song storehouse, the first communication unit comprises wired communication module or wireless communication module; The first communication unit obtains song files from mobile terminal or internet and specifically comprises: obtain song files by wired connection or wireless connections mode from mobile terminal or internet.
Further, in the system in described extension point song storehouse, obtaining song files from mobile terminal or server is audio video document or pure audio file;
In the time obtaining song files from mobile terminal or server and be audio video document, the video in audio accompaniment and former audio video document is synthesized audio video document by audiovisual synthesis unit;
In the time obtaining song files from mobile terminal or server and be pure audio file, the first processing unit is by web search and the suitable video file of song files, and the video in audio accompaniment and video file is synthesized audio video document by audiovisual synthesis unit; Or audio accompaniment and default video or magic lantern are synthesized new audio video document by audiovisual synthesis unit.
By taking technique scheme, the beneficial effect that the present invention obtains is: make full use of internet and the powerful storage capacity of intelligent mobile terminal and the computing ability to multimedia file, produce the accompaniment audio video document for playing Karaoka of meeting consumers' demand to the full extent, fully improve the ability that expands Qu Ku in public or private multimedia entertainment system.
Brief description of the drawings
Fig. 1 is a kind of high-level schematic functional block diagram for the system in song storehouse of expanding in an embodiment of the present invention;
Fig. 2 is a kind of process flow diagram for the method in song storehouse of expanding in an embodiment of the present invention.
Label declaration:
11-audiovisual terminals
The 111-extraction unit of accompanying
112-audiovisual synthesis unit
113-the first processing unit
114-the first communication unit
115-storage unit
116-captions embedded unit
117-first unit of recording
118-sound processing unit
119-plays driver element
1110-broadcast unit
12-mobile terminal
121-second unit of recording
122-the second processing unit
123-the second communication unit
13-server
131-public server
The privately owned server of 132-
Embodiment
By describing technology contents of the present invention, structural attitude in detail, being realized object and effect, below in conjunction with embodiment and coordinate accompanying drawing to be explained in detail.
Referring to Fig. 1, is the high-level schematic functional block diagram of the system in a kind of extension point song storehouse in an embodiment of the present invention; In figure dotted line represent dotted line connect two functional modules between there is communication relation.
The system in the extension point song storehouse described in present embodiment comprises audiovisual terminals 11 and server 13.Wherein, audiovisual terminals 11 comprises the first processing unit 113, the first communication unit 114, storage unit 115, accompaniment extraction unit 111 and audiovisual synthesis unit 112.The first processing unit 113 obtains song files for calling the first communication unit 114 from server 13.In another embodiment, described system also comprises mobile terminal 12, and in this embodiment, the first processing unit 113 calls the first communication unit 114 and obtains song files from mobile terminal 12.
Described song files can be only that song audio frequency is that form is the file of the pure audio forms such as mp3, wav, wma, ape, flac, acc, apple lossless, can be also the multimedia file that simultaneously comprises audio-frequency information and video information.Described mobile terminal 12 can comprise the intelligent terminals such as smart mobile phone, panel computer, PDA, ipod, MP3 player, MP4 player.The source that in prior art is song in place of public amusement or the Kara OK songs storehouse of family expenses audio-video system is conventionally all more limited, if the some song storehouse in KTV may be mainly to depend on to copyright to be purchased and to buy song, the bent storehouse of family expenses audio-video system is more limited, can only lean on user oneself to buy the bent acquiescence Qu Ku that collects or depend on audio-video system of accompaniment, in so undoubtedly Qu Ku, the degree of enriching of song is quite short of, and is particularly difficult to meet the demand of the song that user issues recently to some.The technical scheme that the present invention proposes is exactly public server 131 or the privately owned server 132 from internet, or obtains song in user's mobile terminal 12, through one series of processes being made into into accompaniment audio video document.Chant music resource on internet is vast as the open sea, in all demands that meet user almost without any difficulty; On the mobile terminal 12 that user holds, conventionally there is the song that user likes best and the most often listens, also just means that user wants the own possibility of singing these songs in audiovisual entertaining system larger.So, the expansion of the extensive expansion of Qu Ku and targeted and purpose all becomes possibility.
Further, the first communication unit 114 also comprises wired communication module and/or wireless communication module.The first communication unit 114 obtains song files from mobile terminal 12 or server 13 and specifically comprises by wired connection or wireless connections mode and obtain song files from mobile terminal 12 or Internet Server 13.Wireless connections mode wherein can comprise the wireless communication techniques such as GSM, LTE, 3G, 4G, WLAN, can certainly be any other wireless telecommunications new technology that can reach equally communication object occurring along with development in science and technology.
Received after song files by the first communication unit 114 in audiovisual terminals 11, the first processing unit 113 calls accompaniment extraction unit 111 and extract audio accompaniment from described song files.In the time that described song files is song audio frequency, accompaniment extraction unit 111 directly extracts audio accompaniment from song audio frequency.When described song files is while comprising the multimedia file of Voice & Video simultaneously, first the first processing unit 113 extracts song audio frequency from described multimedia file, and then calls accompaniment extraction unit 111 and extract audio accompaniment from described song audio frequency.Because obviously, the song that the overwhelming majority can be easily obtained from server 13 or mobile terminal 12 all has original singer, and the song audio frequency that has original singer is the demand that is difficult to meet Karaoke user, must do on its basis further processing, audio accompaniment extracted.The extraction of the audio accompaniment that accompaniment extraction unit 111 does specifically comprises removes original singer and promotes vocal accompaniment from song audio frequency.What removal original singer's technology was mainly utilized is the feature such as position distribution feature, the frequecy characteristic of original singer voice of original singer's voice in sound field, after catching and having determined these features, can to a great extent original singer's voice be separated and be removed from former bent audio frequency, reach the desirable effect that is suitable for karaoke accompaniment.In addition, in extracting, accompaniment sometimes also needs the processing of accompanying and promoting, likely because karaoke person is greater than singer to the demand of vocal accompaniment, also likely because accompaniment signal is also had to certain loss in the process of removing original singer's voice, so the extraction unit 111 of accompanying is where necessary also by the operation of accompanying and promoting.
After accompaniment extraction unit 111 has completed accompaniment extraction, audiovisual terminals 11 has been obtained desirable audio accompaniment, next the first processing unit 113 calls audiovisual synthesis unit 112 audio accompaniment and image video or image/video is synthesized to audio video document, and is stored in the some song storehouse of storage unit 115 or is stored in server 13 and with the form storage of online Streaming Media or use.
Because in this amusement process of sing karaoke, if only have audio frequency played, and on display screen without any image or video, will be very uninteresting and be short of recreational experience to user, also greatly weakened multimedia entertainment system superiority originally.So it is very necessary mixing video image for audio accompaniment, that is to say and need to synthesize a new accompaniment audio video document, audio-frequency information is wherein audio accompaniment but not original singer's song audio frequency.
When the original song files obtaining from mobile terminal 12 or server 13 is while comprising the multimedia file of video information, in new accompaniment audio video document, the source of video is the video information in this multimedia file; In the time that the original song files obtaining from mobile terminal 12 or server 13 is only song audio frequency, the MV(Music Video of the source of video this song that can be audiovisual terminals 11 obtain from mobile terminal 12 or server 13 in new accompaniment audio video document) or concert part or the corresponding vidclip of film primary sound of this song, the simple image video generating after its former attached audio-frequency information is peeled off.In the building-up process combining with audio accompaniment at video, also must, by the music progress reasonable butt-joint of audio accompaniment and song video so that the two coincide, avoid the mouth type situation different from the actual lyrics sentence that should occur in accompaniment music that occurs that in picture, singer sings.If cannot obtain MV or concert associated video to a song, also can directly call the video or the lantern slide that prestore and substitute.In certain embodiments, according to the classification difference in each district, call respectively different classes of video or lantern slide adaptive more to agree with the atmosphere in each district with it, ensure to play and result of use.For example adopting the acute paragraph of length is suitable, style is similar film and television, can also be even that scenic film paragraph, FLASH or the image/video that is made up of image are as lantern slide.
Further, audiovisual terminals 11 also comprises captions embedded unit 116; When the original song files obtaining from mobile terminal 12 or server 13 is, while comprising that the multimedia file of Audio and Video and this video have comprised caption information, not need to carry out captions embedding operation again; And in the time not comprising caption information in the video of original multimedia file or be song audio frequency from the original song files that mobile terminal 12 or server 13 obtain, need to carry out captions embedding operation, described captions are to call the first communication unit 114 from mobile terminal 12 or server 13 lyrics captions of synchronizeing with song files that obtain by the first processing unit 113.Because in karaoke process, user is difficult to ensure the lyrics of a first song are remembered tally in every detail, is many times the prompting that needs lyrics captions, is also necessary so embed captions in audio video document.
Obtain after captions, captions embedded unit 116 is embedded into described lyrics captions in audio video document.Certainly,, in the time embedding, also must ensure that the opportunity that every lyrics occur on video is all consistent with the opportunity that it occurs in former song, the accuracy of guarantee to the suggesting effect that user rose like this.Complete the audio video document of captions after embedding also by the some song storehouse being stored in storage unit 115 or be stored in server 13 and with form storage or the use of online Streaming Media.
Further, audiovisual terminals 11 also comprises sound processing unit 118, the first recording unit 117, plays driver element 119 and broadcast unit 1110.Mobile terminal 12 also comprises the second recording unit 121, the second processing unit 122 and the second communication unit 123.Complete after the making of audio video document, in the actual performance process of user, between mobile terminal 12 and audiovisual terminals 11, complete the mirror image of audio video document and sound by the information interaction of the first communication unit 114 and the second communication unit 123 synchronous; The first processing unit 113 call play driver element 119 in order to control broadcast unit 1110 on screen and sound equipment in playback of audio-visual file.When user starts to sing, the first processing unit 113 is transferred the first recording unit 117 and is received acoustic information, or the second processing unit 122 is transferred the second recording unit 121 and received acoustic information and by the second communication unit 123, described acoustic information is synchronized to audiovisual terminals 11.That is to say, user's Speech input can complete by first in audiovisual terminals 11 recording unit 117 as microphone, also can complete as microphone by the unit 121 of recording of second on mobile terminal 12.Particularly audiovisual terminals 11 number of microphone are limited, user has many people to participate under the demand of chorus simultaneously, and the microphone that utilizes every mobile terminal 12 all to have is the good solution to this problem.Subsequently, the first processing unit 113 calls sound processing unit 118 acoustic information receiving is processed, and call and play driver element 119 and control broadcast unit 1110 and play treated sound, user's song can be played back by sound equipment with together with sound in audio video document.
In present embodiment, mobile terminal 12 and audiovisual terminals 11 complete mirror image synchronously the Radio Transmission Technology based between the first communication unit 114 and the second communication unit 123 realize, described Radio Transmission Technology can comprise the main flow wireless data transmission technologys such as Airplay technology, DLNA technology, Miracast technology, WIDI technology, also can comprise other Radio Transmission Technologys.In fact, the mirror image of mobile terminal 12 and audiovisual terminals 11 synchronously can certainly be realized by wired connection, certainly wired connection will be short of than wireless connections in comfort level to some extent due to the existence of data line, while realization, the number of connectivity port is also had to requirement.
Referring to Fig. 2, is a kind of process flow diagram for the method in song storehouse of expanding in an embodiment of the present invention.Described method comprises step:
S20, obtain song files from mobile terminal or internet.
Described song files can be only that song audio frequency is that form is the file of the pure audio forms such as mp3, wav, wma, ape, flac, acc, apple lossless, can be also the multimedia file that simultaneously comprises audio-frequency information and video information.The source that in prior art is song in place of public amusement or the Kara OK songs storehouse of family expenses audio-video system is conventionally all more limited, if the some song storehouse in KTV may be mainly to depend on to copyright to be purchased and to buy song, the bent storehouse of family expenses audio-video system is more limited, can only lean on user oneself to buy the bent acquiescence Qu Ku that collects or depend on audio-video system of accompaniment, in so undoubtedly Qu Ku, the degree of enriching of song is quite short of, and is particularly difficult to meet the demand of the song that user issues recently to some.The technical scheme that the present invention proposes is exactly to obtain song from Internet Server 13 or user's mobile terminal 12, through one series of processes being made into into accompaniment audio video document.Chant music resource on internet is vast as the open sea, in all demands that meet user almost without any difficulty; On the mobile terminal 12 that user holds, conventionally there is the song that user likes best and the most often listens, also just means that user wants the own possibility of singing these songs in audiovisual entertaining system larger.So, the expansion of the extensive expansion of Qu Ku and targeted and purpose all becomes possibility.
Obtaining song files from mobile terminal 12 or server 13 specifically comprises by wired connection or wireless connections mode and obtains song files from mobile terminal 12 or Internet Server 13.Wireless connections mode wherein can comprise the wireless communication techniques such as GSM, LTE, 3G, 4G, WLAN, can certainly be any other wireless telecommunications new technology that can reach equally communication object occurring along with development in science and technology.
S21, from song files, extract audio accompaniment.
In certain embodiments, song files is the audiovisual multimedia file that comprises audio frequency, such as MV, film primary sound fragment etc., and in these embodiment, this step is isolated audio accompaniment track and original singer's voice track from the audio stream of audio video document; In other embodiment, song files is only pure audio file, and this step is isolated audio accompaniment and original singer's voice track from the audio stream of audio file.Concrete detachment process is as described below:
The first communication unit 114 has received after song files by the first communication unit 114 in audiovisual terminals 11, and the first processing unit 113 calls accompaniment extraction unit 111 and extract audio accompaniment from described song files.In the time that described song files is song audio frequency, accompaniment extraction unit 111 directly extracts audio accompaniment from song audio frequency.When described song files is while comprising the multimedia file of Voice & Video simultaneously, first the first processing unit 113 extracts song audio frequency from described multimedia file, and then calls accompaniment extraction unit 111 and extract audio accompaniment from described song audio frequency.Because obviously, the song that the overwhelming majority can be easily obtained from server 13 or mobile terminal 12 all has original singer, and the song audio frequency that has original singer is the demand that is difficult to meet Karaoke user, must do on its basis further processing, audio accompaniment extracted.The extraction of the audio accompaniment that accompaniment extraction unit 111 does specifically comprises removes original singer and promotes vocal accompaniment from song audio frequency.What removal original singer's technology was mainly utilized is the feature such as position distribution feature, the frequecy characteristic of original singer voice of original singer's voice in sound field, after catching and having determined these features, can to a great extent original singer's voice be separated and be removed from former bent audio frequency, reach the desirable effect that is suitable for karaoke accompaniment.In addition, in extracting, accompaniment sometimes also needs the processing of accompanying and promoting, likely because karaoke person is greater than singer to the demand of vocal accompaniment, also likely because accompaniment signal is also had to certain loss in the process of removing original singer's voice, so the extraction unit 111 of accompanying is where necessary also by the operation of accompanying and promoting.
The audio accompaniment extracting can form a new track; The original singer's voice separating can form another new track.In certain embodiments, the original singer's voice separating is abandoned, and only retains the track of audio accompaniment.In other embodiment, can retain this two tracks, for determining whether to play original singer's voice according to different requirements simultaneously.
S22, audio accompaniment and video or slide files are synthesized to audio video document.
After accompaniment extraction unit 111 has completed accompaniment extraction, audiovisual terminals 11 has been obtained desirable audio accompaniment, next the first processing unit 113 calls audiovisual synthesis unit 112 audio accompaniment and image video or image/video is synthesized to audio video document, and is stored in the some song storehouse of storage unit 115 or is stored in server 13 and with the form storage of online Streaming Media or use.
Because in this amusement process of sing karaoke, if only have audio frequency played, and on display screen without any image or video, will be very uninteresting and be short of recreational experience to user, also greatly weakened multimedia entertainment system superiority originally.So it is very necessary mixing video image for audio accompaniment, that is to say and need to synthesize a new accompaniment audio video document, audio-frequency information is wherein audio accompaniment but not original singer's song audio frequency.
When the original song files obtaining from mobile terminal 12 or server 13 is while comprising the multimedia file of video information, in new accompaniment audio video document, the source of video is the video information in this multimedia file; In this step, the video flowing of isolated audio accompaniment and original is synthetic; In the time that the original song files obtaining from mobile terminal 12 or server 13 is only song audio frequency, the MV(Music Video of the source of video this song that can be audiovisual terminals 11 obtain from mobile terminal 12 or server 13 in new accompaniment audio video document) or concert part or the corresponding vidclip of film primary sound of this song, the simple image video generating after its former attached audio-frequency information is peeled off.In the building-up process combining with audio accompaniment at video, also must, by the music progress reasonable butt-joint of audio accompaniment and song video so that the two coincide, avoid the mouth type situation different from the actual lyrics sentence that should occur in accompaniment music that occurs that in picture, singer sings.If cannot obtain MV or concert associated video to a song, also can directly call the video or the lantern slide that prestore and substitute.For example adopting the acute paragraph of length is suitable, style is similar film and television, can also be even that scenic film paragraph, FLASH or the image/video that is made up of image are as lantern slide.Adopting in the embodiment of lantern slide, can adopt independently slide files, also can adopt corresponding picture composition lantern slide in picture library to play.In certain embodiments, for different types of songs, in the picture library of different directories, choose picture, or in addition identification label of the picture in picture library, according to the difference of types of songs, select corresponding picture composition lantern slide with label.
Corresponding to the different embodiment that whether retain original singer's voice track, this step also can be taked different synthesis modes, if the original singer's voice separating is abandoned, only retains the track of audio accompaniment, and that only synthesizes new audio video document by audio accompaniment and video; If retain original singer's voice track and accompaniment track simultaneously, can original singer's voice track, accompaniment track and video be synthesized to new audio video document simultaneously, in these embodiment, while playing synthetic audio video document, can select according to demand only to play audio accompaniment track, or broadcast after will accompany in the time playing according to the instruction of receiving track and original singer's track audio mixing.
S23, obtain the lyrics captions of synchronizeing with song files from mobile terminal or internet, and lyrics captions are embedded to described audio video document.
In different embodiment, obtain lyrics captions and step that lyrics captions are embedded can be adjusted, for example in certain embodiments can be when audio stream and video flowing be synthesized to audio video document embedding lyrics captions.
S24, audio video document is stored in to local some song storehouse.
When the original song files obtaining from mobile terminal 12 or server 13 is, while comprising that the multimedia file of Audio and Video and this video have comprised caption information, not need to carry out captions embedding operation again; And in the time not comprising caption information in the video of original multimedia file or be song audio frequency from the original song files that mobile terminal 12 or server 13 obtain, need to carry out captions embedding operation to audio video document, described captions are to call the first communication unit 114 from mobile terminal 12 or server 13 lyrics captions of synchronizeing with song files that obtain by the first processing unit 113.Because in karaoke process, user is difficult to ensure the lyrics of a first song are remembered tally in every detail, is many times the prompting that needs lyrics captions, is also necessary so embed captions in audio video document.Obtain after captions, captions embedded unit 116 is embedded into described lyrics captions in audio video document.Certainly,, in the time embedding, also must ensure that the opportunity that every lyrics occur on video is all consistent with the opportunity that it occurs in former song, the accuracy of guarantee to the suggesting effect that user rose like this.In the present embodiment, complete captions and embed audio video document afterwards also by the some song storehouse being stored in storage unit 115; In other embodiments, complete that the audio video document of captions after embedding can also be stored in server 13 and with form storage or the use of online Streaming Media.
In certain embodiments, can adopt the embodiment different from step S24, for example, audio video document need not be stored, and synthetic audio video document is transmitted and play in the mode of Streaming Media.
S25, by wireless transmission method synchronous mobile terminal and audiovisual terminals.In certain embodiments, after audio video document after synthetic is carried out synchronously in mobile terminal and audiovisual terminals, in audio video document stores synchronized in mobile terminal box audiovisual terminals, then play, and in further embodiments, do not need the synchronous of file entirety, only in the time playing, carry out the synchronous of video pictures or audio accompaniment.In addition in certain embodiments, can not need synchronous mobile terminal box audiovisual terminals, only utilize audiovisual terminals to carry out playback of audio-visual file, therebetween in some embodiment, mobile terminal is as one of Mike source, recording audio, and mobile phone recording is transferred to audiovisual terminals processes.
S26, playback of audio-visual file.
As described in step S22, the synthetic audio video document of step S22 only has accompaniment track, and while broadcasting so, audio-frequency unit is only play accompaniment.If the audio video document providing in certain embodiments has accompaniment track and original singer's voice track simultaneously, can, according to the difference of real needs in different embodiment, carry out different broadcasting schemes, do not play original singer's voice track as only play accompaniment track; Or in other embodiment, due to singer's being unfamiliar with song, may in playing accompaniment track, play original singer's voice track and sing tune with prompting, so just can adjust original singer's voice track and the intensity of sound of accompaniment track respectively according to user instruction, to meet different user volume requirement with original singer's voice for accompaniment.
S27, receive sound from mobile terminal or audiovisual terminals.
S28, processing are also play described sound.
Complete after the making of audio video document, in the actual performance process of user, between mobile terminal 12 and audiovisual terminals 11, complete the mirror image of audio video document and sound by the information interaction of the first communication unit 114 and the second communication unit 123 synchronous; The first processing unit 113 call play driver element 119 in order to control broadcast unit 1110 on screen and sound equipment in playback of audio-visual file.When user starts to sing, the first processing unit 113 is transferred the first recording unit 117 and is received acoustic information, or the second processing unit 122 is transferred the second recording unit 121 and received acoustic information and by the second communication unit 123, described acoustic information is synchronized to audiovisual terminals 11.That is to say, user's Speech input can complete by first in audiovisual terminals 11 recording unit 117 as microphone, also can complete as microphone by the unit 121 of recording of second on mobile terminal 12.Particularly audiovisual terminals 11 number of microphone are limited, user has many people to participate under the demand of chorus simultaneously, and the microphone that utilizes every mobile terminal 12 all to have is the good solution to this problem.Subsequently, the first processing unit 113 calls sound processing unit 118 acoustic information receiving is processed, and call and play driver element 119 and control broadcast unit 1110 and play treated sound, user's song can be played back by sound equipment with together with sound in audio video document.
In present embodiment, mobile terminal 12 and audiovisual terminals 11 complete mirror image synchronously the Radio Transmission Technology based between the first communication unit 114 and the second communication unit 123 realize, described Radio Transmission Technology can comprise the main flow wireless data transmission technologys such as Airplay technology, DLNA technology, Miracast technology, WIDI technology, also can comprise other Radio Transmission Technologys.In fact, the mirror image of mobile terminal 12 and audiovisual terminals 11 synchronously can certainly be realized by wired connection, certainly wired connection will be short of than wireless connections in comfort level to some extent due to the existence of data line, while realization, the number of connectivity port is also had to requirement.
The foregoing is only embodiments of the invention; not thereby limit the scope of the claims of the present invention; every equivalent structure or conversion of equivalent flow process that utilizes instructions of the present invention and accompanying drawing content to do; or be directly or indirectly used in other relevant technical fields, be all in like manner included in scope of patent protection of the present invention.

Claims (14)

1. the method in extension point song storehouse, is characterized in that, comprises step:
Obtain song files from mobile terminal or internet;
From song files, extract audio accompaniment;
Audio accompaniment and video or slide files are synthesized to audio video document.
2. the method in extension point song as claimed in claim 1 storehouse, is characterized in that, also comprises step: obtain the lyrics captions of synchronizeing with song files from mobile terminal or internet, and lyrics captions are embedded to described audio video document.
3. the method in extension point song as claimed in claim 1 or 2 storehouse, is characterized in that, described song files comprises song audio frequency or multimedia file; Described from song files, extract audio accompaniment comprise from song audio frequency, remove original singer or promote vocal accompaniment, from multimedia file, extract song audio frequency and from song audio frequency, remove original singer or promote vocal accompaniment.
4. the method in extension point song as claimed in claim 1 or 2 storehouse, is characterized in that, also comprises that mirror image is synchronizeed to perform in a radio or TV programme step after audio accompaniment is synthesized to audio video document with video or slide files, specifically comprises:
By wireless transmission method synchronous mobile terminal and audiovisual terminals;
Playback of audio-visual file;
Receive sound from mobile terminal or audiovisual terminals;
Process and play described sound.
5. the method in extension point song as claimed in claim 4 storehouse, is characterized in that, described wireless transmission method comprises Airplay technology, DLNA technology, Miracast technology or WIDI technology.
6. the method in extension point song as claimed in claim 1 or 2 storehouse, is characterized in that, obtains song files and specifically comprises: obtain song files by wired connection or wireless connections mode from mobile terminal or internet from mobile terminal or internet.
7. the method in extension point song as claimed in claim 1 or 2 storehouse, is characterized in that, obtaining song files from mobile terminal or internet is audio video document or pure audio file;
In the time obtaining song files from mobile terminal or internet and be audio video document, audio accompaniment and video or slide files are synthesized to audio video document and comprise step: the video in audio accompaniment and former audio video document is synthesized to audio video document;
In the time obtaining song files from mobile terminal or internet and be pure audio file, audio accompaniment and video or slide files are synthesized to audio video document and comprise step: by web search and the suitable video file of song files, and the video in audio accompaniment and video file is synthesized to audio video document, or audio accompaniment and default video or magic lantern are synthesized to new audio video document.
8. the system in extension point song storehouse, is characterized in that, comprises audiovisual terminals, and server or mobile terminal;
Audiovisual terminals comprises accompaniment extraction unit, audiovisual synthesis unit, the first processing unit, the first communication unit and storage unit;
Server comprises public server or privately owned server;
The first processing unit calls the first communication unit and obtains song files from mobile terminal or server, call accompaniment extraction unit and extract audio accompaniment from described song files, call audiovisual synthesis unit audio accompaniment and video or slide files are synthesized to audio video document.
9. the system in extension point song as claimed in claim 8 storehouse, is characterized in that, audiovisual terminals also comprises captions embedded unit;
The first processing unit calls the first communication unit and obtains the lyrics captions of synchronizeing with song files from mobile terminal or server, and calls captions embedded unit described lyrics captions are embedded in audio video document.
10. the system in extension point song storehouse as claimed in claim 8 or 9, is characterized in that, described song files comprises song audio frequency or multimedia file;
Accompaniment extraction unit is removed original singer or is promoted vocal accompaniment from song audio frequency, or processing unit extracts song audio frequency from multimedia file and accompaniment extraction unit is removed original singer or promoted vocal accompaniment from song audio frequency.
11. systems in extension point song storehouse as claimed in claim 8 or 9, is characterized in that, audiovisual terminals comprises sound processing unit, the first recording unit, plays driver element and broadcast unit; Mobile terminal comprises the second recording unit, the second processing unit and the second communication unit;
After audio accompaniment and video or slide files are synthesized audio video document by audiovisual synthesis unit, the first communication unit and the second communication unit between mobile terminal and audiovisual terminals by wireless transmission method audio-video synchronization file;
The first processing unit calls plays driver element in order to control broadcast unit playback of audio-visual file;
The first processing unit is transferred the first recording unit and is received acoustic information, or the second processing unit is transferred the second recording unit reception acoustic information and by the second communication unit, described acoustic information transferred to audiovisual terminals;
The first processing unit calls sound processing unit and processes described acoustic information, and calls broadcasting driver element control broadcast unit and play treated sound.
The system in 12. extension point song as claimed in claim 11 storehouses, is characterized in that: described wireless transmission method comprises Airplay technology, DLNA technology, Miracast technology or WIDI technology.
13. systems in extension point song storehouse as claimed in claim 8 or 9, is characterized in that: the first communication unit comprises wired communication module or wireless communication module; The first communication unit obtains song files from mobile terminal or internet and specifically comprises: obtain song files by wired connection or wireless connections mode from mobile terminal or internet.
14. systems in extension point song storehouse as claimed in claim 8 or 9, is characterized in that, obtaining song files from mobile terminal or server is audio video document or pure audio file;
In the time obtaining song files from mobile terminal or server and be audio video document, the video in audio accompaniment and former audio video document is synthesized audio video document by audiovisual synthesis unit;
In the time obtaining song files from mobile terminal or server and be pure audio file, the first processing unit is by web search and the suitable video file of song files, and the video in audio accompaniment and video file is synthesized audio video document by audiovisual synthesis unit; Or audio accompaniment and default video or magic lantern are synthesized new audio video document by audiovisual synthesis unit.
CN201410147686.7A 2014-04-14 2014-04-14 A kind of method and its system of extension point library Active CN103886881B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410147686.7A CN103886881B (en) 2014-04-14 2014-04-14 A kind of method and its system of extension point library

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410147686.7A CN103886881B (en) 2014-04-14 2014-04-14 A kind of method and its system of extension point library

Publications (2)

Publication Number Publication Date
CN103886881A true CN103886881A (en) 2014-06-25
CN103886881B CN103886881B (en) 2018-10-02

Family

ID=50955736

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410147686.7A Active CN103886881B (en) 2014-04-14 2014-04-14 A kind of method and its system of extension point library

Country Status (1)

Country Link
CN (1) CN103886881B (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104485122A (en) * 2014-11-27 2015-04-01 广东欧珀移动通信有限公司 Communication information export method and device and terminal equipment
CN104683853A (en) * 2015-02-04 2015-06-03 广州酷狗计算机科技有限公司 Multimedia file acquisition device and terminal
CN104715773A (en) * 2015-03-30 2015-06-17 福建星网视易信息系统有限公司 Self-service song adding method and device used in digital audio-video place
CN104754049A (en) * 2015-03-30 2015-07-01 福建星网视易信息系统有限公司 Method and device for self-help song adding through cloud server
CN104882151A (en) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 Method, device and system for displaying multimedia resources in song singing
CN105025307A (en) * 2014-12-09 2015-11-04 北京歌华有线数字媒体有限公司 Audio and video acquisition synthesis system based on linkage of cable television and intelligent mobile device
CN106488328A (en) * 2015-08-25 2017-03-08 尊博科技股份有限公司 MTV vocal accompaniment order programme
CN107580783A (en) * 2015-07-20 2018-01-12 谷歌有限责任公司 Audio content is synchronized to Voice & Video device
CN107750013A (en) * 2017-09-01 2018-03-02 北京雷石天地电子技术有限公司 MV making, player method and device applied to Karaoke
CN108039184A (en) * 2017-12-28 2018-05-15 腾讯音乐娱乐科技(深圳)有限公司 Lyrics adding method and device
CN108109609A (en) * 2017-11-21 2018-06-01 北京小唱科技有限公司 The method for recording and device of audio and video
CN108322830A (en) * 2017-01-16 2018-07-24 重庆特斯联智慧科技股份有限公司 Intelligent navigation audio and video control system and its method
CN110390925A (en) * 2019-08-02 2019-10-29 湖南国声声学科技股份有限公司深圳分公司 Voice and accompaniment synchronous method, terminal, bluetooth equipment and storage medium
WO2020034227A1 (en) * 2018-08-17 2020-02-20 华为技术有限公司 Multimedia content synchronization method and electronic device
CN113641329A (en) * 2021-08-10 2021-11-12 广州艾美网络科技有限公司 Sound effect configuration method and device, intelligent sound box, computer equipment and storage medium
CN113836344A (en) * 2021-09-30 2021-12-24 广州艾美网络科技有限公司 Personalized song file generation method and device and music singing equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101609667A (en) * 2009-07-22 2009-12-23 福州瑞芯微电子有限公司 Realize the method for Kara OK function in the PMP player
CN101901595A (en) * 2010-05-05 2010-12-01 北京中星微电子有限公司 Method and system for generating animation according to audio music
CN103020173A (en) * 2012-11-27 2013-04-03 北京百度网讯科技有限公司 Video image information searching method and system for mobile terminal and mobile terminal

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010077384A (en) * 2000-02-02 2001-08-17 윤종용 Method and apparatus for upgrading new song in video-song accompaniment equipment and method for supporting new song upgrade in management system
CN102231272A (en) * 2011-01-21 2011-11-02 辜进荣 Method and device for synthesizing network videos and audios

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101609667A (en) * 2009-07-22 2009-12-23 福州瑞芯微电子有限公司 Realize the method for Kara OK function in the PMP player
CN101901595A (en) * 2010-05-05 2010-12-01 北京中星微电子有限公司 Method and system for generating animation according to audio music
CN103020173A (en) * 2012-11-27 2013-04-03 北京百度网讯科技有限公司 Video image information searching method and system for mobile terminal and mobile terminal

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104485122A (en) * 2014-11-27 2015-04-01 广东欧珀移动通信有限公司 Communication information export method and device and terminal equipment
CN105025307A (en) * 2014-12-09 2015-11-04 北京歌华有线数字媒体有限公司 Audio and video acquisition synthesis system based on linkage of cable television and intelligent mobile device
CN105025307B (en) * 2014-12-09 2017-02-22 北京歌华有线数字媒体有限公司 Audio and video acquisition synthesis system based on linkage of cable television and intelligent mobile device
CN104683853A (en) * 2015-02-04 2015-06-03 广州酷狗计算机科技有限公司 Multimedia file acquisition device and terminal
CN104754049A (en) * 2015-03-30 2015-07-01 福建星网视易信息系统有限公司 Method and device for self-help song adding through cloud server
CN104715773A (en) * 2015-03-30 2015-06-17 福建星网视易信息系统有限公司 Self-service song adding method and device used in digital audio-video place
CN104715773B (en) * 2015-03-30 2018-06-15 福建凯米网络科技有限公司 A kind of self-service method and device for adding song in digital audio-video place
CN104882151A (en) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 Method, device and system for displaying multimedia resources in song singing
CN104882151B (en) * 2015-06-05 2018-08-03 福建凯米网络科技有限公司 The method, apparatus and system of multimedia resource are shown in singing songs
CN107580783A (en) * 2015-07-20 2018-01-12 谷歌有限责任公司 Audio content is synchronized to Voice & Video device
CN106488328A (en) * 2015-08-25 2017-03-08 尊博科技股份有限公司 MTV vocal accompaniment order programme
CN108322830A (en) * 2017-01-16 2018-07-24 重庆特斯联智慧科技股份有限公司 Intelligent navigation audio and video control system and its method
CN107750013A (en) * 2017-09-01 2018-03-02 北京雷石天地电子技术有限公司 MV making, player method and device applied to Karaoke
CN108109609A (en) * 2017-11-21 2018-06-01 北京小唱科技有限公司 The method for recording and device of audio and video
CN108039184A (en) * 2017-12-28 2018-05-15 腾讯音乐娱乐科技(深圳)有限公司 Lyrics adding method and device
WO2020034227A1 (en) * 2018-08-17 2020-02-20 华为技术有限公司 Multimedia content synchronization method and electronic device
CN110390925A (en) * 2019-08-02 2019-10-29 湖南国声声学科技股份有限公司深圳分公司 Voice and accompaniment synchronous method, terminal, bluetooth equipment and storage medium
CN110390925B (en) * 2019-08-02 2021-08-10 湖南国声声学科技股份有限公司深圳分公司 Method for synchronizing voice and accompaniment, terminal, Bluetooth device and storage medium
CN113641329A (en) * 2021-08-10 2021-11-12 广州艾美网络科技有限公司 Sound effect configuration method and device, intelligent sound box, computer equipment and storage medium
CN113836344A (en) * 2021-09-30 2021-12-24 广州艾美网络科技有限公司 Personalized song file generation method and device and music singing equipment

Also Published As

Publication number Publication date
CN103886881B (en) 2018-10-02

Similar Documents

Publication Publication Date Title
CN103886881A (en) Method and system for expanding song selecting library
US10943574B2 (en) Non-linear media segment capture and edit platform
CN110692252B (en) Audio-visual collaboration method with delay management for wide area broadcast
CN101770772B (en) Embedded Internet kara OK entertainment device and method for controlling sound and images thereof
CN107027050B (en) Audio and video processing method and device for assisting live broadcast
US11178457B2 (en) Interactive music creation and playback method and system
US20080184870A1 (en) System, method, device, and computer program product providing for a multiple-lyric karaoke system
US20130301392A1 (en) Methods and apparatuses for communication of audio tokens
US9412390B1 (en) Automatic estimation of latency for synchronization of recordings in vocal capture applications
JP2006195385A (en) Device and program for music reproduction
US11146901B2 (en) Crowd-sourced device latency estimation for synchronization of recordings in vocal capture applications
CN102419998A (en) Voice frequency processing method and system
TW201627988A (en) Synchronous visual effect system and method for processing synchronous visual effect
US10284985B1 (en) Crowd-sourced device latency estimation for synchronization of recordings in vocal capture applications
CN101751967A (en) Multimedia file producing and playing method, multimedia file producing device and player
CN113039573A (en) Audio-visual collaboration system and method with seed/join mechanism
CN109327731A (en) A kind of real-time synthetic method of DIY video and system based on Karaoke
Hirabayashi et al. Sense of space: the audience participation music performance with high-frequency sound id.
KR20150018194A (en) Evaluation Methods and System for mimicking song
CN107393566A (en) The audio-frequency decoding method and device of a kind of Intelligent story device
CN104219556A (en) Use method of four-soundtrack karaoke identification playing system
WO2019051689A1 (en) Sound control method and apparatus for intelligent terminal
CN104754049A (en) Method and device for self-help song adding through cloud server
CN111345044B (en) Audiovisual effects system for enhancing a performance based on content of the performance captured
JP4853639B2 (en) Music storage device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20151102

Address after: 350018 Fujian city of Fuzhou province Nanjiang gate town of Cangshan District West Coast Road No. 198 Fuzhou Strait International Convention and Exhibition Center basement East Office Center No. A-029 (FTA test area)

Applicant after: FUJIAN KAIMI NETWORK SCIENCE & TECHNOLOGY CO., LTD.

Address before: Cangshan District of Fuzhou City, Fujian province 350008 Jinshan Road No. 618 juyuanzhou Industrial Zone Ruijie Science Park building 20, four floor

Applicant before: Fujian Starnet e-Video Information System Co., Ltd.

GR01 Patent grant
GR01 Patent grant