CN107809666A - Voice data merging method, device storage medium and processor - Google Patents

Voice data merging method, device storage medium and processor Download PDF

Info

Publication number
CN107809666A
CN107809666A CN201711018182.5A CN201711018182A CN107809666A CN 107809666 A CN107809666 A CN 107809666A CN 201711018182 A CN201711018182 A CN 201711018182A CN 107809666 A CN107809666 A CN 107809666A
Authority
CN
China
Prior art keywords
client
data
role
dubbed
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711018182.5A
Other languages
Chinese (zh)
Inventor
费非
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201711018182.5A priority Critical patent/CN107809666A/en
Publication of CN107809666A publication Critical patent/CN107809666A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/764Media network packet handling at the destination 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream

Abstract

The invention provides a kind of voice data merging method, device storage medium and processor, wherein, this method includes:First client obtains the corresponding first position information in the lines data of file to be dubbed of the first lines data corresponding to first role in file to be dubbed;First client is according to the voice data of first position acquisition of information first;First client obtains the second audio data that the second client is sent;First client merges the first voice data and second audio data according to first position information and second place information, pass through the present invention, it is less efficient when solving the problems, such as to dub for the role dubbed in file, improve and dub efficiency when being dubbed for the role dubbed in file.

Description

Voice data merging method, device storage medium and processor
Technical field
The present invention relates to the communications field, in particular to a kind of voice data merging method, device storage medium and place Manage device.
Background technology
At present, if user wants to realize that more individuals dub, then the people for playing the part of different role must be together right Recorded one by one by manual allocation role afterwards, finally synthesize an audio file again.Like this it can not just realize long-range people Participation is dubbed, and it also requires manual allocation role, recorded every time a role will switch microphone to other role Recording, whole flow process will be comparatively laborious.
The problem of for being less efficient when the role dubbed in file dubs in correlation technique, there is presently no effectively Solution.
The content of the invention
The embodiments of the invention provide a kind of voice data merging method, device storage medium and processor, at least to solve The problem of being certainly less efficient when the role dubbed in file dubs in correlation technique.
According to one embodiment of present invention, there is provided a kind of voice data merging method, including:First client obtains The corresponding first position in the lines data of file to be dubbed of first lines data corresponding to first role in file to be dubbed Information, wherein, the first user that first client is logged in using the first account number is dubbed for the first role, and described first Positional information is used to indicate that the first lines data are in the lines data of the file to be dubbed corresponding to the first role In at the beginning of between and the end time;First client is according to the voice data of first position acquisition of information first;Institute State the first client and obtain the second audio data that the second client is sent, wherein, log in second visitor using the second account number The second user at family end is that the second role in the file to be dubbed is dubbed, the second lines data corresponding to the second role Correspond to second place information in the lines data of the file to be dubbed, the second place information is used to indicating described the Between at the beginning of the second lines data are in the lines data of the file to be dubbed corresponding to two roles and the end time; First client merges first voice data and institute according to the first position information and the second place information State second audio data.
Alternatively, the First corresponding to first role described in file to be dubbed described in the first client acquisition Word data corresponding first position information in the lines data of the file to be dubbed includes:First client obtains The first role for taking first account number to be selected in first client;First client is according to default angle Color and the corresponding relation of lines data determine the corresponding first lines data of the first role in the platform to be dubbed Line number where in word data, wherein, the default role is used to indicate that role is corresponding with the corresponding relation of lines data Lines data;First client obtains the first position information according to the line number.
Alternatively, before the second audio data that first client obtains that the second client is sent, methods described Also include:First client controls second client to obtain the second audio data;The first client control Make second client and send the second audio data.
Alternatively, first client controls second client to obtain the second audio data and included:It is described First client obtains the second role that second account number selects in second client;First client By second place information corresponding to the second lines data corresponding to the second role and the second lines data send to Second client;First client controls second client the according to the second place acquisition of information Two voice datas.
Alternatively, first client obtains the second audio data that second client is sent and included:Institute State the first client acquisition second client and be converted into base64 using what expansible communication and presentation protocol XMPP were sent The second audio data of coded format.
Alternatively, first client merges described the according to the first position information and the second place information One voice data and the second audio data include:First client is by described the second of the base64 coded formats Voice data is reduced to the above-mentioned second audio data of audio file formats;First client is believed according to the first position Breath and the second place information merge first voice data and the sound using multimedia video handling implement FFmpeg The second audio data of frequency file form.
According to another embodiment of the invention, there is provided a kind of voice data merges device, applied to the first client, Including:First acquisition module, for obtaining in file to be dubbed the first lines data corresponding to first role in file to be dubbed Lines data in corresponding first position information, wherein, use the first account number to log in the first user of first client Dubbed for the first role, the first position information is used to indicate the first lines data corresponding to the first role Between at the beginning of in the lines data of the file to be dubbed and the end time;Second acquisition module, for according to described One positional information obtains the first voice data;3rd acquisition module, the second audio data sent for obtaining the second client, Wherein, the second user that second client is logged in using the second account number is matched somebody with somebody for the second role in the file to be dubbed Sound, the second lines data correspond to second confidence in the lines data of the file to be dubbed corresponding to the second role Breath, the second place information are used to indicate that the second lines data are in the file to be dubbed corresponding to the second role Lines data at the beginning of between and the end time;Merging module, for according to the first position information and described second Positional information merges first voice data and the second audio data.
Alternatively, first acquisition module includes:First acquisition unit, for obtaining first account number described The first role selected in one client;Determining unit, for the corresponding relation according to default role and lines data Line number where determining the first lines data corresponding to the first role in the lines data to be dubbed, its In, the corresponding relation of the default role and lines data is used to indicating the corresponding lines data of role;Second acquisition unit, For obtaining the first position information according to the line number.
Alternatively, described device also includes:First control module, for controlling second client to obtain described second Voice data;Second control module, for controlling second client to send the second audio data.
According to still another embodiment of the invention, a kind of storage medium is additionally provided, the storage medium includes storage Program, wherein, the method described in any of the above-described is performed when described program is run.
According to still another embodiment of the invention, a kind of processor is additionally provided, the processor is used for operation program, its In, the method described in any of the above-described is performed when described program is run.
By the present invention, the first client first lines data according to corresponding to the first role to be dubbed got The voice data of first position acquisition of information first, and the second audio data of second terminal transmission is obtained, and according to first position Second place information merges to the first voice data and second audio data corresponding to information and second audio data, so as to So that the personnel of dubbing need not concentrate in together, it can realize and remotely dub, while the first client and the second client have been distinguished Each recording of role in pairs, then obtained voice data is merged into a complete audio file, save and dub the time, So that dub more convenient, fast.Therefore, less efficient problem when can solve to dub for the role dubbed in file, improve Role to dub in file dubs efficiency when dubbing.
Brief description of the drawings
Accompanying drawing described herein is used for providing a further understanding of the present invention, forms the part of the application, this hair Bright schematic description and description is used to explain the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is a kind of hardware block diagram of the mobile terminal of voice data merging method of the embodiment of the present invention;
Fig. 2 is the flow chart of voice data merging method according to embodiments of the present invention;
Fig. 3 is the schematic diagram according to the method for the acquisition first role of optional embodiment of the invention;
Fig. 4 is the schematic diagram for dubbing control according to optional embodiment of the invention;
Fig. 5 is the structured flowchart one that voice data according to embodiments of the present invention merges device;
Fig. 6 is the structured flowchart two that voice data according to embodiments of the present invention merges device;
Voice data according to embodiments of the present invention Fig. 7 merges the structured flowchart three of device.
Embodiment
Describe the present invention in detail below with reference to accompanying drawing and in conjunction with the embodiments.It should be noted that do not conflicting In the case of, the feature in embodiment and embodiment in the application can be mutually combined.
It should be noted that term " first " in description and claims of this specification and above-mentioned accompanying drawing, " Two " etc. be for distinguishing similar object, without for describing specific order or precedence.
Embodiment 1
The embodiment of the method that the embodiment of the present application one is provided can be in mobile terminal, terminal or similar fortune Calculate and performed in device.Exemplified by running on mobile terminals, Fig. 1 is a kind of voice data merging method of the embodiment of the present invention The hardware block diagram of mobile terminal.As shown in figure 1, mobile terminal 10 can include one or more (one is only shown in figure) (processor 102 can include but is not limited to Micro-processor MCV or PLD FPGA etc. processing dress to processor 102 Put), the memory 104 for data storage and the transmitting device 106 for communication function.Those of ordinary skill in the art It is appreciated that the structure shown in Fig. 1 is only to illustrate, it does not cause to limit to the structure of above-mentioned electronic installation.It is for example, mobile whole End 10 may also include more either less components than shown in Fig. 1 or have the configuration different from shown in Fig. 1.
Memory 104 can be used for the software program and module of storage application software, such as the audio in the embodiment of the present invention Programmed instruction/module corresponding to data merging method, processor 102 are stored in the software program in memory 104 by operation And module, so as to perform various function application and data processing, that is, realize above-mentioned method.Memory 104 may include height Fast random access memory, may also include nonvolatile memory, as one or more magnetic storage device, flash memory or other Non-volatile solid state memory.In some instances, memory 104 can further comprise remotely located relative to processor 102 Memory, these remote memories can pass through network connection to mobile terminal 10.The example of above-mentioned network includes but unlimited In internet, intranet, LAN, mobile radio communication and combinations thereof.
Transmitting device 106 is used to data are received or sent via a network.Above-mentioned network instantiation may include The wireless network that the communication providerses of mobile terminal 10 provide.In an example, transmitting device 106 includes a Network adaptation Device (Network Interface Controller, NIC), its can be connected by base station with other network equipments so as to it is mutual Networking is communicated.In an example, transmitting device 106 can be radio frequency (Radio Frequency, RF) module, and it is used In wirelessly being communicated with internet.
A kind of voice data merging method is provided in the present embodiment, and Fig. 2 is audio number according to embodiments of the present invention According to the flow chart of merging method, as shown in Fig. 2 the flow comprises the following steps:
Step S202, the first client obtain in file to be dubbed that the first lines data are being waited to dub corresponding to first role Corresponding first position information in the lines data of file, wherein, use the first user of the first account number the first client of login Dubbed for first role, first position information is used to indicate that the first lines data are in the platform of file to be dubbed corresponding to first role Between at the beginning of in word data and the end time;
Step S204, the first client is according to the voice data of first position acquisition of information first;
Step S206, the first client obtain the second audio data that the second client is sent, wherein, use the second account number The second user for logging in the second client is that the second role in file to be dubbed is dubbed, the second lines number corresponding to second role According to second place information is corresponded in the lines data of file to be dubbed, second place information is used to indicate that second role is corresponding The second lines data in the lines data of file to be dubbed at the beginning of between and the end time;
Step S208, the first client merge the first voice data and the according to first position information and second place information Two voice datas.
Alternatively, it is that file to be dubbed is dubbed that above-mentioned voice data merging method, which can be, but not limited to be applied to user, Scene in.Such as:Voice-over actor is the scene that film is dubbed, and student is scene that curriculum video is dubbed etc..
Alternatively, above-mentioned voice data merging method can be, but not limited to be applied to terminal device, on above-mentioned terminal device Can be, but not limited to that above-mentioned first client is installed.Above-mentioned terminal device can be, but not limited to as mobile phone, notebook computer, put down Plate computer, desktop computer, Intelligent worn device, intelligent appliance etc..
Alternatively, in the present embodiment, above-mentioned file of dubbing can be, but not limited to include:Film video file, video class Journey file, video clip files, song files etc..
Alternatively, in the present embodiment, first position information can be, but not limited to be used for indicate corresponding to first role the One lines data in the lines data of file to be dubbed at the beginning of between and the end time.Such as:Corresponding to first role One lines data include 3 sections of lines, respectively lines A, lines B and lines C, and first position information is used to indicate this 3 sections of lines Time started T1 and end time T2 corresponding to respectively, first position information can serve to indicate that lines A:T1 is 4 ' 35 " 25, T2 For 4 ' 36 " 73;Lines B:T1 is 7 ' 28, and " 14, T2 be 7 ' 35 " 48;Lines C:T1 is 15 ' 04, and " 37, T2 be 15 ' 16 " 64.
Alternatively, in the present embodiment, the second client can be, but not limited to for one or more client, wait to dub It can be, but not limited to be divided into multiple roles in file, dubbed respectively by the account number of login multiple client, a visitor Family end can be responsible for dubbing for one or more role.Each client obtains each self-corresponding voice data respectively, then will The voice data is sent to other each clients, and each voice data is merged into one in each client completely matches somebody with somebody Sound file, so as to realize treat dub file dub work.
Alternatively, in the present embodiment, first role can be, but not limited to as one or more role.Second role It can be, but not limited to as one or more role.
Alternatively, in the present embodiment, file to be dubbed can be, but not limited to include video data and voice data, the sound Frequency evidence is the voice data do not dubbed, such as:Can be background music, audio, ambient sound etc..First client is according to One positional information can be, but not limited to play the video data of the file to be dubbed simultaneously and/or be somebody's turn to do when obtaining the first voice data The voice data of file to be dubbed.
By above-mentioned steps, the first client first lines data according to corresponding to the first role to be dubbed got The voice data of first position acquisition of information first, and the second audio data that the second client is sent is obtained, and according to first Second place information merges to the first voice data and second audio data corresponding to positional information and second audio data, So that the personnel of dubbing need not concentrate in together, it can realize and remotely dub, while the first client and the second client point The recording to respective role is not completed, then obtained voice data is merged into a complete audio file, is saved and is dubbed Time so that dub more convenient, fast.Therefore, less efficient problem when can solve to dub for the role dubbed in file, Improve and dub efficiency when being dubbed for the role dubbed in file.
It is alternatively possible to which the line number where the lines according to corresponding to role is pre-set, role is corresponding with lines data to close System, the line number according to corresponding to the corresponding relation determines role where lines data, according to the position of line number acquisition lines data Confidence ceases.Such as:In above-mentioned steps S202, the first client obtains first jiao that the first account number selects in the first client Color, and determine that the corresponding first lines data of first role are being waited to dub according to default role and the corresponding relation of lines data Lines data in where line number, wherein, the corresponding relation of default role and lines data is for indicating that role is corresponding Lines data, first position information is obtained further according to line number.
It is alternatively possible to which the Role Information in file to be dubbed is included in the first client, carried out by the first user Selection, the mode of selection can be, but not limited to be clicked on screen, can also be input voice or word, the first client Hold and the input information of the first user input is identified, obtain first role.Such as:First client, which can be shown, to be waited to dub Role Information in file, the input information of the first account number is obtained, Role Information corresponding to identified input information, obtains first jiao Color.
In an optional embodiment, Fig. 3 is the acquisition first role according to optional embodiment of the invention The schematic diagram of method, as shown in figure 3, file to be dubbed is a film, film character is shown in the first client:Mother and Pi Te, user can be selected role by drop-down menu, and user can click on room member button to check as the shadow Other users that piece is dubbed, it can click on ACK button after selecting role and confirmed and start to dub, replacement can also be clicked on Role's button reselects role, or can click on cancel button and cancel role selecting.
Alternatively, the first client can distribute role automatically, if user click the mouse slightly can play the part of it is unassigned Role.
Alternatively, the first client can be turned on or off according to the instruction of first position information where the first client The sound-recording function of terminal, to obtain the first voice data.Such as:In above-mentioned steps S204, the first client is according to first Start sound-recording function between at the beginning of first lines data of confidence breath instruction, receive voice data, and believe according to first position The end time for ceasing the first lines data of instruction closes sound-recording function, obtains the first voice data.
Alternatively, recording each time can be controlled by a client.Such as:Above-mentioned steps S204 it Before, the first client can control the second client to obtain second audio data, and control the second client to send the second audio Data.
In an optional embodiment, Fig. 4 is the signal for dubbing control according to optional embodiment of the invention Figure, as shown in figure 4, dubbing task for one, a discussion group can be established by the first client, this is participated in and dubs task Other clients, i.e. one or more second client, can be added in the discussion group, each client can be shown The interface, the first client can be that discussion group takes a name to be shown in the position that group name is discussed.Can in the column on right side The mark of this each account number dubbed is participated in display, user can input word in input frame or voice is matched somebody with somebody to this Sound is discussed, and the first account number logged in the first client can dub opening for task by clicking on start to dub button control Open, again tap on and start to dub at button that (button position can change into display stopping and match somebody with somebody behind the place of clicking on for the first time Sound button) it can stop to dub.After each role dubs end, synthesis talk button can be clicked on the voice data of acquisition is entered Row synthesis.
Alternatively, the first client can be, but not limited to control the second client to obtain the second audio number in the following manner According to:First client obtains the second role that selects on a second client of the second account number, and by corresponding to second role second Second place information corresponding to lines data and the second lines data is sent to the second client, the second client of control according to Second place acquisition of information second audio data.
It is alternatively possible to transmit data using expansible communication and presentation protocol XMPP, data waiting for transmission can be turned Change base64 coded formats into, in order to data can be more efficiently transmission.Such as:In above-mentioned steps S206, the first visitor Family end can obtain the second client and be converted into base64 coded formats using what expansible communication and presentation protocol XMPP were sent Second audio data.
Alternatively, the instrument of merging data can use multimedia video handling implement FFmpeg.Such as:In above-mentioned steps In S208, the second audio data of base64 coded formats can be reduced to above-mentioned the of audio file formats by the first client Two voice datas, and merge the using multimedia video handling implement FFmpeg according to first position information and second place information The second audio data of one voice data and audio file formats.
In one alternate embodiment, the first client can be needed on the video dubbed inside according to video in target Role sorts out all line numbers of the dialogue in captions of this role, for example role Xiao Ming is 1,3,5 of captions, then can be with The line number of captions where marking this role, by that analogy, the line number of captions where the part dialog of whole video is all recorded Get off to obtain the corresponding relation of role and lines data.All roles can be listed when startup is dubbed, user It can select that to be played the part of to dub role.The line number of all captions of this role is all read out after selection, further according to certain row Initiation is corresponded to the end time dub sound-recording function between at the beginning of captions.
Alternatively, by taking multimedia foreign language learning software as an example, it is possible to achieve more people or one long-range match somebody with somebody sound function, and Multiple dubbing datas are transferred to each client it are merged into one and completely dubs audio file.
Through the above description of the embodiments, those skilled in the art can be understood that according to above-mentioned implementation The method of example can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but a lot In the case of the former be more preferably embodiment.Based on such understanding, technical scheme is substantially in other words to existing The part that technology contributes can be embodied in the form of software product, and the computer software product is stored in a storage In medium (such as ROM/RAM, magnetic disc, CD), including some instructions to cause a station terminal equipment (can be mobile phone, calculate Machine, server, or network equipment etc.) perform method described in each embodiment of the present invention.
Embodiment 2
A kind of voice data is additionally provided in the present embodiment and merges device, and applied to the first client, the device is used for Above-described embodiment and preferred embodiment are realized, had carried out repeating no more for explanation.As used below, term " mould Block " can realize the combination of the software and/or hardware of predetermined function.Although the device described by following examples is preferably with soft Part is realized, but hardware, or software and hardware combination realization and may and be contemplated.
Fig. 5 is the structured flowchart one that voice data according to embodiments of the present invention merges device, as shown in figure 5, the device Including:
First acquisition module 52, for obtaining, the first lines data are being waited to dub corresponding to first role in file to be dubbed Corresponding first position information in the lines data of file, wherein, use the first user of the first account number the first client of login Dubbed for first role, first position information is used to indicate that the first lines data are in the platform of file to be dubbed corresponding to first role Between at the beginning of in word data and the end time;
Second acquisition module 54, coupled to the first acquisition module 52, for according to the audio of first position acquisition of information first Data;
3rd acquisition module 56, coupled to the second acquisition module 54, the second audio sent for obtaining the second client Data, wherein, the second user using the second account number the second client of login is that the second role in file to be dubbed is dubbed, the Second lines data corresponding to two roles correspond to second place information, second confidence in the lines data of file to be dubbed Cease for indicate the second lines data corresponding to second role between at the beginning of in the lines data for dubbing file and to the end of Time;
Merging module 58, coupled to the 3rd acquisition module 56, for according to first position information and the conjunction of second place information And first voice data and second audio data.
Alternatively, it is that file to be dubbed is dubbed that above-mentioned voice data, which merges device to can be, but not limited to be applied to user, Scene in.Such as:Voice-over actor is the scene that film is dubbed, and student is scene that curriculum video is dubbed etc..
Alternatively, above-mentioned voice data merges device and can be, but not limited to be applied to terminal device, on above-mentioned terminal device Can be, but not limited to that above-mentioned first client is installed.Above-mentioned terminal device can be, but not limited to as mobile phone, notebook computer, put down Plate computer, desktop computer, Intelligent worn device, intelligent appliance etc..
Alternatively, in the present embodiment, above-mentioned file of dubbing can be, but not limited to include:Film video file, video class Journey file, video clip files, song files etc..
Alternatively, in the present embodiment, first position information can be, but not limited to be used for indicate corresponding to first role the One lines data in the lines data of file to be dubbed at the beginning of between and the end time.Such as:Corresponding to first role One lines data include 3 sections of lines, respectively lines A, lines B and lines C, and first position information is used to indicate this 3 sections of lines Time started T1 and end time T2 corresponding to respectively, first position information can serve to indicate that lines A:T1 is 4 ' 35 " 25, T2 For 4 ' 36 " 73;Lines B:T1 is 7 ' 28, and " 14, T2 be 7 ' 35 " 48;Lines C:T1 is 15 ' 04, and " 37, T2 be 15 ' 16 " 64.
Alternatively, in the present embodiment, the second client can be, but not limited to for one or more client, wait to dub It can be, but not limited to be divided into multiple roles in file, dubbed respectively by the account number of login multiple client, a visitor Family end can be responsible for dubbing for one or more role.Each client obtains each self-corresponding voice data respectively, then will The voice data is sent to other each clients, and each voice data is merged into one in each client completely matches somebody with somebody Sound file, so as to realize treat dub file dub work.
Alternatively, in the present embodiment, first role can be, but not limited to as one or more role.Second role It can be, but not limited to as one or more role.
Alternatively, in the present embodiment, file to be dubbed can be, but not limited to include video data and voice data, the sound Frequency evidence is the voice data do not dubbed, such as:Can be background music, audio, ambient sound etc..First client is according to One positional information can be, but not limited to play the video data of the file to be dubbed simultaneously and/or be somebody's turn to do when obtaining the first voice data The voice data of file to be dubbed.
By said apparatus, the first client first lines data according to corresponding to the first role to be dubbed got The voice data of first position acquisition of information first, and the second audio data that the second client is sent is obtained, and according to first Second place information merges to the first voice data and second audio data corresponding to positional information and second audio data, So that the personnel of dubbing need not concentrate in together, it can realize and remotely dub, while the first client and the second client point The recording to respective role is not completed, then obtained voice data is merged into a complete audio file, is saved and is dubbed Time so that dub more convenient, fast.Therefore, less efficient problem when can solve to dub for the role dubbed in file, Improve and dub efficiency when being dubbed for the role dubbed in file.
It is alternatively possible to which the line number where the lines according to corresponding to role is pre-set, role is corresponding with lines data to close System, the line number according to corresponding to the corresponding relation determines role where lines data, according to the position of line number acquisition lines data Confidence ceases.Fig. 6 is the structured flowchart two that voice data according to embodiments of the present invention merges device, as shown in fig. 6, alternatively, First acquisition module 52 includes:
First acquisition unit 62, the first role selected for obtaining the first account number in the first client;
Determining unit 64, coupled to first acquisition unit 62, for being closed according to default role is corresponding with lines data System determines line number of the first lines data where in lines data to be dubbed corresponding to first role, wherein, default angle The corresponding relation of color and lines data is used to indicating the corresponding lines data of role;
Second acquisition unit 66, coupled to determining unit 64, for obtaining first position information according to line number.
It is alternatively possible to which the Role Information in file to be dubbed is included in the first client, carried out by the first user Selection, the mode of selection can be, but not limited to be clicked on screen, can also be input voice or word, the first client Hold and the input information of the first user input is identified, obtain first role.
Alternatively, first acquisition unit 62 is used for:Show the Role Information in file to be dubbed;Obtain the defeated of the first account number Enter information;Role Information corresponding to identified input information, obtains first role.
In an optional embodiment, Fig. 3 is the acquisition first role according to optional embodiment of the invention The schematic diagram of method, as shown in figure 3, file to be dubbed is a film, film character is shown in the first client:Mother and Pi Te, user can be selected role by drop-down menu, and user can click on room member button to check as the shadow Other users that piece is dubbed, it can click on ACK button after selecting role and confirmed and start to dub, replacement can also be clicked on Role's button reselects role, or can click on cancel button and cancel role selecting.
Alternatively, the first client can distribute role automatically, if user click the mouse slightly can play the part of it is unassigned Role.
Alternatively, the first client can be turned on or off according to the instruction of first position information where the first client The sound-recording function of terminal, to obtain the first voice data.
Alternatively, above-mentioned second acquisition module 54 is used for:The the first lines data indicated according to first position information are opened Begin time startup sound-recording function, receives voice data;The end time of the first lines data indicated according to first position information Sound-recording function is closed, obtains the first voice data.
Alternatively, recording each time can be controlled by a client.Sound according to embodiments of the present invention Fig. 7 The structured flowchart three of frequency data merging device, as shown in fig. 7, alternatively, said apparatus also includes:
First control module 72, coupled to the first acquisition module 52, for controlling the second client to obtain the second audio number According to;
Second control module 74, coupled between the first control module 72 and the second acquisition module 54, for controlling second Client sends second audio data.
In an optional embodiment, Fig. 4 is the signal for dubbing control according to optional embodiment of the invention Figure, as shown in figure 4, dubbing task for one, a discussion group can be established by the first client, this is participated in and dubs task Other clients, i.e. one or more second client, can be added in the discussion group, each client can be shown The interface, the first client can be that discussion group takes a name to be shown in the position that group name is discussed.Can in the column on right side The mark of this each account number dubbed is participated in display, user can input word in input frame or voice is matched somebody with somebody to this Sound is discussed, and the first account number logged in the first client can dub opening for task by clicking on start to dub button control Open, again tap on and start to dub at button that (button position can change into display stopping and match somebody with somebody behind the place of clicking on for the first time Sound button) it can stop to dub.After each role dubs end, synthesis talk button can be clicked on the voice data of acquisition is entered Row synthesis.
Alternatively, the first control module can control the second client to obtain second audio data in the following manner:Obtain The second role for taking the second account number to select on a second client, by the second lines data corresponding to second role and second Second place information corresponding to word data is sent to the second client, and the second client of control is according to second place acquisition of information the Two voice datas.
It is alternatively possible to transmit data using expansible communication and presentation protocol XMPP, data waiting for transmission can be turned Change base64 coded formats into, in order to data can be more efficiently transmission.Such as:3rd acquisition module 56 is used for:Obtain The the second audio number for being converted into base64 coded formats that second client is sent using expansible communication and presentation protocol XMPP According to.
Alternatively, the instrument of merging data can use multimedia video handling implement FFmpeg.Such as:Merging module 58 For:The second audio data of base64 coded formats is reduced to the above-mentioned second audio data of audio file formats;According to First position information and second place information merge the first voice data and audio using multimedia video handling implement FFmpeg The second audio data of file format.
It should be noted that above-mentioned modules can be realized by software or hardware, for the latter, Ke Yitong Cross in the following manner realization, but not limited to this:Above-mentioned module is respectively positioned in same processor;Or above-mentioned modules are with any The form of combination is located in different processors respectively.
Embodiment 3
Embodiments of the invention additionally provide a kind of storage medium, and the storage medium includes the program of storage, wherein, it is above-mentioned The method described in any of the above-described is performed when program is run.
Alternatively, in the present embodiment, above-mentioned storage medium can be configured to the journey that storage is used to perform following steps Sequence code:
S1, the first client obtain in file to be dubbed the first lines data corresponding to first role in file to be dubbed Corresponding first position information in lines data, wherein, the first user that the first client is logged in using the first account number is first Role dubs, and first position information is used to indicate that the first lines data are in the lines data of file to be dubbed corresponding to first role In at the beginning of between and the end time;
S2, the first client is according to the voice data of first position acquisition of information first;
S3, the first client obtain the second audio data that the second client is sent, wherein, log in the using the second account number The second user of two clients is that the second role in file to be dubbed is dubbed, and the second lines data are being treated corresponding to second role Dub and correspond to second place information in the lines data of file, second place information is used to indicate second corresponding to second role Lines data in the lines data of file to be dubbed at the beginning of between and the end time;
S4, the first client merge the first voice data and the second audio according to first position information and second place information Data.
Alternatively, in the present embodiment, above-mentioned storage medium can include but is not limited to:USB flash disk, read-only storage (Read- Only Memory, referred to as ROM), it is random access memory (Random Access Memory, referred to as RAM), mobile hard Disk, magnetic disc or CD etc. are various can be with the medium of store program codes.
Embodiments of the invention additionally provide a kind of processor, and the processor is used for operation program, wherein, program operation Step in Shi Zhihang any of the above-described methods.
Alternatively, in the present embodiment, said procedure is used to perform following steps:
S1, the first client obtain in file to be dubbed the first lines data corresponding to first role in file to be dubbed Corresponding first position information in lines data, wherein, the first user that the first client is logged in using the first account number is first Role dubs, and first position information is used to indicate that the first lines data are in the lines data of file to be dubbed corresponding to first role In at the beginning of between and the end time;
S2, the first client is according to the voice data of first position acquisition of information first;
S3, the first client obtain the second audio data that the second client is sent, wherein, log in the using the second account number The second user of two clients is that the second role in file to be dubbed is dubbed, and the second lines data are being treated corresponding to second role Dub and correspond to second place information in the lines data of file, second place information is used to indicate second corresponding to second role Lines data in the lines data of file to be dubbed at the beginning of between and the end time;
S4, the first client merge the first voice data and the second audio according to first position information and second place information Data.
Alternatively, the specific example in the present embodiment may be referred to described in above-described embodiment and optional embodiment Example, the present embodiment will not be repeated here.
Obviously, those skilled in the art should be understood that above-mentioned each module of the invention or each step can be with general Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing devices and formed Network on, alternatively, they can be realized with the program code that computing device can perform, it is thus possible to they are stored Performed in the storage device by computing device, and in some cases, can be with different from shown in order execution herein The step of going out or describing, they are either fabricated to each integrated circuit modules respectively or by multiple modules in them or Step is fabricated to single integrated circuit module to realize.So, the present invention is not restricted to any specific hardware and software combination.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.All any modifications within the principle of the present invention, made, etc. With replacement, improvement etc., should be included in the scope of the protection.

Claims (11)

  1. A kind of 1. voice data merging method, it is characterised in that including:
    First client obtains lines number of the first lines data in file to be dubbed corresponding to first role in file to be dubbed The corresponding first position information in, wherein, the first user of first client is logged in as described the using the first account number One role dubs, and the first position information is used to indicate that the first lines data are treated described corresponding to the first role Between at the beginning of dubbing in the lines data of file and the end time;
    First client is according to the voice data of first position acquisition of information first;
    First client obtains the second audio data that the second client is sent, wherein, using described in the login of the second account number The second user of second client is that the second role in the file to be dubbed is dubbed, second corresponding to the second role Word data correspond to second place information in the lines data of the file to be dubbed, and the second place information is used to indicate Between at the beginning of the second lines data are in the lines data of the file to be dubbed corresponding to the second role and tie The beam time;
    First client merges first voice data according to the first position information and the second place information With the second audio data.
  2. 2. according to the method for claim 1, it is characterised in that institute in file to be dubbed described in the first client acquisition The first lines data corresponding to first role are stated corresponding described first in the lines data of the file to be dubbed Confidence breath includes:
    First client obtains the first role that first account number selects in first client;
    First client determines the corresponding institute of the first role according to default role and the corresponding relation of lines data The line number that the first lines data are stated where in the lines data to be dubbed, wherein, the default role and lines number According to corresponding relation be used for indicate lines data corresponding to role;
    First client obtains the first position information according to the line number.
  3. 3. according to the method for claim 1, it is characterised in that obtain what the second client was sent in first client Before second audio data, methods described also includes:
    First client controls second client to obtain the second audio data;
    First client controls second client to send the second audio data.
  4. 4. according to the method for claim 3, it is characterised in that first client controls second client to obtain The second audio data includes:
    First client obtains the second role that second account number selects in second client;
    First client is by corresponding to the second lines data corresponding to the second role and the second lines data Second place information is sent to second client;
    First client controls the second client second audio data according to the second place acquisition of information.
  5. 5. method according to any one of claim 1 to 4, it is characterised in that first client obtains described the The second audio data that two clients are sent includes:
    First client is obtained second client and is converted into using what expansible communication and presentation protocol XMPP were sent The second audio data of base64 coded formats.
  6. 6. according to the method for claim 5, it is characterised in that first client according to the first position information and The second place information merges first voice data and the second audio data and included:
    The second audio data of the base64 coded formats is reduced to audio file formats by first client Above-mentioned second audio data;
    First client uses multimedia video processing work according to the first position information and the second place information Has the second audio data that FFmpeg merges first voice data and the audio file formats.
  7. 7. a kind of voice data merges device, applied to the first client, it is characterised in that including:
    First acquisition module, for obtaining in file to be dubbed the first lines data corresponding to first role in file to be dubbed Corresponding first position information in lines data, wherein, the first user that first client is logged in using the first account number is The first role is dubbed, and the first position information is used to indicate that the first lines data exist corresponding to the first role Between at the beginning of in the lines data of the file to be dubbed and the end time;
    Second acquisition module, for according to the voice data of first position acquisition of information first;
    3rd acquisition module, the second audio data sent for obtaining the second client, wherein, log in institute using the second account number The second user for stating the second client is that the second role in the file to be dubbed is dubbed, second corresponding to the second role Lines data correspond to second place information in the lines data of the file to be dubbed, and the second place information is used to refer to Between showing at the beginning of the second lines data are in the lines data of the file to be dubbed corresponding to the second role and End time;
    Merging module, for according to the first position information and the second place information merge first voice data and The second audio data.
  8. 8. device according to claim 7, it is characterised in that first acquisition module includes:
    First acquisition unit, the first role selected for obtaining first account number in first client;
    Determining unit, for according to default role and the corresponding relation of lines data determine the first role it is corresponding described in Line number of the first lines data where in the lines data to be dubbed, wherein, the default role and lines data Corresponding relation be used for indicate lines data corresponding to role;
    Second acquisition unit, for obtaining the first position information according to the line number.
  9. 9. device according to claim 7, it is characterised in that described device also includes:
    First control module, for controlling second client to obtain the second audio data;
    Second control module, for controlling second client to send the second audio data.
  10. A kind of 10. storage medium, it is characterised in that the storage medium includes the program of storage, wherein, when described program is run Method any one of perform claim requirement 1 to 6.
  11. A kind of 11. processor, it is characterised in that the processor is used for operation program, wherein, right of execution when described program is run Profit requires the method any one of 1 to 6.
CN201711018182.5A 2017-10-26 2017-10-26 Voice data merging method, device storage medium and processor Pending CN107809666A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711018182.5A CN107809666A (en) 2017-10-26 2017-10-26 Voice data merging method, device storage medium and processor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711018182.5A CN107809666A (en) 2017-10-26 2017-10-26 Voice data merging method, device storage medium and processor

Publications (1)

Publication Number Publication Date
CN107809666A true CN107809666A (en) 2018-03-16

Family

ID=61582284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711018182.5A Pending CN107809666A (en) 2017-10-26 2017-10-26 Voice data merging method, device storage medium and processor

Country Status (1)

Country Link
CN (1) CN107809666A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110166818A (en) * 2018-11-30 2019-08-23 腾讯科技(深圳)有限公司 Wait match generation method, computer equipment and the storage medium of audio-video
CN110650366A (en) * 2019-10-29 2020-01-03 成都超有爱科技有限公司 Interactive dubbing method and device, electronic equipment and readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5888157B2 (en) * 2012-07-11 2016-03-16 株式会社バッファロー Recorder system and recorder device
CN105709416A (en) * 2016-03-14 2016-06-29 上海科睿展览展示工程科技有限公司 Personalized dubbing method and system for multi-user operating game
CN106060424A (en) * 2016-06-14 2016-10-26 徐文波 Video dubbing method and device
CN106293347A (en) * 2016-08-16 2017-01-04 广东小天才科技有限公司 The learning method of a kind of man-machine interaction and device, user terminal
JP2017079343A (en) * 2015-10-19 2017-04-27 船井電機株式会社 Content distribution method, and content distribution server
CN106792013A (en) * 2016-11-29 2017-05-31 青岛海尔多媒体有限公司 A kind of method, the TV interactive for television broadcast sounds
CN106911900A (en) * 2017-04-06 2017-06-30 腾讯科技(深圳)有限公司 Video dubbing method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5888157B2 (en) * 2012-07-11 2016-03-16 株式会社バッファロー Recorder system and recorder device
JP2017079343A (en) * 2015-10-19 2017-04-27 船井電機株式会社 Content distribution method, and content distribution server
CN105709416A (en) * 2016-03-14 2016-06-29 上海科睿展览展示工程科技有限公司 Personalized dubbing method and system for multi-user operating game
CN106060424A (en) * 2016-06-14 2016-10-26 徐文波 Video dubbing method and device
CN106293347A (en) * 2016-08-16 2017-01-04 广东小天才科技有限公司 The learning method of a kind of man-machine interaction and device, user terminal
CN106792013A (en) * 2016-11-29 2017-05-31 青岛海尔多媒体有限公司 A kind of method, the TV interactive for television broadcast sounds
CN106911900A (en) * 2017-04-06 2017-06-30 腾讯科技(深圳)有限公司 Video dubbing method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110166818A (en) * 2018-11-30 2019-08-23 腾讯科技(深圳)有限公司 Wait match generation method, computer equipment and the storage medium of audio-video
CN110650366A (en) * 2019-10-29 2020-01-03 成都超有爱科技有限公司 Interactive dubbing method and device, electronic equipment and readable storage medium
CN110650366B (en) * 2019-10-29 2021-09-24 成都超有爱科技有限公司 Interactive dubbing method and device, electronic equipment and readable storage medium

Similar Documents

Publication Publication Date Title
US11863336B2 (en) Dynamic virtual environment
US7185057B2 (en) Individually specifying message output attributes in a messaging system
US8886347B2 (en) Method and apparatus for selecting a playback queue in a multi-zone system
US8463917B2 (en) Communications system and method, information processing apparatus and method, information management apparatus and method, recording medium and program
US7379961B2 (en) Spatialized audio in a three-dimensional computer-based scene
JP5613978B2 (en) Presentation control system
CN105072143A (en) Interaction system for intelligent robot and client based on artificial intelligence
CN108133707A (en) A kind of content share method and system
US20020091658A1 (en) Multimedia electronic education system and method
CN103997688A (en) Intelligent interaction system, intelligent interaction device and intelligent interaction method
CN104602133A (en) Multimedia file shearing method and terminal as well as server
US20180308524A1 (en) System and method for preparing and capturing a video file embedded with an image file
KR101772361B1 (en) Method, system and recording medium for providing content in messenger
US20070214424A1 (en) Networked chat technique
WO2009008886A2 (en) Client-side in formation processing system, apparatus and methods
CN113411652A (en) Media resource playing method and device, storage medium and electronic equipment
CN106604056B (en) Video broadcasting method and device
CN112004146A (en) Audio playing method and system, television and storage medium
CN106105245A (en) The playback of interconnection video
CN107809666A (en) Voice data merging method, device storage medium and processor
US9390756B2 (en) Dynamic audio file generation system and associated methods
US20220394067A1 (en) System and method for facilitating interaction among users at real-time
CN109407843A (en) Control method and device, the storage medium, electronic device of multimedia
CN114222190A (en) Remote control processing and responding method and device, equipment, medium and product thereof
KR102630214B1 (en) Method of operating performance server for non-face to face reactive performance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180316

RJ01 Rejection of invention patent application after publication