CN103237259A - Audio-channel processing device and audio-channel processing method for video - Google Patents

Audio-channel processing device and audio-channel processing method for video Download PDF

Info

Publication number
CN103237259A
CN103237259A CN 201310119021 CN201310119021A CN103237259A CN 103237259 A CN103237259 A CN 103237259A CN 201310119021 CN201310119021 CN 201310119021 CN 201310119021 A CN201310119021 A CN 201310119021A CN 103237259 A CN103237259 A CN 103237259A
Authority
CN
China
Prior art keywords
audio
audio frame
frame sequence
video
format
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201310119021
Other languages
Chinese (zh)
Inventor
武悦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TVMining Beijing Media Technology Co Ltd
Original Assignee
TVMining Beijing Media Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TVMining Beijing Media Technology Co Ltd filed Critical TVMining Beijing Media Technology Co Ltd
Priority to CN 201310119021 priority Critical patent/CN103237259A/en
Publication of CN103237259A publication Critical patent/CN103237259A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention provides an audio-channel processing device for a video. The device comprises a de-capsulation device, an audio data decoding device, an audio-channel data transforming device, a coding device and an encapsulation device. The de-capsulation device is used for performing de-capsulation on an audio-video file so as to resolve the audio-video file into a corresponding audio frame sequence and a corresponding video frame sequence. The audio data decoding device is used for decoding the audio frame sequence and restoring audio frames in the audio frame sequence into PCM-format (pulse code modulation format) audio data. The audio-channel data transforming device is used for transforming single-channel PCM-format audio data into double-channel PCM-format audio data. The coding device is used for coding the double-channel PCM-format audio data to acquire the audio frame sequence after recoding. The encapsulation device is used for performing encapsulation on the video frame sequence and the audio frame sequence after recoding. The invention further provides an audio-channel processing method for the video.

Description

A kind of video sound channel processing unit and method
Technical field:
The present invention relates to sound channel processing unit and method in a kind of multimedia video frequency file, relate in particular to a kind of processing unit and method of the monophony in the multimedia video frequency file being carried out the multichannel conversion.
Background technology:
For video file, the result of broadcast of stereo double channel is better than monaural effect, but occupation space will increase in media file, under the situation of pursuing result of broadcast, how quickly and easily monophony (being generally L channel) to be converted to binaural signal, be a lot of users' demand.
At the problems referred to above, Chinese patent application No.201210217416.X discloses a kind of method that monophony is converted to dual track.In the method, it is by being converted to digital signal with simulated audio signal, export above-mentioned digital signal to first output channels by an input sound channel then, and above-mentioned digital signal is carried out exporting through second output channels behind the buffer memory again, realize converting monophony to dual track with this.Yet in the disclosed method of this Chinese patent application No.201210217416.X, owing to above-mentioned digital signal need be carried out buffer memory, this can cause the nonsynchronous problem of left and right acoustic channels, and this method to implement also be more complicated.
In addition, when a monophony video and a dual track video are merged into a video file, if directly transfer monophonic audio to multichannel audio, former monophony track is left in a plurality of tracks of multichannel, and other tracks of multichannel are set to blank.Like this, after handling through such merging, though the audio-video document of acquisition has been the multichannel audio frequency and video from form, still having only a track in original monaural fragment is sounding.When audio frequency and video were play, the effect that originally monaural fragment plays back remained monaural, that is to say to have only a sound horn.Therefore the impression of user's audition is subjected to very big influence, especially in the mosaic of original multichannel fragment and monophony fragment, becomes single sound horn suddenly by the sound field of a plurality of loudspeaker manufacturings and can produce very big contrast.
Summary of the invention:
In order to solve the problems of the technologies described above, the invention provides a kind of sound channel processing unit, comprising: de-encapsulating devices is used for audio-video document is carried out decapsulation, thereby described audio-video document is resolved to corresponding audio frame sequence and sequence of frames of video; Audio data decoding apparatus is used for described audio frame sequence is decoded, and the audio frame in the described audio frame sequence is reduced to the voice data of PCM form; The channel number conversion equipment is for the voice data that the voice data of monaural PCM form is converted to the PCM form of dual track; Code device is encoded by the voice data to the PCM form of described dual track, obtains the audio frame sequence behind recompile; Packaging system is used for described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
Preferably, described channel number conversion equipment is that the unit copies to the voice data of described monaural PCM form by certain byte length, thereby obtains the voice data of the PCM form of described dual track.
Preferably, described channel number conversion equipment is that the unit copies to the voice data of described monaural PCM form by the 2B byte length.
Preferably, described code device adopts the coded format identical with the coded format of the preceding audio frame sequence of decoding that the voice data of the PCM form of described dual track is encoded; Described packaging system adopts the encapsulation format identical with the encapsulation format of the preceding audio-video document of decapsulation that described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
Preferably, described code device adopts user's specified coding form that the voice data of the PCM form of described dual track is encoded; The encapsulation format that described packaging system adopts user's appointment encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
Preferably, described sound channel processing unit also comprises the storage device for the storage data.
Preferably, described de-encapsulating devices has the decapsulation unit of audio-video document form judging unit, decapsulation selected cell and several different-formats; Wherein, described audio-video document form judging unit is used for the form of the described audio-video document of want decapsulation is judged; Described decapsulation selected cell is selected the described decapsulation unit of corresponding format to the judged result of the form of described audio-video document according to described audio-video document form judging unit; Described decapsulation unit is used for the audio-video document of corresponding format is resolved to described audio frame sequence and described sequence of frames of video.
Preferably, described audio data decoding apparatus has the decoder of coded format judging unit, decoder selected cell and several different-formats; Wherein, described coded format judging unit is for the coded format of judging described audio frame sequence; Described decoder selected cell based on the judged result of described coded format, selects corresponding described decoder that described audio frame sequence is decoded; Described decoder, the audio frame that is used for having the described audio frame sequence of corresponding coded format is reduced to the voice data of PCM form.
Preferably, described code device has the encoder of encoder selected cell and several different-formats; Wherein, described encoder selected cell is selected the described encoder with corresponding coded format according to the data of the relevant coded format of described memory device stores; Described encoder is encoded to the voice data of the PCM form of described dual track with corresponding coded format, obtains described audio frame sequence behind recompile.
Preferably, described packaging system comprises the encapsulation unit of encapsulation format selected cell and several different-formats; Wherein, described encapsulation format selected cell is selected to have the described encapsulation unit of corresponding encapsulation format based on the data of the relevant encapsulation format of described memory device stores; Described encapsulation unit encapsulates described sequence of frames of video and described audio frame sequence behind recompile with described corresponding encapsulation format.
The present invention also provides a kind of sound channel processing method, comprising: the decapsulation step, and audio-video document is carried out decapsulation, thereby described audio-video document is resolved to corresponding audio frame sequence and sequence of frames of video; The voice data decoding step is decoded to described audio frame sequence, and the audio frame in the described audio frame sequence is reduced to the voice data of PCM form; The channel number switch process converts the voice data of monaural PCM form to the voice data of the PCM form of dual track; Coding step is encoded by the voice data to the PCM form of described dual track, obtains the audio frame sequence behind recompile; Encapsulation step encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
Preferably, in described channel number switch process, be that the unit copies by the voice data to described monaural PCM form by certain byte length, thereby obtain the voice data of the PCM form of described dual track.
Preferably, at described channel number switch process, be that the unit copies by the voice data to described monaural PCM form by the 2B byte length.
Preferably, in described coding step, adopt the coded format identical with the coded format of the preceding audio frame sequence of decoding that the voice data of the PCM form of described dual track is encoded; In described encapsulation step, adopt the encapsulation format identical with the encapsulation format of the preceding audio-video document of decapsulation that described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
Preferably, in described coding step, adopt user's specified coding form that the voice data of the PCM form of described dual track is encoded; In described encapsulation step, the encapsulation format that adopts user's appointment encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
Preferably, described decapsulation step comprises: the step that the form of the described audio-video document of decapsulation is judged; Select the step of the decapsulation unit of corresponding format according to the judged result of the form of described audio-video document; The audio-video document of corresponding format is resolved to the step of described audio frame sequence and described sequence of frames of video.
Preferably, described voice data decoding step comprises: the step of judging the coded format of described audio frame sequence; Based on the judged result of described coded format, the step of selecting corresponding decoder that described audio frame sequence is decoded; To have the step that audio frame in the described audio frame sequence of corresponding coded format is reduced to the voice data of PCM form.
Preferably, described coding step comprises: the step of selecting to have the encoder of corresponding coded format according to the data of the relevant coded format of memory device stores; With corresponding coded format the voice data of the PCM form of dual track is encoded, obtain the step of described audio frame sequence behind recompile.
Preferably, described encapsulation step comprises: the step of selecting to have the encapsulation unit of corresponding encapsulation format based on the data of the relevant encapsulation format of described memory device stores; The step that described sequence of frames of video and described audio frame sequence behind recompile is encapsulated with described corresponding encapsulation format.
Technique scheme of the present invention can solution never increase the problem that when taking up room in the media file monophony video file is converted to the dual track video file.Simultaneously, technical scheme of the present invention has been simplified the operation to the channel number conversion, and can carry out this channel number conversion process in large quantity, thereby can improve the efficient that sound channel is handled and reduce corresponding processing cost.
Description of drawings:
The structured flowchart of the processing unit of the sound channel conversion that Fig. 1 relates to for embodiment of the present invention;
The structured flowchart of the decapsulation module of the processing unit of the sound channel conversion that Fig. 2 relates to for embodiment of the present invention;
The structured flowchart of the voice data decoder module of the processing unit of the sound channel conversion that Fig. 3 relates to for embodiment of the present invention;
The structured flowchart of the coding module of the processing unit of the sound channel conversion that Fig. 4 relates to for embodiment of the present invention;
The structured flowchart of the package module of the processing unit of the sound channel conversion that Fig. 5 relates to for embodiment of the present invention;
The process chart that Fig. 6 changes for the sound channel that embodiment of the present invention relates to;
The flow chart of the step S12 that Fig. 7 relates to for embodiment of the present invention;
The flow chart of the step S13 that Fig. 8 relates to for embodiment of the present invention;
The flow chart of the step S14 that Fig. 9 relates to for embodiment of the present invention;
Figure 10 A is the PCM packet structure schematic diagram before carrying out the sound channel processing that embodiment of the present invention relates to;
Figure 10 B is the PCM packet structure schematic diagram after carrying out the sound channel processing that embodiment of the present invention relates to;
The flow chart of the step S15 that Figure 11 embodiment of the present invention relates to;
The flow chart of the step S16 that Figure 12 relates to for embodiment of the present invention.
Embodiment:
Illustrated embodiment is set forth this invention with reference to the accompanying drawings below.The related video volume customization regulating system of embodiment of the present invention can be adjusted the volume of each audio-video document according to user's requirement when a plurality of audio-video documents to different file formats merge.
Fig. 1 has shown the structured flowchart of the sound channel conversion processing unit that embodiment of the present invention relates to.As shown in Figure 1, above-mentioned video sound volume regulating system comprises audio, video data receiver module 1, decapsulation module 2, voice data decoder module 3, coding module 4, package module 5, memory module 6, sound channel modular converter 7 and data/address bus 8.Above-mentioned audio, video data receiver module 1, decapsulation module 2, voice data decoder module 3, coding module 4, package module 5, memory module 6 and sound channel modular converter 7 are connected by above-mentioned data/address bus 8.
Above-mentioned audio, video data receiver module 1 is used for receiving the audio, video data with different file formats (for example adopting the audio-video document of forms such as AVI, WMV, RM, RMVB, MOV, TS/PS, MKV respectively) that need merge, and the above-mentioned audio, video data that receives is delivered to above-mentioned decapsulation module 2 carries out corresponding decapsulation.After above-mentioned decapsulation module 2 receives above-mentioned audio, video data from above-mentioned audio, video data receiver module 1, carry out decapsulation according to the corresponding document form of the above-mentioned audio, video data of receiving.In above-mentioned decapsulation process, above-mentioned decapsulation module 2 is interpreted as corresponding sequence of frames of video and audio frame sequence with audio-video document.Then, above-mentioned decapsulation module 2 is stored in the above-mentioned sequence of frames of video that obtains after the decapsulation and the relevant parameter of above-mentioned audio-video document encapsulation format in the above-mentioned memory module 6, above-mentioned audio frame sequence is delivered to above-mentioned voice data decoder module 3 decodes accordingly.Above-mentioned voice data decoder module 3 is decoded according to the corresponding encoded form of the audio frame sequence of receiving after receiving above-mentioned audio frame sequence from above-mentioned decapsulation module 2.In above-mentioned decode procedure, 3 pairs of above-mentioned audio frame sequence of above-mentioned voice data decoder module are decoded, audio frame in the above-mentioned audio frame sequence is reduced to the voice data of PCM form, and the voice data of above-mentioned PCM form is stored in the above-mentioned memory module 6, until the decoding of finishing all audio frame sequence with the relevant parameter of above-mentioned audio frame sequence coded format.Above-mentioned sound channel modular converter 7 obtains the voice data of the PCM form of storing from above-mentioned memory module 6, and to the audio sample point data by certain byte length (being generally 2 bytes) for copying the unit, thereby obtain the voice data of new PCM form, and the voice data of new PCM form is stored in the above-mentioned memory module 6.Above-mentioned coding module 4 obtains the voice data and the relevant parameter of above-mentioned audio frame sequence coded format of above-mentioned new PCM form from above-mentioned memory module 6, and select corresponding encoder that the voice data of above-mentioned new PCM form is encoded, compressed according to the relevant parameter of above-mentioned audio frame sequence coded format, form new audio frame sequence and be stored in the memory module 6.Above-mentioned package module 5 obtains the relevant parameter of above-mentioned audio-video document encapsulation format, above-mentioned sequence of frames of video and new audio frame sequence from above-mentioned memory module 6, the relevant parameter according to above-mentioned audio-video document encapsulation format is packaged into new audio-video document with above-mentioned sequence of frames of video and above-mentioned new audio frame sequence then.
The structured flowchart of the decapsulation module 2 of the sound channel conversion processing unit that Fig. 2 relates to for embodiment of the present invention.As shown in Figure 2, above-mentioned decapsulation module 2 comprises audio-video document form judging unit 21, decapsulation selected cell 22 and several decapsulation unit 23,24,25 ...Wherein, above-mentioned several decapsulation unit 23,24,25 ... have different forms, can carry out decapsulation corresponding to different file formats.Above-mentioned audio-video document form judging unit 21 can be judged the file format of the audio frequency and video that receive from above-mentioned audio, video data receiver module 1 and obtain the relevant parameter of the file format of above-mentioned audio frequency and video, and the relevant parameter of the file format of above-mentioned audio frequency and video stored in the above-mentioned memory module 6, above-mentioned decapsulation selected cell 22 can be according to the judged result (being the form of audio-video document) of above-mentioned audio-video document form judging unit 21 from decapsulation unit 23,24,25 ... the corresponding decapsulation unit of middle selection is carried out decapsulation to above-mentioned audio-video document, above-mentioned corresponding decapsulation unit (is above-mentioned decapsulation unit 23,24,25 ... one of in) be corresponding sequence of frames of video and audio frame sequence with received above-mentioned audio-video document deblocking, and above-mentioned sequence of frames of video and audio frame sequence are stored in the above-mentioned memory module 6.For example, when the form of audio-video document is respectively the AVI form, the form of 21 pairs of audio-video documents that receive of above-mentioned audio-video document form judging unit judges that above-mentioned decapsulation selected cell 22 selects the decapsulation unit (such as decapsulation unit 23) corresponding with the AVI form to carry out decapsulation based on above-mentioned judged result; If above-mentioned audio-video document form judging unit 21 judges that when the form of the audio-video document that receives was the RM form, above-mentioned decapsulation selected cell 22 selected the decapsulation unit (such as decapsulation unit 24) corresponding with the RM form to carry out decapsulation based on above-mentioned judged result; If above-mentioned audio-video document form judging unit 21 judges that when the form of the audio-video document that receives was the WMV form, above-mentioned decapsulation selected cell 22 selected the decapsulation unit (such as decapsulation unit 25) corresponding with the WMV form to carry out decapsulation based on above-mentioned judged result.
The structured flowchart of the voice data decoder module 3 of the sound channel conversion processing unit that Fig. 3 relates to for embodiment of the present invention.As shown in Figure 3, above-mentioned voice data decoder module 3 comprises coded format judging unit 31, decoder selected cell 32 and several decoder 33,34,35 ...Wherein, above-mentioned several decoder 33,34,35 ... can decode corresponding to different decoding requests.Above-mentioned coded format judging unit 31 can judge from the coded format of the audio frame sequence of 2 deblockings of above-mentioned decapsulation module and obtain the relevant parameter of coded format of the audio frame sequence of institute's deblocking, and the relevant parameter of the coded format of above-mentioned audio frame sequence is stored in the above-mentioned memory module 6.Above-mentioned decoder selected cell 32 can be according to the judged result (being the coded format of above-mentioned audio frame sequence) of above-mentioned coded format judging unit 31 from decoder 33,34,35 ... the corresponding decoder of middle selection is decoded to above-mentioned audio-video document, above-mentioned corresponding decoder (is above-mentioned decoder 33,34,35 ... one of in) the above-mentioned audio frame sequence of receiving is decoded, audio frame in the above-mentioned audio frame sequence is reduced to the PCM formatted data, and above-mentioned PCM formatted data is temporarily stored in the memory module 6.For example, if above-mentioned coded format judging unit 31 is judged when the coded format of above-mentioned audio frame sequence is the MP3 coded format, above-mentioned decoder selected cell 32 selects the decoder (such as decoder 33) corresponding with the MP3 coded format to decode, in the temporary memory module 6 of decoded PCM formatted data; If above-mentioned coded format judging unit 31 is judged when the coded format of above-mentioned audio frame sequence is the ACC coded format, above-mentioned decoder selected cell 32 selects the decoder (such as decoder 34) corresponding with the ACC coded format to decode, in the temporary memory module 6 of decoded PCM formatted data; If above-mentioned coded format judging unit 31 is judged when the coded format of above-mentioned audio frame sequence is the AC-3 coded format, above-mentioned decoder selected cell 32 selects the decoder (such as decoder 35) corresponding with the AC-3 coded format to decode, in the temporary memory module 6 of decoded PCM formatted data.
The structured flowchart of the coding module 4 of the sound channel conversion processing unit that Fig. 4 relates to for embodiment of the present invention.As shown in Figure 4, above-mentioned coding module 4 comprises encoder selected cell 41, several encoders 42,43,44 ...Above-mentioned encoder selected cell 41 can obtain new PCM formatted data and the coded format parameter of the former audio frame sequence of storage wherein after above-mentioned sound channel modular converter 7 is handled from above-mentioned memory module 6.It (is above-mentioned encoder 42,43,44 that above-mentioned encoder selected cell 41 is selected corresponding encoder according to the coded format parameter of above-mentioned former audio frame sequence ... one of in) above-mentioned new PCM formatted data is carried out recompile, form new audio frame sequence, and new audio frame sequence is stored in the above-mentioned memory module 6.
The structured flowchart of the package module 5 of the sound channel conversion processing unit that Fig. 5 relates to for embodiment of the present invention.As shown in Figure 5, above-mentioned package module 5 comprises encapsulation format selected cell 51 and several encapsulation units 52,53,54 ...Wherein, above-mentioned several encapsulation units 52,53,54 ... have different encapsulation format, can encapsulate corresponding to the requirement of different encapsulation format.
Above-mentioned encapsulation format selected cell 51 obtains the relevant parameter of above-mentioned audio-video document encapsulation format, above-mentioned sequence of frames of video, new audio frame sequence and (is encapsulation unit 52,53,54 according to the corresponding encapsulation unit of the related parameter choosing of above-mentioned audio-video document encapsulation format from above-mentioned memory module 6 ... one of in) above-mentioned sequence of frames of video and new audio frame sequence are encapsulated the audio-video document after obtaining to regulate.
In the above-described embodiment, above-mentioned coding module 4 and above-mentioned package module 5 can also be selected corresponding coded format and encapsulation format according to user's demand.
The flow chart of the sound channel conversion process that Fig. 6 relates to for embodiment of the present invention.Below, the processing procedure of the sound channel conversion process that relates in present embodiment with reference to Fig. 6 explanation.
At first, the input audio-video document, above-mentioned audio, video data receiver module 1 receives above-mentioned audio-video document data and it is delivered to above-mentioned decapsulation module 2(step S11).2 pairs of above-mentioned audio-video document data of above-mentioned decapsulation module are carried out decapsulation, above-mentioned audio-video document data are resolved to sequence of frames of video and audio frame sequence, and the relevant parameter of the encapsulation format of above-mentioned sequence of frames of video and audio frame sequence and former audio-video document is stored in the above-mentioned memory module 6 (step S12).
Above-mentioned voice data decoder module 3 obtains above-mentioned audio frame sequence from above-mentioned memory module 6, audio frame in the above-mentioned audio frame sequence is reduced to the PCM formatted data, and the relevant parameter of the coded format of above-mentioned PCM formatted data and former audio frame sequence is stored in the above-mentioned memory module 6 (step S13).
Above-mentioned sound channel modular converter 7 obtains above-mentioned PCM formatted data from above-mentioned memory module 6, and above-mentioned PCM formatted data carried out the channel number conversion process, thereby obtain the voice data of new PCM form, and the voice data of new PCM form is stored in the above-mentioned memory module 6 (step S14).Particularly, be that the unit copies to the audio sample point data of above-mentioned monaural PCM formatted data by certain byte length, thereby obtain the voice data (voice data of new PCM form) of the PCM form of dual track.
Above-mentioned coding module 4 obtains the relevant parameter of the coded format of the PCM formatted data of above-mentioned dual track and former audio frame sequence from above-mentioned memory module 6, and according to the relevant parameter of the coded format of former audio frame sequence new PCM formatted data is encoded, form new audio frame sequence and store in the above-mentioned memory module 6 (step S15).
Above-mentioned package module 5 obtains the relevant parameter of the encapsulation format of sequence of frames of video, new audio frame sequence and former audio-video document from above-mentioned memory module 6, and encapsulate according to the relevant parameter of the encapsulation format of former audio-video document, form new audio-video document (step S16).
The flow chart of the step S12 of the sound channel conversion process that Fig. 7 relates to for embodiment of the present invention.Below, the audio-video document that explanation relates in present embodiment with reference to Fig. 7 carries out the decapsulation processing procedure.
Audio file formats judging unit 21 is judged the encapsulation format of the audio-video document that receives based on the audio-video document data that receive, judged result is transported to decapsulation selected cell 22(step 121), the relevant parameter with the encapsulation format of former audio-video document stores in the above-mentioned memory module 6 (step 122) simultaneously.Above-mentioned decapsulation selected cell 22 is selected corresponding decapsulation unit (step 123) based on above-mentioned judged result.For example, above-mentioned audio-video document data can be encapsulation format such as AVI, WMV, RM, RMVB, MOV, TS/PS, MKV, and above-mentioned corresponding decapsulation unit can be carried out decapsulation to the audio-video document data of above-mentioned corresponding encapsulation format.Above-mentioned decapsulation unit (is decapsulation unit 23,24,25 ... one of in) above-mentioned audio-video document data are carried out decapsulation, above-mentioned audio-video document data are resolved to sequence of frames of video and audio frame sequence (step 124), and sequence of frames of video and audio frame sequence are stored in the above-mentioned memory module 6 (step 125).
The flow chart of the step S13 of the sound channel conversion process that Fig. 8 relates to for embodiment of the present invention.Below, explanation is carried out decoding process what present embodiment related to voice data with reference to Fig. 8.
The coded format of the audio frame sequence after 31 pairs of decapsulations of coded format judging unit (for example MP3, AAC, AC-3, MpegAudio etc.) is judged, and judged result is transported to decoder selected cell 32(step 131), the relevant parameter with the coded format of audio frame sequence stores in the above-mentioned memory module 6 (step 132) simultaneously.Above-mentioned decoder selected cell 32 is selected based on above-mentioned judged result and the corresponding decoder of above-mentioned coded format is decoded (step 133), above-mentioned decoder (is decoder 33,34,35 ... one of in) audio frame in the above-mentioned audio frame sequence is reduced to PCM formatted data (step 134), and above-mentioned PCM formatted data is stored in the memory module 6 (step 135).
The flow chart of the step S14 of the sound channel conversion process that Fig. 9 relates to for embodiment of the present invention; Figure 10 A carries out sound channel to handle PCM packet structure schematic diagram before; Figure 10 B has carried out sound channel to handle PCM packet structure schematic diagram afterwards.Below, the processing procedure that sound channel is changed that relates in present embodiment with reference to Fig. 9, Figure 10 A and Figure 10 B explanation.
At first, above-mentioned sound channel modular converter 7 obtains above-mentioned PCM formatted data (step S140) from above-mentioned memory module 6, and determines whether above-mentioned PCM formatted data is monaural voice data (step S141).When above-mentioned PCM formatted data is confirmed as monaural voice data (for example shown in Figure 10 A) (step S141: be), above-mentioned sound channel modular converter 7 is that the unit copies (step S142) with the audio sample point data of above-mentioned PCM formatted data by certain byte length.Judge whether all audio sample point data are copied and finish (step S143), if do not finish (the step S143: not) that copies to all audio sample point data, getting back to step S142 continues to copy, if finished copy (the step S143: be) to all audio sample point data, obtain the voice data (being the voice data of the PCM form of dual track) (for example shown in Figure 10 B) (step S144) of new PCM form, export the voice data of new PCM form then and be stored in the above-mentioned memory module 6 (step S145).In step S141, if (step S141: not), then directly enter step S145 output PCM formatted data when above-mentioned sound channel modular converter 7 is determined voice data that above-mentioned PCM formatted datas are dual track.
In the present embodiment, above-mentioned sound channel modular converter 7 is that the unit copies with the audio sample point data of PCM formatted data by the 2B byte length.For example, the monaural PCM packet shown in Figure 10 A is the PCM packet (the wherein copy of the previous double byte of double-byte representation of italic) that becomes the dual track shown in Figure 10 B after the unit copies by the 2B byte length through above-mentioned sound channel modular converter 7.Be that the unit copies monaural PCM data by above-mentioned sound channel modular converter 7 according to per 2 bytes, the digital signal of audio frequency is after the process digital-to-analogue conversion, two loud speakers send identical sound about will driving, thereby realize that monophony is to the conversion of dual track.
The flow chart of the step S15 that PCM formatted data after the sound channel conversion process that Figure 11 relates to for embodiment of the present invention is encoded.Below, the PCM formatted data to after the sound channel conversion process that relates in present embodiment with reference to Figure 11 explanation processing procedure of encoding.
Encoder selected cell 41 from above-mentioned memory module 6, obtain above-mentioned after the sound channel conversion process the PCM formatted data and the relevant parameter (step S151) of the coded format of above-mentioned former audio frame sequence.Encoder selected cell 41 is based on the corresponding encoder of the related parameter choosing of the coded format of above-mentioned former audio frame sequence (step S152).Above-mentioned chosen encoder is encoded to above-mentioned PCM formatted data after the sound channel conversion process, obtaining new audio frame sequence (step S153), and the new audio frame sequence that obtains is stored in (step S154) in the memory module 6.
The flow chart of the step S16 that sequence of frames of video and new audio frame sequence are encapsulated that Figure 12 relates to for embodiment of the present invention.Below, explanation is carried out the encapsulation process process what present embodiment related to sequence of frames of video and new audio frame sequence with reference to Figure 11.
Above-mentioned encapsulation format selected cell 51 obtains the relevant parameter (step S161) of the encapsulation format of sequence of frames of video, new audio frame sequence and former audio-video document from above-mentioned memory module 6.Above-mentioned encapsulation format selected cell 51 is based on the related parameter choosing of the encapsulation format of former audio-video document and the corresponding encapsulation unit of encapsulation format (step S162) of former audio-video document, above-mentioned encapsulation unit encapsulates again to above-mentioned sequence of frames of video and above-mentioned new audio frame sequence, formation has the new audio-video document (step S163) identical with the encapsulation format of former audio-video document, and exports new audio-video document (step S164).
In the present embodiment, in step 15, adopt the coded format identical with the coded format of former audio frame sequence that the PCM formatted data after the sound channel conversion process is encoded.But the present invention is not limited to this, also can adopt user's specified coding form that the PCM formatted data after the sound channel conversion process is encoded in step 15.
In the present embodiment, in step 16, adopt the encapsulation format identical with the encapsulation format of former audio-video document that sequence of frames of video and new audio frame sequence are encapsulated.But the present invention is not limited to this, also can adopt the encapsulation format of user's appointment that sequence of frames of video and new audio frame sequence are encapsulated in step 15.
Should understand embodiment described in the above specification and embodiment only is used for explanation the present invention and is not used in and limits the scope of the invention.After having read the present invention, those skilled in the art all fall within the application's claims institute restricted portion to the modification of various equivalents of the present invention.

Claims (19)

1. video sound channel processing unit comprises:
De-encapsulating devices is used for audio-video document is carried out decapsulation, thereby described audio-video document is resolved to corresponding audio frame sequence and sequence of frames of video;
Audio data decoding apparatus is used for described audio frame sequence is decoded, and the audio frame in the described audio frame sequence is reduced to the voice data of PCM form;
The channel number conversion equipment is for the voice data that the voice data of monaural PCM form is converted to the PCM form of dual track;
Code device is encoded by the voice data to the PCM form of described dual track, obtains the audio frame sequence behind recompile;
Packaging system is used for described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
2. sound channel processing unit according to claim 1 is characterized in that:
Described channel number conversion equipment is that the unit copies to the voice data of described monaural PCM form by certain byte length, thereby obtains the voice data of the PCM form of described dual track.
3. sound channel processing unit according to claim 2 is characterized in that:
Described channel number conversion equipment is that the unit copies to the voice data of described monaural PCM form by the 2B byte length.
4. sound channel processing unit according to claim 3 is characterized in that:
Described code device adopts the coded format identical with the coded format of the preceding audio frame sequence of decoding that the voice data of the PCM form of described dual track is encoded;
Described packaging system adopts the encapsulation format identical with the encapsulation format of the preceding audio-video document of decapsulation that described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
5. sound channel processing unit according to claim 3 is characterized in that:
Described code device adopts user's specified coding form that the voice data of the PCM form of described dual track is encoded;
The encapsulation format that described packaging system adopts user's appointment encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
6. according to the arbitrary described sound channel processing unit of claim 1~5, also comprise:
The storage device that is used for the storage data.
7. sound channel processing unit according to claim 6 is characterized in that:
Described de-encapsulating devices has the decapsulation unit of audio-video document form judging unit, decapsulation selected cell and several different-formats; Wherein,
Described audio-video document form judging unit is used for the form of the described audio-video document of want decapsulation is judged;
Described decapsulation selected cell is selected the described decapsulation unit of corresponding format to the judged result of the form of described audio-video document according to described audio-video document form judging unit;
Described decapsulation unit is used for the audio-video document of corresponding format is resolved to described audio frame sequence and described sequence of frames of video.
8. sound channel processing unit according to claim 7 is characterized in that:
Described audio data decoding apparatus has the decoder of coded format judging unit, decoder selected cell and several different-formats; Wherein,
Described coded format judging unit is for the coded format of judging described audio frame sequence;
Described decoder selected cell based on the judged result of described coded format, selects corresponding described decoder that described audio frame sequence is decoded;
Described decoder, the audio frame that is used for having the described audio frame sequence of corresponding coded format is reduced to the voice data of PCM form.
9. sound channel processing unit according to claim 8 is characterized in that:
Described code device has the encoder of encoder selected cell and several different-formats; Wherein,
Described encoder selected cell is selected the described encoder with corresponding coded format according to the data of the relevant coded format of described memory device stores;
Described encoder is encoded to the voice data of the PCM form of described dual track with corresponding coded format, obtains described audio frame sequence behind recompile.
10. sound channel processing unit according to claim 9 is characterized in that:
Described packaging system comprises the encapsulation unit of encapsulation format selected cell and several different-formats; Wherein,
Described encapsulation format selected cell is selected to have the described encapsulation unit of corresponding encapsulation format based on the data of the relevant encapsulation format of described memory device stores;
Described encapsulation unit encapsulates described sequence of frames of video and described audio frame sequence behind recompile with described corresponding encapsulation format.
11. a video sound channel processing method comprises:
The decapsulation step is carried out decapsulation to audio-video document, thereby described audio-video document is resolved to corresponding audio frame sequence and sequence of frames of video;
The voice data decoding step is decoded to described audio frame sequence, and the audio frame in the described audio frame sequence is reduced to the voice data of PCM form;
The channel number switch process converts the voice data of monaural PCM form to the voice data of the PCM form of dual track;
Coding step is encoded by the voice data to the PCM form of described dual track, obtains the audio frame sequence behind recompile;
Encapsulation step encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
12. sound channel processing method according to claim 11 is characterized in that:
In described channel number switch process, be that the unit copies by the voice data to described monaural PCM form by certain byte length, thereby obtain the voice data of the PCM form of described dual track.
13. sound channel processing method according to claim 12 is characterized in that:
At described channel number switch process, be that the unit copies by the voice data to described monaural PCM form by the 2B byte length.
14. method according to claim 13 is characterized in that:
In described coding step, adopt the coded format identical with the coded format of the preceding audio frame sequence of decoding that the voice data of the PCM form of described dual track is encoded;
In described encapsulation step, adopt the encapsulation format identical with the encapsulation format of the preceding audio-video document of decapsulation that described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.
15. method according to claim 13 is characterized in that:
In described coding step, adopt user's specified coding form that the voice data of the PCM form of described dual track is encoded;
In described encapsulation step, the encapsulation format that adopts user's appointment encapsulates described sequence of frames of video and described audio frame sequence behind recompile.
16. according to the arbitrary described method of claim 11~15, it is characterized in that:
Described decapsulation step comprises:
The step that the form of the described audio-video document of want decapsulation is judged;
Select the step of the decapsulation unit of corresponding format according to the judged result of the form of described audio-video document;
The audio-video document of corresponding format is resolved to the step of described audio frame sequence and described sequence of frames of video.
17. method according to claim 16 is characterized in that:
Described voice data decoding step comprises:
Judge the step of the coded format of described audio frame sequence;
Based on the judged result of described coded format, the step of selecting corresponding decoder that described audio frame sequence is decoded;
To have the step that audio frame in the described audio frame sequence of corresponding coded format is reduced to the voice data of PCM form.
18. method according to claim 17 is characterized in that:
Described coding step comprises:
Select to have the step of the encoder of corresponding coded format according to the data of the relevant coded format of memory device stores;
With corresponding coded format the voice data of the PCM form of dual track is encoded, obtain the step of described audio frame sequence behind recompile.
19. method according to claim 18 is characterized in that:
Described encapsulation step comprises:
Select to have the step of the encapsulation unit of corresponding encapsulation format based on the data of the relevant encapsulation format of described memory device stores;
The step that described sequence of frames of video and described audio frame sequence behind recompile is encapsulated with described corresponding encapsulation format.
CN 201310119021 2013-03-29 2013-04-08 Audio-channel processing device and audio-channel processing method for video Pending CN103237259A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201310119021 CN103237259A (en) 2013-03-29 2013-04-08 Audio-channel processing device and audio-channel processing method for video

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201310107027 2013-03-29
CN201310107027.6 2013-03-29
CN 201310119021 CN103237259A (en) 2013-03-29 2013-04-08 Audio-channel processing device and audio-channel processing method for video

Publications (1)

Publication Number Publication Date
CN103237259A true CN103237259A (en) 2013-08-07

Family

ID=48885270

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201310119021 Pending CN103237259A (en) 2013-03-29 2013-04-08 Audio-channel processing device and audio-channel processing method for video

Country Status (1)

Country Link
CN (1) CN103237259A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105635748A (en) * 2015-12-30 2016-06-01 矽力杰半导体技术(杭州)有限公司 Audio-video data sending method and receiving method and audio-video data transmission system
CN106412687A (en) * 2015-07-27 2017-02-15 腾讯科技(深圳)有限公司 Interception method and device of audio and video clips
CN106504759A (en) * 2016-11-04 2017-03-15 维沃移动通信有限公司 A kind of mixed audio processing method and terminal device
CN107293303A (en) * 2017-06-16 2017-10-24 苏州蜗牛数字科技股份有限公司 A kind of multichannel voice lost packet compensation method
CN109618139A (en) * 2019-01-10 2019-04-12 深圳市华金盾信息科技有限公司 A kind of intelligent video monitoring system and method for view-based access control model routing
CN110083328A (en) * 2019-03-29 2019-08-02 北京奇艺世纪科技有限公司 A kind of audio frequency playing method, device, electronic equipment and storage medium
WO2021004048A1 (en) * 2019-07-09 2021-01-14 海信视像科技股份有限公司 Display device and audio data transmission method

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106412687A (en) * 2015-07-27 2017-02-15 腾讯科技(深圳)有限公司 Interception method and device of audio and video clips
CN106412687B (en) * 2015-07-27 2020-06-05 腾讯科技(深圳)有限公司 Method and device for intercepting audio and video clips
CN105635748A (en) * 2015-12-30 2016-06-01 矽力杰半导体技术(杭州)有限公司 Audio-video data sending method and receiving method and audio-video data transmission system
CN105635748B (en) * 2015-12-30 2019-02-01 上海芃矽半导体技术有限公司 Sending method, the Transmission system of method of reseptance and audio-visual data of audio-visual data
CN106504759A (en) * 2016-11-04 2017-03-15 维沃移动通信有限公司 A kind of mixed audio processing method and terminal device
CN106504759B (en) * 2016-11-04 2019-07-26 维沃移动通信有限公司 A kind of mixed audio processing method and terminal device
CN107293303A (en) * 2017-06-16 2017-10-24 苏州蜗牛数字科技股份有限公司 A kind of multichannel voice lost packet compensation method
CN109618139A (en) * 2019-01-10 2019-04-12 深圳市华金盾信息科技有限公司 A kind of intelligent video monitoring system and method for view-based access control model routing
CN110083328A (en) * 2019-03-29 2019-08-02 北京奇艺世纪科技有限公司 A kind of audio frequency playing method, device, electronic equipment and storage medium
WO2021004048A1 (en) * 2019-07-09 2021-01-14 海信视像科技股份有限公司 Display device and audio data transmission method
WO2021004047A1 (en) * 2019-07-09 2021-01-14 海信视像科技股份有限公司 Display device and audio playing method

Similar Documents

Publication Publication Date Title
US11463831B2 (en) Apparatus and method for efficient object metadata coding
CN103237259A (en) Audio-channel processing device and audio-channel processing method for video
CN110675882B (en) Method, encoder and decoder for decoding and encoding downmix matrix
KR101751228B1 (en) Efficient coding of audio scenes comprising audio objects
RU2630754C2 (en) Effective coding of sound scenes containing sound objects
US10068577B2 (en) Audio segmentation based on spatial metadata
JP5541928B2 (en) Audio signal processing method and apparatus
US7672743B2 (en) Digital audio processing
CN103200425B (en) A kind of multimedia processing apparatus and method
CN105659319B (en) Rendering of multi-channel audio using interpolated matrices
WO2006126857A2 (en) Method of encoding and decoding an audio signal
WO2015150384A1 (en) Efficient coding of audio scenes comprising audio objects
KR20210027236A (en) Method and device for generating or decoding a bitstream containing an immersive audio signal
US11488614B2 (en) Method and apparatus for decoding a bitstream including encoded Higher Order Ambisonics representations
CN106375778B (en) Method for transmitting three-dimensional audio program code stream conforming to digital movie specification
US10176813B2 (en) Audio encoding and rendering with discontinuity compensation
KR20140047509A (en) Audio coding/decoding apparatus using reverberation signal of object audio signal
CN105376641A (en) Device and method for streaming audio and video file
TW202242852A (en) Adaptive gain control
JP7182751B1 (en) System, method, and apparatus for conversion of channel-based audio to object-based audio
CN103200371A (en) System and method for video volume custom adjustment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20130807

RJ01 Rejection of invention patent application after publication