CN103237258A

CN103237258A - System and method for automatically adjusting video volume

Info

Publication number: CN103237258A
Application number: CN2013101189406A
Authority: CN
Inventors: 武悦
Original assignee: TVMining Beijing Media Technology Co Ltd
Current assignee: TVMining Beijing Media Technology Co Ltd
Priority date: 2013-03-29
Filing date: 2013-04-08
Publication date: 2013-08-07

Abstract

The invention provides a system for automatically adjusting audio-video volume. The system comprises a de-capsulation device, an audio data decoding device, a volume calculating device, a volume adjusting device, a coding device and an encapsulation device. The de-capsulation device is used for resolving a plurality of audio-video files into corresponding audio frame sequences and video frame sequences. The audio data decoding device is used for restoring the audio frame sequences into PCM-format (pulse code modulation format) audio data. The volume calculating device is used for calculating volume coefficients of the PCM-format audio data and volume adjusting coefficients of the PCM-format audio data. The volume adjusting device is used for performing multiplying on the PCM-format audio data with the volume adjusting parameters to acquire the PCM-format audio data after the volume adjustment. The coding device is used for coding the PCM-format audio data to acquire audio frame sequences after recoding. And, the encapsulation device is used for performing encapsulation to the video frame sequences and the recoded audio frame sequences. The invention further provides a method for automatically adjusting audio-video volume.

Description

The self-regulating system and method for a kind of video volume

Technical field:

The present invention relates to the self-regulating system and method for a kind of audio frequency and video volume, especially be applied to volume automatic regulating system and method thereof when the different audio-video document of a plurality of volumes merged.

Background technology:

When making the video source file, two or more video files need be merged into a video file, because each video file volume difference separately, this will cause the user when seeing TV or playing video file, run into sound situation fluctuated sometimes, What is more, intercuts some advertisements sometimes in the TV play, and the sound of advertisement can suddenly uprise.

In order to address the above problem, a kind of method of automatic control channel volume and the digital TV terminal of use said method are disclosed among the Chinese patent application No.200910077304.7.Yet, though this Chinese patent application No.200910077304.7 has solved digital TV terminal program sound situation fluctuated when switching program, but it is just implemented at unique user terminal number word television terminal, therefore can't solve sound that all users run into problem fluctuated.Unify video volume control problem when just having the video file that how in making video source file process, to merge two or more different volumes thus.

Summary of the invention:

In order to solve the problems of the technologies described above, the invention provides the self-regulating system of a kind of audio frequency and video volume, comprising: de-encapsulating devices is used for several audio-video documents are resolved to corresponding audio frame sequence and sequence of frames of video; Audio data decoding apparatus is for the voice data that described audio frame sequence is reduced to the PCM form; The volume calculation element is for the volume adjustment factor of the voice data that calculates described PCM form; Volume adjustment device to the processing of doubling of the voice data of described PCM form, obtains the voice data of the PCM form after volume is regulated with described volume adjustment factor; Code device by the voice data of described PCM form is encoded, obtains the audio frame sequence behind recompile; Packaging system is used for described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.

Preferably, described volume calculation element is the volume mean value of each frame in the voice data of the unit PCM form that calculates each audio-video document with certain byte length, calculate the volume mean value of all frames in the voice data of PCM form of each audio-video document then, with the volume mean value of all frames in the voice data of described PCM form as described volume coefficient.

Preferably, described volume calculation element is at the volume mean value of all frames in the voice data of one in described several audio-video documents designated PCM form that calculates appointed audio-video document as the audio-video document that calculates the reference volume coefficient, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

Preferably, described volume calculation element calculates the volume mean value of all frames in the voice data of PCM form of described several audio-video documents at described several audio-video documents, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

Preferably, the computational methods of described volume mean value are as follows: P=(V ₁+ V ₂+ ... + V _n)/n, wherein, P is the volume mean value of each frame, and V is the volume value of each unit, and n is the units of each frame.

Preferably, the computational methods of the volume mean value of all frames are as follows: A=(P ₁+ P ₂+ ... + P _f)/f, wherein, A is the volume mean value of all frames, and P is the volume mean value of each frame, and f is the frame number in the voice data of PCM form.

Preferably, described certain byte length is 2 bytes.

Preferably, described volume adjustment device doubles with described volume adjustment factor to each frame in the voice data of described PCM form.

The present invention also provides a kind of audio frequency and video volume self-regulating method, comprising: the decapsulation step resolves to corresponding audio frame sequence and sequence of frames of video with several audio-video documents; The voice data decoding step is reduced to described audio frame sequence the voice data of PCM form; The volume calculation procedure is calculated the volume adjustment factor of the voice data of described PCM form; The volume regulating step to the processing of doubling of the voice data of described PCM form, obtains the voice data of the PCM form after volume is regulated with described volume adjustment factor; Coding step by the voice data of described PCM form is encoded, obtains the audio frame sequence behind recompile; Encapsulation step encapsulates described sequence of frames of video and described audio frame sequence behind recompile.

Preferably, in described volume calculation procedure, be the volume mean value of each frame in the voice data of the unit PCM form that calculates each audio-video document with certain byte length, calculate the volume mean value of all frames in the voice data of PCM form of each audio-video document then, with the volume mean value of all frames in the voice data of described PCM form as described volume coefficient.

Preferably, in described volume calculation procedure, when the audio-video document of reference volume coefficient is calculated in one in described several audio-video documents designated conduct, calculate the volume mean value of all frames in the voice data of PCM form of appointed audio-video document, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

Preferably, in described volume calculation procedure, when the audio-video document of reference volume coefficient is calculated in the designated conduct of all described several audio-video documents, calculate the volume mean value of all frames in the voice data of PCM form of described several audio-video documents, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

Preferably, in described volume regulating step, each frame in the voice data of described PCM form is doubled with described volume adjustment factor.

Technique scheme of the present invention can the user run into sound problem fluctuated when playing video file from solve prior art.Simultaneously, technical scheme of the present invention has been simplified the operation that the video volume is regulated, and can carry out this video volume in large quantity and regulate processing, thereby can improve the video volume is regulated the efficient of handling and has been reduced corresponding processing cost.

Description of drawings:

The structured flowchart of the audio frequency and video volume automatic regulating system that Fig. 1 relates to for embodiment of the present invention;

The structured flowchart of the decapsulation module of the audio frequency and video volume automatic regulating system that Fig. 2 relates to for embodiment of the present invention;

The structured flowchart of the decoder module of the audio frequency and video volume automatic regulating system that Fig. 3 relates to for embodiment of the present invention;

The structured flowchart of the audio volume computing module of the audio frequency and video volume automatic regulating system that Fig. 4 relates to for embodiment of the present invention;

The schematic diagram of the volume coefficient calculation unit of the audio frequency and video volume automatic regulating system that Fig. 5 relates to for embodiment of the present invention;

The schematic diagram of the reference volume coefficient calculation unit of the audio frequency and video volume automatic regulating system that Fig. 6 relates to for embodiment of the present invention;

The schematic diagram of the volume adjustment factor computing unit of the audio frequency and video volume automatic regulating system that Fig. 7 relates to for embodiment of the present invention;

The self-regulating flow chart of audio frequency and video volume that Fig. 8 relates to for embodiment of the present invention;

The flow chart of the automatic regulating step S12 of audio frequency and video volume that Fig. 9 relates to for embodiment of the present invention;

The flow chart of the automatic regulating step S13 of audio frequency and video volume that Figure 10 relates to for embodiment of the present invention;

The flow chart of the automatic regulating step S14 of audio frequency and video volume that Figure 11 relates to for embodiment of the present invention;

The flow chart of the automatic regulating step S16 of audio frequency and video volume that Figure 12 relates to for embodiment of the present invention;

The flow chart of the automatic regulating step S17 of audio frequency and video volume that Figure 13 relates to for embodiment of the present invention.

Embodiment:

Illustrated embodiment is set forth this invention with reference to the accompanying drawings below.The related multimedia processing apparatus of embodiment of the present invention can merge according to user's the requirement a plurality of audio-video documents with different volumes, forms the audio-video document with consistent volume.

Fig. 1 has shown the structured flowchart of the audio frequency and video volume automatic regulating system that embodiment of the present invention relates to.As shown in Figure 1, above-mentioned multimedia processing apparatus comprises audio, video data receiver module 1, decapsulation module 2, voice data decoder module 3, audio volume computing module 4, audio volume adjustment module 5, coding module 6, package module 7, memory module 8 and data/address bus 9.Wherein, audio, video data receiver module 1, decapsulation module 2, voice data decoder module 3, audio volume computing module 4, audio volume adjustment module 5, coding module 6, package module 7 and memory module 8 are connected by data/address bus 9.

Above-mentioned audio, video data receiver module 1 is used for receiving the audio, video data with different volumes (for example audio-video document 1, audio-video document 2 and audio-video document 3) that need merge, and the above-mentioned audio, video data that receives is delivered to above-mentioned decapsulation module 2 carries out corresponding decapsulation.Above-mentioned decapsulation module 2 is carried out decapsulation with the above-mentioned audio, video data that above-mentioned audio, video data receiver module 1 receives, in above-mentioned decapsulation process, above-mentioned decapsulation module 2 is descapsulated into sequence of frames of video and audio frame sequence with audio-video document, and above-mentioned sequence of frames of video and audio frame sequence are stored in the above-mentioned memory module 8.Then, above-mentioned voice data decoder module 3 is decoded the above-mentioned audio frame sequence that above-mentioned decapsulation module 2 obtains.In above-mentioned decode procedure, above-mentioned voice data decoder module 3 is reduced to the voice data of PCM form with above-mentioned audio frame sequence and is stored in the above-mentioned memory module 8.Above-mentioned audio volume computing module 4 calculate respectively each above-mentioned PCM form that above-mentioned decoder module 3 obtains voice data the volume coefficient and be stored in the above-mentioned memory module 8.Calculate the volume coefficient of voice data of all above-mentioned PCM forms when audio volume computing module 4 after, again according to user's setting, calculate the reference volume coefficient and be stored in the above-mentioned memory module 8.Then, the volume adjustment factor of audio volume computing module 4 voice data that calculates all described PCM forms respectively based on volume coefficient and the reference volume coefficient of the voice data of above-mentioned PCM form and being stored in the above-mentioned memory module 8.The processing of doubling of the voice data of 5 pairs of above-mentioned PCM forms of above-mentioned audio volume adjustment module, the multiplication constant that above-mentioned multiplication is handled are the corresponding volume adjustment factor of voice data of the above-mentioned PCM form that calculates of above-mentioned audio computer module 4.The voice data of the described PCM form after above-mentioned audio volume adjustment modules 5 multiplications of 6 pairs of above-mentioned coding modules are handled is encoded, and obtains the audio frame sequence the same with original coding and is stored in the above-mentioned memory module 8.Above-mentioned package module 7 is readjusted the timestamp of the above-mentioned audio frame sequence that above-mentioned sequence of frames of video that above-mentioned decapsulation module 2 obtains and above-mentioned coding module 6 obtain, respectively sequence of frames of video and the audio frame sequence of having adjusted timestamp merged then, at last the sequence of frames of video after the above-mentioned merging and audio frame sequence are encapsulated, export new video file.

The structured flowchart of the decapsulation module 2 of the audio frequency and video volume automatic regulating system that Fig. 2 relates to for embodiment of the present invention.As shown in Figure 2, above-mentioned decapsulation module 2 comprises audio-video document form judging unit 21, decapsulation selected cell 22 and

several decapsulation unit

23,24,25 ...Wherein, above-mentioned

several decapsulation unit

23,24,25 ... have different forms, can carry out decapsulation corresponding to different file formats.Above-mentioned audio-video document form judging unit 21 can be judged the file format of the audio frequency and video that receive from above-mentioned audio, video data receiver module 1, above-mentioned decapsulation selected cell 22 can be according to the judged result (being the form of audio-video document) of above-mentioned audio-video document form judging unit 21 from

decapsulation unit

23,24,25 ... the corresponding decapsulation unit of middle selection is carried out decapsulation to above-mentioned audio-video document, above-mentioned corresponding decapsulation unit (is above-mentioned

decapsulation unit

23,24,25 ... one of in) be corresponding sequence of frames of video and audio frame sequence with received above-mentioned audio-video document deblocking, they are stored in the above-mentioned memory module 8.

The structured flowchart of the voice data decoder module 3 of the audio frequency and video volume automatic regulating system that Fig. 3 relates to for embodiment of the present invention.As shown in Figure 3, above-mentioned voice data decoder module 3 comprises coded format judging unit 31, decoder selected cell 32 and

several decoder

33,34,35 ...Wherein, above-mentioned

several decoder

33,34,35 ... have different codec formats, can decode corresponding to different decoding requests.Above-mentioned coded format judging unit 31 can obtain above-mentioned audio frame sequence and judge the coded format of above-mentioned audio frame sequence from above-mentioned memory module 8.Above-mentioned decoder selected cell 32 can be according to the judged result (being the coded format of above-mentioned audio frame sequence) of above-mentioned coded format judging unit 31 from

decoder

33,34,35 ... the corresponding decoder of middle selection, above-mentioned corresponding decoder is decoded to the above-mentioned audio frame sequence of receiving, audio frame in the above-mentioned audio frame sequence is reduced to the voice data of PCM form, and the voice data of above-mentioned PCM form is stored in the above-mentioned memory module 8.

The structured flowchart of the audio volume computing module 4 of the audio frequency and video volume automatic regulating system that Fig. 4 relates to for embodiment of the present invention.As shown in Figure 4, above-mentioned audio volume computing module 4 comprises command reception unit 41, volume coefficient calculation unit 42, reference volume coefficient calculation unit 43 and volume adjustment factor computing unit 44.Above-mentioned command reception unit 41 can receive user's setting instruction, and instruction stored in the above-mentioned memory module 8, it is the reference volume coefficient that above-mentioned user instruction can be specified the volume coefficient of a certain audio-video document, and the mean value that also can specify the volume coefficient of all audio-video documents is the reference volume coefficient.Above-mentioned volume coefficient calculation unit 42 receives the voice data of the above-mentioned PCM form that above-mentioned voice data decoder module 3 obtains, the volume coefficient of the voice data by calculating above-mentioned PCM form, and above-mentioned volume coefficient storage in above-mentioned memory module 8.Said reference volume coefficient calculation unit 43 is at first taken out above-mentioned user instruction from above-mentioned memory module 8, from above-mentioned memory module 8, take out the volume coefficient that needs again by judging, calculate the reference volume coefficient at last and said reference volume coefficient storage in above-mentioned memory module 8.Above-mentioned volume adjustment factor computing unit 44 takes out the volume coefficient of said reference volume coefficient and each above-mentioned PCM format audio data from above-mentioned memory module 8, the volume adjustment factor of the voice data by calculating each PCM form, and the volume adjustment factor of the voice data of above-mentioned each PCM form stored in the above-mentioned memory module 8.

After above-mentioned audio volume adjustment module 5 receives the voice data of above-mentioned PCM form, from above-mentioned memory module 8, read earlier the audio frequency adjustment factor of corresponding above-mentioned PCM format audio data, again to the processing of doubling of above-mentioned PCM format audio data, the voice data of the PCM form after obtaining handling.

The schematic diagram of the volume coefficient calculation unit 42 of the audio frequency and video volume automatic regulating system that Fig. 5 relates to for embodiment of the present invention.As shown in Figure 5, above-mentioned volume coefficient calculation unit 42 is that a unit asks volume mean value to each frame in the voice data of the above-mentioned PCM form of each audio-video document with 2 byte lengths at first, and computing formula is as follows:

(P=(V ₁+V ₂+……+V _n)/n （1）

(wherein, P be frame mean value, V for the value of each unit, n be that the byte number of each frame is divided by 2 for the units of each frame).

Then, above-mentioned volume coefficient calculation unit 42 is asked volume mean value to the above-mentioned PCM format frame sequence (and whole frames) of each audio-video document again, and computing formula is as follows:

A=(P ₁+P ₂+……+P _f)/f （2）

(wherein, A is the volume coefficient, and P is frame mean value, and f is the frame number of PCM frame sequence).

At last, above-mentioned volume coefficient calculation unit 42 draws described volume coefficient, and above-mentioned volume coefficient storage in above-mentioned memory module 8.

The schematic diagram of the reference volume coefficient calculation unit 43 of the audio frequency and video volume automatic regulating system that Fig. 6 relates to for embodiment of the present invention.As shown in Figure 6, said reference volume coefficient calculation unit 43 is at first obtained user instruction from above-mentioned memory module 8, selects which kind of mode to calculate said reference volume coefficient according to above-mentioned user instruction judgement again.If when the user specifies the volume coefficient of a certain audio-video document to be the reference volume coefficient, calculate with following computing formula:

T=A （3）

(wherein, T is the reference volume coefficient, and A is the volume coefficient of user's designated tone video).

That is, said reference volume coefficient calculation unit 43 reads the volume coefficient of voice data of PCM form of a certain audio-video document of above-mentioned appointment, and with this volume coefficient as the reference volume coefficient, and with the reference volume coefficient storage in above-mentioned memory module 8.

If when the user specifies the mean value of the volume coefficient of all audio-video documents to be the reference volume coefficient, calculate with following computing formula:

T=(A ₁+A ₂+……A _m)/m （4）

(wherein, T is the reference volume coefficient, and A is the volume coefficient of each audio frequency and video, and m is the number of PCM format audio data).

Wherein, said reference volume coefficient calculation unit 43 reads the volume coefficient of voice data of the PCM form of whole audio-video documents, and calculates the reference volume coefficient according to above-mentioned formula (4), the reference volume coefficient storage in above-mentioned memory module 8.

The schematic diagram of the volume adjustment factor computing unit 44 of the audio frequency and video volume automatic regulating system that Fig. 7 relates to for embodiment of the present invention.As shown in Figure 7, above-mentioned volume is adjusted coefficient calculation unit 44 reads earlier the voice data of said reference volume coefficient and above-mentioned PCM form from above-mentioned memory module 8 volume coefficient, calculate the volume adjustment factor of PCM format audio data again, above-mentioned volume adjustment factor calculates with following account form (formula 5), and above-mentioned volume adjustment factor is saved in the above-mentioned memory module 8.

ε=T/A （5）

(wherein, ε is the volume adjustment factor, and T is the reference volume coefficient, A by the described volume coefficient of voice data of the described PCM form of processing).

The video volume that Fig. 8 relates to for embodiment of the present invention customizes the structured flowchart of the coding module 6 of regulating system.As shown in Figure 8, above-mentioned coding module 6 comprises encoder selected cell 61,

several encoders

62,63,64 ...Above-mentioned encoder selected cell 61 can obtain the voice data that carries out the new PCM form after volume is regulated through above-mentioned audio volume adjustment module 5 from above-mentioned memory module 8.It (is above-mentioned

encoder

62,63,64 that above-mentioned encoder selected cell 61 can be selected corresponding encoder according to the coded format parameter of former audio frame sequence or user's specified coding form ... one of in) voice data of above-mentioned PCM form after volume is regulated is carried out recompile, form new audio frame sequence, and new audio frame sequence is stored in the above-mentioned memory module 8.

The video volume that Fig. 9 relates to for embodiment of the present invention customizes the structured flowchart of the package module 7 of regulating system.As shown in Figure 9, above-mentioned package module 7 comprises encapsulation format selected cell 71 and

several encapsulation units

72,73,74 ...Wherein, above-mentioned

several encapsulation units

72,73,74 ... have different encapsulation format, can encapsulate corresponding to the requirement of different encapsulation format.

It (is

encapsulation unit

72,73,74 that above-mentioned encapsulation format selected cell 71 can be selected corresponding encapsulation unit according to the file encapsulation format of the relevant parameter of former audio-video document encapsulation format or user's appointment ... one of in).Above-mentioned encapsulation unit encapsulates above-mentioned sequence of frames of video and new audio frame sequence by adjusting the timestamp of each sequence of frames of video and audio frame sequence, the audio-video document after obtaining to regulate.

The self-regulating flow chart of audio frequency and video volume that Figure 10 relates to for embodiment of the present invention.Below, the self-regulating processing procedure of video volume that explanation relates in present embodiment with reference to Figure 10.

At first, the input audio-video document, above-mentioned audio, video data receiver module 1 receives above-mentioned audio-video document data and it is delivered to above-mentioned decapsulation module 2(step S11).2 pairs of above-mentioned audio-video document data of above-mentioned decapsulation module are carried out decapsulation, and above-mentioned audio-video document data are resolved to sequence of frames of video and audio frame sequence, and above-mentioned sequence of frames of video and audio frame sequence are stored in the above-mentioned memory module 8 (step S12).

Above-mentioned voice data decoder module 3 obtains above-mentioned audio frame sequence from above-mentioned memory module 8, audio frame in the above-mentioned audio frame sequence is reduced to the voice data of PCM form, and the voice data of above-mentioned PCM form is stored in the above-mentioned memory module 8 (step S13).

Above-mentioned audio volume computing module 4 calculates the volume coefficient of the voice data of each PCM form that obtains among the step S13 respectively, calculate the volume coefficient of voice data of all PCM forms when above-mentioned audio volume computing module 4 after, setting according to the user calculates the reference volume coefficient again, calculate the volume adjustment factor of the voice data of all PCM forms at last respectively, and above-mentioned volume adjustment factor is stored in the above-mentioned memory module 8 (step S14).

The above-mentioned volume adjustment factor that above-mentioned audio volume adjustment module 5 obtains based on above-mentioned audio volume computing module 4 is to the processing of doubling of the voice data of above-mentioned PCM form, the voice data of above-mentioned PCM form after multiplication is handled is stored in the above-mentioned memory module 8 (step S15), and wherein the multiplication constant handled of above-mentioned multiplication is the corresponding volume adjustment factor of voice data of the above-mentioned PCM form that calculates among the step S14.

Above-mentioned coding module 6 obtains the voice data of the PCM form after volume is regulated from above-mentioned memory module 8, and according to the coded format of former audio frame sequence or user's specified coding form the voice data of the PCM form after regulating through volume is encoded, form new audio frame sequence and store in the above-mentioned memory module 8 (step S16).

Above-mentioned package module 7 obtains sequence of frames of video and new audio frame sequence from above-mentioned memory module 8, regulate the timestamp of each sequence of frames of video and new audio frame sequence, respectively each sequence of frames of video of having adjusted timestamp and new audio frame sequence are merged then, encapsulate according to the encapsulation format of former audio-video document or the encapsulation format of user's appointment at last, form new audio-video document (step S17).

The flow chart of the self-regulating step S12 of video volume that Figure 11 relates to for embodiment of the present invention.Below, the audio-video document that explanation relates in present embodiment with reference to Figure 11 carries out the decapsulation processing procedure.

Audio file formats judging unit 21 judge based on the audio-video document data that receive the encapsulation format of the audio-video document that receives judged result is transported to decapsulation selected cell 22(step S121).Above-mentioned decapsulation selected cell 22 is selected corresponding decapsulation unit (step S122) based on above-mentioned judged result.Above-mentioned decapsulation unit (is

decapsulation unit

23,24,25 ... one of in) above-mentioned audio-video document data are carried out decapsulation, above-mentioned audio-video document data are resolved to sequence of frames of video and audio frame sequence (step S123), and sequence of frames of video and audio frame sequence are stored in the above-mentioned memory module 8 (step S124).

The flow chart of the self-regulating step S13 of video volume that Figure 12 relates to for embodiment of the present invention.Below, explanation is carried out decoding process what present embodiment related to voice data with reference to Figure 12.

The coded format of the audio frame sequence after 31 pairs of decapsulations of coded format judging unit (for example MP3, AAC, AC-3 etc.) is judged, and judged result is transported to decoder selected cell 32(step S131).Above-mentioned decoder selected cell 32 is selected based on above-mentioned judged result and the corresponding decoder of above-mentioned coded format is decoded (step S132), above-mentioned decoder (is

decoder

33,34,35 ... one of in) audio frame in the above-mentioned audio frame sequence is reduced to the voice data (step S133) of PCM form, and the voice data of above-mentioned PCM form is stored to (step S134) in the memory module 8.

The flow chart of the self-regulating step S14 of video volume that Figure 13 relates to for embodiment of the present invention.Below, the processing procedure that the audio volume coefficient is calculated that relates in present embodiment with reference to Figure 13 explanation.

Above-mentioned volume coefficient calculation unit 42 is volume average value P (step S141) in unit and the audio data frame of obtaining the PCM form based on above-mentioned formula (1) to each frame in the voice data of above-mentioned PCM form with 2 byte lengths at first, the volume mean value A(that obtains the audio data frame of all PCM forms in the current PC M frame sequence according to above-mentioned formula (2) is the volume coefficient then), and it is stored to (step S142) in the memory module 8.Said reference volume coefficient calculation unit 43 judges whether the user imports instruction (step S143).If do not receive instruction (the step S143: not), wait for user input instruction that the user imports.If receive the instruction (step S143: be) that the user imports, judge that whether the user specifies the volume coefficient of a certain audio-video document is reference volume coefficient (step S144).If it is reference volume coefficient (step S144: be) that the user specifies the volume coefficient of a certain audio-video document, said reference volume coefficient calculation unit 43 reads the volume coefficient A of the audio-video document of above-mentioned appointment, and calculates reference volume coefficient (step S145) based on above-mentioned formula (3).If it is reference volume coefficient T (step S144: not) that the user specifies the mean value of the volume coefficient of all audio-video documents, said reference volume coefficient calculation unit 43 reads the volume coefficient A of the voice data of all PCM forms, and calculates reference volume coefficient T (step S146) based on above-mentioned formula (4).The reference volume coefficient T of above-mentioned acquisition is stored to (step S147) in the memory module 8.

Above-mentioned volume is adjusted coefficient calculation unit 44 is obtained the voice data of said reference volume coefficient T and current PC M form from memory module 8 volume coefficient A, and calculate the volume adjustment factor ε of the voice data of current PC M form based on above-mentioned formula (5), and it is stored to (step S148) in the memory module 8.

Above-mentioned audio volume adjustment module 5 is obtained above-mentioned volume adjustment factor ε from memory module 8, the volume of the voice data of current PC M form is regulated, and namely the volume of the voice data of current PC M form is trained to increase processing (step S15).

The flow chart of the step S16 that Figure 14 encodes for the PCM formatted data to regulating after the volume that embodiment of the present invention relates to.Below, with reference to Figure 14 explanation present embodiment relate to the PCM formatted data processing procedure of encoding.

Encoder selected cell 61 obtains above-mentioned PCM formatted data (step S161) after regulating volume from above-mentioned memory module 8.Encoder selected cell 41 is selected corresponding encoder (step S162) based on coded format or user's specified coding form of former audio frame sequence.Above-mentioned chosen encoder is encoded to above-mentioned PCM formatted data after regulating volume, obtaining new audio frame sequence (step S163), and the new audio frame sequence that obtains is stored in (step S164) in the memory module 8.

The flow chart of the step S17 that sequence of frames of video and new audio frame sequence are encapsulated that Figure 15 relates to for embodiment of the present invention.Below, explanation is carried out the encapsulation process process what present embodiment related to sequence of frames of video and new audio frame sequence with reference to Figure 15.

Above-mentioned encapsulation format selected cell 71 obtains sequence of frames of video and new audio frame sequence (step S171) from above-mentioned memory module 8.Above-mentioned encapsulation format selected cell 71 is selected corresponding encapsulation unit (step S172) based on the encapsulation format of former audio-video document or the encapsulation format of user's appointment, above-mentioned encapsulation unit encapsulates above-mentioned sequence of frames of video and above-mentioned new audio frame sequence after the adjustment of above-mentioned sequence of frames of video and above-mentioned new audio frame sequence being carried out timestamp again, form new audio-video document (step S173), and export new audio-video document (step S174).

Should understand embodiment described in the above specification and embodiment only is used for explanation the present invention and is not used in and limits the scope of the invention.After having read the present invention, those skilled in the art all fall within the application's claims institute restricted portion to the modification of various equivalents of the present invention.

Claims

1. self-regulating system of audio frequency and video volume comprises:

De-encapsulating devices is used for several audio-video documents are resolved to corresponding audio frame sequence and sequence of frames of video;

Audio data decoding apparatus is for the voice data that described audio frame sequence is reduced to the PCM form;

The volume calculation element is used for the volume adjustment factor of the voice data of described PCM form;

Volume adjustment device to the processing of doubling of the voice data of described PCM form, obtains the voice data of the PCM form after volume is regulated with described volume adjustment factor;

Code device by the voice data of described PCM form is encoded, obtains the audio frame sequence behind recompile;

Packaging system is used for described sequence of frames of video and described audio frame sequence behind recompile are encapsulated.

2. system according to claim 1, it is characterized in that: described volume calculation element is the volume mean value of each frame in the voice data of the unit PCM form that calculates each audio-video document with certain byte length, calculate the volume mean value of all frames in the voice data of PCM form of each audio-video document then, with the volume mean value of all frames in the voice data of described PCM form as the volume coefficient.

3. system according to claim 2, it is characterized in that: described volume calculation element is at the volume mean value of all frames in the voice data of one in described several audio-video documents designated PCM form that calculates appointed audio-video document as the audio-video document that calculates the reference volume coefficient, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

4. system according to claim 2, it is characterized in that: described volume calculation element calculates the volume mean value of all frames in the voice data of PCM form of described several audio-video documents at described several audio-video documents, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

5. according to any described system in the claim 2～4, it is characterized in that:

The computational methods of described volume mean value are as follows:

P=(V ₁+V ₂+……+V _n)/n，

Wherein, P is the volume mean value of each frame, and V is the volume value of each unit, and n is the units of each frame;

The computational methods of the volume mean value of all frames are as follows:

A=(P ₁+P ₂+……+P _f)/f，

Wherein, A is the volume mean value of all frames, and P is the volume mean value of each frame, and f is the frame number in the voice data of PCM form.

6. system according to claim 5, it is characterized in that: described certain byte length is 2 bytes.

7. according to any described system in the claim 1～4, it is characterized in that:

Described volume adjustment device doubles with described volume adjustment factor to each frame in the voice data of described PCM form.

8. self-regulating method of audio frequency and video volume comprises:

The decapsulation step resolves to corresponding audio frame sequence and sequence of frames of video with several audio-video documents;

The voice data decoding step is reduced to described audio frame sequence the voice data of PCM form;

The volume calculation procedure is calculated the volume adjustment factor of the voice data of described PCM form;

The volume regulating step to the processing of doubling of the voice data of described PCM form, obtains the voice data of the PCM form after volume is regulated with described volume adjustment factor;

Coding step by the voice data of described PCM form is encoded, obtains the audio frame sequence behind recompile;

Encapsulation step encapsulates described sequence of frames of video and described audio frame sequence behind recompile.

9. method according to claim 8, it is characterized in that: in described volume calculation procedure, be the volume mean value of each frame in the voice data of the unit PCM form that calculates each audio-video document with certain byte length, calculate the volume mean value of all frames in the voice data of PCM form of each audio-video document then, with the volume mean value of all frames in the voice data of described PCM form as described volume coefficient.

10. method according to claim 9, it is characterized in that: in described volume calculation procedure, when the audio-video document of reference volume coefficient is calculated in one in described several audio-video documents designated conduct, calculate the volume mean value of all frames in the voice data of PCM form of appointed audio-video document, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

11. method according to claim 9, it is characterized in that: in described volume calculation procedure, when the audio-video document of reference volume coefficient is calculated in the designated conduct of all described several audio-video documents, calculate the volume mean value of all frames in the voice data of PCM form of described several audio-video documents, with it as the reference volume coefficient, calculate the volume mean value of all frames in the voice data of current PCM form to be processed simultaneously and with its volume coefficient as the voice data of current PCM form to be processed, with the volume coefficient of described reference volume coefficient divided by the voice data of current PCM form to be processed, obtain described volume adjustment factor then.

12. according to any described method in the claim 9～11, it is characterized in that:

The computational methods of described volume mean value are as follows:

P=(V ₁+V ₂+……+V _n)/n，

A=(P ₁+P ₂+……+P _f)/f，

13. method according to claim 12 is characterized in that: described certain byte length is 2 bytes.

14. any described method according to Claim 8～11 is characterized in that:

In described volume regulating step, each frame in the voice data of described PCM form is doubled with described volume adjustment factor.