CN103796006A - System and method for labial sound synchronization test - Google Patents

System and method for labial sound synchronization test Download PDF

Info

Publication number
CN103796006A
CN103796006A CN201210424020.2A CN201210424020A CN103796006A CN 103796006 A CN103796006 A CN 103796006A CN 201210424020 A CN201210424020 A CN 201210424020A CN 103796006 A CN103796006 A CN 103796006A
Authority
CN
China
Prior art keywords
sound
image
saltus step
moment
energy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210424020.2A
Other languages
Chinese (zh)
Inventor
李立世
张成华
梅卫星
张学英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201210424020.2A priority Critical patent/CN103796006A/en
Priority to PCT/CN2013/083838 priority patent/WO2014067364A1/en
Publication of CN103796006A publication Critical patent/CN103796006A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream

Abstract

The invention discloses a labial sound synchronization test system. The system comprises a playing device used for playing a multimedia file with image and sound which have energy jump and are synchronous; a tested device which are connected with the playing device and used for receiving a first image signal and a first sound signal which are output when the playing device plays the multimedia file and outputting a second image signal corresponding the first image signal and a second sound signal corresponding to the first sound signal; and a test device which is connected with the tested device and used for receiving the second image signal and the second sound signal and outputting a second image energy variation diagram corresponding to the second image signal and a second sound energy variation diagram corresponding to the second sound signal. The test device is also used for detecting whether an image energy hop exists in the second image energy variation diagram and detecting whether a sound energy hop exists in the second sound energy variation diagram.

Description

A kind of System and method for of labial synchronization test
Technical field
The present invention relates to technical field of measurement and test, relate in particular to a kind of System and method for of labial synchronization test.
Background technology
Along with the development of network technology and the communication technology, increasing application all relates to the transmission of multi-medium data, as: during by QQ Video chat, just the multi-medium data that comprises vision signal and voice signal need to be transferred to another or multiple client terminal from a client terminal; In the time carrying out video conference, just the multi-medium data of the vision signal that comprises a participant and voice signal need to be given to another conference terminal of another participant from a meeting terminal transmission.
Due to various reasons, will cause in the prior art the nonsynchronous situation of image and sound, as: due to Internet Transmission, cause voice signal to be first received, after picture signal, be received, and then cause the asynchronous of sound and image.
In above-mentioned various reasons, wherein a kind of important reason is, because Equipment causes, as: this just exists recipient's TV conference system and postpones in the time processing, and then causes image asynchronous with sound.So, in use, TV conference system is tested, determine whether it is normally very important.
For this reason, the method that prior art adopts is conventionally as follows:
Adopt the mode of artificial subjective judgement to carry out, that is: combine by microphone and camera, image and sound are gathered, people, in hearing sound, goes to observe image change by eyes, so subjectively judge this sound and image whether synchronous.
Realizing in the process of technical scheme in the embodiment of the present application, inventor finds that in prior art, at least there are the following problems:
1, due in the prior art, that employing manual type removes to control microphone and camera goes sound and image to gather, and in practice, deactivation microphone and camera simultaneously, so existence can not guarantee image and sound to carry out the technical problem of synchronous acquisition;
2, due in the prior art, by two of microphone and cameras independently equipment sound and image are gathered, so, in the time being input to tested equipment, just there is the technical problem that can not guarantee at synchronization, picture signal and voice signal to be input to tested equipment;
3, due in the prior art, in the time judging, be to adopt artificial mode to judge, subjectivity and randomness are larger, so, there is the technical problem that judged result accuracy is very low.
Summary of the invention
The embodiment of the present application is by a kind of system and method for labial synchronization test is provided, in order to solve the very low technical problem of judged result accuracy existing in prior art.
For solving the problems of the technologies described above, the embodiment of the present application provides a kind of labial synchronization test macro on the one hand, comprising:
Playback equipment, all has energy saltus step and synchronous multimedia file for playing image and sound;
Tested equipment, be connected with described playback equipment, the first picture signal and the first sound tone signal in the time playing described multimedia file, exported for receiving described playback equipment, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Testing equipment, with described tested equipment connection, be used for receiving described the second picture signal and described second sound signal, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Wherein, whether described testing equipment also has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
Further, described playback equipment comprises:
Play chip, for playing described multimedia file, export described the first picture signal and described first sound tone signal;
The first image output interface, is connected with described broadcasting chip, for exporting described the first picture signal;
The first sound output interface, is connected with described broadcasting chip, for exporting described first sound tone signal.
Further, described tested equipment comprises:
The first image input interface, is connected with described the first image output interface, for receiving described the first picture signal;
The first sound input interface, is connected with described the first sound output interface, for receiving described first sound tone signal;
Processing module, be connected with described the first image input interface and described the first sound input interface, and after described the first picture signal receiving is processed, export described the second picture signal, to after receiving described first sound tone signal and processing, export described second sound signal;
The second image output interface, is connected with described processing module, for exporting described the second picture signal;
The second sound output interface, is connected with described processing module, for exporting described second sound signal.
Further, described testing equipment comprises:
The second image input interface, is connected with described the second image output interface, for receiving described the second picture signal;
The second sound input interface, is connected with described the second sound output interface, for receiving described second sound signal;
Display unit, is connected with described the second image input interface and described the second sound input interface, for exporting the second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Test module, is connected with described display unit, whether has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
Further, be connected with described the first image output interface at described the second image input interface, when described the second sound input interface is connected with described the first sound output interface, described display unit, also for exporting the first image energy variation diagram corresponding with described the first picture signal, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether described test module, also have image energy saltus step for detection of described the first image energy variation diagram, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound; And based on described the first moment, described the second moment and described time error, obtain described labial synchronization amount of delay.
Further, described tested equipment is specially: TV conference system, video chat system or video monitoring system.
On the other hand, in the embodiment of the present application, also provide a kind of labial synchronization method of testing, be applied in labial synchronization test macro, described system comprises: playback equipment, and tested equipment, is connected with described playback equipment, testing equipment, with described tested equipment connection, described method comprises:
Play an image and sound all has energy saltus step and synchronous multimedia file by described playback equipment;
Receive by described tested equipment the first picture signal and the first sound tone signal that described playback equipment is exported in the time playing described multimedia file, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Receive described the second picture signal and described second sound signal by described testing equipment, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Whether have image energy saltus step, and in the time having described image energy saltus step, be recorded in the first moment while there is described image energy saltus step if detecting described the second image energy variation diagram by described testing equipment; Whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step if detecting described the second sound energy changing figure;
Based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment by described testing equipment.
Further, described by described testing equipment based on described the first moment and described the second moment, before obtaining the labial synchronization amount of delay of described tested equipment, described method also comprises:
In the time that described testing equipment is directly connected with described playback equipment, by the described testing equipment output first image energy variation diagram corresponding with described the first picture signal, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether have image energy saltus step, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step if detecting the first image energy variation diagram corresponding with described the first picture signal by described testing equipment; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound.
Further, described by described testing equipment based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment, be specially:
By described testing equipment, based on described the first moment, described the second moment and described time error, obtain described labial synchronization amount of delay.
Further, described tested equipment is specially: TV conference system, video chat system or video monitoring system.
The application, by the one or more technical schemes that provide above, at least has following beneficial effect or advantage:
1, due to picture signal with voice signal comes from an image and sound all has energy saltus step and synchronous multimedia file, so, efficiently solve in prior art and to have the technical problem that can not guarantee image and sound to carry out synchronous acquisition, realized the synchronous generation of picture signal and voice signal and the technique effect of acquisition;
2, because employing playback equipment is play an image and sound all has energy saltus step and synchronous multimedia file, so, can export synchronous picture signal and voice signal simultaneously, and be input to tested equipment, and then effectively solve in the time being input to tested equipment, existence can not guarantee at synchronization, picture signal and voice signal to be input to the technical problem of tested equipment, so, realize the synchronous technique effect of inputting tested equipment of picture signal and voice signal;
3, because whether the picture signal and the voice signal that adopt testing equipment automatically to detect tested equipment output have energy saltus step, and the energy saltus step relative time of computed image signal and voice signal is poor in the time having energy saltus step, determine labial amount of delay with this, efficiently solve in prior art and have the technical problem that judged result accuracy is very low, realized the technique effect of the labial synchronization delay data of accurate quantification.
Accompanying drawing explanation
Fig. 1 is the structure chart of realizing labial synchronization test macro in the embodiment of the present application one;
Fig. 2 is the structure chart of playback equipment in system in the embodiment of the present application one;
Fig. 3 is the structure chart of equipment under test in system in the embodiment of the present application one;
Fig. 4 is the structure chart of testing equipment in system in the embodiment of the present application one;
Fig. 5 is image and the acoustic energy saltus step figure that in the embodiment of the present application one, in system, testing equipment is exported;
Fig. 6 is the flow chart of realizing labial synchronization method of testing in the embodiment of the present application two.
Embodiment
The embodiment of the present application is by a kind of system and method for labial synchronization test is provided, in order to solve the very low technical problem of judged result accuracy existing in prior art.
Technical scheme in the embodiment of the present application is for solving the problems of the technologies described above, and general thought is as follows:
All there are energy saltus step and synchronous multimedia file by playback equipment playing image and sound;
By tested equipment, be connected with described playback equipment, the first picture signal and the first sound tone signal in the time playing described multimedia file, exported for receiving described playback equipment, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Pass through testing equipment, with described tested equipment connection, be used for receiving described the second picture signal and described second sound signal, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Wherein, whether described testing equipment also has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
Visible, because whether the picture signal and the voice signal that adopt testing equipment automatically to detect tested equipment output have energy saltus step, and the energy saltus step relative time of computed image and sound is poor in the time having energy saltus step, determine labial amount of delay with this, efficiently solve in prior art and have the technical problem that judged result accuracy is very low, realized the technique effect of the labial synchronization delay data of accurate quantification.
In order better to understand technique scheme, below in conjunction with Figure of description and concrete execution mode, technique scheme is described in detail.
Embodiment mono-:
In embodiment mono-, a kind of labial synchronization test macro is provided, please refer to Fig. 1, the test macro in the embodiment of the present application comprises:
Playback equipment 101, all there are energy saltus step and synchronous multimedia file for playing image and sound, specifically, described multimedia file is specifically as follows: one section of film video, one section of music MV, or the special video of making, but no matter be what video, all need image and sound wherein all to have energy saltus step, and need image and synchronous sound.Existence form for multimedia file can be: CD, or be stored in USB memory device.
Tested equipment 102, be connected with described playback equipment 101, the first picture signal and the first sound tone signal in the time playing described multimedia file, exported for receiving described playback equipment 101, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal, specifically, described tested equipment is specifically as follows: TV conference system, video chat system or video monitoring system, all tested equipment all relate to the encoding and decoding to picture signal and voice signal.
Testing equipment 103, be connected with described tested equipment 102, be used for receiving described the second picture signal and described second sound signal, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal, specifically, described testing equipment is specifically as follows: oscilloscope etc. have the energy acquisition equipment of at least two passages, and can display image signals and moment of voice signal energy saltus step.
Wherein, whether described testing equipment 103 also has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
Below, in connection with Fig. 2, Fig. 3 Fig. 4 and Fig. 5, be described for concrete structure and the operation principle of every part in the test macro in the embodiment of the present application:
Please refer to Fig. 2, in the embodiment of the present application, described playback equipment 101 is specifically as follows common DVD player, can be also Blu-ray player, can be also one to have the computer of playing function etc., just differs one to have given an example in this embodiment of the present application.Specifically, the structure of playback equipment 101, specifically can comprise:
Play chip 1011, for playing described multimedia file, export described the first picture signal and described first sound tone signal;
The first image output interface 1012, is connected with described broadcasting chip 1011, for exporting described the first picture signal;
The first sound output interface 1013, is connected with described broadcasting chip 1011, for exporting described first sound tone signal.
When described playback equipment 101 is specially DVD player, when the existence form of described multimedia file is CD, the specific works process of described playback equipment 101 is as follows: the CD-ROM drive of first described CD being put into described DVD player; Then, described DVD player will detect described CD in CD-ROM drive; Next, the broadcasting chip 1011 in DVD player will carry out read operation to described CD, obtains described multimedia file, and described multimedia file is carried out to play operation; Finally, the first image signal output of exporting by broadcasting, to described the first image output interface 1012, is outputed to described the first sound output interface 1013 by the first sound tone signal of exporting by broadcasting.In actual applications, the first image output interface 1012 and the first sound output interface 1013 can be two physically separated interfaces; Also can be the interface integrating.Be specially which kind of interface, the embodiment of the present application, at this, is not restricted.
Tested equipment 102, in the embodiment of the present application, is specifically as follows TV conference system, video chat system or video monitoring system.Please refer to Fig. 3, tested equipment 102 specifically can comprise:
The first image input interface 1021, is connected with described the first image output interface 1012, for receiving described the first picture signal.
The first sound input interface 1022, is connected 1013 with described the first sound output interface, for receiving described first sound tone signal.
Processing module 1023, with described the first image input interface 1021, and described the first sound input interface 1022 connects, and after described the first picture signal receiving is processed, export described the second picture signal, to after receiving described first sound tone signal and processing, export described second sound signal.In the time that tested equipment 102 is specially TV conference system, processing module 1023 is specially video conferencing functional module; When tested equipment 102 is specially video chat system, during as QQ chat system, processing module 1023 is specially QQ chat feature module.
The second image output interface 1024, is connected with described processing module 1023, for exporting described the second picture signal;
The second sound output interface 1025, is connected with described processing module 1023, for exporting described second sound signal.
In the embodiment of the present application, known in conjunction with Fig. 1, tested equipment 102, to be connected with described playback equipment 101, the first picture signal and the first sound tone signal in the time playing described multimedia file, exported for receiving described playback equipment 101, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal.
Specifically, in the present embodiment, in the time that described tested equipment 102 is specially TV conference system, described tested equipment 102 courses of work are as follows: the first image input interface 1021 of described TV conference system, the first sound input interface 1022, respectively with described playback equipment 101 in the first image output interface 1012, the first sound output interface 1013 connects, for receiving picture signal and the voice signal that described playback equipment 101 is exported, process operation through described processing module 1023, finally by second image signal output corresponding with the first picture signal after treatment to the second image output interface 1024, the second sound signal corresponding with first sound tone signal after treatment outputed to the second sound output interface 1025.
In the embodiment of the present application, testing equipment 103, be specifically as follows: the oscilloscope with at least two passages, this oscilloscope automatically detected image signal and voice signal energy saltus step and record moment of saltus step, specifically, please refer to Fig. 4, the testing equipment 103 in the embodiment of the present application comprises:
Detection module 1033, for detection of described the second picture signal, and whether described second sound signal energy changing figure has energy saltus step;
The second image input interface 1031, is connected with described detection module 1033, for inputting the second picture signal;
The second sound input interface 1032, is connected with described detection module 1033, for inputting second sound signal;
Display unit 1034, is connected with described detection module 1033, for showing described the second detected image signal energy variation diagram, and described detected second sound signal energy changing figure.
In the embodiment of the present application, known with reference to figure 1, described testing equipment 103 is connected with described tested equipment 102, be used for receiving described the second picture signal and described second sound signal, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Wherein, whether described testing equipment 103 also has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment 102.
In the embodiment of the present application, in order to detect more accurately equipment 102 to be detected, also need described testing equipment 103 to be directly connected with described playback equipment 101, now, just can export the first image energy variation diagram corresponding with described the first picture signal by described testing equipment 103, and the first sound energy changing figure corresponding with described first sound tone signal;
Then, more whether there is image energy saltus step by described testing equipment 103 detection the first image energy variation diagram corresponding with described the first picture signal, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound.
Then, then by described testing equipment 103 based on described the first moment, described the second moment and described time error, obtain described labial synchronization amount of delay.
In the present embodiment, in the time that described testing equipment 103 is specially oscilloscope, described testing equipment 103 courses of work are specific as follows: described oscilloscope is connected with described tested equipment 102, the second image input interface 1031 and the second sound input interface 1032 are respectively used to receive described the second picture signal and described second sound signal, after detection module 1033 detects, display unit 1034 shows the energy changing figure of described the second picture signal and described second sound signal;
In the time that detection module 1033 detects that described the second picture signal has energy saltus step, display unit 1034 shows the energy saltus step figure of described the second picture signal, and record the first moment of saltus step T1, detect when described second sound signal has energy saltus step, display unit 1034 shows the energy saltus step figure of described second sound signal, and record the second moment T2, and then obtain the labial synchronization amount of delay (T2-T1) of described equipment under test 102, please refer to Fig. 5.
In the time that described oscilloscope 103 is directly connected with described playback equipment 101, first described oscilloscope 103 display units 1034 show the first image energy variation diagram corresponding with the first picture signal, and with described first sound tone signal to deserved first sound sound energy changing figure, then, in the time that detection module 1033 detects that the first picture signal has energy saltus step, display unit 1034 shows described the first image signal energy saltus step figure, and record the first moment of saltus step T3, detect when described second sound signal has energy saltus step, display unit 1034 shows the energy saltus step figure of described second sound signal, and record the second moment T4, then obtain for characterizing the time error (T4-T3) between described multimedia file image and sound.By described oscilloscope 103, based on the first moment T1, described the second moment T2 and described time error (T4-T3) obtain the labial synchronization amount of delay (T2-T1) of equipment under test 102-(T4-T3).
Technical scheme in above-mentioned the embodiment of the present application, at least has following technique effect or advantage:
1, due to picture signal with voice signal comes from an image and sound all has energy saltus step and synchronous multimedia file, so, efficiently solve in prior art and to have the technical problem that can not guarantee image and sound to carry out synchronous acquisition, realized the synchronous generation of picture signal and voice signal and the technique effect of acquisition;
2, because employing playback equipment is play an image and sound all has energy saltus step and synchronous multimedia file, so, can export synchronous picture signal and voice signal simultaneously, and be input to tested equipment, and then effectively solve in the time being input to tested equipment, existence can not guarantee at synchronization, picture signal and voice signal to be input to the technical problem of tested equipment, so, realize the synchronous technique effect of inputting tested equipment of picture signal and voice signal;
3, because whether the picture signal and the voice signal that adopt testing equipment automatically to detect tested equipment output have energy saltus step, and the energy saltus step relative time of computed image and sound is poor in the time having energy saltus step, determine labial amount of delay with this, efficiently solve in prior art and have the technical problem that judged result accuracy is very low, realized the technique effect of the labial synchronization delay data of accurate quantification.
Embodiment 2
The embodiment of the present application also provides a kind of labial synchronization method of testing, its method flow diagram please refer to Fig. 6, be applied in labial synchronization test macro, described system comprises: playback equipment 101, tested equipment 102, is connected testing equipment 103 with described playback equipment 101, be connected with described tested equipment 102, described method comprises:
S501 plays an image by described playback equipment 101 and sound all has energy saltus step and synchronous multimedia file;
Specifically, in the present embodiment, S501 is specially: described playback equipment 101 is play described multimedia file by the broadcasting chip 1011 of described playback equipment after, the first image output interface 1012 by described playback equipment is exported the first picture signal, and the first sound output interface 1013 is exported first sound tone signal.
S502 receives by described tested equipment 102 the first picture signal and the first sound tone signal that described playback equipment 101 is exported in the time playing described multimedia file, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Specifically, in the present embodiment, S502 is specially: described tested equipment 102 receives by the first image input interface 1021 the first picture signal that described playback equipment the first image output interface 1012 is exported, described tested equipment 102 receives by the first sound input interface 1022 the first sound tone signal that described playback equipment 101 first sound output interfaces 1013 are exported, after processing via described tested device processes module 1023, export second picture signal corresponding with the first picture signal by the second image output interface 1024, and export the second sound signal corresponding with first sound tone signal by the second sound output interface 1025.
S503 receives described the second picture signal and described second sound signal by described testing equipment, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Specifically, in the present embodiment, S503 is specially: described testing equipment 103 receives by the second image input interface 1031 the second picture signal that described tested equipment the second image output interface 1024 is exported, and described testing equipment receives by the second sound input interface 1032 second sound signal that described tested equipment the second sound output interface 1025 is exported, after detecting via described testing equipment detection module 1033, the display unit 1034 output second image energy variation diagram corresponding with the second picture signal of described testing equipment, and the second sound energy changing figure corresponding with described second sound signal.
Whether S504 detects described the second image energy variation diagram by described testing equipment 103 image energy saltus step, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step if detecting described the second sound energy changing figure;
Based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment by described testing equipment.
Specifically, in the present embodiment, S504 is specially: whether the detection module 1033 of described testing equipment detects the second picture signal energy saltus step, show the second image signal energy variation diagram at the display unit 1034 of described testing equipment, and in the time having energy saltus step, the first moment T1 while simultaneously recording described the second image signal energy saltus step; Whether the detection module 1033 of described testing equipment detects second sound signal energy saltus step, show second sound signal energy changing figure at the display unit 1034 of described testing equipment, and in the time having energy saltus step, the second moment T2 while simultaneously recording described second sound signal energy saltus step, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay (T2-T1) of described tested equipment.
S505 is in the time being directly connected described testing equipment 103 with described playback equipment 101, by the described testing equipment 103 output first image energy variation diagram corresponding with described the first picture signal, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether have image energy saltus step, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step if detecting the first image energy variation diagram corresponding with described the first picture signal by described testing equipment 103; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound.
Specifically, in the present embodiment, S505 is specially: the second image input interface 1031 of described testing equipment is connected with the first image output interface 1012 of described playback equipment, the second sound input interface 1032 of described testing equipment is connected with the first sound output interface 1013 of described playback equipment, after the detection module 1033 of described testing equipment detects, the display unit 1034 output first image energy variation diagram corresponding with described the first picture signal of described testing equipment, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether have energy saltus step, and in the time of described image energy saltus step, record the 3rd moment T3 that described testing equipment display unit 1034 shows saltus step if detecting described the first image energy variation diagram by described testing equipment detection module 1033; Whether described testing equipment detection module 1033 detects described the first sound energy changing figure energy saltus step, and in the time of described acoustic energy saltus step, record the 4th moment T4 that described testing equipment display unit 1034 shows saltus step, then, based on described the 3rd moment T3 and described the 4th moment T4, obtain for characterizing the time error (T4-T3) between described multimedia file image and sound.
S506 is by described testing equipment based on described the first moment T1, and described the second moment and described time error T2, obtain described labial synchronization amount of delay.
Specifically, in the present embodiment, based on described the first moment T1, described the second moment T2 and described time error (T4-T3), obtain described labial synchronization amount of delay and be: (T2-T1)-and (T4-T3).
Technical scheme in above-mentioned the embodiment of the present application, at least has following technique effect or advantage:
1, owing to adopting image and sound to be produced by same multimedia file, solved the problem that available technology adopting Artificial Control microphone and camera can not synchronously start, so, realize the synchronous input of image and sound;
2, because employing playback equipment is play an image and sound all has energy saltus step and synchronous multimedia file, so, can export synchronous picture signal and voice signal simultaneously, and be input to tested equipment, and then effectively solve in the time being input to tested equipment, existence can not guarantee at synchronization, picture signal and voice signal to be input to the technical problem of tested equipment, so, realize the synchronous technique effect of inputting tested equipment of picture signal and voice signal;
3, because whether the picture signal and the voice signal that adopt testing equipment automatically to detect tested equipment output have energy saltus step, and the energy saltus step relative time of computed image and sound is poor in the time having energy saltus step, determine labial amount of delay with this, efficiently solve in prior art and have the technical problem that judged result accuracy is very low, realized the technique effect of the labial synchronization delay data of accurate quantification.
Although described the preferred embodiments of the present invention, once those skilled in the art obtain the basic creative concept of cicada, can make other change and modification to these embodiment.So claims are intended to be interpreted as comprising preferred embodiment and fall into all changes and the modification of the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification and not depart from the spirit and scope of the present invention the present invention.Like this, if within of the present invention these are revised and modification belongs to the scope of the claims in the present invention and equivalent technologies thereof, the present invention is also intended to comprise these changes and modification interior.

Claims (10)

1. a labial synchronization test macro, is characterized in that, comprising:
Playback equipment, all has energy saltus step and synchronous multimedia file for playing image and sound;
Tested equipment, be connected with described playback equipment, the first picture signal and the first sound tone signal in the time playing described multimedia file, exported for receiving described playback equipment, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Testing equipment, with described tested equipment connection, be used for receiving described the second picture signal and described second sound signal, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Wherein, whether described testing equipment also has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
2. the system as claimed in claim 1, is characterized in that, described playback equipment comprises:
Play chip, for playing described multimedia file, export described the first picture signal and described first sound tone signal;
The first image output interface, is connected with described broadcasting chip, for exporting described the first picture signal;
The first sound output interface, is connected with described broadcasting chip, for exporting described first sound tone signal.
3. system as claimed in claim 2, is characterized in that, described tested equipment comprises:
The first image input interface, is connected with described the first image output interface, for receiving described the first picture signal;
The first sound input interface, is connected with described the first sound output interface, for receiving described first sound tone signal;
Processing module, be connected with described the first image input interface and described the first sound input interface, and after described the first picture signal receiving is processed, export described the second picture signal, to after receiving described first sound tone signal and processing, export described second sound signal;
The second image output interface, is connected with described processing module, for exporting described the second picture signal;
The second sound output interface, is connected with described processing module, for exporting described second sound signal.
4. system as claimed in claim 3, is characterized in that, described testing equipment comprises:
The second image input interface, is connected with described the second image output interface, for receiving described the second picture signal;
The second sound input interface, is connected with described the second sound output interface, for receiving described second sound signal;
Display unit, is connected with described the second image input interface and described the second sound input interface, for exporting the second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Test module, is connected with described display unit, whether has image energy saltus step for detection of described the second image energy variation diagram, and in the time having described image energy saltus step, is recorded in the first moment while there is described image energy saltus step; Detect described the second sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step, then, based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment.
5. system as claimed in claim 4, it is characterized in that, be connected with described the first image output interface at described the second image input interface, when described the second sound input interface is connected with described the first sound output interface, described display unit, also for exporting the first image energy variation diagram corresponding with described the first picture signal, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether described test module, also have image energy saltus step for detection of described the first image energy variation diagram, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound; And based on described the first moment, described the second moment and described time error, obtain described labial synchronization amount of delay.
6. the system as described in arbitrary claim in claim 1-5, is characterized in that, described tested equipment is specially: TV conference system, video chat system or video monitoring system.
7. a labial synchronization method of testing, is applied in labial synchronization test macro, it is characterized in that, described system comprises: playback equipment, and tested equipment, is connected with described playback equipment, testing equipment, with described tested equipment connection, described method comprises:
Play an image and sound all has energy saltus step and synchronous multimedia file by described playback equipment;
Receive by described tested equipment the first picture signal and the first sound tone signal that described playback equipment is exported in the time playing described multimedia file, and output second picture signal corresponding with described the first picture signal, and the second sound signal corresponding with described first sound tone signal;
Receive described the second picture signal and described second sound signal by described testing equipment, and the output second image energy variation diagram corresponding with described the second picture signal, and the second sound energy changing figure corresponding with described second sound signal;
Whether have image energy saltus step, and in the time having described image energy saltus step, be recorded in the first moment while there is described image energy saltus step if detecting described the second image energy variation diagram by described testing equipment; Whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the second moment while there is described acoustic energy saltus step if detecting described the second sound energy changing figure;
Based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment by described testing equipment.
8. method as claimed in claim 7, is characterized in that, described by described testing equipment based on described the first moment and described the second moment, before obtaining the labial synchronization amount of delay of described tested equipment, described method also comprises:
In the time that described testing equipment is directly connected with described playback equipment, by the described testing equipment output first image energy variation diagram corresponding with described the first picture signal, and the first sound energy changing figure corresponding with described first sound tone signal;
Whether have image energy saltus step, and in the time having described image energy saltus step, be recorded in the 3rd moment while there is described image energy saltus step if detecting the first image energy variation diagram corresponding with described the first picture signal by described testing equipment; Detect described the first sound energy changing figure and whether have acoustic energy saltus step, and in the time having described acoustic energy saltus step, be recorded in the 4th moment while there is described acoustic energy saltus step, then, based on described the 3rd moment and described the 4th moment, obtain for characterizing the time error between described multimedia file image and sound.
9. method as claimed in claim 8, is characterized in that, described by described testing equipment based on described the first moment and described the second moment, obtain the labial synchronization amount of delay of described tested equipment, be specially:
By described testing equipment, based on described the first moment, described the second moment and described time error, obtain described labial synchronization amount of delay.
10. the method as described in arbitrary claim in claim 7-9, is characterized in that, described tested equipment is specially: TV conference system, video chat system or video monitoring system.
CN201210424020.2A 2012-10-30 2012-10-30 System and method for labial sound synchronization test Pending CN103796006A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201210424020.2A CN103796006A (en) 2012-10-30 2012-10-30 System and method for labial sound synchronization test
PCT/CN2013/083838 WO2014067364A1 (en) 2012-10-30 2013-09-18 Lip synchronization test system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210424020.2A CN103796006A (en) 2012-10-30 2012-10-30 System and method for labial sound synchronization test

Publications (1)

Publication Number Publication Date
CN103796006A true CN103796006A (en) 2014-05-14

Family

ID=50626448

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210424020.2A Pending CN103796006A (en) 2012-10-30 2012-10-30 System and method for labial sound synchronization test

Country Status (2)

Country Link
CN (1) CN103796006A (en)
WO (1) WO2014067364A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106101694A (en) * 2016-06-12 2016-11-09 深圳创维-Rgb电子有限公司 A kind of methods, devices and systems of automatic calibration labial synchronization
CN107172419A (en) * 2017-06-08 2017-09-15 广州华多网络科技有限公司 Audio-visual synchronization method of testing and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2437123A (en) * 2006-04-10 2007-10-17 Vqual Ltd Method and apparatus for measuring audio/video sync delay
CN101742357A (en) * 2009-12-29 2010-06-16 北京牡丹电子集团有限责任公司 Method for measuring audio/video synchronous error of digital television device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1207922C (en) * 2002-01-30 2005-06-22 华邦电子股份有限公司 Method and system for measuring accuracy of video/audio output synchronization, and analysis means
GB2437122B (en) * 2006-04-10 2011-01-05 Vqual Ltd Method and apparatus for measuring audio/video sync delay
CN100546391C (en) * 2007-03-09 2009-09-30 北京牡丹视源电子有限责任公司 Digital TV video and audio synchronous method of measurement

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2437123A (en) * 2006-04-10 2007-10-17 Vqual Ltd Method and apparatus for measuring audio/video sync delay
CN101742357A (en) * 2009-12-29 2010-06-16 北京牡丹电子集团有限责任公司 Method for measuring audio/video synchronous error of digital television device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106101694A (en) * 2016-06-12 2016-11-09 深圳创维-Rgb电子有限公司 A kind of methods, devices and systems of automatic calibration labial synchronization
CN107172419A (en) * 2017-06-08 2017-09-15 广州华多网络科技有限公司 Audio-visual synchronization method of testing and system

Also Published As

Publication number Publication date
WO2014067364A1 (en) 2014-05-08

Similar Documents

Publication Publication Date Title
WO2017092343A1 (en) Video data detection method and device
CN108401217B (en) Signal output and TWS earphone testing method, terminal, device and storage medium
US8957972B2 (en) Automatic glass-to-glass video and A/V sync test tool
CN107018443A (en) Video recording method, device and electronic equipment
US9204234B2 (en) Audio testing system and method
KR102260946B1 (en) Techniques for detecting media playback errors
US20180376102A1 (en) Profile for frame rate conversion
CN112040225B (en) Playing delay difference measuring method, device, equipment, system and storage medium
CN110267083A (en) Detection method, device, equipment and the storage medium of audio-visual synchronization
CN105100794A (en) Audio and video synchronization test method and apparatus
CN100437506C (en) Method for detecting audio connection function of audio receiver and computer mainframe board
CN110585702A (en) Sound and picture synchronous data processing method, device, equipment and medium
US7938327B2 (en) Data capturing assembly and method and information storage apparatus
US8223136B2 (en) Error detection and prevention inacoustic data
CN104023176B (en) Handle method, device and the terminal device of audio and image information
CN105744325A (en) Audio/video play control method and audio/video play control device
CN103796006A (en) System and method for labial sound synchronization test
CN112004133A (en) Sound and picture synchronization method and device, projection equipment and readable storage medium
CN116527977A (en) Sound and picture synchronization method and device, electronic equipment and readable storage medium
CN212163616U (en) Microphone test circuit, circuit board and microphone testing arrangement
CN110234029B (en) Playing processing method, device, equipment and storage medium of multi-sensor data
CN101656596A (en) Media flow time delay detecting method and device
CN112566228A (en) Method and system for reducing equipment power consumption in network video conference
CN116437068A (en) Lip synchronization test method and device, electronic equipment and storage medium
CN107172419A (en) Audio-visual synchronization method of testing and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140514