WO2017206935A1 - 一种音视频同步测试的系统及方法 - Google Patents

一种音视频同步测试的系统及方法 Download PDF

Info

Publication number
WO2017206935A1
WO2017206935A1 PCT/CN2017/086916 CN2017086916W WO2017206935A1 WO 2017206935 A1 WO2017206935 A1 WO 2017206935A1 CN 2017086916 W CN2017086916 W CN 2017086916W WO 2017206935 A1 WO2017206935 A1 WO 2017206935A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
video
test signal
test
sound
Prior art date
Application number
PCT/CN2017/086916
Other languages
English (en)
French (fr)
Inventor
戎玲
史源
鲍逸明
杨竹莹
Original Assignee
公安部第三研究所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 公安部第三研究所 filed Critical 公安部第三研究所
Priority to ES17805880T priority Critical patent/ES2921057T3/es
Priority to EP17805880.6A priority patent/EP3468183B1/en
Publication of WO2017206935A1 publication Critical patent/WO2017206935A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • H04N17/004Diagnosis, testing or measuring for television systems or their details for digital television systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream

Definitions

  • the present invention provides a system and method for audio and video synchronization testing of a digital end-to-end device that is non-standard interface or transmission protocol non-standard or adopts encryption technology.
  • the technical solution is specifically:
  • a system for synchronizing audio and video testing which can be applied to performance testing of devices to be tested for playing audio and video, especially for real-time remote audio and video communication (such as building video intercom equipment in home security equipment), etc.
  • the audio and video synchronization performance test performed in a device with high audio and video synchronization performance requirements includes:
  • the ⁇ T is the time difference
  • the T 1 is the audio delay time
  • the T 2 is the video delay time
  • the time of the ⁇ T, the T 1 and the T 2 The units are the same.
  • the time units of the ⁇ T, the T 1 and the T 2 are all milliseconds.
  • the LED pattern generator generates the graphic test signal
  • the image acquisition unit collects the graphic test signal and transmits the graphic test signal to the graphic display unit through the video transmission unit to display a transmission image test signal
  • the video The acquisition unit simultaneously acquires the real-time image test signal currently generated by the LED pattern generator and the transmission image test signal currently displayed by the image display unit
  • the first comparison unit collects the real-time image test signal and the The transmission image test signals are compared
  • the first calculation unit acquires and outputs the video delay time to the processing device according to the comparison result output by the first comparison unit.
  • the image test signal is collected and transmitted by the device under test to the first output end of the device to be tested to output a transmission image test signal;
  • the above-mentioned audio and video synchronization test system may further include:
  • the ⁇ T is the time difference
  • the T 1 is the audio delay time
  • the T 2 is the video delay time
  • the time of the ⁇ T, the T 1 and the T 2 The units are the same.
  • the video signal generating device includes an LED pattern generator, and the device under test includes an image acquisition unit, a video transmission unit, and an image display unit;
  • the time test device includes a video capture unit, a first comparison unit, and a first calculation unit;
  • the LED pattern generator includes at least one row of N LED lamps arranged in a line; and the N LED lamps are in the same direction along which they extend. f sequentially lights each LED light, and each of the LED lights has a lighting time of 1/(N*f), and the video delay time is:
  • T 3 is the video delay time
  • N is a positive integer greater than or equal to 5
  • n is a pulse pattern of the LED illuminated in the pulse pattern of the real-time image test signal and the test signal of the transmission image
  • the difference between the serial numbers of the LEDs, the n is a natural number, and the error range of the T 3 is
  • the audio collection unit includes a free field microphone, the audio analysis unit including an audio analyzer, the audio analyzer passing the acquisition of the free field microphone The input sound test signal outputs the audio delay time.
  • FIG. 4 is a schematic structural diagram of performing audio delay test on a visual intercom device to be tested according to an embodiment of the present invention
  • FIG. 5 is a CSS test signal group of an audio delay according to an embodiment of the present invention.
  • FIG. 1 is a schematic structural diagram of a system for audio and video synchronization testing according to an embodiment of the present invention.
  • this embodiment provides a system for testing audio and video synchronization, which can be applied to a device for transmitting audio and video to be tested.
  • the system can include:
  • the video delay test device is adjacent to the video signal generating device and the first output end of the device to be tested, so as to separately collect the transmitted image test signal output by the device under test and the video signal generating device currently generated.
  • the real-time image test signal, and the acquired transmission image test signal is compared and/or operated with the real-time image test signal, thereby acquiring and outputting the video delay time of the device to be tested;
  • the audio signal generating device can also be disposed adjacent to the device to be tested, so that the device under test collects the sound test signal generated by the audio signal generating device; the device under test collects the sound test signal generated by the audio signal generating device (generally can be utilized) After the speaker simulates the voice of the person to generate the sound test signal, the collected sound test signal is transmitted to the second output end of the device under test (the sound output end, such as an audio device, etc.) to output a sound test signal ( That is, the sound information output after being collected and transmitted by the device under test);
  • An audio delay test device respectively collects and compares the transmitted sound test signal output by the device under test and the real-time sound test signal currently generated by the audio signal generating device to obtain an audio delay time of the device to be tested;
  • the processing device is respectively connected to the video delay testing device and the audio delay testing device to obtain a time difference between the video delay time and the audio delay time.
  • the video signal generating device generates an image test signal
  • the device under test collects the graphic test signal and transmits the signal to the first output end of the remote end to output the image test signal, and the video delay test
  • the device simultaneously acquires the transmitted image
  • the test signal is compared with the real-time image test signal displayed by the video signal generating device at this time, that is, because the device under test needs to take a certain time to transmit the image test signal, thereby causing the test image output by the device under test to be tested.
  • the signal is not the real-time image test signal displayed by the current video signal generating device, so the device under test transmits a certain amount of delay in the image test signal, and the difference obtained by comparing the acquired transmitted image test signal with the real-time image test signal is obtained.
  • the signal can be obtained by using the rule that the video signal generating device generates the image test signal to obtain the video delay time of the device under test; similarly, the audio signal generating device generates the sound test signal, and the device under test collects the sound test.
  • the audio delay test device After the signal is transmitted and transmitted to the second output end of the remote end to output the transmitted sound test signal, the audio delay test device simultaneously collects the transmitted sound test signal and the real-time sound test signal played by the audio signal generating device at this time. And compare, because the device to be tested It takes a certain time to transmit the sound test signal, which in turn causes the transmitted sound test signal output by the device under test to be not the real-time sound test signal generated by the current audio signal generating device, so the device under test transmits the sound test signal.
  • the sound difference signal obtained by comparing the collected transmission sound test signal with the real-time sound test signal can obtain the audio delay of the device under test by using the rule that the audio signal generating device generates the sound test signal Time (note that the order between the steps of obtaining the video delay time and the step of obtaining the audio delay time is in no particular order, even under conditions that do not affect each other);
  • the processing device obtains the video delay time and the audio delay time to perform the comparison operation, and can obtain the time difference between the audio data and the video data when the device under test synchronously transmits the audio and video, and can determine the time difference according to the time difference. Audio and video synchronization of the device under test can.
  • the above system for audio and video synchronization testing may further comprise a judging device connected to the processing module (the judging device may be integrated with the processing module or capable of implementing the same component of the two functions), and the The determining device may pre-store a data table corresponding to the time difference and the audio and video synchronization performance parameter, so that the determining device can quickly retrieve the audio and video synchronization performance parameter corresponding to the time difference from the data table according to the received time difference, and output the data.
  • the audio and video synchronization performance of the device under test is quickly and image-outputted.
  • the time difference can be generally determined based on the audio delay time.
  • the calculation for example, by subtracting the video delay time from the audio delay time, can know how long the video lags the audio in the audio and video synchronization performance of the device under test; of course, the video delay time can also be used as a reference to calculate the time difference. It can be set specifically for actual test needs.
  • the audio and video synchronization performance parameter output by the judging device is the optimal parameter of the audio and video synchronization performance of the device to be tested, that is, regardless of the delay time, as long as the same, it means that the audio and video synchronization performance of the device under test reaches The ideal state is the best performance; when the value of ⁇ T is negative, it means that the delay time of transmitting the audio of the device under test is less than the delay time of the transmitted video, that is, the audio and video synchronization performance parameter output by the judging device is The video signal transmitted by
  • the above video signal generating device may include an LED graphic generator (such as an LED light array, etc.), and the device under test may include an image capturing unit, a video transmitting unit, and an image display unit, and the video.
  • the delay test device may further comprise a video capture unit, a first comparison unit and a first calculation unit, etc.; the graphic test signal is generated by the LED pattern generator, and the image acquisition unit collects the graphic test signal and transmits the graphic test signal to the graphic display through the video transmission unit.
  • the unit displays the transmission image test signal, and the video acquisition unit simultaneously collects the real-time image test signal currently generated by the LED pattern generator and the transmission image test signal currently displayed by the image display unit, and the first comparison unit then collects the collected real-time image test signal.
  • comparing the transmitted image test signals and outputting the comparison result, and the first calculating unit acquires and outputs the video delay time to the processing device according to the comparison result output by the first comparing unit.
  • the LED pattern generator may include at least one row of N LED lamps arranged in a straight line (such as an N*N LED lamp array, and an optional one row of LED lamps for testing); and the N LED lamps extend along the same
  • N LED lamps are sequentially illuminated at a frequency f in the direction, and the lighting time of each LED lamp can be 1/(N*f), and the first comparison unit performs the acquired real-time image test signal and the transmitted image test signal. Compare and get real-time imagery
  • the difference between the test signal and the transmitted image test signal is n LED lights (that is, the difference between the illuminated LED lights is n LED lights), and the calculation formula can use the following formula to obtain the video delay time, specifically:
  • T 3 is the video delay time
  • N is a positive integer greater than or equal to 5
  • n is the serial number between the LED illuminated in the pulse pattern of the real-time image test signal and the LED illuminated by the pulse pattern of the transmission image test signal Poor
  • n is a natural number
  • the error range of T 3 should be between.
  • the value of the frequency f of the illumination between adjacent LED lamps should be greater than or equal to the value of the frame rate that can be resolved by the human eye, and the value of the frame rate of the acquired image of the image acquisition unit is also greater than the frequency f. Value, otherwise you will not be able to capture graphics.
  • the audio signal generating apparatus includes an audio signal generator, and the device under test may further include a sound collecting unit, an audio transmitting unit, and a sound playing unit; and the audio delay testing device may include an audio collecting unit.
  • the audio signal generator generates a sound test signal
  • the sound collection unit collects the sound test signal and transmits it to the sound playing unit through the audio transmission unit to output the transmitted sound test signal
  • the audio collecting unit simultaneously acquires the audio signal generator currently generated
  • the audio analyzing unit analyzes and processes the collected real-time sound test signal and the transmitted sound test signal, and outputs the audio delay time to the processing device.
  • the audio signal generating device may further include a simulation mouth to equalize the sound test signal generated by the audio signal generator, and the sound collecting unit acquires the equalized sound test signal; and the audio collecting unit includes a free field microphone.
  • the audio analysis unit may include an audio analyzer, and the audio signal generator may further have a related instrument device for displaying a pulse pattern of the real-time image test signal, and the audio analyzer performs the real-time sound test signal collected by the free field microphone and the pulse of the sound test signal. After analyzing the comparison processing, the audio delay time of the device under test is output.
  • each audio delay test device includes an audio signal generator, a simulation mouth, a free field microphone and an audio analyzer; in order to avoid two ways The audio delay test devices interfere with each other, and the audio signal generator and the simulation mouth for generating the sound test signal at the same hands-free terminal can be connected
  • the line angle between the free field microphone and the audio analyzer transmitting the sound test signal is set to an angle greater than or equal to 32°, and the distance between the dummy mouth and/or the free field microphone and the hands-free terminal is greater than or equal to 10 cm. And less than 50cm (to avoid the proximity caused by interference, the distance is far from the end of the hands-free The terminal cannot collect enough sounds for testing.
  • the specific value can be set according to the performance parameters and test requirements of the actual device.
  • the two can be asynchronously
  • Each of the road audio delay test devices is tested to ensure the accuracy of each test result; that is, the audio signal generator and the simulation mouth are set only at a hands-free terminal of the visual intercom device of the building to be tested.
  • a free field microphone and an audio analyzer are placed adjacent to the location of the other hands-free terminal.
  • an audio signal generator is used to generate a test unit signal including a plurality of groups (such as four groups of CSS (composite source signal) signals, wherein the former Several groups (such as the first three groups) of CSS signals can be used for training to make the channel transmission reach the normal state, and the remaining group (such as the fourth group) CSS signal is used as the test signal, and the fourth group of CSS signals can be used to maintain the high level.
  • groups such as four groups of CSS (composite source signal) signals
  • the signal part is used for delay measurement; preferably, the pulse width of each group of CSS signals is 248.62 ms, and the adjacent CSS signals are separated by 101.38 ms, so that the signal length of each test unit is 1928.62 ms (the above specific values can be tested according to actual conditions).
  • the present application also provides a method for synchronizing audio and video testing, which can be applied to performance testing such as transmitting and playing audio and video waiting devices based on the above-mentioned audio and video synchronization testing system, especially for audio and video synchronization performance.
  • performance testing such as transmitting and playing audio and video waiting devices based on the above-mentioned audio and video synchronization testing system, especially for audio and video synchronization performance.
  • the method may include:
  • the image test signal is collected and transmitted by the device under test to the first output end of the device to be tested to output a transmission image test signal;
  • the sound test signal is collected and transmitted by the device under test to the second output end of the device to be tested to output a sound test signal;
  • a processing device receives and calculates a time difference between the video delay time and the audio delay time.
  • the above method may further include:
  • the method for synchronizing the audio and video in the present embodiment may be based on a system for synchronizing audio and video testing in the foregoing embodiments, and products and methods that can correspond to each other, so the two embodiments
  • the technical features that are the same or similar may be applied to each other, and are not described herein for the sake of brevity, but they should not be construed as limiting the application, as long as the person skilled in the art sees either of them.
  • the technical solutions that can be extended in the course of creative work should be within the scope of this application.
  • the audio delay referred to in the above embodiment of the building visual intercom device is generally that the audio signal is sent by the local intercom terminal mouth reference point, and after being visually intercommunicated with the components of the building, to the remote end
  • the ear reference point of the terminal receives the required one-way transmission time
  • the video delay is generally taken by the local intercom terminal camera, and after the building visual intercom system components, the display to the remote intercom terminal
  • the device displays the one-way transmission time required for the same frame; and the lip synchronization is the time domain relationship between the transmitted video and the image signal of the building video intercom system.
  • the value can be used to describe the synchronization relationship between the sound and the image signal, and the available time difference range Representation (other embodiments can also be defined with reference to the above semantics); audio delay and video delay in a building-compliant video intercom device that meets the standard should be no more than 300ms, and the video frame rate should be no less than 20fps. Audio delay during the process of lip-synchronization The time difference between the time and the video delay time ranges from -90ms to +185ms, that is, when the system transmits the audio and video signals at the same time, the time of the output audio signal corresponding to the corresponding video signal should be no more than 90ms, and the lag time should be Not more than 185ms.
  • the test environment noise should not exceed 40 dB when performing the audio delay test, and the video delay test, EUT (equipment under test, test equipment)
  • EUT equipment under test, test equipment
  • the intercom device should be in the normal image transmission state, and when the audio communication of the EUT cannot be cut off, the audio and video synchronization test must be performed to ensure that it does not cause changes in the video characteristic parameters and/or audio characteristic parameters. .
  • the LED image generator can be used as a signal source for video frame rate, video delay, and lip sync test, while the LED image signal generator should be placed directly in front of the video capture device and can be adjusted by adjusting the LED matrix and video.
  • the distance between the receiving devices is such that the LED matrix image on the screen of the video receiving device fills the screen on the horizontal or vertical axis, and the image on the screen of the video receiving device needs to be able to focus.
  • the above LED pattern generator can be composed of a 10 ⁇ 10 LED array and a programming controller, and the LED array can be used to generate a graphic pattern for measurement, that is, a single LED can be used as a basic unit, and sequentially passed in the vertical or horizontal direction. Each LED light is illuminated to cause the rendered image to scroll continuously, and the programming controller can control the LED array to produce a varying pattern at a set frequency.
  • the LED pattern generator can also have an LED position information display function and has an output interface for outputting two falling edge synchronization signals.
  • the LED pattern generator can synchronously control the same LED array of the two groups A and B, that is, the group A LED array is placed in front of the VCU camera as a test.
  • the signal is placed on the display terminal of the intercom terminal so that the digital camera can take it.
  • the digital video camera can adjust the video capture frame rate (used in the video delay test) and capture the still image after receiving the start signal (used in lip sync).
  • a continuous change image generated by a row of 10 LED light groups in the middle area of the LED array can be selected as a test signal to activate audio and video communication; then, from 1 Hz, gradually adjusted from low to high
  • the setting frequency of the LED pattern generator (each LED is activated once per second, the flashing time is 1/(10f)s), so that the LED light group is scrolled in the horizontal direction in the order of the cycle.
  • the output image obtained after the system transmission is observed on the display screen of the tested intercom terminal, and the adjustment is stopped when it is observed that the image of the 10 LED lamp groups is stable and not rolling, and the observation state is valid when the state retention time is greater than 10 s.
  • the frequency at which the LED pattern generator is generated at this time is the frame rate (fr) of the detected IP BIS.
  • the frame rate (fr) of the system is taken as the set frequency of the LED pattern generator, and the continuously changing image generated by the LED array is controlled by the LED pattern generator as a test signal. Then, using the video capture device (VCD) to capture the image generated by the LED image generator A, each frame of the digital camera should be able to simultaneously capture the image on the screen of the video receiving device and the image on the LED signal generator B, and simultaneously capture for 3 seconds. . Thereafter, the position of the LED lamp activated on the LED image signal generator B and the position of the LED lamp activated on the screen of the video receiving device in the same frame are compared, and n LED lights are separated from each other to obtain a video delay time. Finally, statistical processing is performed on the measured video delay value, and the upper limit of the 90% confidence interval is used as the video delay Td.
  • the test can be placed in front of the video capture device for testing using an LED light on the LED matrix adjacent to the video receiving device. This will be used in place of the above test methods using LED image generators A and B for testing.
  • the audio delay and the video delay test can be performed simultaneously, and the test conditions and test methods should satisfy both the audio delay and the video delay requirement:
  • the audio signal generator should play a continuous CSS signal until the end of the test.
  • the first 3 sets of CSS signals are used for training, and the 4th set of signals are used for audio delay testing (as described in 7.3.3).
  • the digital camera At the end of the fourth set of signal high frequency signals and at the beginning of the falling edge (ie, 248.62 ms of the CSS signal), the digital camera begins to capture each frame of the image. Therefore, each group of CSS signals will measure the audio delay and video delay at 248.62ms.
  • test should last for 10s (28 tests), and the video delay method is used to calculate the video delay. Then the audio analyzer is controlled to calculate the audio delay time. Finally, the simultaneously captured video delay value and audio delay value are calculated. :
  • Lip sync video delay - audio delay (if negative, the audio is longer than the video delay);
  • the measured lip sync value is statistically processed, and the lower and upper limits of the 90% confidence interval are taken as the forward and backward lip sync measurements, respectively.
  • the real-time audio and video transmission and playback system (such as building security system and video) with high requirements on audio and video synchronization performance can be realized simply and accurately. Performance test of communication system, etc.)
  • the video delay time and audio delay time of the audio and video synchronous transmission playback device can be obtained simply, accurately and easily by using the video delay test device and the audio delay test device respectively, and the obtained video delay time is obtained. After comparing the audio delay time, the audio and video synchronization performance of the audio and video synchronous transmission and playback device can be accurately learned.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)

Abstract

本发明涉及音视频设备测试技术领域,具体涉及一种音视频同步测试的系统及方法,可应用于对音视频同步传输播放设备的同步测试中,尤其可简单、精准地实现对音视频同步性能要求较高的实时的音视频传输播放系统(如楼宇安防系统、视频通讯系统等)的性能测试,即可通过分别利用视频延时测试装置及音频延时测试装置,可简单、精准、易实施地获取音视频同步传输播放设备的视频延时时间及音频延时时间,并对获取的视频延时时间及音频延时时间进行比较后就能精确地获悉该音视频同步传输播放设备的音视频同步性能。

Description

一种音视频同步测试的系统及方法 技术领域
本发明涉及音视频设备测试技术领域,具体涉及一种音视频同步测试的系统及方法。
背景技术
在音视频同步传输的系统中,采用无标准接口或传输协议非标准或采取加密技术的数字端对端设备非常多,但由于当前的数字端对端设备中对于音视频数据传输所产生的延时不同,进而会出现诸如视频通话过程中唇音不同步等缺陷的产生,尤其是诸如实时远程音视频通讯等对于音视频同步性能要求较高的应用中,会大大降低音视频设备的用户体验;例如,在当前的住宅安全技术防范产品楼寓对讲系统中,当访客在楼栋门口访客呼叫机端按下房号,室内接收机端的主人在听到呼叫声音并回答时,最好能及时观看到访客的实时图像,进而以便于主人较为准确的辨别来者为谁,而为了能使室内收机端听到呼叫者声音的同时观看到呼叫者的容貌,就需要使得设备能够实现唇音同步,而一旦唇音不同步,不但会降低用户体验,同时还会使得主人无法辨别访客身份,进而带来安全隐患。
目前,为了提升所研发或测试即将推广应用的远程音视频通讯设备的音视频同步的性能,一般是采用电信号-图像信号测试方法进行 测试(如唇音同步测试等),但由于其采用的诸如接口、视频设备等均可能为非标准的,所以只能采用诸如端对端的全程图像信号测试,进而会使得测试效果不甚理想,即无法准确的获取当前待测设备的音视频同步性能,从而也无法对研发的或待推广应用的产品的性能进行准备的评估。
发明内容
针对现有技术的不足,本发明提供了一种可应用于无标准接口或传输协议非标准或采取加密技术的数字端对端设备的音视频同步测试的系统及方法,该技术方案具体为:
一种音视频同步测试的系统,可应用于对传输播放音视频的待测设备的性能测试中,尤其可针对实时远程音视频通讯(例如居家安防设备中的楼宇可视对讲设备)等对于音视频同步性能要求较高的设备中进行的音视频同步性能测试,所述系统包括:
视频信号发生装置,临近所述待测设备设置,所述视频信号发生装置产生图像测试信号;所述待测设备采集并传送所述图像测试信号至该待测设备的第一输出端,以输出传输图像测试信号;
视频延时测试装置,分别采集并对比所述待测设备所输出的所述传输图像测试信号和所述视频信号发生装置当前所产生的实时图像测试信号,以获取所述待测设备的视频延时时间;
音频信号发生装置,临近所述待测设备设置,所述音频信号发生装置产生声音测试信号;所述待测设备采集并传送所述声音测试信号 至该待测设备的第二输出端,以输出传输声音测试信号;
音频延时测试装置,分别采集并对比所述待测设备所输出的所述传输声音测试信号和所述音频信号发生装置当前所产生的实时声音测试信号,以获取所述待测设备的音频延时时间;
处理装置,分别与所述视频延时测试装置及所述音频延时测试装置连接,以获取所述视频延时时间与所述音频延时时间之间的时间差。
作为一个优选的实施例,上述的音视频同步测试的系统,还可包括:
判断装置,与所述处理模块连接;
其中,所述判断装置中预存有时间差与音视频同步性能参数对应的数据表,所述判断装置接收并根据所述时间差从所述数据表中调取与该时间差所对应的音视频同步性能参数进行输出。
作为一个优选的实施例,上述的音视频同步测试的系统,所述处理装置可以所述音频延时时间为基准获取所述时间差。
作为一个优选的实施例,上述的音视频同步测试的系统,所述处理装置可根据公式ΔT=T1-T2计算所述时间差;
其中,所述ΔT为所述时间差,所述T1为所述音频延时时间,所述T2为所述视频延时时间,且所述ΔT、所述T1及所述T2的时间单位均相同。
作为一个优选的实施例,上述的音视频同步测试的系统,所述ΔT的值为0时,所述判断装置输出的所述音视频同步性能参数为所述待 测设备的音视频同步性能最优参数;
所述ΔT的值为负值时,所述判断装置输出的所述音视频同步性能参数为所述待测设备同时传输的视频信号滞后音频信号|ΔT|毫秒;以及
所述ΔT的值为正值时,所述判断装置输出的所述音视频同步性能参数为所述待测设备同时传输的音频信号滞后视频信号ΔT毫秒;
其中,所述ΔT、所述T1及所述T2的时间单位均为毫秒。
作为一个优选的实施例,上述的音视频同步测试的系统,所述视频信号发生装置包括LED图形发生器,所述待测设备包括图像采集单元、视频传输单元和图像显示单元;所述视频延时测试装置包括视频采集单元、第一比较单元和第一计算单元;
其中,所述LED图形发生器产生所述图形测试信号,所述图像采集单元采集所述图形测试信号并通过所述视频传输单元传送至所述图形显示单元以显示传输图像测试信号,所述视频采集单元同时采集所述LED图形发生器当前所生成的实时图像测试信号和所述图像显示单元当前所显示的传输图像测试信号,所述第一比较单元将采集的所述实时图像测试信号和所述传输图像测试信号进行比对,所述第一计算单元根据所述第一比较单元输出的比较结果获取并输出所述视频延时时间至所述处理装置。
作为一个优选的实施例,上述的音视频同步测试的系统,所述LED图形发生器包括至少一行沿直线排列的N个LED灯;且所述N个LED灯沿其延伸的同一方向上以频率f依次点亮每个LED灯,且 每个所述LED灯的点亮时间为1/(N*f),所述视频延时时间为:
Figure PCTCN2017086916-appb-000001
T3=0,n=0;
其中,T3为所述视频延时时间,N为大于或等于5的正整数,n为所述实时图像测试信号的脉冲图形中点亮的LED与所述传输图像测试信号的脉冲图形点亮的LED之间的序号差,所述n为自然数,且所述T3的误差范围在
Figure PCTCN2017086916-appb-000002
作为一个优选的实施例,上述的音视频同步测试的系统,所述LED灯之间点亮的频率f的值大于或等于人眼所能分辨的帧率的值,所述图像采集单元的采集图形的帧率的值大于所述频率f的值。
作为一个优选的实施例,上述的音视频同步测试的系统,所述音频信号发生装置包括音频信号发生器,所述待测设备还包括声音采集单元、音频传输单元和声音播放单元;所述音频延时测试装置包括音频采集单元和音频分析单元;
其中,所述音频信号发生器产生所述声音测试信号,所述声音采集单元采集所述声音测试信号并通过所述音频传输单元传送至所述声音播放单元以输出传输声音测试信号,所述音频采集单元同时采集所述音频信号发生器当前所生成的实时声音测试信号和所述声音播放单元当前所输出的传输声音测试信号;所述音频分析单元对所述音频采集单元采集的所述实时声音测试信号和所述传输声音测试信号进行分析处理后,输出所述音频延时时间至所述处理装置。
作为一个优选的实施例,上述的音视频同步测试的系统,所述音频信号发生装置还包括仿真嘴,以对所述音频信号发生器生成的所述声音测试信号进行均衡,且所述声音采集单元采集均衡后的所述声音测试信号;以及
所述音频采集单元包括自由场传声器,所述音频分析单元包括音频分析仪,所述音频分析仪通过对所述自由场传声器所采集的所述传输声音测试信号输出所述音频延时时间。
本申请还提供了一种音视频同步测试的方法,可应用于对传输播放音视频的待测设备的性能测试中,尤其是针对音视频同步性能要求较高的诸如实时音视频传输播放设备中,所述方法可包括:
于一视频信号发生装置产生图像测试信号后,利用所述待测设备采集并传送所述图像测试信号至该待测设备的第一输出端,以输出传输图像测试信号;
利用一视频延时测试装置分别采集并对比所述待测设备所输出的所述传输图像测试信号和所述视频信号发生装置当前所产生的实时图像测试信号,以获取所述待测设备的视频延时时间;
于一音频信号发生装置产生声音测试信号后,利用所述待测设备采集并传送所述声音测试信号至该待测设备的第二输出端,以输出传输声音测试信号;
利用一音频延时测试装置分别采集并对比所述待测设备所输出的所述传输声音测试信号和所述音频信号发生装置当前所产生的实时声音测试信号,以获取所述待测设备的音频延时时间;
利用一处理装置接收并计算出所述视频延时时间与所述音频延时时间之间的时间差。
作为一个优选的实施例,上述的音视频同步测试的系统,还可包括:
提供一预存有时间差与音视频同步性能参数对应的数据表的判断装置;
利用所述判断装置接收并根据所述时间差从所述数据表中调取与该时间差所对应的音视频同步性能参数进行输出。
作为一个优选的实施例,上述的音视频同步测试的系统,所述处理装置以所述音频延时时间为基准计算并输出所述时间差。
作为一个优选的实施例,上述的音视频同步测试的系统,所述处理装置根据公式ΔT=T1-T2计算所述时间差;
其中,所述ΔT为所述时间差,所述T1为所述音频延时时间,所述T2为所述视频延时时间,且所述ΔT、所述T1及所述T2的时间单位均相同。
作为一个优选的实施例,上述的音视频同步测试的系统,所述方法中:
当所述ΔT的值为0时,所述判断装置输出所述待测设备的音视频同步性能最优参数;
当所述ΔT的值为负值时,所述判断装置输出所述待测设备同时传输的视频信号滞后音频信号|ΔT|毫秒;以及
当所述ΔT的值为正值时,所述判断装置输出所述待测设备同时 传输的音频信号滞后视频信号ΔT毫秒;
其中,所述ΔT、所述T1及所述T2的时间单位均为毫秒。
作为一个优选的实施例,上述的音视频同步测试的系统,所述视频信号发生装置包括LED图形发生器,所述待测设备包括图像采集单元、视频传输单元和图像显示单元;所述视频延时测试装置包括视频采集单元、第一比较单元和第一计算单元;
其中,所述LED图形发生器产生所述图形测试信号,所述图像采集单元采集所述图形测试信号并通过所述视频传输单元传送至所述图形显示单元以显示传输图像测试信号,所述视频采集单元同时采集所述LED图形发生器当前所生成的实时图像测试信号和所述图像显示单元当前所显示的传输图像测试信号,所述第一比较单元将采集的所述实时图像测试信号和所述传输图像测试信号进行比对,所述第一计算单元根据所述第一比较单元输出的比较结果获取并输出所述视频延时时间至所述处理装置。
作为一个优选的实施例,上述的音视频同步测试的系统,所述LED图形发生器包括至少一行沿直线排列的N个LED灯;且所述N个LED灯沿其延伸的同一方向上以频率f依次点亮每个LED灯,且每个所述LED灯的点亮时间为1/(N*f),所述视频延时时间为:
Figure PCTCN2017086916-appb-000003
T3=0,n=0;
其中,T3为所述视频延时时间,N为大于或等于5的正整数,n为所述实时图像测试信号的脉冲图形中点亮的LED与所述传输图像测 试信号的脉冲图形点亮的LED之间的序号差,所述n为自然数,且所述T3的误差范围在
Figure PCTCN2017086916-appb-000004
作为一个优选的实施例,上述的音视频同步测试的系统,所述LED灯之间点亮的频率f的值大于或等于人眼所能分辨帧率的值,所述图像采集单元的采集图形的帧率的值大于所述频率f的值。
作为一个优选的实施例,上述的音视频同步测试的系统,所述音频信号发生装置包括音频信号发生器,所述待测设备还包括声音采集单元、音频传输单元和声音播放单元;所述音频延时测试装置包括音频采集单元和音频分析单元;
其中,所述音频信号发生器产生所述声音测试信号,所述声音采集单元采集所述声音测试信号并通过所述音频传输单元传送至所述声音播放单元以输出传输声音测试信号,所述音频采集单元同时采集所述音频信号发生器当前所生成的实时声音测试信号和所述声音播放单元当前所输出的传输声音测试信号;所述音频分析单元对所述音频采集单元采集的所述实时声音测试信号和所述传输声音测试信号进行分析处理后,输出所述音频延时时间至所述处理装置。
作为一个优选的实施例,上述的音视频同步测试的系统,所述音频信号发生装置还包括仿真嘴,以对所述音频信号发生器生成的所述声音测试信号进行均衡,且所述声音采集单元采集均衡后的所述声音测试信号;以及
所述音频采集单元包括自由场传声器,所述音频分析单元包括音频分析仪,所述音频分析仪通过对所述自由场传声器所采集的所述传 输声音测试信号输出所述音频延时时间。
与现有技术相比,本发明的优点是:
本申请中的音视频同步测试的系统及方法,可应用于对音视频同步传输播放设备的同步测试中,尤其可简单、精准地实现对音视频同步性能要求较高的实时的音视频传输播放系统(如楼宇安防系统、视频通讯系统等)的性能测试,即可通过分别利用视频延时测试装置及音频延时测试装置,可简单、精准、易实施地获取音视频同步传输播放设备的视频延时时间及音频延时时间,并对获取的视频延时时间及音频延时时间进行比较后就能精确地获悉该音视频同步传输播放设备的音视频同步性能。
附图说明
图1为本发明实施例中音视频同步测试的系统的结构示意图;
图2为本发明实施例中视频延时测试的结构示意图;
图3为本发明实施例中音频延时测试的结构示意图;
图4为本发明实施例中对待测楼宇可视对讲设备进行音频延时测试的结构示意图;
图5为本发明实施例中音频延时的CSS测试信号组;
图6为本发明实施例中音视频同步测试的方法的流程示意图。
具体实施方式
下面结合附图和具体实施例对本发明作进一步说明,但不作为本发明的限定。
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有付出创造性劳动的前提下所获得的所有其他实施方式,都属于本发明保护的范围。
需要说明的是,在不冲突的情况下,本发明中的实施例及实施例中的特征可以相互组合。
下面结合附图和具体实施例对本发明作进一步说明,但不作为本发明的限定。
图1为本发明实施例中音视频同步测试的系统的结构示意图,如图1所示,本实施例提供了一种音视频同步测试的系统,可应用于对传输播放音视频的待测设备(如影音设备等)的性能测试中,尤其可针对实时远程音视频通讯(例如居家安防设备中的楼宇可视对讲设备)等对于音视频同步性能要求较高的设备中进行的音视频同步性能测试,该系统可包括:
视频信号发生装置,可临近待测设备设置,以便于待测设备获取该视频信号发生装置所显示的图形测试信号;该视频信号发生装置可产生图像测试信号(一般可通过显示动态的光影显示信息),如可通过显示屏或LED灯矩阵等来产生上述的图形测试信号;在视频信号发生装置产生上述的图形测试信号后,利用上述的待测设备采集并传 送该图像测试信号至该待测设备的第一输出端(图像输出端,如显示屏等),进而输出传输图像测试信号(即经待测设备采集并传送后所输出的图像信息);
视频延时(Video Delay)测试装置,则临近上述的视频信号发生装置及待测设备的第一输出端,以便于分别采集待测设备所输出的传输图像测试信号及视频信号发生装置当前所产生的实时图像测试信号,并将所采集的传输图像测试信号与实时图像测试信号进行比对和/或运算,进而获取并输出待测设备的视频延时时间;
音频信号发生装置,同样可临近待测设备设置,以便于该待测设备采集音频信号发生装置所产生的声音测试信号;待测设备采集音频信号发生装置所产生的声音测试信号(一般可为利用喇叭模仿人的声音进行声音测试信号的生成)后,将采集到的声音测试信号传送至该待测设备的第二输出端(声音输出端,如音响设备等),以输出传输声音测试信号(即经待测设备采集并传送后所输出的声音信息);
音频延时(audio delay)测试装置,分别采集并对比待测设备所输出的传输声音测试信号和音频信号发生装置当前所产生的实时声音测试信号,以获取待测设备的音频延时时间;
处理装置,分别与视频延时测试装置及音频延时测试装置连接,以获取所述视频延时时间与所述音频延时时间之间的时间差。
具体的,上述的视频信号发生装置产生图像测试信号,而待测设备则采集该图形测试信号并进行传输后输送到远端的第一输出端,以输出传输图像测试信号后,视频延时测试装置则同时采集该传输图像 测试信号和此时视频信号发生装置所显示的实时图像测试信号并进行比对,即因为该待测设备传输图像测试信号需要花费一定的时间,进而会造成该待测设备所输出的传输图像测试信号并不是当前视频信号发生装置所显示的实时图像测试信号,所以待测设备传输图像测试信号有一定的延时,而通过将采集的传输图像测试信号与实时图像测试信号进行比较而获取的差异信号,可利用视频信号发生装置产生图像测试信号的规则就能获取该待测设备的视频延时时间;同样的,上述的音频信号发生装置产生声音测试信号,而待测设备则采集该声音测试信号并进行传输后输送到远端的第二输出端,以输出传输声音测试信号后,音频延时测试装置则同时采集该传输声音测试信号和此时音频信号发生装置所播放的实时声音测试信号并进行比对,即因为该待测设备传输声音测试信号也需要花费一定的时间,进而会造成该待测设备所输出的传输声音测试信号也并不是当前音频信号发生装置所产生的实时声音测试信号,所以待测设备传输声音测试信号也有一定的延时,而通过将采集的传输声音测试信号与实时声音测试信号进行比较而获取的声音差异信号,可利用音频信号发生装置产生声音测试信号的规则就能获取该待测设备的音频延时时间(需要注意到的是,获取上述的视频延时时间的步骤与获取音频延时时间的步骤之间的顺序不分先后,甚至在互相不影响的条件下可同时进行);最后,利用处理装置获取上述的视频延时时间及音频延时时间进行比对运算后就能获取该待测设备同步传输音视频时所音频数据与视频数据之间的时间差,并可依据该时间差来判断该待测设备的音视频同步性 能。
优选的,上述的音视频同步测试的系统还可包括一个与处理模块连接的判断装置(该判断装置可与上述的处理模块集成为一体或能够实现该两个功能的同一个部件),且该判断装置中可预存有时间差与音视频同步性能参数对应的数据表,这样判断装置就能根据接收到的时间差从数据表中快速地调取与该时间差所对应的音视频同步性能参数进行输出,进而及时的将待测设备的音视频同步性能快速、形象的输出。
优选的,为了较为方便地获取上述的时间差(因为视频数据一般相较于音频数据较大,故一般视频会在待测设备中滞后于音频的传输),一般可以音频延时时间为基准进行时间差的计算,如通过将音频延时时间减去视频延时时间就能知悉该待测设备的音视频同步性能中视频滞后音频多久了;当然,也可将视频延时时间作为基准进行时间差的计算,具体可以实际的测试需求而设定。
下面就以音频延时时间为基准进行举例说明,在获取上述的时间差时,处理装置可根据公式ΔT=T1-T2计算所述时间差;ΔT为时间差,T1为音频延时时间,T2为视频延时时间,且ΔT、T1及T2的时间单位均相同;例如,若获取的ΔT的值为0时,则说明该待测设备的音视频传输延时时间相同;相应的,判断装置就会输出的音视频同步性能参数为待测设备的音视频同步性能最优参数,即无论其延时时间为多少,只要相同,就说明该待测设备的音视频同步性能达到了理想状态,为最佳性能;而ΔT的值为负值时,就说明该待测设备传输音频的延 时时间小于传输视频的延时时间,即判断装置就会输出的音视频同步性能参数为待测设备同时传输的视频信号滞后音频信号|ΔT|毫秒;同时ΔT的值为正值时,则说明该待测设备传输音频的延时时间大于传输视频的延时时间,即判断装置输出的音视频同步性能参数为待测设备同时传输的音频信号滞后视频信号ΔT毫秒;其中,ΔT、T1及T2的时间单位可均为毫秒。
进一步的,如图2所示,上述的视频信号发生装置可包括LED图形发生器(如LED灯阵列等),待测设备则可包括图像采集单元、视频传输单元和图像显示单元等,而视频延时测试装置则可包括视频采集单元、第一比较单元和第一计算单元等;利用LED图形发生器产生图形测试信号,而图像采集单元则采集图形测试信号并通过视频传输单元传送至图形显示单元以显示传输图像测试信号,视频采集单元同时采集LED图形发生器当前所生成的实时图像测试信号和图像显示单元当前所显示的传输图像测试信号,第一比较单元再将采集的实时图像测试信号和传输图像测试信号进行比对并输出比对结果,而第一计算单元则根据第一比较单元输出的比较结果获取并输出视频延时时间至上述处理装置。
优选的,上述LED图形发生器可包括至少一行沿直线排列的N个LED灯(如N*N的LED灯阵列,可选一行LED灯作为测试使用);且N个LED灯沿其延伸的同一方向上以频率f依次点亮每个LED灯,且每个LED灯的点亮时间可为1/(N*f),而第一比较单元对采集的实时图像测试信号和传输图像测试信号进行比对,进而获取实时图像测 试信号与传输图像测试信号之间相差n个LED灯(即点亮的LED灯之间相差n个LED灯),而计算单就可以利用下面的公式获取视频延时时间,具体为:
Figure PCTCN2017086916-appb-000005
T3=0,n=0;
其中,T3为视频延时时间,N为大于或等于5的正整数,n为实时图像测试信号的脉冲图形中点亮的LED与传输图像测试信号的脉冲图形点亮的LED之间的序号差,n为自然数,且T3的误差范围应在
Figure PCTCN2017086916-appb-000006
之间。
优选的,相邻的LED灯之间点亮的频率f的值应大于或等于人眼所能分辨的帧率的值,且图像采集单元的采集图形的帧率的值也要大于频率f的值,否则就无法采集图形了。
进一步的,如图3所示,上述的音频信号发生装置包括音频信号发生器,待测设备还可包括声音采集单元、音频传输单元和声音播放单元;而音频延时测试装置可包括音频采集单元和音频分析单元;音频信号发生器产生声音测试信号,声音采集单元采集声音测试信号并通过音频传输单元传送至声音播放单元以输出传输声音测试信号,音频采集单元同时采集音频信号发生器当前所生成的实时声音测试信号和声音播放单元当前所输出的传输声音测试信号,音频分析单元将采集的实时声音测试信号和传输声音测试信号进行分析计算处理后输出音频延时时间至处理装置。
优选的,上述的音频信号发生装置还可包括仿真嘴,以对音频信号发生器生成的声音测试信号进行均衡,且声音采集单元采集均衡后的声音测试信号;以及音频采集单元包括自由场传声器,音频分析单元可包括音频分析仪,音频信号发生器还可具有显示实时图像测试信号的脉冲图形的相关仪器设备,音频分析仪通过自由场传声器采集的实时声音测试信号及传输声音测试信号的脉冲进行分析比对处理后,输出该待测设备的音频延时时间。
下面就以楼宇可视对讲设备的唇音同步(lip sync)测试为例进行详细说明,如图4所示,由于楼宇可视对讲设备中两个用户端需要对话通讯,故在待测楼宇可视对讲设备两个安装在挡板上的免提终端(需要注意的是,当被测的设备为手柄终端设备时,需去除图4中的挡板,并将仿真嘴及自由场传声器安装在LRGP(loudness rating guard-ring position,响度评定值保护环位置)头型架上进行音频延时测试)的处均设置有产生声音测试信号的音频信号发生器和仿真嘴,以及接受传输声音测试信号的自由场传声器和音频分析仪,即包括两路音频延时测试装置,每路音频延时测试装置均包括音频信号发生器、仿真嘴、自由场传声器和音频分析仪;为了避免两路音频延时测试装置之间相互干扰,可将位于同一免提终端的用以产生声音测试信号的音频信号发生器和仿真嘴与用以接收传输声音测试信号的自由场传声器和音频分析仪之间的线路夹角设置为大于或等于32°的夹角,而仿真嘴和/或自由场传声器与免提终端之间的距离大于或等于10cm且小于50cm(以避免距离近而导致干扰,距离远又使得免提终 端无法收集足够的声音进行测试,具体的值可依据实际设备的性能参数及测试需求而设定);较优的,为了避免两路音频延时测试装置之间的干扰,可分别异步对两路音频延时测试装置各进行一次测试,以确保每次测试结果的精确性;即每次仅在待测楼宇可视对讲设备的一免提终端设置音频信号发生器和仿真嘴,而在临近另一免提终端的位置处设置自由场传声器和音频分析仪。
为了阐述简便,下面就以一路音频延时测试装置进行说明,即先利用音频信号发生器生成包括多组(如四组)CSS(composite source signal,合成源信号)信号的测试单元信号,其中前几组(如前三组)CSS信号可用于训练以使得信道传输到达正常状态,并将剩余组(如第四组)CSS信号作为测试信号,即可利用第四组CSS信号的持续高电平信号部分用于延时测量;优选的,每组CSS信号的脉宽为248.62ms,相邻的CSS信号间隔101.38ms,这样每个测试单元信号长度为1298.62ms(上述具体的数值可依据实际测试需求而适应性的调整,在此仅作为示例进行说明);具体的,上述的音频信号发生器所产生的测试信号(如包括四组CSS信号的测试单元信号)经仿真嘴均衡后激励本地对讲终端(即待测楼宇可视对讲设备一免提终端中的麦克风)工作,并经传输网络传送至远端的对讲终端(即待测楼宇可视对讲设备一免提终端中的音响)进行播放,同时利用位于该免提终端位置处附近的自由场传声器对传输测试信号进行采集,以获取音频信号,通过利用音频分析仪就能比较判断出上述的诸如第四组CSS信号(即测试信号)与通过系统端对端传输采集到的信号之间的音频 延时时间。
本申请还提供了一种音视频同步测试的方法,可基于上述音视频同步测试的系统的基础上,应用于诸如对传输播放音视频等待测设备的性能测试中,尤其是针对音视频同步性能要求较高的诸如实时音视频传输播放设备中,所述方法可包括:
于一视频信号发生装置产生图像测试信号后,利用所述待测设备采集并传送所述图像测试信号至该待测设备的第一输出端,以输出传输图像测试信号;
利用一视频延时测试装置分别采集并对比所述待测设备所输出的所述传输图像测试信号和所述视频信号发生装置当前所产生的实时图像测试信号,以获取所述待测设备的视频延时时间;
于一音频信号发生装置产生声音测试信号后,利用所述待测设备采集并传送所述声音测试信号至该待测设备的第二输出端,以输出传输声音测试信号;
利用一音频延时测试装置分别采集并对比所述待测设备所输出的所述传输声音测试信号和所述音频信号发生装置当前所产生的实时声音测试信号,以获取所述待测设备的音频延时时间;
利用一处理装置接收并计算出所述视频延时时间与所述音频延时时间之间的时间差。
进一步的,上述的方法还可包括:
提供一预存有时间差与音视频同步性能参数对应的数据表的判 断装置;
利用所述判断装置接收并根据所述时间差从所述数据表中调取与该时间差所对应的音视频同步性能参数进行输出。
需要注意的是,上述获取视频延时时间的步骤与获取音频延时时间的步骤之间的先后顺序可以颠倒,也可在不影响测试结果的前提下同时进行。
由于本实施例中的音视频同步测试的方法可基于上述实施例中一种音视频同步测试的系统的基础上进行,及其相互之间可为相互对应的产品及方法,故两实施例之间相同或近似的技术特征均可相互的适用,而为了阐述简便,在此不予赘述,但其不应理解为对本申请的限制,只要本领域技术人员在看到两者中任何一个在不付出创造性劳动时所能延伸的技术方案,均应在本申请所记载的范围内。
另外,上述有关楼宇可视对讲设备的实施例中所指的音频延时一般为音频信号由本地对讲终端嘴参考点发送,并经楼宇可视对讲系统各部件后,至远端对讲终端的耳参考点接收所需要的单向传输时间;而视频延时则一般为由本地对讲终端摄像头摄取,并经楼宇可视对讲系统各部件后,至远端对讲终端的显示装置显示同一帧所需要的单向传输时间;而唇音同步则为楼宇可视对讲系统传输声音和图像信号间的时域关系,其值可用于描述声音和图像信号的同步关系,可用时间差范围表示(其他实施例方式也可参照上述语义进行定义);符合标准的楼宇可视对讲设备中音频延时及视频延时均应不大于300ms,而视频帧率应不低于20fps,而待测设备进行唇音同步过程中,音频延 时时间与视频延时时间之间的时间差范围为-90ms~+185ms,也即系统在同时传输音视频信号时,输出的音频信号超前相对应的视频信号的时间应不大于90ms,滞后时间应不大于185ms。
同时,在对楼宇可视对讲设备进行上述音视频同步性能测试时,进行音频延时测试时测试境噪声不应超过40dB,而在进行视频延时测试,EUT(equipment under test,受试设备)即可视对讲设备应处于正常图像传输状态,且当EUT的音频通讯不能切断需要进行音视频同步测试时则必须保证其相互之间不会引起视频特性参数和/或音频特性参数的改变。
例如,在进行视频帧率、视频延时、唇音同步测试可使用LED图像发生器作为信号源,而LED图像信号发生器则应置于视频捕捉设备的正前方,并可通过调整LED矩阵与视频接受设备之间的距离使得视频接受设备屏幕上的LED矩阵图像在横轴或纵轴上充满屏幕,且视频接受设备屏幕上的图像还需应能聚焦。
优选的,上述LED图形发生器可由10×10的LED阵列和编程控制器组成,该LED阵列可被用于产生测量用的图形图案,即可以单个LED为基本单元,在纵向或横向通过依次点亮每个LED灯以使得呈现的图像做连续滚动,而编程控制器可按设定频率控制LED阵列产生变化的图案。LED图形发生器还可具有LED位置信息显示功能,并具有输出2个下降沿同步信号的输出接口。另外,为防止音频对试验的影响,可使得LED图形发生器同步控制A、B两组相同的LED阵列,即A组LED阵列置于VCU摄像头的正前方,作为测试 信号;而B组LED阵列置于对讲终端显示屏端,以便于数字照相机能摄取。同时,数码摄像机可调整视频捕捉帧率(视频延时测试中使用)并能在接受到出发信号后捕捉静态图像(唇音同步中使用)。
优选的,在进行视频帧率试验时,可选择LED阵列中间区域的一行10个LED灯组产生的连续变化图像作为测试信号,激活音视频通讯;然后,从1Hz起,由低向高逐渐调节LED图形发生器的设置频率(每个LED每秒激活1次,闪烁时间为1/(10f)s),使LED灯组按设置循环依次以水平方向滚动点亮。在被测对讲终端显示屏观察经系统传输后得到的输出影像,当观察到测试用10个LED灯组影像保持稳定不滚动时停止调整,观察当该状态保持时间大于10s,即确认状态有效。最后,记录此时LED图形发生器的发生频率即为被检测IP BIS的帧率(fr)。
进一步的,将系统的帧率(fr)即f作为LED图形发生器的设置频率,并以LED图形发生器控制LED阵列产生的连续变化图像作为测试信号。然后,利用视频采集装置(VCD)抓拍到LED图像发生器A产生的图像,数字摄像机每一帧应能同时捕捉视频接受设备屏幕上的图像以及LED信号发生器B上的图像,同时采集3秒。之后,比较LED图像信号发生器B上所激活的LED灯位置和相同帧中视频接受设备屏幕上激活的LED灯的位置,计算二者间相隔n个LED灯,进而获取视频延时时间。最后,对所测得的视频延时值做统计处理,将其90%置信区间的上限作为的视频延时Td。
需要注意的是,如果BIS可以静音或者将音量调小以避免啸叫, 该测试可以使用LED矩阵上1个LED灯与视频接受设备相邻一起置于视频捕捉设备前方进行测试。这将会替代上述测试方法中使用LED图像发生器A和B进行测试。
优选的,在进行唇音同步测试时,可将音频延时和视频延时测试同步进行,试验条件和试验方法应同时满足音频延时和视频延时要求:
首先,音频信号发生器应该播放连续的CSS信号直到测试结束。前3组CSS信号用于训练,第4组信号用于进行音频延时测试(如7.3.3中描述)。在第4组信号高频信号结束,下降沿开始时(即CSS信号的248.62ms),数字摄像机开始捕捉每一帧图像。所以每一组CSS信号第248.62ms的时候都会测量一次音频延时和视频延时。
其次,该测试应持续10s(28次测试),并用视频延时方法计算视频延时,然后控制音频分析仪计算音频延时时间,最后将同时捕捉的视频延时值和音频延时值进行计算:
唇音同步=视频延时-音频延时(如为负值,则说明音频比视频延时大);
同样的,对所测得的唇音同步值做统计处理,将其90%置信区间的下限和上限分别作为前向和后向唇音同步测量值。
综上所述,可应用于对音视频同步传输播放设备的同步测试中,尤其可简单、精准地实现对音视频同步性能要求较高的实时的音视频传输播放系统(如楼宇安防系统、视频通讯系统等)的性能测试,即 可通过分别利用视频延时测试装置及音频延时测试装置,可简单、精准、易实施地获取音视频同步传输播放设备的视频延时时间及音频延时时间,并对获取的视频延时时间及音频延时时间进行比较后就能精确地获悉该音视频同步传输播放设备的音视频同步性能。
以上所述仅为本发明较佳的实施例,并非因此限制本发明的实施方式及保护范围,对于本领域技术人员而言,应当能够意识到凡运用本发明说明书及图示内容所作出的等同替换和显而易见的变化所得到的方案,均应当包含在本发明的保护范围内。

Claims (20)

  1. 一种音视频同步测试的系统,其特征在于,应用于对传输播放音视频的待测设备的性能测试中,所述系统包括:
    视频信号发生装置,临近所述待测设备设置,所述视频信号发生装置产生图像测试信号;所述待测设备采集并传送所述图像测试信号至该待测设备的第一输出端,以输出传输图像测试信号;
    视频延时测试装置,分别采集并对比所述待测设备所输出的所述传输图像测试信号和所述视频信号发生装置当前所产生的实时图像测试信号,以获取所述待测设备的视频延时时间;
    音频信号发生装置,临近所述待测设备设置,所述音频信号发生装置产生声音测试信号;所述待测设备采集并传送所述声音测试信号至该待测设备的第二输出端,以输出传输声音测试信号;
    音频延时测试装置,分别采集并对比所述待测设备所输出的所述传输声音测试信号和所述音频信号发生装置当前所产生的实时声音测试信号,以获取所述待测设备的音频延时时间;
    处理装置,分别与所述视频延时测试装置及所述音频延时测试装置连接,以获取所述视频延时时间与所述音频延时时间之间的时间差。
  2. 如权利要求1所述的音视频同步测试的系统,其特征在于,还包括:
    判断装置,与所述处理模块连接;
    其中,所述判断装置中预存有时间差与音视频同步性能参数对应 的数据表,所述判断装置接收并根据所述时间差从所述数据表中调取与该时间差所对应的音视频同步性能参数进行输出。
  3. 如权利要求2所述的音视频同步测试的系统,其特征在于,所述处理装置以所述音频延时时间为基准获取所述时间差。
  4. 如权利要求3所述的音视频同步测试的系统,其特征在于,所述处理装置根据公式ΔT=T1-T2计算所述时间差;
    其中,所述ΔT为所述时间差,所述T1为所述音频延时时间,所述T2为所述视频延时时间,且所述ΔT、所述T1及所述T2的时间单位均相同。
  5. 如权利要求4所述的音视频同步测试的系统,其特征在于,所述ΔT的值为0时,所述判断装置输出的所述音视频同步性能参数为所述待测设备的音视频同步性能最优参数;
    所述ΔT的值为负值时,所述判断装置输出的所述音视频同步性能参数为所述待测设备同时传输的视频信号滞后音频信号|ΔT|毫秒;以及
    所述ΔT的值为正值时,所述判断装置输出的所述音视频同步性能参数为所述待测设备同时传输的音频信号滞后视频信号ΔT毫秒;
    其中,所述ΔT、所述T1及所述T2的时间单位均为毫秒。
  6. 如权利要求1所述的音视频同步测试的系统,其特征在于, 所述视频信号发生装置包括LED图形发生器,所述待测设备包括图像采集单元、视频传输单元和图像显示单元;所述视频延时测试装置包括视频采集单元、第一比较单元和第一计算单元;
    其中,所述LED图形发生器产生所述图形测试信号,所述图像采集单元采集所述图形测试信号并通过所述视频传输单元传送至所述图形显示单元以显示传输图像测试信号,所述视频采集单元同时采集所述LED图形发生器当前所生成的实时图像测试信号和所述图像显示单元当前所显示的传输图像测试信号,所述第一比较单元将采集的所述实时图像测试信号和所述传输图像测试信号进行比对,所述第一计算单元根据所述第一比较单元输出的比较结果获取并输出所述视频延时时间至所述处理装置。
  7. 如权利要求6所述的音视频同步测试的系统,其特征在于,所述LED图形发生器包括至少一行沿直线排列的N个LED灯;且所述N个LED灯沿其延伸的同一方向上以频率f依次点亮每个LED灯,且每个所述LED灯的点亮时间为1/(N*f),所述视频延时时间为:
    Figure PCTCN2017086916-appb-100001
    T3=0,n=0;
    其中,T3为所述视频延时时间,N为大于或等于5的正整数,n为所述实时图像测试信号的脉冲图形中点亮的LED与所述传输图像测试信号的脉冲图形点亮的LED之间的序号差,所述n为自然数,且 所述T3的误差范围在
    Figure PCTCN2017086916-appb-100002
  8. 如权利要求7所述的音视频同步测试的系统,其特征在于,所述LED灯之间点亮的频率f的值大于或等于人眼分辨帧率的值,所述图像采集单元的采集图形的帧率的值大于所述频率f的值。
  9. 如权利要求1所述的音视频同步测试的系统,其特征在于,所述音频信号发生装置包括音频信号发生器,所述待测设备还包括声音采集单元、音频传输单元和声音播放单元;所述音频延时测试装置包括音频采集单元和音频分析单元;
    其中,所述音频信号发生器产生所述声音测试信号,所述声音采集单元采集所述声音测试信号并通过所述音频传输单元传送至所述声音播放单元以输出传输声音测试信号,所述音频采集单元同时采集所述音频信号发生器当前所生成的实时声音测试信号和所述声音播放单元当前所输出的传输声音测试信号;所述音频分析单元对所述音频采集单元采集的所述实时声音测试信号和所述传输声音测试信号进行分析处理后,输出所述音频延时时间至所述处理装置。
  10. 如权利要求9所述的音视频同步测试的系统,其特征在于,所述音频信号发生装置还包括仿真嘴,以对所述音频信号发生器生成的所述声音测试信号进行均衡,且所述声音采集单元采集均衡后的所述声音测试信号;以及
    所述音频采集单元包括自由场传声器,所述音频分析单元包括音 频分析仪,所述音频分析仪通过对所述自由场传声器所采集的所述传输声音测试信号输出所述音频延时时间。
  11. 一种音视频同步测试的方法,其特征在于,应用于对传输播放音视频的待测设备的性能测试中,所述方法包括:
    于一视频信号发生装置产生图像测试信号后,利用所述待测设备采集并传送所述图像测试信号至该待测设备的第一输出端,以输出传输图像测试信号;
    利用一视频延时测试装置分别采集并对比所述待测设备所输出的所述传输图像测试信号和所述视频信号发生装置当前所产生的实时图像测试信号,以获取所述待测设备的视频延时时间;
    于一音频信号发生装置产生声音测试信号后,利用所述待测设备采集并传送所述声音测试信号至该待测设备的第二输出端,以输出传输声音测试信号;
    利用一音频延时测试装置分别采集并对比所述待测设备所输出的所述传输声音测试信号和所述音频信号发生装置当前所产生的实时声音测试信号,以获取所述待测设备的音频延时时间;
    利用一处理装置接收并计算出所述视频延时时间与所述音频延时时间之间的时间差。
  12. 如权利要求11所述的音视频同步测试的方法,其特征在于,还包括:
    提供一预存有时间差与音视频同步性能参数对应的数据表的判断装置;
    利用所述判断装置接收并根据所述时间差从所述数据表中调取与该时间差所对应的音视频同步性能参数进行输出。
  13. 如权利要求12所述的音视频同步测试的方法,其特征在于,所述处理装置以所述音频延时时间为基准计算并输出所述时间差。
  14. 如权利要求13所述的音视频同步测试的方法,其特征在于,所述处理装置根据公式ΔT=T1-T2计算所述时间差;
    其中,所述ΔT为所述时间差,所述T1为所述音频延时时间,所述T2为所述视频延时时间,且所述ΔT、所述T1及所述T2的时间单位均相同。
  15. 如权利要求14所述的音视频同步测试的方法,其特征在于,所述方法中:
    当所述ΔT的值为0时,所述判断装置输出所述待测设备的音视频同步性能最优参数;
    当所述ΔT的值为负值时,所述判断装置输出所述待测设备同时传输的视频信号滞后音频信号|ΔT|毫秒;以及
    当所述ΔT的值为正值时,所述判断装置输出所述待测设备同时传输的音频信号滞后视频信号ΔT毫秒;
    其中,所述ΔT、所述T1及所述T2的时间单位均为毫秒。
  16. 如权利要求11所述的音视频同步测试的方法,其特征在于,所述视频信号发生装置包括LED图形发生器,所述待测设备包括图像采集单元、视频传输单元和图像显示单元;所述视频延时测试装置包括视频采集单元、第一比较单元和第一计算单元;
    其中,所述LED图形发生器产生所述图形测试信号,所述图像采集单元采集所述图形测试信号并通过所述视频传输单元传送至所述图形显示单元以显示传输图像测试信号,所述视频采集单元同时采集所述LED图形发生器当前所生成的实时图像测试信号和所述图像显示单元当前所显示的传输图像测试信号,所述第一比较单元将采集的所述实时图像测试信号和所述传输图像测试信号进行比对,所述第一计算单元根据所述第一比较单元输出的比较结果获取并输出所述视频延时时间至所述处理装置。
  17. 如权利要求16所述的音视频同步测试的方法,其特征在于,所述LED图形发生器包括至少一行沿直线排列的N个LED灯;且所述N个LED灯沿其延伸的同一方向上以频率f依次点亮每个LED灯,且每个所述LED灯的点亮时间为1/(N*f),所述视频延时时间为:
    Figure PCTCN2017086916-appb-100003
    T3=0,n=0;
    其中,T3为所述视频延时时间,N为大于或等于5的正整数,n为 所述实时图像测试信号的脉冲图形中点亮的LED与所述传输图像测试信号的脉冲图形点亮的LED之间的序号差,所述n为自然数,且所述T3的误差范围在
    Figure PCTCN2017086916-appb-100004
  18. 如权利要求17所述的音视频同步测试的方法,其特征在于,所述LED灯之间点亮的频率f的值大于或等于人眼所能分辨帧率的值,所述图像采集单元的采集图形的帧率的值大于所述频率f的值。
  19. 如权利要求11所述的音视频同步测试的方法,其特征在于,所述音频信号发生装置包括音频信号发生器,所述待测设备还包括声音采集单元、音频传输单元和声音播放单元;所述音频延时测试装置包括音频采集单元和音频分析单元;
    其中,所述音频信号发生器产生所述声音测试信号,所述声音采集单元采集所述声音测试信号并通过所述音频传输单元传送至所述声音播放单元以输出传输声音测试信号,所述音频采集单元同时采集所述音频信号发生器当前所生成的实时声音测试信号和所述声音播放单元当前所输出的传输声音测试信号;所述音频分析单元对所述音频采集单元采集的所述实时声音测试信号和所述传输声音测试信号进行分析处理后,输出所述音频延时时间至所述处理装置。
  20. 如权利要求19所述的音视频同步测试的方法,其特征在于,所述音频信号发生装置还包括仿真嘴,以对所述音频信号发生器生成的所述声音测试信号进行均衡,且所述声音采集单元采集均衡后的所 述声音测试信号;以及
    所述音频采集单元包括自由场传声器,所述音频分析单元包括音频分析仪,所述音频分析仪通过对所述自由场传声器所采集的所述传输声音测试信号输出所述音频延时时间。
PCT/CN2017/086916 2016-06-03 2017-06-02 一种音视频同步测试的系统及方法 WO2017206935A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
ES17805880T ES2921057T3 (es) 2016-06-03 2017-06-02 Sistema y método para realizar pruebas de sincronización de audio y vídeo
EP17805880.6A EP3468183B1 (en) 2016-06-03 2017-06-02 System and method for audio and video synchronization test

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610392099.3A CN106060534A (zh) 2016-06-03 2016-06-03 一种音视频同步测试的系统及方法
CN201610392099.3 2016-06-03

Publications (1)

Publication Number Publication Date
WO2017206935A1 true WO2017206935A1 (zh) 2017-12-07

Family

ID=57170321

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/086916 WO2017206935A1 (zh) 2016-06-03 2017-06-02 一种音视频同步测试的系统及方法

Country Status (4)

Country Link
EP (1) EP3468183B1 (zh)
CN (1) CN106060534A (zh)
ES (1) ES2921057T3 (zh)
WO (1) WO2017206935A1 (zh)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112382313A (zh) * 2020-12-02 2021-02-19 公安部第三研究所 一种音频通讯质量评价系统及方法
CN112752095A (zh) * 2020-12-29 2021-05-04 平安普惠企业管理有限公司 测试ai视频测试通话数据的方法、装置、设备和存储介质
CN113220517A (zh) * 2021-05-28 2021-08-06 Oppo广东移动通信有限公司 操作耗时测试系统、信号处理设备及信号处理方法
CN114039890A (zh) * 2021-11-04 2022-02-11 国家工业信息安全发展研究中心 一种语音识别时延测试方法、系统及存储介质
CN114710687A (zh) * 2022-03-22 2022-07-05 阿里巴巴(中国)有限公司 音视频同步方法、装置、设备及存储介质
CN114915574A (zh) * 2021-12-17 2022-08-16 天翼数字生活科技有限公司 一种通过声音自动检测智能门铃响应延迟的方法和系统
CN115035920A (zh) * 2021-03-04 2022-09-09 漳州立达信光电子科技有限公司 音乐照明同步装置、系统、方法、终端和可读存储介质
CN116915978A (zh) * 2023-08-07 2023-10-20 昆易电子科技(上海)有限公司 触发时间确定方法、数据采集系统、车辆以及工控机

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106060534A (zh) * 2016-06-03 2016-10-26 公安部第三研究所 一种音视频同步测试的系统及方法
CN106851259B (zh) * 2017-01-17 2021-03-12 中国科学院上海高等研究院 监控系统中的视频延时测试装置
CN106899862A (zh) * 2017-02-23 2017-06-27 杭州当虹科技有限公司 计算视频信号时间偏移量的方法和系统
CN107277296B (zh) * 2017-07-03 2020-05-05 中国舰船研究设计中心 适用于气泡声学研究的音频、视频信号同步装置及方法
CN108270622A (zh) * 2018-01-22 2018-07-10 杭州当虹科技有限公司 一种对视频通讯服务系统进行并发仿真测试的方法及系统
CN110446103B (zh) * 2018-05-04 2021-08-31 腾讯科技(深圳)有限公司 一种音像测试方法、装置及存储介质
CN109144642B (zh) * 2018-08-14 2022-02-18 Oppo广东移动通信有限公司 显示控制方法、装置、电子设备及存储介质
CN110636280A (zh) * 2019-09-18 2019-12-31 中铁检验认证中心有限公司 一种用于测试数字摄像机视音频失步时间的系统
CN111092667B (zh) * 2019-12-18 2023-09-01 公安部第三研究所 一种对讲终端音频建立时间的测试方法及测试系统
CN111277823A (zh) * 2020-03-05 2020-06-12 公安部第三研究所 一种音视频同步测试的系统及方法
CN114125258B (zh) * 2020-08-26 2023-04-18 华为技术有限公司 视频处理方法及电子设备
CN111988647A (zh) * 2020-08-27 2020-11-24 广州视源电子科技股份有限公司 音画同步调整方法、装置、设备以及介质
CN112040225B (zh) * 2020-09-02 2022-08-05 广州市百果园信息技术有限公司 播放延时差测量方法、装置、设备、系统及存储介质
EP4024878A1 (en) * 2020-12-30 2022-07-06 Advanced Digital Broadcast S.A. A method and a system for testing audio-video synchronization of an audio-video player
CN113645544A (zh) * 2021-07-02 2021-11-12 武汉市聚芯微电子有限责任公司 一种播放控制方法、装置和电子设备
EP4203470A1 (en) * 2021-12-21 2023-06-28 Rohde & Schwarz GmbH & Co. KG System and method for evaluation of audio-video desynchronization
TWI813213B (zh) * 2022-03-22 2023-08-21 瑞昱半導體股份有限公司 影音訊號同步方法與影音同步處理系統
CN115243030B (zh) * 2022-06-14 2024-03-01 天翼数字生活科技有限公司 一种终端能力测试系统、方法、设备和存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101796812A (zh) * 2006-03-31 2010-08-04 莱切技术国际公司 唇形同步系统和方法
CN104023229A (zh) * 2014-06-23 2014-09-03 公安部第三研究所 非接触式影像系统性能检测方法及系统
CN105100794A (zh) * 2014-05-13 2015-11-25 深圳Tcl新技术有限公司 音视频同步测试方法及装置
CN106060534A (zh) * 2016-06-03 2016-10-26 公安部第三研究所 一种音视频同步测试的系统及方法

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100551087C (zh) * 2004-11-30 2009-10-14 南京Lg新港显示有限公司 数字电视接收机的声像同步测试方法及其装置
KR100584615B1 (ko) * 2004-12-15 2006-06-01 삼성전자주식회사 오디오/비디오 동기 자동 조정 장치 및 그 방법
CN101742357B (zh) * 2009-12-29 2012-10-24 北京牡丹电子集团有限责任公司 数字电视设备音视频同步误差的测量方法
US8525885B2 (en) * 2011-05-15 2013-09-03 Videoq, Inc. Systems and methods for metering audio and video delays
CN202218352U (zh) * 2011-09-09 2012-05-09 华南理工大学 非介入式双端采集的视频端到端时延测量装置
CN202309990U (zh) * 2011-09-09 2012-07-04 华南理工大学 非介入式单端采集的视频端到端时延测量装置
CN103313089A (zh) * 2012-03-16 2013-09-18 三洋科技中心(深圳)有限公司 音唇同步检测装置及方法
CN103676453A (zh) * 2012-09-11 2014-03-26 北京航天计量测试技术研究所 一种相机快门延时时间测量方法及其装置
CN102917245B (zh) * 2012-11-08 2014-11-26 冠捷显示科技(厦门)有限公司 一种基于光电转换器的音画同步测试装置及方法
CN103117072A (zh) * 2013-01-23 2013-05-22 广东欧珀移动通信有限公司 一种音视频同步测试装置及方法
CN103219029A (zh) * 2013-03-25 2013-07-24 广东欧珀移动通信有限公司 自动调节音视频同步的方法和系统
TWI496455B (zh) * 2013-04-10 2015-08-11 Wistron Corp 影音同步檢測裝置與方法
US8957972B2 (en) * 2013-05-21 2015-02-17 Avaya Inc. Automatic glass-to-glass video and A/V sync test tool
US20150062353A1 (en) * 2013-08-30 2015-03-05 Microsoft Corporation Audio video playback synchronization for encoded media

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101796812A (zh) * 2006-03-31 2010-08-04 莱切技术国际公司 唇形同步系统和方法
CN105100794A (zh) * 2014-05-13 2015-11-25 深圳Tcl新技术有限公司 音视频同步测试方法及装置
CN104023229A (zh) * 2014-06-23 2014-09-03 公安部第三研究所 非接触式影像系统性能检测方法及系统
CN106060534A (zh) * 2016-06-03 2016-10-26 公安部第三研究所 一种音视频同步测试的系统及方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3468183A4 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112382313A (zh) * 2020-12-02 2021-02-19 公安部第三研究所 一种音频通讯质量评价系统及方法
CN112752095A (zh) * 2020-12-29 2021-05-04 平安普惠企业管理有限公司 测试ai视频测试通话数据的方法、装置、设备和存储介质
CN115035920A (zh) * 2021-03-04 2022-09-09 漳州立达信光电子科技有限公司 音乐照明同步装置、系统、方法、终端和可读存储介质
CN113220517A (zh) * 2021-05-28 2021-08-06 Oppo广东移动通信有限公司 操作耗时测试系统、信号处理设备及信号处理方法
CN113220517B (zh) * 2021-05-28 2023-01-10 Oppo广东移动通信有限公司 操作耗时测试系统、信号处理设备及信号处理方法
CN114039890A (zh) * 2021-11-04 2022-02-11 国家工业信息安全发展研究中心 一种语音识别时延测试方法、系统及存储介质
CN114039890B (zh) * 2021-11-04 2023-01-31 国家工业信息安全发展研究中心 一种语音识别时延测试方法、系统及存储介质
CN114915574A (zh) * 2021-12-17 2022-08-16 天翼数字生活科技有限公司 一种通过声音自动检测智能门铃响应延迟的方法和系统
CN114915574B (zh) * 2021-12-17 2024-01-09 天翼数字生活科技有限公司 一种通过声音自动检测智能门铃响应延迟的方法和系统
CN114710687A (zh) * 2022-03-22 2022-07-05 阿里巴巴(中国)有限公司 音视频同步方法、装置、设备及存储介质
CN114710687B (zh) * 2022-03-22 2024-03-19 阿里巴巴(中国)有限公司 音视频同步方法、装置、设备及存储介质
CN116915978A (zh) * 2023-08-07 2023-10-20 昆易电子科技(上海)有限公司 触发时间确定方法、数据采集系统、车辆以及工控机

Also Published As

Publication number Publication date
EP3468183B1 (en) 2022-04-06
CN106060534A (zh) 2016-10-26
ES2921057T3 (es) 2022-08-17
EP3468183A1 (en) 2019-04-10
EP3468183A4 (en) 2020-01-01

Similar Documents

Publication Publication Date Title
WO2017206935A1 (zh) 一种音视频同步测试的系统及方法
US8836910B2 (en) Light and sound monitor
US7020894B1 (en) Video and audio synchronization
CN103313089A (zh) 音唇同步检测装置及方法
CN106488226B (zh) 一种生产线上的自动化检测方法和装置
CN102724604B (zh) 一种视频会议的声音处理方法
CN104023229B (zh) 非接触式影像系统性能检测方法及系统
Lindau The perception of system latency in dynamic binaural synthesis
CN113692091B (zh) 设备控制方法、装置、终端设备及存储介质
CN112017693B (zh) 一种音频质量评估方法及装置
CN109618151A (zh) 电视机图像的自动测试方法
US7845233B2 (en) Sound sensor array with optical outputs
CN105910702B (zh) 一种基于相位补偿的异步头相关传输函数测量方法
CN111277823A (zh) 一种音视频同步测试的系统及方法
Maempel et al. Audio-visual interaction of size and distance perception in concert halls-a preliminary study
CN102917245B (zh) 一种基于光电转换器的音画同步测试装置及方法
KR20010106412A (ko) 음성 및 오디오신호의 실시간 품질 분석기
CN113593536A (zh) 一种检测语音识别准确率的装置和系统
Valente et al. Subjective expectation adjustments of early-to-late reverberant energy ratio and reverberation time to match visual environmental cues of a musical performance
US11956417B2 (en) Method of measuring video latency
US20220358932A1 (en) Information processing apparatus, information processing method, and program
JP3727736B2 (ja) 映像と音声の時間差検知用の映像信号・音声信号発生装置
JP2001326950A (ja) 回線時間差測定装置及び回線時間差測定装置用信号発生器
CN106412514B (zh) 一种视频处理的方法和装置
JPWO2010137166A1 (ja) 遅延検出装置、遅延検出方法

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17805880

Country of ref document: EP

Kind code of ref document: A1