GB2437122A - Method and apparatus for measuring audio/video sync delay - Google Patents

Method and apparatus for measuring audio/video sync delay Download PDF

Info

Publication number
GB2437122A
GB2437122A GB0607214A GB0607214A GB2437122A GB 2437122 A GB2437122 A GB 2437122A GB 0607214 A GB0607214 A GB 0607214A GB 0607214 A GB0607214 A GB 0607214A GB 2437122 A GB2437122 A GB 2437122A
Authority
GB
United Kingdom
Prior art keywords
audio
video
event
events
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0607214A
Other versions
GB2437122B (en
GB0607214D0 (en
Inventor
Matthew Alan Bowers
Scott Griffiths
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tektronix International Sales GmbH
Original Assignee
Tektronix International Sales GmbH
VQUAL Ltd
Tektronix Bristol Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tektronix International Sales GmbH, VQUAL Ltd, Tektronix Bristol Ltd filed Critical Tektronix International Sales GmbH
Priority to GB0607214A priority Critical patent/GB2437122B/en
Publication of GB0607214D0 publication Critical patent/GB0607214D0/en
Publication of GB2437122A publication Critical patent/GB2437122A/en
Application granted granted Critical
Publication of GB2437122B publication Critical patent/GB2437122B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • H04N17/004Diagnosis, testing or measuring for television systems or their details for digital television systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2368Multiplexing of audio and video streams

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method of determining the delay between an audio and visual signal, the method comprising providing a video signal having a predetermined sequence of visual events, each visual event occurring at a predetermined point in the sequence, providing an audio signal having a predetermined sequence of audio events, each audio event occurring at a predetermined point in the sequence, wherein the audio and video events are in synchronicity with one another, digitally encoding the video and audio signals to generate encoded video and audio data streams, detecting the occurrence of each event in the encoded video and audio data streams and their relative time of occurrence and from the detected time of occurrence of the audio and visual events measuring any time delay between the encoded video and audio data streams.

Description

<p>I</p>
<p>METHOD AND APPARATUS FOR MEASURING AUDIO/VIDEO SYNC DELAY In the field of providing video content it is common to provide that content as digital data.</p>
<p>In the context of this application the term "video" shall be taken to refer to the combination of audio and visual information. A common requirement is to pass the digital video data through one or more encoding processes, for example prior to the broadcast transmission of the digital video data. The coding processes habitually involve data compression and the use of digital audio filters to process the audio signal. The encoding process may also typically involve multiplexing a plurality of separate data streams together. Since it will commonly be the case that the audio data will be processed differently from the image data and each of the different stages in the encoding process can potentially introduce a time delay to the digital data signal, the overall encoding process can potentially introduce a loss of synchronisation between the audio and image data, which will be most unnoticeable as a loss of lip-sync in video footage of the speaking characters. The human brain can perceive even quite small time delays between the video and image data, with the circumstances in which the audio signal leads the video signal being most noticeable. For this reason, the applicable encoding and transmission standards stipulate maximum time delays between the audio and video data. For example, according to some standards the audio signal must not lead the corresponding video signal by a time delay greater than 4Oms.</p>
<p>It is therefore advantageous to be able to determine in advance the precise amount of time delay any given encoding system is likely to introduce between the audio and visual parts of the overall video signal.</p>
<p>According to a first aspect of the present invention there is provided a method of determining the delay between an audio and visual signal, the method comprising: Providing a video signal having a predetermined sequence of visual events, each visual event occurring at a predetermined point in the sequence; Providing an audio signal having a predetermined sequence of audio events, each audio event occurring at a predetermined point in the sequence, wherein the audio and video events are in synchronicity with one another; Digitally encoding the video and audio signals to generate encoded video and audio data streams; Detecting the occurrence of each event in the encoded video and audio data streams and the relative time of occurrence; and From the detected time of occurrence of the audio and visual events measuring any time delay between the encoded video and audio data streams.</p>
<p>Preferably an alert may be generated if the measured time delay between the encoded audio and video data streams exceeds a predetermined value.</p>
<p>Preferably each event within both sequences of audio and visual events has a different duration. The duration of the events may sequentially increase through the sequence.</p>
<p>Preferably each event within both sequences of audio and visual events is spaced apart by a time periods of differing duration. The time periods between events may sequentially increase throughout the sequence of events.</p>
<p>Preferably each video event comprises illuminating a predetermined display sector.</p>
<p>Preferably each audio event comprises generating an audible tone.</p>
<p>According to a second aspect of the present invention there is provided apparatus for determining the delay between encoded audio and video signals, wherein the video signal has a predetermined sequence of visual events, each visual event occurring at a predetermined point in the sequence and the audio signal has a predetermined sequence of audio events, each audio event occurring at a predetermined point in the sequence, the audio and video events being in synchronicity with one another, the apparatus comprising; A video event detector arranged to detect each video event and provide a first time signal corresponding to each video event; An audio event detector arranged to detect each audio event and provide a second time signal corresponding to each audio event; and A time comparator arranged to receive the first and second time signals and to measure any time delay between the signals.</p>
<p>The apparatus preferably further comprises an alert generator arranged to receive the measured time delay from the time comparator and generate an alert if the time delay exceeds a predetermined value.</p>
<p>Embodiments of the present invention will now be described below, by way of illustrative example only, with reference to the accompanying figures, of which: Figure 1 schematically illustrates the timing and duration of audio and video events included in a test signal according to embodiments of the present invention; Figure 2 schematically illustrates a time delay analysis system for determining the time delays between the audio and video signals shown in Figure 1; and Figure 3 schematically illustrates the relative timings of an audio and video signal as shown in Figure 1 in which there is a delay between the audio and video signals.</p>
<p>In embodiments of the present invention any time delay between audio and video data subsequent to an encoding process being performed on the originally available audio and video data is determined utilising a predetermined video sequence having known timing properties. The video/audio data sequence is provided in either an unencoded data format, such as RAW, or in a "basic" standard encoded data format, such as for example MPEG.</p>
<p>The predetermined video sequence comprises a series of visible "flashes" having a predetermined duration and time interval between each flash. The sequence also comprises a corresponding number of audible tones whose duration and time interval between tones exactly corresponds to the occurrences of the visible flashes. An example of an appropriate timing diagram for the visible and audible signals is schematically illustrated in Figure 1. Tn Figure 1 the upper signal trace 2 represents the binary levels for the visible signal, with the signal either being totally black or totally white in visible appearance. The lower signal trace 4 represents the audible signal, with the upper signal level representing a production of an audible tone and the lower signal level representing the absence of a tone.</p>
<p>As can be seen from Figure 1, after an initial time period of 1 unit, for example I second, during which neither a visible flash is produced nor an audible tone produced, a visible flash and audible tone of duration of 1 unit is subsequently produced. This is followed by a further time period during which no visible flash or audible tone is produced, this second time period having a duration of 2 units. This is then followed by the production of a visible flash and audible tone having a duration of 2 units, followed by a period of no visible flash or audible tone of duration 3 units and so on in the sequence illustrated in Figure 1, the total sequence comprises five periods during which a visible flash and audible tone are produced, each period lasting one time unit longer than the preceding period, with correspondingly increasing time periods in between during which no visible flash or audible tone is produced. In the example shown therefore the entire sequence lasts for a total of 30 time units, which will typically be 30 seconds. The entire sequence preferably continually repeats.</p>
<p>In a preferred embodiment of the present invention the visible flash is produced in at least the macro block, or at least an integer part thereof, that is shown at the top left hand corner of the display screen. Preferably a 2x2 array of blocks, i.e. 32x32 pixels, is used to encode the visible flash. This location is carefully chosen since, due to the scanning method of generating a displayed image as will be appreciated by those skilled in the art, the digital data representing this part of the display screen will occur very early in the relevant data stream and will consequently practically always be correctly encoded. The selection of the visible flash as a 32x32 pixel area will also tend to ensure the correct encoding of this video data. Similarly, the use of only black and white shades for the visible flash will maximise the likelihood of the video data being correctly encoded since these are "basic" digital values. In a similar fashion, the audio tone is provided as a tone whose only frequency allowance is at 10KHz, or some other single frequency. Since only a single frequency component is utilised for the audio tone it should be faithfully encoded by any audio encoder included within the encoding system under test.</p>
<p>Further visual data may be provided to the user, for example a larger visual representation of the visible flash, for example as a series of rotating circular segments, each segment being representative of a single time unit such that a complete sequence requires a full "revolution" through the multiple segments. It will of course be appreciated that such visual enhancements are merely for the convenience of the human operator and are not a necessary part of embodiments to the present invention.</p>
<p>The predetermined audio visual sequence is passed through the encoding system under test and the encoded digital data stream subsequently analysed. The analysis process comprises detecting one or both of the beginning and end of one of the visible flashes by detecting the point in time within the encoded data stream at which the 32x32 pixel macro block integer changes from "black" to "white" or vice versa. The time at which this occurs is accurate to within the duration of 1 frame of audio visual data, since the display is only refreshed every frame. A typical frame rate is 25 frames per second, concurrently the encoded audio signal is analysed to determine one or both of the beginning and end of the audio tones. A preferred method of detecting the beginning or end of the audio tone is to detect the sharply rising or falling amplitude of the tone as each transition from "tone" to "no tone" or vice versa occurs. The analysis process can thus determine any time delay between the video and audio "events" (an event being rising or falling audio or video signal edge). In preferred embodiments any determined delay that falls outside a predetermined set of parameters, such as those set by one or more transmission standards, causes an alert to be automatically generated.</p>
<p>A system in accordance with an embodiment of the present invention for determining any loss of video/audio synchronisation in an encoded data stream is schematically represented in Figure 2. A predetermined video stream 10 as described above in an un-encoded state is stored on a data storage medium such as a hard disk 12 and is provided as an input to the encoding system 14 to be tested. The encoding system will generally output an encoded data stream that can be decomposed to separate video 16 and audio 18 streams. Each of the video and audio streams are provided as inputs to an analysis engine 20 and input to separate video and audio event detection units 22, 24. It will be appreciated that although the video and audio streams are shown in Figure 2 as discrete inputs to the analysis engine, the decomposition of the encoded data stream provided by the encoding system 14 may equally be accomplished within the analysis engine, for example by means of a wrapper demux. Each event detection unit is arranged to detect the relevant video or audio events' of the encoded test data stream, these being the beginning or ends, or both, of the visible flashes' and audio tones as discussed above in relation to Figure 1, and to provide an output signal indicative of when each event occurs. The output signals from each of the audio and video event detection units 22, 24 is provided to a time comparison unit 26 that is arranged to measure any time interval present between the output signals from the event detection units and thus any time interval, be it lag or lead, between the occurrences of the audio and video events'. This time interval data is provided from the time comparison unit 26 to an output interface unit 28 that is arranged to provide the time interval data to an appropriate user interface. In preferred embodiments the output interface unit 28 is also arranged to compare any time delay between the audio and video signals with defined maximum permitted delays that may be stored in a further data storage area 30 or may be stored internally to the output interface unit. If a detected time interval exceeds a predefined value then the output interface unit may be arranged to provide an alarm signal.</p>
<p>As previously mentioned with reference to Figure 1, the sequence of visible flashes and audible tones comprises events' of increasing duration and with increasing time intervals between each event'. This ensures that should the time delay between the video and audio signals be great enough for one of the video events to coincide with an audio event this false' synchronisation, which would not cause the analysis engine to generate a report or alarm, will not be maintained at the next occurrence of a video and audio event. This is schematically illustrated in Figure 3, in which the upper trace 32 represents the video event signal and the lower trace 34 represents the audio event signal. As can be seen from Figure 3, the audio event signal has been delayed relative to the video signal by the encoding process by a time period of 3 time units, say seconds, as represented by arrow A. Consequently, the beginning of the second video event 36 occurs at the same time as the beginning of the first audio event 38. If these are the first video and audio events detected by the analysis engine then a false report of synchronisation between the audio and video streams may be provided. However, at the beginning of the next video event 40 it can be seen that the audio stream is out of synchronisation, since the events are not evenly spaced apart and do not have a constant duration. Consequently the analysis engine is able to determine that in fact the video and audio streams are not in synchronisation. If the analysis engine detects both the beginning and end of the video and audio events then the loss of synchronisation will be detected sooner since the end of the first audio event 38 will occur before the end of the second video event 36, even though the beginning of both events coincided. In this instance the loss of synchronisation between the audio and video streams is detected by the analysis engine within one time unit, for example one second.</p>
<p>The method and apparatus of embodiments of the present invention is thus able to detect and measure any time delay between encoded digital video and audio data streams quickly and reliably.</p>

Claims (1)

  1. <p>CLAIMS</p>
    <p>I. A method of determining the delay between an audio and visual signal, the method comprising: providing a video signal having a predetermined sequence of visual events, each visual event occurring at a predetermined point in the sequence; providing an audio signal having a predetermined sequence of audio events, each audio event occurring at a predetermined point in the sequence, wherein the audio and video events are in synchronicity with one another; digitally encoding the video and audio signals to generate encoded video and audio data streams; detecting the occurrence of each event in the encoded video and audio data streams and their relative time of occurrence; and from the detected time of occurrence of the audio and visual events measuring any time delay between the encoded video and audio data streams.</p>
    <p>2. The method of claim 1, wherein an alert is generated if the measured time delay between the encoded audio and video data streams exceeds a predetermined value.</p>
    <p>3. The method of claim I or 2, wherein each event within both sequences of audio and visual events has a different duration.</p>
    <p>4. The method of claim 3, wherein the duration of the events sequentially increases through the sequence.</p>
    <p>5. The method of any preceding claim, wherein each event within both sequences of audio and visual events is spaced apart by a time period, each respective time period having a different duration.</p>
    <p>6. The method of claim 5, wherein the time periods between events may sequentially increase throughout the sequence of events.</p>
    <p>7. The method of any preceding claim, wherein each video event comprises illuminating a predetermined display sector.</p>
    <p>8. The method of any preceding claim, wherein each audio event comprises generating an audible tone.</p>
    <p>9. Apparatus for determining the delay between encoded audio and video signals, wherein the video signal has a predetermined sequence of visual events, each visual event occurring at a predetermined point in the sequence, and the audio signal has a predetermined sequence of audio events, each audio event occurring at a predetermined point in the sequence, the audio and video events being in synchronicity with one another, the apparatus comprising: a video event detector arranged to detect each video event and provide a first time signal corresponding to each video event; an audio event detector arranged to detect each audio event and provide a second time signal corresponding to each audio event; and a time comparator arranged to receive the first and second time signals and to measure any time delay between the signals.</p>
    <p>10. The apparatus of claim 9 further comprising an alert generator arranged to receive the measured time delay from the time comparator and generate an alert if the time delay exceeds a predetermined value.</p>
GB0607214A 2006-04-10 2006-04-10 Method and apparatus for measuring audio/video sync delay Expired - Fee Related GB2437122B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB0607214A GB2437122B (en) 2006-04-10 2006-04-10 Method and apparatus for measuring audio/video sync delay

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB0607214A GB2437122B (en) 2006-04-10 2006-04-10 Method and apparatus for measuring audio/video sync delay

Publications (3)

Publication Number Publication Date
GB0607214D0 GB0607214D0 (en) 2006-05-17
GB2437122A true GB2437122A (en) 2007-10-17
GB2437122B GB2437122B (en) 2011-01-05

Family

ID=36539691

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0607214A Expired - Fee Related GB2437122B (en) 2006-04-10 2006-04-10 Method and apparatus for measuring audio/video sync delay

Country Status (1)

Country Link
GB (1) GB2437122B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014067364A1 (en) * 2012-10-30 2014-05-08 中兴通讯股份有限公司 Lip synchronization test system and method
EP4203470A1 (en) 2021-12-21 2023-06-28 Rohde & Schwarz GmbH & Co. KG System and method for evaluation of audio-video desynchronization

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2341758A (en) * 1998-07-24 2000-03-22 John Michael Wilson Synchronization of video and audio signals
WO2005004470A1 (en) * 2003-07-01 2005-01-13 Lg Electronics Inc. Method and apparatus for testing lip-sync of digital television receiver
US20050219366A1 (en) * 2004-03-31 2005-10-06 Hollowbush Richard R Digital audio-video differential delay and channel analyzer
US7020894B1 (en) * 1998-07-24 2006-03-28 Leeds Technologies Limited Video and audio synchronization

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001298757A (en) * 2000-04-11 2001-10-26 Nippon Hoso Kyokai <Nhk> Video and audio delay time difference measuring device
JP2003259314A (en) * 2002-02-26 2003-09-12 Nippon Hoso Kyokai <Nhk> Video audio synchronization method and system thereof
JP4517727B2 (en) * 2004-05-27 2010-08-04 ヤマハ株式会社 Audio / Video Amplifier

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2341758A (en) * 1998-07-24 2000-03-22 John Michael Wilson Synchronization of video and audio signals
US7020894B1 (en) * 1998-07-24 2006-03-28 Leeds Technologies Limited Video and audio synchronization
WO2005004470A1 (en) * 2003-07-01 2005-01-13 Lg Electronics Inc. Method and apparatus for testing lip-sync of digital television receiver
US20050219366A1 (en) * 2004-03-31 2005-10-06 Hollowbush Richard R Digital audio-video differential delay and channel analyzer

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014067364A1 (en) * 2012-10-30 2014-05-08 中兴通讯股份有限公司 Lip synchronization test system and method
EP4203470A1 (en) 2021-12-21 2023-06-28 Rohde & Schwarz GmbH & Co. KG System and method for evaluation of audio-video desynchronization

Also Published As

Publication number Publication date
GB2437122B (en) 2011-01-05
GB0607214D0 (en) 2006-05-17

Similar Documents

Publication Publication Date Title
JP5025722B2 (en) Audio / video synchronization delay measuring method and apparatus
KR100499037B1 (en) Method and apparatus of dtv lip-sync test
KR102260946B1 (en) Techniques for detecting media playback errors
CN101796812B (en) Lip synchronization system and method
US20050219366A1 (en) Digital audio-video differential delay and channel analyzer
US20080037954A1 (en) Automatic Video Glitch Detection and Audio-Video Synchronization Assessment
EP3171593B1 (en) Testing system and method
CN101296344B (en) Scene change detection device, coding device, and scene change detection method
US8433143B1 (en) Automated detection of video artifacts in an information signal
EP1246476A2 (en) Frozen field detection of formerly encoded video signal
US20070296870A1 (en) Pulldown correction for progressive display of audiovisual recordings
GB2437122A (en) Method and apparatus for measuring audio/video sync delay
JP3944329B2 (en) Apparatus and method for controlling and detecting alignment of composite video color frame
EP2725578A1 (en) Loudness log for recovery of gated loudness measurements and associated analyzer
CN113055711B (en) Audio and video synchronous detection method and detection system thereof
KR101646847B1 (en) System and method for detecting abnormality of display device
JP2007312191A (en) Video quality evaluation apparatus, video quality monitoring apparatus and video quality monitoring system
CN107968942B (en) Method and system for measuring audio and video time difference of live broadcast platform
US10097819B2 (en) Testing system, testing method, computer program product, and non-transitory computer readable data carrier
KR101721224B1 (en) Apparatus and method for detecting real-time video and audio distortion
TWI625059B (en) Terminal device and method for audio video synchronization
JP4329619B2 (en) Inspection video output device, digital video reproduction device inspection device, and digital video reproduction device inspection system
JP2007312192A (en) Lip-sync evaluation apparatus and lip-sync adjustment apparatus
Kunić et al. Analysis of audio and video synchronization in TV digital broadcast devices
JP2010157914A (en) Broadcast result acquisition system, information embedding device, information detecting device and broadcast result acquisition method

Legal Events

Date Code Title Description
COOA Change in applicant's name or ownership of the application

Owner name: TEKTRONIX INTERNATIONAL SALES GMBH

Free format text: FORMER APPLICANT(S): TEKTRONIX BRISTOL LIMITED

PCNP Patent ceased through non-payment of renewal fee

Effective date: 20190410

732E Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977)

Free format text: REGISTERED BETWEEN 20210812 AND 20210818