EP3235253A1 - Markierter audio-video-strom zur synchronisierung der komponenten davon, verfahren und ausrüstung zur analyse der artefakte und synchronisierung solch eines stroms - Google Patents

Markierter audio-video-strom zur synchronisierung der komponenten davon, verfahren und ausrüstung zur analyse der artefakte und synchronisierung solch eines stroms

Info

Publication number
EP3235253A1
EP3235253A1 EP15826139.6A EP15826139A EP3235253A1 EP 3235253 A1 EP3235253 A1 EP 3235253A1 EP 15826139 A EP15826139 A EP 15826139A EP 3235253 A1 EP3235253 A1 EP 3235253A1
Authority
EP
European Patent Office
Prior art keywords
audio
video
video stream
synchronism
marked
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP15826139.6A
Other languages
English (en)
French (fr)
Inventor
Franck HENNEQUIN
William Gontier
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Softathome SA
Original Assignee
Softathome SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Softathome SA filed Critical Softathome SA
Publication of EP3235253A1 publication Critical patent/EP3235253A1/de
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8547Content authoring involving timestamps for synchronizing content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device

Definitions

  • the present invention relates to a marked audio-video stream, a method of generating such a stream, a method and an installation for analyzing and synchronizing such a stream.
  • the present invention is in the field of visualization of media content, and more particularly audio-visual. State of the art
  • the invention relates to the visualization of audiovisual contents including in particular a video signal and / or an audio signal and / or a textual element broadcast superimposed, such as a subtitle for example. It can be seen that supply and demand for such audio-visual content has grown exponentially in the last decade, thus helping to transform the television (and internet) panorama as well as the way in which the public views such content.
  • Another aim of the invention is to solve at least one of these problems by a new audio-video stream which makes it possible to check the continuity and / or synchronism of the various textual, audio and video components.
  • Another object of the present invention is to provide a device that facilitates the analysis of audio-video streams and to reduce costs. Another goal is to improve the quality and the rendering of broadcast audio-video streams.
  • At least one of the above-mentioned objectives is achieved with a marked audio-video stream comprising (i) at least one video signal composed of at least one video frame, and (ii) at least one audio signal composed of at least one audio frame and / or (iii) at least one textual signal composed of at least one subtitle frame, and characterized in that the at least one video frame comprises means for identifying the video synchronism, and the at least one audio frame comprises means for identifying the audio synchronism and / or the at least one subtitle frame comprises means for identifying the text synchronism.
  • the audio-video stream marked according to the invention comprises several signals of the same type (for example several subtitles and / or several broadcast languages ...) then the corresponding identifiers make it possible to distinguish each signal of the same category;
  • the order in which the frames must be played may be contained in the synchronism identifier itself (using a numerical incrementation, for example) and / or in a database that gathers all the information concerning the audio-video stream Mark. It is therefore possible to know in what order each frame must be broadcast;
  • An audio-video stream according to the invention may comprise several instances of each audio and / or text signal.
  • an audio-video stream marked according to the invention, in the field of cinema may be composed of a video stream, of two different audio streams, corresponding for example to the dialogues in French and in English. , as well as five text streams corresponding to subtitles in different foreign languages.
  • the video stream can also be composed of several video signals.
  • the means of identification of the audio synchronism can be transported by a modulated carrier in agreement with said means of identification of the audio synchronism.
  • the audio synchronism identification means can be a counter for identifying the instant of broadcasting the corresponding audio frame, thereby determining when - and in what order - the audio frames are broadcast.
  • the means for identifying the video synchronism may comprise a unique identifier for indexing a database in order to be able to initiate comparisons with said database and to analyze the corresponding audio-video stream.
  • the identifier can be of any type in order to identify - at least - a cell in a database. For example, it may be a pair of integers respectively describing a row and column number of said database, or a QR code which makes it possible to store more information, such as by example a counter.
  • the means of identification of the video synchronism may further comprise a two-dimensional structure of predefined images in order to be able to detect the presence of possible artifacts by measuring the distortion of the diffused image compared to the predefined one and whose essential characteristics are recorded in the database. It can be a mosaic with particular images at particular positions. The characteristics concerning the images that constitute the mosaic and their positions are stored elsewhere in a database. Finally, the images may be different for each of the frames that make up the video stream.
  • a decoder of a marked audio-video stream comprising (i) means for reading said at least one audio-video stream marked according to the invention, (ii) means for for extracting at least one video synchronism identifier from the at least one video signal of said marked audio-video stream, (iii) means for extracting the at least one audio synchronism identifier from the at least one audio signal of said stream labeled audio-video, (iv) means for extracting the at least one text synchronous identifier from the at least one text signal of said marked audio-video stream, (v) a database comprising on the one hand the decoding information the video synchronism identifier of the at least one video signal of said marked audio-video stream and the audio signal and / or the textual signal of the at least one audio-video stream marked, and (vi) an analysis means for comparing the extracted synchronism identifiers with the contents of the database.
  • the analysis means make it possible, on the one hand, to verify that the at least one broadcast audio stream corresponds to the original audio signal, and on the other hand to check the possible presence of video artifacts such as the cuts or macroblocks by verifying that the images present on the identifier of the video synchronism are in conformity with those which have been used in the original signal and which are stored on the database. It is also possible to check that the text broadcast in closed captioning is in line with that of the original signal.
  • the invention also relates to a method for generating a marked audio-video stream according to the invention and which can comprise at least one iteration of the following steps:
  • a method for analyzing an audio-video stream marked using a processing unit to test (i) the continuity of the at least one video frame of the at least one video signal of said marked audio-video stream; and / or (ii) the continuity of the at least one audio frame of the at least one audio signal of said marked audio-video stream; and / or (iii) the continuity of the at least one subtitle frame of the at least one text signal of said marked audio-video stream.
  • the processing unit may comprise a microprocessor or a microcontroller and may be integrated for example in audio-video stream reading equipment, such as for example a "set-top-box", a digital television or any other type of digital decoder.
  • the analysis method according to the invention can further test the audio synchronism of the at least one audio frame with respect to the at least one video frame of said marked audio-video stream, and / or the text synchronism of the at least one subtitle frame with respect to the at least one video frame of said marked audio-video stream.
  • the analysis method according to the invention can also test the quality of the at least one video frame of said marked audio-video stream.
  • the analysis method according to the invention may comprise at least one iteration of the following steps:
  • a step of acquiring the synchronism identifiers comprising:
  • an artefact detection step comprising:
  • the present invention intends to cover the methods of synchronization and / or correction of artifacts of the component signals of the audio-video streams, regardless of the number of signals component said streams.
  • the present invention also extends to the synchronization of several audio-video marked streams, one of them being taken as reference.
  • the analysis method according to the invention may comprise beforehand at the reading step a step of generating audio-video streams marked from a standard audio-video stream and comprising the following steps:
  • FIG. 1 illustrates a principle diagram for the generation of a labeled audio-video stream according to the invention
  • FIG. 2 illustrates a particular example of video synchronism identifier
  • FIG. 3 illustrates a video stream marked according to the invention
  • FIG. 4 illustrates a principle diagram for the analysis of the different artifacts of a labeled audio-video stream according to the invention as well as the detection of synchronization errors
  • FIG. 5 illustrates an example of transmission of an audio frame counter by amplitude modulation
  • FIGURE 6 illustrates the equivalent spectrum of the generated audio signal.
  • FIG. 1 illustrates a principle diagram for the generation 100 of a labeled audio-video stream according to the invention.
  • the general principle is to divide each signal composing the original audio-video stream 101 into frames on which are introduced markers which will make it possible to check and / or correct the continuity and / or the synchronism between the signals. different signals and / or correct reading artifacts that may sometimes appear during the broadcast of said signals.
  • the method thus delivers a plurality of signals each composed of a plurality of marked frames, said signals comprising for example at least one video signal cut into frames and labeled 104, and / or at least one audio signal cut into frames and labeled 105, and or at least one textual signal cut into frames and marked 103.
  • the rasterized, text-cut text signal may represent the subtitle band associated with the video signal, or any other textual information overlaid on the image broadcast by the media.
  • the marking of the various signals that make up the audio-video stream marked includes a synchronism identifier, which may, however, contain more information than that necessary to check the continuity of the signals and synchronize the different signals.
  • the marking of the video signal cut into frames may comprise a counter to serve as a time reference for the synchronization of the other signals of the marked audio-video stream; it may also contain a unique identifier for indexing a database 110 described below.
  • the marking of the audio signal cut into frames can be carried out by amplitude modulation for example or by any other means making it possible to superimpose on the initial audio signal information making it possible to subsequently identify the synchronism of the audio signal with the video signal.
  • the code superimposed on the original audio signal and transported on each frame includes a counter or an identifier for accessing such a counter.
  • the marking of the textual signal cut into frames further comprises a counter to determine the time of appearance of said text frame.
  • a database 110 is filled with all the information that will allow the analysis and subsequent decoding of the audio-video stream marked.
  • the database may contain information about the original audio-video stream as well as the full description of the labeled audio-video stream generated by the present method, namely, and not limited to, the content and the relative position of each frame of each signal composing the original audio-video stream.
  • the database 110 contains all the information concerning the video, audio and text signals cut into frames and marked. More particularly, the database may contain a video signal counter which makes it possible to identify the synchronism of the other supported signals (audio and textual) and all the information describing all the frames of the audio-video stream marked, to know :
  • FIG. 2 illustrates a particular example of a video synchronism identifier comprising a two-dimensional structure 200 and for a given video frame. It is composed of eight predefined images 202 to 209 and different, positioned around an identification code 201 which indexes the database 110.
  • the identification code is a QR code, well known to those skilled in the art.
  • the present invention is not limited to this type of code but encompasses all the means capable of indexing a database. At a minimum, it may be a pair of integers that describe a row and column number of said database.
  • the cell thus pointed in the database 110 contains the information corresponding to the video, audio and / or textual frame currently being broadcast.
  • FIG. 3 schematically illustrates a labeled audio-video stream 106 according to the invention. In the example illustrated, the audio-video stream 106 is broken down into three distinct signals which have respectively been subsequently labeled and recombined:
  • a video signal 104 composed of a succession of images 341 and 344 at one frequency.
  • the image diffusion frequency is 25 frames per second, but this frequency can be higher in the case of high resolution filming, typically 50 to 60 images per second.
  • a video synchronism identifier 342 and 345 is associated, comprising in the illustrated example a mosaic of predefined images 343 and 346.
  • each frame is associated with a synchronism identifier.
  • the audio frame 351 is associated with the identifier 352 and the audio frame 354 is associated with the identifier 355.
  • Each identifier furthermore comprises a counter 353 and 356 which makes it possible to determine the instant of broadcast of said frame.
  • a text signal 103 for broadcasting the subtitles that correspond to the video and audio frames.
  • each frame is associated with a synchronism identifier.
  • the text frame 361 is associated with the identifier 362 and the text frame 364 is associated with the identifier 365.
  • Each identifier furthermore comprises a counter 363 and
  • FIG. 4 illustrates a principle diagram for the analysis of the different artifacts of an audio-video stream marked according to the invention as well as the detection of synchronization errors. It includes the following steps:
  • the acquisition of the synchronism identifiers 410 for each of the extracted signals namely at least one video signal 411 cut into frames and marked, at least one audio signal 412 cut into frames and marked, and / or at least one textual signal 413 cut into frames and marked.
  • the present invention is not limited to a particular type of means for the acquisition of different identifiers.
  • the acquisition of the video synchronism identifier can be performed using a camera or by screen capture; the acquisition of the audio synchronism identifier can be achieved using a microphone or an acquisition card; and the acquisition of the text synchronous identifier can be achieved using an optical character recognition software ...
  • - decoding synchronism identifiers 420 for each of the signals composing the audio-video stream labeled 106.
  • it may be for the video signal to decode the QR code; for the audio signal, it may be to demodulate the signal carrying the identifier; and for the textual signal, it may be to extract at least one alphanumeric code.
  • the extracted video identifier makes it possible to index the database 110 which contains all the information relating to the audio-video stream such that it should ideally be broadcast.
  • the information contained, for each frame, thus makes it possible to:
  • the audio-video stream marked 450 is thus corrected and / or read optimally.
  • the means for analyzing the audio-video streams marked according to the invention may consist of internal means by means of broadcasting said audio-video streams, such as the processor contained in a digital television or a digital decoder, for example, or using outsourced means such as a computer.
  • the analysis of the flow obtained after having crossed the processing chain consists first of all in identifying the date of occurrence of each change of identifier of the video signal using the QR code for example and the database and in a second step, to detect the occurrence of the counters of the other signals: for the audio signal, it is for example to detect the beginning of each new audio frame.
  • the comparison of dates for identical counter values then gives the offset value between the video and audio and / or textual signals as they are rendered by the broadcasting means and for each of the scanned frames.
  • the present invention thus makes it possible to detect synchronism faults greater than the acceptability threshold (located between -185 ms for a delay of the audio signal on the video signal, and +90 ms for an advance of the audio signal on the video signal) with an accuracy of +/- 10 ms in the case of an analysis performed at 50 images per second.
  • the audio-video stream is broadcast at 50 frames per second, and the analysis is performed every 500 ms, which means that every 25 frames of a video sequence, a synchronism identifier is inserted.
  • the counter of the video frame is extracted during the decoding of the video identifier which marks each video frame. In the example shown, counter # 5204 was not extracted.
  • the video clock corresponds to the date of change of the video identifier and represents the time reference for the other streams.
  • the counter of the audio frame is extracted by demodulation of the signal contained in each of the audio frames.
  • the audio clock corresponds to the detection date of the beginning of the audio frame.
  • Audio frame # 5203 is in phase with the video frame.
  • synchronization means can be integrated in the analysis means or constitute means in its own right. As non-limiting examples, they may consist of a digital television for example, a digital decoder or a computer.
  • FIG. 5 illustrates an example of transmission of an audio frame counter by amplitude modulation
  • the abscissa axis 501 representing the time in milliseconds
  • the ordinate axis 502 the amplitude of the signal encoding the frame counter 504
  • FIG. 6 illustrates the equivalent spectrum 600 of the audio signal generated on four carriers 603-606 and for which the amplitude 602 is represented as a function of the frequency 601.
  • the counter or the identifier making it possible to access such a counter is encoded on a modulated carrier in accordance with the latter.
  • the transmission of the counter or its identifier within the audio signal must not exceed 1 / Ft seconds, where Ft is the frequency of the video signal.
  • Ft is the frequency of the video signal.
  • the transmission of the code corresponding to the audio counter or its identifier must not exceed 20 ms.
  • each transmission start of the value of the counter or its identifier starts with a specific value indicating a "start of frame". In the example illustrated in FIGURES 5 and 6, the start of the frame is materialized by a maximum in the amplitude of the signal.
EP15826139.6A 2014-12-19 2015-12-17 Markierter audio-video-strom zur synchronisierung der komponenten davon, verfahren und ausrüstung zur analyse der artefakte und synchronisierung solch eines stroms Ceased EP3235253A1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR1462869A FR3030973B1 (fr) 2014-12-19 2014-12-19 Flux audio-video marque pour en synchroniser les differentes composantes, procede et installation pour l'analyse des artefacts et du synchronisme d'un tel flux
PCT/EP2015/080253 WO2016097165A1 (fr) 2014-12-19 2015-12-17 Flux audio-video marque pour en synchroniser les differentes composantes, procede et installation pour l'analyse des artefacts et du synchronisme d'un tel flux

Publications (1)

Publication Number Publication Date
EP3235253A1 true EP3235253A1 (de) 2017-10-25

Family

ID=53191771

Family Applications (1)

Application Number Title Priority Date Filing Date
EP15826139.6A Ceased EP3235253A1 (de) 2014-12-19 2015-12-17 Markierter audio-video-strom zur synchronisierung der komponenten davon, verfahren und ausrüstung zur analyse der artefakte und synchronisierung solch eines stroms

Country Status (3)

Country Link
EP (1) EP3235253A1 (de)
FR (1) FR3030973B1 (de)
WO (1) WO2016097165A1 (de)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108597497B (zh) * 2018-04-03 2020-09-08 中译语通科技股份有限公司 一种字幕语音精准同步系统及方法、信息数据处理终端

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070154171A1 (en) * 2006-01-04 2007-07-05 Elcock Albert F Navigating recorded video using closed captioning
US20130097632A1 (en) * 2009-05-27 2013-04-18 Ajay Shah Synchronization to broadcast media

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL119504A (en) * 1996-10-28 2000-09-28 Elop Electrooptics Ind Ltd Audio-visual content verification method and system
DE102011122659A1 (de) * 2011-12-30 2013-07-04 Moticon Gmbh Verfahren zur Synchronisierung von Multimediadaten

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070154171A1 (en) * 2006-01-04 2007-07-05 Elcock Albert F Navigating recorded video using closed captioning
US20130097632A1 (en) * 2009-05-27 2013-04-18 Ajay Shah Synchronization to broadcast media

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of WO2016097165A1 *

Also Published As

Publication number Publication date
WO2016097165A1 (fr) 2016-06-23
FR3030973A1 (fr) 2016-06-24
FR3030973B1 (fr) 2018-04-27

Similar Documents

Publication Publication Date Title
US20140201774A1 (en) Systems and methods for multi-broadcast differentiation
EP0838960A2 (de) System und Verfahren zum Nachprüfen eines audiovisuellen Inhalts
US20140282673A1 (en) Systems and methods for real-time television ad detection using an automated content recognition database
EP2982131B1 (de) Systeme und verfahren zur echtzeiterkennung von fernsehwerbung mithilfe einer datenbank für automatisierte inhaltserkennung
CN107306342B (zh) 一种监视视听内容的方法和系统
WO2017191397A1 (fr) Procédé et dispositif de synchronisation de sous-titres
CN110475156B (zh) 一种视频延迟值的计算方法及装置
CN102413357A (zh) 测试装置及其测试方法
EP2104937B1 (de) Verfahren zur erzeugung einer neuen zusammenfassung eines audiovisuellen dokuments, das bereits eine zusammenfassung und meldungen enthält, und empfänger, der das verfahren implementieren kann
US10965965B2 (en) Detecting of graphical objects to identify video demarcations
CN103731609A (zh) 一种视频播放方法及系统
EP3235253A1 (de) Markierter audio-video-strom zur synchronisierung der komponenten davon, verfahren und ausrüstung zur analyse der artefakte und synchronisierung solch eines stroms
WO2009083459A2 (fr) Procédé de synchronisation d'une action 'rich media'® avec un changement audiovisuel, dispositif et programme d'ordinateur correspondants, procédé de création d'une présentation 'rich media'® et programme d'ordinateur correspondant
JP2007158673A (ja) 危険映像検出方法、映像相違検出方法及び装置
WO2018114108A1 (fr) Procede d'enregistrement d'un programme telediffuse a venir
EP1046283A1 (de) Vorrichtung und verfahren zur manipulation von bildfrequenzen
EP0415494B1 (de) Verfahren und Vorrichtung zur Bildverarbeitung von verbesserter Bewegungseinschätzung
WO2001091462A1 (fr) Dispositif et procede de synchronisation de programmes audiovisuels diffuses et d'informations complementaires
WO2015140252A1 (fr) Procédé et dispositif d'identification d'un contenu en cours de visualisation sur un ecran
FR2942686A1 (fr) Traitement d'un flux de donnees numeriques.
LU86240A1 (fr) Procedes et dispositifs d'analyse et de reconnaissance du code de cryptage d'emissions de television
WO2021214097A1 (fr) Procédé de transposition d'un flux audiovisuel
Pitié et al. Assessment of audio/video synchronisation in streaming media
FR3042369A1 (fr) Enrichissement contextuel par reconnaissance audio
WO2021023397A1 (fr) Procede et appareil d'enrichissement de contenu multimedia par des meta-informations

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20170719

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20201221

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

REG Reference to a national code

Ref country code: DE

Ref legal event code: R003

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20220925