EP0914638A2 - Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele - Google Patents

Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele

Info

Publication number
EP0914638A2
EP0914638A2 EP98900329A EP98900329A EP0914638A2 EP 0914638 A2 EP0914638 A2 EP 0914638A2 EP 98900329 A EP98900329 A EP 98900329A EP 98900329 A EP98900329 A EP 98900329A EP 0914638 A2 EP0914638 A2 EP 0914638A2
Authority
EP
European Patent Office
Prior art keywords
keyframes
keyframe
video
whilst
displaying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP98900329A
Other languages
German (de)
English (en)
Inventor
Marco Paolo Ceccarelli
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Priority to EP98900329A priority Critical patent/EP0914638A2/fr
Priority claimed from PCT/IB1998/000091 external-priority patent/WO1998034182A2/fr
Publication of EP0914638A2 publication Critical patent/EP0914638A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/107Programmed access in sequence to addressed parts of tracks of operating record carriers of operating tapes
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • H04N21/4316Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/445Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/50Tuning indicators; Automatic tuning control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/40Combinations of multiple record carriers
    • G11B2220/45Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup
    • G11B2220/455Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup said record carriers being in one device and being used as primary and secondary/backup media, e.g. HDD-DVD combo device, or as source and target media, e.g. PC and portable player
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/90Tape-like record carriers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/426Internal components of the client ; Characteristics thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4331Caching operations, e.g. of an advertisement for later insertion during playback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/445Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
    • H04N5/45Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/78Television signal recording using magnetic recording
    • H04N5/781Television signal recording using magnetic recording on disks or drums

Definitions

  • the invention relates to a method according to the preamble of Claim 1.
  • a continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images.
  • the character may be various, such as film, news, or for example a shopping list.
  • State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72.
  • Keyframes may be derived from video material upon its reception at the user's through a derivation algorithm, or keyframes may be labelled as such by a video provider, for example, in that each video shot will start with a keyframe.
  • a third scheme is that the frames succeed each other with uniform time intervals as relating to standard video speed. The present invention recognizes that keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to easier access the material, for selecting or deselecting for subsequent display, or for editing.
  • a particular problem with present-day projects for digital and compressed coding of video images is that storage thereof on mass media generally does not allow immediate access thereto, in particular in that the linear storage density such as expressable in frames per storage size is non-uniform. It has been proposed to supplement a high- capacity main storage medium such as tape with a secondary storage medium with smaller capacity and enhanced accessibility. In that case, the execution of trick modes, such as fast forward and fast reverse, as well as editing of the video material for subsequent presentation in an abstracted, modified, or rearranged form give rise to appreciable difficulties, both as seen from the aspect of the user interface, as well as perceived from the aspect of storage technology.
  • the invention is characterized according to the remainder of Claim 1.
  • Presenting the frames in a temporally ordered manner allows to effect fast forward and fast reverse in a simple manner, for example if the frames succeed each other with uniform time intervals as relating to standard video speed.
  • the easy change of hierarchical level with variable granularity in time between the frames allows easy accessing and editing. The same is true if the keyframes or at least a fraction thereof derive from filmshot commencements, or from other relevant events generated by the original film editor. In this manner, a clustering operation may be effected automatically.
  • the method may include highlighting a presently selected keyframe by enlarging it at a multiple-sized format with respect to other keyframes, whilst furthermore including detecting deleterious video interlacing effects and if so, reducing such effect by vertical decimation and/or including applying an upsampling filter to the image before display.
  • video distortions in relatively smallish keyframes have been experienced as tolerable, if a particular keyframe is enlarged, extra measures should be taken for picture improvement.
  • the inventor has recognized that this upgrading, although not always attaining the quality level present under standard rendering conditions, gives a pleasant and instructive improvement of picture quality.
  • the invention also relates to a device arranged for implementing the steps of the method as recited. Further advantageous aspects of the invention are recited in dependent Claims.
  • Figure 1 a block diagram of a TV-Recorder combination
  • Figure 2 an exemplary structure of a video recording
  • Figure 3 a design of a scrolling mosaic user interface
  • Figure 4 a design of a scrolling list user interface
  • Figure 5 a more extensive graphical user interface
  • Figure 7 a state diagram of the system operation.
  • the keyframes must be presented in such a manner that they are distinguishable from each other by a user person located at a typical TV viewing distance. • The number of keyframes presented simultaneously should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
  • the keyframes should be displayed in an undistorted fashion such as by retaining aspect ratio.
  • the remote control device of the TV set itself operates as user control device.
  • the user interface should reflect the familiar linear model of a video presentation.
  • FIG. 1 is a diagram showing a TV-Recorder combination according to the invention.
  • Item 20 represents the TV-set display and associated immediate control and powering.
  • Item 22 represents an antenna, or a connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal.
  • Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. The routing is governed by control box 28 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines.
  • the control box receives detection signals from display 20 on line 30 and from further subsystems 38, 40, whilst also controlling the latter two.
  • Block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region.
  • Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross-track jumping.
  • blocks 38 and 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once.
  • Item 24 represents a remote control device that by way of wireless 26 communicates with display device 20, and indirectly with subsystem 28 and further subsystems 38 and 40.
  • Figure 2 shows an exemplary structure of a video presentation.
  • bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded.
  • the information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over replay time.
  • Interspersed keyframes have been indicated by dark vertical stripes such as 68.
  • a keyframe is used as representing, or as being typical of the overall video in the interval up to the next keyframe.
  • the keyframes may be singled out by a video provider as the first frame of each new shot through adding a label or inclusion in a "table of contents" (TOC).
  • TOC "table of contents”
  • the receiver through some algorithm, detects that the video content changes abruptly from one frame to the next.
  • the present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform. A further mechanism is that successive keyframes succeed each other at prescribed intervals, such as every 2-3 seconds. In the embodiment, at indication 62 only the keyframes are represented. Furthermore, the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be multilevel, in that indication 66 is associated with only a single keyframe for all of the video presentation 60. The various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side.
  • the storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas at least the keyframes are reproduced in disc recorder 40, possibly together with short video and/or audio intervals immediately following the associated keyframe.
  • the length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained.
  • the video presentation may be essentially linear, such as a film.
  • Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present.
  • a possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it.
  • a reset feature may again disjoin the interval.
  • keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval.
  • Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
  • Figure 3 is shows a design of a scrolling mosaic user interface. Every screen presents 20 keyframes starting from top left to right down: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may activate a remote control to move the cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Accessing the top bar of the screen will control accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
  • the latter During display, upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame.
  • the above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward.
  • audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go one, whereas the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting.
  • Figure 4 is an exemplary design of a scrolling list user interface.
  • the overall screen has five keyframes displayed, keyframe 145 being highlighted by a rectangular control cursor that runs along its edges.
  • Keyframe 145 is also displayed at a larger magnification in the background.
  • the control interface is the same as in Figure 3, although the button positions are different.
  • the enlarged keyframe is suppressed in the multi-keyframe bar.
  • Figure 5 shows a more extensive graphical user interface.
  • left and right are columns of control buttons for play, stop, select, cut, paste, fast reverse, zoom+ , zoom-, fast forward.
  • the bottom row has a sequence of nine keyframes that pertain to respective different scenes or shots, in they have hardly any correlations therebetween.
  • the inter-keyframe distance could be, for example, ten seconds, but greater and smaller spacings could be feasible. Especially with short distances between successive keyframes in time, features such as fast forward can be well realized. On the other hand, the same size of spacing could be used for full playback of all audio, whilst the video would only jump from one keyframe to the next.
  • the central keyframe is also represented in an enlarged manner. When playing closely spaced keyframes that have low enough granularity, the enlarged keyframe may be presented in a dynamic manner, for so effecting fast forward (or backward) mode.
  • FIG. 6 shows the presentation of subtitles, in the general format as discussed with reference to Figure 5.
  • space 50 has been devoted to the actual frame; space 52 has been devoted to displaying subtitles derived from, or associated to the video presentation, or to other relevant information, such as speech-to-text converted for the deaf, or a translation into another language than the one used for actual speech.
  • each keyframe has a time code 54 or other relevant data overlayed thereon.
  • the two columns of control buttons 56, 58 have been devoted to application operations at left, and intra-program operators at right.
  • the top of the screen has the title 60 of the actual video program displayed.
  • the organization described herein will allow browsing through information that is different and separate from the overall video string. Even if only the audio is played in a dynamic manner, whilst jumping from any keyframe to the next, the user gets a better impression of the underlying video, at particularly low storage requirements.
  • Figure 7 is a state diagram of the system operation.
  • the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds), effects arrow 108, so that state 102 is reached. Therein, the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
  • Some of the keyframes used to browse the content of the video program may have been extracted from a sequence with high motion. This produces an annoying zigzag effect in case the video sequence was encoded with interlaced coding mode as normally is the case, a frame being made up of two fields which contribute to form the complete frame, where even lines belong to one field, odd lines to the other. This problem is more evident and annoying in small keyframes, where the effect is more visible, when the picture o is magnified and the lines become thick blocks.
  • the keyframes affected by such an interlacing effect must be detected. This effect can be observed on the rows of the image and luminance variations cause high frequency values. This can be exploited by splitting the spatial frequency spectrum into many sub-bands and then considering only the high frequency components. Actually, the effect we want to detect must present alternating values of luminance between even and odd lines, therefore the highest sampling frequency of the resulting picture.
  • the only coefficient that has to be computed is the highest frequency component of a frequency transformation (FFT or, better, DCT) on the columns. When the picture is affected by the zig-zag effect due to interlacing, this component will have a high value.
  • FFT frequency transformation
  • the detection and correction are effected as follows.
  • the first step is to discard one field by removing half of the rows, either even or odd; then, an upsampling by a factor of 2 is performed along the rows in order to recover the original size of the keyframe, followed by an interpolating filter.
  • the filter performs a simple linear interpolation.
  • the keyframe has to be enlarged at almost full screen size by means of an upsampling followed by an interpolating filter.
  • the keyframe has a low resolution, it has to be enlarged by quite a high factor. This means that if it is further processed, the result would not be good-looking, as pixels become large blocks. Therefore the picture must be filtered, but a trade-off must be found since we need to generate a good quality picture to be shown at high resolution, but also fast processing for the application to have a short response time.
  • subtitles are often transmitted along with the program (often in the Vertical Blanking Interval for analog systems or in a separate elementary stream in digital transmissions). This is normally used for programs distributed in foreign languages and not synchronized, or is meant for persons with hearing disability. Such information is normally superimposed on the screen, but could also be recorded on a storage medium. In this way, the speech of the program, and sometimes also some description of the sound for deaf people, is available for search operations.
  • Possible extension of such a system include: • extracting the text from the screen, such as by OCR techniques on still pictures, if the text is not available separately from the video • using speech recognition technology to extract the dialogues from the program.
  • the system will be always independent from the service offered by the broadcaster, so that even in case no subtitle is provided, text retrieval will always be possible at least on some specific keywords that the system can be trained to recognize.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Television Signal Processing For Recording (AREA)
  • Studio Circuits (AREA)

Abstract

L'invention concerne un procédé pour naviguer à travers des images vidéo en affichant une pluralité d'images clef en parallèle, tout en autorisant l'accès sélectif aux images clef affichées, pour ensuite contrôler l'accès réel auxdites images vidéo représentant un mappage des images clef ainsi accessibles. Ce procédé est caractérisé en ce qu'il permet, dans le cadre d'une seule organisation d'interface utilisateur, de choisir entre un premier mode de fonctionnement destiné à agencer sur l'écran des images clef de manière ordonnée dans le temps, et un deuxième mode de fonctionnement destiné à agencer les images clef, présentant de multiples granularités sélectionnables, entre des images clef contiguës affichées.
EP98900329A 1997-02-03 1998-01-22 Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele Withdrawn EP0914638A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP98900329A EP0914638A2 (fr) 1997-02-03 1998-01-22 Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
EP97200286 1997-02-03
EP97200286 1997-02-03
EP97201802 1997-06-16
EP97201802 1997-06-16
EP98900329A EP0914638A2 (fr) 1997-02-03 1998-01-22 Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele
PCT/IB1998/000091 WO1998034182A2 (fr) 1997-02-03 1998-01-22 Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele

Publications (1)

Publication Number Publication Date
EP0914638A2 true EP0914638A2 (fr) 1999-05-12

Family

ID=27238396

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98900329A Withdrawn EP0914638A2 (fr) 1997-02-03 1998-01-22 Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele

Country Status (3)

Country Link
EP (1) EP0914638A2 (fr)
JP (1) JP2000516006A (fr)
KR (1) KR100552248B1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2477800A (en) * 2010-02-16 2011-08-17 Nds Ltd Video trick mode playback
CN102016997B (zh) * 2008-05-08 2012-07-04 索尼爱立信移动通讯有限公司 将与广告内容有关的可寻址章节符号插入视频流中的电子器件和方法
CN116150428A (zh) * 2021-11-16 2023-05-23 腾讯科技(深圳)有限公司 视频标签获取方法、装置、电子设备及存储介质

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7954056B2 (en) 1997-12-22 2011-05-31 Ricoh Company, Ltd. Television-based visualization and navigation interface
US7596755B2 (en) 1997-12-22 2009-09-29 Ricoh Company, Ltd. Multimedia visualization and integration environment
US7861169B2 (en) 2001-11-19 2010-12-28 Ricoh Co. Ltd. Multimedia print driver dialog interfaces
US8635531B2 (en) 2002-02-21 2014-01-21 Ricoh Company, Ltd. Techniques for displaying information stored in multiple multimedia documents
US7747655B2 (en) 2001-11-19 2010-06-29 Ricoh Co. Ltd. Printable representations for time-based media
KR101537592B1 (ko) * 2008-09-03 2015-07-22 엘지전자 주식회사 이동단말기 및 그 제어 방법

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9834182A2 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102016997B (zh) * 2008-05-08 2012-07-04 索尼爱立信移动通讯有限公司 将与广告内容有关的可寻址章节符号插入视频流中的电子器件和方法
GB2477800A (en) * 2010-02-16 2011-08-17 Nds Ltd Video trick mode playback
WO2011101762A1 (fr) 2010-02-16 2011-08-25 Nds Limited Mécanisme de mode de trucage vidéo
US8958687B2 (en) 2010-02-16 2015-02-17 Cisco Technology Inc. Video trick mode mechanism
CN116150428A (zh) * 2021-11-16 2023-05-23 腾讯科技(深圳)有限公司 视频标签获取方法、装置、电子设备及存储介质
CN116150428B (zh) * 2021-11-16 2024-06-07 腾讯科技(深圳)有限公司 视频标签获取方法、装置、电子设备及存储介质

Also Published As

Publication number Publication date
JP2000516006A (ja) 2000-11-28
KR100552248B1 (ko) 2006-06-15
KR20000064909A (ko) 2000-11-06

Similar Documents

Publication Publication Date Title
US6222532B1 (en) Method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
WO1998034182A2 (fr) Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele
US6340971B1 (en) Method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
JP3667262B2 (ja) ビデオスキミング方法及び装置
EP2127368B1 (fr) Présentation simultanée de segments vidéo permettant une compréhension de fichier vidéo rapide
KR100411437B1 (ko) 지능형 뉴스 비디오 브라우징 시스템
Yeung et al. Video visualization for compact presentation and fast browsing of pictorial content
Taniguchi et al. An intuitive and efficient access interface to real-time incoming video based on automatic indexing
US6154771A (en) Real-time receipt, decompression and play of compressed streaming video/hypervideo; with thumbnail display of past scenes and with replay, hyperlinking and/or recording permissively intiated retrospectively
Bolle et al. Video query: Research directions
KR100411340B1 (ko) 뉴스 비디오 콘텐트의 뉴스 기사 기반 요약 및 브라우징시스템
US9185379B2 (en) Medium and method for interactive seamless branching and/or telescopic advertising
KR100411342B1 (ko) 비디오 텍스트 합성 키프레임 생성방법
US20030126603A1 (en) Multimedia data searching and browsing system
WO2006126391A1 (fr) Dispositif de traitement de contenus, méthode de traitement de contenus et programme informatique
WO2001027876A1 (fr) Schema de description de resume video et procede et systeme de generation de donnees de description de resume video pour vue d'ensemble et exploration efficaces
US20040181545A1 (en) Generating and rendering annotated video files
KR101440168B1 (ko) 개요 및 리포트를 이미 포함하는 시청각 도큐먼트의 새로운 개요를 생성하기 위한 방법 및 상기 방법을 구현할 수 있는 수신기
Zhang Content-based video browsing and retrieval
KR100552248B1 (ko) 복수의키-프레임들병렬디스플레이에의해비디오재료를통해네비게이팅하는방법및장치
WO1998034181A2 (fr) Procede et dispositif pour affichage video a base d'images cles au moyen d'une image de curseur video dans un ecran a plusieurs images cles
JPH11220689A (ja) 映像ソフト処理装置及び同処理プログラム記録記憶媒体
KR20020023063A (ko) 비디오 콘텐트의 구조적 정보를 이용한 비디오 스키밍방법과 장치
KR100370249B1 (ko) 샷 세그멘테이션 정보를 이용한 비디오 스키밍 시스템
WO2009024966A2 (fr) Procédé pour adapter des médias à une visualisation sur petits écrans d'affichage

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT DE FR GB

17P Request for examination filed

Effective date: 19981103

17Q First examination report despatched

Effective date: 20040519

APBN Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2E

APBR Date of receipt of statement of grounds of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA3E

APAF Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNE

APAF Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNE

APBT Appeal procedure closed

Free format text: ORIGINAL CODE: EPIDOSNNOA9E

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20120801