WO1998034182A2 - A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel - Google Patents

A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel Download PDF

Info

Publication number
WO1998034182A2
WO1998034182A2 PCT/IB1998/000091 IB9800091W WO9834182A2 WO 1998034182 A2 WO1998034182 A2 WO 1998034182A2 IB 9800091 W IB9800091 W IB 9800091W WO 9834182 A2 WO9834182 A2 WO 9834182A2
Authority
WO
WIPO (PCT)
Prior art keywords
keyframes
keyframe
video
whilst
displaying
Prior art date
Application number
PCT/IB1998/000091
Other languages
French (fr)
Other versions
WO1998034182A3 (en
Inventor
Marco Paolo Ceccarelli
Original Assignee
Koninklijke Philips Electronics N.V.
Philips Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V., Philips Ab filed Critical Koninklijke Philips Electronics N.V.
Priority to JP10529208A priority Critical patent/JP2000516006A/en
Priority to PCT/IB1998/000091 priority patent/WO1998034182A2/en
Priority to EP98900329A priority patent/EP0914638A2/en
Priority to KR1019980708181A priority patent/KR100552248B1/en
Publication of WO1998034182A2 publication Critical patent/WO1998034182A2/en
Publication of WO1998034182A3 publication Critical patent/WO1998034182A3/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • G06F16/745Browsing; Visualisation therefor the internal structure of a single video sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/107Programmed access in sequence to addressed parts of tracks of operating record carriers of operating tapes
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/20Disc-shaped record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/40Combinations of multiple record carriers
    • G11B2220/45Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup
    • G11B2220/455Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup said record carriers being in one device and being used as primary and secondary/backup media, e.g. HDD-DVD combo device, or as source and target media, e.g. PC and portable player
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/90Tape-like record carriers

Definitions

  • the invention relates to a method according to the preamble of Claim 1.
  • a continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images.
  • the character may be various, such as film, news, or for example a shopping list.
  • State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72.
  • Keyframes may be derived from video material upon its reception at the user's through a derivation algorithm, or keyframes may be labelled as such by a video provider, for example, in that each video shot will start with a keyframe.
  • a third scheme is that the frames succeed each other with uniform time intervals as relating to standard video speed. The present invention recognizes that keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to easier access the material, for selecting or deselecting for subsequent display, or for editing.
  • a particular problem with present-day projects for digital and compressed coding of video images is that storage thereof on mass media generally does not allow immediate access thereto, in particular in that the linear storage density such as expressable in frames per storage size is non-uniform. It has been proposed to supplement a high- capacity main storage medium such as tape with a secondary storage medium with smaller capacity and enhanced accessibility. In that case, the execution of trick modes, such as fast forward and fast reverse, as well as editing of the video material for subsequent presentation in an abstracted, modified, or rearranged form give rise to appreciable difficulties, both as seen from the aspect of the user interface, as well as perceived from the aspect of storage technology.
  • the invention is characterized according to the remainder of Claim 1.
  • Presenting the frames in a temporally ordered manner allows to effect fast forward and fast reverse in a simple manner, for example if the frames succeed each other with uniform time intervals as relating to standard video speed.
  • the easy change of hierarchical level with variable granularity in time between the frames allows easy accessing and editing. The same is true if the keyframes or at least a fraction thereof derive from filmshot commencements, or from other relevant events generated by the original film editor. In this manner, a clustering operation may be effected automatically.
  • the method may include highlighting a presently selected keyframe by enlarging it at a multiple-sized format with respect to other keyframes, whilst furthermore including detecting deleterious video interlacing effects and if so, reducing such effect by vertical decimation and/or including applying an upsampling filter to the image before display.
  • video distortions in relatively smallish keyframes have been experienced as tolerable, if a particular keyframe is enlarged, extra measures should be taken for picture improvement.
  • the inventor has recognized that this upgrading, although not always attaining the quality level present under standard rendering conditions, gives a pleasant and instructive improvement of picture quality.
  • the invention also relates to a device arranged for implementing the steps of the method as recited. Further advantageous aspects of the invention are recited in dependent Claims.
  • Figure 1 a block diagram of a TV-Recorder combination
  • Figure 2 an exemplary structure of a video recording
  • Figure 3 a design of a scrolling mosaic user interface
  • Figure 4 a design of a scrolling list user interface
  • Figure 5 a more extensive graphical user interface
  • Figure 7 a state diagram of the system operation.
  • the keyframes must be presented in such a manner that they are distinguishable from each other by a user person located at a typical TV viewing distance. • The number of keyframes presented simultaneously should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
  • the keyframes should be displayed in an undistorted fashion such as by retaining aspect ratio.
  • the remote control device of the TV set itself operates as user control device.
  • the user interface should reflect the familiar linear model of a video presentation.
  • FIG. 1 is a diagram showing a TV-Recorder combination according to the invention.
  • Item 20 represents the TV-set display and associated immediate control and powering.
  • Item 22 represents an antenna, or a connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal.
  • Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. The routing is governed by control box 28 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines.
  • the control box receives detection signals from display 20 on line 30 and from further subsystems 38, 40, whilst also controlling the latter two.
  • Block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region.
  • Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross-track jumping.
  • blocks 38 and 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once.
  • Item 24 represents a remote control device that by way of wireless 26 communicates with display device 20, and indirectly with subsystem 28 and further subsystems 38 and 40.
  • Figure 2 shows an exemplary structure of a video presentation.
  • bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded.
  • the information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over replay time.
  • Interspersed keyframes have been indicated by dark vertical stripes such as 68.
  • a keyframe is used as representing, or as being typical of the overall video in the interval up to the next keyframe.
  • the keyframes may be singled out by a video provider as the first frame of each new shot through adding a label or inclusion in a "table of contents" (TOC).
  • TOC "table of contents”
  • the receiver through some algorithm, detects that the video content changes abruptly from one frame to the next.
  • the present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform. A further mechanism is that successive keyframes succeed each other at prescribed intervals, such as every 2-3 seconds. In the embodiment, at indication 62 only the keyframes are represented. Furthermore, the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be multilevel, in that indication 66 is associated with only a single keyframe for all of the video presentation 60. The various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side.
  • the storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas at least the keyframes are reproduced in disc recorder 40, possibly together with short video and/or audio intervals immediately following the associated keyframe.
  • the length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained.
  • the video presentation may be essentially linear, such as a film.
  • Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present.
  • a possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it.
  • a reset feature may again disjoin the interval.
  • keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval.
  • Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
  • Figure 3 is shows a design of a scrolling mosaic user interface. Every screen presents 20 keyframes starting from top left to right down: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may activate a remote control to move the cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Accessing the top bar of the screen will control accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
  • the latter During display, upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame.
  • the above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward.
  • audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go one, whereas the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting.
  • Figure 4 is an exemplary design of a scrolling list user interface.
  • the overall screen has five keyframes displayed, keyframe 145 being highlighted by a rectangular control cursor that runs along its edges.
  • Keyframe 145 is also displayed at a larger magnification in the background.
  • the control interface is the same as in Figure 3, although the button positions are different.
  • the enlarged keyframe is suppressed in the multi-keyframe bar.
  • Figure 5 shows a more extensive graphical user interface.
  • left and right are columns of control buttons for play, stop, select, cut, paste, fast reverse, zoom+ , zoom-, fast forward.
  • the bottom row has a sequence of nine keyframes that pertain to respective different scenes or shots, in they have hardly any correlations therebetween.
  • the inter-keyframe distance could be, for example, ten seconds, but greater and smaller spacings could be feasible. Especially with short distances between successive keyframes in time, features such as fast forward can be well realized. On the other hand, the same size of spacing could be used for full playback of all audio, whilst the video would only jump from one keyframe to the next.
  • the central keyframe is also represented in an enlarged manner. When playing closely spaced keyframes that have low enough granularity, the enlarged keyframe may be presented in a dynamic manner, for so effecting fast forward (or backward) mode.
  • FIG. 6 shows the presentation of subtitles, in the general format as discussed with reference to Figure 5.
  • space 50 has been devoted to the actual frame; space 52 has been devoted to displaying subtitles derived from, or associated to the video presentation, or to other relevant information, such as speech-to-text converted for the deaf, or a translation into another language than the one used for actual speech.
  • each keyframe has a time code 54 or other relevant data overlayed thereon.
  • the two columns of control buttons 56, 58 have been devoted to application operations at left, and intra-program operators at right.
  • the top of the screen has the title 60 of the actual video program displayed.
  • the organization described herein will allow browsing through information that is different and separate from the overall video string. Even if only the audio is played in a dynamic manner, whilst jumping from any keyframe to the next, the user gets a better impression of the underlying video, at particularly low storage requirements.
  • Figure 7 is a state diagram of the system operation.
  • the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds), effects arrow 108, so that state 102 is reached. Therein, the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
  • Some of the keyframes used to browse the content of the video program may have been extracted from a sequence with high motion. This produces an annoying zigzag effect in case the video sequence was encoded with interlaced coding mode as normally is the case, a frame being made up of two fields which contribute to form the complete frame, where even lines belong to one field, odd lines to the other. This problem is more evident and annoying in small keyframes, where the effect is more visible, when the picture o is magnified and the lines become thick blocks.
  • the keyframes affected by such an interlacing effect must be detected. This effect can be observed on the rows of the image and luminance variations cause high frequency values. This can be exploited by splitting the spatial frequency spectrum into many sub-bands and then considering only the high frequency components. Actually, the effect we want to detect must present alternating values of luminance between even and odd lines, therefore the highest sampling frequency of the resulting picture.
  • the only coefficient that has to be computed is the highest frequency component of a frequency transformation (FFT or, better, DCT) on the columns. When the picture is affected by the zig-zag effect due to interlacing, this component will have a high value.
  • FFT frequency transformation
  • the detection and correction are effected as follows.
  • the first step is to discard one field by removing half of the rows, either even or odd; then, an upsampling by a factor of 2 is performed along the rows in order to recover the original size of the keyframe, followed by an interpolating filter.
  • the filter performs a simple linear interpolation.
  • the keyframe has to be enlarged at almost full screen size by means of an upsampling followed by an interpolating filter.
  • the keyframe has a low resolution, it has to be enlarged by quite a high factor. This means that if it is further processed, the result would not be good-looking, as pixels become large blocks. Therefore the picture must be filtered, but a trade-off must be found since we need to generate a good quality picture to be shown at high resolution, but also fast processing for the application to have a short response time.
  • subtitles are often transmitted along with the program (often in the Vertical Blanking Interval for analog systems or in a separate elementary stream in digital transmissions). This is normally used for programs distributed in foreign languages and not synchronized, or is meant for persons with hearing disability. Such information is normally superimposed on the screen, but could also be recorded on a storage medium. In this way, the speech of the program, and sometimes also some description of the sound for deaf people, is available for search operations.
  • Possible extension of such a system include: • extracting the text from the screen, such as by OCR techniques on still pictures, if the text is not available separately from the video • using speech recognition technology to extract the dialogues from the program.
  • the system will be always independent from the service offered by the broadcaster, so that even in case no subtitle is provided, text retrieval will always be possible at least on some specific keywords that the system can be trained to recognize.

Abstract

A method for navigating through video matter by means of displaying a plurality of key-frames in parallel, whilst allowing selective accessing of displayed keyframes for thereupon controlling actual access to said video matter as representing a mapping of so accessed keyframes, said method being characterized by allowing within a single user interface organization to select between a first operative mode for arranging keyframes in a temporally ordered manner on the screen and a second operative mode for arranging keyframes with multiple selectible granularities between contiguous keyframes as displayed.

Description

A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel.
BACKGROUND OF THE INVENTION
The invention relates to a method according to the preamble of Claim 1. The usage of keyframes as representative parts of a video presentation, that is recorded for subsequent selective playback, has been proposed elsewhere. A continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images. The character may be various, such as film, news, or for example a shopping list. State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72. Keyframes may be derived from video material upon its reception at the user's through a derivation algorithm, or keyframes may be labelled as such by a video provider, for example, in that each video shot will start with a keyframe. A third scheme is that the frames succeed each other with uniform time intervals as relating to standard video speed. The present invention recognizes that keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to easier access the material, for selecting or deselecting for subsequent display, or for editing.
A particular problem with present-day projects for digital and compressed coding of video images is that storage thereof on mass media generally does not allow immediate access thereto, in particular in that the linear storage density such as expressable in frames per storage size is non-uniform. It has been proposed to supplement a high- capacity main storage medium such as tape with a secondary storage medium with smaller capacity and enhanced accessibility. In that case, the execution of trick modes, such as fast forward and fast reverse, as well as editing of the video material for subsequent presentation in an abstracted, modified, or rearranged form give rise to appreciable difficulties, both as seen from the aspect of the user interface, as well as perceived from the aspect of storage technology. SUMMARY TO THE INVENTION
In consequence, amongst other things, it is an object of the present invention to introduce more flexibility into the organization, as well as to present to a user with a more natural feeling of the storage organization as well as of the video material proper, whilst obviating the need to continually access the main storage medium. Now therefore, according to one of its aspects the invention is characterized according to the remainder of Claim 1. Presenting the frames in a temporally ordered manner allows to effect fast forward and fast reverse in a simple manner, for example if the frames succeed each other with uniform time intervals as relating to standard video speed. Furthermore, the easy change of hierarchical level with variable granularity in time between the frames, allows easy accessing and editing. The same is true if the keyframes or at least a fraction thereof derive from filmshot commencements, or from other relevant events generated by the original film editor. In this manner, a clustering operation may be effected automatically.
Advantageously, the method may include highlighting a presently selected keyframe by enlarging it at a multiple-sized format with respect to other keyframes, whilst furthermore including detecting deleterious video interlacing effects and if so, reducing such effect by vertical decimation and/or including applying an upsampling filter to the image before display. Whereas video distortions in relatively smallish keyframes have been experienced as tolerable, if a particular keyframe is enlarged, extra measures should be taken for picture improvement. The inventor has recognized that this upgrading, although not always attaining the quality level present under standard rendering conditions, gives a pleasant and instructive improvement of picture quality.
The invention also relates to a device arranged for implementing the steps of the method as recited. Further advantageous aspects of the invention are recited in dependent Claims.
BRIEF DESCRIPTION OF THE DRAWING
These and other aspects and advantages of the invention will be discussed more in detail with reference to the disclosure of preferred embodiments hereinafter, and in particular with reference to the appended Figures that show:
Figure 1, a block diagram of a TV-Recorder combination;
Figure 2, an exemplary structure of a video recording;
Figure 3, a design of a scrolling mosaic user interface;
Figure 4, a design of a scrolling list user interface; Figure 5, a more extensive graphical user interface;
Figure 6, the presentation of subtitles;
Figure 7, a state diagram of the system operation.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
Various advantages, in particular, but not exclusively pertaining to an ordinary customer and private home use, imply the following:
• The keyframes must be presented in such a manner that they are distinguishable from each other by a user person located at a typical TV viewing distance. • The number of keyframes presented simultaneously should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
• The keyframes should be displayed in an undistorted fashion such as by retaining aspect ratio. • Preferably, the remote control device of the TV set itself operates as user control device.
• Feedback information should be perceivable from a typical viewing distance.
• Computer concepts such as "drag and drop" are generally not necessary.
• It must be feasible that the facilities be used only occasionally, rather than continually.
• The user interface should reflect the familiar linear model of a video presentation.
DISCLOSURE OF A PARTICULAR EMBODIMENT Figure 1 is a diagram showing a TV-Recorder combination according to the invention. Item 20 represents the TV-set display and associated immediate control and powering. Item 22 represents an antenna, or a connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal. Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. The routing is governed by control box 28 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines. The control box receives detection signals from display 20 on line 30 and from further subsystems 38, 40, whilst also controlling the latter two. Block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region. Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross-track jumping. Together, blocks 38 and 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once. Item 24 represents a remote control device that by way of wireless 26 communicates with display device 20, and indirectly with subsystem 28 and further subsystems 38 and 40.
Figure 2 shows an exemplary structure of a video presentation. For effecting the video matter functionality, bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded. The information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over replay time. Interspersed keyframes have been indicated by dark vertical stripes such as 68. A keyframe is used as representing, or as being typical of the overall video in the interval up to the next keyframe. The keyframes may be singled out by a video provider as the first frame of each new shot through adding a label or inclusion in a "table of contents" (TOC). Alternatively, the receiver, through some algorithm, detects that the video content changes abruptly from one frame to the next. The present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform. A further mechanism is that successive keyframes succeed each other at prescribed intervals, such as every 2-3 seconds. In the embodiment, at indication 62 only the keyframes are represented. Furthermore, the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be multilevel, in that indication 66 is associated with only a single keyframe for all of the video presentation 60. The various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side.
The storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas at least the keyframes are reproduced in disc recorder 40, possibly together with short video and/or audio intervals immediately following the associated keyframe. The length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained. By itself, the video presentation may be essentially linear, such as a film. Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present. A possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it. A reset feature may again disjoin the interval. Also, various classes of keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval. Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
Figure 3 is shows a design of a scrolling mosaic user interface. Every screen presents 20 keyframes starting from top left to right down: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may activate a remote control to move the cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Accessing the top bar of the screen will control accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
Other functions are initiated by first selecting a particular keyframe and subsequently one of the bottom buttons. "View program" controls a start at a cursor-accessed keyframe. "View segment" does the same, but plays only a single segment, that will end at the next keyframe. "View from x to y" controls a start at the earliest in time of two cursor- accessed keyframes, and stops at the last in time of the two. Other modes are feasible together with the keyframe-selecting functionalities. Examples are fast-forward or slow- forward, that allow a user person to check a particular interval for certain occurrences, or fast/slow reverse to attain certain video effects. During display, upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame. The above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward. During the display, audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go one, whereas the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting. o
Figure 4 is an exemplary design of a scrolling list user interface. Here, at its bottom the overall screen has five keyframes displayed, keyframe 145 being highlighted by a rectangular control cursor that runs along its edges. Keyframe 145 is also displayed at a larger magnification in the background. The control interface is the same as in Figure 3, although the button positions are different. Alternatively, the enlarged keyframe is suppressed in the multi-keyframe bar.
Figure 5 shows a more extensive graphical user interface. First, left and right are columns of control buttons for play, stop, select, cut, paste, fast reverse, zoom+ , zoom-, fast forward. The bottom row has a sequence of nine keyframes that pertain to respective different scenes or shots, in they have hardly any correlations therebetween.
Through stepping in the hierarchical organization of the keyframes, a good overview on the scene-to-scene dynamics may be gathered. The inter-keyframe distance could be, for example, ten seconds, but greater and smaller spacings could be feasible. Especially with short distances between successive keyframes in time, features such as fast forward can be well realized. On the other hand, the same size of spacing could be used for full playback of all audio, whilst the video would only jump from one keyframe to the next. Now, the central keyframe is also represented in an enlarged manner. When playing closely spaced keyframes that have low enough granularity, the enlarged keyframe may be presented in a dynamic manner, for so effecting fast forward (or backward) mode. Upon reaching the material of the next keyframe, here showing a sailing vessel, the bottom row shifts one position to the left, so that the "sun" at left becomes obscured and a new keyframe enters from the right. Such display could in particular be at a faster frame rate than standard video, as mapped on the presentation from background storage medium. The reverse organization allows for fast reverse. Figure 6 shows the presentation of subtitles, in the general format as discussed with reference to Figure 5. In the central field, space 50 has been devoted to the actual frame; space 52 has been devoted to displaying subtitles derived from, or associated to the video presentation, or to other relevant information, such as speech-to-text converted for the deaf, or a translation into another language than the one used for actual speech. It would not be necessary that the subtitles derive only from the range associated to the seven keyframes at the screen bottom. Their relevance could stretch much further. Further, each keyframe has a time code 54 or other relevant data overlayed thereon. The two columns of control buttons 56, 58 have been devoted to application operations at left, and intra-program operators at right. The top of the screen has the title 60 of the actual video program displayed.
The rationale of the arrangements for having a dynamic representation of the video cursor that runs in time in the actually active keyframe field, is that the static representation of the keyframes alone does less sufficiently convey the dynamics of the video representation as a whole, when such dynamics let the user better understand the evolving of the events. Therefore, the semantics are enhanced as follows. After the system has been idle for a certain amount of time, the keyframe that the cursor 'encloses' will 'come alive', because it will start playing the digital video material in miniature, including any associated audio and further effects. If during the playback, the next keyframe is reached, the cursor will automatically 'jump' to the next keyframe presented in the user interface, until the user will (re) start interacting with the system. In general, the organization described herein will allow browsing through information that is different and separate from the overall video string. Even if only the audio is played in a dynamic manner, whilst jumping from any keyframe to the next, the user gets a better impression of the underlying video, at particularly low storage requirements.
In this respect, Figure 7 is a state diagram of the system operation. In state 100, the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds), effects arrow 108, so that state 102 is reached. Therein, the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
DETECTION AND FILTERING OF KEYFRAMES AFFECTED BY "INTERLACING" EFFECT
Some of the keyframes used to browse the content of the video program, may have been extracted from a sequence with high motion. This produces an annoying zigzag effect in case the video sequence was encoded with interlaced coding mode as normally is the case, a frame being made up of two fields which contribute to form the complete frame, where even lines belong to one field, odd lines to the other. This problem is more evident and annoying in small keyframes, where the effect is more visible, when the picture o is magnified and the lines become thick blocks.
First, the keyframes affected by such an interlacing effect must be detected. This effect can be observed on the rows of the image and luminance variations cause high frequency values. This can be exploited by splitting the spatial frequency spectrum into many sub-bands and then considering only the high frequency components. Actually, the effect we want to detect must present alternating values of luminance between even and odd lines, therefore the highest sampling frequency of the resulting picture. The only coefficient that has to be computed is the highest frequency component of a frequency transformation (FFT or, better, DCT) on the columns. When the picture is affected by the zig-zag effect due to interlacing, this component will have a high value.
However, this effect will also be visible in correspondence of an object with motion, especially with components in the horizontal direction. Therefore we should not consider the total sum of the coefficients, as this would also yield high values in a picture with detailed and contrasted patterns and finally produce false positives. A better result can be obtained by splitting the image in several sub-parts, and considering the greatest value per area. For example, by summing the two highest values of each are, the overall sum will be less susceptible to highly detailed images.
Finally, as a lower vertical resolution is less annoying than the zig-zag effect, the simplest way of filtering this image is to consider only one field and then upsampling it vertically by a factor of 2. An interpolating filter, as mentioned in the following section, can be applied before showing the resulting picture.
So, the detection and correction are effected as follows. The first step is to discard one field by removing half of the rows, either even or odd; then, an upsampling by a factor of 2 is performed along the rows in order to recover the original size of the keyframe, followed by an interpolating filter. In this case, the filter performs a simple linear interpolation.
UPSAMPLING AND INTERPOLATION
To be easily visible from a TV-viewing distance, the keyframe has to be enlarged at almost full screen size by means of an upsampling followed by an interpolating filter. Whereas generally the keyframe has a low resolution, it has to be enlarged by quite a high factor. This means that if it is further processed, the result would not be good-looking, as pixels become large blocks. Therefore the picture must be filtered, but a trade-off must be found since we need to generate a good quality picture to be shown at high resolution, but also fast processing for the application to have a short response time. The issue is that the enlargement must be performed on-the-flight on the picture, which means that the image cannot be enlarged and filtered just once, to be stored on the hard disk and re-used, because it would require too much storage space. Therefore the upsampling and filtering process must be as fast as possible while maintaining at the same time an acceptable result. Normally a usual interpolating filter may be employed (cf. any book on Digital Signal Processing; a relevant paper can be: H.C. Andrews, C.L. Patterson, Digital Interpolation of Discrete Images, IEEE Trans. Comput. 196, v25, pages 196-202).
Alternative techniques to improve the image quality can be used as well. Amongst them, wavelet-based solutions and fractals approaches seem to lead to a higher computational burden, but show outstanding results in visual quality. In fact, fractal compression techniques are well-known to be resolution independent: the details on a higher resolution can be reconstructed or simulated by applying the same decoding process iteratively. In this case what will be stored is a fractal compressed picture, yielding a high compression factor. Similarly, by using wavelets transformation, high frequency components on higher scales can be predicted to obtain a higher resolution image without blurring effects.
TEXTUAL SEARCH ON VIDEO PROGRAMMES BASED ON SUBTITLES
In current video transmissions, subtitles are often transmitted along with the program (often in the Vertical Blanking Interval for analog systems or in a separate elementary stream in digital transmissions). This is normally used for programs distributed in foreign languages and not synchronized, or is meant for persons with hearing disability. Such information is normally superimposed on the screen, but could also be recorded on a storage medium. In this way, the speech of the program, and sometimes also some description of the sound for deaf people, is available for search operations.
The extraction of this kind of information should happen in real time, while the program is being recorded. If this technique is coupled to the keyframe extraction routines, we may link the picture to the related text, i.e. the dialogue that takes place in that part of the program from which the keyframe has been extracted. In this way, with current text retrieval techniques, we can perform text retrieval based on specific keywords. A specific tool of the application will offer the possibility to perform simple queries based on keywords and their composition, as now commonly used in"Web" search engines.
As an example, suppose a news program has been recorded. If we intend to retrieve news regarding France, when the word "France" is inserted, the system will automatically look for this word in the text of the program. If the result is positive, the user will be presented with the keyframe related to that part of the program and the specific part of subtitles where the keyword was found. The user can then start watching the program starting rom that particular point. If more keyframes are found as result of the query, they will be all shown on the bottom of the screen, as in Figure 5, so that the user can analyze the related text one by one on the larger window. Of course similar keywords can be used (French, Paris) if the result was negative. This system can also be useful in sports programs to extract reports covering a specific team or sport.
Many other applications are feasible, for example to check whether a movie is to be allowed for children's viewing, by checking whether the words used in the dialogues are or not included in a list of "bad words.
Possible extension of such a system include: • extracting the text from the screen, such as by OCR techniques on still pictures, if the text is not available separately from the video • using speech recognition technology to extract the dialogues from the program.
In this case the system will be always independent from the service offered by the broadcaster, so that even in case no subtitle is provided, text retrieval will always be possible at least on some specific keywords that the system can be trained to recognize.

Claims

CLAIMS:
1. A method for navigating through video matter by means of displaying one or more series of a plurality of key-frames in parallel, whilst allowing selective accessing of displayed keyframes for thereupon controlling actual access to said video matter as representing a mapping of so accessed keyframes, said method being characterized by allowing within a single user interface organization to select between a first operative mode for arranging keyframes in a temporally ordered manner on the screen and a second operative mode for arranging keyframes with multiple selectible granularities between contiguous keyframes as displayed.
2. A method as claimed in Claim 1, and whilst in said temporally ordered manner progressively playing back an audio interval associated to a temporally centered keyframe.
3. A method as claimed in Claim 2, wherein successive audio intervals will constitute a substantially continuous audio representation with respect to a sequence of discretely spaced keyframes.
4. A method as claimed in Claim 1, and in the second operative mode playing back an audio interval associated to an actually accessed keyframe.
5. A method as claimed in Claim 1, characterized by highlighting a presently selected keyframe whilst simultaneously enlarging it at a multiple-sized format with respect to other keyframes, the method furthermore including detecting deleterious video interlacing effects and if so, reducing such effect by vertical decimation.
6. A method as claimed in Claim 1 , characterized by highlighting a presently selected keyframe whilst simultaneously enlarging it at a multiple format with respect to other keyframes, the method furthermore including applying an upsampling filter to the image before display.
7. A method as claimed in Claim 1, whilst furthermore displaying associated to an actualized keyframe a subtitle or other relevant information extracted for an associated keyframe or sequence of keyframes.
8. A device being arranged for executing a method as claimed in Claim 1.
PCT/IB1998/000091 1997-02-03 1998-01-22 A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel WO1998034182A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP10529208A JP2000516006A (en) 1997-02-03 1998-01-22 Method and apparatus for navigating video content by displaying a plurality of key frames in parallel
PCT/IB1998/000091 WO1998034182A2 (en) 1997-02-03 1998-01-22 A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
EP98900329A EP0914638A2 (en) 1997-02-03 1998-01-22 A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
KR1019980708181A KR100552248B1 (en) 1997-02-03 1998-01-22 Method and apparatus for navigating through video material by multiple key-frames parallel display

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP97200286.9 1997-02-03
EP97201802.2 1997-06-16
PCT/IB1998/000091 WO1998034182A2 (en) 1997-02-03 1998-01-22 A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel

Publications (2)

Publication Number Publication Date
WO1998034182A2 true WO1998034182A2 (en) 1998-08-06
WO1998034182A3 WO1998034182A3 (en) 1998-10-08

Family

ID=11004656

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1998/000091 WO1998034182A2 (en) 1997-02-03 1998-01-22 A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel

Country Status (1)

Country Link
WO (1) WO1998034182A2 (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0915471A2 (en) * 1997-11-10 1999-05-12 Hitachi, Ltd. Video searching, video information production, and storage medium for storing processing program therefor
EP0989759A1 (en) * 1998-08-24 2000-03-29 Hitachi, Ltd. Method and apparatus for detecting local video features and recording medium for storage of data generated during the implementation of the method
EP0999504A1 (en) * 1998-11-06 2000-05-10 Sony Corporation Image processing apparatus, image processing method, and information providing medium
WO2000034953A1 (en) * 1998-12-10 2000-06-15 Arnold & Richter Cine Technik Gmbh & Co. Betriebs Kg Method for separating image sequences stored on supports such as motion-picture film, video tape or such like
KR20020074328A (en) * 2001-03-20 2002-09-30 서치캐스트 주식회사 Method for playing motion pictures using keyframe and apparatus thereof
EP1251515A1 (en) * 2001-04-19 2002-10-23 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
KR100374040B1 (en) * 2001-03-09 2003-03-03 엘지전자 주식회사 Method for detecting caption synthetic key frame in video stream
WO2003069621A1 (en) * 2002-02-14 2003-08-21 Koninklijke Philips Electronics N.V. Visual summary for scanning forwards and backwards in video content
KR100411340B1 (en) * 2001-03-09 2003-12-18 엘지전자 주식회사 Video browsing system based on article of news video content
KR100441342B1 (en) * 2001-04-07 2004-07-23 박범서 Tool for playing a multimedia data and method for playing a multimedia data using the same
KR100479653B1 (en) * 2000-07-01 2005-03-30 주식회사 디지트리얼테크놀로지 A method of panoramic video displaying
KR100512138B1 (en) * 2000-03-08 2005-09-02 엘지전자 주식회사 Video Browsing System With Synthetic Key Frame
AU2003204917B2 (en) * 2002-06-24 2006-01-12 Canon Kabushiki Kaisha Method and Apparatus for Synchronising a Keyframe with Sound
DE102004056445A1 (en) * 2004-11-23 2006-05-24 Siemens Ag Searching of a video image sequence displayed on a mobile telephone uses data stored as a sequence of time periods
KR100767673B1 (en) * 2005-06-20 2007-10-18 엘지전자 주식회사 Digital Broadcasting Terminal with Emboding Slide Show and Method of Emboding Slide Show Using Same
EP1924089A3 (en) * 2006-11-20 2008-08-20 Samsung Electronics Co., Ltd. Content playback
WO2009082579A3 (en) * 2007-12-18 2009-10-08 Netflix, Inc. Trick play of streaming media
EP2161722A1 (en) * 2008-09-03 2010-03-10 LG Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
EP2079234A3 (en) * 2008-01-09 2010-12-01 Sony Corporation Video searching apparatus, editing apparatus, video searching method, and program
WO2011101762A1 (en) 2010-02-16 2011-08-25 Nds Limited Video trick mode mechanism
EP2846331A1 (en) * 2013-09-06 2015-03-11 Kabushiki Kaisha Toshiba Electronic apparatus, control method, and computer-readable storage medium
EP2980802A1 (en) * 2005-03-21 2016-02-03 Samsung Electronics Co., Ltd Optical disc recording apparatus and cell-by-cell editing method using the same
CN106576151A (en) * 2014-10-16 2017-04-19 三星电子株式会社 Video processing apparatus and method
EP1513151B1 (en) * 2003-09-02 2019-04-17 Sony Corporation Device and method for editing moving picture data

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0739129A2 (en) * 1995-04-21 1996-10-23 Eastman Kodak Company A system and method for creating high-quality stills from interlaced video images

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0739129A2 (en) * 1995-04-21 1996-10-23 Eastman Kodak Company A system and method for creating high-quality stills from interlaced video images

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
IEEE MULTIMEDIA, Summer 1994, S.W. SMOLIAR et al., "Content-Based Video Indexing and Retrieval", pages 62-72. *
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, Volume 41, No. 3, August 1995, H-Y. CHEN et al., "A Multi-Layer Video Browsing System", page 842. *
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, Volume 1, 1990, Y. TONOMURA et al., "Content Oriented Visual Interface Using Video Icons for Visual Database Systems", pages 183-198. *
PROCEEDINGS OF THE SPIE, Volume 2417, 1995, H.J. ZHANG et al., "Content-Based Video Browsing Tools", pages 389-398. *

Cited By (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0915471A3 (en) * 1997-11-10 2001-08-01 Hitachi, Ltd. Video searching, video information production, and storage medium for storing processing program therefor
US6449608B1 (en) 1997-11-10 2002-09-10 Hitachi, Ltd. Video searching method and apparatus, video information producing method, and storage medium for storing processing program thereof
EP0915471A2 (en) * 1997-11-10 1999-05-12 Hitachi, Ltd. Video searching, video information production, and storage medium for storing processing program therefor
US6584463B2 (en) 1997-11-10 2003-06-24 Hitachi, Ltd. Video searching method, apparatus, and program product, producing a group image file from images extracted at predetermined intervals
EP0989759A1 (en) * 1998-08-24 2000-03-29 Hitachi, Ltd. Method and apparatus for detecting local video features and recording medium for storage of data generated during the implementation of the method
US6747674B1 (en) 1998-11-06 2004-06-08 Sony Corporation Image processing apparatus, image processing method, and information providing medium
EP0999504A1 (en) * 1998-11-06 2000-05-10 Sony Corporation Image processing apparatus, image processing method, and information providing medium
WO2000034953A1 (en) * 1998-12-10 2000-06-15 Arnold & Richter Cine Technik Gmbh & Co. Betriebs Kg Method for separating image sequences stored on supports such as motion-picture film, video tape or such like
US6987809B1 (en) 1998-12-10 2006-01-17 Arnold & Richter Cine Technik Gmbh & Co. Betriebs Kg Method for separating image sequences stored on supports such as motion-picture film, video tape or such like
KR100512138B1 (en) * 2000-03-08 2005-09-02 엘지전자 주식회사 Video Browsing System With Synthetic Key Frame
KR100479653B1 (en) * 2000-07-01 2005-03-30 주식회사 디지트리얼테크놀로지 A method of panoramic video displaying
KR100411340B1 (en) * 2001-03-09 2003-12-18 엘지전자 주식회사 Video browsing system based on article of news video content
KR100374040B1 (en) * 2001-03-09 2003-03-03 엘지전자 주식회사 Method for detecting caption synthetic key frame in video stream
KR20020074328A (en) * 2001-03-20 2002-09-30 서치캐스트 주식회사 Method for playing motion pictures using keyframe and apparatus thereof
KR100441342B1 (en) * 2001-04-07 2004-07-23 박범서 Tool for playing a multimedia data and method for playing a multimedia data using the same
CN100346420C (en) * 2001-04-19 2007-10-31 皇家菲利浦电子有限公司 Keyframe-based playback position selection method and system
KR101073528B1 (en) * 2001-04-19 2011-10-17 코닌클리케 필립스 일렉트로닉스 엔.브이. Keyframe-based playback position selection method and system
WO2002086897A1 (en) * 2001-04-19 2002-10-31 Koninklijke Philips Electronics N.V. Keyframe-based playback position selection method and system
EP1251515A1 (en) * 2001-04-19 2002-10-23 Koninklijke Philips Electronics N.V. Method and system for selecting a position in an image sequence
US7333712B2 (en) 2002-02-14 2008-02-19 Koninklijke Philips Electronics N.V. Visual summary for scanning forwards and backwards in video content
WO2003069621A1 (en) * 2002-02-14 2003-08-21 Koninklijke Philips Electronics N.V. Visual summary for scanning forwards and backwards in video content
AU2003204917B2 (en) * 2002-06-24 2006-01-12 Canon Kabushiki Kaisha Method and Apparatus for Synchronising a Keyframe with Sound
EP1513151B1 (en) * 2003-09-02 2019-04-17 Sony Corporation Device and method for editing moving picture data
DE102004056445A1 (en) * 2004-11-23 2006-05-24 Siemens Ag Searching of a video image sequence displayed on a mobile telephone uses data stored as a sequence of time periods
EP2980802A1 (en) * 2005-03-21 2016-02-03 Samsung Electronics Co., Ltd Optical disc recording apparatus and cell-by-cell editing method using the same
KR100767673B1 (en) * 2005-06-20 2007-10-18 엘지전자 주식회사 Digital Broadcasting Terminal with Emboding Slide Show and Method of Emboding Slide Show Using Same
US7650577B2 (en) 2005-06-20 2010-01-19 Lg Electronics, Inc. Digital data receiver and method for constructing slideshows
EP1924089A3 (en) * 2006-11-20 2008-08-20 Samsung Electronics Co., Ltd. Content playback
WO2009082579A3 (en) * 2007-12-18 2009-10-08 Netflix, Inc. Trick play of streaming media
US8365235B2 (en) 2007-12-18 2013-01-29 Netflix, Inc. Trick play of streaming media
EP2079234A3 (en) * 2008-01-09 2010-12-01 Sony Corporation Video searching apparatus, editing apparatus, video searching method, and program
US8704794B2 (en) 2008-09-03 2014-04-22 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
US8441451B2 (en) 2008-09-03 2013-05-14 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
US8907916B2 (en) 2008-09-03 2014-12-09 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
US9678587B2 (en) 2008-09-03 2017-06-13 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
EP2296150B1 (en) * 2008-09-03 2018-03-28 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
US10126866B2 (en) 2008-09-03 2018-11-13 Lg Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
EP2161722A1 (en) * 2008-09-03 2010-03-10 LG Electronics Inc. Terminal, controlling method thereof and recordable medium for the same
WO2011101762A1 (en) 2010-02-16 2011-08-25 Nds Limited Video trick mode mechanism
EP2846331A1 (en) * 2013-09-06 2015-03-11 Kabushiki Kaisha Toshiba Electronic apparatus, control method, and computer-readable storage medium
CN106576151A (en) * 2014-10-16 2017-04-19 三星电子株式会社 Video processing apparatus and method
EP3143764A4 (en) * 2014-10-16 2017-12-27 Samsung Electronics Co., Ltd. Video processing apparatus and method
US10014029B2 (en) 2014-10-16 2018-07-03 Samsung Electronics Co., Ltd. Video processing apparatus and method
CN106576151B (en) * 2014-10-16 2021-06-15 三星电子株式会社 Video processing apparatus and method

Also Published As

Publication number Publication date
WO1998034182A3 (en) 1998-10-08

Similar Documents

Publication Publication Date Title
US6222532B1 (en) Method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
WO1998034182A2 (en) A method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel
US6340971B1 (en) Method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
JP3667262B2 (en) Video skimming method and apparatus
KR100411437B1 (en) Intelligent news video browsing system
Yeung et al. Video visualization for compact presentation and fast browsing of pictorial content
Taniguchi et al. An intuitive and efficient access interface to real-time incoming video based on automatic indexing
US6154771A (en) Real-time receipt, decompression and play of compressed streaming video/hypervideo; with thumbnail display of past scenes and with replay, hyperlinking and/or recording permissively intiated retrospectively
US8301669B2 (en) Concurrent presentation of video segments enabling rapid video file comprehension
Bolle et al. Video query: Research directions
KR100411340B1 (en) Video browsing system based on article of news video content
US9185379B2 (en) Medium and method for interactive seamless branching and/or telescopic advertising
KR100411342B1 (en) Method for generating video text synthetic key frame
US20070101266A1 (en) Video summary description scheme and method and system of video summary description data generation for efficient overview and browsing
US20030126603A1 (en) Multimedia data searching and browsing system
WO2006126391A1 (en) Contents processing device, contents processing method, and computer program
EP1222634A1 (en) Video summary description scheme and method and system of video summary description data generation for efficient overview and browsing
KR101440168B1 (en) Method for creating a new summary of an audiovisual document that already includes a summary and reports and a receiver that can implement said method
Zhang Content-based video browsing and retrieval
KR100552248B1 (en) Method and apparatus for navigating through video material by multiple key-frames parallel display
WO1998034181A2 (en) A method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen
JPH11220689A (en) Video software processor and medium for storing its program
JPH11224266A (en) Electronic video document preparation and utilization method and program storage medium
KR20020023063A (en) A method and apparatus for video skimming using structural information of video contents
KR100370249B1 (en) A system for video skimming using shot segmentation information

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1998900329

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1019980708181

Country of ref document: KR

AK Designated states

Kind code of ref document: A3

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1998900329

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1019980708181

Country of ref document: KR

WWR Wipo information: refused in national office

Ref document number: 1019980708181

Country of ref document: KR