EP0914638A2 - Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele - Google Patents
Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en paralleleInfo
- Publication number
- EP0914638A2 EP0914638A2 EP98900329A EP98900329A EP0914638A2 EP 0914638 A2 EP0914638 A2 EP 0914638A2 EP 98900329 A EP98900329 A EP 98900329A EP 98900329 A EP98900329 A EP 98900329A EP 0914638 A2 EP0914638 A2 EP 0914638A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- keyframes
- keyframe
- video
- whilst
- displaying
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 230000008520 organization Effects 0.000 claims abstract description 8
- 238000013507 mapping Methods 0.000 claims abstract description 3
- 235000019580 granularity Nutrition 0.000 claims abstract 2
- 230000000694 effects Effects 0.000 claims description 19
- 230000002939 deleterious effect Effects 0.000 claims description 2
- 238000013461 design Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000009191 jumping Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 206010011878 Deafness Diseases 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
- G11B27/107—Programmed access in sequence to addressed parts of tracks of operating record carriers of operating tapes
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/11—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G11B27/28—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
- H04N21/4316—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47217—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/482—End-user interface for program selection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8455—Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/445—Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/50—Tuning indicators; Automatic tuning control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B2220/00—Record carriers by type
- G11B2220/20—Disc-shaped record carriers
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B2220/00—Record carriers by type
- G11B2220/40—Combinations of multiple record carriers
- G11B2220/45—Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup
- G11B2220/455—Hierarchical combination of record carriers, e.g. HDD for fast access, optical discs for long term storage or tapes for backup said record carriers being in one device and being used as primary and secondary/backup media, e.g. HDD-DVD combo device, or as source and target media, e.g. PC and portable player
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B2220/00—Record carriers by type
- G11B2220/90—Tape-like record carriers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/426—Internal components of the client ; Characteristics thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/433—Content storage operation, e.g. storage operation in response to a pause request, caching operations
- H04N21/4331—Caching operations, e.g. of an advertisement for later insertion during playback
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/445—Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
- H04N5/45—Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/78—Television signal recording using magnetic recording
- H04N5/781—Television signal recording using magnetic recording on disks or drums
Definitions
- the invention relates to a method according to the preamble of Claim 1.
- a continuous video stream means that video remains "on", which may include animation, a series of stills, or an interactive sequence of images.
- the character may be various, such as film, news, or for example a shopping list.
- State of the art is represented by the article 'Content-Based Video Indexing and Retrieval' by S.W. Smoliar and H.J. Zhang, IEEE Multimedia, Summer 1994, pages 62-72.
- Keyframes may be derived from video material upon its reception at the user's through a derivation algorithm, or keyframes may be labelled as such by a video provider, for example, in that each video shot will start with a keyframe.
- a third scheme is that the frames succeed each other with uniform time intervals as relating to standard video speed. The present invention recognizes that keyframes should be utilized so as to give users a dynamic overview over the presentation, combined with useful facilities for enabling them to easier access the material, for selecting or deselecting for subsequent display, or for editing.
- a particular problem with present-day projects for digital and compressed coding of video images is that storage thereof on mass media generally does not allow immediate access thereto, in particular in that the linear storage density such as expressable in frames per storage size is non-uniform. It has been proposed to supplement a high- capacity main storage medium such as tape with a secondary storage medium with smaller capacity and enhanced accessibility. In that case, the execution of trick modes, such as fast forward and fast reverse, as well as editing of the video material for subsequent presentation in an abstracted, modified, or rearranged form give rise to appreciable difficulties, both as seen from the aspect of the user interface, as well as perceived from the aspect of storage technology.
- the invention is characterized according to the remainder of Claim 1.
- Presenting the frames in a temporally ordered manner allows to effect fast forward and fast reverse in a simple manner, for example if the frames succeed each other with uniform time intervals as relating to standard video speed.
- the easy change of hierarchical level with variable granularity in time between the frames allows easy accessing and editing. The same is true if the keyframes or at least a fraction thereof derive from filmshot commencements, or from other relevant events generated by the original film editor. In this manner, a clustering operation may be effected automatically.
- the method may include highlighting a presently selected keyframe by enlarging it at a multiple-sized format with respect to other keyframes, whilst furthermore including detecting deleterious video interlacing effects and if so, reducing such effect by vertical decimation and/or including applying an upsampling filter to the image before display.
- video distortions in relatively smallish keyframes have been experienced as tolerable, if a particular keyframe is enlarged, extra measures should be taken for picture improvement.
- the inventor has recognized that this upgrading, although not always attaining the quality level present under standard rendering conditions, gives a pleasant and instructive improvement of picture quality.
- the invention also relates to a device arranged for implementing the steps of the method as recited. Further advantageous aspects of the invention are recited in dependent Claims.
- Figure 1 a block diagram of a TV-Recorder combination
- Figure 2 an exemplary structure of a video recording
- Figure 3 a design of a scrolling mosaic user interface
- Figure 4 a design of a scrolling list user interface
- Figure 5 a more extensive graphical user interface
- Figure 7 a state diagram of the system operation.
- the keyframes must be presented in such a manner that they are distinguishable from each other by a user person located at a typical TV viewing distance. • The number of keyframes presented simultaneously should be sufficient for providing a user person with an overview of a significant part of the contents of the digital video material.
- the keyframes should be displayed in an undistorted fashion such as by retaining aspect ratio.
- the remote control device of the TV set itself operates as user control device.
- the user interface should reflect the familiar linear model of a video presentation.
- FIG. 1 is a diagram showing a TV-Recorder combination according to the invention.
- Item 20 represents the TV-set display and associated immediate control and powering.
- Item 22 represents an antenna, or a connection with another type of signal distribution entity, such as cable distribution. This item includes, if appropriate, also the derivation of the digital video information or the digital signal part from the received signal.
- Item 34 represents the routing of the video streams and associated information between the various subsystems of Figure 1. The routing is governed by control box 28 through control signals on line 35. The latter has been drawn as a single bidirectional interconnection but may in fact be built from any number of unidirectional or bidirectional lines.
- the control box receives detection signals from display 20 on line 30 and from further subsystems 38, 40, whilst also controlling the latter two.
- Block 38 is a linear tape recorder with a very high storage capacity in the multi-gigabyte region.
- Block 40 is a magnetic disc recorder with a high storage capacity, but which is nevertheless only a fraction of that of recorder 38; on the other hand, access in recorder 40 is much faster through cross-track jumping.
- blocks 38 and 40 form a two-level storage organization that is somewhat akin to a computer memory cache system, and stores all items of a video presentation at least once.
- Item 24 represents a remote control device that by way of wireless 26 communicates with display device 20, and indirectly with subsystem 28 and further subsystems 38 and 40.
- Figure 2 shows an exemplary structure of a video presentation.
- bar 60 contains the video itself, either in the form of frames, or as a string of compressed video matter, such as MPEG-coded.
- the information is stored along the bar as video time progresses, although actual storage requirements need not be uniform over replay time.
- Interspersed keyframes have been indicated by dark vertical stripes such as 68.
- a keyframe is used as representing, or as being typical of the overall video in the interval up to the next keyframe.
- the keyframes may be singled out by a video provider as the first frame of each new shot through adding a label or inclusion in a "table of contents" (TOC).
- TOC "table of contents”
- the receiver through some algorithm, detects that the video content changes abruptly from one frame to the next.
- the present invention takes the associated algorithms for granted. As shown, their distribution may be non-uniform. A further mechanism is that successive keyframes succeed each other at prescribed intervals, such as every 2-3 seconds. In the embodiment, at indication 62 only the keyframes are represented. Furthermore, the keyframes are organized in some hierarchy, in that indication 64 has only a limited set of highly relevant keyframes. This hierarchization may be multilevel, in that indication 66 is associated with only a single keyframe for all of the video presentation 60. The various levels of keyframes may be determined in different ones of the organizations recited supra, and may even exist side by side.
- the storage mapping on Figure 1 may be effected in that the main body of the video presentation is stored in tape recorder 38, whereas at least the keyframes are reproduced in disc recorder 40, possibly together with short video and/or audio intervals immediately following the associated keyframe.
- the length of such interval may correspond to the time latency of linear tape recorder 38, so that thereby real-time access may be attained.
- the video presentation may be essentially linear, such as a film.
- Alternative usage is that certain storage intervals may contain animation, stills, or other images to be used by a consumer present.
- a possible influencing of a keyframe is to suppress it. This effectively joins the time interval before the keyframe in question with the time interval behind it.
- a reset feature may again disjoin the interval.
- keyframes may be suppressed, such as the class that is separated from each other by a fixed time interval.
- Various different classes of keyframes could be available for one presentation, such as those introduced by the provider versus those that are generated by a local algorithm at reception.
- Figure 3 is shows a design of a scrolling mosaic user interface. Every screen presents 20 keyframes starting from top left to right down: each keyframe has its number in the overall ranking of keyframes shown. Actually, keyframe 144 is highlighted by a rectangular control cursor. A user person may activate a remote control to move the cursor freely over the keyframes displayed, as well as over the buttons displayed at the top and bottom bars through the navigational controls on the cursor device. If the user moves the control cursor to the left in the top left corner, the display jumps back by 20 keyframes. Moving to the right in the lower right hand corner will cause a forward jump over 20 frames. Accessing the top bar of the screen will control accessing other parts of the presentation, in that the latter is divided into five equally long parts: a black horizontal bar indicates the total time covered by the twenty keyframes displayed here, of the overall presentation.
- the latter During display, upon passing the instant in time pertaining to a particular keyframe, the latter becomes active and effectively displays the video stream, until arriving at the instant associated with the next keyframe. Thereupon, the latter becomes the active frame.
- the above feature allows a user to straightforwardly program a video recorder for an interval display sequence such as by leaving out certain segments, such as advertising, or rather, to draw attention to certain details by means of slow-forward.
- audio may be active or suppressed through a control button not shown. Alternatively, control may let audio go one, whereas the video cursor is discrete, in that it steps only from interval to interval though appropriate highlighting.
- Figure 4 is an exemplary design of a scrolling list user interface.
- the overall screen has five keyframes displayed, keyframe 145 being highlighted by a rectangular control cursor that runs along its edges.
- Keyframe 145 is also displayed at a larger magnification in the background.
- the control interface is the same as in Figure 3, although the button positions are different.
- the enlarged keyframe is suppressed in the multi-keyframe bar.
- Figure 5 shows a more extensive graphical user interface.
- left and right are columns of control buttons for play, stop, select, cut, paste, fast reverse, zoom+ , zoom-, fast forward.
- the bottom row has a sequence of nine keyframes that pertain to respective different scenes or shots, in they have hardly any correlations therebetween.
- the inter-keyframe distance could be, for example, ten seconds, but greater and smaller spacings could be feasible. Especially with short distances between successive keyframes in time, features such as fast forward can be well realized. On the other hand, the same size of spacing could be used for full playback of all audio, whilst the video would only jump from one keyframe to the next.
- the central keyframe is also represented in an enlarged manner. When playing closely spaced keyframes that have low enough granularity, the enlarged keyframe may be presented in a dynamic manner, for so effecting fast forward (or backward) mode.
- FIG. 6 shows the presentation of subtitles, in the general format as discussed with reference to Figure 5.
- space 50 has been devoted to the actual frame; space 52 has been devoted to displaying subtitles derived from, or associated to the video presentation, or to other relevant information, such as speech-to-text converted for the deaf, or a translation into another language than the one used for actual speech.
- each keyframe has a time code 54 or other relevant data overlayed thereon.
- the two columns of control buttons 56, 58 have been devoted to application operations at left, and intra-program operators at right.
- the top of the screen has the title 60 of the actual video program displayed.
- the organization described herein will allow browsing through information that is different and separate from the overall video string. Even if only the audio is played in a dynamic manner, whilst jumping from any keyframe to the next, the user gets a better impression of the underlying video, at particularly low storage requirements.
- Figure 7 is a state diagram of the system operation.
- the system awaits input from the user, while displaying the multiple keyframes. Such input may imply jumping among the displayed keyframes, jumping to another set of keyframes, selecting a keyframe for displaying the associated interval. Any such input effects arrow 104 and starts a new time interval. Absent any such input during n seconds (such as 20 seconds), effects arrow 108, so that state 102 is reached. Therein, the system runs the dynamic video cursor frame. As long as no user input is received, arrow 110 is effected, and the system continues as long as displayable video material is available. If user input is received however, arrow 106 is effected, and the system freezes, either at the actual content of the dynamic video cursor frame, or at the beginning of the actual interval.
- Some of the keyframes used to browse the content of the video program may have been extracted from a sequence with high motion. This produces an annoying zigzag effect in case the video sequence was encoded with interlaced coding mode as normally is the case, a frame being made up of two fields which contribute to form the complete frame, where even lines belong to one field, odd lines to the other. This problem is more evident and annoying in small keyframes, where the effect is more visible, when the picture o is magnified and the lines become thick blocks.
- the keyframes affected by such an interlacing effect must be detected. This effect can be observed on the rows of the image and luminance variations cause high frequency values. This can be exploited by splitting the spatial frequency spectrum into many sub-bands and then considering only the high frequency components. Actually, the effect we want to detect must present alternating values of luminance between even and odd lines, therefore the highest sampling frequency of the resulting picture.
- the only coefficient that has to be computed is the highest frequency component of a frequency transformation (FFT or, better, DCT) on the columns. When the picture is affected by the zig-zag effect due to interlacing, this component will have a high value.
- FFT frequency transformation
- the detection and correction are effected as follows.
- the first step is to discard one field by removing half of the rows, either even or odd; then, an upsampling by a factor of 2 is performed along the rows in order to recover the original size of the keyframe, followed by an interpolating filter.
- the filter performs a simple linear interpolation.
- the keyframe has to be enlarged at almost full screen size by means of an upsampling followed by an interpolating filter.
- the keyframe has a low resolution, it has to be enlarged by quite a high factor. This means that if it is further processed, the result would not be good-looking, as pixels become large blocks. Therefore the picture must be filtered, but a trade-off must be found since we need to generate a good quality picture to be shown at high resolution, but also fast processing for the application to have a short response time.
- subtitles are often transmitted along with the program (often in the Vertical Blanking Interval for analog systems or in a separate elementary stream in digital transmissions). This is normally used for programs distributed in foreign languages and not synchronized, or is meant for persons with hearing disability. Such information is normally superimposed on the screen, but could also be recorded on a storage medium. In this way, the speech of the program, and sometimes also some description of the sound for deaf people, is available for search operations.
- Possible extension of such a system include: • extracting the text from the screen, such as by OCR techniques on still pictures, if the text is not available separately from the video • using speech recognition technology to extract the dialogues from the program.
- the system will be always independent from the service offered by the broadcaster, so that even in case no subtitle is provided, text retrieval will always be possible at least on some specific keywords that the system can be trained to recognize.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Television Signal Processing For Recording (AREA)
- Studio Circuits (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP98900329A EP0914638A2 (fr) | 1997-02-03 | 1998-01-22 | Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP97200286 | 1997-02-03 | ||
EP97200286 | 1997-02-03 | ||
EP97201802 | 1997-06-16 | ||
EP97201802 | 1997-06-16 | ||
EP98900329A EP0914638A2 (fr) | 1997-02-03 | 1998-01-22 | Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele |
PCT/IB1998/000091 WO1998034182A2 (fr) | 1997-02-03 | 1998-01-22 | Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele |
Publications (1)
Publication Number | Publication Date |
---|---|
EP0914638A2 true EP0914638A2 (fr) | 1999-05-12 |
Family
ID=27238396
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP98900329A Withdrawn EP0914638A2 (fr) | 1997-02-03 | 1998-01-22 | Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0914638A2 (fr) |
JP (1) | JP2000516006A (fr) |
KR (1) | KR100552248B1 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2477800A (en) * | 2010-02-16 | 2011-08-17 | Nds Ltd | Video trick mode playback |
CN102016997B (zh) * | 2008-05-08 | 2012-07-04 | 索尼爱立信移动通讯有限公司 | 将与广告内容有关的可寻址章节符号插入视频流中的电子器件和方法 |
CN116150428A (zh) * | 2021-11-16 | 2023-05-23 | 腾讯科技(深圳)有限公司 | 视频标签获取方法、装置、电子设备及存储介质 |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7954056B2 (en) | 1997-12-22 | 2011-05-31 | Ricoh Company, Ltd. | Television-based visualization and navigation interface |
US7596755B2 (en) | 1997-12-22 | 2009-09-29 | Ricoh Company, Ltd. | Multimedia visualization and integration environment |
US7861169B2 (en) | 2001-11-19 | 2010-12-28 | Ricoh Co. Ltd. | Multimedia print driver dialog interfaces |
US8635531B2 (en) | 2002-02-21 | 2014-01-21 | Ricoh Company, Ltd. | Techniques for displaying information stored in multiple multimedia documents |
US7747655B2 (en) | 2001-11-19 | 2010-06-29 | Ricoh Co. Ltd. | Printable representations for time-based media |
KR101537592B1 (ko) * | 2008-09-03 | 2015-07-22 | 엘지전자 주식회사 | 이동단말기 및 그 제어 방법 |
-
1998
- 1998-01-22 EP EP98900329A patent/EP0914638A2/fr not_active Withdrawn
- 1998-01-22 JP JP10529208A patent/JP2000516006A/ja active Pending
- 1998-01-22 KR KR1019980708181A patent/KR100552248B1/ko not_active IP Right Cessation
Non-Patent Citations (1)
Title |
---|
See references of WO9834182A2 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102016997B (zh) * | 2008-05-08 | 2012-07-04 | 索尼爱立信移动通讯有限公司 | 将与广告内容有关的可寻址章节符号插入视频流中的电子器件和方法 |
GB2477800A (en) * | 2010-02-16 | 2011-08-17 | Nds Ltd | Video trick mode playback |
WO2011101762A1 (fr) | 2010-02-16 | 2011-08-25 | Nds Limited | Mécanisme de mode de trucage vidéo |
US8958687B2 (en) | 2010-02-16 | 2015-02-17 | Cisco Technology Inc. | Video trick mode mechanism |
CN116150428A (zh) * | 2021-11-16 | 2023-05-23 | 腾讯科技(深圳)有限公司 | 视频标签获取方法、装置、电子设备及存储介质 |
CN116150428B (zh) * | 2021-11-16 | 2024-06-07 | 腾讯科技(深圳)有限公司 | 视频标签获取方法、装置、电子设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
JP2000516006A (ja) | 2000-11-28 |
KR100552248B1 (ko) | 2006-06-15 |
KR20000064909A (ko) | 2000-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6222532B1 (en) | Method and device for navigating through video matter by means of displaying a plurality of key-frames in parallel | |
WO1998034182A2 (fr) | Procede et dispositif pour naviguer a travers des images video en affichant une pluralite d'images clef en parallele | |
US6340971B1 (en) | Method and device for keyframe-based video displaying using a video cursor frame in a multikeyframe screen | |
JP3667262B2 (ja) | ビデオスキミング方法及び装置 | |
EP2127368B1 (fr) | Présentation simultanée de segments vidéo permettant une compréhension de fichier vidéo rapide | |
KR100411437B1 (ko) | 지능형 뉴스 비디오 브라우징 시스템 | |
Yeung et al. | Video visualization for compact presentation and fast browsing of pictorial content | |
Taniguchi et al. | An intuitive and efficient access interface to real-time incoming video based on automatic indexing | |
US6154771A (en) | Real-time receipt, decompression and play of compressed streaming video/hypervideo; with thumbnail display of past scenes and with replay, hyperlinking and/or recording permissively intiated retrospectively | |
Bolle et al. | Video query: Research directions | |
KR100411340B1 (ko) | 뉴스 비디오 콘텐트의 뉴스 기사 기반 요약 및 브라우징시스템 | |
US9185379B2 (en) | Medium and method for interactive seamless branching and/or telescopic advertising | |
KR100411342B1 (ko) | 비디오 텍스트 합성 키프레임 생성방법 | |
US20030126603A1 (en) | Multimedia data searching and browsing system | |
WO2006126391A1 (fr) | Dispositif de traitement de contenus, méthode de traitement de contenus et programme informatique | |
WO2001027876A1 (fr) | Schema de description de resume video et procede et systeme de generation de donnees de description de resume video pour vue d'ensemble et exploration efficaces | |
US20040181545A1 (en) | Generating and rendering annotated video files | |
KR101440168B1 (ko) | 개요 및 리포트를 이미 포함하는 시청각 도큐먼트의 새로운 개요를 생성하기 위한 방법 및 상기 방법을 구현할 수 있는 수신기 | |
Zhang | Content-based video browsing and retrieval | |
KR100552248B1 (ko) | 복수의키-프레임들병렬디스플레이에의해비디오재료를통해네비게이팅하는방법및장치 | |
WO1998034181A2 (fr) | Procede et dispositif pour affichage video a base d'images cles au moyen d'une image de curseur video dans un ecran a plusieurs images cles | |
JPH11220689A (ja) | 映像ソフト処理装置及び同処理プログラム記録記憶媒体 | |
KR20020023063A (ko) | 비디오 콘텐트의 구조적 정보를 이용한 비디오 스키밍방법과 장치 | |
KR100370249B1 (ko) | 샷 세그멘테이션 정보를 이용한 비디오 스키밍 시스템 | |
WO2009024966A2 (fr) | Procédé pour adapter des médias à une visualisation sur petits écrans d'affichage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT DE FR GB |
|
17P | Request for examination filed |
Effective date: 19981103 |
|
17Q | First examination report despatched |
Effective date: 20040519 |
|
APBN | Date of receipt of notice of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA2E |
|
APBR | Date of receipt of statement of grounds of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA3E |
|
APAF | Appeal reference modified |
Free format text: ORIGINAL CODE: EPIDOSCREFNE |
|
APAF | Appeal reference modified |
Free format text: ORIGINAL CODE: EPIDOSCREFNE |
|
APBT | Appeal procedure closed |
Free format text: ORIGINAL CODE: EPIDOSNNOA9E |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20120801 |