EP2742599A1 - Logging events in media files including frame matching - Google Patents
Logging events in media files including frame matchingInfo
- Publication number
- EP2742599A1 EP2742599A1 EP12831288.1A EP12831288A EP2742599A1 EP 2742599 A1 EP2742599 A1 EP 2742599A1 EP 12831288 A EP12831288 A EP 12831288A EP 2742599 A1 EP2742599 A1 EP 2742599A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- image
- images
- search
- video
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G11B27/28—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/785—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
- G11B27/105—Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
Definitions
- the present invention relates to logging events, and more specifically, to logging events in media files including frame matching.
- a method of comparing images includes: selecting a target image; selecting one or more search images; comparing the target image to a negative image corresponding to each search image to generate an image comparison score for each search image; and identifying the search image with the best image
- a non-transitory tangible storage medium storing a computer program for comparing images.
- the computer program includes
- executable instructions that cause a computer to: select a target image; select one or more search images; compare the target image to a negative image corresponding to each search image to generate an image comparison score for each search image; and identify the search image with the best image comparison score.
- FIG. 1 shows a screen shot of a queue page of a logger in accordance with one implementation of the present
- FIG. 2 shows a snapshot of a video page of the logger reached by clicking a title including a media file name.
- FIG. 3A shows a snapshot of a stack view in the video page of the logger in accordance with one implementation of the present invention.
- FIG. 3B shows a screenshot of a list of filters
- FIG. 3C shows video information displayed when the video info tab selected in the tabs area.
- FIG. 3D shows logos information displayed when logos is selected in the tabs area.
- FIG. 4A illustrates a representation of a computer system and a user.
- FIG. 4B is a functional block diagram illustrating the computer system hosting a logger.
- FIG. 5 is a flowchart illustrating a method of logging events in a media file in accordance with one implementation of the present invention.
- FIGS. 6-17 are illustrations of implementations of user interfaces for a logger, such as for presenting, selecting, conforming, matching, and logging audio and video elements.
- FIG. 18 shows a flowchart of one implementation of an image matching process.
- FIG. 19 shows a flowchart of one implementation of an image comparison process.
- a software tool referred to as a logger is used to log events in a media file, such as a movie.
- the logger tool provides a user interface allowing a user to view the video in multiple ways and add information to the file to track and log events in the file including the locations of bars and tone, slates, content, logos,
- the logger tool allows the user to capture and validate key events within the media file required to enable downstream automated post production processes and workflows.
- the user interface provides access to the media file and also provides an interface to create, track, and edit events for that media file.
- the user interface allows automatic presentation and association of events with the media file at their proper location, which can improve throughput and quality of the data.
- Events can be generated manually by the user within the logger tool and also generated by importing lists or tables of events created externally. The events can then be associated with the media file within the logger tool. For example, a user can import a quality control report into the logger tool and the logger tool is used to create events for the file matching the quality control entries.
- the logger tool can also present information and views on frame matching and/or differentiation based on imported matched and/or differentiated data.
- FIG. 1 shows a screen shot of a queue page 100 of a logger in accordance with one implementation of the present invention. Queues shown on the queue page 100 are designed to track the progress of media files through each status of a logging process.
- the queue page 100 of the logger includes following items/fields:
- status bar 110 item counter 112, 'sort by drop-down' 114, search field 116, 'expand/collapse all' 118, title 120, identifiers 130, expand 122, thumbnail 124, collapse 126, file specs 128, x add movie' field 132, and logout.
- the status bar 110 is clicked to display the files in the
- the item counter 112 displays the number of files showing for the chosen status.
- the 'sort by drop-down' item 114 is clicked to select an identifier (e.g., Title, Status, Task Id, Added date, Feature, User assigned, and Kit Id) in which the files will be arranged and viewed.
- the search field 116 displays the files that meet the entered keyword criteria.
- the 'expand/collapse all' item 118 is clicked to expand or collapse additional file information (i.e., file specs) for all files in the current status.
- the titlel20 includes a file name that is clicked to proceed to a video page of the logger.
- the identifiers field 130 shows file specific identifying information.
- the expand icon 122 is clicked to display additional file information.
- thumbnail 124 shows a single frame selected to visually represent the file.
- the collapse icon 126 is clicked to hide additional file information.
- the file specs 128 show
- the x add movie' field 132 is used to insert a selected file not currently in the logger tool into a loading status.
- FIG. 2 shows a snapshot of a video page 200 of the logger reached by clicking a title including a media file name (e.g., 120 in FIG. 1) .
- the video page 200 of the logger includes sections, controls, and commands that are used to view, verify, and capture events.
- the video page 200 of the logger includes sections, controls, and commands that are used to view, verify, and capture events.
- the video page 200 of the logger includes sections, controls, and commands that are used to view, verify, and capture events.
- an adjustable filmstrip of thumbnails for all or part of a video file provides/displays following: an adjustable filmstrip of thumbnails for all or part of a video file; audio waveforms for the video; the video with timing information (e.g., time code, tape time code, frame number) ; events associated with the video and their location in the file (e.g., by time code) ; interfaces to display and playback video and audio waveforms; interfaces to create, edit, and delete events for the video file; interfaces to create re-usable clips from a video file (e.g., creating new logos); interfaces for
- the video page 200 includes following sections, controls, and commands: page selector 210, event overview 212, master strip 214, looking glass 216, event strip 218, event indicator 220, anchor 222, audio waveform 224, audio magnification 226, standard timecode 228, tape timecode 230, frame number 232, player controls 234, magnification slider 236, volume slider 238, player pane 242, and stack view 240.
- the page selector 210 is used to choose which page to view (e.g., Queue, Video, or Audio) .
- the event overview 212 represents sections of the file containing events. In one case, known events and unknown events are represented by different colors.
- the master strip 214 represents the entire file timeline from start to end.
- the looking glass 216 is located in the master strip 214 and magnifies the surrounded section of the file in the event strip 218.
- the default position of the looking glass 216 upon opening a new file contains the entire file.
- the event strip 218 is a magnified section located inside the looking glass 216 on the master strip 214 that can split the file into events.
- the event indicator 220 is a stroke that outlines each individual event. For example, a first thumbnail within the event indicator 220 is the first frame of the event, and a last thumbnail within the event indicator 220 is the last frame of the event.
- the anchor 222 is represented by a vertical line that crosses the event strip 218 and audio waveform which represents the location in the file.
- the player controls234 are buttons that control basic playback tasks such as playing, pausing, fast forwarding, and rewinding.
- the magnification slider 236 adjusts the size of the looking glass 216 which can increase or decrease the amount of the master strip 214 that is displayed in the event strip 218.
- the player pane 242 displays the frame located to the right side of the anchor 222.
- the stack view section 240 is the action center of the logger video page 200.
- the video page 200 of the logger can be navigated using the sections, controls, and commands described above.
- the master strip 214 can be navigated by clicking and dragging the looking glass 216 to the right or left to view different sections of the file in the event strip 218.
- the size of the looking glass 216 can be adjusted by moving the magnification slider 236 toward minutes to increase the size of the looking glass 216 and toward frames to decrease the size of the looking glass 216.
- the event strip 218 can be navigated by clicking and dragging the anchor 222 to the right or left along the event strip 218. The event strip 218 can be dragged to the right or left while the anchor 222 remains in the same location.
- Dragging the event strip 218 also moves the looking glass 216 in the master strip 214.
- the event strip 218 will move to place the anchor 222 before the first frame of the selected event.
- Either the Enter key can be pressed or the event on the event strip 218 can be clicked to also expand the event in the center of the strip 218. Up or down arrow key can be used to move to the next or previous event.
- the event strip 218 will move to place the anchor 222 before the first frame of the selected event, and expand the event in the center of the event strip 218.
- FIG. 3A shows a snapshot of a stack view 300 in the video page 200 of the logger in accordance with one
- the stack view 300 shows the tasks being completed as well as filter tools and other information.
- the stack view pane 300 includes track information
- the stack view pane 300 further includes ⁇ ⁇ 11 Notes Expander' 316 and x Notes Expander' 318.
- the track information 310 section provides options to: import quality control report, captions, subtitles, or script alignment; copy from a selected title; or create an unknown default event that represent the entire file.
- FIG. 3B shows a screenshot of a list of filters 332 displayed when the filter tab is selected. A selection of one or more filter from the list of filters allows viewing of the events contained in an individual track by category.
- the filter can be selected to show in the track only the events in that filter category. More than one filter can be turned on at one time to allow viewing of the events in the selected filter categories by pressing multiple filter buttons.
- FIG. 3C shows video information 334 displayed when the video info tab selected in the tabs area 330.
- the video information 334 provides information such as frame rate, language, and other pertinent video information.
- FIG. 3D shows logos information 336 displayed when Logos is selected in the tabs area 330.
- logos To view logos in the logos window of the stack view 300, click the logos button under the track name.
- To search logos click to place the cursor in the search field with the logo window open.
- To create a new logo execute the following steps: create an event that represents the logo from start to end; click on the x edit mode' icon in the stack view for the event that contains the logo; choose x Logo' in the event category menu and the corresponding logo type (e.g., Logo, Production Company Logo, Distribution Logo, or Production Logo) ; place the anchor on the frame that most accurately represents the logo in the event strip; click the ⁇ ⁇ ' button or double-click the correct event type in the event category menu; type in the desired logo name in the search field when the logo window appears over the stack view; click the 'Create New' button; and click the 'submit' button to assign the newly created logo to the event when the new logo appears in the stack view .
- logo type e.g., Logo, Production Company Logo, Distribution logo, or Production Logo
- each event row 320 will display the event type it has been assigned, the event description, duration, as well as start and end. The measurement of the duration and start and end information will display based on the highlighted measurement field.
- Each event type is represented by a different color 322 in the 'event type' column in the stack view 300. able 1 shown below broadly defines the available event types.
- Audio Sync Point A hard effect that can be used to sync the audio track with the visual queue.
- Black Roll-up / Pre- Periods of black picture over MOS roll typically preceding bars and tone.
- Program Slate Information card that displays tape metadata relevant to the file such as feature title, aspect ratio, part - timecode, runtime, audio configuration, date P.O.# / vendor facility, textless material, source material, etc.
- Subtitles Subtitle (in picture) Textual versions of the dialog in films and television programs, usually displayed at the bottom of the screen. They can either be a form of written translation of a dialog in a foreign language, or a written rendering of the dialog in the same language, with or without added information to help viewers who are deaf and hard-of-hearing to follow the dialog.
- Each track includes at least one event that represents the entire file from beginning to end, or many imported or copied events that combined include the entire file.
- Each new event is a portion of an existing event.
- FIG. 4A illustrates a representation of a computer system 400 and a user 402.
- the user 402 uses the computer system 400 to log events in a media file, such as a movie.
- the computer system 400 stores and executes a logger490.
- FIG. 4B is a functional block diagram illustrating the computer system 400 hosting the logger490.
- the controller 410 is a programmable processor and controls the operation of the computer system 400 and its components.
- the controller 410 loads instructions (e.g., in the form of a computer program) from the memory 420 or an embedded controller memory (not shown) and executes these instructions to control the system.
- the controller 410 provides the logger490 as a software system, such as to enable logging of events in a media file. Alternatively, this service can be implemented as separate hardware components in the controller 410 or the computer system 400.
- Memory 420 stores data temporarily for use by the other components of the computer system 400.
- memory 420 is implemented as RAM. In one implementation, memory 420 also includes long-term or
- permanent memory such as flash memory and/or ROM.
- Storage 430 stores data temporarily or long term for use by other components of the computer system 400, such as for storing data used by the logger490.
- storage 430 is a hard disk drive.
- the media device 440 receives removable media and reads and/or writes data to the inserted media. In one
- the media device 440 is an optical disc drive.
- the user interface 450 includes components for accepting user input from the user of the computer system 400 and presenting information to the user.
- the user interface 450 includes a keyboard, a mouse, audio speakers, and a display.
- the controller 410 uses input from the user to adjust the operation of the computer system 400.
- the I/O interface 460 includes one or more I/O ports to connect to corresponding I/O devices, such as external storage or supplemental devices (e.g., a printer or a PDA) .
- the ports of the I/O interface 460 include ports such as: USB ports, PCMCIA ports, serial ports, and/or parallel ports.
- the I/O interface 460 includes a wireless interface for communication with external devices wirelessly.
- the network interface 470 includes a wired and/or wireless network connection, such as an RJ-45 or "Wi-Fi" interface (including, but not limited to 802.11) supporting an Ethernet connection.
- a wired and/or wireless network connection such as an RJ-45 or "Wi-Fi" interface (including, but not limited to 802.11) supporting an Ethernet connection.
- the computer system 400 includes additional hardware and software typical of computer systems (e.g., power, cooling, operating system) , though these components are not
- the method comprises configuring a logger tool, at box 510, to allow a user to view media in multiple ways (box 512) .
- the user also captures and validates key events within the media file, at box 514.
- Events in the media file are tracked and logged, at box 520, by adding information to the media file, at box 522, including locations of bars and tone, slates, content, logos,
- FIGS. 6-17 are illustrations of implementations of user interfaces for a logger, such as for presenting, selecting, conforming, matching, and logging audio and video elements (e.g., frames, tracks, segments, clips, waveforms, filmstrips, events) .
- audio and video elements e.g., frames, tracks, segments, clips, waveforms, filmstrips, events.
- Various implementations can include, but are not limited to, one or more of the following items: (a) Providing an adjustable filmstrip of thumbnails for all or part of a video file; (b) Displaying audio waveforms for the video; (c)
- Timing information e.g., time code, tape time code, frame number
- EDL Edit Decision List
- Quicktime reference movie Providing a UI and operations for on screen annotation in the video UI (i.e. hand draw on the frame);
- the logger includes components to support a set of features that allow users to match frames from the same, or different movie files.
- This frame matching feature's underlying algorithm centers around a basic concept known as the law of absolute difference, and compares positive and negative frames to determine relevance of the match and then returns the results based on a defined threshold.
- This functionality provides users with the ability to create textless masters and foreign texted masters by matching the inserts to the original program.
- a first file contains the original movie and a second file contains inserts, groups of frames to replace corresponding groups of frames in the original (e.g., frames with localized text for a particular language) .
- the user can identify the original frames that match the insert frames and then indicate which frames in the original movie file to replace with which frames from the insert file
- the logger can then output a new version using the original frames with selected frames replaced with the selected insert frames.
- the logger can create a file (e.g., a table of references) that guides playback between the original file and the insert file.
- a user can then create another file for different language using a different insert file.
- the UI provides the users with the ability to see the results play side by side in a player window and includes default storyboard mode as well as a view in "precise” mode where users can fix and adjust if there are insert/original frame mismatches of inconsistencies.
- users can preview their versions “real-time” and toggle between
- the audio analysis and/or audio UI can be implemented in a separate program or component.
- the following figures illustrate aspects of the audio UI .
- Users can select a "gold" reference channel - this is a channel that all of the other channels will be conformed to (e.g., offset or shifted to synchronize) . Once the results come back the audio channels will lock into place and offsets will be recorded. Users then validate that the conform results are accurate and lock the component. As new components are ingested for that title and this auto conform process runs, only unlocked components will be analyzed.
- the logger includes
- audio component creation provides the users a feature that allows users to ingest multi-part audio components, conform them and then render a new component. The resulting components could then be used in a distribution system or other post-production workflows.
- audio component creation also provides these features: sample rate conversion, sync pop removal, basic envelopes and real-time preview.
- a logger can be specialized for other video, such as television shows, internet video, or user generated content, or for audio, such as radio or podcasts, or other content, such as games or text, or
- a computer system provides a user interface and search functionality so that a user can select one or more frames of video and a target set of frames. The user can then request the best match for the selected frames in the target set.
- the computer system compares the frames by comparing the absolute difference between frame images and returns the frames with the best results.
- the computer system provides the frame matching as part of a video editing or video production tool.
- features provided in implementations can include, but are not limited to, one or more of the following items: (a) Providing a UI and operations for frame matching, allowing a user to match frames within or across files; (b) Matching one frame to one frame in a single set of frames; (c) Matching one frame to one frame in multiple sets of frames; (d)
- a computer system performs frame or image matching to locate a match for a selected image or images among one or more frames or images in a target set of frames or images.
- the images can be individual still images or frames from a sequence of images (e.g., a video file) .
- a software tool referred to as a logger is used to log events in a video file, such as a movie.
- the logger provides a user interface (UI) allowing a user to view the video in multiple ways and add information to the file to track events in the file (logging) , such as the locations of bars and tone, slates, content, logos, and commercial blacks.
- UI user interface
- the logger also provides frame matching to find desired frames in a selected file or files.
- the logger includes components to support a set of features that allow users to match frames from the same, or different movie files.
- This frame matching feature's underlying algorithm centers around a basic concept known as the law of absolute difference, and compares positive and negative frames to determine relevance of the match and then returns the results based on a defined threshold.
- a first file contains the original movie and a second file contains
- inserts, groups of frames to replace corresponding groups of frames in the original e.g., frames with localized text for a particular language
- groups of frames to replace corresponding groups of frames in the original e.g., frames with localized text for a particular language
- the user can identify the original frames that match the insert frames and then indicate which frames in the original movie file to replace with which frames from the insert file (manually and/or automatically) .
- the logger can then output a new
- the logger can create a file (e.g., a table of references) that guides playback between the original file and the insert file. A user can then create another file for different language using a different insert file.
- a file e.g., a table of references
- FIG. 18 shows a flowchart of one implementation of an image matching process 1800.
- This process searches for the best match for a single selected image among a set of images. For example, a user of the logger can use this process to select a frame and then search for that frame in a selected video file (e.g., a movie) .
- a selected video file e.g., a movie
- the user selects a target image, block 1810.
- the UI of the computer system provides a selection mechanism, such as through a search command
- the user selects a set of search images, block 1820.
- the search images can be in one or multiple files. For example, the user can select a video file through the UI and the computer system will use all the frame images in the video file as the search images.
- the computer system determines if a negative image is available for each search image, block 1830.
- a negative image of a search image is an inversion of the search image, with all the colors of the search image color reversed.
- the computer system creates negative images for all images when the images are made accessible to the frame matching tool (e.g., when they are ingested into the system) . If any search images do not have a corresponding negative image already created, the computer system generates a negative image for each search image without a negative image, block 1840.
- the negative image generation can be performed in parallel with comparing already generated or existing negative images.
- the computer system begins a comparison loop to compare the target image to the negative images for the search images.
- the computer system begins with the first search image and compares the target image to the negative image of the first search image to generate an image comparison score, block 1850.
- the computer system compares the images by determining the absolute difference between the target image and the negative image.
- FIG. 19 shows an
- the comparison generates an image comparison score or confidence score representing the level of similarity or difference between the target image compared to that negative image.
- the computer system stores the image comparison score, block 1860.
- the computer system determines if the target image has been compared to the negative images for all the search images, block 1870. If not, the computer system then compares the target image to the negative of the next search image
- the computer system continues to compare the target image to negative images of search images until the target image has been compared with the negative images for all the search images (looping through blocks 1880, 1860, 1870) .
- the computer system identifies the search image with the best image comparison score, block 1890.
- the computer system selects the search image with the highest score.
- the computer system selects all the search images that have image comparison scores above a threshold. If no search image has a score above the
- the computer system returns an error or can return a best guess (e.g., an image with the highest score) .
- the user can adjust the threshold to control the desired level of similarity and control results returned.
- FIG. 19 shows a flowchart of one implementation of an image comparison process 1900. This process combines the pixel values of a target image and a negative image to
- the computer system iterates
- the computer system selects the first pixel in the target image, block 1910, and selects the first pixel in the negative image, block 1920.
- the computer system starts with the pixel at the upper left corner of each image and proceeds left to right, top to bottom.
- the computer system combines the selected pixels to generate a combined pixel value, block 1930.
- the computer system combines or adds the pixel values.
- the computer system then compares the combined pixel value with a desired value to generate a pixel
- the desired value is set to indicate a desirable level of match or can be set to indicate a 100% match. In one implementation, if the negative image pixel is the reverse of the target image pixel, the
- the difference between the combined value and the desired value indicates a level of difference between the two pixels.
- Other pixel comparison techniques can also be used.
- the pixel comparison score indicates how similar or different the pixel in the target image is to the pixel in the base image (e.g., search image) corresponding to the pixel in the negative image. In one implementation, a high pixel comparison score indicates the pixels are highly similar.
- the computer system stores the pixel comparison score, block 1950.
- the computer system determines if all the pixels in the target image have been compared to all the pixels in the negative image, block 1960. If not, the computer system then selects the next pixel in the target image, block 1970, and the next pixel in the negative image, block 1980. Then, the computer system starts the comparison for the selected pixels by combining the selected pixels, returning to block 230. The computer system continues to compare the pixels of the target image to the pixels of the negative image until all the pixels of the target image have been compared with all the pixels of the negative image (looping through blocks 230, 240, 250, 260, 270, 280) .
- the computer system combines the pixel comparison scores to generate an image comparison score, block 290.
- the image comparison score indicates the level of similarity between the target image and the base image (e.g., search image) upon which the negative image is based. In one implementation, a higher score indicates a higher level of similarity between the images.
- larger numbers of images can be selected and compared.
- a single image or frame can be compared to images in multiple sets of images (e.g., multiple movie files) .
- multiple images e.g., a video clip or insert
- a user selects a video insert and requests a search of multiple movie or video files to matching frames.
- the user can use a video clip (or image) as a search term for a query to search video content. Additional information can also be used to help guide the search.
- the logger can automatically select files for search images based on the name of the file from which the target image clip (e.g., in insert) has been selected.
- the logger tool can use
- frame matching can be used as part of another process. Being able to identify the same or similar frames or images in separate sources can be helpful for organization and resource management.
- a video editing tool builds a cut list by using frame matching on multiple video samples and a final video file.
- a cut list provides a list of edits that were made to original video (such as film or video tape used to record live shooting of a television program or movie) to create a final version of the video. Without a cut list it can be difficult to determine which original video was used to create the final version (e.g., if a scene were shot multiple times on film, it may be difficult to know which shot was used, or how the pieces were edited together) .
- all the frames are ingested into the video tool or an
- Frames from the final version are used as target frames and the original video is used as the search frames.
- the video tool can match frames from the final version to frames in the original video.
- the video tool can then create a cut list by grouping together sequential frames from the same sources (e.g., frames 1-200 are frames 1251-1450 in source video A, frames 201-275 are frames 12001-1275 in source video B, etc.) .
- frame matching can be used to identify frames that are different as well as or instead of frames that are the same.
- One such implementation of a video tool uses differential analysis to compare versions of a movie (or other video content) to determine what is unique in each file. For example, the tool can use differential
- the tool can determine which frames are the same (or sufficiently similar) and which frames are different. In one implementation, the tool
- This identification shows the delta or differences between the two versions being compared (more versions can also be compared serially or in parallel) .
- the tool can present the unique frames from each version to the user or provide a list or report. The user can adjust the list or adjust the threshold to generate a new set of frames. The user can then use this information for additional editing or other operations, such as dynamic editing, re-editing, audio conform, subtitle confirm, and caption conform. In this way, the tool can re-create inserts from two or more larger video files, similar to a reverse process of matching an insert or clip to a larger video file.
- One implementation includes one or more programmable processors and corresponding computer system components to store and execute computer instructions, such as to provide the image comparison and pixel comparisons, and generation, storage and comparison of comparison scores.
- the data and/or instructions can be stored and accessed on a single device or system or across connected or networked devices or systems.
- the video files can be movies, game content, television, web video, etc.
- the frame matching can be used outside of professional video production, such as for consumer or user generated content, for organizing and matching images or video in personal collections, for searching locally stored or online content, etc. Accordingly, the claims are not limited only to the specific examples described above.
- a logger can be specialized for other video, such as television shows, internet video, or user generated content, or for audio, such as radio or podcasts, or other content, such as games or text, or combinations thereof (e.g., matching and conforming video, audio, and text, such as for screenplay matching and
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161534275P | 2011-09-13 | 2011-09-13 | |
US201261624123P | 2012-04-13 | 2012-04-13 | |
PCT/US2012/055213 WO2013040244A1 (en) | 2011-09-13 | 2012-09-13 | Logging events in media files including frame matching |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2742599A1 true EP2742599A1 (en) | 2014-06-18 |
EP2742599A4 EP2742599A4 (en) | 2016-01-13 |
Family
ID=47883740
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP12831288.1A Ceased EP2742599A4 (en) | 2011-09-13 | 2012-09-13 | Logging events in media files including frame matching |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP2742599A4 (en) |
CN (1) | CN104025465A (en) |
WO (1) | WO2013040244A1 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2843960A1 (en) * | 2013-08-28 | 2015-03-04 | Thomson Licensing | Method and apparatus for managing metadata of media data |
CN105247878A (en) * | 2013-05-27 | 2016-01-13 | 汤姆逊许可公司 | Method and apparatus for visually representing metadata of media data |
CN110121098B (en) * | 2018-02-05 | 2021-08-17 | 腾讯科技(深圳)有限公司 | Video playing method and device, storage medium and electronic device |
US11659214B2 (en) | 2020-07-20 | 2023-05-23 | Netflix, Inc. | Automated workflows from media asset differentials |
CN117132925B (en) * | 2023-10-26 | 2024-02-06 | 成都索贝数码科技股份有限公司 | Intelligent stadium method and device for sports event |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050069291A1 (en) * | 2003-09-25 | 2005-03-31 | Voss James S. | Systems and methods for locating a video file |
US8489583B2 (en) * | 2004-10-01 | 2013-07-16 | Ricoh Company, Ltd. | Techniques for retrieving documents using an image capture device |
US8170109B2 (en) * | 2006-03-23 | 2012-05-01 | Nds Limited | System for analysis of motion |
JP4798018B2 (en) * | 2007-02-22 | 2011-10-19 | 株式会社明電舎 | Image matching device |
US8358380B2 (en) * | 2009-06-05 | 2013-01-22 | Cisco Technology, Inc. | Efficient spatial and temporal transform-based video preprocessing |
KR20140051115A (en) * | 2011-01-04 | 2014-04-30 | 소니 주식회사 | Logging events in media files |
-
2012
- 2012-09-13 EP EP12831288.1A patent/EP2742599A4/en not_active Ceased
- 2012-09-13 WO PCT/US2012/055213 patent/WO2013040244A1/en unknown
- 2012-09-13 CN CN201280052184.5A patent/CN104025465A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2013040244A1 (en) | 2013-03-21 |
CN104025465A (en) | 2014-09-03 |
EP2742599A4 (en) | 2016-01-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10015463B2 (en) | Logging events in media files including frame matching | |
US7432940B2 (en) | Interactive animation of sprites in a video production | |
JP6214619B2 (en) | Generating multimedia clips | |
US9026909B2 (en) | Keyword list view | |
US9881215B2 (en) | Apparatus and method for identifying a still image contained in moving image contents | |
US20100050080A1 (en) | Systems and methods for specifying frame-accurate images for media asset management | |
US8631047B2 (en) | Editing 3D video | |
WO2013040244A1 (en) | Logging events in media files including frame matching | |
GB2520041A (en) | Automated multimedia content editing | |
US20140006978A1 (en) | Intelligent browser for media editing applications | |
CA2823742A1 (en) | Logging events in media files | |
EP3252770A1 (en) | Automated identification and processing of audiovisual data | |
AU2002301447B2 (en) | Interactive Animation of Sprites in a Video Production | |
Denoue et al. | Content-based copy and paste from video documents | |
Costello | Non-Linear Editing | |
Davies et al. | Video document | |
Althagafi et al. | MIDB: A Web-Based Film Annotation Tool. | |
Boykin | Apple Pro Training Series: Final Cut Pro X 10.1 Quick-Reference Guide |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20140313 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAX | Request for extension of the european patent (deleted) | ||
RA4 | Supplementary search report drawn up and despatched (corrected) |
Effective date: 20151210 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G11B 27/10 20060101ALI20151204BHEP Ipc: G11B 27/28 20060101ALI20151204BHEP Ipc: H04B 1/66 20060101AFI20151204BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20190102 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SONY GROUP CORPORATION Owner name: SONY PICTURES TECHNOLOGIES INC. |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20220708 |