CN104025465A - Logging events in media files including frame matching - Google Patents

Logging events in media files including frame matching Download PDF

Info

Publication number
CN104025465A
CN104025465A CN201280052184.5A CN201280052184A CN104025465A CN 104025465 A CN104025465 A CN 104025465A CN 201280052184 A CN201280052184 A CN 201280052184A CN 104025465 A CN104025465 A CN 104025465A
Authority
CN
China
Prior art keywords
image
frame
negative
searching
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201280052184.5A
Other languages
Chinese (zh)
Inventor
J·布拉姆斯
O·卓克弗
O·沙奥弗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Sony Pictures Entertainment Inc
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN104025465A publication Critical patent/CN104025465A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7847Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
    • G06F16/785Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

Comparing images, including: selecting a target image; selecting one or more search images; comparing the target image to a negative image corresponding to each search image to generate an image comparison score for each search image; and identifying the search image with the best image comparison score. Keywords include logging events and frame matching.

Description

Comprise the event in the recording medium file of frame coupling
Quoting of related application
The U.S. Provisional Patent Application No.61/534 of pending trial when the application requires on September 13rd, 2011 to submit to, 275, " Tech Logger "; And the U.S. Provisional Patent Application No.61/624 of submission on April 13rd, 2012,123, the priority of " Frame Matching ".The disclosure of above-mentioned application is incorporated by reference at this.
Technical field
The present invention relates to recording events, more specifically, relate to the event of the recording medium file that comprises frame coupling.
Background technology
Manually for video file creating list of thing not only oppressiveness but also be easy to make mistakes.With a kind of instrument inspection tape or video file, can lead to errors with inconsistent with another kind of instrument manual input time of code simultaneously.These variety of issues can make to be more difficult to as one man process the video file in storehouse.
Summary of the invention
Embodiments of the invention are to show Voice & Video from data file, and metadata is appended to described file create conditions.
A kind of method of movement images is disclosed in one implementation.Described method comprises: select target image; Select one or more searching images; Comparison object image and corresponding to the negative-appearing image of each searching image, thus the image Comparison score of each searching image generated; Searching image with recognition image Comparison score the best.
A kind of non-provisional tangible storage medium of the computer program of preserving movement images is disclosed in another implementation.Described computer program comprises the executable instruction that makes computer carry out following operation: select target image; Select one or more searching images; Comparison object image and corresponding to the negative-appearing image of each searching image, thus the image Comparison score of each searching image generated; Searching image with recognition image Comparison score the best.
After checking following detailed description and accompanying drawing, to those skilled in the art, it is more apparent that other features and advantages of the present invention will become.
Brief description of the drawings
Fig. 1 represents according to the screenshot capture of the queue page of a kind of register of realizing of the present invention.
Fig. 2 represents to comprise by click the screenshot capture of the video page of the register that the title of media file name arrives.
Fig. 3 A represents according to the screenshot capture of the stack view in the video page of a kind of register of realizing of the present invention.
Fig. 3 B represents in the time selecting filter option card, the screenshot capture of the list of the filter of demonstration.
Fig. 3 C represents when select video information tab in tab district time, the video information of demonstration.
Fig. 3 D represents in the time of selection marker in tab district, the flag information of demonstration.
Unify user's the expression of Fig. 4 A graphic extension department of computer science.
Fig. 4 B is the functional-block diagram of the computer system of graphic extension trustship register.
Fig. 5 is that graphic extension is according to the flow chart of the method for the event in a kind of recording medium file of realizing of the present invention.
Fig. 6-17th, the user interface of register, such as for presenting, select, the illustration of the realization of the user interface of adaptive, coupling and record audio and video elementary.
A kind of flow chart of realizing of Figure 18 presentation video matching treatment.
A kind of flow chart of realizing that Figure 19 presentation video is relatively processed.
Embodiment
More disclosed herein be embodied as from data file show Voice & Video, and metadata appended to described file create conditions.After reading this explanation, in various alternative realizations and alternative application, how to realize the present invention and will become obvious.But although various realization of the present invention will be described here, but obviously these are realized and just providing as an example, instead of limitation of the present invention.Thereby the detailed description of various alternative realizations should not be understood as that and limit the scope of the invention or range.
In one implementation, utilize the Software tool recording medium file that is called register, such as the event in film.Register instrument provides user interface, described user interface allows user to check video with various ways, information is added in file, thereby the event in tracking and log file, described information comprises position, clap-stick, content, mark, commercial advertisement black (commercial blacks), quality Control, spolen title and the explanatory captions of color shades fence.Register instrument allows user to catch and verifies the critical event of needs in order to make it possible to realize the automatic post production process in downstream and workflow in media file.
In one implementation, this user interface provides the access to media file, and establishment is provided, follows the tracks of and edits the interface about the event of this media file.This user interface allows automatically to present event and automatically in the appropriate location of event, event is associated with media file, and this can improve throughput and the quality of data.Event can manually be produced by user in register instrument, also can produce by importing outside list of thing or the form creating.Can in register instrument, make event associated with media file subsequently.For example, user can import to Quality Control Report in register instrument, and register instrument is used to create the quality of match control strip object event about file.In another implementation, register instrument also can be according to coupling and/or the discrepant data that import, and present information and view about frame coupling and/or difference.
Fig. 1 represents according to the screenshot capture of the queue page 100 of a kind of register of realizing of the present invention.The queue illustrating on the queue page 100 is designed to every kind of state by recording processing, follows the tracks of the progress of media file.
In Fig. 1, in illustrative realization, the queue page 100 of register comprises following items/field: status bar 110, item counter 112, " drop-down classification " 114, search field 116, " launching/all folding " 118, title 120, identifier 130, launch 122, breviary Figure 124, folding 126, filespec 128, " interpolation film " field 132 and exit.Click status bar 110, with display file under the state selecting, described selection mode comprises all, loads, prepares record, user job, preparation inspection, completes and refuse.The quantity of item counter 112 show needles to the file shown in the state of selecting." drop-down classification " project 114 of click, will arrange by it and the identifier of viewing files (for example, title, state, task Id, add date, feature, user's appointment and Kit Id) selecting.Search field 116 shows the file of the keyword standard that meets input.Click " launch/all folding " project 118, with launch or folding current state under the appended document information (for example, filespec) of All Files.Title 120 comprises clicked to enter the filename of video page of register.Identifier field 130 represents file identification information specific.Click and launch icon 122, to show appended document information.Breviary Figure 124 represents selected with the single frames of render files visually.Click folding icon 126, to hide appended document information.Filespec 128 represents supplementary technology fileinfo." interpolation film " field 132 is not for being inserted in stress state at the selected file of register instrument current.
Fig. 2 expresses by click and comprises the title of media file name (for example, 120 in Fig. 1) and the screenshot capture of the video page 200 of the register that arrives.In one implementation, the video page 200 of register comprises for checking, examine and section, control and the order of capture events.For example,, below the video page 200 of register provides/shows: about the capable of regulating filmstrip of all or part of thumbnail of video file; The audio volume control of video; There is the video of timing information (for example, timing code, time of tape code, frame number); The event associated with video and these events position (for example,, according to timing code) hereof; The interface of demonstration and playback of video and audio volume control; Create, edit and delete the interface of the event of video file; For example, from the interface of the reusable montage of video file creating (, creating newly mark); For in file or across the interface of Document Editing, importing and duplicate event or many groups event; By the interface for user of web browser.
In the illustration of Fig. 2 realizes, video page 200 comprises with lower curtate, control and order: page selector 210, event general view 212, master control bar 214, sight glass 216, event bar 218, event indicating device 220, anchor 222, audio volume control 224, audio frequency amplification 226, standard time code 228, time of tape code 230, frame number 232, player control 234, amplification slide block 236, volume slide 238, player pane 242 and stack view 240.Page selector 210 for example, for selecting to check which page (, queue, video or audio frequency).Event general view 212 represents the section of the file that comprises event.In one case, known event and unknown event represent with different colours.
Master control bar 214 represents whole document time line from the beginning to the end.Sight glass 216 is arranged in master control bar 214, in event bar 218, amplify around the section of file.In the time opening new file, the default location of sight glass 216 comprises whole file.Event bar 218 is amplification sections sight glass 216, that can be event by Divide File that are positioned on master control bar 214.Event indicating device 220 be summarize each separate event retouch limit (stroke).For example, the first thumbnail in event indicating device 220 is the first frame of event, and the last thumbnail in event indicating device 220 is the last frame of event.That anchor 222 use are intersected with event bar 218 and audio volume control, represent that the vertical line of the position in file represents.This document location will be presented in player pane 242.Player control 234 is to control basic playback tasks, such as, broadcasting, time-out, F.F. and the button retreating.Amplify slide block 236 and adjust the size of sight glass 216, this can increase or reduce to be presented at the amount of the master control bar 214 in event bar 218.Player pane 242 shows the frame that is positioned at anchor 222 right sides.Stack view part 240 is action centers of register video page 200.
In one implementation, can be with the navigate video page 200 of register of above-mentioned section, control and order.For example,, by clicking sight glass 216 and dragging the sight glass 216 master control bar 214 that navigates to the right or left, with the different sections of viewing files in event bar 218.Can utilize following manner to adjust the size of sight glass 216, that is, towards minute mobile size of amplifying slide block 236 and increase sight glass 216, and move the size of amplifying slide block 236 and reduce sight glass 216 towards frame.In another example, by clicking anchor 222 and dragging anchor 222 to the right or left along event bar 218, can navigation event bar 218.When anchor 222 remains on same position, event bar 218 can be dragged to the right or left.Drag events bar 218 is mobile sight glass 216 in master control bar 214 also.In the time of the event of the expectation on click event bar 218, event bar 218 will be mobile with before anchor 222 is placed on to the first frame of selected event.Can press enter key, or event on can click event bar 218, with this event of center deployment at event bar 218 also.Directionkeys can be used for moving to next or previous event up or down.In another example, in the time that the event in stack view 240 is selected, event bar 218 will move, thereby before anchor 222 being placed on to the first frame of selected event, and in this event of center deployment of event bar 218.
Fig. 3 A represents according to the screenshot capture of the stack view 300 in the video page of a kind of register of realizing of the present invention 200.Except filter instrument and other information, the task that stack view 300 has also represented.In the illustration of Fig. 3 A realizes, stack view pane 300 comprises orbit information 310 (comprise the drop-down button 312 of track and add track button 314), be used for representing the tab 330 of filter 332 (referring to Fig. 3 B), video information 334 (referring to Fig. 3 C) and mark 336 (referring to Fig. 3 D), and event row 320.As mentioned above, known event and unknown event can represent with different colours 322.Stack view pane 300 also comprises " all annotating spreader " 316 and " annotation spreader " 318.Orbit information 310 parts provide following option: import Quality Control Report, explanatory captions, spolen title or drama alignment; From the replication of titles of selecting; Or create the unknown default event that represents whole file.
Fig. 3 B represents the screenshot capture of the list of the filter 332 showing in the time selecting filter option card.From the list of filter, select one or more filters to allow according to type to check the event being included in single track.Thereby, can select filter, so that the event in this filter kind to be only shown in track.By pressing multiple filter button, can open a more than filter, to allow to check the event in the filter kind of selection simultaneously.
Fig. 3 C represents when select video information tab in tab district 330 time, the video information 334 of demonstration.Video information 334 provides the information such as frame per second, the language video information relevant with other.
Fig. 3 D represents in the time of selection marker in tab district 330, the flag information 336 of demonstration.In order to check mark in the mark window of stack view 300, click the mark button below track name.For search sign, in the time that mark window is opened, thereby click, cursor is placed in search field.In order to create new mark, carry out following steps: create the event that represents from the beginning to the end mark; For the event that comprises this mark, click " edit pattern " icon in stack view; Select " mark " and corresponding type of sign (for example, mark, production company's mark, distribution mark or film-making mark) in kind of event menu; Anchor is placed in event bar and is the most accurately represented on the frame of this mark; Click " OK " button or double-click the correct kind of event in kind of event menu; In the time that mark window appears on stack view, the mark name that input is expected in search field; Click " creating newly mark " button; When appearing in stack view when new mark, click " submissions " button, so that a mark for new establishment is distributed to this event.
Return to Fig. 3 A, each event row 320 event type that it is endowed by demonstration, event description, the duration, and start and finish.The measurement of duration and beginning and ending message show the measurement field based on highlighting.In each kind of event " event type " row in stack view 300, represent with different colours 322.Table 1 summary definition below available event type.
Table 1
Each track comprises at least one event of expression whole file from the beginning to the end, or many importings or the duplicate event of combination comprise whole file.Each new events is a part for existing event.Therefore,, in order to create new events, anchor is placed on the first frame of the event creating in event bar or before tight.This will show the first frame of this event in player pane.Select current event to be divided into two events.The frame on anchor right side represents the first frame of new events now, and the frame in anchor left side represents the last frame of last event.This event will automatically classify as the unknown.
Fig. 4 A graphic extension computer system 400 and user's 402 expression.User 402 utilizes computer system 400 recording medium files, such as the event in film.Computer system 400 is preserved and executive logging device 490.
Fig. 4 B is the functional-block diagram of the computer system 400 of graphic extension trustship register 490.Controller 410 is programmable processors, controls the operation of computer system 400 and assembly thereof.Controller 410, from memory 420 or embedded controller memory (not shown), is written into instruction (for example, with the form of computer program), and carries out these instructions, with control system.In its implementation, controller 410 provides register 490 with the form of software systems, to can realize the record of the event in media file.On the other hand, this service can be realized as the independent hardware component in controller 410 or computer system 400.
Memory 420 is interim preserve for other assembly of computer system 400 data.In one implementation, memory 420 is realized as RAM.In one implementation, memory 420 also comprises long-term or permanent memory, such as flash memory and/or ROM.
Memory device 430 interim or long-term preserve for other assembly of computer system 400 data, the data that use such as keeping records device 490.In one implementation, memory device 430 is hard disk drives.
Medium apparatus 440 is received detachable media, reads and/or data is write in the medium of insertion.In one implementation, for example, medium apparatus 440 is CD drive.
User interface 450 comprises user's input of accepting from the user of computer system 400, and to the assembly of user's presentation information.In one implementation, user interface 450 comprises keyboard, mouse, loud speaker and display.Controller 410 utilizes the input from user, adjusts the operation of computer system 400.
I/O interface 460 comprises the I/O equipment that connection is corresponding, for example, such as one or more I/O ports of External memory equipment or auxiliary equipment (printer or PDA).In one implementation, the port of I/O interface 460 comprises such as port USB port, pcmcia port, serial port and/or parallel port.In another implementation, I/O interface 460 comprises wirelessly and the wave point of external device communication.
Network interface 470 comprises that wired and/or wireless network connects, such as RJ-45 or " Wi-Fi " interface (including but not limited to 802.11) of supporting that Ethernet connects.
Computer system 400 comprise the distinctive other hardware and software of computer system (such as, power supply, cooling, operating system), but for simplicity, in Fig. 4 B, clearly do not represent these assemblies.In other is realized, can use the different structure (for example, different buses or storage organization, or multi-processor structure) of computer system.
Fig. 5 is that graphic extension is according to the flow chart of the method 500 of the event in a kind of recording medium file of realizing of the present invention.In illustrative realization, described method is included in square frame 510 configuration record device instruments, to allow user to check in many ways media (square frame 512).At square frame 514, user also catches and verifies the critical event in media file.By at square frame 522, information is added in media file, event in square frame 520 tracking recording medium file, described information comprises position, clap-stick, content, mark, commercial advertisement black, quality Control, spolen title and the explanatory captions of color shades fence.
Fig. 6-17th, the user interface of register, such as for presenting, select, the illustration of the realization of the user interface of adaptive, coupling and record audio and video elementary (for example, frame, track, fragment, montage, waveform, filmstrip, event).
Various realizations can include, but is not limited to one or more in following project: (a) provide the capable of regulating filmstrip about all or part of thumbnail of video file; (b) audio volume control of display video; (c) show the video for example, with timing information (, timing code, time of tape code, frame number); (d) show the event associated with video and these events position (for example,, according to timing code) hereof; (e) provide and control the demonstration of Audio and Video waveform and the UI (user interface) of playback; (f) be provided as the UI of video file creating, editor and deletion event; (g) for example provide, from the UI of the reusable montage of video file creating (, creating newly mark); (h) be provided in file or across the UI of Document Editing, importing and duplicate event or many groups event; (i), for example, by Study document (, commercial advertisement black or clap-stick), automatically create the event of selecting; (j) be provided for UI and the operation that frame mates, to allow user in file or across file coupling frame; (k) be provided for UI and the operation of audio frequency adaptation, to find out resemblance and the difference in audio volume control; (l) be provided for UI and the operation that audio-frequency assembly creates; (m) be provided for UI and operation-frame matched data the is exported to ability of AvidAAF (senior making form)/EDL (editorial decision list) or Quicktime reference film that AVID derives; (n) in video UI, be provided on screen explaining UI and the operation of (for example Freehandhand-drawing on frame); (o) be provided for UI and the operation of QC report generation; (p) be provided for UI and the operation that automatic picture Chinese version detects; (q) be provided for UI and the operation that voice turn text-processing and have the result demonstration of edit capability; (r) be provided for manually copying UI and the operation of instrument; (s) provide the interface for user by web browser; (t) replace the download and the local replica that use file, or except using the download and local replica of file, use the flow transmission from server, provide Voice & Video by register.
In one implementation, register comprises that support allows the assembly of user's coupling from a stack features of the frame of identical or different movie file.The rudimentary algorithm of described frame matching characteristic is to be called centered by the basic conception of absolute difference law, and than calibration frame and negative frame, to determine the correlation of coupling, then the threshold value based on definition returns results.This function provides by coupling and inserts content and original program to user, creates the ability without text master control bar and foreign language master control bar.In an example, the first file including original film, the second file including replaces the insertion content (many framings) (for example, having the frame of the localized text of language-specific) of the each framing of correspondence in original film.Utilize frame coupling, user can identify and the primitive frame that inserts frame and mate, then indicate in original movie file which frame to use from which frame replacement of inserting file (manually and/or automatically).The primitive frame that then register can utilize the selecteed insertion frame of selected frame to replace, output redaction.On the other hand, register can create guiding original document and insert the file (for example, reference table) of the playback between file.Then user can utilize different insertion files, creates another file of different language.
UI provides the ability of checking the result of playing side by side in player window to user, and comprise the view under acquiescence Storyboard pattern and " accurately " pattern, under " accurately " pattern, if there is the mismatch of inconsistent insertion frame/primitive frame, user can repair and adjust so.In addition, user can their version of " in real time " preview, and switches between language.Before this allows in foreign language master control bar " virtual editor " is rendered into actual file, this foreign language master control bar " virtual editor " of preview.Playback duration in preview region, by the cut-out of the EDL by creating during matching treatment, immediately adds in film inserting content.User goes back selectable audio element and text element, plays up as a part for preview.
In one implementation, register comprises the assembly of a stack features of supporting and audio analysis adaptive for audio frequency, and these assemblies allow user mutually to compare waveform, find out resemblance and/or difference.This technology is a part for register feature set, and is a part of audio frequency UI.In another implementation, audio analysis and/or audio frequency UI can realize with independent program or assembly.The various aspects of following accompanying drawing graphic extension audio frequency UI.User can select " gold " reference channel-this be every other channel will with the channel of its adaptation (for example, be offset or be shifted, so that synchronous).Once result is returned, voice-grade channel is just by locks in place, and side-play amount will be recorded.Then user verifies that adaptive result is accurate, thereby locks this assembly.When being that this title sucks New Parent, and described automatic adaptation is while processing operation, will only analyze unblocked assembly.
In another implementation, register comprises the assembly of supporting the stack features for creating audio-frequency assembly.For example, user can be combined as many parts audio-frequency assembly a part, and for example, it is 1 long played file that 6 volume audio frequency are played up.In one implementation, this audio-frequency assembly establishment is also a part of register audio frequency UI.Audio frequency UI for user provide allow user suck many parts audio-frequency assembly, adaptive they, then play up the feature of New Parent.Then consequent assembly can be used in publishing system or other post-production workflows.The one that audio-frequency assembly creates realizes these features is also provided: sample rate conversion, synchronous pop remove, basic envelope and live preview.
The above explanation that disclosed realization is provided is in order to enable those skilled in the art to make or utilize the present invention.To one skilled in the art, be apparent to the various amendments of these realizations, and general principle disclosed herein realizes applicable to other, and do not depart from the spirit or scope of the present invention.Thereby other realization and variation are also within the scope of the invention.For example, example concentrates on for the demonstration of film and record, but register can be exclusively used in other video, such as the content of TV programme, internet video or user's generation, or is exclusively used in audio frequency, such as broadcast or blog, or other content, such as game or text, or their combination is (for example, coupling and adaptive video, audio frequency and text, such as mating for screen play and following the tracks of).In specific register is realized, not necessarily need all features of each example.In addition, understand that explanation given here and accompanying drawing represent the extensively theme of imagination of the present invention.Will understand in addition, scope of the present invention comprises apparent other to one skilled in the art completely and realizes, scope of the present invention thereby only limited by the claim of adding.
Some realization disclosed herein also provides the equipment and the method that realize the frame of match video or the technology of image.In one implementation, computer system provides user interface and function of search, so that user can select one or more frames of video, and the object set of frame.User asks the optimum Match of selected frame in target tightening subsequently.Computer system, by comparing the absolute difference between two field picture, compares each frame, and returns to the frame with best result.In a kind of such realization, computer system, with a part for video editing or video production instrument, provides frame coupling.
The feature providing in various realizations can include, but is not limited to one or more in following project: (a) be provided for UI and the operation of frame coupling, with allow user in file or across file mate frame; (b) frame in coupling one frame and single framing; (c) frame in coupling one frame and many framings; (d) multiframe in coupling multiframe and single framing; (e) multiframe in coupling multiframe and many framings; (f) by comparing the absolute difference between a frame and the negative-appearing image of another frame, relatively two frames; (g), according to the result that compares two frames, generate and put letter score.
In this new system, computer system is carried out frame or images match, among one or more frames or image with the target tightening at frame or image, searches the coupling thing of one or more images of selection.Image can be for example, each rest image or frame in a series of images (, video file).
In one implementation, the Software tool that is called register is used to recording medium file, such as the event in film.Register provides user interface (UI), described user interface allows user to check video with various ways, information is added in file, thereby the event in trace file (record), such as the position of color shades fence, clap-stick, content, mark and commercial advertisement black.Register is also provided in one or more files of selecting, finds out the frame coupling of the frame of expectation.
Register comprises supports to allow the assembly of user's coupling from a stack features of the frame of identical or different movie file.The rudimentary algorithm of described frame matching characteristic is to be called centered by the basic conception of absolute difference law, and than calibration frame and negative frame, to determine the correlation of coupling, then the threshold value based on definition returns results.This function provides by coupling and inserts content and original program to user, creates the ability without text master control bar and foreign language master control bar.In an example, the first file including original film, the second file including replaces the insertion content (many framings) (for example, having the frame of the localized text of language-specific) of the each framing of correspondence in original film.Utilize frame coupling, user can identify and the primitive frame that inserts frame and mate, then indicate in original movie file which frame to use from which frame replacement of inserting file (manually and/or automatically).The primitive frame that then register can utilize the selecteed insertion frame of selected frame to replace, output redaction.On the other hand, register can create guiding original document and insert the file (for example, reference table) of the playback between file.Then user can utilize different insertion files, creates another file of different language.
A kind of flow chart of realizing of Figure 18 presentation video matching treatment 1800.This processing, in one group of image, is searched for the optimum Match thing of single selection image.For example, the user of register can utilize this processing selecting one frame, then for example,, in the video file (, film) of selecting, searches for this frame.First, user's select target image (square frame 1810).The UI of computer system provides the mechanism of selecting, such as by with the search command of the image correlation of current demonstration.User selects last set image (square frame 1820) subsequently.Searching image can be in one or more files.For example, user can select video file by UI, and computer system will be used as searching image all two field pictures in video file.Computer system judges whether each searching image can obtain negative-appearing image (square frame 1830) subsequently.In one implementation, the negative-appearing image of searching image is the reversion of searching image, and all colours of searching image color is reversed.In one implementation, for example, in the time that frame matching tool can obtain image (, in the time that image is inhaled in system), computer system creates the negative-appearing image of all images.If any searching image does not have the corresponding negative-appearing image having created, computer system is not have each searching image of negative-appearing image to produce negative-appearing image (square frame 1840) so.In another implementation, can carry out concurrently negative-appearing image generation with negative-appearing image that more generate or existing.
In the time that all searching images all have negative-appearing image, computer system starts comparison loop, with the negative-appearing image of comparison object image and searching image.Computer system from first searching image, the relatively negative-appearing image of this target image and first searching image, thereby synthetic image Comparison score (square frame 1850).In one implementation, computer system is by determining the absolute difference between target image and negative-appearing image, movement images.Figure 19 represents a kind of realization of relatively processing.Described relatively produce represent target image and the negative-appearing image compared between similarity or the image Comparison score of diversity factor or put letter score.Computer system is preserved image Comparison score (square frame 1860).Computer system has determined whether to compare the negative-appearing image (square frame 1870) of target image and all searching images.If not, the computer system negative-appearing image of comparison object image and next searching image subsequently so, thus produce corresponding image Comparison score (square frame 1880), then return to square frame 1860, preserve described score.System can be utilized absolute difference comparison, or also can utilize the processing of Figure 19.Computer system continues the negative-appearing image of comparison object image and searching image, until compared the negative-appearing image (through the circulation of square frame 1880,1860,1870) of target image and all searching images.
When carried out all image ratios compared with time, computer system identification has the searching image (square frame 1890) of best image Comparison score.In one implementation, computer system selects to have the searching image of top score.In another implementation, computer system is selected all searching images of image Comparison score higher than threshold value.If do not have searching image to have the score higher than threshold value, computer system is returned to mistake so, or can return to best conjecture (for example, the highest image of score).In one implementation, user's adjustable thresholds, to control the similarity of expectation, thereby controls the result of returning.
Figure 19 presentation video is relatively processed a kind of flow chart of realizing of 1900.The pixel value of this processing combining target image and negative-appearing image, with determine target image and negative-appearing image based on image (for example, the searching image in Figure 18) between similarity.All pixels in two images of computer system iteration.At first, first pixel (square frame 1910) in computer system select target image, and select first pixel (square frame 1920) in negative-appearing image.In one implementation, computer system, from the pixel in the upper left corner at each image, and from left to right, is advanced from top to bottom.
The pixel that computer system combination is selected, thus packed-pixel value (square frame 1930) generated.In one implementation, computer system combination or addition pixel value.Computer system is comparison combination pixel value and desired value subsequently, thereby produces pixel Comparison score (square frame 1940).Desired value is configured to indicate desirable matching degree, or can be configured to indicate 100% coupling.In one implementation, if negative-appearing image pixel is the reversion of target image pixel, combination will produce desired value so.Difference between combined value and desired value is indicated the diversity factor between these two pixels.Also can use other pixel comparative approach.Pixel in pixel Comparison score indicating target image with for example, have corresponding to the pixel in the base image (, searching image) of the pixel in negative-appearing image heterogeneous like or different.In one implementation, higher pixel Comparison score instruction pixels tall is similar.Computer system is preserved pixel Comparison score (square frame 1950).
Computer system determines whether all pixels (square frame 1960) that compared in all pixels and the negative-appearing image in target image.If not, the computer system next pixel (square frame 1970) in select target image subsequently so, and next pixel (square frame 1980) in negative-appearing image.Subsequently, by returning to square frame 230, the pixel that combination is selected, computer system starts the comparison of selected pixel.Computer system continues the pixel of comparison object image and the pixel of negative-appearing image, until compared all pixels (through the circulation of square frame 230,240,250,260,270,280) in all pixels and the negative-appearing image in target image.
In the time carrying out all pixel comparisons, computer system packed-pixel Comparison score, thus produce image Comparison score (square frame 290).Image Comparison score indicating target image and negative-appearing image based on base image (for example, searching image) between similarity.In one implementation, the higher similarity between higher score indicating image.
In other is realized, can select and compare the image that number is larger.For example, can compare for example, image in single image or frame and multiple series of images (, multiple movie files).In another example, multiple images (for example, video clipping or insert) can be chosen as target, the image in the single group image among comparison object image and multiple series of images.In such example, user selects video insert, and the request multiple films of search or video file, to mate frame.In such example, user can utilize the search terms of video clipping (or image) as the inquiry of search video content.Also can utilize other information to help guidance search.For example, in a kind of register instrument is realized, register can be according to the title of file of therefrom having selected target image montage (in insert), and select File automatically, with searching image.In another example, register instrument can utilize about the target image of selecting or timestamp or the frame number information of montage, selects frame or frame scope from another file.
In some implementations, frame coupling can be as a part for another processing.Same or similar frame or the image that can identify in each source of separation can contribute to tissue and resource management.
In such example, video editing instrument, by multiple video samples and final video file, utilizes frame coupling, sets up montage list.Montage list is provided as the editor's who makes the final version of video and original video film or the video tape of the shooting of recording television programs or film (for example for) is carried out list.If there is no montage list, can be difficult to so determine which original video is used to make final version (for example,, if on film, certain scene is taken repeatedly, be difficult to so know to have used which cinema scene, or each section is how to edit together).First, all each frames is inhaled in video frequency tool or addressable database.Each frame from final version is used as target frame, and original video is used as search frame.Utilize frame coupling, video frequency tool can mate from the frame in frame and the original video of final version.By for example, concentrating in together (, frame 1-200 is the frame 1251-1450 in the video A of source, and frame 201-275 is the frame 12001-1275 in the video B of source, etc.) from the successive frame in identical source, video frequency tool can create montage list subsequently.
In another example, except identifying identical frame, or replace the identical frame of identification, frame coupling also can be used for identifying different frames.A kind of such realization of video frequency tool utilizes difference analysis to carry out each version of comparison film (or other video content), to determine that, in each file, what is unique.For example, described instrument can utilize relatively arenas version and director's montage version of film of difference analysis.Utilize frame coupling, described instrument can be determined which frame identical (or enough similar), different with which frame.In one implementation, described tool identification has the frame of putting letter score or Comparison score lower than the threshold value of definition, to identify fully different frame.This identification represents increment or the difference (also serializable ground or concurrently more more version) between two kinds of versions of comparison.Described instrument can be presented to user the unique frames in every kind of version, or list or report are provided.The list of user's capable of regulating, or adjust threshold value, to produce a new framing.User subsequently can be this information for other editor or other operation, such as dynamically editing, update, audio frequency adaptation, spolen title confirm and explanatory captions adaptation.Like this, described instrument can, from the larger video file of two or more, re-create insert, with mate insert or montage and larger video file contrary process similar.
Realize and comprise one or more programmable processors and a corresponding computer system component, to preserve and computer instructions, such as providing image ratio and pixel comparison, and the generation of Comparison score, storage and comparison.Can be in individual equipment or system, or multiple equipment or system preservation and visit data and/or instruction mid-span or networking.
Various variations in addition and realization are also possible.For example, video file can be film, game content, TV, web video etc.In other example, can outside making, professional video use frame coupling, such as the content for consumer or user's generation, for organizing and mating video in the individual image of collecting, for searching for local content or the online content of preserving, etc.Thereby claim is not limited to object lesson described above.
The above explanation that disclosed various realizations are provided is in order to enable those skilled in the art to make or utilize the present invention.To one skilled in the art, be apparent to the various amendments of these realizations, and general principle disclosed herein realizes applicable to other, and do not depart from the spirit or scope of the present invention.Thereby other realization and variation are also within the scope of the invention.For example, example concentrates on for the demonstration of film and record, but register can be exclusively used in other video, such as the content of TV programme, internet video or user's generation, or is exclusively used in audio frequency, such as broadcast or blog, or other content, such as game or text, or their combination is (for example, coupling and adaptive video, audio frequency and text, such as mating for screen play and following the tracks of).In specific register is realized, not necessarily need all features of each example.In addition, understand that explanation given here and accompanying drawing represent the extensively theme of imagination of the present invention.Will understand in addition, scope of the present invention comprises apparent other to one skilled in the art completely and realizes, scope of the present invention thereby only limited by the claim of adding.

Claims (13)

1. for a method for movement images, comprising:
Select target image;
Select one or more searching images;
Comparison object image and corresponding to the negative-appearing image of each searching image, thus the image Comparison score of each searching image generated; And
Identification has the searching image of optimized image Comparison score.
2. in accordance with the method for claim 1, wherein said one or more searching image represents the video file of the data of each frame of video corresponding to preservation.
3. the data representation film of wherein said video file in accordance with the method for claim 2.
4. in accordance with the method for claim 1, wherein comparison object image and negative-appearing image comprise the absolute difference of determining between target image and negative-appearing image.
5. in accordance with the method for claim 1, wherein comparison object image and negative-appearing image comprise:
The pixel value of the respective pixel in pixel value and the negative-appearing image of the each pixel in composite object image;
More each packed-pixel value and desired value, thereby the pixel Comparison score of the each pixel in generation target image;
The pixel Comparison score of all pixels in composite object image, thus the image Comparison score about the comparison of target image and negative-appearing image produced.
6. in accordance with the method for claim 1, also comprise and select at least one other target image, and the searching image of the target image of each selection and selection is compared.
7. the target image of wherein selecting in accordance with the method for claim 6, is corresponding to video clipping.
8. in accordance with the method for claim 1, wherein select at least two searching images.
9. in accordance with the method for claim 1, the searching image of wherein selecting comprises the image corresponding to multiple video files, and each file is preserved the data of the each frame that represents video.
10. preservation is for a non-provisional tangible storage medium for the computer program of movement images, and described computer program comprises the executable instruction that makes computer carry out following operation:
Select target image;
Select one or more searching images;
Comparison object image and corresponding to the negative-appearing image of each searching image, thus the image Comparison score of each searching image generated; And
Identification has the searching image of optimized image Comparison score.
11. according to non-provisional tangible storage medium claimed in claim 10, wherein makes the executable instruction of computer comparison object image and negative-appearing image comprise the executable instruction that makes computer carry out following operation:
Determine the absolute difference between target image and negative-appearing image.
12. according to non-provisional tangible storage medium claimed in claim 10, wherein makes the executable instruction of computer comparison object image and negative-appearing image comprise the executable instruction that makes computer carry out following operation:
The pixel value of the respective pixel in pixel value and the negative-appearing image of the each pixel in composite object image;
More each packed-pixel value and desired value, thereby the pixel Comparison score of the each pixel in generation target image;
The pixel Comparison score of all pixels in composite object image, thus the image Comparison score about the comparison of target image and negative-appearing image produced.
13. according to non-provisional tangible storage medium claimed in claim 10, also comprises the executable instruction that makes computer carry out following operation:
Select at least one other target image, and the searching image of the target image of each selection and selection is compared.
CN201280052184.5A 2011-09-13 2012-09-13 Logging events in media files including frame matching Pending CN104025465A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201161534275P 2011-09-13 2011-09-13
US61/534,275 2011-09-13
US201261624123P 2012-04-13 2012-04-13
US61/624,123 2012-04-13
PCT/US2012/055213 WO2013040244A1 (en) 2011-09-13 2012-09-13 Logging events in media files including frame matching

Publications (1)

Publication Number Publication Date
CN104025465A true CN104025465A (en) 2014-09-03

Family

ID=47883740

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280052184.5A Pending CN104025465A (en) 2011-09-13 2012-09-13 Logging events in media files including frame matching

Country Status (3)

Country Link
EP (1) EP2742599A4 (en)
CN (1) CN104025465A (en)
WO (1) WO2013040244A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110121098A (en) * 2018-02-05 2019-08-13 腾讯科技(深圳)有限公司 Video broadcasting method, device, storage medium and electronic device
CN117132925A (en) * 2023-10-26 2023-11-28 成都索贝数码科技股份有限公司 Intelligent stadium method and device for sports event

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2843960A1 (en) * 2013-08-28 2015-03-04 Thomson Licensing Method and apparatus for managing metadata of media data
KR20160013036A (en) * 2013-05-27 2016-02-03 톰슨 라이센싱 Method and apparatus for visually representing metadata of media data
US11659214B2 (en) 2020-07-20 2023-05-23 Netflix, Inc. Automated workflows from media asset differentials

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060085477A1 (en) * 2004-10-01 2006-04-20 Ricoh Company, Ltd. Techniques for retrieving documents using an image capture device
US20090310681A1 (en) * 2006-03-23 2009-12-17 Nicolas Gaude System for analysis of motion
US20100309379A1 (en) * 2009-06-05 2010-12-09 Schoenblum Joel W Efficient spatial and temporal transform-based video preprocessing

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050069291A1 (en) * 2003-09-25 2005-03-31 Voss James S. Systems and methods for locating a video file
JP4798018B2 (en) * 2007-02-22 2011-10-19 株式会社明電舎 Image matching device
EP2661701A1 (en) * 2011-01-04 2013-11-13 Sony Corporation Logging events in media files

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060085477A1 (en) * 2004-10-01 2006-04-20 Ricoh Company, Ltd. Techniques for retrieving documents using an image capture device
US20090310681A1 (en) * 2006-03-23 2009-12-17 Nicolas Gaude System for analysis of motion
US20100309379A1 (en) * 2009-06-05 2010-12-09 Schoenblum Joel W Efficient spatial and temporal transform-based video preprocessing

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110121098A (en) * 2018-02-05 2019-08-13 腾讯科技(深圳)有限公司 Video broadcasting method, device, storage medium and electronic device
CN110121098B (en) * 2018-02-05 2021-08-17 腾讯科技(深圳)有限公司 Video playing method and device, storage medium and electronic device
CN117132925A (en) * 2023-10-26 2023-11-28 成都索贝数码科技股份有限公司 Intelligent stadium method and device for sports event
CN117132925B (en) * 2023-10-26 2024-02-06 成都索贝数码科技股份有限公司 Intelligent stadium method and device for sports event

Also Published As

Publication number Publication date
EP2742599A4 (en) 2016-01-13
WO2013040244A1 (en) 2013-03-21
EP2742599A1 (en) 2014-06-18

Similar Documents

Publication Publication Date Title
US10015463B2 (en) Logging events in media files including frame matching
US11398171B2 (en) System for authoring and editing personalized message campaigns
US8302010B2 (en) Transcript editor
US6789109B2 (en) Collaborative computer-based production system including annotation, versioning and remote interaction
US9881215B2 (en) Apparatus and method for identifying a still image contained in moving image contents
US20100050080A1 (en) Systems and methods for specifying frame-accurate images for media asset management
US10242712B2 (en) Video synchronization based on audio
JPH07182365A (en) Device and method for assisting multimedia conference minutes generation
JP2004228779A (en) Information processor
CN104025465A (en) Logging events in media files including frame matching
US8819558B2 (en) Edited information provision device, edited information provision method, program, and recording medium
GB2520041A (en) Automated multimedia content editing
US20140006978A1 (en) Intelligent browser for media editing applications
US20080304747A1 (en) Identifiers for digital media
CN103534695A (en) Logging events in media files
BE1023431B1 (en) AUTOMATIC IDENTIFICATION AND PROCESSING OF AUDIOVISUAL MEDIA
US11380364B2 (en) Editing and tracking changes in visual effects
WO2016203469A1 (en) A digital media reviewing system and methods thereof
TWI497959B (en) Scene extraction and playback system, method and its recording media
Rosenberg Adobe Premiere Pro 2.0: Studio Techniques
JP7198564B2 (en) Content production system, content production device and content production method
Riley et al. The craft of the cut: the Final Cut Pro X editor's handbook
Boykin Apple Pro Training Series: Final Cut Pro X 10.1 Quick-Reference Guide
Shapiro et al. From Still to Motion: Editing DSLR Video with Final Cut Pro X

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140903