CN101584001A

CN101584001A - Automated production of multiple output products

Info

Publication number: CN101584001A
Application number: CNA2007800477127A
Authority: CN
Inventors: J·A·马尼科; T·J·怀特彻尔; J·R·麦科伊; T·阿鲁朱南
Original assignee: Eastman Kodak Co
Current assignee: Gaozhi 83 Foundation Co.,Ltd.
Priority date: 2006-12-20
Filing date: 2007-12-20
Publication date: 2009-11-18
Anticipated expiration: 2027-12-20
Also published as: CN101568969B; CN101584001B; CN101568969A

Abstract

A system and method simplifying the creation process for multimedia slideshows, collages, movies, and other imaging products. Users can share their stories using imaging services, which will handle the formatting and delivery of content for recipients. Recipients can then easily request output from the shared stories in the form of prints, DVDs, collage, poster, picture book or custom output.

Description

The automatic generation of multiple output products

Technical field

The present invention relates to be used for producing automatically architecture, method and the software that story is shared product.Especially, the present invention relates to simplify the generation processing of multimedia slide projection, pasteup, film, photograph album and other image product.

Summary of the invention

A preferred embodiment of the present invention comprises a kind of computer system, and it comprises the memory storage that is used for digital media resource and is used for the program that is applied to these resources automatically handled in the digital theme of selecting.Exemplary theme handle comprise birthday, anniversary, spend a holiday, red-letter day, family or motion theme.The theme of following Automatic Program ground to select resource and will be applied to those resources, thus spectacular visual story produced, and this visual story is stored as and can be sent out or be sent to other computer system or imaging device so that the descriptor file that shows.Term " demonstration " also comprises in this article, for example, exports the printer of the hard copy that is used to show, and comprises for example any other output unit of display screen.Follow program to comprise the performance application program with another of said procedure interactive operation, this performance application program is used for determining the compatibility of descriptor file and specific output imaging device, and being formatted into of descriptor file is used for the output file of particular pre-selected output unit.Exemplary output format comprises printed matter, photograph album, placard, video, DVD, digital slideshow, Downloadable film, or the web website.

Another preferred embodiment of the present invention comprises previewer, and it is used for showing based on the output unit of output file and selection the expression of output image product.

Another preferred embodiment of the present invention comprises and can handle a plurality of digital effects that are applied to digital resource automatically with theme.Present embodiment requires to provide rule database to determine whether and particular topic or effect digitally can be applied to specific resources.If have any theme or effect can not be applied to resource, the effect of rule database is that application constraint with those themes or effect is on specific resources so.Rule in the rule database can comprise several rules, and they comprise the combination in any of theme dependency rule, convergent-divergent rule, the algorithm applicability according to resource metadata, many resource operations rule, sequence of operation rule, operation alternative rule, price constraints rule, user privilege rule and expression regulation.The performance program can be revised resource according to the constraint that adds from rule database.

Another preferred embodiment of the present invention comprises a kind of method of being carried out by computing machine, and as mentioned above, it selects a plurality of digital resources of computer-accessible.In this article, the data of term " computer-accessible " indication, can be stored on the hard disk drive or other storer of computing machine, perhaps in the mobile storage means or magnetic medium that are connected with computing machine, perhaps when connecting, computer and network can comprise with it on the webserver of communicating by letter or network storage device of wired and radio communication.The method of invention comprises the theme of selecting computer-accessible, and the resource that subject element is applied to select is with formation story descriptor file.The story descriptor file comprises resource and subject element.Effect can also be added on the resource with subject element.Can select output format or preferred output unit, this will cause computer based to generate one or more output descriptor file in the story descriptor file.Alternatively, can be as described above with reference to rule database, this can determine that certain effects or subject element can not be applied to resource owing to for example incompatible relation of technology.In this case, the inventive method comprises this application of constraint theme or effect.This method also comprises revising at least one resource with reference to regular database response.Alternatively, the expression of the output products of the story descriptor file of this method output unit that allows preview to depend on to select or output format for story.This method also allows also to depend on as described above the compatibility of output descriptor file and device, on a device or a plurality of output unit descriptor file is output as image product.

The contemplated additional embodiments of the present invention comprises computer-readable medium and program storage device, they comprise or carry the program of machine or processor instructions effectively, are used to make machine or computer processor to carry out storage instruction or data structure thereon.This computer-readable medium can be any available medium, and it can be by universal or special computer access.This computer-readable medium can comprise physical computer-readable media, for example, and such as RAM, ROM, EEPROM, CD-ROM, DVD, or other optical disc memory, magnetic disk memory or other magnetic memory apparatus.Can be used for carrying or storing and to be considered within the scope of the invention by any other medium of the software program of universal or special computer access.

When considering, will be familiar with and understand these and other aspect of the present invention and purpose better in conjunction with the following description and drawings.Yet, should be understood that shown the preferred embodiments of the present invention and its a large amount of detail though below describe, these descriptions illustrate and unrestricted providing as an example.Without departing from the premise in the spirit of the present invention, can carry out multiple change and correction within the scope of the invention, and the present invention includes all such corrections.Following accompanying drawing need not be drawn about any accurate ratio of size, angular relationship or relative position.

Description of drawings

Fig. 1 is the block diagram that can implement the computer system of different embodiments of the invention.

Fig. 2 is the diagrammatic representation of the architecture of constructed according to the invention the being used to system that writes story.

Fig. 3 is the process flow diagram of the operation of keymake module constructed according to the invention;

Fig. 4 is the process flow diagram of the operation of previewing module constructed according to the invention;

Fig. 5 is the process flow diagram of the operation of functional modules constructed according to the invention;

Fig. 6 be according to of the present invention from obtain with using system the tabulation of the extraction metadata tag that obtains;

Fig. 7 is the tabulation according to the derivation metadata tag that obtains from the analysis of resource content and existing extraction metadata tag of the present invention;

Fig. 8 A-8D illustrates the tabulation of sharing descriptor file according to the example story of the relation between two kinds of different outputs of resource continuous time effects of the present invention;

Fig. 9 is that exemplary slideshow constructed according to the invention is represented; And

Figure 10 is that exemplary pasteup constructed according to the invention is represented.

Embodiment

Resource is the digital document that is made of picture, rest image, text, figure, music, film, video, audio frequency, multimedia presentation or descriptor file.Every kind of resource all has some standard formats.Story shared system described in the literary composition about: easily produce the entertaining story of intelligence with sharable form, and on numerous imaging systems, transmit consistent optimum playback experience.Story is shared and is allowed the user easily to produce, play and shared story.Story can comprise picture, video and/or audio.The user can use the imaging service to share their story, and this imaging service will be the format and the transmission of recipient's contents processing.Then, the recipient can easily ask from the story output of sharing, and the form of output is printed matter, DVD or such as the customization output of pasteup, placard, atlas etc.

As shown in Figure 1, be used to implement system of the present invention and comprise computer system 10.Computer system 10 comprises the CPU 14 that communicates by letter with other device by bus 12.For example, CPU 14 carries out the software that is stored on the hard disk drive 20.Video display devices 52 is connected in CPU 14 by DIU display interface unit 24.Mouse 44 and keyboard 46 are connected in CPU 14 by desktop interface device 28.Computer system 10 also comprises CD-R/W driver 30, so that read different CD media, and writes to CD-R or CD-RW writable media 42.Computer system 10 also comprises DVD driver 32, so that read or write to it from DVD dish 40.The Audio Interface Unit 26 that is connected with bus 12 allows, and for example, is converted into the simulated audio signal that is applicable to loudspeaker 50 from the voice data that is stored in the digital sound files on the hard disk drive 20.Audio Interface Unit 26 also will convert to from the simulated audio signal of microphone 48 and be applicable to the numerical data that is stored in the hard disk drive 20 for example.In addition, computer system 10 is connected with external network 60 by network connection device 18.Digital camera 6 can pass through, and for example, USB interface device 34 is connected with home computer 10, so that transmit rest image, audio/video and audio files from video camera to hard disk drive 20, vice versa.USB interface can be used for the mobile storage means of USB compatibility is connected on the computer system.The set of digital multimedia or single medium object (digital picture) can exclusively reside on hard disk drive 20, the CD 42, perhaps such as, by the remote storage place of network 60 addressable web servers.Set also can be striden any or all distributions among these.

Should be understood that these digital multimedia objects can be: digital still, such as the digital still that produces by digital camera; Voice data is such as any different-format, such as " WAV " or " MP3 " audio file formats, digital music or voice document, perhaps, these digital multimedia objects can be the digital video fragments that has or do not have sound, such as MPEG-1 or MPEG-4 video.The file that the digital video object also comprises graphics software and produced.The database of digital multimedia object can comprise only a kind of object, perhaps combination in any.

Utilize minimum user's input, the story shared system can automatically produce story intelligently.The story of system constructed according to the invention is shared architecture and workflow is carried out simple and clear illustrating by Fig. 2, and comprises with lower unit:

Resource 110, it can be stored on the storer of computing machine, computer-accessible, perhaps on network.

Story is shared descriptor file 112.

The story of writing is shared descriptor file 115.

Subject description symbol file 111.

Output descriptor file 113.

Story keymake/editing machine 114.

Story performance device/browser 116.

Story writing assembly 117.

Also have the topic style table except said units, this topic style table is background and the prospect resource that is used for theme.The prospect resource is the image that can be superimposed upon on another image.Background image is that the main body to digital photos provides the image such as the background patterns of border or position.In order to produce unique product, multilayer prospect and background resource can be added on the image.

Initial story descriptor file 112 can be default XML file, and it can selectively be used to provide any default information by any system.In case this document is by keymake 114 complete filling, this document then will become the story descriptor file 115 of writing.In the default version of story descriptor file, it comprises the essential information that is used to write story, for example, can define the simple slideshow form that shows delegation's text, for some images keep white space, define the demonstration duration of each, and can select background music.

The story descriptor file of writing provides a description the needed necessary information of entertaining story.As described below, the story descriptor file of writing will comprise the information of resource information, subject information, effect, conversion, metadata and all other needs, so that make up complete and entertaining story.In some aspects, the story descriptor file of writing is similar to Storyboard and can is default descriptor, as mentioned above, filled the resource of selecting its minimum level, perhaps, for example, it can comprise a large amount of users or third party's resource, comprises a plurality of effects and conversion.

Therefore, in case produced this descriptor file of writing 115 (it has represented a story), this document can be stored in the portable memory together with the resource relevant with story so, perhaps be sent to any imaging system, and be used therein to produce the shared output products of story with performance assembly 116.This permission system writes story, preserves information by this story descriptor file of writing, and produces the shared output file (slideshow, film etc.) of the story that shows in the time after a while on different computing machines or to different output subsequently.

Subject description symbol file 111 is another XML files, and for example, it provides necessary subject information, such as artistic expression.This will comprise:

The position of theme, such as, in computer system, perhaps on network such as the internet.

Background/foreground information.

Specific to such as red-letter day theme theme or special-effect, the conversion with individual meaning.

The music file relevant with theme.

Subject description symbol file is, for example, XML file layout, and point to the image template file, such as the JPG file, it provides one or more appointments to be used for showing the space of the resource of selecting from resource set 110.For example, this template can be in the text message of saying " happy birthday " shown in the birthday template.

The keymake 114 that is used to develop story comprises use the subject description symbol file 111 of above-mentioned information.This module obtains input from three previous assemblies, and can selectively use the automated graphics selection algorithm to write story descriptor file 115.The user can select theme, and perhaps theme can be selected with algorithm by the content of the resource that is provided.When sharing descriptor file 115, the story that keymake 114 is write in foundation will utilize subject description symbol file 111.

Story keymake 114 is component softwares, produces the story descriptor file of writing under the situation of its input below given intelligently:

Resource location and resource related information (metadata).The user selects resource 110, perhaps can select resource 110 automatically from the analysis of associated metadata.

Subject description symbol file 111.

The user input relevant with effect, conversion and image organizational.Normally, subject description symbol file will comprise the major part of this information, but the user can select to edit the part of this information.

Utilize this input message, keymake assembly 114 will be arranged necessary information, so that write complete story in the story descriptor file of writing, this story descriptor file of writing comprises the information of the required whole requirements of performance device.To on story descriptor file 115, reflect any editor that the user is undertaken by keymake.

Under the situation of given input, keymake will carry out following operation:

The intellectuality tissue of resource is such as organizing into groups or set up chronology.

Use suitable effect, conversion etc. based on selected theme.

Analyze resource and read and produce the needed necessary information of entertaining story.This requires the detailed description information about resource, and it can be used for determining whether effect is feasible on specific resources.

For example, output descriptor file 113 is XML files, and it comprises about will producing the information of what output, and produces this and export needed information.This document will comprise the constraint based on following factor:

The device capability of output unit.

The hard copy output format.

Output file form (MPEG, Flash, MOV, MPV).

The expression regulation that uses, as described below, it is used for promoting the performance (can utilize again because output unit is unknown-descriptor) of story on another device when the requirement of output form is not included in the information of story descriptor file.

Such as the XSL code-switching descriptor information converting of (be used to revise the story descriptor file, thereby it does not comprise telescopic information, and only comprise) specific to the XSLT program of the information of output form.

Performance device 116 uses output descriptor file 113 to determine available output format.

Story performance device 116 is configurable assemblies, and it is made of the corresponding optional plug-in unit of the different output formats of supporting with representation system.Story performance device 116 is shared the form that the output format of the selection of product is set story analysis descriptor file 115 according to being used for story.For example, if on little mobile phone, giant-screen device or printed matter form, browse output, can revise form such as photograph album.Then, constraint waits to determine the resolution etc. of the requirement that resource is required to the performance device based on output format.When operation, this assembly will read the story of writing that keymake 114 produced and share descriptor file 115, and act on the shared descriptor file 115 of this story of writing by the output 18 of handling story and produce such as the requirement of DVD or other hard copy format (slideshow, film, customization output etc.).116 pairs of story descriptor file 115 elements of performance device make an explanation, and depend on selected output type, and the performance device will produce story with the desired form of output system.For example, the performance device can read the story of writing and share descriptor file 115, and based on the full detail of describing in the story descriptor file 115 of writing, produces the MPEG-2 slideshow.Performance device 116 will be carried out following function:

Reading the story descriptor file 115 of writing also correctly makes an explanation to it.

Explanation is translated, and call suitable plug-in unit to carry out actual encoded/transcoded.

Produce the output format of the performance that requires.

This assembly obtains the story of generation and the output by as requested, suitably produces menu, title, credit and chapters and sections and writes story.

Creation assembly 117 is created in playback menu impression consistent on the different imaging systems.Alternatively, this assembly will contain writing function.It also comprises optional card module, is used to produce specific output, such as, the slideshow of the software of MPEG-2 is implemented in use, perhaps is used to produce the photograph album software of photograph album, perhaps, is used to produce the calendar plug-in of calendar.The device that XML is made an explanation can be directly delivered in the specific output of XML form, and does not therefore require special plug-in unit.

After having described specific story in the story descriptor file 115 of writing, this document can be reused the different output format that produces this particular story.This allows story to write by a computer system or on a computer system, and retains by descriptor file.Can on any system or portable memory, store the story descriptor file of writing, and subsequently it be reused the different output that produces on the different imaging systems.

In additional embodiments of the present invention, story descriptor file 115 does not comprise presentation information, but quotes the identifier that is used for the particular presentation of template form storage.In these embodiments, as the description of being done for subject description symbol file 111, template base will be embedded in keymake 114 and the performance device 116.Then, the story descriptor file will be pointed to template file, but they will not be included as the part of descriptor file self.By this way, will not represent complete story to the third party that may be the non-intended recipinent of story descriptor file.

As described in a preferred embodiment, in Fig. 3,4 and 5, for example understand three main modular in the shared architecture of story in more detail respectively, be keymake module 114, previewing module (not shown in Fig. 2) and functional modules 116, and in more detail below they be described.With reference to Fig. 3, for example understand the operational flowchart of keymake module of the present invention.In step 600, the user himself begins to handle by system is shown.This form that can get has username and password, bio-measurement ID, or the account by selecting to deposit earlier.By ID is provided, system can incorporate into the preference of Any user and configuration information, before use pattern, such as the personal information of existing individual and family relationship, and key dates and major event.This also can be used for being provided to the needed user's of product who finishes address book, phone and/or email list are shared in promotion to intended recipinent inlet.User ID also can be used for being provided to the inlet of user resources collection, shown in step 610.Third party's content that the user resources collection can comprise corporally or commercially generate, it comprises: digital still, text, figure, video segment, sound, music, poem etc.At step 620 place, system reads and writes down the existing metadata that is associated with each resource file, is called the input metadata in the text, such as time/date stamp, exposure information, video segment duration, GPS position, image orientation and filename.At step 630 place, such as human eye/people's face is distinguished/is discerned, object is distinguished/identification, text identification, speech-to-text, indoor/outdoor are determined, a series of resource analysis technology of scene lighting and main body sorting algorithm are used to provide additional resource derives metadata.In several patents of owning together and patented claim, some different graphical analyses and sorting algorithms have been described.For example, as No. the 6606411st, the commonly assigned United States Patent (USP) that is entitled as " A Method For Automatically Classifying Images Into Events " issued on August 12nd, 2003; And No. the 6351556th, the commonly assigned United States Patent (USP) of issuing on February 26th, 2002 that is entitled as " A Method For Automatically Comparing Content of Images forClassification Into Events " is described in detail, by a unstructured group media resource being classified automatically, cut apart and being clustered into discrete timeliness incident and subevent, generate the timeliness incident cluster of image resource.Described in detail as No. the 6480840th, the commonly assigned United States Patent (USP) that is entitled as " Method And Computer Program Product ForSubjective Image Content Similarity-Based Retrieval " issued on November 12nd, 2002, CBIR (CBIR) from database retrieval of similar in the image of example (or inquiry) image.Can judge whether image is similar based on a lot of different modules, for example, color, texture or such as the similarity of other content discerned of people's face.This notion can expand to the part or the interesting areas (ROI) of image.Inquiry can be the part (ROI) of entire image or image.The image of retrieval can mate by entire image, perhaps also can search for each image for the respective regions that is similar to this inquiry.In the context of the present invention, CBIR can be used for automatically selecting with classification and other resources-type like or with resource like the theme class.For example, " Valentine's Day " theme may need to find the image based on redness, perhaps finds the color in autumn for " Halloween " theme.The scene classification device is with scene Recognition or be categorized into one or more scene type (for example, seabeach, indoor etc.), perhaps one or more activities (for example, run etc.).The details of operation of exemplary scene classification type and they has been described: No. the 6282317th, the United States Patent (USP) that is entitled as " Method ForAutomatic Determination Of Main Subjects In Photographic Images " in following patent and patented claim; No. the 6697502nd, the United States Patent (USP) that is entitled as " Image Processing Method ForDetecting Human Figures In A Digital Image Assets "; No. the 6504951st, the United States Patent (USP) that is entitled as " Method For Detecting Sky In Images "; The U. S. application that is numbered US2005/0105776 A1 that is entitled as " Method For Semantic Scene ClassificationUsing Camera Metadata And Content-Based Cues " is open; The U. S. application that is numbered US 2005/0105775 A1 that is entitled as " Method Of Using TemporalContext For Image Classification " is open; And the U. S. application that is numbered US 2004/003746 A1 that is entitled as " Method For Detecting Objects In DigitalImage Assets " is open.People's face detection algorithm is used in and finds people's face as much as possible in the resource set, in following patent and application this is described: No. the 7110575th, the United States Patent (USP) of issuing on September 19th, 2006 that is entitled as " Method For LocatingFaces In Digital Color Images "; No. the 6940545th, the United States Patent (USP) of issuing on September 6th, 2006 that is entitled as " Face Detecting Camera And Method "; The U. S. application that is numbered US 2004/0179719 A1 open (U.S. Patent application of submitting on March 12nd, 2003) that is entitled as " Method And System For Face Detection InDigital Image Assets ".Recognition of face is based on facial characteristics with recognition of face or sort out adult's example or the label relevant with the people, as described in following patented claim: 11/559544 the U.S. Patent application of submitting on November 14th, 2006 of being numbered that is entitled as " User Interface For FaceRecognition "; 11/342053 the U.S. Patent application of submitting on January 27th, 2006 of being numbered that is entitled as " Finding Images With Multiple People Or Objects "; And 11/263156 the U.S. Patent application of submitting on October 31st, 2005 of being numbered that is entitled as " Determining A Particular Person From A Collection ".People's face cluster is used by detecting the data that generate with feature extraction algorithm to come the people's face that seems similar is organized into groups.As hereinafter describing in detail, can trigger this selection based on digital the value of the confidence.Disclose described location-based data as the U. S. application of submitting on November 17th, 2004 that is numbered US 2006/0126944 A1 that is entitled as " Variance-Based Event Clustering ", can comprise mobile phone launching tower position, gps coordinate and network router position.Harvester can comprise or not comprise the metadata of filing with image or video file; Yet, generally store these as the next of metadata with resource by the pen recorder of images acquired, video or sound.When using location-based metadata with other attribute cooperation that is used for the medium cluster, this location-based metadata can be very powerful.For example, keep the information of place names system about the U.S. geologic prospect special column of place name, it provides latitude and longitude coordinate has been mapped to the generally acknowledged feature title and the instrument of characteristic type, and characteristic type comprises the type such as church, park or school.In the U.S. Patent Application Publication that is numbered US 2007/0008321 A1 that is entitled as " Identifying Collection Images With Special Events " that is to submit on July 11st, 2005, describe event recognition that will detect and the semantic classes that is referred to such as birthday, wedding etc. in detail.Because identical position, setting or activity, classify as the media resource of incident can time per unit by so related, and purpose is relevant with the subjective intention of user or user's group.In each incident, media resource can also be clustered into the discrete group of related content, is referred to as the subevent.Medium in the incident and identical setting or movable relevant, and the medium in the subevent have similar content in the incident.Image value index (" IVI ") is defined as significance level (importance, attractive force, serviceability or practicality) tolerance, individual consumer's it can be associated with specific resources (and can be the grade that the user imports) as metadata store, and be the U.S. Patent application that is numbered N0.11/403686 that is entitled as " Value Index From Incomplete Data " that on April 13rd, 2006 submitted to, and on April 13rd, 2006 submit to be entitled as being numbered in 11/403583 the U.S. Patent application of " Camera User InputBased Image Value Index ", this image value index is described in detail.Automatically the IVI algorithm can utilize the characteristics of image such as acutance, illumination and other quality index.The metadata (exposure, time, date) that video camera is relevant, image understanding (size of skin or the detection of people's face and skin/human face region), or behavior measure (browsing time, amplification, editor, printing or shared) also can be used to any specific media resource to calculate IVI.Full content with the prior art reference listed in this section is combined in herein by reference.

At step 640 place, store new derivation metadata together with the existing metadata relevant, so that increase existing metadata with corresponding resource.The new metadata group is used for organizing at step 650 place user's resource and to user's resource order of classification.This classification is based on according to the analysis of correlativity and the output of sorting algorithm, or alternatively, provides the image value index of quantitative result as mentioned above.

At determination step 660 places, can be based on the metadata and the user preference subclass of selecting user resources automatically of combination.The editor's who determines technology such as the order of classification and the quality of image value index resource group is used in this selection expression.At step 670 place, the user can select to ignore automatic resource alternatively and select and select manually to select and edit asset.At determination step 680 places, the analysis of the set of metadata of execution combination and the resource of selection is to determine whether advising suitable theme.Theme in the literary composition be such as move, spend a holiday, the resource descriptor of family, red-letter day, birthday, anniversary etc., and can be by automatically advising such as the metadata of the time/date stamp that conforms to the birthday with the relatives that from user profile information, obtain.This is favourable, because nowadays for the resource of consumer's generation, it almost is hard-core that available theme is handled.For the user, search all over countless option with find pass on suitable emotion mood and with the theme of the format and content feature compatibility of user resources are the challenges that are bound to arouse fear.By analyze relationship and picture material, can advise theme more specifically.For example, if face recognition algorithms identification " Molly ", and user profile information indication " Molly " is user's daughter.User profile information can also comprise the information that this time last year user has made the souvenir DVD of " birthday party in four years old of Molly ".Can provide Dynamic Theme to have the general theme such as " birthday " of additional detail with automatic customization.If use image template in can be with the theme that automatically " filling a vacancy " text and figure be made amendment, this will make it possible to " happy birthday " changed over " happy birthday in five years old Molly ", and not require that the user participates in.Box 690 is included in the step 680, and comprises the tabulation of available theme, can perhaps by being connected with service supplier's network, provide the tabulation of this available theme in this locality by the mobile storage means such as storage card or DVD.Third party participant and content of copyright owner can also provide theme according to the scheme of the type of paying per use.The input of combination and the resource set that derives metadata, analysis and sorting algorithm output and tissue be used to user's selectional restriction to be applicable to resource content and with the theme of resource type compatibility.At step 200 place, the user can select the theme accepting or refuse to advise.If the place does not have suggested subject in step 680, perhaps determine the theme refusing to advise the user of step 200 place, she can select manually to select from the limited tabulation of theme or from the whole available storehouse of available theme theme at step 210 place.

Use the theme of selection to obtain third party's resource and effect with metadata specific to theme.At step 220 place, this additional content and processing can be provided by mobile storage means, perhaps can visit from the service supplier by communication network, perhaps visit by the pointer that points to third-party vendor.System can be based on using and popularity is monitored automatically and document relates to arrangement between the different participants of the distribution of income of using these assets and expense.These write down and also can be used for determining user preferences, and making can be higher by grading specific to the third party's resource and the effect of popular theme, perhaps given and higher right of priority, thus the possibility of increase Customer Satisfaction.These third party's resources and effect comprise dynamic auto zoomed image template, automated graphics placement algorithm, video scene conversion, rolling title, figure, text, poem, music, song, and the digital moving and the rest image of famous person, welcome personage and cartoon character, they all are designed to use with the resource that the user generates and/or obtains.Both be suitable for hard copy generally specific to third party's resource of theme and effect, also be suitable for soft copy such as film, video, digital slideshow, interactive game, web website, DVD and digital cartoon such as greeting card, pasteup, placard, mouse pad, cup, atlas, calendar.Resource of selecting and effect can be used as graph image, Storyboard, descriptive tabulation or multimedia presentation and show to obtain its agreement to the user.At determination step 230 places, the user can select to accept or refuse resource and the effect specific to theme, and if she select to refuse them, system shows that at step 250 place one group of resource of replacing and effect are to obtain user's agreement or refusal.In case the user has accepted third party's resource and effect specific to theme at step 230 place, these third party's resource and effects specific to theme combine with the user resources of tissue at step 240 place, and start previewing module at step 260 place.

Referring now to Fig. 4, illustrate the operational flowchart of previewing module.At step 270 place, make the user resources of arrangement and can use for previewing module specific to the resource and the effect of theme.At step 280 place, the output type that user's selection is wanted.Output type comprises different hard copies and soft copy form, such as printed matter, atlas, placard, video, DVD, digital slideshow, Downloadable film, and the web website etc.Output type can be static, such as printed matter and atlas, or interactive demonstration, such as DVD and video-game.Can from look-up table (LUT) 290, obtain these types, can on removable medium, provide this look-up table 290, perhaps by this look-up table 290 of communication network access to previewing module.But can provide them when the new output type time spent, and can provide by third party vender.Output type comprises to demonstrate user resources with the form of the output form compatibility of selecting and specific to the resource of theme and the needed whole rules and the step of effect.The output type rule is used for from user resources and selects to be suitable for exporting the project of form specific to the resource of theme and effect.For example, if song " happy birthday " is the resource specific to theme of appointment, will be from such as the hard copy of the photograph album output this song " happy birthday " being shown as sheet music or it being omitted fully.If selected video, digital slideshow or DVD, the audio content of song is with selected so.Similarly, derive metadata if people's face detection algorithm is used to generate content, this identical information can be used for hard copy output and use the image of cutting out automatically is provided, and perhaps uses for soft copy and provides dynamic, is the convergent-divergent and the translation at center with people's face.

At step 300 place, for the output type wanted is used effect specific to theme to the user who arranges with specific to the resource of theme.At step 310 place, with show virtual output type rough draft such as resource that provides and output parameter to the user in LUT 320, LUT 320 comprises the parameter specific to output such as image total number, video segment total number, fragment duration, printed matter size, photograph album page layout, music selection and playing duration time.At step 310 place these details are showed to the user with virtual output type rough draft.At determination step 330 places, the user can select to accept virtual output type rough draft or revise resource and output parameter.If the user wants to revise resource/output parameter, she advances to step 340.Be that Downloadable video is shortened to the video with 5 minute duration from 6 minutes total duration how to its example that uses.The user can select manual editing's resource, perhaps allows system to eliminate and/or shorten the displaying time of resource, quickening conversion etc. automatically to shorten the length of video.In case the user is satisfied to virtual output type rough draft at determination step 330 places, should be sent to functional modules by virtual output type rough draft at step 350 place.

Referring now to Fig. 5, illustrate the operational flowchart of the operation of functional modules 116.Turn to step 360 now, make the user resources of arrangement and can use for functional modules specific to the resource of theme and the applied effect of wanting of output type.At step 370 place, the user is from selecting output format the available look-up table shown in the step 390.Can connect by mobile storage means or network this LUT is provided.These output formats comprise the different digital form of supporting such as personal computer, mobile phone, based on the multimedia device of the web website of server or HDTV.These output formats are also supported to produce such as unbound 4 " * 6 " the desired digital format of hard copy output print form of printed matter, binder and placard, for example JPG and TIFF.At step 380 place, to the user who arranges with specific to the processing of the resource of theme and the output format selected specific to the effects applications of theme specific to the user.Show virtual output rough draft at step 400 place to the user, and at determination step 410 places, the user can agree or refuse this virtual output rough draft.If virtual output rough draft is rejected, the user can select to replace output format, and if the user agreed, produce output products at step 420 place.Output products can produce in this locality, for example uses PC of family and/or printer, perhaps remotely produces, and for example uses Kodak Easy ShareGallery ^TMAt step 430 place, for the soft copy type output products of long-range generation, they are sent to the user by the network connection, perhaps physically transported to the recipient of user or appointment.

Referring now to the tabulation of the extraction metadata tag of Fig. 6-obtain from resource acquisition and using system, resource acquisition and using system comprise video camera, mobile phone camera, personal computer, digital frame, video camera docking system, imaging device, networked display and printer.Extract metadata and be synonymous to the input metadata, and comprise by imaging device record and from the mutual information of user and device automatically.The canonical form of extracting metadata comprises: time/date stamp, the positional information that provides by GPS (GPS), nearest mobile phone launching tower or mobile launching tower triangulation, camera setting, image and audio frequency histogram, file layout information, and proofread and correct such as color range adjustment and the automated graphics of eliminating blood-shot eye illness.Except this aut.eq. central information record, user interactions also can be registered as metadata, and comprises: " sharing ", " hobby " or " not wiping " are specified; " figure punch command format (DPOF) "; " the wallpaper appointment " that be used for mobile phone camera or " image information transmission " that the user selects; " image information transmission " recipient who passes through Mobile Directory Number or E-mail address that the user selects; And the drainage pattern of selecting such as the user of " motion ", " microspur/low coverage ", " pyrotechnics " and " portrait ".The image operative installations is such as operation Kodak Easy Share ^TMThe personal computer of software or other image management system and image printer independent or that link to each other also provide the source of extracting metadata.The information of the type comprises and shows that image has been printed the history of printing of how many times, when shows and has where stored or backed up the storage history of image, and shows the type of the digit manipulation that has taken place and the edit history of quantity.Extracting metadata is used to offer help and obtains the background that derives metadata.

Tabulation referring now to the derivation metadata tag of Fig. 7-obtain from resource content and existing analysis of extracting metadata tag.Can produce by resource acquisition and using system and derive metadata tag, collection of resources and utilize system to comprise: video camera, mobile phone camera, personal computer, digital frame, video camera docking system, imaging device, networked display and printer.Can automatically produce the derivation metadata tag satisfying specific predetermined standard time, perhaps produce the derivation metadata tag alternately from the end user.The mutual example that extracts metadata and derive between the metadata is the image acquisition time/date stamp that uses video camera to generate in conjunction with user's digital calendar.Two systems can all be configured on the identical equipment such as mobile phone camera, perhaps can be dispersed between the imaging device and personal computer video camera docking system such as video camera.Digital calendar can not only comprise the individual interested key dates, such as " mother and father's wedding anniversary ", " aunt's Betty birthday " and " the small-sized corporations dinner party of Tommy ", and comprise the popular interested key dates, such as, May 5, Independence Day, Halloween, Christmas Day etc.Whether time/date stamp that video camera generates can be used as the inquiry of check figures calendar, gather on the interested date interested or individual masses so that determine any image or other resource.If produced coupling, metadata can be upgraded the derived information to comprise that this is new.By comprising other extraction metadata and derivation metadata, can set up other context and set such as positional information and location recognition.For example, if, after several weeks idle, be in day entry in September 5 in the position that is identified as " mother and father's house " a series of image and video.In addition, user's digital calendar shows that September 5 was " mother and father's anniversary ", and some images comprise the have demonstration cake picture of text of " mother and father, the anniversary is happy ".Now, the extraction metadata of combination and derivation metadata can provide context very accurately for incident " mother and father's anniversary " automatically.Setting up under this contextual situation, having only relevant theme to select to use, finding the needed workload of suitable theme thereby reduced significantly for the user with making.Because now system's known event type and Primary Actor, can also help to realize making marks, Attach Title or write blog, perhaps make their robotizations.

As mentioned above, the other method of context setting is called as " incident is cut apart ".Its service time/date stamp is with record use pattern, and when this method was used with image histogram, it provided the method that image, video and related resource is grouped into automatically " incident ".This makes the user to organize and to browse large-scale resource set by incident.

Can end user's face, object, language and text identification and the algorithm content of coming analysis image, video and audio resource.Relative position in the number of people's face and a scene or a series of scene can disclose material particular, so that provide context for resource.For example, a large amount of people's faces that are arranged in several rows and some row indicate the context of formal posture to can be applicable to family reunion, team sport, graduation etc.Additional information indication " motion event " such as the team uniform of sign with identification and text; Cap that matches and robe indication " graduation "; The clothing indication " family reunion " that mixes; And white robe, the robe that mixes colours mutually and man's indication " wedding " of wearing full dress.These indications combine with additional extraction metadata and derivation metadata context accurately are provided, and suitable resource can be selected by its system that makes, the related subject of the resource that is provided for selecting, and provide relevant additional resource to the source material collection.

Story shares-rule in the theme:

Theme is the ingredient that story is shared, and it has strengthened the displaying of user resources.Content, the third party's content that provides based on the user and how to show that content sets up specific story.Displaying can be hard copy or soft copy, rest image, video or audio frequency, perhaps their combination or whole.Theme will influence the selection of third party's content and the type of the displaying option that story adopts.Show that option comprises conversion between background, the visible resource, is applied to the effect of visible resource, and audio frequency, video or the static content of replenishing.If show it is soft copy, theme promptly, is showed the speed of content also with the influence time benchmark.

In story, show to relate to content and to the operation of this content.It should be noted that these operations will be influenced by the type of their operated contents.Be not that the whole operations that are included in the particular topic all are suitable for the full content that particular story comprises.

When the story keymake had been determined the displaying of story, this story keymake progressively formed the description to the sequence of operations of one group of given content.Theme can comprise the information as the framework of this series operation in the story.In writing, " key " story uses comprehensive framework.When the user writes the mutual control of processing, use comprehensive more weak framework.Usually should the series operation be called template.Can think that template is unfilled story,, does not have allocated resource that is.Under any circumstance, when to the template Resources allocation, operating in when being applied to content of describing in the template follows the principles.

Normally, the rule relevant with theme with resource as input variable.Rule for carrying out what operation on what content during writing in story retrains.In addition, if resource comprises certain metadata, this series operation or template can be revised or strengthen to the rule relevant with theme, makes story can become more complicated.

The rule example:

1) be not that all image files all have identical resolution.Therefore, be not that all image files can be supported identical zoom operations scope.The rule of the zoom operations of restriction on the specific resources will be based on certain combination of the metadata relevant with resource, such as, for example, resolution, main body are apart from, size of main body or focal length.

2) operation of using in the writing of story will be based on the existence of the resource with certain metadata characteristic, perhaps based on the ability to this resource application specific algorithms.Exist or the applicability condition if can not satisfy, can not comprise this operation for this resource so.For example, searching " tree " if write search attribute, and in set, do not comprising the picture of tree, will not select picture so.After this can not use any algorithm of searching " Christmas tree decoration " picture.

3) certain operations requires two (perhaps may be more a plurality of) resources.Conversion is the example of two resources of requirement.The description of series operation must be mentioned the resource of the needed correct number of specific operation.In addition, the operation of mentioning must have suitable type.That is to say that conversion can not produce between audio resource and rest image.Normally, operation is specific to type, such as not amplifying on audio resource.

4) depend on the operation and the added constraint of theme of use, the order of the operation of carrying out on resource may be restrained.That is, writing processing may require translation before zoom operations.

5) particular topic can forbid carrying out specific operation.For example, story may not comprise video content, and includes only rest image and audio frequency.

6) particular topic can limit the displaying time that any specific resources or resource type can have in story.In this case, will limit demonstration, displaying or play operation.For audio or video, this rule will require keymake to carry out temporal pre-service before in the description that resource is included in series operation.

7) theme with comprehensive framework might comprise quoting non-existent operation on the keymake of particular version.Therefore, theme is necessary to comprise the operation alternative rule.Especially, alternate application is in conversion.When carrying out conversion between two resources, " wiping " can have some mixed effects.If keymake can not be described more advanced conversion, simple sharp edge is wiped and be can be used as alternative transforms.Should be noted in the discussion above that for the performance device to show the situation of the described conversion of story descriptor, this performance device also has alternative rule.Under many circumstances, might substitute unsupported operation with blank operation.

8) rule of particular topic will check whether resource comprises certain metadata.If specific resources comprises certain metadata, then can on this resource, be executed in the additional operations that template limited that exists in the theme.Therefore, particular topic can allow the condition execution to the operation of content.This has provided the outward appearance to the dynamic change of story as what resource function relevant with story, perhaps, more specifically, as the outward appearance to the dynamic change of story of what metadata function relevant with the resource relevant with story.

The rule that is used for commercial constraint:

Depend on specific embodiment, theme can be limited operation according to the complexity of keymake or price or user's privilege.Single theme will retrain based on the identifier of keymake or class of subscriber and write the operation that allows in the processing, rather than distribute not on the same group theme to different keymakes.

Story is shared, but additional application rule:

Show that rule can be the ingredient of theme.When having selected theme, the rule in the subject description symbol is embedded in the story descriptor.Show that rule also can be embedded in the keymake.The story descriptor can be mentioned a large amount of art processing that can draw from specific main resource.Because before the art mentioned in the story descriptor is handled, must produce in intrasystem somewhere and store them, comprise that more art is handled lengthening is write the required time of story.Yet the generation that art is handled makes that the performance efficient of story is higher, especially for the multimedia playback.Be similar to the rule of theme described in selecting, number that the art that draws from main resource during writing processing is handled and form will be required in the user profile and the performance of record farthest increase the weight of, succeeded by the selected theme of ordinary populace.

Expression regulation is the ingredient of output descriptor.When the user selected to export descriptor, these rules helped to guide performance to handle.Specific story descriptor will be mentioned the main coding of digital resource.For rest image, this will be original figure negative film (ODN).The story descriptor will be mentioned other art processing of this main resource probably.The output descriptor will be probably be associated with specific output unit, so rule will be present in the output descriptor, so that the specific art of selecting to be used to show is handled.

The theme selective rule is embedded in the keymake.The user selects to handle channeling conduct to the input of keymake and the metadata that exists to theme in user content.The metadata relevant with the specific collection of user content can be guided the suggestion of some themes into.Keymake will have the inlet to database, and this database will show which has the maximum probability of being selected by the user based on the theme of the suggestion of metadata.Rule will farthest increase the weight of to be fit to the theme of user profile, succeeded by the selected theme of ordinary populace.

With reference to Fig. 8, illustrate the exemplary fragment of the shared descriptor file of story of definition " slideshow " output format in this example.Xml code starts from normative document header 801, and the resource that will be included in this output products starts from the Resources list 802.With boldface letter the variable information that above-mentioned keymake module is filled is shown.The resource that is included in this descriptor file comprises AASID0001 803 to ASID0005 804, and they comprise MP3 audio file and the JPG image file that is arranged in the local resource catalogue.Resource can be placed on the different memory storages that any and local system connect or on the webserver such as web website, internet.This exemplary slideshow also will show resource artist name 805.Also comprise shared resource in this slideshow such as background image resource 806 and audio file 803.Story is shared information and is started from row " story shared segment " 807.The audio frequency duration 808 was defined as 45 seconds.The demonstration of resource ASID0001.jpg809 is programmed to 5 seconds demonstration duration 810.Next resource ASID0002.jpg812 is programmed to 15 seconds demonstration duration 811.Other the different standard of displaying that is used for the resource of slideshow is also included within this exemplary fragment of descriptor file, and is known by the those skilled in the art, will it be further described.

Fig. 9 represents the slideshow output fragment 900:ASID0001.jpg910 and the ASID0002.jpg 920 of above-mentioned two resources.Resource ASID0003.jpg 930 has 5 seconds demonstration duration in the slideshow fragment.Figure 10 represents the utilization again of same descriptor file, shares the slideshow that generates Fig. 9 the descriptor file with pasteup output format 1000 from the identical story of Fig. 8 illustrated.This pasteup output format illustrates the non-time representation of the time reinforcement of the given resource ASID0002.jpg 1020 in the slideshow form, for example, the size that increases, this is because resource ASID0002.jpg 1020 has the duration longer than other resource ASID0001.jpg 1010 and ASID0003.jpg 1030.This understands that for example the resource continuous time is in two kinds of different output-slideshows and the influence in the pasteup.

List of parts

6 digital cameras

10 computer systems

12 data/address bus

14 CPU

16 read-only storages

18 network connection devices

20 hard disk drives

22 random access memory

24 DIU display interface units

26 Audio Interface Units

28 desktop interface devices

30 CD-R/W drivers

32 DVD drivers

34 USB interface devices

40 removable mediums based on DVD are such as DVD R-or DVD R+

42 removable mediums based on CD are such as CD-ROM or CD-R/W

44 mouses

46 keyboards

48 microphones

50 loudspeakers

52 video displays

60 networks

110 resources

111 subject descriptions symbol and template file

112 default stories are shared descriptor file

113 output descriptor file

114 story keymake/editor module

115 stories of writing are shared descriptor file

116 stories performance device/browser module

117 story writing modules

118 produce different output

The theme that 200 users accept a proposal

210 users select theme

220 use metadata obtain third party's resource and the effect specific to theme

Do 230 users accept resource and the effect specific to theme?

240 user resources of arranging+specific to the resource and the effect of theme

250 obtain the third party's resource and the effect specific to theme of replacement

260 to previewing module

270 user resources of arranging+specific to the resource and the effect of theme

The output type that 280 users selection is wanted

290 output type look-up tables

300 use effect specific to theme to the user resources of arranging with specific to the resource of theme for the output type wanted

310 show the virtual output type rough draft comprise resource/output parameter to the user

Parameter list is searched in 320 resource/outputs

390 output format look-up tables

400 virtual output rough drafts

Do 410 users agree?

420 produce output products

430 send output products

600 user ID/summary

610 user resources collection

620 obtain existing metadata

630 extract new metadata

640 process metadata

650 use metadata to come organizational resources and resource is carried out order of classification

Are 660 automatic resources selected?

670 user resources are selected

Can 680 metadata suggested subject?

690 theme look-up tables

700 xml codes

710 resources

720 seconds

730 resources

800 slideshows are represented

801 normative document headers

802 the Resources lists

803?“AASID0001”

804?“ASID0005”

805 resource artist name

806 background image resources

807 story shared segments

808 audio frequency duration

The demonstration of 809 resource ASID0001.jpg

810 resources

811 15 seconds demonstration duration

812 resource ASID0002.jpg

820 resources

830 resources

900 pasteups are represented

910 resources

920 resources

930 resources

1000 pasteup output formats

1010?ASID0001.jpg

1020?ASID0002.jpg

1030?ASID0003.jpg

Claims

1. system comprises:

Be used to store the memory storage of digital media resource;

Subject description symbol application program, its theme that is used for selecting are handled and are applied to the resource selected automatically;

The composer application program, it is used for selecting automatically theme to handle and resource, shares descriptor file with the story of the theme of writing the resource that comprises selection and selection;

The performance application program, it is used for explaining the information of the shared descriptor file of story, and be used to produce at least one output descriptor file, this output descriptor file is shared descriptor file corresponding to story, and comprise with by this at least one export descriptor file at the corresponding output information of output unit of at least one selection; And

The preview application program, it is used for showing based on described at least one output descriptor file at least one expression of output products, and the output unit of wherein said at least one selection shows corresponding output products based on described at least one output descriptor file.

2. the system as claimed in claim 1, wherein subject description symbol application program comprises and is used for handling the effect that is applied to the resource selected automatically with described theme, and wherein this system further comprises:

The rule database that comprises the addressable rule of composer application program, it is used to check whether the application of theme and effect is feasible for the resource of selecting, and wherein said performance application program is revised at least one described resource according to described rule database and described at least one output descriptor file.

3. the system as claimed in claim 1 is wherein selected digital media resource from text, figure, image, video, audio frequency or multimedia presentation.

4. the system as claimed in claim 1 is wherein selected output products from printed matter, photograph album, placard, video, DVD, digital slideshow, Downloadable film, digital document or web website.

5. the system as claimed in claim 1, wherein story is shared descriptor file and the output descriptor file is the XML form.

6. system as claimed in claim 2, wherein rule comprises one or more following rules: theme dependency rule, convergent-divergent rule, the algorithm applicability according to resource metadata, many resource operations, sequence of operation, operation alternative rule, price constraints, user privilege and expression regulation.

7. the system as claimed in claim 1, wherein theme handle comprise birthday, anniversary, spend a holiday, among red-letter day, family, humour, special occasions, friend or the motion theme at least one.

8. the system as claimed in claim 1 is wherein selected output unit from HDTV, digital frame, printer, video monitor, mobile phone or PDA.

9. computer-implemented method comprises step:

Select the subclass of the digital resource of computer-accessible;

Select the story theme of computer-accessible; Write the subclass that comprises digital resource and the story descriptor file of digital story theme;

Generate a plurality of separately based on the output descriptor file of story descriptor file;

The access rule database comprises at least one digital resource of described rule response being revised in the digital resource subclass; And

The a plurality of image products of output on output unit, described image product is separately corresponding to one of described output descriptor file.

10. method as claimed in claim 9, the step that wherein generates a plurality of output descriptor file comprise generate a plurality of separately with the step of the corresponding output descriptor file of one of a plurality of different output units, and wherein export step and comprise based on one of output descriptor file and on each of a plurality of different output units, export its corresponding image product.

11. method as claimed in claim 9, wherein said digital resource comprises image, video, audio frequency and multimedia presentation.

12. method as claimed in claim 9, wherein from printed matter, photograph album, placard, video, DVD, HDTV, digital frame, digital slideshow, can download movies or the web website select described image product.

13. method as claimed in claim 9, wherein story descriptor file and output descriptor file are the XML form.

14. method as claimed in claim 9, wherein rule database comprises: theme dependency rule, convergent-divergent rule, the algorithm applicability according to resource metadata, many resource operations rule, sequence of operation rule, operation alternative rule, price constraints rule, user privilege rule and expression regulation.

15. method as claimed in claim 9, it further is included in the output step and shows the step that the preview of at least one described image product is represented before.

16. method as claimed in claim 9 is wherein selected output unit from HDTV, digital frame, printer, video monitor, mobile phone or PDA.

17. a computer-readable program storage device, it comprises computing machine effectively can carry out the program that requires the instruction of 9 method step with enforcement of rights.

18. program storage device as claimed in claim 17, wherein method step further comprises the step that the preview that shows one or more image products is represented.

19. program storage device as claimed in claim 17, wherein digital resource comprises image, video, audio frequency and multimedia presentation.

20. program storage device as claimed in claim 17, wherein from printed matter, photograph album, placard, video, DVD, digital frame, digital slideshow, can download movies and the web website select image product.

21. program storage device as claimed in claim 17, wherein story descriptor file and output descriptor file are the XML form.

22. program storage device as claimed in claim 17, wherein rule database comprises: theme dependency rule, convergent-divergent rule, the algorithm applicability according to resource metadata, many resource operations rule, sequence of operation rule, operation alternative rule, price constraints rule, user privilege rule and expression regulation.