WO2013074895A2 - Automatic tag generation based on image content - Google Patents
Automatic tag generation based on image content Download PDFInfo
- Publication number
- WO2013074895A2 WO2013074895A2 PCT/US2012/065467 US2012065467W WO2013074895A2 WO 2013074895 A2 WO2013074895 A2 WO 2013074895A2 US 2012065467 W US2012065467 W US 2012065467W WO 2013074895 A2 WO2013074895 A2 WO 2013074895A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- information
- photo
- captured
- location
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/22—Character recognition characterised by the type of writing
- G06V30/224—Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/587—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using geographical or spatial information, e.g. location
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
Definitions
- tags are used to aid in the sorting, saving, and searching of digital photos.
- Tagging refers to a process of assigning keywords to digital data.
- the digital data can then be organized according to the keywords or 'tags'.
- the subject matter of a digital photo can be used to create keywords that are then associated with that digital photo as one or more tags.
- tags can be manually added to a particular digital photo to help in the categorizing and searching of the photos
- tags can be manually added to a particular digital photo to help in the categorizing and searching of the photos
- most cameras assign automatic tags of date and time to the digital photos.
- more and more cameras are including geographic location as part of the automatic tags of a photo.
- software solutions have been developed to provide automatic identification of the people in photos (and matching to a particular identity).
- information regarding certain conditions including, but not limited to, weather, geographical landmarks, architectural landmarks, and prominent ambient features can be extracted from an image.
- the time and geographic location metadata of a photo is used to extract the weather for that particular location and time. The extraction can be performed by querying weather databases to determine the weather for the particular location and time in which the photo was taken.
- geographic location metadata of a photo and image recognition is used to extract geographical and architectural landmarks.
- image recognition is used to extract prominent ambient features (including background, color, hue, and intensity) and known physical objects from images, and tags are automatically assigned to the photo based on the extracted features and objects.
- a database of keywords or object identifiers can be provided to be used as tags when one or more certain conditions are recognized in a photo.
- keywords or object identifiers associated with that particular condition are automatically assigned as tags for the photo.
- Tags previously associated with a particular photo can be used to generate additional tags.
- date information can be used to generate tags with keywords associated with that date, such as the season, school semester, holiday, and newsworthy event.
- recognized objects can be ranked by prominence and the ranking reflected as an additional tag.
- the database used in identifying the recognized objects can include various levels of specificity/granularity.
- FIG. 1 illustrates an automatic tag generation process in accordance with certain embodiments of the invention.
- FIG. 2 illustrates an image recognition process in accordance with certain embodiments of the invention.
- FIG. 3 shows an automatic tag generation process flow in accordance with certain embodiments of the invention.
- FIG. 4 illustrates a process of generating a tag by extracting an architectural landmark from a photo for an automatic tag generation process in accordance with an embodiment of the invention.
- FIG. 5 illustrates a process of generating a tag by extracting a geographical landmark from a photo for an automatic tag generation process in accordance with an embodiment of the invention.
- the automatic tagging can occur as a digital photo (or video) is loaded or otherwise transferred to a photo collection that may be stored on a local, remote, or distributed database. In other embodiments, the automatic tagging can occur upon the initiation of a user in order to tag existing photos.
- An image can include, but is not limited to, the visual representation of objects, shapes, and features of what appears in a photo or a video frame.
- an image may be captured by a digital camera (in the form of a photo or as part of a video), and may be realized in the form of pixels defined by image sensors of the digital camera.
- the term "photo image” is used herein to refer to the image of a digital photo as opposed to metadata or other elements associated with the photo and may be used interchangeably with the term “image” without departing from the scope of certain embodiments of the invention.
- the meaning of the terms "photo,” “image,” and “photo image” will be readily understood from their context.
- an image may refer to the visual representation of the electrical values obtained by the image sensors of a digital camera.
- An image file (and digital photo file) may refer to a form of the image that is computer- readable and storable in a storage device.
- the image file may include, but is not limited to, a .jpg, .gif, and .bmp.
- the image file can be reconstructed to provide the visual representation ("image") on, for example, a display device or substrate (e.g., by printing onto paper).
- Metadata written into a digital photo file often includes information identifying who owns the photo (including copyright and contact information) and the camera (and settings) that created the file, as well as descriptive information such as keywords about the photo for making the file searchable on a user's computer and/or over the Internet.
- Some metadata is written by the camera, while other metadata is input by a user either manually or automatically by software after transferring the digital photo file to a computer (or server) from a camera, memory device, or another computer.
- an image and its metadata are used to generate additional metadata.
- the additional metadata is generated by being extracted or inferred from the image and the metadata for the image.
- the metadata for the image can include the geo-location and date the image was taken, and any other information associated with the image that is available.
- the metadata for the image can be part of the image itself or provided separately.
- the metadata is part of the image itself, the data is first extracted from the digital file of the image before being used to generate the additional metadata. Once generated, the additional metadata can then be associated back to the original image or used for other purposes.
- the extracted and/or created metadata and additional metadata can be associated with the original image as a tag.
- One type of tag is a keyword tag.
- the keyword tag may be used in connection with performing operations on one or more images such as, for example, sorting, searching and/or retrieval of image files based on tags having keywords matching specified criteria.
- FIG. 1 illustrates an automatic tag generation process in accordance with certain embodiments of the invention.
- a photo having an image and corresponding metadata is received 100.
- the automatic tagging process of an embodiment of the invention can automatically begin upon receipt of the photo.
- the process can begin upon the user uploading a photo image file to a photo sharing site.
- the process can begin upon the user loading the photo from a camera onto a user's computer.
- a user's mobile phone can include an application for automatic tag generation where upon capturing an image using the mobile phone's camera or selecting the application, the tagging process can begin.
- Metadata associated with the photo is extracted 110.
- the extraction of the metadata can include reading and parsing the particular type(s) of metadata associated with the photo.
- the types of metadata that can be extracted may include, but are not limited to Exchangeable Image File Format (EXIF), International Press Telecommunication Council (IPTC), and Extensible Metadata Platform (XMP).
- image recognition is performed 120 to recognize and identify shapes and objects in the photo image.
- the particular image recognition algorithm used during the performing of the image recognition can be any suitable image or pattern recognition algorithm available for the particular application or processing constraints.
- the image recognition algorithm may be limited by available databases for providing the matching of objects in the photo to known objects.
- an image recognition algorithm can involve pre-processing of the image. Preprocessing can include, but is not limited to, adjusting the contrast of the image, converting to greyscale and/or black and white, cropping, resizing, rotating, and a combination thereof.
- a distinguishing feature such as (but not limited to) color, size, or shape, can be selected for use in detecting a particular object.
- a distinguishing feature such as (but not limited to) color, size, or shape
- multiple features providing distinguishing characteristics of the object may be used.
- Edge detection or border recognition
- Morphology may be performed in the image recognition algorithm to conduct actions on sets of pixels, including the removal of unwanted components.
- noise reduction and/or filling of regions may be performed.
- the one or more objects can each be located in the image and then classified.
- the located object(s) may be classified (i.e. identified as a particular shape or object) by evaluating the located object(s) according to particular specifications related to the distinguishing feature(s).
- the particular specifications may include mathematical calculations (or relations).
- pattern matching may be performed instead of (or in addition to) locating recognizable objects in the image. Matching may be carried out by comparing elements and/or objects in the image to "known" (previously identified or classified) objects and elements.
- the results (e.g., values) of the calculations and/or comparisons may be normalized to represent a best fit for the classifications, where a higher number (e.g., 0.9) signifies a higher likelihood of being correctly classified as the particular shape or object than a normalized result of a lower number (e.g., 0.2).
- a threshold value may be used to assign a label to the identified object.
- the image recognition algorithms can utilize neural networks (NN) and other learning algorithms.
- a video signal can be received by certain systems described herein and undergo an automatic tag generation process as described in accordance with certain embodiments of the invention.
- one or more video frames of a video signal can be received, where the video frame may include an image and metadata, and image recognition and metadata extraction can be performed.
- a first pass recognition step can be performed for an image to identify that a basic shape or object exists in the image. Once the basic shape or object is identified, a second pass recognition step is performed to obtain a more specific identification of the shape or object. For example, a first pass recognition step may identify that a building exists in the photo, and a second pass recognition step may identify the specific building. In one embodiment, the step of identifying that a building exists in the photo can be accomplished by pattern matching between the photo and a set of images or patterns available to the machine/device performing the image recognition. In certain embodiments, the result of the pattern matching for the first pass recognition step can be sufficient to identify the shape or object with sufficient specificity such that no additional recognition step is performed.
- the extracted metadata can be used to facilitate the image recognition by, for example, providing hints as to what the shape or object in the photo may be.
- geographical information extracted from the metadata can be used to facilitate the identification of the specific building.
- the performing of the image recognition 120 can be carried out using the image recognition process illustrated in FIG. 2. Referring to FIG. 2, a basic image recognition algorithm can be used to identify an object in an image 221. This image recognition algorithm is referred to as "basic" to indicate that the image recognition process in step 221 is not using the extracted metadata and should not be construed as indicating only a simplistic or otherwise limited process.
- the image recognition algorithm can be any suitable image or pattern recognition algorithm available for the particular application or processing constraints, and can also involve pre-processing of the image.
- the extracted metadata 211 can be used to obtain a name or label for the identified object by querying a database (e.g., "Identification DB") 222.
- the database can be any suitable database containing names and/or labels providing identification for the object within the constraints set by the query.
- the names and/or labels resulting from the Identification DB query can then be used to query a database (e.g., "Picture DB”) containing images to find images associated with the names and/or labels 223.
- the images resulting from the Picture DB search can then be used to perform pattern matching 224 to more specifically identify the object in the image.
- a score can be provided for how similar the images of objects resulting from the Picture DB search are to the identified object in the image undergoing the image recognition process.
- the basic image recognition 221 may be used to identify the OBJECT "building” and the algorithm may return, for example, "building,” “gray building,” or “tall building.”
- the extracted metadata 211 is the longitude and latitude at which the photo was taken (may be within a range on the order of ⁇ 10 2 feet)
- a query of an Identification DB 222 may be "find all buildings close to this geographical location" (where the geographical location is identified using the longitude and latitude provided by the extracted metadata).
- the Picture DB can be queried 223 to "find all known pictures for each of those specific buildings" (where the specific buildings are the identified buildings from the query of the Identification DB). Pattern matching 224 can then be performed to compare the images obtained by the query of the Picture DB with the image undergoing the image recognition process to determine whether there is a particularly obvious or close match.
- the relative location of objects to one another may also be recognized.
- an advanced recognition step can be performed to recognize that an identified boat is on an identified river or an identified person is in an identified pool.
- the extracted metadata and recognized/identified objects in the photo can then be used to obtain additional information for the photo by being used in querying databases for related information 130.
- Word matching can be performed to obtain results from the query. This step can include using geographical information, date/time information, identified objects in the image, or various combinations thereof to query a variety of databases to obtain related information about objects in the photo and events occurring in or near the photo.
- the results of the database querying can be received 140 and used as tags for the photo 150.
- a photo having an extracted date of November 24, 2011, an extracted location in the United states, and a recognized object of a cooked turkey on a table can result in an additional information tag of "Thanksgiving," whereas an extracted location of outside of the United states would not necessarily result in the tag of the additional information of "Thanksgiving" for the same image.
- a photo having an extracted date of the 2008 United States presidential election and an image recognized President Obama can result in an additional information tag of "presidential election" or, if the time also matches, the additional information tag can include "acceptance speech.”
- FIG. 3 illustrates an automatic tagging process in accordance with certain embodiments of the invention. Similar to the process described with respect to FIG. 1, a photo having an image 301 and corresponding metadata 302 is received. Any geographic information (310) and date/time information (320) available from the metadata 202 is extracted. If no geographic information and date/time information is available, a null result may be returned (as an end process).
- the image 301 is input into an image classifier 330 that scans for known objects (i.e. objects having been defined and/or catalogued in a database used by the image classifier) and identifies and extracts any known physical objects in the image.
- the image classifier uses a database of shapes and items (objects) to extract as much data as possible from the image.
- the image classifier can search and recognize a variety of objects, shapes, and/or features (e.g., color). Objects include, but are not limited to, faces, people, products, characters, animals, plants, displayed text, and other distinguishable content in an image.
- the database can include object identifiers (metadata) in association with the recognizable shapes and items (objects).
- the sensitivity of the image classifier can enable identification of an object even where only partial shapes or a portion of the object is available in the image.
- the metadata obtained from the image classifier process can be used as tags for the photo. The metadata may be written back into the photo or otherwise associated with the photo and stored (335).
- additional tags can be automatically generated by utilizing a combination of the metadata.
- the image can undergo one or more passes for identification and extraction of a variety of recognized features.
- a confidence value representing a probability that the recognized feature was correctly identified can be provided as part of the tag associated with the photo.
- the confidence value may be generated as part of the image recognition algorithm.
- the confidence value is the matching weight (which may be normalized) generated by the image recognition algorithm when matching a feature/object in the image to a base feature (or particular specification).
- the generated confidence value will depend on the algorithm being used and the delta between the images.
- the result may indicate a 90% match if the algorithm recognizes edges and colors, and in another case, the result may indicate a 100% match if the algorithm is only directed to edges, not color.
- the confidence values can be in the form of a table with levels of confidence.
- the table can be stored as part of the tags themselves.
- the table can include an attribute and associated certainty. For example, given a photo of a plantain (in which it is not clear that the plantain is a plantain or a banana), the photo (after undergoing an automatic tag generation process in accordance with an embodiment of the invention) may be tagged with Table 1 below. It should be understood that the table is provided for illustrative purposes only and should not be construed as limiting the form, organization, or attribute selection.
- the photo of the plantain may be obtained along with the Table 1.
- the user may, in some cases, be able to remove any attributes in the table that the user knows are incorrect and change the confidence value (or certainty) of the attribute the user knows is correct to 100% (or 1).
- the corrected table and photo can be used in an image matching algorithm to enable the image recognition algorithm to be more accurate.
- the extracted geographical information is used to facilitate a landmark recognition pass (340), through which the image is input, to identify and extract any recognized landmarks (geographical or architectural). Confidence values can also be associated with the tags generated from the landmark recognition pass. The tags generated from the landmark recognition pass can by written back into the photo image file or otherwise associated with the image and stored (345).
- a weather database is accessed to extrapolate the weather/temperature information at the time/location at which the image was captured by using the extracted metadata of geographical information and date/time information (350).
- the weather/temperature information can be written back into the photo or otherwise associated with the photo and stored (355).
- the automatic tags generated from each process may be stored in a same or separate storage location.
- the databases used by the tag generating system can be local databases or databases associated with other systems.
- a database can be included having keywords or object identifiers for use as tags when one or more specific conditions such as (but not limited to) the weather, geographical landmarks, and architectural landmarks, are determined to be present in a photo.
- This database can be part of or separate from the database used and/or accessed by the image classifier.
- the databases accessed and used for certain embodiments of the subject automatic tag generation processes can include any suitable databases available to search engines, enabling matching between images and tags.
- geotags include geographical location information such as the latitudinal and longitudinal coordinates of the location where a photo is captured.
- Automatic geotagging typically refers to using a device (e.g., digital still camera, digital video camera, mobile device with image sensor) having a geographical positioning system (GPS) when capturing the image for a photo such that the GPS coordinates are associated with the captured image when stored locally on the image capturing device (and/or uploaded into a remote database).
- GPS geographical positioning system
- CelllD also referred to as CID and which is the identifying number of a cellular network cell for a particular cell phone operator station or sector
- CID also referred to as CID and which is the identifying number of a cellular network cell for a particular cell phone operator station or sector
- a specialized automatic geotagging for geographical and architectural landmarks can be accomplished.
- the date/time and location information of a digital photo can be extracted from metadata of the digital photo and a database searched using the date/time and location codes.
- the database can be a weather database, where a query for the weather at the location and date/time extracted from the digital photo returns information (or code) related to the weather for that particular location and time.
- the result of the query can provide weather code and/or descriptions that can be used as a tag such as "Mostly Sunny,” “Sunny,” “Clear,” “Fair,” “Partly Cloudy,” “Cloudy,” “Mostly Cloudy,” “Rain,” “Showers,” “Sprinkles,” and “T-storms.”
- the weather code may include other weather related descriptors such as “Cold,” “Hot,” “Dry,” and “Humid.” Seasonal information can also be included.
- the weather database being searched may not store weather information for the exact location and time used in the query.
- a best matching search can be performed and weather information (along with a confidence value) can be provided for possible best matches to the location and date/time.
- a weather database may contain weather information updated for each hour according to city. A query of that weather database could then return the weather information for the city that the location falls within or is nearest (e.g., the location may be outside of designated city boundaries) for the closest time(s) to the particular time being searched.
- image recognition is performed on the photo image to extract feature information and a tag associated with the recognized object or feature is automatically assigned to the photo.
- prominent ambient features can be extracted from photos by using image (or pattern) recognition. Predominant colors can be identified and used as a tag.
- the image recognition algorithms can search for whether sky is a prominent feature in the photo and what colors or other highlights are in the photo. For example, the image recognition can automatically identify "blue sky” or “red sky” or “green grass” and the photo can be tagged with those terms.
- image recognition can be used to find as many objects as possible and automatically tag the photo appropriately. If a baseball bat, or a football, or a golf club, or a dog, is detected by the image recognition algorithm, tags with those terms can be automatically added as tags to the photo. In addition the objects could be automatically ranked by prominence.
- the photo can be tagged "chair,” “baseball,” and “table.”
- an extra tag can be included with an indicator that the main subject is (or is likely to be) a chair.
- the granularity of the tags can evolve.
- the database can have increasing granularity of recognizable objects, such as "automobile” to "BMW automobile” to "BMW Z4 automobile.”
- known geographic landmarks can be determined and the information extracted from a photo by using a combination of image recognition and geotagging.
- Data from the photo image itself can be extracted via image recognition and the image recognized shapes or objects compared to known geographic landmarks at or near the location corresponding to the location information extracted from the metadata or geotag of the photo.
- This can be accomplished by querying a database containing geographical landmark information.
- the database can be associated with a map having names and geographic locations of known rivers, lakes, mountains, and valleys.
- the existence of a body of water in the photo image may be recognized using image recognition.
- Combining the recognition that water is in the photograph with a geotag associated with the photograph that indicates that the location the photo image was captured is on or near a particular known body of water can result in automatic generation of tags for the photo of the name of the known body of water.
- a photo with a large body of water and a geotag indicating a location in England along the river Thames can be automatically tagged with "River Thames" and "River.”
- FIG. 4 illustrates one such process. Referring to FIG. 4, image recognition of a photo image 401 showing sunrise over a river can result in a determination that a river 402 is in the image 401.
- this information can then be extracted from the image and applied as a tag and/or used in generating the additional metadata. For example, a more specific identification for the "river" 402 can be achieved using the photo's corresponding metadata 403.
- the metadata 403 may include a variety of information such as location metadata and date time metadata.
- the combination of the location metadata (from the metadata 403) and the image-recognized identified object (402) is used to generate additional metadata.
- the metadata 403 indicates a location (not shown) near the Mississippi River and the image recognized object is a river. This results in the generation of the identifier "Mississippi River," which can be used as a tag for the photo.
- a shape or object that is recognized as being a river can be tagged with "River.”
- a shape or object that is recognized as being a beach can be tagged with "Beach” or "Coast.”
- known architectural landmarks can also be determined from a photo by using a combination of image recognition and geotagging.
- Data from the photo image itself can be extracted via image recognition and the image recognized shapes or objects compared to known architectural landmarks at or near the location corresponding to the location information extracted from the metadata or geotag of the photo. This can be accomplished by querying a database containing architectural landmark information.
- the photo can be automatically tagged with name of the architectural landmark.
- Architectural landmarks including the Eiffel tower, the Great Wall of China, or the Great Pyramid of Giza can be recognized due to their distinctive shapes and/or features.
- the existence of a particular structure in the photo may be recognized using image recognition and the photo tagged with a word associated with that structure or feature.
- the name of the particular structure determined from searching a database can be an additional tag.
- FIG. 5 illustrates one such process.
- image recognition of a photo image 501 showing a person in front of the base of the Eiffel tower can result in a determination that a building structure 502 is in the image 501.
- this information can then be extracted from the image and applied as a tag and/or used in generating the additional metadata.
- the photo can be tagged with a word or words associated with the image-recognized object of "building structure.”
- a more specific identification for the "building structure” can be achieved using the photo's corresponding metadata 503.
- the metadata 503 can include a variety of information such as location metadata and date time metadata.
- the metadata 503 of the photo can also include camera specific metadata and any user generated or other automatically generated tags. This listing of metadata 503 associated with the photo should not be construed as limiting or requiring the particular information associated with the photo and is merely intended to illustrate some common metadata.
- the combination of the location metadata (from the metadata 503) and the image-recognized identified object (502) is used to generate additional metadata.
- the metadata 503 indicates a location (not shown) near the Eiffel tower and the image recognized object is a building structure. This results in the generation of the identifier "Eiffel tower," which can be used as a tag for the photo.
- Similar processes can be conducted to automatically generate a tag of recognizable objects. For example, if a highway is recognized in a photo, the photo can be tagged as "highway.” If a known piece of art is recognized, then the photo can be tagged with the name of the piece of art. For example, a photo of Rodin's sculpture, The Thinker, can be tagged with "The Thinker" and "Rodin.”
- the known object database can be one database or multiple databases that may be accessible to the image recognition program.
- the image recognition processing can be conducted after accessing a database of images tagged or associated with the location at which the photo was taken, enabling additional datasets for comparison.
- a live video stream (having audio and visual components) can be imported and automatically tagged according to image recognized and extracted data from designated frames.
- Ambient sound can also undergo recognition algorithms to have features of the sound attached as a tag to the video.
- speech and tonal recognition, music recognition, and sound recognition e.g., car horns, clock tower bells, claps
- the video can be automatically tagged with emotive based terms, such as "angry.”
- the environment in which the automatic tagging occurs includes a user device and a tag generator provider that communicates with the user device over a network.
- the network can be, but is not limited to, a cellular (e.g., wireless phone) network, the Internet, a local area network (LAN), a wide area network (WAN), a WiFi network, or a combination thereof.
- the user device can include, but is not limited to a computer, mobile phone, or other device that can store and/or display photos or videos and send and access content (including the photos or videos) via a network.
- the tag generator provider is configured to receive content from the user device and perform automatic tag generation.
- the tag generator provider communicates with or is a part of a file sharing provider such as a photo sharing provider.
- the tag generator provider can include components providing and carrying out program modules. These components (which may be local or distributed) can include, but are not limited to, a processor (e.g., a central processing unit (CPU)) and memory.
- the automatic tagging can be accomplished via program modules directly as part of a user device (which includes components, such as a processor and memory, capable of carrying out the program modules).
- a user device which includes components, such as a processor and memory, capable of carrying out the program modules.
- no tag generator provider is used.
- the user device communicates with database providers (or other user or provider devices having databases stored thereon) over the network or accesses databases stored on or connected to the user device.
- program modules include routines, programs, objects, components, and data structures that perform particular tasks or implement particular abstract data types.
- functionality of the program modules may be combined or distributed as desired over a computing system or environment.
- Examples of computing systems, environments, and/or configurations include, but are not limited to, personal computers, server computers, hand- held or laptop devices, multiprocessor systems, microprocessor-based systems, programmable consumer electronics, and distributed computing environments that include any of the above systems or devices.
- computer readable media includes removable and nonremovable structures/devices that can be used for storage of information, such as computer readable instructions, data structures, program modules, and other data used by a computing system/environment, in the form of volatile and non-volatile memory, magnetic-based structures/de vices and optical-based structures/devices, and can be any available media that can be accessed by a user device.
- Computer readable media should not be construed or interpreted to include any propagating signals.
- any reference in this specification to "one embodiment,” “an embodiment,” “example embodiment,” etc. means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention.
- the appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment.
- any elements or limitations of any invention or embodiment thereof disclosed herein can be combined with any and/or all other elements or limitations (individually or in any combination) or any other invention or embodiment thereof disclosed herein, and all such combinations are contemplated within the scope of the invention without limitation thereto.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
Priority Applications (10)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2014542484A JP2015501982A (ja) | 2011-11-17 | 2012-11-16 | 画像コンテンツに基づいた自動タグ生成 |
| IN3322CHN2014 IN2014CN03322A (https=) | 2011-11-17 | 2012-11-16 | |
| CA2855836A CA2855836A1 (en) | 2011-11-17 | 2012-11-16 | Automatic tag generation based on image content |
| MX2014006000A MX2014006000A (es) | 2011-11-17 | 2012-11-16 | Generacion de etiqueta automatica basandose en contenido de imagen. |
| CN201280056443.1A CN103930901A (zh) | 2011-11-17 | 2012-11-16 | 基于图像内容的自动标记生成 |
| EP12850387.7A EP2780863A4 (en) | 2011-11-17 | 2012-11-16 | AUTOMATIC PRODUCTION OF LABELS PER PICTURE CONTENT |
| RU2014119859A RU2608261C2 (ru) | 2011-11-17 | 2012-11-16 | Автоматическое генерирование тега на основании содержания изображения |
| BR112014011739A BR112014011739A8 (pt) | 2011-11-17 | 2012-11-16 | Método de geração de etiqueta automática e meio legível por computador |
| KR1020147013107A KR20140091554A (ko) | 2011-11-17 | 2012-11-16 | 이미지 컨텐트 기반 자동 태그 생성 기법 |
| AU2012340354A AU2012340354A1 (en) | 2011-11-17 | 2012-11-16 | Automatic tag generation based on image content |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/298,310 | 2011-11-17 | ||
| US13/298,310 US20130129142A1 (en) | 2011-11-17 | 2011-11-17 | Automatic tag generation based on image content |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2013074895A2 true WO2013074895A2 (en) | 2013-05-23 |
| WO2013074895A3 WO2013074895A3 (en) | 2013-07-11 |
Family
ID=48426987
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2012/065467 Ceased WO2013074895A2 (en) | 2011-11-17 | 2012-11-16 | Automatic tag generation based on image content |
Country Status (12)
| Country | Link |
|---|---|
| US (1) | US20130129142A1 (https=) |
| EP (1) | EP2780863A4 (https=) |
| JP (1) | JP2015501982A (https=) |
| KR (1) | KR20140091554A (https=) |
| CN (1) | CN103930901A (https=) |
| AU (1) | AU2012340354A1 (https=) |
| BR (1) | BR112014011739A8 (https=) |
| CA (1) | CA2855836A1 (https=) |
| IN (1) | IN2014CN03322A (https=) |
| MX (1) | MX2014006000A (https=) |
| RU (1) | RU2608261C2 (https=) |
| WO (1) | WO2013074895A2 (https=) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103353879A (zh) * | 2013-06-18 | 2013-10-16 | 北京智谷睿拓技术服务有限公司 | 图像处理方法及设备 |
| CN104424262A (zh) * | 2013-08-29 | 2015-03-18 | 宏达国际电子股份有限公司 | 相关影像搜寻方法以及使用者界面控制方法 |
| WO2015101250A1 (zh) * | 2013-12-30 | 2015-07-09 | 腾讯科技(深圳)有限公司 | 图片处理方法及装置 |
Families Citing this family (112)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10786736B2 (en) | 2010-05-11 | 2020-09-29 | Sony Interactive Entertainment LLC | Placement of user information in a game space |
| US8989483B2 (en) * | 2011-06-10 | 2015-03-24 | Sri International | Method and apparatus for inferring the geographic location of captured scene depictions |
| WO2013116901A1 (en) * | 2012-02-08 | 2013-08-15 | Fairweather Corporation Pty Ltd. | Computing device for facilitating discounting and promotions using augmented reality |
| CN104126307B (zh) * | 2012-02-29 | 2018-02-06 | 杜比实验室特许公司 | 用于改善的图像处理和内容传递的图像元数据创建处理器及方法 |
| US9081798B1 (en) * | 2012-03-26 | 2015-07-14 | Amazon Technologies, Inc. | Cloud-based photo management |
| US9129179B1 (en) | 2012-05-10 | 2015-09-08 | Amazon Technologies, Inc. | Image-based object location |
| JP6351219B2 (ja) * | 2012-08-23 | 2018-07-04 | キヤノン株式会社 | 画像検索装置、画像検索方法及びプログラム |
| US20140089067A1 (en) * | 2012-09-23 | 2014-03-27 | Google Inc. | User rewards from advertisers for content provided by users of a social networking service |
| US10229138B2 (en) * | 2012-09-27 | 2019-03-12 | Nokia Technologies Oy | Method and apparatus for tagged deletion of user online history |
| US9817997B2 (en) | 2014-12-18 | 2017-11-14 | Banjo, Inc. | User-generated content permissions status analysis system and method |
| US9043329B1 (en) | 2013-12-19 | 2015-05-26 | Banjo, Inc. | Dynamic event detection system and method |
| US10678815B2 (en) | 2012-10-02 | 2020-06-09 | Banjo, Inc. | Dynamic event detection system and method |
| US9934368B2 (en) | 2012-10-02 | 2018-04-03 | Banjo, Inc. | User-generated content permissions status analysis system and method |
| US9652525B2 (en) | 2012-10-02 | 2017-05-16 | Banjo, Inc. | Dynamic event detection system and method |
| US10360352B2 (en) | 2012-10-02 | 2019-07-23 | Banjo, Inc. | System and method for event-based vehicle operation |
| KR20150011651A (ko) * | 2013-07-23 | 2015-02-02 | 주식회사 케이티 | 스토리텔링 컨텐츠를 제작하는 장치 및 방법 |
| US20150199872A1 (en) * | 2013-09-23 | 2015-07-16 | Konami Gaming, Inc. | System and methods for operating gaming environments |
| US20230156350A1 (en) * | 2013-09-30 | 2023-05-18 | Duelight Llc | Systems, methods, and computer program products for digital photography |
| US9058375B2 (en) | 2013-10-09 | 2015-06-16 | Smart Screen Networks, Inc. | Systems and methods for adding descriptive metadata to digital content |
| KR20150045721A (ko) * | 2013-10-21 | 2015-04-29 | 삼성전자주식회사 | 컨텐츠 제어 방법 및 그 전자 장치 |
| US10013639B1 (en) | 2013-12-16 | 2018-07-03 | Amazon Technologies, Inc. | Analyzing digital images based on criteria |
| WO2015103498A1 (en) * | 2014-01-03 | 2015-07-09 | Snapcious LLC | Method and system for distributed collection and distribution of photographs |
| CN103793521B (zh) * | 2014-02-18 | 2018-04-27 | 腾讯科技(深圳)有限公司 | 图片处理方法和装置 |
| KR102195899B1 (ko) | 2014-03-12 | 2020-12-28 | 삼성전자주식회사 | 컨텐트 관리 방법 및 이를 위한 클라우드 서버 |
| US9786010B2 (en) * | 2014-04-23 | 2017-10-10 | State Farm Mutual Automobile Insurance Company | Homeowners insurance application process using geotagged photos |
| US9459889B2 (en) | 2014-05-19 | 2016-10-04 | Qualcomm Incorporated | Systems and methods for context-aware application control |
| US10311303B2 (en) | 2014-05-22 | 2019-06-04 | Sony Corporation | Information processing apparatus, information processing method, and program |
| US20150356081A1 (en) * | 2014-06-04 | 2015-12-10 | Grandios Technologies, Llc | Advanced camera management function |
| CN105225212B (zh) * | 2014-06-27 | 2018-09-28 | 腾讯科技(深圳)有限公司 | 一种图片处理方法和装置 |
| US9697235B2 (en) * | 2014-07-16 | 2017-07-04 | Verizon Patent And Licensing Inc. | On device image keyword identification and content overlay |
| KR102279674B1 (ko) | 2014-09-02 | 2021-07-20 | 삼성전자주식회사 | 전자 장치의 멀티미디어 데이터 처리 방법 및 그 전자 장치 |
| US10216996B2 (en) * | 2014-09-29 | 2019-02-26 | Sony Interactive Entertainment Inc. | Schemes for retrieving and associating content items with real-world objects using augmented reality and object recognition |
| CN104318251B (zh) * | 2014-10-29 | 2018-05-01 | 广州视源电子科技股份有限公司 | 一种相片实时处理方法和装置 |
| KR102656605B1 (ko) | 2014-11-05 | 2024-04-12 | 삼성전자주식회사 | 복수의 단말기들 간의 화면 공유를 제어하는 방법, 장치 및 기록 매체 |
| US12401911B2 (en) | 2014-11-07 | 2025-08-26 | Duelight Llc | Systems and methods for generating a high-dynamic range (HDR) pixel stream |
| US12401912B2 (en) | 2014-11-17 | 2025-08-26 | Duelight Llc | System and method for generating a digital image |
| US9148475B1 (en) | 2014-12-01 | 2015-09-29 | Pleenq, LLC | Navigation control for network clients |
| WO2016130853A1 (en) * | 2015-02-11 | 2016-08-18 | AVG Netherlands B.V. | Systems and methods for identifying unwanted photos stored on a device |
| US9495614B1 (en) | 2015-02-27 | 2016-11-15 | Google Inc. | Verifying labels for images using image recognition |
| KR20160112391A (ko) | 2015-03-19 | 2016-09-28 | 정호재 | 태깅 기반 이미지 거래 시스템 및 방법 |
| WO2016157076A1 (en) * | 2015-03-30 | 2016-10-06 | Ghoson Ziad | Information processing system and method using image recognition |
| US12445736B2 (en) | 2015-05-01 | 2025-10-14 | Duelight Llc | Systems and methods for generating a digital image |
| CN105101073A (zh) * | 2015-05-22 | 2015-11-25 | 深圳市美贝壳科技有限公司 | 一种根据照片gps信息推送天气信息的方法 |
| US9563643B2 (en) * | 2015-06-25 | 2017-02-07 | Intel Corporation | Automatic metatagging in images |
| CN104965921A (zh) * | 2015-07-10 | 2015-10-07 | 陈包容 | 信息匹配的方法 |
| KR101606760B1 (ko) * | 2015-07-23 | 2016-03-28 | 연세대학교 산학협력단 | 영상 내 객체 기반 영상 감정 변환 장치 및 그 방법 |
| US10467284B2 (en) | 2015-08-03 | 2019-11-05 | Google Llc | Establishment anchoring with geolocated imagery |
| US20170039264A1 (en) * | 2015-08-04 | 2017-02-09 | Google Inc. | Area modeling by geographic photo label analysis |
| AU2015261614A1 (en) * | 2015-09-04 | 2017-03-23 | Musigma Business Solutions Pvt. Ltd. | Analytics system and method |
| US20170115853A1 (en) * | 2015-10-21 | 2017-04-27 | Google Inc. | Determining Image Captions |
| CN105472332B (zh) * | 2015-12-01 | 2018-07-20 | 广东弘栎电子设备有限公司 | 基于定位技术与视频技术的分析方法及其分析系统 |
| US11612092B1 (en) | 2015-12-18 | 2023-03-28 | Realmfive, Inc. | Agricultural system having actuatable work tool assemblies |
| CA3029134A1 (en) * | 2015-12-18 | 2017-06-22 | Realmfive, Inc. | Autonomous integrated farming system |
| CN105653633B (zh) * | 2015-12-25 | 2018-06-05 | 北京金山安全软件有限公司 | 一种图片景观信息的确定方法及装置 |
| JP6669390B2 (ja) * | 2016-03-09 | 2020-03-18 | キヤノン株式会社 | 情報処理装置、情報処理方法、及びプログラム |
| US11042753B2 (en) * | 2016-09-08 | 2021-06-22 | Goh Soo Siah | Video ingestion framework for visual search platform |
| DE102016012497A1 (de) * | 2016-10-19 | 2018-04-19 | PP Picturepartners GmbH | Vorrichtung zum Speichern, Verwalten und Zugriff auf digitale Fotografie, Verwendung einer Vorrichtung und Verfahren |
| AU2017348370A1 (en) | 2016-10-28 | 2019-06-13 | Axon Enterprise, Inc. | Systems and methods for supplementing captured data |
| CN111651628B (zh) * | 2016-12-07 | 2023-04-25 | 创新先进技术有限公司 | 基于图片实现业务操作的方法和装置 |
| CN106845068B (zh) * | 2016-12-13 | 2020-05-22 | 海纳医信(北京)软件科技有限责任公司 | 图片标注方法和装置 |
| US10970605B2 (en) | 2017-01-03 | 2021-04-06 | Samsung Electronics Co., Ltd. | Electronic apparatus and method of operating the same |
| CN108268575A (zh) * | 2017-01-04 | 2018-07-10 | 阿里巴巴集团控股有限公司 | 标注信息的处理方法、装置和系统 |
| US10290137B2 (en) * | 2017-01-31 | 2019-05-14 | Saudi Arabian Oil Company | Auto-generation of map landmarks using sensor readable tags |
| CN110036356B (zh) * | 2017-02-22 | 2020-06-26 | 腾讯科技(深圳)有限公司 | Vr系统中的图像处理 |
| CN107122393B (zh) * | 2017-03-09 | 2019-12-10 | 北京小度互娱科技有限公司 | 电子相册生成方法以及装置 |
| US12373503B2 (en) * | 2017-04-26 | 2025-07-29 | Cueback Technology, Inc. | Cue data model implementation for adaptive presentation of collaborative recollections of memories |
| US11151448B2 (en) * | 2017-05-26 | 2021-10-19 | International Business Machines Corporation | Location tagging for visual data of places using deep learning |
| CN107194370B (zh) * | 2017-06-10 | 2023-12-19 | 国网浙江省电力公司台州供电公司 | 基于地理位置信息的影像智能分析方法 |
| US20190012442A1 (en) * | 2017-07-06 | 2019-01-10 | Bylined Me, Inc. | Facilitating retrieval of permissions associated with a media item |
| US10970334B2 (en) | 2017-07-24 | 2021-04-06 | International Business Machines Corporation | Navigating video scenes using cognitive insights |
| US11074292B2 (en) * | 2017-12-29 | 2021-07-27 | Realwear, Inc. | Voice tagging of video while recording |
| US20190207889A1 (en) * | 2018-01-03 | 2019-07-04 | International Business Machines Corporation | Filtering graphic content in a message to determine whether to render the graphic content or a descriptive classification of the graphic content |
| AU2018201311B2 (en) * | 2018-02-22 | 2023-11-30 | Artlife Solutions Pty Ltd | A system and method for sorting digital images |
| US11163941B1 (en) * | 2018-03-30 | 2021-11-02 | Snap Inc. | Annotating a collection of media content items |
| CN108959345A (zh) * | 2018-04-11 | 2018-12-07 | 北京航星机器制造有限公司 | 一种安检图像记录快速回溯的方法和设备 |
| US11068705B2 (en) * | 2018-06-11 | 2021-07-20 | Cisco Technology, Inc. | Vector based object recognition in hybrid cloud |
| US10878037B2 (en) | 2018-06-21 | 2020-12-29 | Google Llc | Digital supplement association and retrieval for visual search |
| EP3811238A1 (en) * | 2018-06-21 | 2021-04-28 | Google LLC | Digital supplement association and retrieval for visual search |
| CN110647603B (zh) * | 2018-06-27 | 2022-05-27 | 百度在线网络技术(北京)有限公司 | 图像标注信息的处理方法、装置和系统 |
| US20200019968A1 (en) * | 2018-07-11 | 2020-01-16 | Capital One Services, Llc | System and method for authenticating transactions from a mobile device |
| KR102143985B1 (ko) * | 2018-07-18 | 2020-08-12 | 박노성 | 다수 이미지 배치-인쇄-절단 자동화 방법 |
| JP7095455B2 (ja) * | 2018-07-24 | 2022-07-05 | トヨタ自動車株式会社 | 情報処理装置および情報処理方法 |
| CN111078924B (zh) * | 2018-10-18 | 2024-03-01 | 深圳云天励飞技术有限公司 | 图像检索方法、装置、终端及存储介质 |
| US10936178B2 (en) * | 2019-01-07 | 2021-03-02 | MemoryWeb, LLC | Systems and methods for analyzing and organizing digital photos and videos |
| US11537653B2 (en) * | 2019-01-16 | 2022-12-27 | International Business Machines Corporation | Automated personalized identifier switching in view of closeness |
| CN109886258A (zh) * | 2019-02-19 | 2019-06-14 | 新华网(北京)科技有限公司 | 提供多媒体信息的关联信息的方法、装置及电子设备 |
| US11036785B2 (en) * | 2019-03-05 | 2021-06-15 | Ebay Inc. | Batch search system for providing batch search interfaces |
| CN109840285B (zh) * | 2019-03-12 | 2023-01-10 | 广东电网有限责任公司 | 基于目标的经纬度的图像自动命名方法 |
| US10887531B2 (en) | 2019-03-27 | 2021-01-05 | Rovi Guides, Inc. | Replacing a background portion of an image |
| US10944921B2 (en) | 2019-03-27 | 2021-03-09 | Rovi Guides, Inc. | Replacing a background portion of an image |
| KR102699095B1 (ko) * | 2019-04-01 | 2024-08-26 | 주식회사 넥슨코리아 | 통합 게임 신고 서비스를 제공하는 방법 및 장치 |
| KR102199247B1 (ko) * | 2019-04-02 | 2021-01-07 | 넷마블 주식회사 | 버그 제보를 위한 게임 서비스 제공 시스템 |
| CN114008607A (zh) * | 2019-06-04 | 2022-02-01 | 吉奥奎斯特系统公司 | 将地理标签应用于图像以用于识别勘探机会 |
| CN112348886B (zh) * | 2019-08-09 | 2024-05-14 | 华为技术有限公司 | 视觉定位方法、终端和服务器 |
| KR102093733B1 (ko) * | 2019-09-05 | 2020-03-26 | 김지수 | 이미지추천장치 및 이미지추천장치의 동작 방법 |
| CN110647886A (zh) | 2019-09-19 | 2020-01-03 | 北京百度网讯科技有限公司 | 兴趣点标注方法、装置、计算机设备和存储介质 |
| RU2742602C1 (ru) * | 2020-04-06 | 2021-02-09 | Самсунг Электроникс Ко., Лтд. | Распознавание событий на фотографиях с автоматическим выделением альбомов |
| KR20210076573A (ko) * | 2019-12-16 | 2021-06-24 | 삼성전자주식회사 | 태그를 제공하는 방법 및 이를 지원하는 전자 장치 |
| CN111355912A (zh) * | 2020-02-17 | 2020-06-30 | 江苏济楚信息技术有限公司 | 一种执法记录方法及系统 |
| KR102364881B1 (ko) * | 2020-03-24 | 2022-02-18 | 주식회사 에스아이에이 | 전문가 모사 모델 학습 방법 및 그 학습을 위한 장치 |
| IL273722A (en) * | 2020-03-31 | 2021-09-30 | Israel Aerospace Ind Ltd | Automatic labeling of objects in images |
| CN111639599B (zh) * | 2020-05-29 | 2024-04-02 | 北京百度网讯科技有限公司 | 物体图像挖掘方法、装置、设备以及存储介质 |
| JP7536545B2 (ja) * | 2020-07-22 | 2024-08-20 | キヤノン株式会社 | 画像処理装置およびその制御方法、並びにプログラム |
| KR20220020121A (ko) | 2020-08-11 | 2022-02-18 | 삼성전자주식회사 | 컨텐트와 연관된 텍스트를 제공하기 위한 전자 장치 및 그 동작 방법 |
| CN112235517B (zh) * | 2020-09-29 | 2023-09-12 | 北京小米松果电子有限公司 | 旁白添加方法、旁白添加装置及存储介质 |
| US20220101245A1 (en) * | 2020-09-29 | 2022-03-31 | International Business Machines Corporation | Automated computerized identification of assets |
| US11425852B2 (en) | 2020-10-16 | 2022-08-30 | Verdant Robotics, Inc. | Autonomous detection and control of vegetation |
| US11076589B1 (en) | 2020-10-16 | 2021-08-03 | Verdant Robotics, Inc. | Autonomous agricultural treatment system using map based targeting of agricultural objects |
| US11335091B1 (en) * | 2020-12-22 | 2022-05-17 | Sony Group Corporation | Enabling a device with camera and display to control other devices (universal remote control app) |
| US11399531B1 (en) | 2021-10-20 | 2022-08-02 | Verdant Robotics, Inc. | Precision detection and control of vegetation with real time pose estimation |
| US12288238B2 (en) * | 2022-06-02 | 2025-04-29 | Ebay Inc. | Complementary item recommendation system |
| CN117421380B (zh) * | 2023-11-09 | 2025-12-30 | 北京火山引擎科技有限公司 | 湖仓元数据标签创建方法、湖仓元数据标签查询方法 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100257128B1 (ko) * | 1991-11-11 | 2000-05-15 | 시모야마 도시로 | 심볼정보판독장치 |
| US7464877B2 (en) * | 2003-11-13 | 2008-12-16 | Metrologic Instruments, Inc. | Digital imaging-based bar code symbol reading system employing image cropping pattern generator and automatic cropped image processor |
| DE10234276A1 (de) * | 2002-07-27 | 2004-02-12 | Rehau Ag + Co. | Verfahren zur Steuerung und Überwachung der Herstellung von thermoplastischen Extrusionsprofilen insbesondere in einem Inline-Produktionsprozess mit Bedruckungsvorgang |
| US7783135B2 (en) * | 2005-05-09 | 2010-08-24 | Like.Com | System and method for providing objectified image renderings using recognition information from images |
| JP2007233913A (ja) * | 2006-03-03 | 2007-09-13 | Fuji Xerox Co Ltd | 画像処理装置及びプログラム |
| JP2010509668A (ja) * | 2006-11-14 | 2010-03-25 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | デジタル画像により取得されたオブジェクトを識別する方法及び装置 |
| US8218821B2 (en) * | 2007-01-23 | 2012-07-10 | Pips Technology, Inc. | Video preprocessing of image information for character recognition |
| KR101480400B1 (ko) * | 2007-10-24 | 2015-01-12 | 삼성전자주식회사 | 이미지 파일 관리 장치 및 그 관리 방법 |
| US9037583B2 (en) * | 2008-02-29 | 2015-05-19 | Ratnakar Nitesh | Geo tagging and automatic generation of metadata for photos and videos |
| WO2010102515A1 (en) * | 2009-03-11 | 2010-09-16 | Hong Kong Baptist University | Automatic and semi-automatic image classification, annotation and tagging through the use of image acquisition parameters and metadata |
| US8891832B2 (en) * | 2011-06-03 | 2014-11-18 | Facebook, Inc. | Computer-vision-assisted location check-in |
| US8792684B2 (en) * | 2011-08-11 | 2014-07-29 | At&T Intellectual Property I, L.P. | Method and apparatus for automated analysis and identification of a person in image and video content |
-
2011
- 2011-11-17 US US13/298,310 patent/US20130129142A1/en not_active Abandoned
-
2012
- 2012-11-16 CN CN201280056443.1A patent/CN103930901A/zh active Pending
- 2012-11-16 WO PCT/US2012/065467 patent/WO2013074895A2/en not_active Ceased
- 2012-11-16 EP EP12850387.7A patent/EP2780863A4/en not_active Withdrawn
- 2012-11-16 AU AU2012340354A patent/AU2012340354A1/en not_active Abandoned
- 2012-11-16 MX MX2014006000A patent/MX2014006000A/es not_active Application Discontinuation
- 2012-11-16 BR BR112014011739A patent/BR112014011739A8/pt not_active Application Discontinuation
- 2012-11-16 IN IN3322CHN2014 patent/IN2014CN03322A/en unknown
- 2012-11-16 CA CA2855836A patent/CA2855836A1/en not_active Abandoned
- 2012-11-16 RU RU2014119859A patent/RU2608261C2/ru not_active IP Right Cessation
- 2012-11-16 KR KR1020147013107A patent/KR20140091554A/ko not_active Withdrawn
- 2012-11-16 JP JP2014542484A patent/JP2015501982A/ja active Pending
Non-Patent Citations (1)
| Title |
|---|
| See references of EP2780863A4 * |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103353879A (zh) * | 2013-06-18 | 2013-10-16 | 北京智谷睿拓技术服务有限公司 | 图像处理方法及设备 |
| CN104424262A (zh) * | 2013-08-29 | 2015-03-18 | 宏达国际电子股份有限公司 | 相关影像搜寻方法以及使用者界面控制方法 |
| CN104424262B (zh) * | 2013-08-29 | 2018-04-06 | 宏达国际电子股份有限公司 | 相关影像搜寻方法以及使用者界面控制方法 |
| WO2015101250A1 (zh) * | 2013-12-30 | 2015-07-09 | 腾讯科技(深圳)有限公司 | 图片处理方法及装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN103930901A (zh) | 2014-07-16 |
| EP2780863A4 (en) | 2016-05-11 |
| US20130129142A1 (en) | 2013-05-23 |
| BR112014011739A8 (pt) | 2017-12-12 |
| AU2012340354A1 (en) | 2014-05-29 |
| EP2780863A2 (en) | 2014-09-24 |
| RU2608261C2 (ru) | 2017-01-17 |
| RU2014119859A (ru) | 2015-11-27 |
| WO2013074895A3 (en) | 2013-07-11 |
| BR112014011739A2 (pt) | 2017-05-02 |
| IN2014CN03322A (https=) | 2015-07-03 |
| MX2014006000A (es) | 2014-08-27 |
| KR20140091554A (ko) | 2014-07-21 |
| JP2015501982A (ja) | 2015-01-19 |
| CA2855836A1 (en) | 2013-05-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20130129142A1 (en) | Automatic tag generation based on image content | |
| US8150098B2 (en) | Grouping images by location | |
| US9430719B2 (en) | System and method for providing objectified image renderings using recognition information from images | |
| KR101417548B1 (ko) | 사진 콜렉션에서 이벤트들을 생성하고 라벨링하는 방법 및 시스템 | |
| US8897505B2 (en) | System and method for enabling the use of captured images through recognition | |
| US7809192B2 (en) | System and method for recognizing objects from images and identifying relevancy amongst images and information | |
| US20110184953A1 (en) | On-location recommendation for photo composition | |
| Popescu et al. | MonuAnno: automatic annotation of georeferenced landmarks images | |
| CN104331509A (zh) | 照片管理方法及装置 | |
| CN110704712A (zh) | 基于图像检索的场景图片拍摄位置范围识别方法及系统 | |
| Gallagher et al. | Geo-location inference from image content and user tags | |
| CN103069415A (zh) | 用于图像处理的计算机实施的方法、计算机程序产品和计算机系统 | |
| EP2695134A2 (en) | Event determination from photos | |
| CN102422286A (zh) | 利用图像获取参数和元数据自动和半自动的图像分类、注释和标签 | |
| WO2008005175A1 (en) | Using background for searching image collections | |
| CN107578003B (zh) | 一种基于地理标记图像的遥感图像迁移学习方法 | |
| US20150086123A1 (en) | Photo Grouping System, Photo Grouping Method- and Non-Transitory Computer-Readable Storage Medium | |
| Weng et al. | Semantic signatures for large-scale visual localization | |
| Chen et al. | Integrated content and context analysis for mobile landmark recognition | |
| CN106250396A (zh) | 一种图像标签自动生成系统及方法 | |
| Phyu et al. | Self-adaptive feature extraction scheme for mobile image retrieval of flowers | |
| Luo et al. | Recognizing picture-taking environment from satellite images: a feasibility study | |
| Vajda et al. | Propagation of geotags based on object duplicate detection | |
| CN117332110A (zh) | 一种渔获相册智能分类的方法及系统 | |
| Chen et al. | A SURF Feature Based Building Recognition System for Distinctive Architectures |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| REEP | Request for entry into the european phase |
Ref document number: 2012850387 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2012850387 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2014542484 Country of ref document: JP Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 2855836 Country of ref document: CA |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12850387 Country of ref document: EP Kind code of ref document: A2 |
|
| ENP | Entry into the national phase |
Ref document number: 20147013107 Country of ref document: KR Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 2014119859 Country of ref document: RU Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2014/006000 Country of ref document: MX |
|
| ENP | Entry into the national phase |
Ref document number: 2012340354 Country of ref document: AU Date of ref document: 20121116 Kind code of ref document: A |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112014011739 Country of ref document: BR |
|
| ENP | Entry into the national phase |
Ref document number: 112014011739 Country of ref document: BR Kind code of ref document: A2 Effective date: 20140515 |