WO2016048465A1 - Use of depth perception as indicator of search, user interest or preference - Google Patents

Use of depth perception as indicator of search, user interest or preference Download PDF

Info

Publication number
WO2016048465A1
WO2016048465A1 PCT/US2015/044778 US2015044778W WO2016048465A1 WO 2016048465 A1 WO2016048465 A1 WO 2016048465A1 US 2015044778 W US2015044778 W US 2015044778W WO 2016048465 A1 WO2016048465 A1 WO 2016048465A1
Authority
WO
WIPO (PCT)
Prior art keywords
preference information
objects
user
display
information
Prior art date
Application number
PCT/US2015/044778
Other languages
French (fr)
Inventor
Joel Fogelson
Juan M. NOGUEROL
Adam BALEST
Guillaume Andre Roger GOUSSARD
Original Assignee
Technicolor Usa, Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Technicolor Usa, Inc filed Critical Technicolor Usa, Inc
Priority to KR1020177007764A priority Critical patent/KR20170058942A/en
Priority to EP15756724.9A priority patent/EP3198473A1/en
Priority to CN201580051350.3A priority patent/CN107004004B/en
Priority to US15/513,101 priority patent/US11347793B2/en
Priority to JP2017514696A priority patent/JP2017535835A/en
Publication of WO2016048465A1 publication Critical patent/WO2016048465A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/735Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7837Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9038Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present principles relate to the use of depth perception on a three- dimensional (3D) display or in a virtual reality (VR) space to indicate search results, user preferences or interest.
  • 3D three- dimensional
  • VR virtual reality
  • Image segmentation techniques are often used to separate different objects in images or video sequences. Object recognition techniques allow these objects to be identified or tracked within an existing sequence. In the medical imaging field, objects that appear to be tumors can be identified from medical video sequences by defining what a tumor may look like, and then searching for objects that reasonably fit this description in the sequence
  • Another challenge is to present the results of such a search in a meaningful way to a user, such that he can quickly identify those objects that he is looking for.
  • a method for displaying preference information in a three dimensional or virtual reality space includes a step for receiving preference information.
  • the method further includes a step for generating relevance data for at least one segmented and identified object in input image data, based on the preference information.
  • the method further includes a step for displaying the image data in at least two planes based on the generated relevance data.
  • an apparatus comprising a processor configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information.
  • the apparatus further comprises a display processor to receive the relevance data and produce data to display the image data in at least two planes based on the relevance data.
  • Figure 1 shows a flow diagram of an exemplary method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
  • Figure 2 shows one embodiment of an apparatus for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
  • Figures 3a and 3b show a conceptual view of a three dimensional image with a plurality of blocks.
  • the present principles are directed to a method and apparatus for displaying preference information in a three dimensional or virtual reality space.
  • the information is displayed using depth perception, such that those items of most interest are displayed in planes appearing in the foreground, or closer to the viewer. Those items that are disliked are displayed in planes appearing in the background, or farther from the viewer.
  • the foreground and background planes can vary to the degree that they are forward or backward, based on the degree of relevance or interest to the user.
  • Figure 1 is a flow diagram of a method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
  • the method commences with a start at step 101 , and proceeds to step 1 10 for receiving preference information.
  • Preference information can comprised user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information.
  • the method then proceeds to step 120 for generating relevance data for at least one segmented and identified object in input image data, based on the preference information from step 1 10.
  • the method then proceeds to step 130 for displaying the image data in at least two planes based on the generated relevance data from step 120.
  • Figure 2 shows one exemplary embodiment of an apparatus 200 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
  • the apparatus comprises a processor 210 configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information.
  • Preference information can comprise user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information.
  • the segmentation information and object identification information can be generated locally as part of the present principles, or can be supplied by an external source.
  • the apparatus further comprises a display processor 220 that is in signal connectivity with the relevance data output of processor 210 and produces data to display the image data in at least two planes based on the relevance data.
  • the apparatus can also optionally receive input from the user who can adjust the plane of objects in an image that is then fed back to the user preferences to adjust his or her preferences for future use.
  • the depth information is displayed in a three dimensional (3D) or virtual reality (VR) space, assigned to at least one object in an image or image sequence that has been segmented and identified in the image(s).
  • 3D three dimensional
  • VR virtual reality
  • Preference information is used to generate the depth information, also referred to as relevance information.
  • the preference information can be derived in several ways. It can be based on user input, such as, for example, a search query. It can be based on user profile information, or it can be based on other information, for example some externally supplied information that indicates relevancy of objects in an input image or image sequence.
  • Segmentation information is also used to break an image into different objects.
  • the segmentation information can be generated locally as part of the present principles, or can be supplied by an external source.
  • Edge detection algorithms can be used to detect various objects and break them up like pieces of a jigsaw puzzle in the image.
  • Object identification or object recognition information is used to identify objects that have been segmented from the image.
  • the object identification information can also be generated locally as part of the present principles, or can be supplied by an external source.
  • a set of data from an external source can indicate actors appearing in certain movie scenes.
  • One example of this is DigitalSmiths data.
  • the preference information along with the segmentation information and object identification information in the input image, is used to generate relevance information for at least one of the objects in the input image.
  • the preference information can indicate how interested a user is in an object, its relevance to the user or some other metric that the preference information shows.
  • Objects that are favored are shown in foreground planes of the display, to varying degrees based on the strength of the preference.
  • Objects that are disfavored are shown in background planes of the display, also to varying degrees based on the strength of the preferences.
  • Unidentified or neutral objects are shown at a base level, neither foreground nor background.
  • the relevance information for an object or objects in an image is used to display that object in a video plane that is indicative of user interest relative to other objects in the image, or relative to the background.
  • Unidentified objects can be left at a base level that appears to neither be pushed in nor pushed out. For example, if a user is very interested in a particular object because, for example, the user has searched for this object, it can be shown in a foreground plane.
  • Another object is slightly less relevant than the first, but there is still some user interest, it may be shown in a plane that is slightly less foreground than the first object, but still in the foreground relative to neutral parts of the image, in which there is no indicated relevance. If, for example, a user profile indicates a strong dislike for something, and it also is contained in the image, it will appear in a plane that is shown in the background to indicate user disfavor. The rendering of the various objects with regard to the plane they appear is adjusted based on the preference information.
  • Figure 3a shows a front view 300 of five blocks in an image, labelled 1 through 5.
  • a user is most interested in, or likes, block 5 350, then block 3 330, then block 2 320.
  • the user is not interested in block 1 310, and very not interested in block 4 340.
  • Figure 3b shows a conceptual side view 360 of the image under the present principles. Because the user is most interested in block 5 350, it is "pushed forward", or shown in the foreground the closest. Next most forward is block 3 330, then block 2 320.
  • the user is not interested in block 1 310, so it is shown slightly “pushed back” into the background of the image. And the user is very not interested in block 4 340, so it is shown "pushed back” even farther into a background plane of the image.
  • a user would like to search a movie library (either local or online) for movies by Actor A. He also has a profile stored that indicates what actors/actresses he favors and which he disfavors. The profile can also indicate other preference information, such as genre, director, etc.
  • the user searches for movies by Actor A, the user receives a series of search results, in the form of images, clips or trailers, for movies that include Actor A. In these results, Actor A can be pushed into the foreground because of the user request from the search.
  • the clips can also show other actors/actresses in each of the results, and their image can appear to be pushed forward or backward, based on the user preference for that actor.
  • a similar idea can be applied to media asset titles, where those titles that are most appealing to a user can be pushed into the foreground and the unappealing titles pushed back.
  • a user may alter his preferences by directly adjusting the plane that the object, or actor, is in. For example, in the Actor A embodiment above, if a user decides that he has changed his opinion of one of the objects in an image, he can push it back or pull it forward, and his preference information or profile will automatically be updated and now influence the search in a new way.
  • processor or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (“DSP”) hardware, read-only memory (“ROM”) for storing software, random access memory (“RAM”), and non-volatile storage.
  • DSP digital signal processor
  • ROM read-only memory
  • RAM random access memory
  • any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
  • any element expressed as a means for performing a specified function is intended to encompass any way of performing that function including, for example, a) a combination of circuit elements that performs that function or b) software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function.
  • the present principles as defined by such claims reside in the fact that the
  • such phrasing is intended to encompass the selection of the first listed option (A) only, or the selection of the second listed option (B) only, or the selection of the third listed option (C) only, or the selection of the first and the second listed options (A and B) only, or the selection of the first and third listed options (A and C) only, or the selection of the second and third listed options (B and C) only, or the selection of all three options (A and B and C).
  • This may be extended, as readily apparent by one of ordinary skill in this and related arts, for as many items listed.
  • the teachings of the present principles are implemented as a combination of hardware and software.
  • the software may be implemented as an application program tangibly embodied on a program storage unit.
  • the application program may be uploaded to, and executed by, a machine comprising any suitable architecture.
  • the machine is implemented on a computer platform having hardware such as one or more central processing units (“CPU"), a random access memory (“RAM”), and input/output ("I/O") interfaces.
  • CPU central processing units
  • RAM random access memory
  • I/O input/output
  • the computer platform may also include an operating system and microinstruction code.
  • the various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU.
  • various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Library & Information Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • User Interface Of Digital Computer (AREA)
  • Processing Or Creating Images (AREA)

Abstract

A method and apparatus are provided. The method provides display of preference information using depth perception, such as on a three dimensional display or in a virtual reality space. Objects are identified in an image or image sequence and assigned relevance information based on the preference information. Objects that are favored are shown in foreground planes of the display, to varying degrees based on the strength of the preference. Objects that are disfavored are shown in background planes of the display, also to varying degrees based on the strength of the preferences. Unidentified or neutral objects are shown at a base level, neither foreground nor background. An exemplary embodiment is provided for a movie database application with various actors shown pushed in or out. Another embodiment allows a user to adjust the plane of the objects to alter his preferences.

Description

USE OF DEPTH PERCEPTION AS INDICATOR OF SEARCH, USER INTEREST OR
PREFERENCE
TECHNICAL FIELD
The present principles relate to the use of depth perception on a three- dimensional (3D) display or in a virtual reality (VR) space to indicate search results, user preferences or interest.
BACKGROUND
Image segmentation techniques are often used to separate different objects in images or video sequences. Object recognition techniques allow these objects to be identified or tracked within an existing sequence. In the medical imaging field, objects that appear to be tumors can be identified from medical video sequences by defining what a tumor may look like, and then searching for objects that reasonably fit this description in the sequence
But, if a user wants to search for an object, or some subject and isn't sure which media asset the subject might be contained in, or isn't certain of the exact appearance of the subject, image segmentation and object recognition techniques will fail.
Another challenge is to present the results of such a search in a meaningful way to a user, such that he can quickly identify those objects that he is looking for.
A need exists to identify subjects in video images or sequences and present them to a user in a way that also displays those items of interest in the image to a user.
SUMMARY
These and other drawbacks and disadvantages of the prior art are addressed by the present principles, which are directed to using depth perception on a three- dimensional (3D) display or in a virtual reality (VR) space to indicate search results, user preferences or interest.
According to an aspect of the present principles, there is provided a method for displaying preference information in a three dimensional or virtual reality space. The method includes a step for receiving preference information. The method further includes a step for generating relevance data for at least one segmented and identified object in input image data, based on the preference information. The method further includes a step for displaying the image data in at least two planes based on the generated relevance data.
According to another aspect of the present principles, there is provided an apparatus. The apparatus comprises a processor configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information. The apparatus further comprises a display processor to receive the relevance data and produce data to display the image data in at least two planes based on the relevance data.
These and other aspects, features and advantages of the present principles will become apparent from the following detailed description of exemplary embodiments, which is to be read in connection with the accompanying drawings. BRIEF DESCRIPTION OF THE DRAWINGS
The present principles may be better understood in accordance with the following exemplary figures, in which:
Figure 1 shows a flow diagram of an exemplary method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
Figure 2 shows one embodiment of an apparatus for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
Figures 3a and 3b show a conceptual view of a three dimensional image with a plurality of blocks. DETAILED DESCRIPTION
The present principles are directed to a method and apparatus for displaying preference information in a three dimensional or virtual reality space. The information is displayed using depth perception, such that those items of most interest are displayed in planes appearing in the foreground, or closer to the viewer. Those items that are disliked are displayed in planes appearing in the background, or farther from the viewer. The foreground and background planes can vary to the degree that they are forward or backward, based on the degree of relevance or interest to the user.
One embodiment of the present principles is shown in Figure 1 which is a flow diagram of a method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space. The method commences with a start at step 101 , and proceeds to step 1 10 for receiving preference information. Preference information can comprised user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information. The method then proceeds to step 120 for generating relevance data for at least one segmented and identified object in input image data, based on the preference information from step 1 10. The method then proceeds to step 130 for displaying the image data in at least two planes based on the generated relevance data from step 120.
Figure 2 shows one exemplary embodiment of an apparatus 200 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space. The apparatus comprises a processor 210 configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information. Preference information can comprise user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information. The segmentation information and object identification information can be generated locally as part of the present principles, or can be supplied by an external source. The apparatus further comprises a display processor 220 that is in signal connectivity with the relevance data output of processor 210 and produces data to display the image data in at least two planes based on the relevance data.
The apparatus can also optionally receive input from the user who can adjust the plane of objects in an image that is then fed back to the user preferences to adjust his or her preferences for future use.
As previously stated, the present principles are directed to using depth
perception as an indicator of search results, user interest, or preferences. The depth information is displayed in a three dimensional (3D) or virtual reality (VR) space, assigned to at least one object in an image or image sequence that has been segmented and identified in the image(s). When referring to an image in the following description, it should be understood that the process can also be applied to an image sequence comprised of individual images.
Preference information is used to generate the depth information, also referred to as relevance information. The preference information can be derived in several ways. It can be based on user input, such as, for example, a search query. It can be based on user profile information, or it can be based on other information, for example some externally supplied information that indicates relevancy of objects in an input image or image sequence.
Segmentation information is also used to break an image into different objects. The segmentation information can be generated locally as part of the present principles, or can be supplied by an external source. Edge detection algorithms can be used to detect various objects and break them up like pieces of a jigsaw puzzle in the image.
Object identification or object recognition information is used to identify objects that have been segmented from the image. The object identification information can also be generated locally as part of the present principles, or can be supplied by an external source.
In at least one exemplary embodiment, a set of data from an external source can indicate actors appearing in certain movie scenes. One example of this is DigitalSmiths data.
The preference information, along with the segmentation information and object identification information in the input image, is used to generate relevance information for at least one of the objects in the input image. The preference information can indicate how interested a user is in an object, its relevance to the user or some other metric that the preference information shows.
Objects that are favored are shown in foreground planes of the display, to varying degrees based on the strength of the preference. Objects that are disfavored are shown in background planes of the display, also to varying degrees based on the strength of the preferences. Unidentified or neutral objects are shown at a base level, neither foreground nor background. The relevance information for an object or objects in an image is used to display that object in a video plane that is indicative of user interest relative to other objects in the image, or relative to the background. Unidentified objects can be left at a base level that appears to neither be pushed in nor pushed out. For example, if a user is very interested in a particular object because, for example, the user has searched for this object, it can be shown in a foreground plane. If another object is slightly less relevant than the first, but there is still some user interest, it may be shown in a plane that is slightly less foreground than the first object, but still in the foreground relative to neutral parts of the image, in which there is no indicated relevance. If, for example, a user profile indicates a strong dislike for something, and it also is contained in the image, it will appear in a plane that is shown in the background to indicate user disfavor. The rendering of the various objects with regard to the plane they appear is adjusted based on the preference information.
An example of foreground and background parts of an image in a 3D or VR space is indicated in Figure 3. Figure 3a shows a front view 300 of five blocks in an image, labelled 1 through 5. A user is most interested in, or likes, block 5 350, then block 3 330, then block 2 320. The user is not interested in block 1 310, and very not interested in block 4 340.
Figure 3b shows a conceptual side view 360 of the image under the present principles. Because the user is most interested in block 5 350, it is "pushed forward", or shown in the foreground the closest. Next most forward is block 3 330, then block 2 320.
The user is not interested in block 1 310, so it is shown slightly "pushed back" into the background of the image. And the user is very not interested in block 4 340, so it is shown "pushed back" even farther into a background plane of the image.
One example of an embodiment of the present principles can be illustrated through an example of a movie query application. A user would like to search a movie library (either local or online) for movies by Actor A. He also has a profile stored that indicates what actors/actresses he favors and which he disfavors. The profile can also indicate other preference information, such as genre, director, etc. Once the user searches for movies by Actor A, the user receives a series of search results, in the form of images, clips or trailers, for movies that include Actor A. In these results, Actor A can be pushed into the foreground because of the user request from the search. However, because other preferences from the profile information are used, the clips can also show other actors/actresses in each of the results, and their image can appear to be pushed forward or backward, based on the user preference for that actor.
If the user sees lots of foreground actors/actresses, that user may be eager to watch this movie because it contains many of his favorite stars. If, however, he sees a movie with Actor A in the foreground, but the film's other actors pushed back, he may decide he doesn't wish to view the film despite his desire to see an Actor A movie because of his dislike of the remaining cast.
A similar idea can be applied to media asset titles, where those titles that are most appealing to a user can be pushed into the foreground and the unappealing titles pushed back.
In another exemplary embodiment, once the display is shown with objects, such as actors, in their various planes, a user may alter his preferences by directly adjusting the plane that the object, or actor, is in. For example, in the Actor A embodiment above, if a user decides that he has changed his opinion of one of the objects in an image, he can push it back or pull it forward, and his preference information or profile will automatically be updated and now influence the search in a new way.
In a three dimensional display under the present principles, the various objects appear closer or farther in various planes in the image. In a virtual reality space, one can imagine the various planes like filing cabinets, with some drawers sticking out to varying degrees and others pushed in to varying degrees. A user would be able to walk around the files and determine the degree that they are pushed in or out.
The present description illustrates the present principles. It will thus be
appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the present principles and are included within the present principles. All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the present principles and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions.
Moreover, all statements herein reciting principles, aspects, and embodiments of the present principles, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
Thus, for example, it will be appreciated by those skilled in the art that the block diagrams presented herein represent conceptual views of illustrative circuitry
embodying the present principles. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudocode, and the like represent various processes which may be substantially represented in computer readable media and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
The functions of the various elements shown in the figures may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term "processor" or "controller" should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor ("DSP") hardware, read-only memory ("ROM") for storing software, random access memory ("RAM"), and non-volatile storage.
Other hardware, conventional and/or custom, may also be included. Similarly, any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
In the claims hereof, any element expressed as a means for performing a specified function is intended to encompass any way of performing that function including, for example, a) a combination of circuit elements that performs that function or b) software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function. The present principles as defined by such claims reside in the fact that the
functionalities provided by the various recited means are combined and brought together in the manner which the claims call for. It is thus regarded that any means that can provide those functionalities are equivalent to those shown herein.
Reference in the specification to "one embodiment" or "an embodiment" of the present principles, as well as other variations thereof, means that a particular feature, structure, characteristic, and so forth described in connection with the embodiment is included in at least one embodiment of the present principles. Thus, the appearances of the phrase "in one embodiment" or "in an embodiment", as well any other variations, appearing in various places throughout the specification are not necessarily all referring to the same embodiment.
It is to be appreciated that the use of any of the following 7", "and/or", and "at least one of", for example, in the cases of "A/B", "A and/or B" and "at least one of A and B", is intended to encompass the selection of the first listed option (A) only, or the selection of the second listed option (B) only, or the selection of both options (A and B). As a further example, in the cases of "A, B, and/or C" and "at least one of A, B, and C", such phrasing is intended to encompass the selection of the first listed option (A) only, or the selection of the second listed option (B) only, or the selection of the third listed option (C) only, or the selection of the first and the second listed options (A and B) only, or the selection of the first and third listed options (A and C) only, or the selection of the second and third listed options (B and C) only, or the selection of all three options (A and B and C). This may be extended, as readily apparent by one of ordinary skill in this and related arts, for as many items listed. These and other features and advantages of the present principles may be readily ascertained by one of ordinary skill in the pertinent art based on the teachings herein. It is to be understood that the teachings of the present principles may be implemented in various forms of hardware, software, firmware, special purpose processors, or combinations thereof.
Most preferably, the teachings of the present principles are implemented as a combination of hardware and software. Moreover, the software may be implemented as an application program tangibly embodied on a program storage unit. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units ("CPU"), a random access memory ("RAM"), and input/output ("I/O") interfaces. The computer platform may also include an operating system and microinstruction code. The various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU. In addition, various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit.
It is to be further understood that, because some of the constituent system components and methods depicted in the accompanying drawings are preferably implemented in software, the actual connections between the system components or the process function blocks may differ depending upon the manner in which the present principles are programmed. Given the teachings herein, one of ordinary skill in the pertinent art will be able to contemplate these and similar implementations or configurations of the present principles.
Although the illustrative embodiments have been described herein with reference to the accompanying drawings, it is to be understood that the present principles is not limited to those precise embodiments, and that various changes and modifications may be effected therein by one of ordinary skill in the pertinent art without departing from the scope of the present principles. All such changes and modifications are intended to be included within the scope of the present principles as set forth in the appended claims.

Claims

CLAIMS:
1 . A method, comprising:
receiving preference information;
generating relevance data for at least one segmented and identified object, based on said preference information;
displaying image data in one of at least two planes based on said
relevance data.
2. The method of Claim 1 , wherein said displaying step occurs in a three dimensional or virtual reality space.
3. The method of Claim 1 , comprising the step of configuring stored
reference information responsive to at least one of user input and stored profile information.
4. The method of Claim 3, comprising the step of receiving a search query as a user input.
5. The method of Claim 3, comprising the step of configuring said preference information by combining at least one user input and stored profile information.
6. The method of Claim 1 , wherein said at least one object has been identified by a database of objects which are contained in said input video data.
7. The method of Claim 1 , wherein said at least one object that has been segmented is segmented using an edge detection process.
8. The method of Claim 1 , wherein said preference information is modified by a user altering a plane of an object through said display.
9. An apparatus, comprising:
a processor, configured to receive preference information and to generate relevance data for at least one object that has been segmented and identified from input video data based on said preference information;
a display processor that receives said relevance data and produces data to display said image data in one of at least two planes based on said relevance data.
10. The apparatus of Claim 9, wherein said display processor produces data to display said image data in one of at least two planes in a three dimensional or virtual reality space.
1 1 . The apparatus of Claim 9, wherein said preference information is based on at least one of user input and stored profile information.
12. The apparatus of Claim 1 1 , wherein said user input is a search query.
13. The apparatus of Claim 1 1 , wherein said preference information is a combination of at least one user input and stored preference information.
14. The apparatus of Claim 1 1 , wherein said at least one object has been identified by a database of objects which are contained in said input video data.
15. The apparatus of Claim 1 1 , wherein said at least one object that has been segmented is segmented using an edge detection process.
16. The apparatus of Claim 1 1 , wherein said preference information is modified by a user altering a plane of an object through said display.
PCT/US2015/044778 2014-09-22 2015-08-12 Use of depth perception as indicator of search, user interest or preference WO2016048465A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020177007764A KR20170058942A (en) 2014-09-22 2015-08-12 Use of depth perception as indicator of search, user interest or preference
EP15756724.9A EP3198473A1 (en) 2014-09-22 2015-08-12 Use of depth perception as indicator of search, user interest or preference
CN201580051350.3A CN107004004B (en) 2014-09-22 2015-08-12 Using depth perception as an indicator of search, user interest or preference
US15/513,101 US11347793B2 (en) 2014-09-22 2015-08-12 Use of depth perception as indicator of search, user interest or preference
JP2017514696A JP2017535835A (en) 2014-09-22 2015-08-12 Using depth perception as an indicator of search, user interest or preference

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201462053349P 2014-09-22 2014-09-22
US62/053,349 2014-09-22

Publications (1)

Publication Number Publication Date
WO2016048465A1 true WO2016048465A1 (en) 2016-03-31

Family

ID=54012276

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2015/044778 WO2016048465A1 (en) 2014-09-22 2015-08-12 Use of depth perception as indicator of search, user interest or preference

Country Status (6)

Country Link
US (1) US11347793B2 (en)
EP (1) EP3198473A1 (en)
JP (1) JP2017535835A (en)
KR (1) KR20170058942A (en)
CN (1) CN107004004B (en)
WO (1) WO2016048465A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111159541B (en) * 2019-12-11 2023-08-25 微民保险代理有限公司 Method and device for determining account behavior preference
CN112612363A (en) * 2020-12-18 2021-04-06 上海影创信息科技有限公司 User non-preference comparison method and system based on afterglow area
US11334313B1 (en) * 2021-03-30 2022-05-17 Htc Corporation Managing conferences in virtual environment
CN113076436B (en) * 2021-04-09 2023-07-25 成都天翼空间科技有限公司 VR equipment theme background recommendation method and system
CN115423948B (en) * 2022-11-04 2023-02-21 江西省映尚科技有限公司 VR image processing method and system and readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001073596A2 (en) * 2000-03-29 2001-10-04 Koninklijke Philips Electronics N.V. Search user interface for constructing and managing user profiles and search criteria
GB2365300A (en) * 2000-06-07 2002-02-13 David Meakes Displaying search results according to relevance to query
US20100262616A1 (en) * 2009-04-09 2010-10-14 Nokia Corporation Method and apparatus for providing visual search engine results
US20110305437A1 (en) * 2010-06-15 2011-12-15 Kabushiki Kaisha Toshiba Electronic apparatus and indexing control method

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5664077A (en) * 1993-09-06 1997-09-02 Nec Corporation Three-dimensional graph displaying system
US6505194B1 (en) * 2000-03-29 2003-01-07 Koninklijke Philips Electronics N.V. Search user interface with enhanced accessibility and ease-of-use features based on visual metaphors
JP2002157269A (en) * 2000-11-22 2002-05-31 Nippon Telegr & Teleph Corp <Ntt> Video portal system and video providing method
US7995810B2 (en) * 2005-06-24 2011-08-09 The University Of Iowa Research Foundation System and methods for image segmentation in n-dimensional space
JP2009508274A (en) * 2005-09-13 2009-02-26 スペースタイムスリーディー・インコーポレーテッド System and method for providing a three-dimensional graphical user interface
US7870140B2 (en) * 2006-06-12 2011-01-11 D&S Consultants, Inc. System and method of incorporating user preferences in image searches
US8549436B1 (en) * 2007-06-04 2013-10-01 RedZ, Inc. Visual web search interface
JP2009080580A (en) 2007-09-25 2009-04-16 Toshiba Corp Image display device and display method
CN101510291A (en) 2008-02-15 2009-08-19 国际商业机器公司 Visualization method and apparatus for multidimensional data
US8520979B2 (en) * 2008-08-19 2013-08-27 Digimarc Corporation Methods and systems for content processing
JP5359266B2 (en) * 2008-12-26 2013-12-04 富士通株式会社 Face recognition device, face recognition method, and face recognition program
US8335784B2 (en) * 2009-08-31 2012-12-18 Microsoft Corporation Visual search and three-dimensional results
US20110063288A1 (en) 2009-09-11 2011-03-17 Siemens Medical Solutions Usa, Inc. Transfer function for volume rendering
JP2012064200A (en) * 2010-08-16 2012-03-29 Canon Inc Display controller, control method of display controller, program and recording medium
KR101777875B1 (en) 2011-04-28 2017-09-13 엘지디스플레이 주식회사 Stereoscopic image display and method of adjusting stereoscopic image thereof
WO2012164685A1 (en) * 2011-05-31 2012-12-06 楽天株式会社 Information providing device, information providing method, information providing processing program, recording medium recording information providing processing program, and information providing system
JP2013029451A (en) * 2011-07-29 2013-02-07 Ricoh Co Ltd Deposit detection device and deposit detection method
US8799263B2 (en) * 2011-09-04 2014-08-05 Leigh M Rothschild Systems, devices, and methods for providing multidimensional search results
KR101855939B1 (en) 2011-09-23 2018-05-09 엘지전자 주식회사 Method for operating an Image display apparatus
US8990201B1 (en) * 2011-11-03 2015-03-24 Google Inc. Image search results provisoning
US10032303B2 (en) * 2012-12-14 2018-07-24 Facebook, Inc. Scrolling 3D presentation of images
US9459697B2 (en) * 2013-01-15 2016-10-04 Leap Motion, Inc. Dynamic, free-space user interactions for machine control
US9269022B2 (en) * 2013-04-11 2016-02-23 Digimarc Corporation Methods for object recognition and related arrangements

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001073596A2 (en) * 2000-03-29 2001-10-04 Koninklijke Philips Electronics N.V. Search user interface for constructing and managing user profiles and search criteria
GB2365300A (en) * 2000-06-07 2002-02-13 David Meakes Displaying search results according to relevance to query
US20100262616A1 (en) * 2009-04-09 2010-10-14 Nokia Corporation Method and apparatus for providing visual search engine results
US20110305437A1 (en) * 2010-06-15 2011-12-15 Kabushiki Kaisha Toshiba Electronic apparatus and indexing control method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3198473A1 *

Also Published As

Publication number Publication date
CN107004004A (en) 2017-08-01
US20170300502A1 (en) 2017-10-19
EP3198473A1 (en) 2017-08-02
CN107004004B (en) 2021-02-02
US11347793B2 (en) 2022-05-31
JP2017535835A (en) 2017-11-30
KR20170058942A (en) 2017-05-29

Similar Documents

Publication Publication Date Title
US11347793B2 (en) Use of depth perception as indicator of search, user interest or preference
US11435869B2 (en) Virtual reality environment based manipulation of multi-layered multi-view interactive digital media representations
US11960533B2 (en) Visual search using multi-view interactive digital media representations
EP2894634B1 (en) Electronic device and image compostition method thereof
KR101535579B1 (en) Augmented reality interaction implementation method and system
US9684818B2 (en) Method and apparatus for providing image contents
US20150134651A1 (en) Multi-dimensional surround view based search
CN107633023B (en) Image duplicate removal method and device
US8934759B2 (en) Video editing apparatus and video editing method
JP2019534494A (en) Automatic tagging of objects in multi-view interactive digital media representation of dynamic entities
Silva et al. Towards semantic fast-forward and stabilized egocentric videos
Yeh et al. Relative features for photo quality assessment
CN106528800A (en) Image generation method and apparatus based on real scenes
Ferreira et al. A generic framework for optimal 2D/3D key-frame extraction driven by aggregated saliency maps
KR20200064643A (en) Apparatus and method for providing style information of cloth
CN104850600A (en) Method and device for searching images containing faces
US11847829B2 (en) Method, apparatus, electronic device, and computer storage medium for video processing
Ejaz et al. Video summarization by employing visual saliency in a sufficient content change method
US10353946B2 (en) Client-server communication for live search using multi-view digital media representations
Ju et al. A semi-automatic 2D-to-3D video conversion with adaptive key-frame selection
Vandecasteele et al. Spatio-temporal wardrobe generation of actors’ clothing in video content
Ferreira et al. 3d key-frame extraction method based on visual saliency
Baldacci et al. Presentation of 3D scenes through video example
CN116634238A (en) Information display method, device, computer equipment and storage medium
Vendrig et al. Evaluation of logical story unit segmentation in video sequences

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15756724

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017514696

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 20177007764

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015756724

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015756724

Country of ref document: EP