WO2016048465A1 - Use of depth perception as indicator of search, user interest or preference - Google Patents
Use of depth perception as indicator of search, user interest or preference Download PDFInfo
- Publication number
- WO2016048465A1 WO2016048465A1 PCT/US2015/044778 US2015044778W WO2016048465A1 WO 2016048465 A1 WO2016048465 A1 WO 2016048465A1 US 2015044778 W US2015044778 W US 2015044778W WO 2016048465 A1 WO2016048465 A1 WO 2016048465A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- preference information
- objects
- user
- display
- information
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/735—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7837—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9038—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/12—Edge-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
Definitions
- the present principles relate to the use of depth perception on a three- dimensional (3D) display or in a virtual reality (VR) space to indicate search results, user preferences or interest.
- 3D three- dimensional
- VR virtual reality
- Image segmentation techniques are often used to separate different objects in images or video sequences. Object recognition techniques allow these objects to be identified or tracked within an existing sequence. In the medical imaging field, objects that appear to be tumors can be identified from medical video sequences by defining what a tumor may look like, and then searching for objects that reasonably fit this description in the sequence
- Another challenge is to present the results of such a search in a meaningful way to a user, such that he can quickly identify those objects that he is looking for.
- a method for displaying preference information in a three dimensional or virtual reality space includes a step for receiving preference information.
- the method further includes a step for generating relevance data for at least one segmented and identified object in input image data, based on the preference information.
- the method further includes a step for displaying the image data in at least two planes based on the generated relevance data.
- an apparatus comprising a processor configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information.
- the apparatus further comprises a display processor to receive the relevance data and produce data to display the image data in at least two planes based on the relevance data.
- Figure 1 shows a flow diagram of an exemplary method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
- Figure 2 shows one embodiment of an apparatus for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
- Figures 3a and 3b show a conceptual view of a three dimensional image with a plurality of blocks.
- the present principles are directed to a method and apparatus for displaying preference information in a three dimensional or virtual reality space.
- the information is displayed using depth perception, such that those items of most interest are displayed in planes appearing in the foreground, or closer to the viewer. Those items that are disliked are displayed in planes appearing in the background, or farther from the viewer.
- the foreground and background planes can vary to the degree that they are forward or backward, based on the degree of relevance or interest to the user.
- Figure 1 is a flow diagram of a method 100 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
- the method commences with a start at step 101 , and proceeds to step 1 10 for receiving preference information.
- Preference information can comprised user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information.
- the method then proceeds to step 120 for generating relevance data for at least one segmented and identified object in input image data, based on the preference information from step 1 10.
- the method then proceeds to step 130 for displaying the image data in at least two planes based on the generated relevance data from step 120.
- Figure 2 shows one exemplary embodiment of an apparatus 200 for displaying preferences using a plurality of planes in a three dimensional or virtual reality space.
- the apparatus comprises a processor 210 configured to receive preference information and generate relevance data for at least one segmented and identified object from input video data based on the preference information.
- Preference information can comprise user input(s), stored profile information, or other such data. Preference information can also comprise some combination of this aforementioned information, or be based on this information.
- the segmentation information and object identification information can be generated locally as part of the present principles, or can be supplied by an external source.
- the apparatus further comprises a display processor 220 that is in signal connectivity with the relevance data output of processor 210 and produces data to display the image data in at least two planes based on the relevance data.
- the apparatus can also optionally receive input from the user who can adjust the plane of objects in an image that is then fed back to the user preferences to adjust his or her preferences for future use.
- the depth information is displayed in a three dimensional (3D) or virtual reality (VR) space, assigned to at least one object in an image or image sequence that has been segmented and identified in the image(s).
- 3D three dimensional
- VR virtual reality
- Preference information is used to generate the depth information, also referred to as relevance information.
- the preference information can be derived in several ways. It can be based on user input, such as, for example, a search query. It can be based on user profile information, or it can be based on other information, for example some externally supplied information that indicates relevancy of objects in an input image or image sequence.
- Segmentation information is also used to break an image into different objects.
- the segmentation information can be generated locally as part of the present principles, or can be supplied by an external source.
- Edge detection algorithms can be used to detect various objects and break them up like pieces of a jigsaw puzzle in the image.
- Object identification or object recognition information is used to identify objects that have been segmented from the image.
- the object identification information can also be generated locally as part of the present principles, or can be supplied by an external source.
- a set of data from an external source can indicate actors appearing in certain movie scenes.
- One example of this is DigitalSmiths data.
- the preference information along with the segmentation information and object identification information in the input image, is used to generate relevance information for at least one of the objects in the input image.
- the preference information can indicate how interested a user is in an object, its relevance to the user or some other metric that the preference information shows.
- Objects that are favored are shown in foreground planes of the display, to varying degrees based on the strength of the preference.
- Objects that are disfavored are shown in background planes of the display, also to varying degrees based on the strength of the preferences.
- Unidentified or neutral objects are shown at a base level, neither foreground nor background.
- the relevance information for an object or objects in an image is used to display that object in a video plane that is indicative of user interest relative to other objects in the image, or relative to the background.
- Unidentified objects can be left at a base level that appears to neither be pushed in nor pushed out. For example, if a user is very interested in a particular object because, for example, the user has searched for this object, it can be shown in a foreground plane.
- Another object is slightly less relevant than the first, but there is still some user interest, it may be shown in a plane that is slightly less foreground than the first object, but still in the foreground relative to neutral parts of the image, in which there is no indicated relevance. If, for example, a user profile indicates a strong dislike for something, and it also is contained in the image, it will appear in a plane that is shown in the background to indicate user disfavor. The rendering of the various objects with regard to the plane they appear is adjusted based on the preference information.
- Figure 3a shows a front view 300 of five blocks in an image, labelled 1 through 5.
- a user is most interested in, or likes, block 5 350, then block 3 330, then block 2 320.
- the user is not interested in block 1 310, and very not interested in block 4 340.
- Figure 3b shows a conceptual side view 360 of the image under the present principles. Because the user is most interested in block 5 350, it is "pushed forward", or shown in the foreground the closest. Next most forward is block 3 330, then block 2 320.
- the user is not interested in block 1 310, so it is shown slightly “pushed back” into the background of the image. And the user is very not interested in block 4 340, so it is shown "pushed back” even farther into a background plane of the image.
- a user would like to search a movie library (either local or online) for movies by Actor A. He also has a profile stored that indicates what actors/actresses he favors and which he disfavors. The profile can also indicate other preference information, such as genre, director, etc.
- the user searches for movies by Actor A, the user receives a series of search results, in the form of images, clips or trailers, for movies that include Actor A. In these results, Actor A can be pushed into the foreground because of the user request from the search.
- the clips can also show other actors/actresses in each of the results, and their image can appear to be pushed forward or backward, based on the user preference for that actor.
- a similar idea can be applied to media asset titles, where those titles that are most appealing to a user can be pushed into the foreground and the unappealing titles pushed back.
- a user may alter his preferences by directly adjusting the plane that the object, or actor, is in. For example, in the Actor A embodiment above, if a user decides that he has changed his opinion of one of the objects in an image, he can push it back or pull it forward, and his preference information or profile will automatically be updated and now influence the search in a new way.
- processor or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (“DSP”) hardware, read-only memory (“ROM”) for storing software, random access memory (“RAM”), and non-volatile storage.
- DSP digital signal processor
- ROM read-only memory
- RAM random access memory
- any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
- any element expressed as a means for performing a specified function is intended to encompass any way of performing that function including, for example, a) a combination of circuit elements that performs that function or b) software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function.
- the present principles as defined by such claims reside in the fact that the
- such phrasing is intended to encompass the selection of the first listed option (A) only, or the selection of the second listed option (B) only, or the selection of the third listed option (C) only, or the selection of the first and the second listed options (A and B) only, or the selection of the first and third listed options (A and C) only, or the selection of the second and third listed options (B and C) only, or the selection of all three options (A and B and C).
- This may be extended, as readily apparent by one of ordinary skill in this and related arts, for as many items listed.
- the teachings of the present principles are implemented as a combination of hardware and software.
- the software may be implemented as an application program tangibly embodied on a program storage unit.
- the application program may be uploaded to, and executed by, a machine comprising any suitable architecture.
- the machine is implemented on a computer platform having hardware such as one or more central processing units (“CPU"), a random access memory (“RAM”), and input/output ("I/O") interfaces.
- CPU central processing units
- RAM random access memory
- I/O input/output
- the computer platform may also include an operating system and microinstruction code.
- the various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU.
- various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Library & Information Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- User Interface Of Digital Computer (AREA)
- Processing Or Creating Images (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020177007764A KR20170058942A (en) | 2014-09-22 | 2015-08-12 | Use of depth perception as indicator of search, user interest or preference |
EP15756724.9A EP3198473A1 (en) | 2014-09-22 | 2015-08-12 | Use of depth perception as indicator of search, user interest or preference |
CN201580051350.3A CN107004004B (en) | 2014-09-22 | 2015-08-12 | Using depth perception as an indicator of search, user interest or preference |
US15/513,101 US11347793B2 (en) | 2014-09-22 | 2015-08-12 | Use of depth perception as indicator of search, user interest or preference |
JP2017514696A JP2017535835A (en) | 2014-09-22 | 2015-08-12 | Using depth perception as an indicator of search, user interest or preference |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462053349P | 2014-09-22 | 2014-09-22 | |
US62/053,349 | 2014-09-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016048465A1 true WO2016048465A1 (en) | 2016-03-31 |
Family
ID=54012276
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2015/044778 WO2016048465A1 (en) | 2014-09-22 | 2015-08-12 | Use of depth perception as indicator of search, user interest or preference |
Country Status (6)
Country | Link |
---|---|
US (1) | US11347793B2 (en) |
EP (1) | EP3198473A1 (en) |
JP (1) | JP2017535835A (en) |
KR (1) | KR20170058942A (en) |
CN (1) | CN107004004B (en) |
WO (1) | WO2016048465A1 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111159541B (en) * | 2019-12-11 | 2023-08-25 | 微民保险代理有限公司 | Method and device for determining account behavior preference |
CN112612363A (en) * | 2020-12-18 | 2021-04-06 | 上海影创信息科技有限公司 | User non-preference comparison method and system based on afterglow area |
US11334313B1 (en) * | 2021-03-30 | 2022-05-17 | Htc Corporation | Managing conferences in virtual environment |
CN113076436B (en) * | 2021-04-09 | 2023-07-25 | 成都天翼空间科技有限公司 | VR equipment theme background recommendation method and system |
CN115423948B (en) * | 2022-11-04 | 2023-02-21 | 江西省映尚科技有限公司 | VR image processing method and system and readable storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001073596A2 (en) * | 2000-03-29 | 2001-10-04 | Koninklijke Philips Electronics N.V. | Search user interface for constructing and managing user profiles and search criteria |
GB2365300A (en) * | 2000-06-07 | 2002-02-13 | David Meakes | Displaying search results according to relevance to query |
US20100262616A1 (en) * | 2009-04-09 | 2010-10-14 | Nokia Corporation | Method and apparatus for providing visual search engine results |
US20110305437A1 (en) * | 2010-06-15 | 2011-12-15 | Kabushiki Kaisha Toshiba | Electronic apparatus and indexing control method |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5664077A (en) * | 1993-09-06 | 1997-09-02 | Nec Corporation | Three-dimensional graph displaying system |
US6505194B1 (en) * | 2000-03-29 | 2003-01-07 | Koninklijke Philips Electronics N.V. | Search user interface with enhanced accessibility and ease-of-use features based on visual metaphors |
JP2002157269A (en) * | 2000-11-22 | 2002-05-31 | Nippon Telegr & Teleph Corp <Ntt> | Video portal system and video providing method |
US7995810B2 (en) * | 2005-06-24 | 2011-08-09 | The University Of Iowa Research Foundation | System and methods for image segmentation in n-dimensional space |
JP2009508274A (en) * | 2005-09-13 | 2009-02-26 | スペースタイムスリーディー・インコーポレーテッド | System and method for providing a three-dimensional graphical user interface |
US7870140B2 (en) * | 2006-06-12 | 2011-01-11 | D&S Consultants, Inc. | System and method of incorporating user preferences in image searches |
US8549436B1 (en) * | 2007-06-04 | 2013-10-01 | RedZ, Inc. | Visual web search interface |
JP2009080580A (en) | 2007-09-25 | 2009-04-16 | Toshiba Corp | Image display device and display method |
CN101510291A (en) | 2008-02-15 | 2009-08-19 | 国际商业机器公司 | Visualization method and apparatus for multidimensional data |
US8520979B2 (en) * | 2008-08-19 | 2013-08-27 | Digimarc Corporation | Methods and systems for content processing |
JP5359266B2 (en) * | 2008-12-26 | 2013-12-04 | 富士通株式会社 | Face recognition device, face recognition method, and face recognition program |
US8335784B2 (en) * | 2009-08-31 | 2012-12-18 | Microsoft Corporation | Visual search and three-dimensional results |
US20110063288A1 (en) | 2009-09-11 | 2011-03-17 | Siemens Medical Solutions Usa, Inc. | Transfer function for volume rendering |
JP2012064200A (en) * | 2010-08-16 | 2012-03-29 | Canon Inc | Display controller, control method of display controller, program and recording medium |
KR101777875B1 (en) | 2011-04-28 | 2017-09-13 | 엘지디스플레이 주식회사 | Stereoscopic image display and method of adjusting stereoscopic image thereof |
WO2012164685A1 (en) * | 2011-05-31 | 2012-12-06 | 楽天株式会社 | Information providing device, information providing method, information providing processing program, recording medium recording information providing processing program, and information providing system |
JP2013029451A (en) * | 2011-07-29 | 2013-02-07 | Ricoh Co Ltd | Deposit detection device and deposit detection method |
US8799263B2 (en) * | 2011-09-04 | 2014-08-05 | Leigh M Rothschild | Systems, devices, and methods for providing multidimensional search results |
KR101855939B1 (en) | 2011-09-23 | 2018-05-09 | 엘지전자 주식회사 | Method for operating an Image display apparatus |
US8990201B1 (en) * | 2011-11-03 | 2015-03-24 | Google Inc. | Image search results provisoning |
US10032303B2 (en) * | 2012-12-14 | 2018-07-24 | Facebook, Inc. | Scrolling 3D presentation of images |
US9459697B2 (en) * | 2013-01-15 | 2016-10-04 | Leap Motion, Inc. | Dynamic, free-space user interactions for machine control |
US9269022B2 (en) * | 2013-04-11 | 2016-02-23 | Digimarc Corporation | Methods for object recognition and related arrangements |
-
2015
- 2015-08-12 EP EP15756724.9A patent/EP3198473A1/en not_active Ceased
- 2015-08-12 CN CN201580051350.3A patent/CN107004004B/en active Active
- 2015-08-12 US US15/513,101 patent/US11347793B2/en active Active
- 2015-08-12 WO PCT/US2015/044778 patent/WO2016048465A1/en active Application Filing
- 2015-08-12 KR KR1020177007764A patent/KR20170058942A/en not_active Application Discontinuation
- 2015-08-12 JP JP2017514696A patent/JP2017535835A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001073596A2 (en) * | 2000-03-29 | 2001-10-04 | Koninklijke Philips Electronics N.V. | Search user interface for constructing and managing user profiles and search criteria |
GB2365300A (en) * | 2000-06-07 | 2002-02-13 | David Meakes | Displaying search results according to relevance to query |
US20100262616A1 (en) * | 2009-04-09 | 2010-10-14 | Nokia Corporation | Method and apparatus for providing visual search engine results |
US20110305437A1 (en) * | 2010-06-15 | 2011-12-15 | Kabushiki Kaisha Toshiba | Electronic apparatus and indexing control method |
Non-Patent Citations (1)
Title |
---|
See also references of EP3198473A1 * |
Also Published As
Publication number | Publication date |
---|---|
CN107004004A (en) | 2017-08-01 |
US20170300502A1 (en) | 2017-10-19 |
EP3198473A1 (en) | 2017-08-02 |
CN107004004B (en) | 2021-02-02 |
US11347793B2 (en) | 2022-05-31 |
JP2017535835A (en) | 2017-11-30 |
KR20170058942A (en) | 2017-05-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11347793B2 (en) | Use of depth perception as indicator of search, user interest or preference | |
US11435869B2 (en) | Virtual reality environment based manipulation of multi-layered multi-view interactive digital media representations | |
US11960533B2 (en) | Visual search using multi-view interactive digital media representations | |
EP2894634B1 (en) | Electronic device and image compostition method thereof | |
KR101535579B1 (en) | Augmented reality interaction implementation method and system | |
US9684818B2 (en) | Method and apparatus for providing image contents | |
US20150134651A1 (en) | Multi-dimensional surround view based search | |
CN107633023B (en) | Image duplicate removal method and device | |
US8934759B2 (en) | Video editing apparatus and video editing method | |
JP2019534494A (en) | Automatic tagging of objects in multi-view interactive digital media representation of dynamic entities | |
Silva et al. | Towards semantic fast-forward and stabilized egocentric videos | |
Yeh et al. | Relative features for photo quality assessment | |
CN106528800A (en) | Image generation method and apparatus based on real scenes | |
Ferreira et al. | A generic framework for optimal 2D/3D key-frame extraction driven by aggregated saliency maps | |
KR20200064643A (en) | Apparatus and method for providing style information of cloth | |
CN104850600A (en) | Method and device for searching images containing faces | |
US11847829B2 (en) | Method, apparatus, electronic device, and computer storage medium for video processing | |
Ejaz et al. | Video summarization by employing visual saliency in a sufficient content change method | |
US10353946B2 (en) | Client-server communication for live search using multi-view digital media representations | |
Ju et al. | A semi-automatic 2D-to-3D video conversion with adaptive key-frame selection | |
Vandecasteele et al. | Spatio-temporal wardrobe generation of actors’ clothing in video content | |
Ferreira et al. | 3d key-frame extraction method based on visual saliency | |
Baldacci et al. | Presentation of 3D scenes through video example | |
CN116634238A (en) | Information display method, device, computer equipment and storage medium | |
Vendrig et al. | Evaluation of logical story unit segmentation in video sequences |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15756724 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2017514696 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20177007764 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REEP | Request for entry into the european phase |
Ref document number: 2015756724 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2015756724 Country of ref document: EP |