EP3323478A1 - Appareil et procédés associés - Google Patents

Appareil et procédés associés Download PDF

Info

Publication number
EP3323478A1
EP3323478A1 EP16199916.4A EP16199916A EP3323478A1 EP 3323478 A1 EP3323478 A1 EP 3323478A1 EP 16199916 A EP16199916 A EP 16199916A EP 3323478 A1 EP3323478 A1 EP 3323478A1
Authority
EP
European Patent Office
Prior art keywords
virtual reality
video imagery
audio
boundary
spatial
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP16199916.4A
Other languages
German (de)
English (en)
Inventor
Antti Eronen
Miikka Vilermo
Arto Lehtiniemi
Jussi LEPPÄNEN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Technologies Oy
Original Assignee
Nokia Technologies Oy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Technologies Oy filed Critical Nokia Technologies Oy
Priority to EP16199916.4A priority Critical patent/EP3323478A1/fr
Publication of EP3323478A1 publication Critical patent/EP3323478A1/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F13/00Video games, i.e. games using an electronically generated display having two or more dimensions
    • A63F13/50Controlling the output signals based on the game progress
    • A63F13/54Controlling the output signals based on the game progress involving acoustic signals, e.g. for simulating revolutions per minute [RPM] dependent engine sounds in a driving game or reverberation against a virtual wall
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • H04S7/304For headphones
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F2300/00Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
    • A63F2300/80Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game specially adapted for executing a specific type of game
    • A63F2300/8082Virtual reality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • the present disclosure relates to the field of virtual reality and, in particular, to the presentation of spatial audio in virtual reality content. Associated methods, computer programs and apparatus is also disclosed. Certain disclosed aspects/examples relate to portable electronic devices.
  • Virtual reality may use a headset, such as glasses or goggles, or one or more displays that surround a user to provide the user with an immersive virtual experience.
  • a virtual reality apparatus may present multimedia virtual reality content representative of a virtual reality space to a user to simulate the user being present within the virtual reality space.
  • the virtual reality space may be provided by a panoramic video, such as a video having a wide or 360° field of view (which may include above and/or below a horizontally oriented field of view). Spatial audio comprising sound that appears to originate at a particular origin location may also be provided as part of the consumption of virtual reality.
  • an apparatus comprising:
  • the apparatus is caused to provide for audio presentation of the audio track with a spatial audio effect that places the origin location of the audio track at a point on a boundary of the object and provides for movement of the origin location along the boundary of the object based on changes in a view point of the video imagery relative to the object.
  • the apparatus is caused to provide for audio presentation of the audio track with a spatial audio effect that places the origin location of the audio track at a point on a boundary of the object and provides for one or more of:
  • the apparatus being caused to provide for audio presentation of the audio track with a spatial audio effect that places the origin location of the audio track at a point on a surface of the object and provides for movement of the origin location along the surface of the object based on changes in a view point of the video imagery relative to the object, comprises identifying a point on the surface of the object within a threshold distance to the view point of the video imagery and providing for placement of the original location at the identified point.
  • the apparatus provides for identification of a point on the boundary of the object closest to the view point of the video imagery and provides for placement of the original location at the identified point.
  • the boundary is defined by a plurality of surfaces and the apparatus is caused to identify one of the plurality of surfaces that is visible in the virtual reality view and provide for audio presentation of the audio track with a spatial audio effect that places the origin location over the region defined by the identified surface.
  • the apparatus is configured to determine the location of the object in the virtual reality space based on 3-dimensional model information that is associated with the virtual reality content.
  • the 3-dimensional model information may comprise complete or partial virtual models of one or more objects in the scene captured in the VR content.
  • the 3-dimensional model information comprises one or more of depth information such as from the view point of the VR content capture device that captured the video imagery of the scene and three-dimensional model information, such as from a building or city plan.
  • the object comprises a three-dimensional object that appears in the video imagery and at least part of the boundary of the three-dimensional object is known to the apparatus by one or more of:
  • computer readable medium comprising computer program code stored thereon, the computer readable medium and computer program code being configured to, when run on at least one processor, perform the method of. in respect of virtual reality content comprising video imagery, the video imagery presented in a virtual reality space for viewing in virtual reality, wherein a virtual reality view presented to a user provides for viewing of the video imagery, the virtual reality view comprising a spatial portion of the video imagery and being smaller in spatial extent than the spatial extent of the video imagery as presented in the virtual reality space; based on the virtual reality content including at least one audio track comprising audio that is for audible presentation to a user with a spatial audio effect configured such that the audio is audibly perceived to originate from an origin location in the virtual reality space, the audio track associated with an object viewable in the video imagery, the object having a known location in the virtual reality space; providing for audio presentation of the audio track with one or more of;
  • the apparatus is caused to provide for association of the audio track with one or more of; an area defining a boundary of the object; a volume defining the boundary of the object; the boundary of the object; and at least one of a plurality of surfaces that form the boundary of the object.
  • the object comprises a plurality of surfaces that form the boundary of the object and the apparatus provides for visual identification of said plurality of surfaces to enable user selection of at least one of the plurality of surfaces for association of the audio track therewith.
  • a fifth example aspect there is provided a method, the method comprising:
  • an apparatus comprising:
  • the present disclosure includes one or more corresponding aspects, examples or features in isolation or in various combinations whether or not specifically stated (including claimed) in that combination or in isolation.
  • Corresponding means and corresponding functional units e.g., function enabler, video imagery extractor, video imagery compiler, viewing direction measurer, viewing direction modifier, video player, direction sensor
  • function enabler e.g., function enabler, video imagery extractor, video imagery compiler, viewing direction measurer, viewing direction modifier, video player, direction sensor
  • Virtual reality may use a headset, such as glasses or goggles, or one or more displays that surround a user to provide a user with an immersive virtual experience.
  • a virtual reality apparatus may present multimedia virtual reality content representative of a virtual reality space to a user to simulate the user being present within the virtual reality space.
  • the virtual reality space may replicate a real world environment to simulate the user being physically present at a real world location or the virtual reality space may be computer generated or a combination of computer generated and real world multimedia content.
  • the virtual reality space may be provided by video imagery, such as a panoramic video.
  • the video imagery may comprise a video having a wide or 360° field of view (which may include above and/or below a horizontally oriented field of view).
  • the virtual reality space may thus provide a virtual 3-dimensional environment in which the video imagery of the VR content is provided for display.
  • a user may be provided with a spatial subset of the virtual reality space as their VR view, i.e. a view port to view the video imagery of the VR space.
  • the virtual reality apparatus may provide for user interaction with the virtual reality space displayed.
  • the virtual reality content provided to the user may comprise live or recorded images of the real world, captured by a virtual reality content capture device such as a panoramic video capture device or virtual reality content capture device, for example.
  • a virtual reality content capture device is a Nokia OZO camera.
  • the virtual reality space may provide a 360° or more field of view and may provide for panning/rotating around said field of view based on movement of the VR user's head or eyes.
  • the virtual reality view of a virtual reality space may be provided to said user by virtual reality apparatus via displays in the headset.
  • the virtual reality space may appear to the user of the VR apparatus as a three dimensional space created from images of the virtual reality content.
  • the VR content may comprise images taken in multiple viewing directions that can be displayed and arranged together to form a (uninterrupted, continuous) wrap around field of view.
  • Virtual reality content may, by its nature, be immersive and may thereby comprise a large amount of data.
  • the virtual reality content may comprise video imagery (i.e. moving images) that have a large spatial extent, such as to surround the user.
  • the virtual reality view provided to the user may only cover an area of the video imagery smaller than the total area and the user may need to move the virtual reality view around to appreciate the whole spatial extent of the virtual reality space that the video imagery covers.
  • VR content may be presented with spatial audio, otherwise known as three dimensional audio or directional audio, which presents audio to a user (consumer) of virtual reality such that the user perceives the audio as originating from a particular direction relative to the VR content.
  • stereo sound may comprise two channels with one channel provided to each ear provides an element of left-right directionality
  • spatial audio may use head-related transfer functions to virtually position the source of the audio such that it is perceived to originate from a particular position in space.
  • the relative volume and relative timing of the presentation of audio to each ear may provide for virtual positioning of the perceived origin location or source of the audio in space.
  • the audio presented to a user consuming VR content may comprise ambient or non-directional audio such as background sounds or music which is perceived as not coming from a particular direction and directional audio which is perceived by the user as originating from a particular direction.
  • ambient or non-directional audio such as background sounds or music which is perceived as not coming from a particular direction
  • directional audio which is perceived by the user as originating from a particular direction.
  • events in the virtual reality content generate accompanying audio, such as characters speaking, cars revving, hockey pucks being struck.
  • the voice of a character speaking who visually appears to the left of the user's virtual reality view may be presented as directional audio originating from that left-hand side.
  • Figure 1 shows an apparatus 100 configured to provide for audio presentation of an audio track with a spatial audio effect based on the association of the audio track with an object viewable in the video imagery.
  • a user may therefore hear the audio of the audio track as if it originates from the object, as described below.
  • the apparatus 100 may be part of a virtual reality device 101 configured to present the virtual reality content to a user.
  • the virtual reality device 101 may present a VR view of a VR space to a user via a VR display 102, which may comprise a headset.
  • the VR view provided to the user on the VR display 102 may be based on the orientation of the VR display, such that a user can look around the VR space by moving their head. It will be appreciated that other types of display and other means to "look around" the virtual reality space may be provided.
  • the VR content may be provided by a virtual reality content capture device 104.
  • the apparatus 105 may comprise an apparatus for use in production or editing of the VR content.
  • the apparatus 105 may provide an interface for associating the audio track with objects in the VR content, wherein the association may be stored with the audio track and/or VR content in the VR content store 103.
  • the apparatus 100 is functionally provided by a computer or a computer server, each of which may comprise a memory and at least one processor (not shown), similar to that described for the apparatus 100.
  • the processor may be a general purpose processor dedicated to executing/processing information received from other components, such as from content store 103 and/or VR device 101 for apparatus 100 and VR content capture device 104 for apparatus 105, in accordance with instructions stored in the form of computer program code in the memory.
  • the output signalling generated by such operations of the processor is provided onwards to further components, such as the VR apparatus 101 for audible presentation of the audio track or to the VR content store 103 for storing of an object-associated audio track.
  • the memory (not necessarily a single memory unit) is a computer readable medium (solid state memory in this example, but may be other types of memory such as a hard drive, ROM, RAM, Flash or the like) that stores computer program code.
  • This computer program code stores instructions that are executable by the processor, when the program code is run on the processor.
  • the internal connections between the memory and the processor can be understood to, in one or more example embodiments, provide an active coupling between the processor and the memory to allow the processor to access the computer program code stored on the memory.
  • the processor and memory are all electrically connected to one another internally to allow for electrical communication between the respective components.
  • the components are all located proximate to one another so as to be formed together as an ASIC, in other words, so as to be integrated together as a single chip/circuit that can be installed into an electronic device.
  • one or more or all of the components may be located separately from one another.
  • FIG. 1 the apparatus for associating an audio track with virtual reality content (performed by apparatus 105) is shown as well as the apparatus for providing for audio presentation of the audio track with a particular spatial audio effect (performed by apparatus 100) for appreciation of the whole system.
  • apparatus 105 the apparatus for associating an audio track with virtual reality content
  • apparatus 100 the apparatus for providing for audio presentation of the audio track with a particular spatial audio effect
  • the association of an audio track with an object in the virtual reality content may be performed and the resultant VR content with associated audio track, which may be provided to store 103, may be viewed by many different VR devices 101 and associated apparatuses 100 for the provision of audio of the audio track with a particular spatial audio effect based on the association.
  • Figure 2 shows a scene 200 that is captured for presentation in virtual reality of the Palace of
  • the virtual reality content may comprise a documentary about the Palace of
  • the VR content may be about any topic.
  • the scene includes an object 201 comprising the Palace.
  • An arrow shows a path 202 taken by a virtual reality content capture device 203 as it captures video imagery for presentation in virtual reality.
  • the video imagery comprises imagery covering all viewing directions (or substantially all) relative to the content capture device.
  • a user watching the captured virtual reality content will be able to look around at the Palace 201 and gardens 204 as the view point of the video imagery progresses along the path 202 over time.
  • the view point comprises the location from which the video imagery was captured at a particular time relative to the location of an object.
  • the user may be able to look in many different viewing directions to see the video imagery that was captured of the scene around the view point.
  • the audio track may comprise audio other than commentary.
  • the audio track may have been recorded in advance, during or after the capture of the video imagery.
  • the audio track may have been recorded separately from any audio captured at the scene by the VR content capture device 203. It may be desirable to provide for audible presentation of the audio track, perhaps as well as audio captured by the VR content capture device 203.
  • the audio track in this example is about the object (Palace) 201 and it may be desirable to provide for audible presentation of the audio track with a spatial audio effect that associates the audio with the object. Accordingly, the spatial audio effect may provide for the perception that audio of the audio track originates at a point 205 comprising the Palace 201.
  • the presentation of audio in this way may be confusing when the video imagery is captured from a distance close to the object (for example, below a threshold distance) or when the video imagery is captured of an object that covers a large area of the video imagery, such as more than a threshold ratio of area of the object in the video imagery to the area of the video imagery viewable in the virtual reality view at any one time.
  • Figure 3 illustrates a virtual reality view 300 presented to the user based on the video imagery captured by the VR content capture device 203 at the view point at which it is shown in Figure 2 and with a viewing direction shown by arrow 206.
  • the VR view 300 comprises the portion of the Palace 201 labelled 207 in Figure 2 .
  • the audio track having an audio effect applied such that it is perceived to originate from point 205, it will be heard by the user presented with VR view 300 from their left side, illustrated by arrow 301.
  • the commentary or audio track appears to originate not from the Palace but from a different direction, despite point 205 being in the centre of the Palace 201.
  • the presentation of an audio track with a spatial audio effect may be difficult to implement for objects 201 of a particular area or that are captured from a particular distance, as mentioned above.
  • Figure 4 shows the same scene of the Palace (the object in the VR imagery) 201 and wherein during capture of the video imagery of the VR content the VR content capture device 203 has followed the same path 202 illustrated by the arrow.
  • the same VR content capture device 203 is shown at several view points 401, 402, 403, 404, 405 along the path 202 during its progression therealong.
  • the audio track has been associated with the Palace and may be about the Palace, such as commentary.
  • the audio track is provided for audible presentation with a spatial audio effect which places the origin location of the audio track at a point on or within a boundary 406 of the object 201 and provides for movement of the origin location along or within the boundary of the object based on changes in a view point 401, 402, 403, 404, 405 of the video imagery relative to the object 201.
  • the apparatus 100 may provide for audible presentation of the audio of the audio track such that it is perceived to originate from the object 201 but not from a fixed origin location within the object or on its boundary but instead from an origin location that lies on or within the boundary of the object and that is moved based on where the view point of the video imagery is currently located.
  • the origin location of the audio track may be moved in a step wise manner with movement of the view point 401, 402, 403, 404, 405 or continuously with the movement of the view point 401, 402, 403, 404, 405.
  • the origin location may be placed at location on or within the boundary of the object that is closest to the current view point 401, 402, 403, 404, 405.
  • the object may be divided into a plurality of sections (either automatically by the apparatus 100 based on its size and/or distance or based on predetermined sections).
  • the origin location may be placed at a location within one of the predetermined sections of the object that is closest to the current view point 401, 402, 403, 404, 405.
  • the boundary of an object may comprise a surface that delimits the object from other objects or free space.
  • the boundary may comprise the outside walls of the building.
  • the boundary may comprise a region at the perimeter of the park or football field.
  • the outside surface of the animal or person may comprise the boundary.
  • an approximation of the object may be used to define the boundary.
  • a bounding geometric shape that may fit around the object 201 may be used as an approximation of the boundary.
  • the object 201 is identified to have a plurality of surfaces 407, 408, 409, 410, 411 that define some of the walls of the Palace 201.
  • the surfaces 407, 408, 409, 410, 411 may be determined from a three-dimensional model of the object.
  • the three-dimensional model may be obtained from one or more of predetermined model data provided to the apparatus 100, 105, image analysis of the video imagery to derive the shape/boundaries of objects therein, and depth information obtained from the VR content capture device 203.
  • the depth information may comprise information indicative of how far away objects are from the VR content capture device from the view point of the VR content capture device.
  • the depth information which may be collated with depth information from different view points, may provide for the derivation of models of objects visible in the video imagery.
  • the surfaces may be identified by any appropriate algorithm to identify the shape of an object from a model or the model itself may include information that designates its surfaces.
  • the apparatus 100 is configured to provide for audio presentation of the audio track with a spatial audio effect that places the origin location of the audio track at a point on one of the surfaces 407, 408, 409, 410, 411 of the object 201 and provides for movement of the origin location along one or more of surfaces 407, 408, 409, 410, 411 of the object based on changes in a view point 401, 402, 403, 404, 405 of the video imagery relative to the object.
  • the apparatus 100 may be caused to identify a point on one or more of the surfaces of the object within a threshold distance to the view point of the video imagery and provide for placement of the original location at the identified point. Accordingly, the origin location moves along the boundary or surface(s) based on the view point.
  • the apparatus 100 may be caused to identify which of the plurality of surfaces 407, 408, 409, 410, 411 is closest or within a threshold distance of the current view point 401, 402, 403, 404, 405 of the video imagery and provide for placement of the origin location at a fixed point on the identified surface, such as at the centre of the surface or other point thereon. Accordingly, the origin location moves between surfaces based on the view point.
  • the origin location may be placed at location on a boundary surface of the object that is closest to the current view point 401, 402, 403, 404, 405.
  • an arrow is shown designated 421, 422, 423, 424, 425 respectively which illustrates the direction from which the user hears the audio track.
  • the origin location provided by the spatial audio effect is shown, in this example, to move along each of the surfaces with the view point such that the origin location is positioned at the closest position on one of the surfaces to the current view point.
  • the audio track is associated with the object 201 and the apparatus 100 provides for application of an appropriate spatial audio effect to the audio track based on the view point of the video imagery relative to that object.
  • the audio track may include more information than just the object 201 to enable application of an appropriate spatial audio effect.
  • the audio track may be associated with information that designates different parts of the object over time, such as the above-mentioned sections or surfaces.
  • the information may be provided by apparatus 105. Accordingly, the apparatus 100 may not be required to determine the surface or section with which to place the origin location of the audio as this information may be provided with the audio track or VR content having been generated by the apparatus 105.
  • the apparatus 100 may therefore, in this example, provide for audible presentation of the audio track with a spatial audio effect based on the information provided by the apparatus 105.
  • Figure 5 shows a further example of a spatial audio effect applied to "object assigned audio tracks".
  • Figure 5 shows the same scene of the Palace (the object in the VR imagery) 201 and wherein during capture of the video imagery of the VR content the VR content capture device 203 has followed the same path 202 illustrated by the arrow.
  • the same VR content capture device 203 is shown at several view points 501, 502 along the path 202 during its progression therealong.
  • the audio track has been associated with the Palace and may be about the Palace, such as commentary.
  • the object 201 is identified to have a plurality of surfaces 407,408, 409, 410, 411 that define some of the walls of the Palace 201.
  • the surfaces 407, 408, 409, 410, 411 may be determined from a three-dimensional model of the object, which may be performed by the apparatus 100 or the apparatus 100 may be provided with information regarding the location of the surfaces.
  • the origin location was a point, however in this example the origin location is a region. Accordingly, the spatial audio effect is such that the audio of the audio track is perceived to originate from a region rather than a single point.
  • the region may be implemented as multiple points arranged together over the region or as one-dimensional, two-dimensional or three-dimensional origin location that forms the region.
  • the region comprises one or more of the surfaces 407, 408, 409, 410, 411 of the object 201.
  • the spatial audio effect is applied to cause the perception that the audio originates from the region defined by surface 407.
  • the spatial audio effect is applied to cause the perception that the audio originates from the region defined by surface 409.
  • the other surfaces may comprise the regions from which audio is perceived at view points other than those exemplified by 501 and 502.
  • a pair of arrows is shown designated 503, 504 and 505, 506 respectively which illustrates the range of directions from which the user hears the audio of the audio track simultaneously.
  • the origin location provided by the spatial audio effect is shown, in this example, to extend over the surface (region) 407, 408, 409, 410, 411 that is positioned at a position closest to the current view point 501, 502.
  • the apparatus 100 may determine which of the plurality of surfaces 407, 408, 409, 410, 411 is closest to the current view point 501, 502 or it may be provided with this information by a different apparatus.
  • the origin location may comprise a point and that point may be moved within the boundary of an object, along its surface, between surfaces in a continuous or step wise manner based on the view point of the video imagery relative to the object.
  • the origin location may be positioned such that the audio track appears to originate from the object within a predetermined distance of the current view point.
  • the origin location may extend over a region, such as over one or more of a plurality of surfaces of the object and may be moved between surfaces in a continuous or step wise manner based on the view point of the video imagery relative to the object.
  • the origin location may be positioned such that the audio track appears to originate from the object within a predetermined distance of the current view point.
  • the apparatus 100 may be configured to determine whether to provide such a movable origin location or a fixed origin location independent of the view point based on one or more of the distance between the view point and the object and the size of the object. For example, in situations where the object is small or far away from the view point, the movement of the origin location may be unnecessary.
  • the apparatus may be configured perform geometrical calculations to identify the position or surface that is closest to the current view point.
  • Various algorithms will be known to those skilled in the art on how to calculate the closest distance between points and lines or planes.
  • the apparatus 105 based on at least one audio track comprising audio that is for audible presentation to a user with a spatial audio effect that provides the perception that the audio originates from an origin location in the virtual reality space, and a user selection of an object 201 viewable in the video imagery, the object 201 having a known location in the video imagery, the apparatus 105 is caused to provide for association of the audio track with the object 201.
  • the association of the audio track with the object 201 may be achieved in various ways.
  • the association may comprise tagging the audio track with the name of the object 201 e.g. "Palace of Paris". It may then be left to the apparatus 100, when providing for audible presentation of the audio track to identify the tagged object in the video imagery, such as by image analysis of the video imagery/object recognition of the video imagery and the known or acquired image of the object/Palace of York, geographic maps based on geographically tagged VR content or content analysis (semantic for example) of the VR content itself to identify where it was recorded, building or city model information based on geographically tagged VR content or content analysis (semantic for example) of the VR content itself to identify where it was recorded.
  • the apparatus 100 may then provide for appropriate positioning of the origin location.
  • the association of the audio track with the object 201 may comprise the apparatus 105 being provided with information about objects present in the VR content and allowing for selection of one of the objects.
  • the apparatus 105 may provide for highlighting or otherwise identifying the objects in the video imagery with selectable elements to provide for user selection of one of the objects to provide for association with the audio track.
  • the audio track may then be tagged with a location, such as a geographic location that can be translated into the location in the video imagery by apparatus 100 or a location in the virtual reality space in which the video imagery is displayed by the VR apparatus 101 or an area or volume that defines the boundary of the object 201.
  • the apparatus 100 with knowledge of the location of the object 201 or the area/volume defining the object 201 may determine any required movement of the origin location with respect to the object.
  • the association of the audio track with the object 201 may comprise the apparatus 105 being provided with information about objects present in the VR content and allowing for selection of one of the objects as well as one or more points and/or regions on the object that the origin location may be placed.
  • the audio track may then be tagged with the location(s).
  • the apparatus 100 with knowledge of the locations on the boundary of/within the object may determine any required movement of the origin location between those locations.
  • the apparatus 105 is caused to provide for association of the audio track with one or more of; the boundary of the object; and at least one of a plurality of surfaces that form the boundary of the object.
  • the apparatus 105 may provide for selection of one or more audio tracks comprising audio that is for audible presentation to a user with a spatial audio effect such that it is audibly perceived to originate from an origin location in the virtual reality space as well as selection of an object or surface thereof and based on the selection(s), provide for association of one with the other.
  • a sound engineer or other user may select an object (rather than multiple points) that is identified in the scene by the apparatus 105 which may then provide for the audio track to be subsequently audibly presented by apparatus 100 such that the spatial audio effect causes the perception that the origin location is at two or more different locations within the boundary of or on the boundary of the object based on the view point.
  • the apparatus 105 may provide for associated of one or more setting parameters with the audio track for control of how the apparatus 100 subsequently provides for application of a spatial audio effect. For example, based on a user selection the apparatus 105 may provide for setting of a movement control parameter that defines how accurately the boundary or surfaces thereof are followed based on the view point. For example, the parameter may cause the apparatus 100 to provide for the surface to be followed exactly in accordance with a model of the building with one parameter setting and with a different parameter setting may only roughly follow the general shape of the model or may move along the surface of the model with steps of different granularity levels, such as from surface to surface from a single point on each surface.
  • the apparatus 105 may provide for setting of one or more of a view-point-following parameter and a spatial-extent parameter.
  • the view-point-following parameter may provide for control of how closely the movement of the origin location along or within the boundary of the object is based on changes in a view point of the video imagery relative to the object.
  • the view-point-following parameter may be used by the apparatus 100 to determine whether the origin location moved over every detail in the boundary of the object or if boundary/surface direction changes above a threshold frequency are smoothed out.
  • the spatial-extent parameter may be used by the apparatus 100 to control the spatial extent of the origin location from a point to an area greater in extent than the point.
  • one or more parameters may provide for limits on the maximum or minimum size of the origin location.
  • a maximum size limit parameter set when the apparatus 100 determines that the closest surface is a large surface (e.g. 407 or 411) that happens to be greater than the maximum size limit parameter, it may limit the spatial extent of the spatial audio effect to cover only part of the large surface(s).
  • Figure 6 shows a flow diagram illustrating the steps, in respect of virtual reality content comprising video imagery, the video imagery presented in a virtual reality space for viewing in virtual reality, wherein a virtual reality view presented to a user provides for viewing of the video imagery, the virtual reality view comprising a spatial portion of the video imagery and being smaller in spatial extent than the spatial extent of the video imagery as presented in the virtual reality space; based on 600 the virtual reality content including at least one audio track comprising audio that is for audible presentation to a user with a spatial audio effect configured such that the audio is audibly perceived to originate from an origin location in the virtual reality space, the audio track associated with an object viewable in the video imagery, the object having a known location in the virtual reality space; providing for 601 audio presentation of the audio track with one or more of;
  • Figure 7 shows a flow diagram illustrating the steps of, in respect of virtual reality content comprising video imagery, the video imagery for presentation in a virtual reality space for viewing in virtual reality, wherein a virtual reality view presented to a user provides for viewing of the video imagery, the virtual reality view comprising a spatial portion of the video imagery and being smaller in spatial extent than the spatial extent of the video imagery as presented in the virtual reality space; based on 700 at least one audio track comprising audio that is for audible presentation to a user with a spatial audio effect such that it is audibly perceived to originate from an origin location in the virtual reality space, and a user selection of an object viewable in the video imagery, the object having a known location in the video imagery; providing for 701 association of the audio track with the object.
  • Figure 8 illustrates schematically a computer/processor readable medium 800 providing a program according to an example.
  • the computer/processor readable medium is a disc such as a digital versatile disc (DVD) or a compact disc (CD).
  • DVD digital versatile disc
  • CD compact disc
  • the computer readable medium may be any medium that has been programmed in such a way as to carry out an inventive function.
  • the computer program code may be distributed between the multiple memories of the same type, or multiple memories of a different type, such as ROM, RAM, flash, hard disk, solid state, etc.
  • the apparatus shown in the above examples may be a portable electronic device, a laptop computer, a mobile phone, a Smartphone, a tablet computer, a personal digital assistant, a digital camera, a smartwatch, smart eyewear, a pen based computer, a non-portable electronic device, a desktop computer, a monitor, a household appliance, a smart TV, a server, a wearable apparatus, a virtual reality apparatus, or a module/circuitry for one or more of the same.
  • Any mentioned apparatus and/or other features of particular mentioned apparatus may be provided by apparatus arranged such that they become configured to carry out the desired operations only when enabled, e.g. switched on, or the like. In such cases, they may not necessarily have the appropriate software loaded into the active memory in the non-enabled (e.g. switched off state) and only load the appropriate software in the enabled (e.g. on state).
  • the apparatus may comprise hardware circuitry and/orfirmware.
  • the apparatus may comprise software loaded onto memory. Such software/computer programs may be recorded on the same memory/processor/functional units and/or on one or more memories/processors/ functional units.
  • a particular mentioned apparatus may be pre-programmed with the appropriate software to carry out desired operations, and wherein the appropriate software can be enabled for use by a user downloading a "key", for example, to unlock/enable the software and its associated functionality.
  • Advantages associated with such examples can include a reduced requirement to download data when further functionality is required for a device, and this can be useful in examples where a device is perceived to have sufficient capacity to store such pre-programmed software for functionality that may not be enabled by a user.
  • Any mentioned apparatus/circuitry/elements/processor may have other functions in addition to the mentioned functions, and that these functions may be performed by the same apparatus/circuitry/elements/processor.
  • One or more disclosed aspects may encompass the electronic distribution of associated computer programs and computer programs (which may be source/transport encoded) recorded on an appropriate carrier (e.g. memory, signal).
  • signal may refer to one or more signals transmitted as a series of transmitted and/or received electrical/optical signals.
  • the series of signals may comprise one, two, three, four or even more individual signal components or distinct signals to make up said signalling. Some or all of these individual signals may be transmitted/received by wireless or wired communication simultaneously, in sequence, and/or such that they temporally overlap one another.
  • processors and memory may comprise a computer processor, Application Specific Integrated Circuit (ASIC), field-programmable gate array (FPGA), and/or other hardware components that have been programmed in such a way to carry out the inventive function.
  • ASIC Application Specific Integrated Circuit
  • FPGA field-programmable gate array

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)
EP16199916.4A 2016-11-22 2016-11-22 Appareil et procédés associés Withdrawn EP3323478A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP16199916.4A EP3323478A1 (fr) 2016-11-22 2016-11-22 Appareil et procédés associés

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP16199916.4A EP3323478A1 (fr) 2016-11-22 2016-11-22 Appareil et procédés associés

Publications (1)

Publication Number Publication Date
EP3323478A1 true EP3323478A1 (fr) 2018-05-23

Family

ID=57391819

Family Applications (1)

Application Number Title Priority Date Filing Date
EP16199916.4A Withdrawn EP3323478A1 (fr) 2016-11-22 2016-11-22 Appareil et procédés associés

Country Status (1)

Country Link
EP (1) EP3323478A1 (fr)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6330486B1 (en) * 1997-07-16 2001-12-11 Silicon Graphics, Inc. Acoustic perspective in a virtual three-dimensional environment
KR20080053875A (ko) * 2006-12-11 2008-06-16 한국전자통신연구원 가상현실을 위한 오디오 음상 제어 장치 및 그 방법
US20100302441A1 (en) * 2009-06-02 2010-12-02 Canon Kabushiki Kaisha Information processing apparatus, information processing method, and program
US20140079225A1 (en) * 2012-09-17 2014-03-20 Navteq, B.V. Method and apparatus for associating audio objects with content and geo-location

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6330486B1 (en) * 1997-07-16 2001-12-11 Silicon Graphics, Inc. Acoustic perspective in a virtual three-dimensional environment
KR20080053875A (ko) * 2006-12-11 2008-06-16 한국전자통신연구원 가상현실을 위한 오디오 음상 제어 장치 및 그 방법
US20100302441A1 (en) * 2009-06-02 2010-12-02 Canon Kabushiki Kaisha Information processing apparatus, information processing method, and program
US20140079225A1 (en) * 2012-09-17 2014-03-20 Navteq, B.V. Method and apparatus for associating audio objects with content and geo-location

Similar Documents

Publication Publication Date Title
US20190180509A1 (en) Apparatus and associated methods for presentation of first and second virtual-or-augmented reality content
US8644467B2 (en) Video conferencing system, method, and computer program storage device
US10798518B2 (en) Apparatus and associated methods
EP3343349B1 (fr) Appareil et procédés associés dans le domaine de la réalité virtuelle
CN110036416B (zh) 用于空间音频的装置和相关方法
EP3236346A1 (fr) Appareil et procédés associés
US11416201B2 (en) Apparatus and associated methods for communication between users experiencing virtual reality
US11210518B2 (en) Apparatus and associated methods for display of virtual reality content
EP3264222B1 (fr) Appareil et procédés associés
EP3506213A1 (fr) Appareil et procédés associés de présentation de contenu de réalité augmentée
EP3236336B1 (fr) Contenu résumé causal de réalité virtuelle
US20200128348A1 (en) An Apparatus and Associated Methods for Presentation of Spatial Audio
US11223925B2 (en) Apparatus and associated methods for presentation of captured spatial audio content
US20190058861A1 (en) Apparatus and associated methods
US11076253B2 (en) Apparatus and associated methods for presentation of a bird's eye view
EP3323478A1 (fr) Appareil et procédés associés

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20181114

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20190215

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

INTG Intention to grant announced

Effective date: 20190522

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA TECHNOLOGIES OY

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20191002